AV1 Decoder dav1d Lands 10-bit AVX2 Assembly For Big Speed-Up, Thanks Facebook + Netflix

Written by Michael Larabel in Multimedia on 11 May 2021 at 06:28 PM EDT. 34 Comments
For those making use of 10-bit AV1 content and using dav1d as the performant CPU-based decoder, the performance on modern Intel and AMD processors is about to be a heck of a lot better.

Dav1d has enjoyed speedy 10-bit decoding on AArch64 hardware thanks to hand-written Assembly while finally dav1d is seeing AVX2-optimized 10-bit decode. Both Facebook and Netflix provided the funding to make the AVX2-optimized 10-bit decode happen for dav1d.

As of today that Assembly code optimized for AVX2 at high bit depths was merged.

I'm told the performance is a "big" improvement. The dav1d developers are also planning on issuing their next feature release with this code included in the next week or so.

When that next release happens, I'll surely be running benchmarks. For now you can find current CPU performance figures with different AV1 inputs for dav1d via this OpenBenchmarking.org composite listing.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week