AVX2 Tuning Paying Off Big Time For Dav1d 10b/12b Video Decode

Written by Michael Larabel in Multimedia on 17 May 2021 at 08:08 AM EDT. 38 Comments
With the new dav1d 0.9 AV1 decoder release bringing AVX2 Assembly for higher bit depth videos, the performance improvements are very pronounced with modern Intel and AMD systems.

Following this weekend's release of dav1d 0.9, I immediately set off to do some benchmarking of this updated AV1 CPU-based video decoder used by Google Chrome, Mozilla Firefox, and other software for processor-based AV1 decoding with all but the very latest hardware not offering GPU-accelerated AV1 handling yet.
dav1d 0.9 comparison

dav1d 0.9 comparison

The dav1d 0.9 performance is largely unchanged on x86_64 until getting to the high bit depth content...
dav1d 0.9 comparison

For 10-bit AV1 videos, dav1d is running multiple times faster thanks to this AVX2 Assembly that was hand written and funded by Netflix and Facebook. Big win with dav1d 0.9 here... As a reminder, AVX2 is found on Intel CPUs back to Haswell and on AMD processors since Excavator.

More dav1d 0.9 benchmarks incoming at OpenBenchmarking.org.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week