Mesa 24.3 Sees "Substantial Improvement" To AMD Clear/Copy-Buffer Compute Shader

Written by Michael Larabel in Mesa on 10 August 2024 at 06:27 AM EDT. 3 Comments
MESA
Well known AMD Mesa developer Marek Olšák continues relentlessly optimizing the RadeonSI Gallium3D driver and related code for ensuring the AMD graphics stack can reach peak performance.

Recently Marek has been working to optimize the clear/copy_buffer compute shader into AMD common code and as part of it adding support for unaligned copies.

RX 7900 XTX graphics card


In the merge request opened a few weeks ago Marek describes this as a "substantial improvement" that since overnight has been merged for Mesa 24.3. Marek notes in the merge request:
"This is a substantial improvement of the clear/copy_buffer compute shader in radeonsi, which is also moved to src/amd/common.

This adds support for unaligned buffer clears and copies while maintaining the same performance as aligned clears and copies. The optimal alignment for buffer offsets is 256, not 4.

More chip-specific tuning will follow, but this is already optimal for Navi31."

Great to see more of Marek's optimizations ready for Mesa Git. It will be interesting to see what more tuning Marek achieves in time for Mesa 24.3 stable due out in Q4.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week