AMD Working On More Precise GPU Reset Capabilities For Their Linux Driver

Written by Michael Larabel in Radeon on 25 July 2024 at 11:20 AM EDT. 13 Comments
RADEON
While the hope remains that GPU resets are a very infrequent task, AMD Linux driver engineers have recently been working on the ability to support a per-queue GC reset capability for more precise reset capabilities when needed.

With the latest patches for the AMDGPU kernel driver code, the driver would be able to reset the graphics/compute ring where an error occurs and to just lose the jobs pending in that queue as opposed to carrying out a full GPU reset. But if the per-queue/ring reset doesn't work, the AMDGPU driver would continue to perform a full GPU reset as needed.

An updated set of 53 patches were posted today for introducing reset capabilities on a per-ring basis in the AMDGPU driver. This is then wired up across the recent generations of AMD graphics processors, including the upcoming GFX12 (RDNA4) hardware.

AMD GFX9 GPU


The hope is that this per-ring reset capability is more pleasant when encountering a specific job that's hung rather than going the route of a full "legacy" GPU reset.

Those interested in learning more can do so via this patch series now under review for GC per-queue resets.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week