AMDGPU Reset Recovery To Be Flipped On By Default For Newer Radeon GPUs

Written by Michael Larabel in Radeon on 28 October 2018 at 07:00 AM EDT. 52 Comments
RADEON
The AMDGPU DRM Linux kernel driver has offered GPU reset recovery for a while now in case of hangs that can be toggled by a module parameter, but the default behavior for the next kernel release is slated to change where it will be enabled by default for the newer Radeon GPUs.

With the amdgpu-drm-next code for what will eventually be either Linux 4.21 or Linux 5.1, that's being changed where by default it's on for GFX8/GFX9 GPUs. That GPU reset recovery code in the mainline Linux driver has been found to be "working for the most part" on Topaz, Tonga, Fiji, Polaris, and Vega graphics processors, according to this commit added on Friday to the AMDGPU DRM driver's testing branch. Previously it was just on by default for GPU SR-IOV setups. The recovery is triggered after a ten second job timeout.

At least until if/when that GPU reset recovery code is improved upon for the older AMD GCN GPUs, there is still the gpu_recovery module parameter whereby amdgpu.gpu_recovery=1 will enable the GPU recovery path unconditionally. But hopefully your Radeon GPU support is stable enough these days on the open-source driver to not have to worry about such recovery in the first place... It actually seems now like it's been a few kernel releases since I last had a Radeon GPU hang under Linux regardless of the GPU model (sans Raven), so for the most part hopefully this change won't end up being noticeable to end-users.

Have you had any Radeon GPU hangs under Linux recently? Or any particular Radeon Linux driver bugs still biting you? Let us know in the forums.
Related News
About The Author
Michael Larabel

Michael Larabel is the principal author of Phoronix.com and founded the site in 2004 with a focus on enriching the Linux hardware experience. Michael has written more than 20,000 articles covering the state of Linux hardware support, Linux performance, graphics drivers, and other topics. Michael is also the lead developer of the Phoronix Test Suite, Phoromatic, and OpenBenchmarking.org automated benchmarking software. He can be followed via Twitter, LinkedIn, or contacted via MichaelLarabel.com.

Popular News This Week