Announcement

Collapse
No announcement yet.

AMDGPU Reset Recovery To Be Flipped On By Default For Newer Radeon GPUs

Collapse
X
  • Filter
  • Time
  • Show
Clear All
new posts

  • AMDGPU Reset Recovery To Be Flipped On By Default For Newer Radeon GPUs

    Phoronix: AMDGPU Reset Recovery To Be Flipped On By Default For Newer Radeon GPUs

    The AMDGPU DRM Linux kernel driver has offered GPU reset recovery for a while now in case of hangs that can be toggled by a module parameter, but the default behavior for the next kernel release is slated to change where it will be enabled by default for the newer Radeon GPUs...

    http://www.phoronix.com/scan.php?pag...et-Recover-Def

  • #2
    I did laugh when I saw GPU reset patches for the intel gpus. The AMD drm-next-4.21-wip kernel started to hang the system when waking up from monitor blanking and sleeping after 30.9.2018 (4.19.rc5->rc6). The gpu reset patch has no effect with RX560 and system must be rebooted with the power button. My distribution has latest wip kernel available with Synaptic.
    Last edited by debianxfce; 10-28-2018, 07:19 AM.

    Comment


    • #3
      Well, till this spring i had gpu hangs that needed the reset button treatment on my Tonga gpu very often, but to be frank the situation has vastly improved with the latest Kernel/Mesa. I get the feeling that things are far more stable now. Still this reset recovery feature seems like a nice thing to have.

      Comment


      • #4
        Only hangs I've had recently is with DXVK due to me still being on LLVM6 rather than LLVM7. A few kernel versions back hangs were common but they seem to have gone away.

        Comment


        • #5
          This is good news. I consistently get hangs with games. Typically once every few hours. I do not bother reporting them as I cannot reproduce on demand and the bug tracker is full of them with similar logs ([drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=145101, last emitted seq=145103). I have activated the option, we will see whether that at least prevents the power button treatment.

          Comment


          • #6
            I sometimes have random hangs with my TAHITI (GCN 1.0) card. Although I suspect that this is hardware issue, since it happens with both radeon and amdgpu drivers. The only difference is that with amdgpu it lets me login via ssh and get dmesg logs, while with radeon it hangs completely.

            Comment


            • #7
              Is there anything similar for other vendors? My work laptop with Intel+Nvidia hangs almost every week.

              Comment


              • #8
                Originally posted by Med_ View Post
                This is good news. I consistently get hangs with games. Typically once every few hours. I do not bother reporting them as I cannot reproduce on demand and the bug tracker is full of them with similar logs ([drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=145101, last emitted seq=145103). I have activated the option, we will see whether that at least prevents the power button treatment.
                I have found the only reason my GPU hangs (vega64) is because it gets too hot. happens under both windows and linux - so, I have got my fan profiles cranked up. I can get many hour gaming sessions in, albeit with more fan noise.

                Comment


                • #9
                  Originally posted by Med_ View Post
                  This is good news. I consistently get hangs with games. Typically once every few hours. I do not bother reporting them as I cannot reproduce on demand and the bug tracker is full of them with similar logs ([drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=145101, last emitted seq=145103). I have activated the option, we will see whether that at least prevents the power button treatment.
                  What GPU?

                  Comment


                  • #10
                    Originally posted by Med_ View Post
                    This is good news. I consistently get hangs with games. Typically once every few hours. I do not bother reporting them as I cannot reproduce on demand and the bug tracker is full of them with similar logs ([drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=145101, last emitted seq=145103). I have activated the option, we will see whether that at least prevents the power button treatment.
                    What is your system specs? Do you measure temps and do you have good airflow in the case? Is your computer clean? That reset patch does not solve anything, tested with RX560. When X freezes, it does not help X to recover.

                    Comment

                    Working...
                    X