Announcement

Collapse
No announcement yet.

AMDGPU Reset Recovery To Be Flipped On By Default For Newer Radeon GPUs

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
    Brisse
    Senior Member

  • Brisse
    replied
    Originally posted by FireBurn View Post
    This'll be handy on my PRIME system, when it overheats it usually leads to lockups where a GPU reset might fix it
    That's not a fix. A fix is to fix the overheating problem. Overheating is not a normal operating condition for a computer or pretty much for any system electric or otherwise. That's why it's called OVERheating.

    Leave a comment:

  • finalzone
    Senior Member

  • finalzone
    replied
    Originally posted by wizard69 View Post
    Interesting as my Ryzen Mobile does hang randomly but infrequently. I’m not even sure it is a GPU hang though it certainly feels like it.
    For those well versed in Linux what is the best way to turn this on with a new distro like Fedora 29?
    Current suggestion is to do fresh install to resolve the issue from users running on updated beta release.
    Test done on HP Envy x360 Ryzen 2500U.

    Leave a comment:

  • TheYoshiGuy
    Junior Member

  • TheYoshiGuy
    replied
    Everyone who tries to run Mario Kart 8 under Cemu in wine on a Vega64/56, will get a system-wide freeze which can only be recovered from via REISUB or the reset switch.
    They're not making any progress here either: https://bugs.freedesktop.org/show_bug.cgi?id=105251#c17

    I test this every Sunday morning after updating everything. Hopefully this patch will save me from having to reset at least.

    Leave a comment:

  • FireBurn
    Senior Member

  • FireBurn
    replied
    This'll be handy on my PRIME system, when it overheats it usually leads to lockups where a GPU reset might fix it

    Leave a comment:

  • shmerl
    Senior Member

  • shmerl
    replied
    Originally posted by dwagner View Post
    Not sure if I missed an irony tag, but the same hope was stated when amdgpu.dc=1 became the default.
    I don't think anyone declared recovery ready before.

    Leave a comment:

  • dwagner
    Senior Member

  • dwagner
    replied
    Originally posted by shmerl View Post
    It wasn't ready. Apparently now it is.
    Not sure if I missed an irony tag, but the same hope was stated when amdgpu.dc=1 became the default.

    Leave a comment:

  • shmerl
    Senior Member

  • shmerl
    replied
    Originally posted by dwagner View Post
    I tried amdgpu.gpu_recovery=1 in the past, but for the system crashes I experience when using amdgpu.dc=1 it makes no difference whether amdgpu.gpu_recovery is set to 0 or 1 - the system crashes hard either way.
    It wasn't ready. Apparently now it is.

    Leave a comment:

  • dwagner
    Senior Member

  • dwagner
    replied
    I tried amdgpu.gpu_recovery=1 in the past, but for the system crashes I experience when using amdgpu.dc=1 it makes no difference whether amdgpu.gpu_recovery is set to 0 or 1 - the system crashes hard either way.

    Leave a comment:

  • raonlinux
    Phoronix Member

  • raonlinux
    replied
    Originally posted by Med_ View Post
    This is good news. I consistently get hangs with games. Typically once every few hours. I do not bother reporting them as I cannot reproduce on demand and the bug tracker is full of them with similar logs ([drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, last signaled seq=145101, last emitted seq=145103). I have activated the option, we will see whether that at least prevents the power button treatment.
    As you I' m having the same issues, I just need to hard reboot, and happen in the same card RX480, I tested everything, but is getting less with the update of the kernel. The main problem as you said, this cannot reproduce if happen you make a hard reboot and then the card is fine. I hope this option is soon, I will like that you can reset the gpu without freeze the system.

    Leave a comment:

  • RavFX
    Senior Member

  • RavFX
    replied
    I do experience crash often with the game EVERSPACE.

    It append at random time, usually it take less than 1h30 to occur. I thought it was overclocked related but it seam it's the only game I crash and it take the whole system with it. Reset required. Sometime using the reset button still leave anomalies like, extra stuff that does not work properly in the OS. So when that game use to crash my system, I did use the power supply switch to "reset" instead.

    I do not play that game anymore due to these crash, with is unfortunate as it was one of my favorite that have a Linux version on Steam..

    GPU : RX480 @ 1500ish Mhz 1.35V (It's not the overclocking, it also append with stock clock/volt)
    CPU : Ryzen 7 2700x @ 4.3Ghz (was also appening with my R7 1700 OC or not OC, but less than with the 2700x)
    Kernel : usually latest, release or git, it's variable
    Mesa : latest git updated regulary

    The GPU is watercooled with a custom loop, it does not usually reach 65C.
    No other games take my system down, at lease for now.

    Leave a comment:

Working...
X