So I've got a Vega FE some time ago, and I'm having a major trouble with the open driver.
The issue is that after ~3-30 minutes of uptime, under X, the card hangs and the tach turns fully on. And I'm unable to recover a console (not even via SysRq).
I'm still able to use the hwmon, though.
amdgpu_gpu_reset only prints "[drm] No hardware hang detected. Did some blocks stall?" in the dmesg.
Memory usage has been fine, so it is not a memory leak.
The card works perfectly under AMDGPU-PRO.
I'm using the kernel driver from AMDGPU-PRO, with the Mesa userspace driver, because I don't want to quit using the awesome RT patches.
Oh, and the system's specs:
- Motherboard: MSI Z170A GAMING PRO
- Processor: Intel® Core™ i7-6700K (stock freqs)
- Distro: Arch Linux
- Kernel version: 4.9.20-4-rt16-rt-bfq
- LLVM/Clang version: 5.0 RC2
- Mesa version: 17.2-rc4
I'll try to provide more information if needed. Thanks for almost any help.
Oh, one more thing: I don't want to use Debian testing with XFCE and Ubuntu packages.
The issue is that after ~3-30 minutes of uptime, under X, the card hangs and the tach turns fully on. And I'm unable to recover a console (not even via SysRq).
I'm still able to use the hwmon, though.
amdgpu_gpu_reset only prints "[drm] No hardware hang detected. Did some blocks stall?" in the dmesg.
Memory usage has been fine, so it is not a memory leak.
The card works perfectly under AMDGPU-PRO.
I'm using the kernel driver from AMDGPU-PRO, with the Mesa userspace driver, because I don't want to quit using the awesome RT patches.
Oh, and the system's specs:
- Motherboard: MSI Z170A GAMING PRO
- Processor: Intel® Core™ i7-6700K (stock freqs)
- Distro: Arch Linux
- Kernel version: 4.9.20-4-rt16-rt-bfq
- LLVM/Clang version: 5.0 RC2
- Mesa version: 17.2-rc4
I'll try to provide more information if needed. Thanks for almost any help.
Oh, one more thing: I don't want to use Debian testing with XFCE and Ubuntu packages.
Comment