Radeon RX 5700 XT driver crashes persist

Hello,

For some reason my system will randomly crash on me whenever I’m doing light-duty work. I’ve had these problems ever since I got the video card, and these cards have been known to crash from the beginning.
This is what I know so far:

  • Recently, it has usually crashed on weekday mornings

  • Warm rebooting might be a trigger to make the crashes happen

Normally, 1 of 2 scenarios happen:

  1. The screen freezes, goes black for <5 seconds, and then shows a frozen background of my work environment with red and green dots scattered around the screen. I can still move my mouse but can’t launch any applications, click on stuff, etc.

  2. The screen freezes, goes black, and immediately reboots.

Browsing through the internet, it appears that AMD has fixed alot of these problems for others, but I’ve seen no improvements whatsoever. Does anyone know of a solution?
I’ve tried:

  • running a second power cable to the GPU
  • upgrading the CPU
  • upgrading the motherboard (and the chipset)
  • overclocking and de-overclocking the rest of the system
  • going from 2 monitors to 1

I’m wondering if I need to change my window manager or desktop environment - would Mate be something to try?

for reference, here’s the output of lspci:

lspci

[colson@archlabs ~]$ lspci
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Root Complex
00:01.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:01.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:01.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:03.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse GPP Bridge
00:04.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:05.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:07.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:07.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:08.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Host Bridge
00:08.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:08.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:08.3 PCI bridge: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Internal PCIe GPP Bridge 0 to bus[E:B]
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 61)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 51)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 5
00:18.6 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 6
00:18.7 Host bridge: Advanced Micro Devices, Inc. [AMD] Matisse Device 24: Function 7
01:00.0 Non-Volatile memory controller: Samsung Electronics Co Ltd NVMe SSD Controller SM981/PM981/PM983
02:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse Switch Upstream
03:05.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:08.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:09.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
03:0a.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] Matisse PCIe GPP Bridge
04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
05:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Reserved SPP
05:00.1 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
05:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
06:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
07:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
08:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Upstream Port of PCI Express Switch (rev c1)
09:00.0 PCI bridge: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 XL Downstream Port of PCI Express Switch
0a:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 [Radeon RX 5600 OEM/5600 XT / 5700/5700 XT] (rev c1)
0a:00.1 Audio device: Advanced Micro Devices, Inc. [AMD/ATI] Navi 10 HDMI Audio
0b:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse PCIe Dummy Function
0c:00.0 Non-Essential Instrumentation [1300]: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Reserved SPP
0c:00.1 Encryption controller: Advanced Micro Devices, Inc. [AMD] Starship/Matisse Cryptographic Coprocessor PSPCPP
0c:00.3 USB controller: Advanced Micro Devices, Inc. [AMD] Matisse USB 3.0 Host Controller
0c:00.4 Audio device: Advanced Micro Devices, Inc. [AMD] Starship/Matisse HD Audio Controller
0d:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
0e:00.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 51)
[colson@archlabs ~]$

and my system info:

image

Thank you in advance to anyone who can help me!

Radeon Navi cards still seem to have a problem with the current kernel. My Ryzen 7 3750H + Radeon Pro 5500M + Radeon Vega 10 tends to hang with no reason. For work I use the LTS kernel.

I’ve been also experimenting with the kernel parameters, and at the moment I use a set like this:

GRUB_CMDLINE_LINUX_DEFAULT="acpi_backlight=vendor radeon.dpm=1 idle=nomwait amdgpu.dc=1 amdgpu.vm_update_mode=3"

What seems to make a difference is amdgpu.vm_update_mode=3.

1 Like

OK, I’ll try that and report back if it helps. Thank you!!!

Also, following this wiki page, I pre-load amdgpu and radeon modules:

Make sure amdgpu has been set as first module in the Mkinitcpio#MODULES array, e.g. MODULES=(amdgpu radeon) .

No idea if it makes any difference.

1 Like

Is this another GRUB command-line addition like you listed above? If so, where does it go in the GRUB launch file?

At the very beginning of the /etc/mkinitcpio.conf file:

MODULES=(amdgpu radeon)

[edit] Update grub having entered the changes.

1 Like

OK, thanks!
The

MODULES(amdgpu radeon)

was blank to start off, so that should definitely help

Don t forget to update grub after being saved, guess that you know it, just a reminder;

Edited, no good command.

1 Like

Just did!

grub-mkconfig -o /boot/grub/grub.cfg
1 Like

This should be empty by default.

1 Like

@altman, in Arch update-grub exists as an AUR package only. :rofl:

Lol, you bad boy !

Thx for the reminder @nwg

1 Like

[colson@archlabs ~]$ sudo update-grub
sudo: update-grub: command not found, moron!

Thx for me being a moron, will remember that one @ColsonP2 .

https://aur.archlinux.org/packages/update-grub executes grub-mkconfig -o /boot/grub/grub.cfg. I used to use it some time ago.

1 Like

Probably written in <5 lines of code but gets used by a ton of people :laughing:

#!/bin/sh
set -e
exec grub-mkconfig -o /boot/grub/grub.cfg "$@"

Just for curiosity: all day long the 5.6.12-arch1-1 kernel would crash for me after several seconds. Now it works, but the graphics performance is low, same as on the LTS kernel. I suspect that Steam uses the Vega card.

[edit] It seems to vary between consecutive boots. This time it started well.

I think the command to update grub is also a shell alias if you selected zsh as your default shell during ArchLabs installation.

2 Likes

Interesting. I must give zsh a try one day.