Anyway, I was playing Starfield, and had been for several hours (pause game a few times to smoke and eat etc.). I was on dessicated, barren, desert Earth heading back to my ship after a mission at an ancient NASA launch tower, bounding over dunes, jet pack jumping, breaking my legs on a high grav planet lol, when the screen froze (audio was still working). I sighed, and waited for the inevitable game crash, but it was more serious. The display went into "no signal" mode and powered off, I had no keyboard input to switch to a new tty on console. You can see the sequence of events, the ACPI power button shut down worked, but something wouldn't unhook and the filesystems didn't get unmounted and the kernel didn't shut down. The very last entry is me pressing the power button again, and the event registered but at that point there would have been nothing to dispatch the event to (acpid already shut down). The system wasn't halted, but the kernel was fuct. After that, I held the power button until power off.
I am not pleased. I have not had anything like that happen in 4 years. Back to driver recovery not working correctly. (Yesterday's kernel? Too hard to say. I'll have to see if it's an isolated incident. Starfield has crashed a few times since I've had it, but nothing like this has ever happened)
Code: Select all
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32776, for process Starfield.exe pid 10223 thread vkd3d_queue pid 10291)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008000e0400000 from client 0x1b (UTCL2)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00201431
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: SQC (data) (0xa)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x1
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x3
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:24 vmid:2 pasid:32776, for process Starfield.exe pid 10223 thread vkd3d_queue pid 10291)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00008000e0400000 from client 0x1b (UTCL2)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x00000000
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: CB/DB (0x0)
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
Apr 04 02:12:53 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: RW: 0x0
Apr 04 02:13:04 nicetry kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=26306643, emitted seq=26306645
Apr 04 02:13:04 nicetry kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Starfield.exe pid 10223 thread vkd3d_queue pid 10291
Apr 04 02:13:04 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
Apr 04 02:13:08 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: failed to suspend display audio
Apr 04 02:13:08 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: MODE1 reset
Apr 04 02:13:08 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU mode1 reset
Apr 04 02:13:08 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU smu mode1 reset
Apr 04 02:13:19 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset succeeded, trying to resume
Apr 04 02:13:19 nicetry kernel: [drm] PCIE GART of 512M enabled (table at 0x0000008000900000).
Apr 04 02:13:19 nicetry kernel: [drm] VRAM is lost due to GPU reset!
Apr 04 02:13:19 nicetry kernel: [drm] PSP is resuming...
Apr 04 02:13:24 nicetry kernel: [drm:psp_v11_0_memory_training [amdgpu]] *ERROR* send training msg failed.
Apr 04 02:13:24 nicetry kernel: [drm:psp_resume [amdgpu]] *ERROR* Failed to process memory training!
Apr 04 02:13:24 nicetry kernel: [drm:amdgpu_device_fw_loading [amdgpu]] *ERROR* resume of IP block <psp> failed -62
Apr 04 02:13:24 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset(2) failed
Apr 04 02:13:24 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset end with ret = -62
Apr 04 02:13:24 nicetry kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* GPU Recovery Failed: -62
Apr 04 02:13:24 nicetry kernel: [drm] Skip scheduling IBs!
Apr 04 02:13:24 nicetry kernel: [drm] Skip scheduling IBs!
..... snipped multiple lines of the same shit
Apr 04 02:13:24 nicetry kernel: [drm:amdgpu_cs_ioctl [amdgpu]] *ERROR* Failed to initialize parser -125!
Apr 04 02:13:34 nicetry kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=522409, emitted seq=522411
Apr 04 02:13:34 nicetry kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0
Apr 04 02:13:34 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: GPU reset begin!
Apr 04 02:13:38 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: failed to suspend display audio
Apr 04 02:13:38 nicetry kernel: amdgpu 0000:03:00.0: amdgpu: Failed to disallow df cstate
Apr 04 02:14:37 nicetry systemd-logind[485]: Power key pressed short.
Apr 04 02:14:37 nicetry systemd-logind[485]: Powering off...
Apr 04 02:14:37 nicetry systemd-logind[485]: System is powering down.
Apr 04 02:14:37 nicetry systemd[1]: Stopping Session c3 of User grogan...
Apr 04 02:14:37 nicetry systemd[1]: Removed slice Slice /system/modprobe.
Apr 04 02:14:37 nicetry systemd[1]: Stopped target Graphical Interface.
Apr 04 02:14:37 nicetry systemd[1]: Stopped target Multi-User System.
Apr 04 02:14:37 nicetry systemd[1]: Stopped target Login Prompts.
Apr 04 02:14:37 nicetry systemd[1]: Stopped target Sound Card.
Apr 04 02:14:37 nicetry systemd[1]: Stopped target Timer Units.
Apr 04 02:14:37 nicetry systemd[1]: archlinux-keyring-wkd-sync.timer: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Refresh existing PGP keys of archlinux-keyring regularly.
Apr 04 02:14:37 nicetry systemd[1]: man-db.timer: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Daily man-db regeneration.
Apr 04 02:14:37 nicetry systemd[1]: shadow.timer: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Daily verification of password and group files.
Apr 04 02:14:37 nicetry systemd[1]: systemd-tmpfiles-clean.timer: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Daily Cleanup of Temporary Directories.
Apr 04 02:14:37 nicetry systemd[1]: Stopping ACPI event daemon...
Apr 04 02:14:37 nicetry systemd[1]: Stopping Getty on tty1...
Apr 04 02:14:37 nicetry login[8985]: pam_unix(login:session): session closed for user grogan
Apr 04 02:14:37 nicetry systemd[1]: Starting Generate shutdown-ramfs...
Apr 04 02:14:37 nicetry systemd[1]: Stopping Authorization Manager...
Apr 04 02:14:37 nicetry systemd[1]: Stopping RealtimeKit Scheduling Policy Service...
Apr 04 02:14:37 nicetry systemd[1]: Stopping Load/Save OS Random Seed...
Apr 04 02:14:37 nicetry systemd[1]: Stopping Daemon for power management...
Apr 04 02:14:37 nicetry systemd[1]: rtkit-daemon.service: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped RealtimeKit Scheduling Policy Service.
Apr 04 02:14:37 nicetry systemd[1]: getty@tty1.service: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Getty on tty1.
Apr 04 02:14:37 nicetry systemd[1]: upower.service: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Daemon for power management.
Apr 04 02:14:37 nicetry systemd[1]: upower.service: Consumed 1.891s CPU time.
Apr 04 02:14:37 nicetry systemd-logind[485]: Session c3 logged out. Waiting for processes to exit.
Apr 04 02:14:37 nicetry systemd[1]: Removed slice Slice /system/getty.
Apr 04 02:14:37 nicetry systemd[1]: polkit.service: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped Authorization Manager.
Apr 04 02:14:37 nicetry acpid[483]: exiting
Apr 04 02:14:37 nicetry systemd[1]: acpid.service: Deactivated successfully.
Apr 04 02:14:37 nicetry systemd[1]: Stopped ACPI event daemon.
Apr 04 02:14:37 nicetry systemd[1]: acpid.service: Consumed 2.559s CPU time.
Apr 04 02:14:38 nicetry mkinitcpio[11134]: ==> Starting build: 'none'
Apr 04 02:14:38 nicetry mkinitcpio[11134]: -> Running build hook: [sd-shutdown]
Apr 04 02:14:38 nicetry systemd[1]: systemd-random-seed.service: Deactivated successfully.
Apr 04 02:14:38 nicetry systemd[1]: Stopped Load/Save OS Random Seed.
Apr 04 02:14:38 nicetry mkinitcpio[11134]: ==> Build complete.
Apr 04 02:14:38 nicetry systemd[1]: mkinitcpio-generate-shutdown-ramfs.service: Deactivated successfully.
Apr 04 02:14:38 nicetry systemd[1]: Finished Generate shutdown-ramfs.
Apr 04 02:15:08 nicetry systemd-logind[485]: Power key pressed short.