Arch Linux

Please read this before reporting a bug:
https://wiki.archlinux.org/title/Bug_reporting_guidelines

Do NOT report bugs when a package is just outdated, or it is in the AUR. Use the 'flag out of date' link on the package page, or the Mailing List.

REPEAT: Do NOT report bugs for outdated packages!
Tasklist

FS#73816 - Random system freeze with Nouveau

Attached to Project: Arch Linux
Opened by Katya (prima) - Wednesday, 16 February 2022, 13:20 GMT
Last edited by David Thurstenson (thurstylark) - Sunday, 01 May 2022, 14:59 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To No-one
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:

From time to time the computer hangs completely. Screen is frozen, mouse and keyboard not responding. The computer has to be restarted by pressing the reset button.
Everything used to work flawlessly until a few months ago and I suspect a possible kernel regression bug.

------------------------------------------

Additional info:

- It almost always happens when VLC is playing a movie. The problem happens at random intervals and cannot be easily reproduced, for example by playing the same movie again.
- I am using the nouveau driver, and never used the proprietary drivers.
- Environment is XFCE and the following packages were installed when setting up the PC:
pacman -S mesa mesa-libgl xf86-video-nouveau

------------------------------------------

Nouveau details:

pacman -Q --info xf86-video-nouveau

Name : xf86-video-nouveau
Version : 1.0.17-2
Description : Open Source 3D acceleration driver for nVidia cards
Architecture : x86_64
URL : https://nouveau.freedesktop.org/
Licenses : GPL
Groups : xorg-drivers
Provides : None
Depends On : systemd-libs mesa
Optional Deps : None
Required By : None
Optional For : None
Conflicts With : xorg-server<21.1.1 X-ABI-VIDEODRV_VERSION<25 X-ABI-VIDEODRV_VERSION>=26
Replaces : None
Installed Size : 222.91 KiB
Packager : Andreas Radke <andyrtr@archlinux.org>
Build Date : Sun 07 Nov 2021 12:07:13 PM CET
Install Date : Tue 16 Nov 2021 11:41:43 PM CET
Install Reason : Explicitly installed
Install Script : No
Validated By : Signature

------------------------------------------

Relevant kernel log collected after resetting the computer:

Feb 14 15:11:27 central kernel: ------------[ cut here ]------------
Feb 14 15:11:27 central kernel: nouveau 0000:06:00.0: timeout
Feb 14 15:11:27 central kernel: WARNING: CPU: 1 PID: 49751 at drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgf100.c:220 gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:27 central kernel: Modules linked in: tun nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv>
Feb 14 15:11:27 central kernel: gpio_generic wmi acpi_cpufreq vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) sg fuse bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 uas us>
Feb 14 15:11:27 central kernel: CPU: 1 PID: 49751 Comm: kworker/1:1 Tainted: G OE 5.15.22-1-lts #1 ab185e4cf312395c72bf4109dc86c55cf26a8c06
Feb 14 15:11:27 central kernel: Hardware name: System manufacturer System Product Name/TUF B450M-PLUS GAMING, BIOS 2006 11/13/2019
Feb 14 15:11:27 central kernel: Workqueue: events nouveau_cli_work [nouveau]
Feb 14 15:11:27 central kernel: RIP: 0010:gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:27 central kernel: Code: 8b 40 10 48 8b 78 10 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 06 72 80 d4 4c 89 e2 48 c7 c7 61 7a 25 c1 48 89 c6 e8 68 23 bb d4 <0f> 0b e9 56 ff ff ff e8>
Feb 14 15:11:27 central kernel: RSP: 0018:ffffae9e880c7a60 EFLAGS: 00010286
Feb 14 15:11:27 central kernel: RAX: 0000000000000000 RBX: ffff92f4ca8dcc00 RCX: 0000000000000027
Feb 14 15:11:27 central kernel: RDX: ffff92fbbea60728 RSI: 0000000000000001 RDI: ffff92fbbea60720
Feb 14 15:11:27 central kernel: RBP: ffff92f4c6de3600 R08: 0000000000000000 R09: ffffae9e880c7888
Feb 14 15:11:27 central kernel: R10: ffffae9e880c7880 R11: ffff92fbdf302318 R12: ffff92f4c18a25e0
Feb 14 15:11:27 central kernel: R13: ffff92f5058ff7c0 R14: ffff92f4f68b1a80 R15: ffff92f5061de000
Feb 14 15:11:27 central kernel: FS: 0000000000000000(0000) GS:ffff92fbbea40000(0000) knlGS:0000000000000000
Feb 14 15:11:27 central kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 14 15:11:27 central kernel: CR2: 00007fb4e3600950 CR3: 0000000130cac000 CR4: 0000000000350ee0
Feb 14 15:11:27 central kernel: Call Trace:
Feb 14 15:11:27 central kernel: <TASK>
Feb 14 15:11:27 central kernel: nvkm_vmm_iter.constprop.0+0x357/0x860 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? update_sd_lb_stats.constprop.0+0xf1/0x790
Feb 14 15:11:27 central kernel: ? nvkm_vmm_ptes_sparse+0x1c0/0x1c0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? gf100_vmm_new+0x90/0x90 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_vmm_put_locked+0x103/0x270 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? nvkm_vmm_ptes_sparse+0x1c0/0x1c0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? gf100_vmm_new+0x90/0x90 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_uvmm_mthd+0x6c9/0x6f0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_ioctl+0xd5/0x180 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvif_object_mthd+0x168/0x200 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvif_vmm_put+0x65/0x80 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_vma_del+0x89/0xd0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_gem_object_delete_work+0x36/0x60 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_cli_work+0xcc/0x120 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: process_one_work+0x1f1/0x390
Feb 14 15:11:27 central kernel: worker_thread+0x53/0x3e0
Feb 14 15:11:27 central kernel: ? process_one_work+0x390/0x390
Feb 14 15:11:27 central kernel: kthread+0x127/0x150
Feb 14 15:11:27 central kernel: ? set_kthread_struct+0x40/0x40
Feb 14 15:11:27 central kernel: ret_from_fork+0x22/0x30
Feb 14 15:11:27 central kernel: ------------[ cut here ]------------
Feb 14 15:11:27 central kernel: nouveau 0000:06:00.0: timeout
Feb 14 15:11:27 central kernel: WARNING: CPU: 1 PID: 49751 at drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgf100.c:220 gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:27 central kernel: Modules linked in: tun nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv>
Feb 14 15:11:27 central kernel: gpio_generic wmi acpi_cpufreq vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) sg fuse bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 uas us>
Feb 14 15:11:27 central kernel: CPU: 1 PID: 49751 Comm: kworker/1:1 Tainted: G OE 5.15.22-1-lts #1 ab185e4cf312395c72bf4109dc86c55cf26a8c06
Feb 14 15:11:27 central kernel: Hardware name: System manufacturer System Product Name/TUF B450M-PLUS GAMING, BIOS 2006 11/13/2019
Feb 14 15:11:27 central kernel: Workqueue: events nouveau_cli_work [nouveau]
Feb 14 15:11:27 central kernel: RIP: 0010:gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:27 central kernel: Code: 8b 40 10 48 8b 78 10 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 06 72 80 d4 4c 89 e2 48 c7 c7 61 7a 25 c1 48 89 c6 e8 68 23 bb d4 <0f> 0b e9 56 ff ff ff e8>
Feb 14 15:11:27 central kernel: RSP: 0018:ffffae9e880c7a60 EFLAGS: 00010286
Feb 14 15:11:27 central kernel: RAX: 0000000000000000 RBX: ffff92f4ca8dcc00 RCX: 0000000000000027
Feb 14 15:11:27 central kernel: RDX: ffff92fbbea60728 RSI: 0000000000000001 RDI: ffff92fbbea60720
Feb 14 15:11:27 central kernel: RBP: ffff92f4c6de3600 R08: 0000000000000000 R09: ffffae9e880c7888
Feb 14 15:11:27 central kernel: R10: ffffae9e880c7880 R11: ffff92fbdf302318 R12: ffff92f4c18a25e0
Feb 14 15:11:27 central kernel: R13: ffff92f5058ff7c0 R14: ffff92f4f68b1a80 R15: ffff92f5061de000
Feb 14 15:11:27 central kernel: FS: 0000000000000000(0000) GS:ffff92fbbea40000(0000) knlGS:0000000000000000
Feb 14 15:11:27 central kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 14 15:11:27 central kernel: CR2: 00007fb4e3600950 CR3: 0000000130cac000 CR4: 0000000000350ee0
Feb 14 15:11:27 central kernel: Call Trace:
Feb 14 15:11:27 central kernel: <TASK>
Feb 14 15:11:27 central kernel: nvkm_vmm_iter.constprop.0+0x357/0x860 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? update_sd_lb_stats.constprop.0+0xf1/0x790
Feb 14 15:11:27 central kernel: ? nvkm_vmm_ptes_sparse+0x1c0/0x1c0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? gf100_vmm_new+0x90/0x90 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_vmm_put_locked+0x103/0x270 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? nvkm_vmm_ptes_sparse+0x1c0/0x1c0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: ? gf100_vmm_new+0x90/0x90 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_uvmm_mthd+0x6c9/0x6f0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvkm_ioctl+0xd5/0x180 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvif_object_mthd+0x168/0x200 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nvif_vmm_put+0x65/0x80 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_vma_del+0x89/0xd0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_gem_object_delete_work+0x36/0x60 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: nouveau_cli_work+0xcc/0x120 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:27 central kernel: process_one_work+0x1f1/0x390
Feb 14 15:11:27 central kernel: worker_thread+0x53/0x3e0
Feb 14 15:11:27 central kernel: ? process_one_work+0x390/0x390
Feb 14 15:11:27 central kernel: kthread+0x127/0x150
Feb 14 15:11:27 central kernel: ? set_kthread_struct+0x40/0x40
Feb 14 15:11:27 central kernel: ret_from_fork+0x22/0x30
Feb 14 15:11:27 central kernel: </TASK>
Feb 14 15:11:27 central kernel: ---[ end trace dd1c078793b92f19 ]---
Feb 14 15:11:29 central kernel: ------------[ cut here ]------------
Feb 14 15:11:29 central kernel: nouveau 0000:06:00.0: timeout
Feb 14 15:11:29 central kernel: WARNING: CPU: 2 PID: 1406 at drivers/gpu/drm/nouveau/nvkm/subdev/mmu/vmmgf100.c:220 gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:29 central kernel: Modules linked in: tun nft_objref nf_conntrack_netbios_ns nf_conntrack_broadcast nft_fib_inet nft_fib_ipv4 nft_fib_ipv6 nft_fib nft_reject_inet nf_reject_ipv>
Feb 14 15:11:29 central kernel: gpio_generic wmi acpi_cpufreq vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) sg fuse bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 uas us>
Feb 14 15:11:29 central kernel: CPU: 2 PID: 1406 Comm: Renderer Tainted: G W OE 5.15.22-1-lts #1 ab185e4cf312395c72bf4109dc86c55cf26a8c06
Feb 14 15:11:29 central kernel: Hardware name: System manufacturer System Product Name/TUF B450M-PLUS GAMING, BIOS 2006 11/13/2019
Feb 14 15:11:29 central kernel: RIP: 0010:gf100_vmm_invalidate+0x21c/0x230 [nouveau]
Feb 14 15:11:29 central kernel: Code: 8b 40 10 48 8b 78 10 4c 8b 67 50 4d 85 e4 75 03 4c 8b 27 e8 06 72 80 d4 4c 89 e2 48 c7 c7 61 7a 25 c1 48 89 c6 e8 68 23 bb d4 <0f> 0b e9 56 ff ff ff e8>
Feb 14 15:11:29 central kernel: RSP: 0018:ffffae9e83247700 EFLAGS: 00010286
Feb 14 15:11:29 central kernel: RAX: 0000000000000000 RBX: ffff92f4ca8dcc00 RCX: 0000000000000027
Feb 14 15:11:29 central kernel: RDX: ffff92fbbeaa0728 RSI: 0000000000000001 RDI: ffff92fbbeaa0720
Feb 14 15:11:29 central kernel: RBP: ffff92f4ee10a800 R08: 0000000000000000 R09: ffffae9e83247528
Feb 14 15:11:29 central kernel: R10: ffffae9e83247520 R11: ffff92fbdf302720 R12: ffff92f4c18a25e0
Feb 14 15:11:29 central kernel: R13: ffff92f4ee121080 R14: 0000000000000000 R15: ffff92f4f698d000
Feb 14 15:11:29 central kernel: FS: 00007fb4f15ff640(0000) GS:ffff92fbbea80000(0000) knlGS:0000000000000000
Feb 14 15:11:29 central kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 14 15:11:29 central kernel: CR2: 00007f7edb165000 CR3: 0000000108df2000 CR4: 0000000000350ee0
Feb 14 15:11:29 central kernel: Call Trace:
Feb 14 15:11:29 central kernel: <TASK>
Feb 14 15:11:29 central kernel: nvkm_vmm_iter.constprop.0+0x357/0x860 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: ? ktime_get+0x38/0x90
Feb 14 15:11:29 central kernel: ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: ? gf100_vmm_pgt_dma+0x2f0/0x2f0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvkm_vmm_ptes_get_map+0x2c/0x90 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: ? gf100_vmm_pgt_dma+0x2f0/0x2f0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvkm_vmm_map+0x1d4/0x350 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvkm_vram_map+0x56/0x80 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvkm_uvmm_mthd+0x5c2/0x6f0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvkm_ioctl+0xd5/0x180 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvif_object_mthd+0x168/0x200 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nvif_vmm_map+0x11e/0x130 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nouveau_mem_map+0x82/0xe0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nouveau_vma_new+0x1df/0x200 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nouveau_gem_object_open+0xd0/0x150 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: drm_gem_handle_create_tail+0xd4/0x180
Feb 14 15:11:29 central kernel: nouveau_gem_ioctl_new+0x92/0x100 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: ? nouveau_gem_new+0xe0/0xe0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: drm_ioctl_kernel+0xb2/0x100
Feb 14 15:11:29 central kernel: drm_ioctl+0x22a/0x3d0
Feb 14 15:11:29 central kernel: ? nouveau_gem_new+0xe0/0xe0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: nouveau_drm_ioctl+0x55/0xa0 [nouveau 076acf09498154a0dfd79cd191c7836805dd3e9c]
Feb 14 15:11:29 central kernel: __x64_sys_ioctl+0x82/0xb0
Feb 14 15:11:29 central kernel: do_syscall_64+0x5c/0x80
Feb 14 15:11:29 central kernel: ? handle_mm_fault+0xcf/0x2a0
Feb 14 15:11:29 central kernel: ? do_user_addr_fault+0x1d9/0x670
Feb 14 15:11:29 central kernel: ? exc_page_fault+0x72/0x150
Feb 14 15:11:29 central kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
Feb 14 15:11:29 central kernel: RIP: 0033:0x7fb51324459b
Feb 14 15:11:29 central kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01>
Feb 14 15:11:29 central kernel: RSP: 002b:00007fb4f15fcf88 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Feb 14 15:11:29 central kernel: RAX: ffffffffffffffda RBX: 00007fb4f15fcfe0 RCX: 00007fb51324459b
Feb 14 15:11:29 central kernel: RDX: 00007fb4f15fcfe0 RSI: 00000000c0306480 RDI: 0000000000000028
Feb 14 15:11:29 central kernel: RBP: 00000000c0306480 R08: 0000000000000000 R09: 0000000000000000
Feb 14 15:11:29 central kernel: R10: 00000000ffffffff R11: 0000000000000246 R12: 00007fb4fb988870
Feb 14 15:11:29 central kernel: R13: 0000000000000028 R14: 00007fb4f15fcfe0 R15: 000000000000a000
Feb 14 15:11:29 central kernel: </TASK>
Feb 14 15:11:29 central kernel: ---[ end trace dd1c078793b92f1a ]---
Feb 14 15:11:30 central kernel: nouveau 0000:06:00.0: fifo: SCHED_ERROR 0a [CTXSW_TIMEOUT]
Feb 14 15:11:30 central kernel: nouveau 0000:06:00.0: fifo: runlist 0: scheduled for recovery
Feb 14 15:11:30 central kernel: nouveau 0000:06:00.0: fifo: channel 8: killed
Feb 14 15:11:30 central kernel: nouveau 0000:06:00.0: fifo: engine 0: scheduled for recovery

------------------------------------------

Hardware details: NVIDIA GeForce GT 710

Output of lspci -v:

06:00.0 VGA compatible controller: NVIDIA Corporation GK208B [GeForce GT 710] (rev a1) (prog-if 00 [VGA controller])
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 8c93
Flags: bus master, fast devsel, latency 0, IRQ 61, IOMMU group 16
Memory at f5000000 (32-bit, non-prefetchable) [size=16M]
Memory at e8000000 (64-bit, prefetchable) [size=128M]
Memory at f0000000 (64-bit, prefetchable) [size=32M]
I/O ports at f000 [size=128]
Expansion ROM at 000c0000 [disabled] [size=128K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Legacy Endpoint, MSI 00
Capabilities: [100] Virtual Channel
Capabilities: [128] Power Budgeting <?>
Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Kernel driver in use: nouveau
Kernel modules: nouveau

06:00.1 Audio device: NVIDIA Corporation GK208 HDMI/DP Audio Controller (rev a1)
Subsystem: Micro-Star International Co., Ltd. [MSI] Device 8c93
Flags: bus master, fast devsel, latency 0, IRQ 59, IOMMU group 16
Memory at f6080000 (32-bit, non-prefetchable) [size=16K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable- Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Kernel driver in use: snd_hda_intel
Kernel modules: snd_hda_intel

------------------------------------------

Frame buffer details:

cat /proc/fb
0 nouveaudrmfb

------------------------------------------

I am using the LTS kernel:

cat /proc/version
Linux version 5.15.23-2-lts (linux-lts@archlinux) (gcc (GCC) 11.2.0, GNU ld (GNU Binutils) 2.38) #1 SMP Tue, 15 Feb 2022 12:04:53 +0000

------------------------------------------

Steps to reproduce: the problem occurs sporadically but VLC may increase the odds of a freeze.

------------------------------------------

What I attempted to resolve the problem:

1. Add kernel parameter: nouveau.noaccel=1
as suggested here: https://fedoraproject.org/wiki/Common_kernel_problems#Systems_with_nVidia_adapters_using_the_nouveau_driver_lock_up_randomly and here: https://wiki.archlinux.org/title/Nouveau#Random_lockups_with_kernel_error_messages

2. Try another kernel setting: modeset=0 to disable KMS
Source: https://nouveau.freedesktop.org/KernelModeSetting.html

I removed nouveau.noaccel=1 and added modeset=0, so my current cmdline is as follows:

cat /proc/cmdline
initrd=\initramfs-linux-lts.img cryptdevice=UUID=f627282b-e088-4014-b737-b85f03540abe:luks root=/dev/mapper/rootvg-root rw modeset=0

=> A freeze has occurred again after making that change, so it does not seem to fix the problem. However, it may be worthy to note some possible changes:
- VLC was not running at the time
- the mouse was still moving, but everything else was frozen

------------------------------------------

Config files:

/etc/X11/xorg.conf.d/20-nouveau.conf:

Section "Device"
Identifier "NVIDIA Card"
Driver "nouveau"
EndSection


Attached files:
- current running Xorg log

------------------------------------------

Related bug reports:

 FS#59089  - [linux] Nouveau freezes system completely
This task depends upon

Closed by  David Thurstenson (thurstylark)
Sunday, 01 May 2022, 14:59 GMT
Reason for closing:  No response
Comment by Andreas Radke (AndyRTR) - Wednesday, 16 February 2022, 16:48 GMT
Please try with mainline 5.16.x kernel to find, if kernel drm causes this.

Loading...