FS#72620 - linux-lts 5.10.77-1 from testing freezes with amdgpu issue

Attached to Project: Arch Linux
Opened by Matthias Bodenbinder (mbod) - Wednesday, 03 November 2021, 07:11 GMT
Last edited by Andreas Radke (AndyRTR) - Thursday, 04 November 2021, 18:42 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To Andreas Radke (AndyRTR)
Architecture All
Severity Low
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description: linux-lts 5.10.77-1 from testing freezes with amdgpu issue

Steps to reproduce:
I dont know how to reproduce it. It happened in the middle of a web surfing session.


1# journalctl -b -1 | grep WARNING
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 2 PID: 4962 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 8 PID: 3461 at lib/refcount.c:25 refcount_warn_saturate+0x68/0xf0
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 0 PID: 3461 at lib/refcount.c:22 refcount_warn_saturate+0x49/0xf0
Nov 03 07:45:45 rakete kernel: WARNING: CPU: 10 PID: 4038 at include/linux/dma-fence.h:475 amdgpu_sync_keep_later+0x72/0xb0 [amdgpu]
Nov 03 07:45:45 rakete kernel: WARNING: CPU: 10 PID: 4038 at include/linux/dma-fence.h:475 amdgpu_sync_keep_later+0x72/0xb0 [amdgpu]
[...}


and the last warning regarding amdgpu_sync_keep_later is repeated several thousand times. Here are more details:


Nov 03 07:45:41 rakete kernel: ------------[ cut here ]------------
Nov 03 07:45:41 rakete kernel: refcount_t: underflow; use-after-free.
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 2 PID: 4962 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
Nov 03 07:45:41 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 amdgpu stb0899 mxm_wmi wmi_bmof dvb_usb_pctv452e(OE) dvb_usb(OE) amd64_edac_mod edac_mce_amd snd_hda_codec_realtek btusb kvm_amd snd_hda_codec_generic btrtl btbcm ledtrig_audio snd_hda_codec_hdmi btintel ttpci_eeprom kvm bluetooth snd_hda_intel dvb_core snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec videodev snd_hda_core snd_hwdep irqbypass ecdh_generic crct10dif_pclmul soundwire_bus rfkill crc32_pclmul ghash_clmulni_intel aesni_intel mc vfat gpu_sched ecc fat snd_soc_core crc16 crypto_simd ttm snd_compress cryptd ac97_bus glue_helper snd_pcm_dmaengine rapl snd_pcm zfs(POE) drm_kms_helper snd_timer ccp cec snd k10temp syscopyarea sysfillrect i2c_piix4 sysimgblt soundcore fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 07:45:41 rakete kernel: tpm_tis_core tpm zunicode(POE) wmi rng_core pinctrl_amd zzstd(OE) acpi_cpufreq zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel xhci_pci xhci_pci_renesas sr_mod cdrom
Nov 03 07:45:41 rakete kernel: CPU: 2 PID: 4962 Comm: darktable-cli Tainted: P OE 5.10.77-1-lts #1
Nov 03 07:45:41 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 07:45:41 rakete kernel: RIP: 0010:refcount_warn_saturate+0xa6/0xf0
Nov 03 07:45:41 rakete kernel: Code: 05 fb 12 71 01 01 e8 bc fd 55 00 0f 0b c3 80 3d e9 12 71 01 00 75 95 48 c7 c7 e0 97 99 9d c6 05 d9 12 71 01 01 e8 9d fd 55 00 <0f> 0b c3 80 3d c8 12 71 01 00 0f 85 72 ff ff ff 48 c7 c7 38 98 99
Nov 03 07:45:41 rakete kernel: RSP: 0018:ffffbaa7710b7700 EFLAGS: 00010286
Nov 03 07:45:41 rakete kernel: RAX: 0000000000000000 RBX: ffff9cb8d3501f90 RCX: 0000000000000027
Nov 03 07:45:41 rakete kernel: RDX: ffff9cc7bea98bb8 RSI: 0000000000000001 RDI: ffff9cc7bea98bb0
Nov 03 07:45:41 rakete kernel: RBP: ffff9cb8d3501f80 R08: 0000000000000000 R09: ffffbaa7710b7520
Nov 03 07:45:41 rakete kernel: R10: ffffbaa7710b7518 R11: ffff9cc7ff3227a8 R12: ffff9cb916dcae80
Nov 03 07:45:41 rakete kernel: R13: ffff9cb8e7026228 R14: ffff9cb8e7026000 R15: ffffbaa7710b7828
Nov 03 07:45:41 rakete kernel: FS: 00007fd4a9c82640(0000) GS:ffff9cc7bea80000(0000) knlGS:0000000000000000
Nov 03 07:45:41 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 07:45:41 rakete kernel: CR2: 00007fd37813a778 CR3: 0000000163370000 CR4: 0000000000350ee0
Nov 03 07:45:41 rakete kernel: Call Trace:
Nov 03 07:45:41 rakete kernel: amdgpu_sa_bo_remove_locked+0xab/0xb0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_sa_bo_try_free+0x6d/0x80 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_sa_bo_new+0x12b/0x580 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_ib_get+0x3b/0x90 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_job_alloc_with_ib+0x53/0x80 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_vm_sdma_prepare+0x28/0x60 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_vm_clear_bo+0x1a9/0x410 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_vm_update_ptes+0x7f6/0x900 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_vm_bo_update_mapping.constprop.0+0x1c9/0x230 [amdgpu]
Nov 03 07:45:41 rakete kernel: ? ttm_bo_validate+0x56/0x170 [ttm]
Nov 03 07:45:41 rakete kernel: amdgpu_vm_bo_update+0x2b0/0x6e0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_amdkfd_gpuvm_map_memory_to_gpu+0x2e6/0x660 [amdgpu]
Nov 03 07:45:41 rakete kernel: kfd_ioctl_map_memory_to_gpu+0x105/0x2c0 [amdgpu]
Nov 03 07:45:41 rakete kernel: kfd_ioctl+0x2f3/0x400 [amdgpu]
Nov 03 07:45:41 rakete kernel: ? kfd_ioctl_unmap_memory_from_gpu+0x200/0x200 [amdgpu]
Nov 03 07:45:41 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 07:45:41 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 07:45:41 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 07:45:41 rakete kernel: RIP: 0033:0x7fd4bb98d59b
Nov 03 07:45:41 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 07:45:41 rakete kernel: RSP: 002b:00007fd4a9c72d58 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 07:45:41 rakete kernel: RAX: ffffffffffffffda RBX: 00007fd4a9c72da0 RCX: 00007fd4bb98d59b
Nov 03 07:45:41 rakete kernel: RDX: 00007fd4a9c72da0 RSI: 00000000c0184b18 RDI: 0000000000000009
Nov 03 07:45:41 rakete kernel: RBP: 00000000c0184b18 R08: 0000000000000000 R09: 00007fd488562660
Nov 03 07:45:41 rakete kernel: R10: 0000000004389000 R11: 0000000000000246 R12: 00007fd4aa4c0340
Nov 03 07:45:41 rakete kernel: R13: 0000000000000009 R14: 00007fd488562660 R15: 00007fd4a9c72ee8
Nov 03 07:45:41 rakete kernel: ---[ end trace 35af0a0cb598059e ]---
Nov 03 07:45:41 rakete kernel: ------------[ cut here ]------------
Nov 03 07:45:41 rakete kernel: refcount_t: addition on 0; use-after-free.
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 8 PID: 3461 at lib/refcount.c:25 refcount_warn_saturate+0x68/0xf0
Nov 03 07:45:41 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 amdgpu stb0899 mxm_wmi wmi_bmof dvb_usb_pctv452e(OE) dvb_usb(OE) amd64_edac_mod edac_mce_amd snd_hda_codec_realtek btusb kvm_amd snd_hda_codec_generic btrtl btbcm ledtrig_audio snd_hda_codec_hdmi btintel ttpci_eeprom kvm bluetooth snd_hda_intel dvb_core snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec videodev snd_hda_core snd_hwdep irqbypass ecdh_generic crct10dif_pclmul soundwire_bus rfkill crc32_pclmul ghash_clmulni_intel aesni_intel mc vfat gpu_sched ecc fat snd_soc_core crc16 crypto_simd ttm snd_compress cryptd ac97_bus glue_helper snd_pcm_dmaengine rapl snd_pcm zfs(POE) drm_kms_helper snd_timer ccp cec snd k10temp syscopyarea sysfillrect i2c_piix4 sysimgblt soundcore fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 07:45:41 rakete kernel: tpm_tis_core tpm zunicode(POE) wmi rng_core pinctrl_amd zzstd(OE) acpi_cpufreq zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel xhci_pci xhci_pci_renesas sr_mod cdrom
Nov 03 07:45:41 rakete kernel: CPU: 8 PID: 3461 Comm: Xorg:cs0 Tainted: P W OE 5.10.77-1-lts #1
Nov 03 07:45:41 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 07:45:41 rakete kernel: RIP: 0010:refcount_warn_saturate+0x68/0xf0
Nov 03 07:45:41 rakete kernel: Code: 05 38 13 71 01 01 e8 fa fd 55 00 0f 0b c3 80 3d 28 13 71 01 00 75 d3 48 c7 c7 b0 97 99 9d c6 05 18 13 71 01 01 e8 db fd 55 00 <0f> 0b c3 80 3d 0b 13 71 01 00 75 b4 48 c7 c7 88 97 99 9d c6 05 fb
Nov 03 07:45:41 rakete kernel: RSP: 0018:ffffbaa75c14bb20 EFLAGS: 00010282
Nov 03 07:45:41 rakete kernel: RAX: 0000000000000000 RBX: ffff9cb95f489878 RCX: 0000000000000027
Nov 03 07:45:41 rakete kernel: RDX: ffff9cc7bec18bb8 RSI: 0000000000000001 RDI: ffff9cc7bec18bb0
Nov 03 07:45:41 rakete kernel: RBP: ffff9cb902a08040 R08: 0000000000000000 R09: ffffbaa75c14b940
Nov 03 07:45:41 rakete kernel: R10: ffffbaa75c14b938 R11: ffff9cc7ff3227a8 R12: ffff9cb8cb3ca2a0
Nov 03 07:45:41 rakete kernel: R13: ffff9cb8f5b05000 R14: ffff9cb95f489878 R15: ffff9cb8f5b05000
Nov 03 07:45:41 rakete kernel: FS: 00007f469b64e640(0000) GS:ffff9cc7bec00000(0000) knlGS:0000000000000000
Nov 03 07:45:41 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 07:45:41 rakete kernel: CR2: 00007f4643700000 CR3: 0000000127c76000 CR4: 0000000000350ee0
Nov 03 07:45:41 rakete kernel: Call Trace:
Nov 03 07:45:41 rakete kernel: amdgpu_sync_fence+0xe3/0xf0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_sync_resv+0x39/0x1e0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_cs_sync_rings+0x6d/0x90 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_cs_ioctl+0x17df/0x1e70 [amdgpu]
Nov 03 07:45:41 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:41 rakete kernel: drm_ioctl_kernel+0xb2/0x100 [drm]
Nov 03 07:45:41 rakete kernel: drm_ioctl+0x22a/0x3d0 [drm]
Nov 03 07:45:41 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Nov 03 07:45:41 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 07:45:41 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 07:45:41 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 07:45:41 rakete kernel: RIP: 0033:0x7f46a654259b
Nov 03 07:45:41 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 07:45:41 rakete kernel: RSP: 002b:00007f469b64d898 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 07:45:41 rakete kernel: RAX: ffffffffffffffda RBX: 00007f469b64d900 RCX: 00007f46a654259b
Nov 03 07:45:41 rakete kernel: RDX: 00007f469b64d900 RSI: 00000000c0186444 RDI: 000000000000000e
Nov 03 07:45:41 rakete kernel: RBP: 00000000c0186444 R08: 00007f469b64da40 R09: 00007f469b64d9e8
Nov 03 07:45:41 rakete kernel: R10: 00005614006be290 R11: 0000000000000246 R12: 000056140063a000
Nov 03 07:45:41 rakete kernel: R13: 000000000000000e R14: 00000000fffffffd R15: 00005614006bea80
Nov 03 07:45:41 rakete kernel: ---[ end trace 35af0a0cb598059f ]---
Nov 03 07:45:41 rakete kernel: ------------[ cut here ]------------
Nov 03 07:45:41 rakete kernel: refcount_t: saturated; leaking memory.
Nov 03 07:45:41 rakete kernel: WARNING: CPU: 0 PID: 3461 at lib/refcount.c:22 refcount_warn_saturate+0x49/0xf0
Nov 03 07:45:41 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 amdgpu stb0899 mxm_wmi wmi_bmof dvb_usb_pctv452e(OE) dvb_usb(OE) amd64_edac_mod edac_mce_amd snd_hda_codec_realtek btusb kvm_amd snd_hda_codec_generic btrtl btbcm ledtrig_audio snd_hda_codec_hdmi btintel ttpci_eeprom kvm bluetooth snd_hda_intel dvb_core snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec videodev snd_hda_core snd_hwdep irqbypass ecdh_generic crct10dif_pclmul soundwire_bus rfkill crc32_pclmul ghash_clmulni_intel aesni_intel mc vfat gpu_sched ecc fat snd_soc_core crc16 crypto_simd ttm snd_compress cryptd ac97_bus glue_helper snd_pcm_dmaengine rapl snd_pcm zfs(POE) drm_kms_helper snd_timer ccp cec snd k10temp syscopyarea sysfillrect i2c_piix4 sysimgblt soundcore fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 07:45:41 rakete kernel: tpm_tis_core tpm zunicode(POE) wmi rng_core pinctrl_amd zzstd(OE) acpi_cpufreq zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel xhci_pci xhci_pci_renesas sr_mod cdrom
Nov 03 07:45:41 rakete kernel: CPU: 0 PID: 3461 Comm: Xorg:cs0 Tainted: P W OE 5.10.77-1-lts #1
Nov 03 07:45:41 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 07:45:41 rakete kernel: RIP: 0010:refcount_warn_saturate+0x49/0xf0
Nov 03 07:45:41 rakete kernel: Code: 71 01 00 0f 84 a4 00 00 00 c3 85 f6 74 3e 80 3d 48 13 71 01 00 75 f2 48 c7 c7 88 97 99 9d c6 05 38 13 71 01 01 e8 fa fd 55 00 <0f> 0b c3 80 3d 28 13 71 01 00 75 d3 48 c7 c7 b0 97 99 9d c6 05 18
Nov 03 07:45:41 rakete kernel: RSP: 0018:ffffbaa75c14bb20 EFLAGS: 00010282
Nov 03 07:45:41 rakete kernel: RAX: 0000000000000000 RBX: ffff9cb970f27878 RCX: 0000000000000027
Nov 03 07:45:41 rakete kernel: RDX: ffff9cc7bea18bb8 RSI: 0000000000000001 RDI: ffff9cc7bea18bb0
Nov 03 07:45:41 rakete kernel: RBP: ffff9cb902a08040 R08: 0000000000000000 R09: ffffbaa75c14b940
Nov 03 07:45:41 rakete kernel: R10: ffffbaa75c14b938 R11: ffff9cc7ff3227a8 R12: ffff9cb8d6582020
Nov 03 07:45:41 rakete kernel: R13: ffff9cb8f5b05000 R14: ffff9cb970f27878 R15: ffff9cb8f5b05000
Nov 03 07:45:41 rakete kernel: FS: 00007f469b64e640(0000) GS:ffff9cc7bea00000(0000) knlGS:0000000000000000
Nov 03 07:45:41 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 07:45:41 rakete kernel: CR2: 00007fd2a7ee000c CR3: 0000000127c76000 CR4: 0000000000350ef0
Nov 03 07:45:41 rakete kernel: Call Trace:
Nov 03 07:45:41 rakete kernel: amdgpu_sync_fence+0xd7/0xf0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_sync_resv+0x39/0x1e0 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_cs_sync_rings+0x6d/0x90 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_cs_ioctl+0x17df/0x1e70 [amdgpu]
Nov 03 07:45:41 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:41 rakete kernel: drm_ioctl_kernel+0xb2/0x100 [drm]
Nov 03 07:45:41 rakete kernel: drm_ioctl+0x22a/0x3d0 [drm]
Nov 03 07:45:41 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:41 rakete kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Nov 03 07:45:41 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 07:45:41 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 07:45:41 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 07:45:41 rakete kernel: RIP: 0033:0x7f46a654259b
Nov 03 07:45:41 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 07:45:41 rakete kernel: RSP: 002b:00007f469b64d898 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 07:45:41 rakete kernel: RAX: ffffffffffffffda RBX: 00007f469b64d900 RCX: 00007f46a654259b
Nov 03 07:45:41 rakete kernel: RDX: 00007f469b64d900 RSI: 00000000c0186444 RDI: 000000000000000e
Nov 03 07:45:41 rakete kernel: RBP: 00000000c0186444 R08: 00007f469b64da40 R09: 00007f469b64d9e8
Nov 03 07:45:41 rakete kernel: R10: 00005614006be290 R11: 0000000000000246 R12: 000056140063a000
Nov 03 07:45:41 rakete kernel: R13: 000000000000000e R14: 00000000fffffffd R15: 00005614006beb90
Nov 03 07:45:41 rakete kernel: ---[ end trace 35af0a0cb59805a0 ]---
Nov 03 07:45:45 rakete kernel: ------------[ cut here ]------------
Nov 03 07:45:45 rakete kernel: WARNING: CPU: 10 PID: 4038 at include/linux/dma-fence.h:475 amdgpu_sync_keep_later+0x72/0xb0 [amdgpu]
Nov 03 07:45:45 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 amdgpu stb0899 mxm_wmi wmi_bmof dvb_usb_pctv452e(OE) dvb_usb(OE) amd64_edac_mod edac_mce_amd snd_hda_codec_realtek btusb kvm_amd snd_hda_codec_generic btrtl btbcm ledtrig_audio snd_hda_codec_hdmi btintel ttpci_eeprom kvm bluetooth snd_hda_intel dvb_core snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec videodev snd_hda_core snd_hwdep irqbypass ecdh_generic crct10dif_pclmul soundwire_bus rfkill crc32_pclmul ghash_clmulni_intel aesni_intel mc vfat gpu_sched ecc fat snd_soc_core crc16 crypto_simd ttm snd_compress cryptd ac97_bus glue_helper snd_pcm_dmaengine rapl snd_pcm zfs(POE) drm_kms_helper snd_timer ccp cec snd k10temp syscopyarea sysfillrect i2c_piix4 sysimgblt soundcore fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 07:45:45 rakete kernel: tpm_tis_core tpm zunicode(POE) wmi rng_core pinctrl_amd zzstd(OE) acpi_cpufreq zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel xhci_pci xhci_pci_renesas sr_mod cdrom
Nov 03 07:45:45 rakete kernel: CPU: 10 PID: 4038 Comm: xfwm4:cs0 Tainted: P W OE 5.10.77-1-lts #1
Nov 03 07:45:45 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 07:45:45 rakete kernel: RIP: 0010:amdgpu_sync_keep_later+0x72/0xb0 [amdgpu]
Nov 03 07:45:45 rakete kernel: Code: 4c 85 d2 7e 35 48 85 db 74 19 48 8d 7b 38 b8 01 00 00 00 f0 0f c1 43 38 85 c0 74 3d 8d 50 01 09 c2 78 0b 48 89 5d 00 5b 5d c3 <0f> 0b eb c0 be 01 00 00 00 e8 40 68 36 db eb e9 be 03 00 00 00 e8
Nov 03 07:45:45 rakete kernel: RSP: 0018:ffffbaa758cffb88 EFLAGS: 00010293
Nov 03 07:45:45 rakete kernel: RAX: ffff9cb8f6447900 RBX: ffff9cb8f6447cc0 RCX: 0000000000000093
Nov 03 07:45:45 rakete kernel: RDX: 0000000000000000 RSI: ffff9cb8f6447cc0 RDI: ffff9cb95ae47cf8
Nov 03 07:45:45 rakete kernel: RBP: ffff9cb95ae47cf8 R08: ffff9cb91d043398 R09: ffff9cb8c43207e0
Nov 03 07:45:45 rakete kernel: R10: ffff9cb951283b80 R11: 0000000000000000 R12: ffff9cb95ae47c78
Nov 03 07:45:45 rakete kernel: R13: ffff9cb8e7029218 R14: 0000000000000000 R15: ffff9cb8e7020000
Nov 03 07:45:45 rakete kernel: FS: 00007f7337fff640(0000) GS:ffff9cc7bec80000(0000) knlGS:0000000000000000
Nov 03 07:45:45 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 07:45:45 rakete kernel: CR2: 00007f0c4c426000 CR3: 00000001689e4000 CR4: 0000000000350ee0
Nov 03 07:45:45 rakete kernel: Call Trace:
Nov 03 07:45:45 rakete kernel: amdgpu_sync_vm_fence+0x1f/0x30 [amdgpu]
Nov 03 07:45:45 rakete kernel: amdgpu_cs_ioctl+0x16ee/0x1e70 [amdgpu]
Nov 03 07:45:45 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:45 rakete kernel: drm_ioctl_kernel+0xb2/0x100 [drm]
Nov 03 07:45:45 rakete kernel: drm_ioctl+0x22a/0x3d0 [drm]
Nov 03 07:45:45 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 07:45:45 rakete kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Nov 03 07:45:45 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 07:45:45 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 07:45:45 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 07:45:45 rakete kernel: RIP: 0033:0x7f7341a4259b
Nov 03 07:45:45 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 07:45:45 rakete kernel: RSP: 002b:00007f7337ffe858 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 07:45:45 rakete kernel: RAX: ffffffffffffffda RBX: 00007f7337ffe8c0 RCX: 00007f7341a4259b
Nov 03 07:45:45 rakete kernel: RDX: 00007f7337ffe8c0 RSI: 00000000c0186444 RDI: 000000000000000d
Nov 03 07:45:45 rakete kernel: RBP: 00000000c0186444 R08: 00007f7337ffea00 R09: 00007f7337ffe9a8
Nov 03 07:45:45 rakete kernel: R10: 000055fc42c24a40 R11: 0000000000000246 R12: 000055fc42d7ad30
Nov 03 07:45:45 rakete kernel: R13: 000000000000000d R14: 00000000fffffffd R15: 000055fc42ca8700
Nov 03 07:45:45 rakete kernel: ---[ end trace 35af0a0cb59805a1 ]---
This task depends upon

Closed by  Andreas Radke (AndyRTR)
Thursday, 04 November 2021, 18:42 GMT
Reason for closing:  Fixed
Comment by Andreas Radke (AndyRTR) - Wednesday, 03 November 2021, 18:11 GMT
please try 5.10.77-2
Comment by Matthias Bodenbinder (mbod) - Wednesday, 03 November 2021, 19:02 GMT
5.10.77-2 from testing does not solve the issue. I see an immediate freeze when I start "darktable-cli bench.SRW test.jpg --core --configdir /tmp -d perf -d opencl"

Nov 03 19:54:05 rakete kernel: BUG: unable to handle page fault for address: 00000000ffff9610
Nov 03 19:54:05 rakete kernel: #PF: supervisor read access in kernel mode
Nov 03 19:54:05 rakete kernel: #PF: error_code(0x0000) - not-present page
Nov 03 19:54:05 rakete kernel: PGD 0 P4D 0
Nov 03 19:54:05 rakete kernel: Oops: 0000 [#1] SMP NOPTI
Nov 03 19:54:05 rakete kernel: CPU: 0 PID: 11273 Comm: darktable-cli Tainted: P OE 5.10.77-2-lts #1
Nov 03 19:54:05 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 19:54:05 rakete kernel: RIP: 0010:amdgpu_sa_bo_try_free+0x3e/0x80 [amdgpu]
Nov 03 19:54:05 rakete kernel: Code: 47 20 48 8b 28 4c 39 ed 74 35 48 8b 5d 00 49 39 ed 74 2c 4c 8b 65 30 4d 85 e4 74 23 49 8b 44 24 30 a8 01 75 29 49 8b 44 24 08 <48> 8b 40 20 48 85 c0 74 0c 4c 89 e7 e8 b1 e1 39 c4 84 c0 75 07 5b
Nov 03 19:54:05 rakete kernel: RSP: 0018:ffffbf2cb281f938 EFLAGS: 00010246
Nov 03 19:54:05 rakete kernel: RAX: 00000000ffff95f0 RBX: ffffa03236481340 RCX: 000000008040003e
Nov 03 19:54:05 rakete kernel: RDX: 000000008040003f RSI: 000000008040003e RDI: ffffa03200043b00
Nov 03 19:54:05 rakete kernel: RBP: ffffa03236481b80 R08: 0000000000000001 R09: 0000000000000000
Nov 03 19:54:05 rakete kernel: R10: 0000000000000001 R11: ffffa032992e9000 R12: ffffa03249683d80
Nov 03 19:54:05 rakete kernel: R13: ffffa0321e306228 R14: ffffa0321e306000 R15: ffffbf2cb281fa40
Nov 03 19:54:05 rakete kernel: FS: 00007f18c16e3640(0000) GS:ffffa040fea00000(0000) knlGS:0000000000000000
Nov 03 19:54:05 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 19:54:05 rakete kernel: CR2: 00000000ffff9610 CR3: 0000000187194000 CR4: 0000000000350ef0
Nov 03 19:54:05 rakete kernel: Call Trace:
Nov 03 19:54:05 rakete kernel: amdgpu_sa_bo_new+0x12b/0x580 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_ib_get+0x3b/0x90 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_job_alloc_with_ib+0x53/0x80 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_vm_sdma_prepare+0x28/0x60 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_vm_bo_update_mapping.constprop.0+0x156/0x230 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_vm_clear_freed+0xfe/0x240 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_amdkfd_gpuvm_unmap_memory_from_gpu+0x17b/0x2a0 [amdgpu]
Nov 03 19:54:05 rakete kernel: kfd_ioctl_unmap_memory_from_gpu+0x108/0x200 [amdgpu]
Nov 03 19:54:05 rakete kernel: kfd_ioctl+0x2f3/0x400 [amdgpu]
Nov 03 19:54:05 rakete kernel: ? kfd_ioctl_set_cu_mask+0x1f0/0x1f0 [amdgpu]
Nov 03 19:54:05 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 19:54:05 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 19:54:05 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 19:54:05 rakete kernel: RIP: 0033:0x7f18d33ee59b
Nov 03 19:54:05 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 19:54:05 rakete kernel: RSP: 002b:00007f18c16d3db8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 19:54:05 rakete kernel: RAX: ffffffffffffffda RBX: 00007f18c16d3e00 RCX: 00007f18d33ee59b
Nov 03 19:54:05 rakete kernel: RDX: 00007f18c16d3e00 RSI: 00000000c0184b19 RDI: 0000000000000009
Nov 03 19:54:05 rakete kernel: RBP: 00000000c0184b19 R08: 00007f18a0485870 R09: 00007f18a0485870
Nov 03 19:54:05 rakete kernel: R10: 0000000000001000 R11: 0000000000000246 R12: 00007f18c1f21340
Nov 03 19:54:05 rakete kernel: R13: 0000000000000009 R14: 0000000000000000 R15: 0000000000000000
Nov 03 19:54:05 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) mxm_wmi snd_hda_codec_realtek wmi_bmof dvb_usb(OE) snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi ttpci_eeprom amd64_edac_mod amdgpu edac_mce_amd dvb_core snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec btusb btrtl btbcm videodev kvm_amd snd_hda_core snd_hwdep btintel soundwire_bus vfat mc fat kvm bluetooth snd_soc_core irqbypass crct10dif_pclmul gpu_sched crc32_pclmul ghash_clmulni_intel snd_compress ttm ac97_bus snd_pcm_dmaengine aesni_intel snd_pcm drm_kms_helper ecdh_generic crypto_simd rfkill cryptd snd_timer ecc glue_helper crc16 rapl snd cec k10temp syscopyarea ccp sysfillrect soundcore zfs(POE) sysimgblt i2c_piix4 fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 19:54:05 rakete kernel: tpm_tis_core tpm wmi rng_core pinctrl_amd acpi_cpufreq zunicode(POE) zzstd(OE) zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel sr_mod xhci_pci xhci_pci_renesas cdrom
Nov 03 19:54:05 rakete kernel: CR2: 00000000ffff9610
Nov 03 19:54:05 rakete kernel: ---[ end trace 1d94f2a406459926 ]---
Nov 03 19:54:05 rakete kernel: RIP: 0010:amdgpu_sa_bo_try_free+0x3e/0x80 [amdgpu]
Nov 03 19:54:05 rakete kernel: Code: 47 20 48 8b 28 4c 39 ed 74 35 48 8b 5d 00 49 39 ed 74 2c 4c 8b 65 30 4d 85 e4 74 23 49 8b 44 24 30 a8 01 75 29 49 8b 44 24 08 <48> 8b 40 20 48 85 c0 74 0c 4c 89 e7 e8 b1 e1 39 c4 84 c0 75 07 5b
Nov 03 19:54:05 rakete kernel: RSP: 0018:ffffbf2cb281f938 EFLAGS: 00010246
Nov 03 19:54:05 rakete kernel: RAX: 00000000ffff95f0 RBX: ffffa03236481340 RCX: 000000008040003e
Nov 03 19:54:05 rakete kernel: RDX: 000000008040003f RSI: 000000008040003e RDI: ffffa03200043b00
Nov 03 19:54:05 rakete kernel: RBP: ffffa03236481b80 R08: 0000000000000001 R09: 0000000000000000
Nov 03 19:54:05 rakete kernel: R10: 0000000000000001 R11: ffffa032992e9000 R12: ffffa03249683d80
Nov 03 19:54:05 rakete kernel: R13: ffffa0321e306228 R14: ffffa0321e306000 R15: ffffbf2cb281fa40
Nov 03 19:54:05 rakete kernel: FS: 00007f18c16e3640(0000) GS:ffffa040fea00000(0000) knlGS:0000000000000000
Nov 03 19:54:05 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 19:54:05 rakete kernel: CR2: 00000000ffff9610 CR3: 0000000187194000 CR4: 0000000000350ef0
Nov 03 19:54:05 rakete kernel: ------------[ cut here ]------------
Nov 03 19:54:05 rakete kernel: refcount_t: addition on 0; use-after-free.
Nov 03 19:54:05 rakete kernel: WARNING: CPU: 2 PID: 8435 at lib/refcount.c:25 refcount_warn_saturate+0x68/0xf0
Nov 03 19:54:05 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) mxm_wmi snd_hda_codec_realtek wmi_bmof dvb_usb(OE) snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi ttpci_eeprom amd64_edac_mod amdgpu edac_mce_amd dvb_core snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec btusb btrtl btbcm videodev kvm_amd snd_hda_core snd_hwdep btintel soundwire_bus vfat mc fat kvm bluetooth snd_soc_core irqbypass crct10dif_pclmul gpu_sched crc32_pclmul ghash_clmulni_intel snd_compress ttm ac97_bus snd_pcm_dmaengine aesni_intel snd_pcm drm_kms_helper ecdh_generic crypto_simd rfkill cryptd snd_timer ecc glue_helper crc16 rapl snd cec k10temp syscopyarea ccp sysfillrect soundcore zfs(POE) sysimgblt i2c_piix4 fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 19:54:05 rakete kernel: tpm_tis_core tpm wmi rng_core pinctrl_amd acpi_cpufreq zunicode(POE) zzstd(OE) zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel sr_mod xhci_pci xhci_pci_renesas cdrom
Nov 03 19:54:05 rakete kernel: CPU: 2 PID: 8435 Comm: Xorg:cs0 Tainted: P D OE 5.10.77-2-lts #1
Nov 03 19:54:05 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 19:54:05 rakete kernel: RIP: 0010:refcount_warn_saturate+0x68/0xf0
Nov 03 19:54:05 rakete kernel: Code: 05 38 13 71 01 01 e8 fa fd 55 00 0f 0b c3 80 3d 28 13 71 01 00 75 d3 48 c7 c7 b0 97 59 86 c6 05 18 13 71 01 01 e8 db fd 55 00 <0f> 0b c3 80 3d 0b 13 71 01 00 75 b4 48 c7 c7 88 97 59 86 c6 05 fb
Nov 03 19:54:05 rakete kernel: RSP: 0018:ffffbf2c80f7bb20 EFLAGS: 00010282
Nov 03 19:54:05 rakete kernel: RAX: 0000000000000000 RBX: ffffa0326c79f478 RCX: 0000000000000027
Nov 03 19:54:05 rakete kernel: RDX: ffffa040fea98bb8 RSI: 0000000000000001 RDI: ffffa040fea98bb0
Nov 03 19:54:05 rakete kernel: RBP: ffffa03249682c40 R08: 0000000000000000 R09: ffffbf2c80f7b940
Nov 03 19:54:05 rakete kernel: R10: ffffbf2c80f7b938 R11: ffffa0413f3227a8 R12: ffffa03206e201e0
Nov 03 19:54:05 rakete kernel: R13: ffffa0324a1f5000 R14: ffffa0326c79f478 R15: ffffa0324a1f5000
Nov 03 19:54:05 rakete kernel: FS: 00007ff25d632640(0000) GS:ffffa040fea80000(0000) knlGS:0000000000000000
Nov 03 19:54:05 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 19:54:05 rakete kernel: CR2: 00007f189f3fe000 CR3: 0000000127294000 CR4: 0000000000350ee0
Nov 03 19:54:05 rakete kernel: Call Trace:
Nov 03 19:54:05 rakete kernel: amdgpu_sync_fence+0xe3/0xf0 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_sync_resv+0x39/0x1e0 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_cs_sync_rings+0x6d/0x90 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_cs_ioctl+0x17df/0x1e70 [amdgpu]
Nov 03 19:54:05 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 19:54:05 rakete kernel: drm_ioctl_kernel+0xb2/0x100 [drm]
Nov 03 19:54:05 rakete kernel: drm_ioctl+0x22a/0x3d0 [drm]
Nov 03 19:54:05 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Nov 03 19:54:05 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 19:54:05 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 19:54:05 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 19:54:05 rakete kernel: RIP: 0033:0x7ff26852659b
Nov 03 19:54:05 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 19:54:05 rakete kernel: RSP: 002b:00007ff25d631898 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 19:54:05 rakete kernel: RAX: ffffffffffffffda RBX: 00007ff25d631900 RCX: 00007ff26852659b
Nov 03 19:54:05 rakete kernel: RDX: 00007ff25d631900 RSI: 00000000c0186444 RDI: 000000000000000e
Nov 03 19:54:05 rakete kernel: RBP: 00000000c0186444 R08: 00007ff25d631a40 R09: 00007ff25d6319e8
Nov 03 19:54:05 rakete kernel: R10: 00005646aab50410 R11: 0000000000000246 R12: 00005646aaace760
Nov 03 19:54:05 rakete kernel: R13: 000000000000000e R14: 00000000fffffffd R15: 00005646aab50d10
Nov 03 19:54:05 rakete kernel: ---[ end trace 1d94f2a406459927 ]---
Nov 03 19:54:05 rakete kernel: ------------[ cut here ]------------
Nov 03 19:54:05 rakete kernel: refcount_t: underflow; use-after-free.
Nov 03 19:54:05 rakete kernel: WARNING: CPU: 7 PID: 1551 at lib/refcount.c:28 refcount_warn_saturate+0xa6/0xf0
Nov 03 19:54:05 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) mxm_wmi snd_hda_codec_realtek wmi_bmof dvb_usb(OE) snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi ttpci_eeprom amd64_edac_mod amdgpu edac_mce_amd dvb_core snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec btusb btrtl btbcm videodev kvm_amd snd_hda_core snd_hwdep btintel soundwire_bus vfat mc fat kvm bluetooth snd_soc_core irqbypass crct10dif_pclmul gpu_sched crc32_pclmul ghash_clmulni_intel snd_compress ttm ac97_bus snd_pcm_dmaengine aesni_intel snd_pcm drm_kms_helper ecdh_generic crypto_simd rfkill cryptd snd_timer ecc glue_helper crc16 rapl snd cec k10temp syscopyarea ccp sysfillrect soundcore zfs(POE) sysimgblt i2c_piix4 fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 19:54:05 rakete kernel: tpm_tis_core tpm wmi rng_core pinctrl_amd acpi_cpufreq zunicode(POE) zzstd(OE) zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel sr_mod xhci_pci xhci_pci_renesas cdrom
Nov 03 19:54:05 rakete kernel: CPU: 7 PID: 1551 Comm: gfx_0.0.0 Tainted: P D W OE 5.10.77-2-lts #1
Nov 03 19:54:05 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 19:54:05 rakete kernel: RIP: 0010:refcount_warn_saturate+0xa6/0xf0
Nov 03 19:54:05 rakete kernel: Code: 05 fb 12 71 01 01 e8 bc fd 55 00 0f 0b c3 80 3d e9 12 71 01 00 75 95 48 c7 c7 e0 97 59 86 c6 05 d9 12 71 01 01 e8 9d fd 55 00 <0f> 0b c3 80 3d c8 12 71 01 00 0f 85 72 ff ff ff 48 c7 c7 38 98 59
Nov 03 19:54:05 rakete kernel: RSP: 0018:ffffbf2c80b17df0 EFLAGS: 00010286
Nov 03 19:54:05 rakete kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000027
Nov 03 19:54:05 rakete kernel: RDX: ffffa040febd8bb8 RSI: 0000000000000001 RDI: ffffa040febd8bb0
Nov 03 19:54:05 rakete kernel: RBP: 00000000ffffffff R08: 0000000000000000 R09: ffffbf2c80b17c10
Nov 03 19:54:05 rakete kernel: R10: ffffbf2c80b17c08 R11: ffffa0413f3227a8 R12: ffffa03249682c40
Nov 03 19:54:05 rakete kernel: R13: ffffa0326c79f4b0 R14: ffffa0326c79f4f8 R15: ffffa032126bca08
Nov 03 19:54:05 rakete kernel: FS: 0000000000000000(0000) GS:ffffa040febc0000(0000) knlGS:0000000000000000
Nov 03 19:54:05 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 19:54:05 rakete kernel: CR2: 00007f189f4fe000 CR3: 00000001944d8000 CR4: 0000000000350ee0
Nov 03 19:54:05 rakete kernel: Call Trace:
Nov 03 19:54:05 rakete kernel: amdgpu_sync_get_fence+0xec/0xf0 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_job_dependency+0x2d/0xd0 [amdgpu]
Nov 03 19:54:05 rakete kernel: drm_sched_entity_pop_job+0x3d/0x3b0 [gpu_sched]
Nov 03 19:54:05 rakete kernel: drm_sched_main+0x11c/0x460 [gpu_sched]
Nov 03 19:54:05 rakete kernel: ? add_wait_queue_exclusive+0x70/0x70
Nov 03 19:54:05 rakete kernel: ? drm_sched_select_entity+0xc0/0xc0 [gpu_sched]
Nov 03 19:54:05 rakete kernel: kthread+0x11b/0x140
Nov 03 19:54:05 rakete kernel: ? kthread_associate_blkcg+0xa0/0xa0
Nov 03 19:54:05 rakete kernel: ret_from_fork+0x22/0x30
Nov 03 19:54:05 rakete kernel: ---[ end trace 1d94f2a406459928 ]---
Nov 03 19:54:05 rakete kernel: ------------[ cut here ]------------
Nov 03 19:54:05 rakete kernel: refcount_t: saturated; leaking memory.
Nov 03 19:54:05 rakete kernel: WARNING: CPU: 2 PID: 8435 at lib/refcount.c:22 refcount_warn_saturate+0x49/0xf0
Nov 03 19:54:05 rakete kernel: Modules linked in: cfg80211 ccm algif_aead cbc des_generic libdes ecb algif_skcipher cmac md4 algif_hash af_alg it87 hwmon_vid rc_tt_1500 stb6100 isl6423 stb0899 dvb_usb_pctv452e(OE) mxm_wmi snd_hda_codec_realtek wmi_bmof dvb_usb(OE) snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi ttpci_eeprom amd64_edac_mod amdgpu edac_mce_amd dvb_core snd_hda_intel snd_intel_dspcfg soundwire_intel soundwire_generic_allocation videobuf2_vmalloc soundwire_cadence videobuf2_memops videobuf2_common snd_hda_codec btusb btrtl btbcm videodev kvm_amd snd_hda_core snd_hwdep btintel soundwire_bus vfat mc fat kvm bluetooth snd_soc_core irqbypass crct10dif_pclmul gpu_sched crc32_pclmul ghash_clmulni_intel snd_compress ttm ac97_bus snd_pcm_dmaengine aesni_intel snd_pcm drm_kms_helper ecdh_generic crypto_simd rfkill cryptd snd_timer ecc glue_helper crc16 rapl snd cec k10temp syscopyarea ccp sysfillrect soundcore zfs(POE) sysimgblt i2c_piix4 fb_sys_fops igb i2c_algo_bit dca tpm_crb tpm_tis
Nov 03 19:54:05 rakete kernel: tpm_tis_core tpm wmi rng_core pinctrl_amd acpi_cpufreq zunicode(POE) zzstd(OE) zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm pkcs8_key_parser sg fuse crypto_user agpgart zram bpf_preload ip_tables x_tables xfs libcrc32c crc32c_generic usbhid crc32c_intel sr_mod xhci_pci xhci_pci_renesas cdrom
Nov 03 19:54:05 rakete kernel: CPU: 2 PID: 8435 Comm: Xorg:cs0 Tainted: P D W OE 5.10.77-2-lts #1
Nov 03 19:54:05 rakete kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ULTRA/X570 AORUS ULTRA, BIOS F35d 10/13/2021
Nov 03 19:54:05 rakete kernel: RIP: 0010:refcount_warn_saturate+0x49/0xf0
Nov 03 19:54:05 rakete kernel: Code: 71 01 00 0f 84 a4 00 00 00 c3 85 f6 74 3e 80 3d 48 13 71 01 00 75 f2 48 c7 c7 88 97 59 86 c6 05 38 13 71 01 01 e8 fa fd 55 00 <0f> 0b c3 80 3d 28 13 71 01 00 75 d3 48 c7 c7 b0 97 59 86 c6 05 18
Nov 03 19:54:05 rakete kernel: RSP: 0018:ffffbf2c80f7bb20 EFLAGS: 00010282
Nov 03 19:54:05 rakete kernel: RAX: 0000000000000000 RBX: ffffa0326c79f478 RCX: 0000000000000027
Nov 03 19:54:05 rakete kernel: RDX: ffffa040fea98bb8 RSI: 0000000000000001 RDI: ffffa040fea98bb0
Nov 03 19:54:05 rakete kernel: RBP: ffffa03249682c40 R08: 0000000000000000 R09: ffffbf2c80f7b940
Nov 03 19:54:05 rakete kernel: R10: ffffbf2c80f7b938 R11: ffffa0413f3227a8 R12: ffffa03206e209c0
Nov 03 19:54:05 rakete kernel: R13: ffffa0324a1f5000 R14: ffffa0326c79f478 R15: ffffa0324a1f5000
Nov 03 19:54:05 rakete kernel: FS: 00007ff25d632640(0000) GS:ffffa040fea80000(0000) knlGS:0000000000000000
Nov 03 19:54:05 rakete kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Nov 03 19:54:05 rakete kernel: CR2: 00007f189f3fe000 CR3: 0000000127294000 CR4: 0000000000350ee0
Nov 03 19:54:05 rakete kernel: Call Trace:
Nov 03 19:54:05 rakete kernel: amdgpu_sync_fence+0xd7/0xf0 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_sync_resv+0x39/0x1e0 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_cs_sync_rings+0x6d/0x90 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_cs_ioctl+0x17df/0x1e70 [amdgpu]
Nov 03 19:54:05 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 19:54:05 rakete kernel: drm_ioctl_kernel+0xb2/0x100 [drm]
Nov 03 19:54:05 rakete kernel: drm_ioctl+0x22a/0x3d0 [drm]
Nov 03 19:54:05 rakete kernel: ? amdgpu_cs_find_mapping+0x110/0x110 [amdgpu]
Nov 03 19:54:05 rakete kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Nov 03 19:54:05 rakete kernel: __x64_sys_ioctl+0x82/0xb0
Nov 03 19:54:05 rakete kernel: do_syscall_64+0x33/0x40
Nov 03 19:54:05 rakete kernel: entry_SYSCALL_64_after_hwframe+0x44/0xa9
Nov 03 19:54:05 rakete kernel: RIP: 0033:0x7ff26852659b
Nov 03 19:54:05 rakete kernel: Code: ff ff ff 85 c0 79 9b 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a5 a8 0c 00 f7 d8 64 89 01 48
Nov 03 19:54:05 rakete kernel: RSP: 002b:00007ff25d631898 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Nov 03 19:54:05 rakete kernel: RAX: ffffffffffffffda RBX: 00007ff25d631900 RCX: 00007ff26852659b
Nov 03 19:54:05 rakete kernel: RDX: 00007ff25d631900 RSI: 00000000c0186444 RDI: 000000000000000e
Nov 03 19:54:05 rakete kernel: RBP: 00000000c0186444 R08: 00007ff25d631a40 R09: 00007ff25d6319e8
Nov 03 19:54:05 rakete kernel: R10: 00005646aab50410 R11: 0000000000000246 R12: 00005646aaace760
Nov 03 19:54:05 rakete kernel: R13: 000000000000000e R14: 00000000fffffffd R15: 00005646aab50c00
Nov 03 19:54:05 rakete kernel: ---[ end trace 1d94f2a406459929 ]---
Comment by Andreas Radke (AndyRTR) - Wednesday, 03 November 2021, 19:13 GMT
I'm also affected. I've started to bisect this.
Comment by Jan Alexander Steffens (heftig) - Thursday, 04 November 2021, 12:51 GMT Comment by Andreas Radke (AndyRTR) - Thursday, 04 November 2021, 16:43 GMT
Please try again with -3.
Comment by Matthias Bodenbinder (mbod) - Thursday, 04 November 2021, 18:31 GMT
Looks good. No crash so far with 5.10.77-3-lts.
darktable, firefox, netflix ... all good. No warnings in the journal.

Loading...