FS#72188 - [5.14.6-arch1-1] No sound output and unkillable pulseaudio with snd_hdsp driver

Attached to Project: Arch Linux
Opened by Steven Fosdick (Amphitryon) - Monday, 20 September 2021, 23:55 GMT
Last edited by Jan Alexander Steffens (heftig) - Friday, 24 September 2021, 20:37 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To Jan Alexander Steffens (heftig)
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:

With any kernel version 5.14 including up to 5.14.6 as reported here, there is an issue which seems to be related snd_hdsp driver. In GNOME, the volume control shows "muted" and pavucontrol shows only a dummy output.

On shutting down, /var fails to umount because of being busy and the pulseaudio process will not die despite the shutdown process sending it several SIGTERM and SIGKILL signals so it appears to be blocked somewhere in the kernel. Eventually the shutdown continues anyway, but only after several timeouts.

It appears that the pulseaudio process that has got stuck may be owned by gdm and thus the reason for no sound in GNOME as a normal user is because this is not releasing the sound hardware for the user session to use.

Additional info:
* package version(s)
Linux 5.14.6-arch1-1 #1 SMP PREEMPT
pulseaudio-15.0-1
* config and/or log files etc.
Sep 20 23:32:13 ecrins kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000
Sep 20 23:32:13 ecrins kernel: #PF: supervisor read access in kernel mode
Sep 20 23:32:13 ecrins kernel: #PF: error_code(0x0000) - not-present page
Sep 20 23:32:14 ecrins kernel: PGD 0 P4D 0
Sep 20 23:32:14 ecrins kernel: Oops: 0000 [#1] PREEMPT SMP PTI
Sep 20 23:32:14 ecrins kernel: CPU: 2 PID: 647 Comm: pulseaudio Not tainted 5.14.6-arch1-1 #1 5afad94a3d6c5ce03c4ad5c416037ac9efb92f5d
Sep 20 23:32:14 ecrins kernel: Hardware name: System manufacturer System Product Name/P8Z77-V LX, BIOS 1502 11/09/2012
Sep 20 23:32:14 ecrins kernel: RIP: 0010:snd_dma_buffer_mmap+0x5/0x40 [snd_pcm]
Sep 20 23:32:14 ecrins kernel: Code: 83 fa 06 77 1c 48 8b 04 c5 60 d5 a5 c0 48 85 c0 74 0e 48 8b 40 08 48 85 c0 74 05 e9 b5 70 9a ee c3 0f 0b c3 90 0f 1f 44 00 00 <48> 63 07 8d 50 ff 83 f>
Sep 20 23:32:14 ecrins kernel: RSP: 0018:ffffa8814157fcb8 EFLAGS: 00010246
Sep 20 23:32:14 ecrins kernel: RAX: ffff8cfd2548d000 RBX: ffff8cfd074c8a00 RCX: 0000000000000000
Sep 20 23:32:14 ecrins kernel: RDX: 00000000001ac000 RSI: ffff8cfd2c2b37d0 RDI: 0000000000000000
Sep 20 23:32:14 ecrins kernel: RBP: 00007fd1a3b24000 R08: 00000000040400fb R09: 0000000000004000
Sep 20 23:32:14 ecrins kernel: R10: ffff8cfd25510e10 R11: 00007fd1a8ce0000 R12: ffff8cfd2c2b35a8
Sep 20 23:32:14 ecrins kernel: R13: 0000000000000008 R14: ffff8cfd2c2b37d0 R15: ffff8cfd047b6a40
Sep 20 23:32:14 ecrins kernel: FS: 00007fd1a7e84840(0000) GS:ffff8d03fff00000(0000) knlGS:0000000000000000
Sep 20 23:32:14 ecrins kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 20 23:32:14 ecrins kernel: CR2: 0000000000000000 CR3: 0000000100d56005 CR4: 00000000001706e0
Sep 20 23:32:14 ecrins kernel: Call Trace:
Sep 20 23:32:14 ecrins kernel: snd_pcm_mmap_data+0x127/0x140 [snd_pcm 91d99f05cb0931ca64b3bc46c68b3deda000284c]
Sep 20 23:32:14 ecrins kernel: mmap_region+0x415/0x690
Sep 20 23:32:14 ecrins kernel: do_mmap+0x35f/0x550
Sep 20 23:32:14 ecrins kernel: ? security_mmap_file+0x81/0xb0
Sep 20 23:32:14 ecrins kernel: vm_mmap_pgoff+0xe2/0x190
Sep 20 23:32:14 ecrins kernel: ksys_mmap_pgoff+0x1c8/0x230
Sep 20 23:32:14 ecrins kernel: do_syscall_64+0x5c/0x80
Sep 20 23:32:14 ecrins kernel: ? __audit_syscall_exit+0x24d/0x2a0
Sep 20 23:32:14 ecrins kernel: ? syscall_exit_to_user_mode+0x23/0x40
Sep 20 23:32:14 ecrins kernel: ? do_syscall_64+0x69/0x80
Sep 20 23:32:14 ecrins kernel: ? do_syscall_64+0x69/0x80
Sep 20 23:32:14 ecrins kernel: ? do_syscall_64+0x69/0x80
Sep 20 23:32:14 ecrins kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae
Sep 20 23:32:14 ecrins kernel: RIP: 0033:0x7fd1a88d5336
Sep 20 23:32:14 ecrins kernel: Code: 01 00 66 90 f3 0f 1e fa 41 f7 c1 ff 0f 00 00 75 2b 55 48 89 fd 53 89 cb 48 85 ff 74 37 41 89 da 48 89 ef b8 09 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 6>
Sep 20 23:32:14 ecrins kernel: RSP: 002b:00007ffc55d15968 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
Sep 20 23:32:14 ecrins kernel: RAX: ffffffffffffffda RBX: 0000000000000001 RCX: 00007fd1a88d5336
Sep 20 23:32:14 ecrins kernel: RDX: 0000000000000003 RSI: 0000000000004000 RDI: 0000000000000000
Sep 20 23:32:14 ecrins kernel: RBP: 0000000000000000 R08: 000000000000001f R09: 0000000000000000
Sep 20 23:32:14 ecrins kernel: R10: 0000000000000001 R11: 0000000000000246 R12: 000055ebbaa4ed90
Sep 20 23:32:14 ecrins kernel: R13: 00000000fffffffe R14: 0000000000000001 R15: 0000000000000000
Sep 20 23:32:14 ecrins kernel: Modules linked in: nct6775 hwmon_vid intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal snd_hda_codec_hdmi intel_powerclamp coretemp snd_hda_codec_realte>
Sep 20 23:32:14 ecrins kernel: sysfillrect sysimgblt fb_sys_fops cec drm intel_agp intel_gtt agpgart
Sep 20 23:32:14 ecrins kernel: CR2: 0000000000000000
Sep 20 23:32:14 ecrins kernel: ---[ end trace 46bd0b841edd7f40 ]---
Sep 20 23:32:14 ecrins kernel: RIP: 0010:snd_dma_buffer_mmap+0x5/0x40 [snd_pcm]
Sep 20 23:32:14 ecrins kernel: Code: 83 fa 06 77 1c 48 8b 04 c5 60 d5 a5 c0 48 85 c0 74 0e 48 8b 40 08 48 85 c0 74 05 e9 b5 70 9a ee c3 0f 0b c3 90 0f 1f 44 00 00 <48> 63 07 8d 50 ff 83 f>
Sep 20 23:32:14 ecrins kernel: RSP: 0018:ffffa8814157fcb8 EFLAGS: 00010246
Sep 20 23:32:14 ecrins kernel: RAX: ffff8cfd2548d000 RBX: ffff8cfd074c8a00 RCX: 0000000000000000
Sep 20 23:32:14 ecrins kernel: RDX: 00000000001ac000 RSI: ffff8cfd2c2b37d0 RDI: 0000000000000000
Sep 20 23:32:14 ecrins kernel: RBP: 00007fd1a3b24000 R08: 00000000040400fb R09: 0000000000004000
Sep 20 23:32:14 ecrins kernel: R10: ffff8cfd25510e10 R11: 00007fd1a8ce0000 R12: ffff8cfd2c2b35a8
Sep 20 23:32:14 ecrins kernel: R13: 0000000000000008 R14: ffff8cfd2c2b37d0 R15: ffff8cfd047b6a40
Sep 20 23:32:14 ecrins kernel: FS: 00007fd1a7e84840(0000) GS:ffff8d03fff00000(0000) knlGS:0000000000000000
Sep 20 23:32:14 ecrins kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 20 23:32:14 ecrins kernel: CR2: 0000000000000000 CR3: 0000000100d56005 CR4: 00000000001706e0
...
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: State 'stop-sigterm' timed out. Killing.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 575 (systemd) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 647 (pulseaudio) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 777 (alsa-sink-ALC88) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 779 (alsa-source-ALC) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Main process exited, code=killed, status=9/KILL
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 647 (pulseaudio) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 777 (alsa-sink-ALC88) with signal SIGKILL.
Sep 20 23:34:35 ecrins systemd[1]: user@120.service: Killing process 779 (alsa-source-ALC) with signal SIGKILL.
* link to upstream bug report, if any

Steps to reproduce:

For me all I have to do is log into GNOME to see the issue. However, if I blacklist the driver snd_hdsp, the problem disappears, though obviously I then cannot get any sound from the soundcard concerned. The sound card is RME multiface, a studio sound card with an external box. It is quite old and PCI rather than PCI express so probably not very common. I have not, at least as yet, been able to determine if loading the driver without the hardware still shows the issue.
This task depends upon

Closed by  Jan Alexander Steffens (heftig)
Friday, 24 September 2021, 20:37 GMT
Reason for closing:  Duplicate
Additional comments about closing:   FS#72059 

Loading...