FS#57429 - [linux] linux-4.15.1-2 Kernel crash with ipmi_si on boot

Attached to Project: Arch Linux
Opened by Mario Korte (emkay1) - Thursday, 08 February 2018, 08:09 GMT
Last edited by freswa (frederik) - Sunday, 13 September 2020, 14:04 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To Tobias Powalowski (tpowa)
Jan Alexander Steffens (heftig)
Architecture x86_64
Severity Critical
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 3
Private No

Details

Description:
When booting arch with the current 4.15.1-2 kernel the system hangs on boot while loading/initializing the module ipmi_si.
The system is an HP ProLiant G7 MicroServer N54L.

(AMD Turion II Neo N54L, 2x 2.20GHz TDP 25W with AMD RS785E/SB820M)

Additional info:
Excerpt from the boot log:

Feb 07 11:41:05 alarm kernel: IPMI System Interface driver.
Feb 07 11:41:05 alarm kernel: ipmi_si dmi-ipmi-si.0: ipmi_platform: probing via SMBIOS
Feb 07 11:41:05 alarm kernel: ipmi_si: SMBIOS: io 0xca8 regsize 1 spacing 1 irq 0
Feb 07 11:41:05 alarm kernel: ipmi_si: Adding SMBIOS-specified kcs state machine
Feb 07 11:41:05 alarm kernel: ipmi_platform: probing via SPMI
Feb 07 11:41:05 alarm kernel: ipmi_si: SPMI: mem 0x0 regsize 1 spacing 1 irq 0
Feb 07 11:41:05 alarm kernel: ipmi_si: Adding SPMI-specified smic state machine
Feb 07 11:41:05 alarm kernel: ipmi_si: Trying SMBIOS-specified kcs state machine at i/o address 0xca8, slave address 0x20, irq 0
Feb 07 11:41:05 alarm kernel: ipmi_si dmi-ipmi-si.0: Interface detection failed
Feb 07 11:41:05 alarm kernel: raid6: sse2x1 xor() 2894 MB/s
Feb 07 11:41:05 alarm kernel: raid6: sse2x2 gen() 5000 MB/s
Feb 07 11:41:05 alarm systemd[1]: Started Monitoring of LVM2 mirrors, snapshots etc. using dmeventd or progress polling.
Feb 07 11:41:05 alarm systemd[1]: Reached target Local File Systems (Pre).
Feb 07 11:41:05 alarm kernel: raid6: sse2x2 xor() 4974 MB/s
Feb 07 11:41:05 alarm kernel: raid6: sse2x4 gen() 5808 MB/s
Feb 07 11:41:05 alarm kernel: raid6: sse2x4 xor() 3130 MB/s
Feb 07 11:41:05 alarm kernel: raid6: using algorithm sse2x4 gen() 5808 MB/s
Feb 07 11:41:05 alarm kernel: raid6: .... xor() 3130 MB/s, rmw enabled
Feb 07 11:41:05 alarm kernel: raid6: using intx1 recovery algorithm
Feb 07 11:41:05 alarm kernel: async_tx: api initialized (async)
Feb 07 11:41:05 alarm kernel: xor: measuring software checksum speed
Feb 07 11:41:05 alarm kernel: ipmi_si: Trying SPMI-specified smic state machine at mem address 0x0, slave address 0x0, irq 0
Feb 07 11:41:05 alarm kernel: ipmi_si ipmi_si.0: Could not set up I/O space
Feb 07 11:41:05 alarm kernel: prefetch64-sse: 8632.000 MB/sec
Feb 07 11:41:05 alarm kernel: generic_sse: 8204.000 MB/sec
Feb 07 11:41:05 alarm kernel: xor: using function: prefetch64-sse (8632.000 MB/sec)
Feb 07 11:41:05 alarm kernel: BUG: unable to handle kernel NULL pointer dereference at 0000000000000070
Feb 07 11:41:05 alarm kernel: IP: kernfs_find_ns+0x11/0xb0
Feb 07 11:41:05 alarm kernel: PGD 0 P4D 0
Feb 07 11:41:05 alarm kernel: Oops: 0000 [#1] SMP NOPTI
Feb 07 11:41:05 alarm kernel: Modules linked in: async_xor drm_kms_helper(+) xor async_tx ipmi_si(+) raid6_pq irqbypass tg3(+) drm ipmi_devintf md_mod ipmi_msghandler ptp pps_core k10temp joydev sp5100_tco libphy i2c_piix4 mousedev input_leds led_class agpgart syscopyarea
Feb 07 11:41:05 alarm kernel: CPU: 1 PID: 243 Comm: systemd-udevd Not tainted 4.15.1-2-ARCH #1
Feb 07 11:41:05 alarm kernel: Hardware name: HP ProLiant MicroServer, BIOS O41 07/29/2011
Feb 07 11:41:05 alarm kernel: RIP: 0010:kernfs_find_ns+0x11/0xb0
Feb 07 11:41:05 alarm kernel: RSP: 0018:ffff9324406bbb28 EFLAGS: 00010246
Feb 07 11:41:05 alarm kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000004000
Feb 07 11:41:05 alarm kernel: RDX: 0000000000000000 RSI: ffffffffa0c87ee8 RDI: 0000000000000000
Feb 07 11:41:05 alarm kernel: RBP: ffffffffa0c87ee8 R08: 0000000000026120 R09: ffffffffc0642a97
Feb 07 11:41:05 alarm kernel: R10: ffffefb601fdad80 R11: 000000000000000c R12: 0000000000000000
Feb 07 11:41:05 alarm kernel: R13: ffff8f63bab31980 R14: 0000000000000000 R15: 0000000000000000
Feb 07 11:41:05 alarm kernel: FS: 00007f3761ab18c0(0000) GS:ffff8f63be280000(0000) knlGS:0000000000000000
Feb 07 11:41:05 alarm kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 07 11:41:05 alarm kernel: CR2: 0000000000000070 CR3: 000000007a9ec000 CR4: 00000000000006e0
Feb 07 11:41:05 alarm kernel: Call Trace:
Feb 07 11:41:05 alarm kernel: kernfs_find_and_get_ns+0x2c/0x50
Feb 07 11:41:05 alarm kernel: sysfs_unmerge_group+0x18/0x60
Feb 07 11:41:05 alarm kernel: dpm_sysfs_remove+0x1d/0x60
Feb 07 11:41:05 alarm kernel: device_del+0x56/0x350
Feb 07 11:41:05 alarm kernel: platform_device_del.part.3+0x1e/0x70
Feb 07 11:41:05 alarm kernel: platform_device_unregister+0x13/0x20
Feb 07 11:41:05 alarm kernel: try_smi_init+0x8c8/0xdde [ipmi_si]
Feb 07 11:41:05 alarm kernel: init_ipmi_si+0x180/0x1b0 [ipmi_si]
Feb 07 11:41:05 alarm kernel: ? ipmi_si_add_smi+0x220/0x220 [ipmi_si]
Feb 07 11:41:05 alarm kernel: do_one_initcall+0x4e/0x190
Feb 07 11:41:05 alarm kernel: ? _cond_resched+0x15/0x40
Feb 07 11:41:05 alarm kernel: ? kmem_cache_alloc_trace+0xaa/0x1b0
Feb 07 11:41:05 alarm kernel: ? do_init_module+0x22/0x1fb
Feb 07 11:41:05 alarm kernel: do_init_module+0x5b/0x1fb
Feb 07 11:41:05 alarm kernel: load_module+0x259c/0x2aa0
Feb 07 11:41:05 alarm kernel: ? SyS_init_module+0x168/0x1a0
Feb 07 11:41:05 alarm kernel: ? _cond_resched+0x15/0x40
Feb 07 11:41:05 alarm kernel: SyS_init_module+0x168/0x1a0
Feb 07 11:41:05 alarm kernel: do_syscall_64+0x67/0x1a0
Feb 07 11:41:05 alarm kernel: entry_SYSCALL64_slow_path+0x25/0x25
Feb 07 11:41:05 alarm kernel: RIP: 0033:0x7f37613d46ca
Feb 07 11:41:05 alarm kernel: RSP: 002b:00007fff77e356c8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
Feb 07 11:41:05 alarm kernel: RAX: ffffffffffffffda RBX: 000055d0f8938300 RCX: 00007f37613d46ca
Feb 07 11:41:05 alarm kernel: RDX: 00007f3760c8dcb5 RSI: 0000000000020830 RDI: 000055d0f917c9a0
Feb 07 11:41:05 alarm kernel: RBP: 00007f3760c8dcb5 R08: 0000000000000005 R09: 00007fff77e33d90
Feb 07 11:41:05 alarm kernel: R10: 0000000000000005 R11: 0000000000000246 R12: 000055d0f917c9a0
Feb 07 11:41:05 alarm kernel: R13: 000055d0f891fb00 R14: 0000000000020000 R15: 00007fff77e361c0
Feb 07 11:41:05 alarm kernel: Code: 41 5f c3 48 83 fa 01 0f 85 b0 fe ff ff e9 43 fe ff ff 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 55 41 54 48 85 d2 55 53 0f 95 c1 <0f> b7 47 70 49 89 d4 49 89 f5 66 83 e0 20 0f 95 c2 38 d1 75 4f
Feb 07 11:41:05 alarm kernel: RIP: kernfs_find_ns+0x11/0xb0 RSP: ffff9324406bbb28
Feb 07 11:41:05 alarm kernel: CR2: 0000000000000070
Feb 07 11:41:05 alarm kernel: ---[ end trace ed219fb2ca0aee2b ]---
Feb 07 11:41:05 alarm systemd-udevd[232]: worker [243] terminated by signal 9 (KILL)
Feb 07 11:41:05 alarm systemd-udevd[232]: worker [243] failed while handling '/devices/platform/dmi-ipmi-si.0'


Steps to reproduce:
Install the current kernel on an HP ProLiant G7 MicroServer N54L.
This task depends upon

Closed by  freswa (frederik)
Sunday, 13 September 2020, 14:04 GMT
Reason for closing:  Works for me
Additional comments about closing:  ipmi_si seems to work fine with recent kernels
Comment by Frederic Lebouc (flbzh) - Friday, 30 March 2018, 10:42 GMT
Hi,

I see the same problem and the only (unacceptable) workaround I have found so far to use a recent kernel is to turn acpi off in the kernel boot line.

Have you found a better solution ?

Thanks,
FL
Comment by loqs (loqs) - Friday, 30 March 2018, 17:53 GMT
https://bbs.archlinux.org/viewtopic.php?id=234231
Issue is waiting for someone affected to bisect the kernel and works with upstream to resolve it.
Comment by Frederic Lebouc (flbzh) - Sunday, 01 April 2018, 12:35 GMT
Thanks,

I the meantime, because I really need latest kernel, I have simply blacklisted this module. 4.15.14-1 boots successfully.

Cheers,
FL

Loading...