FS#40603 - [linux] 3.14.x Random kernel panics on Lenovo ThinkPad T500 (Fatal exception in interrupt).

Attached to Project: Arch Linux
Opened by archuser_4573 (archuser_4573) - Thursday, 29 May 2014, 23:00 GMT
Last edited by Tobias Powalowski (tpowa) - Monday, 16 June 2014, 20:24 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To Tobias Powalowski (tpowa)
Thomas Bächler (brain0)
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:

- Kernel panic occurs randomly within 1 - 2 hours on Lenovo ThinkPad T500 (Fatal exception in interrupt).
- memtest86+ reports no errors (2 passes completed).
- I even tried changing thermal paste on CPU and ATI/Intel switchable graphics without major improvements.
- Can't get Kdump working properly yet. Crash kernel doesn't load after echo c > /proc/sysrq-trigger (followed https://wiki.archlinux.org/index.php/Kdump). Reading Linux Kernel Crash Book by Dedoimedo right now.

Additional info:
* package version(s)

cat /proc/version
Linux version 3.14.4-1-ARCH (nobody@var-lib-archbuild-testing-x86_64-tobias) (gcc version 4.9.0 20140507 (prerelease) (GCC) ) #1 SMP PREEMPT Tue May 13 16:41:39 CEST 2014

* config and/or log files etc.

lsmod
Module Size Used by
reiserfs 239818 1
xt_CHECKSUM 1231 1
iptable_mangle 1616 1
ipt_MASQUERADE 2186 3
iptable_nat 3454 1
nf_nat_ipv4 3728 1 iptable_nat
nf_nat 13005 3 ipt_MASQUERADE,nf_nat_ipv4,iptable_nat
ipt_REJECT 2505 2
xt_tcpudp 3207 6
tun 20995 1
bridge 99735 0
stp 1653 1 bridge
llc 3729 2 stp,bridge
ctr 3927 3
ccm 8278 3
xt_LOG 13124 1
nf_conntrack_ipv4 9474 3
nf_defrag_ipv4 1499 1 nf_conntrack_ipv4
xt_conntrack 3425 2
nf_conntrack 75656 6 ipt_MASQUERADE,nf_nat,nf_nat_ipv4,xt_conntrack,iptable_nat,nf_conntrack_ipv4
iptable_filter 1552 1
ip_tables 17923 3 iptable_filter,iptable_mangle,iptable_nat
x_tables 17344 9 xt_CHECKSUM,ip_tables,xt_tcpudp,ipt_MASQUERADE,xt_conntrack,xt_LOG,iptable_filter,ipt_REJECT,iptable_mangle
btusb 19720 0
bluetooth 352753 2 btusb
6lowpan_iphc 11556 1 bluetooth
iTCO_wdt 5535 0
joydev 10367 0
mousedev 10912 0
ppdev 7278 0
iTCO_vendor_support 1929 1 iTCO_wdt
coretemp 6550 0
kvm_intel 134316 0
kvm 419846 1 kvm_intel
evdev 11784 14
microcode 17157 0
mac_hid 3273 0
psmouse 92968 0
serio_raw 5009 0
arc4 2064 2
i2c_i801 11364 0
iwldvm 171439 0
pcmcia 46612 0
mac80211 510593 1 iwldvm
radeon 1330458 4
r852 9777 0
sm_common 8709 1 r852
nand 55793 2 r852,sm_common
nand_ecc 3679 1 nand
nand_ids 5745 1 nand
mtd 41348 2 nand,sm_common
ttm 66913 1 radeon
lpc_ich 13560 0
drm_kms_helper 35720 1 radeon
snd_hda_codec_conexant 38779 1
drm 242043 6 ttm,drm_kms_helper,radeon
i2c_algo_bit 5480 1 radeon
i2c_core 25400 5 drm,i2c_i801,drm_kms_helper,i2c_algo_bit,radeon
r592 11983 0
snd_hda_codec_generic 53860 2 snd_hda_codec_conexant
memstick 7664 1 r592
yenta_socket 34233 0
pcmcia_rsrc 9392 1 yenta_socket
pcmcia_core 14655 3 pcmcia,pcmcia_rsrc,yenta_socket
iwlwifi 151777 1 iwldvm
cfg80211 459335 3 iwlwifi,mac80211,iwldvm
thinkpad_acpi 65040 0
snd_hda_intel 38728 2
nvram 6034 1 thinkpad_acpi
snd_hda_codec 101816 3 snd_hda_codec_conexant,snd_hda_codec_generic,snd_hda_intel
snd_hwdep 6396 1 snd_hda_codec
snd_pcm 81607 2 snd_hda_codec,snd_hda_intel
snd_timer 19038 1 snd_pcm
snd 60086 12 snd_hwdep,snd_timer,snd_hda_codec_conexant,snd_pcm,snd_hda_codec_generic,snd_hda_codec,snd_hda_intel,thinkpad_acpi
soundcore 5551 1 snd
parport_pc 20023 0
thermal 8812 0
battery 7821 0
wmi 8539 0
shpchp 25706 0
parport 30901 2 ppdev,parport_pc
e1000e 228148 0
rfkill 15971 4 cfg80211,thinkpad_acpi,bluetooth
hwmon 3153 3 coretemp,thinkpad_acpi,radeon
ptp 8404 1 e1000e
pps_core 8993 1 ptp
intel_agp 11504 0
ac 3366 0
mei_me 9904 0
mei 65600 1 mei_me
intel_gtt 12856 1 intel_agp
tpm_tis 9310 0
tpm 23363 1 tpm_tis
video 12057 0
button 4765 0
acpi_cpufreq 10170 1
processor 25217 3 acpi_cpufreq
pci_stub 1381 1
vboxpci 14995 0
vboxnetflt 17700 0
vboxnetadp 18547 0
vboxdrv 278190 3 vboxnetadp,vboxnetflt,vboxpci
ext4 505509 4
crc16 1359 2 ext4,bluetooth
mbcache 6266 1 ext4
jbd2 86487 1 ext4
dm_mod 85256 15
hid_generic 1217 0
usbhid 41089 0
hid 92246 2 hid_generic,usbhid
sd_mod 37234 4
sr_mod 15026 0
crc_t10dif 1135 1 sd_mod
cdrom 35191 1 sr_mod
crct10dif_common 1436 1 crc_t10dif
mmc_block 27169 2
atkbd 16934 0
libps2 4507 2 atkbd,psmouse
sdhci_pci 12475 0
firewire_ohci 33053 0
sdhci 29492 1 sdhci_pci
led_class 3611 3 sdhci,iwldvm,thinkpad_acpi
mmc_core 100418 3 mmc_block,sdhci,sdhci_pci
firewire_core 53548 1 firewire_ohci
crc_itu_t 1363 1 firewire_core
ahci 24107 3
libahci 21708 1 ahci
libata 174140 2 ahci,libahci
scsi_mod 137184 3 libata,sd_mod,sr_mod
ehci_pci 4152 0
uhci_hcd 34795 0
ehci_hcd 64875 1 ehci_pci
usbcore 187240 5 btusb,uhci_hcd,ehci_hcd,ehci_pci,usbhid
usb_common 1712 1 usbcore
i8042 13135 1 libps2
serio 10785 7 serio_raw,atkbd,i8042,psmouse

---------------

lspci
00:00.0 Host bridge: Intel Corporation Mobile 4 Series Chipset Memory Controller Hub (rev 07)
00:01.0 PCI bridge: Intel Corporation Mobile 4 Series Chipset PCI Express Graphics Port (rev 07)
00:03.0 Communication controller: Intel Corporation Mobile 4 Series Chipset MEI Controller (rev 07)
00:19.0 Ethernet controller: Intel Corporation 82567LM Gigabit Network Connection (rev 03)
00:1a.0 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #4 (rev 03)
00:1a.1 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #5 (rev 03)
00:1a.2 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #6 (rev 03)
00:1a.7 USB controller: Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #2 (rev 03)
00:1b.0 Audio device: Intel Corporation 82801I (ICH9 Family) HD Audio Controller (rev 03)
00:1c.0 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 1 (rev 03)
00:1c.1 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 2 (rev 03)
00:1c.3 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 4 (rev 03)
00:1c.4 PCI bridge: Intel Corporation 82801I (ICH9 Family) PCI Express Port 5 (rev 03)
00:1d.0 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #1 (rev 03)
00:1d.1 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #2 (rev 03)
00:1d.2 USB controller: Intel Corporation 82801I (ICH9 Family) USB UHCI Controller #3 (rev 03)
00:1d.7 USB controller: Intel Corporation 82801I (ICH9 Family) USB2 EHCI Controller #1 (rev 03)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 93)
00:1f.0 ISA bridge: Intel Corporation ICH9M-E LPC Interface Controller (rev 03)
00:1f.2 SATA controller: Intel Corporation 82801IBM/IEM (ICH9M/ICH9M-E) 4 port SATA Controller [AHCI mode] (rev 03)
00:1f.3 SMBus: Intel Corporation 82801I (ICH9 Family) SMBus Controller (rev 03)
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] RV635/M86 [Mobility Radeon HD 3650]
03:00.0 Network controller: Intel Corporation PRO/Wireless 5100 AGN [Shiloh] Network Connection
15:00.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev ba)
15:00.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C832 IEEE 1394 Controller (rev 04)
15:00.2 SD Host controller: Ricoh Co Ltd R5C822 SD/SDIO/MMC/MS/MSPro Host Adapter (rev 21)
15:00.4 System peripheral: Ricoh Co Ltd R5C592 Memory Stick Bus Host Adapter (rev 11)
15:00.5 System peripheral: Ricoh Co Ltd xD-Picture Card Controller (rev 11)


Steps to reproduce:

Not reproducible kernel panic.
This task depends upon

Closed by  Tobias Powalowski (tpowa)
Monday, 16 June 2014, 20:24 GMT
Reason for closing:  Upstream
Comment by Anatol Pomozov (anatolik) - Sunday, 01 June 2014, 17:15 GMT
It sounds like an USB stack kernel bug. I would suggest to report the problem to this maillist http://vger.kernel.org/vger-lists.html#linux-usb

PS KDump would be useful indeed. Could you open a topic at Arch forum about KDump problems? I'll try to help you to setup it.
Comment by archuser_4573 (archuser_4573) - Tuesday, 03 June 2014, 00:32 GMT
Thanks for your comment.

This problem started after updating to 3.14.4-1 kernel. I will report this bug to http://vger.kernel.org/vger-lists.html#linux-usb as soon as possible.

I opened a topic about my Kdump issue: https://bbs.archlinux.org/viewtopic.php?id=182324. I would very much appreciate your help.
Comment by Tobias Powalowski (tpowa) - Monday, 16 June 2014, 07:39 GMT
You could try 3.15 from testing repository, else please report upstream.
Comment by archuser_4573 (archuser_4573) - Monday, 16 June 2014, 20:02 GMT
Thanks for the comment.

My Kdump problems are solved now, but I'm not able to reproduce the kernel panic on my custom build kernel. Something is different between two kernels.

I reported this bug upstream: https://bugzilla.kernel.org/show_bug.cgi?id=78131

Loading...