FS#26614 - [linux] Dell XPS 17 702X System Hangs Without Information

Attached to Project: Arch Linux
Opened by James Kay (Twey) - Tuesday, 25 October 2011, 15:31 GMT
Last edited by Roman Kyrylych (Romashka) - Tuesday, 27 December 2011, 16:05 GMT
Task Type Bug Report
Category Packages: Extra
Status Closed
Assigned To Tobias Powalowski (tpowa)
Thomas Bächler (brain0)
Architecture x86_64
Severity Critical
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 1
Private No

Details

Description:
The system occasionally (approx. once or twice an hour) completely locks up randomly, without caps-lock lights or responding to SysRq (which is enabled). Sounds being played loop, graphics output freezes & does not change, & input devices have no effect. I did not see any unusual activity in iotop, though due to the unpredictable behaviour of the bug, this would be difficult to capture effectively.

Additional info:
* package version(s)
linux 3.0.7-1
xf86-video-intel 2.16.0-1

* config and/or log files etc.
to attach

Steps to reproduce:
- Boot
- Use system
- Wait
This task depends upon

Closed by  Roman Kyrylych (Romashka)
Tuesday, 27 December 2011, 16:05 GMT
Reason for closing:  Fixed
Additional comments about closing:  works in 3.1.5
Comment by James Kay (Twey) - Tuesday, 25 October 2011, 15:34 GMT
dmesg showing nothing unusual as far as I can tell (this is recovered from a crashed system).

This bug appeared a few (perhaps three?) days ago, & another person (EdwardIII) appears to have started encountering the same problem at around the same time.

A Windows installation on the same system does not appear to evince the problem.
Comment by James Kay (Twey) - Tuesday, 25 October 2011, 15:36 GMT
uname -a:
Linux algiz 3.0-ARCH #1 SMP PREEMPT Wed Oct 19 10:27:51 CEST 2011 x86_64 Intel(R) Core(TM) i7-2630QM CPU @ 2.00GHz GenuineIntel GNU/Linux

lspci:

00:00.0 Host bridge: Intel Corporation 2nd Generation Core Processor Family DRAM Controller (rev 09)
00:01.0 PCI bridge: Intel Corporation 2nd Generation Core Processor Family PCI Express Root Port (rev 09)
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)
00:16.0 Communication controller: Intel Corporation 6 Series Chipset Family MEI Controller #1 (rev 04)
00:1a.0 USB Controller: Intel Corporation 6 Series Chipset Family USB Enhanced Host Controller #2 (rev 05)
00:1b.0 Audio device: Intel Corporation 6 Series Chipset Family High Definition Audio Controller (rev 05)
00:1c.0 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 1 (rev b5)
00:1c.1 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 2 (rev b5)
00:1c.3 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 4 (rev b5)
00:1c.4 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 5 (rev b5)
00:1c.5 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 6 (rev b5)
00:1d.0 USB Controller: Intel Corporation 6 Series Chipset Family USB Enhanced Host Controller #1 (rev 05)
00:1f.0 ISA bridge: Intel Corporation HM67 Express Chipset Family LPC Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 6 Series Chipset Family 6 port SATA AHCI Controller (rev 05)
00:1f.3 SMBus: Intel Corporation 6 Series Chipset Family SMBus Controller (rev 05)
01:00.0 VGA compatible controller: nVidia Corporation Device 0dcd (rev a1)
03:00.0 Network controller: Intel Corporation Centrino Advanced-N 6230 (rev 34)
04:00.0 USB Controller: NEC Corporation uPD720200 USB 3.0 Host Controller (rev 04)
0a:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev 06)

lsmod:

Module Size Used by
ipv6 290983 22
ext2 64314 1
snd_hda_codec_hdmi 22092 1
snd_hda_codec_realtek 294320 1
arc4 1410 2
uvcvideo 64963 0
videodev 78006 1 uvcvideo
snd_hda_intel 22122 0
snd_hda_codec 77927 3 snd_hda_codec_hdmi,snd_hda_codec_realtek,snd_hda_intel
joydev 9895 0
iwlagn 236787 0
mac80211 215908 1 iwlagn
media 10437 2 uvcvideo,videodev
snd_hwdep 6325 1 snd_hda_codec
snd_pcm 73952 3 snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec
snd_timer 19416 1 snd_pcm
snd 57818 7 snd_hda_codec_hdmi,snd_hda_codec_realtek,snd_hda_intel,snd_hda_codec,snd_hwdep,snd_pcm,snd_timer
soundcore 6146 1 snd
dell_wmi 1517 0
sparse_keymap 3088 1 dell_wmi
i915 707339 2
drm_kms_helper 25409 1 i915
drm 183380 3 i915,drm_kms_helper
dell_laptop 7947 0
i2c_algo_bit 5199 1 i915
intel_agp 10904 1 i915
ecryptfs 90505 0
intel_gtt 14423 3 i915,intel_agp
serio_raw 4294 0
r8169 42643 0
btusb 11577 0
cfg80211 160772 2 iwlagn,mac80211
bluetooth 139297 1 btusb
wmi 8347 1 dell_wmi
snd_page_alloc 7121 2 snd_hda_intel,snd_pcm
v4l2_compat_ioctl32 8292 1 videodev
rfkill 15402 3 dell_laptop,cfg80211,bluetooth
mei 31313 0
processor 24256 0
thermal 7863 0
button 4470 1 i915
ac 2376 0
video 11228 1 i915
battery 6317 0
i2c_i801 8187 0
psmouse 55192 0
evdev 9530 9
acpi_call 4058 0
i2c_core 20133 6 videodev,i915,drm_kms_helper,drm,i2c_algo_bit,i2c_i801
iTCO_wdt 12717 0
mii 3995 1 r8169
iTCO_vendor_support 1929 1 iTCO_wdt
dcdbas 5488 1 dell_laptop
ext4 370462 2
mbcache 5817 2 ext2,ext4
jbd2 71074 1 ext4
crc16 1297 2 bluetooth,ext4
aesni_intel 47826 34
cryptd 8213 9 aesni_intel
aes_x86_64 7476 1 aesni_intel
aes_generic 26106 2 aesni_intel,aes_x86_64
xts 2493 8
gf128mul 5890 1 xts
dm_crypt 15945 1
dm_mod 67038 12 dm_crypt
sr_mod 14951 0
cdrom 36329 1 sr_mod
sd_mod 28307 3
xhci_hcd 70783 0
ahci 21217 2
libahci 18885 1 ahci
libata 173297 2 ahci,libahci
ehci_hcd 39543 0
scsi_mod 131546 3 sr_mod,sd_mod,libata
usbcore 142576 5 uvcvideo,btusb,xhci_hcd,ehci_hcd
Comment by Edward Prendergast (EdwardIII) - Tuesday, 25 October 2011, 15:42 GMT
Same symptoms, machine differs slightly: Samsung R580 laptop.

Tried switching from nvidia proprietary drivers to open source nouveau but no change.

[edward@eddarch ~]$ uname -a
Linux eddarch 3.0-ARCH #1 SMP PREEMPT Wed Oct 19 10:27:51 CEST 2011 x86_64 Intel(R) Core(TM) i3 CPU M 370 @ 2.40GHz GenuineIntel GNU/Linux



[edward@eddarch ~]$ lspci
00:00.0 Host bridge: Intel Corporation Core Processor DRAM Controller (rev 02)
00:01.0 PCI bridge: Intel Corporation Core Processor PCI Express x16 Root Port (rev 02)
00:1a.0 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 05)
00:1b.0 Audio device: Intel Corporation 5 Series/3400 Series Chipset High Definition Audio (rev 05)
00:1c.0 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 1 (rev 05)
00:1c.2 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 3 (rev 05)
00:1c.3 PCI bridge: Intel Corporation 5 Series/3400 Series Chipset PCI Express Root Port 4 (rev 05)
00:1d.0 USB Controller: Intel Corporation 5 Series/3400 Series Chipset USB2 Enhanced Host Controller (rev 05)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev a5)
00:1f.0 ISA bridge: Intel Corporation Mobile 5 Series Chipset LPC Interface Controller (rev 05)
00:1f.2 SATA controller: Intel Corporation 5 Series/3400 Series Chipset 4 port SATA AHCI Controller (rev 05)
00:1f.3 SMBus: Intel Corporation 5 Series/3400 Series Chipset SMBus Controller (rev 05)
02:00.0 VGA compatible controller: nVidia Corporation GT218 [GeForce 310M] (rev a2)
02:00.1 Audio device: nVidia Corporation High Definition Audio Controller (rev a1)
03:00.0 Network controller: Atheros Communications Inc. AR9285 Wireless Network Adapter (PCI-Express) (rev 01)
07:00.0 Ethernet controller: Marvell Technology Group Ltd. Yukon Optima 88E8059 [PCIe Gigabit Ethernet Controller with AVB] (rev 11)
3f:00.0 Host bridge: Intel Corporation Core Processor QuickPath Architecture Generic Non-core Registers (rev 02)
3f:00.1 Host bridge: Intel Corporation Core Processor QuickPath Architecture System Address Decoder (rev 02)
3f:02.0 Host bridge: Intel Corporation Core Processor QPI Link 0 (rev 02)
3f:02.1 Host bridge: Intel Corporation Core Processor QPI Physical 0 (rev 02)
3f:02.2 Host bridge: Intel Corporation Core Processor Reserved (rev 02)
3f:02.3 Host bridge: Intel Corporation Core Processor Reserved (rev 02)


[edward@eddarch ~]$ lsmod
Module Size Used by
ipv6 290983 39
ext3 128661 1
jbd 48592 1 ext3
snd_hda_codec_hdmi 22092 4
joydev 9895 0
uvcvideo 64963 0
videodev 78006 1 uvcvideo
media 10437 2 uvcvideo,videodev
v4l2_compat_ioctl32 8292 1 videodev
snd_hda_codec_realtek 294320 1
nouveau 698547 2
ttm 54360 1 nouveau
snd_hda_intel 22122 2
snd_hda_codec 77927 3 snd_hda_codec_hdmi,snd_hda_codec_realtek,snd_hda_intel
serio_raw 4294 0
psmouse 55192 0
snd_hwdep 6325 1 snd_hda_codec
snd_pcm 73952 3 snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec
snd_timer 19416 1 snd_pcm
snd 57818 11 snd_hda_codec_hdmi,snd_hda_codec_realtek,snd_hda_intel,snd_hda_codec,snd_hwdep,snd_pcm,snd_timer
drm_kms_helper 25409 1 nouveau
pcspkr 1819 0
drm 183380 4 nouveau,ttm,drm_kms_helper
i2c_algo_bit 5199 1 nouveau
sky2 46875 0
arc4 1410 2
soundcore 6146 1 snd
snd_page_alloc 7121 2 snd_hda_intel,snd_pcm
iTCO_wdt 12717 0
i2c_i801 8187 0
mxm_wmi 1393 1 nouveau
intel_agp 10904 0
intel_gtt 14423 1 intel_agp
ath9k 86568 0
evdev 9530 7
mac80211 215908 1 ath9k
ath9k_common 1770 1 ath9k
ath9k_hw 275596 2 ath9k,ath9k_common
ath 14667 2 ath9k,ath9k_hw
wmi 8347 1 mxm_wmi
i2c_core 20133 6 videodev,nouveau,drm_kms_helper,drm,i2c_algo_bit,i2c_i801
thermal 7863 0
fan 2426 0
btusb 11577 0
bluetooth 139297 1 btusb
processor 24256 0
video 11228 1 nouveau
iTCO_vendor_support 1929 1 iTCO_wdt
cfg80211 160772 3 ath9k,mac80211,ath
rfkill 15402 2 bluetooth,cfg80211
ac 2376 0
button 4470 1 nouveau
battery 6317 0
ext4 370462 1
mbcache 5817 2 ext3,ext4
jbd2 71074 1 ext4
crc16 1297 2 bluetooth,ext4
sd_mod 28307 4
sr_mod 14951 0
cdrom 36329 1 sr_mod
ahci 21217 3
libahci 18885 1 ahci
libata 173297 2 ahci,libahci
ehci_hcd 39543 0
scsi_mod 131546 3 sd_mod,sr_mod,libata
usbcore 142576 4 uvcvideo,btusb,ehci_hcd
Comment by Matt Earnshaw (mearnshaw) - Tuesday, 25 October 2011, 18:19 GMT
Also encountering this on my laptop.

linux 3.0.7-1
xf86-video-intel 2.16.0-1

Linux matt-earnshaw 3.0-ARCH #1 SMP PREEMPT Wed Oct 19 10:27:51 CEST 2011 x86_64 Intel(R) Core(TM) i3-2310M CPU @ 2.10GHz GenuineIntel GNU/Linux

No clues in /var/log/everything.log. Hangs with symptoms and frequency as described by Twey.

lsmod:
Module Size Used by
fuse 67290 3
ipv6 290983 24
cryptd 8213 0
aes_x86_64 7476 1
aes_generic 26106 1 aes_x86_64
ext3 128661 2
jbd 48592 1 ext3
snd_hda_codec_hdmi 22092 1
snd_hda_codec_conexant 46356 1
joydev 9895 0
btusb 11577 0
bluetooth 139297 1 btusb
uvcvideo 64963 0
videodev 78006 1 uvcvideo
media 10437 2 uvcvideo,videodev
v4l2_compat_ioctl32 8292 1 videodev
snd_hda_intel 22122 0
snd_hda_codec 77927 3 snd_hda_codec_hdmi,snd_hda_codec_conexant,snd_hda_intel
snd_hwdep 6325 1 snd_hda_codec
snd_pcm 73952 3 snd_hda_codec_hdmi,snd_hda_intel,snd_hda_codec
arc4 1410 2
snd_timer 19416 1 snd_pcm
i2c_i801 8187 0
i915 707339 2
drm_kms_helper 25409 1 i915
drm 183380 3 i915,drm_kms_helper
iwlagn 236787 0
i2c_algo_bit 5199 1 i915
thinkpad_acpi 62743 0
mac80211 215908 1 iwlagn
snd 57818 8 snd_hda_codec_hdmi,snd_hda_codec_conexant,snd_hda_intel,snd_hda_codec,snd_hwdep,snd_pcm,snd_timer,thinkpad_acpi
i2c_core 20133 6 videodev,i2c_i801,i915,drm_kms_helper,drm,i2c_algo_bit
serio_raw 4294 0
psmouse 55192 0
sg 25557 0
sdhci_pci 8530 0
sdhci 22194 1 sdhci_pci
iTCO_wdt 12717 0
soundcore 6146 1 snd
iTCO_vendor_support 1929 1 iTCO_wdt
e1000e 142545 0
evdev 9530 8
cfg80211 160772 2 iwlagn,mac80211
snd_page_alloc 7121 2 snd_hda_intel,snd_pcm
nvram 5805 1 thinkpad_acpi
mei 31313 0
intel_agp 10904 1 i915
intel_gtt 14423 3 i915,intel_agp
mmc_core 73682 1 sdhci
wmi 8347 0
video 11228 1 i915
thermal 7863 0
battery 6317 0
ac 2376 0
processor 24256 0
rfkill 15402 3 bluetooth,thinkpad_acpi,cfg80211
tpm_tis 8193 0
tpm 11653 1 tpm_tis
tpm_bios 5057 1 tpm
button 4470 1 i915
cpufreq_conservative 5329 0
tp_smapi 20651 0
thinkpad_ec 4189 1 tp_smapi
ext4 370462 2
mbcache 5817 2 ext3,ext4
jbd2 71074 1 ext4
crc16 1297 2 bluetooth,ext4
sd_mod 28307 6
ahci 21217 5
libahci 18885 1 ahci
libata 173297 2 ahci,libahci
ehci_hcd 39543 0
usbcore 142576 4 btusb,uvcvideo,ehci_hcd
scsi_mod 131546 3 sg,sd_mod,libata

lspci
00:00.0 Host bridge: Intel Corporation 2nd Generation Core Processor Family DRAM Controller (rev 09)
00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09)
00:16.0 Communication controller: Intel Corporation 6 Series Chipset Family MEI Controller #1 (rev 04)
00:19.0 Ethernet controller: Intel Corporation 82579LM Gigabit Network Connection (rev 04)
00:1a.0 USB Controller: Intel Corporation 6 Series Chipset Family USB Enhanced Host Controller #2 (rev 04)
00:1b.0 Audio device: Intel Corporation 6 Series Chipset Family High Definition Audio Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 1 (rev b4)
00:1c.1 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 2 (rev b4)
00:1c.3 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 4 (rev b4)
00:1c.4 PCI bridge: Intel Corporation 6 Series Chipset Family PCI Express Root Port 5 (rev b4)
00:1d.0 USB Controller: Intel Corporation 6 Series Chipset Family USB Enhanced Host Controller #1 (rev 04)
00:1f.0 ISA bridge: Intel Corporation 6 Series Chipset Family LPC Controller (rev 04)
00:1f.2 SATA controller: Intel Corporation 6 Series Chipset Family 6 port SATA AHCI Controller (rev 04)
00:1f.3 SMBus: Intel Corporation 6 Series Chipset Family SMBus Controller (rev 04)
03:00.0 Network controller: Intel Corporation Centrino Wireless-N 1000
0d:00.0 System peripheral: Ricoh Co Ltd Device e823 (rev 04)
Comment by Jelle van der Waa (jelly) - Wednesday, 26 October 2011, 09:01 GMT
This sounds like not a bug in archlinux, but maybe in the kernel or your gfx drivers.
So what can you do.

a) Run the laptop without X
b) is it not overheating?
c) Downgrade a kernel version? (Just one)
Comment by Matt Earnshaw (mearnshaw) - Wednesday, 26 October 2011, 16:16 GMT
Downgraded to linux 3.0.6-2 for the time being, everything seems stable so far.
Comment by Jelle van der Waa (jelly) - Wednesday, 26 October 2011, 17:20 GMT
Then it might be a kernel issue, PLEASE report it at lkml mailing list
Comment by Matt Earnshaw (mearnshaw) - Saturday, 29 October 2011, 18:07 GMT
Ok, I reported this at lkml (https://lkml.org/lkml/2011/10/29/77 ... hope I did it right.)
We'll see what happens.

Edit:
LKML suggested that this patch may fix the problem.
https://lkml.org/lkml/2011/10/18/100

I have not tested it.
Comment by jstjohn (jstjohn) - Thursday, 03 November 2011, 05:03 GMT
I have also experienced this problem two times, one of which was around an hour ago. Both times I was working on things in Chromium (which I don't believe is related to this bug) and the system was 100% non-responsive to any sort of control input. Both times I was able to hear my CPU fans ramp up to full speed, indicating that there was probably some sort of thermal event or some process(es) pegging my CPU to full load. I have a Dell Inspiron 1764 with an Intel Core i5 M430 (stepping 2) with integrated graphics.

I first noticed this issue sometime after 2011-10-24 because I remember being uneasy about an upgrade to a pre-release version of Xorg (which was released to [extra] on 2011-10-24) and made sure to pay attention to lock-ups I might have after that upgrade.

I am running linux 3.0.7-1, xorg-server 1.11.1.902-1, and xf86-video-intel 2.16.0-1.

I have not yet tested the patch that Matt Earnshaw linked to from the LKML.
Comment by Matt Earnshaw (mearnshaw) - Sunday, 18 December 2011, 17:17 GMT
I have been running 3.1.5-1 without any problems. I suggest those encountering this problem upgrade past 3.0.7-1 to the latest kernel (or downgrade past 3.0.7-1) as this was almost certainly a kernel issue.

Loading...