FS#41443 - [xf86-video-intel] Occasional X server crashes since 1.16 update

Attached to Project: Arch Linux
Opened by John Lindgren (jlindgren) - Sunday, 03 August 2014, 03:34 GMT
Last edited by Laurent Carlier (lordheavy) - Saturday, 13 September 2014, 18:20 GMT
Task Type Bug Report
Category Packages: Extra
Status Closed
Assigned To Jan de Groot (JGC)
Andreas Radke (AndyRTR)
Laurent Carlier (lordheavy)
Architecture x86_64
Severity Medium
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 1
Private No

Details

Description:

I'm not able to say what triggers the crash as it's happened just twice or three times since the recent X.org 1.16 update, but the backtrace is the same each time:

[ 38.448] (EE) Backtrace:
[ 38.450] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 38.450] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 38.450] (EE) 2: /usr/lib/libc.so.6 (0x7f8e21202000+0x33df0) [0x7f8e21235df0]
[ 38.450] (EE) 3: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f8e1cc73000+0x1076b0) [0x7f8e1cd7a6b0]
[ 38.450] (EE) 4: /usr/bin/Xorg.bin (0x400000+0x162f83) [0x562f83]
[ 38.450] (EE) 5: /usr/bin/Xorg.bin (0x400000+0x163dbb) [0x563dbb]
[ 38.450] (EE) 6: /usr/bin/Xorg.bin (DRI2GetBuffersWithFormat+0xb) [0x5641eb]
[ 38.450] (EE) 7: /usr/bin/Xorg.bin (0x400000+0x165c8b) [0x565c8b]
[ 38.450] (EE) 8: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 38.450] (EE) 9: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 38.450] (EE) 10: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7f8e21222000]
[ 38.450] (EE) 11: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 38.450] (EE)
[ 38.450] (EE) Segmentation fault at address 0x10

Searching for "DRI2GetBuffersWithFormat" produced a few old bug reports, but nothing recent/open.

This is on a Dell Latitude E6430 laptop with hybrid Intel/Nvidia graphics. I believe the Intel chip is an HD 4000, not sure what the NVidia chip is exactly (I disable it at boot time with bbswitch). I have also been connecting an external VGA monitor lately, not sure if that's related.

Additional info:
* package version(s)
xf86-video-intel 2.99.914-3
xorg-server 1.16.0-6
linux 3.15.8-1
bbswitch 0.8-14

* config and/or log files etc.
See attached logs.

Steps to reproduce:
Unknown
This task depends upon

Closed by  Laurent Carlier (lordheavy)
Saturday, 13 September 2014, 18:20 GMT
Reason for closing:  Fixed
Additional comments about closing:  xf86-video-intel-2.99.916-1
Comment by Jeremy LaCroix (jlacroix) - Monday, 04 August 2014, 17:02 GMT
Same problem here. Randomly X will crash and I'll see my login manager rather than what I was working on. I've disabled xscreensaver and that might have made it more stable, but it's too soon for me to tell if removing xscreensaver helped or if it just hasn't decided to crash since. The longest I've gone is 24 hours without a crash so far.
Comment by Kevin Brodsky (Corax) - Thursday, 07 August 2014, 21:57 GMT
Same problem here too, with an Optimus laptop (Intel HD4600/NVidia GTX 765). Same package versions except linux (3.15.7 for now). However the stack trace is different, PictureDestroyWindow this time:

[ 92635.603] (EE) Backtrace:
[ 92635.605] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 92635.605] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 92635.605] (EE) 2: /usr/lib/libc.so.6 (0x7fce0dcd1000+0x33df0) [0x7fce0dd04df0]
[ 92635.605] (EE) 3: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7fce09742000+0x10a3f8) [0x7fce0984c3f8]
[ 92635.605] (EE) 4: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7fce09742000+0x2f641) [0x7fce09771641]
[ 92635.605] (EE) 5: /usr/bin/Xorg.bin (PictureDestroyWindow+0xd1) [0x509131]
[ 92635.605] (EE) 6: /usr/bin/Xorg.bin (0x400000+0x1192e8) [0x5192e8]
[ 92635.605] (EE) 7: /usr/bin/Xorg.bin (0x400000+0xd96d9) [0x4d96d9]
[ 92635.605] (EE) 8: /usr/bin/Xorg.bin (0x400000+0x114add) [0x514add]
[ 92635.605] (EE) 9: /usr/bin/Xorg.bin (0x400000+0xc94d7) [0x4c94d7]
[ 92635.605] (EE) 10: /usr/bin/Xorg.bin (0x400000+0xef030) [0x4ef030]
[ 92635.605] (EE) 11: /usr/bin/Xorg.bin (0x400000+0x6197e) [0x46197e]
[ 92635.605] (EE) 12: /usr/bin/Xorg.bin (DeleteWindow+0x232) [0x4645c2]
[ 92635.605] (EE) 13: /usr/bin/Xorg.bin (0x400000+0x5a622) [0x45a622]
[ 92635.605] (EE) 14: /usr/bin/Xorg.bin (FreeResource+0x9c) [0x45b15c]
[ 92635.605] (EE) 15: /usr/bin/Xorg.bin (0x400000+0x31cbf) [0x431cbf]
[ 92635.605] (EE) 16: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 92635.605] (EE) 17: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 92635.605] (EE) 18: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7fce0dcf1000]
[ 92635.605] (EE) 19: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 92635.605] (EE)
[ 92635.605] (EE) Segmentation fault at address 0x74

It occurs randomly too, I think I've had it twice in 5 days so it's pretty hard to reproduce...
Comment by John Lindgren (jlindgren) - Thursday, 07 August 2014, 23:33 GMT
Just crashed again for me with a different backtrace:

[ 75704.559] (EE) Backtrace:
[ 75704.563] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 75704.563] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 75704.563] (EE) 2: /usr/lib/libc.so.6 (0x7f6a4e2a0000+0x33df0) [0x7f6a4e2d3df0]
[ 75704.563] (EE) 3: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f6a49d11000+0x1069b3) [0x7f6a49e179b3]
[ 75704.563] (EE) 4: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f6a49d11000+0x109d11) [0x7f6a49e1ad11]
[ 75704.563] (EE) 5: /usr/bin/Xorg.bin (DRI2SwapBuffers+0x1c8) [0x5646f8]
[ 75704.563] (EE) 6: /usr/bin/Xorg.bin (0x400000+0x165f6c) [0x565f6c]
[ 75704.563] (EE) 7: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 75704.563] (EE) 8: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 75704.563] (EE) 9: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7f6a4e2c0000]
[ 75704.563] (EE) 10: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 75704.563] (EE)
[ 75704.563] (EE) Segmentation fault at address 0x200000070

This time it was when logging in. X was already running but compton and a bunch of other things would have been starting up.
Comment by persson (persson) - Wednesday, 20 August 2014, 08:42 GMT
I get the same error, random X crash after a variable amount of time.

[ 39119.990] (EE)
[ 39120.024] (EE) Backtrace:
[ 39120.153] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 39120.153] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 39120.153] (EE) 2: /usr/lib/libc.so.6 (0x7f66f46e0000+0x33df0) [0x7f66f4713df0]
[ 39120.153] (EE) 3: /usr/lib/libc.so.6 (0x7f66f46e0000+0x7af28) [0x7f66f475af28]
[ 39120.153] (EE) 4: /usr/lib/libc.so.6 (__libc_malloc+0x60) [0x7f66f475d0c0]
[ 39120.164] (EE) 5: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x8bcfb) [0x7f66f01dccfb]
[ 39120.164] (EE) 6: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x8ee47) [0x7f66f01dfe47]
[ 39120.164] (EE) 7: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x831e2) [0x7f66f01d41e2]
[ 39120.164] (EE) 8: /usr/bin/Xorg.bin (0x400000+0x110e7c) [0x510e7c]
[ 39120.165] (EE) 9: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 39120.165] (EE) 10: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 39120.165] (EE) 11: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7f66f4700000]
[ 39120.165] (EE) 12: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 39120.165] (EE)
[ 39120.165] (EE) Segmentation fault at address 0x10
[ 39120.165] (EE)
Fatal server error:
[ 39120.165] (EE) Caught signal 11 (Segmentation fault). Server aborting
[ 39120.165] (EE)
[ 39120.166] (EE)
Please consult the The X.Org Foundation support
at http://wiki.x.org
for help.
[ 39120.166] (EE) Please also check the log file at "/home/persson/.local/share/xorg/Xorg.0.log" for additional information.
[ 39120.166] (EE)
[ 39120.167] (II) AIGLX: Suspending AIGLX clients for VT switch
[ 47584.990] (EE)
[ 47584.992] (EE) Backtrace:
[ 47584.995] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 47584.998] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 47585.000] (EE) 2: /usr/lib/libc.so.6 (0x7f66f46e0000+0x33df0) [0x7f66f4713df0]
[ 47585.003] (EE) 3: /usr/lib/libc.so.6 (0x7f66f46e0000+0xf5e0b) [0x7f66f47d5e0b]
[ 47585.006] (EE) 4: /usr/lib/libc.so.6 (0x7f66f46e0000+0x7f592) [0x7f66f475f592]
[ 47585.008] (EE) 5: /usr/lib/libc.so.6 (__libc_malloc+0x55) [0x7f66f475d0b5]
[ 47585.011] (EE) 6: /usr/lib/libdbus-1.so.3 (0x7f66f6181000+0x27012) [0x7f66f61a8012]
[ 47585.013] (EE) 7: /usr/lib/libdbus-1.so.3 (0x7f66f6181000+0x1f39a) [0x7f66f61a039a]
[ 47585.016] (EE) 8: /usr/lib/libdbus-1.so.3 (dbus_connection_send_with_reply+0x8f) [0x7f66f61900ef]
[ 47585.018] (EE) 9: /usr/lib/libdbus-1.so.3 (dbus_connection_send_with_reply_and_block+0xb0) [0x7f66f61903f0]
[ 47585.020] (EE) 10: /usr/bin/Xorg.bin (0x400000+0xa19dd) [0x4a19dd]
[ 47585.023] (EE) 11: /usr/bin/Xorg.bin (ddxGiveUp+0x6a) [0x47847a]
[ 47585.025] (EE) 12: /usr/bin/Xorg.bin (0x400000+0x19fc42) [0x59fc42]
[ 47585.027] (EE) 13: /usr/bin/Xorg.bin (0x400000+0x1a0aad) [0x5a0aad]
[ 47585.030] (EE) 14: /usr/bin/Xorg.bin (0x400000+0x197bbc) [0x597bbc]
[ 47585.032] (EE) 15: /usr/lib/libc.so.6 (0x7f66f46e0000+0x33df0) [0x7f66f4713df0]
[ 47585.034] (EE) 16: /usr/lib/libc.so.6 (0x7f66f46e0000+0x7af28) [0x7f66f475af28]
[ 47585.036] (EE) 17: /usr/lib/libc.so.6 (__libc_malloc+0x60) [0x7f66f475d0c0]
[ 47585.038] (EE) 18: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x8bcfb) [0x7f66f01dccfb]
[ 47585.040] (EE) 19: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x8ee47) [0x7f66f01dfe47]
[ 47585.042] (EE) 20: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7f66f0151000+0x831e2) [0x7f66f01d41e2]
[ 47585.044] (EE) 21: /usr/bin/Xorg.bin (0x400000+0x110e7c) [0x510e7c]
[ 47585.046] (EE) 22: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 47585.048] (EE) 23: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 47585.050] (EE) 24: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7f66f4700000]
[ 47585.052] (EE) 25: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 47585.054] (EE)
[ 47585.056] (EE) Received signal 3 sent by process 12364, uid 1000
[ 47585.058] (EE)
FatalError re-entered, aborting
[ 47585.062] (EE) Caught signal 3 (Quit). Server aborting
[ 47585.063] (EE)

Machine is a Dell Vostro 260s, grahpic card is Intel:

00:02.0 VGA compatible controller: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller (rev 09) (prog-if 00 [VGA controller])
Subsystem: Dell Device 04ee
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 45
Region 0: Memory at fe000000 (64-bit, non-prefetchable) [size=4M]
Region 2: Memory at c0000000 (64-bit, prefetchable) [size=256M]
Region 4: I/O ports at f000 [size=64]
Expansion ROM at <unassigned> [disabled]
Capabilities: [90] MSI: Enable+ Count=1/1 Maskable- 64bit-
Address: fee0f00c Data: 4122
Capabilities: [d0] Power Management version 2
Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [a4] PCI Advanced Features
AFCap: TP+ FLR+
AFCtrl: FLR-
AFStatus: TP-
Kernel driver in use: i915
Kernel modules: i915
Comment by Kevin Brodsky (Corax) - Wednesday, 20 August 2014, 19:52 GMT
Interesting, your backtrace shows that if fails when calling malloc...

I've had one or two more crashes recently with similar backtraces as the one I posted (none since I've installed 3.16 though, we'll see how it develops). For what it's worth, it never happens when I'm using my computer, it either happens when I'm going out of hibernation or when the screensaver is on. I'm surprised nobody has a single clue about this problem!
Comment by persson (persson) - Wednesday, 20 August 2014, 21:28 GMT
Yes, same here! As long as I'm working and doing stuff, everything is fine. If I leave it (xscreensaver starts by itself after a few minutes), chances are very high that it crashes within 3 or 4 hours.

Unfortunately I am on 3.16 already:

core/linux 3.16.1-1 (base) [installed]

And xorg 1.16:

extra/xorg-server 1.16.0-6 (xorg) [installed]
extra/xorg-server-common 1.16.0-6 [installed]
Comment by Glyn Kennington (tinglyneurons) - Wednesday, 20 August 2014, 22:24 GMT
I'm getting this with a different stack trace each time, but it's reliably happening after primusrun finishes.

Intel(R) HD Graphics 4600/NVIDIA GPU GeForce GT 750M (GK107)

linux 3.16.1-1
nvidia 340.32-1
xorg-server 1.16.0-6
xorg-server-common 1.16.0-6
xf86-video-intel 2.99.914-4



[ 2644.722] (EE) Backtrace:
[ 2644.976] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 2644.976] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 2644.976] (EE) 2: /usr/lib/libc.so.6 (0x7faaddfea000+0x33df0) [0x7faade01ddf0]
[ 2644.976] (EE) 3: /usr/lib/libc.so.6 (0x7faaddfea000+0x7b4f3) [0x7faade0654f3]
[ 2644.976] (EE) 4: /usr/lib/libc.so.6 (__libc_malloc+0x60) [0x7faade0670c0]
[ 2644.976] (EE) 5: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0x228de) [0x7faad9a7d8de]
[ 2644.976] (EE) 6: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0x262d0) [0x7faad9a812d0]
[ 2644.976] (EE) 7: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0x2a804) [0x7faad9a85804]
[ 2644.976] (EE) 8: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0xdd12d) [0x7faad9b3812d]
[ 2644.976] (EE) 9: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0x48c8c) [0x7faad9aa3c8c]
[ 2644.976] (EE) 10: /usr/lib/xorg/modules/drivers/intel_drv.so (0x7faad9a5b000+0x4acf4) [0x7faad9aa5cf4]
[ 2644.976] (EE) 11: /usr/bin/Xorg.bin (0x400000+0x11b4d8) [0x51b4d8]
[ 2644.976] (EE) 12: /usr/bin/Xorg.bin (doPolyText+0x2c3) [0x43a053]
[ 2644.976] (EE) 13: /usr/bin/Xorg.bin (PolyText+0x59) [0x43ac79]
[ 2644.977] (EE) 14: /usr/bin/Xorg.bin (0x400000+0x34b01) [0x434b01]
[ 2644.977] (EE) 15: /usr/bin/Xorg.bin (0x400000+0x376d7) [0x4376d7]
[ 2644.977] (EE) 16: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 2644.977] (EE) 17: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7faade00a000]
[ 2644.977] (EE) 18: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 2644.977] (EE)
[ 2644.977] (EE) Segmentation fault at address 0x18


[ 101.257] (EE) Backtrace:
[ 101.279] (EE) 0: /usr/bin/Xorg.bin (xorg_backtrace+0x56) [0x593966]
[ 101.279] (EE) 1: /usr/bin/Xorg.bin (0x400000+0x197b69) [0x597b69]
[ 101.279] (EE) 2: /usr/lib/libc.so.6 (0x7f6bf7e40000+0x33df0) [0x7f6bf7e73df0]
[ 101.279] (EE) 3: /usr/lib/libc.so.6 (0x7f6bf7e40000+0x799bd) [0x7f6bf7eb99bd]
[ 101.279] (EE) 4: /usr/lib/libc.so.6 (0x7f6bf7e40000+0x7ae14) [0x7f6bf7ebae14]
[ 101.279] (EE) 5: /usr/lib/libc.so.6 (__libc_calloc+0xc7) [0x7f6bf7ebdac7]
[ 101.279] (EE) 6: /usr/bin/Xorg.bin (0x400000+0x6085d) [0x46085d]
[ 101.279] (EE) 7: /usr/bin/Xorg.bin (CloseDownClient+0x68) [0x436ba8]
[ 101.279] (EE) 8: /usr/bin/Xorg.bin (0x400000+0x3770e) [0x43770e]
[ 101.279] (EE) 9: /usr/bin/Xorg.bin (0x400000+0x3b866) [0x43b866]
[ 101.279] (EE) 10: /usr/lib/libc.so.6 (__libc_start_main+0xf0) [0x7f6bf7e60000]
[ 101.279] (EE) 11: /usr/bin/Xorg.bin (0x400000+0x25d0e) [0x425d0e]
[ 101.279] (EE)
[ 101.279] (EE) Segmentation fault at address 0x18


Comment by John Lindgren (jlindgren) - Saturday, 23 August 2014, 03:53 GMT Comment by John Lindgren (jlindgren) - Saturday, 23 August 2014, 03:58 GMT
As a workaround, disable SNA acceleration and use UXA instead:
https://wiki.archlinux.org/index.php/Intel_graphics#SNA_issues
Comment by John Lindgren (jlindgren) - Saturday, 23 August 2014, 14:43 GMT
Upstream has a patch which fixes the problem for me:
https://bugs.freedesktop.org/attachment.cgi?id=105160
Comment by Kevin Brodsky (Corax) - Sunday, 24 August 2014, 23:04 GMT
Thanks John for reporting the issue upstream! For now I'll use the UXA workaround, I'll wait for the next release of xorg to get the actual fix.
Comment by Laurent Carlier (lordheavy) - Monday, 08 September 2014, 12:35 GMT
please test with xf86-video-intel-2.99.916-1 in testing
Comment by Kevin Brodsky (Corax) - Thursday, 11 September 2014, 12:50 GMT
2.99.916 is now in extra, I installed it today and I switched back to SNA. I've had no segfault with the UXA workaround. I'll test the new version for a week, after that I think we can consider the issue resolved.
Comment by John Lindgren (jlindgren) - Saturday, 13 September 2014, 17:33 GMT
2.99.916-1 seems good here.

Loading...