FS#46462 - [linux] [drm:intel_dp_start_link_train [i915]] *ERROR* too many full retries, give up

Attached to Project: Arch Linux
Opened by Hatem Masmoudi (hmasmoudi) - Monday, 28 September 2015, 14:53 GMT
Last edited by Jan de Groot (JGC) - Tuesday, 03 October 2017, 11:40 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To Tobias Powalowski (tpowa)
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 10
Private No

Details

Description:
Impossible to use an external monitor.

Additional info:
* package version(s)
* config and/or log files etc.


Steps to reproduce:
- Connect and external monitor.
- [drm:intel_dp_start_link_train [i915]] *ERROR* too many full retries, give up
This task depends upon

Closed by  Jan de Groot (JGC)
Tuesday, 03 October 2017, 11:40 GMT
Reason for closing:  Fixed
Additional comments about closing:  2017-10-02: A task closure has been requested. Reason for request: upstream fixed + cannot reproduce it
Comment by Hatem Masmoudi (hmasmoudi) - Monday, 28 September 2015, 14:54 GMT
Attached the dmesg output after connecting and disconnect the monitor.
Comment by Hatem Masmoudi (hmasmoudi) - Monday, 28 September 2015, 14:54 GMT
Hi,

Please note that he same issue is present in other distribution: (FC-23: https://bugzilla.redhat.com/show_bug.cgi?id=1265606).
Comment by - (Lupus) - Monday, 28 September 2015, 20:03 GMT
I'm having the exact same issue with my external monitors connected to the docking station. Activating any of my three connected monitors with xrandr will lead to this. This only occurs while I'm on the 4.2.1-1 kernel from the arch repo. Downgrading the kernel seems to fix it for the time beeing.
Comment by Hatem Masmoudi (hmasmoudi) - Monday, 28 September 2015, 20:29 GMT
Hi,
Please note that this issue happened only after the update to the new kernel version 4.2.1.
Comment by Hatem Masmoudi (hmasmoudi) - Wednesday, 30 September 2015, 12:25 GMT
Hi,
I tried the linux 4.2.2-1 version and the external screen is now displayed.
But I got the following logs from dmesg:
[ 48.082740] gnome-shell[724]: segfault at 0 ip 00007fd89a7a4a22 sp 00007ffcddccff50 error 4 in libmutter.so.0.0.0[7fd89a756000+ee000]
[ 48.714424] [drm:check_crtc_state [i915]] *ERROR* mismatch in ips_enabled (expected 1, found 0)
[ 48.714429] ------------[ cut here ]------------
[ 48.714441] WARNING: CPU: 1 PID: 655 at drivers/gpu/drm/i915/intel_display.c:12324 check_crtc_state+0x8df/0xf80 [i915]()
[ 48.714443] pipe state doesn't match!
[ 48.714444] Modules linked in: fuse uvcvideo videobuf2_vmalloc videobuf2_memops btusb btrtl btbcm btintel videobuf2_core v4l2_common videodev bluetooth media joydev mousedev iTCO_wdt iTCO_vendor_support dell_wmi snd_hda_codec_realtek snd_hda_codec_hdmi sparse_keymap ppdev snd_hda_codec_generic snd_hda_intel snd_hda_codec dell_laptop dcdbas dell_smm_hwmon snd_hda_core arc4 intel_rapl i915 iosf_mbi x86_pkg_temp_thermal intel_powerclamp drm_kms_helper evdev input_leds mac_hid coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul snd_hwdep glue_helper snd_pcm ablk_helper iwlmvm cryptd mac80211 iwlwifi cfg80211 serio_raw psmouse pcspkr snd_timer drm e1000e snd dell_rbtn battery rfkill mei_me mei thermal wmi parport_pc parport intel_gtt
[ 48.714496] ac button shpchp video ptp soundcore lpc_ich i2c_i801 i2c_algo_bit pps_core processor dell_smo8800 sch_fq_codel ip_tables x_tables ext4 crc16 mbcache jbd2 sr_mod cdrom sd_mod hid_generic usbhid hid atkbd libps2 ahci libahci libata xhci_pci ehci_pci sdhci_pci xhci_hcd ehci_hcd sdhci scsi_mod led_class mmc_core usbcore usb_common i8042 serio
[ 48.714527] CPU: 1 PID: 655 Comm: Xorg Not tainted 4.2.2-1-ARCH #1
[ 48.714529] Hardware name: Dell Inc. Latitude E5540/077XV9, BIOS A12 05/19/2015
[ 48.714530] 0000000000000000 000000001dd8bea8 ffff8800cb003788 ffffffff8156b9da
[ 48.714534] 0000000000000000 ffff8800cb0037e0 ffff8800cb0037c8 ffffffff81074886
[ 48.714537] 00000001cb0037d8 ffff880214c7b350 ffff8800dbac1000 ffff880214c7b000
[ 48.714540] Call Trace:
[ 48.714546] [<ffffffff8156b9da>] dump_stack+0x4c/0x6e
[ 48.714550] [<ffffffff81074886>] warn_slowpath_common+0x86/0xc0
[ 48.714553] [<ffffffff81074915>] warn_slowpath_fmt+0x55/0x70
[ 48.714564] [<ffffffffa0827f7f>] check_crtc_state+0x8df/0xf80 [i915]
[ 48.714571] [<ffffffffa036e4ba>] ? drm_calc_vbltimestamp_from_scanoutpos+0x17a/0x330 [drm]
[ 48.714585] [<ffffffffa083a9f6>] intel_modeset_check_state+0x216/0xb50 [i915]
[ 48.714596] [<ffffffffa083500c>] ? __intel_set_mode+0x92c/0xb60 [i915]
[ 48.714607] [<ffffffffa083c037>] intel_crtc_set_config+0x4c7/0x580 [i915]
[ 48.714615] [<ffffffffa03779f6>] drm_mode_set_config_internal+0x66/0x100 [drm]
[ 48.714623] [<ffffffffa037c809>] drm_mode_setcrtc+0x3e9/0x500 [drm]
[ 48.714629] [<ffffffffa036d575>] drm_ioctl+0x125/0x610 [drm]
[ 48.714635] [<ffffffffa037c420>] ? drm_mode_setplane+0x1b0/0x1b0 [drm]
[ 48.714639] [<ffffffff811e29e5>] do_vfs_ioctl+0x295/0x480
[ 48.714642] [<ffffffff811ecad7>] ? __fget+0x77/0xb0
[ 48.714645] [<ffffffff811e2c49>] SyS_ioctl+0x79/0x90
[ 48.714647] [<ffffffff811e4e21>] ? SyS_poll+0x71/0x130
[ 48.714651] [<ffffffff81570f6e>] entry_SYSCALL_64_fastpath+0x12/0x71
[ 48.714653] ---[ end trace 9b5e97fb8c723bad ]---
[ 49.461435] [drm:check_crtc_state [i915]] *ERROR* mismatch in ips_enabled (expected 1, found 0)
[ 49.461438] ------------[ cut here ]------------
[ 49.461447] WARNING: CPU: 2 PID: 655 at drivers/gpu/drm/i915/intel_display.c:12324 check_crtc_state+0x8df/0xf80 [i915]()
[ 49.461448] pipe state doesn't match!
[ 49.461449] Modules linked in: fuse uvcvideo videobuf2_vmalloc videobuf2_memops btusb btrtl btbcm btintel videobuf2_core v4l2_common videodev bluetooth media joydev mousedev iTCO_wdt iTCO_vendor_support dell_wmi snd_hda_codec_realtek snd_hda_codec_hdmi sparse_keymap ppdev snd_hda_codec_generic snd_hda_intel snd_hda_codec dell_laptop dcdbas dell_smm_hwmon snd_hda_core arc4 intel_rapl i915 iosf_mbi x86_pkg_temp_thermal intel_powerclamp drm_kms_helper evdev input_leds mac_hid coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul snd_hwdep glue_helper snd_pcm ablk_helper iwlmvm cryptd mac80211 iwlwifi cfg80211 serio_raw psmouse pcspkr snd_timer drm e1000e snd dell_rbtn battery rfkill mei_me mei thermal wmi parport_pc parport intel_gtt
[ 49.461486] ac button shpchp video ptp soundcore lpc_ich i2c_i801 i2c_algo_bit pps_core processor dell_smo8800 sch_fq_codel ip_tables x_tables ext4 crc16 mbcache jbd2 sr_mod cdrom sd_mod hid_generic usbhid hid atkbd libps2 ahci libahci libata xhci_pci ehci_pci sdhci_pci xhci_hcd ehci_hcd sdhci scsi_mod led_class mmc_core usbcore usb_common i8042 serio
[ 49.461508] CPU: 2 PID: 655 Comm: Xorg Tainted: G W 4.2.2-1-ARCH #1
[ 49.461509] Hardware name: Dell Inc. Latitude E5540/077XV9, BIOS A12 05/19/2015
[ 49.461510] 0000000000000000 000000001dd8bea8 ffff8800cb003788 ffffffff8156b9da
[ 49.461512] 0000000000000000 ffff8800cb0037e0 ffff8800cb0037c8 ffffffff81074886
[ 49.461514] 00000001cb0037d8 ffff880214c7b350 ffff8800dbac1000 ffff880214c7b000
[ 49.461516] Call Trace:
[ 49.461522] [<ffffffff8156b9da>] dump_stack+0x4c/0x6e
[ 49.461526] [<ffffffff81074886>] warn_slowpath_common+0x86/0xc0
[ 49.461528] [<ffffffff81074915>] warn_slowpath_fmt+0x55/0x70
[ 49.461536] [<ffffffffa0827f7f>] check_crtc_state+0x8df/0xf80 [i915]
[ 49.461543] [<ffffffffa036e4ba>] ? drm_calc_vbltimestamp_from_scanoutpos+0x17a/0x330 [drm]
[ 49.461553] [<ffffffffa083a9f6>] intel_modeset_check_state+0x216/0xb50 [i915]
[ 49.461561] [<ffffffffa083500c>] ? __intel_set_mode+0x92c/0xb60 [i915]
[ 49.461570] [<ffffffffa083c037>] intel_crtc_set_config+0x4c7/0x580 [i915]
[ 49.461576] [<ffffffffa03779f6>] drm_mode_set_config_internal+0x66/0x100 [drm]
[ 49.461582] [<ffffffffa037c809>] drm_mode_setcrtc+0x3e9/0x500 [drm]
[ 49.461586] [<ffffffffa036d575>] drm_ioctl+0x125/0x610 [drm]
[ 49.461591] [<ffffffffa037c420>] ? drm_mode_setplane+0x1b0/0x1b0 [drm]
[ 49.461594] [<ffffffff811e29e5>] do_vfs_ioctl+0x295/0x480
[ 49.461596] [<ffffffff811ecad7>] ? __fget+0x77/0xb0
[ 49.461598] [<ffffffff811e2c49>] SyS_ioctl+0x79/0x90
[ 49.461600] [<ffffffff8108300e>] ? SyS_rt_sigprocmask+0x8e/0xc0
[ 49.461603] [<ffffffff81570f6e>] entry_SYSCALL_64_fastpath+0x12/0x71
[ 49.461605] ---[ end trace 9b5e97fb8c723bae ]---
Comment by Simon Perry (pezz) - Wednesday, 30 September 2015, 13:57 GMT
Hey Hatem,

I don't think this is the same bug we're seeing.

Nevertheless, the 4.2 i915 driver does seemed stuffed.
Comment by Simon Perry (pezz) - Wednesday, 30 September 2015, 14:01 GMT
Sorry.

My comment was based on what you pasted, not your bug.log
Comment by Gabriel Maduro (gmp042) - Thursday, 01 October 2015, 12:33 GMT
I'm having the same issue with kernel 4.2.1-1.
Comment by Sree Harsha Totakura (tsh) - Thursday, 01 October 2015, 16:29 GMT
Are you able to use the external monitor if you switch to virtual consoles and switch back to X? i.e pressing Ctrl+Alt+F1 should bring the console on both the primary and external monitor. If it doesn't happen on the first try, try switching to another virtual console (Ctrl+Alt+F2). Once the console shows up, switch back to X (Ctrl+Alt+F7). This currently works for me.

This bug is similar to https://bugs.freedesktop.org/show_bug.cgi?id=89589 except that this bug is also triggered when X is locked.
Comment by Gabriel Maduro (gmp042) - Friday, 02 October 2015, 12:28 GMT
With kernel 4.2.2-1 my external monitor is working again. However, i still have this error:

[drm:check_crtc_state [i915]] *ERROR* mismatch in ips_enabled (expected 1, found 0)
Comment by Swen Thümmler (grmblfrz) - Friday, 02 October 2015, 12:53 GMT
kernel 4.2.2-1 fixed the problem for me.
Comment by - (Lupus) - Friday, 02 October 2015, 17:20 GMT
The 4.2.2-1 kernel fixed the problem for me too. No additional error messages or warning, everything is workthing smoothly as ever.
Comment by Sunny Wong (ArchDoctor_needed) - Thursday, 08 October 2015, 17:38 GMT
  • Field changed: Percent Complete (100% → 0%)
Hello,
I wanted to ask, if the topic could be reopened. The update of one these packages results in the same problems (probably xf86-video-intel)
* wayland
* mesa
* xf86-video-intel
* mesa-libgl
Comment by Sunny Wong (ArchDoctor_needed) - Monday, 12 October 2015, 12:39 GMT
The new package (I believe xf86-video-intel 1:2.99.917+478+gdf72bc5-2) fixed it. Thanks a lot !
Comment by Sunny Wong (ArchDoctor_needed) - Tuesday, 20 October 2015, 07:32 GMT
The bug ist back again:

Nothing is displayed on my second monitor. Also it is possible to drag something into it. Opening gnome display settings results in a white error screen, where i have to relogin. This results into a complete freeze of the system.

Here is an extract from my log:

Oct 20 09:22:00 <Host> colord[994]: /usr/lib/colord/colord-sane: error while loading shared libraries: libsane.so.1: cannot open shared object file: No such file or director
Oct 20 09:22:00 <Host> kernel: [drm:intel_dp_complete_link_train [i915]] *ERROR* failed to train DP, aborting
Oct 20 09:22:00 <Host> avahi-daemon[952]: Server startup complete. Host name is <host>.local. Local service cookie is 218317807.
Oct 20 09:22:01 <Host> kernel: ------------[ cut here ]------------
Oct 20 09:22:01 <Host> kernel: WARNING: CPU: 0 PID: 848 at drivers/gpu/drm/i915/i915_gem.c:5269 i915_gem_track_fb+0x129/0x140 [i915]()
Oct 20 09:22:01 <Host> kernel: WARN_ON(!(old->frontbuffer_bits & frontbuffer_bits))
Oct 20 09:22:01 <Host> kernel: Modules linked in:
Oct 20 09:22:01 <Host> kernel: cmac ecb fuse xt_conntrack ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable
Oct 20 09:22:01 <Host> kernel: parport_pc snd_pcm thermal wmi parport e1000e battery snd_timer drm tpm hp_accel hp_wireless snd video led_class mei_me ac intel_gtt button p
Oct 20 09:22:01 <Host> kernel: CPU: 0 PID: 848 Comm: gnome-shell Tainted: G W 4.2.3-1-ARCH #1
Oct 20 09:22:01 <Host> kernel: Hardware name: Hewlett-Packard HP EliteBook 840 G2/2216, BIOS M71 Ver. 01.05 03/26/2015
Oct 20 09:22:01 <Host> kernel: 0000000000000000 000000009155a9f0 ffff88042ff0fa28 ffffffff8156c0ca
Oct 20 09:22:01 <Host> kernel: 0000000000000000 ffff88042ff0fa80 ffff88042ff0fa68 ffffffff81074886
Oct 20 09:22:01 <Host> kernel: ffff880439580000 0000000000000001 ffff88042cc6ef40 ffff8800a44e1680
Oct 20 09:22:01 <Host> kernel: Call Trace:
Oct 20 09:22:01 <Host> kernel: [<ffffffff8156c0ca>] dump_stack+0x4c/0x6e
Oct 20 09:22:01 <Host> kernel: [<ffffffff81074886>] warn_slowpath_common+0x86/0xc0
Oct 20 09:22:01 <Host> kernel: [<ffffffff81074915>] warn_slowpath_fmt+0x55/0x70
Oct 20 09:22:01 <Host> kernel: [<ffffffffa06bec19>] i915_gem_track_fb+0x129/0x140 [i915]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa0706477>] intel_prepare_plane_fb+0xe7/0x1a0 [i915]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa0629e39>] drm_atomic_helper_prepare_planes+0x59/0xe0 [drm_kms_helper]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa06fe88e>] __intel_set_mode+0x1ae/0xb60 [i915]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa070050f>] ? intel_modeset_compute_config+0x3af/0xb60 [i915]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa0705e26>] intel_crtc_set_config+0x2b6/0x580 [i915]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa04c79f6>] drm_mode_set_config_internal+0x66/0x100 [drm]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa04cc809>] drm_mode_setcrtc+0x3e9/0x500 [drm]
Oct 20 09:22:01 <Host> kernel: [<ffffffffa04bd575>] drm_ioctl+0x125/0x610 [drm]
Oct 20 09:22:01 <Host> kernel: [<ffffffff810d6527>] ? call_rcu+0x17/0x20
Oct 20 09:22:01 <Host> kernel: [<ffffffffa04cc420>] ? drm_mode_setplane+0x1b0/0x1b0 [drm]
Oct 20 09:22:01 <Host> kernel: [<ffffffff811e6c41>] ? __dentry_kill+0x151/0x1e0
Oct 20 09:22:01 <Host> kernel: [<ffffffff811e29e5>] do_vfs_ioctl+0x295/0x480
Oct 20 09:22:01 <Host> kernel: [<ffffffff811ef8c4>] ? mntput+0x24/0x40
Oct 20 09:22:01 <Host> kernel: [<ffffffff811ecad7>] ? __fget+0x77/0xb0
Oct 20 09:22:01 <Host> kernel: [<ffffffff811e2c49>] SyS_ioctl+0x79/0x90
Oct 20 09:22:01 <Host> kernel: [<ffffffff8157162e>] entry_SYSCALL_64_fastpath+0x12/0x71
Oct 20 09:22:01 <Host> kernel: ---[ end trace da1552dd9f39d375 ]---
Oct 20 09:22:01 <Host> kernel: [drm:intel_dp_complete_link_train [i915]] *ERROR* failed to train DP, aborting
Oct 20 09:22:01 <Host> kernel: [drm:intel_dp_start_link_train [i915]] *ERROR* too many voltage retries, give up
Oct 20 09:22:01 <Host> kernel: [drm:intel_dp_complete_link_train [i915]] *ERROR* failed to train DP, aborting
Oct 20 09:22:01 <Host> avahi-daemon[952]: Service "<host>" (/services/ssh.service) successfully established.
Oct 20 09:22:01 <Host> avahi-daemon[952]: Service "<host>" (/services/sftp-ssh.service) successfully established.
Oct 20 09:22:02 <Host> kernel: [drm:intel_dp_complete_link_train [i915]] *ERROR* failed to train DP, aborting
Oct 20 09:22:02 <Host> kernel: [drm:intel_dp_complete_link_train [i915]] *ERROR* failed to train DP, aborting
Comment by Alex Forencich (alex.forencich) - Thursday, 05 November 2015, 04:42 GMT Comment by Sunny Wong (ArchDoctor_needed) - Thursday, 05 November 2015, 09:53 GMT
A short update on this matter.
At the moment it works with a second monitor, but highly instable.

At boot everything works, but releasing the laptop from the docking station or putting it back in, results in problems.
The latter one results constantly in gnome crashing with a white screen and the message that something went wrong and I have to login again (Relogin results in a freeze of the OS).
Also a constantly message that I still have in my journal log is:
> Nov 05 10:08:44 <hostname> kernel: [drm:gen8_irq_handler [i915]] *ERROR* The master control interrupt lied (SDE)!

Also opening the Gnome Desktop setting from the Settings menu results in a black screen for a short while.

Furthermore sometimes the screen just stop working and even after a reboot the matter persists. Just after several reboots it works again. But I can't reproduce what causes this. (I think it is related to pacman, e.g. when I update some libraries through it)

I hope it helps.
Thanks for looking into this!

Loading...