FS#42475 - [linux] [nvidia] fails to resume from suspend

Attached to Project: Arch Linux
Opened by Artur Jaroschek (arturj) - Tuesday, 21 October 2014, 11:44 GMT
Last edited by Sven-Hendrik Haase (Svenstaro) - Sunday, 10 May 2015, 16:54 GMT
Task Type Bug Report
Category System
Status Closed
Assigned To Tobias Powalowski (tpowa)
Thomas Bächler (brain0)
Ionut Biru (wonder)
Sven-Hendrik Haase (Svenstaro)
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 4
Private No

Details

Description:
after last update few days ago which pulled e.g. gnome 3.14, kernel 3.16 as well as new nvidia driver suspend2ram/resume does not work anymore (i.e. PC resumes, but with strange graphical errors along the borders of the screen and some applications like firefox beeing frozen). Previous update was about 4 weeks ago and all was fine. Meanwhile (4 weeks ago and now) something changed it to bad.

I already tried to install linux-lts along with nvidia-lts -> same
Also installed legacy nvidia driver package "nvidia-304xx" -> same

My system:
Intel i5
Nvidia 660

I also have a eeepc (intel atom with intel onboard graphics) which works very well even after the same update procedure.

Additional info:
* package version(s)
* config and/or log files etc.


Steps to reproduce:
do full update
suspend then resume pc
This task depends upon

Closed by  Sven-Hendrik Haase (Svenstaro)
Sunday, 10 May 2015, 16:54 GMT
Reason for closing:  Fixed
Comment by Artur Jaroschek (arturj) - Tuesday, 21 October 2014, 21:17 GMT
Here is my dmesg after a suspend/resume cycle:

[ 160.341696] PM: Syncing filesystems ... done.
[ 160.522703] PM: Preparing system for mem sleep
[ 161.799734] Freezing user space processes ... (elapsed 0.003 seconds) done.
[ 161.803107] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
[ 161.804198] PM: Entering mem sleep
[ 161.804249] Suspending console(s) (use no_console_suspend to debug)
[ 161.806816] sd 5:0:0:0: [sdc] Synchronizing SCSI cache
[ 161.806857] sd 4:0:0:0: [sdb] Synchronizing SCSI cache
[ 161.806874] sd 2:0:0:0: [sda] Synchronizing SCSI cache
[ 161.807174] sd 5:0:0:0: [sdc] Stopping disk
[ 161.808483] sd 2:0:0:0: [sda] Stopping disk
[ 161.832996] serial 00:07: disabled
[ 161.833002] serial 00:07: System wakeup disabled by ACPI
[ 161.833022] i8042 kbd 00:06: System wakeup enabled by ACPI
[ 161.887079] sd 4:0:0:0: [sdb] Stopping disk
[ 162.510948] PM: suspend of devices complete after 707.290 msecs
[ 162.511426] PM: late suspend of devices complete after 0.475 msecs
[ 162.512047] r8169 0000:03:00.0: System wakeup enabled by ACPI
[ 162.512147] ehci-pci 0000:00:1d.0: System wakeup enabled by ACPI
[ 162.512356] xhci_hcd 0000:00:14.0: System wakeup enabled by ACPI
[ 162.512372] ehci-pci 0000:00:1a.0: System wakeup enabled by ACPI
[ 162.524279] PM: noirq suspend of devices complete after 12.864 msecs
[ 162.524662] ACPI: Preparing to enter system sleep state S3
[ 162.525137] PM: Saving platform NVS memory
[ 162.526283] Disabling non-boot CPUs ...
[ 162.526341] intel_pstate CPU 1 exiting
[ 162.527565] kvm: disabling virtualization on CPU1
[ 162.527583] smpboot: CPU 1 is now offline
[ 162.528000] intel_pstate CPU 2 exiting
[ 162.529214] kvm: disabling virtualization on CPU2
[ 162.630756] smpboot: CPU 2 is now offline
[ 162.631047] intel_pstate CPU 3 exiting
[ 162.632190] kvm: disabling virtualization on CPU3
[ 162.733989] smpboot: CPU 3 is now offline
[ 162.735427] ACPI: Low-level resume complete
[ 162.735462] PM: Restoring platform NVS memory
[ 162.735908] Enabling non-boot CPUs ...
[ 162.735952] x86: Booting SMP configuration:
[ 162.735952] smpboot: Booting Node 0 Processor 1 APIC 0x2
[ 162.747270] kvm: enabling virtualization on CPU1
[ 162.749463] Intel pstate controlling: cpu 1
[ 162.749502] CPU1 is up
[ 162.749514] smpboot: Booting Node 0 Processor 2 APIC 0x4
[ 162.760817] kvm: enabling virtualization on CPU2
[ 162.763012] Intel pstate controlling: cpu 2
[ 162.763044] CPU2 is up
[ 162.763055] smpboot: Booting Node 0 Processor 3 APIC 0x6
[ 162.774361] kvm: enabling virtualization on CPU3
[ 162.776561] Intel pstate controlling: cpu 3
[ 162.776594] CPU3 is up
[ 162.778925] ACPI: Waking up from system sleep state S3
[ 162.792331] xhci_hcd 0000:00:14.0: System wakeup disabled by ACPI
[ 162.792506] ehci-pci 0000:00:1d.0: System wakeup disabled by ACPI
[ 162.792661] ehci-pci 0000:00:1a.0: System wakeup disabled by ACPI
[ 162.792724] PM: noirq resume of devices complete after 13.503 msecs
[ 162.793012] PM: early resume of devices complete after 0.269 msecs
[ 162.793079] mei_me 0000:00:16.0: irq 45 for MSI/MSI-X
[ 162.793194] r8169 0000:03:00.0: System wakeup disabled by ACPI
[ 162.793392] i8042 kbd 00:06: System wakeup disabled by ACPI
[ 162.793460] snd_hda_intel 0000:00:1b.0: irq 46 for MSI/MSI-X
[ 162.794029] serial 00:07: activated
[ 162.817423] sd 5:0:0:0: [sdc] Starting disk
[ 162.817425] sd 2:0:0:0: [sda] Starting disk
[ 162.817425] sd 4:0:0:0: [sdb] Starting disk
[ 162.908528] r8169 0000:03:00.0 enp3s0: link down
[ 163.131843] ata10: SATA link down (SStatus 0 SControl 330)
[ 163.131880] ata9: SATA link down (SStatus 0 SControl 330)
[ 163.141813] ata7: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 163.144533] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 163.144536] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT4._GTF] (Node ffff88021604c2f8), AE_NOT_FOUND (20140424/psparse-536)
[ 163.145142] ata3: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 163.148362] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 163.148365] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT4._GTF] (Node ffff88021604c2f8), AE_NOT_FOUND (20140424/psparse-536)
[ 163.148368] ata7.00: configured for UDMA/100
[ 163.150864] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 163.150867] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT0._GTF] (Node ffff88021604c118), AE_NOT_FOUND (20140424/psparse-536)
[ 163.160854] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 163.160856] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT0._GTF] (Node ffff88021604c118), AE_NOT_FOUND (20140424/psparse-536)
[ 163.161625] ata3.00: configured for UDMA/133
[ 163.520403] PM: resume of devices complete after 728.096 msecs
[ 163.520529] PM: Finishing wakeup.
[ 163.520530] Restarting tasks ...
[ 163.520696] pci_bus 0000:05: Allocating resources
[ 163.520712] pci 0000:04:00.0: bridge window [io 0x1000-0x0fff] to [bus 05] add_size 1000
[ 163.520714] pci 0000:04:00.0: bridge window [mem 0x00100000-0x000fffff 64bit pref] to [bus 05] add_size 200000
[ 163.520716] pci 0000:04:00.0: bridge window [mem 0x00100000-0x000fffff] to [bus 05] add_size 200000
[ 163.520718] pci 0000:04:00.0: res[14]=[mem 0x00100000-0x000fffff] get_res_add_size add_size 200000
[ 163.520719] pci 0000:04:00.0: res[15]=[mem 0x00100000-0x000fffff 64bit pref] get_res_add_size add_size 200000
[ 163.520720] pci 0000:04:00.0: res[13]=[io 0x1000-0x0fff] get_res_add_size add_size 1000
[ 163.520725] pci 0000:04:00.0: BAR 14: assigned [mem 0xe0100000-0xe02fffff]
[ 163.520730] pci 0000:04:00.0: BAR 15: assigned [mem 0xe0300000-0xe04fffff 64bit pref]
[ 163.520733] pci 0000:04:00.0: BAR 13: assigned [io 0x2000-0x2fff]
[ 163.520735] pci 0000:04:00.0: PCI bridge to [bus 05]
[ 163.520738] pci 0000:04:00.0: bridge window [io 0x2000-0x2fff]
[ 163.520745] pci 0000:04:00.0: bridge window [mem 0xe0100000-0xe02fffff]
[ 163.520750] pci 0000:04:00.0: bridge window [mem 0xe0300000-0xe04fffff 64bit pref]
[ 163.525697] done.
[ 164.086052] IPv6: ADDRCONF(NETDEV_UP): enp3s0: link is not ready
[ 164.187024] r8169 0000:03:00.0 enp3s0: link down
[ 164.187082] IPv6: ADDRCONF(NETDEV_UP): enp3s0: link is not ready
[ 165.815481] r8169 0000:03:00.0 enp3s0: link up
[ 165.815491] IPv6: ADDRCONF(NETDEV_CHANGE): enp3s0: link becomes ready
[ 167.457645] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 167.462648] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 167.462656] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT3._GTF] (Node ffff88021604c280), AE_NOT_FOUND (20140424/psparse-536)
[ 167.464441] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 167.464450] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT3._GTF] (Node ffff88021604c280), AE_NOT_FOUND (20140424/psparse-536)
[ 167.464781] ata6.00: configured for UDMA/133
[ 168.163607] ata5: link is slow to respond, please be patient (ready=0)
[ 171.307221] ata5: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[ 171.308334] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 171.308342] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT2._GTF] (Node ffff88021604c208), AE_NOT_FOUND (20140424/psparse-536)
[ 171.309712] ACPI Error: [DSSP] Namespace lookup failure, AE_NOT_FOUND (20140424/psargs-359)
[ 171.309719] ACPI Error: Method parse/execution failed [\_SB_.PCI0.SAT0.SPT2._GTF] (Node ffff88021604c208), AE_NOT_FOUND (20140424/psparse-536)
[ 171.309900] ata5.00: configured for UDMA/133
Comment by Artur Jaroschek (arturj) - Tuesday, 21 October 2014, 21:18 GMT
Here is my lspci for a list of hardware components:

00:00.0 Host bridge: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor DRAM Controller (rev 09)
00:01.0 PCI bridge: Intel Corporation Xeon E3-1200 v2/3rd Gen Core processor PCI Express Root Port (rev 09)
00:14.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB xHCI Host Controller (rev 04)
00:16.0 Communication controller: Intel Corporation 7 Series/C210 Series Chipset Family MEI Controller #1 (rev 04)
00:1a.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #2 (rev 04)
00:1b.0 Audio device: Intel Corporation 7 Series/C210 Series Chipset Family High Definition Audio Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 1 (rev c4)
00:1c.4 PCI bridge: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 5 (rev c4)
00:1c.5 PCI bridge: Intel Corporation 82801 PCI Bridge (rev c4)
00:1c.6 PCI bridge: Intel Corporation 7 Series/C210 Series Chipset Family PCI Express Root Port 7 (rev c4)
00:1d.0 USB controller: Intel Corporation 7 Series/C210 Series Chipset Family USB Enhanced Host Controller #1 (rev 04)
00:1f.0 ISA bridge: Intel Corporation Z77 Express Chipset LPC Controller (rev 04)
00:1f.2 SATA controller: Intel Corporation 7 Series/C210 Series Chipset Family 6-port SATA Controller [AHCI mode] (rev 04)
00:1f.3 SMBus: Intel Corporation 7 Series/C210 Series Chipset Family SMBus Controller (rev 04)
01:00.0 VGA compatible controller: NVIDIA Corporation GK106 [GeForce GTX 660] (rev a1)
01:00.1 Audio device: NVIDIA Corporation GK106 HDMI Audio Controller (rev a1)
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 09)
04:00.0 PCI bridge: ASMedia Technology Inc. ASM1083/1085 PCIe to PCI Bridge (rev 03)
06:00.0 SATA controller: Marvell Technology Group Ltd. 88SE9120 SATA 6Gb/s Controller (rev 12)
06:00.1 IDE interface: Marvell Technology Group Ltd. 88SE912x IDE Controller (rev 12)
Comment by Artur Jaroschek (arturj) - Tuesday, 21 October 2014, 21:23 GMT
Here is a screenshot from the graphical error (it should be the default wallpaper) from one of the corners of my desktop:
Comment by Doug Newgard (Scimmia) - Thursday, 23 October 2014, 03:13 GMT
Taking the shotgun approach and assigning to both the kernel and the driver.
Comment by Sven-Hendrik Haase (Svenstaro) - Thursday, 23 October 2014, 04:43 GMT
Since you get the same results on linux-lts (and supposedly nvidia-lts right?), I think we can rule out that this is either a kernel or nvidia problem.
Comment by Artur Jaroschek (arturj) - Thursday, 23 October 2014, 04:59 GMT
agree. Even tried old nvidia driver with same result. Maybe DRM or xserver/wayland?
Comment by Doug Newgard (Scimmia) - Thursday, 23 October 2014, 05:57 GMT
@Svenstaro, then help me out here. I asked for opinions on #archlinux-bugs and the only response I got was that it was probably the kernel or nvidia.
Comment by naveen (naveen) - Thursday, 23 October 2014, 11:38 GMT
I have been having (similar) problems since the last week. Lenovo T420 with nvidia NVS4200M (discrete mode), awesome wm.
The problem initially started with gnome-terminal greying/blacking out after suspend/resume or after a display switch (dock/undock). I have also been trying to use autorandr and disper and I initially attributed the problem to be specific to those, but since then, I have tried only using nvidia-settings, but I still see this. I switched away from gnome-terminal to xfce4-terminal and that solved issues with the terminal. But, the problem now shows up with firefox. I suspend/resume or change display configuration and firefox window is greyed out forcing me to kill firefox and start it afresh.
FWIW, I have tried various linux kernel versions, nvidia driver versions as well as nouveau and I still see this. It looks to be related to gtk3, but today, I downgraded gtk3 to 3.12, but the problem continues. I'm wondering if I need to revert a lot more packages.
Comment by naveen (naveen) - Wednesday, 05 November 2014, 08:17 GMT
Last few updates seem to have resolved this problem for me. I am only using nvidia-settings to change display configuration and the system has had no such issues in the last week.
Comment by Artur Jaroschek (arturj) - Wednesday, 05 November 2014, 18:49 GMT
For me the issue still persists. The only thing is that firefox and other apps do not freeze anymore so I can use my system after resuming. After some minutes the graphical erros disappear (live wallpaper gets updated maybe).
Comment by Jeremy M. (jskier) - Saturday, 15 November 2014, 13:44 GMT
Same issue with 3.17.2-1-ARCH, nvidia 343.22-4 (Geforce 550 ti card).
Comment by Artur Jaroschek (arturj) - Wednesday, 10 December 2014, 19:18 GMT
Still have this issue:

nvidia 343.22-5
kernel 3.17.4-1-ARCH

Any new ideas? Never ever had this on any distro...
Has someone tried a clean install?
I already have tried creating a new fresh user - still the same.
Comment by Matheus Caldas Santos (mtcs) - Friday, 16 January 2015, 13:01 GMT
Same here,

I have a GTX 640 w/ optimus working in prime mode. Maybe it has some relation
Comment by Jeremy M. (jskier) - Saturday, 31 January 2015, 04:19 GMT
Still crashing on linux-3.18.4-1, nvidia-346.35-4.
Comment by Artur Jaroschek (arturj) - Friday, 01 May 2015, 19:03 GMT
Issue has been resolved (for me) with a pacman -Syu a few days ago.

[2015-04-24 10:40] [PACMAN] Running 'pacman -Syu'
[2015-04-24 10:40] [PACMAN] synchronizing package lists
[2015-04-24 10:40] [PACMAN] starting full system upgrade
[2015-04-24 10:42] [ALPM] transaction started
[2015-04-24 10:42] [ALPM] upgraded glibc (2.21-2 -> 2.21-3)
[2015-04-24 10:42] [ALPM] upgraded git (2.3.5-1 -> 2.3.6-1)
[2015-04-24 10:42] [ALPM] upgraded libsystemd (219-5 -> 219-6)
[2015-04-24 10:42] [ALPM] upgraded systemd (219-5 -> 219-6)
[2015-04-24 10:42] [ALPM] upgraded nvidia-utils (346.59-1 -> 349.16-1)
[2015-04-24 10:42] [ALPM-SCRIPTLET] If you run into trouble with CUDA not being available, run nvidia-modprobe first.
[2015-04-24 10:42] [ALPM] upgraded nvidia-libgl (346.59-1 -> 349.16-1)
[2015-04-24 10:42] [ALPM] upgraded hplip (3.15.2-3 -> 3.15.4-1)
[2015-04-24 10:42] [ALPM] upgraded lib32-nvidia-utils (346.59-1 -> 349.16-1)
[2015-04-24 10:42] [ALPM] upgraded lib32-nvidia-libgl (346.59-1 -> 349.16-1)
[2015-04-24 10:42] [ALPM] upgraded libinput (0.13.0-2 -> 0.14.1-1)
[2015-04-24 10:42] [ALPM] upgraded man-pages (3.82-1 -> 3.83-1)
[2015-04-24 10:42] [ALPM] upgraded nvidia (346.59-1 -> 349.16-1)
[2015-04-24 10:42] [ALPM] upgraded systemd-sysvcompat (219-5 -> 219-6)
[2015-04-24 10:42] [ALPM] upgraded tcpdump (4.7.3-2 -> 4.7.4-1)
[2015-04-24 10:42] [ALPM] upgraded xf86-input-void (1.4.0-7 -> 1.4.1-1)
[2015-04-24 10:42] [ALPM] transaction completed

Loading...