Arch Linux

Please read this before reporting a bug:
https://wiki.archlinux.org/index.php/Reporting_Bug_Guidelines

Do NOT report bugs when a package is just outdated, or it is in the AUR. Use the 'flag out of date' link on the package page, or the Mailing List.

REPEAT: Do NOT report bugs for outdated packages!
Tasklist

FS#72765 - Failed to start GNOME Display Manager.

Attached to Project: Arch Linux
Opened by Mihuleac Sergiu (physicalit) - Tuesday, 16 November 2021, 13:00 GMT
Last edited by Jonas Witschel (diabonas) - Wednesday, 17 November 2021, 10:07 GMT
Task Type Bug Report
Category Packages: Extra
Status Assigned
Assigned To Jan Alexander Steffens (heftig)
Architecture x86_64
Severity High
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 0%
Votes 2
Private No

Details

Description:
I have a system (nvidia gpu) with minimal packages on it, I use it for docker, but I also have a Desktop environment, gnome on it and that is it, no costume modification or anything..
After the update I did a few days ago gdm service won't start, someone fucked it up.. I am very curious why this is not posted on the main page of the community cause if I have this issue on a very basic system.. most likely all gnome users have it..


Additional info:
* package version(s) GDM 41.0
* config and/or log files etc. Standard, nothing was modified.
* link to upstream bug report, if any, not found.

Steps to reproduce:
Start the computer and the system will fail to start gdm.

× gdm.service - GNOME Display Manager
Loaded: loaded (/usr/lib/systemd/system/gdm.service; enabled; vendor preset: disabled)
Active: failed (Result: core-dump) since Tue 2021-11-16 14:50:25 EET; 5s ago
Process: 14308 ExecStart=/usr/bin/gdm (code=dumped, signal=SEGV)
Main PID: 14308 (code=dumped, signal=SEGV)
CPU: 27ms

Nov 16 14:50:25 node1 systemd[1]: gdm.service: Failed with result 'core-dump'.
Nov 16 14:50:25 node1 systemd[1]: gdm.service: Scheduled restart job, restart counter is at 5.
Nov 16 14:50:25 node1 systemd[1]: Stopped GNOME Display Manager.
Nov 16 14:50:25 node1 systemd[1]: gdm.service: Start request repeated too quickly.
Nov 16 14:50:25 node1 systemd[1]: gdm.service: Failed with result 'core-dump'.
Nov 16 14:50:25 node1 systemd[1]: Failed to start GNOME Display Manager.
Nov 16 14:50:25 node1 systemd[1]: gdm.service: Triggering OnFailure= dependencies.
Nov 16 14:50:25 node1 systemd[1]: gdm.service: Failed to enqueue OnFailure= job, ignoring: Unit plymouth-quit.service no


node1 :: /home/giumbai » gdm
[1] 22020 segmentation fault (core dumped) gdm

The error is so general.. nothing I can investigate.. no other service is down or affected.. the system works normally.. only issue I can not boot in the graphical mode.
Please do not push updated that are not tested as stable!!!!!!!!!!!!!!!!!!! I feel like I should not say this.. but, please stop pushing bugs!!!!!
This task depends upon

Comment by Antonio Rojas (arojas) - Tuesday, 16 November 2021, 13:38 GMT
Please refrain from using the bug tracker to rant. "most likely all gnome users have it" is obviously false, as this is the first bug report about this a week after the release. If you want to help, start by posting a proper backtrace.
Comment by Mihuleac Sergiu (physicalit) - Tuesday, 16 November 2021, 14:18 GMT
So, apparently everything above Nvidia 470 driver version kills gdm. Also driver 470 version is not compatible with gdm 4.1.. so both packages need to be downgraded.
Comment by Doug Newgard (Scimmia) - Tuesday, 16 November 2021, 14:34 GMT
What GPU? Nvidia just dropped support for some after 470.
Comment by Mihuleac Sergiu (physicalit) - Tuesday, 16 November 2021, 15:04 GMT
So, it worked earlier cause actually sued nouveau instead of the proprietary driver. No matter the version, I think gdm does not speaks with the Nvidia api. I have attached the full debug log of gdm.
I could not find any single reason in the log why it fails, there are some references to this bug https://bugzilla.redhat.com/show_bug.cgi?id=1924079, but I don't think it crashes because of that.

I think that all users that use the nvidia, nvidia-lts or the manual install of the nvidia driver from their website are affected.
Comment by Mihuleac Sergiu (physicalit) - Tuesday, 16 November 2021, 15:08 GMT
I also tested to see if I can use the GPU from the pytorch frameworks and works fine, so other processes can access the gpu only gdm can't and also it doesn't gives any error in the log.. it just says core dumped.. with nothing more to go on. I enabled debug mode, as seen in the above attachment.

Loading...