FS#20918 - 2.6.35.5-1 Fails To Boot: ERROR 24: Attempt to access block outside partition

Attached to Project: Arch Linux
Opened by David C. Rankin (drankinatty) - Wednesday, 22 September 2010, 17:32 GMT
Last edited by Tobias Powalowski (tpowa) - Wednesday, 29 September 2010, 06:21 GMT
Task Type Bug Report
Category Packages: Core
Status Closed
Assigned To No-one
Architecture All
Severity Critical
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:

Just updated and 2.6.35.5-1 fails to boot. It fails at the very start of boot. The screen output is:

kernel /vmlinuz root=/dev/mapper/nvidia_baaccajap5 ro vga=0x31a

[Linux-bzImage setup=0x3200, size=0x20ac50]

Error 24: Attempt to access bloack outside partition

Press any key to continue...


This is on my test server and looks like the same/similar issue to bug  FS#20499 . I don't know what Error 24 is, but I suspect it has something to do with the dmraid setup and not being able to find the partitions. The update complete perfectly:

:: Retrieving packages from core...
grep-2.7-1-x86_64 158.0K 204.0K/s 00:00:01 [#####################################] 100%
kernel26-2.6.35.5-1-x86_64 23.5M 507.3K/s 00:00:47 [#####################################] 100%
kernel26-headers-2.6.35.5-1-x86_64 3.9M 482.3K/s 00:00:08 [#####################################] 100%
kernel26-manpages-2.6.35.5-1-x86_64 2036.9K 466.8K/s 00:00:04 [#####################################] 100%
libldap-2.4.23-1-x86_64 357.3K 313.7K/s 00:00:01 [#####################################] 100%
mlocate-0.23.1-1-x86_64 74.8K 154.2K/s 00:00:00 [#####################################] 100%
:: Retrieving packages from extra...
openldap-2.4.23-1-x86_64 797.3K 389.9K/s 00:00:02 [#####################################] 100%
checking package integrity...
(7/7) checking for file conflicts [#####################################] 100%
(1/7) upgrading grep [#####################################] 100%
(2/7) upgrading kernel26 [#####################################] 100%
>>> Updating module dependencies. Please wait ...
>>> MKINITCPIO SETUP
>>> ----------------
>>> If you use LVM2, Encrypted root or software RAID,
>>> Ensure you enable support in /etc/mkinitcpio.conf .
>>> More information about mkinitcpio setup can be found here:
>>> http://wiki.archlinux.org/index.php/Mkinitcpio

>>> Generating initial ramdisk, using mkinitcpio. Please wait...
==> Building image "default"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26.img
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [autodetect]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26.img'...SUCCESS
==> SUCCESS
==> Building image "fallback"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26-fallback.img -S autodetect
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26-fallback.img'...SUCCESS
==> SUCCESS
(3/7) upgrading kernel26-headers [#####################################] 100%
(4/7) upgrading kernel26-manpages [#####################################] 100%
(5/7) upgrading libldap [#####################################] 100%
(6/7) upgrading mlocate [#####################################] 100%
(7/7) upgrading openldap [#####################################] 100%

So what to try, what to check? 2.6.35.4-1 worked perfectly, so whatever changed for 2.6.35.5-1 is the issue. Let me know what else you need. This box hardware info is:

The server is based on a MSI K9N2 board (MS-7374) with a Phenom 9850 proc & 8G of ram. The box has 2 dmraid arrays:

[22:00 ecstasy:/mnt/arch] # dmraid -r
/dev/sdd: nvidia, "nvidia_baaccaja", mirror, ok, 1465149166 sectors, data@ 0
/dev/sdc: nvidia, "nvidia_fdaacfde", mirror, ok, 976773166 sectors, data@ 0
/dev/sdb: nvidia, "nvidia_baaccaja", mirror, ok, 1465149166 sectors, data@ 0
/dev/sda: nvidia, "nvidia_fdaacfde", mirror, ok, 976773166 sectors, data@ 0

[22:00 ecstasy:/mnt/arch] # dmraid -s
*** Active Set
name : nvidia_baaccaja
size : 1465149056
stride : 128
type : mirror
status : ok
subsets: 0
devs : 2
spares : 0
*** Active Set
name : nvidia_fdaacfde
size : 976773120
stride : 128
type : mirror
status : ok
subsets: 0
devs : 2
spares : 0

The lspci information for the soft-raid controller is:

00:09.0 RAID bus controller: nVidia Corporation MCP78S [GeForce 8200] SATA Controller (RAID mode) (rev a2)
Subsystem: Micro-Star International Co., Ltd. Device 7374
Flags: bus master, 66MHz, fast devsel, latency 0, IRQ 28
I/O ports at b080 [size=8]
I/O ports at b000 [size=4]
I/O ports at ac00 [size=8]
I/O ports at a880 [size=4]
I/O ports at a800 [size=16]
Memory at f9e76000 (32-bit, non-prefetchable) [size=8K]
Capabilities: [44] Power Management version 2
Capabilities: [8c] SATA HBA v1.0
Capabilities: [b0] MSI: Enable+ Count=1/8 Maskable- 64bit+
Capabilities: [ec] HyperTransport: MSI Mapping Enable+ Fixed+
Kernel driver in use: ahci
Kernel modules: ahci

Let me know what else to send. I'll also open a bug with this info...

Steps to reproduce: Just trying to boot after update :(
This task depends upon

Closed by  Tobias Powalowski (tpowa)
Wednesday, 29 September 2010, 06:21 GMT
Reason for closing:  Fixed
Comment by Jan de Groot (JGC) - Wednesday, 22 September 2010, 20:48 GMT
This is an error message from grub, it has nothing to do with the kernel. Possible things could be:
- wrong hd device specified
- corrupted filesystem
- dmraid configuration broken, either inside grub or the dmraid stuff on your system itself
Comment by David C. Rankin (drankinatty) - Wednesday, 22 September 2010, 22:03 GMT
Huh? Nothing changed in /boot/grub/menu.lst I'll double-check in case we had an intermittent dmraid failure and report back. Right now the box is running just fine on LTS. Thanks.

# (0) Arch Linux
title Arch Linux on Archangel
root (hd1,5)
kernel /vmlinuz26 root=/dev/mapper/nvidia_baaccajap5 ro vga=0x31a
initrd /kernel26.img

# (1) Arch Linux
title Arch Linux Fallback
root (hd1,5)
kernel /vmlinuz26 root=/dev/mapper/nvidia_baaccajap5 ro
initrd /kernel26-fallback.img

# (2) Arch Linux LTS
title Arch Linux LTS on Archangel
root (hd1,5)
kernel /vmlinuz26-lts root=/dev/mapper/nvidia_baaccajap5 ro vga=0x31a
initrd /kernel26-lts.img

# (3) Arch Linux LTS-Fallback
title Arch Linux LTS Fallback
root (hd1,5)
kernel /vmlinuz26-lts root=/dev/mapper/nvidia_baaccajap5 ro
initrd /kernel26-lts-fallback.img

# (4) memtest86+
title Memtest86+ on Archangel
kernel (hd1,5)/memtest86+/memtest.bin
Comment by David C. Rankin (drankinatty) - Wednesday, 22 September 2010, 22:09 GMT
Nope - still broken. It's a bug somewhere Jan. I just tried to boot it again and it fails at the same location. Reboot to LTS - just fine. I don't know what the heck grub is trying to do, but it is really messed up. 3 or 4 of the past 6 kernels have had boot issues on this box. All OS's work fine on the box, except for these boot issues. What say the masters as what to try next?
Comment by Jan de Groot (JGC) - Wednesday, 22 September 2010, 22:23 GMT
This looks more like a problem with the location of the kernel on your filesystem, not the kernel itself. Searching google for your error, I've seen Ubuntu users having the same problem with random Ubuntu kernels.

Is this grub 0.x with your kernel on ext4? If so, you might want to consider installing grub2 instead. I've been very unlucky with ext4 and grub 0.x on my systems also.
Comment by David C. Rankin (drankinatty) - Thursday, 23 September 2010, 01:11 GMT
Thanks Jan,

I read the issues on ext4 and /boot so all my /boot partitions are still ext3. I'll google and see what comes up. We may need Tobias in on this as well as it could be a dmraid issue as well. I know there has been 1 dmraid update between 2.6.35.4-1 and 2.6.35.5-1 so maybe the update of the kernel tripped a problem with dmraid that wasn't present with 2.6.35.4-1.

17:54 archangel:~> mount
proc on /proc type proc (rw,relatime)
sys on /sys type sysfs (rw,relatime)
udev on /dev type devtmpfs (rw,nosuid,relatime,size=10240k,nr_inodes=1022847,mode=755)
/dev/mapper/nvidia_baaccajap5 on / type ext3 (rw,relatime,errors=continue,data=writeback)
none on /dev/pts type devpts (rw)
none on /dev/shm type tmpfs (rw)
/dev/mapper/nvidia_baaccajap6 on /boot type ext3 (rw)
/dev/mapper/nvidia_baaccajap7 on /home type ext3 (rw)
/dev/mapper/nvidia_baaccajap9 on /var type ext3 (rw)
/dev/mapper/nvidia_baaccajap10 on /srv type ext3 (rw)
/dev/mapper/nvidia_baaccajap12 on /data type ext4 (rw)
/dev/mapper/nvidia_baaccajap11 on /home/david/pvt type ext4 (rw)
Comment by David C. Rankin (drankinatty) - Thursday, 23 September 2010, 05:43 GMT
Jan,

This has got to be a dmraid/kernel/grub issue with my boards chipset. I went ahead and updated my production x86_64 box (Identical s/w config) and I have no issues booting the other box.

However, the box I opened this bug on is still DEAD. I haven't found anything relevant googling yet except that Error 24 supposedly means you have the grub (drive,part) information messed up. That is NOT the case here. This is a problem with the way the boot process inits dmraid with kernel 2.6.35.5-1. If I downgrade to 2.6.35.4-1 everything works FINE. If I upgrade to 2.6.35.5-1, the box fails to boot and I have to reboot into LTS to even bring it up.

Let me know if I can provide anything else that narrows this issue down. If any one of you want direct access to the box, email me privately and I'll send you credentials and port info. I can check anything you like at this end, I just need to know what to look at. Thanks.
Comment by David C. Rankin (drankinatty) - Friday, 24 September 2010, 07:18 GMT
Just did another test on the box that fails to boot with 2.6.35.5-1 where I downgraded it again to 2.6.35.4-2. The box boots beautifully with 2.6.35.4-2. The downgrade output is identical to the upgrade output, but this time the box boots. Here is the downgrade output:

01:59 archangel:/home/backup/pkg-old> sudo pacman -U kernel26-2.6.35.4-2-x86_64.pkg.tar.xz kernel26-headers-2.6.35.4-2-x86_64.pkg.tar.xz kernel26-manpages-2.6.35.4-2-x86_64.pkg.tar.xz
resolving dependencies...
looking for inter-conflicts...

Targets (3): kernel26-2.6.35.4-2 kernel26-headers-2.6.35.4-2 kernel26-manpages-2.6.35.4-2

Total Download Size: 0.00 MB
Total Installed Size: 161.81 MB

Proceed with installation? [Y/n]
checking package integrity...
(3/3) checking for file conflicts [#####################################] 100%
(1/3) upgrading kernel26 [#####################################] 100%
>>> Updating module dependencies. Please wait ...
>>> MKINITCPIO SETUP
>>> ----------------
>>> If you use LVM2, Encrypted root or software RAID,
>>> Ensure you enable support in /etc/mkinitcpio.conf .
>>> More information about mkinitcpio setup can be found here:
>>> http://wiki.archlinux.org/index.php/Mkinitcpio

>>> Generating initial ramdisk, using mkinitcpio. Please wait...
==> Building image "default"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26.img
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [autodetect]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26.img'...SUCCESS
==> SUCCESS
==> Building image "fallback"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26-fallback.img -S autodetect
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26-fallback.img'...SUCCESS
==> SUCCESS
(2/3) upgrading kernel26-headers [#####################################] 100%
(3/3) upgrading kernel26-manpages [#####################################] 100%
02:02 archangel:/home/backup/pkg-old> x
logout
Connection to archangel closed.
02:02 alchemy:~> sar 'sudo shutdown -r now'

The box popped right up running 2.6.35.4-2, so there is a problem here with 2.6.35.5-1. (I don't know what it is, but it is there bigger than Texas. Boots with 2.6.35.4-2 perfectly. However if I upgrade the 3 packages to 2.6.35.5-1 (kernel26, kernel26-headers, kernel26-manpages) then the box Locks-Up on boot. That makes no sense unless somebody dorked with KMS again between 2.6.35.4-2 and 2.6.35.5-1. Where there any changes there whereby my kernel line with 0x31a now prevents KMS early in 2.6.35.5-1 where it did not with 2.6.35.4-2??

What say the master archers on what to check next. The grub config hasn't changed in months so that isn't relevant. This thing smells like a dmraid init problem such that grub doesn't see the dmraid paritions and then grub throws a grub error, where the error all along was a kernel dmraid init issue.

I'm basically out of ideas on what I can try next. Can you throw a brother a bone of wisdom here? Thanks.
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 15:02 GMT
Just a FYI follow-up - updated to kernel26-lts-2.6.32.22-1-x86_64 and it works just fine. 2.6.35.5-1 still won't boot.

There was one strange curiosity when I did the last update. pacman updated 2.6.35.5-1 again even though there was no minor version change. Here is the update output:

FORGET THE ABOVE COMMENT - I DOWNGRADED SO THE UPGRADE WAS EXPECTED (duh...)

<snip>

Targets (13): bluez-4.72-1 ed-1.5-1 electricsheep-2.7b12-3 gparted-0.6.3-1 kernel26-2.6.35.5-1
kernel26-headers-2.6.35.5-1 kernel26-lts-2.6.32.22-1 kernel26-manpages-2.6.35.5-1
konq-plugins-svn1179246-1 mingw32-binutils-2.20.51-1 sysvinit-2.88-1 transcode-1.1.5-5
xscreensaver-5.12-1

Total Download Size: 28.70 MB
Total Installed Size: 303.24 MB

<snip>

( 5/13) upgrading kernel26 [#####################################] 100%
>>> Updating module dependencies. Please wait ...
>>> MKINITCPIO SETUP
>>> ----------------
>>> If you use LVM2, Encrypted root or software RAID,
>>> Ensure you enable support in /etc/mkinitcpio.conf .
>>> More information about mkinitcpio setup can be found here:
>>> http://wiki.archlinux.org/index.php/Mkinitcpio

>>> Generating initial ramdisk, using mkinitcpio. Please wait...
==> Building image "default"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26.img
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [autodetect]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26.img'...SUCCESS
==> SUCCESS
==> Building image "fallback"
==> Running command: /sbin/mkinitcpio -k 2.6.35-ARCH -c /etc/mkinitcpio.conf -g /boot/kernel26-fallback.img -S autodetect
:: Begin build
:: Parsing hook [base]
:: Parsing hook [udev]
:: Parsing hook [pata]
:: Parsing hook [scsi]
:: Parsing hook [sata]
:: Parsing hook [dmraid]
:: Parsing hook [filesystems]
:: Generating module dependencies
:: Generating image '/boot/kernel26-fallback.img'...SUCCESS
==> SUCCESS
( 6/13) upgrading kernel26-headers [#####################################] 100%

I tried booting 2.6.35.5-1 and it failed at the same point - first few lines on the screen and then locks up. (the grub Error 24 is gone now, but it still hangs)

I agree it looks like grub reads the boot code from the MBR and then can't find the dmraid partition to boot from. However -- nothing, I mean nothing has changed there between 2.6.35.4-2 and 2.6.35.5-1.

Since this box is a server, I'd rather not move to grub2 until I am more comfortable with it. Why is normal old grub failing?

Here is a bit of additional info about my config.

mkinitcpio.conf:

[09:52 archangel:/home/david] # cat /etc/mkinitcpio.conf
# vim:set ft=sh
# MODULES
# The following modules are loaded before any boot hooks are
# run. Advanced users may wish to specify all system modules
# in this array. For instance:
# MODULES="piix ide_disk reiserfs"
MODULES="dm_mod dm_mirror sata_nv"

rc.conf:

MOD_AUTOLOAD="yes"
#MOD_BLACKLIST=() #deprecated
MODULES=(dm_mod dm_mirror sata_sil nvidia)

# Scan for LVM volume groups at startup, required if you use LVM
USELVM="no"

These settings are the same settings I have always used with this box and it 2 nvraid dmraid arrays.

Let me know what else to check. I'd like to figure out why 2.6.35.5-1 won't boot and get back on the normal Arch kernel.
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 15:09 GMT
Oops forgot the HOOKS line:

HOOKS="base udev autodetect pata scsi sata dmraid filesystems"
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 15:18 GMT
could this be related to the sata_sil issue in:

https://bugs.archlinux.org/task/20614
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 15:51 GMT
In rc.conf I've tried changing the MODULES line sata inclusion from sata_sil with sata_nv and rebooted - no change. The normal kernel still fails and LTS still boots fine.

I misstated in my recent comment that the GRUB ERROR: 24 was gone -- it's not, it's still there, I just shut down after the hang but before the grub timeout. GRUB ERROR: 24 is still present. Back to square 1...
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 19:29 GMT
Jan,

Any other thoughts/guesses here? I'll try anything. I know you said it looked like grub can't find the kernel, but why? My menu.lst hasn't changed since August 2009 when I added LTS and the contents of /boot look fine:

[14:32 archangel:/home/david] # l /boot
total 28777
drwxr-xr-x 5 root root 1024 Sep 26 18:55 .
drwxr-xr-x 22 root root 4096 Sep 21 15:45 ..
-rw-r--r-- 1 root root 1400334 Sep 21 04:57 System.map26
-rw-r--r-- 1 root root 1345754 Sep 21 04:57 System.map26-lts
drwxr-xr-x 2 root root 1024 Aug 23 13:18 grub
-rw-r--r-- 1 root root 9098203 Sep 26 18:55 kernel26-fallback.img
-rw-r--r-- 1 root root 8305205 Sep 26 18:56 kernel26-lts-fallback.img
-rw-r--r-- 1 root root 2476409 Sep 26 18:55 kernel26-lts.img
-rw-r--r-- 1 root root 2501522 Sep 26 18:54 kernel26.img
drwx------ 2 root root 12288 Apr 20 2009 lost+found
drwxr-xr-x 2 root root 1024 Jul 4 14:40 memtest86+
-rw-r--r-- 1 root root 2154576 Sep 21 04:57 vmlinuz26
-rw-r--r-- 1 root root 2034832 Sep 21 04:57 vmlinuz26-lts

There is something in the way 2.6.35.5-1 inits that's just broken, but what? Any help appreciated.
Comment by David C. Rankin (drankinatty) - Monday, 27 September 2010, 19:52 GMT
Also, I looked at the initrd images with 'bsdtar -t -f /boot/kernel26.img' and compared it to LTS 'bsdtar -t -f /boot/kernel26-lts.img'.

The only difference between them other than the obvious '2.6.35-ARCH' and '2.6.32-lts' is the 2.6.35-ARCH initrd contains:

/lib/modules/2.6.35-ARCH/kernel/drivers/ata/ahci.ko

which LTS doesn't. This line is right before the '/dev/mapper' entry. Could ahci.ko be part of the problem?

Comment by David C. Rankin (drankinatty) - Tuesday, 28 September 2010, 01:39 GMT
Guys - update: 2.6.35.6-1 boot but hangs on UDEV activation until the 120 sec timeout occurs. Here is the dmesg error:

INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task usbhid-ups:2023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
usbhid-ups D 0000000000000000 0 2023 1 0x00000000
ffff8802251d3b98 0000000000000082 ffff8802251d3ae8 ffffffff00000000
0000000000014f40 0000000000014f40 ffff8802251d3fd8 ffff8802251d3fd8
ffff8802251d3fd8 ffff8802258f6810 ffff8802251d3fd8 0000000000014f40
Call Trace:
[<ffffffff81371fc9>] __mutex_lock_slowpath+0x139/0x310
[<ffffffff813721b1>] mutex_lock+0x11/0x30
[<ffffffffa0201165>] usbdev_open+0x145/0x330 [usbcore]
[<ffffffff81127d17>] chrdev_open+0x127/0x270
[<ffffffff81127bf0>] ? chrdev_open+0x0/0x270
[<ffffffff8112223b>] __dentry_open+0x11b/0x3c0
[<ffffffff811a2b2a>] ? security_inode_permission+0x1a/0x20
[<ffffffff81123474>] nameidata_to_filp+0x54/0x70
[<ffffffff81131700>] do_last+0x480/0x740
[<ffffffff81131bad>] do_filp_open+0x1ed/0x680
[<ffffffff8113d184>] ? alloc_fd+0xf4/0x150
[<ffffffff811234f4>] do_sys_open+0x64/0x130
[<ffffffff811235db>] sys_open+0x1b/0x20
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task usbhid-ups:2023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
usbhid-ups D 0000000000000000 0 2023 1 0x00000000
ffff8802251d3b98 0000000000000082 ffff8802251d3ae8 ffffffff00000000
0000000000014f40 0000000000014f40 ffff8802251d3fd8 ffff8802251d3fd8
ffff8802251d3fd8 ffff8802258f6810 ffff8802251d3fd8 0000000000014f40
Call Trace:
[<ffffffff81371fc9>] __mutex_lock_slowpath+0x139/0x310
[<ffffffff813721b1>] mutex_lock+0x11/0x30
[<ffffffffa0201165>] usbdev_open+0x145/0x330 [usbcore]
[<ffffffff81127d17>] chrdev_open+0x127/0x270
[<ffffffff81127bf0>] ? chrdev_open+0x0/0x270
[<ffffffff8112223b>] __dentry_open+0x11b/0x3c0
[<ffffffff811a2b2a>] ? security_inode_permission+0x1a/0x20
[<ffffffff81123474>] nameidata_to_filp+0x54/0x70
[<ffffffff81131700>] do_last+0x480/0x740
[<ffffffff81131bad>] do_filp_open+0x1ed/0x680
[<ffffffff8113d184>] ? alloc_fd+0xf4/0x150
[<ffffffff811234f4>] do_sys_open+0x64/0x130
[<ffffffff811235db>] sys_open+0x1b/0x20
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task usbhid-ups:2023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
usbhid-ups D 0000000000000000 0 2023 1 0x00000000
ffff8802251d3b98 0000000000000082 ffff8802251d3ae8 ffffffff00000000
0000000000014f40 0000000000014f40 ffff8802251d3fd8 ffff8802251d3fd8
ffff8802251d3fd8 ffff8802258f6810 ffff8802251d3fd8 0000000000014f40
Call Trace:
[<ffffffff81371fc9>] __mutex_lock_slowpath+0x139/0x310
[<ffffffff813721b1>] mutex_lock+0x11/0x30
[<ffffffffa0201165>] usbdev_open+0x145/0x330 [usbcore]
[<ffffffff81127d17>] chrdev_open+0x127/0x270
[<ffffffff81127bf0>] ? chrdev_open+0x0/0x270
[<ffffffff8112223b>] __dentry_open+0x11b/0x3c0
[<ffffffff811a2b2a>] ? security_inode_permission+0x1a/0x20
[<ffffffff81123474>] nameidata_to_filp+0x54/0x70
[<ffffffff81131700>] do_last+0x480/0x740
[<ffffffff81131bad>] do_filp_open+0x1ed/0x680
[<ffffffff8113d184>] ? alloc_fd+0xf4/0x150
[<ffffffff811234f4>] do_sys_open+0x64/0x130
[<ffffffff811235db>] sys_open+0x1b/0x20
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task usbhid-ups:2023 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
usbhid-ups D 0000000000000000 0 2023 1 0x00000000
ffff8802251d3b98 0000000000000082 ffff8802251d3ae8 ffffffff00000000
0000000000014f40 0000000000014f40 ffff8802251d3fd8 ffff8802251d3fd8
ffff8802251d3fd8 ffff8802258f6810 ffff8802251d3fd8 0000000000014f40
Call Trace:
[<ffffffff81371fc9>] __mutex_lock_slowpath+0x139/0x310
[<ffffffff813721b1>] mutex_lock+0x11/0x30
[<ffffffffa0201165>] usbdev_open+0x145/0x330 [usbcore]
[<ffffffff81127d17>] chrdev_open+0x127/0x270
[<ffffffff81127bf0>] ? chrdev_open+0x0/0x270
[<ffffffff8112223b>] __dentry_open+0x11b/0x3c0
[<ffffffff811a2b2a>] ? security_inode_permission+0x1a/0x20
[<ffffffff81123474>] nameidata_to_filp+0x54/0x70
[<ffffffff81131700>] do_last+0x480/0x740
[<ffffffff81131bad>] do_filp_open+0x1ed/0x680
[<ffffffff8113d184>] ? alloc_fd+0xf4/0x150
[<ffffffff811234f4>] do_sys_open+0x64/0x130
[<ffffffff811235db>] sys_open+0x1b/0x20
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b
INFO: task modprobe:1547 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
modprobe D 0000000000000000 0 1547 1545 0x00000000
ffff880226c91a28 0000000000000086 ffff88022fca0000 ffff880200000000
0000000000014f40 0000000000014f40 ffff880226c91fd8 ffff880226c91fd8
ffff880226c91fd8 ffff880226ec6120 ffff880226c91fd8 0000000000014f40
Call Trace:
[<ffffffffa01f7ba5>] usb_kill_urb+0x85/0xc0 [usbcore]
[<ffffffff810719b0>] ? autoremove_wake_function+0x0/0x40
[<ffffffffa0275831>] usbhid_init_reports+0xb1/0x120 [usbhid]
[<ffffffffa0275d53>] usbhid_start+0x4b3/0x5a0 [usbhid]
[<ffffffffa025f6d8>] hid_device_probe+0x98/0xe0 [hid]
[<ffffffff81287cfa>] ? driver_sysfs_add+0x5a/0x90
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff812881a0>] ? __device_attach+0x0/0x60
[<ffffffff812881eb>] __device_attach+0x4b/0x60
[<ffffffff81286bb4>] bus_for_each_drv+0x64/0x90
[<ffffffff81287e6f>] device_attach+0x8f/0xb0
[<ffffffff81287625>] bus_probe_device+0x25/0x40
[<ffffffff8128536f>] device_add+0x4ff/0x5e0
[<ffffffffa025f0a7>] hid_add_device+0x87/0x1b0 [hid]
[<ffffffffa02734b9>] usbhid_probe+0x329/0x500 [usbhid]
[<ffffffffa01fc9fb>] usb_probe_interface+0xfb/0x1f0 [usbcore]
[<ffffffff81287fd6>] driver_probe_device+0x96/0x1c0
[<ffffffff8128819b>] __driver_attach+0x9b/0xa0
[<ffffffff81288100>] ? __driver_attach+0x0/0xa0
[<ffffffff81286f0e>] bus_for_each_dev+0x5e/0x90
[<ffffffff81287c99>] driver_attach+0x19/0x20
[<ffffffff812877a7>] bus_add_driver+0xc7/0x2e0
[<ffffffff81288411>] driver_register+0x71/0x140
[<ffffffffa01fb6c8>] usb_register_driver+0xb8/0x170 [usbcore]
[<ffffffffa01c9000>] ? hid_init+0x0/0xd1 [usbhid]
[<ffffffffa01c9093>] hid_init+0x93/0xd1 [usbhid]
[<ffffffff81002149>] do_one_initcall+0x39/0x1a0
[<ffffffff8108cfdb>] sys_init_module+0xbb/0x200
[<ffffffff81009e82>] system_call_fastpath+0x16/0x1b

Full dmesg output is here:

http://www.3111skyline.com/dl/Archlinux/bugs/dmesg20100927.txt
Comment by David C. Rankin (drankinatty) - Tuesday, 28 September 2010, 23:45 GMT
Whatever the problem was, it looks fixed in kernel26 2.6.35.6-2. Oh well, there are just somethings in the world you just can't explain :p
Comment by David C. Rankin (drankinatty) - Tuesday, 28 September 2010, 23:47 GMT
It would be nice to get at least some explanation of what was broken in kernel26 2.6.35.5-1 and fixed in kernel26 2.6.35.6-2 so I would at least know where to look if we suffer a regression in the future...

Loading...