Arch Linux

Please read this before reporting a bug:
https://wiki.archlinux.org/title/Bug_reporting_guidelines

Do NOT report bugs when a package is just outdated, or it is in the AUR. Use the 'flag out of date' link on the package page, or the Mailing List.

REPEAT: Do NOT report bugs for outdated packages!
Tasklist

FS#55106 - EDAC amd64: Error: Error probing instance: 0 after upgrade to 4.12.4

Attached to Project: Arch Linux
Opened by Richard PALO (risto3) - Friday, 11 August 2017, 09:46 GMT
Last edited by Doug Newgard (Scimmia) - Thursday, 24 August 2017, 12:35 GMT
Task Type Bug Report
Category Kernel
Status Closed
Assigned To No-one
Architecture All
Severity Low
Priority Normal
Reported Version
Due in Version Undecided
Due Date Undecided
Percent Complete 100%
Votes 0
Private No

Details

Description:

since upgrading this morning to 4.12.4 I'm noticing the above errors during boot
and the following, filtering dmesg:

richard@sarchx64:/home/richard$ dmesg |grep EDAC
[ 1.400411] EDAC MC: Ver: 3.0.0
[ 2.762843] ghes_edac: This EDAC driver relies on BIOS to enumerate memory and get error reports.
[ 2.763017] EDAC MC0: Giving out device to module ghes_edac.c controller ghes_edac: DEV ghes (INTERRUPT)
[ 2.763067] EDAC MC1: Giving out device to module ghes_edac.c controller ghes_edac: DEV ghes (INTERRUPT)
[ 18.385923] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.385926] EDAC amd64: F15h detected (node 0).
[ 18.385996] EDAC MC: DCT0 chip selects:
[ 18.385998] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.386000] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.386002] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.386003] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.386004] EDAC MC: DCT1 chip selects:
[ 18.386006] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.386007] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.386009] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.386010] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.386011] EDAC amd64: using x8 syndromes.
[ 18.386012] EDAC amd64: MCT channel count: 2
[ 18.386055] EDAC amd64: Error: Error probing instance: 0
[ 18.441569] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.441573] EDAC amd64: F15h detected (node 0).
[ 18.441641] EDAC MC: DCT0 chip selects:
[ 18.441643] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.441646] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.441647] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.441649] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.441650] EDAC MC: DCT1 chip selects:
[ 18.441651] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.441653] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.441654] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.441656] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.441657] EDAC amd64: using x8 syndromes.
[ 18.441658] EDAC amd64: MCT channel count: 2
[ 18.441694] EDAC amd64: Error: Error probing instance: 0
[ 18.501576] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.501581] EDAC amd64: F15h detected (node 0).
[ 18.501649] EDAC MC: DCT0 chip selects:
[ 18.501651] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.501653] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.501655] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.501656] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.501658] EDAC MC: DCT1 chip selects:
[ 18.501659] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.501661] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.501662] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.501663] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.501665] EDAC amd64: using x8 syndromes.
[ 18.501666] EDAC amd64: MCT channel count: 2
[ 18.501705] EDAC amd64: Error: Error probing instance: 0
[ 18.581374] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.581378] EDAC amd64: F15h detected (node 0).
[ 18.581448] EDAC MC: DCT0 chip selects:
[ 18.581450] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.581453] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.581455] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.581457] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.581458] EDAC MC: DCT1 chip selects:
[ 18.581460] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.581462] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.581464] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.581466] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.581467] EDAC amd64: using x8 syndromes.
[ 18.581469] EDAC amd64: MCT channel count: 2
[ 18.581518] EDAC amd64: Error: Error probing instance: 0
[ 18.648046] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.648049] EDAC amd64: F15h detected (node 0).
[ 18.648113] EDAC MC: DCT0 chip selects:
[ 18.648115] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.648117] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.648119] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.648120] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.648122] EDAC MC: DCT1 chip selects:
[ 18.648123] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.648125] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.648126] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.648128] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.648129] EDAC amd64: using x8 syndromes.
[ 18.648131] EDAC amd64: MCT channel count: 2
[ 18.648165] EDAC amd64: Error: Error probing instance: 0
[ 18.714648] EDAC amd64: Node 0: DRAM ECC enabled.
[ 18.714652] EDAC amd64: F15h detected (node 0).
[ 18.714719] EDAC MC: DCT0 chip selects:
[ 18.714721] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.714724] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.714726] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.714728] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.714729] EDAC MC: DCT1 chip selects:
[ 18.714731] EDAC amd64: MC: 0: 0MB 1: 0MB
[ 18.714733] EDAC amd64: MC: 2: 4096MB 3: 4096MB
[ 18.714735] EDAC amd64: MC: 4: 0MB 5: 0MB
[ 18.714737] EDAC amd64: MC: 6: 0MB 7: 0MB
[ 18.714738] EDAC amd64: using x8 syndromes.
[ 18.714740] EDAC amd64: MCT channel count: 2
[ 18.714787] EDAC amd64: Error: Error probing instance: 0

richard@sarchx64:/home/richard$ uname -a
Linux sarchx64 4.12.4-1-ARCH #1 SMP PREEMPT Fri Jul 28 18:54:18 UTC 2017 x86_64 GNU/Linux
richard@sarchx64:/home/richard$ sudo dmidecode -tbaseboard -t bios -t processor -tmemory
# dmidecode 3.1
Getting SMBIOS data from sysfs.
SMBIOS 2.6 present.

Handle 0x0000, DMI type 0, 24 bytes
BIOS Information
Vendor: American Megatrends Inc.
Version: 3.5b
Release Date: 03/18/2016
Address: 0xF0000
Runtime Size: 64 kB
ROM Size: 2048 kB
Characteristics:
ISA is supported
PCI is supported
PNP is supported
BIOS is upgradeable
BIOS shadowing is allowed
ESCD support is available
Boot from CD is supported
Selectable boot is supported
BIOS ROM is socketed
EDD is supported
5.25"/1.2 MB floppy services are supported (int 13h)
3.5"/720 kB floppy services are supported (int 13h)
3.5"/2.88 MB floppy services are supported (int 13h)
Print screen service is supported (int 5h)
8042 keyboard services are supported (int 9h)
Serial services are supported (int 14h)
Printer services are supported (int 17h)
CGA/mono video services are supported (int 10h)
ACPI is supported
USB legacy is supported
LS-120 boot is supported
ATAPI Zip drive boot is supported
BIOS boot specification is supported
Targeted content distribution is supported
BIOS Revision: 8.16

Handle 0x0002, DMI type 2, 15 bytes
Base Board Information
Manufacturer: Supermicro
Product Name: H8SGL
Version: 1234567890
Serial Number: NM144S601234
Asset Tag: To Be Filled By O.E.M.
Features:
Board is a hosting board
Board is replaceable
Location In Chassis: To Be Filled By O.E.M.
Chassis Handle: 0x0003
Type: Motherboard
Contained Object Handles: 0

Handle 0x0004, DMI type 4, 42 bytes
Processor Information
Socket Designation: CPU 1
Type: Central Processor
Family: Opteron 6300
Manufacturer: AMD
ID: 20 0F 60 00 FF FB 8B 17
Signature: Family 21, Model 2, Stepping 0
Flags:
FPU (Floating-point unit on-chip)
VME (Virtual mode extension)
DE (Debugging extension)
PSE (Page size extension)
TSC (Time stamp counter)
MSR (Model specific registers)
PAE (Physical address extension)
MCE (Machine check exception)
CX8 (CMPXCHG8 instruction supported)
APIC (On-chip APIC hardware supported)
SEP (Fast system call)
MTRR (Memory type range registers)
PGE (Page global enable)
MCA (Machine check architecture)
CMOV (Conditional move instruction supported)
PAT (Page attribute table)
PSE-36 (36-bit page size extension)
CLFSH (CLFLUSH instruction supported)
MMX (MMX technology supported)
FXSR (FXSAVE and FXSTOR instructions supported)
SSE (Streaming SIMD extensions)
SSE2 (Streaming SIMD extensions 2)
HTT (Multi-threading)
Version: AMD Opteron(tm) Processor 6338P
Voltage: 1.0 V
External Clock: 200 MHz
Max Speed: 2300 MHz
Current Speed: 2300 MHz
Status: Populated, Enabled
Upgrade: Socket G34
L1 Cache Handle: 0x0005
L2 Cache Handle: 0x0006
L3 Cache Handle: 0x0007
Serial Number: To Be Filled By O.E.M.
Asset Tag: To Be Filled By O.E.M.
Part Number: To Be Filled By O.E.M.
Core Count: 12
Core Enabled: 12
Thread Count: 12
Characteristics:
64-bit capable

Handle 0x000F, DMI type 13, 22 bytes
BIOS Language Information
Language Description Format: Long
Installable Languages: 1
en|US|iso8859-1
Currently Installed Language: en|US|iso8859-1

Handle 0x0011, DMI type 16, 15 bytes
Physical Memory Array
Location: System Board Or Motherboard
Use: System Memory
Error Correction Type: Multi-bit ECC
Maximum Capacity: 64 GB
Error Information Handle: Not Provided
Number Of Devices: 8

Handle 0x0013, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: Unknown
Data Width: Unknown
Size: No Module Installed
Form Factor: <OUT OF SPEC>
Set: None
Locator: DIMM1B
Bank Locator: BANK0
Type: Unknown
Type Detail: None
Speed: Unknown
Manufacturer: Manufacturer00
Serial Number: SerNum00
Asset Tag: AssetTagNum0
Part Number: ModulePartNumber00
Rank: Unknown

Handle 0x0015, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: DIMM1A
Bank Locator: BANK1
Type: DDR3
Type Detail: Synchronous
Speed: 1600 MT/s
Manufacturer: Samsung
Serial Number: 12280B5E
Asset Tag: AssetTagNum1
Part Number: M391B1G73QH0-YK0
Rank: 2

Handle 0x0017, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: Unknown
Data Width: Unknown
Size: No Module Installed
Form Factor: <OUT OF SPEC>
Set: None
Locator: DIMM2B
Bank Locator: BANK2
Type: Unknown
Type Detail: None
Speed: Unknown
Manufacturer: Manufacturer02
Serial Number: SerNum02
Asset Tag: AssetTagNum2
Part Number: ModulePartNumber02
Rank: Unknown

Handle 0x0019, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: DIMM2A
Bank Locator: BANK3
Type: DDR3
Type Detail: Synchronous
Speed: 1600 MT/s
Manufacturer: Samsung
Serial Number: 12280B35
Asset Tag: AssetTagNum3
Part Number: M391B1G73QH0-YK0
Rank: 2

Handle 0x001B, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: Unknown
Data Width: Unknown
Size: No Module Installed
Form Factor: <OUT OF SPEC>
Set: None
Locator: DIMM3B
Bank Locator: BANK4
Type: Unknown
Type Detail: None
Speed: Unknown
Manufacturer: Manufacturer04
Serial Number: SerNum04
Asset Tag: AssetTagNum4
Part Number: ModulePartNumber04
Rank: Unknown

Handle 0x001D, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: DIMM3A
Bank Locator: BANK5
Type: DDR3
Type Detail: Synchronous
Speed: 1600 MT/s
Manufacturer: Samsung
Serial Number: 12280B22
Asset Tag: AssetTagNum5
Part Number: M391B1G73QH0-YK0
Rank: 2

Handle 0x001F, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: Unknown
Data Width: Unknown
Size: No Module Installed
Form Factor: <OUT OF SPEC>
Set: None
Locator: DIMM4B
Bank Locator: BANK6
Type: Unknown
Type Detail: None
Speed: Unknown
Manufacturer: Manufacturer06
Serial Number: SerNum06
Asset Tag: AssetTagNum6
Part Number: ModulePartNumber06
Rank: Unknown

Handle 0x0021, DMI type 17, 28 bytes
Memory Device
Array Handle: 0x0011
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: DIMM4A
Bank Locator: BANK7
Type: DDR3
Type Detail: Synchronous
Speed: 1600 MT/s
Manufacturer: Samsung
Serial Number: 12280B34
Asset Tag: AssetTagNum7
Part Number: M391B1G73QH0-YK0
Rank: 2

Handle 0x0025, DMI type 41, 11 bytes
Onboard Device
Reference Designation: To Be Filled By O.E.M.
Type: Video
Status: Enabled
Type Instance: 0

Handle 0x0026, DMI type 41, 11 bytes
Onboard Device
Reference Designation: To Be Filled By O.E.M.
Type: SCSI Controller
Status: Disabled
Type Instance: 0


Additional info:
* package version(s)
* config and/or log files etc.


Steps to reproduce:
This task depends upon

Closed by  Doug Newgard (Scimmia)
Thursday, 24 August 2017, 12:35 GMT
Reason for closing:  Not a bug
Comment by Jan de Groot (JGC) - Friday, 11 August 2017, 18:17 GMT
This is spam from ghes-edac and is probably caused by incomplete bios support. Last month there has been upstream discussion about a whitelist for the ghes driver.

It's probably best to add ghes.disable=1 to your boot line.

Loading...