Supermicro X9 and X10 series motherboards
Occasionally you may receive events in the IPMI system event log after a memory failure that do not point to a particular faulted component. The output of the errors can vary depending on how you're accessing the event log, and ipmitool in particular has been known to encounter bugs occasionally when attempting to decode Supermicro's event logs.
Here are some examples of such vague event log entries:
11 | 10/10/2000 | 12:00:00 | Memory | Correctable ECC | Asserted | CPU 0 DIMM 8
12 | 10/10/2000 | 12:00:00 | Memory | Uncorrectable ECC | Asserted
In the examples above, the first indicates a corrected ECC error on CPU 0 DIMM 8, which isn't a slot number on the motherboard. In the second example, the CPU/DIMM locator is missing entirely. Obviously there is a hardware problem, but you need a slot number that looks like P1-DIMMA1, or P2 DIMM1B, as these follow the naming convention used by the motherboard slots.
Luckily, most motherboards in the X9 and X10 series will have an additional event log stored in the BIOS, which tends to give more concise error reporting. However, this log is only accessible from the BIOS, so you will need to reboot your system in order to view it.
- Reboot your system and hit <Delete> during POST to enter the BIOS configuration.
- Navigate to Event Log > View Smbios Event Log, and hit <Enter> to open the event log viewer.
- Look for errors that give slot numbers, and use these to correlate to faulty RAM. If your event log does not show slot numbers, or if the errors do not immediately indicate a RAM fault, then contact Support for further instructions.
Supermicro Update Manager (SUM)
Introduction The Supermicro Update Manager (SUM) can be used to manage the BIOS and BMC firmware image update and configuration update for select systems. In addition, system checks as well as event log management are also supported. Moreover, ...
Introduction to IPMI
Q: What is IPMI? A: IPMI stands for Intelligent Platform Management Interface. It is in essence a web server that runs internally on your motherboard, powered by a separate ARM-based chip, also known as the baseboard management controller (BMC). The ...
RAM models and serials
Q: I have a failed DIMM, but I do not know the model and serial. How do I obtain this information? A: This information is written into DMI into the motherboard. To access it, simply use dmidecode, available on most UNIX and Linux operating systems by ...
Checking IP address configuration
Q: How can I check the IP address that IPMI is using? A: The motherboard will list the IP address in the BIOS, usually under IPMI > Set LAN Configuration. By default we leave DHCP turned off and set 0.0.0.0 as the static IP before shipping, for ...
BIOS recovery procedure
Prerequisites: Failed motherboard with F9 POST code Latest BIOS release for your motherboard A FreeDOS / MS-DOS formatted USB drive (see our FreeDOS USB creation guide here) Instructions: If you do not already have a copy of the latest BIOS, head to ...