DIMM Replacement Guidelines
Replace a DIMM when one of the following events takes place:
- The DIMM fails memory testing under BIOS due to Uncorrectable Memory Errors (UCEs).
-
UCEs occur and investigation shows that the errors originated from memory.
-
More than 24 Correctable Errors (CEs) originate in 24 hours from a single DIMM and no other DIMM is showing further CEs.
Locating a faulty DIMM:
You can use the following Help Center Article to assist with DIMM identification and replacement.
https://siliconmechanics.zendesk.com/hc/en-us/articles/208891966-Identify-Bad-DIMM-from-EDAC
Related Articles
Identify Bad DIMM from EDAC
Here is an example to show you how to identify defective DIMM on an AMD_x64 archtecture machine, syslog reported kernel error from EDAC (Error Detection and Correction kernel module). Here is a piece of typical error message from EDAC kernel: ...
RAM models and serials
Q: I have a failed DIMM, but I do not know the model and serial. How do I obtain this information? A: This information is written into DMI into the motherboard. To access it, simply use dmidecode, available on most UNIX and Linux operating systems by ...
Checking Smbios Event Log
Applies to: Supermicro X9 and X10 series motherboards Description: Occasionally you may receive events in the IPMI system event log after a memory failure that do not point to a particular faulted component. The output of the errors can vary ...
How To Diagnose Memory Errors on AMD x64 using EDAC
Find the first DIMM slot using dmidecode output ******************************************************************************* 1. Which EDAC modules are in use? This HowTo is for the amd64_edac module. # lsmod | grep -i amd amd64_edac_mod ...
Memory testing with Memtest86 MultiCore Mode.
Create a bootable USB as directed below. Boot the system and run the default tests. Most memory issues will surface fairly quickly but let tests complete if CPU's are under suspicion. Download the attached ISO. Create Bootable USB from the ISO ...