I have identified an issue with memory speeds and latency on my server. I am renting this server from OVH and one of their requirements before getting the RAM replaced is
To launch an intervention, you need to send logs in your ticket showing the identifiant and the affected RAM module.
How am I able to detect the DRAM chip which is faulty without running memtest86+ for days as this is a large (1TB RAM) production server.
sysbench --test=memory --memory-block-size=4G --memory-total-size=32G run
WARNING: the --test option is deprecated. You can pass a script name or path on the command line without any options.
sysbench 1.0.18 (using system LuaJIT 2.1.0-beta3)
Running the test with following options:
Number of threads: 1
Initializing random number generator from current time
Running memory speed test with the following options:
block size: 4194304KiB
total size: 32768MiB
operation: write
scope: global
Initializing worker threads...
Threads started!
Total operations: 2 ( 0.15 per second)
8192.00 MiB transferred (630.16 MiB/sec)
General statistics:
total time: 12.9937s
total number of events: 2
Latency (ms):
min: 6338.94
avg: 6496.29
max: 6653.64
95th percentile: 6594.16
sum: 12992.58
Threads fairness:
events (avg/stddev): 2.0000/0.00
execution time (avg/stddev): 12.9926/0.00
sysbench --test=memory --memory-block-size=1K --memory-total-size=100G --num-threads=1 run
WARNING: the --test option is deprecated. You can pass a script name or path on the command line without any options.
sysbench 1.0.18 (using system LuaJIT 2.1.0-beta3)
Running the test with following options:
Number of threads: 1
Initializing random number generator from current time
Running memory speed test with the following options:
block size: 1KiB
total size: 102400MiB
operation: write
scope: global
Initializing worker threads...
Threads started!
Total operations: 48693603 (4868132.10 per second)
47552.35 MiB transferred (4754.04 MiB/sec)
General statistics:
total time: 10.0002s
total number of events: 48693603
Latency (ms):
min: 0.00
avg: 0.00
max: 0.47
95th percentile: 0.00
sum: 4155.88
Threads fairness:
events (avg/stddev): 48693603.0000/0.00
execution time (avg/stddev): 4.1559/0.00
sudo lshw -short -C memory
H/W path Device Class Description
==========================================================
/0/0 memory 64KiB BIOS
/0/20 memory 1TiB System Memory
/0/20/0 memory [empty]
/0/20/1 memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/2 memory [empty]
/0/20/3 memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/4 memory [empty]
/0/20/5 memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/6 memory [empty]
/0/20/7 memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/8 memory [empty]
/0/20/9 memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/a memory [empty]
/0/20/b memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/c memory [empty]
/0/20/d memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/20/e memory [empty]
/0/20/f memory 128GiB DIMM DDR4 Synchronous LRDIMM 2933 MHz (0.3 ns)
/0/23 memory 3MiB L1 cache
/0/24 memory 24MiB L2 cache
/0/25 memory 256MiB L3 cache