Hi all
I have 2 x 8280L sockets box with 768 GB DRAM and 6144 GB PMEM running under RHEL 8.6 with latest available ml kernel 5.19.11-1.el8.elrepo.x86_64
When I setting up tiered memory, I got next warning
[root@memverge ~]# daxctl reconfigure-device --mode=system-ram all
dax1.0:
WARNING: detected a race while onlining memory
Some memory may not be in the expected zone. It is
recommended to disable any other onlining mechanisms,
and retry. If onlining is to be left to other agents,
use the --no-online option to suppress this warning
dax1.0: 2 memory sections already online
dax0.0:
WARNING: detected a race while onlining memory
Some memory may not be in the expected zone. It is
recommended to disable any other onlining mechanisms,
and retry. If onlining is to be left to other agents,
use the --no-online option to suppress this warning
dax0.0: 1 memory section already online
[root@memverge ~]# cat /sys/devices/system/memory/auto_online_blocks
offline
[root@memverge ~]#
Optance system-ram devices are going to existing NUMA nodes and don't create additional CPU-free NUMA nodes.
[root@memverge ~]# daxctl list
[
{
"chardev":"dax1.0",
"size":3183575302144,
"target_node":1,
"align":2097152,
"mode":"system-ram",
"movable":true
},
{
"chardev":"dax0.0",
"size":3183575302144,
"target_node":0,
"align":2097152,
"mode":"system-ram",
"movable":true
}
]
[root@memverge ~]#
[root@memverge ~]# numactl -H
available: 2 nodes (0-1)
node 0 cpus: 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
node 0 size: 3419704 MB
node 0 free: 3139765 MB
node 1 cpus: 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55
node 1 size: 3420152 MB
node 1 free: 3141870 MB
node distances:
node 0 1
0: 10 21
1: 21 10
[root@memverge ~]# lsmem
RANGE SIZE STATE REMOVABLE BLOCK
0x0000000000000000-0x000000607fffffff 386G online yes 0-192
0x0000006c80000000-0x000003b17fffffff 3.3T online yes 217-1890
0x000003bd80000000-0x000006a1ffffffff 2.9T online yes 1915-3395
Memory block size: 2G
Total online memory: 6.6T
Total offline memory: 0B
[root@memverge ~]# free -h
total used free shared buff/cache available
Mem: 6.5Ti 544Gi 6.0Ti 13Mi 711Mi 5.8Ti
Swap: 4.0Gi 0B 4.0Gi
[root@memverge ~]#
Any ideas why Optane system-ram devices are going to existing NUMA nodes and don't create their own CPU-free NUMA nodes ?
Anton