Reads/writes from/to DCPMM in Memory mode

Anton Gavriliuk

unread,

Apr 5, 2019, 2:26:52 PM4/5/19

to pmem

Hi team

As far as I understand, if request data not found in DRAM (DCPMM cache), next step is trying to find data in DCPMM. That means in this case total time to find data will be sum of DRAM latency + DCPMM latency. And if still have no requested data in DCPMM, data must be read from disk and will be write to DRAM or DCPMM ? Can we read data directly to CPU L1 cache bypassing L3/L2 CPU caches ?

Anton

Andy Rudoff

unread,

Apr 5, 2019, 4:04:11 PM4/5/19

to pmem

Hi Anton,

As far as I understand, if request data not found in DRAM (DCPMM cache), next step is trying to find data in DCPMM. That means in this case total time to find data will be sum of DRAM latency + DCPMM latency.

Correct. Memory Mode uses persistent memory capacity as the system's main memory. But DCPMM is slower than DRAM, so Memory Mode helps make up for the performance difference by using DRAM as a cache. This is all done in hardware, transparent to the OS or applications. When you hit in the DRAM cache, the latency is the same as DRAM. When you miss, the latency will be the time it took to do the DRAM lookup plus the time it took to fetch the missed line from persistent memory, just as you state.

And if still have no requested data in DCPMM, data must be read from disk and will be write to DRAM or DCPMM ?

There's no such thing as "still have no requested data". As I wrote above, in Memory Mode, the DCPMM capacity is the system's main memory. So if you have 6TB of DCPMM, configured in memory mode, and 768GB of DRAM, when you boot the system, the OS will see 6TB of main memory. It won't see the 768GB of DRAM, since that is managed transparently by hardware. And every cache line of that 6TB has location where it lives in DCPMM, and at any time is potentially cached in DRAM. The CPU caches behave the same as always, treating Memory Mode as if it is just a huge pool of DRAM.

Of course, the OS also supports paging, where you can make virtual memory appear to be larger than physical memory and fetch pages from storage as necessary. But that has nothing to do with the physical address space exposed by Memory Mode above. It is up to you to decide if you want a system to do paging to storage (with or without Memory Mode).

Can we read data directly to CPU L1 cache bypassing L3/L2 CPU caches ?

No, I'm not aware of a way to do that. If you say more about your goal here I might have a more useful answer.

Thanks,

-andy

Anton Gavriliuk

unread,

Apr 6, 2019, 1:10:35 AM4/6/19

to Andy Rudoff, pmem

Hi Andy

Latencies matter to peak performance. That's why I'm asking for it.

So with DRAM miss we have latency = DRAM latency + DCPMM latency

But in worst case with (DRAM miss and DCPMM miss) we have latency =

(DRAM latency + DCPMM latency) + ((DRAM latency + DCPMM latency + DISK latency) or (DRAM latency + DISK latency) or (DCPMM latency + DISK latency)) ??

Anton

пт, 5 апр. 2019 г. в 23:04, Andy Rudoff <an...@rudoff.com>:

--
You received this message because you are subscribed to the Google Groups "pmem" group.
To unsubscribe from this group and stop receiving emails from it, send an email to pmem+uns...@googlegroups.com.
To post to this group, send email to pm...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pmem/1d29c4ce-94aa-4c67-ad23-ca6f05e23f89%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Adrian Jackson

unread,

Apr 6, 2019, 4:59:23 AM4/6/19

to Anton Gavriliuk, Andy Rudoff, pmem

This is not how memory systems work, on a memory mode miss you should only have the dcpmm latency, the memory controller will know if it's in cache or not, it doesn't have to search through both memory spaces.

To view this discussion on the web visit https://groups.google.com/d/msgid/pmem/CAAiJnjrQQiA%2BsCozacypeuHytM_8QekE_9W%2ByAsSW%2BpRdLJkNg%40mail.gmail.com.

Andy Rudoff

unread,

Apr 6, 2019, 12:07:26 PM4/6/19

to pmem

Hi Anton,

Like I said, there's no disk involved in Memory Mode and there's no such thing as a DCPMM miss in Memory Mode. In that mode, DCPMM is not a cache, it is the system's main memory. If the lookup in the DRAM cache indicates a miss, then the line is fetched from DCPMM, there's no "searching" for the line in DCPMM, it is just addressed directly.

Doing a quick google search, I see an article written by Storage Review has a nice picture showing the flows for Memory Mode:

https://www.storagereview.com/intel_optane_dc_persistent_memory_module_pmm

(scroll down about halfway into the article to the section titled Optane DC Persistent Memory - Memory Mode)

Hope that helps.

-andy

To unsubscribe from this group and stop receiving emails from it, send an email to pmem+unsubscribe@googlegroups.com.

Anton Gavriliuk

unread,

Apr 8, 2019, 12:22:17 PM4/8/19

to Andy Rudoff, pmem

Hi team

> Like I said, there's no disk involved in Memory Mode and there's no such thing as a DCPMM miss in Memory Mode.

I'm not sure if you correctly understand my question.

Ok, I'm not talking anymore about DCPMM in Memory Mode only, instead, I'm talking about I/O performance with DCPMM in Memory Mode.

So if double miss happened (DRAM and DCPMM, or it's not possible ?) could you please describe or show me data flow like on the link above you provided.

Anton

сб, 6 апр. 2019 г. в 19:07, Andy Rudoff <an...@rudoff.com>:

To unsubscribe from this group and stop receiving emails from it, send an email to pmem+uns...@googlegroups.com.

To post to this group, send email to pm...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/pmem/1d29c4ce-94aa-4c67-ad23-ca6f05e23f89%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--

You received this message because you are subscribed to the Google Groups "pmem" group.

To unsubscribe from this group and stop receiving emails from it, send an email to pmem+uns...@googlegroups.com.

To post to this group, send email to pm...@googlegroups.com.

To view this discussion on the web visit https://groups.google.com/d/msgid/pmem/18110a14-f1f8-4d5b-bf5a-57c1ce8c74e3%40googlegroups.com.

Anton Gavriliuk

unread,

Apr 9, 2019, 12:09:30 PM4/9/19

to Andy Rudoff, pmem

I switched DCPMMs to Memory Mode and now after boot I can check the data flow in case double miss (DRAM miss & DCPMM miss).

I used sequential read from local root disk,

linux-4185:~ # numactl --cpunodebind=1 --membind=1 fio --filename=/dev/sda --rw=read --ioengine=sync --bs=128k --iodepth=1 --numjobs=1 --runtime=60 --group_reporting --name=perf_test

perf_test: (g=0): rw=read, bs=(R) 128KiB-128KiB, (W) 128KiB-128KiB, (T) 128KiB-128KiB, ioengine=sync, iodepth=1

fio-3.13-27-gef32d

Starting 1 process

Jobs: 1 (f=1): [R(1)][100.0%][r=2121MiB/s][r=16.0k IOPS][eta 00m:00s]

perf_test: (groupid=0, jobs=1): err= 0: pid=3315: Tue Apr 9 18:49:58 2019

read: IOPS=14.2k, BW=1777MiB/s (1863MB/s)(104GiB/60001msec)

clat (usec): min=22, max=2939, avg=70.07, stdev=122.08

lat (usec): min=22, max=2939, avg=70.10, stdev=122.08

So each time due to double miss it reads data from DISK -> DCPMM and then writes to DRAM. That means total latency should be equal (DISK latency + (2 x DRAM latency + 2 x DCPMM latency)). That's what I asked for.

|---------------------------------------||---------------------------------------|

|-- Socket 0 --||-- Socket 1 --|

|---------------------------------------||---------------------------------------|

|-- Memory Channel Monitoring --||-- Memory Channel Monitoring --|

|---------------------------------------||---------------------------------------|

|-- Mem Ch 0: Reads (MB/s): 4.91 --||-- Mem Ch 0: Reads (MB/s): 754.15 --|

|-- Writes(MB/s): 6.16 --||-- Writes(MB/s): 1128.36 --|