Hi all! we want to implement DMA between GPU and PMEM (to be more specific, we want to enable GPU to directly access PMEM). We have tried to use CUDA API to pin the page of mmap-ed PMEM on fsdax so that we can utilize the CUDA Copy engine, but the results make me think this method doesn't help. There are some discussions suggesting using devdax instead of fsdax for RDMA, I am wondering what is the reason behind it.
If anyone ever has experience with (R)DMA over PMEM, could you share some useful articles/papers to read, or any insights about (R)DMA over PMEM? I would really appreciate any help or insight.
Thanks a lot in advance!
Cheers,
Xindi