Hi,
The Jetson TX1 / TX2 share the RAM between GPU and CPU which should ideally reduce data transfer time between the CPU and GPU. However, it also seems to be the case that this memory is uncached, which would affect performance in some applications.
I wanted to know whether ArrayFire exploits the zero-copy feature available on the TXs or not (given its pros and cons). Or is there something more intelligent happening under the hood (probably depending on memory-access patterns)?
FYI, I am on v3.4.2.
Thanks,
Shobhit