Hi,
I'm using Macsim to simulate CPU+GPU traces. I'm getting a higher number of GPU L1,L2 accesses when I run 1CPU + 1GPU, as compared to 1GPU only. I'm not able to reason why this should be so. In my understanding, CPU-GPU interference shouldn't affect GPU private cache accesses. Can anyone please help me understand this?
I've attached the stat files (all are dumped when the GPU finishes the first time), and also the params.out file. I'm running zeusmp (from SPECCPU2006) and hotspot (from Rodinia 3.1). L*_MISS_GPU+L*_HIT_GPU is different, while INST_COUNT_CORE_1 is the same in both. I observed similar behaviour on almost all other GPU benchmarks also.
Any help will be much appreciated. Thanks!
Sincerely,
Abhinav Sharma
Indian Institute of Science