Hi Sihang,
If you use --access-host-pagetable there is no "scheduler" that coordinates shared memory. The GPU can access the DRAM exactly the same way as the CPU, with loads and stores to virtual addresses. Without --access-host-pagetable, the GPU uses a separate virtual address, but accesses the same physical addresses.
If you want to investigate the data within the caches, you will need to look into the Ruby subsystem and Ruby caches of gem5. For the GPU shared memory (GPU scratchpad) you need to look into its implementation in GPGPU-Sim.
Let us know if you have more questions,