Hi all,
First, thank you for your work and I think that Accel-sim is a powerful tool to simulate GPU environments.
I already verify that Accel-sim supports cuDNN and tensor core (HMMA.884 instruction) by using not only your provided code (like Deepbench) but also using the latest CUTLASS code provided by NVIDIA with Volta generation.
However, when I use HMMA.1688 or IMMA.8816 instructions which are FP16 or integer instruction for Turing generation by using the latest CUTLASS code, I have errors like this:
--------------------------------------------------------------------------------
accel-sim.out: abstract_hardware_model.cc:316: void warp_inst_t::generate_mem_accesses(): Assertion `0' failed.
./accel-sim/accel-sim-framework/util/job_launching/../../sim_run_11.0/cutlass_profiler/4096_4096_1024/RTX2060/slurm.sim: line 51: 7199 Aborted (core dumped) ./accel-sim/accel-sim-framework/util/job_launching/../../sim_run_11.0/gpgpu-sim-builds/accelsim-commit-4c2bf09a79d6b57bb10fe1898700930a5dd5531f_modified_0.0/accel-sim.out -config ./gpgpusim.config -trace ./traces/kernelslist.g
--------------------------------------------------------------------------------
Thus, I want to know whether Accel-sim can support HMMA.1688 or IMMA.8816 instructions or not.
I also want to know that Accel-sim can support these instructions but doesn’t support newly added LSDM instructions as you mentioned in other conversations. If this is correct, I will try to replace LDSM instruction with LDS instruction.