Hi,
I'm trying to use the garnet network model. But it generates a deadlock; see the output log below:
GPGPU-Sim API: Stream Manager State
GPGPU-Sim API: stream 0 has 1 operations
GPGPU-Sim API: 0 : stream operation memcpy host-to-device
panic: Possible Deadlock detected. Aborting!
version: 12 request.paddr: 0x[0x7e8ec7f0, line 0x7e8ec780] m_readRequestTable: 1 current time: 5437930752500 issue_time: 5437680752500 difference: 250000000
@ cycle 5437930752500
[wakeup:build/VI_hammer/mem/ruby/system/Sequencer.cc, line 107]
Memory Usage: 2928700 KBytes
Program aborted at cycle 5437930752500
I use VI_hammer to take a checkpoint after linux boot; and then restore with the following command:
/research/wangh/noc/codebase/v03/gem5/build/VI_hammer/gem5.opt --outdir=m5out_ckpt /research/wangh/noc/codebase/v03/gem5-gpu/configs/fs_fusion.py --checkpoint-restore=1 --kernel=x86_64-vmlinux-2.6.28.4-smp --script=./backprop.rcS --cpu-type=detailed --restore-with-cpu=timing --num-cpus=4 --clusters=8 --topology=Cluster --garnet-network=fixed
Some more info:
I've tested backprop & srad, both deadlock.
It was running correctly before I add the last option "--garnet-network=fixed";
With smaller input (e.g. 2048 backprop), it runs successfully (only this one test, not sure if the input size matters).
With "--garnet-network=flexible", it seems also working.
I wonder if there is a known issue regarding this?
Thanks.
Hao