Hi Sean,
Try the --use-systolic-array commandline flag to enable it. It will use the systolic array of the backend whenever possible.
Currently we don’t have a mailing-list for SMAUG, you can either ask questions here or file issues/questions on the SMAUG repo.
Thanks,
Yuan
--
You received this message because you are subscribed to the Google Groups "gem5-Aladdin users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to gem5-aladdin-us...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/gem5-aladdin-users/2f8febd8-aa42-4786-98b8-760d2d56e96bn%40googlegroups.com.
Hi Sean,
Thanks for reporting this. The commandlines for generating trace and running gem5 simulation look right to me. That systolic_array.cfg should also be correct.
I tried to reproduce this using the same trace and config, and for me it was a segfault error during the stage 3 of ResNet50. So something must be wrong in the systolic array. Rooting causing this is a little challenging as it takes a couple of hours to crash, but I’ll let you know if I find the bug.
To view this discussion on the web visit https://groups.google.com/d/msgid/gem5-aladdin-users/a2dc13ac-c5bd-4e84-b4f2-3e6d07f0b828n%40googlegroups.com.
I can reproduce the same error now and got some progress. So in the systolic array, the commit unit basically collects finished data from the PE array, buffers it to fill up a memory request size, and then sends a write request to the output scratchpad. That assertion failed because for some reason when the commit unit received new data from the PE array, the previous one in the buffer had not finished writing to the scratchpad. This seems to be a memory/bus bandwidth issue, but I’m not sure yet. I’ll dig more to root cause it.
Some suggestions that would help debug:
To view this discussion on the web visit https://groups.google.com/d/msgid/gem5-aladdin-users/6f3ba8fe-dd86-40eb-8869-c90ca0ceee79n%40googlegroups.com.
| Sam Xi Google Inc., Software Engineer http://www.samxi.org |
To view this discussion on the web visit https://groups.google.com/d/msgid/gem5-aladdin-users/12a9045f-ddf0-426e-a7bb-a51ab85aff23n%40googlegroups.com.
Hi Sean,
Sorry for the late response. This issue is because of a bug in the TensorIndexIterator class that’s used to easily index data in a tensor. Please pull the changes from the master branch, this should have been fixed. Let us know if you run into any more issues.
Thanks,
Yuan
To view this discussion on the web visit https://groups.google.com/d/msgid/gem5-aladdin-users/CA%2BgiAXsJthRr1tSpTveYPJ9KZxtFgKrxPZmTWQ%2BEguptZ5E8dw%40mail.gmail.com.