There are some problems with your code:
1. In line 123 of main.s you're using a1. But in your matmul.s you didn't follow the calling convention and you have changed the a1 value. So if you print s2, after line 123, it shows a wrong value.
2. After fixing the first problem, double-check that you're not reaching out-of-bound elements in your relu.s.
Basically, inserting some int_print in your code would be helpful for finding the bugs of the code. I have inserted one after calling "matmul" and one inside relu.s, and the first one helped me to find out that s2 has a wrong value.