I understood the way to compute the forward part in Deep learning. Now, I want to understand the backward part. Let's take `X(2,2)` as an example. The backward at the position `X(2,2)` can compute as the figure bellow
![](https://lh3.googleusercontent.com/-N6W0JnlPjCY/WJFjOK9goJI/AAAAAAAAAAk/9TUwxrDNMrIQI1GRCbezEenMeZO-cxzewCLcB/s320/Screenshot%2Bfrom%2B2017-02-01%2B13%253A10%253A24.png)
My question is that How to compute `dE/dY` (such as `dE/dY(1,1)`,`dE/dY(1,2)`...) at the first iteration? Does it randomly initial?