Aug 4, 2021
Thank you for the kind words Sarah :) Basically, we have Y=f(X,W,B). In other words, W is contained inside Y. So the chain rule states that: dE/dW = dE/dY * dY/dW. Since we have multiple Y values, it turns into a sum. It's the formula written just after what you have highlighted. Does that answer your question ?