Neural networks and the derivatives of the cost function

2sin54 · Jul 19, 2016

Hello. I need some guidance on the derivation of the derivatives of the quadratic cost function (CF) in an artificial neural network. I can derive the equations for the forward propagation with no trouble but when it comes to finding the derivative of the CF with respect to the weight matrix (matrices) I struggle to distinguish where to use the Hadamar product, where to use the dot matrix product and the order of the multiples. Does anyone know some good resources where I could see a thorough derivation of this OR linear algebra resource relevant to my question?

Edgardo · Jul 20, 2016

Here is a derivation from stats.stackexchange that you may find helpful. It is basically about the chain rule for derivatives.

Neural networks and the derivatives of the cost function

1. What is a neural network and how does it work?

2. What is the cost function in a neural network?

3. Why do we need to calculate the derivatives of the cost function in a neural network?

4. How are the derivatives of the cost function calculated in a neural network?

5. What is the role of backpropagation in calculating the derivatives of the cost function?

Similar threads

Hot Threads

Recent Insights