Neural networks and the derivatives of the cost function

2sin54 · Jul 19, 2016

Hello. I need some guidance on the derivation of the derivatives of the quadratic cost function (CF) in an artificial neural network. I can derive the equations for the forward propagation with no trouble but when it comes to finding the derivative of the CF with respect to the weight matrix (matrices) I struggle to distinguish where to use the Hadamar product, where to use the dot matrix product and the order of the multiples. Does anyone know some good resources where I could see a thorough derivation of this OR linear algebra resource relevant to my question?

Edgardo · Jul 20, 2016

Here is a derivation from stats.stackexchange that you may find helpful. It is basically about the chain rule for derivatives.

Neural networks and the derivatives of the cost function

Similar threads

How to increase phone signal strength by lying about it

Who is responsible for the software when AI takes over programming?

Learning Assembly and computer architecture for x86

Use of AI (ML/DL) in Science

Could the reason why I can't select any kernels in VS Code be this error?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers