Graduate Understanding the Cost Function in Machine Learning: A Practical Guide

emmasaunders12 · Nov 17, 2016

Could someone please help me work through the differentiation in a paper (not homework), I am having trouble finding out how they came up with their cost function.

The loss function is L=wE, where E=(G-Gest)^2 and G=F'F

The derivative of the loss function wrt F is proportional to F'(G-Gest)

Can't seem to figure it out.

Thanks

Emma

fresh_42 · Nov 17, 2016

I have some trouble to understand you:

Do all functions depend on, say time ##t##, which the primes refer to? And why isn't ##G-G=0##? I first thought it could be the strange notation of a function, but then you defined a single ##G##. And last, could it be ##L \propto F(G-G)'##?

emmasaunders12 · Nov 17, 2016

fresh_42 said:

I have some trouble to understand you:

Do all functions depend on, say time ##t##, which the primes refer to? And why isn't ##G-G=0##? I first thought it could be the strange notation of a function, but then you defined a single ##G##. And last, could it be ##L \propto F(G-G)'##?

Thanks for the response, its the loss function of a neural network, so I've corrected to G and Gest, primes refer to transpose

PeroK · Nov 17, 2016

emmasaunders12 said:

Thanks for the response, its the loss function of a neural network, so I've corrected to G and Gest, primes refer to transpose

Perhaps someone else can help, but without a lot more context I have no idea what mathematically we are dealing with here.

emmasaunders12 · Nov 17, 2016

PeroK said:

Perhaps someone else can help, but without a lot more context I have no idea what mathematically we are dealing with here.

The specific problem is described on page 4 here https://arxiv.org/pdf/1505.07376v3.pdf

Graduate Understanding the Cost Function in Machine Learning: A Practical Guide

Thread 'How to define a vector field?'

Similar threads

Undergrad About the existence of Hamel basis for vector spaces

Undergrad How to define a vector field?

Undergrad Can one find a matrix that's 'unique' to a collection of eigenvectors?

Undergrad Localizing single variable quotient polynomial ring at a prime ideal

Undergrad Question about "A Group Epimorphism is Surjective"

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers