Minimization of objective function

kiuhnm · Nov 21, 2015

Hi,
I need to minimize, with respect to \hat{y}(x), the following function:
\tilde{J}_x = \mathbb{E}_{p(x,y)}[(\hat{y}(x)-y)^2] + \nu \mathbb{E}_{p(x,y)}[(\hat{y}(x)-y)tr(\nabla_x^2\hat{y}(x))] + \nu \mathbb{E}_{p(x,y)}[||\nabla_x\hat{y}(x)||^2],
where x is a vector and y a scalar.
I found this in a book about Deep Learning (Machine Learning). I'm studying on my own and this math is a bit over my head. If you want more context, see pages 215-216 here: http://goodfeli.github.io/dlbook/contents/regularization.html
First of all, do I need to learn the Calculus of Variations to solve this?
The expression I wrote here is slightly different from the one on the book, because I think the authors forgot a "trace" (tr).
Thank you for your time.

MP. · Nov 21, 2015

Hey,
I think you're looking for the Lagrange multiplier

MP

EDIT: Sorry, misread the post, I thought you already wanted a solution within calculus. The answer to whether or not you'll have to learn it is not really, as you will have a machine do it for you anyway. So you don't have to understand why this solution works as long as you manage to code it once / get somebody else to do it.

MP. · Nov 21, 2015

And please don't forget that the function doesn't neccesarily have to obtain a min/max value, unless you're working with a closed set.

For example the function f(x) = x obtains no minima/maxima for x∈(0,1), although you can get "infinitely close" to both infimum and supremum (0 and 1). You may have to consider these cases separately, that really depends on what you're doing.

MP

Minimization of objective function

Thread 'Distance between a Clock's hands when the distance is increasing most rapidly'

Similar threads

Prove that the integral is equal to ##\pi^2/8##

Limit of piecewise function using epsilon delta

Volume with spherical coordinates

Use greedy vertex coloring algorithm to prove the upper bound of χ

Does this series converge uniformly?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers