Lagrange undetermined multipliers

unscientific · Jul 24, 2012

Homework Statement

This section describes the "Lagrange undetermined multipliers" method to find a maxima/minima point, which i have several problems at the end.

The Attempt at a Solution

Why are they adding the respective contributions d(f + λg), instead of equating df = λdg ?

Imagine f(x,y) as the function in the 2nd picture attached, and g(x,y) = c as an equation of a circle. We know that the constraint is g(x,y) = c so therefore all possible points (x,y) from the origin must follow g(x,y) = c.

Then somewhere in f(x,y) there is a minima point (Point B) that also lie on g(x,y). We know that:

=> This point B must satisfy df = (∂f/∂x)dx + (∂f/∂y)dy = 0 and must satisfy g(x,y) = c

To solve for this point B, we simply equate df = λdg.

Why are they adding them? It's like adding the graph of y = sin x + cos x to find the intersection between them, instead of equating sin x = cos x.

ehild · Jul 24, 2012

df=λdg is equivalent to d(f-λg) =0. The value of lambda would be the negative of the one, obtained with d(f+λg) =0.
(When I learned about the method of Lagrange multiplier, we used the form d(f-λg) =0.

)

ehild

unscientific · Jul 24, 2012

ehild said:

df=λdg is equivalent to d(f-λg) =0. The value of lambda would be the negative of the one, obtained with d(f+λg) =0.
(When I learned about the method of Lagrange multiplier, we used the form d(f-λg) =0.)

ehild

YES! i knew it! thanks so much! it made more sense to equate than to add them, right? (adding them is only for special cases when both = 0)

ehild · Jul 24, 2012

Well, both of them should be zero. The point moves along g=const, so dg=0, but at the same time it must be an extreme, so df=0...

ehild

HallsofIvy · Jul 24, 2012

No, with the restriction g= constant, neither df nor dg is necessarily 0.

More correctly, rather than "df" we have [itex]\nabla f[/itex], the gradient vector. For any f, [itex]\nabla f[/itex] points in the direction of fastest increase, so if we want to go to the point of maximum f, we should move in that direction, moving until [itex]\nabla f= 0[/tex] so there is no "direction" in which to move. But if we are required to stay on the surface g(x,y,z)= constant, we can't do that. We can, rather, argue that we could move in the direction of the projection of [itex]\nabla f[/itex] on the surface. We can do that until there is no such direction- when [itex]\nabla f[/itex] is perpendicular to the surface. Since [itex]\nabla g[/itex] is perpendicular to g(x,y,z)= constant at every point, that means we must have [itex]\nabla f[/itex] parallel to [itex]\nabla g[/itex]- hence [itex]\nabla f[/itex] is a multiple of [itex]\nabla g[/itex].[/itex]

ehild · Jul 24, 2012

HallsofIvy,

dg is meant the change of g along the curve g=const, not the gradient of g which is perpendicular to g=const. Naturally dg=0. df is the change of f when a point (x,y) shifts by (dx,dy). (See the OP where df were defined: the dot product of the gradient(f) with the vector (dx,dy).) If f has an extreme on g df must be zero with the appropriate (dx,dy). As you pointed out, grad(f) must be perpendicular to g(x,y)=const, that is grad (f)=λgrad(g) instead of df=λdg.

ehild

Ray Vickson · Jul 24, 2012

unscientific said:

Homework Statement

This section describes the "Lagrange undetermined multipliers" method to find a maxima/minima point, which i have several problems at the end.

The Attempt at a Solution

Why are they adding the respective contributions d(f + λg), instead of equating df = λdg ?

Imagine f(x,y) as the function in the 2nd picture attached, and g(x,y) = c as an equation of a circle. We know that the constraint is g(x,y) = c so therefore all possible points (x,y) from the origin must follow g(x,y) = c.

Then somewhere in f(x,y) there is a minima point (Point B) that also lie on g(x,y). We know that:

=> This point B must satisfy df = (∂f/∂x)dx + (∂f/∂y)dy = 0 and must satisfy g(x,y) = c

To solve for this point B, we simply equate df = λdg.

Why are they adding them? It's like adding the graph of y = sin x + cos x to find the intersection between them, instead of equating sin x = cos x.

It does not matter whether we write df - λdg = 0 or df + λdg = 0; they just use λ of opposite signs. It DOES matter when usinig the interpretation of λ in a post-optimality analysis (which is often as important as the solution itself). In the problem max/min f, subject to g = c, the λ in the df = λdg form represents the *rate of change of the optimal value as a function of c*; that is, if we regard the problem as having a solution x(c), giving a value F(c) = f(x(c)), then λ = dF/dc at the original value of c. If we use the df+λdg = 0 form, we have λ = -dF/dc.

Also: in _inequality_ constrained problems, the sign of the Lagrange multiplier is determined (so having a λ of the wrong sign tells you your point is not optimal---an important test used in optimization algorithms for numerical solution). Of course, you need to write the correct form of optimality condition so that the sign of λ is properly examined.

RGV

Lagrange undetermined multipliers

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Homework Statement

The Attempt at a Solution

Attachments

Homework Statement

The Attempt at a Solution

Similar threads

Distance between a Clock's hands when the distance is increasing most rapidly

Polar integral

Deriving spatial derivatives

Is this the correct general solution of the given PDE?

J_1(x) = (x^2/10)*(J_1(x) + J_3(x)) How to solve?

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight