Lagrange undetermined multipliers

  • Thread starter Thread starter unscientific
  • Start date Start date
  • Tags Tags
    Lagrange
Click For Summary

Homework Help Overview

This discussion revolves around the method of Lagrange undetermined multipliers, specifically focusing on finding maxima and minima points under constraints. Participants are exploring the mathematical reasoning behind the formulation of the method, particularly the relationship between the differentials of the functions involved.

Discussion Character

  • Conceptual clarification, Mathematical reasoning, Assumption checking

Approaches and Questions Raised

  • Participants are questioning why the formulation involves adding the contributions of the functions rather than equating their differentials directly. There is a discussion about the implications of using different forms of the equations, such as df = λdg versus df + λdg = 0.

Discussion Status

The discussion is active, with participants providing insights and clarifications regarding the interpretation of the Lagrange multiplier and its significance in optimization problems. There is an exploration of different perspectives on the mathematical expressions used in the method.

Contextual Notes

Some participants note the importance of understanding the implications of the sign of the Lagrange multiplier in relation to optimality conditions, particularly in the context of inequality constraints. There is also mention of specific cases where the formulation might differ based on the context of the problem.

unscientific
Messages
1,728
Reaction score
13

Homework Statement



This section describes the "Lagrange undetermined multipliers" method to find a maxima/minima point, which i have several problems at the end.

The Attempt at a Solution



Why are they adding the respective contributions d(f + λg), instead of equating df = λdg ?

Imagine f(x,y) as the function in the 2nd picture attached, and g(x,y) = c as an equation of a circle. We know that the constraint is g(x,y) = c so therefore all possible points (x,y) from the origin must follow g(x,y) = c.

Then somewhere in f(x,y) there is a minima point (Point B) that also lie on g(x,y). We know that:

=> This point B must satisfy df = (∂f/∂x)dx + (∂f/∂y)dy = 0 and must satisfy g(x,y) = c

To solve for this point B, we simply equate df = λdg.

Why are they adding them? It's like adding the graph of y = sin x + cos x to find the intersection between them, instead of equating sin x = cos x.
 

Attachments

  • lagrange1.jpg
    lagrange1.jpg
    45.2 KB · Views: 923
  • lagrange2.jpg
    lagrange2.jpg
    24.2 KB · Views: 528
Physics news on Phys.org
df=λdg is equivalent to d(f-λg) =0. The value of lambda would be the negative of the one, obtained with d(f+λg) =0.
(When I learned about the method of Lagrange multiplier, we used the form d(f-λg) =0.:smile:)

ehild
 
ehild said:
df=λdg is equivalent to d(f-λg) =0. The value of lambda would be the negative of the one, obtained with d(f+λg) =0.
(When I learned about the method of Lagrange multiplier, we used the form d(f-λg) =0.:smile:)

ehild

YES! i knew it! thanks so much! it made more sense to equate than to add them, right? (adding them is only for special cases when both = 0)
 
Well, both of them should be zero. The point moves along g=const, so dg=0, but at the same time it must be an extreme, so df=0...

ehild
 
Last edited:
No, with the restriction g= constant, neither df nor dg is necessarily 0.

More correctly, rather than "df" we have \nabla f, the gradient vector. For any f, \nabla f points in the direction of fastest increase, so if we want to go to the point of maximum f, we should move in that direction, moving until \nabla f= 0[/tex] so there is no &quot;direction&quot; in which to move.<br /> <br /> But if we are required to stay on the surface g(x,y,z)= constant, we can&#039;t do that. We can, rather, argue that we could move in the direction of the <b>projection</b> of \nabla f on the surface. We can do that until there is no such direction- when \nabla f is <b>perpendicular</b> to the surface. Since \nabla g is perpendicular to g(x,y,z)= constant at every point, that means we must have \nabla f <b>parallel</b> to \nabla g- hence \nabla f is a multiple of \nabla g.
 
HallsofIvy,

dg is meant the change of g along the curve g=const, not the gradient of g which is perpendicular to g=const. Naturally dg=0. df is the change of f when a point (x,y) shifts by (dx,dy). (See the OP where df were defined: the dot product of the gradient(f) with the vector (dx,dy).) If f has an extreme on g df must be zero with the appropriate (dx,dy). As you pointed out, grad(f) must be perpendicular to g(x,y)=const, that is grad (f)=λgrad(g) instead of df=λdg.

ehild
 
Last edited:
unscientific said:

Homework Statement



This section describes the "Lagrange undetermined multipliers" method to find a maxima/minima point, which i have several problems at the end.



The Attempt at a Solution



Why are they adding the respective contributions d(f + λg), instead of equating df = λdg ?

Imagine f(x,y) as the function in the 2nd picture attached, and g(x,y) = c as an equation of a circle. We know that the constraint is g(x,y) = c so therefore all possible points (x,y) from the origin must follow g(x,y) = c.

Then somewhere in f(x,y) there is a minima point (Point B) that also lie on g(x,y). We know that:

=> This point B must satisfy df = (∂f/∂x)dx + (∂f/∂y)dy = 0 and must satisfy g(x,y) = c

To solve for this point B, we simply equate df = λdg.

Why are they adding them? It's like adding the graph of y = sin x + cos x to find the intersection between them, instead of equating sin x = cos x.

It does not matter whether we write df - λdg = 0 or df + λdg = 0; they just use λ of opposite signs. It DOES matter when usinig the interpretation of λ in a post-optimality analysis (which is often as important as the solution itself). In the problem max/min f, subject to g = c, the λ in the df = λdg form represents the *rate of change of the optimal value as a function of c*; that is, if we regard the problem as having a solution x(c), giving a value F(c) = f(x(c)), then λ = dF/dc at the original value of c. If we use the df+λdg = 0 form, we have λ = -dF/dc.

Also: in _inequality_ constrained problems, the sign of the Lagrange multiplier is determined (so having a λ of the wrong sign tells you your point is not optimal---an important test used in optimization algorithms for numerical solution). Of course, you need to write the correct form of optimality condition so that the sign of λ is properly examined.

RGV
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
Replies
3
Views
2K
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K