A Lagrange multiplier approach to the catenary problem

In summary: The fact that you do not want to change c means that you can find an arbitrary number of directions in which ##c## does not change (i.e., where ##\nabla c = 0##) but only one direction where ##s## does not change (i.e., where ##\nabla s = 0##). You want to find the minimum of ##s## in the directions where ##c## does not change, so you look at the directions where ##\nabla h## is zero. This is the same as saying that you want to find the extremum of ##s## subject to the constraint that ##c## is a constant, which is exactly the same situation as
  • #1
Coffee_
259
2
In general, when dealing with mechanics problems using a function ##f(q1,q2,...)=0## that represent constraints one is minimizing the action ##S## while adding a term to the Lagrangian of the not-independent coordinates ##L + \lambda f ##. One can show that this addition doesn't change the minimal action and that the desired equations follow under a correct choice of ##\lambda##. Anyway this is the known theoretic approach.

Now consider the following treatment of the catenary problem on page 5: http://physics.ucsd.edu/students/courses/fall2009/managed/physics110a/documents/Lecture11-9.pdf

Don't waste the time on the details because my question is about the method:

Why does one add the ##\lambda f## term to the function ##V## one wants to minimize? As you see from my introduction above, when minimizing the action ##S##, that term is not added to the action but to the Lagrangian. Here ##V## is supposed to be the analogue of the action since this is what we want to minimize. So it seems to me that in this problem Lagrange multipliers are used differently than we had used in previous problems and different from the description of the theory.

Please help me lift the confusion. To show that I have thought about it, this is my idea but I'm not sure:

Since we are talking about the potential energy function ##V## and the kinetic energy is 0, we are basically talking about the Lagrangian and not the analog of the action. As we know from the theory we add this ##\lambda f## term to the Lagrangian, or the potential energy here. We know that the physics haven't changed from this addition. And now we minimize the new expression with the added term and forget that we even ever used lagrange multipliers?
 
Last edited by a moderator:
Physics news on Phys.org
  • #2
The constraint that the chain should have a fixed length is not a holonomic constraint and you should not be surprised that it appears differently. In fact, I think that this type of Lagrange multiplier is more reminiscent of how Lagrange multipliers are used in multivariable calculus. I agree that it is confusing mainly based upon the document not discussing this type of problem earlier but only discussing holonomic constraints.
 
  • #3
Orodruin said:
The constraint that the chain should have a fixed length is not a holonomic constraint and you should not be surprised that it appears differently. In fact, I think that this type of Lagrange multiplier is more reminiscent of how Lagrange multipliers are used in multivariable calculus. I agree that it is confusing mainly based upon the document not discussing this type of problem earlier but only discussing holonomic constraints.

Do you mind elaborating on the reasoning they use for this problem then? I seem to understand it for holonomic constraints but here something is different as I mentioned and just seem to not see it.
 
  • #4
Coffee_ said:
Do you mind elaborating on the reasoning they use for this problem then? I seem to understand it for holonomic constraints but here something is different as I mentioned and just seem to not see it.

Consider a functional ##S[f]## which you want to find the extreme values of under the condition ##C[f] = C_0##, where ##C## is another functional of ##f##. By finding the extreme values of ##H = S- \lambda C##, you will be setting
$$
\delta H = \delta S - \lambda \delta C = 0.
$$
This of course is equivalent to setting ##\delta S = \lambda \delta C##, i.e., the variation of ##S## is proportional to the variation of ##C##. If this is true, a variation that does not change ##C## does not change ##S## either, which is what must be true for an extreme value of ##S## under the condition that ##C## is a given constant. If ##\delta S## is not proportional to ##\delta C##, then you can find a variation such that ##\delta S## is non-zero while ##\delta C = 0##, implying that this variation will change the value of ##S## while keeping the value of ##C## equal to ##C_0##. The reasoning is exactly equivalent to the corresponding reasoning in multivariable calculus.
 
  • #5
Orodruin said:
Consider a functional ##S[f]## which you want to find the extreme values of under the condition ##C[f] = C_0##, where ##C## is another functional of ##f##. By finding the extreme values of ##H = S- \lambda C##, you will be setting
$$
\delta H = \delta S - \lambda \delta C = 0.
$$
This of course is equivalent to setting ##\delta S = \lambda \delta C##, i.e., the variation of ##S## is proportional to the variation of ##C##. If this is true, a variation that does not change ##C## does not change ##S## either, which is what must be true for an extreme value of ##S## under the condition that ##C## is a given constant. If ##\delta S## is not proportional to ##\delta C##, then you can find a variation such that ##\delta S## is non-zero while ##\delta C = 0##, implying that this variation will change the value of ##S## while keeping the value of ##C## equal to ##C_0##. The reasoning is exactly equivalent to the corresponding reasoning in multivariable calculus.

Thanks for the answer. I think the reason for me having troubles with this is because I have never seen functionals not variations formally. I had to deduce how one works with them from examples in mechanics. This means I'm certainly going to have look over your explanation a few times to really grasp it.
 
  • #6
Coffee_ said:
Thanks for the answer. I think the reason for me having troubles with this is because I have never seen functionals not variations formally. I had to deduce how one works with them from examples in mechanics. This means I'm certainly going to have look over your explanation a few times to really grasp it.

Compare to the multivariable case. Let us say you want to minimise some function ##s(x)## where ##x \in \mathbb R^n## under the condition that ##c(x) = c_0##. This is basically the same situation, only with functions instead of functionals. Now, in order to perform this minimisation, you construct ##h = s-\lambda c## and minimise it, i.e., you find out where ##\nabla h = \nabla s - \lambda \nabla c = 0##, which is equivalent to ##\nabla s = \lambda \nabla c##. This means that the gradient of ##s## is parallel to the gradient of ##c##. So what is the geometrical interpretation of this? In order to stay on the surface ##c = c_0##, you can only go in directions perpendicular to the surface normal, i.e., ##\nabla c## so if you have a point ##x_0## fulfilling ##\nabla h = 0## (and ##c(x_0) = c_0##), then any infinitesimal displacement ##\delta## within the surface is going to lead to ##\delta\cdot \nabla s = \lambda \delta\cdot \nabla c = 0##, which means that ##x_0## is an extremum of ##s## on the surface ##c = c_0##. The functional case is completely analogous to this.
 

1. What is the catenary problem?

The catenary problem is a mathematical problem that involves finding the shape of a hanging chain or cable under its own weight. It was first studied by mathematicians like Galileo and Huygens in the 17th century and has applications in fields such as architecture, engineering, and physics.

2. What is a Lagrange multiplier approach?

A Lagrange multiplier approach is a mathematical technique used to optimize a function subject to constraints. It involves using a Lagrange multiplier, which is a constant that helps incorporate the constraints into the optimization problem, to find the maximum or minimum value of the function.

3. How does the Lagrange multiplier approach apply to the catenary problem?

In the catenary problem, the Lagrange multiplier approach is used to find the shape of the hanging chain that minimizes its potential energy. The constraint in this case is the length of the chain, and the Lagrange multiplier helps incorporate it into the optimization problem.

4. What are the benefits of using a Lagrange multiplier approach for the catenary problem?

Using a Lagrange multiplier approach allows for a more efficient and systematic solution to the catenary problem. It also provides a rigorous mathematical framework for finding the optimal solution and can be applied to more complex problems with multiple constraints.

5. Are there any limitations to using a Lagrange multiplier approach for the catenary problem?

The Lagrange multiplier approach assumes that the catenary is a continuous and smooth curve, which may not always be the case in real-life scenarios. Additionally, it may not be suitable for finding the shape of a hanging chain under non-uniform gravitational fields or other external forces.

Similar threads

Replies
13
Views
1K
  • Calculus and Beyond Homework Help
Replies
8
Views
479
  • Special and General Relativity
Replies
3
Views
2K
Replies
7
Views
1K
  • Classical Physics
Replies
5
Views
1K
Replies
3
Views
1K
  • Advanced Physics Homework Help
Replies
2
Views
831
Replies
1
Views
946
  • Classical Physics
Replies
1
Views
990
Replies
5
Views
885
Back
Top