Is the Variation of a Functional in Calculus of Variation Correctly Calculated?

Click For Summary

Discussion Overview

The discussion revolves around the calculation of the variation of a functional in the context of the calculus of variations. Participants explore different approaches to defining the variation, particularly focusing on the differences between a physicist's intuitive method and a more formal mathematical approach.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant questions whether the variation of the functional I with respect to g can be expressed as a specific integral involving partial derivatives of L.
  • Another participant confirms the correctness of the initial variation calculation presented.
  • A participant expresses confusion regarding the definition of "first variation" and contrasts their approach with a textbook definition that involves an external function h and a parameter ε.
  • One participant explains the differences in notation and rigor between the two approaches, noting that the physicist's method is more intuitive but less formal than the mathematical approach.
  • Another participant attempts to summarize the differences, suggesting that both methods ultimately yield similar results despite their differing levels of rigor and formalism.
  • A later reply acknowledges that while the intuitive approach is heuristic and useful, it lacks the precision required for formal proofs.

Areas of Agreement / Disagreement

Participants express differing views on the rigor and validity of the two approaches to calculating variations. While some agree on the correctness of the intuitive method, others emphasize the importance of formal definitions and the role of the function h in the mathematical approach. The discussion remains unresolved regarding the superiority of one method over the other.

Contextual Notes

Participants note that the intuitive approach using infinitesimals may not be well-defined in a rigorous mathematical sense, which could limit its applicability in formal proofs.

lennyleonard
Messages
21
Reaction score
0
Hi everyone!

Here's my problem:
Let's suppose that we have a functional [itex]I[f,g]=\int{L(f,\dot{f},g,\dot{g},x)\,dx}[/itex].

Is it right to say that the variation of [itex]I[/itex] whit respect to [itex]g[/itex] (thus taking [itex]g\;\rightarrow\;g+\delta g[/itex]) is [tex]\delta I=\int{[L(f,\dot{f},g+\delta g,\dot{g}+\delta \dot g,x)-L(f,\dot{f},g,\dot{g},x)]\,dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\,dx}[/tex]??
Thanks for your disponibility!
 
Physics news on Phys.org
Yes, it is true.
 
Thank you petr!

I had this doubt when i saw the definition of "first variation" of a functional: according to the textbook i red (and to wikipedia: http://en.wikipedia.org/wiki/First_variation" ); taken a functional [itex]J(y)[/itex] it's said that I need a function [itex]h[/itex] to define the variaton of [itex]y:\;y\rightarrow y+\varepsilon h[/itex] where [itex]\varepsilon \in R[/itex].
Then the first variation is:[tex]\delta J=lim_{\varepsilon\rightarrow 0}\frac{J(y+\varepsilon h)-J(y)}{\varepsilon}[/tex]That's very different from the form I've written before!
In fact where i took [itex]y\rightarrow y+\delta y[/itex] the books take [itex]y\rightarrow y+\varepsilon h[/itex]; i.e. they use an "external" function [itex]h[/itex] and a parameter [itex]\varepsilon[/itex], to be sent to zero in the operation.

Is it becaouse my [itex]\delta y[/itex] can actually take the place of [itex]\varepsilon h[/itex] (being [itex]\delta y[/itex] an infinitesimal and a function it could work as wel..doesn'it?) or it's a whole different story??
The problem is that in some exercises (see the link to wikipedia above) the result depends on [itex]h[/itex], and [itex]h[/itex] alone it's not an infinitesimal, so it cannot be "replaced" by [itex]\delta y[/itex]!

Is there someone who could explain that to me??
 
Last edited by a moderator:
You seem to have understood the ideas but there are some differences here in notation and level of rigour, which add up to be quite confusing when you compare the two approaches.

In your opening post, you run through the first stages of what I might call a 'physicist's derivation' of the Euler-Lagrange equations, where you expand things up to first order and write an equality when you have ignored terms of quadratic order and higher. You regard [itex]\delta y[/itex] as an 'infinitesimal function' and you only bother to keep an intuitive idea of [itex]\delta I[/itex] as an 'infinitesimal change' rather than defining it properly. And this is fine, up to a point. The wiki article is a little more careful.

Here, we fix a definite function h, and a scale epsilon, which we imagine to be some very small but nonzero number. Then we see how the functional changes if we change its argument by [itex]\epsilon h[/itex], similar to your [itex]\delta y[/itex] as your instincts correctly suggested. Think of h as fixing the 'shape' of the perturbation and epsilon its 'size'. This change in the function is some small but finite number, and is similar to your [itex]\delta I[/itex]. For epsilon very small, this change will be linear in epsilon, so by dividing through by epsilon and taking the limit, just like taking ordinary derivatives, we get a measure of the rate of change of the functional in the 'direction' of h. This is what they call [itex]\delta J[/itex], which is a number which depends on h (Just as your [itex]\delta I[/itex] depends on [itex]\delta y[/itex]). Notice that nowhere here have we appealed to the hand-waving ideas of 'infinitesimals'.

Hopefully that helps; if you are still confused try doing the two approaches with ordinary functions of a single real variable.
 
You've been very kind to spend some of your time on my problem henry_m! I'm very thankful!
But please, be patient enough to see if my conclusions are right:

Basically the difference in the two method (aside for the derivations) is that the former is the "quick-straight to the result" one, which doesn't cure for the mathematical formalism and the latter is the "elegant, precise" one, which avoids (as you said) the clutter concept of differential.

Anyway basically the two things are quite the same: in "my" method I used an expansion to first order so my [itex]\delta I[/itex] depends on the infinitesimal [itex]\delta g[/itex] (and its derivatives):[tex]\delta I=\int{[L(g+\delta g,\dot{g}+\delta \dot{g},x)-L(g,\dot{g},x)]\;dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\;dx}[/tex]while in the wiki-mathematician way there's no expansion employment whatsoever and thus the result depends on the non-infinitesimal funcion h, but besides this they're equal.
In fact it is also true that, using wiki method:[tex]\delta J(h) = \int{[(\frac{\partial L}{\partial g}h+\frac{\partial L}{\partial \dot{g}}\dot{h})]\;dx}[/tex]

Is this right?
 
Last edited:
lennyleonard said:
Anyway basically the two things are quite the same: in "my" method I used an expansion to first order so my [itex]\delta I[/itex] depends on the infinitesimal [itex]\delta g[/itex] (and its derivatives):[tex]\delta I=\int{[L(g+\delta g,\dot{g}+\delta \dot{g},x)-L(g,\dot{g},x)]\;dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\;dx}[/tex]while in the wiki-mathematician way there's no expansion employment whatsoever and thus the result depends on the non-infinitesimal funcion h, but besides this they're equal.
In fact it is also true that, using wiki method:[tex]\delta J(h) = \int{[(\frac{\partial L}{\partial g}h+\frac{\partial L}{\partial \dot{g}}\dot{h})]\;dx}[/tex]

Is this right?

Yes, you seem to have things pretty much spot on. Just bear in mind that your [itex]\delta I[/itex], [itex]\delta g[/itex] are not well-defined objects so your approach is only heuristic and can't be used in a proper proof. But this sort of argument can be a very quick and useful way to derive and motivate results, and often appears in physics literature and textbooks.
 
Your explanations have been extremely clarifying, sir.

Thank you very very much!
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
Replies
6
Views
3K