Is the Variation of a Functional in Calculus of Variation Correctly Calculated?

lennyleonard · May 25, 2011

Hi everyone!

Here's my problem:
Let's suppose that we have a functional [itex]I[f,g]=\int{L(f,\dot{f},g,\dot{g},x)\,dx}[/itex].

Is it right to say that the variation of [itex]I[/itex] whit respect to [itex]g[/itex] (thus taking [itex]g\;\rightarrow\;g+\delta g[/itex]) is [tex]\delta I=\int{[L(f,\dot{f},g+\delta g,\dot{g}+\delta \dot g,x)-L(f,\dot{f},g,\dot{g},x)]\,dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\,dx}[/tex]??
Thanks for your disponibility!

Petr Mugver · May 26, 2011

Yes, it is true.

lennyleonard · May 26, 2011

Thank you petr!

I had this doubt when i saw the definition of "first variation" of a functional: according to the textbook i red (and to wikipedia: http://en.wikipedia.org/wiki/First_variation" ); taken a functional [itex]J(y)[/itex] it's said that I need a function [itex]h[/itex] to define the variaton of [itex]y:\;y\rightarrow y+\varepsilon h[/itex] where [itex]\varepsilon \in R[/itex].
Then the first variation is:[tex]\delta J=lim_{\varepsilon\rightarrow 0}\frac{J(y+\varepsilon h)-J(y)}{\varepsilon}[/tex]That's very different from the form I've written before!
In fact where i took [itex]y\rightarrow y+\delta y[/itex] the books take [itex]y\rightarrow y+\varepsilon h[/itex]; i.e. they use an "external" function [itex]h[/itex] and a parameter [itex]\varepsilon[/itex], to be sent to zero in the operation.

Is it becaouse my [itex]\delta y[/itex] can actually take the place of [itex]\varepsilon h[/itex] (being [itex]\delta y[/itex] an infinitesimal and a function it could work as wel..doesn'it?) or it's a whole different story??
The problem is that in some exercises (see the link to wikipedia above) the result depends on [itex]h[/itex], and [itex]h[/itex] alone it's not an infinitesimal, so it cannot be "replaced" by [itex]\delta y[/itex]!

Is there someone who could explain that to me??

henry_m · May 26, 2011

You seem to have understood the ideas but there are some differences here in notation and level of rigour, which add up to be quite confusing when you compare the two approaches.

In your opening post, you run through the first stages of what I might call a 'physicist's derivation' of the Euler-Lagrange equations, where you expand things up to first order and write an equality when you have ignored terms of quadratic order and higher. You regard [itex]\delta y[/itex] as an 'infinitesimal function' and you only bother to keep an intuitive idea of [itex]\delta I[/itex] as an 'infinitesimal change' rather than defining it properly. And this is fine, up to a point. The wiki article is a little more careful.

Here, we fix a definite function h, and a scale epsilon, which we imagine to be some very small but nonzero number. Then we see how the functional changes if we change its argument by [itex]\epsilon h[/itex], similar to your [itex]\delta y[/itex] as your instincts correctly suggested. Think of h as fixing the 'shape' of the perturbation and epsilon its 'size'. This change in the function is some small but finite number, and is similar to your [itex]\delta I[/itex]. For epsilon very small, this change will be linear in epsilon, so by dividing through by epsilon and taking the limit, just like taking ordinary derivatives, we get a measure of the rate of change of the functional in the 'direction' of h. This is what they call [itex]\delta J[/itex], which is a number which depends on h (Just as your [itex]\delta I[/itex] depends on [itex]\delta y[/itex]). Notice that nowhere here have we appealed to the hand-waving ideas of 'infinitesimals'.

Hopefully that helps; if you are still confused try doing the two approaches with ordinary functions of a single real variable.

lennyleonard · May 27, 2011

You've been very kind to spend some of your time on my problem henry_m! I'm very thankful!
But please, be patient enough to see if my conclusions are right:

Basically the difference in the two method (aside for the derivations) is that the former is the "quick-straight to the result" one, which doesn't cure for the mathematical formalism and the latter is the "elegant, precise" one, which avoids (as you said) the clutter concept of differential.

Anyway basically the two things are quite the same: in "my" method I used an expansion to first order so my [itex]\delta I[/itex] depends on the infinitesimal [itex]\delta g[/itex] (and its derivatives):[tex]\delta I=\int{[L(g+\delta g,\dot{g}+\delta \dot{g},x)-L(g,\dot{g},x)]\;dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\;dx}[/tex]while in the wiki-mathematician way there's no expansion employment whatsoever and thus the result depends on the non-infinitesimal funcion h, but besides this they're equal.
In fact it is also true that, using wiki method:[tex]\delta J(h) = \int{[(\frac{\partial L}{\partial g}h+\frac{\partial L}{\partial \dot{g}}\dot{h})]\;dx}[/tex]

Is this right?

henry_m · May 27, 2011

lennyleonard said:

Anyway basically the two things are quite the same: in "my" method I used an expansion to first order so my [itex]\delta I[/itex] depends on the infinitesimal [itex]\delta g[/itex] (and its derivatives):[tex]\delta I=\int{[L(g+\delta g,\dot{g}+\delta \dot{g},x)-L(g,\dot{g},x)]\;dx}=\int{(\frac{\partial L}{\partial g}\delta g+\frac{\partial L}{\partial \dot{g}}\delta \dot{g})\;dx}[/tex]while in the wiki-mathematician way there's no expansion employment whatsoever and thus the result depends on the non-infinitesimal funcion h, but besides this they're equal.
In fact it is also true that, using wiki method:[tex]\delta J(h) = \int{[(\frac{\partial L}{\partial g}h+\frac{\partial L}{\partial \dot{g}}\dot{h})]\;dx}[/tex]

Is this right?

Yes, you seem to have things pretty much spot on. Just bear in mind that your [itex]\delta I[/itex], [itex]\delta g[/itex] are not well-defined objects so your approach is only heuristic and can't be used in a proper proof. But this sort of argument can be a very quick and useful way to derive and motivate results, and often appears in physics literature and textbooks.

lennyleonard · May 27, 2011

Your explanations have been extremely clarifying, sir.

Thank you very very much!

Is the Variation of a Functional in Calculus of Variation Correctly Calculated?

1. What is the definition of Calculus of Variation?

2. How is Calculus of Variation different from traditional calculus?

3. What are some applications of Calculus of Variation?

4. What are some techniques used in Calculus of Variation?

5. Are there any real-life examples that can be solved using Calculus of Variation?

Similar threads

Hot Threads

Recent Insights