How do Lie symmetries help find canonical coordinates in ODEs?

  • Context: Graduate 
  • Thread starter Thread starter dydxforsn
  • Start date Start date
  • Tags Tags
    Symmetries
Click For Summary
SUMMARY

This discussion focuses on the application of Lie symmetries to find canonical coordinates in ordinary differential equations (ODEs), referencing the textbook "Symmetry Methods for Differential Equations: A Beginner's Guide" by Peter Hydon. The participants explore the transformation of coordinates to achieve translational symmetry, specifically through the equations involving tangent vectors and the method of characteristics. Key equations discussed include the characteristic equations and their relationship to first-order PDEs, highlighting the confusion surrounding the derivation of these equations and their implications in solving ODEs.

PREREQUISITES
  • Understanding of Lie symmetries in differential equations
  • Familiarity with ordinary differential equations (ODEs)
  • Knowledge of first-order partial differential equations (PDEs)
  • Proficiency in using the method of characteristics
NEXT STEPS
  • Study the method of characteristics in detail for first-order PDEs
  • Learn about Lie group transformations and their applications in ODEs
  • Explore canonical coordinates and their significance in differential equations
  • Review the textbook "Symmetry Methods for Differential Equations" by Peter Hydon for practical examples
USEFUL FOR

Mathematicians, physicists, and engineers interested in advanced differential equations, particularly those working with symmetry analysis and canonical transformations in ODEs.

dydxforsn
Messages
104
Reaction score
0
I'm currently reading a textbook on the application of Lie symmetries to differential equations (Symmetry Methods for Differential Equations: A Beginner's Guide Hydon, Peter. Cambridge University Press. 2000.) I'm somewhat at the beginning (pg. 22-25) where a method is being discussed to find canonical coordinates (coordinates that allow you to have some translational symmetry to and ODE, I.E. (\hat{x}, \hat{y}) = (x, y + \epsilon).) Let's assume that the coordinate system we are working in does not have a translational symmetry. Currently we're trying to find some coordinate system that does have such a symmetry (because ODEs are solvable via integration in the presence of said symmetries..) So we're trying to find some (r, s) = (r(x, y), s(x, y)) such that (\hat{r}, \hat{s}) = (r(\hat{x}, \hat{y}), s(\hat{x}, \hat{y})) = (r, s + \epsilon). Anything variable with a "hat" on it is in the transformed coordinate system (or the destination of the mapping in the same coordinate system, whatever you want to call it...) However, this book appears to be clinically terse in "explanations". The section in the book I'm confused on is as follows:

In the new coordinates (\hat{r}, \hat{s}) the tangent vector to an orbit traced out by the continuum of epsilon values has a tangent vector at the point (r, s) of (0, 1):\frac{d \hat{r}}{d \epsilon} = 0, \quad \frac{d \hat{s}}{d \epsilon} = 1
These should both be evaluated at \epsilon = 0, but I don't know how to do that in LaTeX.
By the chain rule (\hat{r} is a function of 'x' and 'y' which are in turn functions of '\epsilon'):EQN\#1a: \quad \xi{(x, y)} r_{x} + \eta{(x, y)} r_{y} = 0EQN\#2b: \quad \xi{(x, y)} s_{x} + \eta{(x, y)} s_{y} = 1Canonical coordinates can be obtained from EQUATIONS #1 by using the method of characteristics.
Note that I believe \frac{dx}{{d}{\epsilon}} = \xi{(x, y)} and \frac{dy}{{d}{\epsilon}} = \eta{(x, y)}. I KNOW that r_{x} = \frac{{\partial}{r}}{{\partial}{x}}. Thus EQN#1 is a chain rule.
The characteristic equations are:\frac{dx}{\xi{(x, y)}} = \frac{dy}{\eta{(x, y)}} = dsA first integral of a given first-order ODEEQN\#2: \quad \frac{dy}{dx} = f(x, y)is a non-constant function \phi(x, y) whose value is constant on any solution y = y(x) of the ODE (Equation #2). Therefore:EQN\#3: \quad \phi_{x} + f(x, y) \phi_{y} = 0 \quad \phi_{y} \neq 0The general solution of the ODE (Equation #2) is:\phi (x, y) = cSuppose that \xi (x, y) \neq 0. Comparing equations #1 and #3, we see that the invariant canonical coordinate 'r' is a first integral ofEQN\#4: \quad \frac{dy}{dx} = \frac{\eta{(x, y)}}{\xi{(x, y)}}So r = \phi (x, y) is found by solving equation #4...
My question is: I'm VERY confused as to how the equation \frac{dx}{\xi{(x, y)}} = \frac{dy}{\eta{(x, y)}} = ds comes up. Where did this thing come from? What's all this talk about \phi (x, y)? I realize EQN#3 is the "characteristic equation", but I don't see how it takes the form that it does. Previously the book had said that the characteristic equation was something like:Q = \eta (x, y) - y' \xi (x, y)where \quad \frac{dy}{dx} = f(x, y) Why is the general solution to the ODE simply \phi (x, y) = c after the book said that the function wasn't constant... How in the world do equations #1 and #3 combine to get equation #4? I'm so utterly confused I had to make this entirely lengthy verbatim post because I can't break down this problem into a simple question.. I will say that I suspect my problems have something to do with the functional relationships of x, y, \hat{x}, \hat{y}, r, s, \hat{r}, and \hat{s}, because that's the added complication to this problem that a previous problem did not have where there was a translational symmetry in just straight 2D cartesian coordinates without having to transform to a canonical coordinate space.

I guess can anybody fill in the gaps on this explanation? I know I left things out, I'm hoping this is a standard/familiar enough problem that people can guide me the right direction even if I didn't quite have the ability to explain things correctly (after all I'm pretty confused about the whole thing..)
 
Physics news on Phys.org
In symmetry analysis, you arrive at first order linear PDE's, e.g. your equations 1a,2b. So some background of first order PDE's is necessary:

Consider the following general first order PDE:
F = P(x,y,z)z_{x}+Q(x,y,z)z_{y}-R(x,y,z)=0

The PDE simply says that the scalar product of [P,Q,R] and [z_x,z_y,-1] vanishes. Because they vanish, these two vectors are perpendicular.

We also have the relationship that dz=z_xdx +z_ydy due to the chain rule.
Now, the normal vector in a point B(x,y,z) on the solution curve z(x,y) is simply [z_x,z_y,-1]. You might see this intuitively, but it is also given by the chain rule above.

Now, because [z_x,z_y,-1] is normal to the solution surface, and because this vector is perpendicular to [P,Q,R], therefore [P,Q,R] must be tangent to the solution surface z(x,y).

Now, define a curve s on the solution z(x,y). We can interpret s as a coordinate axis projected onto the solution and we can express x,y,z in terms of s: x(s),y(s),z(s). We now know that the gradients of x(s),y(s),z(s) with respect to s (which is lying on the solution curve) have to obey:
dx/ds = P
dy/ds = Q
dz/ds = R
because these gradients are given by the direction vector (P,Q,R) above. You can rewrite this as:
or: dx/P = dy/Q = dz/R

You can also derive this more formally by differentiating your first order PDE with respect to s and determine when dx/ds etc will satisfy the pde F=0.

Note that in Q =\eta -y' \xi, you satisfy the characteristic equations if Q=0 (y'=dy/dx). So, yes it is just the characteristic equation, slightly rewritten.

Also, the first integral leads to an equation of the form \phi(x,y)=C. It doesn't mean that the first integral itself is constant (except on the solution curve!), for example the first integral can be \phi(x,y)=x^2+y^2=C.

Hope this helps. I found Hydon's book useful, but the classic is from Bluman and Kumei. Also, The book of Hans Stephani was quite enjoyable.
 
Thank you for the reply bigfooted, but I'm afraid your response raises some more questions, but it does indeed answer and shed light on a lot! heh, let me try to progress the discussion:

bigfooted said:
In symmetry analysis, you arrive at first order linear PDE's, e.g. your equations 1a,2b. So some background of first order PDE's is necessary:

Consider the following general first order PDE:
F = P(x,y,z)z_{x}+Q(x,y,z)z_{y}-R(x,y,z)=0

The PDE simply says that the scalar product of [P,Q,R] and [z_x,z_y,-1] vanishes. Because they vanish, these two vectors are perpendicular.

Looking at these things with a vector interpretation (which my book NEVER talked about *sigh*) does seem necessary, but how do we interpret the RHS of these equations as a vector? Here it could be easy, all of the components would be zero, but what about my equation 1b (there was a typo where I called it "2b"..) where the RHS is equal to 1?

bigfooted said:
We also have the relationship that dz=z_xdx +z_ydy due to the chain rule.
Now, the normal vector in a point B(x,y,z) on the solution curve z(x,y) is simply [z_x,z_y,-1]. You might see this intuitively, but it is also given by the chain rule above.

Now, because [z_x,z_y,-1] is normal to the solution surface,

I don't think I quite follow how we know the part I have bolded.

bigfooted said:
Now, define a curve s on the solution z(x,y). We can interpret s as a coordinate axis projected onto the solution and we can express x,y,z in terms of s: x(s),y(s),z(s). We now know that the gradients of x(s),y(s),z(s) with respect to s (which is lying on the solution curve) have to obey:
dx/ds = P
dy/ds = Q
dz/ds = R
because these gradients are given by the direction vector (P,Q,R) above. You can rewrite this as:
or: dx/P = dy/Q = dz/R

Okay, I think I follow this although we would need another curve (say 'r') to map a surface to this new curvilinear axes. I could see the relation of this to what I'm doing, but I think I need to think about both of our posts a lot longer to really "get it"..

However, this part brings up another question. It seems all well and good to do this with our vector interpretation of the PDE, but only if all of the components on the RHS are zero. In my post, there was that pesky equation 1b that I'm on about that was equal to "1". How would I come up with such a clean set of 3 equations if I didn't have the RHS of the PDE equal to 0? This whole seem sounds very PDE specific. Other than that I do indeed see how dx/P = dy/Q = dz/R = ds comes about.

bigfooted said:
You can also derive this more formally by differentiating your first order PDE with respect to s and determine when dx/ds etc will satisfy the pde F=0.

Again I agree as long as it's homogeneous.

bigfooted said:
Note that in Q =\eta -y' \xi, you satisfy the characteristic equations if Q=0 (y'=dy/dx). So, yes it is just the characteristic equation, slightly rewritten.

Not sure about the links between my equation #3 and the characteristic equation still :/

bigfooted said:
Also, the first integral leads to an equation of the form \phi(x,y)=C. It doesn't mean that the first integral itself is constant (except on the solution curve!), for example the first integral can be \phi(x,y)=x^2+y^2=C.

Ohhhh, they're referring to a "level curve". It was written out in plain mathematical language there for me, but I guess because I didn't see the words "level curve" or any description like you gave so I didn't see it. Thanks a lot for this, not sure why they chose to word it the way they did...

bigfooted said:
Hope this helps. I found Hydon's book useful, but the classic is from Bluman and Kumei. Also, The book of Hans Stephani was quite enjoyable.

Oh okay, so the classic is from Bluman and Kumei, I'll have to check it out!

Thanks for trying to get to the bottom of this seemingly intractable problem with me! I feel like you've shown me most of the way and when I completely figure it out I'm just going to sit awestruck at how impossible it would have been to figure this out from the book :(
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 0 ·
Replies
0
Views
4K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
4K