Why Recast Second-Order ODEs as Sturm-Liouville Problems?

  • Context: Graduate 
  • Thread starter Thread starter bolbteppa
  • Start date Start date
Click For Summary

Discussion Overview

The discussion revolves around the motivation and justification for recasting second-order ordinary differential equations (ODEs) into Sturm-Liouville problems. Participants explore the implications of this transformation, particularly in relation to eigenvalues and eigenvectors, as well as the role of integrating factors and exactness in the context of Sturm-Liouville theory.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant expresses confusion about the transition from the equation L[y] = f to L[y] = λy, questioning the justification for this transformation.
  • Another participant emphasizes that Sturm-Liouville theory focuses on studying eigenvalues and eigenvectors, suggesting that this is the primary reason for the shift in focus from L[y] = f to L[y] = λy.
  • A participant mentions that the integrating factor is a method to convert a second-order ODE into the Sturm-Liouville form, which allows for the identification of the correct weight in the inner product space.
  • There is a request for clarification on the integration form of the Sturm-Liouville problem, indicating that it may not be widely recognized or accepted.
  • One participant proposes that under certain conditions, the eigenvectors of L[y] can form an orthonormal basis, allowing the projection of f onto this basis, which leads to a sum of ODEs of the form L[y] = λy.
  • Another participant points out a potential typo in the representation of the Sturm-Liouville form, suggesting a correction to the equation presented.

Areas of Agreement / Disagreement

Participants express differing views on the justification for the transformation of L[y] = f to L[y] = λy, indicating that the discussion remains unresolved. There is also disagreement regarding the correct representation of the Sturm-Liouville form.

Contextual Notes

Participants highlight the importance of exactness and integrating factors in the transformation process, but the discussion does not resolve the underlying assumptions or conditions necessary for these transformations.

bolbteppa
Messages
300
Reaction score
41
In thinking about Sturm-Liouville theory a bit I see I have no actual idea what is going on.

The first issue I have is that my book began with the statement that given

$$L[y] = a(x)y'' + b(x)y' + c(x)y = f(x)$$

the problem L[y] \ = \ f can be re-cast in the form L[y] \ = \ \lambda y.

Now it could be a typo on their part but I see no justification for the way you can just do that!

More importantly though is the motivation for Sturm-Liouville theory in the first place. The story as I know it is as follows:

Given a linear second order ode

$$F(x,y,y',y'') = L[y] = a(x)y'' + b(x)y' + c(x)y = f(x)$$

it is an exact equation if it is derivable from a differential equation of one order lower, i.e.

$$F(x,y,y',y'') = \frac{d}{dx}g(x,y,y').$$

The equation is exact iff

$$a''(x) - b'(x) + c(x) = 0. $$

If F is not exact it can be made exact on multiplication by a suitable integrating factor $\alpha(x)$.

This equation is exact iff

$$(\alpha(x)a(x))'' - (\alpha(x)b(x))' + \alpha(x)c(x) = 0 $$

If you expand this out you get the Adjoint operator

$$L^*[\alpha(x)] \ = \ (\alpha(x)a(x))'' \ - \ (\alpha(x)b(x))' \ + \ \alpha(x)c(x) \ = 0 $$

If you expand L^* you see that we can satisfy L \ = \ L^* if a'(x) \ = \ b(x) & a''(x) \ = \ b'(x) which then turns L[y] into something of the form

$$L[y] \ = \ \frac{d}{dx}[a(x)y'] \ + \ c(x)y \ = \ f(x).$$

Thus we seek an integratiing factor \alpha(x) so that we can satisfy this & the condition this will hold is that \alpha(x) \ = \ \frac{1}{a(x)}e^{\int\frac{b(x)}{a(x)}dx}

Then we're dealing with:

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ \alpha(x)c(x)y \ = \ \alpha(x)f(x)$$

But again, by what my book said they magically re-cast this problem as

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ \alpha(x)c(x)y \ = \ \lambda \alpha(x) y(x)$$

Then calling

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ ( \alpha(x)c(x)y \ - \ \lambda \alpha(x) )y(x) \ = \ 0$$

a Sturm-Liouville problem.

My question is, how can I make sense of everything I wrote above? How can I clean it up & interpret it, like at one stage I thought we were turning our 2nd order ode into something so that it reduces to the derivative of a first order ode so we can easily find first integrals then the next moment we're pulling out eigenvalues & finding full solutions - what's going on? I want to be able to look at a(x)y'' \ + \ b(x)y' \ + \ c(x)y \ = \ f(x) & know how & why we're turning this into a Sturm-Liouville problem in a way that makes sense of exactness & integrating factors, thanks for reading!
 
Physics news on Phys.org
First and foremost, its important to keep in mind that the purpose of Strum-Liouville theory is to study the eigenvalues and eigenvectors of an ode. It turn out that you can learn a lot about the solution to an ode, and thus gain a significant amount of physical inside into a variety of problems, by identify and studying the eigenvalues and eigenvectors.

So when your book writes Ly = \lambda y this is not an identity. Instead it is an assertion that we are only going to study the eigenvalue problem.


Second, the procedure on applying the integrating factor is a method used to covert a second order ODE into the traditional S-L form.

\frac{-1}{\alpha}\int dx \left( p \frac{dy}{dx} + q y\right) = \lambda y

This form is useful because the differential operator is self-adjoint over the inner product space defined by
g(x) \cdot f(x) = \int dx g(x) f(x) \alpha(x)

and the above form of the S-L problem allows you to identify the correct weight to use in the inner product.
 
Thank you, I don't see how that explains why L[y] \ = \ f can be re-cast in the form L[y] \ = \ \lambda y though, I know we're studying eigenvalues of the operator but how that justifies L[y] \ = \ f becoming L[y] \ = \ \lambda y is beyond me - what does the author mean when he says that? He does it more than once, even at a crucial step in the derivation of the sturm-liouville problem as a whole, so I think there's more to it than just saying we're studying the operator.

Also I've never seen a sturm-liouville form written as you have done in terms of integration - do you have a reference?

Is there nothing more one can say as regards sturm-liouvlle theory when looking at a(x)y'' \ + \ b(x)y' \ + \ c(x)y \ = \ f(x) other than to say finding the eigenvalues of the differential operator (homogeneous differential operator?) sheds light on the solutions?
 
Also I've never seen a sturm-liouville form written as you have done in terms of integration - do you have a reference?

That would be because its a typo. It should be of the form:

\frac{1}{a}\left(-\frac{d}{dx}\left( p(x) \frac{dy}{dx} \right) + q(x) y \right)=\lambda y

Thank you, I don't see how that explains why L[y] = f can be re-cast in the form L[y] = λy

As I understand it, the formal S-L problem is to consider the eigenvalue problem for the operator L. I may be wrong,
However, under certain conditions the eigenvectors of the L[y] form an orthonormal basis. In this case you can project f onto that basis and the ODE L[y] = f becomes equivalent to a sum of ODEs of the from L[y] = λy.
 

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 0 ·
Replies
0
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K