Why Recast Second-Order ODEs as Sturm-Liouville Problems?

bolbteppa · Jul 8, 2013

In thinking about Sturm-Liouville theory a bit I see I have no actual idea what is going on.

The first issue I have is that my book began with the statement that given

$$L[y] = a(x)y'' + b(x)y' + c(x)y = f(x)$$

the problem L[y] \ = \ f can be re-cast in the form L[y] \ = \ \lambda y.

Now it could be a typo on their part but I see no justification for the way you can just do that!

More importantly though is the motivation for Sturm-Liouville theory in the first place. The story as I know it is as follows:

Given a linear second order ode

$$F(x,y,y',y'') = L[y] = a(x)y'' + b(x)y' + c(x)y = f(x)$$

it is an exact equation if it is derivable from a differential equation of one order lower, i.e.

$$F(x,y,y',y'') = \frac{d}{dx}g(x,y,y').$$

The equation is exact iff

$$a''(x) - b'(x) + c(x) = 0. $$

If F is not exact it can be made exact on multiplication by a suitable integrating factor $\alpha(x)$.

This equation is exact iff

$$(\alpha(x)a(x))'' - (\alpha(x)b(x))' + \alpha(x)c(x) = 0 $$

If you expand this out you get the Adjoint operator

$$L^*[\alpha(x)] \ = \ (\alpha(x)a(x))'' \ - \ (\alpha(x)b(x))' \ + \ \alpha(x)c(x) \ = 0 $$

If you expand L^* you see that we can satisfy L \ = \ L^* if a'(x) \ = \ b(x) & a''(x) \ = \ b'(x) which then turns L[y] into something of the form

$$L[y] \ = \ \frac{d}{dx}[a(x)y'] \ + \ c(x)y \ = \ f(x).$$

Thus we seek an integratiing factor \alpha(x) so that we can satisfy this & the condition this will hold is that \alpha(x) \ = \ \frac{1}{a(x)}e^{\int\frac{b(x)}{a(x)}dx}

Then we're dealing with:

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ \alpha(x)c(x)y \ = \ \alpha(x)f(x)$$

But again, by what my book said they magically re-cast this problem as

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ \alpha(x)c(x)y \ = \ \lambda \alpha(x) y(x)$$

Then calling

$$\frac{d}{dx}[\alpha(x)a(x)y'] \ + \ ( \alpha(x)c(x)y \ - \ \lambda \alpha(x) )y(x) \ = \ 0$$

a Sturm-Liouville problem.

My question is, how can I make sense of everything I wrote above? How can I clean it up & interpret it, like at one stage I thought we were turning our 2nd order ode into something so that it reduces to the derivative of a first order ode so we can easily find first integrals then the next moment we're pulling out eigenvalues & finding full solutions - what's going on? I want to be able to look at a(x)y'' \ + \ b(x)y' \ + \ c(x)y \ = \ f(x) & know how & why we're turning this into a Sturm-Liouville problem in a way that makes sense of exactness & integrating factors, thanks for reading!

the_wolfman · Jul 9, 2013

First and foremost, its important to keep in mind that the purpose of Strum-Liouville theory is to study the eigenvalues and eigenvectors of an ode. It turn out that you can learn a lot about the solution to an ode, and thus gain a significant amount of physical inside into a variety of problems, by identify and studying the eigenvalues and eigenvectors.

So when your book writes Ly = \lambda y this is not an identity. Instead it is an assertion that we are only going to study the eigenvalue problem.

Second, the procedure on applying the integrating factor is a method used to covert a second order ODE into the traditional S-L form.

\frac{-1}{\alpha}\int dx \left( p \frac{dy}{dx} + q y\right) = \lambda y

This form is useful because the differential operator is self-adjoint over the inner product space defined by
g(x) \cdot f(x) = \int dx g(x) f(x) \alpha(x)

and the above form of the S-L problem allows you to identify the correct weight to use in the inner product.

bolbteppa · Jul 10, 2013

Thank you, I don't see how that explains why L[y] \ = \ f can be re-cast in the form L[y] \ = \ \lambda y though, I know we're studying eigenvalues of the operator but how that justifies L[y] \ = \ f becoming L[y] \ = \ \lambda y is beyond me - what does the author mean when he says that? He does it more than once, even at a crucial step in the derivation of the sturm-liouville problem as a whole, so I think there's more to it than just saying we're studying the operator.

Also I've never seen a sturm-liouville form written as you have done in terms of integration - do you have a reference?

Is there nothing more one can say as regards sturm-liouvlle theory when looking at a(x)y'' \ + \ b(x)y' \ + \ c(x)y \ = \ f(x) other than to say finding the eigenvalues of the differential operator (homogeneous differential operator?) sheds light on the solutions?

the_wolfman · Jul 10, 2013

Also I've never seen a sturm-liouville form written as you have done in terms of integration - do you have a reference?

That would be because its a typo. It should be of the form:

\frac{1}{a}\left(-\frac{d}{dx}\left( p(x) \frac{dy}{dx} \right) + q(x) y \right)=\lambda y

Thank you, I don't see how that explains why L[y] = f can be re-cast in the form L[y] = λy

As I understand it, the formal S-L problem is to consider the eigenvalue problem for the operator L. I may be wrong,
However, under certain conditions the eigenvectors of the L[y] form an orthonormal basis. In this case you can project f onto that basis and the ODE L[y] = f becomes equivalent to a sum of ODEs of the from L[y] = λy.

Why Recast Second-Order ODEs as Sturm-Liouville Problems?

Similar threads

Undergrad A question about variables of integration

Graduate About ellipticity and a proof that a system of PDEs is elliptic

Graduate Should the boundary condition have to satisfy dimensional consistency?

Undergrad Direction Fields and Isoclines

Graduate Volterra equation of the second kind

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers