Is there a formula for calculating the derivative of the largest eigenvalue?

Leo321 · Oct 20, 2010

Assume A=f(t) is an nxn matrix with all elements being non-negative. The eigenvalue with the largest absolute value is real and positive. We will call it r.
Is there an analytical way to calculate dr/dt, or at least find the values of t for which dr/dt=0?

mattrb · Nov 20, 2010

Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra). If the largest eigenvalue has multiplicity 1 then it should be possible to calculate the derivative of the largest eigenvalue should be defined. I would consider your matrix as being [tex]A[t]=A[t_0]+A'[t_0](t-t_0)+...[/tex] (i.e. ignore second order time derivative terms). Then consider [tex]A=X\Sigma X^{-1}[/tex] (the diagonalization). Differentiating you get.
[tex]d\lambda_1/dt=y_1^*A'[t_0]x_1[/tex]. Where [tex]y_1[/tex] and [tex]x_1[/tex] are the left and right eigenvectors of [tex]A[t_0][/tex] respectively corresponding to the largest eigenvalue. They are the first row of [tex]X^{-1}[/tex] and the first column of [tex]X[/tex] respectively if the eigenvalues are sorted in descending order in [tex]\Sigma[/tex].
"www.ics.forth.gr/cvrl/publications/.../2000_eccv_SVD_jacobian.pdf"[/URL] gives the procedure for doing this with the SVD. I just generalized the result. Please check it over and do the detailed computations before using this formula.

I hope I was able to help!

Matthew

AlephZero · Nov 20, 2010

mattrb said:

Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra).
Matthew

I'm not convinced about that for a general matrix (see below) - though it may be true if the largest eigenvalue is always real.

But the largest EV is not a differentiable function of A (at least, not everywhere) even if it is continuous.

Counterexample: a 2x2 diagonal matrix with A11 = 1, A12 = t

The max eigenvalue = 1 if t <= 1, t if t >= 1. This is not differentiable when t = 1.

The reason is connected to the fact that the corresponding eigenvector is not even a continuous function. It "flips" from (1, 0) to (0, 1) when t passes through 1.

It is easy to construct an 4x4 matrix with two pairs of complex conjugate eigenvalues, where the largest eigenvalue (maximum absolute value) is not even continuous when it "jumps" from being a member of one pair to a member of the other pair. The OP said the largest eigenvalue is real, but it's hard to characterize what that means for a completely general matrix.

mattrb · Nov 20, 2010

Aleph, you're perfectly right about the complex example and the general non-differentiability of eigenvalues. Eigenvectors are ill-posed when there are duplicate eigenvalues, since any linear combination of the two (or more) eigenvectors is also an eigenvector. However, Demmel states that eigenvalues are always continuous functions of the matrix entries, even if they are not differentiable. The max function applied to real functions does not affect continuity though it might affect differentiability (if two eigenvalues are equal). 'largest' is subjective when using the complex number system.

Matthew

AlephZero · Nov 22, 2010

If you know an eigenvalue [tex]\lambda[/tex] and the corresponding normalized eigenvector [tex]x[/tex] you can find derivative like this:

You know [tex]Ax = \lambda x[/tex] and [tex]x^T x = 1[/tex]

[tex](A + dA)(x + dx) = (\lambda + d\lambda)(x + dx)[/tex]

Ignoring second order terms, that expands to

[tex]A.dx + dA.x = \lambda.dx + d\lambda.x[/tex]

You also want [tex](x + dx)[/tex] to be normalized. That gives

[tex](x + dx)^T(x+dx) = 1[/tex]

Ignoring second order terms, that expands to

[tex]x^T dx = 0[/tex] (which is an interesting fact in its own right)

So you have [tex]n+1[/tex] equations that you can solve for the [tex]n[/tex] components of [tex]dx[/tex], and the value of [tex]d\lambda[/tex].

If [tex]A = A(t)[/tex] you can obviously recast the idea in terms of derivatives w.r.t. [tex]t[/tex].

If all the eigenvalues are distinct, this works for any eigenpair, not just the largest. And if all the eigenvalues are distinct there is no ambiguity about which is the largest, so my earlier quibbles don't apply.

Is there a formula for calculating the derivative of the largest eigenvalue?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Undergrad The vector to which a dual vector corresponds

Graduate Confusion about the Moyal-Weyl twist

Undergrad How to define a vector field?

Undergrad 2 interpretations of bra-ket expression: equal, & isomorphic, but...

Undergrad Erroneously finding discrepancy in transpose rule

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight