Is there a formula for calculating the derivative of the largest eigenvalue?

Click For Summary

Discussion Overview

The discussion revolves around the calculation of the derivative of the largest eigenvalue of a time-dependent matrix, specifically focusing on whether there exists an analytical method to compute this derivative or identify points where it equals zero. The scope includes theoretical considerations and mathematical reasoning related to eigenvalues and their properties.

Discussion Character

  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants propose that the largest eigenvalue of a matrix is a continuous function of the matrix itself, particularly when the largest eigenvalue has multiplicity 1.
  • Others argue that the largest eigenvalue may not be differentiable everywhere, providing a counterexample of a 2x2 diagonal matrix where the eigenvalue is not differentiable at a specific point.
  • A participant mentions that eigenvectors can be ill-posed when there are duplicate eigenvalues, complicating the differentiation process.
  • Another viewpoint suggests that while eigenvalues are continuous functions of the matrix entries, their differentiability can be affected by the presence of equal eigenvalues.
  • One participant describes a method for calculating the derivative of an eigenvalue using perturbation theory, emphasizing the importance of normalization of eigenvectors in the process.
  • It is noted that if all eigenvalues are distinct, the method for finding derivatives applies to any eigenpair, not just the largest eigenvalue.

Areas of Agreement / Disagreement

Participants express differing views on the differentiability of the largest eigenvalue, with some asserting continuity and others highlighting potential non-differentiability in certain cases. No consensus is reached regarding the conditions under which the derivative can be calculated.

Contextual Notes

The discussion highlights limitations related to the assumptions about the matrix structure, the implications of eigenvalue multiplicity, and the conditions under which eigenvalues and eigenvectors remain continuous or differentiable.

Leo321
Messages
38
Reaction score
0
Assume A=f(t) is an nxn matrix with all elements being non-negative. The eigenvalue with the largest absolute value is real and positive. We will call it r.
Is there an analytical way to calculate dr/dt, or at least find the values of t for which dr/dt=0?
 
Last edited:
Physics news on Phys.org
Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra). If the largest eigenvalue has multiplicity 1 then it should be possible to calculate the derivative of the largest eigenvalue should be defined. I would consider your matrix as being [tex]A[t]=A[t_0]+A'[t_0](t-t_0)+...[/tex] (i.e. ignore second order time derivative terms). Then consider [tex]A=X\Sigma X^{-1}[/tex] (the diagonalization). Differentiating you get.
[tex]d\lambda_1/dt=y_1^*A'[t_0]x_1[/tex]. Where [tex]y_1[/tex] and [tex]x_1[/tex] are the left and right eigenvectors of [tex]A[t_0][/tex] respectively corresponding to the largest eigenvalue. They are the first row of [tex]X^{-1}[/tex] and the first column of [tex]X[/tex] respectively if the eigenvalues are sorted in descending order in [tex]\Sigma[/tex].
"www.ics.forth.gr/cvrl/publications/.../2000_eccv_SVD_jacobian.pdf"[/URL] gives the procedure for doing this with the SVD. I just generalized the result. Please check it over and do the detailed computations before using this formula.

I hope I was able to help!

Matthew
 
Last edited by a moderator:
mattrb said:
Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra).
Matthew

I'm not convinced about that for a general matrix (see below) - though it may be true if the largest eigenvalue is always real.

But the largest EV is not a differentiable function of A (at least, not everywhere) even if it is continuous.

Counterexample: a 2x2 diagonal matrix with A11 = 1, A12 = t

The max eigenvalue = 1 if t <= 1, t if t >= 1. This is not differentiable when t = 1.

The reason is connected to the fact that the corresponding eigenvector is not even a continuous function. It "flips" from (1, 0) to (0, 1) when t passes through 1.

It is easy to construct an 4x4 matrix with two pairs of complex conjugate eigenvalues, where the largest eigenvalue (maximum absolute value) is not even continuous when it "jumps" from being a member of one pair to a member of the other pair. The OP said the largest eigenvalue is real, but it's hard to characterize what that means for a completely general matrix.
 
Last edited:
Aleph, you're perfectly right about the complex example and the general non-differentiability of eigenvalues. Eigenvectors are ill-posed when there are duplicate eigenvalues, since any linear combination of the two (or more) eigenvectors is also an eigenvector. However, Demmel states that eigenvalues are always continuous functions of the matrix entries, even if they are not differentiable. The max function applied to real functions does not affect continuity though it might affect differentiability (if two eigenvalues are equal). 'largest' is subjective when using the complex number system.

Matthew
 
If you know an eigenvalue [tex]\lambda[/tex] and the corresponding normalized eigenvector [tex]x[/tex] you can find derivative like this:

You know [tex]Ax = \lambda x[/tex] and [tex]x^T x = 1[/tex]

[tex](A + dA)(x + dx) = (\lambda + d\lambda)(x + dx)[/tex]

Ignoring second order terms, that expands to

[tex]A.dx + dA.x = \lambda.dx + d\lambda.x[/tex]

You also want [tex](x + dx)[/tex] to be normalized. That gives

[tex](x + dx)^T(x+dx) = 1[/tex]

Ignoring second order terms, that expands to

[tex]x^T dx = 0[/tex] (which is an interesting fact in its own right)

So you have [tex]n+1[/tex] equations that you can solve for the [tex]n[/tex] components of [tex]dx[/tex], and the value of [tex]d\lambda[/tex].

If [tex]A = A(t)[/tex] you can obviously recast the idea in terms of derivatives w.r.t. [tex]t[/tex].

If all the eigenvalues are distinct, this works for any eigenpair, not just the largest. And if all the eigenvalues are distinct there is no ambiguity about which is the largest, so my earlier quibbles don't apply.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
7
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 2 ·
Replies
2
Views
12K
  • · Replies 18 ·
Replies
18
Views
4K
  • · Replies 6 ·
Replies
6
Views
1K
  • · Replies 6 ·
Replies
6
Views
7K
  • · Replies 8 ·
Replies
8
Views
3K