Is there a formula for calculating the derivative of the largest eigenvalue?

In summary, the largest eigenvalue of a matrix is a continuous function of A, but it is not differentiable.
  • #1
Leo321
38
0
Assume A=f(t) is an nxn matrix with all elements being non-negative. The eigenvalue with the largest absolute value is real and positive. We will call it r.
Is there an analytical way to calculate dr/dt, or at least find the values of t for which dr/dt=0?
 
Last edited:
Physics news on Phys.org
  • #2
Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra). If the largest eigenvalue has multiplicity 1 then it should be possible to calculate the derivative of the largest eigenvalue should be defined. I would consider your matrix as being [tex]A[t]=A[t_0]+A'[t_0](t-t_0)+...[/tex] (i.e. ignore second order time derivative terms). Then consider [tex] A=X\Sigma X^{-1}[/tex] (the diagonalization). Differentiating you get.
[tex]d\lambda_1/dt=y_1^*A'[t_0]x_1[/tex]. Where [tex]y_1[/tex] and [tex]x_1[/tex] are the left and right eigenvectors of [tex]A[t_0][/tex] respectively corresponding to the largest eigenvalue. They are the first row of [tex]X^{-1}[/tex] and the first column of [tex]X[/tex] respectively if the eigenvalues are sorted in descending order in [tex] \Sigma[/tex].
"www.ics.forth.gr/cvrl/publications/.../2000_eccv_SVD_jacobian.pdf"[/URL] gives the procedure for doing this with the SVD. I just generalized the result. Please check it over and do the detailed computations before using this formula.

I hope I was able to help!

Matthew
 
Last edited by a moderator:
  • #3
mattrb said:
Leo,

The largest eigenvalue of a matrix is a continuous function of A. (Demmel Applied Numerical Linear Algebra).
Matthew

I'm not convinced about that for a general matrix (see below) - though it may be true if the largest eigenvalue is always real.

But the largest EV is not a differentiable function of A (at least, not everywhere) even if it is continuous.

Counterexample: a 2x2 diagonal matrix with A11 = 1, A12 = t

The max eigenvalue = 1 if t <= 1, t if t >= 1. This is not differentiable when t = 1.

The reason is connected to the fact that the corresponding eigenvector is not even a continuous function. It "flips" from (1, 0) to (0, 1) when t passes through 1.

It is easy to construct an 4x4 matrix with two pairs of complex conjugate eigenvalues, where the largest eigenvalue (maximum absolute value) is not even continuous when it "jumps" from being a member of one pair to a member of the other pair. The OP said the largest eigenvalue is real, but it's hard to characterize what that means for a completely general matrix.
 
Last edited:
  • #4
Aleph, you're perfectly right about the complex example and the general non-differentiability of eigenvalues. Eigenvectors are ill-posed when there are duplicate eigenvalues, since any linear combination of the two (or more) eigenvectors is also an eigenvector. However, Demmel states that eigenvalues are always continuous functions of the matrix entries, even if they are not differentiable. The max function applied to real functions does not affect continuity though it might affect differentiability (if two eigenvalues are equal). 'largest' is subjective when using the complex number system.

Matthew
 
  • #5
If you know an eigenvalue [tex]\lambda[/tex] and the corresponding normalized eigenvector [tex]x[/tex] you can find derivative like this:

You know [tex]Ax = \lambda x[/tex] and [tex]x^T x = 1[/tex]

[tex](A + dA)(x + dx) = (\lambda + d\lambda)(x + dx)[/tex]

Ignoring second order terms, that expands to

[tex]A.dx + dA.x = \lambda.dx + d\lambda.x[/tex]

You also want [tex](x + dx)[/tex] to be normalized. That gives

[tex](x + dx)^T(x+dx) = 1[/tex]

Ignoring second order terms, that expands to

[tex]x^T dx = 0[/tex] (which is an interesting fact in its own right)

So you have [tex]n+1[/tex] equations that you can solve for the [tex]n[/tex] components of [tex]dx[/tex], and the value of [tex]d\lambda[/tex].

If [tex]A = A(t)[/tex] you can obviously recast the idea in terms of derivatives w.r.t. [tex]t[/tex].

If all the eigenvalues are distinct, this works for any eigenpair, not just the largest. And if all the eigenvalues are distinct there is no ambiguity about which is the largest, so my earlier quibbles don't apply.
 

1. What is the formula for calculating the derivative of the largest eigenvalue?

The formula for calculating the derivative of the largest eigenvalue is known as the Rayleigh quotient gradient formula. It states that the derivative of the largest eigenvalue with respect to a matrix A is equal to the product of the inverse of the matrix A and its corresponding eigenvector.

2. How is the Rayleigh quotient gradient formula derived?

The Rayleigh quotient gradient formula is derived using the matrix calculus technique known as the chain rule. It involves taking the derivative of the Rayleigh quotient, which is a function of the matrix A and its corresponding eigenvector, with respect to the matrix A.

3. Can the Rayleigh quotient gradient formula be used for any type of matrix?

Yes, the Rayleigh quotient gradient formula can be used for any type of matrix, including square matrices, symmetric matrices, and non-symmetric matrices. However, it is most commonly used for symmetric matrices as it simplifies the calculation process.

4. Are there any limitations to using the Rayleigh quotient gradient formula?

One limitation of the Rayleigh quotient gradient formula is that it can only be used to calculate the derivative of the largest eigenvalue. It cannot be used to calculate the derivative of other eigenvalues. Additionally, it may not be suitable for matrices with repeated eigenvalues.

5. How is the derivative of the largest eigenvalue used in applications?

The derivative of the largest eigenvalue has various applications in fields such as physics, engineering, and economics. It can be used to optimize functions, identify critical points, and analyze stability in systems. It is also used in machine learning algorithms for dimensionality reduction and feature extraction.

Similar threads

  • Linear and Abstract Algebra
Replies
1
Views
596
  • Linear and Abstract Algebra
Replies
2
Views
2K
  • Engineering and Comp Sci Homework Help
Replies
18
Views
2K
  • Quantum Physics
Replies
2
Views
965
Replies
3
Views
2K
  • Linear and Abstract Algebra
Replies
7
Views
2K
  • Linear and Abstract Algebra
Replies
1
Views
1K
  • Calculus and Beyond Homework Help
Replies
2
Views
521
Replies
5
Views
1K
  • Linear and Abstract Algebra
Replies
8
Views
2K
Back
Top