Trace of a matrix and an expansion of eigenvectors

Click For Summary
SUMMARY

This discussion focuses on deriving the Kullback-Leibler divergence between two multivariate Gaussian distributions using the property of orthonormal eigenvectors. The key conclusion is that if E has orthonormal eigenvectors \( u_{i} \) and eigenvalues \( \lambda_{i} \), then the trace of the product \( A*E \) can be expressed as \( \text{trace}(A*E) = \sum_{i} u^{T}_{i}*A*u_{i}*\lambda_{i} \). The method involves transforming to a basis defined by the eigenvectors and then returning to a general basis to achieve the desired result.

PREREQUISITES
  • Understanding of multivariate Gaussian distributions
  • Knowledge of eigenvalues and eigenvectors
  • Familiarity with matrix operations and properties
  • Concept of trace in linear algebra
NEXT STEPS
  • Study the derivation of Kullback-Leibler divergence for multivariate distributions
  • Learn about orthonormal bases in linear algebra
  • Explore the properties of the trace operator in matrix theory
  • Investigate block matrix techniques for simplifying matrix expressions
USEFUL FOR

Mathematicians, data scientists, and statisticians who are working with multivariate Gaussian distributions and require a deeper understanding of eigenvector properties and matrix operations.

CuppoJava
Messages
23
Reaction score
0
Hi,
I'm trying to derive the Kullback-Leibler divergence between two multi-variate gaussian distributions, and I need the following property. Is there a simple way to understand this?

Prove that:
Given that E has orthonormal eigenvectors u_{i} and eigenvalues \lambda_{i}

Then:
trace(A*E) = \sum_{i}u^{T}_{i}*A*u_{i}*\lambda_{i}

I'm not quite sure how to start. I suspected that it can be proven by looking at block matrices but I didn't get anywhere with that. Thanks a lot for your help.
-Patrick
 
Physics news on Phys.org
Hi Patrick! :smile:

(have a sigma: ∑ and a lambda: λ and try using the X2 tag just above the Reply box :wink:)

If the ui are orthonormal, then they can be used as a basis, and in that basis tr(AE) = ∑ λiAii

Then transform back to a general basis, and you get the result given.

(there's probably a more direct way of doing it, also! :wink:)
 
Thanks Tiny_Tim. It took me a while to figure out the details but I finally got it. Your advice worked perfectly!

PS: And thanks for the generous greek letters. =)
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 4 ·
Replies
4
Views
6K
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 6 ·
Replies
6
Views
5K
  • · Replies 6 ·
Replies
6
Views
5K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
5K
Replies
8
Views
24K