Mahalanobis Distance using Eigen-Values of the Covariance Matrix

Click For Summary
The discussion centers on the Mahalanobis Distance formula and its simplification using Eigen-value decomposition of the covariance matrix. The user encounters issues when the covariance matrix has more variables than observations, leading to zero eigen-values, which disrupts the equivalence of the simplified expression to the original Mahalanobis Distance. It is noted that for the simplified expression to be valid, the covariance matrix must be positive or negative definite, meaning it cannot have zero eigen-values. Suggestions include using Principal Component Analysis or removing problematic variables to address the singularity of the covariance matrix. The simplified expression may not functionally represent the Mahalanobis Distance in cases of zero eigen-values.
orajput
Messages
1
Reaction score
0
Given the formula of Mahalanobis Distance:

D^2_M = (\mathbf{x} - \mathbf{\mu})^T \mathbf{S}^{-1} (\mathbf{x} - \mathbf{\mu})

If I simplify the above expression using Eigen-value decomposition (EVD) of the Covariance Matrix:

S = \mathbf{P} \Lambda \mathbf{P}^T

Then,

D^2_M = (\mathbf{x} - \mathbf{\mu})^T \mathbf{P} \Lambda^{-1} \mathbf{P}^T (\mathbf{x} - \mathbf{\mu})

Let, the projections of (\mathbf{x}-\mu) on all eigen-vectors present in \mathbf{P} be \mathbf{b}, then:

\mathbf{b} = \mathbf{P}^T(\mathbf{x} - \mathbf{\mu})

And,

D^2_M = \mathbf{b}^T \Lambda^{-1} \mathbf{b}

D^2_M = \sum_i{\frac{b^2_i}{\lambda_i}}

The problem that I am facing right now is as follows:

The covariance matrix \mathbf{S} is calculated on a dataset, in which no. of observations are less than the no. of variables. This causes some zero-valued eigen-values after EVD of \mathbf{S}.

In these cases the above simplified expression does not result in the same Mahalanobis Distance as the original expression, i.e.:

(\mathbf{x} - \mathbf{\mu})^T \mathbf{S}^{-1} (\mathbf{x} - \mathbf{\mu}) \neq \sum_i{\frac{b^2_i}{\lambda_i}} (for non-zero \lambda_i)

My question is: Is the simplified expression still functionally represents the Mahalanobis Distance?

P.S.: Motivation to use the simplified expression of Mahalanbis Distance is to calculate its gradient wrt b.
 
Physics news on Phys.org
Hello,

In order to be invertible, S mustn't have zero eigen values, that is , must be positive definite or negative definite. Apart from that , that expression must work...

All the best

GoodSpirit
 
Hey orajput and welcome to the forums.

For your problem, if you do have a singular or ill-conditioned covariance matrix, I would try and do something like Principal Components, or to remove the offending variable from your system and re-do the analysis.
 
I am studying the mathematical formalism behind non-commutative geometry approach to quantum gravity. I was reading about Hopf algebras and their Drinfeld twist with a specific example of the Moyal-Weyl twist defined as F=exp(-iλ/2θ^(μν)∂_μ⊗∂_ν) where λ is a constant parametar and θ antisymmetric constant tensor. {∂_μ} is the basis of the tangent vector space over the underlying spacetime Now, from my understanding the enveloping algebra which appears in the definition of the Hopf algebra...

Similar threads

  • · Replies 1 ·
Replies
1
Views
507
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 12 ·
Replies
12
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 2 ·
Replies
2
Views
871
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 38 ·
2
Replies
38
Views
1K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K