Newbie question: Algebra of Mahalanobis distance

  • Context: Undergrad 
  • Thread starter Thread starter anja.ende
  • Start date Start date
  • Tags Tags
    Algebra
Click For Summary

Discussion Overview

The discussion revolves around the algebraic formulation of the Mahalanobis distance, specifically the placement of the covariance matrix within the expression. Participants explore the mathematical and intuitive reasoning behind this formulation in the context of multidimensional statistics.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • Anja questions the intuitive reason for the covariance matrix being placed in the middle of the dot product in the Mahalanobis distance formula.
  • One participant explains that the Mahalanobis distance measures how many standard deviations from the mean a point is, relating the covariance matrix to the concept of standard deviation in multidimensional cases.
  • Another participant elaborates that the expression defines a family of hyperellipsoids in N-dimensional space, suggesting that the Mahalanobis distance reflects the number of standard deviations a point is from the mean.

Areas of Agreement / Disagreement

Participants express different aspects of the Mahalanobis distance without reaching a consensus on the intuitive reasoning behind the placement of the covariance matrix in the formula. The discussion remains exploratory with multiple viewpoints presented.

Contextual Notes

The discussion does not resolve the intuitive understanding of the covariance matrix's placement and relies on various interpretations of the mathematical formulation.

anja.ende
Messages
5
Reaction score
0
Hello,

The Mahalanobis distance or rather its square is defined as :

[itex](X-\mu)^2/\Sigma[/itex] which is then written as

[itex](X-\mu)^{T} Ʃ^{-1}(X-\mu)[/itex]

Ʃ is the covariance matrix. My silly question is why is the sigma placed in the middle of the dot product of the (X-μ) vector with itself. I am sure this makes sense mathematically (this reduces the output to a scalar) but I would like to know the intuitive reason behind it.

Thanks a lot!
Anja
 
Physics news on Phys.org
The idea behind the Mahalanobis distance is that you are measuring how many standard deviations from the mean X is in the one dimensional case. In multidimensional cases, [itex]\Sigma[/itex] is going to be a positive (semi)definite matrix, which will have a unique positive (semi)definite square root which I will call S. S serves the same role as the standard deviation. Then the expression above is the same as

[tex]\left( S^{-1}(X-\mu) \right)^T \left(S^{-1}(X-\mu) \right)[/tex]

basically, you scale the random vector [itex]X-\mu[/itex] by the standard deviation, the same as you would in the one dimensional case.
 
anja.ende said:
[itex](X-\mu)^{T} Ʃ^{-1}(X-\mu)[/itex]

Ʃ is the covariance matrix. My silly question is why is the sigma placed in the middle of the dot product of the (X-μ) vector with itself. I am sure this makes sense mathematically (this reduces the output to a scalar) but I would like to know the intuitive reason behind it.
The expression ##(X-\mu)^T \Sigma^{-1}(X-\mu) = \sigma^2## defines a family of hyperellipsoids in the N-dimensional space in which X and μ live, characterized by the scalar parameter σ. I used σ intentionally. Think of σ as representing "standard deviations". For example, ##(X-\mu)^T \Sigma^{-1}(X-\mu) = 1## is the one sigma hyperellipsoid.

The Mahalanobis distance is essentially a measure of how many standard deviations a point X is from the mean μ.
 
Thank you guys!
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 4 ·
Replies
4
Views
5K