Newbie question: Algebra of Mahalanobis distance

anja.ende · Nov 11, 2013

Hello,

The Mahalanobis distance or rather its square is defined as :

(X-\mu)^2/\Sigma which is then written as

(X-\mu)^{T} Ʃ^{-1}(X-\mu)

Ʃ is the covariance matrix. My silly question is why is the sigma placed in the middle of the dot product of the (X-μ) vector with itself. I am sure this makes sense mathematically (this reduces the output to a scalar) but I would like to know the intuitive reason behind it.

Thanks a lot!
Anja

Office_Shredder · Nov 11, 2013

The idea behind the Mahalanobis distance is that you are measuring how many standard deviations from the mean X is in the one dimensional case. In multidimensional cases, \Sigma is going to be a positive (semi)definite matrix, which will have a unique positive (semi)definite square root which I will call S. S serves the same role as the standard deviation. Then the expression above is the same as

\left( S^{-1}(X-\mu) \right)^T \left(S^{-1}(X-\mu) \right)

basically, you scale the random vector X-\mu by the standard deviation, the same as you would in the one dimensional case.

D H · Nov 11, 2013

anja.ende said:

(X-\mu)^{T} Ʃ^{-1}(X-\mu)

Ʃ is the covariance matrix. My silly question is why is the sigma placed in the middle of the dot product of the (X-μ) vector with itself. I am sure this makes sense mathematically (this reduces the output to a scalar) but I would like to know the intuitive reason behind it.

The expression ##(X-\mu)^T \Sigma^{-1}(X-\mu) = \sigma^2## defines a family of hyperellipsoids in the N-dimensional space in which X and μ live, characterized by the scalar parameter σ. I used σ intentionally. Think of σ as representing "standard deviations". For example, ##(X-\mu)^T \Sigma^{-1}(X-\mu) = 1## is the one sigma hyperellipsoid.

The Mahalanobis distance is essentially a measure of how many standard deviations a point X is from the mean μ.

anja.ende · Nov 11, 2013

Thank you guys!

Newbie question: Algebra of Mahalanobis distance

Thread 'Trouble understanding an online solution to an exercise in Dummit & Foote'

Thread 'Questions about non existence of GCDs vs (coimages, cokernels)'

Thread 'Decomposition into irreps of compact Lie group'

Similar threads

Hot Threads

I How to show ##p(x)=g(x)x\pm 1\in\Bbb{Q}[x]## is irreducible in ##\Bbb{Q}_{\Bbb{Z}}[x]##?

A Question about ##FG## modules

I Showing ##k[x_1,\ldots,x_n]/\mathfrak{a}## is finite dimensional

A Near-Rings with Noncommutative Addition and Two-Sided Distributivity

I How do we distinguish two different notations for cokernel and coimage?

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective