Understanding Mahalanobis distance

  • Context: Undergrad 
  • Thread starter Thread starter Avatrin
  • Start date Start date
  • Tags Tags
    Normal distribution
Click For Summary

Discussion Overview

The discussion centers around the Mahalanobis distance, particularly its rigorous understanding in the context of multivariable normal distribution and its applications in pattern recognition. Participants explore the mathematical properties of the metric, its relation to ellipsoids, and its advantages over other distance measures.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant seeks a rigorous understanding of Mahalanobis distance, emphasizing the need for a mathematical foundation rather than intuitive explanations.
  • Another participant provides links to resources, suggesting that Mahalanobis distance is simply a metric but does not elaborate on its deeper properties.
  • A participant notes that Mahalanobis distance relates to the N-dimensional normal distribution and questions why it is preferred in statistics over other metrics.
  • One participant argues that Euclidean distance is inadequate when dealing with variables on different scales, proposing that Mahalanobis distance should be viewed more as a measurement of intensity rather than a traditional distance.
  • It is mentioned that Mahalanobis distance accounts for variance and covariances, which makes it more suitable for certain statistical applications.

Areas of Agreement / Disagreement

Participants express differing views on the nature and interpretation of Mahalanobis distance, with some emphasizing its metric properties while others challenge its classification as a distance measure. The discussion remains unresolved regarding the best approach to rigorously understand the metric.

Contextual Notes

Participants highlight the need for a deeper exploration of the mathematical properties of Mahalanobis distance, indicating that existing resources may not fully address the complexities involved. There is also mention of varying interpretations of distance in the context of different scales and types of data.

Who May Find This Useful

This discussion may be useful for students and professionals in statistics, pattern recognition, and data analysis who are looking to deepen their understanding of Mahalanobis distance and its applications in multivariable contexts.

Avatrin
Messages
242
Reaction score
6
I am currently taking a course in pattern recognition, and several times I have encountered the multivariable normal distribution and thus, Mahalanobis distance. I want to understand Mahalanobis distance; Primarily for understanding the normal distribution, but also to understand the measure itself.

I have read several intuitive explanations here and in books, but how can I do this rigorously? I have had some, but not much, measure theory (in the part of a real analysis course that covered integration theory). I have had one introductory statistics and probability theory course.

What should I read to understand the Mahalanobis distance? Clearly it is related to ellipoids, but how? Again, I don't want a rough intuitive explanation.
 
Physics news on Phys.org
MarneMath said:
I'm not really sure what else you're looking for with regards to this. It's simply a metric.
Well, I did write I want to understand it in regard to the N dimensional normal distribution. According to Wikipedia, it tells me how many standard deviations a point x is from the mean of the deviation. The books on pattern recognition don't even really mention it.

So, I guess, what I am looking for is the property of the metric. Why is it used in statistics rather than some other metric?
 
Imagine you have units on different scales, then the euclidian distance doesn't really make sense, since you're simply adding the squared units of that measurement. So if it's all the same, then we're good. However, if you have units in different scales and types, the idea of distance becomes a bit more complicated. In fact, I don't really like to think about the Mahalonbis distance as a distance but rather a measurement of intensity.

The number one answer here does a good job of explaining how Mahalonbis does a good job at transforming the data into something reasonable: http://stats.stackexchange.com/questions/62092/bottom-to-top-explanation-of-the-mahalanobis-distance

Overall, though the main advantages are that it considers variance, covariances and unitizes uncorrelated variables for the Euclidian distance.
 

Similar threads

Replies
8
Views
6K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K