Mean Absolute Deviation/Standard Deviation Ratio

Click For Summary

Discussion Overview

The discussion revolves around the relationship between the Mean Absolute Deviation (MAD) and Standard Deviation (SD) of a Normal Distribution, specifically exploring the ratio MAD:SD and its implications. Participants raise questions about the underlying reasons for this ratio, its significance in statistical analysis, and related inquiries about higher moments of random variables.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant states that the MAD of a Normal Distribution is 0.7979 of its SD, questioning why this is true and how it relates to the distribution's characteristics.
  • Another participant explains that for any arbitrary probability distribution, MAD is always less than SD, and discusses the implications of this for the Normal Distribution, noting that it has lighter tails compared to other distributions like the Double Exponential.
  • A later reply provides a mathematical derivation of the MAD for the Normal Distribution, confirming the ratio using integrals and properties of even functions.
  • Participants discuss the challenges of reconciling different statistical measures, such as skewness and kurtosis, and propose the idea of a joint test to compare these characteristics.
  • Several participants pose related questions about the relationships between higher moments of random variables and their absolute values, with one providing a counterexample to illustrate potential misconceptions.

Areas of Agreement / Disagreement

Participants express varying views on the implications of the MAD:SD ratio and its relationship to other statistical measures. There is no consensus on the best approach to reconcile these differences, and the discussion remains unresolved regarding the broader implications of the findings.

Contextual Notes

Some participants note that the relationship between MAD and SD may depend on the specific characteristics of the distribution being analyzed, and there are unresolved questions regarding the assumptions behind the higher moments of random variables.

kimberley
Messages
14
Reaction score
0
I ran across an interesting statistic today while doing some research, but it was stated as a matter of fact without explanation and there appears to be a dearth of material on it. It was stated that the Mean Absolute Deviation ("MAD") of a Normal (Gaussian) Distribution is .7979 of a Normal Distribution's Standard Deviation ("SD"). The simple equation offered was MAD:SD=SQRT (2/pi).

Question 1: Assuming this statement is true, why is it true? That is, what is it about the Normal Distribution that would cause a MAD to be .7979 of the SD?

Question 2: Again, assuming this statement is true, how would you reconcile two samples, one of which has a more favorable Jarque-Bera Test Statistic than another, but a less favorable MAD/SD Ratio?

Thank you in advance.

Kimberley
 
Physics news on Phys.org
For any arbitrary probability distribution F, MAD(F) < SD(F) is always the case.

As for Q.1., the normal dist. has the characteristic that as its "spread" increases one unit as measured by squared deviations (i.e., the variance), its spread increases 0.7979 of a unit as measured by absolute deviations. Remember that SD is "the sqrt of Var" = "sqrt of average squared error," and squaring inflates outliers.

(This link shows a MAD of 0.681 for the Normal -- my guess is it is simulated data; which contain some error.) Contrast this with the Double Exponential, whose MAD is about half of its SD. Compared to SD, MAD gives less weight to outliers, so distributions with light tails tend to have a MAD/SD ratio closer to 1. This is confirmed by the observation that the Normal does indeed have lighter tails than the Double Exp'l.

Q.2 is challenging. MAD and Skew measure different characteristics: MAD is dispersion, which is a second-order moment, so is SD. But the ratio MAD/SD is akin to a fourth-order moment (Kurtosis). Skew is third-order. It should be possible to devise a joint test of skewness and Kurtosis, which would be a Golden Key, but I don't have a ready formula for that.

A practical approach may be to say "a third-order statistic is obviously more important than a fourth-order one," and devise an ad-hoc two-step test: If distribution F has excess skew closer to 0 than distribution G, then we conclude F is more normal than G. If distributions F and G have "practically" the same excess skew, then we compare their MAD/SD ratios; the one closer to 0.7979 is more normal.

Again, the Golden Key would be to devise a joint test.
 
Last edited:
kimberley said:
The simple equation offered was MAD:SD=SQRT (2/pi).

Question 1: Assuming this statement is true, why is it true? That is, what is it about the Normal Distribution that would cause a MAD to be .7979 of the SD?

It's an easy calculation. The mean absolute deviation is the expected value of the absolute value of the random variable:

E(|x|) = \int_{-\infty}^{\infty}|x|\, p(x) dx

As both absolute value and the standard Gaussian distribution are even functions,

E(|x|) =<br /> 2\int_0^{\infty}x \frac 1 {\sigma\sqrt{2\pi}}<br /> e^{-\,\frac {x^2} {2\sigma^2} } dx

A simple u-substitution does the trick here, u=\exp(-x^2/(2\sigma^2)):

E(|x|) =<br /> \left.-\,\sigma \sqrt{\frac 2 {\pi}} e^{-\,\frac {x^2} {2\sigma^2} }\right|_0^{\infty}<br /> = \sigma \sqrt{\frac 2 {\pi}}
 
i have a vaguely related question...
consider random variables X and Y with
E[X]=E[Y]=0
E[X^2]=E[Y^2] (ie the same standard deviation)
and for n>=3
E[X^n]>=E[Y^n]>=0

is there a way to conclude that
E[|X|]>=E[|Y|]
?
 
judoudo said:
i have a vaguely related question...
consider random variables X and Y with
E[X]=E[Y]=0
E[X^2]=E[Y^2] (ie the same standard deviation)
and for n>=3
E[X^n]>=E[Y^n]>=0

is there a way to conclude that
E[|X|]>=E[|Y|]
?


I assume you mean that the higher moments (3, 4, etc.) of |X| are always higher for X than Y. If that is what you mean, then I think I can provide a counter example. Consider X to be the absolute value of a rescaled version (standard deviation 1) of a central T distribution on 9 degrees of freedom. Suppose X is the absolute value of a Standard mean zero normal variable. Then I believe (from simulation) that higher moments (2, 3, ...) of the absolute value of the rescaled t variable are all greater than the corresponding higher moments of the absolute normal. Yet the expected value of the absolute T-variable is LESS than that of the absolute normal. Indeed you would not intuitively expect the relationship to be "higher, equal, higher, higher..." as the exponent goes 1, 2, 3, 4, ... rather it goes "less, equal, higher, higher, ..."
 
judoudo said:
i have a vaguely related question...
consider random variables X and Y with
E[X]=E[Y]=0
E[X^2]=E[Y^2] (ie the same standard deviation)
and for n>=3
E[X^n]>=E[Y^n]>=0

is there a way to conclude that
E[|X|]>=E[|Y|]
?


After I sent my post to the question you did NOT ask (with absolute value signs), I realized you might have meant it just the way you posed it! Here is a counterexample to the question you ACTUALLY posed.

Let X and Y both have 2-point distributions.

X is 9 with probability 0.1 and -1 with probability 0.9. E(X) = 0, Var(X) = 9

Y is 4.5 with probability 4/13 and -2 with probability 9/13. E(Y) = 0, Var(Y) = 9

E( |X| ) = 1.8 and E ( |Y| ) is clearly greater than 1.8, since it has to be between 2 and 4.5. Yet the higher moments of X are clearly all greater than those of Y.

I apologize for assuming you did not mean to leave off the absolute value signs.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
20
Views
3K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K