# The Replacement of Squaring the Individual Distances with Absolute Values in Variance

1. Apr 9, 2012

### vanmaiden

The variance equation basically sums up all the distances between each data value and the mean of the set. The interesting thing is that each distance and squared for a reason that I believe is to make the distance positive, but why don't the statisticians just take the absolute value of each distance to give a smaller number? Is there some benefit to having a large number to work with? I mean, the smaller numbers nevertheless have decimals that can be used to compare magnitude and such.

Thank you.

2. Apr 9, 2012

### Stephen Tashi

Re: The Replacement of Squaring the Individual Distances with Absolute Values in Vari

That's a good question and I don't think it has a simple answer. There are many reasons why the mean squared deviation is very useful. It is directly related to a parameter in the often used Normal Distribution, while the "Mean Absolute Deviation" (which is what you are proposing as an alternative) is not. If X and Y are independent random variables then variances obey the law Var(X+Y) = Var(X) + Var(Y), but I don't think the Mean Absolute Deviation obeys such a nice law.

The Mean Absolute Deviation has been studied and used, so you can't really say that statisiticans haven't tried it.

If you want to talk about things like the mean squared deviation or the mean absolute deviation, you need to be clear which of the 3 different meanings you are discussing. Each of these things can be 1) A parameter of a probability distribution, 2) a statistic computed from a sample or 3) a formula for estimating a parameter in a probability distribution by using values from a sample. Each of those 3 things can be discussed as 1) a random variable or 2) a specific value of random variable.