Replacement of Squaring in Variance Equation: Benefits?

vanmaiden
Messages
101
Reaction score
1
The variance equation basically sums up all the distances between each data value and the mean of the set. The interesting thing is that each distance and squared for a reason that I believe is to make the distance positive, but why don't the statisticians just take the absolute value of each distance to give a smaller number? Is there some benefit to having a large number to work with? I mean, the smaller numbers nevertheless have decimals that can be used to compare magnitude and such.

Thank you.
 
Physics news on Phys.org


vanmaiden said:
why don't the statisticians just take the absolute value of each distance to give a smaller number?

That's a good question and I don't think it has a simple answer. There are many reasons why the mean squared deviation is very useful. It is directly related to a parameter in the often used Normal Distribution, while the "Mean Absolute Deviation" (which is what you are proposing as an alternative) is not. If X and Y are independent random variables then variances obey the law Var(X+Y) = Var(X) + Var(Y), but I don't think the Mean Absolute Deviation obeys such a nice law.

The Mean Absolute Deviation has been studied and used, so you can't really say that statisiticans haven't tried it.

If you want to talk about things like the mean squared deviation or the mean absolute deviation, you need to be clear which of the 3 different meanings you are discussing. Each of these things can be 1) A parameter of a probability distribution, 2) a statistic computed from a sample or 3) a formula for estimating a parameter in a probability distribution by using values from a sample. Each of those 3 things can be discussed as 1) a random variable or 2) a specific value of random variable.
 
Hi all, I've been a roulette player for more than 10 years (although I took time off here and there) and it's only now that I'm trying to understand the physics of the game. Basically my strategy in roulette is to divide the wheel roughly into two halves (let's call them A and B). My theory is that in roulette there will invariably be variance. In other words, if A comes up 5 times in a row, B will be due to come up soon. However I have been proven wrong many times, and I have seen some...
Thread 'Detail of Diagonalization Lemma'
The following is more or less taken from page 6 of C. Smorynski's "Self-Reference and Modal Logic". (Springer, 1985) (I couldn't get raised brackets to indicate codification (Gödel numbering), so I use a box. The overline is assigning a name. The detail I would like clarification on is in the second step in the last line, where we have an m-overlined, and we substitute the expression for m. Are we saying that the name of a coded term is the same as the coded term? Thanks in advance.
Back
Top