View Full Version : Statistic deviation.
icystrike
Jun24-09, 03:11 PM
I went for a lecture and the lecturer said that the square of the difference between the x sub i and the mean is the take precaution of the negative value. This has been bugging me , i was wondering why dont they just take absolute because there is a difference between :
\sqrt{\frac{\sum(x-\mu)^2}{f}} and\frac{\sum \left|(x-\mu)\right|}{f}
statdad
Jun24-09, 05:07 PM
Yes, there is a difference, as
\sqrt{\sum(x-\mu)^2} \ne \sum |x - \mu |
There is actually quite a history about whether a measure based on
\sqrt{\frac{\sum (x-\mu)^2 }{f}}
or
\sqrt{\frac{\sum |x-\mu|}{f}}
should be used. Basically, the measure based on the sum of squared deviations won out because, statistically, when it is assumed that the data are drawn from a normal distribution (equivalently, when it is assumed the random noise is Gaussian).
icystrike
Jun25-09, 02:49 AM
Basically, the measure based on the sum of squared deviations won out because, statistically, when it is assumed that the data are drawn from a normal distribution (equivalently, when it is assumed the random noise is Gaussian).
Thanks for your help :smile:
Random noise, i got to check this out !
heh, i remember my stats lecturer said that too.
an analogy can be drawn with why we take the squares of the sides (pythagoras) to work out the hypotenuse and not the absolute value.
daviddoria
Jun25-09, 08:47 AM
The squared distance is also used because it is continuous, where the absolute distance function has a discontinuity. This is a big problem in optimization.
statdad
Jun25-09, 09:54 PM
The squared distance is also used because it is continuous, where the absolute distance function has a discontinuity. This is a big problem in optimization.
Not really the case in statistics - the median, median deviation, and other procedures use the absolute value.
HallsofIvy
Jun26-09, 06:12 AM
The squared distance is also used because it is continuous, where the absolute distance function has a discontinuity. This is a big problem in optimization.
The absolute distance function does not have a derivative at a point. There is no discontinuity.
HallsofIvy
Jun26-09, 06:14 AM
Yes, there is a difference, as
\sqrt{\sum(x-\mu)^2} \ne \sum |x - \mu |
There is actually quite a history about whether a measure based on
\sqrt{\frac{\sum (x-\mu)^2 }{f}}
or
\sqrt{\frac{\sum |x-\mu|}{f}}
When you sum the absolute values, you should not have a square root.
should be used. Basically, the measure based on the sum of squared deviations won out because, statistically, when it is assumed that the data are drawn from a normal distribution (equivalently, when it is assumed the random noise is Gaussian).
There is a third used occasionally:
\frac{max |x-\mu|}{f}
The end of your last sentence seems to be missing!
statdad
Jun26-09, 08:07 AM
Halls, i wish i had your proof-reading skills. Thanks for catching my missed comment.
You are also correct that the absolute value expression has no derivative, but again, for statistics, I'd add that really isn't a problem.
Why did I miss the unneeded square root? Let me know when you figure it out, because I can't.
vBulletin® v3.7.6, Copyright ©2000-2009, Jelsoft Enterprises Ltd.