Hi,I was looking at the way variance is calcullated. Let us say V

  • Thread starter Thread starter Santosh_J
  • Start date Start date
  • Tags Tags
    Variance
AI Thread Summary
Variance is calculated by averaging the squared differences between each data point and the mean, which emphasizes larger deviations. Squaring prevents the cancellation of negative and positive differences, ensuring that all deviations contribute positively to the variance. Using absolute differences instead of squared differences would not capture the same statistical properties, particularly in normally distributed data. The relationship between variance and standard deviation is crucial, as it helps define the distribution of data points around the mean. Overall, the squaring of differences is essential for accurately representing variability in a dataset.
Santosh_J
Messages
2
Reaction score
0
Hi,

I was looking at the way variance is calcullated. Let us say V represents variance, then

V = average of the sum of squares of the difference between Xi and the mean (μ)

What I do not understand is why square the difference? By squaring are we not magnifying the difference by many folds?

If squaring is only to prevent canceling of negative and positive differences, then we can instead average the sum of modulus of the difference between Xi and the mean (μ), which I believe is an accurate representation, i.e.

V = (Ʃ|Xi-μ|)/N


Am I missing something here or my understanding is wrong about variance?

Thanks,
Santosh
 
Physics news on Phys.org


Santosh_J said:
Hi,

I was looking at the way variance is calcullated. Let us say V represents variance, then

V = average of the sum of squares of the difference between Xi and the mean (μ)

What I do not understand is why square the difference? By squaring are we not magnifying the difference by many folds?

If squaring is only to prevent canceling of negative and positive differences, then we can instead average the sum of modulus of the difference between Xi and the mean (μ), which I believe is an accurate representation, i.e.

V = (Ʃ|Xi-μ|)/N


Am I missing something here or my understanding is wrong about variance?

Thanks,
Santosh

Squaring the deviation emphasizes larger differences. Additionally, for a normal distribution, 68% of values lie with 1 standard deviation of the mean, 95% of values lie within 2 standard deviations of the mean, and 99.7% of the values lie within 3 standard deviations of the mean. For some reason, this only seems to work if we use squared deviations as part of our definition of variance.
 
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...
Back
Top