Hi,I was looking at the way variance is calcullated. Let us say V

  • Context: Undergrad 
  • Thread starter Thread starter Santosh_J
  • Start date Start date
  • Tags Tags
    Variance
Click For Summary
SUMMARY

The discussion centers on the calculation of variance, defined as V = average of the sum of squares of the difference between Xi and the mean (μ). The user, Santosh, questions the necessity of squaring the differences, suggesting that using the average of the absolute differences could suffice. However, it is established that squaring the deviations emphasizes larger differences and is essential for accurately representing data distribution, particularly in normal distributions where a significant percentage of values fall within specific standard deviations from the mean.

PREREQUISITES
  • Understanding of statistical concepts such as variance and standard deviation.
  • Familiarity with normal distribution properties.
  • Knowledge of mathematical operations including squaring and averaging.
  • Basic grasp of data analysis techniques.
NEXT STEPS
  • Study the properties of variance and standard deviation in depth.
  • Learn about the implications of using absolute differences versus squared differences in statistical analysis.
  • Explore the concept of normal distribution and its significance in statistics.
  • Investigate alternative measures of variability, such as mean absolute deviation (MAD).
USEFUL FOR

Statisticians, data analysts, and students studying statistics who seek to deepen their understanding of variance and its calculation methods.

Santosh_J
Messages
2
Reaction score
0
Hi,

I was looking at the way variance is calcullated. Let us say V represents variance, then

V = average of the sum of squares of the difference between Xi and the mean (μ)

What I do not understand is why square the difference? By squaring are we not magnifying the difference by many folds?

If squaring is only to prevent canceling of negative and positive differences, then we can instead average the sum of modulus of the difference between Xi and the mean (μ), which I believe is an accurate representation, i.e.

V = (Ʃ|Xi-μ|)/N


Am I missing something here or my understanding is wrong about variance?

Thanks,
Santosh
 
Physics news on Phys.org


Santosh_J said:
Hi,

I was looking at the way variance is calcullated. Let us say V represents variance, then

V = average of the sum of squares of the difference between Xi and the mean (μ)

What I do not understand is why square the difference? By squaring are we not magnifying the difference by many folds?

If squaring is only to prevent canceling of negative and positive differences, then we can instead average the sum of modulus of the difference between Xi and the mean (μ), which I believe is an accurate representation, i.e.

V = (Ʃ|Xi-μ|)/N


Am I missing something here or my understanding is wrong about variance?

Thanks,
Santosh

Squaring the deviation emphasizes larger differences. Additionally, for a normal distribution, 68% of values lie with 1 standard deviation of the mean, 95% of values lie within 2 standard deviations of the mean, and 99.7% of the values lie within 3 standard deviations of the mean. For some reason, this only seems to work if we use squared deviations as part of our definition of variance.
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
6K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 4 ·
Replies
4
Views
6K
  • · Replies 4 ·
Replies
4
Views
5K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K