What divisor does Microsoft Excel use for standard deviation?

Click For Summary

Discussion Overview

The discussion centers around the divisor used in the standard deviation calculations within Microsoft Excel, specifically addressing the difference between the sample standard deviation (using N-1) and the population standard deviation (using N). Participants explore the implications of these definitions and their potential for causing confusion among users.

Discussion Character

  • Debate/contested
  • Conceptual clarification
  • Technical explanation

Main Points Raised

  • Some participants assert that the standard deviation is defined with divisor N, while Excel's "STDEV" function uses N-1, which they find misleading.
  • Others clarify that the term "standard deviation" can refer to different formulas depending on whether one is dealing with a sample or a population.
  • It is noted that Excel provides both STDEV for sample standard deviation and STDEVP for population standard deviation, suggesting that users typically want the sample standard deviation.
  • One participant emphasizes that there is no single standard deviation formula, as the appropriate one depends on context.
  • Another participant discusses the maximum likelihood estimator (MLE) for standard deviation under the Gaussian model, indicating that the interpretation of variance differs based on whether it is treated as a sample or population statistic.
  • Concerns are raised about teaching statistics without clear references to population or estimation, particularly for students using Excel.

Areas of Agreement / Disagreement

Participants express various viewpoints on the definitions and implications of standard deviation in Excel, with no consensus reached on whether the terminology used by Excel is misleading or appropriate. The discussion remains unresolved regarding the best approach to teaching these concepts.

Contextual Notes

Participants highlight that the definitions and applications of standard deviation can vary based on context, and there are nuances in how variance is treated in different statistical frameworks. Some argue that textbooks often present a singular definition before addressing more complex contexts.

ssd
Messages
268
Reaction score
6
By definition standard deviation has divisor 'N' .
sdmjgp.png

But in Microsoft Excel the "STDEV" function gives 'N-1' as divisor.
This may not be major issue to many of the people. But the command "STDEV"
makes many users feel that this is standard deviation, i.e. it has divisor 'N'. Misleading...is not it?
 
Physics news on Phys.org
In texts you will see the one with divisor N defined as the population standard deviation: the one with divisor N-1 is referred to as the sample standard deviation.

There is no single "standard deviation formula" - the appropriate formula depends on context.
 
Excel has STDEV, the sample standard deviation, and STDEVP, the population standard deviation. Usually you want the sample standard deviation, which is why the commands are named as they are.

I'm pretty sure this is all in the Excel help.
 
At the very outset of this reply let me say that all of you are correct from your
respective point of view. But my point was a little different which EnumaElish has got almost correctly.
To clarify my point I will say a word or two.

EnumaElish said:
"the square root of the bias-corrected variance [estimator] is
sometimes also known as the standard deviation"
http://mathworld.wolfram.com/StandardDeviation.html

True. You rightly got it: "sometimes" not in general- or I may say, not commonly.
That is why I used the term "misleading" instead of "bugged".

CRGreathouse said:
Excel has STDEV, the sample standard deviation,
and STDEVP, the population standard deviation. Usually you want the sample
standard deviation, which is why the commands are named as they are.

Point well taken.

statdad said:
In texts you will see the one with divisor N defined as the population
standard deviation: the one with divisor N-1 is referred to as the sample standard deviation.
There is no single "standard deviation formula" - the appropriate formula depends on context.


In the first step of stating the descriptive measures of dispersion all the textbooks, as I have gone through
( ex: attachment -1), use a single and unique "standard deviation formula" (and no context dependent definition). And this is done much before coming to the chapters of "point estimation" or "sample survey" (where we start talking about "expectation/unbiasedness" or "population/sample".

Then again, as learned from textbooks( attachment-2), the sample variance (also) have divisor
n, not n-1). Hence it is stated, sample variance is not unbiased estimator of population variance.

30nf95w.jpg

attachment -1

2rdu3h5.jpg

attachment-2
 
Last edited by a moderator:
The standard deviation with the divisor of n is the MLE when the Gaussian model is assumed and both [tex]\mu[/tex] and [/tex] \sigma [/tex] are unknown. (It is a function of the variance, the MLE of [tex]\sigma ^2[/tex], so is the MLE of the standard deviation. The sample variance with the divisor of n -1 comes from the unbiased estimator of variance.

But: this expression
[tex] \frac 1 n \sum_{i=1}^n (x_i-\bar x)^2,[/tex]

are also given as formulas for population variance and sample variance, respectively, for discrete populations, since in this case the population mean is defined as

[tex] \mu = \sum 1 n \sum_{i=1}^n[/tex]

and so is identical in form to the arithmetic mean.

So, when you use the variance with n in the denominator, you must be clear whether you are dealing with a sample or population: the interpretation differs. That is the context.
 
statdad said:
The standard deviation with the divisor of n is the MLE when the Gaussian model is assumed and both [tex]\mu[/tex] and [/tex] \sigma [/tex] are unknown. (It is a function of the variance, the MLE of [tex]\sigma ^2[/tex], so is the MLE of the standard deviation. The sample variance with the divisor of n -1 comes from the unbiased estimator of variance.

But: this expression
[tex] \frac 1 n \sum_{i=1}^n (x_i-\bar x)^2,[/tex]

are also given as formulas for population variance and sample variance, respectively, for discrete populations, since in this case the population mean is defined as

[tex] \mu = \sum 1 n \sum_{i=1}^n[/tex]

and so is identical in form to the arithmetic mean.

So, when you use the variance with n in the denominator, you must be clear whether you are dealing with a sample or population: the interpretation differs. That is the context.

Agree. But my point differs a little from yours.

What if you are given with 20 no.s and asked to find their mean, variance, skewness and kurtosis... without any reference to population/estimation. Remember what we did in our stating course of descriptive statistics. What I am trying to say is that, there are situations when one can look upon variance as a purely descriptive measure (of dispersion). And this measure itself is meaningful some times (apart from the fact that this measure will be used for inferential data analysis).
 
"without any reference to population/estimation" - I can't imagine why you you would want any set of statistics without wondering about any descriptive or inferential purpose (unless it is simply an exercise in calculation) - but so be it.
 
statdad said:
"without any reference to population/estimation" - I can't imagine why you you would want any set of statistics without wondering about any descriptive or inferential purpose (unless it is simply an exercise in calculation) - but so be it.

Right. I was more concerned about my students who are going through the initial stages of learning descriptive statistics. Most of them (if not all) use excel for their course work.
 

Similar threads

  • · Replies 42 ·
2
Replies
42
Views
6K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 24 ·
Replies
24
Views
7K
  • · Replies 15 ·
Replies
15
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K