Standard Deviation Conceptual [intro. Stats]

Click For Summary
The discussion centers on the application of standard deviation within a bounded range of test scores (0% to 100%) in a normal distribution context. It highlights that a random variable with such limits cannot be normally distributed, leading to questions about the appropriateness of using the three-sigma rule. The concept of a truncated normal distribution is introduced, indicating that standard deviations in this context would yield different probabilities compared to a standard normal distribution. Participants agree that while normal distribution approximations are common in textbooks, they may not accurately reflect real-world scenarios. Ultimately, the conversation emphasizes the need for careful consideration of distribution types when dealing with bounded data.
END
Messages
36
Reaction score
4
Hello, PF!

[My question pertains to a non-rigorous, undergraduate introductory Probability and Statistics course. I'm no math major, so please correct me if I've mishandled any terms or concepts as I try to express myself. I'm always eager to learn!]

* * *​

In a discussion of the standard deviation of a sample in relation to the 68-95-99.7 rule, the following "conceptual" example was given—or rather, made up on the spot—by our professor:

Assume \bar{x}=50 \% and s=20 \% for test scores (in units of percent correct), and assume that the sample represents the normal distribution (symmetrical and bell-shaped) of a test where no test score range below 0 \% and none above 100 \% (sorry, fellas, no extra credit).

It occurred to me that any score beyond 2.5 standard deviations would be a score of more than 100 \% or less than 0 \%. According to the three-sigma rule, this would still only encompass approximately 98.7\% of the scores meaning that approximately 1.3\% of the scores fall outside this possible range.

My question:

Is the above example even possible given the "parameters" (limits?—I can't find the right word) {0 \%}≤x_i≤{100 \%}?

And

Extrapolating this question to the overall concept, can any standard deviation s of a normal distribution ever exceed the possible range of data points/values within that distribution?

My guess is that this was simple oversight and an error on the part of my professor.

Thank you!
 
Physics news on Phys.org
A random variable with a bounded range (such as 0 to 100) is not actually normally distributed. However, a normal distribution can often be used to get approximate answers to questions about such a random variable. Many problems in textbooks expect students to make such an approximation. I'd call this type of approximation a tradition, not an oversight.
 
Ah, bounded was the word I was looking for! Thank you, Stephen.


A random variable with a bounded range (such as 0 to 100) is not actually normally distributed.

Would the example then be considered a Truncated normal distribution ?

If this is the case, what would 2.5 or 3 standard deviations imply in relation to the three-sigma rule when the values simply cannot extend beyond the boundaries? Does the three-sigma/68-95-99.7 rule simply not apply to this (and other truncated distributions); i.e., this example would have a different set of probabilities in relation to the various standard deviations: Pr(\bar{x} - 2.5s ≤x≤ \bar{x} + 2.5s) = 1.00 \ ?



Thank you.
 
END said:
Would the example then be considered a Truncated normal distribution ?

The example used the normal distribution as an approximation. If you want to make a different example, you could use a different distribution. A truncated normal distribution is but one example of what could be used.

i.e., this example would have a different set of probabilities in relation to the various standard deviations: Pr(\bar{x} - 2.5s ≤x≤ \bar{x} + 2.5s) = 1.00 \ ?


In general, a truncated normal distribution would have a different set of probability values for such an interval than a non-truncated normal distribution. (Keep in mind that a truncated normal distribution has a different "s" than the normal distribution that was truncated.)
 
There is no such thing as a normal distribution in the real world. It is a mathematical ideal. Quite often there are deviations in the tails. Often it is close enough for jazz.
 
The standard _A " operator" maps a Null Hypothesis Ho into a decision set { Do not reject:=1 and reject :=0}. In this sense ( HA)_A , makes no sense. Since H0, HA aren't exhaustive, can we find an alternative operator, _A' , so that ( H_A)_A' makes sense? Isn't Pearson Neyman related to this? Hope I'm making sense. Edit: I was motivated by a superficial similarity of the idea with double transposition of matrices M, with ## (M^{T})^{T}=M##, and just wanted to see if it made sense to talk...

Similar threads

Replies
20
Views
2K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 6 ·
Replies
6
Views
6K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 11 ·
Replies
11
Views
3K
Replies
6
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
Replies
4
Views
2K