Why a normal distribution is not a good approximation for these exam scores?

Click For Summary
SUMMARY

The discussion centers on the inappropriateness of using a normal distribution to model exam scores, particularly when the maximum score is capped at 100. The calculated probabilities indicate that the range of μ - 3σ to μ + 3σ (26.6 to 118.4) does not align with the actual score limits, leading to a significant probability (3.61%) of invalid scores exceeding 100. This discrepancy highlights that normal distribution fails to accurately represent the data, as the theoretical probabilities do not match the observed constraints of the exam scores.

PREREQUISITES
  • Understanding of normal distribution properties
  • Familiarity with statistical concepts such as mean (μ) and standard deviation (σ)
  • Ability to calculate probabilities using the normal distribution
  • Knowledge of truncation effects on statistical analysis
NEXT STEPS
  • Study the implications of truncating data on statistical distributions
  • Learn about alternative distributions suitable for bounded data, such as the beta distribution
  • Explore the Central Limit Theorem and its applications in statistics
  • Investigate methods for assessing the goodness-of-fit for statistical models
USEFUL FOR

Statisticians, data analysts, educators assessing exam scores, and anyone involved in statistical modeling and analysis of bounded datasets.

songoku
Messages
2,508
Reaction score
402
Homework Statement
The average score for an exam is 72.5 out of 100 and the standard deviation is 15.3
Why a normal distribution would not give a good approximation to the distribution of the score?
Relevant Equations
Not sure
I am not really sure what the reason is but my argument would be if normal distribution is appropriate, then almost all the score will fall in the range of μ - 3σ to μ + 3σ

For this case, the range of μ - 3σ to μ + 3σ is 26.6 to 118.4 and all the score is unlikely to be within the range.

I feel my argument is not strong enough to justify why normal distribution is not appropriate. Since the max score is 100, the range can be truncated to 26.6 to 100. One can argue that this is a plausible range for exam score

Is there a certain condition to justify use of normal distribution, maybe something like the standard deviation should be at most a % of the mean?

Thanks
 
Last edited:
  • Like
Likes atyy
Physics news on Phys.org
I think you are on the right track but can make a stronger argument. Specifically, what is wrong with the range of 26.6 to 118.4? Concentrate on the problem with that and give a specific probability for the problem values of a normal distribution.
 
  • Like
Likes songoku, atyy, pbuk and 1 other person
FactChecker said:
I think you are on the right track but can make a stronger argument. Specifically, what is wrong with the range of 26.6 to 118.4? Concentrate on the problem with that and give a specific probability for the problem values of a normal distribution.
For normal distribution to be appropriate:
P(μ - σ < X < μ + σ) = 68.26%

P(μ - 2σ < X < μ + 2σ) = 95.44%

P(μ - 3σ < X < μ + 3σ) = 99.73%In this question:
P(μ - σ < X < μ + σ) =P(57.2 < X < 87.8) = 68.26%

P(μ - 2σ < X < μ + 2σ) = P(41.9 < X < 103.1) = P(41.9 < X < 100) = 94.11%

P(μ - 3σ < X < μ + 3σ) = P(26.6 < X < 118.4) = P(26.6 < X < 100) = 96.25%

Since P(μ - 3σ < X < μ + 3σ) does not match with theoretical value, so normal distribution is not good approximation?

Thanks
 
Last edited:
Assuming a normal distribution with that mean and standard deviation, I would calculate the probability, P(X>100), which all are invalid results. It may be a matter of preference, but I think that is the strongest argument.
 
  • Like
Likes songoku
FactChecker said:
Assuming a normal distribution with that mean and standard deviation, I would calculate the probability, P(X>100), which all are invalid results. It may be a matter of preference, but I think that is the strongest argument.
P(X > 100) = 0.0361= 3.61%

There will be around 3.61% of invalid data in the region of upper tail of normal distribution so normal distribution is not good approximation.

Is this what you mean?

Thanks
 
Yes.
 
Thank you very much FactChecker
 
  • Like
Likes berkeman

Similar threads

Replies
2
Views
5K
  • · Replies 1 ·
Replies
1
Views
10K
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
4K
Replies
6
Views
4K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 6 ·
Replies
6
Views
3K
Replies
2
Views
5K
  • · Replies 4 ·
Replies
4
Views
27K
Replies
10
Views
3K