Question about Chi-Square Test Regarding Normal Distribution

Click For Summary
SUMMARY

The discussion centers on the Chi-Square Test and its sensitivity to data grouping, specifically in relation to normal distribution. Two different groupings of data intervals were analyzed, leading to conflicting conclusions regarding the null hypothesis. The participants confirmed that varying bin sizes can indeed yield different results due to changes in degrees of freedom. Additionally, the Shapiro-Wilk test was recommended as a robust alternative for assessing normality, particularly when sample sizes are limited.

PREREQUISITES
  • Understanding of Chi-Square Test methodology
  • Familiarity with data grouping and binning techniques
  • Knowledge of the Shapiro-Wilk test for normality
  • Basic statistics concepts, including degrees of freedom
NEXT STEPS
  • Research the implications of bin size on Chi-Square Test results
  • Learn how to perform the Shapiro-Wilk test for normality
  • Explore the Central Limit Theorem (CLT) and its applications
  • Study the concept of degrees of freedom in statistical tests
USEFUL FOR

Statisticians, data analysts, and researchers conducting hypothesis testing and analyzing data distributions.

songoku
Messages
2,509
Reaction score
393
TL;DR
Let say I have 50 raw data of height of students. I want to do goodness of fit test to check whether normal distribution is appropriate model for the data at a certain significance level
The first step is to group the data and make a table so I can get the observed frequency for each data interval. I did two different groupings (something like 150 - 160 , 160 - 170 , etc and the other is 150 - 170, 170 - 190, etc) and found out that the conclusion of the hypothesis is different, one resulting in accepting null hypothesis and the other rejecting the null hypothesis.

Is it possible different grouping resulting in different conclusion? Or there should be mistake in my working?

Thanks
 
Physics news on Phys.org
You suffer from low statistics -- 50 events isn't much to confirm a distribution.
 
  • Like
Likes   Reactions: songoku
In reality, as everyone knows the height of individuals has finite variance, you can just rely on the CLT with n=50 to assume normality
 
  • Like
Likes   Reactions: songoku
It certainly is possible to get different results. Your first grouping would show more detail than your second grouping. It would also have twice the degrees of freedom, so the Chi-Squared distribution is different.
 
  • Like
Likes   Reactions: songoku
Thank you very much for the help and explanation BWV, BvU, FactChecker
 
  • Like
Likes   Reactions: BvU

Similar threads

  • · Replies 7 ·
Replies
7
Views
2K
Replies
20
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 1 ·
Replies
1
Views
1K
Replies
1
Views
3K
  • · Replies 17 ·
Replies
17
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K