Why does Chi-square formula have two different ones?

  • Context: Undergrad 
  • Thread starter Thread starter EigenCake
  • Start date Start date
  • Tags Tags
    Formula
Click For Summary
SUMMARY

The discussion clarifies the existence of two distinct chi-square formulas: one defined as the sum of squared standard-normal random variables divided by variance, and the other as the sum of squared differences between observed and expected frequencies divided by the expectation value. The first formula is a theoretical definition of the chi-squared distribution, while the second is specifically applied for goodness-of-fit tests. For hypothesis testing, the second formula is recommended to determine whether to reject or accept the null hypothesis.

PREREQUISITES
  • Understanding of chi-squared distribution
  • Familiarity with hypothesis testing concepts
  • Knowledge of observed versus expected frequencies
  • Basic statistics, particularly regarding variance and standard-normal random variables
NEXT STEPS
  • Study the derivation of the chi-squared distribution from standard-normal random variables
  • Learn about goodness-of-fit tests using the chi-square formula
  • Explore the implications of the Central Limit Theorem in relation to chi-squared tests
  • Investigate software tools for performing chi-square tests, such as R or Python's SciPy library
USEFUL FOR

Statisticians, data analysts, researchers conducting hypothesis testing, and anyone involved in statistical modeling or data analysis will benefit from this discussion.

EigenCake
Messages
12
Reaction score
0
One is:
http://en.wikipedia.org/wiki/Chi-squared_distribution

where at the bottom of the page, chi-square = sum of something devided by variance.

However, here:
http://www.napce.org/documents/research-design-yount/23_chisq_4th.pdf

where the chi-square formula is that the sum of the same thing devided by expectation value.

Which formula is right then?
 
Last edited by a moderator:
Physics news on Phys.org
The first reference is the definition of Chi-squared :- the distribution of a RV formed from the sum of squared standard-normal random variables.

The second reference is an application of the above distribution to the case of expect versus observed frequencies, where (O-E)/sqrt(E) is asymptotically standard-normal and hence (O-E)^2/E is asymptotically Chi-squared.
 
Thanks!

It seems that for goodness of fit I should use the second formula to either reject or accept the null hypothesis, rather than use the first formula.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 5 ·
Replies
5
Views
9K
  • · Replies 6 ·
Replies
6
Views
1K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K