Undergrad Calculating Degrees of Freedom for Chi-Squared & P Value

Click For Summary
SUMMARY

This discussion focuses on calculating degrees of freedom (dof) for chi-squared tests and p-values using observed and expected frequencies. The correct formula for dof is established as dof = (rows - 1) * (columns - 1), which yields a value of 4 for the provided data set. An alternative method involving the number of restrictions also confirms this result, with five restrictions identified. Additionally, the discussion clarifies that for a single-row table with ten columns, the degrees of freedom is 9, following the formula D = I - P, where I represents independent sample points and P represents independent parameters.

PREREQUISITES
  • Understanding of chi-squared tests and p-values
  • Familiarity with observed and expected frequencies
  • Basic knowledge of statistical parameters and independent observations
  • Ability to calculate degrees of freedom using formulas
NEXT STEPS
  • Study the application of chi-squared tests in different statistical scenarios
  • Learn about the implications of degrees of freedom in various statistical tests
  • Explore the concept of restrictions in statistical models
  • Investigate the use of random number generators in statistical analysis
USEFUL FOR

Statisticians, data analysts, researchers, and students seeking to deepen their understanding of chi-squared tests and the calculation of degrees of freedom in statistical models.

lola2000
Messages
12
Reaction score
0
I am trying to understand how to decide the number of degrees of freedom when calculating a chi-squared and p value.

I have the data:

England:
people with no pets = 665
people with 1 pet = 976
people with 2+ pets = 913

Scotland
people with no pets = 313
people with 1 pet = 527
people with 2+ pets = 506

Wales
people with no pets =302
people with 1 pet = 440
people with 2+ pets = 358

I've calculated the expected frequency and therefore the (observed - expected)^2 / expected for each cell but I am stuck with degrees of freedom

One thing I've found says dof = (rows-1 ) * (col -1 ) which would = 2 * 2 = 4
another thing says dof = number of cells - number of restrictions = 9 - 2 = 7
where number of restrictions is number of things you are categorising by

can someone clarify this please!
 
Physics news on Phys.org
Both methods are correct and give the same answer, which is 4.
To apply the 'number of restrictions' method we need to count the restrictions carefully. There are five restrictions, being:
  • items in column 1 must add to column total 1
  • items in column 2 must add to column total 2
  • items in column 3 must add to column total 3
  • items in row 1 must add to row total 1
  • items in row 2 must add to row total 2
We don't need a restriction on row 3 because it is already implied by the five restrictions above, as
total row 3 = tot col 1 + tot col 2 + tot col 3 - tot row 1 - tot row 2

I suggest sticking with the (r-1) x (c-1) mnemonic, as it's easier to remember.
 
Thanks so much for the help. I see where I went wrong with the number of restrictions now
 
Similar question but how would this hold true for a situation where you have a random number generator producing numbers between 0 and 9 and you are counting their frequency? So you would have a table of one row and 10 columns.

Degrees of freedom is 9?
 
lola2000 said:
Similar question but how would this hold true for a situation where you have a random number generator producing numbers between 0 and 9 and you are counting their frequency? So you would have a table of one row and 10 columns.

Degrees of freedom is 9?
Yes.
 
Hey lola2000.

The degrees of freedom is a function of the number of independent parameters in the model in addition to how many independent observations exist within the sample.

If you can understand this then you will be able to get an arbitrary value for the degrees of freedom of any statistic.

You use D = I - P where I is the number of independent sample points and P is the number of independent parameters being assessed.

Different test statistics use different rules to get it but the idea above is central to that of all statistics.
 
If there are an infinite number of natural numbers, and an infinite number of fractions in between any two natural numbers, and an infinite number of fractions in between any two of those fractions, and an infinite number of fractions in between any two of those fractions, and an infinite number of fractions in between any two of those fractions, and... then that must mean that there are not only infinite infinities, but an infinite number of those infinities. and an infinite number of those...

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 20 ·
Replies
20
Views
4K
  • · Replies 22 ·
Replies
22
Views
5K
  • · Replies 4 ·
Replies
4
Views
5K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 5 ·
Replies
5
Views
9K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K