- #1
bitty
- 14
- 0
Homework Statement
I know that for a Chi Square test to adequately describe a distribution, you need each bin to have an estimated frequency > 5. As a rule of thumb though, do I want to pool the bins with less than 5 estimated frequency so as to maximize the number of bins or minimize them?
For instance say we have:
(first column is the bin number, second column is estimated frequency of that bin)
Bin# Estimated Frequency
1 6
2 3
3 4
4 3
5 3
Which way would we pool bins?
Method 1)
Bin # Estimated Frequency
1 6
2+ 13
or
Method 2)
Bin # Estimated Frequency
1 6
2-3 7
4-5 6