Statistics Chi Square test of normally distributed data

Liesl
Messages
1
Reaction score
0

Homework Statement


The following data is hypothesized to possesses a normal distribution, is that hypothesis sustained by a Chi-square test at .05 significance:

<66...4
67-68...24
68-70...35
70-72...15
72-74...8
>74...4


The Attempt at a Solution



1. I know that I should calculate the expected values for each bin for a normally distributed data set using the equation f(x)=2.66*e^-[(x-μ)^2/(2σ^2)] To do this, I'lll need to calculate μ and s from the data using estimators.

2. Then I can simply plug those values into the Chi Squared equation and see if the resulting value is less than the X^2 value from the table. (I will have to make sure np > 5 for all bins, and subtract 2 from the number of final bins to get the degrees of freedom).

My question is, how do is use estimators when the data is continuously distributed in bins as it is in the problem? Thank you.
 
Physics news on Phys.org
I would try to minimize that Chi-squared value in a fit to get mean and standard deviation (and Chi2) at the same time.
 
Liesl said:

Homework Statement


The following data is hypothesized to possesses a normal distribution, is that hypothesis sustained by a Chi-square test at .05 significance:

<66...4
67-68...24
68-70...35
70-72...15
72-74...8
>74...4


The Attempt at a Solution



1. I know that I should calculate the expected values for each bin for a normally distributed data set using the equation f(x)=2.66*e^-[(x-μ)^2/(2σ^2)] To do this, I'lll need to calculate μ and s from the data using estimators.

2. Then I can simply plug those values into the Chi Squared equation and see if the resulting value is less than the X^2 value from the table. (I will have to make sure np > 5 for all bins, and subtract 2 from the number of final bins to get the degrees of freedom).

My question is, how do is use estimators when the data is continuously distributed in bins as it is in the problem? Thank you.

You calculate the bin probabilities in the usual way: ##P\{ a < X < b\} = P\{ (a - \mu)/\sigma < Z < (b - \mu)/\sigma\}##, where ##Z## is a standard normal random variable (mean= 0, variance = 1). You need to use normal tables or a computer package, or something similar.
 
There are two things I don't understand about this problem. First, when finding the nth root of a number, there should in theory be n solutions. However, the formula produces n+1 roots. Here is how. The first root is simply ##\left(r\right)^{\left(\frac{1}{n}\right)}##. Then you multiply this first root by n additional expressions given by the formula, as you go through k=0,1,...n-1. So you end up with n+1 roots, which cannot be correct. Let me illustrate what I mean. For this...

Similar threads

Replies
14
Views
2K
Replies
7
Views
2K
Replies
2
Views
2K
Replies
23
Views
2K
Replies
1
Views
4K
Replies
1
Views
1K
Replies
8
Views
2K
Back
Top