- #1
Vital
- 108
- 4
Hello.
I am doing problem sets on very basic stat topics. When solving problems that require the use of chi-squared table, I stumbled upon an unexpected issue. I seem to miss something important about how to correctly choose the right critical value using the chi-squared table.
Below are two examples, and both have totally confused me. My questions are below these examples.
(1) During a 10-year period, the standard deviation of annual returns on a portfolio you are analyzing was 7 percent a year. You want to see whether this record is sufficient evidence to support the conclusion that the portfolio’s underlying variance of return was less than 395, the return variance of the portfolio’s benchmark.
Question: Identify the rejection point or points at the 0.05 significance level for the hypothesis H0: σ2 ≥ 400 versus Ha: σ2 < 395, where 395 is the hypothesized value of variance, σ2.
Solution from the book:
The book provides the table which is named Values of chi-squared (degrees of freedom, level of significance) Probability in Right Tail. I am attaching the picture of the table.
The rejection point is found across degrees of freedom of 9, under the 0.95 column (95 percent of probability above the value). It is 3.325. We will reject the null hypothesis if we find that χ2 < 3.325.
(2)
For example, we will consider a goodness of fit test for a twelve-sided die. Our null hypothesis is that all sides are equally likely to be rolled, and so each side has a probability of 1/12 of being rolled. Since there are 12 outcomes, there are 12 -1 = 11 degrees of freedom. This means that we will use the row marked 11 for our calculations.
A goodness of fit test is a one-tailed test. The tail that we use for this is the right tail. Suppose that the level of significance is 0.05 = 5%. This is the probability in the right tail of the distribution. Our table is set up for probability in the left tail. So the left of our critical value should be 1 – 0.05 = 0.95. This means that we use the column corresponding to 0.95 and row 11 to give a critical value of 19.675.
My questions:
Doing a few problems, I came to understanding that if, for example, we take 5% significance level, it means that we have a critical value X, and 5% of data above this critical value X is where we would reject our null hypothesis, because it would mean that our actual observations do not fit into 95% of data below that critical value.
So, the first example "looks" at the table that is set for the right tale, and the second "looks" at the table set to the left tale, but both use the 5% significance level. This is so confusing.
If we take a 5% significance level for the first problem, than why do we take the critical value in the column for 95%? I thought that, as I explained above, that I have to look at the value in column 0.05 (given certain degrees of freedom), meaning that if my computed test statistic is lower than that critical value, then I reject the null, but if it is not, then I do not reject the null and it would mean that the data is actually in that right 5% segment.
I am completely confused by these two problems and by tables that show probabilities in the right or left tale for chi-squared distribution.
EDIT:
I have just found one more example, which uses the same 5% level of significance, and chooses the critical value in the column with 0.05, not in column 0.95.
Here is the link to this third example.
And here is another link to the youtube video, where he explains that when you are looking for a critical value of say 5% with say 5 degrees of freedom, you have to look in the column with 0.05 in the table to find the critical value of 11.070, and not in the column with 0.95.
Please, help) Thank you very much.
I am doing problem sets on very basic stat topics. When solving problems that require the use of chi-squared table, I stumbled upon an unexpected issue. I seem to miss something important about how to correctly choose the right critical value using the chi-squared table.
Below are two examples, and both have totally confused me. My questions are below these examples.
(1) During a 10-year period, the standard deviation of annual returns on a portfolio you are analyzing was 7 percent a year. You want to see whether this record is sufficient evidence to support the conclusion that the portfolio’s underlying variance of return was less than 395, the return variance of the portfolio’s benchmark.
Question: Identify the rejection point or points at the 0.05 significance level for the hypothesis H0: σ2 ≥ 400 versus Ha: σ2 < 395, where 395 is the hypothesized value of variance, σ2.
Solution from the book:
The book provides the table which is named Values of chi-squared (degrees of freedom, level of significance) Probability in Right Tail. I am attaching the picture of the table.
The rejection point is found across degrees of freedom of 9, under the 0.95 column (95 percent of probability above the value). It is 3.325. We will reject the null hypothesis if we find that χ2 < 3.325.
(2)
For example, we will consider a goodness of fit test for a twelve-sided die. Our null hypothesis is that all sides are equally likely to be rolled, and so each side has a probability of 1/12 of being rolled. Since there are 12 outcomes, there are 12 -1 = 11 degrees of freedom. This means that we will use the row marked 11 for our calculations.
A goodness of fit test is a one-tailed test. The tail that we use for this is the right tail. Suppose that the level of significance is 0.05 = 5%. This is the probability in the right tail of the distribution. Our table is set up for probability in the left tail. So the left of our critical value should be 1 – 0.05 = 0.95. This means that we use the column corresponding to 0.95 and row 11 to give a critical value of 19.675.
My questions:
Doing a few problems, I came to understanding that if, for example, we take 5% significance level, it means that we have a critical value X, and 5% of data above this critical value X is where we would reject our null hypothesis, because it would mean that our actual observations do not fit into 95% of data below that critical value.
So, the first example "looks" at the table that is set for the right tale, and the second "looks" at the table set to the left tale, but both use the 5% significance level. This is so confusing.
If we take a 5% significance level for the first problem, than why do we take the critical value in the column for 95%? I thought that, as I explained above, that I have to look at the value in column 0.05 (given certain degrees of freedom), meaning that if my computed test statistic is lower than that critical value, then I reject the null, but if it is not, then I do not reject the null and it would mean that the data is actually in that right 5% segment.
I am completely confused by these two problems and by tables that show probabilities in the right or left tale for chi-squared distribution.
EDIT:
I have just found one more example, which uses the same 5% level of significance, and chooses the critical value in the column with 0.05, not in column 0.95.
Here is the link to this third example.
And here is another link to the youtube video, where he explains that when you are looking for a critical value of say 5% with say 5 degrees of freedom, you have to look in the column with 0.05 in the table to find the critical value of 11.070, and not in the column with 0.95.
Please, help) Thank you very much.
Attachments
Last edited: