Formulating null and alternative hypotheses for a chi-squared test

In summary: If I've understood correctly, then the χ2 value in the hypothesis should only be used to determine whether or not to reject the null hypothesis, not to calculate any other statistic.In summary, the goal of the experiment is to quantify to what degree observations agree with Fraunhofer theory. The null hypothesis is that there is no difference between the results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime. The alternative hypothesis is that the results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime disagree at a particular level of precision. I'm a little unsure on the alternative hypothesis in particular. I
  • #1
Eruditio
4
0
I am attempting to investigate to what quantitative degree a physical theory agrees with observations of the phenomena it predicts (specifically, Fraunhofer theory).

I want to use the chi-squared test to produce some confidence levels in the measurements made in different sections of the experiment.

The chi-squared test, as far as I'm aware, is just like any other statistical test in that it requires both a null and an alternative hypothesis. I believe that these need to be quite specific in order to make valid conclusions. What I would like is a some advice as to how to proceed with wording these hypotheses. Currently I have:

H0: No difference exists between the results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime.

H1: The results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime disagree at a particular level of precision.

I'm a little unsure on the alternative hypothesis in particular. I'm not quite sure how to word it; essentially what we are expecting is that to some quantitative degree, such as 1 in 50, 1 in 100 etc. the measured results will not line up with the expected results. Any and all help will be much appreciated.
 
Physics news on Phys.org
  • #2
Eruditio said:
I am attempting to investigate to what quantitative degree a physical theory agrees with observations of the phenomena it predicts (specifically, Fraunhofer theory).

I want to use the chi-squared test to produce some confidence levels in the measurements made in different sections of the experiment.

The chi-squared test, as far as I'm aware, is just like any other statistical test in that it requires both a null and an alternative hypothesis. I believe that these need to be quite specific in order to make valid conclusions. What I would like is a some advice as to how to proceed with wording these hypotheses. Currently I have:

H0: No difference exists between the results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime.

H1: The results expected from Fraunhofer theory and observations made of diffraction phenomena in the Fraunhofer regime disagree at a particular level of precision.

I'm a little unsure on the alternative hypothesis in particular. I'm not quite sure how to word it; essentially what we are expecting is that to some quantitative degree, such as 1 in 50, 1 in 100 etc. the measured results will not line up with the expected results. Any and all help will be much appreciated.

Hey Eruditio and welcome to the forums.

In terms of wording hypotheses, the best way to do this is to give an interval for the statistic to fall in. As an example consider that we are testing whether population mean is less than 50 for null and greater or equal to alternative. We would write this as:

H0: mu < 50
HA (or H1): mu >= 50

For your chi-square, you need to figure out the interval for the statistic. It might be two-sided or one-sided. An example of a two-sided test would look like this (for chi-square)

H0: X < 10 OR X > 20
H1: 10 <= X <= 20

Where X^2 is the calculate statistics (for Chi-Square X^2 is always >= 0)

So essentially you need to find out the interval and state that. I have a feeling you haven't done much statistics before and I don't know much about Fraunhofer theory so maybe you could tell us what you are trying to calculate from your book or source or point us to some relevant description of the problem if it's on a website.
 
  • #3
Hi Chiro and thank you for the prompt response!

You're right in assuming that I haven't done much statistics before. This isn't a question from a book or other source; I'm performing an independent investigation at university. The goal of the experiment is as described: to quantify to what degree observations agree with Fraunhofer theory. I don't think an understanding of Fraunhofer theory specifically is necessary for the task at hand, but I expect some experience in using statistics in an experimental environment is required (which I'm lacking, unfortunately!).

I see what you're saying with regards to the wording of the hypotheses and the examples you've given certainly are specific. The null hypothesis I've provided is essentially equivalent to stating that χ2 = 0, right?

The difficulty I'm having is that the aim of the experiment isn't as specific as setting out to prove that the theory agrees with observation to a particular quantitative degree, but that it does agree to some quantitative degree, meaning that χ2 > 0, I suppose...?

I'm a little confused about using the test statistic in the hypothesis itself, though. I thought part of the point of the hypotheses was as a sort of justification for using a particular test i.e., the χ2 test requires a null and alternative hypothesis, and its value allows us to reject or fail to reject the null. Including the test statistic in the hypothesis seems a little... circular. I may be wrong, that's why I'm here and asking!
 
  • #4
Eruditio said:
Hi Chiro and thank you for the prompt response!

You're right in assuming that I haven't done much statistics before. This isn't a question from a book or other source; I'm performing an independent investigation at university. The goal of the experiment is as described: to quantify to what degree observations agree with Fraunhofer theory. I don't think an understanding of Fraunhofer theory specifically is necessary for the task at hand, but I expect some experience in using statistics in an experimental environment is required (which I'm lacking, unfortunately!).

I see what you're saying with regards to the wording of the hypotheses and the examples you've given certainly are specific. The null hypothesis I've provided is essentially equivalent to stating that χ2 = 0, right?

The difficulty I'm having is that the aim of the experiment isn't as specific as setting out to prove that the theory agrees with observation to a particular quantitative degree, but that it does agree to some quantitative degree, meaning that χ2 > 0, I suppose...?

I'm a little confused about using the test statistic in the hypothesis itself, though. I thought part of the point of the hypotheses was as a sort of justification for using a particular test i.e., the χ2 test requires a null and alternative hypothesis, and its value allows us to reject or fail to reject the null. Including the test statistic in the hypothesis seems a little... circular. I may be wrong, that's why I'm here and asking!

No your hypotheses have to involve what the statistic is what that particular test (you can test multiple things but for this test you are only using one statistic). So in other words you would write something like:

H0: σ2 < 4
H1: σ2 >= 4

Where σ2 is the population variance. But this can be represented in the chi-square if you are testing variance by nothing χ2(n-1) ~ (n-1)σ2/S^2 where S^2 is the sample variance. You could just mention the final statistic and then re-arrange to get σ or you could just write in first in terms of σ which is usually what is done because people automatically know that σ will refer to the standard deviation parameter.

Does this help?
 
  • #5
I'm afraid I'm having a bit of trouble understanding your last post chiro.

I can't say that I've seen a lot of null/alternative hypothesis statements before, but the ones I have seen haven't been quite as terse as simply stating an expected value for a statistic. Stating a statistic doesn't seem to be quite as meaningful as making a statement about what is physically expected, but using it to back up those physical claims seems intuitive.

I'm also unsure of how you intend one to use σ2. The issue in the experiment isn't the value of σ2 as every datum carries Gaussian random error drawn from a distribution of known standard deviation. The point of the experiment isn't to determine a value with an associated uncertainty, but to determine to what degree theory and reality are out of line. I'm using chi-squared to assign a confidence level to said degree (e.g., say some measurements agree to within 1 part in 50 with theory, but chi-squared gives a probability of said agreement of 90%).

Sorry if I'm being unclear but this is all quite confusing for me!
 
  • #6
Eruditio said:
I'm using chi-squared to assign a confidence level to said degree (e.g., say some measurements agree to within 1 part in 50 with theory, but chi-squared gives a probability of said agreement of 90%).

Chiro is a very diplomatic and patient person and hasn't criticized your writing and vocabulary - but I will.

What do you mean by "measurements agree to within 1 part in 50 with theory"? You haven't clearly described the problem. One guess is that you have some determinisitic theoretical formula y = f(x). You have taken several different value of the control variable x, say x0, x1,.. and for each value, you have measured several different values of y. So, for example, for x= x1, you have a set of measurements y11, y12, y13,... If that is the case there are two things involved in a "disagreement" with theory. A given y value can differ from the predicted y value and, for given size difference, a certain number of the measurements can produce that difference or smaller. What are you asking about? Is this a question about analyzing the set of y's for an individual value of x as a separate problem? Or are you asking about how to test some hypothesis about entire set of data?

It is advisable when discussing situations involving probability, to simply use the word "probability" to refer to probability instead of a variety of other words (such as "chance", "uncertainty", "confidence"). In particular, a "confidence level" has a technical meaning in statistics. It's a term used in the theory of estimation. What meaning do you wish to convey by the term "confidence level"?
 
  • #7
Eruditio said:
I'm afraid I'm having a bit of trouble understanding your last post chiro.

I can't say that I've seen a lot of null/alternative hypothesis statements before, but the ones I have seen haven't been quite as terse as simply stating an expected value for a statistic. Stating a statistic doesn't seem to be quite as meaningful as making a statement about what is physically expected, but using it to back up those physical claims seems intuitive.

I'm also unsure of how you intend one to use σ2. The issue in the experiment isn't the value of σ2 as every datum carries Gaussian random error drawn from a distribution of known standard deviation. The point of the experiment isn't to determine a value with an associated uncertainty, but to determine to what degree theory and reality are out of line. I'm using chi-squared to assign a confidence level to said degree (e.g., say some measurements agree to within 1 part in 50 with theory, but chi-squared gives a probability of said agreement of 90%).

Sorry if I'm being unclear but this is all quite confusing for me!

If you want to construct an interval for a hypothesis test, what this means if you are testing for a parameter is that one hypothesis will have an interval for what that parameter will be and the other hypothesis will be something else. Together the two hypotheses form all the possibilities under your total constraint space.

Since variance is greater than zero we don't consider all of the real line so in a typical hypothesis test for population variance using chi-square, we either have a two-sided test which has one hypothesis relating to one interval and the other relating to everything left and right of the original interval.

Like Stephen Tashi pointed above, you need to figure out what the interval is and how that corresponds to the parameter you are trying to estimate. An example of estimating population variance given a sample might look something like this:

H0: σ2 = 4
H1: σ2 != 4 given that σ2 is always >= 0.

Now if you have a certain confidence and a value for your sample variance along with your degrees of freedom, you would find the interval that corresponds to σ2 by using the definition of what chi-square with n-degrees of freedom actually represents.

The chi-squared distribution is used for many things and only one of them has to do with classical statistical calculation of the estimator of the population variance assuming the initial assumptions are met. If the chi-square model is a good estimator then we have:

χ2_(n-1) ~ (n-1)S^2/σ2 which can be seen here:

http://en.wikipedia.org/wiki/Variance#Distribution_of_the_sample_variance

What you would do is then take your values corresponding to the interval relating to 'confidence' as many scientists and engineers call it, and then find the corresponding interval for your parameter σ2 or even σ. You then get an interval for this and put that in your H0 and H1 definitions. You do the appropriate test, get a test statistic and evaluate it.

Now I want to say that statistics is not this easy and you can't just treat these tests like they are 'magic boxes' that work all the time, but if you are asked to perform a test for whatever reason and there is some level of confidence that it gives a useful answer or some other useful metric then this is what you do.

So basically for a 90% interval, the lower point corresponds to finding P(χ2_n < 0.05) and the upper point corresponds to P(χ2 < 0.95). This leaves 10% in the tails which gives you the 90% interval (this is why it's called a 90% one). This is not necessarily the only interval but this is the conventional way that it's done. You find these numbers in statistical tables for a given n or you use a computer to calculate it directly for any probability.
 

1. What is a null hypothesis in a chi-squared test?

A null hypothesis in a chi-squared test is a statement that suggests there is no significant difference between the observed and expected frequencies of a categorical variable. It assumes that any differences between the observed and expected data are due to chance or random error.

2. What is an alternative hypothesis in a chi-squared test?

An alternative hypothesis in a chi-squared test is a statement that suggests there is a significant difference between the observed and expected frequencies of a categorical variable. It is the opposite of the null hypothesis and suggests that any differences between the observed and expected data are not due to chance.

3. How do you determine the expected frequencies in a chi-squared test?

The expected frequencies in a chi-squared test are determined by taking the total number of observations in each category and dividing it by the total number of observations in the sample. This gives the expected proportion for each category, which can then be multiplied by the total sample size to get the expected frequency for each category.

4. What is the purpose of a chi-squared test?

A chi-squared test is used to determine whether there is a significant association between two categorical variables. It compares the observed data to the expected data under the assumption of the null hypothesis and calculates a p-value, which indicates the likelihood of obtaining the observed results if the null hypothesis is true. A low p-value suggests that the observed results are unlikely to be due to chance and supports the alternative hypothesis.

5. How do you interpret the results of a chi-squared test?

The results of a chi-squared test are typically interpreted by comparing the calculated p-value to a predetermined significance level (usually 0.05). If the p-value is less than the significance level, the null hypothesis is rejected, and it can be concluded that there is a significant association between the two variables. If the p-value is greater than the significance level, the null hypothesis is not rejected, and there is not enough evidence to suggest a significant association between the variables.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
994
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
17
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
760
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
20
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
7K
  • Set Theory, Logic, Probability, Statistics
Replies
14
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
Back
Top