Hypotheses construction for significance testing

Click For Summary

Homework Help Overview

The discussion revolves around formulating null and alternative hypotheses in the context of a decision-making process based on students' scores from a standardized math exam. The scenario involves determining whether the average score is less than the statewide average of 89, which would trigger curriculum changes for math teachers.

Discussion Character

  • Conceptual clarification, Assumption checking

Approaches and Questions Raised

  • Participants explore the appropriateness of using a population parameter (mu) when discussing a sample of junior high students. There is a focus on understanding the implications of rejecting the null hypothesis based on sample data and the potential randomness affecting sample averages.

Discussion Status

Some participants have offered insights into the nature of hypothesis testing and its empirical justification. There is an ongoing exploration of the conventions used in hypothesis formulation, particularly regarding the null hypothesis and its relationship to the alternative hypothesis. Multiple interpretations of the problem and its requirements are being considered.

Contextual Notes

Participants note the subjective nature of applying statistical methods to real-life problems and the potential lack of rigorous justification in such applications. The discussion also highlights the importance of understanding textbook conventions regarding hypothesis testing.

Mr Davis 97
Messages
1,461
Reaction score
44

Homework Statement


State the null and alternative hypotheses.

The decision to implement changes in the current math program at a junior high will be based on a sample of students' scores on a standardized math exam. If the average is less than the statewide average of 89, all math teachers will have to participate in a workshop to revise the curriculum.

Homework Equations


None.

The Attempt at a Solution


Based on previous examples I've seen, I would write:
##\textrm{H}_{0}: \mu = 89##
##\textrm{H}_{a}: \mu < 89##

However, I have no idea why this would be correct or incorrect. Essentially, I want to know why this would be the correct answer (if it is). Since the problem is talking about a small subset of the population of junior high school students, I don't see how talking about mu (a population parameter) in this context would be correct. If the sample of students at the junior high school scored lower than 89 simply because they're not so smart (i.e. not due to random variation), why would we reject the null and conclude that the mu, which refers to the entire population, is lower based on that one sample? I don't understand.
 
Physics news on Phys.org
Mr Davis 97 said:
Since the problem is talking about a small subset of the population of junior high school students, I don't see how talking about mu (a population parameter) in this context would be correct.

Applying statistics to real life problems is subjective, so don't expect to find rigorous mathematical justification for using a particular method in real life. The concepts of "reject" and "accept" are not the same as the mathematical concepts of "prove" and "disprove". Hypothesis testing is a procedure not a method of proof. Decisions reached by applying it are not guaranteed to be correct. The justification for using hypothesis testing is empirical. Forms of it have been found useful. There are many different ways to do hypothesis testing. Particular ways of doing it have become traditional in some fields of science.

Applying statistics to textbook exercises is reasonably objective because the author of the textbook expects the problem to be solved in a particular way!

Is this is merely a verbal problem about the concept of "null hypothesis", a problem that gives no specific test scores?

You have to understand the conventions of the textbook. The textbook may want you to say that the null hypothesis is \mu \ge 89 because there would be no justification for revising the curriculum if it was "better than average". On the other hand, to work a problem with given data, you must have a hypothesis that gives enough information to do numerical computations. For example, if you had the students' test scores, then a numerical calculation could be done with a probability distribution that had \mu = 89 , while the information \mu \ge 89 doesn't describe a specific distribution.

You have to determine what convention the textbook uses. Does it insist that the null hypothesis is precisely the logical negation of the "alternative hypothesis"? Or does it say the null hypothesis is the information used to do a numerical calculation? There may be books that aren't consistent in this terminology, in which case you have to figure out what's going on in the current chapter.
 
Mr Davis 97 said:

Homework Statement


State the null and alternative hypotheses.

The decision to implement changes in the current math program at a junior high will be based on a sample of students' scores on a standardized math exam. If the average is less than the statewide average of 89, all math teachers will have to participate in a workshop to revise the curriculum.

Homework Equations


None.

The Attempt at a Solution


Based on previous examples I've seen, I would write:
##\textrm{H}_{0}: \mu = 89##
##\textrm{H}_{a}: \mu < 89##

However, I have no idea why this would be correct or incorrect. Essentially, I want to know why this would be the correct answer (if it is). Since the problem is talking about a small subset of the population of junior high school students, I don't see how talking about mu (a population parameter) in this context would be correct. If the sample of students at the junior high school scored lower than 89 simply because they're not so smart (i.e. not due to random variation), why would we reject the null and conclude that the mu, which refers to the entire population, is lower based on that one sample? I don't understand.

Try to think of the effects of "randomness". If the current program had a theoretical average of 89 or better, we would be happy to keep it, but even in that case, the average in a small sample (say in a single junior high) could come out a bit less than 89, just by "chance". Realizing this, you might say that if the sample's average is a bit less than 89 there is not enough justification to warrant the inconvenience, disruption and expense of revising the curriculum. However, if the school's average comes out a lot less than 89, you would be strongly inclined to say the curriculum needs revision. So, in a problem like this you might typically test ##\text{H}_0: \mu = 89## vs. ##\text{H}_1: \mu < 89##, and reject ##\text{H}_0##, hence, accept ##\text{H}_1##---thus taking some possibly expensive revision actions---if the sample mean falls below some critical value ##m < 89##. You would typically like to choose ##m## so that there would not be too great a chance of adjusting the process unneccessarily. That is, even though we might wrongly adjust a perfectly good current system occasionally (just because of random fluctuations), we do not want to do that too often. So, if ##\mu = 89## is true, you want to use a critical value ##m < 89## so that ##P(\text{sample average} \leq m)= p##, for some "small" ##p##, such as ##p =## 0.10 or .05 or .01, for example.
 
Ray Vickson said:
Try to think of the effects of "randomness". If the current program had a theoretical average of 89 or better, we would be happy to keep it, but even in that case, the average in a small sample (say in a single junior high) could come out a bit less than 89, just by "chance". Realizing this, you might say that if the sample's average is a bit less than 89 there is not enough justification to warrant the inconvenience, disruption and expense of revising the curriculum. However, if the school's average comes out a lot less than 89, you would be strongly inclined to say the curriculum needs revision. So, in a problem like this you might typically test ##\text{H}_0: \mu = 89## vs. ##\text{H}_1: \mu < 89##, and reject ##\text{H}_0##, hence, accept ##\text{H}_1##---thus taking some possibly expensive revision actions---if the sample mean falls below some critical value ##m < 89##. You would typically like to choose ##m## so that there would not be too great a chance of adjusting the process unneccessarily. That is, even though we might wrongly adjust a perfectly good current system occasionally (just because of random fluctuations), we do not want to do that too often. So, if ##\mu = 89## is true, you want to use a critical value ##m < 89## so that ##P(\text{sample average} \leq m)= p##, for some "small" ##p##, such as ##p =## 0.10 or .05 or .01, for example.

Stephen Tashi said:
Applying statistics to real life problems is subjective, so don't expect to find rigorous mathematical justification for using a particular method in real life. The concepts of "reject" and "accept" are not the same as the mathematical concepts of "prove" and "disprove". Hypothesis testing is a procedure not a method of proof. Decisions reached by applying it are not guaranteed to be correct. The justification for using hypothesis testing is empirical. Forms of it have been found useful. There are many different ways to do hypothesis testing. Particular ways of doing it have become traditional in some fields of science.

Applying statistics to textbook exercises is reasonably objective because the author of the textbook expects the problem to be solved in a particular way!

Is this is merely a verbal problem about the concept of "null hypothesis", a problem that gives no specific test scores?

You have to understand the conventions of the textbook. The textbook may want you to say that the null hypothesis is \mu \ge 89 because there would be no justification for revising the curriculum if it was "better than average". On the other hand, to work a problem with given data, you must have a hypothesis that gives enough information to do numerical computations. For example, if you had the students' test scores, then a numerical calculation could be done with a probability distribution that had \mu = 89 , while the information \mu \ge 89 doesn't describe a specific distribution.

You have to determine what convention the textbook uses. Does it insist that the null hypothesis is precisely the logical negation of the "alternative hypothesis"? Or does it say the null hypothesis is the information used to do a numerical calculation? There may be books that aren't consistent in this terminology, in which case you have to figure out what's going on in the current chapter.

Thank you for the input, it helps. However, my main question still seems to be unanswered. My main question is why in this problem are we basing what the population mean test score is, 89, on whether this specific junior high school in question has a mean below or around 89? Why should we reject the null hypothesis that the mean test score in the state is 89 given that the one specific school scores much lower than 89? Wouldn't this just indicate that the students in the specific school just aren't very smart, rather than indicate that the population mean is lower than 89?
 

Similar threads

Replies
6
Views
4K
  • · Replies 5 ·
Replies
5
Views
4K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 43 ·
2
Replies
43
Views
6K
  • · Replies 27 ·
Replies
27
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
Replies
24
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 5 ·
Replies
5
Views
3K