nomadreid said:
I was assuming (always dangerous) that I had sufficient information from the sampling distribution in order to figure out standard deviations and means, not merely the one statistic. Otherwise put, I assumed that I would have more than 30 samples, each one giving me its value of that statistic. In this case I could figure out whether the given statistic was at least two standard deviations from the mean. But one phrase you used puzzled me: you wrote of a "normally distributed statistic". Since a statistic is a single number,
In statistics, a "statistic" is not a single number. (In layman's terms a "statistic" is a single number as in "2.4 children per family". Even statisticians lapse into this kind of speech, so perhaps we must regard "statistic" as being an ambiguous term. However, it has only one "high class" definition in mathematical statistics.)
Since you have a background in pure math, you should appreciate the need for precise terminology. Technically a statistic is a function, not a single value. For example, the mean of a sample of 60 indpendent realizations of a random variable X is a statistic and it is defined by the formula that adds up the 60 realizations and divides by 60.
You could consider an individual realization of a random variable X as a statistic. However, unless you know that X is normally distributed, you don't know that the statistic defined by one realization is a normal random variable. By contrast, the Central Limit Theorem tells you that the statistic defined by the mean of (say) 60 independent samples of X has an approximately normal distribution even if X doesn't.
do you mean:
(a) a statistic calculated from a normally distributed sample (from a population which is assumed to be normally distributed), or
(b) a statistic which is one of N>30 statistics, so that the N statistics are themselves normally distributed?
Thanks again for any further help.
I mean neither of the above. A statistic may have a normal distribution without being a function of a sample from a normally distributed population. A statistic based on fewer that 30 realizations of a normal random variable may be normally distributed. So the two choices above imply restrictions that do not apply. By normally distributed statistic, I simply mean a random variable S with a normal distribution. (Since a statistic S os a function of sample values, it is a function of random variables, hence the statistic S is a random varible itself.)
I think you are incorrectly narrowing the scenario for hypothesis testing in your mind. You think of it only in terms of using a statistic S that is the mean value of some realizations. A statistic can be any function of the sample values. For example, we can define a statistic S1 to be the minimum value that appears in 60 independent realizations of a random variable. (See "order statistics".)
The fact that a statistic is a random variable causes mathematical statistics to have a recursive nature. A statistic S is a function of realizations of a random variable X, so S is a random variable itself. It has its own probability distribution. The distribution usually has its own mean, variance etc. In general, these parameters need not be the same as those of the distribution of X. (There is also no requirement that a statistic be a function of
independent samples of a random variable even though the most of commonly used statistics are. )