The t-test and the central limit theorem

TytoAlba95 · May 15, 2019

Ans is 3.

I know basic t-test but I have no clue to solve this question.
Thanks.

mjc123 · May 15, 2019

It would appear that by "what dry weight values would lead...?" they mean "what hypothesised values of the mean dry weight would lead...?" Which seems odd when they've given you a hypothesised value already. It sounds as if they're talking about measured values, but that would make no sense. Mean, standard deviation and hypothesised mean are all you need for a t-test; if they were talking about rejecting outliers that would be a different test.

TytoAlba95 · May 17, 2019

I'm sorry, I couldn't understand what you said.
The book I'm following has provided a solution to this question which is too complex for me. It talks about central limit theorem.

mjc123 · May 17, 2019

What in particular don't you understand? You say you "know basic t-test". Do you mean you know a formula that you apply blindly, or do you understand where it comes from?
The question as you state it (have you copied it correctly?) is badly expressed and unclear. The answer also contains mistakes, e.g. in the last line "does not contain" should be "contains". The paragraph immediately after the diagram should read
"... there is a 95% probability that the interval μ ± 2.1(σ/√n) contains Y. Likewise, there is a 95% probability that the interval specified by Y ± 2.1(σ/√n) contains μ."
If the value of the hypothesised mean was greater than 5.22 or less than 4.38, you would reject the hypothesis on the basis of the data. This is what answer 3 must mean. But they tell you that the hypothesised mean is 4.5. It makes no sense.

TytoAlba95 · May 17, 2019

Sorry, I understand it was too vague to have said 'I know basic t-test'. I only know how to apply formulae, I should have mentioned that.
I have copied the sum correctly (there's no mistake...).

My Attempt :
Ho= There's no difference between the sample mean (ȳ =4.8 mg) and the population mean μ=4.5.
Ha= There's difference between the sample mean and population mean. The sample doesn't belong to the population.

t_cal= (ȳ -μ)/SE
here SE=s/√n-1.
t_cal= {4.8-4.5}/(0.8/3.8) = 0.3/.21 = 0.38

My t_cal is greater than the given t_0.05 = 2.1 (though t_0.05, _df=15 =1.7), so the Ho is rejected. The sample does not belong to the population.

what dry weight values would lead to rejection of the null hypothesis at p = 0.05 level?

From the above quote it appears to me that the Ho should not have been rejected, and the question is asking the hypothetical μ for which it will be rejected.

Then again after checking the solution I got more confused with confidence interval and central limit theorem. (Could you suggest some easy-reads on these terms)

I hope I could make my current understanding of this sum more clear.

mjc123 · May 20, 2019

I'm afraid your post illustrates that "only knowing how to apply formulae" without understanding them means that you will sometimes apply them wrongly.
Your H₀ is nonsensical - of course there is a difference between 4.8 and 4.5. Do you mean "the difference is not statistically significant at the 0.05 level"? Similarly with H_a: "There is a statistically significant difference..."
What do you mean by SE = s/√n - 1. As written it is ambiguous. Do you mean s/√(n - 1), or s/(√n - 1), or (s/√n) - 1? I suspect the first, but that is wrong - it should be s/√n.
0.3/.21 is not 0.38, it is 1.43 - which is still not greater than 2.1, so why do you reject the hypothesis? And "t_0.05,15 = 1.7" is wrong - that is a single-tailed value, while you are looking for the two-tailed value.

SanjuktaGhosh said:

From the above quote it appears to me that the Ho should not have been rejected, and the question is asking the hypothetical μ for which it will be rejected.

I agree with you. That was what I was saying, I'm sorry if it wasn't clear.
There's no real substitute for understanding where the t distribution comes from and how the t test should be used. Unfortunately, I can't recommend a simple source for you, as the book I learned from is probably unobtainable now, and the Wikipedia article looks fearsomely mathematical. Perhaps someone else could help with suggestions?

TytoAlba95 · May 21, 2019

mjc123 said:

I'm afraid your post illustrates that "only knowing how to apply formulae" without understanding them means that you will sometimes apply them wrongly.
Your H₀ is nonsensical - of course there is a difference between 4.8 and 4.5. Do you mean "the difference is not statistically significant at the 0.05 level"? Similarly with H_a: "There is a statistically significant difference..."
What do you mean by SE = s/√n - 1. As written it is ambiguous. Do you mean s/√(n - 1), or s/(√n - 1), or (s/√n) - 1? I suspect the first, but that is wrong - it should be s/√n.

Oh! I didn't know it should be s/√n, I was taught SE=s/√(n-1).

0.3/.21 is not 0.38, it is 1.43 - which is still not greater than 2.1, so why do you reject the hypothesis? And "t_0.05,15 = 1.7" is wrong - that is a single-tailed value, while you are looking for the two-tailed value.

Sorry, those were silly mistakes.

I agree with you. That was what I was saying, I'm sorry if it wasn't clear.
There's no real substitute for understanding where the t distribution comes from and how the t test should be used. Unfortunately, I can't recommend a simple source for you, as the book I learned from is probably unobtainable now, and the Wikipedia article looks fearsomely mathematical. Perhaps someone else could help with suggestions?

Yes, Wikipedia is too mathematical.
Can I create a post and ask other biologists for book recommendation?

mjc123 · May 21, 2019

I suggest posting a request in the biology forum, in case they aren't looking here.

mjc123 · May 21, 2019

The book I used was "Statistics for Mathematicians" by D.J.Finney; it appears that second-hand copies are available on Amazon. The same author also appears to have written a somewhat shorter "Statistics for Biologists", which costs about £100 new, but used copies are available much cheaper. I don't know how much easier it is.

TytoAlba95 · May 22, 2019

mjc123 said:

The book I used was "Statistics for Mathematicians" by D.J.Finney; it appears that second-hand copies are available on Amazon. The same author also appears to have written a somewhat shorter "Statistics for Biologists", which costs about £100 new, but used copies are available much cheaper. I don't know how much easier it is.

Thank you for helping me so much and bearing with me.
I'll post in Biology/Medical forum.

The t-test and the central limit theorem

"Critical" Triangle Problem

The optimal way of dividing the bet three ways

Hedging on a weather prediction

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

The t-test and the central limit theorem

Similar threads