Which is the mean value of the sample?

  • Context: MHB 
  • Thread starter Thread starter mathmari
  • Start date Start date
  • Tags Tags
    Mean Value
Click For Summary

Discussion Overview

The discussion revolves around determining the mean value of a sample in the context of hypothesis testing. Participants explore the relationship between p-values, null and alternative hypotheses, and the use of t-distribution versus normal distribution for a sample size of 25.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant sets up the hypotheses and questions the interpretation of the p-value as a conditional probability.
  • Another participant agrees with the setup and emphasizes the importance of understanding p-values, suggesting the use of the t-distribution due to the sample size being less than 30.
  • Several participants express confusion about the definition of the p-value and its relationship to the null hypothesis, with questions about whether it can be defined as \( P(\mu < 60 \mid H_0) \).
  • There is a discussion about the implications of rejecting the null hypothesis and the interpretation of probabilities under the null hypothesis.
  • Participants explore the calculation of the p-value using the t-distribution and express uncertainty about the correct formulation and execution of calculations.
  • One participant shares their use of an online R compiler to compute values related to the t-distribution, seeking confirmation on their approach.
  • There is clarification on the notation used for sample means versus random variables, with a consensus that both notations refer to the same concept in this context.

Areas of Agreement / Disagreement

Participants generally agree on the setup of the problem and the interpretation of p-values, but there is ongoing confusion and debate regarding the correct application of statistical concepts and calculations. No consensus is reached on the interpretation of certain probabilities or the use of distributions.

Contextual Notes

Participants express uncertainty about the assumptions underlying their calculations, particularly regarding the choice between t-distribution and normal distribution, and the implications of sample size on these choices.

mathmari
Gold Member
MHB
Messages
4,984
Reaction score
7
Hey! :o

We have the following hypotheses: $$H_0: \mu\geq 60 \ \ \ , \ \ \ H_1:\mu<60$$

A test is executed with a sample of size $25$ and an estimated standard deviation $S'=8$.

From the test we get a p-value of 5%. I want to determine the value of the mean of the sample $\overline{X}$.
The p-value is equal to $P(T\leq t\mid H_0)$. So is this in this case equal to
$P(\mu\leq 60\mid H_0)$ ?

If yes, then we have the following: $$p=0.05\Rightarrow \Phi \left (\frac{\overline{X}-60}{\frac{S'}{\sqrt{n}}}\right )=0.05 \Rightarrow \Phi \left (\frac{\overline{X}-60}{\frac{8}{\sqrt{25}}}\right )=0.05\Rightarrow \frac{\overline{X}-60}{\frac{8}{\sqrt{25}}}=\Phi^{-1}(0.05)$$ Or am I thinking wrong?

Do we use here the table of t-distribution?
 
Physics news on Phys.org
Hey mathmari,

I agree with your problem set up. Most people really fail to understand the concept of a p-value as a conditional probability and you very nicely stated how it is one.

There are different rules of thumb for when you can use a normal distribution to approximate a t-distribution, but the sample size seems to be $n > 30$ from what I read right now. Given that our sample is smaller than that, you might be better off using the inverse t function instead.

Overall though your understanding of the solution looks good to me. :)
 
I thought about that again and now I got stuck.

We have the null hypothesis $H_0: \mu\geq 60$ and the alternative hypothesis $H_1:\mu<60$.

The p-value is defined as the probability, under the null hypothesis, of obtaining a result equal to or more extreme than what was actually observed.

At the alternative hypothesis we have that what we've observed, so we have observes that $\mu<60$, or not?

So, is the p-value defined as $$P(\mu< 60\mid H_0)=P(\mu< 60\mid \mu\geq 60)$$ ? Or am I thinking wrong? (Wondering)
 
mathmari said:
I thought about that again and now I got stuck.

We have the null hypothesis $H_0: \mu\geq 60$ and the alternative hypothesis $H_1:\mu<60$.

The p-value is defined as the probability, under the null hypothesis, of obtaining a result equal to or more extreme than what was actually observed.

At the alternative hypothesis we have that what we've observed, so we have observes that $\mu<60$, or not?

So, is the p-value defined as $$P(\mu< 60\mid H_0)=P(\mu< 60\mid \mu\geq 60)$$ ? Or am I thinking wrong? (Wondering)

To reject $H_0$, we want to be really sure that the probability that we are wrong is less than $\alpha$.
The expected value $\mu$ could be greater than $60$, but we pick $\mu=60$ so that if we err, it will be on the safe side.Additionally we cannot observe $\mu$ directly. That is, not unless we measure the entire population.
After all, isn't $P(\mu< 60\mid \mu\geq 60)=0$? $\mu$ cannot both be greater and less than $60$ can it? (Wondering)

Instead we observe the mean $\overline x$ of a sample, which is an instance of the random variable $\overline X$ of sample means.
So we have:
$$p\text{-value} = P(\overline X < \overline x\mid \mu=60)$$
(Thinking)
 
I like Serena said:
To reject $H_0$, we want to be really sure that the probability that we are wrong is less than $\alpha$.
The expected value $\mu$ could be greater than $60$, but we pick $\mu=60$ so that if we err, it will be on the safe side.Additionally we cannot observe $\mu$ directly. That is, not unless we measure the entire population.
After all, isn't $P(\mu< 60\mid \mu\geq 60)=0$? $\mu$ cannot both be greater and less than $60$ can it? (Wondering)

Instead we observe the mean $\overline x$ of a sample, which is an instance of the random variable $\overline X$ of sample means.
So we have:
$$p\text{-value} = P(\overline X < \overline x\mid \mu=60)$$
(Thinking)

Ah ok! So we have the following: $$p\text{-value} = P(\overline X < \overline x\mid \mu=60)=t_{25}\left (\frac{\overline{x}-\mu}{\frac{S'}{\sqrt{n}}}\right )=t_{25}\left (\frac{\overline{x}-60}{\frac{8}{\sqrt{25}}}\right )=t_{25}\left (\frac{\overline{x}-60}{\frac{8}{5}}\right )=t_{25}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )$$ or not? (Wondering)

So it must be $$t_{25}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )=0.05$$ I used the online R compiler and got this result. Is this correct, or do I have to write something else there? (Wondering)
 
Last edited by a moderator:
mathmari said:
Ah ok! So we have the following: $$p\text{-value} = P(\overline X < \overline x\mid \mu=60)=t_{25}\left (\frac{\overline{x}-\mu}{\frac{S'}{\sqrt{n}}}\right )=t_{25}\left (\frac{\overline{x}-60}{\frac{8}{\sqrt{25}}}\right )=t_{25}\left (\frac{\overline{x}-60}{\frac{8}{5}}\right )=t_{25}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )$$ or not?

So it must be $$t_{25}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )=0.05$$

Since we have $n=25$ we have $df=n-1=25-1$ don't we? (Wondering)
I think we need:
$$t_{25-1}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )=0.05$$

mathmari said:
I used the online R compiler and got this result. Is this correct, or do I have to write something else there?

If I follow that link, I only see some default commands. (Worried)
What did you execute, and what was its result?
 
I like Serena said:
Since we have $n=25$ we have $df=n-1=25-1$ don't we? (Wondering)
I think we need:
$$t_{25-1}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )=0.05$$

Ah ok!

I like Serena said:
If I follow that link, I only see some default commands. (Worried)
What did you execute, and what was its result?

Now I wrote [m]qt(0.05, 24)[/m]
Is this wrong? (Wondering)
 
mathmari said:
Now I wrote [m]qt(0.05, 24)[/m]
Is this wrong? (Wondering)

That seems fine to me.
So now we can solve for $\overline x$. (Happy)
 
I like Serena said:
That seems fine to me.
So now we can solve for $\overline x$. (Happy)

We have that \begin{align*}&t_{24}\left (\frac{5}{8}\left[\overline{x}-60\right ]\right )=0.05\Rightarrow \frac{5}{8}\left[\overline{x}-60\right ]=-1.710882 \Rightarrow \frac{5}{8}\overline{x}-37.5=-1.710882 \Rightarrow \frac{5}{8}\overline{x}=35.789118 \\ & \Rightarrow \overline{x}=57.2625888\end{align*}
mathmari said:
I want to determine the value of the mean of the sample $\overline{X}$.

Is $\overline{x}$ what we want to calculate or $\overline{X}$ ? (Wondering)
 
  • #10
It's the same thing.
The question asks for the mean of the observed sample. And that is what you found.
Usually we use lower case for observed measurements, and upper case for random variables. But it's not unusual either that this convention is ignored, or applied sloppily. (Nerd)
 
  • #11
I like Serena said:
It's the same thing.
The question asks for the mean of the observed sample. And that is what you found.
Usually we use lower case for observed measurements, and upper case for random variables. But it's not unusual either that this convention is ignored, or applied sloppily. (Nerd)

Ah ok! Thank you very much! (Sun)
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
2K
Replies
20
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 24 ·
Replies
24
Views
6K
Replies
10
Views
3K
  • · Replies 1 ·
Replies
1
Views
10K
Replies
1
Views
4K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K