Average of 3 normally distributed random samples

Click For Summary

Homework Help Overview

The discussion revolves around calculating the average and sum of three normally distributed random samples. Participants are exploring the implications of combining independent random variables and how this affects the mean and standard deviation of the resulting distributions.

Discussion Character

  • Conceptual clarification, Mathematical reasoning, Assumption checking

Approaches and Questions Raised

  • Participants discuss the calculation of z-scores and the derivation of means and standard deviations for averages and sums of random variables. There are questions about the underlying principles of combining distributions and the application of the Central Limit Theorem.

Discussion Status

Some participants have provided insights into the calculations for parts (ii) and (iii), discussing the expected values and variances. There is an ongoing exploration of the assumptions regarding independence and covariance among the variables.

Contextual Notes

Participants note that certain concepts, such as the derivation of standard deviations from averages and sums of random variables, may not have been fully covered in class, leading to confusion. There is also mention of the need to clarify the treatment of covariance in this context.

TheBigDig
Messages
65
Reaction score
2
Homework Statement
A new university teacher of a large first year class is uncertained how to distribute grades. She decides to assume that the scores will be normally distributed with mean ##\mu## and standard deviation ##\sigma##. There are 5 grades (A,B,C,D,E) with A being the highest.

(i) Suppose that the scores are normally distributed with mean 50 and standard deviation 15. What percentage of students will fail if the pass mark is 40%?

(ii) If a random sample of three scripts is selected by the external examiner for detailed scrutiny, what is the probability that the average score for these scripts will be less than 40?

(iii) Explain why it would be surprising if the sum of the 3 scores exceeds 250.
Relevant Equations
$$Z = \frac {x- \mu} {\sigma},$$
I've found part (i) by calculating the z-score for 40
$$Z = \frac {40- 50} {15} = -0.67$$
$$N(-0.67) = 1- N(0.67) $$
$$1- N(0.67) = 1-0.7486 = 0.2514$$

But parts (ii) and (iii) are confusing me. I have answers provided by my professor that say the mean and std deviation for (ii) and (iii) are
(ii) ##\mu = 50, \sigma = 15/\sqrt(3)##
(iii) ##\mu = 150, \sigma = 15\sqrt(3)##
but I'm not sure where the standard deviations come from as it's not something we've covered in class.
 
Physics news on Phys.org
Hi there,

TheBigDig said:
But parts (ii) and (iii) are confusing me. I have answers provided by my professor that say the mean and std deviation for (ii) and (iii) are
(ii) ##\mu = 50, \sigma = 15/\sqrt(3)##
(iii) ##\mu = 150, \sigma = 15\sqrt(3)##
but I'm not sure where the standard deviations come from as it's not something we've covered in class.

So these results come from the combination of independent random variables. We realize that we need to find the parameters for the new normal distributions.

For part (ii), we are picking three tests at random X_1, X_2, and X_3. We are taking the scores and averaging them so we can define a new variable Y to be our average score
Y = \frac{X_1 + X_2 + X_3}{3}
Now if we want to find the mean (expected value) and variance of our new distribution, then we can use the fact that (where a and b are constants)
E[aX + b] = aE[X] + b
and
Var(aX + b) = a^2 Var(X)
Therefore, given that E[X_i] = 50 and Var(X) = 15:
E[Y] = \frac{X_1 + X_2 + X_3}{3} = \frac{1}{3} E[X_1] + \frac{1}{3} E[X_2] + \frac{1}{3} E[X_3] = 50
and
Var(Y) = \left( \frac{1}{3} \right)^2 \left( Var(X_1) + Var(X_2) + Var(X_3) \right) = \frac{15^2}{3}
Therefore, the standard deviation is: \sigma_Y = \sqrt{Var(Y)} = \frac{15}{\sqrt{3}}

For part(iii), we now don't want the average of the three scores, but instead just the sum. So we can define the random variable Z as:
Z = X_1 + X_2 + X_3
Using similar results to above, the standard deviation can be derived. Let me know if you aren't able to get to the result. Please note that we are adding the three test scores rather than multiplying by 3 - there is a subtle difference when it comes to the resultant variance (can discuss this below, but don't want to confuse you unnecessarily).

These are results from combining random variables and align with results from the 'Central Limit Theorem', which I would recommend looking up (there is plenty of material out there) and can be generalized for a general number n of distributions combined.

Hope that was of some help. There are some general formulae for combining normal distributions
 
Did you not cover the facts that if X_1 and X_2 are independent and normally distributed with means \mu_1 and \mu_2 and variances \sigma_1^2 and \sigma_2^2 then
  • X_1 + X_2 is normally distributed with mean \mu_1 + \mu_2 and mean \sigma_1^2 + \sigma_2^2, and
  • \alpha X_1 is normally distributed with mean \alpha\mu_1 and variance \alpha^2 \sigma_1^2
and the consequence that if a sample of size n is taken from a normal distribution with mean \mu and variance \sigma^2 then the sample mean is normally distributed with mean \mu and variance \sigma^2/n?
 
Master1022 said:
Hi there,
So these results come from the combination of independent random variables. We realize that we need to find the parameters for the new normal distributions.

For part (ii), we are picking three tests at random X_1, X_2, and X_3. We are taking the scores and averaging them so we can define a new variable Y to be our average score
Y = \frac{X_1 + X_2 + X_3}{3}
Now if we want to find the mean (expected value) and variance of our new distribution, then we can use the fact that (where a and b are constants)
E[aX + b] = aE[X] + b
and
Var(aX + b) = a^2 Var(X)
Therefore, given that E[X_i] = 50 and Var(X) = 15:
E[Y] = \frac{X_1 + X_2 + X_3}{3} = \frac{1}{3} E[X_1] + \frac{1}{3} E[X_2] + \frac{1}{3} E[X_3] = 50
and
Var(Y) = \left( \frac{1}{3} \right)^2 \left( Var(X_1) + Var(X_2) + Var(X_3) \right) = \frac{15^2}{3}
Therefore, the standard deviation is: \sigma_Y = \sqrt{Var(Y)} = \frac{15}{\sqrt{3}}

For part(iii), we now don't want the average of the three scores, but instead just the sum. So we can define the random variable Z as:
Z = X_1 + X_2 + X_3
Using similar results to above, the standard deviation can be derived. Let me know if you aren't able to get to the result. Please note that we are adding the three test scores rather than multiplying by 3 - there is a subtle difference when it comes to the resultant variance (can discuss this below, but don't want to confuse you unnecessarily).

These are results from combining random variables and align with results from the 'Central Limit Theorem', which I would recommend looking up (there is plenty of material out there) and can be generalized for a general number n of distributions combined.

Hope that was of some help. There are some general formulae for combining normal distributions

Hi there! Thank you so much for your help. As it turns out we have covered this before but I didn't consider that these manipulations could be performed when it was three variables taken from the same data set if you understand me. I thought it only applied to normal variables with different means and variances but I see now why it makes sense. I got the standard deviation in part (iii) but I looked into it and see what you mean about it being more nuanced than just multiplying them by three

$$var(aX + bY +cZ) = a^2Var(X)+b^2Var(Y)+c^2Var(Z)+2abCov(X,Y)+2acCov(X,Z)+2bcCov(Y,Z)$$

I assume in this example since they all come from the dataset we can take covariance to be zero?

pasmith said:
Did you not cover the facts that if X_1 and X_2 are independent and normally distributed with means \mu_1 and \mu_2 and variances \sigma_1^2 and \sigma_2^2 then
  • X_1 + X_2 is normally distributed with mean \mu_1 + \mu_2 and mean \sigma_1^2 + \sigma_2^2, and
  • \alpha X_1 is normally distributed with mean \alpha\mu_1 and variance \alpha^2 \sigma_1^2
and the consequence that if a sample of size n is taken from a normal distribution with mean \mu and variance \sigma^2 then the sample mean is normally distributed with mean \mu and variance \sigma^2/n?

Thank you as well for your help!
 
TheBigDig said:
I assume in this example since they all come from the dataset we can take covariance to be zero?

Yes, we can assume they are independent so that the covariance is 0
 

Similar threads

Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
4K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
Replies
12
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K