Distribution of the sample mean of an exponential distribution

Click For Summary

Discussion Overview

The discussion revolves around finding the distribution of the sample mean (Ybar) of a random sample drawn from an exponential distribution with a mean of 8. Participants explore the theoretical implications of this distribution, particularly focusing on its variance and potential forms, such as the Erlang distribution.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant questions how to find the distribution of Ybar, noting that the variance of the sample mean cannot also be 64 if the mean is 8.
  • Another participant suggests computing the five-fold convolution of the exponential distribution, prompting a question about the participant's familiarity with convolutions.
  • Some participants express uncertainty about the distribution of Ybar, with one suggesting it may have a gamma distribution, specifically the Erlang distribution.
  • A later reply elaborates on the Erlang distribution, explaining that the distribution of the sum of k independent identically distributed (iid) exponential variables is described by this distribution.
  • One participant provides a formula for the probability density function (pdf) of the Erlang distribution and discusses how to derive the distribution of the sample mean from it.
  • Another participant raises a concern about a potential mistake in the provided formula and mentions a separate problem regarding the distribution of the sample variance of iid exponential distributions.

Areas of Agreement / Disagreement

Participants generally agree that the distribution of Ybar is not exponential and explore the possibility of it being Erlang or gamma. However, there is no consensus on the exact formulation or approach to derive the distribution, and some participants express uncertainty about their contributions.

Contextual Notes

There are unresolved assumptions regarding the application of convolution and the specific parameters of the Erlang distribution. The discussion also highlights the complexity of deriving the distribution of the sample variance, which remains unaddressed.

Who May Find This Useful

This discussion may be useful for students and practitioners interested in statistical distributions, particularly those dealing with exponential distributions and their properties in sampling contexts.

buggy418
Messages
2
Reaction score
0
Let's say you have a random sample of 5 values that are drawn from an exponential distribution with a mean of 8.

How do I find the distribution of Ybar, which is the sample mean of the 5 random variables? [Note: Ybar = 1/5(Y₁+Y₂+Y₃+Y₄+Y₅)]

I know that for an exponential distribution with mean 8 (i.e. Y~exp(8)), the variance would be 64.
So it seems like the distribution of Ybar can't also be exponential, since the variance is supposed to be the mean squared. I figure the mean of Ybar will be 8, but the variance must be something other than 64.
I don't know what approach to take...this seems harder than the approach for a normal distribution.
 
Physics news on Phys.org
You need to compute the five fold convolution of the exponential distribution. Have you studied convolutions?
 
buggy418 said:
How do I find the distribution of Ybar, which is the sample mean of the 5 random variables? [Note: Ybar = 1/5(Y₁+Y₂+Y₃+Y₄+Y₅)]


So it seems like the distribution of Ybar can't also be exponential

True. What distribution would you expect for the sample means?
 
I think it may have a gamma distribution of some sort. We've learned the method of distribution functions. So maybe if I let V=5Y, and then found the distribution of V/5 that would work? But it seems like that's just going to make me end up at the exponential distribution again, since V/5 = 5Y/5 = Y.
 
buggy418 said:
So maybe if I let V=5Y

No, that would imply you had 5 copies of the same outcome, not 5 independent outcomes. You're going to have to do a convolution or look up the answer in a reference. Generating functions might help with a convolution.
 
Sorry to dig up a two-month old thread, but in the course of solving a different problem, I may have found a straight-forward answer to the problem presented by the OP. Here it is, for future reference.

The OP was on the right lines with the Gamma distribution. In particular, it is the Erlang distribution, which is a special case of the Gamma distribution, that is appropriate in this case.

Recall that the distribution of the sum of k iid exponential distributions is described by the Erlang distribution. That is,
Erl(k,r) ~ Exp(r) + … + Exp(r). (Sum of k exponential distros.)
Here we use r to denote the rate parameter (more commonly denoted by lamda), where r = 1/mean.

The pdf of the Erlang distro is given by
f( x; k, r ) = ( r^k * x^(k-1) * e^(-r*x) ) / (k-1)! .
(Wiki page for more info on the Erlang distro: http://en.wikipedia.org/wiki/Erlang_distribution )

So, to find the distribution of the sample mean of k values drawn from k iid exponential distributions we simply need to find
1/k * Erl(k,r).
This is a scalar multiple of a random variable. The transformation of the random variable yields a distribution with pdf
f'( x; k, r ) = f( x*k; k, r ) = ( r^k * (x*k)^(k-1) * e^(-r*x*k) ) / (k-1)!.

In the OP's case we have k=5; plugging this into the pdf gives
f'( x; 5, r ) = (625/24) * e^(-5*x*r) * r^5 * x^4.

This is my first post here. I may have made a mistake. A quick numerical test gives similar results. Also the convolution method for k=2 gives the same result.

In case anyone's interested, my own problem is to find the distribution of the sample *variance* of k iid exponential distributions. I have yet to find the solution.
 
mattjw said:
Sorry to dig up a two-month old thread, but in the course of solving a different problem, I may have found a straight-forward answer to the problem presented by the OP. Here it is, for future reference.

The OP was on the right lines with the Gamma distribution. In particular, it is the Erlang distribution, which is a special case of the Gamma distribution, that is appropriate in this case.

Recall that the distribution of the sum of k iid exponential distributions is described by the Erlang distribution. That is,
Erl(k,r) ~ Exp(r) + … + Exp(r). (Sum of k exponential distros.)
Here we use r to denote the rate parameter (more commonly denoted by lamda), where r = 1/mean.

The pdf of the Erlang distro is given by
f( x; k, r ) = ( r^k * x^(k-1) * e^(-r*x) ) / (k-1)! .
(Wiki page for more info on the Erlang distro: http://en.wikipedia.org/wiki/Erlang_distribution )

So, to find the distribution of the sample mean of k values drawn from k iid exponential distributions we simply need to find
1/k * Erl(k,r).
This is a scalar multiple of a random variable. The transformation of the random variable yields a distribution with pdf
f'( x; k, r ) = f( x*k; k, r ) = ( r^k * (x*k)^(k-1) * e^(-r*x*k) ) / (k-1)!.

In the OP's case we have k=5; plugging this into the pdf gives
f'( x; 5, r ) = (625/24) * e^(-5*x*r) * r^5 * x^4.

This is my first post here. I may have made a mistake. A quick numerical test gives similar results. Also the convolution method for k=2 gives the same result.

In case anyone's interested, my own problem is to find the distribution of the sample *variance* of k iid exponential distributions. I have yet to find the solution.

A factor of k seems to be missing from your f', but anyway if you just need to find the sample variance for sample size k, it's just (1/k) times the variance of the exponential distribution.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
3K
Replies
1
Views
1K
  • · Replies 31 ·
2
Replies
31
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
Replies
5
Views
6K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K