Estimate p from sample of two Binomially Distributions

Click For Summary
SUMMARY

The discussion centers on estimating the parameter p from two independent Binomial distributions, X~B(5,p) and Y~B(7,p), given sample values x=3 and y=5. The best minimum-variance unbiased estimate of p is derived using Maximum Likelihood Estimation (MLE), leading to the conclusion that p=2/3. The participants clarify the correct likelihood function to use, emphasizing that the convolution of distributions is not applicable in this context since only one outcome from each distribution is sampled.

PREREQUISITES
  • Understanding of Binomial distributions, specifically B(5,p) and B(7,p).
  • Familiarity with Maximum Likelihood Estimation (MLE) techniques.
  • Knowledge of the Cramer-Rao lower bound and its implications for unbiased estimators.
  • Ability to differentiate and solve equations involving likelihood functions.
NEXT STEPS
  • Study the derivation of Maximum Likelihood Estimators in detail.
  • Learn about the Cramer-Rao lower bound and its applications in statistics.
  • Explore the concept of convolution in probability distributions and its limitations.
  • Practice solving problems involving multiple independent Binomial distributions.
USEFUL FOR

Statisticians, data analysts, and students studying probability theory, particularly those focusing on estimation techniques and Binomial distributions.

Scootertaj
Messages
97
Reaction score
0
1. Suppose X~B(5,p) and Y~(7,p) independent of X. Sampling once from each population gives x=3,y=5. What is the best (minimum-variance unbiased) estimate of p?

Homework Equations


P(X=x)=\binom{n}{x}p^x(1-p)^{n-x}

The Attempt at a Solution


My idea is that Maximum Likelihood estimators are unbiased, and have asymptotic variance = Cramer-Rao lower bound. Also, C-R lower bound = minimum variance of unbiased estimators.
So, since X and Y independent, X+Y \sim B(5+7,p)=B(12,p)
Thus, can we just compute the likelihood function and take the derivative?
L = \binom{12}{8}p^8(1-p)^4
\frac{dL}{dp} = 8p^7(1-p)^4 - 4p^8(1-p)^3
Thus, p=8/12=2/3

Is that legit?
 
Last edited:
Physics news on Phys.org
The method is OK as far as I know (not an expert on unbiased estimators), but I have a problem with this step:
Scootertaj said:
\frac{dL}{dp} = 8p^7(1-p)^4 - 4p^8(1-p)^3
Thus, p=8/12=3/4
That's not what I get by putting dL/dp =0.
 
\frac{dL}{dp} = 8p^7(1-p)^4 - 4p^8(1-p)^3=0
p^7(1-p)^3[8(1-p)]-4p]=0
8-8p-4p=0 ignoring p=0,p=1
8=12p \Rightarrow p=8/12=2/3

Did I miss something?
Note: I screwed up my fraction simplification previously if that's what you meant.
 
Scootertaj said:
I screwed up my fraction simplification previously if that's what you meant.
Yes.
 
Scootertaj said:
1. Suppose X~B(5,p) and Y~(7,p) independent of X. Sampling once from each population gives x=3,y=5. What is the best (minimum-variance unbiased) estimate of p?



Homework Equations


P(X=x)=\binom{n}{x}p^x(1-p)^{n-x}


The Attempt at a Solution


My idea is that Maximum Likelihood estimators are unbiased, and have asymptotic variance = Cramer-Rao lower bound. Also, C-R lower bound = minimum variance of unbiased estimators.
So, since X and Y independent, X+Y \sim B(5+7,p)=B(12,p)
Thus, can we just compute the likelihood function and take the derivative?
L = \binom{12}{8}p^8(1-p)^4
\frac{dL}{dp} = 8p^7(1-p)^4 - 4p^8(1-p)^3
Thus, p=8/12=2/3

Is that legit?

I disagree with your expression for L, but agree with the final answer you get later. You only get B(n,p)+B(m,p) = B(n+m,p) when you perform a convolution (that is, sum over all the outcomes for each term). In your case you just have one outcome from each binomial, so you should use
L = {5 \choose 3}p^3(1-p)^2 {7 \choose 5} p^5 (1-p)^2. Of course, this has the form ##c p^8(1-p)^4,## so will give the same result you got.
 
Ahh yes, I thought about that also. It's nice to see I wasn't off-base by considering that way.
Why does it not make statistical sense to use the L I suggested (which is based off the fact that X+Y~B(12,p) ?
 
Scootertaj said:
Ahh yes, I thought about that also. It's nice to see I wasn't off-base by considering that way.
Why does it not make statistical sense to use the L I suggested (which is based off the fact that X+Y~B(12,p) ?

I already gave the explanation, although it was very brief. I mean that P(X+Y=8) = P(X=1,Y=7) + P(X=2,Y=6) + P(X=3,Y=5) + P(X=4,Y=4)+ P(X=5,Y=3), a sum of 5 terms, but you have only the term P(X=3,Y=5). Each separate term has p^8 * (1-p)^4, but with different coefficients.
 
Last edited:

Similar threads

  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 5 ·
Replies
5
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
15
Views
4K
Replies
1
Views
2K
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 13 ·
Replies
13
Views
2K