Uncertainty for p = 0 for binomial distribution?

Click For Summary

Discussion Overview

The discussion revolves around estimating the uncertainty of a probability value derived from a binomial distribution, specifically when the observed successes are zero. Participants explore various methods for calculating this uncertainty and the implications of their estimates.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant estimates the probability p as 0 based on their data and seeks a method to calculate uncertainty, proposing a total of 40 trials with one assumed success.
  • Another participant suggests using the binomial probability mass function (PMF) to calculate the probability of observing zero successes and discusses the variance and confidence intervals.
  • A later reply questions the validity of the initial integral approach and suggests using a Poisson approximation for better handling of the situation where p is near zero.
  • One participant proposes a method to find a confidence interval for p by solving an integral equation, leading to a specific confidence level for p.
  • Another participant corrects a misunderstanding regarding the integral and emphasizes the need to consider non-zero values for x in the context of the PMF.
  • A participant introduces the Wilson interval as a solution for estimating uncertainty in binomial problems, particularly for cases with zero successes, and references relevant literature.
  • One participant reiterates the initial uncertainty calculation and suggests a confidence interval based on the PMF, indicating a specific confidence level for p.

Areas of Agreement / Disagreement

Participants express differing views on the appropriate methods for calculating uncertainty and confidence intervals for the probability p. There is no consensus on a single approach, and various models and techniques are discussed without resolution.

Contextual Notes

Participants acknowledge limitations in their approaches, including assumptions about the distribution and the nature of the data. The discussion highlights the complexity of estimating probabilities when observed successes are zero.

Who May Find This Useful

This discussion may be useful for statisticians, researchers, and students dealing with binomial distributions, particularly in contexts where zero successes are observed and uncertainty estimation is required.

apsiloritis
Messages
5
Reaction score
0
I have some data (4 runs each of about 10 trials) which is binomial with n_hits/N_trials

n/N = 0/11, 0/9, 0/10, 0/10

So, I estimate the probability p = n/N = 0
But how can I calculate an uncertainty on this value?

I thought to try
total N_tot=40 and n_tot=1, so p_tot=1/40 = 0.025
(i.e., assume one of the trials happened to be successful instead of not)

Then s = sqrt[ 0.025 * (1-0.025)/40] = 0.0247 (approx. 1/40)

is this a valid way of doing this? Or should I be looking at some confidence test?

Hope someone can help! Thanks.
 
Physics news on Phys.org
apsiloritis said:
I have some data (4 runs each of about 10 trials) which is binomial with n_hits/N_trials

n/N = 0/11, 0/9, 0/10, 0/10

So, I estimate the probability p = n/N = 0
But how can I calculate an uncertainty on this value?

I thought to try
total N_tot=40 and n_tot=1, so p_tot=1/40 = 0.025
(i.e., assume one of the trials happened to be successful instead of not)

Then s = sqrt[ 0.025 * (1-0.025)/40] = 0.0247 (approx. 1/40)

is this a valid way of doing this? Or should I be looking at some confidence test?

If you're asking what the probability of x=0 is given a binomial distribution, the calculation (using the binomial PMF) reduces to [tex](1-p)^n[/tex]. The variance is np(1-p). Using the square root of the variance, you can construct the right confidence interval using a Z score value if the distribution is reasonably symmetric. Otherwise you need to do a Poisson approximation where np=[tex]\lambda[/tex] and the sd estimate is [tex]\sqrt\lambda[/tex]
 
Last edited:
SW VandeCarr said:
If you're asking what the probability of x=0 is given a binomial distribution, the calculation (using the binomial PMF) reduces to [tex](1-p)^n[/tex]. The variance is np(1-p). Using the square root of the variance, you can construct the right confidence interval using a Z score value if the distribution is reasonably symmetric. Otherwise you need to do a Poisson approximation where np=[tex]\lambda[/tex] and the sd estimate is [tex]\sqrt\lambda[/tex]

Thanks for your reply: I've been mulling over your suggestions.

One problem I have is that I don't know what the population p is, I estimate p = 0 from the results. I know my distribution is totally asymmetric: I know that p[tex]\geq[/tex]0. The poisson approximation is asymmetric, so I see how that could work.

But... I have only used t-test for comparing normal distributions, so not sure how to apply it properly here. I'm hoping to say something like "My estimated value of p is 0, and I am confident (to say 90%) that the true value of p is less than Y".
 
OK, how about this... I'd be very grateful if someone could check this.

p is the true (population) probability. Given that, with binomial, prob. that I measure k = 0 from N trials is [tex](1-p)^N[/tex].

We know [tex]0\leq p\leq 1[/tex]. So an experimental measurement of p_exp = 0 could have come from any value of p in that range.

So I know [tex]\int_0^1 (1-p)^N dp=1=1/(N+1)[/tex]

Then I need to solve [tex](N+1)\int_0^a (1-p)^N dp=b[/tex] for a, where b is the confidence I choose, say b = 0.9.

For N = 50, I find a = 0.044, so I'm 90% confident [tex]p\leq 0.044[/tex]

and clearly we check for b = 1, a = 1, i.e., must be 100% confident that [tex]p\leq 1[/tex].
 
apsiloritis said:
So I know [tex]\int_0^1 (1-p)^N dp=1=1/(N+1)[/tex]

That's wrong.[tex]P(x=0)=(1-p)^N[/tex]

For P=1 you must sum over the full PMF, not the special case where P(x)=0. That's just a point for which the integral is 0. You have to choose a non zero value for x (or k since you working with a discrete distribution). Since you are actually looking for the expectation to be very close to zero, I would use the Poisson approximation to the binomial.
 
Last edited:
SW VandeCarr said:
That's wrong.[tex]P(x=0)=(1-p)^N[/tex]

For P=1 you must sum over the full PMF, not the special case where P(x)=0. That's just a point for which the integral is 0. You have to choose a non zero value for x (or k since you working with a discrete distribution). Since you are actually looking for the expectation to be very close to zero, I would use the Poisson approximation to the binomial.

Sorry, I meant to write [tex]\int_0^1 (1-p)^N dp = 1/(N+1)[/tex], where N is fixed (total number of trials). The integrand looks just like an exp. decay exp(-Np), i.e., Poisson function, as you pointed out.

I can treat the integrand like a density function, and want to find the value of p = a, which accounts for 90% of the value of this integral. This is equivalent to saying that if I do measure p_exp = 0, then there's a 90% chance that it happened because the true value of [tex]p \leq a[/tex].

So I renormalize the function in the integrand to [tex](N+1)(1-p)^N[/tex], and solve the equation [tex]\int_0^a (N+1)(1-p)^N dp = 0.9[/tex] to find a.

The method actually gives a value very close to the original approx. solution in my first post. For single s.d., confidence=68% and for N=40, I calculate a = 0.0274 (answer above was 0.0247).

What do you think?
 
apsiloritis said:
Sorry, I meant to write [tex]\int_0^1 (1-p)^N dp = 1/(N+1)[/tex], where N is fixed (total number of trials). The integrand looks just like an exp. decay exp(-Np), i.e., Poisson function, as you pointed out.

I can treat the integrand like a density function, and want to find the value of p = a, which accounts for 90% of the value of this integral. This is equivalent to saying that if I do measure p_exp = 0, then there's a 90% chance that it happened because the true value of [tex]p \leq a[/tex].

So I renormalize the function in the integrand to [tex](N+1)(1-p)^N[/tex], and solve the equation [tex]\int_0^a (N+1)(1-p)^N dp = 0.9[/tex] to find a.

The method actually gives a value very close to the original approx. solution in my first post. For single s.d., confidence=68% and for N=40, I calculate a = 0.0274 (answer above was 0.0247).

What do you think?

If the probability of 0 is 0.9, then P(>0) is 0.1. So are you saying that lambda = a = 0.0247? In that case, it makes sense since lambda will be small, but I can't say whether or not it's correct.
 
Last edited:
For those that may be interested, the solution to this problem is to use the Wilson interval. Details are available in the following paper. This interval can be calculated for any value of n and k, including for k = 0 (which was what I was interested in).

http://projecteuclid.org/euclid.ss/1009213286"

Note that the textbook interval [tex]\sqrt{np(1-p)}[/tex] is called the Wald interval and is generally acknowledged to be the wrong thing to use for any binomial problem!

The paper also discusses other solutions, and it is worthwhile looking at the paper by Agresti and Coull for a readable discussion of these.

Cheers,
A
 
Last edited by a moderator:
apsiloritis said:
I have some data (4 runs each of about 10 trials) which is binomial with n_hits/N_trials

n/N = 0/11, 0/9, 0/10, 0/10

So, I estimate the probability p = n/N = 0
But how can I calculate an uncertainty on this value?

I thought to try
total N_tot=40 and n_tot=1, so p_tot=1/40 = 0.025
(i.e., assume one of the trials happened to be successful instead of not)

Then s = sqrt[ 0.025 * (1-0.025)/40] = 0.0247 (approx. 1/40)

is this a valid way of doing this? Or should I be looking at some confidence test?

Hope someone can help! Thanks.

The 95% confidence would be [0,p1] where p1 is the solution to (1-p1)^40=0.95.
 

Similar threads

  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
Replies
1
Views
3K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K