How do I express that a 100% occurrence in a small sample is low "confidence"?

Click For Summary

Discussion Overview

The discussion revolves around how to quantify the confidence associated with observing a 100% occurrence rate in small sample sizes compared to larger ones, specifically in the context of statistical analysis and confidence intervals. Participants explore both frequentist and Bayesian approaches to express confidence levels in experimental results.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • Some participants note that while both experiments yield a 100% frequency and a 95% confidence interval of (1, 1), the intuition suggests that the result from a larger sample (experiment B) is more reliable than that from a smaller sample (experiment A).
  • One participant proposes calculating the probability that the observed results are due to chance under a null hypothesis, suggesting a frequency of 50% as a baseline.
  • Another participant suggests performing a Bayesian analysis, indicating that the 95% credible interval would differ between the two experiments, being broader for the smaller sample and narrower for the larger sample.
  • A participant explains that using a beta distribution as a conjugate prior for a binomial random variable allows for the calculation of credible intervals, providing specific intervals for both experiments based on observed successes and failures.
  • One participant expresses satisfaction with the beta distribution approach, sharing their development of a Python function to visualize frequency and credibility across varying sample sizes.
  • Another participant reflects on the intuitive expectations of confidence levels, suggesting that with only two observations, one might reasonably assert a probability greater than 30%, while with 100 observations, a probability greater than 96% seems reasonable.
  • A later reply connects the Bayesian approach to Laplace's formula, discussing its application to estimating probabilities based on past observations.

Areas of Agreement / Disagreement

Participants do not reach a consensus on a single method for expressing confidence levels, as multiple approaches (frequentist and Bayesian) are discussed, and differing intuitions about confidence in small versus large samples are expressed.

Contextual Notes

Limitations include the dependence on the choice of null hypothesis and the assumptions underlying the statistical methods discussed. The discussion also highlights the unresolved nature of how best to quantify confidence in the context of varying sample sizes.

Archosaur
Messages
333
Reaction score
4
TL;DR
How do I express that a 100% frequency occurrence in a small sample is low "confidence", when, strictly speaking, its 95% confidence interval is (1,1)?
In experiment A: I observe an event 2 times in 2 trials.
In experiment B: I observe an event 100 times in 100 trials.

In both cases, I calculate a frequency of 100%
In both cases, I calculate a 95% confidence interval of (1, 1).

But intuitively the result of experiment B is "stronger" than that of A. How can I express this as a number?
 
Physics news on Phys.org
Archosaur said:
TL;DR Summary: How do I express that a 100% frequency occurrence in a small sample is low "confidence", when, strictly speaking, its 95% confidence interval is (1,1)?

In experiment A: I observe an event 2 times in 2 trials.
In experiment B: I observe an event 100 times in 100 trials.

In both cases, I calculate a frequency of 100%
In both cases, I calculate a 95% confidence interval of (1, 1).

But intuitively the result of experiment B is "stronger" than that of A. How can I express this as a number?

Assume a null hypothesis of whatever frequency you think is appropriate. 50% maybe. Then calculate the probability that such an experimental result is due to chance, ie. that your null hypothesis is true. This will usually be very close to zero in the second case.
 
  • Like
Likes   Reactions: Agent Smith
Or you could do a Bayesian analysis and the 95% credible interval would not be (1,1) in either case, but it would be quite broad in the low data case and quite narrow in the high data case
 
In the Bayesian case a beta distribution is the conjugate prior for a binomial random variable. The posterior is ##\beta(a+1,b+1)## where ##a## is the number of successes observed and ##b## is the number of failures observed.

From that you can calculate the credible interval. For ##(a=2,b=0)## we find that the 95% credible interval for ##\beta(3,1)## is 0.368 to 1.000. In contrast, for ##(a=100,b=0)## we find that the 95% credible interval for ##\beta(101,1)## is 0.971 to 1.000
 
  • Like
Likes   Reactions: Agent Smith
This is awesome. Thanks very much for pointing me to the Beta distribution - this is exactly what I was looking for. I made a python function that calculated frequency and "credibility" (1 - width of 95% credible interval) for O observations in N trials up to 100, because I was curious what a credibility heatmap would look like in this space.
credibility.png
 
  • Like
Likes   Reactions: Agent Smith, berkeman and Dale
Excellent! Personally, I think that this behaves as a reasonable person would expect. With just 2 observations it seems reasonable to say “I am pretty sure the probability is greater than 30%”. And with 100 observations it also seems reasonable to say “I am pretty sure the probability is greater than 96%”.
 
  • Like
Likes   Reactions: Agent Smith
Dale said:
In the Bayesian case a beta distribution is the conjugate prior for a binomial random variable. The posterior is β(a+1,b+1) where a is the number of successes observed and b is the number of failures observed.
This looks like Laplace's "formula" which he invented to answer the question "what is the likelihood that the sun will rise tomorrow?" If out of ##b## observations, the sun rose ##a## of those times, ##\text{P(sun will rise tomorrow)} = \frac{a + 1}{b + 1}##
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 24 ·
Replies
24
Views
6K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 18 ·
Replies
18
Views
3K