Applying Bayesian Inference to Test Hypothesis on 100 Samples of Random Numbers

Click For Summary

Discussion Overview

The discussion revolves around the application of Bayesian inference to test a hypothesis regarding potential biases in a series of random numbers. Participants explore how to define and quantify bias using Bayesian methods, particularly in the context of a sample of 100 random numbers ranging from 1 to 10.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • Some participants inquire about the practical application of Bayesian inference in testing the hypothesis of bias towards specific numbers (5 and 7) in a sample of random numbers.
  • One participant emphasizes the importance of incorporating prior knowledge about potential biases and suggests defining a probability distribution for bias, indicating that any vector of probabilities should sum to 1 and lie between 0 and 1.
  • Another participant proposes that bias could be expressed as a condition where the probability of a number exceeds a user-defined threshold, such as 0.15, and discusses the need to analyze different historical sample sizes to identify when bias is most evident.
  • There is a mention that Bayesian statistics provides a probability of a probability rather than a definitive yes-or-no answer, contrasting it with non-Bayesian statistics that rely on fixed distributions and arbitrary limits for acceptance or rejection of hypotheses.
  • Concerns are raised about the complexities introduced by varying sample sizes and the potential for misinterpretation when looking for trends in data.
  • Participants note the necessity of clearly defining specific examples and translating ambiguous language into precise mathematical terms for effective Bayesian analysis.

Areas of Agreement / Disagreement

Participants express differing views on the application and interpretation of Bayesian inference, particularly regarding how to define and measure bias. There is no consensus on a specific approach or methodology, and the discussion remains unresolved.

Contextual Notes

Limitations include the need for clear definitions of bias and the challenges of translating everyday language into mathematical terms for Bayesian analysis. The discussion also highlights the complexities of analyzing trends over varying sample sizes.

scalpmaster
Messages
35
Reaction score
0
how is bayesian inference actually applied?
Say I have (100samples) a series of random numbers between 1 to 10.
How do I test for the hypothesis that "there is a bias for the numbers 5,7" ?
 
Physics news on Phys.org
scalpmaster said:
how is bayesian inference actually applied?
Say I have (100samples) a series of random numbers between 1 to 10.
How do I test for the hypothesis that "there is a bias for the numbers 5,7" ?

The strength of Bayesian analysis is that it encourages experts to use their knowledge instead of leaving out details of the problem in order to fit it into some textbook type of exercise. If this is a real world problem, you have to consider what you know about the causes of the bias or examples of other series of numbers where you understand the bias.

If you are making the problem up merely to work an example of Bayesian analysis, then we can consider how to define a probability distribution on "bias". You could use a "probability distribution of probability distributions". For example, let p_i be the probability of the number i and assume that any vector of probabilities p_1, p_2,...p_n is equally likely, subject to the condition that the probabilities add to 1 and are each between 0 and 1. If you are trying to make a "yes or no" judgment on "bias", you have to define what that means. For example, does a "bias" in favor of 5 mean that p_5 was at least 0.15?.

We can discuss this further if you can refine your goals. It's often most convenient to do Bayesian analysis by Monte-Carlo simulations.
 
Stephen Tashi said:
If you are making the problem up merely to work an example of Bayesian analysis, then we can consider how to define a probability distribution on "bias". You could use a "probability distribution of probability distributions". For example, let p_i be the probability of the number i and assume that any vector of probabilities p_1, p_2,...p_n is equally likely, subject to the condition that the probabilities add to 1 and are each between 0 and 1. If you are trying to make a "yes or no" judgment on "bias", you have to define what that means. For example, does a "bias" in favor of 5 mean that p_5 was at least 0.15?.
We can discuss this further if you can refine your goals. It's often most convenient to do Bayesian analysis by Monte-Carlo simulations.

This is a basic example(1to10) just to try Bayesian analysis.
Let bias be expressed as Pi>Pb, where Pb is user defined, e.g 0.15
The goal is simply to find out if there was user defined level of bias for certain numbers in different historical sample set sizes, e.g last 30draws, last 70draws, last 100draws, etc besides merely looking at the corresponding frequencies for each number for each timeframe.
I.e. At which timeframe, there was most number of bias shown even though in the long run Pi for all numbers converges to 0.1?
 
Last edited:
Neither Bayesian or non-Bayesian statistics gives you a definite yes-or-no answer to most problems. In Bayesian statistics, the answer to the question "Is P_5 > 0.15" will have a certain probability. (It's a "probability of a probability" in this case, which might be a confusing thought.)

In non-Bayesian statistics, you would assume a definite distribution for the numbers, you would compute the probability of observing your data and you would set some abritary limit on how improbable the data would be in order to "reject" or "accept" your original assumption.

When you start mentioning varying sample sizes and "trends", you are getting into complications that you need to be clear about. People who look for "trends" in data can often fool themselves. Are you assuming the "bias" varies over time?

If you are looking for an example to use to understand Bayesian analysis then you must define a specific example and do so precisely. Bayesian analysis doesn't do a translation from the ambiguous language of everyday speech into mathematics. The user of Bayesian analysis must do that.
 

Similar threads

  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 30 ·
2
Replies
30
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 3 ·
Replies
3
Views
849
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 10 ·
Replies
10
Views
2K