Applying Bayesian Inference to Test Hypothesis on 100 Samples of Random Numbers

AI Thread Summary
Bayesian inference can be applied to test hypotheses about biases in random number samples by defining a probability distribution for the bias. In this context, bias can be expressed as the probability of a number exceeding a user-defined threshold, such as 0.15 for numbers 5 and 7. The analysis often involves Monte Carlo simulations to estimate the probabilities of these biases over different sample sizes. Unlike traditional statistics, Bayesian methods provide a probability of a hypothesis rather than a definitive yes-or-no answer. Clear definitions and precise examples are crucial for effective Bayesian analysis, as it requires translating ambiguous concepts into mathematical terms.
scalpmaster
Messages
35
Reaction score
0
how is bayesian inference actually applied?
Say I have (100samples) a series of random numbers between 1 to 10.
How do I test for the hypothesis that "there is a bias for the numbers 5,7" ?
 
Physics news on Phys.org
scalpmaster said:
how is bayesian inference actually applied?
Say I have (100samples) a series of random numbers between 1 to 10.
How do I test for the hypothesis that "there is a bias for the numbers 5,7" ?

The strength of Bayesian analysis is that it encourages experts to use their knowledge instead of leaving out details of the problem in order to fit it into some textbook type of exercise. If this is a real world problem, you have to consider what you know about the causes of the bias or examples of other series of numbers where you understand the bias.

If you are making the problem up merely to work an example of Bayesian analysis, then we can consider how to define a probability distribution on "bias". You could use a "probability distribution of probability distributions". For example, let p_i be the probability of the number i and assume that any vector of probabilities p_1, p_2,...p_n is equally likely, subject to the condition that the probabilities add to 1 and are each between 0 and 1. If you are trying to make a "yes or no" judgment on "bias", you have to define what that means. For example, does a "bias" in favor of 5 mean that p_5 was at least 0.15?.

We can discuss this further if you can refine your goals. It's often most convenient to do Bayesian analysis by Monte-Carlo simulations.
 
Stephen Tashi said:
If you are making the problem up merely to work an example of Bayesian analysis, then we can consider how to define a probability distribution on "bias". You could use a "probability distribution of probability distributions". For example, let p_i be the probability of the number i and assume that any vector of probabilities p_1, p_2,...p_n is equally likely, subject to the condition that the probabilities add to 1 and are each between 0 and 1. If you are trying to make a "yes or no" judgment on "bias", you have to define what that means. For example, does a "bias" in favor of 5 mean that p_5 was at least 0.15?.
We can discuss this further if you can refine your goals. It's often most convenient to do Bayesian analysis by Monte-Carlo simulations.

This is a basic example(1to10) just to try Bayesian analysis.
Let bias be expressed as Pi>Pb, where Pb is user defined, e.g 0.15
The goal is simply to find out if there was user defined level of bias for certain numbers in different historical sample set sizes, e.g last 30draws, last 70draws, last 100draws, etc besides merely looking at the corresponding frequencies for each number for each timeframe.
I.e. At which timeframe, there was most number of bias shown even though in the long run Pi for all numbers converges to 0.1?
 
Last edited:
Neither Bayesian or non-Bayesian statistics gives you a definite yes-or-no answer to most problems. In Bayesian statistics, the answer to the question "Is P_5 > 0.15" will have a certain probability. (It's a "probability of a probability" in this case, which might be a confusing thought.)

In non-Bayesian statistics, you would assume a definite distribution for the numbers, you would compute the probability of observing your data and you would set some abritary limit on how improbable the data would be in order to "reject" or "accept" your original assumption.

When you start mentioning varying sample sizes and "trends", you are getting into complications that you need to be clear about. People who look for "trends" in data can often fool themselves. Are you assuming the "bias" varies over time?

If you are looking for an example to use to understand Bayesian analysis then you must define a specific example and do so precisely. Bayesian analysis doesn't do a translation from the ambiguous language of everyday speech into mathematics. The user of Bayesian analysis must do that.
 
I was reading documentation about the soundness and completeness of logic formal systems. Consider the following $$\vdash_S \phi$$ where ##S## is the proof-system making part the formal system and ##\phi## is a wff (well formed formula) of the formal language. Note the blank on left of the turnstile symbol ##\vdash_S##, as far as I can tell it actually represents the empty set. So what does it mean ? I guess it actually means ##\phi## is a theorem of the formal system, i.e. there is a...

Similar threads

Replies
13
Views
6K
Replies
5
Views
2K
Replies
2
Views
2K
Replies
7
Views
2K
Replies
30
Views
2K
Back
Top