How Can I Calculate the Probability in a Multi-Level Box System?

  • Context: Undergrad 
  • Thread starter Thread starter Unconscious
  • Start date Start date
  • Tags Tags
    Box Probability System
Click For Summary
SUMMARY

This discussion focuses on calculating the probability of achieving a sum greater than or equal to a threshold M in a multi-level box system, where each box contains a binary number with a probability p of being '1'. The formula used for this calculation is p' = ∑_{j=M}^N (N choose j) p^j (1-p)^{N-j}. The user implements a MATLAB script to compute the probabilities for various configurations, ultimately obtaining a result of 0.09562 for N = 8 and M = 4. The conversation also highlights the correlation between numbers in the blue boxes and suggests that for larger N, a Monte Carlo simulation may be a more efficient approach.

PREREQUISITES
  • Understanding of binomial probability distributions
  • Familiarity with MATLAB programming
  • Knowledge of combinatorial mathematics
  • Concept of Monte Carlo simulations
NEXT STEPS
  • Study binomial probability distributions in depth
  • Learn advanced MATLAB techniques for statistical simulations
  • Explore combinatorial optimization methods
  • Investigate Monte Carlo simulation applications in probability calculations
USEFUL FOR

Mathematicians, data scientists, and engineers interested in probability theory, statistical modeling, and those working with binary systems in computational environments.

Unconscious
Messages
77
Reaction score
12
Good morning,
I have a system that consists of a huge number of boxes, it is not important to know how many, each containing a binary number:

ba034c3043cfb69f4dada3bdd55d537394b5ed43_3_650.png

In every cell, the probability that there is a '1' is p, so the probability that there is a '0' is equal to (1-p).
What I do is take them in groups of N and add them up. For each of these sums, I calculate the probability that it is at least M.
So:
aa390b34ccb6157cb8244f28122afbd9990f5d01_3_650.png

if I do not make errors in the reasoning, the probability p' that the generic blue box contains a number greater than M is:

## p' = \sum_{j=M}^N \binom{N}{M}p^j (1-p)^{N-j} ##

In each of the blue boxes, I insert a '1' if the threshold M has been exceeded (or equaled), otherwise '0'. For example, in the previous drawing (where N = 3) if I use M = 2:

376728bb5ba916090e4a2ba81eea593af8764d83_3_650.png


At this point I take another step, and this is where my doubts come from.
I again cover the blue boxes in groups of N, separately though:

d9791f7cd205e5959466de0a849df3dec31cf236_3_650.png

I want to calculate the probability p' that is the probability that in the generic red box there is a number greater than or equal to 1. Surely I can write it like this:

d6e1645ecc8c9b92a3658a688654e50b.png


where obviously (1-p'') is the probability that in the generic red box there is a '0'. This is the probability that in N consecutive blue boxes there is always '0'. In my opinion, I cannot simply say that
d7d0690f37c92a7c9df4e7c7f3238133.png
because it does not seem to me that the events are independent.

I thought then to 'count' all the cases in my favor, using only the elementary events corresponding to the black boxes of the first row. Since this count is not at all trivial, at least for me, I had Matlab run it, with this code:

Code:
M = 4 ;
N = 8 ;
P_in = 1e-2 ;

M_results = zeros(2^(2*N-1),2*N-1);
M_prob = zeros(2^(2*N-1),2*N-1);
for i=2:2*N
    M_results(:,i-1)=repmat([ zeros(2^(2*N-i),1) ; ones(2^(2*N-i),1) ],2^(i-2),1);
    M_prob(:,i-1)=repmat([ (1-P_in)+zeros(2^(2*N-i),1) ; P_in*ones(2^(2*N-i),1) ],2^(i-2),1);
end

Probs = prod(M_prob,2);
P = 0;

for k=1:2^(2*N-1)
bool = 1;
    for j=1:N-1
        bool = bool && sum(M_results(k,j:N+(j-1)))<M;
    end
    if (bool)
        P = P + Probs(k);
    end
end

1-P

The result provided is 0.09562, in the case where N = 8 and M = 4.
I'd be curious to know an opinion on this calculation, do you think I made a mistake or is it all correct?

If you need further clarifications, I am obviously available to provide them.

Thanks in advance.
 
Last edited:
Physics news on Phys.org
The numbers in the blue box are correlated, indeed. If N is not too big the easiest approach is to go through all options, as you did. For larger N a Monte Carlo approach might be better.
 
Ok, thank you.
 

Similar threads

  • · Replies 29 ·
Replies
29
Views
6K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K
  • · Replies 10 ·
Replies
10
Views
1K
  • · Replies 13 ·
Replies
13
Views
2K