Calculating Expected Collisions: Probability Question on Keys and Boxes

  • Context: Graduate 
  • Thread starter Thread starter economist13
  • Start date Start date
  • Tags Tags
    Probability
Click For Summary
SUMMARY

The discussion focuses on calculating the expected number of collisions when distributing r keys into k boxes, where each key is placed in box i with probability pi. The initial approach incorrectly suggests that the expected number of collisions is r - k, which is not valid when the number of boxes exceeds the number of keys. The correct method involves using an indicator function to determine the expected value of Y, defined as Y = max(X - 1, 0), where X is the number of keys in a box. This approach accurately accounts for the scenario of multiple keys being placed in the same box.

PREREQUISITES
  • Understanding of probability theory, specifically expected values.
  • Familiarity with indicator functions in probability.
  • Basic knowledge of random variables and their distributions.
  • Concept of collisions in probabilistic models.
NEXT STEPS
  • Study the application of indicator functions in probability theory.
  • Learn about expected value calculations for random variables.
  • Explore the concept of collisions in probability, particularly in the context of the "balls and bins" problem.
  • Investigate advanced probability topics such as the Law of Large Numbers and its implications for collision probabilities.
USEFUL FOR

Mathematicians, statisticians, computer scientists, and anyone interested in probability theory and its applications in collision analysis.

economist13
Messages
13
Reaction score
0
A total of r keys are to be put one at a time, in k boxes, with each key
independently being put in box i with probability pi ,
∑pi = 1. Each time a key is put in a nonempty box, we say that a collision occurs. Find
the expected number of collisions.

My professor hinted that we should use an indicator function, but I went about it a different way...(which I know to be wrong)

let X be the number of keys put into box i,

then E[X] = rpi. Since any key after key 1 will result in a collision, the rpi - 1 represents the expected number of collision for box i. then ∑(rpi -1) for all i and we get then that the total expected number of collisions is r-k.

Now, I know that cannot be correct, simply by thinking of the case where we have more boxes than keys (we cannot have negative collisions)

any hints?? what indicator function should I focus on?
 
Physics news on Phys.org
economist13 said:
let X be the number of keys put into box i

one way is to determine E[Y] where Y=max(X-1,0)
 

Similar threads

  • · Replies 29 ·
Replies
29
Views
6K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K