Regarding indicators in Statistics.

  • Thread starter Thread starter peripatein
  • Start date Start date
  • Tags Tags
    Statistics
peripatein
Messages
868
Reaction score
0
Hi,

Homework Statement


n different balls are distributed independently between m boxes with unlimited capacity each. I am asked to find the expectation and variance of the number of empty boxes.


Homework Equations





The Attempt at a Solution


The probability of i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used Var(Xi) = E[Xi](1-E[Xi])=(1-1/m)n(1-(1-1/m)n). Therefore, Var(X) = m*Var(Xi).
Is that correct?
 
Physics news on Phys.org
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?
 
If the Xi were independent, then your variance would be correct. They aren't though, so you will have terms like E(Xi Xj) that you will have to deal with (luckily it's not too hard to calculate these) when you expand E(X2)
 
So should it be:
m(1-1/m)n(1-(1-1/m)n) + 2m(1-2/m)n?
(I used the expression for the Covariance)
 
peripatein said:
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?

Not incomprehensible, just sloppy and incomplete. What is the meaning of ##X_i##? If you mean that ##X_i = 1## if box i is empty and ##X_i = 0## if box i is not empty, then you should say so. Also, the notation ##P(X_i)## is meaningless; if you mean ##P(X_i = 1)## you should write that.
 
I am sorry, I was using my mobile to post that one. I did mean everything you thought I might have meant.
I'd very much appreciate it if you could comment on my attempt at solution now.
Actually, both this one and the Statistics problem I posted earlier.
 
I'd truly appreciate some feedback on my recent attempt at solution, namely:
The probability of the i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used the following:
Var(X) = m*V(Xi) + 2*SIGMA(where i<j)*Cov(Xi,Xj) = m(1-1/m)n(1-(1-1/m)n) + mC2*(1-2/m)n
I am not sure this is correct but would certainly appreciate any comments.
 

Similar threads

Back
Top