Regarding indicators in Statistics.

  • Thread starter peripatein
  • Start date
  • #1
880
0
Hi,

Homework Statement


n different balls are distributed independently between m boxes with unlimited capacity each. I am asked to find the expectation and variance of the number of empty boxes.


Homework Equations





The Attempt at a Solution


The probability of i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used Var(Xi) = E[Xi](1-E[Xi])=(1-1/m)n(1-(1-1/m)n). Therefore, Var(X) = m*Var(Xi).
Is that correct?

Homework Statement





Homework Equations





The Attempt at a Solution

 

Answers and Replies

  • #2
880
0
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?
 
  • #3
Office_Shredder
Staff Emeritus
Science Advisor
Gold Member
3,831
143
If the Xi were independent, then your variance would be correct. They aren't though, so you will have terms like E(Xi Xj) that you will have to deal with (luckily it's not too hard to calculate these) when you expand E(X2)
 
  • #4
880
0
So should it be:
m(1-1/m)n(1-(1-1/m)n) + 2m(1-2/m)n?
(I used the expression for the Covariance)
 
  • #5
Ray Vickson
Science Advisor
Homework Helper
Dearly Missed
10,706
1,728
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?
Not incomprehensible, just sloppy and incomplete. What is the meaning of ##X_i##? If you mean that ##X_i = 1## if box i is empty and ##X_i = 0## if box i is not empty, then you should say so. Also, the notation ##P(X_i)## is meaningless; if you mean ##P(X_i = 1)## you should write that.
 
  • #6
880
0
I am sorry, I was using my mobile to post that one. I did mean everything you thought I might have meant.
I'd very much appreciate it if you could comment on my attempt at solution now.
Actually, both this one and the Statistics problem I posted earlier.
 
  • #7
880
0
I'd truly appreciate some feedback on my recent attempt at solution, namely:
The probability of the i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used the following:
Var(X) = m*V(Xi) + 2*SIGMA(where i<j)*Cov(Xi,Xj) = m(1-1/m)n(1-(1-1/m)n) + mC2*(1-2/m)n
I am not sure this is correct but would certainly appreciate any comments.
 

Related Threads on Regarding indicators in Statistics.

Replies
10
Views
6K
  • Last Post
Replies
0
Views
676
  • Last Post
Replies
8
Views
1K
  • Last Post
Replies
11
Views
2K
  • Last Post
Replies
1
Views
1K
  • Last Post
Replies
0
Views
1K
  • Last Post
Replies
2
Views
1K
  • Last Post
Replies
12
Views
1K
  • Last Post
Replies
1
Views
502
  • Last Post
2
Replies
27
Views
11K
Top