Regarding indicators in Statistics.

  • Thread starter Thread starter peripatein
  • Start date Start date
  • Tags Tags
    Statistics
Click For Summary

Homework Help Overview

The discussion revolves around a statistics problem involving the distribution of n different balls into m boxes and the calculation of the expectation and variance of the number of empty boxes.

Discussion Character

  • Mathematical reasoning, Assumption checking

Approaches and Questions Raised

  • The original poster attempts to calculate the expectation and variance of empty boxes using probabilities, but questions arise regarding the independence of the variables involved and the correct formulation of the variance.

Discussion Status

Some participants have provided feedback on the original poster's formulation, suggesting that while the approach may be on the right track, there are issues with the independence assumption and notation that need clarification. There is ongoing exploration of the correct variance calculation.

Contextual Notes

Participants have noted potential confusion in the original poster's notation and definitions, particularly regarding the meaning of the variables used and the implications of independence in the calculations.

peripatein
Messages
868
Reaction score
0
Hi,

Homework Statement


n different balls are distributed independently between m boxes with unlimited capacity each. I am asked to find the expectation and variance of the number of empty boxes.


Homework Equations





The Attempt at a Solution


The probability of i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used Var(Xi) = E[Xi](1-E[Xi])=(1-1/m)n(1-(1-1/m)n). Therefore, Var(X) = m*Var(Xi).
Is that correct?
 
Physics news on Phys.org
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?
 
If the Xi were independent, then your variance would be correct. They aren't though, so you will have terms like E(Xi Xj) that you will have to deal with (luckily it's not too hard to calculate these) when you expand E(X2)
 
So should it be:
m(1-1/m)n(1-(1-1/m)n) + 2m(1-2/m)n?
(I used the expression for the Covariance)
 
peripatein said:
I am wondering why no one has yet replied. Is my formulation inappropriate/incomprehensible?

Not incomprehensible, just sloppy and incomplete. What is the meaning of ##X_i##? If you mean that ##X_i = 1## if box i is empty and ##X_i = 0## if box i is not empty, then you should say so. Also, the notation ##P(X_i)## is meaningless; if you mean ##P(X_i = 1)## you should write that.
 
I am sorry, I was using my mobile to post that one. I did mean everything you thought I might have meant.
I'd very much appreciate it if you could comment on my attempt at solution now.
Actually, both this one and the Statistics problem I posted earlier.
 
I'd truly appreciate some feedback on my recent attempt at solution, namely:
The probability of the i-th box being empty at the end is (1-1/m)n. Ergo, E[Xi] = P(Xi) = (1-1/m)n. Hence, E[X] = m(1-1/m)n.
As for the Variance, I used the following:
Var(X) = m*V(Xi) + 2*SIGMA(where i<j)*Cov(Xi,Xj) = m(1-1/m)n(1-(1-1/m)n) + mC2*(1-2/m)n
I am not sure this is correct but would certainly appreciate any comments.
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
3K
Replies
0
Views
1K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 36 ·
2
Replies
36
Views
6K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 5 ·
Replies
5
Views
6K