Coefficient of correletion problem

  • Thread starter Thread starter ENgez
  • Start date Start date
  • Tags Tags
    Coefficient
AI Thread Summary
The discussion centers on calculating the coefficient of correlation between the number of red balls (R) and black balls (B) drawn from a vase containing red, white, and black balls. The correlation is defined as the covariance of R and B divided by the product of their standard deviations. Participants express challenges in calculating the covariance, particularly the expectation E[RB], and explore various methods, including conditional probability and multinomial distributions. There is confusion regarding the application of the smoothing theorem and the independence of probabilities in the context of the problem. The conversation highlights the complexities of deriving E[RB] and the impact of the presence of white balls on the calculations.
ENgez
Messages
69
Reaction score
0
a vase contains one red ball two white balls and three black balls. n balls are take out of the vase and (each ball returned to it afterwards). let B denote the number of black balls taken out and R denote the number of red balls taken out. what is the coefficient of correlation between R and B?

well i know that coeffieicnt of correlation = \frac{cov(R,B)}{\sigma R*\sigma B}
and that R and B are simply bernouli trials with x and n-x trials respectivly.

my problem is calculating the covariance = E[RB] -E[R]E (E is the mean).
E[R] and E are straightforward but E[RB] is a bit trickier for me.

i thought of using the definition of the mean with a multinomial vector for the shared probability for R and B and summing over 0<x<n, but is there an easier way?
 
Physics news on Phys.org
It would be easy if it weren't for the white balls... As it is, I don't see any way of doing it short of writing out the sum.
 
The number of red balls is R = sum_{i=1}^n I[ball i is red] where I is the indicator function so to calculate E[RB] you just need to expand the product.
 
bpet, i am sorry, but can you elaborate?

and i thought of using the smoothing theorom,
Due to the fact that it is easier to think in terms of conditional probability in this problem.

E[RB]=E[E[RB|B]]=E[B*E[R|B]=E[B*(n-B)*P(R)]

=P(R)*(nE-E[B^2])

B is distributed binomially with n trials and probability P(B).

the problem here is that i get a different answer for E[E[RB|R]]
what am i doing wrong?
 
sry for the double post but believe i understood the part about expanding the product :).

tell me if this is correct

R = R1+R2 +R3 ... + Rn

where:
R = 1 , p=1/6
R = 0 , p=5/6

and likewise for B

E[RB] = E[(R1+R2+...+Rn)(B1+B2+...+Bn)= (n^2-n)*P(B)*P(R)]
Ri*Bi is always 0 becuase one being 1 implies the other being 0.

E[RB]-E[R]E<b> = (n^2-n)*P(B)*P(R)-n^2*P(B)*P(R) =- n*P(R)*P(B) = -n/12</b>
 
Last edited:
ENgez said:
E[RB]=E[E[RB|B]]=E[B*E[R|B]=E[B*(n-B)*P(R)]

=P(R)*(nE-E[B^2])


If you change the RHS to E[B*(n-B)*P(R|B)] does that work?
 
i don't see how P(R|B) changes anything becuase (n-B)*P(R) is the mean of R|B (where B=s , 0<s<n).
P(R) is the probability that in a given "trial" (pulling a ball out) you get red, which is independent from the amount of black balls you took out.
 
Back
Top