Coefficient of correletion problem

  • Context: Graduate 
  • Thread starter Thread starter ENgez
  • Start date Start date
  • Tags Tags
    Coefficient
Click For Summary

Discussion Overview

The discussion revolves around calculating the coefficient of correlation between the number of red balls (R) and the number of black balls (B) drawn from a vase containing red, white, and black balls. Participants explore the mathematical formulation of covariance and expectations in the context of this probability problem, which involves Bernoulli trials and conditional probabilities.

Discussion Character

  • Mathematical reasoning
  • Technical explanation
  • Debate/contested

Main Points Raised

  • One participant states the formula for the coefficient of correlation and expresses difficulty in calculating the covariance, particularly E[RB].
  • Another participant suggests that the presence of white balls complicates the calculation, indicating a need for a more detailed summation approach.
  • A different participant proposes using the indicator function to express R and suggests that expanding the product could simplify the calculation of E[RB].
  • One participant introduces the smoothing theorem and conditional probability to approach E[RB], but expresses confusion over discrepancies in results when comparing E[E[RB|R]] and E[E[RB|B]].
  • Another participant attempts to clarify their understanding of the expansion of E[RB] and presents a calculation involving probabilities and expectations, leading to a negative result for the covariance.
  • A later reply questions the impact of changing the conditional probability in the expression for E[RB], suggesting that the independence of R and B may not be affected.

Areas of Agreement / Disagreement

Participants express differing views on the best approach to calculate E[RB] and the implications of conditional probabilities. There is no consensus on the correct method or final result, as participants continue to challenge and refine their calculations.

Contextual Notes

Participants note the complexity introduced by the presence of white balls and the need for careful consideration of conditional probabilities. There are unresolved mathematical steps and assumptions regarding the independence of R and B.

ENgez
Messages
69
Reaction score
0
a vase contains one red ball two white balls and three black balls. n balls are take out of the vase and (each ball returned to it afterwards). let B denote the number of black balls taken out and R denote the number of red balls taken out. what is the coefficient of correlation between R and B?

well i know that coeffieicnt of correlation = [itex]\frac{cov(R,B)}{\sigma R*\sigma B}[/itex]
and that R and B are simply bernouli trials with x and n-x trials respectivly.

my problem is calculating the covariance = E[RB] -E[R]E (E is the mean).
E[R] and E are straightforward but E[RB] is a bit trickier for me.

i thought of using the definition of the mean with a multinomial vector for the shared probability for R and B and summing over 0<x<n, but is there an easier way?
 
Physics news on Phys.org
It would be easy if it weren't for the white balls... As it is, I don't see any way of doing it short of writing out the sum.
 
The number of red balls is R = sum_{i=1}^n I[ball i is red] where I is the indicator function so to calculate E[RB] you just need to expand the product.
 
bpet, i am sorry, but can you elaborate?

and i thought of using the smoothing theorom,
Due to the fact that it is easier to think in terms of conditional probability in this problem.

E[RB]=E[E[RB|B]]=E[B*E[R|B]=E[B*(n-B)*P(R)]

=P(R)*(nE-E[B^2])

B is distributed binomially with n trials and probability P(B).

the problem here is that i get a different answer for E[E[RB|R]]
what am i doing wrong?
 
sry for the double post but believe i understood the part about expanding the product :).

tell me if this is correct

[itex]R = R1+R2 +R3 ... + Rn[/itex]

where:
[itex]R = 1 , p=1/6[/itex]
[itex]R = 0 , p=5/6[/itex]

and likewise for B

[itex]E[RB] = E[(R1+R2+...+Rn)(B1+B2+...+Bn)= (n^2-n)*P(B)*P(R)][/itex]
Ri*Bi is always 0 becuase one being 1 implies the other being 0.

[itex]E[RB]-E[R]E<b> = (n^2-n)*P(B)*P(R)-n^2*P(B)*P(R) =- n*P(R)*P(B) = -n/12</b>[/itex]
 
Last edited:
ENgez said:
E[RB]=E[E[RB|B]]=E[B*E[R|B]=E[B*(n-B)*P(R)]

=P(R)*(nE-E[B^2])


If you change the RHS to E[B*(n-B)*P(R|B)] does that work?
 
i don't see how P(R|B) changes anything becuase (n-B)*P(R) is the mean of R|B (where B=s , 0<s<n).
P(R) is the probability that in a given "trial" (pulling a ball out) you get red, which is independent from the amount of black balls you took out.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 16 ·
Replies
16
Views
3K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 18 ·
Replies
18
Views
4K