Having trouble understaning the notation co-variances

  • Context: Undergrad 
  • Thread starter Thread starter Rabolisk
  • Start date Start date
  • Tags Tags
    Notation
Click For Summary
SUMMARY

The discussion focuses on the formula for covariance, specifically Cov(X,Y) = [X - E(X)][Y - E(Y)], and its implications under ordinary least squares (OLS) assumptions. Participants clarify that if X and Y are matrices, an additional factor of 1/N is necessary for accurate calculation. The conversation also draws parallels between vector lengths and variances of random variables, emphasizing the relationship between covariance and the cosine of the angle between vectors.

PREREQUISITES
  • Understanding of covariance and its mathematical representation
  • Familiarity with ordinary least squares (OLS) regression assumptions
  • Knowledge of matrix operations and properties
  • Basic concepts of variance and standard deviation
NEXT STEPS
  • Review the Wikipedia page on covariance for detailed definitions and examples
  • Study the implications of OLS assumptions on covariance calculations
  • Learn about matrix algebra and its applications in statistics
  • Explore the relationship between covariance and correlation coefficients
USEFUL FOR

Statisticians, data analysts, and students studying regression analysis and covariance in statistical models.

Rabolisk
Messages
6
Reaction score
0
Can someone explain to me how
Cov(x,y)=[X-E(x))(Y-E(y)]

and this would equal 0 under the normal OLS assumption.


I know how to calculate the covariance, but X here is a matrix. So I don't understand the logic of this formula...
 
Physics news on Phys.org
Welcome to PF, Rabolisk! :smile:

Did you already look at the wiki page?
http://en.wikipedia.org/wiki/Covariance

Your formula does not seem to be quite right.
X and Y could be random variables, but in that case an extra E should be present in your formula (see wiki).

If your X and Y are matrices then your formula should have an extra 1/N in it (see wiki).
 
There is an interesting analogy between lengths-of-vectors and the variances-of-random-variables.

For vectors X and Y, we have
[itex]| X + Y |^2 = |X|^2 + |Y|^2 - 2|X||Y| \cos \theta[/itex] where [itex]\theta[/itex] is the angle between the vectors.

For random variables X and Y we have

variance(X+Y) = variance(X) + variance(Y) + 2 Covariance(X,Y)

The analogy is even better if we express it in standard deviations:

[itex]( std. dev(X+Y))^2 = (std. dev(X))^2 + (std. dev(Y))^2 + 2 Covariance(X,Y)[/itex]

The covariance is roughly analagous to the cosine term. The quantity that would be analagous to [itex]cos(\theta)[/itex] is [itex]\frac{- Covariance(X,Y) }{(std. dev(X)) (std. dev(Y))}[/itex].
 

Similar threads

  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 43 ·
2
Replies
43
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
5K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 24 ·
Replies
24
Views
3K