Question about sample covariance matrix

sanctifier · May 23, 2012

Suppose vectors X₁, X₂, ... , X_n whose components are random variables are mutually independent(I mean X_i's are vectors of components with constants which are possible values of random variables labeled by the component indice, and all these labeled random variables are organized as a vector X, hence X_i's just are samples of such a X), and the sample mean is [itex]\hat{M}[/itex] = [itex]\frac{1}{N}[/itex][itex]\sum[/itex] _{[itex]\stackrel{N}{i = 1}[/itex]} X_i, and the true mean of all X_i's is M. Then to estimate the covariance matrix of X_i, we employ the following formula:
[itex]\hat{Ʃ}[/itex] = [itex]\frac{1}{N}[/itex][itex]\sum[/itex] _{[itex]\stackrel{N}{i = 1}[/itex]} {(X_i - [itex]\hat{M}[/itex])(X_i - [itex]\hat{M}[/itex])^T}
[itex]\ [/itex][itex]\ [/itex][itex]\: [/itex]= [itex]\frac{1}{N}[/itex][itex]\sum[/itex] _{[itex]\stackrel{N}{i = 1}[/itex]} {((X_i - M) - ([itex]\hat{M}[/itex] - M))((X_i - M) - ([itex]\hat{M}[/itex] - M))^T}
[itex]\ [/itex][itex]\ [/itex][itex]\: [/itex]= [itex]\frac{1}{N}[/itex][itex]\sum[/itex] _{[itex]\stackrel{N}{i = 1}[/itex]} (X_i - M)(X_i - M)^T - ([itex]\hat{M}[/itex] - M)([itex]\hat{M}[/itex] - M)^T
My question is how does the equal sign hold in the last step?
I did some work about this question, first I note that the transpose is a llinear transformation, i.e., for two vectors V and U, (V + U)^T = V^T + U^T, then I realize that the following equation is legal.
(V - U)(V - U)^T = V[itex]\! [/itex]V^T - VU^T - UV^T + UU^T
Let V = (X_i - M) and U = ([itex]\hat{M}[/itex] - M), the terms missing in the last step of [itex]\hat{Ʃ}[/itex] are -VU^T and -UV^T, OK, I know the entries of E[VU^T] actually are covariances of X_i and [itex]\hat{M}[/itex], and I assume they are all zero, consequently the terms -VU^T and -UV^T do miss because of taking the expectation on [itex]\hat{Ʃ}[/itex], but in the last step, they vanished before taking the expectation! Why?
Finally, I also notice that the sign of UU^T = ([itex]\hat{M}[/itex] - M)([itex]\hat{M}[/itex] - M)^T has been changed from + to -, how does this happen?

sanctifier · May 23, 2012

Ok, if 1/N can be envisaged as a approximate probability of each entry of VU^T, this can explain the vanishing of -VU^T and -UV^T without taking a expectation, but how to explain the sign change of UU^T occurred in the last step?

Question about sample covariance matrix

1. What is a sample covariance matrix?

2. How is a sample covariance matrix calculated?

3. What does the diagonal of a sample covariance matrix represent?

4. How is a sample covariance matrix used in data analysis?

5. What are the limitations of using a sample covariance matrix?

Similar threads

Hot Threads

Recent Insights