Variance captured in coordinate axis.

simpleton · Jul 26, 2013

Hi all,

Note: The text below is the motivation for my question. To jump to the question immediately, please skip to the line that says HI!

I have a set of data points, let's call it A, and I ran principal component analysis to get the top 3 principal components to be able to represent the data points as a 3D plot.

Now, I have another set of data points, let's call it B, and I want to see how B differs from A. To do so, I want to plot B along the top 3 principal components of A. However, this coordinate system may be unfair to B, because most of the variance of B may not be captured in the first 3 principal components of A. Therefore, I want to be able to measure how much of the variance of B is captured in the first 3 principal components of A. Since principal components of A may not be eigenvectors of B, I cannot take the square of eigenvalues of each corresponding principal component, as in doing PCA).

Therefore, my question is:

HI! <---- For those who have been reading this post in its entirety, please ignore this

Suppose you are given a matrix M of data points. How do you measure how much variance in the dataset is captured in a particular coordinate of M?

As an example, suppose all my points are of the form (a,1) for different values of a and all a are distinct. Then the first coordinate will capture 100% of the variance while the second coordinate will capture 0% of the variance.

fresh_42 · Sep 11, 2019

As in your example: consider the coordinates separately and form random variables for each coordinate.

Variance captured in coordinate axis.

1. What is variance captured in coordinate axis?

2. How is variance captured in coordinate axis calculated?

3. What does a high variance captured in coordinate axis indicate?

4. How does variance captured in coordinate axis relate to standard deviation?

5. Can variance captured in coordinate axis be negative?

Similar threads

Hot Threads

Recent Insights