Is There a Linear Transformation to Map Data Set X to Y in PCA?

Trentkg · Oct 9, 2013

This question broadly relates to principle component analysis (PCA)

Say you have some data vector X, and a linear transformation K that maps X to some new data vector Z:

K*X → Z

Now say you have another linear transformation P that maps Z to a new data vector Y:

P*Z → Y

is there a linear transformation, call it M, that maps X to Y?

M*X → Y?

If Y, Z, P and X are known, can we solve for M? I would think we could find M by simple substitution...

M*X → Y,
P*Z → Y,
M → X^-1*P*Z = X^-1 (P*K*X) ?

We'll run into serious problems here if X is not square.

WHY I'M ASKING THIS QUESTION AND HOW IT RELATED TO PCA:

Without going into too much detail, PCA is a dimensional reduction technique. It seeks to find a Linear transformation P that maps Z to Y, such that the matrix Cy, defined as:

Cy == 1/n Y*Y^T is diagonalized.

Cy is the covariance matrix--the diagonal terms represent the covariance of the system while the off diagonal terms represent the variance (To see why this is true, write Y as an MxN matrix with elements i->j. If these elements are mean Zero, what does element 1/n Yi x Yj look like? What about 1/n Yi x Yi? )When Cy is diagonlized, the diagonal terms (variance of the system) is maximized, while the off diagonal terms (the covariance of the system) are minimized (set to zero). Y is Z in an ew basis, with the highest variance of the system is aligned along the first eigenvector, the second highest variance of the system alinged along the second, so on and so forth. The idea is if there are 20 measurements in a ssytem, yet you can express 99% of the variance of the system in only the first 4 eigenvectors, then your 20 dimensional system can probably be reduced to 4. ( a better explanation can be found here: http://www.snl.salk.edu/~shlens/pca.pdf )

Usually X is setup as an mxn matrix where m is the number of different measurements and n the number of trials. The first transformation, K, could be standardization/normlization, or changing units. The fear is that the variance of one measurement will dominate. If m1 has variance 1, and m2 variance 10000, then m2 will dominate the covariance matrix even if m1 is in units of cm and m2 units of km. Hence, we must standardize the variables so they are comparable with a map K.

The problem, then, is that the transformation taking eigenvalues that map Z to Y are in terms of the data set Z. Data set Z may not be of any interest to the experimenter (in this case, ME!). I'm interested in what the eigenvectors/Principle components are of data set X!

Anyways, thanks for any help!

Erland · Oct 10, 2013

M=P*K of course!

Trentkg · Oct 10, 2013

Erland said:

M=P*K of course!

M = X^-1 (P*K)X

so you're saying

X^-1 (P*K)X = (P*K)X^-1 *X ?

My Linear algbra is a little rusty, but isn't matrix multiplication not communative?

Erland · Oct 10, 2013

K*X=Z, P*Z=Y. Hence, Y=P*Z=P*(K*X)=(P*K)*X, so we can set M=P*K. Matrix multiplication is not commutative, but it is associative.

Trentkg · Oct 10, 2013

Ah yes! Of course, how simple. Thank you erland!

Is There a Linear Transformation to Map Data Set X to Y in PCA?

Thread '##(A/\mathfrak{a})_{\mathfrak{p}/\mathfrak{a}}## and its isomorphism?'

Thread 'Questions about non existence of GCDs vs (coimages, cokernels)'

Thread 'Decomposition into irreps of compact Lie group'

Similar threads

Hot Threads

I How to show ##p(x)=g(x)x\pm 1\in\Bbb{Q}[x]## is irreducible in ##\Bbb{Q}_{\Bbb{Z}}[x]##?

A Question about ##FG## modules

I Showing ##k[x_1,\ldots,x_n]/\mathfrak{a}## is finite dimensional

A Near-Rings with Noncommutative Addition and Two-Sided Distributivity

I How do we distinguish two different notations for cokernel and coimage?

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective