Understanding Markov Chains and Different Types of Approximation

Jamin2112 · Jun 8, 2011

Homework Statement

Quick question, bros.

Homework Equations

Tell me if I'm right.

The Attempt at a Solution

A = U∑V^T

where ∑ contains on its diagonal the singular values of A^TA
U contains the corresponding eigenvectors for those singular values
V contains the eigenvectors corresponding to the singular values of AA^T

jbunniii · Jun 8, 2011

Close but not exactly right.

∑ contains on its diagonal the singular values of A. These in turn are the square roots of the eigenvalues of A^TA. You can take square roots because A^TA is positive semidefinite, hence has nonnegative eigenvalues.

The columns of U are the eigenvectors of AA^T.

The columns of V are the eigenvectors of A^TA.

If A is complex, you have to use Hermitian transposes above.

In addition to what you wrote, U and V are orthogonal matrices. (Or unitary matrices, if A is complex.)

The singular value decomposition has a nice geometric interpretation if A is real and square. It says that A can be decomposed into the following steps: rotate by some angle, then stretch or shrink along the standard axes, then rotate by some other angle. (The "rotation" can also include reflection if U or V has determinant -1 instead of 1.)

Jamin2112 · Jun 8, 2011

jbunniii said:

Close but not exactly right.

∑ contains on its diagonal the singular values of A. These in turn are the square roots of the eigenvalues of A^TA. You can take square roots because A^TA is positive semidefinite, hence has nonnegative eigenvalues.

The columns of U are the eigenvectors of AA^T.

The columns of V are the eigenvectors of A^TA.

If A is complex, you have to use Hermitian transposes above.

In addition to what you wrote, U and V are orthogonal matrices. (Or unitary matrices, if A is complex.)

The singular value decomposition has a nice geometric interpretation if A is real and square. It says that A can be decomposed into the following steps: rotate by some angle, then stretch or shrink along the standard axes, then rotate by some other angle. (The "rotation" can also include reflection if U or V has determinant -1 instead of 1.)

Thanks!

Jamin2112 · Jun 8, 2011

jbunniii said:

The columns of U are the eigenvectors of AA^T.

How do I know which order to arrange them in?

jbunniii · Jun 8, 2011

Jamin2112 said:

How do I know which order to arrange them in?

The standard (canonical) arrangement is to sort the singular values from high to low, so the upper left corner of ∑ has the largest singular value, and the lower right corner (assuming a square matrix) is the smallest singular value.

Then arrange the rows of V^T and the columns of U in the same order. i.e. the left column of U goes with the largest singular value, the right column goes with the smallest. The top row of V^T goes with the largest singular value, the bottom row with the smallest.

Jamin2112 · Jun 8, 2011

jbunniii said:

The standard (canonical) arrangement is to sort the singular values from high to low, so the upper left corner of ∑ has the largest singular value, and the lower right corner (assuming a square matrix) is the smallest singular value.

Then arrange the rows of V^T and the columns of U in the same order. i.e. the left column of U goes with the largest singular value, the right column goes with the smallest. The top row of V^T goes with the largest singular value, the bottom row with the smallest.

Will AA^T and A^TA have the same eigenvalues?

jbunniii · Jun 8, 2011

Jamin2112 said:

Will AA^T and A^TA have the same eigenvalues?

Yes. You can actually prove this using the SVD.

[tex]AA^T = (U \Sigma V^T)(U \Sigma V^T)^T = U \Sigma V^T V \Sigma^T U^T = U \Sigma \Sigma^T U^T = UDU^T[/tex]

where D is a diagonal matrix whose diagonal elements are the squares of the diagonal elements of ∑. Similarly, you can show that

[tex]A^TA = V D V^T[/tex]

These two equations tell you that the diagonal elements of D are the eigenvalues of both [itex]AA^T[/itex] and [itex]A^T A[/itex], and that the eigenvectors of these matrices are the columns of U and V, respectively.

jbunniii · Jun 8, 2011

P.S. The above assumes that A is square. If A is not square, then AA^T and A^TA have the same nonzero eigenvalues, but they can have different multiplicities of 0 as an eigenvalue.

Jamin2112 · Jun 8, 2011

Thanks for the info, brah. I have a final tomorrow. Maybe you answer me a few more questions, since you seem to like this stuff.

(1) My professor says "We study three kinds of approximation: Interpolation, Data fitting, and Global approximation." The definitions of data fitting and global approximation he then gives are eerily similar; in fact I cannot tell the difference. Perhaps you could explain that. (2) Markov chains! I'm trying to use my basic knowledge of statistics to understand these. We have a problem that says the probability of rain or sun tomorrow depends on the weather today: P(R-->R) = .2, P(R-->S) = .8, P(S-->R) = .5, P(S-->S) = .5. The Markov chain looks like [.2 .5; .8 .5] * [x₁; x₂] = [.2x₁ + .5x₂; .8x₁ + .5x₂].

I'm trying to get a grasp of this. It says the probability of rain tomorrow is .2x₁ + .5x₂. I think in terms of P(rain tomorrow) = P(rain tomorrow | rain today) + P(rain tomorrow | sun today) ... Am I right?

Oh, wait ... is it P(rain tomorrow) = P(rain tomorrow & rain today) + P(rain tomorrow & sun today) = P(rain tomorrow | sun today) * P(sun today) + P(rain tomorrow | rain today) * P(rain today) = .5x₂ + .2x₁

jbunniii · Jun 9, 2011

Jamin2112 said:

Thanks for the info, brah. I have a final tomorrow. Maybe you answer me a few more questions, since you seem to like this stuff.

(1) My professor says "We study three kinds of approximation: Interpolation, Data fitting, and Global approximation." The definitions of data fitting and global approximation he then gives are eerily similar; in fact I cannot tell the difference. Perhaps you could explain that. (2) Markov chains! I'm trying to use my basic knowledge of statistics to understand these. We have a problem that says the probability of rain or sun tomorrow depends on the weather today: P(R-->R) = .2, P(R-->S) = .8, P(S-->R) = .5, P(S-->S) = .5. The Markov chain looks like [.2 .5; .8 .5] * [x₁; x₂] = [.2x₁ + .5x₂; .8x₁ + .5x₂].

I'm trying to get a grasp of this. It says the probability of rain tomorrow is .2x₁ + .5x₂. I think in terms of P(rain tomorrow) = P(rain tomorrow | rain today) + P(rain tomorrow | sun today) ... Am I right?

Oh, wait ... is it P(rain tomorrow) = P(rain tomorrow & rain today) + P(rain tomorrow & sun today) = P(rain tomorrow | sun today) * P(sun today) + P(rain tomorrow | rain today) * P(rain today) = .5x₂ + .2x₁

Offhand I'm not sure about these - we have now jumped from linear algebra/matrix theory to some sort of statistics. I studied Markov chains but it was many moons ago and I don't remember most of the details anymore. You should start new threads for each question, with appropriate titles, so others with expertise in this area will have a better chance to see the questions.

Understanding Markov Chains and Different Types of Approximation

Homework Help Overview

Discussion Character

Approaches and Questions Raised

Discussion Status

Contextual Notes

Homework Statement

Homework Equations

The Attempt at a Solution

Similar threads

Hi! Can someone explain about Differential Equations?

Polar integral

Deriving spatial derivatives

Is this the correct general solution of the given PDE?

J_1(x) = (x^2/10)*(J_1(x) + J_3(x)) How to solve?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect