[ALGEBRA] Unitary Matrixes and length conservation

libelec · Aug 19, 2009

Homework Statement

Prove that the Unitary matrixes are the only ones that preserve the length of the vectors.

The Attempt at a Solution

It's an iff, so I have to prove that a) If the matrix is Unitary, then it preserves the length and b) If the matrix preserves the length, then it's Unitary

I could only solve a) (using the canonic inner product for R)

Let it be A \in \Re^nXn an unitary matrix, x \in \Reⁿ, then (Ax, Ax) = (Ax)^T(Ax) = x^TA^TAx = x^Tx, because A^TA = I, for A is unitary, then x^Tx = (x,x). Then, A preserves the length.

But I don't know how to prove b).

CompuChip · Aug 19, 2009

You can start from the fact that it preserves the length: (Ax, Ax) = (Ax)T(Ax) = xTATAx = xTx (as you already said).
What does this tell you about ATA?

(Sorry, didn't bother to make the superscripts, I'm sure you see what I mean).

libelec · Aug 20, 2009

That A^TA must be I? Because it could also happen that x is eigenvector of A and A^T associated to the eigenvalue 1 (which is the reason why I thought that reasoning was going nowhere).

CompuChip · Aug 20, 2009

That can happen. But in general, it is not.
Remember, A^TA is always the same, but x is an arbitrary vector.

jambaugh · Aug 20, 2009

libelec said:

...But I don't know how to prove b).

It is critical that you use the fact that it is unitary only if it preserves the lengths of all vectors.

Clearly you can have a non-unitary matrix which leaves one dimension alone but say doubles length in another.

Given this, it is then sufficient to show it is true for a basis since the action is linear.

That should be sufficient for you to solve the second part.

[EDIT: I may have been hasty there... let me think about whether this is a good approach.]
[EDIT2: Actually you don't need to invoke the basis per se. You should be able to use the fact that given (x , M x) = (x,x) for all x M must be the identity. That one is easy enough to show by resolving it i.t.o. basis and inner product properties that you should be able to take it as a standing lemma.]

Hurkyl · Aug 20, 2009

It may be useful to recall this trick that is useful in a lot of problems:

If you have an expression (like A^TA), and you think it should be equal to some other expression (like I) then it is useful to try and study how the two expressions are different.

e.g. one might try and look at the properties of A^TA-I.

Hurkyl · Aug 20, 2009

jambaugh said:

[EDIT2: Actually you don't need to invoke the basis per se. You should be able to use the fact that given (x , M x) = (x,x) for all x M must be the identity. That one is easy enough to show by resolving it i.t.o. basis and inner product properties that you should be able to take it as a standing lemma.]

I assume you'll catch this error yourself shortly, but just in case -- that lemma is very not true. (consider pretty much any nontrivial example)

jambaugh · Aug 20, 2009

Hurkyl said:

I assume you'll catch this error yourself shortly, but just in case -- that lemma is very not true. (consider pretty much any nontrivial example)

Hmmm... I guess I should have said (y, M x) = (y,x) for all x, and all y implies M=1.

But for a counter-example to my error clearly it is true if M is a normal operator then (x,Mx)=(x,x) must imply all eigen-values are one and M is I.

What is more if it is not true then there are non-unitary length preserving matrices... proof would spoil the homework so I'll PM it to you.

[EDIT] Its true and usible if M is normal or particularly Hermitian which is sufficient for the exercise here. I requested a counter example via PM but feel free to present it here. I'm stumped.

jambaugh · Aug 20, 2009

jambaugh said:

... I'm stumped.

Arrrg! I got it now. 1 + shift operator.

Dick · Aug 20, 2009

jambaugh said:

Arrrg! I got it now. 1 + shift operator.

Do you mean 1+skew symmetric?

jambaugh · Aug 20, 2009

Dick said:

Do you mean 1+skew symmetric?

No, that would still be a normal operator. Shift operator is nilpotent.
(e.g. S+ = Sx+iSy Pauli spin raising operator)

General counter example is 1 + any nilpotent matrix N^k = 0 for some k. Thus (x,Nx)=0 since (x,N^k x)=0, (x,(1+N)x) = (x,1x).
Yes I am an idgit!

Dick · Aug 20, 2009

jambaugh said:

No, that would still be a normal operator. Shift operator is nilpotent.
(e.g. S+ = Sx+iSy Pauli spin raising operator)

General counter example is 1 + any nilpotent matrix N^k = 0 for some k. Thus (x,Nx)=0 since (x,N^k x)=0, (x,(1+N)x) = (x,1x).
Yes I am an idgit!

Sure, it's normal, but what's wrong with that? We are working over a real space. The eigenvectors might not be real. M=[[1,-1],[1,1]] satisfies (x^T)Mx=(x^T)x for all x. The antisymmetric part drops out. I don't see what nilpotent buys you.

jambaugh · Aug 20, 2009

Dick said:

Sure, it's normal, but what's wrong with that? We are working over a real space. The eigenvectors might not be real. M=[[1,-1],[1,1]] satisfies (x^T)Mx=(x^T)x for all x. The antisymmetric part drops out. I don't see what nilpotent buys you.

So it does (in real space) And my example doesn't!
Arrrg!
I'm a bigger igit than I thought! (My excuse is I'm trying to absorb CUDA programming right now and its taking up all my neural resources!)

I assumed (since the OP referred to a Unitary rather than Orthogonal matrx) that we were talking general complex Hilbert spaces and not real (or complex orthogonal) inner product space. But I see that he did specify real space.

Your example clearly wouldn't be true for the eigen-vectors e.g. x=(1,i)^T. (Trying to save a little face here!)

Back to the OP's problem...
I can see my suggestions were all wrong.
Perhaps the trick of (A(x+y) | A(x+y) ) expanded.
Ahhh yes that will do the trick and use only the properties of the inner product!
Hope I didn't give too much away there.

Hurkyl · Aug 21, 2009

Aha! You never said that you were defining M = A^TA -- I thought you were using M to refer to what the OP called A, so I misunderstood what you were getting at.

Dick · Aug 21, 2009

jambaugh said:

So it does (in real space) And my example doesn't!
Arrrg!
I'm a bigger igit than I thought! (My excuse is I'm trying to absorb CUDA programming right now and its taking up all my neural resources!)

I assumed (since the OP referred to a Unitary rather than Orthogonal matrx) that we were talking general complex Hilbert spaces and not real (or complex orthogonal) inner product space. But I see that he did specify real space.

Your example clearly wouldn't be true for the eigen-vectors e.g. x=(1,i)^T. (Trying to save a little face here!)

Back to the OP's problem...
I can see my suggestions were all wrong.
Perhaps the trick of (A(x+y) | A(x+y) ) expanded.
Ahhh yes that will do the trick and use only the properties of the inner product!
Hope I didn't give too much away there.

My example would be true for (1,i)^T (though it's not part of the real space)! Just plug it in. Remember the inner product is the REAL inner product (x,x)=x^Tx. Not the COMPLEX inner product x^(T*)x. If you promote the whole problem to a complex space, then yes, you are right. But be careful when you do that. What's true in the complex space is not necessarily true in the real space.

jambaugh · Aug 21, 2009

Dick said:

My example would be true for (1,i)^T (though it's not part of the real space)! Just plug it in. Remember the inner product is the REAL inner product (x,x)=x^Tx. Not the COMPLEX inner product x^(T*)x. If you promote the whole problem to a complex space, then yes, you are right. But be careful when you do that. What's true in the complex space is not necessarily true in the real space.

If we're talking unitary operators in the complex extension then the inner product ( | ) they preserve must be the hermitian inner product of the Hilbert space (2nd case). The promotion to complex space is ambiguous with either R-orthogonal -> C-orthogonal or
R-orthogonal -> unitary. I was keying of the "unitary" term in the OP.

w.r.t. termonology I think of your "COMPLEX" one as the "real" inner product in the sense that it yields real norms on all (complex) vectors. Or more precisely that the invariance group is a real Lie group U(V) rather than the complex group SO(V;C).
But I think we understand each other beyond choice of terms.

libelec · Aug 21, 2009

Hurkyl said:

It may be useful to recall this trick that is useful in a lot of problems:

If you have an expression (like A^TA), and you think it should be equal to some other expression (like I) then it is useful to try and study how the two expressions are different.

e.g. one might try and look at the properties of A^TA-I.

You say I should study x^TA^TAx = x^Tx, then x^TA^TAx - x^Tx = O, then x^T(A^TA - I)x = O?

The problem with that is that x could belong to the kernel of A^TA - I, and I don't see how I could manage to disregard that (the other possibilities: that A^TA is I, or that x is the O of its vectorial space).

jambaugh · Aug 21, 2009

libelec,
I think you'll find a simpler demonstration of b.) starting with the fact that A will preserve the length of (x+y) for arbitrary x and y. Expand and see what happens.

Dick · Aug 21, 2009

You really ought to use that A^TA is self-adjoint.

CompuChip · Aug 22, 2009

libelec said:

You say I should study x^TA^TAx = x^Tx, then x^TA^TAx - x^Tx = O, then x^T(A^TA - I)x = O?

The problem with that is that x could belong to the kernel of A^TA - I, and I don't see how I could manage to disregard that (the other possibilities: that A^TA is I, or that x is the O of its vectorial space).

If x belongs to the kernel of A^TA - I for all vectors x then the kernel of A^TA - I is all of the vector space. Undoubtedly you have seen that the only map with this property is the null map O.

Dick · Aug 22, 2009

libelec said:

You say I should study x^TA^TAx = x^Tx, then x^TA^TAx - x^Tx = O, then x^T(A^TA - I)x = O?

The problem with that is that x could belong to the kernel of A^TA - I, and I don't see how I could manage to disregard that (the other possibilities: that A^TA is I, or that x is the O of its vectorial space).

Here's another possibility. Let N be the matrix [[0,1],[-1,0]] in R^2. Then x^TNx=0 for all x. Why can't A^TA-I be that kind of map? I really don't understand why SOMEONE doesn't bring up the notion of self-adjointness.

libelec · Aug 23, 2009

jambaugh said:

libelec,
I think you'll find a simpler demonstration of b.) starting with the fact that A will preserve the length of (x+y) for arbitrary x and y. Expand and see what happens.

I haven't seen anything that would help me:

<A(x+y)|A(x+y)> = <Ax + Ay|Ax + Ay> = x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = <(x + y)|(x + y)> = x^Tx + x^Ty + y^Tx + y^Ty. So:

x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = x^Tx + x^Ty + y^Tx + y^Ty.

And then I could only figure out the same thing I tried before with <Ax|Ax>.

Dick said:

You really ought to use that A^TA is self-adjoint.

How? I don't see how that helps. A^TA = (A^TA)^T, then what?

jambaugh · Aug 23, 2009

libelec said:

I haven't seen anything that would help me:

<A(x+y)|A(x+y)> = <Ax + Ay|Ax + Ay> = x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = <(x + y)|(x + y)> = x^Tx + x^Ty + y^Tx + y^Ty. So:

x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = x^Tx + x^Ty + y^Tx + y^Ty.

And then I could only figure out the same thing I tried before with <Ax|Ax>.

How? I don't see how that helps. A^TA = (A^TA)^T, then what?

You almost have it! Remember your assumption that A is norm preserving and remember your conclusion is the definition of Unitarity i.e. inner product preserving. Apply your assumption and seek your conclusion.

Dick · Aug 23, 2009

libelec said:

I haven't seen anything that would help me:

<A(x+y)|A(x+y)> = <Ax + Ay|Ax + Ay> = x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = <(x + y)|(x + y)> = x^Tx + x^Ty + y^Tx + y^Ty. So:

x^TA^TAx + x^TA^TAy + y^TA^TAx + y^TA^TAy = x^Tx + x^Ty + y^Tx + y^Ty.

And then I could only figure out the same thing I tried before with <Ax|Ax>.

How? I don't see how that helps. A^TA = (A^TA)^T, then what?

You have been getting a lot of bad advice and I'm not sure why. Some of these people should know better. You have <x|Mx>=<x|x> where M=A^TA, right? So M is self adjoint. A self adjoint operator has a complete set of eigenvectors. Use that.

libelec · Aug 25, 2009

Dick said:

You have been getting a lot of bad advice and I'm not sure why. Some of these people should know better. You have <x|Mx>=<x|x> where M=A^TA, right? So M is self adjoint. A self adjoint operator has a complete set of eigenvectors. Use that.

OK, let me know if I just made a mistake, please:

<Ax|Ax> = x^TA^TAx. Since A^TA is always a symmetric matrix in R, then it's diagonalizable by an orthogonal matrix P. Then

<Ax|Ax> = x^TA^TAx = x^TPD_A^TAP^Tx = x^Tx, with D_A^TA the diagonal matrix of A^TA that loads its eigenvalues.

Then, each column i of the product matrix is x^T_iP_i\sigma_iP^T_ix_i, with \sigma_i an eigenvalue of A^TA. Since it's a scalar number, this is the same as \sigma_ix^T_iP_iP^T_ix_i, and since PP^T = I, for it's orthogonal, then this is equal to \sigma_ix^T_ix_i = x_i^Tx_i, iff \sigma_i = 1 for every i. Then A^TA has to be I, iff A is orthogonal (unitary in C).

I think there's something wrong (especially when I consider each column, I commute the eigenvalue), I don't know if this is what you meant?

Dick · Aug 25, 2009

libelec said:

OK, let me know if I just made a mistake, please:

<Ax|Ax> = x^TA^TAx. Since A^TA is always a symmetric matrix in R, then it's diagonalizable by an orthogonal matrix P. Then

<Ax|Ax> = x^TA^TAx = x^TPD_A^TAP^Tx = x^Tx, with D_A^TA the diagonal matrix of A^TA that loads its eigenvalues.

Then, each column i of the product matrix is x^T_iP_i\sigma_iP^T_ix_i, with \sigma_i an eigenvalue of A^TA. Since it's a scalar number, this is the same as \sigma_ix^T_iP_iP^T_ix_i, and since PP^T = I, for it's orthogonal, then this is equal to \sigma_ix^T_ix_i = x_i^Tx_i, iff \sigma_i = 1 for every i. Then A^TA has to be I, iff A is orthogonal (unitary in C).

I think there's something wrong (especially when I consider each column, I commute the eigenvalue), I don't know if this is what you meant?

That's what I meant all right. But you are overcomplicating the notation. The eigenvectors ei of A^TA span the vector space (that's what diagonalizable means). If (A^TA)ei=ri*ei (ri is the eigenvalue of ei), then ei^T(A^TA)ei=ri*ei^Tei=ei^Tei. (Since x^T(A^TA)x=x^Tx). So, sure, all of the eigenvalues are 1. But now any vector v can be written as a linear combination of those eigenvectors. So A^TAv=v for all v. So v=I.

libelec · Aug 25, 2009

Dick said:

That's what I meant all right. But you are overcomplicating the notation. The eigenvectors ei of A^TA span the vector space (that's what diagonalizable means). If (A^TA)ei=ri*ei (ri is the eigenvalue of ei), then ei^T(A^TA)ei=ri*ei^Tei=ei^Tei. (Since x^T(A^TA)x=x^Tx). So, sure, all of the eigenvalues are 1. But now any vector v can be written as a linear combination of those eigenvectors. So A^TAv=v for all v. So v=I.

Thanks, finally I understood.

[ALGEBRA] Unitary Matrixes and length conservation

Homework Statement

The Attempt at a Solution

Similar threads

Hot Threads

Prove that the integral is equal to ##\pi^2/8##

Calculating radius of gyration of plane figure about x-axis

Solve this problem that involves induction

The volume of a "spherical cap" using triple integrals

Finding the modulus and argument of ##\dfrac{a}{(b±ci)^n}##

Recent Insights

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem