Matrix and Projection: Gram-Schmidt Process and Projection Matrix

In summary: A = QR, then you can use it to show p = QQTb, but not the other way around. (How could you possibly get from p = QQTb to A = QR?) So the question is, how do you show QQT = A(ATA)-1AT directly?Here's a hint: writeA = QRand substitute this into the right-hand side of QQT = (QR)(1)RTQT. What do you get?Sorry, my original response assumed that Q is m x m and R is m x n (the standard QR factorization). But the more natural factorization associated with Gram-Schmidt is A = QR where Q
  • #1
Denver Dang
148
1

Homework Statement


Let A be a m x n matrix of rank n and let [tex]\textbf{b} \in R^{m}[/tex]. If Q and R are the matrices derived from applying the Gram-Schmidt process to the column vectors of A and

p = c1q1 + c2q2 + ... + cnqn

is the projection of b onto R(A), then show that:

a) c = QTb
b) p = QQTb
c) QQT = A(ATA)-1AT


Homework Equations


I'm not quite sure about the (a).
But in (b) I have, from my book:
Let S be a nonzero subspace of Rm and let [tex]\textbf{b} \in R^{m}[/tex]. If {u1, u2,..., uk} is an orthonormal basis for S and U = (u1, u2,..., uk), then the projection p of b onto S is given by

p = UUTb

In (c) I have (Some place in my book) that:

[tex]\textbf{p} = A\hat{x} = A\left(A^{T}A\right)^{-1}A^{T}\textbf{}[/tex] which is called the projection matrix.


The Attempt at a Solution


As I said, I don't know about (a), but in (b) my thought was that the projection p of b is given by:

p=c1u1 + ... + ckuk = Uc, which is pretty much the same as given in the problem, and

c = (c1, c2,...,ck) = (u1Tb, u2Tb,..., ukTb) = UTb

and then I get: p = UUTb, which is what I needed to find (except for some other letters used in this problem).

In (c) we have the p from (b) which were p = UUTb. So by applying p on p's place in the equation given in "Rel. eq" I get

UUTb = [tex]A\hat{x} = A\left(A^{T}A\right)^{-1}A^{T}\textbf{}[/tex] which is pretty much the thing I needed to show, except for the two b's on each side.
But I don't know if I just can remove them, just if they were numbers? Because that would give me the correct equation.


Hope I haven't made it to confusing. At least I hope for some hints for (a), and maybe a hint about (b) and (c)'s correctness ? :)


Regards
 
Physics news on Phys.org
  • #2
No one ? :S
 
  • #3
For part (a), you have

cj = <b, qj> = bTqj = qjTb for each j. Stack these into a column vector and you're done.

You can use (a) in a similar way to show (b). For (c), try writing A in terms of Q and R and substituting into the right-hand side.
 
  • #4
jbunniii said:
For part (a), you have

cj = <b, qj> = bTqj = qjTb for each j. Stack these into a column vector and you're done.

You can use (a) in a similar way to show (b). For (c), try writing A in terms of Q and R and substituting into the right-hand side.

Thank you :)

I will try that.
 
  • #5
Hi again.

I'm just thinking. In c, why bother writing A it in terms of Q and R ?
I've almost proved it, except the b's. So don't I just need an argument for removing the b's on each side ?
 
  • #6
Denver Dang said:
Hi again.

I'm just thinking. In c, why bother writing A it in terms of Q and R ?
I've almost proved it, except the b's. So don't I just need an argument for removing the b's on each side ?

Well, it's true that if S and T are matrices, and Sb = Tb for every vector b in R^m, then S = T. (i.e., if S and T map every b the same way, then by definition they are the same map).

But what's your justification for writing

[tex]p = A(A^TA)^{-1}A^T[/tex]

in the first place? Correct me if I'm wrong, but seems to me that you are trying to prove the formula is true by saying "I found it in the book, so it must be true." The point of writing A in terms of Q and R is that it will allow you to prove part (c) directly.
 
  • #7
You have a point :)

So you want me to write:

A = QR

QQT = A(ATA)-1AT
QQT = (QR)((QR)T(QR))-1(QR)T
QQT = (QR)(RTQTQR)-1RTQT
QQT = (QR)(1)RTQT
QQT = (QR)RTQT
QQT = Q(RRT)QT
QQT = Q(1)QT
QQT = QQT

And then I have it ?
 
  • #8
Denver Dang said:
You have a point :)

So you want me to write:

A = QR

QQT = A(ATA)-1AT
QQT = (QR)((QR)T(QR))-1(QR)T
QQT = (QR)(RTQTQR)-1RTQT
QQT = (QR)(1)RTQT
QQT = (QR)RTQT
QQT = Q(RRT)QT
QQT = Q(1)QT
QQT = QQT

And then I have it ?

I don't think this is quite right. You're assuming that

[tex]R^T R = R R^T = I[/tex]

which means that R is an orthogonal matrix. But is that true? The Gram-Schmidt procedure gives you a factorization

[tex]A = QR[/tex]

where Q is orthogonal, but what kind of matrix is R? It's not orthogonal in general.
 
  • #9
jbunniii said:
I don't think this is quite right. You're assuming that

[tex]R^T R = R R^T = I[/tex]

which means that R is an orthogonal matrix. But is that true? The Gram-Schmidt procedure gives you a factorization

[tex]A = QR[/tex]

where Q is orthogonal, but what kind of matrix is R? It's not orthogonal in general.
Hmmm, then R is an upper triangular matrix as far as I can read in my book.
But what is the last step, if RRT=RTR isn't not equal 1 ?
An upper triangular matrix * a transposed upper triangular matrix don't give anything I can use, does it ? :S
 
  • #10
I think I maybe figured it out.
Since the middle brackets have ^-1, then when the first QQT becomes 1, since they are orthogonal, then you have RTR left, which will be R-1(RT)-1. And then we have RR-1 and (RT)-1R-1 which should be one, since the inverse matrix has nothing to do with ortogonality.

Right ?
 
  • #11
Denver Dang said:
I think I maybe figured it out.
Since the middle brackets have ^-1, then when the first QQT becomes 1, since they are orthogonal, then you have RTR left, which will be R-1(RT)-1. And then we have RR-1 and (RT)-1R-1 which should be one, since the inverse matrix has nothing to do with ortogonality.

Right ?

[EDIT]: Sorry, my original response assumed that Q is m x m and R is m x n (the standard QR factorization). But the more natural factorization associated with Gram-Schmidt is

A = QR

where Q is m x n and R is n x n. This assumes n <= m, which is implied by the fact that rank(A) = n. (Why?)

Looking more carefully at your problem statement, in particular 1(a), it's clear that Q in your case is m x n, and therefore R must be n x n. Since A has rank n, so also must R have rank n. (Why?) Therefore your assumption that R is invertible is valid, and what you have written is correct, assuming you meant:

[tex]\begin{align*}
A(A^T A)^{-1} A^T &= QR((QR)^T QR)^{-1}(QR)^T \\
&= QR(R^T Q^T Q R)^{-1} R^T Q^T \\
&= QR(R^T R)^{-1} R^T Q^T \\
&= QR R^{-1} (R^T)^{-1} R^T Q^T \\
&= Q(R R^{-1})((R^T)^{-1}R^T)Q^T \\
&= Q I I Q^T \\
&= QQ^T
\end{align*}[/tex]

If instead the "other" factorization was used, A = QR where Q is m x m and R is m x n, then R is no longer necessarily even square, let alone invertible. In that case, 1(c) is still true, and you still obtain it by substituting A = QR, but you have to argue more delicately in that case because R itself isn't invertible. That was the content of my original response, but now that I see it isn't the case for your problem, I deleted that response. Sorry if you read it before I removed it, and for any confusion it might have caused.
 
Last edited:

1. What is a matrix in the context of projection?

In the context of projection, a matrix is a mathematical tool used to transform or project points from one coordinate system to another. It is represented as a rectangular array of numbers, with each number representing a specific transformation or scaling factor.

2. How is a projection matrix different from a regular matrix?

A projection matrix is specifically used to project points from one coordinate system to another, while a regular matrix can be used for a variety of mathematical operations. A projection matrix typically has a specific structure and set of values that are used for projection calculations.

3. What is the purpose of using a projection matrix?

The purpose of using a projection matrix is to transform or project points from one coordinate system to another. This is especially useful in computer graphics and 3D rendering, where objects in a 3D space need to be projected onto a 2D screen for display.

4. How is a projection matrix calculated?

A projection matrix is calculated by first defining the 3D space that needs to be projected onto a 2D plane. Then, specific values are assigned to the matrix to determine the scaling, rotation, and translation of the points in the 3D space. These values are typically based on the properties of the projection, such as the field of view and aspect ratio.

5. Can a projection matrix be used for non-linear projections?

Yes, a projection matrix can be used for non-linear projections, such as perspective and orthographic projections. These types of projections result in non-uniform scaling, meaning that the scale of the object changes depending on its distance from the projection plane.

Similar threads

  • Calculus and Beyond Homework Help
Replies
17
Views
984
  • Calculus and Beyond Homework Help
Replies
6
Views
295
  • Calculus and Beyond Homework Help
Replies
16
Views
1K
  • Calculus and Beyond Homework Help
Replies
4
Views
2K
  • Calculus and Beyond Homework Help
Replies
8
Views
938
  • Calculus and Beyond Homework Help
Replies
2
Views
1K
  • Linear and Abstract Algebra
Replies
14
Views
1K
  • Calculus and Beyond Homework Help
Replies
1
Views
1K
  • Calculus and Beyond Homework Help
Replies
23
Views
8K
  • Calculus and Beyond Homework Help
Replies
5
Views
1K
Back
Top