Understanding the Geometric Interpretation of the Gram-Schmidt Process

  • Context: Graduate 
  • Thread starter Thread starter EngWiPy
  • Start date Start date
  • Tags Tags
    Process
Click For Summary

Discussion Overview

The discussion centers on the geometric interpretation of the Gram-Schmidt orthonormalization process, exploring how the procedure ensures orthogonality among vectors. Participants seek to understand the underlying geometric concepts and representations involved in the process.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant questions how the second step of the Gram-Schmidt process ensures that the vector u2 is orthogonal to e1, seeking a geometric interpretation.
  • Another participant explains that the expression e1 represents the projection of a2 onto e1, and that subtracting this projection from a2 yields a vector orthogonal to e1.
  • A further reply elaborates on the orthogonality condition by showing that the inner product of e1 and the adjusted vector (a2 - e1) equals zero, confirming their orthogonality.
  • One participant introduces a geometric interpretation based on the work of Hestenes and Sobczyk, discussing the use of the wedge product to represent subspaces spanned by vectors and how this relates to the Gram-Schmidt process.
  • Another participant describes the process as "shaving off" components of vectors until they are orthogonal, using a metaphor of planks of wood and shadows to illustrate the concept of removing projections to achieve orthogonality.

Areas of Agreement / Disagreement

Participants express various interpretations and explanations of the Gram-Schmidt process, with no consensus reached on a single geometric interpretation. Multiple competing views and methods of explanation are presented.

Contextual Notes

Some participants express uncertainty about the foundational concepts involved in the Gram-Schmidt process, indicating a potential gap in understanding the geometric implications of the procedure.

EngWiPy
Messages
1,361
Reaction score
61
Hi,

What is the geometric interpretation of the Gram-Schmidt orthonormalization process? I mean, you will find everywhere that the procedure is as follows:

1. set u1=a1 => e1=u1/||u1||
2. u2=a2-<a2,e1>e1 => e2=u2/||u2||
.
.
.

As a comment on the second line, you will read that this is done to ensure that u2 is orthogonal on e1. But how is that? What is the geometric interpretation of this?

Thanks in advance
 
Physics news on Phys.org
S_David said:
Hi,

What is the geometric interpretation of the Gram-Schmidt orthonormalization process? I mean, you will find everywhere that the procedure is as follows:

1. set u1=a1 => e1=u1/||u1||
2. u2=a2-<a2,e1>e1 => e2=u2/||u2||
.
.
.

As a comment on the second line, you will read that this is done to ensure that u2 is orthogonal on e1. But how is that? What is the geometric interpretation of this?

Thanks in advance

<a2,e1>e1 is the projection of a2 onto e1, then a2-<a2,e1>e1 is the vector orthogonal to e1
 
yifli said:
<a2,e1>e1 is the projection of a2 onto e1, then a2-<a2,e1>e1 is the vector orthogonal to e1

How is that? Can you elaborate more? Obviously I have problems in the basics.!
 
Here's the simplest way to see it:

1. We know that two vectors a and b are orthogonal if and only if <a,b> = 0.

2. We can see that <e1, a2 - <a2,e1>e1> = <e1,a2> - <e1,<a2,e1>e1> = <e1,a2> - <a2,e1><e1,e1> = <e1,a2> - <a2,e1> = <e1,a2> - <e1,a2> = 0

3. Therefore, e1 and a2 - <a2,e1>e1 are orthogonal.
 
If you're looking for a geometric interpretation of Gram-Schmidt, the finest I've seen comes originally from Hestenes and Sobczyk, "Clifford Algebra to Geometric Calculus". It involves the wedge product. Here's a quick exposition of the main ideas: both in algebraic form, and in plain English.

I'll call the original (non-orthogonal) frame [itex]a_i[/itex], and the orthogonal frame [itex]c_i[/itex]. Assume there are [itex]n[/itex] vectors.

First, we need a way to represent "the subspace spanned by the first [itex]k[/itex] vectors". This is given by
[tex]A_k = a_1 \wedge a_2 \wedge \ldots \wedge a_k[/tex]
If you don't know what the wedge product means, simply think of [itex]A_k[/itex] as "the subspace spanned by the first [itex]k[/itex] vectors.

Now, the subspace spanned by all the vectors is of course [itex]A_n[/itex]. We will write this down, then multiply by unity in a clever way.
[tex] \begin{align}<br /> a_1 \wedge a_2 \wedge \ldots \wedge a_n &= A_n \\<br /> &= (A_1A_1^{-1})(A_2A_2^{-1}) \ldots (A_{n-1}A_{n-1}^{-1})A_n \\<br /> &= A_1(A_1^{-1}A_2)(A_2^{-1}\ldots)\ldots(\ldots A_{n-1})(A_{n-1}^{-1}A_n) \\<br /> &= c_1(c_2)(c_3)\ldots(c_{n-1})(c_n) \\<br /> \end{align}[/tex]
Here, the expression for the [itex]k[/itex]th vector is
[tex] c_k = A_{k-1}^{-1}A_k[/tex]
What we have done is to write a pure [itex]n[/itex]-vector (i.e. [itex]A_n[/itex]) as the geometric product of [itex]n[/itex] different vectors (i.e. the [itex]c_k[/itex]). These vectors must therefore all be mutually orthogonal.

What did we do, in plain English? Well, to find the k'th Gram-Schmidt vector,
1) Take the subspace spanned by the first [itex]k[/itex] vectors
2) Remove the subspace spanned by the first [itex](k-1)[/itex] vectors
In other words: we keep only what the [itex]k[/itex]th vector "adds", only what it gets us that we couldn't get before.

Note that the usual Gram-Schmidt is an iterative procedure: you actually need to calculate [itex]c_1, c_2, \ldots, c_{k-1}[/itex] before you can get [itex]c_k[/itex]. Not so with this exposition: we can directly write down an expression for any [itex]c_k[/itex] involving only the original [itex]a_i[/itex]. Moreover, the expression is a simple one with clear geometric meaning. A very elegant take on this well-known algorithm -- I was delighted when I first read it.
 
You're basically shaving off from a vector until it's orthogonal to previous vectors. Orthogonal vectors have an inner product of 0, so that if your inner product is not 0 it has some projection on original vectors. If you subtract off that part so that the inner product is indeed 0, then you'll leave a vector that's orthogonal.

Think of having two planks of wood that are obliquely aligned. If you shine a light from above you'll see a shadow, so you shave down the wood by the amount of the shadow casted on it. This leaves no shadow meaning the two planks are normal.
 

Similar threads

  • · Replies 14 ·
Replies
14
Views
4K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 7 ·
Replies
7
Views
14K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 10 ·
Replies
10
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K