Schwarz Inequality is your friend

samalkhaiat · Dec 29, 2015

I would like to show you how to use Schwarz inequality to prove some important general theorems and solve problems about vectors in Minkowski spacetime.
Okay, Schwarz inequality states that
[tex]\left| U^{k}V^{k}\right| \leq \sqrt{(U^{i})^{2}(V^{j})^{2}}. \ \ i,j,k =1,2,3 \ \ \ (1)[/tex]
And, the equality holds if and only if [itex]U^{i} \propto V^{i}[/itex].
Let us see how we apply this to 4-vectors in Minkowski spacetime [itex]M^{4}(+1, -3)[/itex]. With this signature:
i) a vector [itex]\mathbf{U} \in M^{4}[/itex] for which
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} > 0 ,[/tex]
is called a timelike vector;
ii) a vector [itex]\mathbf{U} \in M^{4}[/itex] for which
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} < 0 ,[/tex]
is called a spacelike vector; and
iii) a vector [itex]\mathbf{U} \in M^{4}[/itex] that satisfies
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} = 0 ,[/tex]
is called a null (or lightlike) vector.
Now, suppose that [itex]\mathbf{U}[/itex] and [itex]\mathbf{V}[/itex] are two timelike vectors. Then, from the definition (i), we have
[tex] \begin{align*} \left(U^{i}\right)^{2} &< \left(U^{0}\right)^{2} \\ \left(V^{j}\right)^{2} &< \left(V^{0}\right)^{2} . \end{align*}[/tex]
These inequalities can be combined to give
[tex]\sqrt{\left(U^{i}\right)^{2}\left(V^{j}\right)^{2}} < \left| U^{0} V^{0} \right| . \ \ \ \ \ \ \ (2)[/tex]
Now, from the Schwarz inequality (1), we conclude that
[tex] \left| U^{k}V^{k} \right| < \left| U^{0}V^{0}\right| , \ \ \ \ \ \ \ \ \ \ \ (3)[/tex]
holds for any two timelike vectors. Thus, we have proved the following theorem
Theorem (A): No two timelike vectors can be orthogonal (in the Minkowski sense).
For,
[tex]\mathbf{U}\cdot \mathbf{V} = 0 \ \ \Rightarrow \ \ \left| U^{k}V^{k}\right| = \left| U^{0}V^{0}\right| ,[/tex] which contradicts (3). qed.

Theorem (B): If [itex]\mathbf{U},\mathbf{V}[/itex] are two timelike vectors such that [itex]U^{0} > 0, \ \ V^{0} > 0[/itex], then [itex]\mathbf{U}\cdot \mathbf{V} > 0[/itex].
Proof:
[tex]U^{k}V^{k} \leq \left|U^{k}V^{k}\right| \leq \sqrt{(U^{i})^{2}(V^{j})^{2}} .[/tex]
Using (2) or (3), and the fact that [itex]U^{0}>0[/itex] and [itex]V^{0}>0[/itex], we obtain
[tex]U^{k}V^{k} < \left|U^{0}V^{0}\right| = U^{0}V^{0} .[/tex]
Thus
[tex]U^{0}V^{0} - U^{k}V^{k} = \mathbf{U}\cdot \mathbf{V} > 0 .[/tex] qed

Theorem (C): A timelike vector cannot be orthogonal to a non-zero null vector.
Proof:
Let [itex]\mathbf{T}[/itex] be a timelike vector, and [itex]\mathbf{N}[/itex] is a non-zero null vector. This means that
[tex]\begin{align*} \left(T^{i}\right)^{2} &< \left(T^{0}\right)^{2} \\ \left(N^{j}\right)^{2} &= \left(N^{0}\right)^{2} . \end{align*}[/tex]
Thus
[tex] \left(T^{i}\right)^{2} \left(N^{j}\right)^{2} < \left( T^{0}N^{0} \right)^{2} .[/tex]
Using the Schwarz identity (1), we get
[tex]\left( T^{k}N^{k} \right)^{2} < \left( T^{0}N^{0} \right)^{2} . \ \ \ \ \ \ (4)[/tex]
Now, suppose [itex]\mathbf{T}\cdot \mathbf{N} = 0[/itex]. This implies [itex]\left( T^{k}N^{k} \right)^{2} = \left( T^{0}N^{0} \right)^{2}[/itex] which contradicts (4). qed.

Theorem (D): If [itex]\mathbf{T}[/itex] is a timelike vector and [itex]\mathbf{T} \cdot \mathbf{S} = 0[/itex], then [itex]\mathbf{S}[/itex] is spacelike vector.
Proof:
From theorem (A), [itex]\mathbf{S}[/itex] cannot be a timelike vector. And from theorem (C), [itex]\mathbf{S}[/itex] cannot be a non-zero null vector. Thus, [itex]\mathbf{S}[/itex] must be a spacelike vector. qed.

Theorem (E): Two non-zero null vectors are orthogonal, if and only if they are proportional.
This theorem shows the bizarre and non-intuitive nature of null vectors and “orthogonality” in Minkowski spacetime.
Proof:
The “if” part is trivially obvious: Assume that the null vector [itex]\mathbf{P}[/itex] is proportional to the null vector [itex]\mathbf{N}[/itex]. Then, [itex]\mathbf{P} = \lambda \mathbf{N}[/itex] for some [itex]\lambda \in \mathbb{R}[/itex]. Thus [itex]\mathbf{P} \cdot \mathbf{N} = \lambda \left(\mathbf{N} \cdot \mathbf{N} \right) = 0 .[/itex]
The “only if” part:
[tex]\mathbf{P} \cdot \mathbf{P} = 0 \ \Rightarrow \left(P^{0}\right)^{2} = \left(P^{i}\right)^{2},[/tex]
[tex]\mathbf{N} \cdot \mathbf{N} = 0 \ \Rightarrow \left(N^{0}\right)^{2} = \left(N^{j}\right)^{2},[/tex]
and
[tex]\mathbf{P} \cdot \mathbf{N} = 0 \ \Rightarrow \ P^{0}N^{0} = P^{k} N^{k}.[/tex]
From these relations, we obtain
[tex]\left(P^{k}N^{k}\right)^{2} = \left(P^{0}N^{0}\right)^{2} = \left(P^{i}\right)^{2}\left(N^{j}\right)^{2} .[/tex]
Thus
[tex]\left| P^{k}N^{k} \right| = \sqrt{\left(P^{i}\right)^{2}\left(N^{j}\right)^{2}} .[/tex]
Since this equation is the equality case of Schwarz inequality, we conclude that [itex]P^{k} = \lambda N^{k}[/itex] for some scalar [itex]\lambda \neq 0[/itex]. Since [itex]N^{0} \neq 0[/itex], we have
[tex]P^{0} = \frac{P^{k}N^{k}}{N^{0}} = \lambda \frac{N^{k}N^{k}}{N^{0}} = \lambda N^{0} .[/tex]
Thus [itex]\mathbf{P} = \left(\lambda N^{0} , \lambda N^{k}\right) = \lambda \mathbf{N}[/itex]. qed.
Okay, almost all other results follow from the above theorems. So, good luck to you all.

andrewkirk · Jan 1, 2016

What notation are you using? The C-S inequality has two sums over indices on the right-hand side and one sum on the left-hand side. But the statement above has no explicit sums, and no implicit sums under any notation system I have yet learned. If one relaxes the rule in the Einstein summation convention that identical indices are summed over only when one is up and one down, then there will be a sum on the left-hand side, but I have not yet encountered a notation convention or summation rule that will imply the necessary sums on the right-hand side. Could you please explain what convention you are using and how it implies the three sums? Thank you.

strangerep · Jan 1, 2016

Heh, physicists often play loose with the Einstein summation convention in that way. I.e., repeated indices are summed, even if both are up or both are down. Afaict, Sam's formulas are correct under this convention. :oldsmile:

BTW, it's notable that Sam's post got 8 "likes" very quickly. I wonder if (or how many) other posts in the technical forums have done so well.

samalkhaiat · Jan 2, 2016

andrewkirk said:

What notation are you using? The C-S inequality has two sums over indices on the right-hand side and one sum on the left-hand side. But the statement above has no explicit sums, and no implicit sums under any notation system I have yet learned. If one relaxes the rule in the Einstein summation convention that identical indices are summed over only when one is up and one down, then there will be a sum on the left-hand side, but I have not yet encountered a notation convention or summation rule that will imply the necessary sums on the right-hand side. Could you please explain what convention you are using and how it implies the three sums? Thank you.

The vectors in Schwarz inequality are Euclidean 3-vectors. So, there is no difference between upper and lower indices. So, in Euclidean space, the Einstein summation convention is simply “repeated indices are summed over”:
[tex]U^{k}V^{k} \equiv \sum_{k =1}^{3} U^{k}V^{k} ,[/tex]
[tex](U^{i})^{2} = U^{i}U^{i} \equiv \sum_{i=1}^{3} U^{i}U^{i} ,[/tex]
and
[tex](V^{j})^{2} = V^{j}V^{j} \equiv \sum_{j =1}^{3} V^{j}V^{j} .[/tex]
For, 4-vectors [itex]\mathbf{V} = (V^{0}, V^{j})[/itex] some care is needed:
[tex] \begin{align*} \mathbf{U} \cdot \mathbf{V} &= \eta_{\mu \nu} U^{\mu}V^{\nu} \\ &= U^{0}V^{0} - \sum_{j = 1}^{3} U^{j}V^{j} . \end{align*}[/tex]
So, I used the above mentioned summation convention, i.e., dropped the summation sign, and wrote
[tex] \mathbf{U} \cdot \mathbf{V} = U^{0}V^{0} - U^{i}V^{i} ,[/tex]
which is correct because if you use [itex]U^{0} = U_{0} , \ U^{j} = - U_{j}[/itex] in the above, you get
[tex] \begin{align*} \mathbf{U} \cdot \mathbf{V} &= U_{0}V^{0} + U_{i}V^{i} \\ &= U_{\mu}V^{\mu} \\ &= \eta_{\mu\nu}U^{\mu}V^{\nu} . \end{align*}[/tex]
I hope it is clear enough for you now. Let me know if it is not.

samalkhaiat · Jan 2, 2016

strangerep said:

...

BTW, it's notable that Sam's post got 8 "likes" very quickly. I wonder if (or how many) other posts in the technical forums have done so well.

Is that means "wow"?

strangerep · Jan 2, 2016

samalkhaiat said:

The vectors in Schwarz inequality are Euclidean 3-vectors. So, there is no difference between upper and lower indices.

But you're using the "west coast" (hep) metric convention (+,-,-,-), hence ##U^j = -U_j##, as you noted later in your post #4. (I didn't mention this before because it doesn't affect your original post -- for which I see the "like" count is now up to 10.)

This is one of the reasons why, some years ago, I became a convert to the "east coast" (relativist) convention (-,+,+,+). :oldwink:

For other readers: these conventions are somewhat tangential to the main topic, but you can read more about them in Peter Woit's blog article. :oldbiggrin:

dextercioby · Jan 3, 2016

I think it is fair to claim that the name of this thread could have been chosen a little differently:

https://en.wikipedia.org/wiki/Talk:Cauchy–Schwarz_inequality#common_name

Where I come from we call it the CBS inequality.

samalkhaiat · Jan 3, 2016

strangerep said:

But you're using the "west coast" (hep) metric convention (+,-,-,-), hence ##U^j = -U_j##, as you noted later in your post #4. (I didn't mention this before because it doesn't affect your original post -- for which I see the "like" count is now up to 10.)

This is one of the reasons why, some years ago, I became a convert to the "east coast" (relativist) convention (-,+,+,+).

Well, the inequality I wrote deals with Euclidean vectors, so it is meaningless to bother about the Minkowski metric signature. However, one needs to be careful when one applies it to the spatial components of 4-vectors, as I mentioned.
As for the mostly-plus or mostly-minus signatures, personally I don’t mind working with any of them. I believe people prefer one on the other because they are lazy. Put it this way, had Weinberg written his GR text in the mostly-minus metric, you would see all relativists using it now. And the same (I believe) applies to the Bjorken & Drell texts: had they used the mostly-plus signature, we would see particle physicists using it now.

pyroartist · Jan 4, 2016

May the Schwarz be with you.

(Sorry, couldn't resist.)

George Jones · Jan 4, 2016

samalkhaiat said:

Thus, we have proved the following theorem
Theorem (A): No two timelike vectors can be orthogonal (in the Minkowski sense)

How do you prove an axiom/definition?

samalkhaiat · Jan 4, 2016

George Jones said:

How do you prove an axiom/definition?

Well, if you call that an axiom, then I obviously did. My only axiom was the Schwarz inequality and the only definition used in the proof is that of timelike vectors.

George Jones · Jan 5, 2016

Minkowski spacetime:

Minkowski spacetime [itex]\left( V,\mathbf{g}\right)[/itex] is a 4-dimensional vector space [itex]V[/itex] together with a symmetric, non-degenerate, bilinear mapping [itex]g:V\times V\rightarrow\mathbb{R}[/itex]. A vector in [itex]V[/itex] is called a 4-vector, and a 4-vector [itex]v[/itex] is called timelike if [itex]g\left(v,v\right) >0[/itex], lightlike if [itex]g\left(v,v\right) =0[/itex], and spacelike if [itex]g\left(v,v\right) <0[/itex]. [itex]\left( V,g\right)[/itex] is such that: 1) timelike vectors exist; 2) [itex]v[/itex] is spacelike whenever [itex]u[/itex] is timelike and [itex]g\left( u,v\right)=0[/itex].

samalkhaiat · Jan 5, 2016

George Jones said:

and a 4-vector [itex]v[/itex] is called timelike if [itex]g\left(v,v\right) >0[/itex], lightlike if [itex]g\left(v,v\right) =0[/itex], and spacelike if [itex]g\left(v,v\right) <0[/itex].

Aren’t these the definitions (i)-(iii) in #1?
The “existence” of these three types of vectors follows from the fact that the quadratic form [itex]Q: M^{4} \to \mathbb{R}[/itex], associated with the inner product [itex]g[/itex], [itex]\left( Q(v) = g(v,v) = v^{2} \right)[/itex], on the Minkowski vector space [itex]M^{4}[/itex], is indefinite.

.. 2) [itex]v[/itex] is spacelike whenever [itex]u[/itex] is timelike and [itex]g\left( u,v\right)=0[/itex].

This statement is provable. It is the content of Theorem D.

*****
My purpose from this thread was to show people the usefulness of the Schwarz inequality (on the spatial subspace [itex]V^{3} \equiv \big \{ \mathbf{v} \in M^{4}, \ v^{0} = 0 \big \}[/itex]) for proving statements about Lorentz vectors. Clearly, people liked what I did.
This is an [I-type] thread. So, I had no intention to lay down, in here, the axiomatic structure of the Minkowski vector space [itex]M^{4}[/itex] or that of the Minkowski spacetime (as 4-dimensional differentiable manifold). And, if I wanted to, I could have done a pretty good job in that as well.

Schwarz Inequality is your friend

Undergrad Relativistic Space Travel: Optimizing Proper Time [Project Hail Mary]

Undergrad Why is gravity a fictitious force?

Undergrad KE of rotating disc

Undergrad Why is the Lorentz Force always perpendicular to velocity?

Graduate How valid is the Block Universe theory?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Schwarz Inequality is your friend

Similar threads