Schwarz Inequality is your friend

In summary: U^{0}V^{0} + \eta_{0j} U^{0}V^{j} + \eta_{i0} U^{i}V^{0} + \eta_{ij} U^{i}V^{j} \\ ... &= \eta_{00} U^{0}V^{0} + \eta_{0j} U^{0}V^{j} + \eta_{i0} U^{i}V^{0} + \eta_{ij} U^{i}V^{j} \\ ... &= \eta_{00} \left(U^{0}\right)^{2} - \sum_{j=1}^{3}
  • #1
samalkhaiat
Science Advisor
Insights Author
1,802
1,200
I would like to show you how to use Schwarz inequality to prove some important general theorems and solve problems about vectors in Minkowski spacetime.
Okay, Schwarz inequality states that
[tex]\left| U^{k}V^{k}\right| \leq \sqrt{(U^{i})^{2}(V^{j})^{2}}. \ \ i,j,k =1,2,3 \ \ \ (1)[/tex]
And, the equality holds if and only if [itex]U^{i} \propto V^{i}[/itex].
Let us see how we apply this to 4-vectors in Minkowski spacetime [itex]M^{4}(+1, -3)[/itex]. With this signature:
i) a vector [itex]\mathbf{U} \in M^{4}[/itex] for which
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} > 0 ,[/tex]
is called a timelike vector;
ii) a vector [itex]\mathbf{U} \in M^{4}[/itex] for which
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} < 0 ,[/tex]
is called a spacelike vector; and
iii) a vector [itex]\mathbf{U} \in M^{4}[/itex] that satisfies
[tex]\mathbf{U} \cdot \mathbf{U} \equiv \eta_{\mu\nu}U^{\mu}U^{\nu} = 0 ,[/tex]
is called a null (or lightlike) vector.
Now, suppose that [itex]\mathbf{U}[/itex] and [itex]\mathbf{V}[/itex] are two timelike vectors. Then, from the definition (i), we have
[tex]
\begin{align*}
\left(U^{i}\right)^{2} &< \left(U^{0}\right)^{2} \\
\left(V^{j}\right)^{2} &< \left(V^{0}\right)^{2} .
\end{align*}
[/tex]
These inequalities can be combined to give
[tex]\sqrt{\left(U^{i}\right)^{2}\left(V^{j}\right)^{2}} < \left| U^{0} V^{0} \right| . \ \ \ \ \ \ \ (2) [/tex]
Now, from the Schwarz inequality (1), we conclude that
[tex]
\left| U^{k}V^{k} \right| < \left| U^{0}V^{0}\right| , \ \ \ \ \ \ \ \ \ \ \ (3)
[/tex]
holds for any two timelike vectors. Thus, we have proved the following theorem
Theorem (A): No two timelike vectors can be orthogonal (in the Minkowski sense).
For,
[tex]\mathbf{U}\cdot \mathbf{V} = 0 \ \ \Rightarrow \ \ \left| U^{k}V^{k}\right| = \left| U^{0}V^{0}\right| ,[/tex] which contradicts (3). qed.

Theorem (B): If [itex]\mathbf{U},\mathbf{V}[/itex] are two timelike vectors such that [itex]U^{0} > 0, \ \ V^{0} > 0[/itex], then [itex] \mathbf{U}\cdot \mathbf{V} > 0[/itex].
Proof:
[tex]U^{k}V^{k} \leq \left|U^{k}V^{k}\right| \leq \sqrt{(U^{i})^{2}(V^{j})^{2}} .[/tex]
Using (2) or (3), and the fact that [itex]U^{0}>0[/itex] and [itex]V^{0}>0[/itex], we obtain
[tex]U^{k}V^{k} < \left|U^{0}V^{0}\right| = U^{0}V^{0} .[/tex]
Thus
[tex]U^{0}V^{0} - U^{k}V^{k} = \mathbf{U}\cdot \mathbf{V} > 0 .[/tex] qed

Theorem (C): A timelike vector cannot be orthogonal to a non-zero null vector.
Proof:
Let [itex]\mathbf{T}[/itex] be a timelike vector, and [itex]\mathbf{N}[/itex] is a non-zero null vector. This means that
[tex]\begin{align*}
\left(T^{i}\right)^{2} &< \left(T^{0}\right)^{2} \\
\left(N^{j}\right)^{2} &= \left(N^{0}\right)^{2} .
\end{align*}
[/tex]
Thus
[tex]
\left(T^{i}\right)^{2} \left(N^{j}\right)^{2} < \left( T^{0}N^{0} \right)^{2} .
[/tex]
Using the Schwarz identity (1), we get
[tex]\left( T^{k}N^{k} \right)^{2} < \left( T^{0}N^{0} \right)^{2} . \ \ \ \ \ \ (4)[/tex]
Now, suppose [itex]\mathbf{T}\cdot \mathbf{N} = 0[/itex]. This implies [itex]\left( T^{k}N^{k} \right)^{2} = \left( T^{0}N^{0} \right)^{2}[/itex] which contradicts (4). qed.

Theorem (D): If [itex]\mathbf{T}[/itex] is a timelike vector and [itex]\mathbf{T} \cdot \mathbf{S} = 0[/itex], then [itex]\mathbf{S}[/itex] is spacelike vector.
Proof:
From theorem (A), [itex]\mathbf{S}[/itex] cannot be a timelike vector. And from theorem (C), [itex]\mathbf{S}[/itex] cannot be a non-zero null vector. Thus, [itex]\mathbf{S}[/itex] must be a spacelike vector. qed.

Theorem (E): Two non-zero null vectors are orthogonal, if and only if they are proportional.
This theorem shows the bizarre and non-intuitive nature of null vectors and “orthogonality” in Minkowski spacetime.
Proof:
The “if” part is trivially obvious: Assume that the null vector [itex]\mathbf{P}[/itex] is proportional to the null vector [itex]\mathbf{N}[/itex]. Then, [itex]\mathbf{P} = \lambda \mathbf{N}[/itex] for some [itex]\lambda \in \mathbb{R}[/itex]. Thus [itex]\mathbf{P} \cdot \mathbf{N} = \lambda \left(\mathbf{N} \cdot \mathbf{N} \right) = 0 .[/itex]
The “only if” part:
[tex]\mathbf{P} \cdot \mathbf{P} = 0 \ \Rightarrow \left(P^{0}\right)^{2} = \left(P^{i}\right)^{2},[/tex]
[tex]\mathbf{N} \cdot \mathbf{N} = 0 \ \Rightarrow \left(N^{0}\right)^{2} = \left(N^{j}\right)^{2},[/tex]
and
[tex]\mathbf{P} \cdot \mathbf{N} = 0 \ \Rightarrow \ P^{0}N^{0} = P^{k} N^{k}.[/tex]
From these relations, we obtain
[tex]\left(P^{k}N^{k}\right)^{2} = \left(P^{0}N^{0}\right)^{2} = \left(P^{i}\right)^{2}\left(N^{j}\right)^{2} .[/tex]
Thus
[tex]\left| P^{k}N^{k} \right| = \sqrt{\left(P^{i}\right)^{2}\left(N^{j}\right)^{2}} .[/tex]
Since this equation is the equality case of Schwarz inequality, we conclude that [itex]P^{k} = \lambda N^{k}[/itex] for some scalar [itex]\lambda \neq 0[/itex]. Since [itex]N^{0} \neq 0[/itex], we have
[tex]P^{0} = \frac{P^{k}N^{k}}{N^{0}} = \lambda \frac{N^{k}N^{k}}{N^{0}} = \lambda N^{0} .[/tex]
Thus [itex]\mathbf{P} = \left(\lambda N^{0} , \lambda N^{k}\right) = \lambda \mathbf{N}[/itex]. qed.
Okay, almost all other results follow from the above theorems. So, good luck to you all.
 
  • Like
Likes vanhees71, ShayanJ, bhobba and 11 others
Physics news on Phys.org
  • #2
What notation are you using? The C-S inequality has two sums over indices on the right-hand side and one sum on the left-hand side. But the statement above has no explicit sums, and no implicit sums under any notation system I have yet learned. If one relaxes the rule in the Einstein summation convention that identical indices are summed over only when one is up and one down, then there will be a sum on the left-hand side, but I have not yet encountered a notation convention or summation rule that will imply the necessary sums on the right-hand side. Could you please explain what convention you are using and how it implies the three sums? Thank you.
 
  • #3
Heh, physicists often play loose with the Einstein summation convention in that way. I.e., repeated indices are summed, even if both are up or both are down. Afaict, Sam's formulas are correct under this convention. :oldsmile:

BTW, it's notable that Sam's post got 8 "likes" very quickly. I wonder if (or how many) other posts in the technical forums have done so well. :biggrin:
 
  • Like
Likes bhobba
  • #4
andrewkirk said:
What notation are you using? The C-S inequality has two sums over indices on the right-hand side and one sum on the left-hand side. But the statement above has no explicit sums, and no implicit sums under any notation system I have yet learned. If one relaxes the rule in the Einstein summation convention that identical indices are summed over only when one is up and one down, then there will be a sum on the left-hand side, but I have not yet encountered a notation convention or summation rule that will imply the necessary sums on the right-hand side. Could you please explain what convention you are using and how it implies the three sums? Thank you.

The vectors in Schwarz inequality are Euclidean 3-vectors. So, there is no difference between upper and lower indices. So, in Euclidean space, the Einstein summation convention is simply “repeated indices are summed over”:
[tex]U^{k}V^{k} \equiv \sum_{k =1}^{3} U^{k}V^{k} ,[/tex]
[tex](U^{i})^{2} = U^{i}U^{i} \equiv \sum_{i=1}^{3} U^{i}U^{i} ,[/tex]
and
[tex](V^{j})^{2} = V^{j}V^{j} \equiv \sum_{j =1}^{3} V^{j}V^{j} .[/tex]
For, 4-vectors [itex]\mathbf{V} = (V^{0}, V^{j})[/itex] some care is needed:
[tex]
\begin{align*}
\mathbf{U} \cdot \mathbf{V} &= \eta_{\mu \nu} U^{\mu}V^{\nu} \\
&= U^{0}V^{0} - \sum_{j = 1}^{3} U^{j}V^{j} .
\end{align*}
[/tex]
So, I used the above mentioned summation convention, i.e., dropped the summation sign, and wrote
[tex]
\mathbf{U} \cdot \mathbf{V} = U^{0}V^{0} - U^{i}V^{i} ,[/tex]
which is correct because if you use [itex]U^{0} = U_{0} , \ U^{j} = - U_{j}[/itex] in the above, you get
[tex]
\begin{align*}
\mathbf{U} \cdot \mathbf{V} &= U_{0}V^{0} + U_{i}V^{i} \\
&= U_{\mu}V^{\mu} \\
&= \eta_{\mu\nu}U^{\mu}V^{\nu} .
\end{align*}
[/tex]
I hope it is clear enough for you now. Let me know if it is not.
 
Last edited:
  • #5
strangerep said:
...

BTW, it's notable that Sam's post got 8 "likes" very quickly. I wonder if (or how many) other posts in the technical forums have done so well. :biggrin:
Is that means "wow"? :wink:
 
  • #6
samalkhaiat said:
The vectors in Schwarz inequality are Euclidean 3-vectors. So, there is no difference between upper and lower indices.
But you're using the "west coast" (hep) metric convention (+,-,-,-), hence ##U^j = -U_j##, as you noted later in your post #4. (I didn't mention this before because it doesn't affect your original post -- for which I see the "like" count is now up to 10.)

This is one of the reasons why, some years ago, I became a convert to the "east coast" (relativist) convention (-,+,+,+). :oldwink:
For other readers: these conventions are somewhat tangential to the main topic, but you can read more about them in Peter Woit's blog article. :oldbiggrin:
 
  • Like
Likes ShayanJ and andrewkirk
  • #8
strangerep said:
But you're using the "west coast" (hep) metric convention (+,-,-,-), hence ##U^j = -U_j##, as you noted later in your post #4. (I didn't mention this before because it doesn't affect your original post -- for which I see the "like" count is now up to 10.)

This is one of the reasons why, some years ago, I became a convert to the "east coast" (relativist) convention (-,+,+,+). :oldwink:


Well, the inequality I wrote deals with Euclidean vectors, so it is meaningless to bother about the Minkowski metric signature. However, one needs to be careful when one applies it to the spatial components of 4-vectors, as I mentioned.
As for the mostly-plus or mostly-minus signatures, personally I don’t mind working with any of them. I believe people prefer one on the other because they are lazy. Put it this way, had Weinberg written his GR text in the mostly-minus metric, you would see all relativists using it now. And the same (I believe) applies to the Bjorken & Drell texts: had they used the mostly-plus signature, we would see particle physicists using it now.
 
  • #9
May the Schwarz be with you.

(Sorry, couldn't resist.)
 
  • #10
samalkhaiat said:
Thus, we have proved the following theorem
Theorem (A): No two timelike vectors can be orthogonal (in the Minkowski sense)

How do you prove an axiom/definition? :wink:
 
  • #11
George Jones said:
How do you prove an axiom/definition? :wink:
Well, if you call that an axiom, then I obviously did. My only axiom was the Schwarz inequality and the only definition used in the proof is that of timelike vectors.
 
  • #12
Minkowski spacetime:

Minkowski spacetime [itex]\left( V,\mathbf{g}\right)[/itex] is a 4-dimensional vector space [itex]V[/itex] together with a symmetric, non-degenerate, bilinear mapping [itex]g:V\times V\rightarrow\mathbb{R}[/itex]. A vector in [itex]V[/itex] is called a 4-vector, and a 4-vector [itex]v[/itex] is called timelike if [itex]g\left(v,v\right) >0[/itex], lightlike if [itex]g\left(v,v\right) =0[/itex], and spacelike if [itex]g\left(v,v\right) <0[/itex]. [itex]\left( V,g\right)[/itex] is such that: 1) timelike vectors exist; 2) [itex]v[/itex] is spacelike whenever [itex]u[/itex] is timelike and [itex]g\left( u,v\right)=0[/itex].
 
  • #13
George Jones said:
and a 4-vector [itex]v[/itex] is called timelike if [itex]g\left(v,v\right) >0[/itex], lightlike if [itex]g\left(v,v\right) =0[/itex], and spacelike if [itex]g\left(v,v\right) <0[/itex].
Aren’t these the definitions (i)-(iii) in #1?
The “existence” of these three types of vectors follows from the fact that the quadratic form [itex]Q: M^{4} \to \mathbb{R}[/itex], associated with the inner product [itex]g[/itex], [itex]\left( Q(v) = g(v,v) = v^{2} \right)[/itex], on the Minkowski vector space [itex]M^{4}[/itex], is indefinite.
.. 2) [itex]v[/itex] is spacelike whenever [itex]u[/itex] is timelike and [itex]g\left( u,v\right)=0[/itex].
This statement is provable. It is the content of Theorem D.

*****
My purpose from this thread was to show people the usefulness of the Schwarz inequality (on the spatial subspace [itex]V^{3} \equiv \big \{ \mathbf{v} \in M^{4}, \ v^{0} = 0 \big \}[/itex]) for proving statements about Lorentz vectors. Clearly, people liked what I did.
This is an [I-type] thread. So, I had no intention to lay down, in here, the axiomatic structure of the Minkowski vector space [itex]M^{4}[/itex] or that of the Minkowski spacetime (as 4-dimentional differentiable manifold). And, if I wanted to, I could have done a pretty good job in that as well. :wink:
 

FAQ: Schwarz Inequality is your friend

1. What is Schwarz Inequality?

Schwarz Inequality, also known as Cauchy-Schwarz Inequality, is a mathematical inequality that relates the inner product of two vectors to their norms. It states that the absolute value of the inner product of two vectors is less than or equal to the product of their norms.

2. How is Schwarz Inequality used in science?

Schwarz Inequality is a fundamental tool in many branches of science, including physics, engineering, and statistics. It is commonly used to prove the existence of solutions to equations, establish bounds on quantities, and analyze data.

3. Why is Schwarz Inequality referred to as "your friend"?

Schwarz Inequality is often referred to as "your friend" because it is a helpful and powerful tool that can simplify and provide insight into complex problems. It is also widely applicable and can be used in a variety of situations, making it a valuable ally for scientists.

4. Can you provide an example of Schwarz Inequality in action?

One example of Schwarz Inequality in action is in quantum mechanics, where it is used to prove the uncertainty principle - that the product of the uncertainties of certain pairs of physical quantities, such as position and momentum, cannot be smaller than a certain limit.

5. Are there any limitations to Schwarz Inequality?

While Schwarz Inequality is a powerful and useful tool, it does have some limitations. For example, it only applies to inner product spaces, and it may not hold for certain complex or infinite-dimensional spaces. Additionally, it can only provide upper and lower bounds, not exact values.

Back
Top