Four vectors in complex notation

Click For Summary

Discussion Overview

The discussion revolves around the use of four vectors in complex notation versus real components in the context of electrodynamics and special relativity. Participants explore the implications of different conventions for representing four vectors and their scalar products, examining both theoretical and practical aspects.

Discussion Character

  • Debate/contested
  • Technical explanation
  • Conceptual clarification

Main Points Raised

  • Some participants note that the use of real components for four vectors is common in textbooks and exams, while others argue for the elegance of using complex notation with an imaginary unit.
  • One participant mentions that the "four dimensional scalar product" has a minus sign in the real convention, which they find awkward compared to the complex convention that eliminates this issue.
  • Another participant states that the choice between real and complex representations is largely a matter of taste, with no definitive preference established in the community.
  • Some argue that the use of complex numbers is not consistent with the rest of the theory, as they are not typically employed elsewhere in special relativity.
  • A participant references the historical context of the use of the imaginary unit in four vectors, noting that it was abandoned due to confusion and the need to acknowledge the indefinite metric in relativity.
  • One participant introduces a more abstract perspective, discussing the relationship between complex numbers, quaternions, and their implications for understanding space-time and phase space, suggesting that limiting the investigation to these frameworks can lead to complications.

Areas of Agreement / Disagreement

Participants express differing views on the use of complex versus real vectors, with no consensus reached on which convention is superior or more widely accepted. The discussion remains unresolved regarding the implications and preferences for each approach.

Contextual Notes

Some limitations are noted, such as the dependence on definitions and the historical context of conventions used in different texts. The discussion highlights unresolved mathematical steps and the implications of using different metrics.

Nick89
Messages
553
Reaction score
0
Hi,

I recently completed a course on Electrodynamics, with a short introduction to special relativity (and E/B fields in special relativity).

Both in the book we were asked to study, and the final exam, the four vectors we used used real components, with the time coordinate being the zeroth component. For example:
[tex](x^\mu) = (ct, x, y, z)[/tex]
[tex](A^\mu) = (\phi / c, A_x, A_y, A_z)[/tex]
(A = vectorpotential, [itex]\phi[/itex] = scalar potential)
etc...

Also, what the book calls the 'four dimensional scalar product' has a, in my opinion, rather strange minus sign in the zeroth component:
[tex]\sum_{\mu = 0}^3 a_\mu b^\mu = -a^0b^0 + a^1b^1+a^2b^2+a^3b^3[/tex]

In the classes we were given however, we used a different convention (well, I guess it's a convention), where the time coordinate as the fourth component (counting from 1 to 4), carrying a factor of [itex]i[/itex]:
[tex](x^\mu) = (x, y, z, ict)[/tex]
[tex](A^\mu) = (A_x, A_y, A_z, i\phi / c)[/tex]
etc...

Now, the product doesn't need the pesky minus sign, because it comes from the factor [itex]i^2[/itex]...

I could only find the first convention (real vectors) in practice, I couldn't find the complex vector convention anywhere I looked, except in our classes. But, I think it's a much neater definition, because everything generalizes from 3D, without the pesky minus sign...

I was just wondering, is there any particular reason for using the real vectors, instead of the complex ones? If not, is the real vector definition really used more often, or did I just not look hard enough?

I just find it a bit strange that everything seems to be about generalization (putting time and space into a single vector, and putting vector potential and scalar potential into one vector, etc), yet an, in my opinion, beautiful use of complex vectors is not used, so that an awkward definition of the scalar product has to be used..?
 
Physics news on Phys.org
It's just a matter of taste. Some people do special relativity one way, some do it the other. In general relativity, the coefficients occurring in the inner product are varying rather than constant, so it becomes pointless to worry about trying to make them all equal +1.
 
As bcrowell said, it's a matter of taste. I personally find the "i" weird and awkward, mainly because we wouldn't use complex numbers anywhere else in the entire theory. I have never seen that convention used in a book on differential geometry or general relativity. Also note that any bilinear form g on [itex]\mathbb R^4[/itex] can be defined as [itex]g(x,y)=x^T M y[/itex], where M is a 4×4 matrix and x and y are 4×1 matrices. We have g(x,y)=g(y,x) if and only if MT=M. When M=I (the identity matrix), g is the standard inner product, and when [itex]M=\eta[/itex], it's the Minkowski bilinear form. (I don't want to call it an inner product or a scalar product, because it's not positive definite). I think this is a much more attractive way to deal with these things.

In case you don't know already, [itex]\eta[/itex] is defined by

[tex]\eta=\begin{pmatrix}-1 & 0 & 0 & 0\\ 0 & 1 & 0 & 0\\ 0 & 0 & 1 & 0\\ 0 & 0 & 0 & 1\end{pmatrix}[/tex]
 
Use of the i that way was dropped about 30 years ago, because it avoided the fact that there really is a minus sign in the metric, which is called an 'indefinite metric'. I think Panofsky and Phillips used it in their first edition, but went to the indefinite metric for their second edition. The i leads to confusion in relativistic QM, and can't be extended to GR.
The metric used by most physicists now is +1,-1,-1,-1, because this leads to
p.p=m^2 for the length of the momentum 4-vector.
 
Fredrik said:
As bcrowell said, it's a matter of taste. I personally find the "i" weird and awkward, mainly because we wouldn't use complex numbers anywhere else in the entire theory.

I agree with this strongly. If we are going to introduce complex variables we should not limit ourselves to pure complex numbers. What would 3+2i seconds look like? 4+3i meters would be a 3-4-5 triangle, but what significance is that?

Fringe warning, proceed with caution.
I like to consider from time to time that these imaginary values might have meaning in a phase space frequency relationship. I also like to think that the metric looks too similar to quaternion arithmetic to be a coincidence, but we still need the pesky pure i's there (which then generates a signature of (+---), rather than the (++++) of the underlying quaternions ).

Complex numbers, quaternions and split complex numbers are part of a family of algebras related by Cayley-Dickson doubling. One can observe that split doublings naturally generate algebras with Lorentz boosts.

http://en.wikipedia.org/wiki/Cayley–Dickson_construction
http://en.wikipedia.org/wiki/Split-octonion

If one uses Cayley-Dickson doubling three times from the real numbers using split values for each additional step one obtains the three split primary components, three generated proper roots value and an additional split root with a combined signature of (++++----).

If the real value is taken as time and the three original split roots as real space, we get a metric of (+---), but we have ignored the phase space.

The split algebra generated after the third cycle have a signature (+++-). Using the phase-space analogy, the three proper roots correspond to some type of spatial frequency and the split root a temporal frequency. If we are basing a theory on observed peaks and troughs of waves this is all of the algebra we are using.

Thus if we limit our investigations to space-time or phase space-frequency we will make observations consistent with SR. Only when we attempt to combine the two will complications arise.

This algebra obeys the property that a metric is preserved under multiplication but breaks the associative property, requiring some type of bra-ket-like notation to be employed. It also contains zero divisors that are similar to the light cones of SR. (A spatial direction with an equal spatial frequency form a new class of cones, as does a temporal frequency locked to the real time. The latter obviously represents a dynamic system appearing static, the former is not so clear.)

Then I through up my hands and say "whatever."
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 124 ·
5
Replies
124
Views
10K
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 22 ·
Replies
22
Views
5K
  • · Replies 12 ·
Replies
12
Views
3K