Lorentz invariance and equation of motion for a scalar field

eoghan · Oct 7, 2018

Hi there,

I just saw some lectures where they claim that the Klein Gordon equation is the lowest order equation which is Lorentz invariant for a scalar field.
But I could easily come up with a Lorentz invariant equation that is first order, e.g.
$$
(M^\mu\partial_\mu + m^2)\phi=0
$$
where M is a generic matrix.
Now, something should be wrong with this equation, because, as Dirac showed, if we want a first order equation the field needs to be a spinor.

But I don't clearly understand why this first order equation is not Lorentz invariant. I mean, $$M^\mu\partial_\mu$$ is a scalar, so the equation is invariant, isn't it?
Is it maybe because the matrix M changes form by changing reference system, so that we could find privileged systems (e.g. a reference where the matrix is diagonal)?

hilbert2 · Oct 7, 2018

Let's say there's only one space coordinate, ##x##, in addition to the time coordinate ##t##. Then that first order equation would be

##\left(a\frac{\partial}{\partial t} - b\frac{\partial}{\partial x} + m^2 \right)\phi (x,t) = 0##,

with ##a## and ##b## some constants. This kind of an equation is called advection equation, as it has a first time derivative and first space derivative, but there's also the ##m^2 \phi## term which acts like a source term that depends on how large the ##\phi## already is at some point.

You could first assume that ##m=0## and read a bit about the advection equation to deduce whether it would be of any use as a field equation in physics.

vanhees71 · Oct 8, 2018

For the scalar field to be interpretible as free particles it should also imply the "on-shell condition", i.e.,
$$(\Box+m^2) \Phi(x)=0.$$
The heuristic approach, however leads in the most simple case to Dirac spinors, i.e., spin-1/2 particles+antiparticles rather than spin-0 particles.

For a systematic understanding, it's necessary to study the famous analysis on the unitary representations of the Poincare group and their realization through local field operators. A very good introduction is given in

R. Sexl, H. Urbandtke, Relativity, Groups, Particles, Springer

eoghan · Oct 9, 2018

hilbert2 said:

Let's say there's only one space coordinate, ##x##, in addition to the time coordinate ##t##. Then that first order equation would be

##\left(a\frac{\partial}{\partial t} - b\frac{\partial}{\partial x} + m^2 \right)\phi (x,t) = 0##,

with ##a## and ##b## some constants. This kind of an equation is called advection equation, as it has a first time derivative and first space derivative, but there's also the ##m^2 \phi## term which acts like a source term that depends on how large the ##\phi## already is at some point.

You could first assume that ##m=0## and read a bit about the advection equation to deduce whether it would be of any use as a field equation in physics.

I've read something about this equation, but still I don't get why we cannot use it. It has to do with the fact that, as pointed out by vanhees71, with such an equation the on shell condition is not met?

vanhees71 said:

For the scalar field to be interpretible as free particles it should also imply the "on-shell condition", i.e.,
$$(\Box+m^2) \Phi(x)=0.$$
The heuristic approach, however leads in the most simple case to Dirac spinors, i.e., spin-1/2 particles+antiparticles rather than spin-0 particles.

For a systematic understanding, it's necessary to study the famous analysis on the unitary representations of the Poincare group and their realization through local field operators. A very good introduction is given in

R. Sexl, H. Urbandtke, Relativity, Groups, Particles, Springer

Thanks for the reference, I admit my knowledge of group theory is still in its infancy. I still have to read the book, but as far as I remember the Lorentz group being noncompact has not representations that are unitary, isn't it?

vanhees71 · Oct 10, 2018

The Lorentz group or, more importantly, the entire Poincare group has no unitary finite-dimensional representations but fortunately it has many physically useful "infinite-dimensional" unitary representations. That's why the quantum mechanical Hilbert spaces of relativistic (as well as nonrelativistic) systems has an infinite dimension.

Demystifier · Oct 10, 2018

eoghan said:

But I could easily come up with a Lorentz invariant equation that is first order, e.g.
$$
(M^\mu\partial_\mu + m^2)\phi=0
$$
where M is a generic matrix.
Now, something should be wrong with this equation, because, as Dirac showed, if we want a first order equation the field needs to be a spinor.

But I don't clearly understand why this first order equation is not Lorentz invariant. I mean, $$M^\mu\partial_\mu$$ is a scalar, so the equation is invariant, isn't it?
Is it maybe because the matrix M changes form by changing reference system, so that we could find privileged systems (e.g. a reference where the matrix is diagonal)?

Since you suggest that ##\phi## transforms as a scalar and ##M^\mu## as a vector under Lorentz transformations, you will probably find interesting that Dirac equation can also be interpreted in that way: https://lanl.arxiv.org/abs/1309.7070

hilbert2 · Oct 10, 2018

eoghan said:

I've read something about this equation, but still I don't get why we cannot use it. It has to do with the fact that, as pointed out by vanhees71, with such an equation the on shell condition is not met?

If you have a 1D advection equation for function ##\phi (x,t)##, the time evolution of an initial state ##\phi (x,t_0 )## is just a translation with constant speed ##v##:

##\phi (x,t_0 + \Delta t) = \phi (x + v\Delta t,t_0 )##.

In the 2D or 3D cases, it is a similar translation to the direction of some velocity vector ##\vec{v}##. There's not much room for any interesting physics in that kind of time evolution. The term dependent on ##m^2##, if not zero, will only make the norm of the function ##\phi## grow or decrease exponentially (if it's normalizable in the first place).

samalkhaiat · Oct 13, 2018

eoghan said:

But I could easily come up with a Lorentz invariant equation that is first order, e.g.
$$
(M^\mu\partial_\mu + m^2)\phi=0
$$
where M is a generic matrix.
Now, something should be wrong with this equation

How about, almost everything is wrong with that equation:
1) If, as you say, [itex]M^{\mu}[/itex] is not [itex]\partial^{\mu}[/itex] but a “generic matrix”, then the expression [itex]M^{\mu}\partial_{\mu} + m^{2}[/itex] is meaningless because each term has different physical unit. In the natural units, the dimension of the first term is [itex]\mbox{cm}^{-1}[/itex] while the dimension of [itex]m^{2}[/itex] is [itex]\mbox{cm}^{-2}[/itex].
2) You said that [itex]\phi[/itex] is a scalar field. In 4-dimensional spacetime, a (real) scalar field can be described either by a single function or (equivalently) by a 5-component field treating [itex](\phi , \partial_{\mu}\phi )[/itex] as independent variables. In the first case (i.e., when [itex]\phi[/itex] is a 1-component field), your “matrices” [itex]M^{\mu}[/itex] must be [itex]1 \times 1[/itex] matrices. Thus, you must take [itex]M^{\mu} = \partial^{\mu}[/itex] so that the correct dispersion relation [itex]E^{2} = P^{2} + m^{2}[/itex] holds. In the second (5-component) case, the correct first-order equation (for a real scalar field) looks exactly like Dirac equation [tex]\left( i \Gamma^{\mu} \partial_{\mu} - m \right) \Psi (x) = 0 , \ \ \ \ \ \ \ \ \ \ (1)[/tex] with [tex]\Psi = \left( \varphi , \psi_{0} , \psi_{1}, \psi_{2} , \psi_{3} \right)^{T} ,[/tex] and the [itex]\Gamma[/itex]’s are a set of four [itex]5 \times 5[/itex] matrices satisfying the Duffin-Kemmer algebra [tex]\Gamma^{\mu}\Gamma^{\rho}\Gamma^{\nu} + \Gamma^{\nu}\Gamma^{\rho}\Gamma^{\mu} = \eta^{\mu \rho}\Gamma^{\nu} + \eta^{\nu\rho}\Gamma^{\mu} . \ \ \ \ \ \ (2)[/tex] Notice that (the Duffin-Kemmer equation) Eq(1) has no [itex]m^{2}[/itex] term in it. Within the representation theory, the Duffin-Kemmer equation has beautiful interpretation. However, since your knowledge in group theory is (unfortunately) poor, bellow I will only show you how to obtain the DK equation (1) from the following Klein-Gordon equation of real (1-component) scalar field [tex]\partial^{\mu}\partial_{\mu} \phi (x) + m^{2} \phi (x) = 0 . \ \ \ \ \ \ \ \ \ \ \ \ \ (3)[/tex] So we want to reduce (3) into a system of five (coupled) first-order equations. To do this we define a scalar field by [tex]\varphi (x) = \sqrt{m} \phi (x) ,[/tex] and a 4-vector field by [tex]\psi_{\mu}(x) = \frac{1}{\sqrt{m}} \partial_{\mu}\phi (x) .[/tex] Thus, the KG equation (3) can be replaced by the following equivalent system of first-order equations [tex]\partial^{\mu}\psi_{\mu} (x) + m \varphi (x) = 0 ,[/tex][tex]\partial_{\mu} \varphi (x) - m \psi_{\mu}(x) = 0 .[/tex] These can easily be rewritten as matrix equation
[tex]\begin{pmatrix} - m & - \partial_{0} & \partial_{1} & \partial_{2} & \partial_{3} \\ \partial_{0} & - m & 0 & 0 & 0 \\ \partial_{1} & 0 & - m & 0 & 0 \\ \partial_{2} & 0 & 0 & - m & 0 \\ \partial_{3} & 0 & 0 & 0 & - m \end{pmatrix} \begin{pmatrix} \varphi (x) \\ \psi_{0}(x) \\ \psi_{1}(x) \\ \psi_{2}(x) \\ \psi_{3}(x) \end{pmatrix} = 0 .[/tex] This is exactly the Duffin-Kemmer equation (1) with the [itex]\Gamma[/itex]’s given by [tex]\Gamma^{0} = \begin{pmatrix} 0 & i & 0 & 0 & 0 \\ - i & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{pmatrix} , \ \ \ \Gamma^{1} = \begin{pmatrix} 0 & 0 & - i & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ - i & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{pmatrix} ,[/tex] [tex]\Gamma^{2} = \begin{pmatrix} 0 & 0 & 0 & - i & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ - i & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \end{pmatrix} , \ \ \ \Gamma^{3} = \begin{pmatrix} 0 & 0 & 0 & 0 & - i \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 \\ - i & 0 & 0 & 0 & 0 \end{pmatrix} ,[/tex] and the Duffin-Kemmer field [itex]\Psi (x)[/itex] is given by [tex]\Psi (x) = \begin{pmatrix} \varphi (x) \\ \psi_{0}(x) \\ \psi_{1}(x) \\ \psi_{2}(x) \\ \psi_{3}(x) \end{pmatrix} .[/tex]
Similarly, you can equivalently write the Proca equation [tex]\partial_{\mu}F^{\mu\nu} + m^{2}A^{\nu} = 0 ,[/tex] which describes a free massive spin-1 field, as a Duffin-Kemmer equation, with four [itex]10 \times 10[/itex] [itex]\Gamma[/itex]'s. I leave this as exercise for you.

hilbert2 · Oct 14, 2018

samalkhaiat said:

How about, almost everything is wrong with that equation:
1) If, as you say, [itex]M^{\mu}[/itex] is not [itex]\partial^{\mu}[/itex] but a “generic matrix”, then the expression [itex]M^{\mu}\partial_{\mu} + m^{2}[/itex] is meaningless because each term has different physical unit. In the natural units, the dimension of the first term is [itex]\mbox{cm}^{-1}[/itex] while the dimension of [itex]m^{2}[/itex] is [itex]\mbox{cm}^{-2}[/itex]..

I was thinking that the ##M^\mu## is just a four-vector where the components are simple numbers with freely chosen dimensions. It can be called a "matrix" with only one row or column in it. I guess you're assuming here that the components of ##M^\mu## can be matrices that act in the space of some indices other than the Minkowski ones.

samalkhaiat · Oct 15, 2018

hilbert2 said:

I was thinking that the ##M^\mu## is just a four-vector where the components are simple numbers with freely chosen dimensions.

This thread is about relativistic field equations. So, in relativistic field theories, the expression [itex]M^{\mu}\partial_{\mu} + m^{2}[/itex] is meaningful differential operator if and only if [itex]M^{\mu} = \partial^{\mu}[/itex].

I guess you're assuming here that the components of ##M^\mu## can be matrices that act in the space of some indices other than the Minkowski ones.

No, I made no such assumption because it is not true in general. The indices on the Duffin-Kemmer field are spacetime indices acted upon by the matrices [itex]\Gamma^{\mu}[/itex]. So, the lessons from #8 are: (1) First-order relativistic field equations have no [itex]m^{2}[/itex] term in them, and (2) All relativistic multi-component fields satisfy the Klein-Gordon equation (which has [itex]m^{2}[/itex] term) component by component.

hilbert2 · Oct 16, 2018

I was thinking of it just as any time evolution equation with a differential operator explicitly written as

##M^0 \partial_0 - M^1 \partial_1 - M^2 \partial_2 - M^3 \partial_3 + m^2##,

and the 0-component interpreted as time. Even as such it doesn't really describe anything more complicated than motion with constant velocity.

Shouldn't it become a proper relativistic equation if the components of ##M^\mu## are functions of the Lorentz frame, i.e. transform as a vector?

Lorentz invariance and equation of motion for a scalar field

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

High School Interesting paper on QM in Scientific American

Undergrad ##r-##independent angular momentum in quantum mechanics

Graduate Consistency of Relativistic QM

Graduate Some derivation in QFT in Curved SpaceTime by Birrell and Davies

High School Seemingly odd quantum tunneling

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect