Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

Featured A Is Gravity A Gauge Theory

  1. Jan 29, 2018 #1


    Staff: Mentor

    I have been reviewing GR lately because as a mentor I find myself now answering more of those questions. I learnt GR years ago from Wald and other sources, but since then have been exposed to the symmetries of the Standard Model. What struck me during this review is I now have a different perspective. I now suspect like all the other fundamental forces it is a gauge theory. Its gauge being invariance to coordinate transformations. Is this the right way of looking at it or is an I missing something? Exactly what gauge group would it be? Or is this a matter of exactly what one considers a gauge theory? I am not sure it can be formulated as a Yang-Mills theory (if so it would of course it would be one in the ordinary sense) - but came across the following I need to study:

  2. jcsd
  3. Jan 29, 2018 #2


    Staff: Mentor

  4. Jan 29, 2018 #3


    Staff: Mentor

    I don’t know if it is a gauge theory, but if it is then I don’t think that the gauge can be invariance to coordinate transformations. That comes from writing the theory in terms of tensors, but you can do that for the Standard Model too.
  5. Jan 29, 2018 #4


    Staff: Mentor

    That's interesting because in linearised gravity its gauge invariance is exactly that - invariance to infinitesimal coordinate transformations - Chapter 7 - Ohanian Gravitation and Space Time. Kretchmann proved merely writing a theory in the form of tensors is vacuous - it says nothing - and was eventually agreed to by Einstein. The exact foundation of GR was that the content of laws had to be invariant to coordinate transformations - merely formulating a theory in the form of tensors does not guarantee that. The Standard Model symmetries are formulated in terms of SR tensors which I don't think are general coordinate transformations.

    Have quickly scanned my posted link - they have a theory that generalizes GR - but isn't exactly GR.

    I am groping in the dark here - not sure exactly whats going on.

    Might need some high powered help from people like Urs or Sal (cant remember his full name).

    Last edited: Jan 29, 2018
  6. Jan 29, 2018 #5


    Staff: Mentor


    Had a look at both - but the above looks more reasonable than the others (my posted paper and the other one you posted). After reading them I am now suspicious its not a Yang-Mills theory - that seems to lead to theories that are a bit strange - not that it is a criteria for being correct - but it doesn't 'excite' me much

    The above paper is something that looks really interesting to study.

  7. Jan 29, 2018 #6


    User Avatar
    Science Advisor

    Maybe you like my insight about this:


    Short answer: GR can be seen as a gauge theory of the Poincaré algebra. It differs from Yang-Mills significantly, because (1) you put one gauge curvature to zero (that of the translations), and (2) the action is not quadratic in the remaining field strength (that of the Lorentz transformations). Because of the curvature constraint in (1) the local translations are just general coordinate transformations plus local Lorentz transformations, leaving you with the right types of transformations.
  8. Jan 29, 2018 #7


    Staff: Mentor

    Great article - loved it. but I have a lot of reading to do. However I was gladdened when I read - People like to say that gravity is the result of locally gauging Lorentz symmetry. That was my gut feeling sense.

    However a lot of work on my part understanding the detail.

  9. Jan 30, 2018 #8


    User Avatar
    Science Advisor

    Yes, it is a bit formal. On top of that, it is quite different from GR and subtely different from Yang-Mills :P

    The reason why it is a great approach however, is because it is easily extended to other symmetries, giving you all kinds of different theories of gravity. E.g., if you apply the same gauging procedure to the Bargmann algebra (which is the Galilei algebra + central extension), you obtain Newton-Cartan theory. In my PhD research I developed, in a similar way, Newton-Cartan theory for gravitating strings instead of point particles. And if you like supergravity (SUGRA): the D=4 minimal SUGRA can be obtained by gauging the super-Poincaré algebra.
  10. Feb 1, 2018 #9
    Enjoyed your Insight article. It immediately reminded me of the Bacry & Levy-Leblond classification of the 8 possible kinematical groups.

    If I recall correctly, John Baez, and more prominently, Derek Wise did some nice work on MacDowell-Mansouri GR using Cartan geometry. Moreover, almost 25 years ago, i.e. prior to Wise's work, Roger Penrose used Newton-Cartan theory to describe Diosi-Penrose quantum state reduction; how difficult would it exactly be to extend his argument to (MacDowell-Mansouri) GR? I'm assuming that this has already been tried.
  11. Feb 2, 2018 #10


    User Avatar
    Science Advisor

    Thanks! I'm not familiar with the Diosi-Penrose quantum state reduction, to be honest, but I also don't see why people would first try Newton-Cartan instead of ordinary Newtonian gravity. Newton-Cartan theory is more complicated than GR (although the solutions of course are simpler): you need degenerate geometry, two metrics, the connection contains an extra 2-form, the action principle is much more complicated, etc.

    Also, the question arises what conclusions one can draw from Newton-Cartan theory. E.g., people have written papers about its quantization, but Newton-Cartan theory doesn't contain any gravitational waves/propagating degrees of freedom, so I don't see how the quantization of NC-theory gives one any insight into quantum gravity in general (relativity :P ).
  12. Feb 2, 2018 #11


    User Avatar
    Science Advisor

    I don't know what a gauge theory is, but my impression is that mathematically it is described by a principal bundle over space-time and a choice of a connection. All of the mathematical notions having physical interpretation such as: the connection form is the field potential, the curvature form is the field strength and so on. General relativity as usually presented looks different, but since the connection used is a linear connection it can be viewed as a connection on the ##GL(4)## bundle of linear frames. And since it is a pseudo-Riemannian it reduces to the bundle of orthogonal frames.
  13. Feb 2, 2018 #12
    DP quantum state reduction is a prediction of intrinsic mass-dependent wave-function collapse as an actual phenomenon in an as yet to be discovered non-linear extension of QM, of which QM is the limiting case with quantum state reduction mass ##m_r \rightarrow 0## with the actual value for ##m_r## being the Planck mass.

    The actual argument used is quite subtle, namely not quantization of gravity (i.e. quantum gravity) but gravitization of QM, i.e. not reformulating GR respecting the principles of QM, but instead reformulating QM respecting the principles of GR i.e. in curved spacetime using the equivalence and general covariance principles.

    Moreover, Penrose explicitly chooses NC because he needs curved spacetime and a mathematically tractable way of dealing with the prohibition of pointwise identification of space-times due to the principle of general covariance. NC doesn't exactly resolve this prohibition but it at least makes a preliminary investigation of the problem mathematically more tractable; the fact that NC is a Newtonian approximation isn't problematic at all since quantum state reduction is independent of ##c##. Last but not least, there are papers arguing exactly the opposite i.e. that the NC framework does in fact shed light on the role of gravity in the measurement problem.
  14. Feb 2, 2018 #13


    User Avatar

    I have always been partly symphatetic with Penrose direction of thinking of a deeper connection between QM and GR. As is also supported by recent ponderings of "ER=EPR" and "QM=GR" as argued by Susskind and others.

    So there is no doubt in my mind that his intuition was onto something. But I think to develop the ideas and to understand exactly in what sense QM gravitates, one needs a serious revision of our constructing principles.

    I think trying to analyse the constructing principles of GR as well as the SM, trying to find the "common denominator" is indeed a rational approach.

    The first paragraph och haushofers nice insight articler describes the three steps on howto typically USE this gauge principle as a construction tool to build theories.

    One of the things often overlooked thinkgs - probably because it fringes to philosophy - is how to interpret the "ontological status" of these gauge symmetries in the first place? And the real question is not how to just "interpret them" to make us feel better and the usual discussions that follow, the core question is which is the most fruitful WAY of interpreting them in order to arrive at a coherent construction of the theory of all interactions. The way of interpreting this that leads to progress is thus the "right" one.
    (This was partly discussed from Wittens perspetive in https://www.physicsforums.com/threads/ed-witten-on-symmetry-and-emergence.927897/)

    But in order to state things more clearly, for me at least it has always been clear that the physical essence of the gauge principles observer equivalence. This means that the laws of physics "seen" by any observer in our universe, must be consistent with the views of the other observers. Equivalence may be a better word than invariance, because invariance makes it sounds like the gauges are non-physical and simply in the mathematical realm which imo is wrong. To think that all observers are "equivalent" is much more SANE that to think that the theory is invariant with respect to the observer choice.

    I would like to recode Haushofers steps in general terms

    (1) Encode/Represent the laws of physics with respect to an observer where we know how to do this
    (2) Now, try to define the manifold of all physically existing observers, and find transformations that transforms one observer to another one. Transform the theory
    the a general observer and note the appearance of additional terms in the theory that was not there before.
    (3) Introduce counterterms to compensate for the additional terms implied by (2). These counterterms are identified with a NEW interaction.

    So what to we need to know here?
    A) The laws of physics with respect to one observer. (We are free to choose the observer where the theory just "happens" to be simlpest)
    B) We need to know the full manifold of physically existing observers, and we need to understand the transformation groups that relates them.

    The full understanding of (B) requires complementing the lossy renormalisation flow with the reverse step. It also needs to be complemented with details of the actual way laws are encoded, and how the internal structure both mirrorors and constraints the laws. This is indeed a holographic view, but given the observer equivalence view, rather than observer invariance view, i am personally convinced that the laws are a result of negotiation, and thus a kind of attractor in theory space. When all observers can agree to disagree without killing off each other, we have a stable law that is consistently coded throughout the equivalence class.

    When we have sorted this mess out, I think the explicit connection between how information gravitates will be clear as well.

    If we dont get stuck on the surface of current theories, and step back and look at the constructing principles, such as the observer equivalence, I see great hope to find a coherent picture which embraces both views.

  15. Feb 3, 2018 #14


    User Avatar
    Science Advisor

    Thanks, I've done a little reading also by myself.

    It was a bit confusing perhaps of me to mention quantization of gravity. I'm well aware that Penrose's approach is not this. I merely wanted to emphasize the fact that one has to be carefull to use Newton-Cartan theory as a playground. But if it's about general covariance, I see why Penrose choose it.
  16. Feb 3, 2018 #15


    User Avatar
    Science Advisor

    Hoping to give an answer which replies to this post (the "ontological status"):

    The way I see gauge "redundancy", is that it enables us to make certain other symmetries manifest.

    E.g., take U(1) gauge theory. We use this gauge "redundancy" such that we can put the photon in a spin-1 representation of the Lorentz algebra. E.g., we make the symmetries of special relativity manifest. The same can be said of GR: we put the graviton in a spin-2 representation (Fierz-Pauli theory) and we need certain gauge symmetries to avoid ghosts. The special thing about GR however is that these gauge "redundancies" are transformations in spacetime instead of some internal space ("fiber") . More generally, we refer to this as the Stückelberg trick. I regard Newton-Cartan theory also as a very sophisticated Stückelberg trick. Sophisticated, because it makes Newtonian gravity general covariant in a very non-trivial way, namely via geometrization.

    See also

  17. Feb 3, 2018 #16


    User Avatar

    i will be away for a day or so, so i will be back with more comments on this later.

    But I agree this is an important difference, but with ontological status, i meant where/how the information about these equivalence classes are physically encoded. Here we face another issue, which indeed is related to the internal vs external point. In QFT we study small susystems, and the observer is dominant. In GR we have a cosmological theory where the environment dominates the observer. How can we make sense of this? in the holographic sense?

  18. Feb 14, 2018 #17


    User Avatar
    Science Advisor

    A) If by gauge theory one means the Yang-Mills type theory with its compact symmetry group, then claim that GR can be formulated as a Yang-Mills theory is just meaningless, because the symmetry group of GR is non-compact. However, the “limited similarities” between the dynamical variables of the two theories (Claim 1) and, in particular, the tetrad formalism of GR make general relativity very “similar” to a gauge theory. In fact, while the 4-dimensional GR action integral is definitely not a gauge theory action (Claim 2), the (1+2)-dimensional GR, without cosmological constant, is “equivalent” to a gauge theory with gauge group [itex]\mathcal{P}(1,2) = T(3) \rtimes SO(1,2)[/itex] and a Chern-Simons action integral (Claim 3). Even in this case, by “equivalence” one means on-shell equivalence between the gauge group [itex]\mathcal{P}(1,2)[/itex] and the group of diffeomorphisms, the symmetry group of the E-H action [itex]\int d^{3}x \sqrt{|g|} R[/itex]. Adding a cosmological constant [itex]\Lambda[/itex] to the 3-dimensional GR will only change the gauge group from the Poincare’ group [itex]\mathcal{P}(1,2)[/itex] to the de Sitter [itex]SO(1,3)[/itex] or anti-de Sitter [itex]SO(2,2)[/itex] group, depending on the sign of [itex]\Lambda[/itex]. The point is the following: the invariant metric (inner product) on the Lie algebra in question needs to be non-degenerate so that the action integral contains kinetic terms for each component of the gauge field. In particular, a Chern-Simons action exists for the gauge group [itex]\mathcal{P}(1,n-1)[/itex] if and only if [itex]n = 3[/itex]: For a general [itex]n[/itex], a Lorentz invariant bilinear expression in the generators would have to be of the form [itex]C = c_{1} P^{a}P_{a} + c_{2} J^{ab}J_{ab}[/itex], for some constants [itex]c_{1}[/itex] and [itex]c_{2}[/itex]. However, [itex][C , P_{b}] = 0[/itex] forces us to set [itex]c_{2} = 0[/itex] and with it disappear our hope in constructing a non-degenerate bilinear form on the Lie algebra. So, for a general [itex]n[/itex], there will be no Chern-Simons 3-form [itex]\mbox{tr} \left(\mathbb{A} \wedge \mbox{d} \mathbb{A} \right) = \mbox{tr} \left( X_{a}X_{b}\right) A^{a} \wedge \mbox{d} A^{b}[/itex] for [itex]\mathcal{P}(1,n-1)[/itex]. For [itex]n = 3[/itex] we are lucky because, in this case we can take [itex]C = \frac{1}{2} \epsilon_{abc}P^{a}J^{bc} \equiv P^{a}J_{a}[/itex]. It is easy to see that [itex][C , P] = [C , J] = 0[/itex] (i.e., Poincare invariant) as well as non-degenerate. Thus we are led to define the following invariant inner product on the Lie algebra [itex]\mathfrak{p}(1,2)[/itex]: [tex]\mbox{tr} \left( P_{a} J_{b}\right) = \eta_{ab} \ , \ \ \ \mbox{tr} \left( P_{a}P_{b}\right) = \mbox{tr} \left( J_{a}J_{b}\right) = 0 . \ \ \ \ (A.0)[/tex] With this inner product, a Chern-Simons action for the gauge group [itex]\mathcal{P}(1,2)[/itex] will exists and, as we shall see below, coincide with Einstein-Hilbert action on [itex](1+2)[/itex]-dimensional space-time.

    As usual, since I don’t like to use ambiguous language in my posts, I will try to mathematically clarify the three claims that I made above, at least in a sketchy way.

    (Claim 1) about the “limited similarities”:

    Basically, one would like to set up a 1-to-1 correspondence (dictionary if you like) between gravitational variables and gauge theory variables and see if such a dictionary is complete. If for every GR variable one can find a gauge theory variable, one then concludes that GR is certainly equivalent to a gauge theory. Even though we have already spoiled the fun, let us see how far one can go with analogy. Recall that under a general coordinate transformation, the gauge potential [itex]\mathbb{A}_{\alpha}(x) = A_{\alpha}^{C}(x)T_{C}[/itex], which is a (matrix) field taking values in the Lie algebra of the gauge group, transforms as a co-vector up to a gauge transformation. That is, when [itex]x^{\alpha} \to \bar{x}^{\alpha}(x)[/itex], the Yang-Mills connection transforms according to [tex]\bar{\mathbb{A}}_{\alpha} (\bar{x}) = \frac{\partial x^{\beta}}{\partial \bar{x}^{\alpha}} \left( U(x) \mathbb{A}_{\beta}(x) U^{-1}(x) + U(x) \partial_{\beta} U^{-1}(x)\right) \ . \ \ \ (A.1)[/tex] This means that the space-time index [itex]\alpha[/itex] carried by the gauge field matrix [itex](\mathbb{A}_{\alpha})^{a}{}_{b}(x)[/itex] transforms (as it should) by the inverse Jacobian matrix [itex]\frac{\partial x^{\beta}}{\partial \bar{x}^{\alpha}}[/itex], while the matrix (i.e., internal) indices [itex](a,b)[/itex] of [itex](\mathbb{A}_{\alpha})^{a}{}_{b}(x)[/itex] transform by the arbitrary gauge functions [itex]U^{a}{}_{b}(x)[/itex]. The corresponding suspect in GR is the Christoffel connection [itex]\Gamma^{\mu}_{\alpha \nu}(x)[/itex] with its usual transformation law [tex]\bar{\Gamma}^{\mu}_{\alpha \nu}(\bar{x}) = \left( \frac{\partial \bar{x}^{\mu}}{\partial x^{\rho}} \Gamma^{\rho}_{\beta \sigma} (x) \frac{\partial x^{\sigma}}{\partial \bar{x}^{\nu}} + \frac{\partial \bar{x}^{\mu}}{\partial x^{\sigma}} \frac{\partial}{\partial x^{\beta}} ( \frac{\partial x^{\sigma}}{\partial \bar{x}^{\nu}} ) \right) \frac{\partial x^{\beta}}{\partial \bar{x}^{\alpha}} \ . \ \ \ (A.2)[/tex] Now, for each [itex]\alpha = 0,1,2,3[/itex], let us define the following [itex]4 \times 4[/itex] field matrices [tex]\Gamma^{\mu}_{\alpha \nu}(x) \equiv ( \Gamma_{\alpha})^{\mu}{}_{\nu} (x) \ .[/tex] Let us also introduce the following [itex]4 \times 4[/itex] matrix (and its inverse) [tex]\frac{\partial \bar{x}^{\nu}}{\partial x^{\rho}} = V^{\nu}{}_{\rho} (x) \ , \ \ \frac{\partial x^{\sigma}}{\partial \bar{x}^{\nu}} = (V^{-1})^{\sigma}{}_{\nu} (x) \ .[/tex] With these definitions, equation (A.2) takes on the following matrix form [tex]\bar{\Gamma}_{\alpha}(\bar{x}) = \left( V(x) \Gamma_{\beta}(x) V^{-1}(x) + V(x) \partial_{\beta} V^{-1}(x) \right) \frac{\partial x^{\beta}}{ \partial \bar{x}^{\alpha}} \ . \ \ \ (A.3)[/tex] Now, I will make the following false argument: Since Eq(A.1) (which is the transformation law of the gauge field matrix [itex]\mathbb{A}_{\alpha}(x)[/itex]) is identical to Eq(A.3) (which is the transformation law of the Christoffel matrix field [itex]\Gamma_{\alpha}(x)[/itex]) then [itex]\Gamma_{\alpha}(x)[/itex] is the gauge field (i.e., connection) associated the [itex]GL(4 , \mathbb{R})[/itex] gauge group. In other words, GR is a gauge theory! Can you figure out why my argument is false? What is wrong with the saying that Eq(A.1) is identical to Eq(A.3)? So, [itex]\Gamma_{\alpha} \leftrightarrow \mathbb{A}_{\alpha}[/itex] is the first entry in our Gravity-Gauge dictionary. Okay, let us populate the dictionary with more objects. For the Riemann tensor [itex]R^{\mu}{}_{\nu \alpha \beta}[/itex] we define the anti-symmetric matrix [tex]R^{\mu}{}_{\nu \alpha \beta} (x) \equiv ( \mathbb{B}_{\alpha \beta})^{\mu}{}_{\nu}(x) \ .[/tex] Thus, the definition of the Riemann tensor, in terms of the Christoffel connection, translates to the following matrix equation [tex]\mathbb{B}_{\alpha \beta} = \partial_{\alpha} \Gamma_{\beta} - \partial_{\beta}\Gamma_{\alpha} + [ \Gamma_{\alpha} , \Gamma_{\beta} ] \ .[/tex] But this is exactly like the definition of the Yang-Mills field strength matrix [itex]\mathbb{F}_{\alpha \beta}(x) = F^{C}_{\alpha \beta}(x) T_{C}[/itex] [tex]\mathbb{F}_{\alpha \beta}(x) = \partial_{\alpha} \mathbb{A}_{\beta} - \partial_{\beta}\mathbb{A}_{\alpha} + [ \mathbb{A}_{\alpha} , \mathbb{A}_{\beta} ] \ .[/tex] And so we have [itex]R^{\mu}{}_{\nu \alpha \beta} \leftrightarrow \mathbb{F}_{\alpha \beta}[/itex]. This seems easy and you can carry on adding more objects, for example a (1,1)-type GR tensor [itex]T^{\mu}{}_{\nu}[/itex] corresponds to a Yang-Mills matrix [itex]M[/itex] with values in the adjoint representation of the gauge group, and then the GR covariant derivative [itex]\nabla_{\alpha}[/itex] translates according to [tex]\nabla_{\alpha} T^{\mu}{}_{\nu} \leftrightarrow D_{\alpha} M = \partial_{\alpha} M + [ \mathbb{A}_{\alpha} , M] \ .[/tex] However, our dictionary ends when we try to contract the space-time indices [itex](\alpha , \beta , ...)[/itex] with the would be “gauge indices” [itex](\mu , \nu , ...)[/itex] (why is that?). This means that the Ricci tensor [itex]R_{\nu \beta} = R^{\mu}{}_{\nu \mu \beta}[/itex] and the scalar curvature [itex]R = g^{\nu \beta} R_{\nu \beta}[/itex] do not correspond to well-defined objects in gauge theories. Thus, the Einstein-Hilbert action [itex]\int d^{4}x \sqrt{|g|} R[/itex] does not exist and we conclude that in 4 dimensions GR is not equivalent to a gauge theory as I claimed in (Claim 2) above. We will reach the same conclusion about (Claim 2) in the tetrad formalism below.

    B) The tetrad formalism of GR, (Claim 2) and (Claim 3):

    Since GR can be, equivalently, formulated in terms of the tetrad field and spin connection, let me say few words about the geometrical meaning and the functioning of the tetrad field and the spin connection. This is important because it is this tetrad formalism that led many people to try to interpret GR as a gauge theory of the Poincare’ group [itex]\mathcal{P}(1,3)[/itex]. The idea is to interpret the tetrad field [itex]e^{a}{}_{\alpha}(x)[/itex] as the gauge field associated with translation, while the spin connection [itex]\omega_{\alpha}{}^{a}{}_{b}(x)[/itex] is taken to be the gauge field associated with the Lorentz group [itex]SO(1,3)[/itex]. So, what are these fields and in what vector spaces do their indices take values in? We say that space-time of dimension [itex]n[/itex] can be modelled by a smooth [itex]n[/itex]-manifold [itex]M[/itex] if and only if [itex]M[/itex] admits metric of Lorentzian signature. So, we introduce an abstract vector bundle [itex]\mathcal{V}^{n}[/itex] with structure group [itex]SO(1,n-1)[/itex]. This means that [itex]\mathcal{V}^{n}[/itex] is equipped with a Minkowskian metric [itex]\eta_{ab} = \mbox{diag}(1,-1,-1, \cdots , -1)[/itex] and a volume form [itex]\epsilon_{a_{1} a_{2} \cdots a_{n}}[/itex]. We assume that [itex]\mathcal{V}^{n}[/itex] has the same topological structure as that of the tangent bundle [itex]TM[/itex] so that isomorphisms exist between [itex]\mathcal{V}^{n}[/itex] and [itex]TM[/itex]. At any [itex]p \in M[/itex], the “tetrad” [itex]e_{\mu}{}^{a}[/itex] provides a choice of (vector space) isomorphism [itex]e_{\mu}{}^{a}(p) : M_{p}T \to \mathcal{V}^{n}[/itex], i.e., a [itex]\mathcal{V}^{n}[/itex]-valued 1-form on [itex]M[/itex] [itex]e^{a} (x) = e_{\mu}{}^{a}(x) \mbox{d}x^{\mu}[/itex]. I think it is clear to you that I am using [itex](a,b,c, \cdots )[/itex] as Lorentz indices (i.e., they define the geometrical objects on [itex]\mathcal{V}^{n}[/itex]), and [itex](\mu , \nu \cdots )[/itex] as tangent space indices (i.e., they define geometrical object on the spacetime manifold [itex]M[/itex]). The metric [itex]\eta[/itex] and the volume form [itex]\epsilon[/itex] on [itex]\mathcal{V}^{n}[/itex] together with isomorphism [itex]e_{\mu}{}^{a}[/itex] between [itex]MT[/itex] and [itex]\mathcal{V}^{n}[/itex] give a metric [tex]g_{\mu\nu} = e_{\mu}{}^{a} \ e_{\nu}{}^{b} \ \eta_{ab} \ ,[/tex] on [itex]M[/itex] having the same signature as [itex]\eta_{ab}[/itex] and a volume form [tex]\sqrt{|g|} \epsilon_{\mu_{1} \mu_{2} \cdots \mu_{n}} = e_{\mu_{1}}{}^{a_{1}} \ e_{\mu_{2}}{}^{a_{2}} \ \cdots \ e_{\mu_{n}}{}^{a_{n}} \ \epsilon_{a_{1} a_{2} \cdots a_{n}} \ . \ \ \ (B.1)[/tex] The spin connection [itex]\omega[/itex], which is an [itex]\mathfrak{so}(1,n-1)[/itex]-valued connection on [itex]\mathcal{V}^{n}[/itex], can be regarded as a 1-form on [itex]M[/itex] with values in the Lie algebra of [itex]SO(1,n-1)[/itex] [tex]\omega^{a}{}_{b}(x) = \omega_{\mu}{}^{a}{}_{b}(x) \ \mbox{d}x^{\mu} \ . \ \ \ \ \ \ (B.2)[/tex] Now, instead of the fields [itex](g_{\mu\nu} , \Gamma^{\rho}_{\mu\nu})[/itex], GR can be (equivalently) described in terms of the fields [itex](e_{\mu}{}^{a} , \omega_{\mu}{}^{a}{}_{b})[/itex]. The curvature tensor is defined by [tex]R_{\alpha \beta}{}^{ab}( \omega ) = e_{\mu}{}^{a} \ e_{\nu}{}^{b} R_{\alpha \beta}{}^{\mu\nu}( \Gamma ) = \partial_{[ \alpha} \omega_{ \beta ]}{}^{ab} + [ \omega_{\alpha} , \omega_{\beta} ]^{ab} \ ,[/tex] or simply as a 2-form on [itex]M[/itex] with values in [itex]\wedge^{2}\mathcal{V}^{n}[/itex]: [tex]\mathcal{R}^{a}{}_{b} \equiv \frac{1}{2} R_{\alpha \beta}{}^{a}{}_{b} \ \mbox{d}x^{\alpha} \wedge \mbox{d}x^{\beta} = \left( \mbox{d} \omega + \omega \wedge \omega \right)^{a}{}_{b} \ . \ \ (B.3)[/tex]

    B1) 4-dimensional Gravity and (Claim 2):

    In this subsection we will put [itex]n = 4[/itex] and show that one cannot hope to interpret GR as a gauge theory. Using the above formalism we can now use the 1-form [itex]e^{a}[/itex] and the 2-form [itex]\mathcal{R}^{cd}[/itex] together with the volume form [itex]\epsilon_{abcd}[/itex] on [itex]\mathcal{V}^{(1,3)}[/itex] to construct an invariant (action) integral on [itex]M^{(1,3)}[/itex] (the 4-form [itex]e \wedge e \wedge \mathcal{R}[/itex] on [itex]M[/itex] which takes values in [itex]\mathcal{V} \otimes \mathcal{V} \otimes \wedge^{2} \mathcal{V}[/itex] and maps to [itex]\wedge^{4} \mathcal{V}^{(1,3)}[/itex] can be used to form an invariant integral on [itex]M[/itex] because section of [itex]\wedge^{4} \mathcal{V}^{(1,3)}[/itex] is just a function) [tex]S(e , \omega ) = - \frac{1}{2} \int_{M^{(1,3)}} \ \epsilon_{abcd} \ e^{a} \wedge e^{b} \wedge \mathcal{R}^{cd} \ , \ \ \ \ \ \ (B.4)[/tex] Indeed, this is nothing but the Einstein-Hilbert action written in terms of the fields [itex]( e ,\omega )[/itex]: To obtain eq(B.4) from the E-H action, write the Ricci scalar in the form [tex]\begin{align*}R & = \frac{1}{2} \left( \delta^{\mu}_{\alpha} \ \delta^{\nu}_{\beta} - \delta^{\mu}_{\beta} \ \delta^{\nu}_{\alpha}\right) R_{\mu \nu}{}^{\alpha \beta} \\ & = - \frac{1}{4} \epsilon^{\mu \nu \rho \sigma} \ \epsilon_{\alpha \beta \rho \sigma} \ R_{\mu \nu}{}^{\alpha \beta} \end{align*} [/tex] Now multiply this with [itex]\sqrt{|g|}[/itex] and use the 4-dimensional version of eq(B.1): [tex]\sqrt{|g|} \ \epsilon_{\alpha \beta \rho \sigma} = \epsilon_{abcd} \ e_{\alpha}{}^{c} \ e_{\beta}{}^{d} \ e_{\rho}{}^{a} \ e_{\sigma}{}^{b} \ ,[/tex] then integrate over [itex]M[/itex] [tex]\begin{align*} \int d^{4}x \ \sqrt{|g|} \ R &= - \frac{1}{2} \int \left( d^{4}x \ \epsilon^{\mu \nu \rho \sigma}\right) \ \epsilon_{abcd} \ e_{\rho}{}^{a} \ e_{\sigma}{}^{b} \left( \frac{1}{2} e_{\alpha}{}^{c} \ e_{\beta}{}^{d} \ R_{\mu \nu}{}^{\alpha \beta}\right) \\ &= - \frac{1}{2} \int \ \epsilon_{abcd} \left( e_{\rho}{}^{a} \mbox{d}x^{\rho}\right) \wedge \left( e_{\sigma}{}^{a} \mbox{d}x^{\sigma}\right) \wedge \left( \frac{1}{2} R_{\mu\nu}{}^{cd} \ \mbox{d}x^{\mu} \wedge \mbox{d}x^{\nu}\right) \\ &= - \frac{1}{2} \int_{M^{(1,3)}} \ \epsilon_{abcd} \ e^{a} \wedge e^{b} \wedge \mathcal{R}^{cd} \ . \end{align*} [/tex] So, on the 4-dimensional spacetime [itex]M^{(1,3)}[/itex], the E-H action is of the general form [tex]S_{EH}(e,\omega ) \sim \int_{M^{(1,3)}} \ \epsilon \ e \wedge e \wedge \left( \mbox{d} \omega + \omega^{2} \right) \ .[/tex] Now if we interpret the fields [itex](e , \omega )[/itex] as components of a gauge connection [itex]\mathbb{A}[/itex], then the above action will have the following general form [tex]S( \mathbb{A} ) \sim \int_{M^{(1,3)}} \ \mathbb{A} \wedge \mathbb{A} \wedge \left( \mbox{d} \mathbb{A} + \mathbb{A}^{2} \right) \ .[/tex] But there is no such action in gauge theories and, therefore, 4-dimensional gravity is not a gauge theory.

    B2) 3-dimensional Gravity is a gauge theory (Claim 3):

    Let us now repeat what we have done in (B1) for the (1+2)-dimensional spacetime [itex]M^{(1,2)}[/itex]. I am sure you can follow all the steps in the derivation of the following E-H action on [itex]M^{(1,2)}[/itex]:
    S_{EH}(e , \omega ) & = \frac{1}{2} \int \left(d^{3}x \ \epsilon^{\mu \nu \rho} \right) \ e_{\rho}{}^{a} \left(\epsilon_{abc} \ R_{\mu \nu}{}^{bc} \right) \\
    & = \int \left( e_{\rho}{}^{a} \mbox{d}x^{\rho} \right) \wedge \left( \frac{1}{2} \epsilon_{abc} \ R_{\mu \nu}{}^{bc} \mbox{d}x^{\mu} \wedge \mbox{d}x^{\nu} \right) \\
    & = \int_{M^{(1,2)}} \ e^{a} \wedge \left( \epsilon_{abc} \ \mathcal{R}^{bc} \right) \\
    & = 2 \int_{M^{(1,2)}} \ e^{a} \wedge \mathcal{R}_{a} \ \ \ \ \ \ \ \ \ \ (B.5) \\
    & = 2 \int_{M^{(1,2)}} \ e^{a} \wedge \left( \mbox{d} \omega_{a} + \frac{1}{2} \epsilon_{abc} \ \omega^{b} \wedge \omega^{c} \right) \ \ \ (B.5)
    where [tex]\omega_{a} = \frac{1}{2} \epsilon_{abc} \ \omega^{bc} =\frac{1}{2} \ \epsilon_{abc} \ \omega_{\mu}{}^{bc} \mbox{d}x^{\mu} \ ,[/tex] and [tex]\mathcal{R}_{a} = \frac{1}{2} \epsilon_{abc} \ \mathcal{R}^{bc} = \mbox{d} \omega_{a} + \frac{1}{2} \epsilon_{abc} \ \omega^{b} \wedge \omega^{c} \ .[/tex] Now, if we interpret [itex](e, \omega )[/itex] as components of gauge field matrix [itex]\mathbb{A}[/itex], then the E-H action of 3-dimentional gravity will be of the form [tex]S_{EH}( \mathbb{A}) \sim \int_{M^{(1,2)}} \mathbb{A} \wedge \left( \mbox{d}\mathbb{A} + \mathbb{A}^{2} \right) \ . [/tex] But this looks very much like gauge theory action of Chern-Simons type. So, it is conceivable to interpret (1+2)-gravity as a gauge theory of the Poincare’ group [itex]\mathcal{P}(1,2)[/itex] with a pure Chern-Simons action. Indeed, you can show that the action eq(B.5) is invariant under local [itex]SO(1,2)[/itex] (Lorentz) transformations generated by the infinitesimal parameter [itex]\beta^{a}(x)[/itex]:
    [tex]\delta e^{a} (x) = \epsilon^{abc} \ e_{b}(x) \beta_{c}(x) \ ,[/tex] [tex]\delta \omega^{a} (x) = \mbox{d} \beta^{a} + \epsilon^{abc} \ \omega_{b} (x) \beta_{c}(x) \ ,[/tex] as well as local translations generated by the infinitesimal [itex]T^{(3)}[/itex]-parameter [itex]\alpha^{a}(x)[/itex]:
    [tex]\delta e^{a} (x) = \mbox{d} \alpha^{a} + \epsilon^{abc} \ \omega_{b}(x) \ \alpha_{c}(x) \ , \ \ \delta \omega^{a} (x) = 0 \ .[/tex] Moreover, when the fields [itex](e^{a} , \omega^{a})[/itex] satisfy their equations of motion (i.e., on shell), one can show that the combination of transformations with parameters [itex]\alpha^{a}(x) = \chi \ e^{a}(x)[/itex] and [itex]\beta^{a}(x) = \chi \ \omega^{a}(x)[/itex] is equivalent to a [itex]M^{(1,2)}[/itex]-diffeomorphism generated by the vector field [itex]\chi[/itex]. It remains to prove that 3D-gravity action eq(B.5) is the Chern-Simons action associated with the gauge group [itex]\mathcal{P}(1,2)[/itex]. Below, I will present you with two methods for proving that statement. Both methods will require the Poincare’ algebra as well as an invariant inner product on it. In 3 dimensions it is convenient to work with the Lorentz generator [itex]J_{a} = \frac{1}{2} \epsilon_{abc} J^{bc}[/itex] instead of [itex]J^{ab}[/itex]. With this definition, the Lie algebra of [itex]\mathcal{P}(1,2)[/itex] can be rewritten as [tex][P_{a} , P_{b}] = 0 \ , \ \ [J_{a} , P_{b} ] = \epsilon_{abc}P^{c} \ ,[/tex][tex][J_{a} , J_{b}] = \epsilon_{abc}J^{c} \ , [/tex] and the relevant invariant inner product on the algebra is then [tex]\mbox{Tr}(J_{a}P_{b}) = \eta_{ab} \ , \ \ \ \mbox{Tr}(P_{a}P_{b}) = \mbox{Tr}(J_{a}J_{b}) = 0 \ .[/tex]

    Method 1: In this method, we will borrow the topological Chern-Simon action from ordinary Yang-Mills theory, express the Yang-Mills potential in terms of the fields [itex]( e , \omega )[/itex] and use the Poincare’ algebra and its inner product to show that it is equivalent to the (1+2)-dimensional GR action integral eq(B.5). In an ordinary gauge theory with compact Lie group and algebra
    [tex][X_{a} , X_{b}] = C_{ab}{}^{c} X_{c}, \ \ \ a, b, c = 1, 2, \cdots r \ ,[/tex]
    the topological Chern-Simons action is given (up to a constant which I set equal to 1) by [tex]S_{CS}(A) = \int_{M^{(1,2)}} \mbox{tr} \left( \mathbb{A} \wedge \mbox{d} \mathbb{A} + \frac{2}{3} \mathbb{A} \wedge \mathbb{A} \wedge \mathbb{A}\right) \ , \ \ \ (B.6)[/tex] where [tex]\mathbb{A} = \mathbb{A}_{\mu} \mbox{d}x^{\mu} = A_{\mu}^{a} X_{a} \mbox{d}x^{\mu} = A^{a} X_{a} \ . \ \ \ \ (B.7)[/tex] I hope Eq(B.7) makes it clear to you that [itex]\mathbb{A}[/itex] is a matrix-valued 1-form, [itex]\mathbb{A}_{\mu}[/itex] is a matrix-valued gauge field, [itex]A_{\mu}^{a}[/itex] are real numbers denoting the components of the gauge field matrix [itex]\mathbb{A}_{\mu}[/itex] in the Lie algebra basis [itex]X_{a}[/itex] and finally the [itex]A^{a}[/itex]’s are a set of 1-forms representing the “components” of the 1-form matrix [itex]\mathbb{A}[/itex] in the Lie algebra basis [itex]X_{a}[/itex]. So, with this notation, we have [tex]\mathbb{A} \wedge \mathbb{A} = \frac{1}{2}[X_{a} , X_{b}] \ A^{a} \wedge A^{b} \ .[/tex] Therefore [tex]\mbox{tr} \left(\mathbb{A} \wedge \mathbb{A} \wedge \mathbb{A}\right) = \frac{1}{2} \mbox{tr} \left( X_{a}[X_{b} , X_{c}] \right) A^{a} \wedge A^{b} \wedge A^{c} \ . \ \ \ (B.8b)[/tex] And we also write [tex]\mbox{tr} \left( \mathbb{A} \wedge \mbox{d} \mathbb{A} \right) = \mbox{tr} \left( X_{a} X_{b} \right) \ A^{a} \wedge \mbox{d} A^{b} \ . \ \ \ \ (B.8a)[/tex] We will now identify the generators [itex]X_{a}, \ a = 1, \cdots r[/itex] with six Poincare’ generators [itex](P_{a} , J_{a})[/itex], and the set of [itex]r[/itex] one forms [itex]A^{a}[/itex] with [itex](e^{a} , \omega^{a})[/itex], the Poincare-valued one forms on [itex]M^{(1,2)}[/itex]. In other words, we simply write [tex]\mathbb{A} (x) = e^{a}(x) \ P_{a} + \omega^{a}(x) \ J_{a} \ . \ \ \ \ \ (B.9)[/tex] Now, using the inner product on the Poincare’ algebra, we see that the RHS of eq(B.8a) contains only two non-zero terms [tex]\begin{align*} \mbox{tr} \left( \mathbb{A} \wedge \mbox{d} \mathbb{A} \right) &= \mbox{tr}(P_{a}J_{b}) \ e^{a} \wedge \mbox{d} \omega^{b} + \mbox{tr}(J_{a}P_{b}) \ \omega^{a} \wedge \mbox{d} e^{b} \\ & = e^{a} \ \wedge \ \mbox{d} \omega_{a} + \omega_{a} \ \wedge \ \mbox{d} e^{a} \ . \end{align*}[/tex] Also, the Poincare’ algebra and inner product reduce the RHS of eq(B.8b) to only three non-zero and equal terms [tex]\begin{align*} \mbox{RHS of (B.8b)} & = \frac{1}{2} \mbox{tr} \left( [P_{a} , J_{b}] J_{c} \right) e^{a} \wedge \omega^{b} \wedge \omega^{c} \\
    & + \frac{1}{2} \mbox{tr} \left( [J_{a} , P_{b}] J_{c} \right) \omega^{a} \wedge e^{b} \wedge \omega^{c} \\
    & + \frac{1}{2} \mbox{tr} \left( [J_{a} , J_{b}] P_{c} \right) \omega^{a} \wedge \omega^{b} \wedge e^{c} \\
    & = \frac{3}{2} \epsilon_{abc} \ e^{a} \wedge \omega^{b} \wedge \omega^{c} \ . \end{align*}[/tex] Substituting these results in eq(B.6) we obtain [tex]S_{CS}(e, \omega) = \int_{M^{(1,2)}} \left( \omega_{a} \wedge \mbox{d} e^{a} + e^{a} \wedge \mbox{d} \omega_{a} + \epsilon_{abc} \ e^{a} \wedge \omega^{b} \wedge \omega^{c}\right) \ .[/tex] And, finally we integrate the first term by part and ignore total derivative to obtain the Chern-Simons action associated with gauge group [itex]\mathcal{P}(1,2) = T(3) \rtimes SO(1,2)[/itex] (integrating the second term instead of the first produces another story)[tex]S_{CS}(e , \omega ) = 2 \int_{M^{(1,2)}} \ e^{a} \wedge \left( \mbox{d} \omega_{a} + \frac{1}{2} \epsilon_{abc} \omega^{b} \wedge \omega^{c} \right) = 2 \int_{M^{(1,2)}} \ e^{a} \wedge \mathcal{R}_{a} \ .[/tex] Comparing this with eq(B.5), we see that [itex]S_{EH} (e , \omega ) = S_{CS} (e , \omega )[/itex]. Thus, in (1+2)-dimensional spacetime, GR is equivalent to a gauge theory with gauge group [itex]\mathcal{P}(1,2)[/itex] and topological Chern-Simons action.

    Method 2: In here we will follow the usual method for constructing Chern-Simons action from topological invariant integral. Recall that, under gauge transformation by [itex]U(x) = e^{- \Theta (x)} \ , \Theta = \theta^{a}(x) X_{a}[/itex], the gauge field matrix changes according to [tex]\mathbb{A}_{\mu} \to U \left( \partial_{\mu} + \mathbb{A}_{\mu} \right) U^{-1} \ .[/tex] The infinitesimal version of this transformation is simply [tex]\delta \mathbb{A}_{\mu} = \partial_{\mu} \Theta + [\mathbb{A}_{\mu} , \Theta ] \equiv D_{\mu} \Theta \ . \ \ \ \ (B.10)[/tex] Now, we take [itex]U(x)[/itex] to be local Poincare transformation [tex]U( \alpha , \beta ) = e^{- ( \alpha^{a} (x) \ P_{a} + \beta^{a} (x) \ J_{a} )} \ ,[/tex] and write the gauge field in terms of the triad field [itex]e_{\mu}{}^{a}[/itex] and spin connection [itex]\omega_{\mu}{}^{a}[/itex] on [itex]M^{(1,2)}[/itex]: [tex]\mathbb{A}_{\mu} = e_{\mu}{}^{a} (x) \ P_{a} + \omega_{\mu}{}^{a}(x) \ J_{a} \ . \ \ \ \ \ (B.11)[/tex] Substituting these in eq(B.10) and using the Poincare algebra, we obtain the following local (Poincare) gauge transformations [tex]\delta e_{\mu}{}^{a} = \partial_{\mu} \alpha^{a} + \epsilon^{abc} \left( e_{\mu b} \beta_{c} + \omega_{\mu b} \ \alpha_{c}\right) \ , \ \ \ (B.12a)[/tex][tex]\delta \omega_{\mu}{}^{a} = \partial_{\mu} \beta^{a} + \epsilon^{abc} \ \omega_{\mu b} \ \beta_{c} \ . \ \ \ \ \ \ \ (B.12b)[/tex] The matrix-valued field tensor is defined as usual [tex]\mathbb{F}_{\mu\nu} = [ D_{\mu} , D_{\nu}] = \partial_{[ \mu} \mathbb{A}_{\nu ]} + [ \mathbb{A}_{\mu} , \mathbb{A}_{\nu}] \ .[/tex] Substituting eq(B.11) and using the Poincare algebra, we get [tex]\mathbb{F}_{\mu\nu} (x) = \mathcal{E}_{\mu\nu}{}^{a} (x) \ P_{a} + \mathcal{R}_{\mu\nu}{}^{a} (x) \ J_{a} \ ,[/tex] where [tex]\mathcal{E}_{\mu\nu}{}^{a} = \partial_{[\mu} e_{\nu ]}{}^{a} + \epsilon^{abc} \left( e_{\mu b} \ \omega_{\nu c} + \omega_{\mu b} \ e_{\nu c}\right) \ ,[/tex][tex]\mathcal{R}_{\mu\nu}{}^{a} = \partial_{[\mu} \omega_{\nu ]}{}^{a} + \epsilon^{abc} \ \omega_{\mu b} \ \omega_{\nu c} \ .[/tex] Now, suppose we want to put this [itex]\mathcal{P}(1,2)[/itex] gauge theory on a 4-dimensional manifold [itex]N^{4}[/itex]. Then, on [itex]N^{4}[/itex] there will be a topological invariant given by the integral [tex]\mathcal{J} = \frac{1}{2} \int_{N^{4}} d^{4}x \ \epsilon^{\mu\nu\rho\sigma} \ \mbox{tr} \left( \mathbb{F}_{\mu\nu} \mathbb{F}_{\rho\sigma}\right) \ .[/tex] using the inner product on the algebra, we get [tex]\mathcal{J} = \int_{N^{4}} d^{4}x \ \epsilon^{\mu\nu\rho\sigma} \ \mathcal{E}_{\mu\nu a} \ \mathcal{R}_{\rho \sigma}{}^{a} \ . \ \ \ \ \ \ \ \ \ \ \ (B.13)[/tex] Now, with extremely painful algebra we can show that the integrand in eq(B.13) is actually a total divergence, and we use the divergence theorem to obtain [tex]\mathcal{J} = \int_{N^{4}} d^{4}x \ \partial_{\sigma} \left( \epsilon^{\mu\nu\rho\sigma} \ \mathcal{S}_{\mu\nu\rho}\right) = \int_{\partial N^{4}} d \Sigma_{\sigma} \ \epsilon^{\mu\nu\rho\sigma} \ \mathcal{S}_{\mu\nu\rho} \ ,[/tex] where [tex]\mathcal{S}_{\mu\nu\rho} = e_{\rho}{}^{a} \left( \partial_{[ \mu} \omega_{\nu ] a} + \epsilon_{abc} \ \omega_{\mu}{}^{b} \ \omega_{\nu}{}^{c}\right) \ .[/tex] Now, if we identify our (1+2)-spacetime [itex]M^{(1,2)}[/itex] with the boundary [itex]\partial N^{4}[/itex] we obtain [itex]\int_{M} d^{3}x \ \epsilon^{\mu\nu\rho} \ \mathcal{S}_{\mu\nu\rho}[/itex], as a gauge invariant integral on [itex]M^{(1,2)}[/itex] and this, by definition, is the Chern-Simons action [tex]\begin{align*} S_{CS}(e , \omega ) & = 2 \int_{M^{(1,2)}} d^{3}x \ \epsilon^{\mu\nu\rho} \ e_{\rho}{}^{a} \left( \frac{1}{2} \partial_{[ \mu} \omega_{\nu ] a} + \frac{1}{2} \ \epsilon_{abc} \ \omega_{\mu}{}^{b} \ \omega_{\nu}{}^{c} \right) \\ & = 2 \int_{M^{(1,2)}} \ e^{a} \wedge \left( \mbox{d} \omega_{a} + \frac{1}{2} \ \epsilon_{abc} \ \omega^{b} \wedge \omega^{c}\right) \\ & = S_{EH} ( e , \omega ) \ . \end{align*}[/tex]

    I think that is enough for now.
    Last edited: Feb 14, 2018
  19. Feb 16, 2018 #18

    Paul Colby

    User Avatar
    Gold Member

    The book is dated but it might be worth a look "Group Theory and General Relativity" by Moshe Carmeli pub McGraw-Hill 1977. Much in there about the Lorentz group representations, Yang Mills, spinors SL(2,C) and so forth. I've enjoyed reading sections over the years.
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook

Have something to add?
Draft saved Draft deleted