(Simple) Derivation of Yang-Mills Equations

  • Context: Graduate 
  • Thread starter Thread starter ozone
  • Start date Start date
  • Tags Tags
    Derivation Yang-mills
Click For Summary
SUMMARY

The discussion centers on the derivation of the Yang-Mills equations, specifically addressing the variation of the action given by the Lagrangian \( S = \frac{1}{4} \int \text{tr}(F_{\mu\nu} F^{\mu\nu}) d^4x \). The participant explores the implications of taking the trace in the context of non-commuting potentials and the resulting field equations. The final derived equation is \( \partial_\mu F^{\mu\nu} = A_\mu F^{\mu\nu} - F^{\mu\nu} A_\mu \), which aligns with the standard form \( D_\mu F^{\mu\nu} = 0 \), where \( D_\mu \) represents the covariant derivative.

PREREQUISITES
  • Understanding of Yang-Mills theory and its equations
  • Familiarity with Lagrangian mechanics and variational principles
  • Knowledge of matrix operations, particularly traces and commutators
  • Basic concepts of differential geometry as applied in physics
NEXT STEPS
  • Study the derivation of the covariant derivative \( D_\mu G = \partial_\mu - [A_\mu, G] \)
  • Explore the relationship between Yang-Mills theory and general relativity (GR)
  • Investigate the properties of traces in matrix calculus, particularly in the context of variations
  • Review literature on differential forms and their applications in theoretical physics
USEFUL FOR

The discussion is beneficial for theoretical physicists, graduate students in physics, and researchers focusing on gauge theories and their mathematical foundations.

ozone
Messages
121
Reaction score
0
Hello all, my teacher assigned a problem related to the yang-mills equation in my general relativity class and I just wanted to ask a couple of questions about this problem. I believe it is a simplified version of the Yang-Mills you encounter in particle physics.

basically assuming that F_{\mu\nu} → F_{\mu\nu} + A_\mu A_\nu - A_\nu A_\mu (for some non-commuting potentials) we were instructed to derive the field equations from our lagrangian S = \frac{1}{4} \int tr ( F_{\mu\nu}F^{\mu \nu}) d^4x. Firstly I wondered if we could just ignore the fact that we are taking a trace.. I am really not sure what difference this makes to the equations. He presented the problem with basically only the details I described above so perhaps the trace is used when dealing with a more complex problem.. I'm not entirely sure. Anyways I decided to try varying with respect to \delta A_\nu the math went like:

\delta S = \frac{1}{4} \int tr (( F_{\mu\nu} \delta F^{\mu \nu} + \delta F^{\mu\nu} F_{\mu \nu} ) d^4x

\delta S = \frac{1}{2} \int tr ((\delta F_{\mu\nu} F^{\mu \nu} )) d^4x

\delta S = \frac{1}{2} \int tr (F^{\mu\nu} (\partial_\mu \delta A_\nu - \partial_\nu \delta A_\mu + \delta(A_\mu A_\nu - A_\nu A_\mu))) d^4x

\delta S = \int tr (F^{\mu\nu} (\partial_\mu \delta A_\nu + (\delta A_\mu A_\nu + A_\mu \delta A_\nu))) d^4x

Now using integration by parts and boundary conditions on the first term and similar index swapping from above I made it about as far as I could on my own and came upon

\delta S = \int tr ((-\partial_\mu F^{\mu\nu} \delta A_\nu +F^{\mu\nu} (-\delta A_\nu A_\mu + A_\mu \delta A_\nu))) d^4x

So then I supposed that I could multiply by some sort of inverse matrix or something of the sort. Also I assumed that we could ignore the trace (which again I'm not sure is correct, but I have never varied a trace of a matrix before). My final equation came out to be.

\partial_\mu F^{\mu\nu} + (\delta A_\nu A_\mu (\delta A_\nu)^{-1} - A_\mu) F^{\mu\nu} = 0

However something has definitely gone wrong, I know I need to get rid of that pesky \delta A_\nu.. I also tried using Euler-Lagrange equations but I had even less luck than here.. I would just like to bounce some ideas off you guys and perhaps get pointed in the direction of some good literature.
 
Last edited:
Physics news on Phys.org
In your second-to-last equation, why don't you split up the trace using tr(X+Y+Z) = tr(X) + tr(Y) + tr(Z) and then use the cyclic property of the trace on the Y term to get the ##\delta A## in the same spot as it is in the X and Z terms? Then you can combine the three terms back together again and get a nice expression for ##\delta S/\delta A##, as desired.

Regarding the trace: I bet you can convince yourself that your method for taking the variation of a trace is correct if you try a few concrete examples. Pick a specific matrix ##M## and a variation ##\delta M##. Then compute the variation ##\delta ({\rm tr} (M)) = {\rm tr}(M + \delta M) - {\rm tr} (M)## and compare to your assumption that ##\delta ({\rm tr} M) = {\rm tr} (\delta M)##.

In the end you get an equation of the form ##{\rm tr} (M \delta A) = 0## which must hold for any value of the matrix ##\delta A##. See if you can think up examples of matrices ##\delta A## that force particular entries in the matrix M to be zero. If you can do so for every entry in ##M##, then the equation reduces to ##M = 0##, as desired.
 
Last edited:
  • Like
Likes   Reactions: 1 person
Ahh, I have never seen this property before. Using that I arrived at

\partial_\mu F^{\mu\nu} =A_\mu F^{\mu\nu} - F^{\mu\nu} A_\mu

Is this a reasonable answer for such a problem? Sorry but I have no intuition for the yang-mills and it is a very dense subject.

I just saw your edit about the trace, I will play around with it and see if I can't convince myself, I understand that the trace is linear and hence I felt this was an okay way to approach it, but it just felt odd that in the end I completely disregard the fact that I am taking a trace in order to reach this conclusion, really the above line should be enclosed in a trace operator but I feel as if that makes no diference.
 
Of course, stating that the trace of two matrices is equal is a much weaker statement than being able to say that the two matrices are equal, so it's very important that the equation you get is not just relating the trace of both sides.

The key is that the variation of A is totally arbitrary - you want the the action to be an extremum for any possible variation. So you just need to convince yourself that δA can be chosen such that tr(MδA)=0 implies an arbitrary chosen component M_{ij} must be zero. So for each component of M, you can find a variation of A forcing that component to zero.
 
  • Like
Likes   Reactions: 1 person
ozone said:
Ahh, I have never seen this property before. Using that I arrived at

\partial_\mu F^{\mu\nu} =A_\mu F^{\mu\nu} - F^{\mu\nu} A_\mu

Is this a reasonable answer for such a problem? Sorry but I have no intuition for the yang-mills and it is a very dense subject.

Looks about right. Often this would be written as

##D_\mu F^{\mu \nu} = 0##

where the covariant derivative ##D## acts on a general matrix-valued field ##G## as

##D_\mu G = \partial_\mu - [A_\mu, G]##

(The brackets are a matrix commutator.)

The covariant derivative in Yang-Mills theory is supposed to closely parallel the covariant derivative in GR. And ##A## and ##F## in Yang-Mills theory are respectively the analogs of the Christofel symbol ##\Gamma## and the Riemann curvature tensor ##R_{\mu\nu\rho\lambda}##
 
  • Like
Likes   Reactions: 1 person
Alright thank you guys. This does seem to closely parallel the stuff were about to be getting into with GR so I am pretty glad to have worked through the problem.. also thank you for linking to that more geometrical derivation.. it was a bit advanced for my taste but it was interesting to see none the less.
 

Similar threads

  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 0 ·
Replies
0
Views
1K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 3 ·
Replies
3
Views
5K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 38 ·
2
Replies
38
Views
5K
  • · Replies 9 ·
Replies
9
Views
1K