Intro to elementary index notation

In summary, index notation is a useful tool for simplifying complicated expressions involving cross and dot products. It allows for easy manipulation and derivation of formulas, and the summation convention helps to reduce the amount of writing required. The Kronecker delta and Levi-Civita symbols are important symbols to know and are used frequently in index notation.
  • #1
fantispug
105
0
This is a brief tutorial to cover the basics of index notation which are useful when handling complicated expressions involving cross and dot products.
I will skip over a lot of technicalities (such as covariant and contravariant vectors) and focus on 3 dimensions - but all of what I say here can easily be generalised and extended, and I encourage anyone with the background to do so.

Conventions and notation:
I use bold symbols to indicate vectors (invariably 3 dimensional) and use [tex]\bold{\hat{i}},\bold{\hat{j}},\bold{\hat{k}}[/tex] as the unit vectors in the x, y and z directions.

[tex]\nabla = \frac{d}{dx}\bold{\hat{i}} + \frac{d}{dy}\bold{\hat{j}} + \frac{d}{dz}\bold{\hat{k}}[/tex]

The problem:
Suppose we have some complicated expression for example
[tex]\nabla \cdot (\bold A \times (\nabla V))[/tex]
where [tex]\bold A[/tex] is some 3 dimensional vector and [tex]V=V(\bold r)[/tex] is some scalar function, and we want to write it in a simpler form.

There are formulas for this sort of thing such as the BAC CAB rule:
[tex]\bold A \times (\bold B \times \bold C) = \bold{B}(\bold{A} \cdot \bold{C}) - \bold{C}(\bold{A} \cdot \bold{B})[/tex]
but these are derived using commuting vectors, and so if we use [tex]\nabla[/tex], since

[tex](\nabla V)[/tex] does not equal [tex](V \nabla)[/tex]

However index notation provides a quick and easy way to derive these types of expressions.

Indicies and the summation convention

Indicies allow us to rewrite an expression component by component. For example
[tex]\bold{A}=(A_1,A_2,A_3)[/tex]
[tex]\bold{B}=(B_1,B_2,B_3)[/tex]

So
[tex]\bold{A}\cdot\bold{B}=A_1 B_1 + A_2 B_2 + A_3 B_3=\sum_{i=1}^3 A_i B_i[/tex]
(Clearly this can be generalised to any number of components)

Now for compactness we introduce the (Einstein) summation convention: If an index is repeated we sum over it. So
[tex]A_i B_i=\sum_{i=1}^3 A_i B_i[/tex]
This cuts down on a lot of writing. Note that there must be conservation of unpaired indicies, for example
[tex]A_i=B_j C_j D_i[/tex]
is a fine expression - it says the ith component of A is the ith component of D (pre)multiplied by the dot product of B and C, that is
[tex]\bold{A}=(\bold{B} \cdot \bold{C}) \bold{D}[/tex]

However
[tex]A_i=B_j C_j [/tex]
only makes sense if it means that all components of A are the same. Even then this is bad notation and it is much better to use
[tex]A_i = B_j C_j I_i [/tex] where [tex]\bold{I}=(1,1,1)[/tex]

If we stick to this kind of convention we always get the same unpaired indicies on either side of an expression, in the above case i.

An expression like
[tex]A_i B_i C_i [/tex]
makes no sense, with this convention. If you are evaluating an expression such as [tex](\bold{A} \cdot \bold{B})(\bold{C} \cdot \bold{D})[/tex] you must use different indicies so [tex]A_i B_i C_j D_j[/tex].

Finally note paired indicies are dummy indicies. We can change them (if we change both of them) to whatever we want (providing what we change it to is not already being used) without altering the result (because they are summed over). Unpaired indicies are not dummy indicies.
So we can write:
[tex]A_i B_j C_j = A_i B_k C_k = A_i B_{(cats)} C_{(cats)}[/tex]
(where I take (cats) to represent a single variable) but NOT
[tex] A_i B_j C_j = A_i B_i C_i [/tex] or [tex] A_i B_j C_j = A_k B_j C_j [/tex]

(Note in the first of the two wrong expressions the right hand side has an index 3 times, so must be wrong, and in the second expression the unpaired index is not conserved - i is on the left hand side but not the right, so it too must be wrong).

So that's a lot of boring detail without much gain, but stick on we'll get there


Multiple indicies and Symmetry
It's often useful to have expressions with multiple indicies (these represent tensors, in general). If we stick to indicies only taking values 1,2,3 then a multiple index object
[tex]A_{ij}[/tex] represents the elements of a 3x3 matrix (the ith row and the jth column).

If we have two matricies A and B, then their product is (by definition)
[tex](AB)_{ij}=\sum_{k=1}^3A_{ik} B_{kj} = A_{ik} B_{kj}[/tex]

Objects with more than 2 indicies are not as easy to interpret, so I won't, I'll just use them.

An object with 2 or more indicies is symmetric if it is unchanged under interchange of two indicies, e.g.
[tex]S_{ij}=S_{ji}[/tex] is symmetric, as is
[tex]S_{ijk}=S_{kij}=S_{jki}=S_{ikj}=S_{jik}=S_{kji}[/tex]

Note that, if we view [tex]S_{ij}[/tex] as the i-jth matrix element then [tex]S_{ji}=S^{T}_{ij}[/tex] is the i-jth element of the transpose. So a 2 index object is symmetric iff it corresponds to a symmetric matrix.

An antisymmetric object is one that changes sign every time two indicies are interchanged, e.g.
[tex]A_{ij}=-A_{ji}[/tex] and
[tex]A_{ijk}=A_{kij}=A_{jki}=-A_{ikj}=-A_{jik}=-A_{kji}[/tex]
(note that the 2nd and 3rd term in the latter expression correspond to interchanging TWO indicies, so the two negative signs cancel).

Finally if a symmetric object is contracted (i.e. summed over 2 or more indicies) with an antisymmetric object it is zero. By this I mean if S is symmetric and A is antisymmetric then
[tex]A_{ij}S_{ij}=-A_{ji}S_{ij}=-A_{ji}S_{ji}=-A_{ij}S_{ij}[/tex]
where in the last step I have renamed the dummy indicies - switching i and j. So [tex]A_{ij}S_{ij}=0[/tex]

(This also implies [tex]A_{ijk}S_{ij}=0[/tex] and similarly, all we need is two indicies summed over for the argument to work)


Kronecker Delta and Levi-Civita Symbols

It is handy to use the symbols

Kronecker delta: [tex]\delta_{ij}[/tex] which is 1 if i=j and 0 otherwise.

The Kronecker delta is symmetric
[tex]\delta_{ij}=\delta_{ji}[/tex]
and corresponds to the matrix elements of the Identity matrix (diag{1,1}).

So [tex]\delta_{ij}A_i[/tex] is equal to [tex]A_i[/tex] when i=j and 0 otherwise. So
[tex]\delta_{ij}A_i=A_j[/tex]

(A common mistake is to say [tex]\delta_{ii}=1[/tex] but this is wrong. Why?
[tex]\delta_{ii}=\sum_{i=1}^3 1=3[/tex])

Levi-Civita symbol [tex]\varepsilon_{ijk}[/tex] which is 1 if ijk=123 or 312 or 231 and -1 if ijk=132 or 213 or 321 and 0 otherwise. (Sorry for writing this out so horribly).

The Levi-Civita symbol is antisymmetric:
[tex]\varepsilon_{ijk}=\varepsilon_{kij}=\varepsilon_{jki}=-\varepsilon_{ikj}=-\varepsilon_{jik}=-\varepsilon_{kij}[/tex]

The Levi-Civita symbol is related to the Kronecker Delta:
[tex]\varepsilon_{ijk}\varepsilon_{lmn} = \det \begin{vmatrix} \delta_{il} & \delta_{im}& \delta_{in}\\ \delta_{jl} & \delta_{jm}& \delta_{jn}\\ \delta_{kl} & \delta_{km}& \delta_{kn}\\ \end{vmatrix}[/tex]
although I have not found this expression to be too useful in practice, setting i=l gives a very useful expression:
[tex]\varepsilon_{ijk}\varepsilon_{ilm}=\delta_{jl}\delta_{km}-\delta_{jm}\delta_{kl}[/tex]
(Note the positive delta terms occur between indicies on the left hand side in the same place of the Levi-Civita symbol, and the negative terms between opposite places).

From this you can derive expressions for more summed indicies, such as:
[tex]\varepsilon_{ijk}\varepsilon_{ijl}=\delta_{jj}\delta_{kl}-\delta_{jl}\delta_{kj}=3\delta_{kl}-\delta_{kl}=2\delta_{kl}[/tex]
And
[tex]\varespilon_{ijk}\varepsilon_{ijk}=2\delta{kk}=6[/tex]

The Levi-Civita symbol is useful because of its relation to the cross product:
[tex]\det A =\varepsilon_{ijk} A_{1i} A_{2j} A_{3k} [/tex]
and more importantly:
[tex](\bold{A} \times \bold{B})_i=\varepsilon_{ijk} A_j B_k [/tex]

That pretty much covers everything we're going to need.

Evaluating Expressions
Let's start with a very easy one:
[tex]\bold{A} \times \bold{A} = 0[/tex]
This is well known, but provides an easy check:
[tex](\bold{A} \times \bold{A})_i=\varepsilon_{ijk}A_jA_k[/tex]
Now [tex]\varepsilon_{ijk}[/tex] is antisymmetric under interchange of j and k, but since [tex]A_j A_k = A_k A_j [/tex] the product [tex]A_j A_k [/tex] is symmetric under interchange of j and k. So the whole expression is zero.

What about the BAC CAB rule?
[tex](\bold A \times (\bold B \times \bold C))_i =\varepsilon_{ijk}A_j(\bold B \times \bold C)_k=\varepsilon_{ijk}A_j\varepsilon_{klm}B_l C_m [/tex]

[tex]=\varepsilon_{kij}\varepsilon_{klm}A_j B_l C_m = (\delta_{il}\delta_{jm}-\delta_{im}\delta_{jl})(A_j B_l C_m)
[/tex]

[tex]=A_m B_i C_m - A_l B_l C_i = B_i A_m C_m - C_i A_l B_l= B_i (\bold A \cdot \bold C) - C_i(\bold a \cdot \bold B)[/tex]
That is dropping indicies:
[tex]\bold A \times (\bold B \times \bold C) = \bold{B}(\bold{A} \cdot \bold{C}) - \bold{C}(\bold{A} \cdot \bold{B})[/tex]

This may look a tad messy, but it is much quicker than the normal way of doing this - expanding it out component by component.

I will write [tex](\nabla)_i=\frac{d}{dr_i}=\partial_i[/tex]

So let's try a slightly harder one

[tex](\nabla \times (\nabla \times \bold{A}))_i=\varepsilon_{ijk}\partial_j \varepsilon_{klm} \partial_l A_m = \varepsilon_{kij}\varepsilon_{klm}\partial_l\partial_j A_m[/tex]
[tex]=\partial_i\partial_m A_m - \partial_l \partial_l A_i=(\nabla)_i(\nabla \cdot \bold{A}) - \nabla^2 A_i[/tex]
or: [tex]\nabla \times (\nabla \times \bold{A})=\nabla(\nabla \cdot \bold{A}) - \nabla^2 \bold{A}[/tex]
(I have suppressed most of the detail here - once you get the hang of it you should be able to see these steps straight off, but for now, work them through it in detail).

I will do one more example, an identity I doubt you'd find in most books and would have to derive for yourself anyway:
[tex]\nabla \cdot (\bold A \times (\nabla V))=\partial_i (\varepsilon_{ijk} A_j \partial_k V)
= \varepsilon_{ijk} (\partial_i (A_j) \partial_k V + A_j \partial_i \partial_k V)[/tex]
Where the last step follows from the product rule for derivatives. Note that [tex]\partial_i \partial_k V = \partial_k \partial_i V[/tex] (assuming V is a sufficiently nice function - that is it is harmonic. This assumption is ok most of the time.) Consequently
[tex]\varepsilon_{ijk} \partial_i \partial_k V = 0[/tex] (Why?)
So
[tex]\nabla \cdot (\bold A \times (\nabla V)) = (\varepsilon_{kij}\partial_i (A_j)) \partial_k V = (\nabla \times \bold{A}) \cdot \nabla V[/tex]
(Again: work through it)

Finally I would like to point out that this is extremely powerful on non-commuting linear operators (see: Quantum Mechanics - particularly useful in deriving commutators) and is a prelude to the notation that is used in relativity.

Do a few examples, you'll find once you get the hang of it you can derive identities very quickly.
 
Last edited:
Mathematics news on Phys.org
  • #2
i just found this from a google search regarding tensors and index notation. it's very helpful, thanks.
 

1. What is elementary index notation?

Elementary index notation is a mathematical notation used to represent and manipulate numbers and variables in a concise and efficient way. It involves using indices or superscripts to represent repeated multiplication or division.

2. What are the benefits of using elementary index notation?

Using elementary index notation can help simplify and condense mathematical expressions, making them easier to understand and work with. It also allows for easier manipulation of variables and calculations involving repeated operations.

3. How do you read and interpret indices in elementary index notation?

The index or superscript in elementary index notation represents the number of times the base number is multiplied or divided by itself. For example, 23 would be read as "2 to the power of 3" or "2 cubed".

4. Can index notation be used for any type of mathematical operation?

Yes, index notation can be used for addition, subtraction, multiplication, and division. It can also be used for more complex operations such as roots and logarithms.

5. How is index notation related to exponents?

Indices in elementary index notation are essentially the same as exponents. They both represent repeated multiplication or division of a base number. In fact, exponents are often used interchangeably with indices in mathematical expressions.

Similar threads

  • Calculus and Beyond Homework Help
Replies
3
Views
1K
  • Special and General Relativity
Replies
1
Views
667
Replies
1
Views
2K
Replies
3
Views
1K
  • Linear and Abstract Algebra
Replies
3
Views
1K
  • Differential Geometry
Replies
3
Views
2K
  • Special and General Relativity
Replies
12
Views
2K
  • Linear and Abstract Algebra
Replies
10
Views
4K
  • General Math
Replies
1
Views
4K
  • Calculus and Beyond Homework Help
Replies
2
Views
3K
Back
Top