Very specific question about index notation

mindarson · Jul 31, 2013

I am reading through this text

http://www.ita.uni-heidelberg.de/~dullemond/lectures/tensor/tensor.pdf

and am having a bit of trouble with one of the arguments that is put in index notation. Specifically, equation (3.3). I was wondering if anyone could have a look at it and clear up a confusion for me.

I understand the argument, i.e. that the 'old definition' (eqn (3.2) in the text) of the inner product is not invariant under coordinate transformation in general, which is why we need covectors, covariant components, etc.

My specific question is about how the index notation is used in eqn (3.3). The authors write that

s' = <a',b'> = A^μ _αa^αA^μ _βb^β = (A^T)^μ _αA^μ _βa^αb^β (3.3)

They then argue that this shows that only if A^-1 = A^T (so the 2 matrices together equal δ_βα) (i.e. only if the transformation is orthonormal) will the inner product actually come out to the same value that it had in the untransformed coordinate system.

My question is how to express the inverse and transpose of a matrix in index notation. Where did the transpose come from in the 3rd equality, and why did the indices on it not change position at all? How is the relationship between a matrix, its transpose, and its inverse expressed in index notation? How, exactly, do the authors read off from (3.3) the fact that A^-1 must equal A^T?

I do understand that, to complete the argument, we ultimately need α = β, but how does one get, in practice, from 2 matrices to the Kronecker delta? What would the multiplication of a matrix by its inverse to get the Kronecker delta actually look like when written out?

I understand the argument, but I need clarification on how the argument is being expressed specifically using index notation.

Thanks for any help you can give!

Bill_K · Jul 31, 2013

This author is being very careless. Eq 3.3 is invalid, since it has two μ's upstairs. It should be written

A^μ_α a^α A_μ^β b_β

The transpose of a tensor is obtained the same way as the transpose of a matrix - by interchanging rows and columns. So A^μ_α = (A^T)_α^μ. Thus we have

(A^T)_α^μ A_μ^β a^α b_β

The inverse of A_μ^β is defined as the tensor (A^-1)_α^μ such that

(A^-1)_α^μ A_μ^β = δ^β_α

and thus comparing the last two eqs we have (A^T)_α^μ = (A^-1)_α^μ

dextercioby · Jul 31, 2013

Hi Bill, it's valid as he didn't assume that <mu> upstairs differs from <mu> downstairs. Actually he uses the metric as the unit matrix, so he's free to place the indices wherever he wants. It's like special relativity with x₄=ict.

Bill_K · Jul 31, 2013

Are you sure? He does draw a distinction between co- and contravariant indices. In fact he says earlier

To make further distinction between contravariant and covariant vectors we will put the contravariant indices (i.e. the indices of contravariant vectors) as superscript and the covariant indices (i.e. the indices of covariant vectors) with subscripts

dextercioby · Jul 31, 2013

Then you're right and the notes are badly written.

Very specific question about index notation

FAQ: Very specific question about index notation

1. What is index notation and how is it used in scientific equations?

2. How do I convert a formula written in traditional notation to index notation?

3. Are there any advantages to using index notation over traditional notation?

4. Can index notation be used in all scientific fields?

5. Are there any common mistakes to avoid when using index notation?

Similar threads

Hot Threads

Recent Insights