Insights What Is a Tensor? The mathematical point of view

fresh_42 · Jun 18, 2017

Introduction

Let me start with a counter-question. What is a number? Before you laugh, there is more to this question than one might think. A number can be something we use to count or more advanced an element of a field like real numbers. Students might answer that a number is a scalar. This is the appropriate answer when vector spaces are around. But what is a scalar? A scalar can be viewed as the coordinate of a one-dimensional vector space, the component of a basis vector. It means we stretch or compress a vector. But this manipulation is a transformation, a homomorphism, a linear mapping. So the number represents a linear mapping of a one-dimensional vector space. It also transports other numbers to new ones. Thus it is an element of ##\mathbb{R}^*##, the dual space: ##c \mapsto (a \mapsto \langle c,a \rangle##). Wait! Linear mapping? Aren't those represented by matrices? Yes, it is a ##1 \times 1## matrix and even a vector itself.

Continue reading ...

burakumin · Jun 20, 2017

fresh_42 said:

Definition: A tensor product of vector spaces U⊗V is a vector space structure on the Cartesian product U×V that satisfies ...

If I interpret correctly this sentence says the underlying set of U⊗V is U×V. This is not correct. This works for the direct sum U⊕V but the tensor product is a "bigger" set than U×V. One usual way to encode it is to quotient ##\mathbb{R}^{(U \times V)}## (the set of finitely-supported functions from U×V to ##\mathbb{R}##) by the appropriate equivalence relation.

lavinia · Jun 20, 2017

- Students of General Relativity learn about tensors from their transformation properties. Tensors are arrays of number assigned to each coordinate system that transform according to certain rules. Arrays that do not transform according to these rules are not tensors. I think that it would be helpful to connect this General Relativity approach to the mathematical approach that you have explained in this Insight.

Also these students need to understand how a metric allows one to pass back and forth between covariant and contra-variant tensors. One might show how this is the same as passing between a vector space and its dual.

- Students of Quantum Mechanics learn about tensors to describe the states of several particles e.g. two entangled electrons. In this case, the mathematical definition is more like the Quantum Mechanics definition but for the Quantum Mechanics student it is also important to understand how linear operators act on tensor products of vector spaces.

- If one wants to discuss tensor products purely mathematically, then one might show how they are defined when the scalars are not in a field but in a commutative ring - or even a non-commutative ring. The formal properties do not depend on a field per se.

lavinia · Jun 20, 2017

burakumin said:

If I interpret correctly this sentence says the underlying set of U⊗V is U×V. This is not correct. This works for the direct sum U⊕V but the tensor product is a "bigger" set than U×V. One usual way to encode it is to quotient ##\mathbb{R}^{(U \times V)}## (the set of finitely-supported functions from U×V to ##\mathbb{R}##) by the appropriate equivalence relation.

The tensor product of two 1 dimensional vector spaces is 1 dimensional so it is smaller not bigger than the direct sum. The tensor product of two 2 dimensional vector spaces is 4 dimensional so this is the the same size as the direct sum not bigger.

burakumin · Jun 20, 2017

lavinia said:

The tensor product of two 1 dimensional vector spaces is 1 dimensional so it is smaller not bigger than the direct sum. The tensor product tof two 2 dimensional vector spaces is 4 dimensional so this is the the same size as the direct sum not bigger.

This is correct but missing the relevant point: that the presentation contains a false statement. The fact that you can indeed find counter examples where the direct sum is bigger than the tensor product does not makes the insight presentation any more correct.

fresh_42 · Jun 20, 2017

burakumin said:

If I interpret correctly this sentence says the underlying set of U⊗V is U×V. This is not correct. This works for the direct sum U⊕V but the tensor product is a "bigger" set than U×V. One usual way to encode it is to quotient ##\mathbb{R}^{(U \times V)}## (the set of finitely-supported functions from U×V to ##\mathbb{R}##) by the appropriate equivalence relation.

No, this interpretation was of course not intended, rather a quotient of the free linear span of the set ##U \times V##.
I added an explanation to close this trapdoor. Thank you.

burakumin · Jun 20, 2017

fresh_42 said:

No, this interpretation was of course not intended, rather a quotient of the free linear span of the set ##U \times V##.
I added an explanation to close this trapdoor. Thank you.

Thank you

fresh_42 · Jun 20, 2017

lavinia said:

- Students of General Relativity learn about tensors ...

- Students of Quantum Mechanics learn about tensors ...

- If one wants to discuss tensor products purely mathematically ...

I know, or at least assumed all this. And I was tempted to explain a lot of these aspects. However, as I recognized that this would lead to at least three or four parts, I concentrated on my initial purpose again, which was to explain what kind of object tensors are, rather than to cover all aspects of their applications. It was meant to answer this basic question which occasionally comes up on PF and I got bored retyping the same stuff over and over again. That's why I've chosen Strassen's algorithm as an example, because it uses linear functionals as well as vectors to form a tensor product on a very basic level, which could easily be followed.

Greg Bernhardt · Jun 20, 2017

Great FAQ @fresh_42!

Orodruin · Jun 21, 2017

Let me first say that I think that the Insight is well written in general. However, I must say that I have had a lot of experience with students not grasping what tensors are based on them being introduced as multidimensional arrays. Sure, you can represent a tensor by a multidimensional array, but this does not mean that a tensor is a multidimensional array or that a multidimensional array is a tensor. Let us take the case of tensors in ##V\otimes V## for definiteness. A basis change in ##V## can be described by a matrix that will tell you how the tensor components transform, but in itself this matrix is not a tensor.

Furthermore, you can represent a tensor of any rank with a row or column vector - or (in the case of rank > 1) a matrix for that matter (just choose suitable bases). This may even be more natural if you consider tensors as multilinear maps. An example of a rank 4 tensor being used in solid mechanics is the compliance/stiffness tensors that give a linear relation between the stress tensor and the strain tensor (both symmetric rank 2 tensors). This is often represented as a 6x6 matrix using the basis ##\vec e_1 \otimes \vec e_1##, ##\vec e_2 \otimes \vec e_2##, ##\vec e_3 \otimes \vec e_3##, ##\vec e_{\{1} \otimes \vec e_{2\}}##, ##\vec e_{\{1} \otimes \vec e_{3\}}##, ##\vec e_{\{2} \otimes \vec e_{3\}}## for the symmetric rank 2 tensors. In the same language, the stress and strain tensors are described as column matrices with 6 elements.

fresh_42 · Jun 21, 2017

Orodruin said:

Sure, you can represent a tensor by a multidimensional array, but this does not mean that a tensor is a multidimensional array or that a multidimensional array is a tensor. Let us take the case of tensors in ##V\otimes V## for definiteness. A basis change in ##V## can be described by a matrix that will tell you how the tensor components transform, but in itself this matrix is not a tensor.

You can consider every matrix as a tensor (defining the matrix rank by tensors) or write a tensor in columns, as it is a vector (element of a vector space) in the end. Personally I like to view a tensor product as the solution of a couniversal mapping problem. As I said, I was tempted to write more about the aspect of "How to use a tensor" instead of "What is a tensor" but this would have led to several chapters and the problem "Where to draw the line" were still an open one. Therefore I simply wanted to take away the fears of the term and answer what it is, as I did before in a few threads, where the basic question was about multilinearity and linear algebra and the constituencies of tensors. The intro with the numbers should show that the degree of complexity depends on the complexity of purpose. I simply wanted to shortcut future answers to threads rather than write a book about tensor calculus. That was the main reason for the examples, which can be understood on a very basic level. Otherwise I would have written about the Ricci tensor and tensor fields which I find far more exciting. And I would have started with rings and modules and not with vector spaces. Thus I only mentioned them, because I wanted to keep it short and to keep it easy: an answer for a thread. Nobody on an "A" and probably as well on an "I" level reads a text about what a tensor is.

Orodruin · Jun 21, 2017

Perhaps I was mislead regarding the intended audience from the beginning. I am pretty sure most engineering students will not remember what a homomorphism is without looking it up. Certainly a person at B-level cannot be expected to know this?

In the end, I suspect we would give different answers to the question in the title based on our backgrounds and the expected audience. My students would (generally) not prefer me to give them the mathematical explanation, but instead the physical application and interpretation, more to the effect of how I think you would interpret "how can you use tensors in physics?" or "how do I interpret the meaning of a tensor?"

fresh_42 said:

Nobody on an "A" and probably as well on an "I" level reads a text about what a tensor is.

This must mean I am B-level.

fresh_42 · Jun 21, 2017

Orodruin said:

Perhaps I was mislead regarding the intended audience from the beginning. I am pretty sure most engineering students will not remember what a homomorphism is without looking it up. Certainly a person at B-level cannot be expected to know this?

In the end, I suspect we would give different answers to the question in the title based on our backgrounds and the expected audience. My students would (generally) not prefer me to give them the mathematical explanation, but instead the physical application and interpretation, more to the effect of how I think you would interpret "how can you use tensors in physics?" or "how do I interpret the meaning of a tensor?"

Yes, you are right. My goal was really to say "Hey look, a tensor is nothing to be afraid of." and that's why I wrote

Depending on whom you ask, how many room and time there is for an answer, where the emphases lie or what you want to use them for, the answers may vary significantly.

And to be honest, I'm bad at basis changes, i.e. frame changes and this whole rising and lowering indices is mathematically completely boring stuff. I first wanted to touch all these questions but I saw, that would need a lot of more space. So I decided to write a simple answer and leave the "several parts" article about tensors for the future. Do you want to know where I gave it up? I tried to get my head around the covariant and contravariant parts. Of course I know what this means in general, but what does it mean here? How is it related? Is there a natural way how the ##V's## come up contravariant and the ## V^{*'}s## covariant? Without coordinate transformations? In a categorial sense, it is again a different situation. And as I've found a source where it was just the other way around, I labeled it "deliberate". Which makes sense, as you can always switch between a vector space and its dual - mathematically. I guess it depends on whether one considers ##\operatorname{Hom}(V,V^*)## or ##\operatorname{Hom}(V^*,V)##. But if you know a good answer, I really like to hear it.

This must mean I am B-level.

Well, your motivation can't have been to learn what a tensor is. That's for sure.

Maybe you have been curious about another point of view. As I started, I found there are so many of them, that it would be carrying me away more and more (and thus couldn't be used as a short answer anymore). It is as if you start an article "What is a matrix?" by the sentence: "The Killing form is used to classify all simple Lie Groups, which are classical matrix groups. There is nothing special about it, all we need is the natural representation and traces ... etc." Could be done this way, why not.

This is the skeleton I originally planned:

\subsection*{Covariance and Contravariance}
\subsection*{To Rise and to Lower Indices}
\subsection*{Natural Isomorphisms and Representations}
\subsection*{Tensor Algebra}
\section*{Stress Energy Tensor}
\section*{Cauchy Stress Tensor}
\section*{Metric Tensor}
\section*{Curvature Tensor}
\section*{The Co-Universal Property}
\subsection*{Graßmann Algebras}
\subsection*{Clifford Algebras}
\subsection*{Lie Algebras}
\section*{Tensor Fields}

fresh_42 · Jun 21, 2017

Orodruin said:

I am pretty sure most engineering students will not remember what a homomorphism is without looking it up.

Corrected. Thanks.

WWGD · Jun 21, 2017

I thought it would be nice to have a good understanding of what a singleton ## a \otimes b ## represents in a tensor product. It is one of these things that I have understood and then forgotten many times over.

WWGD · Jun 21, 2017

burakumin said:

If I interpret correctly this sentence says the underlying set of U⊗V is U×V. This is not correct. This works for the direct sum U⊕V but the tensor product is a "bigger" set than U×V. One usual way to encode it is to quotient ##\mathbb{R}^{(U \times V)}## (the set of finitely-supported functions from U×V to ##\mathbb{R}##) by the appropriate equivalence relation.

I think this is done before the moding out and arranging into equivalence classes is done.

fresh_42 · Jun 21, 2017

WWGD said:

I think this is done before the moding out and arranging into equivalence classes is done.

It's the freely generated vector space (module) on the set ##U \times V##. The factorization indeed guarantees the multilinearity and the finiteness of sums which could as well be formulated as conditions to hold.

WWGD · Jun 21, 2017

fresh_42 said:

It's the freely generated vector space (module) on the set ##U \times V##. The factorization indeed guarantees the multilinearity and the finiteness of sums which could as well be formulated as conditions to hold.

I was replying to someone else's post.

fresh_42 · Jun 21, 2017

WWGD said:

I was replying to someone else's post.

Sorry, was a bit in "defensive mode".

WWGD · Jun 21, 2017

fresh_42 said:

Sorry, was a bit in "defensive mode".

No problem.

stevendaryl · Jun 22, 2017

The way that tensors are manipulated implicitly assumes isomorphisms between certain spaces.

If A is a vector space, then A^* is the set of linear functions of type A \rightarrow S (where S means "scalar", which can mean real numbers or complex numbers or maybe something else depending on the setting).

The first isomorphism is A^{**} is isomorphic to A.

The second isomorphism is A^* \otimes B^* is isomorphic to those function of type (A \times B) \rightarrow S that are linear in both arguments.

So this means that a tensor of type T^p_q can be thought of as a linear function that takes q vectors and p covectors and returns a scalar, or as a function that takes q vectors and returns an element of V \otimes V \otimes ... \otimes V (p of them), or as a function that takes p covectors and returns an element of V^* \otimes ... \otimes V^* (p of them), etc.

WWGD · Jun 22, 2017

stevendaryl said:

The way that tensors are manipulated implicitly assumes isomorphisms between certain spaces.

If A is a vector space, then A^* is the set of linear functions of type A \rightarrow S (where S means "scalar", which can mean real numbers or complex numbers or maybe something else depending on the setting).

The first isomorphism is A^{**} is isomorphic to A.

The second isomorphism is A^* \otimes B^* is isomorphic to those function of type (A \times B) \rightarrow S that are linear in both arguments.

So this means that a tensor of type T^p_q can be thought of as a linear function that takes q vectors and p covectors and returns a scalar, or as a function that takes q vectors and returns an element of V \otimes V \otimes ... \otimes V (p of them), or as a function that takes p covectors and returns an element of V^* \otimes ... \otimes V^* (p of them), etc.

Good point; same is the case with Tensor Contraction, i.e., it assumes/makes use of , an isomorphism.

lavinia · Jun 22, 2017

fresh_42 said:

Is there a natural way how the ##V's## come up contravariant and the ## V^{*'}s## covariant? Without coordinate transformations?

Given a linear map between two vector spaces ##L:V →W## then ##L## determines a map of the algebra of tensor products of vectors in ##V## to the algebra of tensor products of vectors in ##W##. This is correspondence is a covariant functor. ##L## also determines a map of the algebra of tensor products of dual vectors in ##W## to the algebra of tensor products of dual vectors in ##V##. This correspondence is a contravariant functor.

One might guess that this is the reason for the terms covariant and contravariant tensor though I do not know the history.

fresh_42 · Jun 22, 2017

lavinia said:

Given a linear map between two vector spaces ##L:V →W## then ##L## determines a map of the algebra of tensor products of vectors in ##V## to the algebra of tensor products of vectors in ##W##. This is correspondence is a covariant functor. ##L## also determines a map of the algebra of tensor products of dual vectors in ##W## to the algebra of tensor products of dual vectors in ##V##. This correspondence is a contravariant functor.

One might guess that this is the reason for the terms covariant and contra-variant tensor though I do not know the history.

Yes, but one could as well say ##T_q^p(V) = \underbrace{V \otimes \ldots \otimes V}_{p-times} \otimes \underbrace{V^* \otimes \ldots \otimes V^*}_{q-times}## has ##p## covariant factors ##V## and ##q## contravariant factors ##V^*## and in this source
http://www.math.tu-dresden.de/~timmerma/texte/tensoren2.pdf (see beginning of section 2.1)
it is done. So what are the reasons for one or the other? The fact which are noted first? Are the first ones always considered contravariant? As someone who tends to confuse left and right I was looking for some possibility to remember a convention, one or the other. So I'm still looking for a kind of natural, or if not possible, at least a canonical deduction.

lavinia · Jun 22, 2017

fresh_42 said:

Yes, but one could as well say ##T_q^p(V) = \underbrace{V \otimes \ldots \otimes V}_{p-times} \otimes \underbrace{V^* \otimes \ldots \otimes V^*}_{q-times}## has ##p## covariant factors ##V## and ##q## contravariant factors ##V^*## and in this source
http://www.math.tu-dresden.de/~timmerma/texte/tensoren2.pdf (see beginning of section 2.1)
it is done. So what are the reasons for one or the other? The fact which are noted first? Are the first ones always considered contravariant? As someone who tends to confuse left and right I was looking for some possibility to remember a convention, one or the other. So I'm still looking for a kind of natural, or if not possible, at least a canonical deduction.

I think I said the same thing. The covariant factors are the tensor products of the vectors, the contravariant are the tensors of the dual vectors.

fresh_42 · Jun 22, 2017

lavinia said:

I think I said the same thing. The covariant factors are the tensor products of the vectors, the contravariant are the tensors of the dual vectors.

I agree. This would be a natural way to look at it. However, the German Wikipedia does it the other way around and the English speaks of considering ##V## as ##V^{**}## and refers to basis transformations as the origin of terminology. I find this a bit unsatisfactory as motivation but failed to find a good reason for a different convention.

lavinia · Jun 22, 2017

fresh_42 said:

I agree. This would be a natural way to look at it. However, the German Wikipedia does it the other way around and the English speaks of considering ##V## as ##V^{**}## and refers to basis transformations as the origin of terminology. I find this a bit unsatisfactory as motivation but failed to find a good reason for a different convention.

In Physics a contravariant vector is thought of as a displacement dx. In Mathematics this corresponds to a 1 form and at each point in space this is a dual vector to the vector space of tangent vectors.

In primitive terms one does not think of the tangent space as its double dual.

WWGD · Jun 22, 2017

fresh_42 said:

I agree. This would be a natural way to look at it. However, the German Wikipedia does it the other way around and the English speaks of considering ##V## as ##V^{**}## and refers to basis transformations as the origin of terminology. I find this a bit unsatisfactory as motivation but failed to find a good reason for a different convention.

I don't know if this matters in terms of equating the two, but the isomorphism between ## V , V^{**} ## is not a natural one.

WWGD · Jun 22, 2017

lavinia said:

I think I said the same thing. The covariant factors are the tensor products of the vectors, the contravariant are the tensors of the dual vectors.

Is there a reason why we group together the (contra/co) variant factors? Why not have , e.g., ## T^p_q = V \ \otimes V^{*} \otimes V... ## , etc ?

lavinia · Jun 22, 2017

WWGD said:

I don't know if this matters in terms of equating the two, but the isomorphism between ## V , V^{**} ## is not a natural one.

One has the isomorphism between ##V## and ##V^{**}##, ##v→v^{**}##, defined by ##v^{**}(w) = w(v)##.

WWGD · Jun 22, 2017

lavinia said:

One has the isomorphism between ##V## and ##V^{**}##, ##v→v^{**}## ,defined by ##v^{**}(w) = w(v)##.

Yes, but AFAIK is not a natural isomorphism, meaning it is not basis-independent. I wonder to what effects/ when this makes a difference.

EDIT: My bad, this isomorphism between a vector space and its double dual _is_ natural; it is the isomorphism ## V \rightarrow V^{*} ## that is not natural. EDIT 2, there may be a natural pairing if V is equipped with a non-degenerate form.

scottdave · Jun 22, 2017

I learned about tensors in college - fluids or thermodynamics, maybe, I cannot recall for sure. I "sort of" got it, but later on in life, I came across this video, which I found useful.

StoneTemplePython · Jun 22, 2017

I liked this article. I am well aware of the proof of correctness of Strassen's algorithm, but had never seen where the idea came from -- nice.

It occurs to me that if people are only interested in certain properties of higher rank tensors and they don't want the object to jump off the page, they may be interested in things like Kronecker products or wedge products.

It probably should be noted that when moving from a 2-D matrix to something like a 3-D or 4-D (or n-D) tensor, is a bit like moving from 2-SAT to 3-SAT... most of the interesting things you'd want to do computationally (e.g. numerically finding eigenvalues or singular values) become NP Hard ( E.g. see: https://arxiv.org/pdf/0911.1393.pdf )

fresh_42 · Jun 22, 2017

WWGD said:

Is there a reason why we group together the (contra/co) variant factors? Why not have , e.g., ## T^p_q = V \ \otimes V^{*} \otimes V... ## , etc ?

The grouping allows a far better handling. There is no advantage in mixing the factors, so why should it be done? Perhaps in case where one considers tensors of ##V = U \otimes U^*##. The applications I know are all for low values of ##p,q## and it only matters how the application of a tensor is defined on another object. Formally one could even establish a bijection like the transposition of matrices. But all of this only means more work in writing without any benefits. E.g. Strassen's algorithm can equally be written as ##\sum u^*\otimes v^* \otimes W## or ##\sum W \otimes u^*\otimes v^*##. Only switching ##u^*,v^*## would make a difference, namely between ##A\cdot B## and ##B \cdot A##.

I once calculated the group of all ##(\varphi^*,\psi^*,\chi)## with ##[X,Y]=\chi([\varphi(X),\psi(Y)])## for all semisimple Lie algebras. Nothing interesting except that ##\mathfrak{su}(2)## produced an exception - as usual. But I found a pretty interesting byproduct for non-semisimple Lie algebras. Unfortunately this excludes physics, I guess.

WWGD · Jun 22, 2017

fresh_42 said:

The grouping allows a far better handling. There is no advantage in mixing the factors, so why should it be done? Perhaps in case where one considers tensors of ##V = U \otimes U^*##. The applications I know are all for low values of ##p,q## and it only matters how the application of a tensor is defined on another object. Formally one could even establish a bijection like the transposition of matrices. But all of this only means more work in writing without any benefits. E.g. Strassen's algorithm can equally be written as ##\sum u^*\otimes v^* \otimes W## or ##\sum W \otimes u^*\otimes v^*##. Only switching ##u^*,v^*## would make a difference, namely between ##A\cdot B## and ##B \cdot A##.

I once calculated the group of all ##(\varphi^*,\psi^*,\chi)## with ##[X,Y]=\chi([\varphi(X),\psi(Y)])## for all semisimple Lie algebras. Nothing interesting except that ##\mathfrak{su}(2)## produced an exception - as usual. But I found a pretty interesting byproduct for non-semisimple Lie algebras. Unfortunately this excludes physics, I guess.

Thanks, but aren't there naturally-occurring tensors in which the factors are mixed? What do you then do?

fresh_42 · Jun 22, 2017

StoneTemplePython said:

It probably should be noted that when moving from a 2-D matrix to something like a 3-D or 4-D (or n-D) tensor, is a bit like moving from 2-SAT to 3-SAT... most of the interesting things you'd want to do computationally (e.g. numerically finding eigenvalues or singular values) become NP Hard ( E.g. see: https://arxiv.org/pdf/0911.1393.pdf )

Yes, interesting, isn't it? This tiny difference between ##2## and ##3## which decides, whether we're too stupid to handle those problems, or whether there is a system immanent difficulty. And lower bounds are generally hard to prove. I know that Strassen lost a bet on ##NP = P##. I've forgotten the exact year, but he thought we would have found out something in the 90's. But I guess he enjoyed the journey in a balloon over the Alps anyway.

fresh_42 · Jun 22, 2017

WWGD said:

Thanks, but aren't there naturally-occurring tensors in which the factors are mixed? What do you then do?

Perhaps if you consider tensor algebras of ##\operatorname{Hom}(V,V^*)## or similar. I would group them pairwise in such a case: all even indexed ##V## and all odd indexed ##V^*##. This is what I really learned about tensors: it all heavily depends on what you want to do.

WWGD · Jun 22, 2017

fresh_42 said:

Perhaps if you consider tensor algebras of ##\operatorname{Hom}(V,V^*)## or similar. I would group them pairwise in such a case: all even indexed ##V## and all odd indexed ##V^*##. This is what I really learned about tensors: it all heavily depends on what you want to do.

Maybe we are referring to different things, but if we have a multilinear map defined on , say, ## V \otimes V^{*} \otimes V ## then the map would be altered by defining it on ## V \otimes V \otimes V^{*} ##, wouldn't it?

fresh_42 · Jun 22, 2017

Say we have a map ##V \otimes V^* \otimes V = V_1 \otimes V^* \otimes V_2 \longrightarrow W##, then it is an element of ##V_1 \otimes V^* \otimes V_2 \otimes W## which could probably be grouped as ##V^* \otimes V_1 \otimes V_2 \otimes W## and we have the original grouping again. I don't know of an example, where the placing of ##V^*## depends on the fact, that it is in between the copies of ##V##. As soon as algebras play a role, we factor their multiplication rules anyway. Or even better in a way such that the contravariance of ##W## is respected.

WWGD · Jun 22, 2017

I don't know if you mentioned this, but I think another useful perspective here is that the tensor product also defines a map taking a k-linear map into a linear map ( on the tensor product ; let's stick to vector spaces over ## \mathbb R ## and maps into the Reals, to keep it simple for now) , so that there is a map taking, e.g., the dot product ( as a bilinear map, i.e., k=2 ) on ## \mathbb R^2 \times \mathbb R^2 ## into a linear map defined on ## \mathbb R^2 \otimes \mathbb R^2 ## ( Into the Reals, in this case ), so we have a map from {## K :V_1 \times V_2 \times...\times V_k ##} to {## L:V_1 \otimes V_2 \otimes...\otimes V_k ##} , where K is a k-linear map and L is linear. This perspective helps me understand things better.

fresh_42 · Jun 22, 2017

I listed my originally intended chapters here:
https://www.physicsforums.com/threads/what-is-a-tensor-comments.917927/#post-5788263
where universality, natural isomorphisms and what else comes to mind considering tensors would have been included, but this tended to became about 40-50 pages and I wasn't really prepared for such a long explanation ... And after this debate here, I'm sure that even then there would have been some who thought I left out an essential part or described something differently from what they are used to and so on. Would have been interesting to learn more about the physical part of it, the more as a tensor to me is merely a multilinear product, which only gets interesting if a subspace is factored out. If there only wasn't these coordinate transformations and indices wherever you look. :wideeyed:

stevendaryl · Jun 23, 2017

WWGD said:

Is there a reason why we group together the (contra/co) variant factors? Why not have , e.g., ## T^p_q = V \ \otimes V^{*} \otimes V... ## , etc ?

Isn't this just a matter of convention? If you have a tensor t of type V \otimes V^* \otimes V, anything you want to do with t, you can do the analogous thing with the tensor t' of type V \otimes V \otimes V^*. There is only a notational difficulty, which is indicating which arguments of one tensor are contracted with which other arguments of a different tensor. But the Einstein summation convention makes this explicit.

WWGD · Jun 23, 2017

stevendaryl said:

Isn't this just a matter of convention? If you have a tensor t of type V \otimes V^* \otimes V, anything you want to do with t, you can do the analogous thing with the tensor t' of type V \otimes V \otimes V^*. There is only a notational difficulty, which is indicating which arguments of one tensor are contracted with which other arguments of a different tensor. But the Einstein summation convention makes this explicit.

I meant not just for contraction but for describing the general type ( co- and contra- variant) of the tensor; I was wondering if a tensor with mixed components , like ## V \otimes V^{*} \otimes V \otimes V^{*}... ## could always be expressible as ## V \otimes V \otimes ...V^{*} \otimes V^{*}... ## ., though I agree that when contracting the order does not matter. Basically, could we use contraction to show the two above types are equivalent? I am being kind of lazy, let me try it.

burakumin · Jun 25, 2017

Concerning this point:

lavinia said:

Given a linear map between two vector spaces ##L:V →W## then ##L## determines a map of the algebra of tensor products of vectors in ##V## to the algebra of tensor products of vectors in ##W##. This is correspondence is a covariant functor. ##L## also determines a map of the algebra of tensor products of dual vectors in ##W## to the algebra of tensor products of dual vectors in ##V##. This correspondence is a contravariant functor.

One might guess that this is the reason for the terms covariant and contravariant tensor though I do not know the history.

fresh_42 said:

I agree. This would be a natural way to look at it. However, the German Wikipedia does it the other way around and the English speaks of considering ##V## as ##V^{**}## and refers to basis transformations as the origin of terminology. I find this a bit unsatisfactory as motivation but failed to find a good reason for a different convention.

This is what Laurent Schwartz writes in "Les Tenseurs" in 1975:

Ces règles sont bien commodes pour les calculs techniques, mais elles ont pour base une erreur historique, qui n'a pas fini de canuler l'humanité pour plusieurs siècles. Elles furent établies à une époque où on manipulait plus les coordonnées que les vecteurs. Elles aboutissent ainsi à appeler contravariant (contra = contre) ce qui est relatif à ##E##, covariant (co = avec) ce qui est relatif à ##E^*## ! Dans tous les raisonnements théoriques utilisant des produits tensoriels (et ils couvrent aujourd'hui toutes les mathématiques), c'est une catastrophe. Un vecteur de ##E## (resp. ##E^*##) est appelé tenseur contravariant (resp. covariant) ! Ce qui est vrai (et c'est de là que vient l'apellation) c'est que le système de coordonnées d'un vecteur (i.e. "le tenseur ##x^i = \langle \epsilon^i, x \rangle##", ##\epsilon^i## formant la base duale) est contravariant ; mais, dans les mathématiques modernes, un vecteur est autre chose que le système de ses coordonnées ! Il aurait fallu appeler tenseur covariant un élément de ##E##, tenseur contravariant un élément de ##E^*##, quitte à faire remarquer que les coordonnées varient en sens inverse.

which can be translated by:

These rules are very convenient for technical calculations, but they are based on a historical error, which will continue to play a joke on humanity for several centuries. They were established at a time when coordinates were manipulated more than vectors. They results in calling contravariant (contra = against) what is relative to ##E##, covariant (co = with), which is relative to ##E ^ *## ! In every theoretical reasoning using tensorial products (and they cover all mathematics today), it is a catastrophe. A vector of ##E## (respectively ##E ^ *##) is called a contravariant tensor (respectively a covariant tensor)! What is correct (and this is the origin of such a terminology) is that the system of coordinates of a vector (ie, the tensor ##x^i = \langle \epsilon^i, x \rangle##", ##\epsilon^i## being the dual basis) is contravariant; But in modern mathematics a vector is something else than the system of its coordinates! An element of ##E## should have been called covariant tensor, an element of ##E ^ *## contravariant tensor, even if we point out that the coordinates vary in the opposite direction.

lavinia · Jun 25, 2017

If I understand the point: if one writes a vector in terms of a basis then its coefficients are picked out by the dual basis. So the coefficients are contravariant.

lavinia · Jun 25, 2017

An clear exposition of the Physics approach to tensors is in Leonard Susskind's Lectures on General Relativity starting somewhere around minute 40 in lecture 3.

Deepak Solanki · Aug 29, 2017

"A scalar can be viewed as the coordinate of one dimensional vector space, the component of a basis Vector."
Respected Sir,
can you please explain this statement that you made in your answer?

fresh_42 · Aug 29, 2017

Deepak Solanki said:

"A scalar can be viewed as the coordinate of one dimensional vector space, the component of a basis Vector."
Respected Sir,
can you please explain this statement that you made in your answer?

If we have a ##1-##dimensional vector space ##V## with a basis vector ##\vec{b}##, then all vectors ##\vec{v}## can be written ##\vec{v}=c \cdot \vec{b}##. This means ##c## is the scalar, which transforms ##\vec{b}## to ##\vec{v}##, the coordinate of ##\vec{v}## in the basis ##\{\vec{b}\}## and the component of ##\vec{v}## with respect to ##\vec{b}##. And it constitutes an isomorphism ##c \leftrightarrow \vec{v}## between the field ##\mathbb{F}## and ##V##.

WWGD · Sep 2, 2017

Deepak Solanki said:

"A scalar can be viewed as the coordinate of one dimensional vector space, the component of a basis Vector."
Respected Sir,
can you please explain this statement that you made in your answer?

Think of the Reals as a vector space over itself. Then any vector/Real number is a multiple of any non-zero number. Generalize this to any other 1D v. space over the Reals.

Thuring · Oct 13, 2017

This cut from Wikipedia shows a motive of using tensors:

"Because they express a relationship between vectors, tensors themselves must be independent of a particular choice of basis. The basis independence of a tensor then takes the form of a https://www.physicsforums.com/x-dictionary:r:'Covariant_transformation?lang=en&signature=com.apple.DictionaryApp.Wikipedia' that relates the array computed in one basis to that computed in another one. "

I believe this might be one of the most important characteristics of tensors for differential geometry and general relativity. (both essentially over my head)

Thanks for taking the time and effort to write this article.

Insights What Is a Tensor? The mathematical point of view

Introduction

Similar threads

Hot Threads

I How to show ##p(x)=g(x)x\pm 1\in\Bbb{Q}[x]## is irreducible in ##\Bbb{Q}_{\Bbb{Z}}[x]##?

I Showing ##k[x_1,\ldots,x_n]/\mathfrak{a}## is finite dimensional

A Near-Rings with Noncommutative Addition and Two-Sided Distributivity

I How do we distinguish two different notations for cokernel and coimage?

I Localising a non integral domain at a prime

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective

Insights What Is a Tensor? The mathematical point of view

Introduction​

Similar threads

Hot Threads

I How to show ##p(x)=g(x)x\pm 1\in\Bbb{Q}[x]## is irreducible in ##\Bbb{Q}_{\Bbb{Z}}[x]##?

I Showing ##k[x_1,\ldots,x_n]/\mathfrak{a}## is finite dimensional

A Near-Rings with Noncommutative Addition and Two-Sided Distributivity

I How do we distinguish two different notations for cokernel and coimage?

I Localising a non integral domain at a prime

Recent Insights

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers

Insights Fermat's Last Theorem

Insights Why Vector Spaces Explain The World: A Historical Perspective

Introduction