Matrix Representations of Linear Transformations - Comments

• Insights

Related Linear and Abstract Algebra News on Phys.org
fresh_42
Mentor
Maybe I'm wrong and it's my browser but it seems there are non compiled latex lines in the text.

Maybe I'm wrong and it's my browser but it seems there are non compiled latex lines in the text.
Fixed, thanks!

S.G. Janssens
Thank you for the nice article!

I hope it will help beginning students to avoid the kind of confusion that I used to experience. (I think that part of this confusion is due to the fact that in physics literature, one usually doesn't distinguish between an operator and its matrix representation and, often, one also omits the specification of the underlying bases. For trained readers this is usually not a problem, but for students just coming from an LA course and looking to apply the theory in physics problems, I believe this can cause unnecessary difficulties.)

One typo:

In the line starting with: "We just define T to be the unique linear ##T:X→Y## such that (...)" you probably meant to write
$$T e_j = \sum_{i=1}^m{T_{ij}f_i}$$
since at this point in the text you have not yet assumed that ##X = Y##, etc.

Two suggestions:

1. I think it would make the article even better if you would also discuss a second example, this time of an operator acting on an abstract (but still finite dimensional) vector space, such as a space of polynomials or so. This way, it becomes clear that a vector / matrix and its representation w.r.t. a basis are really two different things, and it also showcases the power of matrix representations when doing computations with abstract operators.

2. Perhaps, alluding to your remark on QM at the end, it would be nice if you would write a follow-up on how this generalizes quite easily to bounded linear operators on separable Hilbert spaces. Then, you could also comment on what happens when you replace a bounded operator with an unbounded (differential) operator, which is typically the case physicists encounter when studying QM.
Hopefully you do not consider these comments an interference, but rather an expression of my enthusiasm for the subject and the attention that it has recently received on PF.

Last edited:
lavinia
Gold Member
As pointed out by Krylov

"Given an m×n matrix M, there’s a simple way to define a linear transformation T:X→Y such that the matrix representation of T with respect to (A,B) is M. We just define T to be the unique linear ##T:X→Y ## such that ##Tej=∑ni=1Tijei## for all j∈{1,…,n}."

only works if ##X=Y##. A matrix determines a linear transformation for each choice of basis for ##X## and ##Y##. Without a choice of bases, the matrix does not determine a linear transformation.

fresh_42
Mentor
A matrix determines a linear transformation for each choice of basis for XX and YY. Without a choice of bases, the matrix does not determine a linear transformation.
But given a matrix the unity vectors in both spaces always define a natural basis to which the matrix is a linear transformation.

Staff Emeritus
Gold Member
One typo:

In the line starting with: "We just define T to be the unique linear ##T:X→Y## such that (...)" you probably meant to write
$$T e_j = \sum_{i=1}^m{T_{ij}f_i}$$
Good catch. That line isn't present in the last draft I that I discussed with other people (in February 2013) before I turned it into a FAQ post (in June 2013...I'm pretty slow apparently), so I must have put it in later and not proofread it well enough.

Two suggestions:

1. I think it would make the article even better if you would also discuss a second example, this time of an operator acting on an abstract (but still finite dimensional) vector space, such as a space of polynomials or so. This way, it becomes clear that a vector / matrix and its representation w.r.t. a basis are really two different things, and it also showcases the power of matrix representations when doing computations with abstract operators.

2. Perhaps, alluding to your remark on QM at the end, it would be nice if you would write a follow-up on how this generalizes quite easily to bounded linear operators on separable Hilbert spaces. Then, you could also comment on what happens when you replace a bounded operator with an unbounded (differential) operator, which is typically the case physicists encounter when studying QM.
Hopefully you do not consider these comments an interference, but rather an expression of my enthusiasm for the subject and the attention that it has recently received on PF.
Your comments are welcome, and I like your suggestions. Unfortunately I don't have a lot of time to improve this post right now. If you would like to do it, I'm more than OK with that.

The LaTeX can be improved. When I wrote this in 2013, LaTeX behaved differently here. There was no automatic numbering of equations for example. I would like to make sure that only those equations that should be numbered are numbered. Removing all the numbers is also an option. Also, the equation that begins with Tx= wasn't split over two lines before. It needs an explicit line break followed by an alignment symbol. (I could edit the post when it was a normal FAQ post. I don't think I can now that it's an Insights post).

Greg Bernhardt
WWGD
Gold Member
2019 Award
I also think the bases selected in each of ##X,Y ## both have to be ordered bases for there to be an isomorphism between ## L(X,Y) ## , linear maps between ## X,Y ## and ##M_{n \times m}(R)## , where ##R## is the Ring; ## M_{n \times m}(R) ## is the space of matrices with coefficients in the ring and##X,Y ## are (free, of course) ##R ##-modules (both right- or left-, I think); I think this is the most general scope of the isomorphism

Last edited:
lavinia
Gold Member
But given a matrix the unity vectors in both spaces always define a natural basis to which the matrix is a linear transformation.
Yes if one already has two bases then the matrix defines a linear map. But there is no natural given basis for a vector space. You need to select one. Not sure what you mean by the unity vectors.

fresh_42
Mentor
Yes if one already has two bases then the matrix defines a linear map. But there is no natural given basis for a vector space. You need to select one. Not sure what you mean by the unity vectors.
Physicists probably write them ##e_i = (δ_{ij})_j##. I learned unit vectors. Ok, it's not the i-th basis vector but the coordinate representation of the i-th basis vector. But that is hair-splitting. To awake the impression that a matrix isn't a linear transformation is negligent. There is always a basis to which the matrix is a linear transformation. And in the finite dimensional case even without the use of the axiom of choice. I just wanted to avoid someone saying: "But I've read on the internet that a matrix isn't a linear transformation." The discussion distinguishing between the vectors themselves and their coordinate representation is in my opinion something for specialists and logicians.

lavinia
Gold Member
Physicists probably write them ##e_i = (δ_{ij})_j##. I learned unit vectors. Ok, it's not the i-th basis vector but the coordinate representation of the i-th basis vector. But that is hair-splitting. To awake the impression that a matrix isn't a linear transformation is negligent. There is always a basis to which the matrix is a linear transformation. And in the finite dimensional case even without the use of the axiom of choice. I just wanted to avoid someone saying: "But I've read on the internet that a matrix isn't a linear transformation." The discussion distinguishing between the vectors themselves and their coordinate representation is in my opinion something for specialists and logicians.
Hmmm. The point I was trying to make is that a matrix determines a continuum of linear transformations each of which depends on a choice of basis.

fresh_42
Mentor
Hmmm. The point I was trying to make is that a matrix determines a continuum of linear transformations each of which depends on a choice of basis.
Good point. I remember I had my difficulties, too, when I first learned the concept. All of a sudden there surfaced matrices ##T## and ##T^{-1}## surrounding my original ##A##....or even worse ##T## and ##S^{-1}##

WWGD
Gold Member
2019 Award
Physicists probably write them ##e_i = (δ_{ij})_j##. I learned unit vectors. Ok, it's not the i-th basis vector but the coordinate representation of the i-th basis vector. But that is hair-splitting. To awake the impression that a matrix isn't a linear transformation is negligent. There is always a basis to which the matrix is a linear transformation. And in the finite dimensional case even without the use of the axiom of choice. I just wanted to avoid someone saying: "But I've read on the internet that a matrix isn't a linear transformation." The discussion distinguishing between the vectors themselves and their coordinate representation is in my opinion something for specialists and logicians.
But a matrix does not necessarily describe a linear transformation (sorry if this is not what you mean). It can represent the adjacency conditions of a graph, a Markov process, etc. If you mean that there is a bijection ( isomorphism) between linear maps and matrices, then I agree.

fresh_42
Mentor
But a matrix does not necessarily describe a linear transformation (sorry if this is not what you mean). It can represent the adjacency conditions of a graph, a Markov process, etc. If you mean that there is a bijection ( isomorphism) between linear maps and matrices, then I agree.
That's a good one. But to be honest, e.g. Markov processes didn't come to my mind in a thread about linear transformations.
It reminds me on a test I once recorded. The student could perfectly define a linear transformation and was asked about an example. The professor would had been satisfied with a rotation or just a matrix. Unfortunately for the student he couldn't tell one. I remember this because I still wonder what the professor would have answered on my example. I would have answered: 0. (And 1 next.)
But of course you are completely right: a matrix is nothing else as any elements of any set ordered in a rectangle. Or a movie ...

Last edited:
WWGD
Gold Member
2019 Award
Hmmm. The point I was trying to make is that a matrix determines a continuum of linear transformations each of which depends on a choice of basis.
I don't know if this gets you into Philosophy, but isn't a linear transformation expressed in different bases essentially the same linear transformation, i.e., given ##L## in any one basis, then the set { ## S^{-1}LS ##} for any (invertible) matrix S just one linear transformation?

lavinia
Gold Member
I don't know if this gets you into Philosophy, but isn't a linear transformation expressed in different bases essentially the same linear transformation, i.e., given ##L## in any one basis, then the set { ## S^{-1}LS ##} for any (invertible) matrix S just one linear transformation?
Yes it is the same. But its matrix representation changes by a conjugation - at least for a linear map of a vector space into itself.
Perhaps the insight should explain this by showing how the matrix changes for a change of basis.

S.G. Janssens
Physicists probably write them ##e_i = (δ_{ij})_j##. I learned unit vectors. Ok, it's not the i-th basis vector but the coordinate representation of the i-th basis vector. But that is hair-splitting.
Really? Take ##P_2##, the vector space of, say, real polynomials of order ##\le 2##, including the zero polynomial. There are no unit vectors here. (We cannot even normalize to unity because there is no norm chosen yet.) Let's pick a basis, perhaps ##\mathcal{A} := \{1, x, x^2\}## and consider ##p \in P_2## defined by ##p(x) = 6 - x^2##. Then its coordinate vector is ##[p]_{\mathcal{A}} = [6, 0, -1] \in \mathbb{R}^3##. However, with respect to the basis ##\mathcal{B} = \{2 - x, x, -x^2\}## we have ##[p]_{\mathcal{B}} = [3, 3, 1]_{\mathcal{B}} \in \mathbb{R}^3##. Also, the representation of the first basis vector in ##\mathcal{A}## with respect to ##\mathcal{B}## is ##[1]_{\mathcal{B}} = [\tfrac{1}{2},\tfrac{1}{2},0] \in \mathbb{R}^3##, etc.

Especially when learning LA, it is very important that students, in mathematics and physics alike, distinguish between ##p##, ##[p]_{\mathcal{A}}## and ##p_{\mathcal{B}}##. (Later on, they may learn that these vectors are related through isomorphisms, but that is not how one starts.) It is also crucial when doing computations, for example in numerical analysis.
To awake the impression that a matrix isn't a linear transformation is negligent.
Nobody awoke this impression, but as we have just seen, one has to be precise, especially when dealing with vector spaces different from ##\mathbb{R}^n## or ##\mathbb{C}^n##.
There is always a basis to which the matrix is a linear transformation. And in the finite dimensional case even without the use of the axiom of choice.
In combination with your earlier comments on unit vectors, you seem to suggest that for infinite dimensional vector spaces every matrix defines a linear transformation on the space. This is already false for separable Hilbert spaces. Take the sequence space ##\ell_2## with the canonical basis (yes indeed, the one consisting of the unit vectors ##\{(\delta_{mn})_{m = 1}^{\infty} \,:\,n \in \mathbb{N}\}##) and consider the infinite matrix ##M = (\delta_{mn}n)_{m,n=1}^{\infty}##. Then ##x = [n^{-1}]_{n=1}^{\infty}## is in ##\ell_2## but ##Mx = [1]_{n=1}^{\infty}## is not.

In fact, by Parseval's identity there is no orthonormal basis of ##\ell_2## with respect to which ##M## represents a linear operator.
The discussion distinguishing between the vectors themselves and their coordinate representation is in my opinion something for specialists and logicians.
No, it is not, as we have already seen in the example on ##P_2##. It is part of any decent first course on linear algebra. On the other hand, if by "specialists" are meant people that actually know what they are talking about, then I agree.

Yes, I'm quite irritated. Your post lacks any, well... insight, and is one of those that sometimes makes me wonder whether I'm wasting my time here.

Last edited:
micromass
Staff Emeritus
Gold Member
I have made some minor edits. (Greg showed me how). I fixed the mistake that Krylov found, removed the equation numbers, and made some minor changes to the language.

I got a comment about my usage of the term "n-tuple". I have always felt that it's unnecessary to say "ordered n-tuple", since no one uses the term "n-tuple" to refer to a set of cardinality n. How do you guys feel about this? Do you feel that my usage is like saying "line" instead of "straight line", or that it's plain wrong?

Greg Bernhardt
Great work Fredrik!

fresh_42
Mentor
since no one uses the term "n-tuple" to refer to a set of cardinality n
Haven't met anyone either. If it's not ordered you won't say tuple. Even the notation in round brackets implies it's ordered, imao.