MHB Notation for vector coordinates in a given basis

Click For Summary
The discussion focuses on the computation of vector coordinates during changes of basis and the application of linear operators, emphasizing the need for clarity without using summation notation. It highlights the distinction between expressing "old" coordinates through "new" ones during a change of basis and vice versa when applying a linear operator. The notation $[v]_{\mathcal{E}}$ is proposed for denoting coordinates of vector $v$ in basis $\mathcal{E}$, facilitating the understanding of transformations and operator matrices. The conversation also touches on the challenges of defining coordinates when bases are not linearly independent and suggests a potential categorical approach to generalizing the concept of coordinates. Overall, the thread seeks to refine the explanation of these mathematical concepts for better comprehension.
Evgeny.Makarov
Gold Member
MHB
Messages
2,434
Reaction score
4
Sorry for a long post. I am looking for a clear and concise way to explain how to compute coordinates when changes of basis or linear operators are involved. I would like to avoid the summation notation as much as possible and use the definition of matrix multiplication only in the beginning, when it is indeed necessary. I would like to be able to explain things like the following.

  • Why is it that when a change of basis occurs, we express the "old" coordinates through the "new" ones, but when a linear operator is applied, we express the "new" coordinates through the "old" ones?
  • How to find the matrix of a linear operator in a different basis?
  • Suppose a linear operator $\varphi$ on $\Bbb R^n$ maps a sequence of vectors $\mathcal{A}=(a_1,\dots,a_n)$ to $\mathcal{B}=(b_1,\dots,b_n)$ and $\mathcal{A}$ is linear independent. How to find the matrix of $\varphi$ in basis $\mathcal{E}$ given coordinates of $\mathcal{A}$ and $\mathcal{B}$ in $\mathcal{E}$?

While it is possible to explain the change of basis referring to "new" and "old" coordinates of a single vector in two bases, applying an operator $\varphi$ to a vector $v$ involves four sets of coordinates:
  • coordinates of $v$ in the initial basis $\mathcal{E}$,
  • coordinates of $\varphi v$ in the initial basis $\mathcal{E}$,
  • coordinates of $v$ in the new basis $\varphi\mathcal{E}$ and
  • coordinates of $\varphi v$ in the new basis $\varphi\mathcal{E}$.
It's no longer enough to use $x$ and $x'$ for coordinates. It really helps if we can say precisely which vector in which basis has which coordinates.

A popular idea is to write $[v]_{\mathcal{E}}$ to refer to coordinates of vector $v$ in basis $\mathcal{E}$. Similarly, $[\varphi]_{\mathcal{E}}$ denotes the matrix of $\varphi$ in $\mathcal{E}$ and if $\mathcal{E}'=(e_1',\dots,e_n')$, then $[\mathcal{E}']_{\mathcal{E}}$ is the matrix with columns $[e_1']_{\mathcal{E}},\dots,[e_n']_{\mathcal{E}}$, i.e., the transition matrix from $\mathcal{E}$ to $\mathcal{E}'$. By definition,
\[
[\varphi]_{\mathcal{E}}=[\varphi\mathcal{E}]_{\mathcal{E}}.\tag{1}
\]
Then we can state and prove the following properties.
\begin{align}
&[\mathcal{E}']_{\mathcal{E}}[v]_{\mathcal{E}'}=[v]_{\mathcal{E}}\tag{2}\\
&[v]_{\mathcal{E}}=[\varphi v]_{\mathcal{\varphi E}}\tag{3}
\end{align}

Using this, we can prove that
\[
[\varphi v]_{\mathcal{E}}=[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}.\tag{4}
\]
Indeed,
\[
[\varphi v]_{\mathcal{E}}\overset{(2)}{=}[\varphi\mathcal{E}]_{\mathcal{E}}[\varphi v]_{\mathcal{\varphi E}}
\overset{(1)}{=}[\varphi]_{\mathcal{E}}[\varphi v]_{\varphi\mathcal{E}}\overset{(3)}{=}[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}.\tag{5}
\]

For another example, here is the summary of Deveno's explanation that $[\varphi]_{\mathcal{E}'}=[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[\mathcal{E}']_{\mathcal{E}}$ https://driven2services.com/staging/mh/index.php?posts/55983/. For any $v$,
\[
[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[\mathcal{E}']_{\mathcal{E}}[v]_{\mathcal{E}'}
\overset{(2)}{=}
[\mathcal{E}]_{\mathcal{E}'}[\varphi]_{\mathcal{E}}[v]_{\mathcal{E}}
\overset{(4)}{=}
[\mathcal{E}]_{\mathcal{E}'}[\varphi v]_{\mathcal{E}}
\overset{(2)}{=}
[\varphi v]_{\mathcal{E}'}.
\]

This notation seems short and expressive, but unfortunately $[v]_{\mathcal{E}}$ does not make sense if $\mathcal{E}$ is not a basis. So if $\varphi$ is not an isomorphism, then the proof (5) does not quite work.

It is possible to define the inverse operation: if $x$ is a column of numbers, then $(x)_{\mathcal{E}}\overset{\text{def}}{=}\mathcal{E}x$ is the linear combination of vectors from $\mathcal{E}$ with coefficients $x$. This operation is well-defined even if $\mathcal{E}$ are linearly dependent. I have not yet finished rewriting (1)-(4) using this notation, but even if this is possible, I am wondering if the proofs would not be too obscure and giving little insight.

How do authors and lecturers usually deal with this? Also, I am wondering if there is a generalization of the operation of taking coordinates. Perhaps coordinates can be thought of as a morphism in category theory from $V\times\dots\times V$ to $V$ taking a basis into a vector. Maybe such a generalization can give a hint for a suitable notation.

Thank you.
 
Physics news on Phys.org
ThePerfectHacker said:
Who needs change-of-basis when one has commutative diagrams?

Suppose $\alpha: \mathcal A \to \mathcal E$ and $\beta: \mathcal B \to \mathcal E$ are the canonical transformations. And suppose $M_\phi$ is the requested matrix.

View attachment 3078

Then:
$$M_\phi = \beta \circ \varphi \circ \alpha^{-1}$$
 

Attachments

  • Commutator_diagram.png
    Commutator_diagram.png
    1.3 KB · Views: 103
Thread 'How to define a vector field?'
Hello! In one book I saw that function ##V## of 3 variables ##V_x, V_y, V_z## (vector field in 3D) can be decomposed in a Taylor series without higher-order terms (partial derivative of second power and higher) at point ##(0,0,0)## such way: I think so: higher-order terms can be neglected because partial derivative of second power and higher are equal to 0. Is this true? And how to define vector field correctly for this case? (In the book I found nothing and my attempt was wrong...

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 34 ·
2
Replies
34
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
8
Views
2K
  • · Replies 27 ·
Replies
27
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K