Calculating Matrix Representation of Linear Function in New Basis

In summary: M. Similarly, ##c_i=N e_i=\sum_j (Ne_i)_j e_j =\sum_j ([N]_{E,E})_{ji} e_j##. It's convenient and relatively harmless to simplify the notation from ##[N]_{E,E}## to... N.So, how do ##[f]_{C,B}## and ##[f]_{E,E}## relate to M and N? (Note that there's no reason why M and N should be the same linear operator. It could be that some of the ejs are vectors in ##\mathbb R^n## and some of them are vectors in ##\mathbb R^m##. It could also be that n
  • #1
squenshl
479
4

Homework Statement


Let ##f : \mathbb{R}^n \rightarrow \mathbb{R}^m## be a linear function. Suppose that with the standard bases for ##\mathbb{R}^n## and ##\mathbb{R}^m## the function ##f## is represented by the matrix ##A##. Let ##b_1, b_2, \ldots, b_n## be a new set of basis vectors for ##\mathbb{R}^n## and ##c_1, c_2, \ldots, c_m## be a new set of
basis vectors for ##\mathbb{R}^m##. What is the matrix that represents ##f## when the linear spaces are described in terms of the new basis vectors?

Homework Equations

The Attempt at a Solution


Suppose ##f : \mathbb{R}^n \rightarrow \mathbb{R}^n## is represented by the matrix ##A## when we describe ##\mathbb{R}^n## in terms of the standard basis vectors ##e_1, e_2, \ldots, e_n## and that we have a new set of basis vectors ##b_1, b_2, \ldots, b_n##. Then when ##\mathbb{R}^n## is described in terms of these new basis vectors the linear function ##f## will be represented by the matrix ##B^{-1}AB##.
 
Physics news on Phys.org
  • #2
What about the set of basis vectors ##c_1, c_2, \ldots, c_m## for ##\mathbb{R}^m##?
 
  • #3
Here's an example with ##n=m=2##. Suppose that with the standard bases for ##\mathbb{R}^n## and ##\mathbb{R}^m## the function ##f## is represented by the matrix ##\begin{bmatrix}
3 & 1 \\
1 & 2
\end{bmatrix}##. Let ##
\begin{bmatrix}
3 \\
2
\end{bmatrix}## and ##
\begin{bmatrix}
1 \\
1
\end{bmatrix}## be a new set of vectors for ##\mathbb{R}^2##. The matrix that represents ##f## when ##\mathbb{R}^2## is described in terms of the new basis vectors is $$B^{-1}AB =
\begin{bmatrix}
1 & -1 \\
-2 & 3
\end{bmatrix}
\begin{bmatrix}
3 & 1 \\
1 & 2
\end{bmatrix}
\begin{bmatrix}
3 & 1 \\
2 & 1
\end{bmatrix}
=
\begin{bmatrix}
4 & 1 \\
-1 & 1
\end{bmatrix}.
$$
My question is how exactly do I incorporate the matrix ##C## into this?
It looks quite simple but I just can't see it!
 
  • #4
Maybe ##C^{-1}B^{-1}ABC##.
 
  • #5
Suppose that ##T:X\to Y## is linear, that ##U=(u_1,\dots,u_n)## is an ordered basis for ##X##, and that ##V=(v_1,\dots,v_m)## is an ordered basis for ##Y##. Your book must have given you a formula that associates a matrix with the linear transformation T (and the two ordered bases U and V). Something like this:
$$\left([T]_{V,U}\right)_{ij}=(Tu_j)_i.$$ The right-hand side denotes the ##i##th component of ##u_j## with respect to ##V##. If ##X=Y##, you can use this formula to determine a relationship between ##[T]_{V,V}## and ##[T]_{U,U}##. Edit: That last sentence is irrelevant to this problem, because it's asking for something else. See my next post below.
 
Last edited:
  • #6
The only formula given is ##B^{-1}AB##
I'm completely lost on this.
 
  • #7
Would like to point out that ##f## is not represented by a matrix, ##f## is a function, a vector's image can be represented in matrix form with respect to ##f##. There exists a matrix ##A## such that ##f(x) = Ax##.

The transformation matrix can be written with column vectors, where the first column is the image of the first initial basis vector, second column is the image of the second initial basis vector and so on. This is by definition.

It turns out that the matrices with respect to the initial basis and the new basis respectively, are similar, that is the meaning of ##B^{-1}AB##, where ##B## is regular, because it is invertible.
 
Last edited:
  • #8
squenshl said:
The only formula given is ##B^{-1}AB##
I'm completely lost on this.
Given where? There's no B in the problem statement. How do you define your B?

In post #1, you're letting us know that you know that if two matrices represent the same linear operator in different bases, then they are similar (i.e. if one of them is A, then there's a matrix B such that the other one is ##B^{-1}AB##). But if the book hasn't proved a theorem that gives you a formula for the matrix B, then you will have to work directly with the definitions in post #5, and the definition of matrix multiplication: ##(AB)_{ij} =\sum_k A_{ik}B_{kj}##. Edit: Also, since the problem involves two changes of ordered bases, you should expect two matrices to show up in the answer.

This isn't an easy problem, because it requires you to understand the definitions well enough to use them correctly, and because it involves two changes of ordered basis instead of just one. But it's fairly straightforward in the sense that if you choose a good notation and apply the definitions correctly in every step, you will get the result you want.

The notations I used in post #5 are good enough, but we also need something to distinguish between the ith component of a vector x with respect to U, and the ith component of the same vector with respect to V. You could e.g. denote the former by ##x_i## and the latter by ##x_i'##. In your case, that would be ##x_i## for the ith component with respect to the standard ordered basis, and ##x_i'## for the ith component with respect to the other one.

In general, when you ask for assistance here, you should post your attempt up to the point where you're stuck. If you're stuck right at the start, then you should at least explain what definitions and/or theorems you think you should be using, and what it is about them that confuses you or makes you think they won't solve the problem.

Edit: These are some suggestions to get you started. I will denote the standard ordered bases for ##\mathbb R^n## and ##\mathbb R^m## by E. (If that feels weird, you can use notations like ##E_n## and ##E_m##). I will denote the other two ordered bases by B and C. In my notation, what's given in the problem is that ##[f]_{E,E}=A##. The problem is asking you to use that to find ##[f]_{C,B}##. Post #5 gives you a formula for its ij component. It's ##([f]_{C,B})_{ij}=(fb_j)_i'##. How do you proceed from this? A good start would be to multiply by ##c_i## and sum over i. Then you can use the definitions (and what I'm saying in the last paragraph below) to simplify the right-hand side. When you've taken that as far as you can, you should look for another way to rewrite the left-hand side.

Some other useful notations: Let M be the the linear operator such that ##b_i=Me_i## for all i. Let N be the linear operator such that ##c_i=Ne_i## for all i. Note that ##b_i=Me_i=\sum_j (Me_i)_j e_j =\sum_j ([M]_{E,E})_{ji} e_j##. It's convenient and relatively harmless to simplify the notation from ##[M]_{E,E}## to just M. So we can write ##b_i=\sum_j M_{ji} e_j##.
 
Last edited:

1. What is a matrix representation of a linear function in a new basis?

A matrix representation of a linear function in a new basis is a way of expressing the same linear function using a different set of basis vectors. This can be useful for simplifying calculations and understanding the behavior of the function in a different coordinate system.

2. How is the matrix representation of a linear function in a new basis calculated?

The matrix representation of a linear function in a new basis is calculated by first expressing the linear function in terms of the new basis vectors, and then constructing a matrix where the columns are the coordinates of the new basis vectors in terms of the original basis. This matrix is known as the transformation matrix.

3. What is the purpose of calculating the matrix representation of a linear function in a new basis?

The purpose of calculating the matrix representation of a linear function in a new basis is to simplify calculations and provide a better understanding of the function in a different coordinate system. This can be particularly useful in applications such as computer graphics and physics.

4. Can the matrix representation of a linear function in a new basis be used to transform vectors?

Yes, the matrix representation of a linear function in a new basis can be used to transform vectors. The transformation can be done by multiplying the transformation matrix with the vector, resulting in a new vector with coordinates in the new basis.

5. Are there any limitations to calculating the matrix representation of a linear function in a new basis?

One limitation is that the linear function must be defined in terms of the new basis vectors. Additionally, the new basis vectors must be linearly independent and form a basis for the vector space. If these conditions are not met, the calculation of the matrix representation may not be possible.

Similar threads

  • Calculus and Beyond Homework Help
Replies
0
Views
449
  • Calculus and Beyond Homework Help
Replies
4
Views
1K
  • Calculus and Beyond Homework Help
Replies
8
Views
794
  • Calculus and Beyond Homework Help
Replies
14
Views
592
  • Calculus and Beyond Homework Help
Replies
3
Views
569
  • Calculus and Beyond Homework Help
Replies
17
Views
2K
  • Calculus and Beyond Homework Help
Replies
6
Views
1K
  • Calculus and Beyond Homework Help
Replies
11
Views
1K
  • Calculus and Beyond Homework Help
Replies
2
Views
1K
  • Calculus and Beyond Homework Help
Replies
9
Views
1K
Back
Top