How do you determine the coefficients for a linear transformation matrix?

Click For Summary

Discussion Overview

The discussion revolves around understanding how to determine the coefficients for a linear transformation matrix, particularly in the context of transforming geometric shapes such as squares into parallelograms. Participants explore the implications of matrix coefficients in linear transformations and how they relate to vector representations.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant describes confusion regarding the meaning of coefficients in a shear transformation matrix and seeks clarification on how to determine these coefficients for transforming shapes.
  • Another participant proposes a method for defining a parallelogram from a square by establishing relationships between the vectors and forming a matrix, leading to a specific matrix representation.
  • A different participant explains that a 2x2 matrix represents a linear transformation with respect to chosen bases on the source and target spaces, suggesting a conceptual framework for understanding transformations.
  • One participant suggests that the simplest case involves transforming a square into a parallelogram by using the vectors defining the sides as columns of the transformation matrix.
  • A participant expresses appreciation for the clarification that a matrix represents the necessary numbers to transform vectors, while also critiquing the lack of emphasis in textbooks on the evolving definitions of matrices throughout the learning process.
  • Another participant discusses the relationship between a matrix of coefficients from a system of equations and its interpretation as a transformation, illustrating how vectors are mapped to new coordinates.

Areas of Agreement / Disagreement

Participants express varying degrees of understanding and confusion regarding the transformation process and the role of coefficients in matrices. There is no consensus on a single method or interpretation, as multiple approaches and explanations are presented.

Contextual Notes

Some participants highlight the importance of understanding the changing definitions of matrices in different contexts, indicating potential limitations in how these concepts are presented in educational materials.

Felix83
Messages
75
Reaction score
0
Im a second semester engineering student and I am a few weeks into a linear algebra class. I understand most of it, but my teacher has to work to speak english so she doesn't explain things very well. We just started linear transformations and a few things seem unclear to me.

Take a shear transformation for example. You multiply a vector by a matrix and end up with another vector - the 'image' of the original vector. There is an example in my book that considers the square formed by area under y=2 from x=0 to x=2. It considers every point as a vector, and then multiplies them by a matrix to get a transformation that looks like a parallelogram - the height of it is still two, but it is horizontally stretched. It shows calculations for a couple of the corner points. The first is the vector [0,2] ([x1,x2] - written vertical just hard to type that way :) ) multiplied by the 2x2 matrix ( [1,3] for the top row and [0,1] for the bottom ) to get [6,2] . Next the vector [2,2] is multiplied by the same matrix to get [8,2] .

What does the matrix mean - what do the coefficients actually represent? If you consider a shape such as a square and you want to transform it to a parallelogram, how do you know what coefficients to use in the matrix?
 
Last edited:
Physics news on Phys.org
Suppose you want a square whose sides have length a to be transformed into a parallelogram with height a, but (horizontal) length b, with the upper left vertex at (c,a). This uniquely defines the parallelogram.

So you want the vector [a,0] to be transformed into [b,0] (lenght b), and [0,a] must be transformed into [c,a] (still height a, and upperleft vertex (c,a)).

In general this can be done by forming a 2x2 matrix A and solving: [itex]A{a \choose 0}={b \choose 0}[/itex] and [itex]A{0 \choose a}={c \choose a}[/itex] for A.
This'll lead to 4 equations in 4 unknowns.

Since this example is a lot easier than a general case, some insight might show you that since [itex]A{a \choose 0}={b \choose 0}[/itex] the first column must be [itex]b/a \choose 0[/itex].
Likewise, since [itex]A{0 \choose a}={c \choose a}[/itex], the second column must be [itex]c/a \choose 1[/itex]. so that:

[tex]A=\left(\begin{array}{cc}b/a & c/a \\ 0& 1\end{array}\right)[/tex].

For a=2, b=2 and c=6 you get
[tex]A=\left(\begin{array}{cc}1 & 3 \\ 0 & 1\end{array} \right)[/tex].

which is the matrix in your example, giving the parallelogram with vertices (0,0),(2,0),(6,2) and (8,2).
 
Last edited:
Just like a 2-tuple is a representation of a vector with respect to a chosen basis, a 2x2 matrix is a representation of a linear transformation with respect to the chosen bases on the source and target spaces.
 
the easiest case is to transform the square with sides given by the vectors (1,0) and (0,1). then to transform it into the parallelgram with sides given by vectors (a,b) and (c,d), just put (a,b) and (c,d) in as the columns of the matrix. try it.
 
ok thanks, so in a transformation, a matrix simply represents the numbers requred to transform a set of vectors to another set. this makes a lot more sense now. i think the problem is the book doesn't emphasize enough when it changes what the matrix is defined as. it starts out as the coefficients of a set of linear equations, then a couple sections later it represents sets of vectors, and then with transformations it represents the set of numbers needed to transform a vector to another vector. books and teachers really need to point this out before all other things. if you were going through this and didnt know that this was changing, it would be very confusing and frustrating.
 
suppose you have the equations ax+by = e, and cx+dy = f, and you want to solve them.

If you form the matrix of coefficients, you have first row (a b) and second row (c d).

Thus the first column is (a,c) and the second column is (b,d).

then this matrix can be viewed as a transformation that takes the vector (1,0) to the vector (a,c), and takes the vector (0,1) to the vector (b,d).

Then by asking for a solution to the equations above,

ax+by = e, and cx+dy = f,

you are asking whether any vector gets taken to the vector (e,f) by this matrix.

i.e. since (x,y) can be written as x(1,0) + y(0,1), and since (1,0) and (0,1) go to

(a,c) and (b,d), it follows that (x,y) goes to x(a,c) + y(b,d).

so you are asking whether it is possible to wqrite (e,f) in the form x(a,c) + y(b,d).


enough for now.
 

Similar threads

  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 19 ·
Replies
19
Views
4K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 1 ·
Replies
1
Views
5K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 27 ·
Replies
27
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 20 ·
Replies
20
Views
4K