Linear transformations: function arguments

pamparana
Messages
123
Reaction score
0
I have a small confusion about functions and variables. So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.

So, let us take for example a simple rotation matrix. This matrix takes a point in 2D space and maps it to another point in this 2D space. So, this is fine and the argument to this rotation function is a 2D point.

Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.

All the explanations seem to treat the angle of rotation as a constant or in our optimisation problem case some unknown (but constant) quantity to estimate. So, in most optimisation cases it will involve taking the derivative of the rotation matrix wrt to the variable of interest i.e. \theta and I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
 
Physics news on Phys.org
pamparana said:
I have a small confusion about functions and variables. So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.

So, let us take for example a simple rotation matrix. This matrix takes a point in 2D space and maps it to another point in this 2D space. So, this is fine and the argument to this rotation function is a 2D point.

Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.

All the explanations seem to treat the angle of rotation as a constant or in our optimisation problem case some unknown (but constant) quantity to estimate. So, in most optimisation cases it will involve taking the derivative of the rotation matrix wrt to the variable of interest i.e. \theta and I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
As you said, most explanations treat the angle of rotation as an unknown constant, or parameter, which means that θ isn't considered to be a variable. If θ is allowed to vary (as in the optimization problems you cite), the rotation matrix is a function of only this rotation angle θ.
 
  • Like
Likes 1 person
Thanks for that explanation. As far as I understood, the optimisation is going (well, according to some local minima/maxima) to select the best rotation function which optimises the given cost function. So, in this case \theta can be seen as a variable as we evaluate the cost function by varying \theta continuously around the current value.
 
pamparana said:
So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.
A function T from a vector space V to a vector space W also has to satisfy the condition ##T(ax+by)=aT(x)+bT(y)## for all x,y in V and all real numbers a,b.

pamparana said:
Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.
...
I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
You can certainly define a function f by
$$f(\theta,x,y)=\begin{pmatrix}\cos\theta & -\sin\theta\\ \sin\theta & \cos\theta\end{pmatrix}\begin{pmatrix} x\\ y\end{pmatrix},$$ but if you're looking for the optimal angle for a given (x,y), you might prefer to to define functions ##f_{x,y}## (one for each (x,y)) by
$$f_{x,y}(\theta)=\begin{pmatrix}\cos\theta & -\sin\theta\\ \sin\theta & \cos\theta\end{pmatrix}\begin{pmatrix} x\\ y\end{pmatrix},$$
 
Last edited:
  • Like
Likes 1 person
It is a misconception that linear matrices/operators that represent such transformations as rotation and shear take points as arguments. They take and produce vectors. These are affine transformations (D × D+1 matrices) that take and produce points.
 
Last edited:
Incnis Mrsi said:
It is a misconception that linear matrices/operators that represent such transformations as rotation and shear take points as arguments. They take and produce vectors. These are affine transformations (D × D+1 matrices) that take and produce points.
"Point" is a standard term for elements of (the underlying sets of) topological spaces. All finite-dimensional vector spaces can be given a topology in a simple way. Hilbert spaces are equipped with inner produces, which are used to define topologies in a standard way. So unless you're talking about some very exotic vector space, vectors are points.

Some authors also use the term "point" to mean any element of any set. With that terminology, vectors are points even if you are talking about those exotic vector spaces.

Pamparana was talking about ##\mathbb R^2##, which is just a plane with vector space operations defined on it. So even if we ignore the stuff I said about topology, it's still more than OK to call those vectors points, because they are points in the sense of Euclidean geometry.
 
points or vectors?

Fredrik said:
Pamparana was talking about ##\mathbb R^2##…
Didn’t spot the word “ℝ2”. Was my sight failed, or the browser maybe?

Fredrik said:
… because they are points in the sense of Euclidean geometry.
Ῑ’m not willing to stuff the thread with off-topic. Just notice: “matrix takes a point” is a (formally correct, but) counter-intuitive and confusingly ungeometric terminology.
 
Incnis Mrsi said:
Didn’t spot the word “ℝ2”. Was my sight failed, or the browser maybe?
He said "point in 2D" space and then "2D point". So he was talking about ##\mathbb R^2## even though he never used that symbol.
 
Fredrik: there is neither addition nor scalar multiplication for points on Euclidean plane, without an additional structure (such as distinguished point). ℝ2 defines a Euclidean plane, but an abstract Euclidean plane is neither ℝ2 nor a vector space.
 
  • #10
Incnis Mrsi said:
Fredrik: there is neither addition nor scalar multiplication for points on Euclidean plane, without an additional structure (such as distinguished point). ℝ2 defines a Euclidean plane, but an abstract Euclidean plane is neither ℝ2 nor a vector space.

Formally correct of course. But in the context, I think it's very clear that the OP meant ##\mathbb{R}^2##.
 
Back
Top