Linear transformations: function arguments

Click For Summary

Discussion Overview

The discussion revolves around the nature of linear transformations, specifically focusing on rotation matrices and their arguments. Participants explore whether a rotation matrix can be viewed as a function of both the rotation angle and a 2D point, particularly in the context of optimization problems.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants propose that a rotation matrix can be expressed as a function of two variables, the angle θ and a 2D point, suggesting a notation R(θ, p).
  • Others argue that in many explanations, θ is treated as a constant or parameter, which leads to the interpretation of the rotation matrix as a function of only θ.
  • A participant suggests defining a function that incorporates both θ and the coordinates of the point, indicating a preference for a function f_{x,y}(θ) for optimization purposes.
  • Some participants clarify that linear transformations like rotation matrices take vectors as arguments rather than points, introducing the concept of affine transformations.
  • There is a discussion about the terminology used, with some participants noting that referring to vectors as points can be misleading or confusing in certain contexts.
  • One participant emphasizes that without additional structure, points in the Euclidean plane do not support operations like addition or scalar multiplication, distinguishing between abstract and concrete representations of Euclidean spaces.

Areas of Agreement / Disagreement

Participants express differing views on whether rotation matrices should be considered functions of both angle and point or solely of the angle. There is no consensus on the terminology used to describe the inputs to these transformations, leading to a contested discussion.

Contextual Notes

Some statements reflect assumptions about the nature of points and vectors in various mathematical contexts, which may not be universally accepted. The discussion also highlights the potential for confusion arising from terminology related to linear transformations and their arguments.

pamparana
Messages
123
Reaction score
0
I have a small confusion about functions and variables. So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.

So, let us take for example a simple rotation matrix. This matrix takes a point in 2D space and maps it to another point in this 2D space. So, this is fine and the argument to this rotation function is a 2D point.

Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.

All the explanations seem to treat the angle of rotation as a constant or in our optimisation problem case some unknown (but constant) quantity to estimate. So, in most optimisation cases it will involve taking the derivative of the rotation matrix wrt to the variable of interest i.e. \theta and I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
 
Physics news on Phys.org
pamparana said:
I have a small confusion about functions and variables. So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.

So, let us take for example a simple rotation matrix. This matrix takes a point in 2D space and maps it to another point in this 2D space. So, this is fine and the argument to this rotation function is a 2D point.

Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.

All the explanations seem to treat the angle of rotation as a constant or in our optimisation problem case some unknown (but constant) quantity to estimate. So, in most optimisation cases it will involve taking the derivative of the rotation matrix wrt to the variable of interest i.e. \theta and I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
As you said, most explanations treat the angle of rotation as an unknown constant, or parameter, which means that θ isn't considered to be a variable. If θ is allowed to vary (as in the optimization problems you cite), the rotation matrix is a function of only this rotation angle θ.
 
  • Like
Likes   Reactions: 1 person
Thanks for that explanation. As far as I understood, the optimisation is going (well, according to some local minima/maxima) to select the best rotation function which optimises the given cost function. So, in this case \theta can be seen as a variable as we evaluate the cost function by varying \theta continuously around the current value.
 
pamparana said:
So, on doing a bit of reading, a linear transformation is a function that maps inputs from one vector space to another.
A function T from a vector space V to a vector space W also has to satisfy the condition ##T(ax+by)=aT(x)+bT(y)## for all x,y in V and all real numbers a,b.

pamparana said:
Now, in many applications, we want to find the optimal rotation matrix i.e. we want to find the optimal angle \theta to do the rotation. So, my question is can we view this rotation matrix now as a function of two variables i.e. \theta and the input 2D point.
...
I was wondering whether I can view the rotation matrix as a function of \theta i.e. write it as R(\theta, p) where R is the rotation function and p is the point in 2D space.
You can certainly define a function f by
$$f(\theta,x,y)=\begin{pmatrix}\cos\theta & -\sin\theta\\ \sin\theta & \cos\theta\end{pmatrix}\begin{pmatrix} x\\ y\end{pmatrix},$$ but if you're looking for the optimal angle for a given (x,y), you might prefer to to define functions ##f_{x,y}## (one for each (x,y)) by
$$f_{x,y}(\theta)=\begin{pmatrix}\cos\theta & -\sin\theta\\ \sin\theta & \cos\theta\end{pmatrix}\begin{pmatrix} x\\ y\end{pmatrix},$$
 
Last edited:
  • Like
Likes   Reactions: 1 person
It is a misconception that linear matrices/operators that represent such transformations as rotation and shear take points as arguments. They take and produce vectors. These are affine transformations (D × D+1 matrices) that take and produce points.
 
Last edited:
Incnis Mrsi said:
It is a misconception that linear matrices/operators that represent such transformations as rotation and shear take points as arguments. They take and produce vectors. These are affine transformations (D × D+1 matrices) that take and produce points.
"Point" is a standard term for elements of (the underlying sets of) topological spaces. All finite-dimensional vector spaces can be given a topology in a simple way. Hilbert spaces are equipped with inner produces, which are used to define topologies in a standard way. So unless you're talking about some very exotic vector space, vectors are points.

Some authors also use the term "point" to mean any element of any set. With that terminology, vectors are points even if you are talking about those exotic vector spaces.

Pamparana was talking about ##\mathbb R^2##, which is just a plane with vector space operations defined on it. So even if we ignore the stuff I said about topology, it's still more than OK to call those vectors points, because they are points in the sense of Euclidean geometry.
 
points or vectors?

Fredrik said:
Pamparana was talking about ##\mathbb R^2##…
Didn’t spot the word “ℝ2”. Was my sight failed, or the browser maybe?

Fredrik said:
… because they are points in the sense of Euclidean geometry.
Ῑ’m not willing to stuff the thread with off-topic. Just notice: “matrix takes a point” is a (formally correct, but) counter-intuitive and confusingly ungeometric terminology.
 
Incnis Mrsi said:
Didn’t spot the word “ℝ2”. Was my sight failed, or the browser maybe?
He said "point in 2D" space and then "2D point". So he was talking about ##\mathbb R^2## even though he never used that symbol.
 
Fredrik: there is neither addition nor scalar multiplication for points on Euclidean plane, without an additional structure (such as distinguished point). ℝ2 defines a Euclidean plane, but an abstract Euclidean plane is neither ℝ2 nor a vector space.
 
  • #10
Incnis Mrsi said:
Fredrik: there is neither addition nor scalar multiplication for points on Euclidean plane, without an additional structure (such as distinguished point). ℝ2 defines a Euclidean plane, but an abstract Euclidean plane is neither ℝ2 nor a vector space.

Formally correct of course. But in the context, I think it's very clear that the OP meant ##\mathbb{R}^2##.
 

Similar threads

  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 7 ·
Replies
7
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 26 ·
Replies
26
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K