Vectors - are the definitions truly equivalent?

In summary: In the latter case, the components of the vector are the numbers which are the coefficients of the basis vectors in the representation. This is the idea of decomposing a vector into components. In physics, we want to do this so that the components transform in a simple, predictable way under rotations or other coordinate transformations. But, as you know, this leads to the idea of a pseudovector, which is something that behaves like a vector under rotations, but has some additional sign change properties when we do coordinate transformations.In the mathematical view, we have a vector space over a field. If the field is R or C, then we can pick a basis, and this leads to a decomposition of the vector into components,
  • #1
1MileCrash
1,342
41
There are two "formal" definitions of vectors (and tensors in general) which I've learned.

The first is what I consider the "better" definition, one I learned in linear algebra. We call a set X a vector space over a field F whenever that set has properly defined operations of scalar multiplication and addition, and follows certain properties. The elements of X are then called vectors.

The second definition, the one I learned as a physics major, is the one I consider the "pain in the ***" definition, that is, it is a vector is a "group of numbers" that transform a certain way (transformation law) into a new coordinate system.

I've always wondered if these definitions were, in fact, equivalent and I've seen it mentioned that they are.

However, what makes me doubt that they are really equivalent definitions is the emergence of "pseudovector" in the case of the second definition. These are vector-like objects which do not transform correctly upon inversion.

My questions are:
Consider a vector that meets the first definition; does it meet the second?

Consider a vector that meets the second definition. Does it meet the first?
 
Physics news on Phys.org
  • #2
[edit] I had never heard of pseudovectors or any definition of "vector" that refers to transformations. But I found this Wiki entry which gave some insight. As usual the physicists decided to do things a bit differently.

http://en.wikipedia.org/wiki/Pseudovector

The definition of a "vector" in physics (including both polar vectors and pseudovectors) is more specific than the mathematical definition of "vector" (namely, any element of an abstract vector space). Under the physics definition, a "vector" is required to have components that "transform" in a certain way under a proper rotation: In particular, if everything in the universe were rotated, the vector would rotate in exactly the same way. (The coordinate system is fixed in this discussion; in other words this is the perspective of active transformations.) Mathematically, if everything in the universe undergoes a rotation described by a rotation matrix R, so that a displacement vector x is transformed to x′ = Rx, then any "vector" v must be similarly transformed to v′ = Rv. This important requirement is what distinguishes a vector (which might be composed of, for example, the x-, y-, and z-components of velocity) from any other triplet of physical quantities (For example, the length, width, and height of a rectangular box cannot be considered the three components of a vector, since rotating the box does not appropriately transform these three components.)
I'm not sure I fully understand the "definition" above (for one thing, what is the difference between "displacement vector" and "vector"?), but already it's clear that this is not equivalent to the usual definition of a vector space, for a couple of reasons:

(1) In a general vector space, it's not necessarily possible to express a vector as an array of finitely many numbers/coordinates. This is only possible in a finite-dimensional vector space. Many interesting vector spaces have infinite dimensions, for example, the space of all real-valued sequences, or the space of all real-valued functions defined on ##\mathbb{R}##.

(2) In a general vector space, there need not be a notion of "rotation" or "angle" - this requires some additional structure such as an inner product.

(In the language of differential geometry, this requirement is equivalent to defining a vector to be a tensor of contravariant rank one.)
Hopefully someone who knows something about tensors (i.e. not me :tongue: ) can give some more insight here.
 
Last edited:
  • #3
The first definition is the "Linear Algebra" definition- it is more general than the second or "physics definition". That is mainly because the first definition allows infinite dimensional vector spaces. For example, if X is the set of all polynomials, with scalar multiplication defined to be multiplying every coefficient by that number and vector addition defined to be adding coefficients of the same power of the variable, then X is a vector space. It is NOT "group of numbers" because there is no upper bound on the degree of the polynomial- it is "infinite dimensional".

On the other hand, if V is a finite dimensional vector space, using the first definition, then we can choose a basis for the vector space, write every vector as in terms of those basis vectors, [itex]v= a_1v_1+ a_2v_2+ \cdot\cdot\cdot+ a_nv_n[/itex], and then identify vector v with the "group of numbers" [itex](a_1, a_2, \cdot\cdot\cdot, a_n)[/itex]. If we were to select a different basis for the vector space, we would have a different set of numbers: [itex]v= b_1u_1+ b_2u_2+ \cdot\cdot\cdot+ b_nu_n[/itex]. However, the set of numbers [itex](b_1, b_2, \cdot\cdot\cdot, b_n[/itex] and [itex](a_1, a_2, \cdot\cdot\cdot a_n)[/itex] are connected- each is a matrix times the other. That matrix multiplication is the "transformation law" you refer to.

Going the other way, given a "group of numbers", [itex](a_1, a_2, \cdot\cdot\cdot, a_n)[/itex], with the usual scalar multiplication and addition, does satisfy all of the requirements for a "vector space".

Now, you say that a "pseudovector" does not transform correctly upon inversion. What do you mean by "inversion"? Does that even correspond to an allowable operation in a vector space?
 
  • #4
If you know something about differential forms I give you some citations of Frankel's book:

"...Half of the "forms", "vectors," and "scalars" that occur in physics are in fact "pseudo-objects" that make sense only when an orientation prescribed...for example the magnetic field pseudovector B is perhaps the most famous example..."

"Definition: A pseudo-p-form [itex]\alpha[/itex] on a vector space E assigns, for each orientation [itex]\circ[/itex] of E, an exterior p-form [itex]\alpha_{\circ}[/itex] such that if the orientation is reversed the exterior form is replaced by it's negative [itex]\alpha_{-\circ}=-\alpha_{\circ}[/itex]...Similarly we can define define pseudovectors, pseudoscalars and so on, pseudo always reffering to a change of sign with a change of orientation"

Maybe that helps !

Greets
 
  • #5
I think the motivation for the different approaches is that mathematicians want to consider the properties of vectors in an abstract sense, but physicists and engineers want to do calculations, which in practice means representing a finite-dimensional vector in some definite (sually orthogonal) basis, or "coordinate system".

The physics/engineering definition of vectors (and similarly of tensors) focuses on the transformations required to do the calculations, when the underlying field is ##\mathbb{R}## or ##\mathbb{C}##, and the underlying vector space corresponds to n-dimensional Euclidean geometry.

Of course, physicists and engineers had been successfully doing the calculations for a few centuries before mathematicians decided to make the subject more complicated :biggrin:
 
  • #6
AlephZero said:
I think the motivation for the different approaches is that mathematicians want to consider the properties of vectors in an abstract sense, but physicists and engineers want to do calculations, which in practice means representing a finite-dimensional vector in some definite (sually orthogonal) basis, or "coordinate system".

The physics/engineering definition of vectors (and similarly of tensors) focuses on the transformations required to do the calculations, when the underlying field is ##\mathbb{R}## or ##\mathbb{C}##, and the underlying vector space corresponds to n-dimensional Euclidean geometry.

Of course, physicists and engineers had been successfully doing the calculations for a few centuries before mathematicians decided to make the subject more complicated :biggrin:

Please put comments like this in this thread: https://www.physicsforums.com/showthread.php?t=761983
 
  • #7
To add onto Aleph's point, mathematicians tend to like to make concrete things abstract (vector is a set with over a field along with the vector properties), whereas physicists like to stick to the concrete examples (vector is an arrow, or an object with magnitude and direction, or an n-tuple of numbers) . The difference lies in the ultimate goal differences between a mathematician and a physicist.

To a mathematician, making a concept abstract makes it possible to describe a much broader set of mathematical objects in the same way. This was hinted at in Ivy's post. Because mathematicians decided to define a vector in such an abstract way, they can analyze objects that we wouldn't normally consider vectors (e.g. functions spaces) using vector analysis. In this way a mathematician can find mathematical insight into objects that he wouldn't be able easily get by analyzing it otherwise.

To a physicist, we want to stick to the concrete examples because that helps us deal with the physical problems much easier and allows us physical insight into a problem. If we think of a velocity vector as an arrow in space, for example, we can much more quickly figure out what is going on physically than if we consider a velocity vector as an abstract object which, combined with the other velocity vectors (and the real numbers), obey the vector space postulates.

Because the "physicist's definition" originated first (as is the case for many mathematical objects because the "physicist's definition" is more likely to be the more intuitive one and less abstract one), it is usually found to be a subset of the "mathematician's definition". As is the case here.
 
  • #8
Moderator note: Please keep this thread on topic. This is not supposed to be a thread on physicists and mathematicians and who do it better. For any such discussions, see the link in my previous reply.
 
  • #9
1MileCrash said:
There are two "formal" definitions of vectors (and tensors in general) which I've learned.

The first is what I consider the "better" definition, one I learned in linear algebra. We call a set X a vector space over a field F whenever that set has properly defined operations of scalar multiplication and addition, and follows certain properties. The elements of X are then called vectors.

The second definition, the one I learned as a physics major, is the one I consider the "pain in the ***" definition, that is, it is a vector is a "group of numbers" that transform a certain way (transformation law) into a new coordinate system.

I've always wondered if these definitions were, in fact, equivalent and I've seen it mentioned that they are.

However, what makes me doubt that they are really equivalent definitions is the emergence of "pseudovector" in the case of the second definition. These are vector-like objects which do not transform correctly upon inversion.

My questions are:
Consider a vector that meets the first definition; does it meet the second?

Consider a vector that meets the second definition. Does it meet the first?
Any n-tuple of real numbers is a vector in the sense of linear algebra, but to get a vector (or pseudovector) in the sense of the awful definition, you need something that associates an n-tuple of real numbers with each coordinate system. It's the function that makes that association that should be called a tensor/pseudotensor, not one specific n-tuple.
 
  • #10
Fredrik said:
Any n-tuple of real numbers is a vector in the sense of linear algebra, but to get a vector (or pseudovector) in the sense of the awful definition, you need something that associates an n-tuple of real numbers with each coordinate system. It's the function that makes that association that should be called a tensor/pseudotensor, not one specific n-tuple.

So rigorously, what would be the definition of a pseudovector?
 
  • #11
HallsofIvy said:
Now, you say that a "pseudovector" does not transform correctly upon inversion. What do you mean by "inversion"? Does that even correspond to an allowable operation in a vector space?
Inversion is just the map ##x\mapsto -x##. Physics books often say that a vector transforms according to the tensor transformation law under O(3) transformations, while a pseudovector transforms that way under SO(3) transformations, and does something else (flips the sign?) under inversion.
 
  • #12
micromass said:
So rigorously, what would be the definition of a pseudovector?
I don't think I've ever seen a rigorous version of the awful definitions (of "vector" and "pseudovector"), but something like this should work:

A function ##V:O(3)\to\mathbb R^3## is said to be a vector (under O(3) transformations) if
$$V(R)_i=\sum_{j=1}^3 R_{ij}V(I)_j$$ for all ##i\in\{1,2,3\}## and all ##R\in O(3)##.

A function ##V:O(3)\to\mathbb R^3## is said to be a pseudovector (under O(3) transformations) if
$$V(R)_i=(\det R) \sum_{j=1}^3 R_{ij}V(I)_j$$ for all ##i\in\{1,2,3\}## and all ##R\in O(3)##.
 
  • Like
Likes 1 person
  • #13
I already posted a definition of pseudoforms...
 
  • #14
JonnyMaddox said:
I already posted a definition of pseudoforms...

Ah yes, I missed that. Which book by Frankel is this exactly?
 
  • #17
Fredrik said:
Inversion is just the map ##x\mapsto -x##. Physics books often say that a vector transforms according to the tensor transformation law under O(3) transformations, while a pseudovector transforms that way under SO(3) transformations, and does something else (flips the sign?) under inversion.

From what I understand, under inversion, a pseudovector will "stay the same" (or flip the sign an additional time if you prefer to say it that way).

Suppose O and P are coordinate systems such that by the transformation law, a vector V in O would be expressed as -V in P.

Then for vectors A, B in O, A x B does not transform according to the transformation law because:

A x B = - B x A = - A x -B

If A x B were to be a vector by the poopy definition, it would have to be the case that -A x -B = -(A x B).

Right?
 
  • #18
1MileCrash said:
From what I understand, under inversion, a pseudovector will "stay the same" (or flip the sign an additional time if you prefer to say it that way).

Suppose O and P are coordinate systems such that by the transformation law, a vector V in O would be expressed as -V in P.

Then for vectors A, B in O, A x B does not transform according to the transformation law because:

A x B = - B x A = - A x -B

If A x B were to be a vector by the poopy definition, it would have to be the case that -A x -B = -(A x B).

Right?
This sounds right to me, assuming that the transformation flips the sign of all elements of ##\mathbb R^3##. ##A\times B## is only one element of ##\mathbb R^3##, but if we also require that for all O(3) transformations R, the "transformed" ##A\times B## is ##RA\times RB##, then we have associated an element of ##\mathbb R^3## with all coordinate systems (of the relevant kind), and we can check the transformation properties of this association. You suggested (if I understand you correctly) that we consider R=-I, where I is the identity. We have
$$(A\times B)'=(-IA\times -IB)=(-A)\times(-B)= A\times B,$$ and therefore
$$(A\times B)'_i =(A\times B)_i=I_{ij}(A\times B)_j =\det(-I)\big((-I)_{ij}(A\times B)_j\big) \neq(-I)_{ij}(A\times B)_j.$$ So the association defined above certainly doesn't define a "vector" under O(3) transformations, but it may define a pseudovector under O(3) transformations. To know for sure, we would have to check what happens with an arbitrary O(3) transformation instead of just -I.
 

1. What is a vector?

A vector is a mathematical quantity that has both magnitude and direction. It is often represented by an arrow, with its length representing the magnitude and its direction indicating the direction.

2. How are vectors defined?

Vectors can be defined in multiple ways, but the most common definition is that they are any mathematical object that can be added and multiplied by a scalar to produce another vector that follows the same rules of addition and scalar multiplication.

3. Are all the definitions of vectors equivalent?

Yes, all the definitions of vectors are equivalent. They may differ in their terminology and notation, but they all describe the same mathematical concept.

4. What is the difference between a vector and a scalar?

A vector has both magnitude and direction, while a scalar only has magnitude. In other words, a scalar is a single quantity, whereas a vector is a combination of multiple quantities.

5. How are vectors used in science?

Vectors are used in various fields of science, including physics, engineering, and computer science. They are used to represent physical quantities such as force, velocity, and acceleration, and are essential in solving problems involving motion and forces.

Similar threads

  • Linear and Abstract Algebra
Replies
4
Views
2K
  • Linear and Abstract Algebra
Replies
2
Views
996
  • Topology and Analysis
Replies
5
Views
174
  • Linear and Abstract Algebra
2
Replies
43
Views
5K
  • Linear and Abstract Algebra
Replies
4
Views
1K
  • Linear and Abstract Algebra
Replies
16
Views
4K
  • Linear and Abstract Algebra
Replies
1
Views
1K
  • Linear and Abstract Algebra
Replies
6
Views
1K
Replies
4
Views
1K
  • Linear and Abstract Algebra
Replies
3
Views
2K
Back
Top