Angular momentum - kleppner and kolenkow - derivation or definition?

In summary, the definition of angular momentum as the cross product between the radius vector and momentum of a particle is usually taken as a definition and not derived. It is considered useful due to its conservation and is used in many calculations, such as in the solution of planetary orbits. The choice to use a cross product was arbitrary, but it has been observed to be a useful mathematical tool in physics. It is not known how physicists came across this particular formula, but it is likely that they discovered its usefulness through experimentation and observation.
  • #1
3102
19
0
I am reading "An Introduction to Mechanics" by Kleppner and Kolenkow (2014). On page 241 is the definition of the angular momentum:

"Here is the formal definition of the angular momentum $$\vec{L}$$ of a particle that has momentum $$\vec{p}$$ and is at position $$\vec{r}$$ with respect to a given coordinate system: $$\vec{L}=\vec{r} \times \vec{p}$$"

In the book there is no explanation why this formula should be true. From this equation the formulas for torque and moment of inertia are derived.

My question is: Why is the formula above correct? Why isn't the formula for angular momentum something completely different, like $$\vec{L}=\sqrt{(\vec{r} \cdot \vec{p})2\pi M}$$? Is the formula for angular momentum just an arbitrary definition? If not, how to derive it? How did people come across that particular formula?

================================================

If anyone read the book mentioned above: Is there any derivation that I haven't found?
 
Physics news on Phys.org
  • #2
That formula is usually taken as a definition. You don't prove definitions. They just associate an English word with something mathematical.
 
  • #3
3102 said:
In the book there is no explanation why this formula should be true.
Definitions are always true, by definition. You cannot derive a definition, and it doesn't make sense to ask why it is true, it is tautologically true because it is defined to be true.

The better question about a definition is whether or not it is useful.
 
  • #4
BUT: An gyroscope behaves in a way that is not influenced by our definitions. If I took another definition for angular momentum (e.g.: the one mentioned above) then I would compute a totally wrong "forecast" about the gyroscopes motion. So, at least for my part, it does not seem like an arbitrary definition.
 
  • #5
3102 said:
Is the formula for angular momentum just an arbitrary definition? If not, how to derive it? How did people come across that particular formula?

The quantity ##\vec{r}\times\vec{p}## has been observed to be conserved, which makes useful enough to be worth studying. The arbitrary choice happened when we decided to call it "angular momentum" instead of something else.

(As an aside, there are deeper reasons why this thing we call angular momentum is conserved, but they were not discovered until centuries after angular momentum had been discovered and named).
 
  • #6
Just because the quantity is useful doesn't mean the definition's not arbitrary. By arbitrary we mean that it could have been called anything, not necessarily angular momentum (and obviously not necessarily represented by [itex]\vec{L}[/itex]). This quantity, the cross product between the radius vector and the momentum of a particle, turns out to be pretty useful, so we decide to make it an official physical quantity by giving it a name and all that. That part was arbitrary.
 
  • #7
3102 said:
BUT: An gyroscope behaves in a way that is not influenced by our definitions. If I took another definition for angular momentum (e.g.: the one mentioned above) then I would compute a totally wrong "forecast" about the gyroscopes motion. So, at least for my part, it does not seem like an arbitrary definition.
Regardless of what term you associate with ##\vec r\times\vec p## and what piece of mathematics you associate with the term "angular momentum", it would be ##\vec r\times\vec p## that you use in the calculation. For example, the formula ##\frac{d}{dt}\vec r\times\vec p=\vec r\times\vec F## holds regardless of what terms you associate with mathematical things.
 
  • #8
guitarphysics said:
Just because the quantity is useful doesn't mean the definition's not arbitrary. By arbitrary we mean that it could have been called anything, not necessarily angular momentum (and obviously not necessarily represented by [itex]\vec{L}[/itex]). This quantity, the cross product between the radius vector and the momentum of a particle, turns out to be pretty useful, so we decide to make it an official physical quantity by giving it a name and all that. That part was arbitrary.

But how do you know, that this particular quantity r x p is so useful? Is there any passage in the book that shoes you that this quantity is so much more useful than any other quantity?
 
  • #9
DaleSpam said:
Definitions are always true, by definition. You cannot derive a definition, and it doesn't make sense to ask why it is true, it is tautologically true because it is defined to be true.

The better question about a definition is whether or not it is useful.

okay, why is the particular quantity "r x p" so useful? And for me way more important: How to find out which quantity is important and which not?
 
  • #10
3102 said:
okay, why is the particular quantity "r x p" so useful? And for me way more important: How to find out which quantity is important and which not?

It is so useful because it is conserved. For example, the solution to the problem of planetary orbits depends on conservation of angular momentum.
 
  • #11
3102 said:
But how do you know, that this particular quantity r x p is so useful? Is there any passage in the book that shoes you that this quantity is so much more useful than any other quantity?
I haven't looked inside K & K in many years, but you're the one who talked about using it to calculate the motion of a gyroscope, so you must be aware of some uses for it already. If not, the sections or chapter(s) about angular momentum in the book should make it clear in what ways it's useful.

More useful than any other quantity? Useful for what? You'd have to answer the latter question to get an answer to the former. Since I just proved that ##\frac{d}{dt}\vec r\times\vec p=\vec r\times F## (I didn't post the proof), I can tell you that it's extremely useful for that.
 
  • #12
Nugatory said:
It is so useful because it is conserved. For example, the solution to the problem of planetary orbits depends on conservation of angular momentum.

Thanks for your answer. Now I have got 2 more questions:
1. How did physicists find out that the quantity "r x p" is the useful quantity? I assume that they did not try all possible combinations of quantities.
2. How did they know that they should use the cross product? I don't think that they were just randomly combining mathematical tools.

A short answer to my two questions would be very nice.
 
  • #13
3102 said:
Thanks for your answer. Now I have got 2 more questions:
1. How did physicists find out that the quantity "r x p" is the useful quantity? I assume that they did not try all possible combinations of quantities.
2. How did they know that they should use the cross product? I don't think that they were just randomly combining mathematical tools.

Newton described the concept of angular momentum using a combination of geometrical reasoning and his three laws. At the time he was looking for an explanation of Kepler's Law of Areas, which was an empirical observation about planetary orbits.

The modern formulation in terms of ##\vec{r}\times\vec{p}## came later, with the development of modern vector calculus.

If you google for "angular momentum history" you'll find a few more detailed descriptions. I don't know of any comprehensive explanation that's available online though.
 
  • Like
Likes 3102
  • #14
3102 said:
1. How did physicists find out that the quantity "r x p" is the useful quantity? I assume that they did not try all possible combinations of quantities.
I don't know historically how it happened for angular momentum. However, typically the way this works is that they don't have the concept, but the quantity keeps on showing up in calculations. Once it has shown up 3 or 4 times they get tired of writing it so they assign it to a variable to make their formulas easier.

3102 said:
2. How did they know that they should use the cross product? I don't think that they were just randomly combining mathematical tools.
This is just a matter of recognition. If you expand out the cross product in terms of vector components you get a very characteristic form of expression. Whenever you see that form you recognize it as the cross product.
 
  • Like
Likes 3102
  • #15
A deep answer:
  • The laws of physics haven't changed since yesterday, last week, or billions of years ago.
    That the laws of physics are invariant with respect to time means that energy is a conserved quantity.
  • The laws of physics are the same in London and Moscow, and on Mars, and in the Andromeda Galaxy.
    That the laws of physics are invariant with respect to displacement means that linear momentum is a conserved quantity.
  • The laws of physics are the same whether you look south or north, east or west, up or down.
    That the laws of physics are invariant with respect to orientation means that angular momentum is a conserved quantity.
These deep answers are a consequence of Noether's[/PLAIN] theorem. Emmy Noether developed her theorems four hundreds of years after Kepler first proposed his laws of planetary motion. A number of physicists worked on these problems between Kepler and Noether. There's a whole lot of science that was developed between Kepler and Noether.

Here's a dirty little secret of science education: The nice picture you are presented with in physics classes in high school and the undergraduate college level represents hundreds of years of stumbling around by physicists. You aren't taught that "stumbling around" because teaching that has been deemed to be counterproductive. Once you hit graduate school, you'll see exactly how much stumbling around scientists do before arriving at a simple and consistent explanation. This is the case not just in physics but throughout the sciences.

With regard to the cross product, Kepler, Newton, d'Alembert, Euler, Lagrange, Laplace, along with many others who worked on rotational motion did not use the cross product. They didn't even use vectors! They instead "stumbled around." The concept of a vector is a rather recent invention, dating to the final part of the ##19^\text{th}## century -- just a few decades before Emmy Noether developed her theorems. You'll see an amazing amount of "stumbling around" if you read texts or journal articles written prior to the ##20^\text{th}## century.
 
Last edited by a moderator:
  • Like
Likes 3102 and Nugatory
  • #16
Nugatory said:
The modern formulation in terms of ##\vec{r}\times\vec{p}## came later, with the development of modern vector calculus.

This was during the 1890-1910 period, more or less, as I recall reading. The physics of angular momentum (at least at the first-year introductory level) had surely been settled long before that (probably in the 1700s, or at the latest, early 1800s). I don't know what kind of notation people were using before then, but I suspect some of them were using quaternions.
 
  • Like
Likes 3102
  • #17
jtbell said:
I don't know what kind of notation people were using before then, but I suspect some of them were using quaternions.
Hamilton developed the quaternions in 1843. What those people who preceded Hamilton used falls into the category of what I not so nicely called "stumbling around".

Not that many people used Hamilton's quaternions at first.They remained a rather esoteric concept for a good twenty or so years after Hamilton first described them. Maxwell took interest and noted that his description of electrodynamics might be a lot easier if expressed in terms of quaternions. Followers of Hamilton, most notably Peter Tait, tried to eschew the value of quaternions. Others, most notably Josiah Gibbs and Oliver Heaviside, absolutely hated Hamilton's quaternions and instead developed the vector calculus that is now used almost universally in the teaching of introductory level physics.
 
  • Like
Likes 3102
  • #18
D H said:
Here's a dirty little secret of science education: The nice picture you are presented with in physics classes in high school and the undergraduate college level represents hundreds of years of stumbling around by physicists. You aren't taught that "stumbling around" because teaching that has been deemed to be counterproductive.

Interestingly, a lot more of the stumbling around is shown when talking about elementary QM at that level: Bohr atoms, wave-particle duality, uncertainty caused by measurements disturbing the system, photons like little bullets, observers collapsing the wave function, cats neither dead nor alive... And it seems pretty clear to me that it is counterproductive - an amazing number of threads in the QM forum are about getting people to unlearn these concepts.
 
  • Like
Likes Dale and 3102
  • #19
Wow, thanks for all your answers.

But, now I've got 2 more questions:
A) Was the cross product because of the long formulas in physics or was it independently developed in mathematics?
B) If the cross product was developed in physics, how did they get to these long formulas for rotational motion?
 
  • #20
The three dimensional dot product and cross product were originally minor side shows in Hamilton's quaternions. Gibbs and Heaviside worked on developing a quaternion-free notation that centralized these minor side shows. The result is the vector analysis that students are now taught, worldwide.

With regard to those "long formulas for rotational motion", what long formulae? Euler's equations are long and there are three of them. The modern version is rather compact in comparison: ##\boldsymbol {\tau}_{\text{ext}} = \mathbf I \dot{\boldsymbol \omega} + \boldsymbol \omega \times (\mathbf I \boldsymbol \omega)##.

This happens a lot. Developments in mathematics help make physics more compact, and developments in physics motivates mathematical developments. For example, if you proceed in studying physics you'll inevitably run across "Maxwell's equations," four short equations that succinctly represent all of classical electromagnetism. Maxwell did not write those four equations. He instead wrote 21 scalar equations, some of them rather long. If you proceed in studying physics even further, you'll find that you can express those four short equations in an even shorter form, as one single equation.
 
Last edited:
  • Like
Likes 3102
  • #21
D H said:
With regard to those "long formulas for rotational motion", what long formulae? Euler's equations are long and there are three of them. The modern version is rather compact in comparison: ##\boldsymbol {\tau}_{\text{ext}} = \mathbf I \dot{\boldsymbol \omega} + \boldsymbol \omega \times (\mathbf I \boldsymbol \omega)##.

By this I mean the 3 formulae, one for each coordinate direction x,y,z; the expansion of the cross product.
##\tau_{z}=xF_y-yF_x##

Where do these equations come from?
 
  • #22
They come from the definition of angular momentum and from the strong form of Newton's third law.

Suppose Physicist A writes about some particular obscure concept. Suppose that concept is used very little afterwards, and whenever it is used, it is used in its original form. It's going to stay in its original form.

Suppose on the other hand that Physicist B writes about some concept, Physicist C writes about some apparently different concept, Physicist D writes about yet another apparently different concept. Physicist E comes along and says "Hey! You guys were writing about the exact same thing! My paper presents a generic framework for analyzing this class of problems."

Next, Physicists F, G, H, I, and J, each of whom is unaware of Physicist E's unifying work, write about some other apparently different concepts. Physicist K comes along and says "Hey! All of you guys were writing about the exact same thing! Did you not read Physicist E's paper? My paper presents an even more generic framework for analyzing this class of problems."

Things that pop up over and over and over are given special names, special analyses. Physicists look for common features and create a common nomenclature when this happens. Other physicists look for ways to reduce the mathematics to a canonical form. This is what happened with rotational motion. The initial descriptions were rather verbose and perhaps a bit unclear. Now it is extremely compact.
 
  • Like
Likes 3102
  • #23
3102 said:
But how do you know, that this particular quantity r x p is so useful? Is there any passage in the book that shoes you that this quantity is so much more useful than any other quantity?
It's important because angular momentum stands side-by-side with energy and linear momentum (and also electrical charge, and a very small number of other things) that are conserved quantities. One of the key differences between Newton's mechanics and the Lagrangian / Hamiltonian formulations of Newtonian mechanics was a focus on those conserved quantities. This became even more important in the development of quantum mechanics.
 
  • Like
Likes 3102

Related to Angular momentum - kleppner and kolenkow - derivation or definition?

1. What is angular momentum?

Angular momentum is a physical quantity that describes the amount of rotational motion an object has. It is a vector quantity, meaning it has both magnitude and direction.

2. How is angular momentum defined?

Angular momentum is defined as the product of an object's moment of inertia and its angular velocity. It is represented by the symbol L and has units of kilogram meters squared per second (kg·m^2/s).

3. What is the formula for angular momentum?

The formula for angular momentum is L = Iω, where L is the angular momentum, I is the moment of inertia, and ω is the angular velocity.

4. How is angular momentum derived?

Angular momentum can be derived from Newton's Second Law of Motion, which states that the net torque acting on an object is equal to the rate of change of its angular momentum. This can be expressed as τ = dL/dt, where τ is the torque, L is the angular momentum, and t is time.

5. What is the difference between linear momentum and angular momentum?

Linear momentum refers to the motion of an object in a straight line, while angular momentum refers to the rotational motion of an object around a fixed axis. Linear momentum is represented by the symbol p, while angular momentum is represented by the symbol L.

Similar threads

Replies
17
Views
1K
Replies
42
Views
4K
Replies
5
Views
1K
Replies
3
Views
1K
Replies
3
Views
761
Replies
1
Views
4K
Replies
11
Views
12K
Replies
2
Views
4K
Replies
5
Views
2K
Back
Top