Dividing two differentials gives a total derivative how?

Click For Summary

Discussion Overview

The discussion revolves around the relationship between differentials in the context of differential geometry and their application in special relativity, particularly regarding the operation of dividing differentials to obtain a total derivative. Participants explore the mathematical implications of this operation and its conceptual underpinnings.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants express understanding of the differential as a 1-form and question how it relates to the total derivative, particularly in the context of Lorentz transformations.
  • There is a discussion on whether the transformation of differentials can be treated similarly to coordinate differences, with some arguing that using differentials is appropriate only under certain conditions, such as when velocity is constant.
  • Some participants propose that dividing one tensor by another can yield a scalar if one tensor is a scalar multiple of another, while others challenge the validity of this operation in the context of tensors and matrices.
  • Concerns are raised about the mathematical operations involved, particularly regarding the interpretation of division when applied to tensors and matrices, with suggestions that the operation may not be a true "division" in the conventional sense.
  • Participants discuss the nature of differentials as linear maps and their role in approximating changes in functions, emphasizing the complexity of higher-dimensional cases.

Areas of Agreement / Disagreement

Participants exhibit a range of views on the mathematical treatment of differentials and tensors, with no consensus reached on the validity of dividing differentials or the implications of such operations. The discussion remains unresolved regarding the conceptual and mathematical foundations of these ideas.

Contextual Notes

Limitations include the dependence on specific definitions of differentials and tensors, as well as unresolved mathematical steps regarding the operations discussed. The discussion reflects a variety of interpretations and assumptions that are not universally accepted.

ArthurB
Messages
17
Reaction score
0
Hello
I am studying some differential geometry. I think I have understood the meaning of "differential" of a function:

\text{d}f (V) = V(f)

It is a 1-form, an operator that takes a vector and outputs a real number.
But how is it related to the operation of "total derivative" ?
For example, in special relativity we can perform a boost with velocity v along the x axis and transform the differentials like this:
<br /> \text{d}x&#039; = \gamma (\text{d}x-v \, \text{d}t) \qquad \text{d}y&#039;=\text{d}y <br />
<br /> \text{d}t&#039; = \gamma\left(\text{d}t-v\, \text{d}x\right) \qquad \text{d}z&#039;=\text{d}z<br />
then the x component of the velocity
<br /> u^1=\frac{dx}{dt}<br />
in the new frame will have the expression
<br /> u&#039;^1=\frac{dx&#039;}{dt&#039;} = ? = \frac{\text{d}x&#039;}{\text{d}t&#039;}= \frac{u^1 -v}{1-{u^1 v}}<br />
but how is it mathematically possible? a differential is a tensor, not a scalar, how can I divide a tensor by another tensor and obtain a scalar?

P.S. notice the difference between \text{d} and d
 
Last edited:
Physics news on Phys.org
ArthurB said:
For example, in special relativity we can perform a boost with velocity v along the x axis and transform the differentials like this:
<br /> \text{d}x&#039; = \gamma (\text{d}x-v \, \text{d}t) \qquad \text{d}y&#039;=\text{d}y <br />
<br /> \text{d}t&#039; = \gamma\left(\text{d}t-v\, \text{d}x\right) \qquad \text{d}z&#039;=\text{d}z<br />
then the x component of the velocity
<br /> u^1=\frac{dx}{dt}<br />
in the new frame will have the expression
<br /> u&#039;^1=\frac{dx&#039;}{dt&#039;} = ? = \frac{\text{d}x&#039;}{\text{d}t&#039;}= \frac{u^1 -v}{1-{u^1 v}}<br />
but how is it mathematically possible? a differential is a tensor, not a scalar, how can I divide a tensor by another tensor and obtain a scalar?
Note that if you just drop the d's from this calculation, all the calculation does is to determine the velocity associated with the (straight) world line from (0,0) to (t,x) in another inertial coordinate system. (A Lorentz transformation leaves (0,0) unchanged and takes (t,x) to (t',x'), so x/t becomes x'/t'). I would say that the calculation with the d's in place is exactly the same thing, with a weird notation for the variables. It doesn't have anything to do with 1-forms.
 
Fredrik said:
I would say that the calculation with the d's in place is exactly the same thing, with a weird notation for the variables. It doesn't have anything to do with 1-forms.

It's conceptually the same thing only if u is constant I think. If the world line of the particle is not straight I don't think it's correct to use the transformations without the d's.
And those must be 1-forms, since Lorentz transformations leave the metric invariant:
<br /> \text{d}t^2 - \text{d}x^2 - \text{d}y^2 - \text{d}z^2 = <br /> \text{d}t&#039;^2 - \text{d}x&#039;^2 - \text{d}y&#039;^2 - \text{d}z&#039;^2<br />
in which the notation means, for example
<br /> \text{d}y^2 = \text{d}y \otimes \text{d}y<br />
that is a tensor product of two 1-forms.
 
ArthurB said:
how can I divide a tensor by another tensor and obtain a scalar?
You could observe that one tensor is a scalar multiple of another tensor. This happens somewhat frequently when you have a one-dimensional space of tensors...
 
ArthurB said:
It's conceptually the same thing only if u is constant I think. If the world line of the particle is not straight I don't think it's correct to use the transformations without the d's.
Right, but you can take your dx, dy, etc to be coordinate differences between two points on a tangent to the world line. In that case, you're just dealing with numbers, not differentials.

ArthurB said:
And those must be 1-forms, since Lorentz transformations leave the metric invariant:
<br /> \text{d}t^2 - \text{d}x^2 - \text{d}y^2 - \text{d}z^2 = <br /> \text{d}t&#039;^2 - \text{d}x&#039;^2 - \text{d}y&#039;^2 - \text{d}z&#039;^2<br />
in which the notation means, for example
<br /> \text{d}y^2 = \text{d}y \otimes \text{d}y<br />
that is a tensor product of two 1-forms.
Sure, the metric can be expressed as g=g_{\mu\nu}\text{d}x^\mu\otimes\text{d}y^\nu, but that doesn't mean that the dx, dy, etc in the velocity calculation must be considered differentials.


Since \text{d}x^\mu/\text{d}x^0 doesn't make sense, I guess the real question is if velocity components can be calculated as \frac{\text{d}x^\mu(v)}{\text{d}x^0(v)}=\frac{v(x^\mu)}{v(x^0)}=\frac{v^\mu}{v^0}, where v is some tangent vector. There's obviously a choice of v involved. I suggest v=\dot C(s), the tangent vector at C(s) of the world line C, defined by \dot C(s)f=(f\circ C)&#039;(s). Its components in the coordinate system x are \dot C^\mu(s)=\dot C(s)x^\mu=(x^\mu\circ C)&#039;(s) (The first equality defines the notation on the left). So \frac{\text{d}x^\mu(\dot C(s))}{\text{d}x^0(\dot C(s))} =\frac{(x^\mu\circ C)&#039;(s)}{(x^0\circ C)&#039;(s)}. The numerator is a velocity component. The whole thing isn't.
 
as others have said, v/w = t means that tw = v, so if it is possible for a tensor to be a scalar multiple of another tensor, then it is equally possible for the quotient of those tensors to be a scalar.
 
mathwonk said:
as others have said, v/w = t means that tw = v, so if it is possible for a tensor to be a scalar multiple of another tensor, then it is equally possible for the quotient of those tensors to be a scalar.

This seems reasonable, but as you know tensors can be represented by matrices, so are you saying that if a matrix equals a scalar multiplied by another matrix then it implies that the quotient of those matrices is a scalar?
<br /> M = a N \Rightarrow M/N = a ?<br />
My only problem is that the operator "/" usually takes two scalars as input and then outputs another scalar, right? How can you feed it matrices?
I'm not saying it's wrong, just that probably we can't call it "quotient", and that the operation we are doing is not "dividing", it must be something else, but what?
 
Just like 1/x or x-1 is the inverse of a number x with respect to multiplication, A-1 is the inverse of a matrix A with respect to matrix multiplication.

Wether you use A-1 or 1/A is a matter of convention (and like for the numbers, A/B is just a shorthand notation for A * 1/B).
 
kith said:
Just like 1/x or x-1 is the inverse of a number x with respect to multiplication, A-1 is the inverse of a matrix A with respect to matrix multiplication.

Wether you use A-1 or 1/A is a matter of convention (and like for the numbers, A/B is just a shorthand notation for A * 1/B).
In this case, we're talking about cotangent vectors, so the matrices would be 1×4. I think what mathwonk is saying is just that the notation v/w=t can be defined to mean v=wt. But I don't understand what Hurkyl has in mind. There's a theorem that says that two (bounded) linear functionals on the same (normed) vector space have the same kernel if and only if one of them is a number times the other. So for \text{d}x^3 to be a number times \text{d}x^0, the "coordinate system" would have to be such the 3 component of a tangent vector is 0 if and only if the 0 component is 0. Wouldn't the 3 axis have to coincide with the 0 axis for that to happen? (That would mean that x doesn't qualify as a coordinate system at all, let alone as an inertial coordinate system in SR).
 
  • #10
I hope what I'm saying is not too trivial, but it took me a while to figure this out myself:

but the differential of f is a linear map

that describes the local change of the values of f , e.g., the differential of f(x)=x^2

is 2xdx , so that the line with slope 2xo is the line that best approximates (in a delta-eps.

sense) the change of f(x) in a 'hood (neighborhood) of xo. This is the simplest case, i.e.,

of maps from Reals to Reals. In higher-dimensional cases, you have many more directions

to take care of, so you deal with a full tangent space . So your differential gives you the

best approximation in a given direction , and this direction is described by a vector in

the tangent space. And, a technical point about M/N=a : the quotient of objects in

a structure should be an object in that structure , so M/N =a would mean the scalar

matrix a, not the number a, since M,N are matrices.

So, formally, as a 1-form, the

map assigns to each point
 
  • #11
Fredrik said:
But I don't understand what Hurkyl has in mind.

I think he simply said that in a 1-dimensional vector space every vector is a multiple of the other ones. Right. But aren't we reasoning in 4 dimensions?
About the whole "differential-derivative" stuff I think I'm figuring it out, thanks everyone for the help.
 
  • #12
ArthurB said:
About the whole "differential-derivative" stuff I think I'm figuring it out,
I actually didn't quite understand what it was you wanted to know there. Was it that the formula df=\frac{\partial f}{\partial x^i}dx^i can be interpreted as \text{d}f|_p(v)=v(f)=v^i\frac{\partial}{\partial x^i}\bigg|_p f=\frac{\partial f(p)}{\partial x^i}v^i\ ?
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 36 ·
2
Replies
36
Views
6K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 6 ·
Replies
6
Views
4K
  • · Replies 14 ·
Replies
14
Views
5K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 10 ·
Replies
10
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K