Tangent vectors as directional derivatives

In summary, the first question is about directional derivatives in general. The second question is about how to not consider the parameter t to be a coordinate for a curve in a region. The answer to the first question is that the curve specifies the direction (in a sense). The answer to the second question is that you need a coordinate system to specify the location of the point on the curve.
  • #1
"Don't panic!"
601
8
I have a few conceptual questions that I'd like to clear up if possible.

The first is about directional derivatives in general. If one has a function [itex]f[/itex] defined in some region and one wishes to know the rate of change of that function (i.e. its derivative) along a particular direction in that region, is the reason why one specifies a curve along the direction one wishes to consider because the curve specifies the direction (in a sense)? That is, if we choose a curve [itex]\gamma[/itex] (parametrised by [itex]t[/itex]) along some direction in the region in which [itex]f[/itex] is defined, then we can evaluate the function along that curve by composing [itex]f[/itex] with [itex]\gamma[/itex], i.e. [itex]f\circ\gamma[/itex]. Then for each value of [itex]\gamma[/itex] we can evaluate [itex]f[/itex] at that point and as such, the rate of change of the function along the direction defined by the curve (at a particular point) [itex]\gamma[/itex] is given by [tex]\frac{d}{dt}(f\circ\gamma)[/tex] Would this be correct?
When it comes to defining tangent vectors on manifolds, is the point that we define a curve [itex]\gamma : (-\varepsilon, \varepsilon)\rightarrow M[/itex] such that a particular direction along the manifold, at a given point [itex]p\in M[/itex], is specified. Then we can consider function [itex]f:M\rightarrow\mathbb{R}[/itex] and evaluate this function along the curve [itex]\gamma[/itex] at the point [itex]p\in M[/itex]. This involves composing the function with the curve [itex]\gamma[/itex] and noting that [itex]\gamma (0)=p[/itex]. Then, [tex]\frac{d}{dt}(f\circ\gamma)\bigg\vert_{p}[/tex] which is the derivative of [itex]f[/itex] along a particular direction (specified by [itex]\gamma[/itex]) on the manifold at a given point [itex]p\in M[/itex]. We note that, in general, there will be more than one curve that will have the same tangent at a given point, and so we identify a tangent vector at the point [itex]p\in M[/itex] as an equivalence class of curves passing through [itex]p\in M[/itex] and satisfying [itex](\phi\circ\gamma_{1})'(0)=(\phi\circ\gamma_{2})'(0)[/itex] (where [itex]\phi[/itex] is some coordinate chart)?!
Adding to this, I was asked a question as to way we don't consider the parameter [itex]t[/itex] to be a one-dimensional coordinate system for the curve [itex]\gamma[/itex]?! My response was that [itex]t[/itex] simply parametrises a curve in [itex]M[/itex], each value of [itex]t\in(-\varepsilon, \varepsilon)\subset\mathbb{R}[/itex] is mapped to a specific point on the manifold, i.e. [itex]t\mapsto \gamma (t)=p\in M[/itex], however, this doesn't specify the actual location of the point on the manifold and therefore [itex]t[/itex] is not a coordinate; one requires a mapping from [itex]M[/itex] to [itex]\mathbb{R}^{n}[/itex] in order to specify the actual location of the point in terms of an [itex]n[/itex]-tuple of coordinate values. I'm unsure whether this is a valid argument though?!

My second question is, given a function [itex]F: \mathbb{R}^{n}\rightarrow\mathbb{R}[/itex] is it valid to consider a curve [itex]\gamma :[0,1]\rightarrow\mathbb{R}^{n}[/itex] defined such that [itex]\gamma (0)=a\in\mathbb{R}^{n} [/itex] and [itex]\gamma (1)=x\in\mathbb{R}^{n} [/itex], and express it as [tex]\gamma(t)=(x^{1}(t),\ldots,x^{n}(t))=\gamma (0)+t\left(\gamma (1)-\gamma (0)\right)=a+t(x-a)[/tex] where the [itex]x^{i}:[0,1]\rightarrow\mathbb{R}[/itex] are coordinate functions defined by [tex]x^{i}(t)=a^{i}+t\left(x^{i}-a^{i}\right)[/tex] Then one can write [tex]\frac{d}{dt}\left((F\circ\gamma)(t)\right)= \frac{d}{dt}\left(F(\gamma(t))\right)=\frac{d}{dt}\left(F((x^{1}(t),\ldots,x^{n}(t)))\right) \\ \qquad\qquad\qquad\qquad\qquad\qquad\;\;\;=\sum_{i=1}^{n}\frac{\partial F(a+t(x-a))}{\partial x^{i}}\frac{dx^{i}}{dt} \\ \qquad\qquad\qquad\qquad\qquad\qquad\;\;\;=\sum_{i=1}^{n}\frac{\partial F(a+t(x-a))}{\partial x^{i}}\left(x^{i}-a^{i}\right)[/tex] and as [itex]x\in\mathbb{R}^{n} [/itex] was chosen arbitrarily, this result holds [itex]\forall x\in\mathbb{R}^{n} [/itex]. Would this be valid?
 
Physics news on Phys.org
  • #2
"Don't panic!" said:
The first is about directional derivatives in general. If one has a function [itex]f[/itex] defined in some region and one wishes to know the rate of change of that function (i.e. its derivative) along a particular direction in that region, is the reason why one specifies a curve along the direction one wishes to consider because the curve specifies the direction (in a sense)?
Yes. Also, the composition ##f\circ\gamma## is a real-valued function defined on a subset of ##\mathbb R##, so the definition of "derivative" from calculus can be applied.

"Don't panic!" said:
When it comes to defining tangent vectors on manifolds,...
No objections to this part, other than to the notation ##|_p##. It should be ##|_0##. I would also put that notation right next to the d/dt, but that's a matter of taste.
$$(f\circ\gamma)'(0)=\frac{d}{dt}\bigg|_0 f(\gamma(t)).$$
"Don't panic!" said:
Adding to this, I was asked a question as to way we don't consider the parameter [itex]t[/itex] to be a one-dimensional coordinate system for the curve [itex]\gamma[/itex]?!
You can view the range of the curve ##\gamma:(-\varepsilon,\varepsilon)\to M## as a 1-dimensional submanifold. If this map is injective, then its inverse can be thought of as a coordinate system.

"Don't panic!" said:
...this doesn't specify the actual location of the point on the manifold and therefore [itex]t[/itex] is not a coordinate; one requires a mapping from [itex]M[/itex] to [itex]\mathbb{R}^{n}[/itex] in order to specify the actual location of the point in terms of an [itex]n[/itex]-tuple of coordinate values.
A coordinate system on M is a map into ##\mathbb R^n##. A coordinate system on a 1-dimensional submanifold of M is a map into ##\mathbb R##.

"Don't panic!" said:
...express it as [tex]\gamma(t)=(x^{1}(t),\ldots,x^{n}(t))=\gamma (0)+t\left(\gamma (1)-\gamma (0)\right)=a+t(x-a)[/tex]
This would be an approximation, and it might be a bad one.

It's confusing that you're using the symbol x for two different things.
 
  • #3
Fredrik said:
A coordinate system on M is a map into Rn\mathbb R^n. A coordinate system on a 1-dimensional submanifold of M is a map into R\mathbb R.

But isn't the map meant to be coordinate independent? I had a friend question me on this last weekend as he thought the argument for defining a vector using this approach was a bit circular as he thought that it implicitly introduced a coordinate system from the start?! I wasn't able to give a particularly convincing argument, the best I was able to do was to give the explanation I put in my first post

Fredrik said:
This would be an approximation, and it might be a bad one.

It's confusing that you're using the symbol x for two different things.

I included this bit in reference to Hadamard's lemma for smooth functions, that I've been trying to prove, i.e. For any smooth function [itex] F:\mathbb{R} ^{n} \rightarrow\mathbb{R}[/itex] there exist smooth functions [itex] H_{\mu} [/itex] such that for all [itex] x\in\mathbb{R}^{n} [/itex] [tex] F(x) =F(a) +\sum_{i=1}^{n}(x^{\mu}-a^{\mu})H_{\mu}(x)[/tex] where [itex] a\in\mathbb{R}^{n} [/itex].
 
  • #4
"Don't panic!" said:
But isn't the map meant to be coordinate independent? I had a friend question me on this last weekend as he thought the argument for defining a vector using this approach was a bit circular as he thought that it implicitly introduced a coordinate system from the start?! I wasn't able to give a particularly convincing argument, the best I was able to do was to give the explanation I put in my first post
I don't quite understand what your friend is arguing against or what his argument is.

"Don't panic!" said:
I included this bit in reference to Hadamard's lemma for smooth functions, that I've been trying to prove, i.e. For any smooth function [itex] F:\mathbb{R} ^{n} \rightarrow\mathbb{R}[/itex] there exist smooth functions [itex] H_{\mu} [/itex] such that for all [itex] x\in\mathbb{R}^{n} [/itex] [tex] F(x) =F(a) +\sum_{i=1}^{n}(x^{\mu}-a^{\mu})H_{\mu}(x)[/tex] where [itex] a\in\mathbb{R}^{n} [/itex].
There's a nice proof of that lemma in Isham's book. The same proof can be found in Wald. It involves a trick that's easy to understand but difficult to find.
 
  • #5
Fredrik said:
I don't quite understand what your friend is arguing against or what his argument is.

I think the issue came up when I was trying to introduce the concept of vectors on a manifold to him. I said that we need to develop a notion of a tangent vector when we have no notion of a specific origin or what a straight line is, or how to relate nearby points on a manifold. I said that we want to develop a coordinate independent definition (as vectors are coordinate independent quantities) and at this point started talking about curves on a manifold and this is where the issue came up, as he said how can we introduce vectors in this way when it appears that one is introducing a 1-dimensional coordinate system to do so?!

Fredrik said:
There's a nice proof of that lemma in Isham's book. The same proof can be found in Wald. It involves a trick that's easy to understand but difficult to find.

I was actually trying to do the problem in Wald's book where he asks the reader to prove it, and this was the approach that I took, but I was really unsure about it.
 
  • #6
"Don't panic!" said:
...he said how can we introduce vectors in this way when it appears that one is introducing a 1-dimensional coordinate system to do so?!
I would agree that there's a problem if we had used one specific curve ##\gamma## to define ##T_pM##, and a different choice of ##\gamma## would have given us a different ##T_pM##. But that's not the case. The definition of ##T_pM## goes roughly like this: We start with a set that contains lots of curves through p. We define an equivalence relation on that set. Then we define a vector space structure (an addition operation and a scalar multiplication operation) on the set of equivalence classes. The vector space we end up with is denoted by ##T_pM##. Note in particular that an equivalence class of curves doesn't define a coordinate system on a 1-dimensional submanifold.

"Don't panic!" said:
I was actually trying to do the problem in Wald's book where he asks the reader to prove it, and this was the approach that I took, but I was really unsure about it.
I actually don't remember what Wald said about this theorem. When I saw the proof in Isham a few years ago, it made me think that I had previously studied the same proof in Wald. But maybe I had just done the exercise you mentioned. Isham's proof can be read at Google Books. Link.
 
  • #7
Fredrik said:
Note in particular that an equivalence class of curves doesn't define a coordinate system on a 1-dimensional submanifold.

Is this because each curve in the equivalence class can, in principle, be parametrised by a different parameter and so the equivalence class just captures the properties that they each pass through the same point and have the same derivative at that point, without needing to specify any particular coordinate system on a 1-dimensional submanifold?

Fredrik said:
I actually don't remember what Wald said about this theorem. When I saw the proof in Isham a few years ago, it made me think that I had previously studied the same proof in Wald

Wald quotes the result in chapter 2 of his book and then asks the reader to prove it by induction at the end of this chapter. I get the 1-dimensional case as we can write [tex]F(x)=F(a)+(F(x)-F(a))=F(a)+F(a+t(x-a))\big\vert_{t=0}^{t=1}=F(a)+\int_{0}^{1}\frac{dF(a+t(x-a))}{dt}dt\\ =F(a)+\int_{0}^{1}\frac{\partial F(a+t(x-a))}{\partial u}\frac{du}{dt}dt =F(a)+\int_{0}^{1}\frac{\partial F(a+t(x-a))}{\partial u}(x-a)dt\\ =F(a)+(x-a)\int_{0}^{1}\frac{\partial F(a+t(x-a))}{\partial u}dt[/tex]
where [itex]u(t)=a+t(x-a)[/itex].

I assume that Isham is doing something similar to this, although I don't quite follow his proof. Is the point that he is adding and subtracting the same functions (reducing in dimension by 1 each time) such that he can write an [itex]n[/itex]-dimensional function as a sum of functions (decreasing in dimension)? Also, when he goes from [tex]F(a^{1},\ldots,a^{n})=F(0,\ldots,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots,ta^{\mu},0,\ldots,0)\big\vert_{t=0}^{t=1}[/tex] to [tex]F(a^{1},\ldots,a^{n})=F(0,\ldots,0)+\sum_{\mu =1}^{n}\int_{0}^{1}F(a^{1},\ldots,ta^{\mu},0,\ldots,0)dt\\ =F(0,\ldots,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{\partial F(a^{1},\ldots,ta^{\mu},0,\ldots,0)}{\partial u^{\mu}}a^{\mu}dt[/tex] I assume he's using the fundamental theorem of calculus and the chain rule (with [itex]u^{\mu}(t)=ta^{\mu}[/itex])? My confusion arises from reading other texts where they just define a function [itex]h(t)=F(a+t(x-a))[/itex] and then note that [tex]\frac{dh}{dt}=\sum_{\mu =1}^{n}\frac{\partial F(a+t(x-a))}{\partial x^{\mu}}(x^{\mu}-a^{\mu})[/tex] and then note that [tex]h(1)-h(0)=F(x)-F(a)=\int_{0}^{1}\frac{dh}{dt}dt=\sum_{\mu =1}^{n}(x^{\mu}-a^{\mu})\int_{0}^{1}\frac{\partial F(a+t(x-a))}{\partial x^{\mu}}dt[/tex] but I don't see how this follows (unless they are abusing notation)?!
 
  • #8
"Don't panic!" said:
Is this because each curve in the equivalence class can, in principle, be parametrised by a different parameter...
No, it's because two different curves in the same equivalence class define coordinate systems on different submanifolds. The point p may be the only point in M that's in the range of both curves.

The possibility of reparametrization means that each curve defines infinitely many coordinate systems (on the same 1-dimensional submanifold), rather than just one.

I will think about that proof and post my comments later.
 
  • #9
Fredrik said:
it's because two different curves in the same equivalence class define coordinate systems on different submanifolds

Ah, so is the point that two different curves in the same equivalence class are mappings from different submanifolds to the manifold, with their particular parameter defining a 1-dimensional coordinate system for each of the respective submanifolds. The two curves may wildly differ at in general, but are such that they both pass through a particular point p, and the value of their derivatives (evaluated at this point are equal). Would this be correct?

Fredrik said:
The possibility of reparametrization means that each curve defines infinitely many coordinate systems (on the same 1-dimensional submanifold), rather than just one.

Is the point here that how a given curve is parametrised is somewhat arbitrary and as such we are free to choose how we parametrise it and hence making it coordinate independent? How does one show that the derivative of a curve at a point is independent of parametrisation?

Fredrik said:
I will think about that proof and post my comments later.

Thanks, really appreciate you taking the time to look at it :)
 
  • #10
As far as I can tell, the rewrite
$$F(x)=F(0)+\sum_{k=1} F(x_1,\dots,x_{k+1},tx_k,0,\dots,0)\big|_{t=0}^{t=1}$$ is in no way better than the much simpler rewrite
$$F(x)=F(0)+F(x)-F(0)= F(0)+F(tx)\big|_{t=0}^{t=1}.$$ They both seem to get the job done, in the sense that they both give us ways to write
$$F(x)=F(0)+\sum_{k=1}^n x_k F_k(x).$$ They give us different sets of functions ##F_k##, but for each k, both ##F_k## have the same value at 0. I guess the ##F_k## aren't unique. Does that make sense? Let's denote the members of the second set of such functions by ##G_k## instead of ##F_k##. For all x, we have
$$F(0)+\sum_{k=1}^n x_k F_k(x)=F(0)+\sum_{k=1}^n x_k G_k(x)$$ and therefore
$$\sum_{k=1}^n x_k (F_k(x)-G_k(x))=0.$$ This implies in particular that for all ##k##, we must have ##F_k(x)=G_k(x)## when ##x## is the ##k##th standard basis vector. But it doesn't seem to imply that ##F_k=G_k##. So it seems plausible that both of those rewrites get the job done.
 
  • #11
"Don't panic!" said:
Ah, so is the point that two different curves in the same equivalence class are mappings from different submanifolds to the manifold,
They map intervals in ##\mathbb R## (possibly the same one) to different 1-dimensional submanifolds of M. If they are injective, their inverses map those submanifolds back to those intervals. Those inverses can be thought of as coordinate systems on 1-dimensional submanifolds.

"Don't panic!" said:
The two curves may wildly differ at in general, but are such that they both pass through a particular point p, and the value of their derivatives (evaluated at this point are equal). Would this be correct?
"Their derivatives" is the tangent vector that we're trying to define, so we shouldn't be talking about it when we're discussing the very early parts of the construction. But given any two curves C,D in the same equivalence class, and any coordinate system x such that x(p)=0, the maps ##x\circ C## and ##x\circ D## (curves in ##\mathbb R^n##) have the same derivative at 0. (Now I'm just talking about the kind of derivative encountered in calculus). The fact that we're mentioning a coordinate system here is a problem if and only if the equivalence relation somehow depends on x. That's why one of the first steps in this construction is to prove that it doesn't.

"Don't panic!" said:
Is the point here that how a given curve is parametrised is somewhat arbitrary and as such we are free to choose how we parametrise it and hence making it coordinate independent? How does one show that the derivative of a curve at a point is independent of parametrisation?
It's not. The derivative (of the composition of the coordinate system and the curve) can be interpreted as a velocity, and most reparametrizations will change it. A reparametrization of a curve ##\gamma:(a,b)\to M## is a map ##s:(c,d)\to(a,b)##. The reparametrized curve is the map ##\gamma\circ s##. If x is a coordinate system as above, then we have
$$(x\circ\gamma\circ s)'(0)=(x\circ\gamma)'(s(0))s'(0).$$ So if ##s'(0)\neq 1##, ##x\circ\gamma\circ s## and ##x\circ\gamma## have different velocities.
 
  • #12
Fredrik said:
They map intervals in R\mathbb R (possibly the same one) to different 1-dimensional submanifolds of M. If they are injective, their inverses map those submanifolds back to those intervals. Those inverses can be thought of as coordinate systems on 1-dimensional submanifolds.

But I thought the mapping was from a subset of [itex]\mathbb{R}[/itex] to the manifold, i.e. [itex]\gamma : (-\varepsilon, \varepsilon)\rightarrow M[/itex]?

Fredrik said:
It's not. The derivative (of the composition of the coordinate system and the curve) can be interpreted as a velocity, and most reparametrizations will change it. A reparametrization of a curve γ:(a,b)→M\gamma:(a,b)\to M is a map s:(c,d)→(a,b)s:(c,d)\to(a,b). The reparametrized curve is the map γs\gamma\circ s. If x is a coordinate system as above, then we have
(xγs)′(0)=(xγ)′(s(0))s′(0).​
(x\circ\gamma\circ s)'(0)=(x\circ\gamma)'(s(0))s'(0). So if s′(0)≠1s'(0)\neq 1, the reparametrized curve doesn't have the same velocity vector.

So is the reason why we need not worry about the introduction of these 1-dimensional coordinate systems the fact that a tangent vector at a point is defined as an equivalence class of curves which will, in general, be parametrised differently (corresponding to them having different 1-dimensional coordinate systems), and will, in general, vary very differently at other points over the manifold. As such, as the definition of a tangent vector is not dependent on anyone particular curve, we do not need to even consider how these curves are parametrised (i.e. we don't need to introduce such 1-dimensional coordinate systems into the definition) and thus the definition is completely coordinate independent?!

Also, I have to confess, I'm a little confused by your previous post (sorry). Is there any reference for the identity that Isham uses, or is it literally noticing that you can write out an alternating sum of functions (decreasing in dimension) such that [tex]F(a^{1},\ldots ,a^{n})=F(a^{1},\ldots ,a^{n})-F(a^{1},\ldots ,a^{n-1},0)+F(a^{1},\ldots ,a^{n-1},0)-F(a^{1},\ldots ,a^{n-2},0,0)\\ \qquad\qquad\qquad+\cdots +F(a^{1},0,\ldots ,0,0)-F(0,0,\ldots ,0,0)+F(0,0\ldots ,0,0)[/tex] It just seems a little unsatisfactory (non-rigorous) to write it out this way.
I'm slightly confused by his notation as well. I see that the above sum can be written as [tex]F(a^{1},\ldots ,a^{n})=F(a^{1},\ldots ,ta^{n})\bigg\vert_{t=0}^{t=1}+\cdots +F(ta^{1},0,\ldots ,0,0)\bigg\vert_{t=0}^{t=1}+F(0,0,\ldots,0,0)[/tex]
but I'm not sure how one can write this as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}[/tex]
Is this just compact notation for noting that as [itex]\mu[/itex] the dimension of the function increases?!
Finally, assuming this notation I see how he can write it as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}\\=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{d}{dt}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)dt[/tex] but in the next step he re-writes this as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}\\ =F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{\partial}{\partial u^{\mu}}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)a^{\mu}dt[/tex] Is this simply the chain rule upon defining a function [itex]u^{\mu}(t)=ta^{\mu}[/itex] such that [tex]\frac{d}{dt}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)=\frac{d}{dt}F(u)\\=\frac{\partial}{\partial u^{\mu}}F(a^{1},\ldots ,u^{\mu}(t),0,\ldots, 0)\frac{du^{\mu}(t)}{dt}\\ =\frac{\partial}{\partial u^{\mu}}F(a^{1},\ldots ,u^{\mu}(t),0,\ldots, 0)a^{\mu}[/tex]
or is there another reason. Can one assert from this, that as [itex]a=(a^{1},\ldots ,a^{n})[/itex] was chosen arbitrarily from the open ball that we are considering, that this is true [itex]\forall x=(x^{1},\ldots ,x^{n})[/itex] in this open ball, such that [tex]F(x)=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{\partial}{\partial u^{\mu}}F(x^{1},\ldots ,tx^{\mu},0,\ldots, 0)x^{\mu}dt[/tex]
 
  • #13
"Don't panic!" said:
But I thought the mapping was from a subset of [itex]\mathbb{R}[/itex] to the manifold, i.e. [itex]\gamma : (-\varepsilon, \varepsilon)\rightarrow M[/itex]?
Right, that's correct, and consistent with what I said, but not with the statement I was replying to. I said that the range of an injective curve is a 1-dimensional submanifold of M. That doesn't contradict the statement that the codomain of the curve is M. But in the text I was replying to, you said that the domain of the curve is a submanifold of M (which would make it a subset of M rather than ##\mathbb R##).

"Don't panic!" said:
So is the reason why we need not worry about the introduction of these 1-dimensional coordinate systems the fact that a tangent vector at a point is defined as an equivalence class of curves which will, in general, be parametrised differently (corresponding to them having different 1-dimensional coordinate systems), and will, in general, vary very differently at other points over the manifold. As such, as the definition of a tangent vector is not dependent on anyone particular curve, we do not need to even consider how these curves are parametrised (i.e. we don't need to introduce such 1-dimensional coordinate systems into the definition) and thus the definition is completely coordinate independent?!
You seem to have a pretty good understanding of what's going on, but the statements about parametrization look strange to me. It only makes sense to talk about the parametrization of a curve in M when we use the word "curve" to refer to a set of points in M. But our "curves" are maps ##\gamma:(a,b)\to M##. The range of such a map is a set of points in M. The curve is the parametrization of that set. So you should be talking about different curves, not different parametrizations.

"Don't panic!" said:
Is there any reference for the identity that Isham uses, or is it literally noticing that you can write out an alternating sum of functions (decreasing in dimension) such that [tex]F(a^{1},\ldots ,a^{n})=F(a^{1},\ldots ,a^{n})-F(a^{1},\ldots ,a^{n-1},0)+F(a^{1},\ldots ,a^{n-1},0)-F(a^{1},\ldots ,a^{n-2},0,0)\\ \qquad\qquad\qquad+\cdots +F(a^{1},0,\ldots ,0,0)-F(0,0,\ldots ,0,0)+F(0,0\ldots ,0,0)[/tex] It just seems a little unsatisfactory (non-rigorous) to write it out this way.
I think he just expects people to be familiar enough with induction proofs to recognize this as an argument that can easily be turned into a rigorous proof using induction.

"Don't panic!" said:
I'm slightly confused by his notation as well. I see that the above sum can be written as [tex]F(a^{1},\ldots ,a^{n})=F(a^{1},\ldots ,ta^{n})\bigg\vert_{t=0}^{t=1}+\cdots +F(ta^{1},0,\ldots ,0,0)\bigg\vert_{t=0}^{t=1}+F(0,0,\ldots,0,0)[/tex]
but I'm not sure how one can write this as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}[/tex]
Is this just compact notation for noting that as [itex]\mu[/itex] the dimension of the function increases?!
I'm not sure what the concern is here, because you seem to understand exactly what's going on. Is the issue that a notation like ##F(a^1,\dots,a^{\mu-1},ta^\mu,0\dots,0)## appears to suggest that ##\mu\geq 3##? That's not how it's supposed to be interpreted.

"Don't panic!" said:
Finally, assuming this notation I see how he can write it as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}\\=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{d}{dt}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)dt[/tex] but in the next step he re-writes this as [tex]F(a^{1},\ldots ,a^{n})=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)\bigg\vert_{t=0}^{t=1}\\ =F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{\partial}{\partial u^{\mu}}F(a^{1},\ldots ,ta^{\mu},0,\ldots, 0)a^{\mu}dt[/tex] Is this simply the chain rule
Yes.

"Don't panic!" said:
Can one assert from this, that as [itex]a=(a^{1},\ldots ,a^{n})[/itex] was chosen arbitrarily from the open ball that we are considering, that this is true [itex]\forall x=(x^{1},\ldots ,x^{n})[/itex] in this open ball, such that [tex]F(x)=F(0,0,\ldots,0,0)+\sum_{\mu =1}^{n}\int_{0}^{1}\frac{\partial}{\partial u^{\mu}}F(x^{1},\ldots ,tx^{\mu},0,\ldots, 0)x^{\mu}dt[/tex]
Yes.
 
Last edited:
  • #14
As I said in post #10 (the one you found confusing), I don't see a reason to prefer Isham's approach over the other one you posted. I see nothing wrong with this (my version of the alternative approach):

Let B be an open ball in ##\mathbb R^n## around 0. Let F be a real-valued function on a subset of ##\mathbb R^n## that contains B. If F is smooth on B, then for all x in B, we have
\begin{align}
&F(x)=F(0)+F(x)-F(0)=F(0)+F(tx)\big|_{t=0}^{t=1} =F(0)+\int_0^1\frac{d}{dt}F(tx)dt \\
& =F(0)+\int_0^1\left(\sum_{k=1}^n F_{,k}(tx)x_k\right)dt =F(0)+\sum_{k=1}^n x_k \int_0^1 F_{,k}(tx)dt.
\end{align}
For each k, define ##F_k## by ##F_k(x)=\int_0^1 F_{,k}(tx) dt## for all x in B. Now we can write the above as
$$F(x)=F(0)+\sum_{k=1}^n x_k F_k(x).$$
Now suppose that ##f:M\to\mathbb R## is smooth, that ##p\in M##, and that ##x:U\to\mathbb R^n## is a coordinate system such that ##x(p)=0##. Define ##F=f\circ x^{-1}:x(U)\to\mathbb R##. For each k, define ##f_k=F_k\circ x##. Let ##B## be an open ball around 0 that's also a subset of ##x(U)##. For all ##q\in x^{-1}(B)##, we have
\begin{align}
&f(q)=(f\circ x^{-1}\circ x)(q)=F(x(q)) =F(0)+\sum_{k=1}^n x^k(q)F_k(x(q))\\
&=f(p) +\sum_{k=1}^n x^k(q)f_k(q).
\end{align} We can allow ourselves to write
$$f=f(p)+\sum_{k=1}^n x^kf_k.$$ (The first term should be interpreted as the number f(p) times the identity map). This equality is strictly speaking not true, but it doesn't need to be. What matters is that there's an open neighborhood of ##p## such that the restriction of the left-hand side to that neighborhood is equal to the restriction of the right-hand side to that neighborhood. That's all we need for the final calculation below to be valid.

Before we do it, note that
\begin{align}
&f_k(p)=F_k(x(p))= \int_0^1 F_{,k}(tx(p))dt =\int_0^1 F_{,k}(0)dt =F_{,k}(0)\\
&=(f\circ x^{-1})_{,k}(x(p)) =\left(\frac{\partial}{\partial x^k}\right)_p f.
\end{align}
Now let's wrap things up. For all ##v\in T_pM##, we have
\begin{align}
v(f)=v\left(f(p)+\sum_{k=1}^n x^kf_k\right) =\sum_{k=1}^n \big(v(x^k)f_k(p)+\underbrace{x^k(p)}_{=0} v(f_k)\big) =\sum_{k=1}^n v(x^k)\left(\frac{\partial}{\partial x^k}\right)_p f.
\end{align} Since f is an arbitrary smooth function, this implies that
$$v=\sum_{k=1}^n v(x^k)\left(\frac{\partial}{\partial x^k}\right)_p.$$
 
  • #15
Fredrik said:
Right, that's correct, and consistent with what I said, but not with the statement I was replying to. I said that the range of an injective curve is a 1-dimensional submanifold of M. That doesn't contradict the statement that the codomain of the curve is M. But in the text I was replying to, you said that the domain of the curve is a submanifold of M (which would make it a subset of M rather than R\mathbb R).

I don't quite see how the range is 1-dimensional, the curve maps to points on [itex] M[/itex] which would have the same dimension as the manifold wouldn't they? (sorry if I'm just being really stupid).

Fredrik said:
But our "curves" are maps γ:(a,b)→M\gamma:(a,b)\to M. The range of such a map is a set of points in M. The curve is the parametrization of that set. So you should be talking about different curves, not different parametrizations.

This is what I was trying to explain to my friend, but he when I said that each [itex] t\in (a, b) [/itex] maps to a point on the manifold I couldn't convince him that [itex] t[/itex] doesn't define a 1-dimensional coordinate system, I think he was thinking in terms of classical cases (in Euclidean space), but I tried to explain that the case is the same there as well, as one can always parameterise a curve in Euclidean space such that each value of [itex] t[/itex] corresponds to a set of coordinate values, but the parameter [itex] t[/itex] itself isn't considered as a coordinate.

Fredrik said:
F(0)+∫10(∑k=1nF,k(tx)xk)dt

Is this correct, [itex] F_{, k} (tx)=\frac{\partial F(tx)} {\partial x^{k}} [/itex]? My original confusion came about because in Wald (and a couple of other texts that I've read) he denotes it as [itex] H_{\mu} (x) [/itex] and that [tex] H_{\mu} (x) =\int_{0}^{1} \frac{\partial F(tx)} {\partial x^{\mu}} dt[/tex]
 
Last edited:
  • #16
"Don't panic!" said:
I don't quite see how the range is 1-dimensional, the curve maps to points on [itex] M[/itex] which would have the same dimension as the manifold wouldn't they? (sorry if I'm just being really stupid)
The dimension of a manifold is by definition the (vector space) dimension of the ##\mathbb R^n## that's the codomain of all the coordinate systems. If ##\gamma:(a,b)\to M## is a curve such that ##\gamma^{-1}:\gamma(M)\to(a,b)## can be considered a coordinate system on ##\gamma(M)##, then the fact that ##(a,b)## is a subset of ##\mathbb R## makes ##\gamma(M)## 1-dimensional.

Edit: The notation ##\gamma(M)## doesn't make sense. I should have written ##\gamma\big((a,b)\big)##.

"Don't panic!" said:
Is this correct, [itex] F_{, k} (tx)=\frac{\partial F(tx)} {\partial x^{k}} [/itex]?
It depends on what you mean by what you wrote on the right.

Interpretation 1: Compute the partial derivative of ##F## with respect to the ##k##th variable slot. The result is a function. The notation represents the value of that function at ##tx##.

Interpretation 2: Compute the partial derivative with respect to ##x^k## of the function of ##x## defined by the expression ##F(tx)##. (To be more precise, compute the partial derivative of the function ##x\mapsto F(tx)## with respect to the ##k##th variable slot). The result is a function. The notation represents the value of that function at ##x##.

If you meant what I called "interpretation 1", then yes.

I'm inclined to interpret the notation ##\frac{\partial F(tx)} {\partial x^{k}}## according to interpretation 1 (because it's a partial derivative of F, right?), but I think the only interpretation of the notation ##\frac{\partial}{\partial x^k} F(tx)## that makes sense is interpretation 2. This would give us the horrendously ugly result
$$\frac{\partial}{\partial x^k} F(tx) =t\frac{\partial F(tx)} {\partial x^k}.$$ This sort of thing is why I don't use the ##\partial/\partial x^k## notation in calculus.
 
Last edited:
  • #17
Fredrik said:
γ−1:γ(M)→(a,b)\gamma^{-1}:\gamma(M)\to(a,b) can be considered a coordinate system on γ(M)\gamma(M), then the fact that (a,b)(a,b) is a subset of R\mathbb R makes γ(M)\gamma(M) 1-dimensional.

Isn't [itex] \gamma (M) [/itex] the image set of [itex]\gamma [/itex] though, and as they are points on [/itex] which is locally homeomorphic to [itex] \mathbb {R} ^{n} [/itex], so I would've thought it would be n-dimensional?! (sorry to go on a bit, I'm just getting more confused over the whole situation).
Also, would what I've put here be correct at all?

"Don't panic!" said:
This is what I was trying to explain to my friend, but he when I said that each t∈(a,b) t\in (a, b) maps to a point on the manifold I couldn't convince him that t t doesn't define a 1-dimensional coordinate system, I think he was thinking in terms of classical cases (in Euclidean space), but I tried to explain that the case is the same there as well, as one can always parameterise a curve in Euclidean space such that each value of t t corresponds to a set of coordinate values, but the parameter t t itself isn't considered as a coordinate.

Fredrik said:
Interpretation 1: Compute the partial derivative of FF with respect to the kkth variable slot. The result is a function. The notation represents the value of that function at txtx.

Yes, I assumed by the notation they give that it's the [itex] \mu[/itex]th variable slot, as [itex] F : \mathbb{R} ^{n} \rightarrow \mathbb{R} [/itex] and so the notation [itex] \frac{\partial F} {\partial x^{\mu}} [/itex] is, symbolically, the derivative of [itex] F[/itex] with respect to its [itex] \mu[/itex]th "coordinate" (i.e. the derivative of [itex] F[/itex] with respect to [itex] tx^{\mu} [/itex])?
 
  • #18
Fredrik said:
The dimension of a manifold is by definition the (vector space) dimension of the ##\mathbb R^n## that's the codomain of all the coordinate systems.

While correct, it misses that a manifold also has a very intrinsic dimension. You don't need to have coordinate systems to be able to determine the dimension. In fact, we have something like topological dimension of a manifold. It is intrinsic to the manifold and agrees with the codomain of the coordinate systems. So I'm inclined to take that as the definition of the dimension (although - agreed - most books take your definition, which skips over the subtle point that perhaps two coordinate systems might exist which gives a different dimension, it's difficult to prove this can't occur).

Back to curves then. The image of a curve does not need to be one-dimensional (see space-filling curves). Even with smooth curves things can go wrong. The point is the difference between an immersed and an embedded submanifold. While an (injective) smooth curve always is an immersed submanifold, it doesn't need to be embedded (that is: the topology of the manifolds need not agree). Luckily, a corollary of the inverse function theorem says that we can always limit the domain of a curve so that it becomes embedded. So if you only care about arbitrary small domains of curves (this is the germ approach which is popular in algebraic geometry); then you're ok.
 
  • #19
"Don't panic!" said:
Isn't [itex] \gamma (M) [/itex] the image set of [itex]\gamma [/itex] though, and as they are points on [/itex] which is locally homeomorphic to [itex] \mathbb {R} ^{n} [/itex], so I would've thought it would be n-dimensional?! (sorry to go on a bit, I'm just getting more confused over the whole situation).
Also, would what I've put here be correct at all?
I think you're missing that "local homeomorphic" is a property of an open set, while you're actuing like it is a property of a point. The codomain ##\gamma(M)## is not an open set, so you can't use homeomorphisms.
 
  • #20
Would this be correct though?

This is what I was trying to explain to my friend, but he when I said that each [itex] t\in (a, b) [/itex] maps to a point on the manifold I couldn't convince him that [itex] t[/itex] doesn't define a 1-dimensional coordinate system, I think he was thinking in terms of classical cases (in Euclidean space), but I tried to explain that the case is the same there as well, as one can always parameterise a curve in Euclidean space such that each value of [itex] t[/itex] corresponds to a set of coordinate values, but the parameter [itex] t[/itex] itself isn't considered as a coordinate.

I was trying to rationalise with him why the definition of a tangent vector using this approach is intrinsically coordinate independent? Is what I put correct, or is it more that as it is defined as an equivalence class of curves and therefore not dependent on anyone particular curve it is independent of any coordinate system introduced when specifying the form of a particular curve?!
 
  • #21
"Don't panic!" said:
Isn't [itex] \gamma (M) [/itex] the image set of [itex]\gamma [/itex] though, and as they are points on [/itex] which is locally homeomorphic to [itex] \mathbb {R} ^{n} [/itex], so I would've thought it would be n-dimensional?! (sorry to go on a bit, I'm just getting more confused over the whole situation).
I don't understand why you think the dimension of a submanifold has to be the same as the dimension of the manifold. Have you seen a definition of "dimension" that makes you think that this is the case? This would be very different from how things work with vector spaces. For example, a straight line though the origin in ##\mathbb R^3## is 1-dimensional, not 3-dimensional.

And oops, I see now that I wrote "##\gamma(M)##". That notation makes no sense.. Maybe that has contributed to the confusion. The range of ##\gamma:(a,b)\to M## is of course ##\gamma\big((a,b)\big)##. If ##\gamma## is a smooth injective curve, then ##\gamma^{-1}:\gamma\big((a,b)\big)\to(a,b)## is a homeomorphism. Since the range of this map is ##(a,b)##, which is a subset of ##\mathbb R##, the submanifold ##\gamma\big((a,b)\big)## is 1-dimensional.

"Don't panic!" said:
Also, would what I've put here be correct at all?
##\gamma## takes each ##t\in (a,b)## to a point ##\gamma(t)## in M. The inverse of of ##\gamma## (assuming that this is an injective curve) is a coordinate system on a 1-dimensional submanifold, but it's not one of the coordinate systems associated with M itself. So you got that last part right at least.
 
Last edited:
  • #22
micromass said:
While correct, it misses that a manifold also has a very intrinsic dimension. You don't need to have coordinate systems to be able to determine the dimension. In fact, we have something like topological dimension of a manifold. It is intrinsic to the manifold and agrees with the codomain of the coordinate systems.
Cool. I have heard of the concept, but I have never studied it.

micromass said:
Back to curves then. The image of a curve does not need to be one-dimensional (see space-filling curves).
I was thinking that if we restrict the domain of a smooth curve sufficiently, it will be injective...and I see that your comment supports that idea.
 
  • #23
Fredrik said:
I was thinking that if we restrict the domain of a smooth curve sufficiently, it will be injective...and I see that your comment supports that idea.

Well no, take the constant curve for example.
 
  • #24
micromass said:
While correct, it misses that a manifold also has a very intrinsic dimension. You don't need to have coordinate systems to be able to determine the dimension. In fact, we have something like topological dimension of a manifold. It is intrinsic to the manifold and agrees with the codomain of the coordinate systems. So I'm inclined to take that as the definition of the dimension (although - agreed - most books take your definition, which skips over the subtle point that perhaps two coordinate systems might exist which gives a different dimension, it's difficult to prove this can't occur).

Back to curves then. The image of a curve does not need to be one-dimensional (see space-filling curves). Even with smooth curves things can go wrong. The point is the difference between an immersed and an embedded submanifold. While an (injective) smooth curve always is an immersed submanifold, it doesn't need to be embedded (that is: the topology of the manifolds need not agree). Luckily, a corollary of the inverse function theorem says that we can always limit the domain of a curve so that it becomes embedded. So if you only care about arbitrary small domains of curves (this is the germ approach which is popular in algebraic geometry); then you're ok.

What do you mean by intrinsic dimension? By the definition I know, a space is an n- manifold if every point has a neighborhood that is homeomorphic to ## \mathbb R^n ##.
 
  • #25
Fredrik said:
I don't understand why you think the dimension of a submanifold has to be the same as the dimension of the manifold. Have you seen a definition of "dimension" that makes you think that this is the case? This would be very different from how things work with vector spaces. For example, a straight line though the origin in R3\mathbb R^3 is 1-dimensional, not 3-dimensional.

Yes, sorry I think I was just stressing out about it a bit and started conflating ideas. I think I sometimes get a bit lost in the abstraction, as I can easily see how a curve in [itex] \mathbb{R} ^{3}[/itex] as it can be described in terms of a single parameter (hence 1-dimensional).

Fredrik said:
And oops, I see now that I wrote "γ(M)\gamma(M)". That notation makes no sense.. Maybe that has contributed to the confusion. The range of γ:(a,b)→M\gamma:(a,b)\to M is of course γ((a,b))\gamma\big((a,b)\big). If γ\gamma is a smooth injective curve, then γ−1:γ((a,b))→(a,b)\gamma^{-1}:\gamma\big((a,b)\big)\to(a,b) is a homeomorphism. Since the range of this map is (a,b)(a,b), which is a subset of R\mathbb R, the submanifold γ((a,b))\gamma\big((a,b)\big) is 1-dimensional.

Fredrik said:
γ\gamma takes each t∈(a,b)t\in (a,b) to a point γ(t)\gamma(t) in M. The inverse of of γ\gamma (assuming that this is an injective curve) is a coordinate system on a 1-dimensional submanifold, but it's not one of the coordinate systems associated with M itself. So you got that last part right at least.

I think this all makes sense now, thanks for your patience!
 
  • #26
Sorry to bring up a previous question, but why would it be incorrect to do a proof of Hadamard's lemma using the following approach?!

Let [itex]\gamma :[0,1]\rightarrow\mathbb{R}^{n}[/itex] be a curve in [itex]\mathbb{R}^{n}[/itex] such that [itex]\gamma (0)=a\in\mathbb{R}^{n}[/itex] and [itex]\gamma (1)=x\in\mathbb{R}^{n}[/itex], hence a fixed value of [itex]t\in [0,1][/itex] identifies a point in [itex]\mathbb{R}^{n}[/itex] by [itex]\gamma (t)[/itex]. The curve [itex]\gamma[/itex] is such that it passes through the fixed point [itex]a\in\mathbb{R}^{n}[/itex] and some arbitrary point [itex]x\in\mathbb{R}^{n}[/itex]. The coordinate representation of the curve is given by $$\gamma (t)=\left(x^{1}(t),\ldots,x^{n}(t)\right)=\gamma (0)+t\left(\gamma (1)-\gamma (0)\right)=a+t\left(x-a\right)$$ where [itex]x^{\mu}:[0,1]\rightarrow\mathbb{R}[/itex] are the coordinate functions that specify the coordinates of the curve [itex]\gamma[/itex] in [itex]\mathbb{R}^{n}[/itex] for each value of [itex]t\in [0,1][/itex]. We thus define the coordinate functions [itex]x^{\mu}[/itex] such that $$x^{\mu}(t)=a^{\mu}+t\left(x^{\mu}-a^{\mu}\right)$$ where [itex]a^{\mu}, x^{\mu}\in\mathbb{R}[/itex].

Now consider the function [itex]h:[0,1]\rightarrow\mathbb{R}[/itex], defined such that $$h(t)=(F\circ\gamma)(t)=F(\gamma(t))=F(a+t(x-a))$$ where [itex]F:\mathbb{R}^{n}\rightarrow\mathbb{R}[/itex] is [itex]C^{\infty}[/itex]. It follows that $$\frac{dh}{dt}=h'(t)=\frac{d}{dt}\left[(F\circ\gamma)(t)\right]=\frac{d}{dt}\left[F\left((x^{1}(t),\ldots,x^{n}(t))\right)\right]\\ \qquad=\sum_{\mu =1}^{n}\frac{\partial F}{\partial x^{\mu}}\frac{dx^{\mu}}{dt}=\sum_{\mu =1}^{n}\frac{\partial}{\partial x^{\mu}}\left[F(a+t(x-a))\right]\left(x^{\mu}-a^{\mu}\right).$$ Observe that $$h(1)-h(0)=F(x)-F(a)=\int_{0}^{1}h'(t)dt=\int_{0}^{1}\sum_{\mu =1}^{n}\frac{\partial}{\partial x^{\mu}}\left[F(a+t(x-a))\right]\left(x^{\mu}-a^{\mu}\right)dt =\sum_{\mu =1}^{n}\left(x^{\mu}-a^{\mu}\right)\int_{0}^{1}\frac{\partial}{\partial x^{\mu}}\left[F(a+t(x-a))\right]dt \\ =\sum_{\mu =1}^{n}\left(x^{\mu}-a^{\mu}\right)H_{\mu}(x)$$ where [itex]H_{\mu}(x)= \int_{0}^{1}\frac{\partial}{\partial x^{\mu}}\left[F(a+t(x-a))\right]dt[/itex].
Note that [itex] F[/itex] is [itex]C^{\infty}[/itex] and so it follows that [itex] H_{\mu}[/itex] are also [itex]C^{\infty}[/itex].

Finally, taking derivatives of both sides with respect to [itex]x^{\mu}[/itex] we have that $$\frac{\partial F}{\partial x^{\mu}}=H_{\mu}(x)+\left(x^{\nu}-a^{\nu}\right)\frac{\partial H_{\nu}}{\partial x^{\mu}}$$ and so evaluating this at [itex] x=a[/itex] gives $$\frac{\partial F}{\partial x^{\mu}}\bigg\vert_{x=a}=H_{\mu}(a)$$ as claimed.
 
  • #27
I don't think there's anything wrong with it. I think the following version is slightly simpler though. I would write
\begin{align}
F(x)-F(a) &=\int_0^1 \frac{d}{dt}F\big(a+t(x-a)\big) dt =\int_0^1 \left(\sum_{i=1}^n F_{,i}\big(a+t(x-a)\big) (x-a)_i \right)dt\\
&=\sum_{i=1}^n (x-a)_i \int_0^1 F_{,i}(a+t(x-a)) dt,
\end{align} then define ##H_i## by
$$H_i(x)=\int_0^1 F_{,i}(a+t(x-a)) dt$$ for all ##x##, and then note that
$$H_i(a)=\int_0^1 F_{,i}(a) dt =F_{,i}(a)\int_0^1 dt =F_{,i}(a).$$ This is essentially the same thing you did in a different notation.
 
Last edited:
  • #28
Thanks for taking a look at it. I know I was quite explicit with all the steps, but I just wanted to check that I was doing all correctly really. Was the first part of, with defining the curve [itex] \gamma[/itex], etc?!
 
  • #29
That part is strangely worded. Your "let ##\gamma##" statement makes it sound like ##\gamma## is an arbitrary curve such that ##\gamma(0)=a## and ##\gamma(1)=x##, and later you're using that ##\gamma(t)=a+t(x-a)##, so it looks like you're making the claim that every curve ##\gamma## such that ##\gamma(0)=a## and ##\gamma(1)=x## is a straight line. It would be better to start with something like this: Let ##\gamma:\mathbb R\to\mathbb R^n## be the curve defined by ##\gamma(t)=a+t(x-a)## for all ##t\in\mathbb R##. (The domain of this curve doesn't have to be ##\mathbb R##, but if you want to say that the curve is smooth on [0,1], the domain must be a set that contains an open set that contains [0,1]).
 
  • #30
Fredrik said:
It would be better to start with something like this: Let γ:R→Rn\gamma:\mathbb R\to\mathbb R^n be the curve defined by γ(t)=a+t(x−a)\gamma(t)=a+t(x-a) for all t∈Rt\in\mathbb R.

So would it be better to start with this and then require that [itex] \gamma [/itex] is smooth in the interval [itex] [0,1][/itex], and then proceed with the rest of the proof as I have in the previous post (apart from the change in notation as you suggested)?
 
  • #31
If you define ##\gamma:\mathbb R\to\mathbb R^n## the way I did, you don't have to require it to be smooth. It will simply be smooth.

The notation is a matter of taste.
 
  • #32
Ok, great. Thanks for your help :)
 
  • #33
"Don't panic!" said:
Would this be correct though?

This is what I was trying to explain to my friend, but he when I said that each [itex] t\in (a, b) [/itex] maps to a point on the manifold I couldn't convince him that [itex] t[/itex] doesn't define a 1-dimensional coordinate system, I think he was thinking in terms of classical cases (in Euclidean space), but I tried to explain that the case is the same there as well, as one can always parameterise a curve in Euclidean space such that each value of [itex] t[/itex] corresponds to a set of coordinate values, but the parameter [itex] t[/itex] itself isn't considered as a coordinate.

A curve does not define a coordinate chart on a 1-dimensional sumbamnifold for a couple of reasons.

- A coordinate chart on a 1 dimensional manifold maps an open neighborhood on the manifold into R. But a curve maps an open neighborhood in R into the manifold. This is called a " parameterization" of a neighborhood if it is invertible and its inverse is smooth.

- But for an arbitrary smooth curve, its inverse may not even be differentiable. For instance inverse of the curve ## t-> t^3## is not differentiable at 0.

- For a submanifold of a larger dimensional manifold, the a coordinate chart on the submanifold alone is not considered to be a coordinate chart on the submanifold unless it can be smoothly extended to an open neighborhood in the larger dimensional manifold as well. In general, a mapping of any subset of a smooth manifold is called smooth if it can be smoothly extended to an open neighborhood in the ambient manifold.

So a submanifold is defined as a subset so that around each point there is an open neighborhood in the subspace topology that is homeomorphic to an open set in euclidean space and such a homeomorphism can be chosen so that it and its inverse are smooth. This means that the homeomorphism must be smoothly extendable to an open set in the larger manifold.
I was trying to rationalise with him why the definition of a tangent vector using this approach is intrinsically coordinate independent? Is what I put correct, or is it more that as it is defined as an equivalence class of curves and therefore not dependent on anyone particular curve it is independent of any coordinate system introduced when specifying the form of a particular curve?!

Your expression "intrinsically coordinate independent" seems vague to me. Can you define it more precisely?

While in my mind, intrinsic can have more than one meaning, one idea is that the calculation of the quantity always gives the same answer in any coordinate system. To me this is what "coordinate independent" means. It does not mean that you have to define it without coordinate systems.

While it is true that defining tangent vectors as equivalence classes of curves appears not to use coordinates, one is assuming the idea of differentiability and differentiable can not be defined without reference to coordinate systems. How do you know that a function on a manifold is differentiable? When its composition with a parameterization is a differentiable function on Euclidean space. That requires the idea of coordinate systems.

- If one does use coordinate systems then for each parameter neighborhood of a point, one can define the directional derivative of a function by composing it with the parameter mapping. This has the great advantage of immediately showing that the differential of the function is a linear map.

If you do it this way, you need to somehow identify directions in different parameterizations. This is done using the Chain Rule. So tangent vectors are thought of directions in different parameterizations that are pasted together - identified. This shows that tangent vectors are intrinsic.

It also shows that tangent vectors form a vector space since the differentials of coordinate transformations are linear maps.
 
Last edited:
  • #34
lavinia said:
- A coordinate chart on a 1 dimensional manifold maps an open neighborhood on the manifold into R. But a curve maps an open neighborhood in R into the manifold. This is called a " parameterization" of a neighborhood if it is invertible and its inverse is smooth.

My (perhaps incorrect?!) intuition behind this was that a curve is a map that assigns a real value [itex]t[/itex] to each point [itex]p[/itex] on a manifold [itex]M[/itex], such that the points trace out a curve on [itex]M[/itex], however, it does not localise the position of each point that it maps to on [itex]M[/itex]; this requires the introduction of a local coordinate chart such that we can describe the positions of the points of the points on [itex]M[/itex] in [itex]\mathbb{R}^{n}[/itex].

lavinia said:
Your expression "intrinsically coordinate independent" seems vague to me. Can you define it more precisely?

By this I basically meant what you have said, i.e. "the calculation of the quantity always gives the same answer in any coordinate system", but you're right, my wording was a little vague, apologies for that.

lavinia said:
If you do it this way, you need to somehow identify directions in different parameterizations. This is done using the Chain Rule. So tangent vectors are thought of directions in different parameterizations that are pasted together - identified.

By this do you is it mean that the curves in each equivalence class identify a particular direction along the manifold (at some point on the manifold) thus providing a notion of direction to the tangent vector that each equivalence class is identified with?!
 
  • #35
"Don't panic!" said:
My (perhaps incorrect?!) intuition behind this was that a curve is a map that assigns a real value [itex]t[/itex] to each point [itex]p[/itex] on a manifold [itex]M[/itex], such that the points trace out a curve on [itex]M[/itex], however, it does not localise the position of each point that it maps to on [itex]M[/itex]; this requires the introduction of a local coordinate chart such that we can describe the positions of the points of the points on [itex]M[/itex] in [itex]\mathbb{R}^{n}[/itex].
I have not seen it defined this way. I think of a curve as the path of a particle.

By this do you is it mean that the curves in each equivalence class identify a particular direction along the manifold (at some point on the manifold) thus providing a notion of direction to the tangent vector that each equivalence class is identified with?!

Well if you mean by direction a vector.

Given a parameterization,φ, of a domain on a manifold, the composed function ## F(x) = f(φ(x))## can be differentiated with respect to a vector,h, by taking the limit of the Newton quotient

(F( x + th) - F(x))/t.

(By directional derivative I mean this rather than restricting h to be just a unit vector.)

This definition works with respect to any parameterization but one needs to compare vectors in two different parameterizations in order to say when the vectors are the same
This is done with coordinate transformations. The differential of a coordinate transformation will map some vector,v, in another parameterization to the vector,h,. v and h are considered to be the same in the tangent space to the manifold.
 
Last edited:
<h2>1. What are tangent vectors?</h2><p>Tangent vectors are vectors that are tangent to a curve or surface at a specific point. They represent the direction and magnitude of change at that point.</p><h2>2. How are tangent vectors related to directional derivatives?</h2><p>Tangent vectors can be thought of as the direction in which the directional derivative is taken. The directional derivative measures the rate of change of a function in a specific direction, and the tangent vector represents that direction.</p><h2>3. Can tangent vectors be negative?</h2><p>Yes, tangent vectors can have a negative direction. This means that the function is decreasing in that direction, and the directional derivative will also be negative.</p><h2>4. How are tangent vectors used in real-world applications?</h2><p>Tangent vectors are used in many fields of science, such as physics, engineering, and computer graphics. They are used to model the motion of objects, calculate rates of change, and determine the direction of forces.</p><h2>5. How are tangent vectors calculated?</h2><p>Tangent vectors can be calculated using calculus, specifically the derivative. For a curve, the tangent vector is the derivative of the curve's equation at a specific point. For a surface, the tangent vector is calculated using the partial derivatives of the surface's equation at a specific point.</p>

1. What are tangent vectors?

Tangent vectors are vectors that are tangent to a curve or surface at a specific point. They represent the direction and magnitude of change at that point.

2. How are tangent vectors related to directional derivatives?

Tangent vectors can be thought of as the direction in which the directional derivative is taken. The directional derivative measures the rate of change of a function in a specific direction, and the tangent vector represents that direction.

3. Can tangent vectors be negative?

Yes, tangent vectors can have a negative direction. This means that the function is decreasing in that direction, and the directional derivative will also be negative.

4. How are tangent vectors used in real-world applications?

Tangent vectors are used in many fields of science, such as physics, engineering, and computer graphics. They are used to model the motion of objects, calculate rates of change, and determine the direction of forces.

5. How are tangent vectors calculated?

Tangent vectors can be calculated using calculus, specifically the derivative. For a curve, the tangent vector is the derivative of the curve's equation at a specific point. For a surface, the tangent vector is calculated using the partial derivatives of the surface's equation at a specific point.

Similar threads

  • Differential Geometry
Replies
2
Views
511
Replies
9
Views
3K
Replies
4
Views
1K
Replies
4
Views
2K
  • Differential Geometry
Replies
10
Views
639
  • Differential Geometry
Replies
21
Views
2K
  • Differential Geometry
Replies
21
Views
553
Replies
4
Views
1K
  • Differential Geometry
Replies
3
Views
1K
  • Differential Geometry
Replies
5
Views
2K
Back
Top