Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

Why the basis of the tangent space of a manifold is some partials?

  1. Jun 20, 2010 #1
    it is quite peculiar

    i know you do not want to embed the manifold into a R^n Euclidean space

    but still it is too peculiar

    it is hard to develop some intuition
  2. jcsd
  3. Jun 20, 2010 #2


    User Avatar
    Science Advisor
    Homework Helper
    Gold Member

    Are you really asking why the basis of the tangent space of a manifold is some partials like the title of the thread suggests, or why the tangent space of a manifold is defined the way it is as the vector space of derivations at p on the germs of smooth functions?
  4. Jun 20, 2010 #3

    i mean the latter
  5. Jun 20, 2010 #4


    User Avatar
    Staff Emeritus
    Science Advisor
    Gold Member

    There's a more intuitive definition. If A and B are curves such that A(0)=B(0)=p, and x is a coordinate system, then we can say that A and B are equivalent if [itex]x\circ A[/itex] and [itex]x\circ B[/itex] have the same derivative at 0. (These are curves in [itex]\mathbb R^n[/itex], so this doesn't involve differential geometry). Then we define the tangent space at p as the set of equivalence classes of curves that pass through p at parameter value 0. It's easy to show that the equivalence relation doesn't depend on which coordinate system is used in its definition.

    That set can be given a vector space structure, again using coordinate systems, and then proving that the definition is independent of the coordinate systems used. If we call that vector space TpM, and the vector space of derivations DpM, we can show that the function [itex]\phi:T_pM\rightarrow D_pM[/itex] defined by [itex]\phi([A])(f)=(f\circ A)'(0)[/itex] is a vector space isomorphism. Of course, first we have to prove that it's a well-defined function, by showing that the right-hand side doesn't depend on which member A we choose from the equivalence class [A].

    See Isham for details. (It also has a nice proof of the (non-trivial) claim that the set of partial derivative operators associated with a coordinate system is a basis for DpM).
  6. Jun 20, 2010 #5
    The basis doesn't have to be the partials, but the partials are the most natural choice. You can convert between the basis of partials and any other basis by the vielbein.

    I'm not sure if this is accurate to say or not, but the basis in classical differential geometry is:

    [tex]e_i=\frac{\partial \vec_{r}}{\partial x^i} [/tex]

    where r is the position vector in a euclidean embedding space. modern differential geometry just gets rid of the r:

    [tex]e_i=\frac{\partial}{\partial x^i} [/tex]
  7. Jun 20, 2010 #6
    I don't know if this is the type of answer that you are looking for, but, following
    up on Fredrik's post, you can show that every derivation is a partial derivative,
    (and the fact that the space of derivations has the same dimension as the
    manifold in which it lives) so that the partial derivatives, as derivations/operators,
    span the tangent space. Sorry, my knowledge of physics is too limited to give
    you a physical explanation.
  8. Jun 20, 2010 #7

    i understand this literally

    but i mean it is a bit peculiar, it is non-intuitive

    maybe i need to get accustomed to this
  9. Jun 20, 2010 #8


    User Avatar
    Science Advisor
    Homework Helper
    Gold Member

    The most difficult part is perhaps to identify what it mean to define a notion of tangent space for an abstract manifold.

    We have the notion of the tangent space for a k-manifold M embedded in some R^n: if [itex]\sigma:U\subset \mathbb{R}^k\rightarrow \mathbb{R}^n[/itex] is a local parametrization of M with [itex]\sigma(0)=p[/itex], then T_pM is defined as the k-dimensional subspace [itex]\mathrm{Im}(D\sigma_0)[/itex]. A basis for it is [itex]\left(\frac{\partial\sigma/}{\partial x^i}(0)\right)[/itex], and if [itex]\tau:V\subset \mathbb{R}^k\rightarrow \mathbb{R}^n[/itex] is another local parametrization of M with [itex]\tau(0)=p[/itex], then by the chain rule, the relation between the basis [itex]\left(\frac{\partial\sigma}{\partial x^i}(0)\right)[/itex] and [itex]\left(\frac{\partial\tau}{\partial x^i}(0)\right)[/itex] is

    [tex]\frac{\partial\tau}{\partial x^i}(0)=\sum_{j=1}^k\frac{\partial(\sigma^{-1}\circ\tau)^j}{\partial x^i}(0)\frac{\partial\sigma}{\partial x^j}(0)[/tex]

    My guess then is that the "tangent space problem" is (or rather, was to the pioneers of the theory of calculus on manifolds) a way to associate a k-dimensional vector space T_pM at every point p of an abstract smooth k-manifold M in such a way that each local parametrization of M induces a basis in such a way that if (v_1,...,v_k) and (w_1,...,w_k) are two basis induced by local parametrization [itex]\sigma[/itex] and [itex]\tau[/itex], then

    [tex]w_i=\sum_{j=1}^k\frac{\partial(\sigma^{-1}\circ\tau)^j}{\partial x^i}(0)v_i[/tex]

    (compare with the above relation for the classical tangent space)

    Indeed, if this program is carried out succesfully, then we will have found way to associate a vector space to each point of a manifold that coincides for manifold embedded in R^n with the classical notion of the tangent space (up to the canonical isomorphism [itex]v_i\leftrightarrow \partial\sigma/\partial x^i[/itex]).

    There are at least 3 substantially different ways to solve the tangent space problem, all of which are pretty much equally abstract and initially unintuitive. One of which is the "derivation at p" solution.
  10. Jun 20, 2010 #9


    User Avatar
    Science Advisor
    Homework Helper
    Gold Member

    It does seem to get better with time :)
  11. Jun 21, 2010 #10
    I've only dipped into this so far, so there may be an explanation in there that I haven't got to yet... but in Isham's definition 2.4, pp. 73-74, part 2, he writes

    (I changed the notation slightly in eq. 2.3.2, as I don't know how to do that full-length vertical line in LaTex.)

    My question: suppose we have a curve in the Euclidean plane, such as a circle, paramaterized in orthonormal coordinates by [itex](\cos(t),\sin(t)), t \in [0,2pi)[/itex], and another curve, say [itex](\cos(t),2\sin(t)), t \in [0,2pi)[/itex]. Would these curves be called tangent at [itex]t=0[/itex]? I would expect yes, as the derivatives are parallel here, but the derivative of the latter curve with respect to t, at this point, has twice the magnitude of the derivative of the unit circle, and so seems not to meet condition (b). Does this definition assume that t is arc length so that any curve has unit speed?
  12. Jun 22, 2010 #11


    User Avatar
    Staff Emeritus
    Science Advisor
    Gold Member

    I don't think I even thought of that when I read the book. :redface: I don't really have time to think about it now, but if you figure it out on your own, let me know.

    You can use \bigg| to get a result like this:

    [tex]\frac{d}{dt}\bigg|_0 f(t)=f'(0)[/tex]
  13. Jun 22, 2010 #12
    Here is my shot at explaining why partials are used as the basis for the tangent space. First it is useful to think a little bit about the manifold itself:

    There are two related approaches that can be used to describe a manifold. We can represent it in terms of a set of n coordinate functions, where n is the dimension of the manifold (such as x, y, and z in euclidean 3 space). The coordinate functions may be embedded in a larger space (such as functions to describe the 2D surface of a sphere in 3D space).

    Another way we tend to represent a manifold is in terms of coordinate lines (i.e. parameterized coordinate curves). The coordinate curves can be derived from the set of coordinate functions, and for each coordinate function, there is a corresponding set of coordinate curves. If you draw a picture of the coordinate functions, you can visualize the coordinate curves. For example, in polar coordinates, the coordinate curves associated with the coordinate r are drawn along lines of constant [tex]\theta[/tex]. The parameterization of the coordinate lines is defined by the function r, or in other words, the rate of change of the coordinate curve parameter exactly agrees with the rate of change of the function r. The coordinate curve can be written as:

    [tex] [x,y] = [r cos(\theta_0), r sin(\theta_0)] [/tex]

    Since this is a one dimensional curve, [tex]\theta_0[/tex] is held constant while r varies. The associated family of curves is obtained by adjusting the constant [tex]\theta_0[/tex]. The coordinate curve for [tex]\theta[/tex] is given by:

    [tex][x,y] = [r_0 cos(\theta), r_0 sin(\theta)] [/tex]

    where [tex]r_0[/tex] is held constant.

    The coordinate functions and coordinate curves each define a basis for the manifold, and are referred to as duals of one another, partially because if one is given, the other can be derived. A basis for the tangent space can be derived by taking the derivative of the coordinate line with respect to the curve parameter. For the r coordinate curve above, the curve parameter is r, and we get a basis vector:

    [tex][\frac{dx}{dr}, \frac{dy}{dr}] = [cos(\theta_0), sin(\theta_0)] [/tex]

    Another way to write this is the partial notation (that comes from the chain rule):

    [tex] \frac{d}{dr} = \frac{dx}{dr}\frac{\partial}{\partial x} + \frac{dy}{dr}\frac{\partial}{\partial y} = cos(\theta_0) \frac{\partial}{\partial x} + sin(\theta_0) \frac{\partial}{\partial y} [/tex]

    So, why the partial notation? Since the coordinate curve for r is a parameterized curve in space, I can use it to take a directional derivative. That is, I can overlay a function from space onto the curve and compare the rate of change of the function compared to the curve parameter (i.e. I can calculate [tex]\frac{df}{dr}[/tex]). I can do this by applying [tex]\frac{d}{dr}[/tex] as defined above to some function of x and y. For example if f=x then:

    [tex]\frac{df}{dr} = cos(\theta_0) \frac{\partial(x)}{\partial x} + sin(\theta_0) \frac{\partial (x)}{\partial y} = cos(\theta_0) [/tex]

    The second basis tangent vector comes from the equation for the [tex]\theta[/tex] curve, and comes again by using the chain rule:

    [tex]\frac{d}{d \theta} = \frac{dx}{d \theta}\frac{\partial }{\partial x} + \frac{dy}{d \theta}\frac{\partial}{\partial y} = - r_0 sin(\theta) \frac{\partial}{\partial x} + r_0 cos(\theta) \frac{\partial}{\partial y} [/tex]

    So in general, the tangent basis can be derived once a coordinate chart is chosen, and the coordinate curves are calculated. The tangent basis can be thought of a set of directional derivatives along the coordinate curves with respect to the curve parameter (which is defined by the associated coordinate function). It is a way of defining derivatives with respect to the selected coordinate functions.
    Last edited: Jun 22, 2010
  14. Jun 23, 2010 #13
    The following is something that is not exact in so far as ignoring the tangle of maps, but may be an intuitive start to the graphically minded. The action of a vector on a dual vector is the scalar [itex]df/d\lambda[/itex]. This is the action of a directional derivative on a scalar valued function.

    [tex]\frac{df}{d\lambda} = \frac{\partial f}{\partial x^i}\frac{dx^i}{d \lambda} [/tex]

    [tex]\frac{df}{d\lambda} = \frac{\partial f}{\partial x^i}\ \frac{\partial x^i}{\partial x^j} \frac{dx^j}{d \lambda} [/tex]

    [tex]\frac{df}{d\lambda} = \frac{\partial f}{\partial x^i}\ \partial_j dx^i \ \frac{dx^j}{d \lambda} [/tex]

    We will demand that
    [tex]\partial_j dx^i} = \delta^i_j[/tex]​
    for the reason that follows. We can rearrange, and partition the right side of the equation, so long as we remember that the action of [itex]\partial_j[/itex] is only upon [itex]dx^i[/itex].

    [tex] \frac{df}{d\lambda} = \left[ \frac{dx^j}{d \lambda} \partial_j \right] \left[ \frac{\partial f}{\partial x^i} dx^i \right] [/tex]

    [tex] V^j \hat{e}_j = \left[ \frac{dx^j} {d \lambda} \partial_j \right] [/tex]​
    is a vector with basis [itex]\partial_j[/itex] acting upon the dual vector, or covector, or one-form (or anyway you find it named),

    [tex] U_i \hat{e}^i = \left[ \frac{\partial f}{\partial x^i} dx^i \right] [/tex]​
    with bais [itex]dx^i[/itex].

    The demand that
    [tex]\partial_j dx^i} = \delta^i_j[/tex]​
    is exactly the requirement of dual vector basis.

    The action of a vector on a one-from is the action of the vector basis upon the one-form basis, where the vector and dual vector coefficents act though direct multiplication.
    Last edited: Jun 23, 2010
  15. Jun 23, 2010 #14
    One thing you will begin to notice is that the chain rule plays a fundamental role in differential geometry. In differential geometry, we don't want to restrict ourselves to the viewpoint of a single coordinate system, and the chain rule is the rule of transforming between coordinate systems. Because of this it is useful to represent differential objects in a notation that is natural for the chain rule.

    It does seem a little disconcerting to use partials as basis vectors at first, particularly if you are thinking of a tangent basis at a point on the manifold. This notation is more naturally suited to expressing vector fields that fill the manifold rather the basis at a point.

    You may remember that an ordinary differential equation can be expressed in first order form (i.e. a form in which no derivative higher than first order appears), and that this can be represented as a vector field on state space. Commonly ordinary differential equations are represented in the notation:

    [tex] \frac {dx_1}{dt} = f_1(x), \frac {dx_2}{dt} = f_2(x),..., \frac {dx_n}{dt} = f_n(x) [/tex]

    It is trivial to convert this to vector field form when you are using partial notation, because the vector field form looks just like the chain rule (on purpose):

    [tex]\frac{d}{dt} = \frac{dx_1}{dt} \frac{\partial}{\partial x_1} + \frac{dx_2}{dt} \frac{\partial}{\partial x_2} + ... + \frac{dx_n}{dt} \frac{\partial}{\partial x_n} \\
    = f_1(x) \frac{\partial}{\partial x_1} + f_2(x) \frac{\partial}{\partial x_2} + ... + f_n(x) \frac{\partial}{\partial x_n} [/tex]

    When the differential equation is written in this form, the rule for transforming the differential equation to a new coordinate system becomes obvious. Suppose we would like to transform to a new coordinate system y, then the differential equation can be written as:

    [tex] \frac{d}{dt} = \frac{dx_1}{dt} \frac{\partial y_i}{\partial x_1} \frac{\partial}{\partial y_i} + \frac{dx_2}{dt} \frac{\partial y_i}{\partial x_2} \frac{\partial}{\partial y_i} + ... + \frac{dx_n}{dt} \frac{\partial y_i}{\partial x_n} \frac{\partial}{\partial y_i}[/tex]

    Or this can be represented in the more compact notation:

    [tex] \frac{d}{dt} = \frac{dx_j}{dt} \frac{\partial y_i}{\partial x_j} \frac{\partial}{\partial y_i} [/tex]

    Note that the differential equation can be integrated, and that the solution curves are paratmeterized lines that fill the space. The derivative of the solution curves is the differential equation, or the vector field. The vector field can be used as a derivative operator (the Lie derivative), which has a natural interpretation as comparing a function q with the solution parameter t (i.e. [tex]\frac{dq}{dt}[/tex]) along the solution curve of the differential equation. Differential geometry is filled with chain rule operations when dealing with differential objects (such as ODEs or tensor fields), so the partial notation for a basis of the tangent space is natural and convenient when working with these kind of objects.
    Last edited: Jun 23, 2010
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook