General vectors and tangent space

In summary: In this case, we have the four basis vectors \hat{U}(x^\mu), \hat{U}(y^\mu), \hat{U}(z^\mu), \hat{U}(t^\mu) (where x^\mu, y^\mu, z^\mu, and t^\mu are the components of the directional derivative in the coordinate system).
  • #1
sunrah
199
22
Given a scalar function g defined on a manifold and a curve f:λ -> xa, the change of the function along the curve is

[itex]\frac{dg}{d\lambda} = \frac{dg}{dx^{a}}\frac{dx^{a}}{d\lambda} = T^{a}\frac{dg}{dx^{a}}[/itex]

where

[itex]\frac{dx^{a}}{d\lambda} = T^{a}[/itex] is the tangent to the curve.

The argument that I don't understand is that this

[itex]T^{a}\frac{d}{dx^{a}}[/itex]

is a vector. To me it looks like the inner product of two vectors, [itex]\vec{T} = (T_{x}, T_{y})[/itex] and ∇x,y, so looks like a scalar to me.

Also who do the coefficients of the gradient necessarily form a basis for the tangent space?
 
Physics news on Phys.org
  • #2
sunrah said:
The argument that I don't understand is that this

[itex]T^{a}\frac{d}{dx^{a}}[/itex]

is a vector.
Let M be an ##n##-dimensional smooth manifold. Let F be the set of smooth real-valued functions with domain M. The set of all ##v:F\to\mathbb R## satisfies the definition of a vector space. So all its elements can be called "vectors". For each ##p\in M##, the set of all linear ##v:F\to\mathbb R## that satisfy the condition ##v(fg)=v(f)g(p)+f(p)v(g)## is an ##n##-dimensional subspace of that vector space. This subspace is denoted by ##T_pM## and is called the tangent space of M at p. The elements of ##T_pM## are called tangent vectors at p.

For each coordinate system (=chart) ##x:U\to\mathbb R## and each ##i\in\{1,\dots,n\}##, the map ##\frac{\partial}{\partial x^i}\big|_p:F\to\mathbb R## defined by
$$\frac{\partial}{\partial x^i}\bigg|_p f=(f\circ x^{-1})_{,i}(x(p))$$ for all ##f\in F##, is an element of ##T_pM##. The comma denotes the kind of partial differentiation that you are familiar with from calculus. For example, if ##g:\mathbb R^n\to\mathbb R##, I would denote the partial derivative of g with respect to its second variable as ##g_{,2}##.

It's possible to show that the set
$$\bigg\{\frac{\partial}{\partial x^1}\bigg|_p,\dots,\frac{\partial}{\partial x^n}\bigg|_p\bigg\}$$ is a basis for ##T_pM##. I've seen the proof in a couple of different books. I posted a version of it in post #14 in this thread.

So the thing you're asking about is a tangent vector at p because it's a linear combination of tangent vectors at p.
 
Last edited:
  • Like
Likes Greg Bernhardt
  • #3
sunrah said:
The argument that I don't understand is that this

[itex]T^{a}\frac{d}{dx^{a}}[/itex]

is a vector. To me it looks like the inner product of two vectors, [itex]\vec{T} = (T_{x}, T_{y})[/itex] and ∇x,y, so looks like a scalar to me.

This is really just a convention, and if it bothers you, you don't have to adopt it, but you should understand it.

I think it's clear in terms of coordinates that if you have a parametrized path, [itex]x^\mu(s)[/itex], giving a path through space (or spacetime) as a function of a parameter [itex]s[/itex] (not necessarily time, nor even proper time, just a real-valued parameter that increases as you move down the path), then there is a corresponding vector [itex]U[/itex] whose components are given by [itex]U^\mu = \dfrac{dx^\mu}{ds}[/itex]. This vector is called the tangent vector to the path [itex]x^\mu(s)[/itex]. For each tangent vector [itex]U[/itex] with components [itex]U^\mu[/itex], there is a corresponding operator, the "directional derivative", that can be defined by: [itex](U \cdot \nabla) \phi = U^\mu \dfrac{d \phi}{dx^\mu}[/itex]

The above definitions are all in terms of components, which are specific to a coordinate system. Is there a coordinate-independent way to talk about directional derivatives, without mentioning components of the vector? Yes. Let [itex]\mathcal{P}(s)[/itex] be a smooth function from a real number [itex]s[/itex] to points in space (or spacetime). We define the directional derivative along path [itex]\mathcal{P}[/itex], [itex]\frac{d \mathcal{P}}{ds}[/itex], to be the operator [itex]\hat{U}[/itex] defined by:

For any scalar field [itex]\phi[/itex] (that is, function that assigns a real number to each point in space, or spacetime),

[itex]\hat{U}(\phi) = \dfrac{d\phi(\mathcal{P}(s))}{ds}[/itex]

This definition of "directional derivative" doesn't mention coordinates or components. It only mentions scalar fields and parametrized paths and derivatives of real-valued functions.

At this point, we note that there is a one-to-one correspondence between directional derivatives (which are operators) and tangent vectors (which are...I don't know...abstract objects that can be represented by column matrices that transform in some particular way under coordinate transformations). We have a coordinate-free definition of a directional derivative, and every tangent vector corresponds to a directional derivative, so there is really no reason not to identify the two: A tangent vector simply IS a directional derivative.

This way of looking at it flips the idea of what is fundamental. Instead of defining a vector in terms of components, and using components to define a directional derivative, we view the directional derivative as fundamental, and components to be a derived concept:

Pick a coordinate system. Then since a scalar field is any function from points in space [itex]\mathcal{P}[/itex] to real numbers, then a coordinate system is equivalent to a collection of four scalar fields [itex]X(\mathcal{P}), Y(\mathcal{P}), Z(\mathcal{P}), T(\mathcal{P})[/itex], where [itex]X(\mathcal{P})[/itex] gives the value of the x-coordinate at point [itex]\mathcal{P}[/itex], etc. Then we can define the components of a directional derivative [itex]\hat{U}[/itex] via:

[itex]U^x = \hat{U}(X)[/itex]
[itex]U^y = \hat{U}(Y)[/itex]
[itex]U^z = \hat{U}(Z)[/itex]
[itex]U^t = \hat{U}(T)[/itex]

or more compactly: [itex]U^\mu = \hat{U}(x^\mu)[/itex] (where [itex]x^\mu[/itex] is understood as the scalar field corresponding to that coordinate).

Once we identify directional derivatives with tangent vectors, we can pick out a set of basis vectors corresponding to a coordinate system as follows:

[itex]e_x = \frac{d}{dx}[/itex]
[itex]e_y = \frac{d}{dy}[/itex]
[itex]e_z = \frac{d}{dz}[/itex]
[itex]e_t = \frac{d}{dt}[/itex]

Those are the directional derivatives in the x, y, z, and t-directions. An arbitrary vector [itex]U[/itex] can be written as a linear combination of these basis vectors:

[itex]U = U^x e_x + U^y e_y + U^z e_z + U^t e_z = U^x \frac{d}{dx} + U^y \frac{d}{dy} + U^z \frac{d}{dz} + U^t \frac{d}{dt} = U^\mu \frac{d}{dx^\mu}[/itex]

So in the expression [itex]U^\mu \frac{d}{dx^\mu}[/itex], you're just creating a linear combination of basis vectors.
 
Last edited:
  • Like
Likes Nugatory and bcrowell
  • #4
stevendaryl said:
So in the expression [itex]U^\mu \frac{d}{dx^\mu}[/itex], you're just creating a linear combination of basis vectors.

Just another word in clarification: when someone writes [itex]U = U^\mu e_\mu[/itex], the right-hand side looks like a scalar product of a vector with components [itex]U^\mu[/itex] and another vector with components [itex]e_\mu[/itex]. But the index [itex]\mu[/itex] on [itex]e_\mu[/itex] doesn't indicate which component, it indicates which vector. [itex]e_x, e_y, e_z, e_t[/itex] are 4 different vectors, not 4 components of one vector.
 
  • #5
That's why I like the convention in which the mu-components on e are written with brackets () to indicate they label vectors, not just components. It's confusing otherwise.
 
  • #6
stevendaryl said:
Just another word in clarification: when someone writes [itex]U = U^\mu e_\mu[/itex], the right-hand side looks like a scalar product of a vector with components [itex]U^\mu[/itex] and another vector with components [itex]e_\mu[/itex]. But the index [itex]\mu[/itex] on [itex]e_\mu[/itex] doesn't indicate which component, it indicates which vector. [itex]e_x, e_y, e_z, e_t[/itex] are 4 different vectors, not 4 components of one vector.

I usually use the following notation, which I think is helpful. ##e_\mu## is a vector, the components of that vector are ##(e_\mu)^\nu##, where ##\mu## picks out "which vector" and ##\nu## picks out "which component".
 
  • Like
Likes stevendaryl
  • #7
Thanks everyone! You've been really helpful! So I still need to convince myself that [itex]T^{a}\frac{d}{dx^{a}}[/itex] is really a vector, but that just means going through the proof. I understand now that the [itex]\frac{d}{dx^{a}}[/itex] form a basis which spans the tangent space, but my problem was that it wasn't so obvious, having always viewed [itex]\frac{d}{dx^{a}}[/itex] as the coefficient of some vector-like operator, e.g. ∇.
 

1. What is a vector in general?

A vector in general is a mathematical object that represents both magnitude and direction. It can be represented using coordinates, and is typically used to describe physical quantities such as velocity, force, and acceleration.

2. What is the difference between a vector and a scalar?

A vector has both magnitude and direction, while a scalar only has magnitude. This means that a vector can be added, subtracted, and multiplied by another vector, while a scalar can only be multiplied by a vector.

3. What is a tangent space?

A tangent space is a mathematical concept used in differential geometry and multivariate calculus. It is a vector space that approximates the behavior of a curved surface at a specific point, and is used to study the behavior of functions on that surface.

4. How is a tangent space related to a vector space?

A tangent space is a type of vector space, specifically a vector space that is tangent to a curved surface. It is similar to a regular vector space in that it has a basis and can be operated on by linear transformations, but it also has additional properties that make it useful for studying curved surfaces.

5. What is the importance of tangent spaces in physics?

Tangent spaces are important in physics because they allow us to study the behavior of physical quantities on curved surfaces, such as trajectories of objects moving on a curved path. They are also used in general relativity to describe the curvature of spacetime and the behavior of objects in a gravitational field.

Similar threads

  • Special and General Relativity
Replies
20
Views
1K
  • Special and General Relativity
Replies
7
Views
1K
  • Special and General Relativity
2
Replies
38
Views
4K
  • Special and General Relativity
Replies
11
Views
406
  • Special and General Relativity
Replies
6
Views
1K
  • Special and General Relativity
Replies
12
Views
2K
  • Special and General Relativity
Replies
27
Views
4K
  • Special and General Relativity
Replies
11
Views
1K
  • Special and General Relativity
Replies
13
Views
2K
Replies
1
Views
851
Back
Top