Coordinate charts and change of basis

I think it is clear what you mean and I could just use the notation in your first post.In summary, the conversation discusses the use of the chain rule in proving a coordinate independent expression for the components of a vector in the tangent space of a manifold. The conversation also explores the relationship between different coordinate systems and their corresponding bases in the tangent space.
  • #1
"Don't panic!"
601
8
So I know that this involves using the chain rule, but is the following attempt at a proof correct.

Let [itex]M[/itex] be an [itex]n[/itex]-dimensional manifold and let [itex](U,\phi)[/itex] and [itex](V,\psi)[/itex] be two overlapping coordinate charts (i.e. [itex]U\cap V\neq\emptyset[/itex]), with [itex]U,V\subset M[/itex], covering a neighbourhood of [itex]p\in M[/itex], such that [itex]p\in U\cap V[/itex]. Consider a function [itex]f:M\rightarrow\mathbb{R}[/itex], and let [itex]x=\phi(p)[/itex], [itex]y=\psi(p)[/itex]. It follows then that $$\frac{\partial f}{\partial x^{\mu}}(p)=\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)=\frac{\partial}{\partial x^{\mu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\\ \qquad \quad\; =\frac{\partial}{\partial y^{\nu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\frac{\partial}{\partial x^{\mu}}\left[\left((\psi\circ\phi^{-1})^{\nu}(\phi(p))\right)\right]\\ =\frac{\partial f}{\partial y^{\nu}}(p)\frac{\partial y^{\nu}}{\partial x^{\mu}}(p)\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\,$$ where [itex]y(x)=(\psi\circ\phi^{-1})(\phi(p))[/itex].

Hence, as [itex]f[/itex] is an arbitrary differentiable function, we conclude that $$\frac{\partial }{\partial x^{\mu}}=\frac{\partial y^{\nu}}{\partial x^{\mu}}\frac{\partial }{\partial y^{\nu}}$$ From this, we note that as [itex]\lbrace\frac{\partial }{\partial x^{\mu}}\rbrace[/itex] and [itex]\lbrace\frac{\partial }{\partial y^{\nu}}\rbrace[/itex] are two coordinate bases for the tangent space [itex]T_{p}M[/itex] at the point [itex]p[/itex], the two bases must be related by the formula above.
 
Physics news on Phys.org
  • #2
Looks fine to me. Note that ##(\psi \circ \phi^{-1})(\phi(p)) = \psi(\phi^{-1}(\phi(p))) = \psi(p)##. It is a bit shorter to write ...
 
  • #3
Cheers for taking a look. Yeah, apologies for the explicitness, just wrote it out in full to make sure that I was understanding it correctly and show that the transition functions behave as the "new" coordinates as functions of the "old" coordinates...
 
  • #4
So, is my intuition correct in the following analysis.

In the intersection of two coordinate charts (with coordinates [itex]x[/itex] and [itex]y[/itex] for simplicity) we can equally well describe functions (0-forms) in terms of the [itex]x[/itex]-coordinates or the [itex]y[/itex]-coordinates. Similarly, the tangent space to a point [itex]p[/itex] in the intersection can equally be described in terms of the coordinate basis [itex]\frac{\partial}{\partial x^{\mu}}[/itex] induced by the [itex]x[/itex] coordinate chart, or in terms of the coordinate basis [itex]\frac{\partial}{\partial y^{\nu}}[/itex] induced by the [itex]y[/itex] coordinate chart. As such, we have a situation in which a given function [itex]f[/itex] could be described in terms of the [itex]x[/itex] or the [itex]y[/itex] coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases [itex]\frac{\partial}{\partial x^{\mu}}[/itex] and [itex]\frac{\partial}{\partial y^{\nu}}[/itex] could be used to describe a tangent vector [itex]X[/itex] in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that [tex]X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}}[/tex] where [itex]X^{\mu}[/itex] and [itex]\tilde{X}^{\mu}[/itex] are the components of [itex]X[/itex] with respect to the two bases [itex]\frac{\partial}{\partial x^{\mu}}[/itex] and [itex]\frac{\partial}{\partial y^{\nu}}[/itex] respectively. Hence, this implies that the components of the vector transform as [tex]\tilde{X}^{\nu}=\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}X^{\nu}.[/tex] Would this be a correct understanding at all?
 
Last edited:
  • #5
Just one detail, the ##X^\mu## are the components of the vector (in the appropriate basis), not coordinates.
 
  • #6
Orodruin said:
Just one detail, the X^\mu are the components of the vector (in the appropriate basis), not coordinates.

Sorry, that's what I'd meant (typed coordinates by mistake). Have corrected the post now.

so otherwise, have I understood the notion correctly?
 
  • #7
The notation in post #1 is ambiguous and very confusing.

"Don't panic!" said:
$$\frac{\partial f}{\partial x^{\mu}}(p)$$
Here ##\frac{\partial}{\partial x^\mu}## apparently denotes the ##\mu##th partial derivative with respect to the coordinate system ##\phi##. Why a notation that involves ##x## instead of one that involves ##\phi##? I don't think the fact that you chose to denote ##\phi(p)## by ##x## automatically makes this notation appropriate. Consider e.g. the situation where ##\phi(p)=\psi(p)## (for the specific p that we're considering). Then your notation suggests that ##\frac{\partial}{\partial x^\mu}=\frac{\partial}{\partial y^\mu}##.

"Don't panic!" said:
$$\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)$$
I can't tell what the intention is here. Are you using the definition of the previous expression, or are you just inserting ##\phi^{-1}\circ\phi## in the middle? If it's the latter, then this step doesn't take you closer to where you want to go. If it's the former, then the notation doesn't make sense. The notational issues only get bigger as get further into the calculation.

I would use a notation like this:

\begin{align*}
&\frac{\partial}{\partial\phi^\mu}\bigg|_p f =(f\circ\phi^{-1})_{,\mu}(\phi(p)) = (f\circ\psi^{-1}\circ\psi\circ \phi^{-1})_{,\mu}(\phi(p)) =(f\circ\psi^{-1})_{,\nu}\big((\psi\circ\phi^{-1})(\phi(p))\big) (\psi\circ\phi^{-1})^\nu{}_{,\mu}(\phi(p))\\
&=(f\circ\psi^{-1})_{,\nu}(\psi(p)) (\psi^\nu\circ\phi^{-1})_{,\mu}(\phi(p)) = \bigg(\frac{\partial}{\partial\phi^\mu}\bigg|_p \psi^\nu\bigg) \frac{\partial}{\partial\psi^\nu}\bigg|_p f.
\end{align*}
Edit: Actually, as you have seen in some of my posts before, I like to call the coordinate systems x and y instead of ##\phi## and ##\psi##, and I like to use Latin indices instead of Greek, to save myself some typing. So I'd write this result as
$$\frac{\partial}{\partial y^i}\bigg|_p f =\bigg(\frac{\partial}{\partial y^i}\bigg|_p x^j\bigg) \frac{\partial}{\partial x^j}\bigg|_p f.$$
 
Last edited:
  • #8
Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)

"Don't panic!" said:
In the intersection of two coordinate charts (with coordinates xx and yy for simplicity) we can equally well describe functions (0-forms) in terms of the xx-coordinates or the yy-coordinates. Similarly, the tangent space to a point pp in the intersection can equally be described in terms of the coordinate basis ∂∂xμ\frac{\partial}{\partial x^{\mu}} induced by the xx coordinate chart, or in terms of the coordinate basis ∂∂yν\frac{\partial}{\partial y^{\nu}} induced by the yy coordinate chart. As such, we have a situation in which a given function ff could be described in terms of the xx or the yy coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} could be used to describe a tangent vector XX in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that
X[f]=Xμ∂f(y)∂xμ=Xμ∂f(y(x))∂xμ=Xμ∂yν(x)∂xμ∂f(y)∂yν=X~ν∂f(y)∂yν​
X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}} where XμX^{\mu} and X~μ\tilde{X}^{\mu} are the components of XX with respect to the two bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} respectively. Hence, this implies that the components of the vector transform as
X~ν=∂yν(x)∂xμXν.​
 
  • #9
"Don't panic!" said:
Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)
The sentence that involves the words "one as a function of the other" is a bit odd, and the notation y(x) is too. But you found the correct formula for how the components of a vector transforms under the change of coordinates ##x\to y##.
 
Last edited:
  • #10
So is the point that in the coordinate chart overlap one represent a function in terms of either set of coordinates and then transition between these two descriptions via transition functions. Then if we use the coordinate basis [itex]\lbrace\frac{\partial}{\partial x^{\mu}}\rbrace[/itex] induced by one coordinate system [itex]\phi[/itex] to act on a function [itex]f\circ\psi^{-1}[/itex] that is represented in the other coordinate system [itex]\psi[/itex], then we must use transition functions [itex]\psi\circ\phi^{-1}[/itex] so that we can relate this coordinate basis to the coordinate basis [itex]\lbrace\frac{\partial}{\partial y^{\nu}}\rbrace[/itex] induced by the coordinate system [itex]\psi[/itex] that the function is represented in?
 
  • #11
##\frac{\partial}{\partial x^\mu}\big|_p## acts on ##f##, not on ##f\circ\psi^{-1}##.
 
  • #12
Fredrik said:
∂∂xμ∣∣p\frac{\partial}{\partial x^\mu}\big|_p acts on ff, not on f∘ψ−1f\circ\psi^{-1}.

I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have [itex]f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}[/itex]?
 
  • #13
"Don't panic!" said:
I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have [itex]f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}[/itex]?
That's correct. That's why we define ##\frac{\partial}{\partial x^\mu}\big|_p## by
$$\frac{\partial}{\partial x^\mu}\bigg|_p f=(f\circ\phi^{-1})_{,\mu}(\phi(p))$$ for all smooth ##f:M\to\mathbb R##.

Since the ##\frac{\partial}{\partial x^\mu}\big|_p## defined this way is the partial derivative functional associated with the point p and the coordinate system ##\phi##, I would prefer to denote it by ##\frac{\partial}{\partial\phi^\mu}\big|_p##. Another option is to denote the cordinate system by ##x## instead of ##\phi##.
 
Last edited:
  • #14
Ah ok. I think I was really trying to justify why one needs to do the whole procedure in the first place? Is the point that in the coordinate chart overlap we have two different way of representing the same object and so we require a way of relating these to representations (achieved via the appropriate change of basis between the two coordinate bases)?
 
  • #15
I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.
 
  • #16
Fredrik said:
I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.

Yes sorry it was meant to be "two" and not "to".

Could one also derive the change of coordinate basis by considering to coordinate systems [itex]x[/itex] and [itex]y[/itex] and noting that as [itex]\lbrace\frac{\partial}{\partial x^{i}}\rbrace[/itex] and [itex]\lbrace\frac{\partial}{\partial y^{j}}\rbrace[/itex] are bases for the same tangent space we can express each basis vector [itex]\frac{\partial}{\partial x^{i}}[/itex] as a linear combination of the basis vectors [itex]\lbrace\frac{\partial}{\partial y^{j}}\rbrace[/itex] such that [tex]\frac{\partial}{\partial x^{i}}=A^{i}_{j}\frac{\partial}{\partial y^{j}}[/tex] Then if we act on the [itex]j[/itex]th coordinate function [itex]y^{j}[/itex] (of the [itex]y[/itex] coordinate system) we find that [tex]\frac{\partial y^{j}}{\partial x^{i}}=A^{i}_{j}[/tex] as required. Alternatively, we can equally express each basis vector [itex]\frac{\partial}{\partial y^{j}}[/itex] as a linear combination of the basis vectors [itex]\lbrace\frac{\partial}{\partial x^{i}}\rbrace[/itex] such that [tex]\frac{\partial}{\partial y^{j}}=\tilde{A}^{i}_{j}\frac{\partial}{\partial x^{i}}[/tex] Then, acting on the [itex]i[/itex]th coordinate function [itex]x^{i}[/itex] (of the [itex]x[/itex] coordinate system) we have that [tex]\frac{\partial x^{i}}{\partial y^{j}}=\tilde{A}^{i}_{j}.[/tex] We also note that this implies that [tex]A^{i}_{j}\tilde{A}^{j}_{k}=\delta^{i}_{k}[/tex] and so [itex]\tilde{A}^{i}_{j}=(A^{-1})^{i}_{j}[/itex]
 
  • #17
Yes, that's accurate, and is proved by using the definition of the ##\frac{\partial}{\partial x^\mu}\big|_p## notation and the chain rule, as discussed above.
 
  • #18
ok great, thanks.
 

What is a coordinate chart?

A coordinate chart is a mathematical tool used in geometry and topology to define a local coordinate system on a manifold. It is a function that maps points on the manifold to points in a coordinate space, typically a subset of Euclidean space. This allows for the calculation of coordinates and the measurement of distances and angles on the manifold.

How do coordinate charts relate to change of basis?

A change of basis is a transformation that converts coordinates from one basis to another. This is important when working with vector spaces, as different bases can represent the same vector in different ways. Coordinate charts are used to define a basis on a manifold, and a change of basis can be used to transform the coordinates of a point on the manifold from one chart to another.

What is the purpose of using multiple coordinate charts?

Using multiple coordinate charts allows for a more comprehensive understanding of a manifold. By defining different local coordinate systems, different regions of the manifold can be studied in more detail. This is especially useful in understanding the global properties of a manifold, as well as in calculations involving curvature and other geometric properties.

How are coordinate charts chosen?

Coordinate charts are typically chosen to cover the entire manifold in a way that overlaps with other charts. This ensures that every point on the manifold can be described by at least one chart. In addition, the charts are usually chosen to be smooth, meaning that the transition functions between charts are differentiable. This allows for the use of calculus in working with the manifold.

What are the advantages of using coordinate charts?

Coordinate charts provide a powerful tool for studying and understanding manifolds. By defining local coordinate systems, they allow for the use of geometric and analytical methods, such as calculus, in working with these often complex mathematical objects. In addition, they provide a way to visualize and represent these abstract spaces, making them easier to work with and understand.

Similar threads

  • Differential Geometry
Replies
2
Views
594
  • Differential Geometry
Replies
6
Views
3K
  • Differential Geometry
Replies
1
Views
2K
Replies
4
Views
1K
  • Differential Geometry
Replies
12
Views
3K
Replies
9
Views
3K
  • Differential Geometry
Replies
20
Views
2K
Replies
5
Views
1K
  • Differential Geometry
Replies
7
Views
3K
  • Differential Geometry
Replies
3
Views
3K
Back
Top