Coordinate charts and change of basis

Click For Summary

Discussion Overview

The discussion centers on the relationship between coordinate charts and change of basis in the context of differentiable manifolds. Participants explore the implications of using different coordinate systems to describe functions and tangent vectors, focusing on the mathematical formulation of these transformations and the associated notational conventions.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant presents a proof involving the chain rule to show how partial derivatives transform between two overlapping coordinate charts.
  • Another participant confirms the proof's validity but suggests a more concise notation for a specific expression.
  • A different participant expresses their understanding of the independence of functions and tangent vectors from the choice of coordinate system, emphasizing the need for coordinate transformations.
  • Concerns are raised about the clarity and appropriateness of the notation used in the initial proof, particularly regarding the representation of partial derivatives.
  • One participant proposes an alternative notation that they find clearer, indicating a preference for using specific symbols and indices.
  • Another participant seeks confirmation of their intuitive understanding of the transformation of tangent vector components between coordinate bases.

Areas of Agreement / Disagreement

Participants generally agree on the importance of understanding how functions and tangent vectors relate across different coordinate systems. However, there is disagreement regarding the clarity and appropriateness of the notation used, with some participants finding it confusing.

Contextual Notes

Notational issues are highlighted, particularly concerning the representation of partial derivatives and the choice of symbols. There is also a discussion about the implications of coordinate independence and the transformation of vector components.

"Don't panic!"
Messages
600
Reaction score
8
So I know that this involves using the chain rule, but is the following attempt at a proof correct.

Let M be an n-dimensional manifold and let (U,\phi) and (V,\psi) be two overlapping coordinate charts (i.e. U\cap V\neq\emptyset), with U,V\subset M, covering a neighbourhood of p\in M, such that p\in U\cap V. Consider a function f:M\rightarrow\mathbb{R}, and let x=\phi(p), y=\psi(p). It follows then that $$\frac{\partial f}{\partial x^{\mu}}(p)=\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)=\frac{\partial}{\partial x^{\mu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\\ \qquad \quad\; =\frac{\partial}{\partial y^{\nu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\frac{\partial}{\partial x^{\mu}}\left[\left((\psi\circ\phi^{-1})^{\nu}(\phi(p))\right)\right]\\ =\frac{\partial f}{\partial y^{\nu}}(p)\frac{\partial y^{\nu}}{\partial x^{\mu}}(p)\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\,$$ where y(x)=(\psi\circ\phi^{-1})(\phi(p)).

Hence, as f is an arbitrary differentiable function, we conclude that $$\frac{\partial }{\partial x^{\mu}}=\frac{\partial y^{\nu}}{\partial x^{\mu}}\frac{\partial }{\partial y^{\nu}}$$ From this, we note that as \lbrace\frac{\partial }{\partial x^{\mu}}\rbrace and \lbrace\frac{\partial }{\partial y^{\nu}}\rbrace are two coordinate bases for the tangent space T_{p}M at the point p, the two bases must be related by the formula above.
 
Physics news on Phys.org
Looks fine to me. Note that ##(\psi \circ \phi^{-1})(\phi(p)) = \psi(\phi^{-1}(\phi(p))) = \psi(p)##. It is a bit shorter to write ...
 
Cheers for taking a look. Yeah, apologies for the explicitness, just wrote it out in full to make sure that I was understanding it correctly and show that the transition functions behave as the "new" coordinates as functions of the "old" coordinates...
 
So, is my intuition correct in the following analysis.

In the intersection of two coordinate charts (with coordinates x and y for simplicity) we can equally well describe functions (0-forms) in terms of the x-coordinates or the y-coordinates. Similarly, the tangent space to a point p in the intersection can equally be described in terms of the coordinate basis \frac{\partial}{\partial x^{\mu}} induced by the x coordinate chart, or in terms of the coordinate basis \frac{\partial}{\partial y^{\nu}} induced by the y coordinate chart. As such, we have a situation in which a given function f could be described in terms of the x or the y coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases \frac{\partial}{\partial x^{\mu}} and \frac{\partial}{\partial y^{\nu}} could be used to describe a tangent vector X in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}} where X^{\mu} and \tilde{X}^{\mu} are the components of X with respect to the two bases \frac{\partial}{\partial x^{\mu}} and \frac{\partial}{\partial y^{\nu}} respectively. Hence, this implies that the components of the vector transform as \tilde{X}^{\nu}=\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}X^{\nu}. Would this be a correct understanding at all?
 
Last edited:
Just one detail, the ##X^\mu## are the components of the vector (in the appropriate basis), not coordinates.
 
Orodruin said:
Just one detail, the X^\mu are the components of the vector (in the appropriate basis), not coordinates.

Sorry, that's what I'd meant (typed coordinates by mistake). Have corrected the post now.

so otherwise, have I understood the notion correctly?
 
The notation in post #1 is ambiguous and very confusing.

"Don't panic!" said:
$$\frac{\partial f}{\partial x^{\mu}}(p)$$
Here ##\frac{\partial}{\partial x^\mu}## apparently denotes the ##\mu##th partial derivative with respect to the coordinate system ##\phi##. Why a notation that involves ##x## instead of one that involves ##\phi##? I don't think the fact that you chose to denote ##\phi(p)## by ##x## automatically makes this notation appropriate. Consider e.g. the situation where ##\phi(p)=\psi(p)## (for the specific p that we're considering). Then your notation suggests that ##\frac{\partial}{\partial x^\mu}=\frac{\partial}{\partial y^\mu}##.

"Don't panic!" said:
$$\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)$$
I can't tell what the intention is here. Are you using the definition of the previous expression, or are you just inserting ##\phi^{-1}\circ\phi## in the middle? If it's the latter, then this step doesn't take you closer to where you want to go. If it's the former, then the notation doesn't make sense. The notational issues only get bigger as get further into the calculation.

I would use a notation like this:

\begin{align*}
&\frac{\partial}{\partial\phi^\mu}\bigg|_p f =(f\circ\phi^{-1})_{,\mu}(\phi(p)) = (f\circ\psi^{-1}\circ\psi\circ \phi^{-1})_{,\mu}(\phi(p)) =(f\circ\psi^{-1})_{,\nu}\big((\psi\circ\phi^{-1})(\phi(p))\big) (\psi\circ\phi^{-1})^\nu{}_{,\mu}(\phi(p))\\
&=(f\circ\psi^{-1})_{,\nu}(\psi(p)) (\psi^\nu\circ\phi^{-1})_{,\mu}(\phi(p)) = \bigg(\frac{\partial}{\partial\phi^\mu}\bigg|_p \psi^\nu\bigg) \frac{\partial}{\partial\psi^\nu}\bigg|_p f.
\end{align*}
Edit: Actually, as you have seen in some of my posts before, I like to call the coordinate systems x and y instead of ##\phi## and ##\psi##, and I like to use Latin indices instead of Greek, to save myself some typing. So I'd write this result as
$$\frac{\partial}{\partial y^i}\bigg|_p f =\bigg(\frac{\partial}{\partial y^i}\bigg|_p x^j\bigg) \frac{\partial}{\partial x^j}\bigg|_p f.$$
 
Last edited:
Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)

"Don't panic!" said:
In the intersection of two coordinate charts (with coordinates xx and yy for simplicity) we can equally well describe functions (0-forms) in terms of the xx-coordinates or the yy-coordinates. Similarly, the tangent space to a point pp in the intersection can equally be described in terms of the coordinate basis ∂∂xμ\frac{\partial}{\partial x^{\mu}} induced by the xx coordinate chart, or in terms of the coordinate basis ∂∂yν\frac{\partial}{\partial y^{\nu}} induced by the yy coordinate chart. As such, we have a situation in which a given function ff could be described in terms of the xx or the yy coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} could be used to describe a tangent vector XX in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that
X[f]=Xμ∂f(y)∂xμ=Xμ∂f(y(x))∂xμ=Xμ∂yν(x)∂xμ∂f(y)∂yν=X~ν∂f(y)∂yν​
X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}} where XμX^{\mu} and X~μ\tilde{X}^{\mu} are the components of XX with respect to the two bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} respectively. Hence, this implies that the components of the vector transform as
X~ν=∂yν(x)∂xμXν.​
 
"Don't panic!" said:
Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)
The sentence that involves the words "one as a function of the other" is a bit odd, and the notation y(x) is too. But you found the correct formula for how the components of a vector transforms under the change of coordinates ##x\to y##.
 
Last edited:
  • #10
So is the point that in the coordinate chart overlap one represent a function in terms of either set of coordinates and then transition between these two descriptions via transition functions. Then if we use the coordinate basis \lbrace\frac{\partial}{\partial x^{\mu}}\rbrace induced by one coordinate system \phi to act on a function f\circ\psi^{-1} that is represented in the other coordinate system \psi, then we must use transition functions \psi\circ\phi^{-1} so that we can relate this coordinate basis to the coordinate basis \lbrace\frac{\partial}{\partial y^{\nu}}\rbrace induced by the coordinate system \psi that the function is represented in?
 
  • #11
##\frac{\partial}{\partial x^\mu}\big|_p## acts on ##f##, not on ##f\circ\psi^{-1}##.
 
  • #12
Fredrik said:
∂∂xμ∣∣p\frac{\partial}{\partial x^\mu}\big|_p acts on ff, not on f∘ψ−1f\circ\psi^{-1}.

I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}?
 
  • #13
"Don't panic!" said:
I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}?
That's correct. That's why we define ##\frac{\partial}{\partial x^\mu}\big|_p## by
$$\frac{\partial}{\partial x^\mu}\bigg|_p f=(f\circ\phi^{-1})_{,\mu}(\phi(p))$$ for all smooth ##f:M\to\mathbb R##.

Since the ##\frac{\partial}{\partial x^\mu}\big|_p## defined this way is the partial derivative functional associated with the point p and the coordinate system ##\phi##, I would prefer to denote it by ##\frac{\partial}{\partial\phi^\mu}\big|_p##. Another option is to denote the cordinate system by ##x## instead of ##\phi##.
 
Last edited:
  • #14
Ah ok. I think I was really trying to justify why one needs to do the whole procedure in the first place? Is the point that in the coordinate chart overlap we have two different way of representing the same object and so we require a way of relating these to representations (achieved via the appropriate change of basis between the two coordinate bases)?
 
  • #15
I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.
 
  • #16
Fredrik said:
I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.

Yes sorry it was meant to be "two" and not "to".

Could one also derive the change of coordinate basis by considering to coordinate systems x and y and noting that as \lbrace\frac{\partial}{\partial x^{i}}\rbrace and \lbrace\frac{\partial}{\partial y^{j}}\rbrace are bases for the same tangent space we can express each basis vector \frac{\partial}{\partial x^{i}} as a linear combination of the basis vectors \lbrace\frac{\partial}{\partial y^{j}}\rbrace such that \frac{\partial}{\partial x^{i}}=A^{i}_{j}\frac{\partial}{\partial y^{j}} Then if we act on the jth coordinate function y^{j} (of the y coordinate system) we find that \frac{\partial y^{j}}{\partial x^{i}}=A^{i}_{j} as required. Alternatively, we can equally express each basis vector \frac{\partial}{\partial y^{j}} as a linear combination of the basis vectors \lbrace\frac{\partial}{\partial x^{i}}\rbrace such that \frac{\partial}{\partial y^{j}}=\tilde{A}^{i}_{j}\frac{\partial}{\partial x^{i}} Then, acting on the ith coordinate function x^{i} (of the x coordinate system) we have that \frac{\partial x^{i}}{\partial y^{j}}=\tilde{A}^{i}_{j}. We also note that this implies that A^{i}_{j}\tilde{A}^{j}_{k}=\delta^{i}_{k} and so \tilde{A}^{i}_{j}=(A^{-1})^{i}_{j}
 
  • #17
Yes, that's accurate, and is proved by using the definition of the ##\frac{\partial}{\partial x^\mu}\big|_p## notation and the chain rule, as discussed above.
 
  • #18
ok great, thanks.
 

Similar threads

  • · Replies 6 ·
Replies
6
Views
5K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 36 ·
2
Replies
36
Views
6K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 9 ·
Replies
9
Views
4K
  • · Replies 0 ·
Replies
0
Views
2K