Coordinate charts and change of basis

"Don't panic!" · Jun 3, 2015

So I know that this involves using the chain rule, but is the following attempt at a proof correct.

Let [itex]M[/itex] be an [itex]n[/itex]-dimensional manifold and let [itex](U,\phi)[/itex] and [itex](V,\psi)[/itex] be two overlapping coordinate charts (i.e. [itex]U\cap V\neq\emptyset[/itex]), with [itex]U,V\subset M[/itex], covering a neighbourhood of [itex]p\in M[/itex], such that [itex]p\in U\cap V[/itex]. Consider a function [itex]f:M\rightarrow\mathbb{R}[/itex], and let [itex]x=\phi(p)[/itex], [itex]y=\psi(p)[/itex]. It follows then that $$\frac{\partial f}{\partial x^{\mu}}(p)=\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)=\frac{\partial}{\partial x^{\mu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\\ \qquad \quad\; =\frac{\partial}{\partial y^{\nu}}\left[(f\circ\psi^{-1})\left((\psi\circ\phi^{-1})(\phi(p))\right)\right]\frac{\partial}{\partial x^{\mu}}\left[\left((\psi\circ\phi^{-1})^{\nu}(\phi(p))\right)\right]\\ =\frac{\partial f}{\partial y^{\nu}}(p)\frac{\partial y^{\nu}}{\partial x^{\mu}}(p)\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\,$$ where [itex]y(x)=(\psi\circ\phi^{-1})(\phi(p))[/itex].

Hence, as [itex]f[/itex] is an arbitrary differentiable function, we conclude that $$\frac{\partial }{\partial x^{\mu}}=\frac{\partial y^{\nu}}{\partial x^{\mu}}\frac{\partial }{\partial y^{\nu}}$$ From this, we note that as [itex]\lbrace\frac{\partial }{\partial x^{\mu}}\rbrace[/itex] and [itex]\lbrace\frac{\partial }{\partial y^{\nu}}\rbrace[/itex] are two coordinate bases for the tangent space [itex]T_{p}M[/itex] at the point [itex]p[/itex], the two bases must be related by the formula above.

Orodruin · Jun 8, 2015

Looks fine to me. Note that ##(\psi \circ \phi^{-1})(\phi(p)) = \psi(\phi^{-1}(\phi(p))) = \psi(p)##. It is a bit shorter to write ...

"Don't panic!" · Jun 8, 2015

Cheers for taking a look. Yeah, apologies for the explicitness, just wrote it out in full to make sure that I was understanding it correctly and show that the transition functions behave as the "new" coordinates as functions of the "old" coordinates...

"Don't panic!" · Jun 8, 2015

So, is my intuition correct in the following analysis.

In the intersection of two coordinate charts (with coordinates [itex]x[/itex] and [itex]y[/itex] for simplicity) we can equally well describe functions (0-forms) in terms of the [itex]x[/itex]-coordinates or the [itex]y[/itex]-coordinates. Similarly, the tangent space to a point [itex]p[/itex] in the intersection can equally be described in terms of the coordinate basis [itex]\frac{\partial}{\partial x^{\mu}}[/itex] induced by the [itex]x[/itex] coordinate chart, or in terms of the coordinate basis [itex]\frac{\partial}{\partial y^{\nu}}[/itex] induced by the [itex]y[/itex] coordinate chart. As such, we have a situation in which a given function [itex]f[/itex] could be described in terms of the [itex]x[/itex] or the [itex]y[/itex] coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases [itex]\frac{\partial}{\partial x^{\mu}}[/itex] and [itex]\frac{\partial}{\partial y^{\nu}}[/itex] could be used to describe a tangent vector [itex]X[/itex] in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that [tex]X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}}[/tex] where [itex]X^{\mu}[/itex] and [itex]\tilde{X}^{\mu}[/itex] are the components of [itex]X[/itex] with respect to the two bases [itex]\frac{\partial}{\partial x^{\mu}}[/itex] and [itex]\frac{\partial}{\partial y^{\nu}}[/itex] respectively. Hence, this implies that the components of the vector transform as [tex]\tilde{X}^{\nu}=\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}X^{\nu}.[/tex] Would this be a correct understanding at all?

Orodruin · Jun 8, 2015

Just one detail, the ##X^\mu## are the components of the vector (in the appropriate basis), not coordinates.

"Don't panic!" · Jun 8, 2015

Orodruin said:

Just one detail, the XμX^\mu are the components of the vector (in the appropriate basis), not coordinates.

Sorry, that's what I'd meant (typed coordinates by mistake). Have corrected the post now.

so otherwise, have I understood the notion correctly?

Fredrik · Jun 8, 2015

The notation in post #1 is ambiguous and very confusing.

"Don't panic!" said:

$$\frac{\partial f}{\partial x^{\mu}}(p)$$

Here ##\frac{\partial}{\partial x^\mu}## apparently denotes the ##\mu##th partial derivative with respect to the coordinate system ##\phi##. Why a notation that involves ##x## instead of one that involves ##\phi##? I don't think the fact that you chose to denote ##\phi(p)## by ##x## automatically makes this notation appropriate. Consider e.g. the situation where ##\phi(p)=\psi(p)## (for the specific p that we're considering). Then your notation suggests that ##\frac{\partial}{\partial x^\mu}=\frac{\partial}{\partial y^\mu}##.

"Don't panic!" said:

$$\frac{\partial}{\partial x^{\mu}}\left((f\circ\phi^{-1})(\phi(p))\right)$$

I can't tell what the intention is here. Are you using the definition of the previous expression, or are you just inserting ##\phi^{-1}\circ\phi## in the middle? If it's the latter, then this step doesn't take you closer to where you want to go. If it's the former, then the notation doesn't make sense. The notational issues only get bigger as get further into the calculation.

I would use a notation like this:

\begin{align*}
&\frac{\partial}{\partial\phi^\mu}\bigg|_p f =(f\circ\phi^{-1})_{,\mu}(\phi(p)) = (f\circ\psi^{-1}\circ\psi\circ \phi^{-1})_{,\mu}(\phi(p)) =(f\circ\psi^{-1})_{,\nu}\big((\psi\circ\phi^{-1})(\phi(p))\big) (\psi\circ\phi^{-1})^\nu{}_{,\mu}(\phi(p))\\
&=(f\circ\psi^{-1})_{,\nu}(\psi(p)) (\psi^\nu\circ\phi^{-1})_{,\mu}(\phi(p)) = \bigg(\frac{\partial}{\partial\phi^\mu}\bigg|_p \psi^\nu\bigg) \frac{\partial}{\partial\psi^\nu}\bigg|_p f.
\end{align*}
Edit: Actually, as you have seen in some of my posts before, I like to call the coordinate systems x and y instead of ##\phi## and ##\psi##, and I like to use Latin indices instead of Greek, to save myself some typing. So I'd write this result as
$$\frac{\partial}{\partial y^i}\bigg|_p f =\bigg(\frac{\partial}{\partial y^i}\bigg|_p x^j\bigg) \frac{\partial}{\partial x^j}\bigg|_p f.$$

"Don't panic!" · Jun 8, 2015

Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)

"Don't panic!" said:

In the intersection of two coordinate charts (with coordinates xx and yy for simplicity) we can equally well describe functions (0-forms) in terms of the xx-coordinates or the yy-coordinates. Similarly, the tangent space to a point pp in the intersection can equally be described in terms of the coordinate basis ∂∂xμ\frac{\partial}{\partial x^{\mu}} induced by the xx coordinate chart, or in terms of the coordinate basis ∂∂yν\frac{\partial}{\partial y^{\nu}} induced by the yy coordinate chart. As such, we have a situation in which a given function ff could be described in terms of the xx or the yy coordinate chart at a given point in the intersection, thus we must express one as a function of the other in order for the function to be coordinate independent. Similarly, either of the coordinate bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} could be used to describe a tangent vector XX in the tangent space at that point. Again this tangent vector must be coordinate independent, and so, in this intersection we must have that

X[f]=Xμ∂f(y)∂xμ=Xμ∂f(y(x))∂xμ=Xμ∂yν(x)∂xμ∂f(y)∂yν=X~ν∂f(y)∂yν
X[f]=X^{\mu}\frac{\partial f(y)}{\partial x^{\mu}}=X^{\mu}\frac{\partial f(y(x))}{\partial x^{\mu}}=X^{\mu}\frac{\partial y^{\nu}(x)}{\partial x^{\mu}}\frac{\partial f(y)}{\partial y^{\nu}}=\tilde{X}^{\nu}\frac{\partial f(y)}{\partial y^{\nu}} where XμX^{\mu} and X~μ\tilde{X}^{\mu} are the components of XX with respect to the two bases ∂∂xμ\frac{\partial}{\partial x^{\mu}} and ∂∂yν\frac{\partial}{\partial y^{\nu}} respectively. Hence, this implies that the components of the vector transform as

X~ν=∂yν(x)∂xμXν.

Fredrik · Jun 9, 2015

"Don't panic!" said:

Ok, thanks Fredrik, sorry for the notational issues.

Would the intuitive description behind why this is so be correct at all? (Quoted below from one of my previous posts)

The sentence that involves the words "one as a function of the other" is a bit odd, and the notation y(x) is too. But you found the correct formula for how the components of a vector transforms under the change of coordinates ##x\to y##.

"Don't panic!" · Jun 9, 2015

So is the point that in the coordinate chart overlap one represent a function in terms of either set of coordinates and then transition between these two descriptions via transition functions. Then if we use the coordinate basis [itex]\lbrace\frac{\partial}{\partial x^{\mu}}\rbrace[/itex] induced by one coordinate system [itex]\phi[/itex] to act on a function [itex]f\circ\psi^{-1}[/itex] that is represented in the other coordinate system [itex]\psi[/itex], then we must use transition functions [itex]\psi\circ\phi^{-1}[/itex] so that we can relate this coordinate basis to the coordinate basis [itex]\lbrace\frac{\partial}{\partial y^{\nu}}\rbrace[/itex] induced by the coordinate system [itex]\psi[/itex] that the function is represented in?

Fredrik · Jun 9, 2015

##\frac{\partial}{\partial x^\mu}\big|_p## acts on ##f##, not on ##f\circ\psi^{-1}##.

"Don't panic!" · Jun 9, 2015

Fredrik said:

∂∂xμ∣∣p\frac{\partial}{\partial x^\mu}\big|_p acts on ff, not on f∘ψ−1f\circ\psi^{-1}.

I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have [itex]f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}[/itex]?

Fredrik · Jun 9, 2015

"Don't panic!" said:

I thought one needed to introduce a coordinate chart before doing calculus though? That is, don't we need to have [itex]f\circ\psi^{-1}:\mathbb{R}^{n}\rightarrow\mathbb{R}[/itex]?

That's correct. That's why we define ##\frac{\partial}{\partial x^\mu}\big|_p## by
$$\frac{\partial}{\partial x^\mu}\bigg|_p f=(f\circ\phi^{-1})_{,\mu}(\phi(p))$$ for all smooth ##f:M\to\mathbb R##.

Since the ##\frac{\partial}{\partial x^\mu}\big|_p## defined this way is the partial derivative functional associated with the point p and the coordinate system ##\phi##, I would prefer to denote it by ##\frac{\partial}{\partial\phi^\mu}\big|_p##. Another option is to denote the cordinate system by ##x## instead of ##\phi##.

"Don't panic!" · Jun 9, 2015

Ah ok. I think I was really trying to justify why one needs to do the whole procedure in the first place? Is the point that in the coordinate chart overlap we have two different way of representing the same object and so we require a way of relating these to representations (achieved via the appropriate change of basis between the two coordinate bases)?

Fredrik · Jun 9, 2015

I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.

"Don't panic!" · Jun 9, 2015

Fredrik said:

I guess you could say that (if the last "to" is a typo, supposed to be "two"), since the n-tuple of components of a vector v with respect to the ordered basis associated with a particular coordinate system can be thought of as a representation of the vector v. Change the coordinate system, and you change the ordered basis, which changes the components.

Yes sorry it was meant to be "two" and not "to".

Could one also derive the change of coordinate basis by considering to coordinate systems [itex]x[/itex] and [itex]y[/itex] and noting that as [itex]\lbrace\frac{\partial}{\partial x^{i}}\rbrace[/itex] and [itex]\lbrace\frac{\partial}{\partial y^{j}}\rbrace[/itex] are bases for the same tangent space we can express each basis vector [itex]\frac{\partial}{\partial x^{i}}[/itex] as a linear combination of the basis vectors [itex]\lbrace\frac{\partial}{\partial y^{j}}\rbrace[/itex] such that [tex]\frac{\partial}{\partial x^{i}}=A^{i}_{j}\frac{\partial}{\partial y^{j}}[/tex] Then if we act on the [itex]j[/itex]th coordinate function [itex]y^{j}[/itex] (of the [itex]y[/itex] coordinate system) we find that [tex]\frac{\partial y^{j}}{\partial x^{i}}=A^{i}_{j}[/tex] as required. Alternatively, we can equally express each basis vector [itex]\frac{\partial}{\partial y^{j}}[/itex] as a linear combination of the basis vectors [itex]\lbrace\frac{\partial}{\partial x^{i}}\rbrace[/itex] such that [tex]\frac{\partial}{\partial y^{j}}=\tilde{A}^{i}_{j}\frac{\partial}{\partial x^{i}}[/tex] Then, acting on the [itex]i[/itex]th coordinate function [itex]x^{i}[/itex] (of the [itex]x[/itex] coordinate system) we have that [tex]\frac{\partial x^{i}}{\partial y^{j}}=\tilde{A}^{i}_{j}.[/tex] We also note that this implies that [tex]A^{i}_{j}\tilde{A}^{j}_{k}=\delta^{i}_{k}[/tex] and so [itex]\tilde{A}^{i}_{j}=(A^{-1})^{i}_{j}[/itex]

Fredrik · Jun 9, 2015

Yes, that's accurate, and is proved by using the definition of the ##\frac{\partial}{\partial x^\mu}\big|_p## notation and the chain rule, as discussed above.

"Don't panic!" · Jun 9, 2015

ok great, thanks.

Coordinate charts and change of basis

What is a coordinate chart?

How do coordinate charts relate to change of basis?

What is the purpose of using multiple coordinate charts?

How are coordinate charts chosen?

What are the advantages of using coordinate charts?

Similar threads

Hot Threads

Recent Insights