On inverse function theorem in Spivak's CoM

  • Context: Undergrad 
  • Thread starter Thread starter psie
  • Start date Start date
  • Tags Tags
    Multivariable calculus
Click For Summary

Discussion Overview

The discussion centers on the Inverse Function Theorem as presented in Spivak's Calculus on Manifolds. Participants explore the assumptions and implications of the theorem, particularly regarding the conditions under which it holds and the nature of the proof. The scope includes theoretical aspects of differentiability and the behavior of functions under linear transformations.

Discussion Character

  • Technical explanation
  • Conceptual clarification
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • Some participants question why the proof can assume the derivative of the function is the identity map, particularly when the linear transformation is not the identity.
  • Others argue that if the theorem holds for a transformed function, it can be shown to hold for the original function through composition with a linear transformation.
  • There is a discussion about the validity of a specific inequality derived in the proof, with some participants suggesting it follows from the reverse triangle inequality.
  • A participant presents a specific case of a function and questions the applicability of the derived inequality, leading to a discussion about the conditions under which the theorem is proven.
  • Some participants clarify that the proof does indeed consider the case when the derivative is the identity, and that the general result follows from this special case.

Areas of Agreement / Disagreement

Participants express differing views on the assumptions of the theorem and the implications of the proof. There is no consensus on the necessity of proving the theorem for the transformed function or the validity of the inequalities presented.

Contextual Notes

Limitations include the dependence on specific assumptions about the linear transformation and the conditions under which the inequalities hold. The discussion does not resolve these limitations.

Who May Find This Useful

This discussion may be useful for students and professionals interested in advanced calculus, particularly those studying the properties of differentiable functions and the implications of the Inverse Function Theorem.

psie
Messages
315
Reaction score
40
TL;DR
I have two questions on Spivak's proof of the inverse function theorem in Calculus on Manifolds.
I know of a thread on this site with a similar question, but no definite answer. I will not state the whole proof, as it is quite long.

2-11 Theorem (Inverse Function Theorem). Suppose that ##f: \mathbb{R}^n\to\mathbb{R}^n## is continuously differentiable in an open set containing ##a##, and ##\det f'(a)\neq 0##. Then there is an open set ##V## containing ##a## and an open set ##W## containing ##f(a)## such that ##f:V\to W## has a continuous inverse ##f^{-1}:W\to V## which is differentiable and for all ##y\in W## satisfies $$(f^{-1})'(y) = [f'(f^{-1}(y))]^{-1}$$ Proof. Let ##\lambda## be the linear transformation ##Df(a)##. Then ##\lambda## is non-singular, since ##\det f'(a)\neq 0##. Now ##D(\lambda^{-1}\circ f)(a) = D(\lambda^{-1})(f(a))\circ Df(a) = \lambda^{-1}\circ Df(a)## is the identity linear transformation. If the theorem is true for ##\lambda^{-1}\circ f##, it is clearly true for ##f##. Therefor we may assume at the outset that ##\lambda## is the identity.

... ... ...

Note that (3) and Lemma 2-10 applied to ##g(x)=f(x)-x## imply for ##x_1,x_2\in U## that $$|f(x_1)-x_1-(f(x_2)-x_2)|\leq\frac12|x_1-x_2|.$$ Since \begin{align*}
|x_1-x_2|-|f(x_1)-f(x_2)|&\leq |f(x_1)-x_1-(f(x_2)-x_2)| \\ &\leq\frac12 |x_1-x_2|,
\end{align*}
... ... ...

1. Why can we assume ##f## to have the identity map as its derivative? I understand how if the theorem is true for ##g = \lambda^{-1} \circ f##, then its true for ##f## and I also understand that ##Dg(a)=\mathrm{id}##, where ##\mathrm{id}## is the identity map. If ##\lambda=\mathrm{id}##, then simply ##g=f## and this is how the proof proceeds. But what when ##\lambda\neq \mathrm{id}##?

2. Why is $$|x_1-x_2|-|f(x_1)-f(x_2)|\leq |f(x_1)-x_1-(f(x_2)-x_2)|,$$ true? I'm tempted to say it is the reverse triangle inequality, but shouldn't there be an extra pair of absolute value signs around the LHS of this inequality?
 
Physics news on Phys.org
Hi @psie,

1. Suppose the assertion holds true for ##g##. Then there are open neighborhoods ##V \ni a## and ##U\ni g(a)## such that ##g : V \to W## is a ##C^1##-diffeomorphism. Since ##\lambda : \mathbb{R}^n \to \mathbb{R}^n## is a ##C^1##-diffeomorphism, the image ##W := \lambda(U)## is an open neighborhood of ##f(a)## and the composition ##V \xrightarrow[\approx]{g} U \xrightarrow[\approx]{\lambda} W## is a ##C^1##-diffeomorphism. But this composition is ##f : V \to W##. The inverse ##f^{-1} : W \to V## is ##g^{-1} \circ \lambda^{-1}##, so the formula for ##(f^{-1})'(y)## is deduced from the formula for ##(g^{-1})'(z)## and the chain rule.

2. You are right that this is due to the reverse triangle inequality. Absolute value signs around the LHS are not needed since ##|x| \ge x## for all real numbers ##x##.
 
  • Like
Likes   Reactions: psie
Thank you, @Euge. Regarding 1, I understand what you’re writing, but shouldn’t we then prove the theorem for ##g##? Spivak only seems to prove a very special case, namely when ##g=f##, i.e. when ##\lambda=\mathrm{id}##. How do we modify the proof when ##\lambda\neq\mathrm{id}##?
 
If you multiply your function by a linear transformation the derivate is also multiplied by that linear transformation. So if you multiply it by the inverse of its derivative, you get a new function g whose derivative is the identity map.
 
  • Like
Likes   Reactions: psie
psie said:
Thank you, @Euge. Regarding 1, I understand what you’re writing, but shouldn’t we then prove the theorem for ##g##?
That is what he did. Specifically, he proves the result when ##\lambda = \text{id}##. The general result is then obtained by considering ##\lambda^{-1} \circ f##. Since ##D(\lambda^{-1}\circ f)(a)## is the identity, by what he has shown, ##\lambda^{-1}\circ f## is ##C^1## locally invertible. Thus ##f = \lambda \circ (\lambda^{-1}\circ f)## is also ##C^1## locally invertible.
 
  • Like
Likes   Reactions: Greg Bernhardt and psie
Hi,
I don't want to create a new thread because my question is related to this one. One of the steps in the proof of this theorem is the following relation:
2\cdot|f(x_{1})-f(x_{2})|\geq|x_{1}-x_{2}|
Let's assume the simplest case n=1, and the function:
f(x)=\frac{1}{3}x
It is 𝐶^{1} and invertible: f^{-1}(y)=3y
Using this relation, we get, for example when dx = 1 that \frac{2}{3} >= 1. How to understand it?
 
Last edited:
stan_s said:
Hi,
I don't want to create a new thread because my question is related to this one. One of the steps in the proof of this theorem is the following relation:
2\cdot|f(x_{1})-f(x_{2})|\geq|x_{1}-x_{2}|
Let's assume the simplest case n=1, and the function:
f(x)=\frac{1}{3}x
It is 𝐶^{1} and invertible: f^{-1}(y)=3y
Using this relation, we get, for example when dx = 1 that \frac{2}{3} >= 1. How to understand it?
It was assumed in the proof that ##Df(a)## is the identity, from which the inequality ##2|f(x_1) - f(x_2)| \ge |x_1 - x_2|## was obtained. Your function has derivative ##1/3##, not ##1##.
 
  • Like
Likes   Reactions: stan_s and psie

Similar threads

  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 22 ·
Replies
22
Views
3K
  • · Replies 0 ·
Replies
0
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
4K
  • · Replies 25 ·
Replies
25
Views
4K