MHB How does Lemma 2.2.3 Demonstrate the Uniqueness of the Derivative?

  • Thread starter Thread starter Math Amateur
  • Start date Start date
  • Tags Tags
    Differentiability
Click For Summary
Lemma 2.2.3 in "Multidimensional Real Analysis I: Differentiation" demonstrates the uniqueness of the derivative by showing that for any element \( h \) in the domain, the expression \( Df(a)h \) depends solely on the function \( f \) and the points \( a \) and \( h \). The proof involves taking the limit as \( t \) approaches zero, allowing for the expression \( \frac{1}{t}(f(a + th) - f(a)) \) to converge to \( Df(a)(h) \). This approach maintains \( h \) as fixed while manipulating \( t \) to establish the limit, thereby ensuring that \( Df(a) \) is uniquely determined. The discussion also clarifies the role of norms in evaluating the limit of the error term \( \epsilon_a(th) \) as \( t \) approaches zero. Overall, the lemma confirms that if two derivatives at point \( a \) yield the same result for all \( h \), they must be identical.
Math Amateur
Gold Member
MHB
Messages
3,920
Reaction score
48
I am reading "Multidimensional Real Analysis I: Differentiation" by J. J. Duistermaat and J. A. C. Kolk ...

I am focused on Chapter 2: Differentiation ... ...

I need help with the proof of Lemma 2.2.3 ... ...

Duistermaat and Kolk's Lemma 2.2.3 and its proof read as follows:View attachment 7796
I do not understand the strategy (or overall idea) of this proof ... how does considering $$a + th \in U$$ lead to demonstrating that $$\text{Df}(a)$$ is uniquely determined ...

and

how do we get

$$\text{Df}(a) h = \frac{1}{t} ( f ( a + th ) - f(a) ) - \frac{1}{t} \epsilon_a (th)$$ ... ... ... ... ... (1)follow from (2.10) ... ... and then, how does (1) lead to ...$$\text{Df}(a) h = \lim_{ t \rightarrow 0 } ( f ( a + th ) - f(a) )$$and, indeed, how does the above show that $$\text{Df}(a)$$ is uniquely determined ... .. ?
Hope someone can help ... ...

Peter==========================================================================================The above post mentions (2.10) which is in the notes following Definition 2.2.2 ... so I am providing Definition 2.2.2 and the accompanying notes ... ... as follows ... ...
View attachment 7797
https://www.physicsforums.com/attachments/7798
 
Physics news on Phys.org
Peter said:
I do not understand the strategy (or overall idea) of this proof ... how does considering $$a + th \in U$$ lead to demonstrating that $$\text{Df}(a)$$ is uniquely determined ...

If we show that for any $h \in \mathbb{R}^n$ (i.e. any element in the domain of the linear mapping $Df(a)$) the expression $Df(a)h$ depends only on $f$ (and of course on the points $a$ and $h$), then $Df(a)$ is uniquely determined by $f$. For, any other derivative $L$ of $f$ at $a$ is then such that
\[
Lh = Df(a)h
\]
for all $h \in \mathbb{R}^n$.

Peter said:
and

how do we get

$$\text{Df}(a) h = \frac{1}{t} ( f ( a + th ) - f(a) ) - \frac{1}{t} \epsilon_a (th)$$ ... ... ... ... ... (1)follow from (2.10) ... ...

This is a matter of taking the first equality in (2.10) with $th$ in place of $h$.

Peter said:
and then, how does (1) lead to ...$$\text{Df}(a) h = \lim_{ t \rightarrow 0 } ( f ( a + th ) - f(a) )$$

Note that
\[
\left\|\frac{1}{t}\epsilon_a(th)\right\| = \|h\| \cdot \frac{\|\epsilon_a(th)\|}{\|t h\|},
\]
and now take the limit $t \to 0$ (keeping $h$ fixed!) and use the second equality in (2.10).

Peter said:
and, indeed, how does the above show that $$\text{Df}(a)$$ is uniquely determined ... .. ?

See above.
 
Krylov said:
If we show that for any $h \in \mathbb{R}^n$ (i.e. any element in the domain of the linear mapping $Df(a)$) the expression $Df(a)h$ depends only on $f$ (and of course on the points $a$ and $h$), then $Df(a)$ is uniquely determined by $f$. For, any other derivative $L$ of $f$ at $a$ is then such that
\[
Lh = Df(a)h
\]
for all $h \in \mathbb{R}^n$.
This is a matter of taking the first equality in (2.10) with $th$ in place of $h$.
Note that
\[
\left\|\frac{1}{t}\epsilon_a(th)\right\| = \|h\| \cdot \frac{\|\epsilon_a(th)\|}{\|t h\|},
\]
and now take the limit $t \to 0$ (keeping $h$ fixed!) and use the second equality in (2.10).
See above.
Thanks for the help, Krylov ...

Working through your post line by line and reflecting ...

... just a small clarification ... at the start of your reply, you write:

" ... ... If we show that for any $h \in \mathbb{R}^n$ (i.e. any element in the domain of the linear mapping $Df(a)$) the expression $Df(a)h$ depends only on $f$ (and of course on the points $a$ and $h$), then $Df(a)$ is uniquely determined by $f$. For, any other derivative $L$ of $f$ at $a$ is then such that
\[
Lh = Df(a)h
\]
for all $h \in \mathbb{R}^n$. ... ... "
If we wish to show that the Lemma holds for any $$a + h \in U$$ why don't we begin the proof with a statement like the following:

"Consider any $$h \in \mathbb{R}$$ such that $$a + h \in U$$ ... ... "

then surely what we prove will hold for any/every $$h$$ such that $$a + h \in U$$ ... so why do we need to bring $$t$$ into the proof ... ...

Can you help further ... ... ?

Peter
 
Krylov said:
If we show that for any $h \in \mathbb{R}^n$ (i.e. any element in the domain of the linear mapping $Df(a)$) the expression $Df(a)h$ depends only on $f$ (and of course on the points $a$ and $h$), then $Df(a)$ is uniquely determined by $f$. For, any other derivative $L$ of $f$ at $a$ is then such that
\[
Lh = Df(a)h
\]
for all $h \in \mathbb{R}^n$.
This is a matter of taking the first equality in (2.10) with $th$ in place of $h$.
Note that
\[
\left\|\frac{1}{t}\epsilon_a(th)\right\| = \|h\| \cdot \frac{\|\epsilon_a(th)\|}{\|t h\|},
\]
and now take the limit $t \to 0$ (keeping $h$ fixed!) and use the second equality in (2.10).
See above.
Hi Krylov ... thanks again for the help ...

Just another clarification ...

You write:

" ... ... Note that
\[
\left\|\frac{1}{t}\epsilon_a(th)\right\| = \|h\| \cdot \frac{\|\epsilon_a(th)\|}{\|t h\|},
\]
and now take the limit $t \to 0$ (keeping $h$ fixed!) and use the second equality in (2.10)."The above expression is expressed in norms ... but neither

$$\text{Df}(a) h = \frac{1}{t} ( f ( a + th ) - f(a) ) - \frac{1}{t} \epsilon_a (th)$$ ... ... ... ... ... (1)

nor

$$\text{Df}(a) h = \lim_{ t \rightarrow 0 } ( f ( a + th ) - f(a) )$$

have norm signs in them ...Can you explain what is going on with the taking of norms in your explanation ...Hope that my question is clear ...

Peter
 
Peter said:
If we wish to show that the Lemma holds for any $$a + h \in U$$ why don't we begin the proof with a statement like the following:

"Consider any $$h \in \mathbb{R}$$ such that $$a + h \in U$$ ... ... "

then surely what we prove will hold for any/every $$h$$ such that $$a + h \in U$$ ... so why do we need to bring $$t$$ into the proof ... ...

The purpose of the proof is to derive an expression for $Df(a)h$ where $h$ is arbitrary but fixed. This expression should depend only on $f$, $a$ and $h$. Namely, we will then have established that the derivative of $f$ at $a$ (if it exists) acting on an arbitrary element of its domain is uniquely determined.

However, the proof works by taking a limit. So in order to keep the direction $h$ arbitrary but fixed while at the same time being able to take the limit, we introduce the parameter $t$. Then we show that
\[
\frac{1}{t}{\left(f(a + th) - f(a)\right)} \to Df(a)(h),
\]
as $t \to 0$. Without $t$ in the game, we would have to somehow consider the limit $h \to 0$, which would spoil the purpose of keeping $h$ fixed.

Peter said:
You write:

" ... ... Note that
\[
\left\|\frac{1}{t}\epsilon_a(th)\right\| = \|h\| \cdot \frac{\|\epsilon_a(th)\|}{\|t h\|},
\]
and now take the limit $t \to 0$ (keeping $h$ fixed!) and use the second equality in (2.10)."The above expression is expressed in norms ... but neither

$$\text{Df}(a) h = \frac{1}{t} ( f ( a + th ) - f(a) ) - \frac{1}{t} \epsilon_a (th)$$ ... ... ... ... ... (1)

nor

$$\text{Df}(a) h = \lim_{ t \rightarrow 0 } ( f ( a + th ) - f(a) )$$

have norm signs in them ...Can you explain what is going on with the taking of norms in your explanation ...

Sure. I wanted to argue that in the limit $t \to 0$, the quantity $\frac{1}{t}\epsilon_a(th)$ that appears in the displayed equation in the proof actually goes to zero, because that is what the proof uses in its penultimate line.

Now, the above quantity goes to zero precisely when $\left\|\frac{1}{t}\epsilon_a(th)\right\| \to 0$. The latter expression however is easier to deal with, because we can divide by $\|h\|$ but not by $h$ itself. (In turn, the purpose of this division-and-multiplication manipulation is to recover the second equality in (2.10) in your text, but with $\hat{h} := th$ in place of $h$. Note that for fixed $h \in \mathbb{R}^n$ we have that $\hat{h} \to 0$ if $t \to 0$, so it would follow that $\frac{\|\epsilon_a(\hat{h})\|}{\|\hat{h}\|} \to 0$)
 
Krylov said:
The purpose of the proof is to derive an expression for $Df(a)h$ where $h$ is arbitrary but fixed. This expression should depend only on $f$, $a$ and $h$. Namely, we will then have established that the derivative of $f$ at $a$ (if it exists) acting on an arbitrary element of its domain is uniquely determined.

However, the proof works by taking a limit. So in order to keep the direction $h$ arbitrary but fixed while at the same time being able to take the limit, we introduce the parameter $t$. Then we show that
\[
\frac{1}{t}{\left(f(a + th) - f(a)\right)} \to Df(a)(h),
\]
as $t \to 0$. Without $t$ in the game, we would have to somehow consider the limit $h \to 0$, which would spoil the purpose of keeping $h$ fixed.
Sure. I wanted to argue that in the limit $t \to 0$, the quantity $\frac{1}{t}\epsilon_a(th)$ that appears in the displayed equation in the proof actually goes to zero, because that is what the proof uses in its penultimate line.

Now, the above quantity goes to zero precisely when $\left\|\frac{1}{t}\epsilon_a(th)\right\| \to 0$. The latter expression however is easier to deal with, because we can divide by $\|h\|$ but not by $h$ itself. (In turn, the purpose of this division-and-multiplication manipulation is to recover the second equality in (2.10) in your text, but with $\hat{h} := th$ in place of $h$. Note that for fixed $h \in \mathbb{R}^n$ we have that $\hat{h} \to 0$ if $t \to 0$, so it would follow that $\frac{\|\epsilon_a(\hat{h})\|}{\|\hat{h}\|} \to 0$)
Thanks so much for the help, Krylov ...

Just now reflecting on what you have written ...

Thanks again,

Peter
 
We all know the definition of n-dimensional topological manifold uses open sets and homeomorphisms onto the image as open set in ##\mathbb R^n##. It should be possible to reformulate the definition of n-dimensional topological manifold using closed sets on the manifold's topology and on ##\mathbb R^n## ? I'm positive for this. Perhaps the definition of smooth manifold would be problematic, though.

Similar threads

Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
1
Views
2K