Second derivative of chained function

In summary, we discussed the concept of multivariable linear functions and single variable functions and how to evaluate the second derivative of a function composed of these two types of functions. We used the product rule and chain rule to calculate the second derivative and also introduced the notation for partial derivatives and total derivatives.
  • #1
lriuui0x0
101
25
Let's say we have a function ##M(f(x))## where ##M: \mathbb{R}^2 \to \mathbb{R}^2## is a multivariable linear function, and ##f: \mathbb{R} \to \mathbb{R}^2## is a single variable function. Now I'm getting confused with evaluating the following second derivative of the function:

$$
[M(f(x))]'' = [M'(f(x)) \circ f'(x)]'
$$

How do we continue to evaluate the second derivative? This may be a very basic question, but I'm trying to get clear on the dimension of functions.
 
Last edited:
Physics news on Phys.org
  • #2
lriuui0x0 said:
Let's say we have a function ##M(f(x))## where ##M: \mathbb{R}^2 \to \mathbb{R}^2## is a multivariable linear function, and ##f: \mathbb{R} \to \mathbb{R}## is a single variable function. Now I'm getting confused with evaluating the following second derivative of the function:

$$
[M(f(x))]'' = [M'(f(x)) \circ f'(x)]'
$$

How do we continue to evaluate the second derivative? This may be a very basic question, but I'm trying to get clear on the dimension of functions.
This doesn't make sense. If ##M## is a function on ##\mathbb R^2##, then it needs two argumemts. You've given it only one, namely ##f(x)##.
 
  • #3
lriuui0x0 said:
Let's say we have a function ##M(f(x))## where ##M: \mathbb{R}^2 \to \mathbb{R}^2## is a multivariable linear function, and ##f: \mathbb{R} \to \mathbb{R}## is a single variable function. Now I'm getting confused with evaluating the following second derivative of the function:

$$
[M(f(x))]'' = [M'(f(x)) \circ f'(x)]'
$$

How do we continue to evaluate the second derivative? This may be a very basic question, but I'm trying to get clear on the dimension of functions.
Assuming ##M: \mathbb{R} \to \mathbb{R}##, and we want the second derivative of the function ##g(x) = M(f(x))##, then the first derivative is: $$g'(x) = M'(f(x))f'(x)$$ Note that this is not what you've written.

The second derivative is then evaluated using the product rule and the chain rule again.
 
  • #4
PeroK said:
This doesn't make sense. If ##M## is a function on ##\mathbb R^2##, then it needs two argumemts. You've given it only one, namely ##f(x)##.
Sorry I made a mistake in my question, it should be ##f: \mathbb{R} \to \mathbb{R}^2## instead.
 
  • #5
lriuui0x0 said:
Sorry I made a mistake in my question, it should be ##f: \mathbb{R} \to \mathbb{R}^2## instead.
You have partial derivatives, not ordinary derivatives, for ##M(y, z)##. Where ##f(x) = (y(x), z(x))##.
 
  • #6
PeroK said:
You have partial derivatives, not ordinary derivatives, for ##M(y, z)##. Where ##f(x) = (y(x), z(x))##.
I'm thinking about the total derivatives instead, ##M'## is a map ##\mathbb{R}^2 \to \mathbb{R}^2## at every ##\mathbb{R}^2## point. Therefore ##M'(f(x)) : \mathbb{R}^2 \to \mathbb{R}^2## and it can be composed with ##f'(x): \mathbb{R} \to \mathbb{R}^2##.
 
  • #7
lriuui0x0 said:
Let's say we have a function ##M(f(x))## where ##M: \mathbb{R}^2 \to \mathbb{R}^2## is a multivariable linear function, and ##f: \mathbb{R} \to \mathbb{R}^2## is a single variable function. Now I'm getting confused with evaluating the following second derivative of the function:

$$
[M(f(x))]'' = [M'(f(x)) \circ f'(x)]'
$$

How do we continue to evaluate the second derivative? This may be a very basic question, but I'm trying to get clear on the dimension of functions.

[itex]\circ[/itex] is function composition; the inner product is [itex]\cdot[/itex] (\cdot)

The derivative of [itex]M[/itex] is a 2x2 matrix [itex]J_{ij} = \partial_jM_i[/itex]; the second derivative is a 2x2x2 tensor [itex]K_{ijk} = \partial_j\partial_k M_i[/itex] where [itex]\delta_i[/itex] denotes differentiation with respect to the [itex]i[/itex]th argument. It is difficult to write expressions involving these without using suffix notation:
[tex]
\begin{split}
(M \circ f)_i'(t) &= (J_{ij} \circ f)(t)f_j'(t) \\
(M \circ f)_i''(t) &= (J_{ij} \circ f)(t)f_j''(t) + f_j'(t)f_k'(t)(K_{ijk} \circ f)(t)
\end{split}
[/tex] Now the first derivative and the first term of the second derivative can be written as matrix products of [itex](J \circ f)(t))[/itex] and [itex]f'(t)[/itex] or [itex]f''(t)[/itex] respectively; it's much harder to do that with the second term of the second derivative, but [itex]((K \circ f)(t) \cdot f'(t)) \cdot f'(t)[/itex] is an attempt, on the understanding that dotting implies contraction over the last index of the first factor and the first index of the second factor.
 
  • #8
lriuui0x0 said:
I'm thinking about the total derivatives instead, ##M'## is a map ##\mathbb{R}^2 \to \mathbb{R}^2## at every ##\mathbb{R}^2## point. Therefore ##M'(f(x)) : \mathbb{R}^2 \to \mathbb{R}^2## and it can be composed with ##f'(x): \mathbb{R} \to \mathbb{R}^2##.
You've lost me. I think you need to specify carefully (without typos) what functions you are dealing with and what derivatives you want to calculate. ##M'(x)## is generally only used for a simple function of one variable.
 
  • #9
pasmith said:
[itex]\circ[/itex] is function composition; the inner product is [itex]\cdot[/itex] (\cdot)

The derivative of [itex]M[/itex] is a 2x2 matrix [itex]J_{ij} = \partial_jM_i[/itex]; the second derivative is a 2x2x2 tensor [itex]K_{ijk} = \partial_j\partial_k M_i[/itex] where [itex]\delta_i[/itex] denotes differentiation with respect to the [itex]i[/itex]th argument. It is difficult to write expressions involving these without using suffix notation:
[tex]
\begin{split}
(M \circ f)_i'(t) &= (J_{ij} \circ f)(t)f_j'(t) \\
(M \circ f)_i''(t) &= (J_{ij} \circ f)(t)f_j''(t) + f_j'(t)f_k'(t)(K_{ijk} \circ f)(t)
\end{split}
[/tex] Now the first derivative and the first term of the second derivative can be written as matrix products of [itex](J \circ f)(t))[/itex] and [itex]f'(t)[/itex] or [itex]f''(t)[/itex] respectively; it's much harder to do that with the second term of the second derivative, but [itex]((K \circ f)(t) \cdot f'(t)) \cdot f'(t)[/itex] is an attempt, on the understanding that dotting implies contraction over the last index of the first factor and the first index of the second factor.
Is there some reference of chain rule using index notation? Also what do you mean by ##J_{ij} \circ f##, isn't ##J_{ij}## a single number?
 
  • #10
PeroK said:
You've lost me. I think you need to specify carefully (without typos) what functions you are dealing with and what derivatives you want to calculate. ##M'(x)## is generally only used for a simple function of one variable.
Sorry about that! I guess I'm just randomly coming up with the notation...
 
  • #11
lriuui0x0 said:
where ##M: \mathbb{R}^2 \to \mathbb{R}^2## is a multivariable linear function
Do you mean that ##M## is linear as a function of its argument in ##\mathbb{R}^2##? If yes, then its role in computing the derivative of ##M \circ f : \mathbb{R} \to \mathbb{R}^2## is trivial and you may want to forget about it for the purpose of the question.
 
  • #12
S.G. Janssens said:
Do you mean that ##M## is linear as a function of its argument in ##\mathbb{R}^2##? If yes, then its role in computing the derivative of ##M \circ f : \mathbb{R} \to \mathbb{R}^2## is trivial and you may want to forget about it for the purpose of the question.
I'd like to know for both ##M## is linear or not. When ##M## is linear, it's basically a matrix. Why being linear makes the question trivial?
 
  • #13
lriuui0x0 said:
I'd like to know for both ##M## is linear or not. Why being linear makes the question trivial?
Because then the answer is
$$
(M \circ f)^{(r)}(x) = M(f^{(r)}(x)) \qquad \forall\,r\in \mathbb{Z}_+,\,\forall\,x\in \mathbb{R},
$$
where the superscript indicates the ##r##th derivative with respect to the scalar variable ##x##. I would also interpret ##M## as a matrix and write the RHS as ##M f^{(r)}(x)##. (##M## just transforms the ##r##th derivative of the curve in ##\mathbb{R}^2## parameterized by ##f##.)
 
  • #14
S.G. Janssens said:
Because then the answer is
$$
(M \circ f)^{(r)}(x) = M(f^{(r)}(x)) \qquad \forall\,r\in \mathbb{Z}_+,\,\forall\,x\in \mathbb{R},
$$
where the superscript indicates the ##r##th derivative with respect to the scalar variable ##x##. I would also interpret ##M## as a matrix and write the RHS as ##M f^{(r)}(x)##. (##M## just transforms the ##r##th derivative of the curve in ##\mathbb{R}^2## parameterized by ##f##.)
Oh is this just the analogy of ##(sf)'## where ##s## is a scalar. Here we just replace the scalar multiplication with a matrix multiplication but it still behaves like a scalar.
 
  • #15
lriuui0x0 said:
Oh is this just the analogy of ##(sf)'## where ##s## is a scalar. Here we just replace the scalar multiplication with a matrix multiplication but it still behaves like a scalar.
Exactly.
 
  • Like
Likes lriuui0x0

1. What is the second derivative of a chained function?

The second derivative of a chained function is the derivative of the first derivative of the function. It measures the rate of change of the slope of the original function.

2. How is the second derivative of a chained function calculated?

The second derivative of a chained function can be calculated by first finding the first derivative of the function, and then taking the derivative of that result. This can be done using the chain rule or by using the product rule and the chain rule.

3. What does the second derivative of a chained function tell us?

The second derivative of a chained function can tell us about the concavity of the original function. If the second derivative is positive, the function is concave up, and if it is negative, the function is concave down. It can also help us identify points of inflection and local extrema.

4. Why is the second derivative of a chained function important?

The second derivative of a chained function is important because it helps us understand the behavior of the original function. It can provide information about the curvature of the function and help us identify important points on the graph.

5. What are some real-world applications of the second derivative of a chained function?

The second derivative of a chained function has many real-world applications, such as in physics, economics, and engineering. It can be used to analyze the motion of objects, predict changes in economic trends, and optimize processes in engineering. It is also used in machine learning and data analysis to identify patterns and make predictions.

Similar threads

Replies
3
Views
1K
Replies
14
Views
2K
Replies
2
Views
791
Replies
2
Views
1K
Replies
2
Views
961
Replies
1
Views
1K
Replies
5
Views
1K
Replies
2
Views
298
Replies
1
Views
207
Replies
1
Views
960
Back
Top