Derivative of a function whose variable is a matrix

Click For Summary

Homework Help Overview

The problem involves the differentiability of a function defined on the space of matrices, specifically the function \( f: M_{n \times n} \rightarrow M_{n \times n} \) given by \( f(X) = X^2 \). Participants are tasked with showing that this function is differentiable and finding its differential.

Discussion Character

  • Exploratory, Conceptual clarification, Mathematical reasoning

Approaches and Questions Raised

  • Participants discuss the use of the difference quotient to identify a linear transformation that would satisfy the conditions for differentiability. There is an exploration of the terms involved in the limit and attempts to simplify the expression to isolate the linear part. Questions arise regarding the nature of the linear transformation and the implications of treating \( h \) as a matrix versus a vector.

Discussion Status

Some participants have provided insights into the concept of the Fréchet derivative and its application to the problem. There is recognition of the distinction between linear transformations and matrices, leading to clarifications in understanding. However, there is no explicit consensus on the necessity of rewriting the differential in a specific form, and the discussion remains open-ended.

Contextual Notes

Participants note the complexity of treating \( h \) as a matrix rather than a vector, which affects the formulation of the derivative. There is also mention of potential constraints related to the dimensionality of the matrices involved.

lus1450
Messages
40
Reaction score
1

Homework Statement


Let ##f: M_{n \times n} \rightarrow M_{n \times n}## with ##f(X) = X^2##, where ##M_{n \times n}## denotes the vector space of ##n \times n## matrices. Show ##f## is differentiable and find its differential.


Homework Equations





The Attempt at a Solution


So far, I've been looking at the difference quotient in order to "guess" linear transformation ##A## that will satisfy it. We have:
$$\lim_{|h|\to 0} \frac{|f(X+h) - f(X) - Ah|}{|h|} = \lim_{|h|\to 0} \frac{|Xh + hX + h^2 - Ah|}{|h|}$$after a little simplifying.
And I was thinking for a fixed ##X##, I have ##A(h) = Xh + hX##, as I wanted to get rid of the ##Xh + hX## term in the quotient. However, I'm stuck now since it's ##Ah## and not just ##A##. I was thinking of throwing in an ##h^{-1}## to my ##A##, but that would make it non-linear. Any suggestions?
 
Physics news on Phys.org
Zaculus said:

Homework Statement


Let ##f: M_{n \times n} \rightarrow M_{n \times n}## with ##f(X) = X^2##, where ##M_{n \times n}## denotes the vector space of ##n \times n## matrices. Show ##f## is differentiable and find its differential.


Homework Equations





The Attempt at a Solution


So far, I've been looking at the difference quotient in order to "guess" linear transformation ##A## that will satisfy it. We have:
$$\lim_{|h|\to 0} \frac{|f(X+h) - f(X) - Ah|}{|h|} = \lim_{|h|\to 0} \frac{|Xh + hX + h^2 - Ah|}{|h|}$$after a little simplifying.
And I was thinking for a fixed ##X##, I have ##A(h) = Xh + hX##, as I wanted to get rid of the ##Xh + hX## term in the quotient. However, I'm stuck now since it's ##Ah## and not just ##A##. I was thinking of throwing in an ##h^{-1}## to my ##A##, but that would make it non-linear. Any suggestions?


The concept you need is that of the Frechet Derivative; see http://en.wikipedia.org/wiki/Fréchet_derivative or
http://www.maths.lse.ac.uk/Courses/MA409/Notes-Part2.pdf . If ##h## is a matrix, then for ##f(x) = x^2## we have
f(X+h) - f(X) = (X+h)(X+h) - X^2 = hX + Xh + h^2,
so the linear operator ##D_X: h \rightarrow M_{n \times n},## defined as ##D_X(h) = Xh + hX##, is the Frechet derivative. Since ##D_X## is linear, it will always be possible to write ##D_X(h)## as ##A_X h## for some matrix ##A_X##, if you really want to, but I am not sure if your question really requires that you do so.
 
Ah okay, thank you! I just realized I was swapping linear transformation and matrix in my thought process, since in my linear algebra class "A" is normally a matrix while in my analysis class, "A" is a linear transformation. As such, it is saying A(h), not A*h. My confusion is cleared up =]
 
Zaculus said:
Ah okay, thank you! I just realized I was swapping linear transformation and matrix in my thought process, since in my linear algebra class "A" is normally a matrix while in my analysis class, "A" is a linear transformation. As such, it is saying A(h), not A*h. My confusion is cleared up =]

I take back what I said about it always being possible to write ##D_X(h)## as ##A_Xh## for some matrix ##A_X##. I forgot that ##h## is a matrix, not a vector. If it were a vector, that re-write would be possible, but maybe not if it is a matrix. (Actually, ##h## is an ##n^2##-dimensional vector, so if we are willing to re-write all matrices as ##n^2##-vectors, there would, indeed, be an ##n^2 \times n^2## matrix giving us what we need, but that seems excessive.)
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 13 ·
Replies
13
Views
2K
  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 9 ·
Replies
9
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
1K