Derivative of determinant wrt matrix

Click For Summary

Discussion Overview

The discussion focuses on deriving the derivative of the determinant of a matrix expression, specifically det(A+O'XO), with respect to the matrix X. Participants explore various mathematical approaches and formulas related to matrix calculus and determinants.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Exploratory

Main Points Raised

  • One participant introduces the relationship det(A+O'XO) = exp(tr(log(A+O'XO))) as a starting point for the derivation.
  • Another participant discusses using the matrix partial derivative dX and its implications for deriving the determinant.
  • There is mention of the adjugate of a matrix and its role in the derivative of the determinant, with references to definitions and properties from external sources.
  • A participant shares an alternative approach using the general formula d(det(Y))/dX = dtr(QY)/dX, where Q is defined in relation to Y.
  • Questions arise regarding the transition between steps in the derivation, particularly how the partial derivative dX interacts with the trace function.

Areas of Agreement / Disagreement

Participants express agreement on certain derivations and approaches, but there are also questions and clarifications sought regarding specific steps in the calculations. The discussion remains open with no consensus on all aspects of the derivation.

Contextual Notes

Some participants note that the derivation relies on properties of matrix calculus and determinants, but there are unresolved questions about the application of these properties in specific steps.

bakav
Messages
7
Reaction score
0
Hi there,
I want to derive the derivative of the det(A+O'XO) with respect to X, where A, O', O and X are all matrix.
Any suggestions,
Thanks
Baska
 
Physics news on Phys.org
First note that
det(A+O'XO) = exp(tr(log(A+O'XO)))

Then define the matrix partial derivative dX such that
dX tr(Xn) = n Xn-1
In terms of components, this means
(dX f(X))ij = dXji f(X)
where f(X) is an arbitrary scalar function of X or some component of a vector/matrix/tensor function of X.

Then we take the derivative
dX det(A+O'XO)
= dX exp(tr(log(A+O'XO)))
= exp(tr(log(A+O'XO))) dXtr(log(A+O'XO))
= det(A+O'XO) tr(dXlog(A+O'XO))
= det(A+O'XO) tr((A+O'XO)-1dX(A+O'XO))
= det(A+O'XO) O(A+O'XO)-1O'
= O adj(A+O'XO) O'

where adj(A) = det(A) A-1 is the http://en.wikipedia.org/wiki/Adjugate" of A. (Note that in older texts you see it called the adjoint - but that gets confusing with other adjoints).

Although the above definition goes via log(A+O'XO), the result does not require its existence. By looking at the definition of determinant, you can find it's derivative directly in terms of the adjugate. http://en.wikipedia.org/wiki/Determinant#Derivative
Note the transposition of the indices in their formula - which justifies my definition of matrix partial derivative.

Also see
http://en.wikipedia.org/wiki/Matrix_calculus#Derivative_of_matrix_determinant
 
Last edited by a moderator:
Thanks man
I also tried to use the general formula as:

d(det(Y))/dX=dtr(QY)/dX where Q is the det(Y)inv(Y) and apparently Y is a function of X.
I got the same derivation as you did.
Thanks a lot for your great help.
 
thanks dude appreciate it.
Can you guide me that how you got, in your derivation, from
= exp(tr(log(A+O'XO))) dXtr(log(A+O'XO)
= det(A+O'XO) tr(dXlog(A+O'XO))
= det(A+O'XO) tr((A+O'XO)-1dX(A+O'XO))
= det(A+O'XO) O(A+O'XO)-1O'
My question addresses how did you bring in the partial derivative dX to the trace function and then how you got rid of the trace in the last equation.
 

Similar threads

  • · Replies 20 ·
Replies
20
Views
4K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 14 ·
Replies
14
Views
5K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 20 ·
Replies
20
Views
3K
  • · Replies 5 ·
Replies
5
Views
4K
Replies
15
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
13
Views
4K
  • · Replies 15 ·
Replies
15
Views
5K