Quick question on matrix calculus

  • Context: Graduate 
  • Thread starter Thread starter majon
  • Start date Start date
  • Tags Tags
    Calculus Matrix
Click For Summary

Discussion Overview

The discussion revolves around the differentiation of expressions involving matrices with complex entries, specifically focusing on the expression \(\frac{\partial}{\partial A} (A^\dagger B A)\). Participants explore the implications of using complex variables and the appropriate definitions of derivatives in this context.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant questions the validity of the expression \(\frac{\partial}{\partial A} (A^\dagger B A)\) given that \(A\) has complex entries.
  • Another participant clarifies that if \(A\) consists of complex variables, the differentiation makes sense, and they confirm the use of the conjugate transpose \(A^\dagger\) in their context.
  • A participant provides a formulation of derivatives for complex variables, stating that \(\frac{\partial z}{\partial z} = 1\) and other related derivatives are zero, suggesting that \(\frac{\partial}{\partial A} A^\dagger = 0\).
  • Further, they express a preference for using differential forms to handle the differentiation, leading to a proposed simplification of the expression to \(A^\dagger B\) under certain assumptions.
  • Another participant contrasts this with the case of real variables, providing a different formulation for the derivative of \(A^T B A\) and noting the differences in handling real versus complex variables.

Areas of Agreement / Disagreement

Participants express differing views on the interpretation of derivatives with respect to complex variables and the implications of using the conjugate transpose. There is no consensus on the correctness of the initial expression or the appropriate approach to differentiation.

Contextual Notes

Participants highlight the importance of definitions and the specific context in which the matrices are used, indicating that assumptions about the nature of the variables (complex vs. real) significantly affect the discussion.

majon
Messages
8
Reaction score
0
I have a quick question. Say we have the following matrices, A = \begin{pmatrix} a \\ b \end{pmatrix}
A^\dagger = \begin{pmatrix} a & b \end{pmatrix}
B = \begin{pmatrix} \alpha & \beta \\ \gamma & \delta \end{pmatrix} where the entries can be complex.

Now is the following expression correct?

\frac{\partial}{\partial A} (A^\dagger B A) = B A + B^\dagger A ?
 
Last edited:
Physics news on Phys.org
\partial/\partial A doesn't make sense if A's entries are complex numbers. If they are complex variables, it would make sense, though.

Did you really mean to use A^\dagger for the transpose of A, rather than the conjugate transpose of A?
 
Thanks for your reply Hurkyl. The entries are complex variables, I'll edit my question. And I meant to use the dagger (conjugate transpose) because this form of terms has a specific meaning where I'm using it.
 
The reason I asked about the dagger is because in your post, you wrote an equation stating A^\dagger is the transpose of A. I wanted to clarify if that's what you meant or if that was just a typo.With complex variables, we normally define the derivatives so that
\frac{\partial z}{\partial z} = 1<br /> \qquad \qquad<br /> \frac{\partial z}{\partial \bar{z}} = 0<br /> \qquad \qquad<br /> \frac{\partial \bar{z}}{\partial z} = 0<br /> \qquad \qquad<br /> \frac{\partial \bar{z}}{\partial \bar{z}} = 1

Following this, I imagine you want to have
\frac{\partial}{\partial A} A^\dagger = 0I find these things much easier to deal with as differential forms:

d(A^\dagger B A) = (d A^\dagger) B A + A^\dagger (dB) A + A^\dagger B (dA)

and so we would have

\frac{\partial}{\partial A} (A^\dagger B A) = 0 + 0 + A^\dagger B

As a sanity check, this answer is a row vector. Which is what we want, since the derivative of a scalar with respect to a column is a row, IIRC.
If we had real variables, then

d(A^T B A) = (d A^T) B A + A^T (dB) A + A^T B (dA) = A^T B^T (dA) + A^T (dB) A + A^T B (dA) = A^T (dB) A + A^T (B + B^T) dA

and so

\frac{\partial}{\partial A} (A^T B A) = A^T B + A^T B^T
 
Last edited:

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 12 ·
Replies
12
Views
3K
  • · Replies 12 ·
Replies
12
Views
3K
Replies
2
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
Replies
9
Views
2K
  • · Replies 7 ·
Replies
7
Views
4K