Why is d(x^TCx) equal to x^T(C+C^T)dx?

  • Context: Graduate 
  • Thread starter Thread starter hotvette
  • Start date Start date
  • Tags Tags
    Calculus Matrix
Click For Summary

Discussion Overview

The discussion centers on the mathematical expression for the differential of a quadratic form, specifically why d(x^TCx) equals x^T(C+C^T)dx. Participants explore the application of the product rule in matrix calculus and seek clarification on the underlying principles and assumptions involved in this derivation.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • Some participants reference a specific online resource to illustrate the equality d(x^TCx) = x^T(C+C^T)dx, questioning the application of the product rule in this context.
  • One participant attempts to apply the product rule but expresses confusion regarding the terms involved, particularly in relation to the differentiation of x^T.
  • Another participant notes that the expression d(x^TCx) can be rewritten as d(x^TC^Tx) and provides a detailed expansion, emphasizing the scalar nature of the resulting expressions.
  • There is a discussion about the implications of C being symmetric, leading to the conclusion that if C is symmetric, the expression simplifies to 2x^TCdx.
  • One participant acknowledges a key insight regarding the scalar evaluation of the expressions, attributing this understanding to contributions from others in the thread.

Areas of Agreement / Disagreement

Participants express varying levels of understanding regarding the derivation, with some agreeing on the scalar nature of the expressions while others remain uncertain about specific steps in the differentiation process. No consensus is reached on the clarity of the product rule application.

Contextual Notes

Some assumptions about the properties of matrices, such as symmetry, are discussed but not universally accepted or clarified. The discussion also reflects uncertainty about the implications of differentiating with respect to dx and the treatment of transposes in the context of matrix calculus.

hotvette
Homework Helper
Messages
1,001
Reaction score
11
Referring to:

http://www.ee.ic.ac.uk/hp/staff/dmb/matrix/calculus.html#deriv_quad

Is there an easy way to illustrate why the following is true?

[tex]d(x^TCx) = x^T(C+C^T)dx[/tex]

My attempt at using the product rule doesn't seem to work:

[tex]A = x^TC[/tex]

[tex]B = x[/tex]

[tex]d(AB) = (dA)B + A(dB) = d(x^TC)x + x^TC[/tex]
 
Last edited:
Physics news on Phys.org
hotvette said:
Referring to:

http://www.ee.ic.ac.uk/hp/staff/dmb/matrix/calculus.html#deriv_quad

Is there an easy way to illustrate why the following is true?

[tex]d(x^TCx) = x^T(C+C^T)dx[/tex]

My attempt at using the product rule doesn't seem to work:

[tex]A = x^TC[/tex]

[tex]B = x[/tex]

[tex]d(AB) = (dA)B + A(dB) = d(x^TC)x + x^TC[/tex]

From your link:

[tex]d(X^TCX): = (X^TCdX): + (d(X^T) CX): = (I ¤ X^TC) dX: + (X^TC^T ¤ I) dX^T:[/tex]

Anyway, it looks like they did exactly what you did, except that they didn't set dx to the identity matrix. You are assuming differentiation by dx. This might not be the case. Also notice that (d(X^T) CX) is scaler and thus invariant to the transpose operator. They multiplied this scaler by the identity matrix so adding the two matrices make sense. Anyway, nice link. It is a good reference.
 
Last edited:
I was just trying to understand why the 2nd item under the heading "Differential of Quadratic Products":

[tex]d(x^TCx) = x^T(C+C^T)dx = [C=C^T]2x^TCdx[/tex]

is true. Is there any way to illustrate?

Assuming the standard product rule is valid, it means:

[tex]d/dx(x^TCx) = x^TC + (d/dx(x^T))Cx = x^TC + x^TC^T=x^T(C+C^T)[/tex]

What I don't understand is why:

[tex](d/dx(x^T))Cx = x^TC^T[/tex]
 
[tex]d(x^TCx)=d(x^TC^Tx)[/tex]

[tex]x^TCdx + (dx)^TCx = x^TC^Tdx + (dx)^TC^Tx[/tex]

Since they are scalars

[tex]x^TCdx + \left((dx)^TCx\right)^T = x^TC^Tdx + \left((dx)^TC^Tx\right)^T[/tex]

[tex]x^TCdx + \left(x^TC^Tdx\right) = x^TC^Tdx + \left(x^TCdx\right)[/tex]

[tex]x^T\left(C+C^T\right)dx = x^T\left(C^T+C\right)dx[/tex]

If [tex]C[/tex] is symmetric then that equals [tex]2x^TCdx[/tex]
 
Thanks, I get it now. The key is the fact that the expressions evaluate to scalars. John Creighto mentioned that also. I didn't see that before. Thanks to both of you.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
5K