Vector/Matrix Differentiation: Where to Put the Transpose?

  • Thread starter Thread starter daviddoria
  • Start date Start date
  • Tags Tags
    Transpose
Click For Summary
When differentiating a function f(x) that is a vector, the placement of the transpose in the expression for the gradient can be ambiguous. The derivative of f(x)^2 results in 2*f(x)*del(f(x)), but the correct placement of the transpose depends on whether f is treated as a column or row vector. Geometric algebra provides a clearer framework, allowing for expressions like 2 f(x)^T del(f(x)) or 2 (del(f(x)))^T f(x), with the latter requiring restrictions on the gradient's operation. The discussion emphasizes the importance of understanding vector operations and suggests that using the dot product may simplify the differentiation process. Ultimately, clarity in the representation of vectors and gradients is crucial for accurate differentiation.
daviddoria
Messages
96
Reaction score
0
if you want to find the derivative (gradient) of f(x)^2 when f is a vector, you would get

2*f(x)*del(f(x))

I never know where to put the transpose! sometimes its clear because another term in the equation will be a scalar, so you know an inner product is needed, but if you don't have a hint like that, how do you know if you should put the transpose on the f(x) or the del(f(x))? I suppose it depends on if f is a column or row vector, but a lot of times this is not given in the statement of the problem.

Any thoughts on this? Does anyone have a good online tutorial on vector/matrix differentiation?

Thanks!

Dave
 
Physics news on Phys.org
In terms of geometric algebra (since f is a vector grad and the the vector don't commute) one has:

<br /> \nabla f(x)^2 = (\nabla f(x)) f(x) + f(x) \nabla(f(x)) = 2 f(x) \cdot \nabla f(x)<br />

(this works for a scalar f too since the dot product will just be scalar multiplication and you get 2 f(x) \nabla f(x).

Since you are talking about transposes, I'm assuming you've taken coordinate for your f and grad in some basis, in which case you can do it either way:

<br /> 2 f(x)^\text{T}\nabla f(x)<br />

or:
<br /> 2 ({\nabla{f(x)}})^\text{T} f(x)<br />

but in the second case you have to restrict the gradient to operating just on the first f(x).
 
ps. I don't know of an online tutorial. I'm learning about this now from two places:

Hestenes's New Foundations for Classical Mechanics.
Doran/Lasenby's Geometric Algebra for Physicists.
 
pps. If you choose to use coordinate vectors, and tranposition I think that basis also has to be orthonormal. Better to express using the dot product directly.
 
It occurred to me that what I initially wrote is wrong (has to be since it was a scalar result when it should be a vector).

Using ticks to mark what the grad is operating on when separated one can write:

<br /> \nabla \lvert f \rvert ^2 = \nabla f^2 = \nabla f f = \acute{\nabla}f\acute{f} + (\nabla f)f<br />

(again using the geometric product to multiply the two vectors).

Expanding this I get:

<br /> \nabla \lvert f \rvert ^2 = f (\nabla \cdot f) + (f \cdot \nabla) f - f \cdot (\nabla \wedge f) - (f \wedge \nabla) \cdot f<br />

Which in the three dimensional case can be written in terms of the "normal" vector products.

<br /> \nabla \lvert f \rvert ^2 = f (\nabla \cdot f) + (f \cdot \nabla) f + f \times (\nabla \times f) + (f \times \nabla) \times f<br />

Here we have a divergence, curl, directional derivative, and a ``normal'' directional derivative term (name?).

In my original post I allowed grad to commute with the vector, which means they are colinear (not generally true). Note that with the correction I don't really see how one would express this naturally with matrixes at all, so I've no idea now to answer your question of the where to put the transpose except for the colinear case.
 
I am studying the mathematical formalism behind non-commutative geometry approach to quantum gravity. I was reading about Hopf algebras and their Drinfeld twist with a specific example of the Moyal-Weyl twist defined as F=exp(-iλ/2θ^(μν)∂_μ⊗∂_ν) where λ is a constant parametar and θ antisymmetric constant tensor. {∂_μ} is the basis of the tangent vector space over the underlying spacetime Now, from my understanding the enveloping algebra which appears in the definition of the Hopf algebra...

Similar threads

  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 33 ·
2
Replies
33
Views
2K
  • · Replies 7 ·
Replies
7
Views
747
  • · Replies 25 ·
Replies
25
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K