Gradient on Riemannian Manifold

In summary, zhentil said that he is trying to compute a gradient of a function in a local coordinate system, but is having trouble understanding why any of the terms might not be real. He is also confused about what the directional derivative is and how to use it in this situation.
  • #1
Kreizhn
743
1
Hopefully this is a simple enough question.

Let (M,g) be a matrix Riemannian manifold and [itex] f: M \to \mathbb R[/itex] a smooth function. Take [itex] p \in M [/itex] and let [itex] \{ X_1,\ldots, X_n \} [/itex] be a local orthonormal frame for a neighbourhood of p. We can define a gradient of f in a neighbourhood of p as
[tex] \nabla f = \sum_{i=1}^n X_i f X_i [/tex]

Now in my situation, I'm actually trying to compute this is a local coordinate system. Let [itex] \{ x^{jk} \} [/itex] be a local coordinate system and write [itex] X_i = a_i^{jk} \frac{\partial}{\partial x^{jk}} [/itex]. Then our gradient, evaluated at (for simplicity) p becomes
[tex] \begin{align*}
(\nabla f)_p &= \sum_{i=1}^n \underbrace{\left( a_i^{jk} \left.\frac{\partial}{\partial x^{jk}}\right|_p \right)}_{X_i} f \underbrace{\left(a_i^{mn} \left.\frac{\partial}{\partial x^{jk}}\right|_p\right)}_{X_i} \\
&= \sum_{i=1}^n a_i^{jk} a_i^{mn} \frac{\partial f}{\partial x^{jk}}(p) \left.\frac{\partial}{\partial x^{jk}}\right|_p
\end{align*}
[/tex]

Now we know that [itex] (X_i f)(p) = X_i(p) f \in \mathbb R [/itex] if we use the derivation definition of tangent vectors. But this term simply corresponds to
[tex] a_i^{jk} \frac{\partial f}{\partial x^{jk}}(p) \in \mathbb R[/tex]

Now here is my problem. Let's take [itex] M = U(N) [/itex] the unitary group of dimension N, and [itex] f(p) = \text{tr}[y^\dagger p]\text{tr}[p^\dagger y ] [/itex]
for some [itex] y \in U(N)[/itex]. We know that [itex] f(p) \in \mathbb R [/itex] since [itex] \langle y,p \rangle = \text{tr}[y^\dagger p ] [/itex] is just the Hilbert-Schmidt inner-product, so f(p) is nothing more that [itex] f(p) = |\langle y,p \rangle|^2. [/itex]. But when I compute the terms
[tex] a_i^{jk} \frac{\partial f}{\partial x^{jk}}(p) [/tex]
I get values in [itex] \mathbb C [/itex]. In fact, I can't see any reason why this HAS to be in [itex] \mathbb R [/itex]. Did I just make a silly mistake?
 
Physics news on Phys.org
  • #2
I'm quite confused. How can any of the terms not be real? By definition of local coordinates, the X_i are R-linear combinations of the basis vectors.
 
  • #3
Hey zhentil,

I'm glad that you've somehow tracked down my other posts as well and responded to them!

It might be that there is a flaw in my understanding. So if I may, perhaps you could tell me if what I'm doing here is correct:

My issue first came when trying to figure out how elements of the tangent space acted as derivations if we assigned them a matrix representation. I figured that the matrix, like the vector, simply represents the coefficients of the tangent element in a prescribed set of coordinates, so that if I have a matrix [itex] V \in T_p M [/itex] for some [itex] p \in M[/itex] then if V has a representation
[tex] V = \begin{pmatrix} v^{11} & v^{12} & \cdots & v^{1n} \\ v^{21} & v^{22} & \cdots & v^{2n} \\
\vdots & \vdots & \ddots & \vdots \\ v^{n1} & v^{n2} & \cdots & v^{nn} \end{pmatrix} [/tex]
then what I'm really expressing is
[tex] V = v^{ij} \frac{\partial}{\partial x^{ij} } [/tex]
for some local coordinate system in a neighbourhood of p. Is this correct?

If it is correct, then the next issue is what happens when we look at matrices in [itex] U(N) [/itex], whose elements are naturally allowed to be in [itex] \mathbb C[/itex] despite the fact that it is a real-Lie group with real-Lie algeba [itex] \mathfrak{u}(N) [/itex]? Is it that each element [itex] v^{ij} = v_{\mathbb R}^{ij} + i v_{i\mathbb R}^{ij} [/itex] and this requires me to actually double the size of my coordinate system?
 
  • #4
If you're talking about trivializing the tangent space and computing a Jacobian, you need to use real tangent vectors, i.e. embed U(n) into GL(2n, R), etc.

But why not try computing it using the definition of the directional derivative? I imagine it would be much easier here. You could use complex matrices and avoid the headache.
 
  • #5
I'm not sure what you mean by "trivializing." Do you mean in the same sense as we assign a trivialization to a fibre-bundle in order to associate each fibre to a vector space?

I'm actually trying to use the "directional derivative" approach here. Namely, I'm using the definition of the tangent space that recognizes tangent elements as derivations. I'm just not certain how to make a matrix representation of a tangent element into a derivation.

I realize I made a mistake in the above post, and the V should be
[tex] V = v^{ij} \left. \frac{\partial}{\partial x^{ij}} \right|_p [/tex]
which we can see will assign a function to it's directional derivative in the direction of V, albeit in this case the [itex] v^{ij} [/itex] are complex.

Can I mix both suggestions and embed U(n) into GL(2n,R) and then use a coordinate basis to make the [itex] v^{ij} [/itex] real? What is the best way to do this. If memory serves, we normally use a symplectic representation of [itex] \mathbb C [/itex] right? Namely
[tex] \mathbb C = \left\{ \begin{pmatrix} a & b \\ -b & a \end{pmatrix} : a,b \in \mathbb R \right\} [/tex]
So that the decomposition [itex] a + ib [/itex] becomes [itex] a I + bJ [/itex] where I is the identity element and J is the symplectic element which endows our space with almost-complex structure. Yes?
 

1. What is a gradient in coordinates?

A gradient in coordinates refers to the change in a function with respect to its coordinates. It is a vector that points in the direction of the steepest increase of the function and has a magnitude equal to the rate of change at that point.

2. How is the gradient calculated?

The gradient is calculated as a vector of partial derivatives, where each component represents the rate of change in a specific coordinate direction. It is denoted by the symbol ∇ and is calculated using the formula ∇f(x,y) = (df/dx, df/dy).

3. What is the significance of the gradient in coordinates?

The gradient in coordinates is important in determining the direction of the steepest increase of a function at a specific point. It is also used in optimization problems to find the minimum or maximum value of a function.

4. How is the gradient used in vector calculus?

In vector calculus, the gradient is used to define the del operator, which is a vector operator that represents the gradient of a function. It is used in various equations and theorems, such as the gradient theorem and the vector Laplacian.

5. Can the gradient be negative?

Yes, the gradient can have a negative value. This indicates that the function is decreasing in that particular direction. The magnitude of the gradient determines the rate of decrease, with a larger magnitude indicating a steeper decrease.

Similar threads

Replies
4
Views
1K
  • Differential Geometry
Replies
2
Views
590
  • Differential Geometry
Replies
11
Views
3K
  • Differential Geometry
Replies
12
Views
3K
Replies
6
Views
357
Replies
4
Views
1K
Replies
5
Views
388
Replies
1
Views
1K
Replies
3
Views
1K
  • Differential Geometry
Replies
7
Views
2K
Back
Top