The Gradient and the Hessian of a Function of Two Vectors

Click For Summary
To find the gradient and Hessian of a function f(x, y) of two n-dimensional vectors, treat it as a function of 2n variables. The gradient will be a 1-by-2n vector, while the Hessian will be a 2n-by-2n matrix. The first derivative acts as a linear map from R^n x R^n to R, and the second derivative is a symmetric bilinear map. When decomposing the source space into two n vectors, you can derive two partial gradients and a 2-by-2 Hessian matrix composed of four n-by-n matrices of second derivatives. This approach allows for a structured understanding of the function's behavior in multidimensional space.
EngWiPy
Messages
1,361
Reaction score
61
Hi,

Suppose we have a function of two n-dimensional vectors f(\mathbf{x},\mathbf{y}). How can we find the gradient and Hessian of this function?

Regards
 
Mathematics news on Phys.org
i would just treat it as a function of 2n variables and take the vector of 1st, then the matrix of 2nd derivatives.
 
mathwonk said:
i would just treat it as a function of 2n variables and take the vector of 1st, then the matrix of 2nd derivatives.

So, the gradient will be 1-by-2n vector, and the Hessian will be 2n-by-2n matrix?
 
yep. basically the gradient is a linear map approximating the original map. so you could also view it as a linear map from R^n x R^n -->R, and the Hessian I suppose as a bilinear such map.

I.e. if R^n = V, the 1st derivative is a linear map VxV-->R, and the second derivative is a symmetric bilinear map (VxV)x(VxV)-->R.

So if you really want to break up your source space into a pair of n vectors, then
you get two "partial" gradients, each a 1byn matrix, and you get a 2 by 2 Hessian matrix, where each block is an nbyn matrix, i.e. 4 nbyn matrices of vector second derivatives
 
mathwonk said:
yep. basically the gradient is a linear map approximating the original map. so you could also view it as a linear map from R^n x R^n -->R, and the Hessian I suppose as a bilinear such map.

I.e. if R^n = V, the 1st derivative is a linear map VxV-->R, and the second derivative is a symmetric bilinear map (VxV)x(VxV)-->R.

So if you really want to break up your source space into a pair of n vectors, then
you get two "partial" gradients, each a 1byn matrix, and you get a 2 by 2 Hessian matrix, where each block is an nbyn matrix, i.e. 4 nbyn matrices of vector second derivatives

Ok, I see. Thank you.

Regards
 
Here is a little puzzle from the book 100 Geometric Games by Pierre Berloquin. The side of a small square is one meter long and the side of a larger square one and a half meters long. One vertex of the large square is at the center of the small square. The side of the large square cuts two sides of the small square into one- third parts and two-thirds parts. What is the area where the squares overlap?

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
2
Views
3K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
1K
  • · Replies 17 ·
Replies
17
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K