Lagrange multipliers with vectors and matrices

Click For Summary
SUMMARY

The discussion focuses on the application of Lagrange multipliers in vector and matrix contexts, specifically using the function F(w, λ) = wCwT - λ(wuT - 1). The first-order necessary condition is derived as 2wC - λu = 0, confirmed through the gradient ∇F = 2wC - λu. The covariance matrix C is emphasized as a critical component in the calculations, which utilize Einstein notation for clarity and efficiency. The discussion highlights the importance of index notation in simplifying complex calculations involving symmetric matrices.

PREREQUISITES
  • Understanding of Lagrange multipliers
  • Familiarity with covariance matrices
  • Knowledge of Einstein notation
  • Proficiency in vector calculus
NEXT STEPS
  • Study the derivation of Lagrange multipliers in multivariable calculus
  • Explore the properties and applications of covariance matrices
  • Learn more about Einstein notation and its applications in physics and mathematics
  • Investigate advanced topics in vector calculus, including covariant and contravariant indices
USEFUL FOR

Mathematicians, physicists, and engineers who are working with optimization problems involving vectors and matrices, particularly those interested in advanced calculus and linear algebra applications.

IniquiTrance
Messages
185
Reaction score
0
My textbook is using Lagrange multipliers in a way I'm not familiar with.

F(w,λ)=wCwT-λ(wuT-1)

Why is the first order necessary condition?:

2wC-λu=0

Is it because:
[itex]\nabla[/itex]F=2wC-λu

Why does [itex]\nabla[/itex]F equal this?

Many thanks!

Edit: C is a covariance matrix
 
Physics news on Phys.org
The easiest way to think about these things is to use http://en.wikipedia.org/wiki/Einstein_notation" .

F = wmCmnwn-λ(wmum-1)

where repeated indices are summed over their range.

Then using ∂wiwj = δij, where δij is the http://en.wikipedia.org/wiki/Kronecker_delta"

We can calculate ∇F:

(∇F)i = ∂wiF = δimCmnwn+wmCmnδin-λ(δimum-0) = Cinwn+wmCmi-λui = 2wmCmi-λui = (2wC - λu)i

where we have used the fact that C and δ are symmetric matrices.

This calculation can also be done symbolically, but I find it easier (and often make less errors with transposes etc) using index notation. It also generalises to more complicated situations where you have covariant and contravariant indices, and different classes of indices (such as holomorphic and antiholomorphic in complex cases), http://en.wikipedia.org/wiki/DeWitt_notation" .
 
Last edited by a moderator:

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 25 ·
Replies
25
Views
3K
  • · Replies 13 ·
Replies
13
Views
2K
Replies
4
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 6 ·
Replies
6
Views
6K
  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K