View Full Version : Lagrange multipliers with vectors and matrices
IniquiTrance
Sep22-11, 02:04 PM
My textbook is using Lagrange multipliers in a way I'm not familiar with.
F(w,λ)=wCwT-λ(wuT-1)
Why is the first order necessary condition?:
2wC-λu=0
Is it because:
\nablaF=2wC-λu
Why does \nablaF equal this?
Many thanks!
Edit: C is a covariance matrix
Simon_Tyler
Sep24-11, 08:04 PM
The easiest way to think about these things is to use index notation (http://en.wikipedia.org/wiki/Einstein_notation).
F = wmCmnwn-λ(wmum-1)
where repeated indices are summed over their range.
Then using ∂wiwj = δij, where δij is the Kronecker delta (http://en.wikipedia.org/wiki/Kronecker_delta)
We can calculate ∇F:
(∇F)i = ∂wiF = δimCmnwn+wmCmnδin-λ(δimum-0) = Cinwn+wmCmi-λui = 2wmCmi-λui = (2wC - λu)i
where we have used the fact that C and δ are symmetric matrices.
This calculation can also be done symbolically, but I find it easier (and often make less errors with transposes etc) using index notation. It also generalises to more complicated situations where you have covariant and contravariant indices, and different classes of indices (such as holomorphic and antiholomorphic in complex cases), combined (http://en.wikipedia.org/wiki/DeWitt_notation) and more abstract indices (http://en.wikipedia.org/wiki/Abstract_index_notation).
vBulletin® v3.8.7, Copyright ©2000-2012, vBulletin Solutions, Inc.