Lagrangian Multiplier with Matrices

CuppoJava · Sep 17, 2009

Hi,
I'm trying to use calculus of variations to solve for the probability distribution with highest entropy for a given covariance matrix. I want to maximize this:

[tex]H[p(\vec{x})] = -\int p(\vec{x})*ln(p(\vec{x}))d\vec{x}[/tex]

with the following constraints:

[tex]\int p(\vec{x}) = 1[/tex]
[tex]\int \vec{x}p(\vec{x})d\vec{x} = \vec{u}[/tex]
[tex]\int (\vec{x}-\vec{u})(\vec{x}-\vec{u})^{T}p(\vec{x})d\vec{x} = \Sigma[/tex]

Using Lagrangian multipliers, the proposed maximization function is:

[tex]F[p(\vec{x})] = -\int p(\vec{x})*ln(p(\vec{x}))d\vec{x} + \lambda_{1}(\int p(\vec{x})d\vec{x}-1) + \vec{m}^{T}(\int \vec{x}p(\vec{x})d\vec{x} - \vec{u}) + Tr\{L(\int (\vec{x}-\vec{u})(\vec{x}-\vec{u})^{T}p(\vec{x})d\vec{x} - \Sigma)\}[/tex]

I understand that m is needed because there is D constraints imposed by the mean. And L is needed because there is DxD constraints imposed by the covariances. But what is the trace operator doing in there?

Thanks for the help
-Patrick

fresh_42 · Dec 3, 2019

The method by Lagrange multipliers involves an inner product for the constraints. The trace is such a product.

Lagrangian Multiplier with Matrices

Similar threads

Undergrad Finding the minimum distance between two curves

Undergrad Why ##a^0=1##?

High School Straightforward integration…

High School Arc Length for Hyperbolic Sin

Undergrad Ambiguity of the term "indefinite integral"

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect