A Lagrange multipliers on Banach spaces (in Dirac notation)

Rabindranath
Messages
10
Reaction score
1
I want to prove Cauchy–Schwarz' inequality, in Dirac notation, ##\left<\psi\middle|\psi\right> \left<\phi\middle|\phi\right> \geq \left|\left<\psi\middle|\phi\right>\right|^2##, using the Lagrange multiplier method, minimizing ##\left|\left<\psi\middle|\phi\right>\right|^2## subject to the constraint ##\left<\phi\middle|\phi\right> - c = 0##, where ##c## is a constant.

I'm completely new to Lagrange multipliers (although the idea is perfectly clear in simpler cases like e.g. ##f : \mathbf R^2 \to \mathbf R##), and the Fréchet derivative etc., and have tried to consult https://en.wikipedia.org/wiki/Lagrange_multipliers_on_Banach_spaces, but am still quite confused, conceptually.

This is my sketchy thinking thus far (trying to follow the Wikipedia exposition, adapted to my problem):

We have a Banach space ##B_\phi##. We then let ##f = \left|\left<\psi\middle|\phi\right>\right|^2 : B_\phi \to \mathbf C##, which we want to minimize. The constraint is given by ##g = \left<\phi\middle|\phi\right> - c : B_\phi \to \mathbf C##, which is set to zero. The Wikipedia article goes on to suppose that "##u_0##" (would "##\left|\phi_0\right>##" be a logical label in my case?) is a constrained extremum of ##f##, i.e. an extremum of ##f## on ##g^{-1}(0) = \big\{\left|\phi\right> \in B_\phi## ##|## ##g(\left|\phi\right>) = 0 \in \mathbf C \big\} \subseteq B_\phi##. The problem is then formulated as $$Df(u_0) = \lambda \circ Dg(u_0)$$ where ##\lambda## is the Lagrange multiplier, and ##D## the Fréchet derivative. Is it a complete misconception if I write this as (given ##f## and ##g## above, and my assumption that ##u_0 = \left|\phi_0\right>##) $$D \left|\left<\psi|\phi_0\right>\right|^2 = \lambda \circ D\big(\left<\phi_0|\phi_0\right> -c\big)$$?

My main questions at the moment are:

1. What are the conceptual errors above? (I guess there are plenty)
2. How do I evaluate the Fréchet derivative, e.g. ##D \left|\left<\psi|\phi_0\right>\right|^2##?

Thanks in advance!
 
Physics news on Phys.org
Rabindranath said:
2. How do I evaluate the Fréchet derivative, e.g. ##D \left|\left<\psi|\phi_0\right>\right|^2##?

As an example, suppose that you work in the complex Hilbert space ##H = \mathbb{C}## and let ##z_0 \in H## be fixed. If I read you correctly, you would be interested in differentiability of ##z \mapsto \left|\left<z |z_0 \right>\right|^2##. However, unless ##z_0 = 0## (which makes everything trivial), this map is not differentiable except at ##z = 0##. (You can check this using the Cauchy-Riemann equations.)

For the case of a real Hilbert space, it would be different. It is my impression that - since complex differentiability is such a strong requirement - Fréchet derivatives of operators and functionals are usually discussed in the context of real normed spaces, although I think that the basic definitions work fine in either case.
 
  • Like
Likes Rabindranath
Krylov said:
As an example, suppose that you work in the complex Hilbert space ##H = \mathbb{C}## and let ##z_0 \in H## be fixed. If I read you correctly, you would be interested in differentiability of ##z \mapsto \left|\left<z |z_0 \right>\right|^2##.

Rather ##z \mapsto \left|\left<z_0 |z \right>\right|^2## if I use your example. That is, going back to my example again, for some arbitrarily chosen fixed element ##\left|\psi\right>## in the Banach space ##B_\phi##, I'm interested in the map ## \left|\phi\right> \mapsto \left|\left<\psi\middle|\phi\right>\right|^2 ##. This is what I wanted to minimize, by means of the Lagrange multiplier method (with constraint ##g = \left<\phi\middle|\phi\right> - c = 0 ##). What I called ##\left|\phi_0\right>## was meant as "the element that minimizes ## \left|\phi\right> \mapsto \left|\left<\psi\middle|\phi\right>\right|^2 ## given the constraint".

Thanks for your reply anyway! I will look into it deeper when I have more time.
 
Thread 'How to define a vector field?'
Hello! In one book I saw that function ##V## of 3 variables ##V_x, V_y, V_z## (vector field in 3D) can be decomposed in a Taylor series without higher-order terms (partial derivative of second power and higher) at point ##(0,0,0)## such way: I think so: higher-order terms can be neglected because partial derivative of second power and higher are equal to 0. Is this true? And how to define vector field correctly for this case? (In the book I found nothing and my attempt was wrong...

Similar threads

  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 8 ·
Replies
8
Views
1K
  • · Replies 0 ·
Replies
0
Views
2K
Replies
15
Views
1K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K