Convex Optimization: Dual Function Definition

Click For Summary

Homework Help Overview

The discussion revolves around the definition of the dual function in the context of a convex optimization problem. The original poster presents an optimization problem involving the minimization of a quadratic function subject to a quadratic inequality constraint. The focus is on understanding the dual function derived from the Lagrangian and the implications of its definition.

Discussion Character

  • Conceptual clarification, Assumption checking

Approaches and Questions Raised

  • Participants discuss the formulation of the Lagrangian and the subsequent definitions of the primal and dual problems. Questions arise regarding the conditions under which the dual function is defined and the implications of the constraints on the Lagrange multiplier.

Discussion Status

The discussion is ongoing, with participants seeking clarification on specific aspects of the dual function's definition. There is a recognition of the need to define the dual function over a broader range of values for the Lagrange multiplier, even when the optimization is constrained to non-negative values. Some participants express confusion regarding the presence of negative infinity in the dual function definition.

Contextual Notes

Participants note that the original problem involves constraints that may affect the interpretation of the dual function. There is also mention of a potential typo in the definition of the dual function, which has led to further questions about its implications.

Master1022
Messages
590
Reaction score
116
Homework Statement
Given the following problem: find the dual and the dual solution.
Relevant Equations
Lagrange dual
Hi,

I was working through the following problem and I am getting confused with the solution's definition of the dual.

Problem:
Given the optimization problem:
minimize ## x^2 + 1 ##
s.t. ## (x - 2) (x - 4) \leq 0 ##

Attempt:
I can define the Lagrangian as:
L(x, \lambda) = (x^2 + 1) + \lambda (x^2 - 6x + 8)
where ## \lambda \geq 0 ## (this is where the question arises in a later part of the question)

Now we can define the primal problem as: ## min_{x} \left( max_{\lambda} \left[ L(x, \lambda) \right] \right) ##
and thus the dual problem is: ## max_{\lambda} \left( min_{x} \left[ L(x, \lambda) \right] \right) ##

if we follow the algebra through, we see that the minimum of ## L(x, \lambda) ## occurs at ## x_{min} = \frac{3 \lambda}{\lambda + 1} ## and therefore the dual function ## g(\lambda) ## is defined as :
g(\lambda) = L(x_{min}, \lambda) = \frac{1 + 9\lambda - \lambda^2}{1 + \lambda}

The answer now specifies that the dual function as:
g(\lambda) = \begin{cases} \frac{1 + 9\lambda - \lambda^2}{1 + \lambda} & \text{if } \lambda > -1 \\ -\infty & \text{if } x \leq - \infty \end{cases}

Question: Why are we concerned with ## \lambda \leq -1 ## when we have restricted ## \lambda \geq 0 ##?
- Is this because we are dealing with ## g(\lambda) ##, which is inside the maximum function and thus the restriction isn't applied yet?

Thanks in advance.

[EDIT]: Also, I don't fully understand where the ## -\infty## comes from. We can substitute values ## \lambda < -1 ## and get finite numbers, so am slightly confused...
 
Physics news on Phys.org
Master1022 said:
Homework Statement:: Given the following problem: find the dual and the dual solution.
Relevant Equations:: Lagrange dual

Hi,

I was working through the following problem and I am getting confused with the solution's definition of the dual.

Problem:
Given the optimization problem:
minimize ## x^2 + 1 ##
s.t. ## (x - 2) (x - 4) \leq 0 ##

Attempt:
I can define the Lagrangian as:
L(x, \lambda) = (x^2 + 1) + \lambda (x^2 - 6x + 8)
where ## \lambda \geq 0 ## (this is where the question arises in a later part of the question)

Now we can define the primal problem as: ## min_{x} \left( max_{\lambda} \left[ L(x, \lambda) \right] \right) ##
and thus the dual problem is: ## max_{\lambda} \left( min_{x} \left[ L(x, \lambda) \right] \right) ##

if we follow the algebra through, we see that the minimum of ## L(x, \lambda) ## occurs at ## x_{min} = \frac{3 \lambda}{\lambda + 1} ## and therefore the dual function ## g(\lambda) ## is defined as :
g(\lambda) = L(x_{min}, \lambda) = \frac{1 + 9\lambda - \lambda^2}{1 + \lambda}

The answer now specifies that the dual function as:
g(\lambda) = \begin{cases} \frac{1 + 9\lambda - \lambda^2}{1 + \lambda} &amp; \text{if } \lambda &gt; -1 \\ -\infty &amp; \text{if } x \leq - \infty \end{cases}

Question: Why are we concerned with ## \lambda \leq -1 ## when we have restricted ## \lambda \geq 0 ##?
- Is this because we are dealing with ## g(\lambda) ##, which is inside the maximum function and thus the restriction isn't applied yet?

Thanks in advance.

[EDIT]: Also, I don't fully understand where the ## -\infty## comes from. We can substitute values ## \lambda < -1 ## and get finite numbers, so am slightly confused...
Quick sanity check. ##(x - 2)(x - 4) \le 0 \Leftrightarrow 2 \le x \le 4##, so the minimal value of ##x^2 + 1## is attained when x = 2, thus ##x^2 + 1 \ge 5## is the minimum value on the interval I listed.

g(\lambda) = \begin{cases} \frac{1 + 9\lambda - \lambda^2}{1 + \lambda} &amp; \text{if } \lambda &gt; -1 \\ -\infty &amp; \text{if } x \leq - \infty \end{cases}
Part of this makes no sense to me. How can ##x \le -\infty##? Was this a typo, but should say ##\lambda \le -1##?
 
Mark44 said:
Part of this makes no sense to me. How can ##x \le -\infty##? Was this a typo, but should say ##\lambda \le -1##?
Yes, apologies. This was a typo. It should read:

g(\lambda) = \begin{cases} \frac{1 + 9\lambda - \lambda^2}{1 + \lambda} &amp; \text{if } \lambda &gt; -1 \\ -\infty &amp; \text{if } \lambda \leq - 1 \end{cases}
 
  • Informative
Likes   Reactions: BvU
What about the other part of what I said, about the maximal value being 5? If my understanding is correct, any work you do should agree with that.
 
Thanks for the response @Mark44 !
Mark44 said:
What about the other part of what I said, about the maximal value being 5? If my understanding is correct, any work you do should agree with that.
Yeh that makes sense - I understood that part of the problem. We can differentiate that expression for ## g(\lambda) ## and then the optimal lambda (which is 2) leads us to dual problem optimal of 5. The strong duality makes sense because Slater's condition holds.

The parts that confused me were:
1. The definition of ## g(\lambda) ## : I was unsure why we needed to define it over all ## \lambda ## when we are optimizing over ## \lambda \geq 0 ##. The only reason I have is that the maximization (and hence restriction on ## \lambda ##) is applied after defining ## g (\lambda)## and therefore we need to define it over the whole range
2. Where did the ##- \infty## come from in the case of ## \lambda \leq -1 ##. I can understand for the case of -1, but not for numbers less than that.
 

Similar threads

Replies
6
Views
2K
  • · Replies 9 ·
Replies
9
Views
2K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
Replies
2
Views
2K
  • · Replies 47 ·
2
Replies
47
Views
4K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
1K
Replies
2
Views
2K