How to maximize when two variables depend on each others

  • Context: Graduate 
  • Thread starter Thread starter Kolmin
  • Start date Start date
  • Tags Tags
    Variables
Click For Summary

Discussion Overview

The discussion revolves around maximizing a function of multiple variables, specifically in the context where one variable is dependent on another. Participants explore the formulation of gradients and Hessians for functions of the form F(y, x1, x2, ..., xn) with y defined as a function of x1. The scope includes theoretical considerations and mathematical reasoning related to optimization without constraints.

Discussion Character

  • Exploratory
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant questions how to organize the gradient and Hessian in the context of a function F(y, x1, x2) where y=f(x1), expressing uncertainty about the correct formulation.
  • Another participant suggests that F is primarily a function of x1,...,xn and that derivatives should be calculated with respect to these variables, mentioning the need for the chain rule.
  • A later reply introduces the idea of treating F as a utility function and discusses the implications of y being an endogenous function, raising questions about the appropriate gradient formulation.
  • One participant proposes rewriting F as F(f(x), x, z) to clarify the dependency and discusses how to compute the derivative with respect to x, emphasizing the use of partial derivatives and the chain rule.
  • Another participant expresses confusion about their understanding of the problem and seeks clarification on maximizing a specific function, indicating that the relationship between variables is complex and cannot be easily decomposed.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the correct formulation of the gradient and Hessian, and multiple competing views on how to approach the problem remain. There is ongoing uncertainty regarding the dependencies and the implications for optimization.

Contextual Notes

Participants note that the notation and formulation of the problem may be unclear, and there are unresolved questions about how to handle the dependencies between variables, particularly in the context of maximizing the function.

Kolmin
Messages
66
Reaction score
0
Hi all.

I was thinking on how we can maximize a function with the following form:

F(y, x1, x2,..., xn) with y=f(x1)

I know that I should use the total derivative to find out the effective rate of y, but I don't see how gradient and Hessian should be organized in this context.

For example, if we have F(y, x1, x2) with y=f(x1), how should we set gradient and Hessian?
Should the gradient be [itex]\nabla[/itex]F= (F1, F2)?
Or [itex]\nabla[/itex]F= (Fy, F1, F2)
What about the Hessian?

Sorry for the question (maybe not exactly challenging...:redface:), but this problem is not explicitly mentioned in any book.

Thanks. :smile:
 
Physics news on Phys.org
Just to put me on the right path, is the question completely meaningless or there is some sort of amazing mistake in it that I cannot see that makes the answer trivial?
 
F is really only a function of x1,...,xn so you would only calculate derivatives with respect to these variables

To calculate the gradients and hessians you may have to use the chain rule and knowledge of [itex]\frac{\partial F}{\partial y}[/itex] depending on the problem.

It would probably be clearer if you tried to choose specific functions F and f which you think are confusing and we can work through it
 
Office_Shredder said:
F is really only a function of x1,...,xn so you would only calculate derivatives with respect to these variables

To calculate the gradients and hessians you may have to use the chain rule and knowledge of [itex]\frac{\partial F}{\partial y}[/itex] depending on the problem.

It would probably be clearer if you tried to choose specific functions F and f which you think are confusing and we can work through it

First of all, thanks. :smile:

Let's put that F(y,x,z) is a Utility function and y=f(x) is an endogenous probability functions that depends on whatever you want (the important thing is that it is endogenous).
The decision maker has to choose the variables in order to maximize is problem (without constraints).
This is not the typical Von Neumann-Morgenstern Utility function, so how do we maximize it?

Do we set gradient [itex]\nabla[/itex]F= ( [itex]\frac{\partial F}{\partial y}[/itex], [itex]\frac{\partial F}{\partial x}[/itex], [itex]\frac{\partial F}{\partial z}[/itex] ) or do we set it [itex]\nabla[/itex]F= ([itex]\frac{\partial F}{\partial x}[/itex], [itex]\frac{\partial F}{\partial z}[/itex] ), which honestly doesn't make much sense to me?

Hessian is the next step, so gradient is the main problem I guess.
 
If we re-write this as F(f(x),x,z) then it's clear that F is a function of two variables. To calculate the gradient we need to know what [itex]\frac{\partial F}{\partial x}[/itex] is (hopefully it's clear what dF/dz is). Unfortunately the notation is a bit unclear so I am going to re-define things. We will assume z is a constant and what we want to calculate is
[tex]\frac{dF(y(x),x,z)}{dx}[/tex]
where this is the derivative of F as a function of x (which is what you normally think of as the partial derivative. The reason for this is that I will use [itex]\frac{\partial F(y,x,z)}{\partial x}[/itex] to denote the partial derivative of F with respect to the second variable, treating y as a constant value independent of x. Then the chain rule for multivariable calculus says that
[tex]\frac{dF}{dx} = \frac{\partial F}{\partial y} \frac{ dy}{dx} + \frac{\partial F}{\partial x}[/tex]

So if you just have a function F(y,x,z) whose partial derivatives (as a function of 3 variables) are known, you can calculate the derivative of F with respect to x even when y is a function of x.
 
Great!
Thanks a lot and sorry for the (now I see why...) trivial question. :smile:
 
Another trivial question on this topic after that I started to play around a bit with it (assuming that hopefully I got the point after the last answer…).

Following the notation introduced by the fourth post I guess the gradient should look like

[itex]\nabla[/itex]F = ( [itex]\frac{dF}{dx}[/itex],[itex]\frac{\partial F}{\partial z}[/itex])

right? Then set it equal zero and it should work. But still I find some problems in my understanding of the situation.

What I realize lately is that the way in which I previously stated the question is really bad and doesn't emphasize one thing that is probably the most important to me. The fact is that in my problem not only s influences p, but at the same time s stands in itself in the hypothetical formula (with which I mean that you cannot simply decompose y(x) and that's it). Hopefully an example will make my terms of my problem more clear.

[itex]F(y(x),x,z)=(1-y)(x+z)+y(x-z)[/itex]

[This is what I mean with my peculiar - and probably wrong - jargon "not only s influences p, but at the same time s stands in itself"]

How do we maximize something like this?
Still with the gradient I wrote few lines above?

And then, if the gradient is that one, how the heck does the Hessian look like?

Many thanks to the one who will help this soul in the reign of shadows. :smile:
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 10 ·
Replies
10
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 24 ·
Replies
24
Views
7K
  • · Replies 3 ·
Replies
3
Views
4K