On concave functions over spaces of probabilty distributions

Click For Summary

Discussion Overview

The discussion centers on the properties of concave functions in the context of probability distributions, specifically examining the relationship between a concave function H of a probability distribution p(x) and the conditional expectation H[X|Y]. Participants explore the conditions under which the difference H[X] - H[X|Y] remains concave with respect to p(x) while keeping p(y|x) fixed.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants propose that to establish the concavity of H[X] - H[X|Y], one might need to analyze the conditions under which the product of functions remains concave.
  • Others argue that the suggestion to consider products of concave functions does not directly relate to the original problem, which focuses on H as a function of the probability distribution p(x).
  • A participant clarifies that while H[X] is a number, H[X|Y] is a function that must also be concave, raising questions about the implications of this relationship.
  • Another participant suggests that if H[X|Y] can be expressed as an integral involving p(x|y) and p(y), then under certain conditions, this integral could yield a concave function of x.
  • There is a discussion about the separability of joint distributions p(x,y) and q(x,y), with some participants indicating that this separability is crucial for understanding the conditions required for concavity.

Areas of Agreement / Disagreement

Participants express differing views on the relevance of considering products of concave functions, and there is no consensus on the best approach to address the original question regarding the conditions for concavity in H[X] - H[X|Y].

Contextual Notes

Participants note that the definitions and relationships between the functions involved may depend on specific assumptions about the distributions and their properties, which remain unresolved in the discussion.

bayesian
Messages
2
Reaction score
0
Given two (dependent) random variables [itex]X[/itex] and [itex]Y[/itex] with joint PDF [itex]p(x,y)[/itex] [itex]=p(x|y)p(y)[/itex] [itex]=p(y|x)p(x)[/itex], let [itex]H[X][/itex] be real-valued concave function of [itex]p(x)[/itex], and [itex]H[X|Y][/itex] the expectation of [itex]H[/itex] of [itex]p(x|y)[/itex] with respect to [itex]p(y)[/itex].

Examples of possible functions [itex]H[/itex] include the entropy of [itex]X[/itex], or its variance.

The concavity of [itex]H[/itex] implies that [itex]H[X]-H[X|Y]≥0[/itex] (through Jensen's inequality).

Question:

What additional conditions (if any) on [itex]H[/itex] are imposed if we in addtion require that [itex]H[X]-H[X|Y][/itex] should also be concave with respect to [itex]p(x)[/itex], if [itex]p(y|x)[/itex] remains fixed?
 
Physics news on Phys.org
Hey bayesian and welcome to the forums.

If you want to do a rigorous proof on this, then the way I would approach is to show what needs to hold for concavity of h when h = f*g. You can use the definition of a concave function for this, simplify and then collect terms.

We also know that p(x,y) and q(x,y) are separable since if p(y|x) is constant between two distributions, then if we have two different distributions corresponding to your two cases where the first corresponds to p(y) being concave, and the second corresponds to p(x) as well as p(y) being concave then this implies that p(x,y)/p(x) = q(x,y)/q(x) which implies q(x)/p(x) = q(x,y)/p(x,y) which means both p(x,y) and q(x,y) are separable.

So once you show what is needed for concavity to hold you can show what conditions p(x,y) and q(x,y) need and thus what the joint distributions need (or vice-versa) for concavity and hopefully answer your question.

This is only a suggestion, but I think those steps would be more rigorous, although it probably would be complicated.
 
chiro, I do not understand your suggestion, why should I consider products of functions that are concave? I cannot see how such products arise from the problem that I stated.

Just to make things clear: in my problem, H is a function of the probability distribution p(x) (e.g. the variance) and concave with respect to the space of probability distributions.
 
bayesian said:
chiro, I do not understand your suggestion, why should I consider products of functions that are concave? I cannot see how such products arise from the problem that I stated.

Just to make things clear: in my problem, H is a function of the probability distribution p(x) (e.g. the variance) and concave with respect to the space of probability distributions.

By a "function of the probability distribution" you mean that H(X) could be V(X) where X is a random variable? Because if so V(X) is not a function but a number. I don't think yo mean that, but I am not sure what you mean either.
 
bayesian said:
chiro, I do not understand your suggestion, why should I consider products of functions that are concave? I cannot see how such products arise from the problem that I stated.

Just to make things clear: in my problem, H is a function of the probability distribution p(x) (e.g. the variance) and concave with respect to the space of probability distributions.

I'm a little confused like viraltux as well, but if i understand correctly then H[X] will be a number, but H[X|Y] will be a function and this will have to concave. Is this right?

If this is concave and you end up getting H[X|Y] = Z(X) for some Y, then you want to show that this is concave as well. For H[X|Y], we get an integral in the form of p(x|y)p(y)ydy and the result of this will be concave (a function of x).

Now you can show that using your condition, the probability density function corresponding to keeping p(y|x) constant between changes will result in p(x,y) = b(x)c(y) [Separable].

From this you can take b(x) outside the integral (only with respect to y as in the dy) and then this implies that b(x) is concave, which implies H[X|Y] as a function is concave.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 2 ·
Replies
2
Views
2K