Expected value on subset of Pareto

  • Context: Graduate 
  • Thread starter Thread starter xag
  • Start date Start date
  • Tags Tags
    Expected value Value
Click For Summary

Discussion Overview

The discussion revolves around calculating the expected value of a subset of samples from a Pareto distribution, specifically focusing on the conditional expectation E[X|X

Discussion Character

  • Exploratory
  • Mathematical reasoning
  • Technical explanation

Main Points Raised

  • Xavier seeks to find the expected value of samples from a Pareto distribution that are below a certain threshold 'a', expressing interest in the parameters of the distribution.
  • Some participants inquire about the definition of conditional expectation and the methods used to calculate it, suggesting a need for clarification on the approach taken.
  • Xavier proposes that the samples of interest may also follow a Pareto distribution and attempts to derive the expected value based on the shape and scale parameters.
  • A participant suggests integrating the probability density function to find the expected value, but later questions the correctness of this approach.
  • Pere provides an alternative method for calculating the conditional distribution function and suggests deriving the conditional density function to find the expected value.
  • Xavier acknowledges a mistake in a previous calculation and seeks confirmation on the integration results provided by Pere.
  • Xavier ultimately finds a formula for the expected value, but it remains unclear if this is universally accepted or verified by others.

Areas of Agreement / Disagreement

Participants express differing views on the methods for calculating the expected value, with some proposing integration of the probability density function and others suggesting alternative approaches. The discussion does not reach a consensus on the best method or the correctness of the derived formulas.

Contextual Notes

There are limitations regarding the assumptions made about the distribution of the samples and the parameters involved. The discussion includes unresolved mathematical steps and varying interpretations of the conditional expectation.

xag
Messages
6
Reaction score
0
Hi -
I have a Pareto distribution X (xm=1, k known)*. High-value samples are filtered out and I want the expected value of the remaining. Namely: E[X|X<a].

*Wikipedia notations

Many thanks in advance.
Xavier
 
Physics news on Phys.org
What is the definition of the conditional expectation you are referring to? How did you try to calculate it? Where did you get stuck?
 
Hi Pere -

What I'm looking for is a mean value.
I found out that the samples of interest belong to a larger set that roughly follow a Pareto distribution, and that among this larger set, the samples of interest are the ones whose values are below a certain number, 'a'.

I can estimate the parameters of the Pareto once for all, and calculate 'a' for each run. But I can't access the samples for each run, so if there was a way to predict it with 'k' (the Pareto shape) and 'a', that would be great.

My first guess is that the samples of interest also follow a Pareto, but I've not shown that. And in this case, could we express its shape K and scale Xm? Then the mean I'm looking for would be K Xm/(K-1).


Xavier
 
Ok. It seems I got all the way to the finish line and stopped just before :

I guess I just had to integrate the probability density function between 1 and a (it's defined between 1 and +oo).
It gives : (a ^(1 - k) - 1) * k / (1 - k)

Can anyone confirm?
 
xag said:
I guess I just had to integrate the probability density function between 1 and a.
Can anyone confirm?

What you did gives you the probability of a sample with value less than a.

Try the following instead: First calculate the conditional distribution function[itex]\mathbb{P}(X\leqx| X \leq a)[/itex].
By definition this is equal to

[tex] \frac{\mathbb{P}(X\leq x \wedge X \leq a)}{\mathbb{P}(X\leq a)}[/tex]

(You already calculated the denominator.)

You can also easily calculate the numerator. For x>a the numerator is [itex]\mathbb{P}(X \leq a)[/itex] which cancels with denominator giving one. For x>a you calculate it in the same way as you did for [itex]\mathbb{P}(X \leq a)[/itex] with a replaced by x. Then you take the derivative with respect to x to find the conditional density function [itex]\rho_a(x)[/itex] (suported on [1,a]), which you then use to calculate the conditional expectation as
[tex]\mathbb{E}\left[X|X\leq a\right]=\int_1^a{x\rho_a(x)dx}[/tex]
A more direct though maybe less obvious formula would be
[tex]\mathbb{E}\left[X|X\leq a\right]=\frac{\int_1^a{x\rho(x)dx}}{\int_1^a{\rho(x)dx}}[/tex],
where [itex]\rho(x)[/itex] is the density function of your original Pareto distribution.
I suggest you try them both and check if they really are the same:smile:
-Pere
 
Last edited:
xag said:
Ok. It seems I got all the way to the finish line and stopped just before :

I guess I just had to integrate the probability density function between 1 and a (it's defined between 1 and +oo).
It gives : (a ^(1 - k) - 1) * k / (1 - k)

Can anyone confirm?

Forget this post: the thing I integrated was x * dF(x) and moreover the result I gave here is wrong.
 
Pere Callahan said:
Try the following instead: First calculate the conditional distribution function[itex]\mathbb{P}(X\leqx| X \leq a)[/itex].
[...]
I suggest you try them both and check if they really are the same
-Pere

Thank you so much.

In both cases I find [tex]\mathbb{E}\left[X|X\leq a\right]= \frac{k}{k - 1} \frac{a ^ k - a}{a ^ k - 1}[/tex]
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
4K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 76 ·
3
Replies
76
Views
7K
  • · Replies 14 ·
Replies
14
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 9 ·
Replies
9
Views
3K
Replies
4
Views
2K