Random variables: Total probability, Transformations & CDFs

Click For Summary

Discussion Overview

The discussion revolves around the evaluation of the cumulative distribution function (CDF) for a transformed random variable Y derived from a uniformly distributed random variable X. Participants explore the application of the law of total probability in this context and the challenges encountered in obtaining consistent results through different approaches.

Discussion Character

  • Exploratory
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • Daniel presents a transformation of a uniformly distributed random variable X and seeks to evaluate the CDF of Y using the law of total probability.
  • Some participants question the validity of equating certain probabilities, specifically whether P(Y ≤ t | -1 ≤ X ≤ 0) can be expressed as P(X + 1 ≤ t).
  • There is a discussion about the implications of conditional probabilities and how they relate to the events defined by the transformation of Y.
  • Daniel acknowledges a mistake in his reasoning regarding the application of the law of total probability and the intersection of events.
  • Another participant emphasizes the need to express the event Y ≤ t in terms of X and to consider the partitioning of the domain of X when applying the law of total probability.
  • Daniel expresses a desire to find a more analytical method for computing probabilities involving transformations, especially when they are complex.
  • A later reply encourages Daniel not to abandon his goal of finding analytical methods for transformations, suggesting that further exploration could benefit the mathematical community.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the correct application of the law of total probability in this context, and multiple viewpoints regarding the transformation and its implications remain present throughout the discussion.

Contextual Notes

Participants highlight the importance of correctly defining events and their intersections when applying probability laws. There are unresolved questions about the proper handling of the transformation and the resulting probabilities.

danielakkerma
Messages
230
Reaction score
0
Hello All!
A recent problem has stuck with me, and I was hoping you could help me resolve it.
Consider the following premise: Let us assume that X \sim \mathcal{U}(-3,3)
(U is the continuous, uniform distribution).
And let the transformation Y be applied thus:
<br /> Y = \left\{<br /> \begin{align*}<br /> X+1, &amp; &amp; -1 \leq X \leq 0 \\<br /> 1-X, &amp; &amp; 0\leq X \leq 1 \\<br /> 0~~~, &amp; &amp; \rm{otherwise}<br /> \end{align*}<br /> \right.<br />
Then one desires to evaluate F_Y(t), Where F(t) is the cumulative dist. func. for Y.
Obviously, the simplest approach would be to find the expression using elementary means -- for example, by plotting the new domain of Y as a function of X.
However, I attempted to obtain the same result by considering the problem from more general principles, particularly, the law of total probability.
I considered the following statement:
<br /> F_Y(t) = \mathbf{P}(Y \leq t)<br />
By LTP:
<br /> \mathbf{P}(Y \leq t) = \sum_{i} P(Y \leq t ~ \mathbf{|} X \in A_i) \cdot P(X \in A_i)<br />
Where {A} is the set of all the regions on which Y is defined, as a function of X. For example, A_1 = [-1,0],A_2 =[0,1], and so forth.
Thus, I would get:
<br /> \mathbf{P}(Y \leq t) = P(Y \leq t ~ \mathbf{|} -1 \leq X \leq 0) \cdot P(-1 \leq X \leq 0) + P(Y \leq t ~ \mathbf{|} 0 \leq X \leq 1) \cdot P(0 \leq X \leq 1) + \\ + P(Y \leq t ~ \mathbf{|} 1 \leq X \leq 3 \cup -3 \leq X \leq -1) \cdot P(1 \leq X \leq 3 \cup -3 \leq X \leq -1)<br />
Then I observe that: P(Y \leq t ~ \mathbf{|} -1 \leq X \leq 0) is merely P(Y \leq t \cap Y = X+1) = P(X+1 \leq t) = F_X(t-1).
This I then apply to all the conditional probabilities(i.e., separating the values of Y according to the constituent Xs(as shown)) and using the CDF for X.
However, I obtain a completely different(and erroneous!) result here, compared with the direct approach(i.e., graphic, and others).
What went wrong?
Is my approach at all correct(or possible/permissible)?
Thank you very much for your attention,
Daniel
 
Last edited:
Physics news on Phys.org
danielakkerma said:
Then I observe that: P(Y \leq t ~ \mathbf{|} -1 \leq X \leq 0) is merely P(Y \leq t \cap Y = X+1) = P(X+1 \leq t) = F_X(t-1).

Does this amount to claiming that P(A|B) = P(A and B) ?
 
Not exactly...

Thanks for your reply.
I meant to say that that event corresponded to:
<br /> P(Y \leq t ~ | ~ -1 \leq X \leq 0) = P(X+1 \leq t)<br />
Since logically, the two are -- or at least should be -- equivalent(one is only possible with the other in tandem).
Is this reasoning invalid?
What is, then, the probability of that conditional statement?
Thanks again,
Daniel
 
Suppose t = -1/2.

P(X+1 \le -1/2) = P(X \le -3/2) = \frac{ (-3/2) -(-3)}{6} = 3/12 = 1/4

P(Y \le -1/2 | -1 \le X \le 0) = 0
 
You are, of course, correct!

You're obviously right. I can't believe I didn't detect such a boneheaded mistake, sooner; thank you!
I see I should have written that equality, using the LTP, in this manner:
<br /> P(Y \leq t) = \sum_i P(Y \leq t \cap A_i)
Where again A_i form the domains of Y.
I can therefore get, for one of the subtended regions:
<br /> P(Y \leq t \cap -1\leq X\leq 0) = P(X \leq t-1 \cap -1 \leq X \leq 0)=<br /> \left\{<br /> \begin{align*}<br /> F_X(t-1)-F_X(-1), &amp; &amp; 0 \leq t \leq 1 \\<br /> 1~~~~, &amp;&amp; t&gt;1 \\<br /> 0, &amp;&amp; else<br /> \end{align*}<br /> \right.<br />
But, for the other term:
<br /> P(Y \leq t \cap 0\leq X\leq 1) = P(X \geq 1-t \cap 0 \leq X \leq 1)=<br /> \left\{<br /> \begin{align*}<br /> F_X(1)-F_X(1-t), &amp; &amp; 0 \leq t \leq 1 \\<br /> 1~~~~, &amp;&amp; t&gt;1 \\<br /> 0, &amp;&amp; else<br /> \end{align*}<br /> \right.<br />
Here, it is already evident that when summing these two results(as per the LTP), one would obtain that \lim_{t \to \infty} F_Y(t) = 2 and not 1, which is a fundamental property of the CDF, lost here.
How do I correct this discrepancy?
Thanks,
Daniel
 
danielakkerma said:
<br /> P(Y \leq t \cap -1\leq X\leq 0) = P(X \leq t-1 \cap -1 \leq X \leq 0)=<br />

But Y \leq t is not the same event as X \leq t -1. So you can't equate the events Y \leq t \ \cap -1 \leq X \leq 0 and X \leq t-1 \ \cap -1 \leq X \leq 0.

Let Y be a function of the random variable X. Let the sets A_i partition the domain of X.

Then P(Y \leq t) = \sum_{i=1}^n P(Y \le t \ \cap X \in A_i)
= \sum_{i=1}^n P(Y \leq t| X \in A_i) P(X \in A_i).

To compute P(Y \leq t | X \in A_i) you can express the statement that defines Y \leq t as an equivalent statement about X. Then compute P(Y \leq t) as P(Y \leq t \ \cap X \in A_i)/ P(X \in A_i).

For example, in your problem:

\{Y \leq 1/2\} = S = \{-3 \leq X \leq -1\} \cup \{ -1 \leq X \leq -1/2 \} \cup \{ 1/2 \leq X \leq 3 \}

\{Y \leq 1/2\} \cap \{-1 \leq X \leq 0 \} = S \cap \{-1 \leq X \leq 0\} = \{-1 \leq X \leq -1/2\}


The way such examples are usually solved is to express the event Y \le t as an equivalent statement about X being in one of a union of mutually exclusive sets B_i. The sets B_i depend on t. Then the law of total probability is used to compute P(X \in (B_1 \cup B_2\cup...\cup B_n)).

Thinking of Yas a function that maps a set s in the domain of X to a set Y(s) in the domain of Y the way to find P(Y \leq t) is to find the probability of the set Y^{-1}( \{Y \leq t\}) using the distribution of X
 
Now, it's finally clear!

Stephen,
Thanks again for your patient and diligent aid here! it's finally dawned on me(and I'm sorry it has taken so long).
I now see that I should have accounted for the various values Y≤t could take, irrespective of X; and obviously, as you point out, the intersection between Ys and Xs would not -- necessarily -- result in limiting Y itself to any particular domain(as a function of X).
Thinking of Y as a function that maps a set s in the domain of X to a set Y(s) in the domain of Y the way to find P(Y≤t) is to find the probability of the set Y-1({Y≤t}) using the distribution of X
This is what I have been doing hitherto, but I was hoping I could find a more analytical method to compute these sets, especially, when the transformations are not quite so trivial(and may involve multi-valued inverse solutions).
Still, I quite see now where I was mistaken.
Thanks again for all your help!
Daniel
 
danielakkerma said:
but I was hoping I could find a more analytical method to compute these sets, especially, when the transformations are not quite so trivial(and may involve multi-valued inverse solutions).

Don't completely give up on that goal. I don't know what progress can be made, but if you find something, it would be a great service to mathematical humanity. Maybe the cure for the problems of transformations is yet more transformations.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 0 ·
Replies
0
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K