Understanding Order Statistics and Probability of Random RV Selection

  • Context: Graduate 
  • Thread starter Thread starter EngWiPy
  • Start date Start date
  • Tags Tags
    Statistics
Click For Summary

Discussion Overview

The discussion revolves around the probability of selecting a specific order statistic from a set of independent and identically distributed random variables (RVs). Participants explore the implications of arranging these RVs in ascending order and the mathematical formulation of the probability associated with selecting a particular RV.

Discussion Character

  • Exploratory
  • Technical explanation
  • Mathematical reasoning

Main Points Raised

  • One participant poses a question about the probability of selecting a specific RV from a set of N independent RVs arranged in ascending order.
  • Several participants seek clarification on the definition of the variable X and the necessity of ordering the RVs before selection.
  • Another participant provides a context where ordering is relevant, such as in antenna selection problems, where performance metrics depend on the arrangement of values.
  • Questions arise regarding the interpretation of the order statistics and the specific probability calculations related to selecting the r-th order statistic.
  • A detailed mathematical explanation is provided by a participant, outlining how to derive the cumulative distribution function (CDF) for the r-th order statistic using combinatorial reasoning.
  • Another participant confirms the correctness of the mathematical explanation provided, indicating a shared understanding of the derivation process.

Areas of Agreement / Disagreement

Participants express varying levels of understanding regarding the problem and its mathematical implications. While some agree on the mathematical formulation of the CDF for order statistics, others remain uncertain about the necessity of ordering the RVs and the overall context of the problem.

Contextual Notes

The discussion includes assumptions about the independence and identical distribution of the RVs, as well as the dependence on the definitions of the PDF and CDF. Some mathematical steps remain unresolved, particularly in the context of selecting RVs based on specific criteria.

EngWiPy
Messages
1,361
Reaction score
61
Hello,

Suppose that we have N independent and identically distributed Random Variables (RVs) say [tex]X_1, X_2, \ldots, X_N[/tex]. Suppose then that we arrange these RVs in magnitude and in an ascending order as [tex]X_{(1)}\leq X_{(2)}\leq \dots \leq X_{(N)}[/tex]. Now, the question is: what is the probability that we select at random any of these RVs? For example, mathematically:

[tex]\mbox{Pr}\left(X=X_3\right)=??[/tex]​

Thanks in advance
 
Physics news on Phys.org
How is X defined?
 
Enuma_Elish said:
How is X defined?

Suppose that the Probability Density Function (PDF) and the Cumulative Distribution Function (CDF) of these Random Variables (RVs) are [tex]f_X(x)[/tex] and [tex]F_X(x)[/tex], respectively.
Regards
 
I don't understand the problem. If I am going to select one variable out of N variables, why do I need to put them in "<" order first?
 
Enuma_Elish said:
I don't understand the problem. If I am going to select one variable out of N variables, why do I need to put them in "<" order first?

Dear,
In some problems, such as in antenna selection problem, we need to select the antenna that maximizes the performance. In order to compute this performance, we need to arrange the the Signal-to-Noise ratios (SNRs) in ascending order, and find its PDF and CDF. This is a special case, in other cases we need to select an antenna Whose SNR is above a certain threshold, but not necessarly the maximum.
Regards
 
Are you looking at an individual order stat. (for example, max{X1, ..., XN}, or X(N)), and asking what is the probability that X(N) = X3?
 
Enuma_Elish said:
Are you looking at an individual order stat. (for example, max{X1, ..., XN}, or X(N)), and asking what is the probability that X(N) = X3?

I am sorry, I might not state the problem correctly. Suppose we have [tex]N[/tex] random variables that we arrange in magnitude and ascending order. Now suppose that, using random selection cretirion, we select the [tex]r^{th}[/tex] order statistic, which is the [tex]r^{th}[/tex] smallest RV from the arranged ones. Referring to Order Statistics, the CDF of this selected RV is given by:
[tex]\sum_{i=r}^N {N \choose i} \, F_X^i(x)\,[1-F_X(x)]^{N-i}[/tex]​
I didn't understand the last equation. How did he obtain this?
 
That's a question I can answer. I'll use F(x) as a shorthand for FX(x) and write F(r)(x) for the CDF of the r'th order stat. X(r).

F(r)(x) = Prob{at least r of the X's are less than x}
= Prob{at least r of the X's are < x and the remaining X's are > x}
= Prob{at least r of the X's are < x} times Prob{remaining X's are > x}.

Let's say r = 3. One way that the event {at least r of the X's are < x and remaining X's are > x} could happen is, for example, {X1, X2, X3 < x} and {X4, ..., XN > x}. The corresponding probability is then F(x)3[1 - F(x)]N-3.

But it can also be {X1, X2, X4 < x} and {X3, X5, ..., XN > x}, or {X1, X3, X4 < x} and {X2, X5, ..., XN > x}, etc. Since all X's have the same F, the probability of each of these subcases is F(x)3[1 - F(x)]N-3. In all, there are (N,3) = N!/(3!(N-3)!) subcases such that exactly 3 of the X's are less than x, and the remaining N-3 of the X's are greater than x.

Now, still other ways that at least 3 of the X's can be less than x are: {X1, X2, X3, X4 < x} and {X5, X6, ..., XN > x}, or {X1, X2, X3, X4, X5 < x} and {X6, X7, ..., XN > x}, ..., or {all X's < x}.

In each of the above cases, you have (N,i) ways (subcases) to order the X's, where i is a number between r and N. The probability of each subcase is F(x)i[1 - F(x)]N-i, so the probability of each case is (N,i)F(x)i[1 - F(x)]N-i.

Summing over all possible cases (i > r), we arrive at the stated formula.
 
Last edited:
Enuma_Elish said:
That's a question I can answer. I'll use F(x) as a shorthand for FX(x) and write F(r)(x) for the CDF of the r'th order stat. X(r).

F(r)(x) = Prob{r of the X's are less than x}
= Prob{r of the X's are < x and the remaining X's are > x}
= Prob{r of the X's are < x} times Prob{remaining X's are > x}.

Let's say r = 3. One way that the event {r of the X's are < x and remaining X's are > x} could happen is, for example, {X1, X2, X3 < x} and {X4, ..., XN > x}. The corresponding probability is then F(x)3[1 - F(x)]N-3.

But it can also be {X1, X2, X4 < x} and {X3, X5, ..., XN > x}, or {X1, X3, X4 < x} and {X2, X5, ..., XN > x}, etc. Since all X's have the same F, the probability of each of these subcases is F(x)3[1 - F(x)]N-3. In all, there are (N,3) = N!/(3!(N-3)!) subcases such that exactly 3 of the X's are less than x, and the remaining N-3 of the X's are greater than x.

Now, still other ways that 3 of the X's can be less than x are: {X1, X2, X3, X4 < x} and {X5, X6, ..., XN > x}, or {X1, X2, X3, X4, X5 < x} and {X6, X7, ..., XN > x}, ..., or {all X's < x}.

In each of the above cases, you have (N,i) ways (subcases) to order the X's, where i is a number between r and N. The probability of each subcase is F(x)i[1 - F(x)]N-i, so the probability of each case is (N,i)F(x)i[1 - F(x)]N-i.

Summing over all possible cases (i > r), we arrive at the stated formula.

Yes, that is right.
Thanks
 
  • #10
Good! No problem.
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 35 ·
2
Replies
35
Views
5K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 8 ·
Replies
8
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 4 ·
Replies
4
Views
7K