Understanding Order Statistics and Probability of Random RV Selection

  • Thread starter Thread starter EngWiPy
  • Start date Start date
  • Tags Tags
    Statistics
AI Thread Summary
The discussion revolves around the probability of selecting a specific order statistic from a set of independent and identically distributed random variables (RVs). The participants clarify that arranging these RVs in ascending order is essential for determining probabilities related to their selection, particularly in applications like antenna performance optimization. The formula for the cumulative distribution function (CDF) of the r-th order statistic is derived, emphasizing the need to account for various combinations of RVs being less than or greater than a specific value. The explanation highlights the combinatorial nature of the problem, where different arrangements yield the same probability outcome. Understanding these concepts is crucial for applying order statistics in practical scenarios.
EngWiPy
Messages
1,361
Reaction score
61
Hello,

Suppose that we have N independent and identically distributed Random Variables (RVs) say X_1, X_2, \ldots, X_N. Suppose then that we arrange these RVs in magnitude and in an ascending order as X_{(1)}\leq X_{(2)}\leq \dots \leq X_{(N)}. Now, the question is: what is the probability that we select at random any of these RVs? For example, mathematically:

\mbox{Pr}\left(X=X_3\right)=??​

Thanks in advance
 
Physics news on Phys.org
How is X defined?
 
Enuma_Elish said:
How is X defined?

Suppose that the Probability Density Function (PDF) and the Cumulative Distribution Function (CDF) of these Random Variables (RVs) are f_X(x) and F_X(x), respectively.
Regards
 
I don't understand the problem. If I am going to select one variable out of N variables, why do I need to put them in "<" order first?
 
Enuma_Elish said:
I don't understand the problem. If I am going to select one variable out of N variables, why do I need to put them in "<" order first?

Dear,
In some problems, such as in antenna selection problem, we need to select the antenna that maximizes the performance. In order to compute this performance, we need to arrange the the Signal-to-Noise ratios (SNRs) in ascending order, and find its PDF and CDF. This is a special case, in other cases we need to select an antenna Whose SNR is above a certain threshold, but not necessarly the maximum.
Regards
 
Are you looking at an individual order stat. (for example, max{X1, ..., XN}, or X(N)), and asking what is the probability that X(N) = X3?
 
Enuma_Elish said:
Are you looking at an individual order stat. (for example, max{X1, ..., XN}, or X(N)), and asking what is the probability that X(N) = X3?

I am sorry, I might not state the problem correctly. Suppose we have N random variables that we arrange in magnitude and ascending order. Now suppose that, using random selection cretirion, we select the r^{th} order statistic, which is the r^{th} smallest RV from the arranged ones. Referring to Order Statistics, the CDF of this selected RV is given by:
\sum_{i=r}^N {N \choose i} \, F_X^i(x)\,[1-F_X(x)]^{N-i}​
I didn't understand the last equation. How did he obtain this?
 
That's a question I can answer. I'll use F(x) as a shorthand for FX(x) and write F(r)(x) for the CDF of the r'th order stat. X(r).

F(r)(x) = Prob{at least r of the X's are less than x}
= Prob{at least r of the X's are < x and the remaining X's are > x}
= Prob{at least r of the X's are < x} times Prob{remaining X's are > x}.

Let's say r = 3. One way that the event {at least r of the X's are < x and remaining X's are > x} could happen is, for example, {X1, X2, X3 < x} and {X4, ..., XN > x}. The corresponding probability is then F(x)3[1 - F(x)]N-3.

But it can also be {X1, X2, X4 < x} and {X3, X5, ..., XN > x}, or {X1, X3, X4 < x} and {X2, X5, ..., XN > x}, etc. Since all X's have the same F, the probability of each of these subcases is F(x)3[1 - F(x)]N-3. In all, there are (N,3) = N!/(3!(N-3)!) subcases such that exactly 3 of the X's are less than x, and the remaining N-3 of the X's are greater than x.

Now, still other ways that at least 3 of the X's can be less than x are: {X1, X2, X3, X4 < x} and {X5, X6, ..., XN > x}, or {X1, X2, X3, X4, X5 < x} and {X6, X7, ..., XN > x}, ..., or {all X's < x}.

In each of the above cases, you have (N,i) ways (subcases) to order the X's, where i is a number between r and N. The probability of each subcase is F(x)i[1 - F(x)]N-i, so the probability of each case is (N,i)F(x)i[1 - F(x)]N-i.

Summing over all possible cases (i > r), we arrive at the stated formula.
 
Last edited:
Enuma_Elish said:
That's a question I can answer. I'll use F(x) as a shorthand for FX(x) and write F(r)(x) for the CDF of the r'th order stat. X(r).

F(r)(x) = Prob{r of the X's are less than x}
= Prob{r of the X's are < x and the remaining X's are > x}
= Prob{r of the X's are < x} times Prob{remaining X's are > x}.

Let's say r = 3. One way that the event {r of the X's are < x and remaining X's are > x} could happen is, for example, {X1, X2, X3 < x} and {X4, ..., XN > x}. The corresponding probability is then F(x)3[1 - F(x)]N-3.

But it can also be {X1, X2, X4 < x} and {X3, X5, ..., XN > x}, or {X1, X3, X4 < x} and {X2, X5, ..., XN > x}, etc. Since all X's have the same F, the probability of each of these subcases is F(x)3[1 - F(x)]N-3. In all, there are (N,3) = N!/(3!(N-3)!) subcases such that exactly 3 of the X's are less than x, and the remaining N-3 of the X's are greater than x.

Now, still other ways that 3 of the X's can be less than x are: {X1, X2, X3, X4 < x} and {X5, X6, ..., XN > x}, or {X1, X2, X3, X4, X5 < x} and {X6, X7, ..., XN > x}, ..., or {all X's < x}.

In each of the above cases, you have (N,i) ways (subcases) to order the X's, where i is a number between r and N. The probability of each subcase is F(x)i[1 - F(x)]N-i, so the probability of each case is (N,i)F(x)i[1 - F(x)]N-i.

Summing over all possible cases (i > r), we arrive at the stated formula.

Yes, that is right.
Thanks
 
  • #10
Good! No problem.
 
Back
Top