Probability & Statistics: Order Statistics

Click For Summary
SUMMARY

This discussion focuses on the nuances of order statistics in probability and statistics, specifically addressing the notation differences between x(1) and X(1), where x denotes observed values and X denotes random variables. The conversation clarifies that while specific observed values can be ordered, random variables cannot be directly ordered; instead, they define a distribution governing the order statistic. Additionally, the discussion highlights the distinction between treating random variables as independent and identically distributed (iid) versus their joint density functions, emphasizing the importance of understanding the underlying distributions.

PREREQUISITES
  • Understanding of random variables and their notation
  • Familiarity with order statistics and their definitions
  • Knowledge of probability distributions, particularly iid distributions
  • Basic concepts of joint density functions in probability
NEXT STEPS
  • Study the properties of order statistics in detail
  • Learn about the implications of independence in probability distributions
  • Explore joint density functions and their applications in statistics
  • Investigate the differences between observed values and random variables in statistical analysis
USEFUL FOR

Students and professionals in statistics, data analysts, and researchers who are working with probability distributions and order statistics will benefit from this discussion.

kingwinner
Messages
1,266
Reaction score
0

Homework Statement


Q1) About "order statistics", sometimes it's denoted x(1) and sometimes it's denoted X(1). What is the difference between the two?
Also, for X(1)=min{X1,X2,...,Xn}, it's a random variable. What does it mean to be the minimum of a bunch of random variables? If they are SPECIFIC observed values, then we can order them (e.g. if we have 6,3,8,7, then ordering them gives 3,6,7,8)...that I understand. But if they are random variables, HOW can we order them?



Q2) (more about order statistics)
http://www.geocities.com/asdfasdf23135/stat7.JPG
Here we have n random variables X1,X2...,Xn and we see FX(x) here. Why can we label it just based on one single varaible "x" instead of x1,x2,...,xn? Don't we have to treat them separately as x1,x2,...xn instead of just one "x"? Well, you may say it is because they're identically distributed, so we can just use a single "x" to represent each of x1,x2,...xn. But consider the following case:
Let X1,X2,...,Xn be iid random variables with density f(xi)=xi, 0<xi<sqrt2, then in this case the joint density must be f(x1,x2,...,xn)=x1x2...xn, and is definitely NOT (x1)n
So we've seen two different situations. In the first case, we can say x=x1=x2=...=xn, but not so in the second case. What is going on? Can someone please explain? I am always confused between these two cases. I am confused whenever they say X1,...Xn are iid with COMMON density fX(x). If this is the case, then the JOINT density [fX(x)]n would be a function of only a single variable "x" which doesn't make any sense to me (the joint density should be a function of n variables x1,x2,...,xn)


Homework Equations


Order Statistics

The Attempt at a Solution


As shown above.


Thank you for clearing my doubts! I appreciate your great help!
 
Physics news on Phys.org
x usually denotes an observed value; whereas X denotes a random variable (i.e., a distribution). You cannot order random variables but using them you can derive the distribution that governs the value of the order statistic variable. As for Q2, you are confusing particular values with a distribution. If it makes it easier, you can try thinking of them not as probability distributions but frequency distributions (e.g., body height in a given population of N individuals).
 
For Q1, maybe it would help you if you would recall that a random variable X is a function from a sample space S to the set of real numbers.

If s is an outcome in S, then X(s) is a real number.

To say [tex]X_{(1)}=\min\{X_1,X_2,\dots,X_n\}[/tex], this really means that for each outcome s in S, we have [tex]X_{(1)}(s)=\min\{X_1(s),X_2(s),\dots,X_n(s)\}[/tex].

For Q2, you are right that you can not simply say that the joint f is the product [tex][f_X(x)]^n[/tex]. There is a different reason for this in this example.

First, for emphasis and clarity, use the letter k instead of x, where k is a constant.

Now [tex]\{X_{(n)} \le k\}[/tex] is shorthand for [tex]\{ s\in S:X_{(n)}(s) \le k\}[/tex]. But [tex]X_{(n)}(s)=\max\{X_1(s),X_2(s),\dots,X_n(s)\}[/tex], so [tex]X_{(n)}(s) \le k[/tex] if and only [tex]X_i(s)\le k[/tex] for all i from 1 to n.

That is, [tex]\{ s\in S:X_{(n)}(s) \le k\}=\{s\in S:X_1(s)\le k\ \land\ X_2(s)\le k\ \land\ \dots\ \land\ X_n(s)\le k\}=\{s\in S:X_1(s)\le k\}\cap \dots\cap\{s\in S: X_n(s)\le k\}[/tex].

By independence, [tex]P(\{s\in S:X_1(s)\le k\}\cap \dots\cap\{s\in S: X_n(s)\le k\})=P(X_1\le k)P(X_2\le k)\dots P(X_n\le k)[/tex].
 

Similar threads

Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 7 ·
Replies
7
Views
19K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 16 ·
Replies
16
Views
2K
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 10 ·
Replies
10
Views
1K