Probability & Statistics: Order Statistics

Click For Summary
The discussion focuses on the concept of order statistics, particularly the notation differences between x(1) and X(1), where x typically represents observed values and X denotes random variables. It clarifies that while individual random variables cannot be ordered, one can derive the distribution of the order statistic from them. The confusion arises when discussing joint densities of independent and identically distributed (iid) random variables, where it's important to distinguish between specific values and distributions. The conversation emphasizes that the joint density cannot simply be expressed as the product of individual densities without considering their independence. Understanding these distinctions is crucial for grasping the behavior of order statistics in probability and statistics.
kingwinner
Messages
1,266
Reaction score
0

Homework Statement


Q1) About "order statistics", sometimes it's denoted x(1) and sometimes it's denoted X(1). What is the difference between the two?
Also, for X(1)=min{X1,X2,...,Xn}, it's a random variable. What does it mean to be the minimum of a bunch of random variables? If they are SPECIFIC observed values, then we can order them (e.g. if we have 6,3,8,7, then ordering them gives 3,6,7,8)...that I understand. But if they are random variables, HOW can we order them?



Q2) (more about order statistics)
http://www.geocities.com/asdfasdf23135/stat7.JPG
Here we have n random variables X1,X2...,Xn and we see FX(x) here. Why can we label it just based on one single varaible "x" instead of x1,x2,...,xn? Don't we have to treat them separately as x1,x2,...xn instead of just one "x"? Well, you may say it is because they're identically distributed, so we can just use a single "x" to represent each of x1,x2,...xn. But consider the following case:
Let X1,X2,...,Xn be iid random variables with density f(xi)=xi, 0<xi<sqrt2, then in this case the joint density must be f(x1,x2,...,xn)=x1x2...xn, and is definitely NOT (x1)n
So we've seen two different situations. In the first case, we can say x=x1=x2=...=xn, but not so in the second case. What is going on? Can someone please explain? I am always confused between these two cases. I am confused whenever they say X1,...Xn are iid with COMMON density fX(x). If this is the case, then the JOINT density [fX(x)]n would be a function of only a single variable "x" which doesn't make any sense to me (the joint density should be a function of n variables x1,x2,...,xn)


Homework Equations


Order Statistics

The Attempt at a Solution


As shown above.


Thank you for clearing my doubts! I appreciate your great help!
 
Physics news on Phys.org
x usually denotes an observed value; whereas X denotes a random variable (i.e., a distribution). You cannot order random variables but using them you can derive the distribution that governs the value of the order statistic variable. As for Q2, you are confusing particular values with a distribution. If it makes it easier, you can try thinking of them not as probability distributions but frequency distributions (e.g., body height in a given population of N individuals).
 
For Q1, maybe it would help you if you would recall that a random variable X is a function from a sample space S to the set of real numbers.

If s is an outcome in S, then X(s) is a real number.

To say X_{(1)}=\min\{X_1,X_2,\dots,X_n\}, this really means that for each outcome s in S, we have X_{(1)}(s)=\min\{X_1(s),X_2(s),\dots,X_n(s)\}.

For Q2, you are right that you can not simply say that the joint f is the product [f_X(x)]^n. There is a different reason for this in this example.

First, for emphasis and clarity, use the letter k instead of x, where k is a constant.

Now \{X_{(n)} \le k\} is shorthand for \{ s\in S:X_{(n)}(s) \le k\}. But X_{(n)}(s)=\max\{X_1(s),X_2(s),\dots,X_n(s)\}, so X_{(n)}(s) \le k if and only X_i(s)\le k for all i from 1 to n.

That is, \{ s\in S:X_{(n)}(s) \le k\}=\{s\in S:X_1(s)\le k\ \land\ X_2(s)\le k\ \land\ \dots\ \land\ X_n(s)\le k\}=\{s\in S:X_1(s)\le k\}\cap \dots\cap\{s\in S: X_n(s)\le k\}.

By independence, P(\{s\in S:X_1(s)\le k\}\cap \dots\cap\{s\in S: X_n(s)\le k\})=P(X_1\le k)P(X_2\le k)\dots P(X_n\le k).
 
Question: A clock's minute hand has length 4 and its hour hand has length 3. What is the distance between the tips at the moment when it is increasing most rapidly?(Putnam Exam Question) Answer: Making assumption that both the hands moves at constant angular velocities, the answer is ## \sqrt{7} .## But don't you think this assumption is somewhat doubtful and wrong?

Similar threads

Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 7 ·
Replies
7
Views
18K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 16 ·
Replies
16
Views
2K
Replies
1
Views
1K
  • · Replies 1 ·
Replies
1
Views
2K
Replies
2
Views
1K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 10 ·
Replies
10
Views
1K