Difficulty with distribution functions

Click For Summary

Discussion Overview

The discussion revolves around the concept of random variables and distribution functions in probability theory. Participants explore the transition from set-based probability to the use of random variables, addressing definitions, misconceptions, and the implications of terminology.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant expresses difficulty in understanding the transition from probability with sets to random variables and distribution functions, seeking clarification.
  • Another participant explains that a random variable is a function mapping outcomes from a sample space to real numbers, emphasizing that the underlying set and probability measure remain integral to the concept.
  • Some participants argue that the term "random variable" is misleading, asserting that it is a deterministic function rather than a variable that is random.
  • There is a discussion about the nature of random variables, with some asserting that the term "random" is vague and undefined in probability theory, while others challenge the accuracy of definitions provided in earlier posts.
  • A participant raises a point about the classification of random variables, questioning whether a specific example should be considered continuous or mixed based on its probability distribution.
  • Another participant mentions the Radon-Nikodym derivative and its relation to the absolute continuity of distribution functions, introducing a measure-theoretic perspective.

Areas of Agreement / Disagreement

Participants express differing views on the definitions and implications of random variables and distribution functions. There is no consensus on the terminology used or the classification of certain types of random variables, indicating ongoing debate and unresolved questions.

Contextual Notes

Some participants note limitations in the definitions and examples provided, suggesting that certain assumptions or contexts may not be fully addressed. The discussion reflects a range of interpretations and understandings of foundational concepts in probability theory.

matthew9
Messages
15
Reaction score
0
My undergrad probability theory course just got to random variables and distribution functions. Up until this point, the material was very straightforward and I could understand what was being done, but I feel that I am just not seeing the jump between probability with sets and probability with random variables/distribution functions. From what I understand, the RV will replace the set allowing us to have a sort of shortcut in the calculations, but after that I am not sure. Any help would be greatly appreciated!
 
Physics news on Phys.org
The set is always there. It is usually called the "sample space" and denoted by ##\Omega##. The other elements of what is called a "probability space" are the collection of events, often denoted by ##\mathcal{A}##, and the probability measure, often denoted by ##\mathbb{P}##. In the simplest of cases (your course probably started there), the collection ##\mathcal{A}## is just the power set of ##\Omega## (i.e. the collection of all subsets of ##\Omega##).

In this simplest case, a random variable is nothing but a function ##X : \Omega \to \mathbb{R}## (in which case ##X## is called a "continuous random variable") or a function ##X : \Omega \to \mathbb{Z}## (in which case ##X## is called a "discrete random variable"). You can now obtain events by taking inverse images of sets in the codomain of ##X##, say ##\mathbb{R}##. Indeed, for any ##B \in \mathbb{R}## the inverse image ##X^{-1}(B)## is a subset of ##\Omega## or, equivalently, an element of ##\mathcal{A}## and henceforth an event.

This way you can define a function ##F : \mathbb{R} \to [0,1]## by putting
$$
F(x) := \mathbb{P}[X^{-1}((-\infty,x])] = \mathbb{P}(\{\omega \in \Omega\,:\, X(\omega) \le x\})
$$
for all ##x \in \mathbb{R}##. This function is conventionally called the "distribution function" of the random variable ##X##. As you can see, the underlying set ##\Omega##, its collection of events ##\mathcal{A}## and the probability measure ##\mathbb{P}## are all still very much present, but now in addition there is the random variable ##X## with its distribution function ##F##.
 
A random variable is neither. It's a function, not a variable. And it is deterministic, not random.
 
EnumaElish said:
A random variable is neither. It's a function, not a variable. And it is deterministic, not random.
Undoubtedly this will clarify a lot to the OP.
 
  • Like
Likes   Reactions: EnumaElish
I think you explained it well in your reply. I'm just saying it's a total misnomer. Don't go looking for a variable that is random. Look for a deterministic function.
 
matthew9 said:
From what I understand, the RV will replace the set allowing us to have a sort of shortcut in the calculations,

It isn't accurate to think that a random variable "replaces" "the set".

In most general scenario for a probability model, there is a set of elements Omega. We think of an single element in that set as a "point" or an "outcome" that cannot be further subdivided into in other points or outcomes. There is a "probability measure" mu defined on Omega that tells us the probability of certain subsets of Omega. It may not tell us the probability of each possible subset of Omega, but it tells us the probability of some subsets of Omega, including the probability of entire set and the probability of the null set. In most practical examples, mu can tell us the probability of a given single outcome.

The probability measure mu is an abstract concept. It doesn't specify any formula or procedure for find the probability of a given subset of Omega. Abstractly we can only think of mu as being specified by a perhaps infinite list of subsets and their associated probabilities.

When we have a probability model that uses a "random variable" , the random variable has a distribution function. The distribution function allows us to use a computational procedure to find the probability of certain subsets of Omega. The distribution function doesn't "replace" Omega. If you want to say the distribution function "replaces" something, you could say that it replaces the abstract implementation of mu as a big list with an implementation of mu by specific computations.
 
EnumaElish said:
I think you explained it well in your reply. I'm just saying it's a total misnomer. Don't go looking for a variable that is random. Look for a deterministic function.
Your absolutely right that the term random variable is a misnomer and is a function. But saying it isn't random is a bit misleading because random is not a term in probability theory and in fact is an undefined vague idea about reality. Same goes for deterministic.

Also Krylov's bit of sarcasm is uncalled for since his reply is very likely over the head of the OP. For an intro to probability Ω should be finite with proscribed probabilities. This is fact completely general (see Nelson's "Radical Elementary Probability Theory"),
 
  • Like
Likes   Reactions: EnumaElish
Zafa Pi said:
Also Krylov's bit of sarcasm is uncalled for since his reply is very likely over the head of the OP.
Oh dear, do you think that is why he left after starting the thread and did not return so far?
 
  • Like
Likes   Reactions: Heinera
Krylov said:
Oh dear, do you think that is why he left after starting the thread and did not return so far?
Look at post #1, what do you think. <<Mentor note: comment removed.>>
And BTW you said "In this simplest case, a random variable is nothing but a function X:Ω→R (in which case X is called a "continuous random variable")" which is false. A continuous r.v. is one that gives points probability 0.
 
Last edited by a moderator:
  • #10
<<Mentor note: reply to deleted comment removed>>
Zafa Pi said:
And BTW you said "In this simplest case, a random variable is nothing but a function X:Ω→R (in which case X is called a "continuous random variable")" which is false. A continuous r.v. is one that gives points probability 0.
No this terminology is not false, see here, where this is explicitly stated as the second sentence in the second paragraph. Note that I was not talking about an absolutely continuous random variable.
 
Last edited by a moderator:
  • #11
Zafa Pi said:
Look at post #1, what do you think.
The OP has not been back to since before any of the replies in this thread were posted so clearly anything the replies state have nothing to do with it.
 
  • Like
Likes   Reactions: jim mcnamara, EnumaElish and S.G. Janssens
  • #12
Krylov said:
<<Mentor note: reply to deleted comment removed>>

No this terminology is not false, see here, where this is explicitly stated as the second sentence in the second paragraph. Note that I was not talking about an absolutely continuous random variable.
Two points: If one writes X: Ω→R that allows X to be constant, i.e. take on one value, whereas the wiki article requires the the range to be uncountable. However I really don't think this is an important issue The main thing is that I think the wiki article got it wrong. If the range of X is {0} U [1,2] where the probability of 0 is 1/2 and the probability of [1,2] is 1/2, but otherwise uniform, would you call X continuous in spite of it having uncountable range? Most sources don't call it continuous.
When I googled "continuous random variable probability" I found my definition. The idea behind a continuous r.v. is that the PDF is continuous (need not be absolutely continuous) which implies a point has probability 0.
 
  • #13
Zafa Pi said:
If the range of X is {0} U [1,2] where the probability of 0 is 1/2 and the probability of [1,2] is 1/2, but otherwise uniform, would you call X continuous in spite of it having uncountable range? Most sources don't call it continuous.

It's a mixed random variable.

When I googled "continuous random variable probability" I found my definition. The idea behind a continuous r.v. is that the PDF is continuous (need not be absolutely continuous) which implies a point has probability 0.

The existence of the Radon-Nikodym derivative requires that the distribution function to be absolutely continuous, this is a classic result in measure theory. Hence the term "absolutely continuous random variable".
For an example of a continuous random variable with no pdf, take the Cantor distribution.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
5K
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 11 ·
Replies
11
Views
4K
  • · Replies 6 ·
Replies
6
Views
5K