Meaning of PDFs in the context of statistics

  • Context: Undergrad 
  • Thread starter Thread starter Cole A.
  • Start date Start date
  • Tags Tags
    Statistics
Click For Summary

Discussion Overview

The discussion revolves around the interpretation and meaning of probability density functions (PDFs) and probability mass functions (PMFs) in the context of statistics, particularly regarding fixed quantities like the height of a building. Participants explore the implications of using these functions to describe values that are currently unknown but are fixed in reality.

Discussion Character

  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • One participant notes that in probability theory, PDFs and PMFs describe random variables, which are inherently unpredictable, using the example of coin flips.
  • Another participant argues that the unpredictability of a specific measurement does not negate the usefulness of probability functions, as they can describe the expected distribution of repeated measurements.
  • A later reply expresses appreciation for the clarification that reporting a PDF indicates the expected outcomes of repeated measurements, rather than asserting a fixed value.
  • One participant questions the appropriateness of using probability distributions for a single building's height, suggesting that such usage is more aligned with Bayesian statistics rather than frequentist approaches.
  • Another participant humorously critiques the implications of a normal distribution with a mean height, pointing out that it suggests non-zero probabilities for implausible heights, like a basement or a tunnel.

Areas of Agreement / Disagreement

Participants express differing views on the appropriateness of using PDFs to describe fixed quantities. While some find value in the predictive nature of these functions, others challenge their applicability to singular, absolute measurements.

Contextual Notes

The discussion highlights the distinction between frequentist and Bayesian interpretations of probability distributions, particularly in relation to fixed quantities and measurement errors.

Cole A.
Messages
12
Reaction score
0
This is a really basic point that I am getting held up on.

In probability theory, we use PDFs and PMFs to describe random variables. (The term "random variable" carries a certain connotation of "unpredictable variations when repeated.") For example, the number of heads we will see in 50 flips of a coin is a random variable. This number will change from one set of 50 flips to the next. And the changes cannot be predicted beforehand. The random variable could be described by a Bin(50, p) PMF. We could even set p = 0.5 if the coin was fair.

But in statistics, PDFs and PMFs still seem to be used, to describe any quantity that we do not know for sure --- even when there isn't a scope of repetition associated with that quantity, i.e. the quantity has a real, fixed, unchanging value, just presently unknown to us. This is what is confusing me: If someone looks at a building and says that its height in feet is described by N(100, 50), and another claims that its height is described by Unif(0, 200), what are they saying exactly? The building's height is an absolute, fixed, unchanging number. What meaning does a PDF possibly have in this context?
 
Physics news on Phys.org
We use the probability functions because, even though the result of a particular event is not predictable, the pattern of the unpredictability is, itself, predictable. You should have seen this already. It is very useful, for example, to realize that the sum of two dice is most likely to be 7, and cannot be more than 12 or less than 2. There are fortunes being made by being able to predict just how often each possible number will show up in many rolls.

In the case of a physical object's height, it is an open matter as to whether or not it has one certain and absolute height at a particular time or not as there is no way of telling.

All we can do is measure it to finite precision with our equipment. When someone reports a PDF for a distribution of possible heights, they are telling you that the outcome of many repeated measurements of the height will return values that conform with that distribution.

If someone says "I will build you a building with a height distributed as follows..." then they are telling you how well they can build to the specification. This is why we have margins for error and error analysis.
 
Simon Bridge said:
When someone reports a PDF for a distribution of possible heights, they are telling you that the outcome of many repeated measurements of the height will return values that conform with that distribution.

This clarified things immensely for me. Thank you.
 
Cole A. said:
If someone looks at a building and says that its height in feet is described by N(100, 50), and another claims that its height is described by Unif(0, 200), what are they saying exactly? The building's height is an absolute, fixed, unchanging number. What meaning does a PDF possibly have in this context?

Unless you are studying Bayesian statistics, you don't find such statements in a statistics text.
In "frequentist" statistics, the kind normally studied in introductory courses, you would not find the height of one building described by a probability distribution. You might find the height of a randomly selected building from a population of buildings described by a distribution. You might find the measured height of a single building described by a probability distribution if the measurement has a random error.

If you are studying Bayesian statistics, you might use a probability distribution for the height of one building. The distribution can be regarded as stating a "belief" about the height or you can pretend that when the building was built, it's height was selected at random from a population of possible heights that might have occurred.
 
Hmmm... and N(100,50) feet seems a little odd - that would be a normal distribution with a mean at 100' and a variance of 250' ... suggesting there is a non-zero probability that the "building" is actually a basement. (Quite aside from that the tails of the normal distribution don't hit zero so there's a faint chance that the building is actually a space-elevator or a tunnel through the Earth.)
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 28 ·
Replies
28
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 12 ·
Replies
12
Views
2K