Relation between variables and distributions in statistics

Click For Summary
SUMMARY

The discussion clarifies the relationship between variables and distributions in statistics, specifically distinguishing between descriptive statistics and inferential statistics. A variable in descriptive statistics represents a measurable characteristic, while a random variable encompasses all possible outcomes of an experiment, described by a probability distribution. Descriptive statistics analyze data after measurements, whereas inferential statistics predict outcomes before measurements are taken. Understanding these differences is crucial for effectively applying statistical methods in research and analysis.

PREREQUISITES
  • Understanding of descriptive statistics and their applications.
  • Familiarity with inferential statistics concepts.
  • Knowledge of random variables and probability distributions.
  • Basic statistical measurement techniques.
NEXT STEPS
  • Study the differences between descriptive and inferential statistics in-depth.
  • Learn about probability distributions and their applications in inferential statistics.
  • Explore the concept of random variables and their significance in statistical modeling.
  • Investigate the use of frequency tables and other descriptive methods in data analysis.
USEFUL FOR

Statisticians, data analysts, researchers, and students seeking to deepen their understanding of the foundational concepts in statistics, particularly the interplay between descriptive and inferential statistics.

Mr Davis 97
Messages
1,461
Reaction score
44
I am a little confused about how variables are related to distributions as one moves from descriptive statistics to inferential statistics. I know that a variable in descriptive statistics is some measurable characteristic of some phenomenon, and its distribution is some description (table or graph) of how the values of this variable vary. This seems fairly comprehensible. But then I was introduced to the concept of a random variable, and its associated probability distribution. My main question, what is the difference between descriptive statistical variables and random variables, and what is the difference between a the distribution of a regular variable and a probability distribution of a random variable? They seem like analogues, but I am just not seeing the "big picture" in terms of what I am doing in statistics with these random variables, distributions, and probability distributions. If anybody could give me a clear description of how I should be thinking about all of this, it would be greatly appreciated.
 
Physics news on Phys.org
The big difference between descriptive and inferential statistics is time. I mean this: descriptive statistics happens after all the measurements are made, inferential statistics happens before all the measurements are made. As such, descriptive statistics just describe the system, while inferential statistics tries to predict the system.

So a variable in descriptive statistics is pretty logical: it is some quantity that has been measured and that we have certain measurements for. Random variables are a lot harder since the measurement has not yet been made. Again, random variables are certain quantities. But now we must prepare ourselves for all possible outcomes of the experiment! So a random variable measures all possible outcomes of a measurement and the probability distribution gives the probabilities for these outcomes. The idea is that we then do an experiment and get certain outcomes. These outcomes can be described with descriptive statistics and we hope that the distribution (in the descriptive sense) agrees with the probability distribution.
 
micromass said:
The big difference between descriptive and inferential statistics is time. I mean this: descriptive statistics happens after all the measurements are made, inferential statistics happens before all the measurements are made. As such, descriptive statistics just describe the system, while inferential statistics tries to predict the system.

So a variable in descriptive statistics is pretty logical: it is some quantity that has been measured and that we have certain measurements for. Random variables are a lot harder since the measurement has not yet been made. Again, random variables are certain quantities. But now we must prepare ourselves for all possible outcomes of the experiment! So a random variable measures all possible outcomes of a measurement and the probability distribution gives the probabilities for these outcomes. The idea is that we then do an experiment and get certain outcomes. These outcomes can be described with descriptive statistics and we hope that the distribution (in the descriptive sense) agrees with the probability distribution.

Okay, I see. So would it be correct to say something along the lines of: Inferential statistics uses random variables and their associated probability distributions in order to theoretically idealize a certain experiment in terms of outcomes and the distribution of those outcomes? Also, another question: why do we only describe a the distribution of a random variable with a probability distribution? Why are there not other ways that are analogous to descriptive statistics, such as a frequency table?
 

Similar threads

  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 7 ·
Replies
7
Views
3K