Relation between variables and distributions in statistics

Click For Summary
Descriptive statistics involve measurable characteristics of a phenomenon and summarize data after measurements are taken, while inferential statistics utilize random variables to predict outcomes before measurements occur. A variable in descriptive statistics is straightforward, representing actual measured quantities, whereas a random variable encompasses all possible outcomes of an experiment, along with their associated probabilities. The probability distribution of a random variable provides a theoretical framework for understanding these outcomes, contrasting with the descriptive distribution that summarizes observed data. Inferential statistics relies on random variables and their probability distributions to model experiments and predict results. The discussion highlights the need for clear distinctions between these statistical concepts to grasp their applications effectively.
Mr Davis 97
Messages
1,461
Reaction score
44
I am a little confused about how variables are related to distributions as one moves from descriptive statistics to inferential statistics. I know that a variable in descriptive statistics is some measurable characteristic of some phenomenon, and its distribution is some description (table or graph) of how the values of this variable vary. This seems fairly comprehensible. But then I was introduced to the concept of a random variable, and its associated probability distribution. My main question, what is the difference between descriptive statistical variables and random variables, and what is the difference between a the distribution of a regular variable and a probability distribution of a random variable? They seem like analogues, but I am just not seeing the "big picture" in terms of what I am doing in statistics with these random variables, distributions, and probability distributions. If anybody could give me a clear description of how I should be thinking about all of this, it would be greatly appreciated.
 
Mathematics news on Phys.org
The big difference between descriptive and inferential statistics is time. I mean this: descriptive statistics happens after all the measurements are made, inferential statistics happens before all the measurements are made. As such, descriptive statistics just describe the system, while inferential statistics tries to predict the system.

So a variable in descriptive statistics is pretty logical: it is some quantity that has been measured and that we have certain measurements for. Random variables are a lot harder since the measurement has not yet been made. Again, random variables are certain quantities. But now we must prepare ourselves for all possible outcomes of the experiment! So a random variable measures all possible outcomes of a measurement and the probability distribution gives the probabilities for these outcomes. The idea is that we then do an experiment and get certain outcomes. These outcomes can be described with descriptive statistics and we hope that the distribution (in the descriptive sense) agrees with the probability distribution.
 
micromass said:
The big difference between descriptive and inferential statistics is time. I mean this: descriptive statistics happens after all the measurements are made, inferential statistics happens before all the measurements are made. As such, descriptive statistics just describe the system, while inferential statistics tries to predict the system.

So a variable in descriptive statistics is pretty logical: it is some quantity that has been measured and that we have certain measurements for. Random variables are a lot harder since the measurement has not yet been made. Again, random variables are certain quantities. But now we must prepare ourselves for all possible outcomes of the experiment! So a random variable measures all possible outcomes of a measurement and the probability distribution gives the probabilities for these outcomes. The idea is that we then do an experiment and get certain outcomes. These outcomes can be described with descriptive statistics and we hope that the distribution (in the descriptive sense) agrees with the probability distribution.

Okay, I see. So would it be correct to say something along the lines of: Inferential statistics uses random variables and their associated probability distributions in order to theoretically idealize a certain experiment in terms of outcomes and the distribution of those outcomes? Also, another question: why do we only describe a the distribution of a random variable with a probability distribution? Why are there not other ways that are analogous to descriptive statistics, such as a frequency table?
 
Here is a little puzzle from the book 100 Geometric Games by Pierre Berloquin. The side of a small square is one meter long and the side of a larger square one and a half meters long. One vertex of the large square is at the center of the small square. The side of the large square cuts two sides of the small square into one- third parts and two-thirds parts. What is the area where the squares overlap?

Similar threads

  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 30 ·
2
Replies
30
Views
4K
  • · Replies 6 ·
Replies
6
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 24 ·
Replies
24
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K