Random choosing of objects from a Normal distribution

In summary, when dealing with probabilities and normal distributions, there is no direct proof that any given sample will be normally distributed. However, there is a proof that the means of repeated samples from any distribution will be normally distributed at the limit, which can be used to estimate the distribution of the sampling error. Additionally, if samples are taken from the same distribution, the histogram of those samples will converge to the underlying distribution as the sample size approaches a large number or infinity. This is known as the Strong Law of Large Numbers.
  • #1
omoplata
327
2
Let's say I have a very large number of objects with some property which is Normally distributed. If I choose a subset of these objects randomly, will those objects have the property Normally distributed too?

If the answer is yes, can it be proven?

Thanks
 
Last edited:
Physics news on Phys.org
  • #2
omoplata said:
Let's say I have a very large number objects with some property which is Normally distributed. If I choose a subset of these objects randomly, will those objects have the property Normally distributed too?

If the answer is yes, can it be proven?

Thanks

Sure, approximately with sampling error taken into account. It seems to me that its by definition.
 
  • #3
Thanks for the answer.

But I don't understand. Could you write down the logical flow of arriving at that answer, so I can understand it better?

Is it like this?

The initial sample is normally distributed. If we choose objects from it randomly, that means we show no preference one way or the other. So we get a new sample with the same mean and the standard deviation as the larger sample.
 
  • #4
omoplata said:
Thanks for the answer.

But I don't understand. Could you write down the logical flow of arriving at that answer, so I can understand it better?

Is it like this?

The initial sample is normally distributed. If we choose objects from it randomly, that means we show no preference one way or the other. So we get a new sample with the same mean and the standard deviation as the larger sample.

Because we are dealing with probabilities, there is no direct proof that any given sample from a normal distribution will be normally distributed. However there is a proof that the means of repeated samples from any distribution will be normally distributed at the limit.

http://www.swarthmore.edu/NatSci/peverso1/Stat 111/CLT.pdf
 
Last edited:
  • #5
SW VandeCarr said:
Because we are dealing with probabilities, there is no direct proof that any given sample from a normal distribution will be normally distributed. However there is a proof that the means of repeated samples from any distribution will be normally distributed at the limit.

http://www.swarthmore.edu/NatSci/peverso1/Stat 111/CLT.pdf

OK. I don't have enough background to understand that proof, but I can use it. Thanks.
 
  • #6
omoplata said:
Thanks for the answer.

But I don't understand. Could you write down the logical flow of arriving at that answer, so I can understand it better?

Is it like this?

The initial sample is normally distributed. If we choose objects from it randomly, that means we show no preference one way or the other. So we get a new sample with the same mean and the standard deviation as the larger sample.


Yes that is right. Except that by chance the mean and stddev are going to be somewhat different. This is known as sampling error, and gets smaller as the sample grows larger. Statistics is largely about figuring out the distribution of the sampling error, so you know how large of a sample to take.
 
  • #7
omoplata said:
Thanks for the answer.

But I don't understand. Could you write down the logical flow of arriving at that answer, so I can understand it better?

Is it like this?

The initial sample is normally distributed. If we choose objects from it randomly, that means we show no preference one way or the other. So we get a new sample with the same mean and the standard deviation as the larger sample.

One subtlety to watch out for is if you don't know the mean of the space. Then you have to estimate the mean from the sample, which distorts things a bit. This means that you get something called a Student's t distribution.
 
  • #8
omoplata said:
If I choose a subset of these objects randomly, will those objects have the property Normally distributed too?

You won't get a clear answer until you ask a clear question. What do you mean by the objects having the property normally distributed"?

If the random variables denoting the objects are [itex] X_1, X_2,...X_n [/itex], are you asking about the distribution of the single random variable [itex] \frac{( X_1 + X_2 + ...X_n)}{n} [/itex]? Or are you asking a question about the random vector [itex] (X_1,X_2,...X_n) [/itex] (in which case you would be asking about whether their joint distribution is a multivariate normal). Or are you asking whether if we histogram the individual measurements [itex] {X_1,X_2,...X_n} [/itex] that the histogram would resemble the normal distribution?
 
  • #9
Stephen Tashi said:
You won't get a clear answer until you ask a clear question. What do you mean by the objects having the property normally distributed"?

If the random variables denoting the objects are [itex] X_1, X_2,...X_n [/itex], are you asking about the distribution of the single random variable [itex] \frac{( X_1 + X_2 + ...X_n)}{n} [/itex]? Or are you asking a question about the random vector [itex] (X_1,X_2,...X_n) [/itex] (in which case you would be asking about whether their joint distribution is a multivariate normal). Or are you asking whether if we histogram the individual measurements [itex] {X_1,X_2,...X_n} [/itex] that the histogram would resemble the normal distribution?

Yeah, I meant to ask if we histogram the individual measurements [itex] {X_1,X_2,...X_n} [/itex] would the histogram resemble the normal distribution?
 
  • #10
omoplata said:
Yeah, I meant to ask if we histogram the individual measurements [itex] {X_1,X_2,...X_n} [/itex] would the histogram resemble the normal distribution?

If those samples came from the same distribution, then yes the histogram of those samples as the sample size approached a large number (or infinity) should converge to the underlying distribution (if they all come from the same distribution).

One way of thinking about this intuitively is to think of the Strong Law of Large numbers in terms of the frequencies (probabilities) of each element of the domain.

Basically the idea is that the observed frequencies end up converging to the expected frequencies for every element of the domain of the random variable, and as this happens the distribution of the sample converges to the distribution of the underlying distribution if every sample indeed comes from the underlying distribution.
 
  • #11
Thanks to everyone who replied.
 

What is a Normal distribution?

A Normal distribution is a type of probability distribution that is often used to model real-world data. It is characterized by a bell-shaped curve and is symmetric around its mean. The majority of the data falls within one standard deviation of the mean, and the tails of the curve extend infinitely in both directions.

How are objects randomly chosen from a Normal distribution?

In order to choose objects randomly from a Normal distribution, a random number generator is used. The generator generates a set of numbers that follow a Normal distribution, and these numbers are then used to select the objects.

What is the purpose of randomly choosing objects from a Normal distribution?

The purpose of randomly choosing objects from a Normal distribution is to mimic the natural variability of real-world data. This allows scientists to make inferences and predictions about a population based on a sample, as well as to test hypotheses and make statistical conclusions.

What factors can affect the results of randomly choosing objects from a Normal distribution?

The results of randomly choosing objects from a Normal distribution can be affected by the sample size, the mean and standard deviation of the distribution, and the method used to generate the random numbers. It is important to carefully consider these factors in order to accurately represent the population being studied.

What are some examples of situations where randomly choosing objects from a Normal distribution is used?

Randomly choosing objects from a Normal distribution is commonly used in various scientific fields, such as psychology, economics, and biology. It can be used to study the distribution of IQ scores, the heights of individuals in a population, or the amount of time it takes for a plant species to grow. It is also used in quality control and process improvement, where the goal is to achieve a Normal distribution of data for optimal performance.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
441
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
471
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
341
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
483
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
Back
Top