How Does Sampling Strategy Impact Measurement Accuracy in Statistics?

In summary, when measuring a physical variable, it is not necessary to collect multiple samples as the same amount of data can be obtained from a single large sample. The rule in statistics is to add the variances of independent random variables, resulting in a standard deviation of S/sqrt(N) for the mean value. The Central Limit Theorem can also be applied to describe the behavior of the sample mean, and the S/sqrt(N) formula is derived from this. The concept of a sampling distribution is used to make definitive statements about the probability of a mean value deviating from the actual physical value.
  • #1
fog37
1,568
108
Hello Forum,

I am taking a lab and we are learning about measurement and uncertainty. Suppose we have to measure the length L of an object. Once the data has been collected we can calculate the mean (average) and the standard deviation s. The resulting measurement would be expressed as [ mean +- s/sqrt(N)] unit where N is the number of collected measurements.

In statistics, there is the practice to collect multiple samples and obtain statistics from the sampling distribution. But this approach does not seem to apply in in the context of measuring a physical variable. Why?
Would it be better, from a statistical standpoint, to collect a single sample of N=100 measurements or M=10 samples each containing N=10 measurements? On both cases the total number of measurements is 100...

Thanks,
fog37
 
Physics news on Phys.org
  • #2
I'm not 100% sure I understand what you're asking but here goes...

In statistics it may be helpful to consider multiple samples to better understand the sampling distribution but as you say the N=100 once vs Ten cases of N=10 are equivalent amounts of data. The rule in statistics is that when we add independent random variables we add their variances (square of the standard deviation). Thus when we average (1/N*sum) we get for the composite standard deviation the [itex]S_{mean} = \frac{S}{\sqrt{N}}[/itex].

Consider then your N=100 vs Ten N=10 cases. Assuming the sample standard deviations for each sample are about equal and close to the pop. stdev we would average the averages to get:
[tex] S_{Mean} = \frac{ \frac{S}{ \sqrt{10}} }{\sqrt{10}} = \frac{S}{\sqrt{10}\cdot \sqrt{10}}= \frac{S}{\sqrt{100}}[/tex]
so the standard deviation calculation is the same.
[There's additional stat analysis we could do with the variations in the sample standard deviations but it should all balance out as your intuition would suggest since fundamentally what it's telling you is correct, this partitioning of the sample doesn't change the information it contains.]

As you move deeper into the probability theory for the sampling distribution you'll note the Central Limit Theorem applies an you can describe the shape of the distribution for the sample mean you use for your measurement. (that it has the normal distribution with its bell shaped density curve).

What may be confusing you is that to talk about the sampling distribution of your mean measurement, one must speak of how it behaves under many samplings. One is abstracting the random experiment of measuring the system one level higher to the random experiment of measuring N systems and calculating a mean value. To describe this second random experiment we need a (meta)sample i.e. many N-measurement samples. Once we consider this then we can describe the aggregate behavior of such samples and thus make definitive statements about the probability that your mean value deviates a certain amount from the actual physical value it seeks to measure.

This is where the S/sqrt(N) formula comes from... in particular:
[tex] \sigma_{\overline{X}} = \frac{1}{\sqrt{N}} \sigma_X[/tex]
where [itex]\overline{X}[/itex] is the mean value of N independent random variables each with standard deviation [itex]\sigma_X[/itex].

One is here describing how the mean value behaves over many samples in terms of how the original variable behaves over a single sample.
 

Related to How Does Sampling Strategy Impact Measurement Accuracy in Statistics?

What is measurement and statistics?

Measurement and statistics is a branch of science that deals with the collection, analysis, and interpretation of data in order to make informed decisions and draw conclusions about a particular phenomenon or population.

Why is measurement important in scientific research?

Measurement is important in scientific research because it allows us to quantify and compare various aspects of a phenomenon or population. It provides us with a standardized way to collect and analyze data, making our findings more reliable and valid.

What are the different types of measurement scales?

There are four main types of measurement scales: nominal, ordinal, interval, and ratio. Nominal scales categorize data without any numerical value, ordinal scales rank data in a specific order, interval scales have equal intervals between values but no true zero point, and ratio scales have equal intervals and a true zero point.

How do statistics help in understanding data?

Statistics help in understanding data by providing us with tools and techniques to organize, summarize, and analyze large amounts of data. This allows us to identify patterns, trends, and relationships within the data, leading to a deeper understanding of the phenomenon being studied.

What is the difference between descriptive and inferential statistics?

Descriptive statistics are used to summarize and describe a set of data, while inferential statistics are used to make predictions or generalizations about a larger population based on a sample of data. In other words, descriptive statistics describe what the data shows, while inferential statistics make inferences about what the data implies.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
13
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
648
  • Set Theory, Logic, Probability, Statistics
Replies
21
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
12
Views
2K
  • Set Theory, Logic, Probability, Statistics
2
Replies
42
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
18
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
24
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
Back
Top