Calculating variance of a single statistic from different collections of data

In summary: You want to combine readings from both balls into one distribution? Did you mean you want a single normal distribution of the errors the machine makes?No, you want to create a normal distribution of the machine's errors.
  • #1
spaceWatch
4
0
Let's assume there's a magical ball that bounces once every .5 seconds, and that there was a specific machine designed to calculate the rate of the ball's bounce (with error). After performing numerous readings of the ball's bounce using this machine, you have a collection of values describing the rate at which the ball bounces for this particular ball.

Now, say you have another magical ball that bounces once every .25 seconds, and used the same machine from before to read the rate at which this other ball bounces. After getting some data, you now have data on the bounce-per-.5-second ball and the bounce-per-.25-second ball.

Question: How would one be able to relate the accuracy and precision of this machine between the readings of the two balls?

Question from different perspective: Could you relate the standard deviation from readings from the first ball to readings from the second?

Question from another perspective: Would it be possible, after collecting readings from the first ball and the second ball, to estimate the readings present in a third magic ball which bounced at a different rate?

Question from yet another perspective: How would you analyze the machine's precision off a collection of balls with different bounce frequencies?

Any feedback would be greatly appreciated. Preferably, if this type of statistical analysis exists, I'd like to know the name. I'm having immense trouble finding this type of analysis using google.

_
 
Last edited:
Physics news on Phys.org
  • #2
spaceWatch said:
After performing numerous readings of the ball's bounce using this machine, you have a collection of values describing the rate at which the ball bounces for this particular ball.

It will be important to clarify whether the periodic motion in this problem really has any impact on the statistical analysis. In other words, is there any difference in analyzing these measurements vs analyzing two sets of measurements with a scale, one set on a 0.5 kg mass and the other on a 0.25 kg mass?

You say the machine measures the "rate" at which the ball bounces and not something like the times when the ball lands. If I imagine a reall machine to do this job it would have an "integration time". Over a certain time interval, it would detect the ball interrupting light to a photo diode. It would record the time's when this happened. Then it would produce a rate based on average time between these events, using all the events it recorded in the integration time interval. If it measured a ball bouncing every 0.25 seconds, it would have more measurements to average than with the slower ball. So the final output of the measurement as a rate might have different precision for balls bouncing at different rates.

Are we to assume the machine is just a "black box" that has the same precison regardless of the rate of bounce of the balls?
 
  • #3
On an added note, how would you create a single normal distribution of the machine's readings when the machine reads different balls?

EDIT: And yes, it's best to assume the machine as a black box with the same precision no matter how long it's been looking at a ball.
 
  • #4
spaceWatch said:
how would you create a single normal distribution of the machine's readings when the machine reads different balls?

Why would you want to combine readings from both balls into one distribution? Did you mean you want a single normal distribution of the errors the machine makes?

If this is a real life problem, it would be best if you described that problem rather than attempt to abstract the mathematics from it on your own. If this is hypothetical problem, then we can discuss it that way. If the machine is being treated as a black box, it would simpler to forget about the bouncing balls and just think about a machine that measures weights, or heights etc.
 
  • #5
The overall goal is to create a normal distribution which can be used to visually describe the accuracy and precision of something, when the 'something' is used on a multitude of different objects.

Granted, the ball-rate-measuring-machine was an example. One could use anything that can measure anything.
 
  • #6
Assume the variance of the errors of the device doesn't depend on the true magnitude of what it is measuring. ( For example, assume the variability of the error a scale makes doesn't increase with the weight of the thing measured - as if would if the scale had a constant variabiliy in its percentage error rather than a constant variability in error)

Assume that the mean error of the device is 0.

Given that you know the correct value of the thing being measured, you can compute the error made on each measurement. You can combine all these observations of errors into one data set and estimate the mean and variance of the normal distribution that fits it. Is that what you want?
 
  • #7
Percentage error, in some cases, could be huge. If said scale was inaccurate enough to measure an object whose weight is similar to its error, percent error could be over 1.

I suppose it would depend on the things being weighed. If the error depended on the object being measured, it would be percentage error. If not, it would be a flat plus-or-minus. Both normal distributions would have a high-point at 0. I suppose one would need to look at actual results of said hypothetical data in order to look at how best to group it.

Thanks!
 
  • #8
Yes, you'd have to look at actual data or know the physics involved to determine if the variability of the error depends on the magnitude of the thing being measued. So the problem also includes estimating a formula for this dependence.
 

Related to Calculating variance of a single statistic from different collections of data

1. What is variance and why is it important in statistics?

Variance is a statistical measure of how spread out a set of data is from its mean. It is important because it provides information about the variability and diversity of the data, which can help in making predictions and drawing conclusions.

2. How do you calculate the variance of a single statistic from different collections of data?

The formula for calculating the variance of a single statistic from different collections of data is:
Var(X) = (1/n) * ∑(x - x̅)^2
where n is the total number of data points, x is each individual data point, and x̅ is the mean of the data set.

3. Can you explain the concept of degrees of freedom in relation to calculating variance?

Degrees of freedom in statistics refers to the number of independent pieces of information that are available for calculating a statistic. In the context of calculating variance, degrees of freedom refers to the number of data points that are free to vary in a data set. It is calculated by subtracting 1 from the total number of data points.

4. What is the difference between sample variance and population variance?

Sample variance is calculated using a sample of data from a larger population, while population variance is calculated using data from the entire population. Sample variance is an estimate of the true population variance and is used when it is not possible to calculate the variance of the entire population.

5. How is variance used in hypothesis testing and statistical analysis?

Variance is used in hypothesis testing to determine if there is a significant difference between two sets of data. It is also used in statistical analysis to measure the accuracy of a statistical model and to compare the variability of different data sets.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
729
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
  • Electromagnetism
2
Replies
48
Views
4K
  • Set Theory, Logic, Probability, Statistics
Replies
17
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Advanced Physics Homework Help
Replies
11
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Precalculus Mathematics Homework Help
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
2K
  • Special and General Relativity
Replies
11
Views
2K
Back
Top