Register to reply 
Margin of error in a tdistribution 
Share this thread: 
#1
Nov911, 07:52 AM

P: 83

This is not a homework problem.
I am working on an experiment and I need to know how many samples (n) I need to achieve a margin of error (e) below 2%. Looking through a statistics text book they provide a calculation for e using zdistributions, but not tdistributions. Replacing variables I concluded that e = (t_{a/2}S)^{2}/[itex]\sqrt{}n[/itex] where t_{a/2} is the upper bound, S is the sample standard deviation. Is is correct? Also, if so, is the value of e given as a percentage? Lastly, from some preliminary tests (2 tests), the closer the initial tests results are to each other the smaller the error value (obviously). But I am concerned that a sample size of two is simply too small to definitively conclude that I am safely within my desired margin or error. I need to conduct tests under a variety of conditions and the final number of tests performed may run into the hundreds, if not thousands, so it is vital that I do not perform more tests for any particular set of conditions that absolutely necessary. Any advice in this regard would be greatly appreciated. Thanks in advance. 


#2
Nov911, 11:05 AM

Sci Advisor
P: 3,300

What is your definition of "margin of error"?
Do you want to say something like "the estimated value, based on the sample, is 28.93 and there is a 95% chance that the true value is within plus or minus 1.30 of this value"? Then give up, if you are using ordinary ("frequentist") statistics. It won't tell you that. You can get something called a "confidence" interval from frequentist statistics. It doesn't tell you the probability that the true value (of whatever your are estimating) is in a specific numerical interval. You didn't say what your are estimating and what estimator you are using. Once those are specified, you can estimate the standard deviation of the estimator. Some people call an interval that is plus or minus two (or three or four) standard deviations around the estimated value, the "margin of error". Is that what you mean? 


#3
Nov911, 01:30 PM

P: 83

I am trying to find the coefficient of friction, and I plan to use the sample mean as the estimator. 


#4
Nov911, 05:05 PM

Sci Advisor
P: 3,300

Margin of error in a tdistribution
However, if you are thinking about publishing a report, consider that there are some areas of engineering and science where frequentist statistics is traditional. Frequentist statistics emphasizes "confidence intervals" for estimators. A confidence interval approach can make a statement like "When we base our estimate on 50 samples, there is a 95% probability the the true value will be within plus or minus 1.3 of our estimate." This statement is similar to what you want, but it cannot be applied to a particular estimate, such as 28.93. Laymen often incorrectly apply it to read like the statement you want. Bayesian and frequentist statistical methods often use substantially the same formulae. There is a distinct difference in the problems they are solving with these formulae. 


#5
Nov911, 06:50 PM

P: 83

Thank you for your help. It is greatly appreciated.



#6
Nov1411, 02:07 PM

HW Helper
P: 1,372

""When we base our estimate on 50 samples, there is a 95% probability the the true value will be within plus or minus 1.3 of our estimate."
Except it is not stated that way, as this makes it seem the true value is the quantity that is random. "When we repeat this process a large number of times, and create a confidence interval each time, 95% of those intervals will fall around the true value" is the appropriate interpretation. Notice that this does not attach any information to a specific instance of an interval. 


#7
Nov1511, 04:46 PM

HW Helper
P: 1,372

A final comment: "Replacing variables I concluded that e = (ta/2S)2/√n where ta/2 is the upper bound, S is the sample standard deviation.
Is is correct? Also, if so, is the value of e given as a percentage?" won't work. In order to know which tvalue to use in this formula you need to select a number of degrees of freedom: as soon as you do that you've selected a sample size. The original formula uses z because (say for 95% confidence) a single z value suffices for every sample size. The downside: you must assume normality (as you do even for a tinterval) 


#8
Nov1511, 07:21 PM

P: 83

I did some research and one study suggested taking a couple samples, finding the minimum number of samples for a zdistribution, then use that value of n in determining the degrees of freedom for the tdistribution. Then I solve for the new n. with each new sample everything is recalulated, S, n for zdis and n for the tdis.



#9
Nov1611, 04:27 AM

P: 259

The problem is that with n=2 the sample standard deviation is likely to be quite an inaccurate estimate of the standard deviation of the population. So you have an additional uncertainty. I don't know whether in your many runs you can assume that the population standard deviation is close to the same in each run. If you can then you can use a pooled sample standard deviation and your problems are over. If you can't and your n for each possible population standard deviation is low then you need to use the t distribution instead of z. 


#10
Nov1611, 01:20 PM

Sci Advisor
P: 3,300

We really should get straight what your goal is. If you are writing a formal report about the coefficient of friction and the audience expects certain statistical techniques and would be made uncomfortable by others, the obvious course of action is to use the techniques they expect. If you are doing a student project on the coefficient of friction and getting distracted by a fascination with statistics, then you have to decide how much time you can devote to learning statistical techniques before the report is due. To me, a more intresting application of probability to the coefficient of friction would be in stochastic modeling of the coefficient of friction, but might be too big a digression from your task, whatever that task is. 


#11
Nov1611, 08:45 PM

P: 83

This is a school project. The statistical analysis is simply an attempt to show that some thought went into the sample size and it was not chosen randomly.



Register to reply 
Related Discussions  
Basis for Margin of Error in Opinion Polls?  Set Theory, Logic, Probability, Statistics  9  
Standard Errors and Margin of Error  Set Theory, Logic, Probability, Statistics  6  
Source for the margin of error of curvature  Cosmology  3  
Margin of Error.  Introductory Physics Homework  0  
Finding error margin when one term is 0  General Math  5 