Experiments are the arbiters of theoretical assertions, and the proper experiment depends on the assertion. If the assertion is essentially "adding an offset to a model of a proportionality is always better", then it depends on what you mean by "better." For me "better" means "more accurate" since accuracy is key in testing the specific hypothesis and in determining the constant of proportionality.
Vanadium 50 said:
It has been suggested to add extra terms and if they come out zero, they don't hurt anything. One (of several) problem with this is "where do you stop?" You can always add terms and the best fit will always be when the number of parameters equals the number of data points. These "fits" tend to be unphysical and wiggly.
Extra terms do hurt. Even when they are statistically not different from zero, the cases above demonstrate that allowing them increases the errors of the parameters from the known good values.
Extra terms are also unnecessary for testing hypotheses relating to proportionality. Before least square fitting was invented, there was hundreds of years of sound scientific practice testing proportions without a constant term. That's how Kepler tested his third law: T^2 = k a^3. That's how Galileo tested his law of falling bodies: d = k t^2. That's how Robert Boyle tested Boyle's law: PV = k. Least squares fitting provides much easier ability, but not the necessity of constant terms.
If there are significant resolution issues in either time or position, adding terms can remedy the experimental limitations with more advanced analysis. Using a video camera to test Galileo's law of falling bodies (and determine g), the time resolution is 1/30 or 1/60 of a second, and the position resolution might be 1720 pixels across the vertical field of view. The time resolution is the big limitation since it is much harder to drop an object at the exact moment a frame is captured. Adding constant and linear terms to the equation saves the day, both allowing testing of Galileo's law of falling bodies AND determination of g within 1-2% (the accuracy is now limited by the imperfect linear mapping of pixels to position.) But there is something of a happy accident here in that adding a constant and linear term don't change the value of the coefficient of the quadratic term. This is not generally the case.
As I mentioned in the original speed of sound article, there is no need for a least-squares fit. The historical procedure for testing a hypothesis of proportionality is adequate. One can compute a speed of sound simply as V = d/t for each data point. If one has a number of data points, one can then compute a mean and standard error for comparison with the speed of sound predicted for the measured temperature. Values computed this historical way tend to be within the standard error of those predicted from the temperature and obtained from a least squares fit without a vertical shift. Adding the vertical shift, while the shift is statistically not different from zero, it pulls the slope further from the proportionality constant obtained using the historical method (d/t) and further from the predicted speed of sound.
Before selecting an analysis approach for students, my habit is to run a pilot experiment or two along with the analysis. Of course, I try common variations on the analysis which is how I learned that the constant and linear terms save the day for a video experiment on Galileo's law of falling bodies, but that the constant term reduces accuracy on the speed of sound experiment. For some physics classes, it might make sense to try a linear fit with the offset as an additional test on the proportionality hypothesis, d = Vt. If one has done a good experiment (sufficiently small systematic errors), an offset different from zero (in the sense of statistical significance) would cast doubt on the hypothesis. But this experiment is not hard to do well. A simple tape measure determines the distance to the wall to much better than 1%, the separation from the firecracker to the wall is over 1000 times the microphone offset, and the firecracker produces very high frequencies so that diffraction does not significantly harm the speed of sound determination. Wind has the greatest potential to introduce systematic errors, but there are a lot of still mornings to avoid that.