I What are the recommended tests for comparing two sets of data?

VVS2000 · Feb 26, 2022

So I have two columns of data, One containing experimental values and the other having expected values. So I read that chi-squared test and Anova Tests can be used to compare two set of data. My main aim is to quantitatively know how different these two sets of data are, so are these two tests enough or is there any other tests that you would suggest?

Dale · Feb 26, 2022

The standard measure of distance would be the sum of square residuals. However, I am not sure what you are asking has real meaning as stated.

As I understand your statement, you don’t have two sets of data, you have one set of data and a model. Any set of data can plausibly come from any model given sufficiently uncertain measurements. So just asking about the sum of square residuals doesn’t say much.

If your model has any free parameters then you can meaningfully ask which parameters minimize the residuals. Or if you have two competing models you can ask which model minimizes the residuals. Or if you can predict the expected uncertainty in your measurements, then you can determine how likely the data is under the model.

FactChecker · Feb 26, 2022

Asking "how different" your data is from the expected values of a model is a vague question.
Are you asking how likely (probability) such data might fit the model like that (or worse)? The Chi-squared goodness of fit test is well suited for that.
Are you asking for some numerical measure of how great the differences are? The sum-squared-errors total is well suited for that.

VVS2000 · Feb 28, 2022

Dale said:

The standard measure of distance would be the sum of square residuals. However, I am not sure what you are asking has real meaning as stated.

As I understand your statement, you don’t have two sets of data, you have one set of data and a model. Any set of data can plausibly come from any model given sufficiently uncertain measurements. So just asking about the sum of square residuals doesn’t say much.

If your model has any free parameters then you can meaningfully ask which parameters minimize the residuals. Or if you have two competing models you can ask which model minimizes the residuals. Or if you can predict the expected uncertainty in your measurements, then you can determine how likely the data is under the model.

ok sorry for not being clear. I have two sets of data. One column contains Observed or experimental focal length of a lens at different heights from the axis. The Other column contains the expected or theoretical value of focal length. so yeah I want to quantitatively know different these two sets of data are given the uncertainty and errors in the observed data

Dale · Feb 28, 2022

VVS2000 said:

The Other column contains the expected or theoretical value of focal length

Does this column have uncertainty associated with it?

VVS2000 · Mar 1, 2022

Dale said:

Does this column have uncertainty associated with it?

yes, but all values have the same uncertainty

FactChecker · Mar 1, 2022

Two possibilities are Chi-Squared goodness of fit, and linear regression. The first one is very general. It requires that the data be combined into categories. The second one requires that the model of the variable of interest is a linear function of the other variables.

I What are the recommended tests for comparing two sets of data?

Similar threads

B A Little Probability Puzzle

I Need help solving this Existence Algorithm for truth

I A variant of the Monty Hall problem

I What Are the Axioms of Fuzzy Logic and How Do They Extend Boolean Algebra?

I Please Explain (actually explain) The Monty Hall Problem

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers