Statistical Test for X-Y Data with Error Bars

  • Context: Graduate 
  • Thread starter Thread starter natski
  • Start date Start date
  • Tags Tags
    Statistical Test
Click For Summary
SUMMARY

The discussion focuses on selecting a statistical test for analyzing x-y data with associated error bars. The user seeks a method to determine the plausibility of observed scatter around the y=0 line, considering individual one-sigma errors. Natski recommends calculating the line of best fit, defining a null hypothesis based on this model, and using the chi-square statistic to derive a p-value. This approach allows for a rigorous evaluation of the data's distribution against the proposed model.

PREREQUISITES
  • Understanding of statistical hypothesis testing
  • Familiarity with chi-square statistics
  • Knowledge of regression analysis and line of best fit
  • Ability to calculate p-values from statistical data
NEXT STEPS
  • Learn how to perform chi-square tests in Python using SciPy
  • Study regression analysis techniques in R, focusing on linear models
  • Explore advanced statistical methods for handling error bars in data
  • Investigate the implications of p-values in hypothesis testing
USEFUL FOR

Researchers, data analysts, and statisticians who are analyzing experimental data with error measurements and seeking to apply rigorous statistical methods for hypothesis testing.

natski
Messages
262
Reaction score
2
Hi all,

I am hoping someone can recommend a useful statistical test. I have a set of data on an x-y plot which varies about the y=0 line in a seemingly random way. Each data point has a y error bar, which appears to be, in general, smaller than the standard deviation of the data.

I would like to apply a rigorous statistical test to calculate the probability that for the given individual one-sigma errors on each data point, the observed scatter about the y=0 line is plausible, or whether there is evidence for some kind of oscillation pattern?

A null hypothesis test was my first thought but this does not take into account the individual errors bars on the data points. Also I have noticed that many mathematicians don't hold the the null-hypothesis in much respect, nor does it offer a probability of the data points being randomly spread.

Thanks for any advise.

Natski
 
Physics news on Phys.org
It sounds like you're looking for a p-value, which is associated with null hypothesis testing. I would first use a computer and calculate the line of best fit through the data points. Let's call this line y = f(x). Then, your null hypothesis can be: "The equation which best models this phenomenon (or whatever it is) is f(x)." Your alternative hypothesis can be: "No. It's not."

Then for all the data points (X1,Y1)... (Xn,Yn), calculate ((Yi - f(Xi))^2)/(f(Xi)) for all i = 1, 2, 3...n. Add the numbers from these calculations up and that would give you a chi-square statistic. Using a table or calculator, you can find the p-value corresponding to this score. The p-value, or the probability of observing this data given the equation y = f(x) is the true model for this phenomenon.

Hope this helps.
 

Similar threads

  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
5K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
6K
Replies
28
Views
4K
  • · Replies 7 ·
Replies
7
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 8 ·
Replies
8
Views
7K