Statistical Test for X-Y Data with Error Bars

  • Thread starter Thread starter natski
  • Start date Start date
  • Tags Tags
    Statistical Test
Click For Summary
A statistical test is needed to analyze x-y data that varies around the y=0 line, with each point having smaller y error bars than the standard deviation. The goal is to determine if the observed scatter is plausible or indicates an oscillation pattern. A null hypothesis test is suggested, but it does not account for individual error bars. Instead, calculating a line of best fit and using a chi-square statistic can provide a p-value to assess the fit of the model. This approach helps evaluate whether the data aligns with the proposed model or suggests a different underlying pattern.
natski
Messages
262
Reaction score
2
Hi all,

I am hoping someone can recommend a useful statistical test. I have a set of data on an x-y plot which varies about the y=0 line in a seemingly random way. Each data point has a y error bar, which appears to be, in general, smaller than the standard deviation of the data.

I would like to apply a rigorous statistical test to calculate the probability that for the given individual one-sigma errors on each data point, the observed scatter about the y=0 line is plausible, or whether there is evidence for some kind of oscillation pattern?

A null hypothesis test was my first thought but this does not take into account the individual errors bars on the data points. Also I have noticed that many mathematicians don't hold the the null-hypothesis in much respect, nor does it offer a probability of the data points being randomly spread.

Thanks for any advise.

Natski
 
Physics news on Phys.org
It sounds like you're looking for a p-value, which is associated with null hypothesis testing. I would first use a computer and calculate the line of best fit through the data points. Let's call this line y = f(x). Then, your null hypothesis can be: "The equation which best models this phenomenon (or whatever it is) is f(x)." Your alternative hypothesis can be: "No. It's not."

Then for all the data points (X1,Y1)... (Xn,Yn), calculate ((Yi - f(Xi))^2)/(f(Xi)) for all i = 1, 2, 3...n. Add the numbers from these calculations up and that would give you a chi-square statistic. Using a table or calculator, you can find the p-value corresponding to this score. The p-value, or the probability of observing this data given the equation y = f(x) is the true model for this phenomenon.

Hope this helps.
 
The standard _A " operator" maps a Null Hypothesis Ho into a decision set { Do not reject:=1 and reject :=0}. In this sense ( HA)_A , makes no sense. Since H0, HA aren't exhaustive, can we find an alternative operator, _A' , so that ( H_A)_A' makes sense? Isn't Pearson Neyman related to this? Hope I'm making sense. Edit: I was motivated by a superficial similarity of the idea with double transposition of matrices M, with ## (M^{T})^{T}=M##, and just wanted to see if it made sense to talk...

Similar threads

  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 30 ·
2
Replies
30
Views
4K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 5 ·
Replies
5
Views
6K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 16 ·
Replies
16
Views
2K
Replies
28
Views
4K