Chi squared test for data with error

Click For Summary

Discussion Overview

The discussion revolves around the application of the chi-squared test for data that includes asymmetric error margins. Participants explore how to handle different types of error in the context of statistical testing, particularly when the data does not conform to standard normal distributions.

Discussion Character

  • Exploratory
  • Technical explanation
  • Debate/contested
  • Mathematical reasoning

Main Points Raised

  • One participant inquires about how to perform a chi-squared test when data has asymmetric errors, specifically asking whether to use the upper or lower error margin.
  • Another participant seeks clarification on the notation used for the error margins, suggesting that the range could be interpreted as [1.9, 2.9].
  • Concerns are raised about the appropriateness of using the chi-squared test for non-normal distributions, with a suggestion that transformations may be necessary to evaluate the test statistic.
  • Some participants discuss the implications of transforming data to fit a normal distribution, questioning whether this approach is valid and how it might affect the integrity of the original data.
  • There is a suggestion that if data is suspected to follow a lognormal distribution, applying a logarithmic transformation could yield a normal distribution suitable for testing.

Areas of Agreement / Disagreement

Participants express uncertainty regarding the correct approach to handle asymmetric errors in chi-squared testing. There is no consensus on whether transforming the data to achieve normality is appropriate, and differing opinions on the implications of such transformations are evident.

Contextual Notes

The discussion highlights limitations in understanding the implications of using different error margins and the potential need for transformations in statistical analysis. There are unresolved questions about the nature of the data and the appropriateness of the chi-squared test under varying conditions.

shadishacker
Messages
28
Reaction score
0
Hi everyone.

I am totally new to statistics so my question may or may not be simple!
I know that for the data fitting we can do a chi squared test like:
\begin{equation} \chi^2 = \Sigma \frac{(f_{data}-f_{model})^2}{(error_{data})^2}\end{equation}

So I have been doing this for a while, but now I have some data with different error, let's say like:
\begin{equation} f_i = 2 ^{+0.9}_{-0.1}\end{equation}
How should I do the chi squared test for this?! What should I consider as the error? 0.9 ? 0.1?
 
Physics news on Phys.org
Notation in (2) is unfamiliar. Do you mean range = [1.9, 2.9]?
 
EnumaElish said:
Notation in (2) is unfamiliar. Do you mean range = [1.9, 2.9]?
yes, it means it can go from 1.9 to 2.9.
But until now, I have used Chi squared test only for normal distributions, which are for instance:
\begin{equation}f_i = 2_{0.1}^{0.1}\end{equation}
i.e. error in both sides are the same.
 
Hey shadishacker.

The chi-square test you are thinking of is regression based and I'm wondering why you can't transform the variance if it isn't in some standard form.

Usually doing transformations on random variables to get evaluate a test statistic is common and the most used one is standardizing a Normal distribution where you have Z = (X - mu)/sigma.

A similar transformation can be done to get it in the normal chi-square form and inferences based on this transformation can be made.
 
shadishacker said:
yes, it means it can go from 1.9 to 2.9.
But until now, I have used Chi squared test only for normal distributions, which are for instance:
\begin{equation}f_i = 2_{0.1}^{0.1}\end{equation}
i.e. error in both sides are the same.
I'm still uncomfortable with the notation. What is the significance of "2"? Is it the mean or the median or the mode? In the case of [1.9, 2.9] isn't it possible to re-center the distribution so as to make it symmetric?
 
I think it means that the mean is 2. and as the distribution is normal, the \begin{equation} \mu^2=0.1\end{equation}
However if the distribution is not normal, then \begin{equation} \mu^2\end{equation} would be different from left and right side of the mean.
 
chiro said:
Hey shadishacker.

The chi-square test you are thinking of is regression based and I'm wondering why you can't transform the variance if it isn't in some standard form.

Usually doing transformations on random variables to get evaluate a test statistic is common and the most used one is standardizing a Normal distribution where you have Z = (X - mu)/sigma.

A similar transformation can be done to get it in the normal chi-square form and inferences based on this transformation can be made.
Dear Chiro,

So you mean I can change the shape of the distribution to a nomal one?
but is it a right thing to do?
I mean if there are observational points, then doesn't this change the data completely?!
 
shadishacker said:
Dear Chiro,

So you mean I can change the shape of the distribution to a nomal one?
but is it a right thing to do?
I mean if there are observational points, then doesn't this change the data completely?!
No it does not change the data. Suppose there is a test for determining if a sample is from a normal distribution. Suppose there isn't a test for determining if a sample is from a lognormal distribution. If the data are suspected to be lognormal, what are we going to do? Well, we can "log the data" so as to turn them into data distributed normally. Then apply the normality test. That's possible because the "log" of lognormal is normal. Chiro is suggesting a similar transformation.
 

Similar threads

  • · Replies 7 ·
Replies
7
Views
3K
  • · Replies 11 ·
Replies
11
Views
3K
  • · Replies 23 ·
Replies
23
Views
3K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 6 ·
Replies
6
Views
2K
  • · Replies 8 ·
Replies
8
Views
2K
  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 4 ·
Replies
4
Views
4K
  • · Replies 8 ·
Replies
8
Views
2K