HiI am testing that data are normally distributed by using

In summary, the individual has conducted a chi-square test to determine if their data is normally distributed but is getting a high chi coefficient of 173, which is concerning given that the data fits the 3sigma test and has equal median and mode. They are seeking advice on what they might be doing wrong and what other test might be better suited for their approach. They mention the Central Chi Square test and having a limited number of degrees of freedom (309) for 310 data points, and also mention that other tests such as Kolmogorov and AD are consistent with normality. They are unsure if they are aggregating the data in a way that could impact their results.
  • #1
Mark J.
81
0
Hi
I am testing that data are normally distributed by using chi-square test.
I get a very high chi coefficient about 173.
This is very strange because data fits at 3sigma test and median mode are equal so I am almost sure that is normal distribution.
What I am doing wrong?
Which test do you advice for a better approach?
Regards
 
Physics news on Phys.org
  • #2


Mark J. said:
Hi
I am testing that data are normally distributed by using chi-square test.
I get a very high chi coefficient about 173.
This is very strange because data fits at 3sigma test and median mode are equal so I am almost sure that is normal distribution.
What I am doing wrong?
Which test do you advice for a better approach?
Regards

The Central Chi Square will approach the normal distribution as the number of degrees of freedom gets large. For over 50 df, the distribution is effectively normal.
 
  • #3


The problem is that I don't have so much degrees of freedom to get this value.
The other tests Kolmogorov or AD work just fine.
Maybe I am miscalculating degrees of freedom or 310 data are an issue?
Please help
 
  • #4


Mark J. said:
The problem is that I don't have so much degrees of freedom to get this value.
The other tests Kolmogorov or AD work just fine.
Maybe I am miscalculating degrees of freedom or 310 data are an issue?
Please help

Are you saying you have 310 data points? If they are independent observations that would mean you have 309 df. In this case, you certainly should be able to assume a normal distribution with the central chi square. When you say the other tests of normality work fine, do you mean they are consistent with normality? If so, you should be fine. However, I think you might be aggregating data in some way, and how you do this could be important in interpreting the data..
 
  • #5


Hello,

There are a few things to consider when testing for normal distribution using the chi-square test. First, it is important to have a large enough sample size for the test to be accurate. If your sample size is too small, the chi-square test may not be the most appropriate method for testing normality.

Additionally, the chi-square test assumes that the data is continuous and independent. If your data is not continuous or if there are dependencies among the data points, the results of the chi-square test may not be reliable.

Furthermore, the chi-square test is typically used to test for differences between observed and expected frequencies, rather than testing for normality. It may be more appropriate to use a test specifically designed for testing normality, such as the Kolmogorov-Smirnov test or the Shapiro-Wilk test.

In general, it is important to consider the assumptions and limitations of any statistical test before interpreting the results. It may also be helpful to consult with a statistician or seek out additional resources for guidance on selecting the most appropriate test for your data. Best of luck with your analysis.
 

What does it mean to test if data are normally distributed?

Testing if data are normally distributed means determining whether the data follows a normal distribution, which is a bell-shaped curve with a symmetric distribution around the mean. This is important in many statistical analyses as it affects the validity of certain assumptions and conclusions.

Why is it important to test for normal distribution?

Testing for normal distribution is important because many statistical tests and models assume that the data follows a normal distribution. If the data is not normally distributed, these tests and models may not be valid, leading to incorrect conclusions or interpretations.

How is normal distribution tested?

Normal distribution can be tested in several ways, such as visually inspecting a histogram or a Q-Q plot, calculating skewness and kurtosis statistics, or using formal statistical tests like the Kolmogorov-Smirnov test or the Shapiro-Wilk test.

What are the assumptions of normal distribution testing?

The assumptions of normal distribution testing include a large sample size, independence of observations, and a continuous variable. It is also assumed that the data is not significantly skewed or has extreme outliers.

What if my data is not normally distributed?

If your data is not normally distributed, there are several options to consider. You can try transforming the data to achieve a more normal distribution, using non-parametric tests that do not require normality assumptions, or choosing a different statistical method that is appropriate for non-normal data.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
823
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
817
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
730
  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
472
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
923
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
Back
Top