Solving Overdetermined Problems: X2 Distribution Requirements

Niles · Sep 8, 2011

Hi

I'm not sure this is the right place to post, but I'll go ahead. In my book it says that if I am dealing with an overdetermined problem with m data points and n parameters (so m>n), then my observed chi square X²_obs follows a X² distribution with m-n degrees of freedom if the data points are normally distributed.

I thought that the number of degrees of freedom was always m-n, regardless of what distribution my data follows. Am I right or is it correct what the book is stating?

Stephen Tashi · Sep 9, 2011

Niles said:

Am I right or is it correct what the book is stating?

I think no one has answered this because you haven't given a clear statement of what the book said. For example, what kind of parameters is the book talking about? Means? Covariances? Any old parameter? What kind of data are the "data points"?

Do you have a source or link that supports your own opinion that the random variables need not be normally distributed?

Dickfore · Sep 9, 2011

Niles said:

Hi

I'm not sure this is the right place to post, but I'll go ahead. In my book it says that if I am dealing with an overdetermined problem with m data points and n parameters (so m>n), then my observed chi square X²_obs follows a X² distribution with m-n degrees of freedom if the data points are normally distributed.

I thought that the number of degrees of freedom was always m-n, regardless of what distribution my data follows. Am I right or is it correct what the book is stating?

The Chi-squared distribution has an essential parameter called number of degrees of freedom. So, the bolded and red text in your quote is all part of the name.

Niles · Sep 9, 2011

By "parameters" I mean parameters used to make a fit to the data. And data points are physically measured data, which is why I believe the book is so keen on always dealing with normally distributed data (cf. Central Limit Theorem).

I have no source for my statement. In fact I believe I might be wrong. But I still think it is an interesting question: If I am dealing with data that isn't Gaussianly distributed, then how would I go about and make a goodness-of-fit estimate, considering I can't use X²?

Thanks.

Dickfore · Sep 9, 2011

Niles said:

By "parameters" I mean parameters used to make a fit to the data.

And by parameters I meant coefficients that characterize the probability density function, just like the expectation a and the standard deviation [itex]\sigma[/itex] in the normal distribution [itex]\mathcal{N}(a, \sigma)[/itex], or the endpoints a and b in the uniform distribution [itex]\mathcal{U}(a, b)[/itex] or the parameter [itex]\lambda[/itex] in the Poisson distribution [itex]\mathcal{P}(\lambda)[/itex].

jambaugh · Sep 9, 2011

The "if the data points are normally distributed." part may be invoked by using the Central Limit Theorem. If the data points are sums or averages of many RVs then one may assume it is "close to" normally distributed and thus the statistic is "close to" chi-squared.

(BTW: one should say "regardless of" or "irrespective of" or even "irregarding" but not "irregardless".)

Niles · Sep 9, 2011

For the fact that the data has to be normally distributed, see http://people.richland.edu/james/lecture/m170/ch12-fit.html

"[...] This goes back to the requirement that the data be normally distributed."

Niles · Sep 9, 2011

jambaugh said:

(BTW: one should say "regardless of" or "irrespective of" or even "irregarding" but not "irregardless".)

I did not know that, thank you (http://en.wikipedia.org/wiki/Irregardless).

Dickfore · Sep 9, 2011

http://en.wikipedia.org/wiki/Cochran%27s_theorem" gives the precise conditions when the distribution is chi-square and what the number of degrees of freedom is.

Solving Overdetermined Problems: X2 Distribution Requirements

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

Undergrad My basic understanding of set theory

Undergrad How do E[X] and E[|X|] relate?

Graduate Expected numbers of cards of a last color remaining

Undergrad Understanding permutations and combinations in a coin toss experiment

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight