When is Linear Model not Good despite r^2 close to 1?

Bacle · Oct 3, 2011

Hi, All:
I was reading of cases in which linear models in least-squares regression were found to be
innefective, despite values of r, r^2 being close to 1 (obviously, both go together ).
I think the issue has to see with the distribution of the residuals being distinctively non-linear (and, definitely, not being normal), e.g., having a histogram that looks like a parabola, or a cubic, etc.
Just curious to see if someone knows of some examples and/or results in this respect, and of what other checks can be made to see if a linear model makes sense for a data set. Checks I know of are Lack-of-fit Sum of Squares F-test and inference for regression (with Ho:= Slope is zero.)

Thanks.

mXSCNT · Oct 5, 2011

Another way - suppose there is overfitting, or not enough data points for the number of dimensions. If you have 100 data points but are using a model with 100 different dimensions it doesn't matter how good your correlation is.

Pyrrhus · Oct 6, 2011

A high [itex] R^{2} [/itex] is not the only important statistic to check. I prefer adjusted [itex] R^{2} [/itex], because the more parameters you add to the former it'll tend to inflate it.

Bacle · Oct 6, 2011

Thanks, Pyrrhus:

What do I then do if the adjusted R^2 is low ? Do I start considering linear models on two-or-more variables, or do I consider quadratic, cubic, etc. models?

Pyrrhus · Oct 7, 2011

You could try adding square terms, and interaction terms, but if the r-squared is still low it might just be that the regressors don't do a good job to explain the dependent variable.

Mapes · Oct 8, 2011

Try this incredible free http://creativemachines.cornell.edu/eureqa" developed at Cornell. I've used it in my own research, rating fits by adjusted r² and Akaike Information Criterion values.

Bacle · Oct 8, 2011

Excellent, Thanks!.

When is Linear Model not Good despite r^2 close to 1?

1. What is a linear model and what does r^2 represent?

2. Can a linear model have a high r^2 value but still not be a good fit?

3. What are some reasons for a linear model to not be good despite a high r^2 value?

4. How can I determine if a linear model is not a good fit despite a high r^2 value?

5. Is it possible to improve a linear model that has a high r^2 value but is not a good fit?

Similar threads

Hot Threads

Recent Insights