image
Physics Forums Logo
image
image
* Register * Upgrade Blogs Library Staff Rules Mark Forums Read
image
image   image
image

Go Back   Physics Forums > Mathematics > Set Theory, Logic, Probability, Statistics


Reply

image Multiple linear regression: partial F-test Share It Thread Tools Search this Thread image
Old Jun18-09, 10:00 PM                  #1
kingwinner

kingwinner is Offline:
Posts: 815
Multiple linear regression: partial F-test

"Suppose that in a MULTIPLE linear regression analysis, it is of interest to compare a model with 3 independent variables to a model with the same response varaible and these same 3 independent variables plus 2 additional independent variables.
As more predictors are added to the model, the coefficient of multiple determination (R^2) will increase, so the model with 5 predicator variables will have a higher R^2.
The partial F-test for the coefficients of the 2 additional predictor variables (H_o: β_4=β_5=0) is equivalent to testing that the increase in R^2 is statistically signifcant."


I don't understand the bolded sentence. Why are they equivalent?

Thanks for explaining!
  Reply With Quote
Old Jun19-09, 09:16 AM                  #2
statdad

statdad is Online:
Posts: 702
Recognitions:
Homework Helper Homework Helper
Re: Multiple linear regression: partial F-test

Mathematically LaTeX Code:  R^2  will increase whether or not the new variables contribute to the model. Because of this, the question in practice is whether the larger LaTeX Code:  R^2  is due simply to the math (this corresponds to LaTeX Code:  H_0 \\colon \\beta_4 = \\beta_5 = 0  ) or whether the increase is due to at least one of the two coefficients is non-zero (this would be the alternative hypothesis that at least one of the two coefficients is non-zero). If LaTeX Code:  H_0  is rejected, we know at least one coefficient is non-zero, and we also know that the increase in LaTeX Code:  R^2  is due to something other than mere chance.

Does this help, or were you looking for a mathematical explanation?
  Reply With Quote
Old Jun19-09, 03:08 PM       Last edited by kingwinner; Jun19-09 at 04:01 PM..            #3
kingwinner

kingwinner is Offline:
Posts: 815
Re: Multiple linear regression: partial F-test

Do you have a mathematical explanation for that?

The statement claims that the test of H_o: β_4 = β_5 = 0 is equivalent to testing that the increase in R^2 is statistically signifcant. What would be the equivalent null and alternative hypotheses in terms of R^2?

Thanks!
  Reply With Quote
Old Jun19-09, 05:02 PM                  #4
statdad

statdad is Online:
Posts: 702
Recognitions:
Homework Helper Homework Helper
Re: Multiple linear regression: partial F-test

Suppose you have a total of five variables (since you reference LaTeX Code:  \\beta_4, \\beta_5

We want to test

LaTeX Code: <BR>\\begin{align*}<BR>H_0 \\colon & \\beta_4 = \\beta_5 = 0 \\\\<BR>H_a \\colon & \\text{At least one of } \\beta_4, \\beta_5 \\ne 0<BR>\\end{align*}<BR>

The test begins with the fitting of a full and a reduced model:

LaTeX Code: <BR>Y = \\beta_0 + \\beta_1 x_1 + \\beta_2 x_2 + \\beta_3 x_3 + \\beta_4 x_4 + \\beta_5 x_5 \\tag{Full}<BR>

LaTeX Code: <BR>Y = \\beta_0 + \\beta_1 x_1 + \\beta_2 x_2 + \\beta_3 x_3 \\tag{Reduced}<BR>

Denote the sum of squares for error in the full model by LaTeX Code:  SSE(F) = SSE(x_1, x_2, x_3, x_4, x_5)  , and the sum of squares for error in the reduced model by LaTeX Code:  SSE(R) = SSE(x_1, x_2, x_3)

Since we use more variables in the full model than in the reduced model, we will see LaTeX Code:  SSE(F) < SSE(R)  . The test statistic for the above hypotheses are

LaTeX Code: <BR>F = \\frac{SSE(R) - SSE(F)}{(n-4) - (n-6)} \\div \\frac{SSE(F)}{n-6}<BR>

In the old days (to be read as "when statdad was in school") the numerator of this statistic was written as

LaTeX Code: <BR>SSE(R) - SSE(F) = SSE(X_1, X_2, X_3) - SSE(X_1, X_2, X_3, X_4, X_5) = SSR(X_4, X_5 \\mid X_1, X_2, X_3)<BR>

Think of the last notation ("sum of squares R eduction") as denoting the reduction in variation from adding LaTeX Code:  x_4, x_5  to a model that already contains the other three variables. The test is done by comparing F to the appropriate tables.

How is this related to LaTeX Code:  R^2  ? It isn't, directly, it is related to something called a coefficient of partial determination . The first bit of notation is this:

LaTeX Code: <BR>r^2_{Y45.123}<BR>

In the subscript the numbers to the left of the "." are the dependent variable and the "number label" of the variables being added to the model, while the numbers to the right of the "." are the "number labels" of the variables originally in the model. The coefficient of partial determination is calculated as

LaTeX Code: <BR>r^2_{Y45.123} = \\frac{SSR(X_4, X_5 \\mid X_1, X_2, X_3)}{SSE(X_1, X_2, X_3)}<BR>

Technically, this measures the percentage reduction in error sum of squares that results when we move from the model with 3 variables to the model with all 5 variables.

When the F-test referred to above is significant) (LaTeX Code:  H_0  is rejected), this coefficient of partial determination indicates a [b] significant [/tex] change in LaTeX Code:  R^2

Hope this helped.
  Reply With Quote
Old Jun20-09, 07:56 AM                  #5
kingwinner

kingwinner is Offline:
Posts: 815
Re: Multiple linear regression: partial F-test

Thanks!

R^2 = regression SS/total SS

F = [(R^2_full - R^2_reduced) / (5 -3)] / [(1 - R^_full) / (n - 5 - 1)] .
where R^2_full is the R^2 with 5 independent variables and R^2_reduced is the R^2 with 3 independent variables

Based on this form of the F statistic, can we say that the partial F-test for the coefficients of the 2 additional predictor variables (H_o: β_4=β_5=0) is equivalent to testing that the increase in R^2 is statistically signifcant?
  Reply With Quote
Old Jun20-09, 08:21 AM                  #6
statdad

statdad is Online:
Posts: 702
Recognitions:
Homework Helper Homework Helper
Re: Multiple linear regression: partial F-test

Yes - good job.
  Reply With Quote
image image
Reply
Thread Tools


Similar Threads for: Multiple linear regression: partial F-test
Thread Thread Starter Forum Replies Last Post
Multiple regression and matracies? Rabolisk General Math 0 Apr30-09 09:36 PM
Multiple regression and Time Series sony Calculus & Beyond 0 Apr24-09 10:13 AM
Linear regression in R Mosis Programming & Comp Sci 3 Jul15-08 12:16 PM
How to test forecasting accuracy of regression model? dune2 Set Theory, Logic, Probability, Statistics 3 Feb22-06 12:40 PM
multiple regression model semidevil Set Theory, Logic, Probability, Statistics 0 Apr26-05 01:57 PM

Powered by vBulletin Copyright ©2000 - 2009, Jelsoft Enterprises Ltd. Đ 2009 Physics Forums
Sciam | physorgPhysorg.com Science News Partner
image
image   image