Simple regression: not including the intercept term

  • Thread starter Thread starter 939
  • Start date Start date
  • Tags Tags
    Regression Term
Click For Summary
In simple regression, omitting the intercept term (α) can lead to biased estimates of the slope coefficient (β) unless specific conditions are met. If the true model includes an intercept, fitting a no-intercept model will generally yield biased results. The least-squares method can still provide unbiased estimates for β when the intercept is included, even if α equals zero. Additionally, using a no-intercept model can invalidate the traditional R² statistic, complicating the analysis further. Overall, excluding the intercept is typically not advisable due to these biases and statistical limitations.
939
Messages
110
Reaction score
2

Homework Statement



The simple regression model is y = α + βx + u, where u is the error term. If you don't include α, when is β unbiased?

Homework Equations


y = α + βx + u

The Attempt at a Solution



Not including α doesn't affect whether β is unbiased because α is a constant.
 
  • Like
Likes Ray Vickson
Physics news on Phys.org
If the true model is ##y = \beta x + \epsilon##, you get an unbiased estimate of ##\beta## by using the least-squares method on the model ##\hat{y} = a + b x##---including the intercept! The point is that ##a, b## are both unbiased for the true model ##y = \alpha+\beta x + \epsilon##, and this is true even if it happens that ##\alpha = 0##. Therefore, my guess would be that the estimated obtained from the no-intercept fit ##\hat{y} = bx## would be biased. After all, the two estimates of ##\beta## would be given by different formulas in the ##(x_i, y_i)## data points, and one of the formulas gives an unbiased result.
 
  • Like
Likes 939
If \alpha \ne 0 but you fit the "no intercept" model then the estimate of the slope will be biased. To see this begin with
<br /> E(b) = E[\left( X&#039;X \right)^{-1}X&#039; y] = E[\left( X&#039;X \right)^{-1}X&#039; \left(\alpha + X \beta + \epsilon \right)]<br />

and work through the right side. You'll be able to see the only two conditions where the estimate of the slope won't be biased. Essentially - it's biased because you're fitting an incorrect model: fitting no intercept when one exists.

Regression without the intercept is rarely a good idea, for this reason AND for the fact that it means the traditional R^2 statistic is rendered useless (there are other issues as well).
 
  • Like
Likes 939
Question: A clock's minute hand has length 4 and its hour hand has length 3. What is the distance between the tips at the moment when it is increasing most rapidly?(Putnam Exam Question) Answer: Making assumption that both the hands moves at constant angular velocities, the answer is ## \sqrt{7} .## But don't you think this assumption is somewhat doubtful and wrong?

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
5K
Replies
1
Views
1K
Replies
4
Views
1K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
1K