Linear regression and bivariate normal, is there a relationship?

CantorSet · Aug 27, 2011

Hi everyone,

This is not a homework question. I just want to understand an aspect of linear regression better. The book "Applied Linear Models" by Kutchner et al, states that a linear regression model is of the form

[tex]Y_i = B_0 + B_1 X_i + \epsilon_i[/tex]

where
[itex]Y_i[/itex] is the value of the response variable in the ith trial
[itex]B_0, B_1[/itex] are parameters
[itex]X_i[/itex] is a known constant
[itex]\epsilon_i[/itex] is a random variable, normally distributed.
Therefore, [itex]Y_i[/itex] is also a random variable, normally distributed but [itex]X_i[/itex] is a constant.

This confused me a bit because I always associated linear regression with the bivariate normal distribution. That is, the underlying assumption of linear regression is the data [itex]\{(x_1,y_1), (x_2,y_2),...,(x_n,y_x) \}[/itex] is sampled from a bivariate normal distribution. In which case, both X and Y are random variables. But in the formulation above, X is a known constant, while [itex]\epsilon[/itex] and therefore [itex]Y[/itex] are the random variables.

So in summary, what is the connection (if any) is between linear regression as formulated by Kutner and the bivariate normal.

Stephen Tashi · Aug 27, 2011

CantorSet said:

the underlying assumption of linear regression is the data [itex]\{(x_1,y_1), (x_2,y_2),...,(x_n,y_x) \}[/itex] is sampled from a bivariate normal distribution. In which case, both X and Y are random variables.

I've never seen a treatment of regression that made that assumption. Are you confusing linear regession with some sort of "total least squares" regression?
http://en.wikipedia.org/wiki/Total_least_squares

CantorSet · Aug 27, 2011

Stephen Tashi said:

I've never seen a treatment of regression that made that assumption. Are you confusing linear regession with some sort of "total least squares" regression?
http://en.wikipedia.org/wiki/Total_least_squares

Thanks for responding, Stephen.

Yea, that was my own confusion for making that assumption. Thanks for clearing that up.

By the way, total least squares is just a generalization of linear regression in that the curve you're fitting the data points to can be polynomials with degrees higher than 1, right? Or is there more to total least squares?

Stephen Tashi · Aug 27, 2011

Total least squares treats both X and Y as random variables.

Linear regression and bivariate normal, is there a relationship?

Discussion Overview

Discussion Character

Main Points Raised

Areas of Agreement / Disagreement

Contextual Notes

Similar threads

Graduate Hypothesis testing: Defining H0, HA hypotheses so that ( H_A)_A' makes sense

Undergrad My basic understanding of set theory

Undergrad The problem of points

Graduate Expected numbers of cards of a last color remaining

Undergrad How does axiom of foundation prevent infinite sequence of elements?

Insights Revisiting the Velocity-Time Function

Insights Remote Operated Gate Control System

Insights AI Enriched Problem Solving

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect