- #1
CantorSet
- 44
- 0
Hi everyone,
This is not a homework question. I just want to understand an aspect of linear regression better. The book "Applied Linear Models" by Kutchner et al, states that a linear regression model is of the form
[tex] Y_i = B_0 + B_1 X_i + \epsilon_i [/tex]
where
[itex] Y_i [/itex] is the value of the response variable in the ith trial
[itex] B_0, B_1 [/itex] are parameters
[itex] X_i [/itex] is a known constant
[itex] \epsilon_i [/itex] is a random variable, normally distributed.
Therefore, [itex]Y_i [/itex] is also a random variable, normally distributed but [itex]X_i [/itex] is a constant.
This confused me a bit because I always associated linear regression with the bivariate normal distribution. That is, the underlying assumption of linear regression is the data [itex]\{(x_1,y_1), (x_2,y_2),...,(x_n,y_x) \} [/itex] is sampled from a bivariate normal distribution. In which case, both X and Y are random variables. But in the formulation above, X is a known constant, while [itex]\epsilon[/itex] and therefore [itex]Y[/itex] are the random variables.
So in summary, what is the connection (if any) is between linear regression as formulated by Kutner and the bivariate normal.
This is not a homework question. I just want to understand an aspect of linear regression better. The book "Applied Linear Models" by Kutchner et al, states that a linear regression model is of the form
[tex] Y_i = B_0 + B_1 X_i + \epsilon_i [/tex]
where
[itex] Y_i [/itex] is the value of the response variable in the ith trial
[itex] B_0, B_1 [/itex] are parameters
[itex] X_i [/itex] is a known constant
[itex] \epsilon_i [/itex] is a random variable, normally distributed.
Therefore, [itex]Y_i [/itex] is also a random variable, normally distributed but [itex]X_i [/itex] is a constant.
This confused me a bit because I always associated linear regression with the bivariate normal distribution. That is, the underlying assumption of linear regression is the data [itex]\{(x_1,y_1), (x_2,y_2),...,(x_n,y_x) \} [/itex] is sampled from a bivariate normal distribution. In which case, both X and Y are random variables. But in the formulation above, X is a known constant, while [itex]\epsilon[/itex] and therefore [itex]Y[/itex] are the random variables.
So in summary, what is the connection (if any) is between linear regression as formulated by Kutner and the bivariate normal.