Dixanadu
- 250
- 2
Homework Statement
Hi guys,
so the problem is as follows:
A set of n independent measurements y_{i}, i=1...n are treated as Gaussian, each with standard deviations \sigma_{i}. Each measurement corresponds to a value of a control variable x_{i}. The expectation value of y is given by
f(x;\alpha,\beta)=\alpha x +\beta x^{2}.
1) Find the log-likelihood function for the parameters \alpha,\beta.
2) Show that the least-squares estimators for \alpha,\beta can be found from the solution of a system of equations as follows:
\begin{pmatrix}<br /> a & b \\<br /> c & d <br /> \end{pmatrix}<br /> \left( \begin{array}{c}<br /> \alpha \\<br /> \beta <br /> \end{array}\right) = <br /> \left( \begin{array}{c}<br /> e \\<br /> f <br /> \end{array} \right)
and find the quantities a,b,c,d,e and f as functions of x_{i}, y_{i}, \sigma_{i}.
Homework Equations
[/B]
least squares estimators are
\chi^{2}(\alpha,\beta)=\sum_{i=1}^{n}\frac{1}{\sigma_{i}^{2}}(y_{i}-f(x_{i};\alpha,\beta)^{2})
if the measurements are not independent, then given the covariance matrix V, the least squares estimators are given by
\chi^{2}(\vec{\theta})\sum_{i,j=1}^{N}(y_{i}-f(x_{i};\vec{\theta}))(V^{-1})_{ij}(y_{j}-f(x_{j};\vec{\theta}))
where the \vec{\theta} is the vector of parameters we wish to estimate.
The Attempt at a Solution
[/B]
Right so I'm pretty sure I've solved the first part:
1)
2)
This is where I get stuck. To find the least squares estimators from the chi-squared thing, I have to put it in matrix form, differentiate, set it equal to 0 and solve the resulting system of equations. So in matrix form, since our measurements are all independent, we have
\chi^{2}(\alpha,\beta)=(\vec{y}-A\vec{\theta})^{2}(V^{-1})_{ij}
where A_{ij} is given by f(x_{i};\vec{\theta})=\sum_{j=1}^{m}a_{j}(x_{i})\theta_{j}=\sum_{j=1}^{m}A_{ij}\theta_{j}
However, in our case, we already have this quantity because
\sum_{j=1}^{m}a_{j}(x_{i})\theta_{j}=\alpha x +\beta x^{2}
aaaand this is my problem - I have no idea how to extract the A_{ij} matrix out of this, and even more confusing is: how is it square? if the i index runs from 1...n (the measurements) and j runs from 1,2 (the number of parameters) then how am I supposed to cast this into the square matrix equation above?
Anyway, I did differentiate the chi-squared thing and set it equal to 0, which gives me
A\vec{\theta}=\vec{y}
Which fits the system of equations provided that A is square...I don't see how this works...please help!