MHB Relation within Gauss-Newton method for minimization

i_a_n · Mar 23, 2015

If we study model fit on a nonlinear regression model $Y_i=f(z_i,\theta)+\epsilon_i$, $i=1,...,n$, and in the Gauss-Newton method, the update on the parameter $\theta$ from step $t$ to $t+1$ is to minimize the sum of squares $\sum_{i=1}^{n}[Y_i-f(z_i,\theta^{(t)})-(\theta-\theta^{(t)})^Tf'(z_i,\theta^{(t)})]^2$. Can we prove that (why) (part 1) the update is given in the following form: $\theta^{(t+1)}=\theta^{(t)}+[(A^{(t)})^TA^{(t)}]^{-1}(A^{(t)})^Tx^{(t)}$,(part 2) where $A^{(t)}$ is a matrix whose $i$-th row is $f'(z_i,\theta^{(t)})^T$, and $x^{(t)}$ is a column vector whose $i$-th entry is $Y_i-f(z_i,\theta^{(t)})$.
Any solution or hints? How to derive those relationships?

Thanks in advance!

Siron · Mar 23, 2015

ianchenmu said:

If we study model fit on a nonlinear regression model $Y_i=f(z_i,\theta)+\epsilon_i$, $i=1,...,n$, and in the Gauss-Newton method, the update on the parameter $\theta$ from step $t$ to $t+1$ is to minimize the sum of squares $\sum_{i=1}^{n}[Y_i-f(z_i,\theta^{(t)})-(\theta-\theta^{(t)})^Tf'(z_i,\theta^{(t)})]^2$. Can we prove that (why) (part 1) the update is given in the following form: $\theta^{(t+1)}=\theta^{(t)}+[(A^{(t)})^TA^{(t)}]^{-1}(A^{(t)})^Tx^{(t)}$,(part 2) where $A^{(t)}$ is a matrix whose $i$-th row is $f'(z_i,\theta^{(t)})^T$, and $x^{(t)}$ is a column vector whose $i$-th entry is $Y_i-f(z_i,\theta^{(t)})$.
Any solution or hints? How to derive those relationships?

Thanks in advance!

How are those iterations (or updates) defined in the Gauss-Newton method?

MHB Relation within Gauss-Newton method for minimization

Thread 'How do E[X] and E[|X|] relate?'

Similar threads

High School A Little Probability Puzzle

Undergrad A variant of the Monty Hall problem

Undergrad Please Explain (actually explain) The Monty Hall Problem

Undergrad What Are the Axioms of Fuzzy Logic and How Do They Extend Boolean Algebra?

High School How Rare Is Low Smartphone Usage Among Metro Travelers in Japan?

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers