rayge
- 25
- 0
Homework Statement
f(x) is the function we want to minimize. Beyond being real-valued, there are no other conditions on it. (I'm surprised it's not at least continuous, but the book doesn't say that's a condition.) We choose the next x^k through the relation x^k = x^{k-1} + \alpha_{k}d^k. We assume d^k is a descent direction. That is, for small positive \alpha, f(x^{k-1} + \alpha d^k) is decreasing.
Here's the lemma we want to prove:
When x^k is constructed using the optimal \alpha, we have \nabla f(x^k) \cdot d^k = 0
The Attempt at a Solution
It's suggested in the book that we should differentiate the function f(x^{k-1} + \alpha d^k) with respect to \alpha. My problem is that I don't know how to differentiate a function that isn't defined. My first guess went something like this, but I don't see how I'm any closer to a solution.
\frac{\partial}{\partial \alpha} f(x^{k-1} + \alpha d^k) = \frac{\partial}{\partial \alpha} f(x^k)
f'(x^{k-1} + \alpha d^k)d^k = f'(x^k)(0)
f'(x^{k-1} + \alpha d^k)d^k = 0
I'm really not looking for an answer, but if someone could point me to where I could learn about taking differentials of undefined functions that would be helpful. I'm guessing that somehow I can extract a gradient out of this, and a dot product, but I'm feeling pretty confused.