I'm just learning this theory and the maths is really trivial but the theory is slightly confusing me. I understand that if we have some function z=f(x,y) and we graph this on a three dimensional set of axis we will have some surface, we can then extend this by creating level curves in the x, y plane for differing values of z. We then set up a function g(x,y)=k such that k is a constant, we call this our constraint curve and it looks something as shown in the link I have provided. http://en.wikipedia.org/wiki/File:LagrangeMultipliers2D.svg I understand how the theory that the maximum/minimum value will occur when the grad of function f and the grad of function g are scalar multiples of each other. But imagine for a second that the constraint curve was a straight line and it went straight through the centre of the circle created by the level curve of the function f. In that case the maximum value would be at the centre of this level curve, but to my knowledge at no time were the gradients parallel. Or imagine a sloping flat surface sloping up. If the constraint curve started off running up the slope then for a second ran tangent to one of the level curves and then ran at some angle up this surface and stopped, then the point at which the gradients were scalar multiples only in the middle of the curves, so the supposed maximum/minimum occured at a random place. If you could clarify this it would be extremely helpful thanks!