The implicit function theorem in 2 vbls is pretty easy to understand. how about an explanation instead of a real proof?
Technically the statement is that if f(x,y) is a function of two variables with continuous partial derivatives at a point (a,b), and if the y partial at (a,b) is non zero, then there is a rectangular neighborhood of (a,b) in which the locus of points where f(x,y) = f(a,b) is a curve in the (x,y) plane that looks like the graph of a differentiable function of x, say y = g(x).
What this hypothesis means, is that the function f(x,y) divides the (x,y) plane near (a,b) up into differentiable curves where the value of f is constant, and that the curve that passes through (a,b), i.e. the curve of points where f(x,y) has the same value it has at (a,b), is a smooth curve.
Moreover since the y partial of f at (a,b) is not zero, this means the tangent line to this "level curve" is not vertical. (The gradient vector is the vector perpendicular to this curve, and the y partial not being zero means the gradient does have a y component, so the perpendicular vector, tangent to the curve has an x component, thus it is not vertical.)
OK, now we see that the hypothesis of the theorem says that the curve of points where f(x,y) has the same value it has at (a,b), is a smooth curve whose tangent vector is not vertical at (a,b). But that just means this curve is not vertical either, i.e. it passes the vertical line test near that point, i.e. is locally the graph of a function.
Easier?: consider a linear function of two variables f(x,y) = rx+sy+c such that s is not zero. Then since s is not zero, the line defined by setting this expression equal to c, is not vertical, hence is the graph of a function, since we can divide by s and get y as a function of x.
Now if f is any differentiable function, then near (a,b), f is approximated well by the linear function (df/dx)(a,b) (x-a) + (df/dy)(a,b) (y-b) + f(a,b), i.e. by a linear function rx+sy+c, as above where r is the x derivative and s is the y derivative of f.
The hypothesis of the implicit function theorem is that the line defined by setting this linear approximation equal to f(a,b) is not vertical. Since that line is tangent to the curve obtained by setting f(x,y) = f(a,b), then that curve is not vertical either. That's it.
In general the implicit function theorem just says if in the linear approximation to f, at a point p, you can solve for some of the variables in terms of the others, then that is also true for the original f, at least near the given point.
well maybe I am proving the subject is hard, but a picture would show it isn't in a second.