| Thread Closed |
inverse function theorem for surface mappings |
Share Thread | Thread Tools |
| Jun8-10, 11:44 PM | #1 |
|
|
inverse function theorem for surface mappings
Hi,
I have a limited background in differential geometry. I have a problem involving a surface mapping (from R2 to R3) which does not have a square Jacobian. I understand that for a mapping of preserved dimensionality I can compute a matrix inverse which will allow me to map tangent vector coordinates from image to domain or vice-versa. i.e. S = (x(u,v), y(u,v), z(u,v)) I have come across some sites online that seem to cavalierly stick a column [0,0,1] and then invert, and/or utilize a matrix pseudoinverse to define an inverse for this mapping. My question is what the heck are they doing and why? Do these pseudoinverse methods have any properties that make them useful? Perhaps my understanding is limited because I don't understand what the inverted tangent vectors mean in the domain space? Sorry for my lack of background and thanks for any help :) |
| Jun11-10, 09:55 PM | #2 |
|
|
If the Jacobian has maximal rank, you can certainly define an inverse if you restrict to the range.
|
| Jun12-10, 12:09 PM | #3 |
|
|
Sorry, I don't understand what you mean by restricting its range. How would I do that? Are you saying I need restrict consideration to a location where I know there is a one-to-one mapping?
And am I assured that by knowing that all of the corresponding 2x2 matrices are non-singular? |
| Jun12-10, 02:01 PM | #4 |
|
|
inverse function theorem for surface mappings |
| Jun13-10, 11:04 PM | #5 |
|
|
Thanks I think I see what you are saying and I think I can see why I am confused, could you enlighten me?
Using the surface mapping [tex] s(u,v) := (x(u,v), y(u,v), z(u,v)) [/tex] I get a Jacobian that looks like [tex] \[ \left( \begin{array}{ccc} x_u & x_v\\ y_u & y_v \\ z_u & z_v \end{array} \right)\] [/tex] Now, the vectors [tex] (x_u, y_u, z_u)[/tex] and [tex](x_v, y_v, z_v) [/tex] are the columns of the Jacobian for this mapping, but they are themselves vectors in the image of this mapping, and form a basis for the tangent space of the image (is that correct?). Then there is nothing to keep me from defining a 2 x 3 psuedoinverse matrix. If you restrict yourself to operating on linear combinations of the [tex] (x_u, y_u, z_u) [/tex] and [tex](x_v, y_v, z_v) [/tex], vectors in the image are mapped back to vectors in the domain of the mapping. My question then, is there any special significance to the inverse mapping of the vectors that defined my Jacobian? Hope that made sense. |
| Jul26-10, 11:29 AM | #6 |
|
|
The mapping [tex]s(u,v) := (x(u,v), y(u,v), z(u,v))[/tex] defines a two dimensional surface in the three dimensional space x,y,z space. The functions u and v are coordinates on this surface, and you can characterize the coordinates by drawing constant surfaces of u and v. There are a couple of concepts that are more clear if we do not use a 2D mapping, so I will add a coordinate w to the list. The mapping [tex]s(u,v,w) := (x(u,v,w), y(u,v,w), z(u,v,w))[/tex] defines new coordinate mapping of the x,y,z space. The functions u, v, and w are the new coordinates, and you can characterize the coordinates by drawing constant surfaces of u, v, and w. For example, the coordinate surfaces of u are drawn by holding u constant and drawing [tex] s(u=c,v,w) [/tex]. We then increment u by [tex]\Delta u[/tex] and draw [tex] s(u=c + \Delta u, v, w) [/tex]. We continue this process incrementing u by constant amounts and drawing [tex] s(u=c+n * \Delta u, v, w) [/tex]. We then repeat the process by holding v constant rather than u. We repeat the process again for w as well. This is the typical way we characterize a coordinate system such as cylindrical or spherical coordinates. When you define a set of coordinates, you also define a dual set of coordinate lines. These lines are parameterized lines, and the parameter for a coordinate line is the associated coordinate function. The coordinate lines are visibly when the coordinate system is drawn. The coordinate line for u falls along intersecting constant surfaces of v and w, and as noted the parameterization of the coordinate line for u is defined by coordinate u. Similarly the coordinate line for v falls along the intersection of the constant surfaces of u and w and the coordinate line parameter is the coordinate function v. We intuitively recognize the coordinate lines when we look at some coordinate system that has been drawn, because these fall at the intersection of the coordinate surfaces and the parameterization is visible by the associated coordinate function. The coordinate lines are easy to calculate from [tex] s(u,v,w) := (x(u,v,w), y(u,v,w), z(u,v,w)) [/tex]. For the u coordinate lines, you simply hold v and w constant and plot [tex] s(u,v=c1,w=c2) [/tex]. There is a separate coordinate line for each value of v and w. For the v coordinate lines, you hold u and w constant and plot [tex] s(u=c1, v, w=c2) [/tex]. The same pattern is followed for w. However, it is often easier to work with the differential of the coordinate functions and the coordinate lines. The differential of the coordinate functions are sometimes called differential forms, and sometimes covectors. Sometimes these are represented as row vectors, but since the chain rule is the rule for transformation in differential geometry, it is often more convenient to represent these in differential form notation [tex] du, dv, dw [/tex]. Just as u, v, and w form a new coordinate basis, [tex] du, dv, dw [/tex] form an alternative basis to [tex] dx, dy, dz [/tex] for the covector space. This basis is a vector basis that characterizes the rate of change of functions (it can also characterize things that are not functions, but that will not be discussed here). The conversion from the old basis to the new basis is given by: [tex] dx = \frac{\partial x}{\partial u} du + \frac{\partial x}{\partial v} dv + \frac{\partial x}{\partial w} dw = x_u du + x_v dv + x_w dw [/tex] [tex] dy = \frac{\partial y}{\partial u} du + \frac{\partial y}{\partial v} dv + \frac{\partial y}{\partial w} dw = y_u du + y_v dv + y_w dw [/tex] [tex] dz = \frac{\partial z}{\partial u} du + \frac{\partial z}{\partial v} dv + \frac{\partial z}{\partial w} dw = z_u du + z_v dv + z_w dw [/tex] Note that the notation is convenient since the chain rule (or rule fro transformations) is naturally expressed in this form. The vector basis for u, v, and w can be represented by column vectors. However in differential geometry it is more convenient to use the notation [tex]\frac{\partial}{\partial u}, \frac{\partial}{\partial v}, \frac{\partial}{\partial w}[/tex]. These can be calculated from [tex] s(u,v,w) := (x(u,v,w), y(u,v,w), z(u,v,w)) [/tex], by the chain rule formula: [tex]\frac{\partial}{\partial u} = \frac{\partial x}{\partial u} \frac{\partial}{\partial x} + \frac{\partial y}{\partial u} \frac{\partial}{\partial y} + \frac{\partial z}{\partial u} \frac{\partial}{\partial z} = x_u \frac{\partial}{\partial x} + y_u \frac{\partial}{\partial y} + z_u \frac{\partial}{\partial z} [/tex] [tex]\frac{\partial}{\partial v} = \frac{\partial x}{\partial v} \frac{\partial}{\partial x} + \frac{\partial y}{\partial v} \frac{\partial}{\partial y} + \frac{\partial z}{\partial v} \frac{\partial}{\partial z} = x_v \frac{\partial}{\partial x} + y_v \frac{\partial}{\partial y} + z_v \frac{\partial}{\partial z}[/tex] [tex]\frac{\partial}{\partial w} = \frac{\partial x}{\partial w} \frac{\partial}{\partial x} + \frac{\partial y}{\partial w} \frac{\partial}{\partial y} + \frac{\partial z}{\partial w} \frac{\partial}{\partial z} = x_w \frac{\partial}{\partial x} + y_w \frac{\partial}{\partial y} + z_w \frac{\partial}{\partial z}[/tex] The Jacobian for this system is given by: [tex] \[ \left( \begin{array}{ccc} x_u & x_v & x_w\\ y_u & y_v & y_w \\ z_u & z_v & z_w \end{array} \right)\] [/tex] Note that the columns of the Jacobian are not just new basis vectors, but the are the basis vectors [tex]\frac{\partial}{\partial u}, \frac{\partial}{\partial v}, \frac{\partial}{\partial w}[/tex], associated with the new coordinate system u, v, w. The rows of the Jacobian are dx, dy, and dz transformed to the new basis du, dv, dw. Multiplying a row vector (field) / covector (field) / one form by the Jacobian converts it from the dx, dy, dz basis to the du, dv, dw basis - it is just an expression of the chain rule. Multiplying the Jacobian by a vector (field) converts it from the [tex]\frac{\partial}{\partial u}, \frac{\partial}{\partial v}, \frac{\partial}{\partial w}[/tex] basis to the [tex]\frac{\partial}{\partial x}, \frac{\partial}{\parital y}, \frac{\partial}{\partial z}[/tex] basis - again just an expression of the chain rule. The Jacobian is a useful tool when writing vectors and covectors in matrix notation. When writing things in the more modern notation that is natural for the chain rule, the transformation rule is typically more obvious and the Jacobian does not need to be represented explicitly. Note that the coordinate vectors [tex]\frac{\partial}{\partial u}, \frac{\partial}{\partial v}, \frac{\partial}{\partial w}[/tex] are not unit vectors which are usually represented by using an 'e' form like [tex] e_1 , e_2 , e_3 [/tex]. Unit vectors have a length 1 with respect a metric defined on the space. Coordinate vectors need no metric to be defined and are parameterized by the associated coordinate function. |
| Jul28-10, 02:12 PM | #7 |
|
|
llarsen, thank you for your amazing post. After reviewing I have a couple of questions, and I entreat you to be brief because you've already been much more thorough than I could have hoped.
1. I assume that (d/dx, d/dy, d/dz) are just the same old vector basis (e1,e2,e3) in Euclidian R3, right? This is confusing to me because (dx,dy,dz) are the differentials along coordinate curves (as you describe), so which ones transform covariantly and which contravariantly? The (d/dx, d/dy, d/dz) is an object built along coordinate lines and so those basis vectors transform covariantly, correct? 2. I think this next question gets to my lack of ability to always distinguish between vectors and components of vectors. Let's say I wanted to transform a vector from basis (d/dx, d/dy, d/dz) to (d/du, d/dv, d/dw). If I multiply your 3x3 Jacobian by a 3x1 column vector representing the components of that vector in R3, the matrix multiplication gives me a transformation that looks like a one-form (ie. dx/du, dx/dv, dx/dw) instead of one with (dx/du, dy/du, dz/du), yet I know it is the Jacobian operating on a column vector is the one that will map a tangent vector in a coordinate transformation. Is this due to the fact that I am operating on components of the vector, which transform like (are?) covectors? And hence, if I were to map a basis vector that transforms covariantly, I would premultiply it as a row vector to get the appropriate transformation? Thanks for any help! Feel free to point me to useful references online but I've failed to find books/etc that have helped me get to the concepts I'm having trouble with. |
| Jul28-10, 04:00 PM | #8 |
|
|
[tex]\[ \left( \begin{array}{ccc} \frac{dx}{dt} x_u + \frac{dy}{dt} x_v + \frac{dz}{dt} x_w\\ \frac{dx}{dt} y_u + \frac{dy}{dt} y_v + \frac{dz}{dt} y_w\\ \frac{dx}{dt} z_u + \frac{dy}{dt} z_v + \frac{dz}{dt} z_w\end{array} \right)\][/tex] Vectors and covectors are geometric objects that come in a package. You don't operate on each component one at a time. I should note that I prefer writing vector fields using the notation [itex] V = f_1 \frac{\partial}{\partial x} + f_2 \frac{\partial}{\partial y} + f_3 \frac{\partial}{\partial z} [/itex] rather than using matrix notation, because the chain rule transformation becomes much more transparent. There are a few books from William Burke that I found useful. His books are great for visualizing concepts from differential geometry, but not so good if you are interested in rigor. A website for him is found here. He has a lot of great insights and is worth looking at. "The Geometry of Physics" by Theodore Frankel is about the level of rigor I like. It is reasonably approachable, but has fewer graphical depictions of concepts than I would like. Covers vectors and differential forms pretty well. |
| Jul28-10, 04:05 PM | #9 |
|
|
Is 7thSon an Orson Scott Card reference?
|
| Jul28-10, 04:14 PM | #10 |
|
|
Thanks a million for your help again, I will look through your response later today |
| Jul29-10, 03:52 PM | #11 |
|
|
I made a mistake in my post above. I claimed that when you transform from (d/dx, d/dy, d/dz) to (d/du, d/dv, d/dw) by multiplying the vector (dx/dt, dy/dt, dz/dt) transposed by the jacobian to get:
[tex] \[ \left( \begin{array}{ccc} \frac{dx}{dt} x_u + \frac{dy}{dt} x_v + \frac{dz}{dt} x_w\\ \frac{dx}{dt} y_u + \frac{dy}{dt} y_v + \frac{dz}{dt} y_w\\ \frac{dx}{dt} z_u + \frac{dy}{dt} z_v + \frac{dz}{dt} z_w\end{array} \right)\] [/tex] This isn't correct, and you can tell this since it doesn't agree with the chain rule. For example, examine the first term in the vector: [tex]\frac{dx}{dt} x_u + \frac{dy}{dt} x_v + \frac{dz}{dt} x_w = \frac{dx}{dt} \frac{\partial x}{\partial u} + \frac{dy}{dt}\frac{\partial x}{\partial v} + \frac{dz}{dt} \frac{\partial x}{\partial w}[/tex] The chain rule suggests that for the first component we should really have: [tex]\frac{dx}{dt} u_x + \frac{dy}{dt} u_y + \frac{dz}{dt} u_z = \frac{dx}{dt} \frac{\partial u}{\partial x} + \frac{dy}{dt}\frac{\partial u}{\partial y} + \frac{dz}{dt} \frac{\partial u}{\partial z}[/tex] What this is telling us is that we really need the inverse jacobian to transform from transform from (d/dx, d/dy, d/dz) to (d/du, d/dv, d/dw): [tex] \[ \left( \begin{array}{ccc} u_x & u_y & u_z\\ v_x & v_y & v_z \\ w_x & w_y & w_z \end{array} \right)\] [/tex] The jacobian is used to transform from (d/du, d/dv, d/dw) to (d/dx, d/dy, d/dz). |
| Jul29-10, 04:54 PM | #12 |
|
|
Just a followup that seems simple/dumb by comparison:
Do we need any additional conditions to get a complex version of the inverse function theorem.?. I am thinking of the corollary that inverse image f^-1(a) is a submanifold. If we started out with a complex manifold M and the same conditions on the Jacobian of f, does it follow that the inverse image f^-1(w) ( w in C, complexes) is a complex submanifold.? Thanks. s |
| Jul30-10, 12:21 AM | #13 |
|
|
The cases I deal with only required real coordinates. Since I don't have to deal with complex change of coordinates, I seldom take time to think about the modifications needed for complex systems, so I won't try to address this question for you. Sometimes the theorems need little modification to deal with the complex case, but there are sometimes subtleties you have to be careful of, and complex differential geometry is not my forte.
|
| Jul30-10, 01:13 AM | #14 |
|
|
One of the things I was trying to convey in my discussions above is that the chain rule tends to lie at the heart of differential geometry. One of the problems with matrix and tensor notation is that it obscures the chain rule. I realized that my discussion above may be difficult to follow because I jump between notations for describing vector fields and tensors. If you are not familiar with the different notations and how they are connected my comments may be confusing. Understanding the different notations and their connection can help in understanding the concepts in differential geometry.
For example, a vector field is a basic object in differential geometry. A vector field indicates direction and magnitude at each point where the vector field is defined. One form a vector field can be represented in is as a set of first order ordinary differential equations (ODEs) such as the following: [tex] \begin{array}{ccc} \frac{dx_1}{dt} = f_1(X) \\ \frac{dx_2}{dt} = f_2(X) \\ ... \\ \frac{dx_n}{dt} = f_n(X) \end{array} [/tex] where [tex]X[/tex] represents dependence on coordinates [tex](x_1 , x_2 , ..., x_n)[/tex]. A vector field can also be represented as a row vector like so: [tex] \[ \left( \begin{array}{ccc} f_1(X) \\ f_2(X) \\ ... \\ f_n(X) \end{array} \right)\] [/tex] Note that from the ODE definition, you can replace the column vector representation above with: [tex] \[ \left( \begin{array}{ccc} \frac{dx_1}{dt} \\ \frac{dx_1}{dt} \\ ... \\ \frac{dx_n}{dt} \end{array} \right)\] [/tex] It is useful to write [itex] \frac{dx_i}{dt} [/itex] in place of [tex]f_i[/tex] sometimes because the chain rule becomes more explicit and it is easier to tell when you get transformations correct. But it can also be mysterious if you are not aware of the connection to vector fields and ODEs. You will notice that I wrote the column vector in this form in a previous post assuming this concept was understood. Another form to write a vector field in is in the tensor notation [tex] f_i [/tex] where expansion of i is understood. This can also be written as [itex]\frac{dx_i}{dt}[/itex] where the connection to the ODE form is understood. Another form to write a vector field in is the vector field notation: [tex] V = f_1(X) \frac{\partial}{\partial x_1} + f_2(X) \frac{\partial}{\partial x_1} + ... + f_n(X) \frac{\partial}{\partial x_n} = \frac{dx_1}{dt} \frac{\partial}{\partial x_1} + \frac{dx_2}{dt} \frac{\partial}{\partial x_2} + ... + \frac{dx_n}{dt} \frac{\partial}{\partial x_n} [/tex] Note that the form after the second equal sign is particularly representative of the chain rule. The terms [itex]\frac{\partial}{\partial x_i}[/itex] represent coordinate vectors as described in a post above. However, when applying the chain rule, they act as standard partial derivatives. So if we want to transform from coordinate system X to coordinate system Y, we use the chain rule formula: [tex] V = \frac{dx_i}{dt} \frac{\partial y_j}{\partial x_i} \frac {\partial}{\partial y_j} [/tex] The jacobian is built into this transformation, but since the notation is natural for expressing the chain rule, it is not necessary to write out the jacobian. It is also harder to get the transformation wrong in this format. Anyway, transformations typically reduce down to using the chain rule, so understanding how to represent the opertion in terms of the chain rule is quite useful. |
| Jul30-10, 05:43 PM | #15 |
|
|
Excllent, llarsen; would you suggest any particular way of going about understanding
the chain rule.? |
| Jul31-10, 09:46 PM | #16 |
|
|
llarsen, I think I'm almost there. There is one more thing that started to confuse me at the wikipedia page after I thought I understood everything. I believe the following is correct.
For the change of coordinates (u,v,w) --> (x,y,z), with Jacobian defined with terms like dx/du: 1. Vectors components in (u,v,w) basis can be found by using the inverse Jacobian to operate on the components of that vector in (x,y,z) basis 2. One can find components of covectors in (u,v,w) basis by multiplying the covector's components in (x,y,z) basis by Jacobian i.e. f_new = f * J 3. The basis vectors (d/du, d/dv, d/dw) are just the columns in the "forward" Jacobian, each with the appropriate coordinate vector appended (d/dx, d/dy, or d/dz) 4. If there was ever a reason to find (d/dx, d/dy, or d/dz) in terms of (d/du, d/dv, d/dz), it would come from the columns of the inverse Jacobian. Lastly, the wikipedia page defines a matrix "A" which is the linear combination converting the basis vectors in the domain to basis vectors in the mapping's image. The rules of this "A" matrix and the Jacobian seem to be opposite and I suspect they are in fact inverses, is this true? It's kind of hard for me to decipher because I am used to looking at a coordinate chart mapping (u,v,w) ---> (x,y,z), and normally we have the coordinates of something in (x,y,z) and want to find the corresponding components of the objects in the domain of the mapping. Since two things are going on, I just ended up confusing myself. |
| Jul31-10, 10:08 PM | #17 |
|
|
Also, are you comfortable with thermodynamics and the total differential? I think sometimes a concrete example of a physical system using the chain rule can clarify things. |
| Thread Closed |
| Thread Tools | |
Similar Threads for: inverse function theorem for surface mappings
|
||||
| Thread | Forum | Replies | ||
| Inverse Function Theorem | Calculus & Beyond Homework | 0 | ||
| Inverse Function Theorem | Calculus & Beyond Homework | 3 | ||
| Inverse Function Theorem in R | Calculus | 2 | ||
| Inverse Function Theorem | Calculus & Beyond Homework | 4 | ||
| inverse function theorem | Calculus | 16 | ||