Definition of Partial Derivative: Correct Form?

  • Thread starter Cyrus
  • Start date
  • Tags
    Definition
In summary: But more importantly, I think the important thing is to be able to convert between the two. So, if you are comfortable with that, then you know how to convert between the two and it shouldn't really matter.In summary, the correct definition for \frac {\partial f}{\partial \bar{x} } is as a column vector, but some sources may use a row vector. It is important to be able to convert between the two representations.
  • #1
Cyrus
3,238
16
Which one is the correct definition? I have seen both being used in texts and it makes a world of difference which one is used when doing proofs.


[tex] \frac {\partial f}{\partial \bar{x} } = [ \frac {\partial f}{\partial x_1 }, \frac {\partial f}{\partial x_2 }, \frac {\partial f}{\partial x_n }] [/tex]

OR:

[tex] \frac {\partial f}{\partial \bar{x} } = \left[\begin{array}{c} \frac {\partial f}{\partial x_1 } \\ \\ \frac {\partial f}{\partial x_2 }\\ \\ \frac {\partial f}{\partial x_n }\end{array}\right] [/tex]

My book and wiki have the first definition, but I have seen some people use the second.
 
Physics news on Phys.org
  • #2
They look like the same thing to me.
 
  • #3
One is a column vector and one is a row vector.
 
  • #4
What's the context here? Your expression looks very like grad(f).
 
  • #5
cristo said:
What's the context here? Your expression looks very like grad(f).

It's very nearly the same thing as writing grad.

f(r+dr)=grad[f(r)].dr
=dr df/dr
=dr. I df/dr

So grad[f(r)]=I df/dr

(I'm too lazy to latex the above, but all the r, dr are vectors)
 
  • #6
The notation [itex]df/d\vec x[/itex] is just a non-standard way to write [itex]\nabla f[/itex].

The standard representation of a displacement vector is as a column vector. With this representation, a displacement vector transforms from frame A to frame B as

[tex]\vec x_B = T_{A\to B} \, \vec x_A[/tex]

The gradient of a scalar field transforms by a different set of rules. The gradient lives in the dual space of the displacement vector field. The gradient should thus be represented as a row vector. With the gradient expressed as a row vector, the gradient of a function transforms according to

[tex]\nabla f_B(x) = (\nabla f_A(x)) \, T_{A\to B}[/tex]
 
Last edited:
  • #7
D H said:
The notation [itex]df/d\vec x[/itex] is just a non-standard way to write [itex]\nabla f[/itex].
That's what I thought; although the bar in the original post confused me!

The standard representation of a displacement vector is as a column vector. With this representation, a displacement vector transforms from frame A to frame B as

[tex]\vec x_B = T_{A\to B} \, \vec x_A[/tex]

The gradient of a scalar field transforms by a different set of rules. The gradient lives in the dual space of the displacement vector field. The gradient should thus be represented as a row vector. With the gradient expressed as a row vector, the gradient of a function transforms according to

[tex]\nabla f_B(x) = (\nabla f_A(x)) \, T_{A\to B}[/tex]

Of course, some people gloss over this and write all "vectors" as row vectors!
 
  • #8
The bar means its a scalar differentiated W.R.T a vector.

Also, It never dawned on me to think of it as the gradient, but in fact that's exactly what it's doing (only if x is spatial coordinates). In that case, the correct definition is definition 2.

But I am asking which one is the correct definition? Why would you need context to answer my question?
 
Last edited:
  • #9
According to post #6 by DH

The gradient should thus be represented as a row vector.
 
  • #10
cyrusabdollahi said:
Which one is the correct definition? I have seen both being used in texts and it makes a world of difference which one is used when doing proofs.


[tex] \frac {\partial f}{\partial \bar{x} } = [ \frac {\partial f}{\partial x_1 }, \frac {\partial f}{\partial x_2 }, \frac {\partial f}{\partial x_n }] [/tex]

OR:

[tex] \frac {\partial f}{\partial \bar{x} } = \left[\begin{array}{c} \frac {\partial f}{\partial x_1 } \\ \\ \frac {\partial f}{\partial x_2 }\\ \\ \frac {\partial f}{\partial x_n }\end{array}\right] [/tex]

My book and wiki have the first definition, but I have seen some people use the second.
The components of the first is a vector and as such is is layed out using a row matrix, which is the correct thing to do here. The second has components of a vector but is a column matrix of a 1-form which is the wrong thing to do. Only 1-forms are represented with the variables in the denuminator using subscripts and a change of sign for the spatial components.

Well ... it sounds good anyway. :biggrin: Does anyone agree? :confused:

Pete
 
  • #11
Mentz114 said:
According to post #6 by DH

Incorrect, the gradient is a column vector because my spatial coordinates is a column vector in R^n. So the gradient also has to be a column vector in R^n.
 
Last edited:
  • #12
cyrusabdollahi said:
Which one is the correct definition?
For better or worse -- the right answer to this question is "the one in the book." (Or "the one the instructor gave", or whatever is appropriate)

Also, for better or worse (worse, IMHO), it is common to formalize the mathematics so that no distinction is made between the two.


But if you are going to make a distinction (which I certainly advise) -- then [itex]\partial f / \partial \vec{x}[/itex] is going to be the opposite of what you use to denote [itex]\vec{x}[/itex]. The most common convention is that [itex]\vec{x}[/itex] is a column, thus [itex]\partial f / \partial \vec{x}[/itex] is a row.



And, for maximum confusion, the notation [itex]\nabla f[/itex] is often used both to denote the gradient and to denote [itex]\partial f / \partial \vec{x}[/itex]. :frown: (e.g. see Spivak's Differential Geometry)
 
Last edited:
  • #13
Thats my problem. The book uses one way, my notes from class, which is from a different book, uses another way.

Here is a paper that basically says the same thing I've been saying in the footnote. But I wanted a second opinion from you guys.

http://www.colorado.edu/engineering/CAS/courses.d/IFEM.d/IFEM.AppD.d/IFEM.AppD.pdf

Also note, the Jacobian is defined as the Transpose in my book of what's in the link above.
 
  • #14
cyrusabdollahi said:
Thats my problem. The book uses one way, my notes from class, which is from a different book, uses another way.

Here is a paper that basically says the same thing I've been saying in the footnote. But I wanted a second opinion from you guys.

http://www.colorado.edu/engineering/CAS/courses.d/IFEM.d/IFEM.AppD.d/IFEM.AppD.pdf

Also note, the Jacobian is defined as the Transpose in my book of what's in the link above.
Ye gads! That's (IMHO) a very poor conventional choice -- so much that my first instinct would be to call it wrong! I would be a thousand times happier if x and y were row vectors, rather than the column vectors they were defined to be. (Or keep them as columns, but transpose everything else in that appendix)
 
  • #15
Well, in Dynamics it makes more sense to define things as column vectors and not row vectors.
 
  • #16
I still maintain that

[itex]\nabla f(\mathbf{r})=\mathbf{I}\frac{\partial f(\mathbf{r})}{\partial \mathbf{r}}[/itex]

Grad and df/dr are not completely equivalent.
 
  • #17
I don't understand what your talking about christian. Can you explain what your saying in more detail?
 
  • #18
[itex]f(\mathbf{r}+d\mathbf{r})=f(\mathbf{r})+\nabla f(\mathbf{r}).d\mathbf{r} [/itex]

Also

[itex]f(\mathbf{r}+d\mathbf{r})=f(\mathbf{r})+\frac{\partial}{\partial \mathbf{r}} f(\mathbf{r})d\mathbf{r}
= f(\mathbf{r})+\frac{\partial}{\partial \mathbf{r}} f(\mathbf{r})\mathbf{I}.d\mathbf{r}[/itex]

So

[itex]\nabla f(\mathbf{r})=\mathbf{I}\frac{\partial f(\mathbf{r})}{\partial \mathbf{r}}[/itex]
 
Last edited:
  • #19
cyrusabdollahi said:
Incorrect, the gradient is a column vector because my spatial coordinates is a column vector in R^n. So the gradient also has to be a column vector in R^n.

If you represent spatial coordinates as a column vector (WHO DOES THIS GARBAGE?), you should represent the gradient as a row vector.

Another way to look at this is to view displacement vectors and the gradient of a scalar field as tensors. Displacement vectors are contravariant vectors, while gradients are covariant. They live in spaces dual to each other. It is best to represent the two vectors in different forms.
 
  • #20
In fact, it is the standard in dynamics D_H. The configuration space will typically be a column in R^n.
 
  • #21
Christian, you implicitly chose a convention for [itex]df/d\vec r[/itex] by doing what you did and then used that choice to justify what you did.

It is better to write this as

[tex]f(\vec r+d\vec r) =
f(\vec r) + d\vec r \cdot \frac{\partial}{\partial \vec r}f(\vec r)[/tex]

and then it doesn't matter whether you represent spatial vectors as column vectors or as row vectors.
 
Last edited:
  • #22
cyrusabdollahi said:
In fact, it is the standard in dynamics D_H. The configuration space will typically be a column in R^n.

I'm an orientationally challenged idiot.

Read my post #6. I had it right there, where I said displacement vectors are typically represented as column vectors (and that gradients should be represented as row vectors).

Think of displacement vectors (and velocity vectors and acceleration vectors) as living in one space (e.g., column vectors); the gradient must necessarily live in the dual space (e.g., row vectors).

Suppose some units-challenged engineer gave you x, y, and z spatial coordinates in meters, meters, and furlongs. You naturally transform those goofy meters into furlongs and do all of your work in the furlongs/fortnight system. When you report your results back, you have to transform back to the mixed system. Different transformation rules apply for the displacement vectors and gradients your report. Displacement vectors transform as contravariant tensors, gradients as covariant tensors.
 
  • #23
D H said:
Christian, you implicitly chose a convention for [itex]df/d\vec r[/itex] by doing what you did and then used that choice to justify what you did.

It is better to write this as

[tex]f(\vec r+d\vec r) =
f(\vec r) + d\vec r \cdot \frac{\partial}{\partial \vec r}f(\vec r)[/tex]

and then it doesn't matter whether you represent spatial vectors as column vectors or as row vectors.

Of course.

But my convention makes much more sense.
 
  • #24
You naturally transform those goofy meters into
furlongs and do all of your work in the furlongs/fortnight system.
Aha - now I understand. A furlong is the length of a row (furrow) of beans. So we should use row vectors for position. I always work in roods per hour, so I don't have this problem.
 
  • #25
christianjb said:
Of course.

But my convention makes much more sense.


Your convention involved dividing by a vector. That doesn't make much sense at all.
 
  • #26
D H said:
Your convention involved dividing by a vector. That doesn't make much sense at all.

Hey- if it gives the right answer at the end of the manipulations then who cares?

You never end up with an answer that involves the division of a vector. It's just a matter of book-keeping.

Personally I don't like the convention that df/dr = grad f. It's better for the derivative to obey f(r+dr)-f(r)=dr df/dr, which this convention doesn't.
 
  • #27
The convention that [itex]df/d\vec r[/itex] is a [itex]1\times n[/itex] row vector and [itex]d\vec r[/itex] is a [itex]n\times1[/itex] column vector does obey the convention [itex]f(\vec r+d\vec r) = f(\vec r) + df/d\vec r*d\vec r[/itex]. The product of a 1xn vector and a nx1 vector is a scalar.
 
  • #28
D H said:
The convention that [itex]df/d\vec r[/itex] is a [itex]1\times n[/itex] row vector and [itex]d\vec r[/itex] is a [itex]n\times1[/itex] column vector does obey the convention [itex]f(\vec r+d\vec r) = f(\vec r) + df/d\vec r*d\vec r[/itex]. The product of a 1xn vector and a nx1 vector is a scalar.

The inner product, yes.
 
  • #29
I wrote that as a matrix product, not the inner product.
 
  • #30
Now I'm confused.

"The product of a 1xn vector and a nx1 vector is a scalar"

That's the inner product right?
 
  • #31
The inner product of two column vectors [itex]\mathbf a[/itex] and [itex]\mathbf b[/itex], [itex]\mathbf a \cdot \mathbf b[/itex], written in matrix form is [itex]\mathbf a^T * \mathbf b[/itex].
 
Last edited:

1. What is the correct form of a partial derivative?

The correct form of a partial derivative is written as ∂f/∂x, where f is the function and x is the variable with respect to which the derivative is being taken.

2. How is a partial derivative different from a regular derivative?

A partial derivative is taken with respect to one variable while holding all other variables constant, whereas a regular derivative is taken with respect to one variable in a single-variable function.

3. Can a partial derivative be taken more than once?

Yes, a partial derivative can be taken multiple times with respect to different variables in a multivariable function.

4. What does a partial derivative represent?

A partial derivative represents the rate of change of a multivariable function with respect to one of its variables, while holding all other variables constant.

5. How is a partial derivative calculated?

A partial derivative is calculated by taking the limit of the difference quotient as the change in the variable approaches zero.

Similar threads

  • Other Physics Topics
Replies
1
Views
2K
Replies
3
Views
1K
Replies
22
Views
365
  • Special and General Relativity
Replies
4
Views
1K
  • Advanced Physics Homework Help
Replies
2
Views
881
Replies
1
Views
814
Replies
5
Views
273
  • Electromagnetism
Replies
2
Views
803
  • Calculus and Beyond Homework Help
Replies
6
Views
505
  • Differential Geometry
Replies
9
Views
356
Back
Top