Directional derivative - conventions

In summary, the "directional derivative" in vector analysis, which measures the differential rate of increase of a scalar function in a vector direction, is conventionally defined starting from the vector gradient (the 'del' operator). This definition involves decomposing the vector into coordinate axes and finding the derivatives of the scalar function with respect to those axes. However, it can also be defined directly as the limit of the difference quotient, without the use of coordinates and the gradient. This definition is more fundamental and eliminates ambiguity, but the conventionally defined directional derivative is still widely used and accepted.
  • #1
rachmaninoff
"directional derivative" - conventions

Warning: nitpicking notational issues ahead! :eek:

So the "directional derivative" in vector analysis - the differential rate of increase of a scalar function φ in a vector direction - is by convention* defined starting from the vector gradient (the 'del' operator): [tex]\nabla \phi \cdot \hat{u}=
\vec{\nabla} \phi(\vec{r}) | _{r_0} \cdot \hat{u}[/tex]
(the directional derivative of φ in the u-direction).

So it's not really fundamental - it's defined in terms of del-φ, which is 'more fundamental' I guess. Big deal.

*"by convention" meaning, in every one of a dozen reference books I looked through...

For example, in cartesian coordinates, a directional derivative (at r) might look like:

[tex]\vec{\nabla} \phi_r \cdot \hat{u} = \left(\hat{i}\frac{\partial{\phi}}{\partial{x}}| _r + \hat{j} \frac{\partial{\phi}}{\partial{y}} | _r + \hat{k}\frac{\partial{\phi}}{\partial{z}} | _r \right) \cdot \left(u_x \hat{i} + u_y \hat{j} + u_z \hat{k} \right)[/tex]
[tex]=\frac{\partial{\phi}}{\partial{x}}u_x +\frac{\partial{\phi}}{\partial{y}}u_y + \frac{\partial{\phi}}{\partial{z}}u_z [/tex]

So, the vector u is decomposed into some coordinate axes, and the derivatives of the scalar function are found with respect to those three axes, and things are scalar-multiplied and added together. Your definition of the directional derivatives goes through a coordinate system, of your preference - through the definitions of the del operator.

But why not define this directly, in the usual manner for defining a derivative:

[tex]\lim_{\lambda \rightarrow 0^+} \frac{\phi (\vec{r} + \lambda \hat{u} ) - \phi (\vec{r})}{\lambda}[/tex]

Which is the same thing, without those coordinate decompositions,

[tex]d \phi = \frac{\partial{\phi}}{\partial{x}}dx + \frac{\partial{\phi}}{\partial{y}}dy + \frac{\partial{\phi}}{\partial{z}}dz[/tex]

[tex]\mbox{and, } d \vec{u} = \hat{i} d(u_x) + \hat{j}d(u_y) + \hat{k}d(u_z)??[/tex]

Because [itex]d \phi[/itex] looks well-defined to me. And so does [itex]d \vec{u} = \hat{u} d \lambda[/itex]. So then, why don't we just say, without ambiguity,


[tex]\frac{d \phi}{d \vec{u}} \right| _r \equiv \lim_{\lambda \rightarrow 0^+} \frac{\phi (\vec{r} + \lambda \hat{u} ) - \phi (\vec{r})}{\lambda}[/tex]

(with the normalized direction [tex]\hat{u} = \vec{u} / \| \vec{u} \| [/tex] ) ?!

So as to eliminate any reference to the coordinate system?

I mean, when we're taking the usual partial derivatives, with respect to say the "x" axis, that's just a 'directional' derivative in the [itex]\hat{x}[/itex] direction. So, can we not generalize this to any (normalized) vector direction, and so define the derivative of a scalar function with respect to a vector?

That's my question - is this notation rigorous enough?
 
Last edited by a moderator:
Physics news on Phys.org
  • #2
Just to clarify, by "notation", I mean the idea of writing

[tex]\frac{d \phi}{d \vec{u}}[/tex]
 
  • #3
As far as I know, the actual definition of the directional derivative is given as, for a unit vector u:

[tex]
(\nabla_{\vec{u}} f)(\vec{a}) = \lim_{h \rightarrow 0}
\frac{f(\vec{a} + h \vec{u}) - f(\vec{a})}{h}
[/tex]

So that it may actually defined even if the gradient is not.
 
  • #4
I don't know if this helps, but... (2 variables):

Let u=<a,b> so that the line going through u is given by the following:

[tex]x=x_0+at[/tex]
[tex]y=y_0+bt[/tex]

[tex]\frac{\partial f}{\partial t}=\frac{\partial f}{\partial x}\frac{dx}{dt}+\frac{\partial f}{\partial y}\frac{dy}{dt}[/tex]

Now from above, it is clear that [itex]\frac{dx}{dt}=a;\quad\frac{dy}{dt}=b[/itex].

Substituting results in the following:

[tex]\frac{\partial f}{\partial t}=\frac{\partial f}{\partial x}\frac{dx}{dt}+\frac{\partial f}{\partial y}\frac{dy}{dt}=\frac{\partial f}{\partial x}a+\frac{\partial f}{\partial y}b=\nabla f\cdot \vec{u}[/tex]
 
  • #5
Apmcavoy, a bit more exactly...

Not [tex]\frac {\partial f}{\partial t}[/tex] but [tex]\frac {d f}{d t},[/tex] because in your example the only independent variable is [tex] t.[/tex]

:smile:
 
  • #6
Hurkyl said:
As far as I know, the actual definition of the directional derivative is given as, for a unit vector u:

[tex]
(\nabla_{\vec{u}} f)(\vec{a}) = \lim_{h \rightarrow 0}
\frac{f(\vec{a} + h \vec{u}) - f(\vec{a})}{h}
[/tex]

So that it may actually defined even if the gradient is not.

Can you direct me to a reference which develops this?
 
  • #7
Advanced Calculus (Buck) defines it exactly as I have, except he uses the notation D&beta; for the directional derivative in the direction &beta;.

But frankly, I would expect just about any calculus textbook that ever deals with vectors to have this definition in it.
 
  • #8
I've never seen a vector calculus book worth its salt lacking Hurkyl's definition of the directional derivitave.
In my opinion, it wouldn't be worth its salt if it lacked a definition like that..
 
  • #9
I agree with Hurkyl, the directional derivative can intrinsicly be defined using the limit-definition, which seems fundamental enough. It's done like that in my Analysis course as well.

If you accept that as definition, it's not so hard to prove that the directional derivative can be calculated using the dot product with the gradient, so it follows as an equivalent way of calculating it.

This definition is given on Wikipedia: Directional Derivative as well.
 
  • #10
I prefer the notation
[tex](u\cdot\nabla) f[/tex]
for several reasons
first less confusing, if parenthesis are omited we do not confuse it with the gradient of an inner product, nor become confused should u depend on spatial variables. It is also less confusing should we take a directional derivative of a vector, or a higher order directional derivative.
[tex]D_u^n f =(u\cdot\nabla)^n f[/tex]
I do not know what you mean by the definition of gradient has reference to the coordinate system. It need not. In any case various operators such as divergence, curl ect. should not be seen as functions of the del operator. Those sugestive notations should only remind us of a way the limit process the operator represents can be realized. i.e.
[tex]T(\nabla,a)=\lim_{V\rightarrow 0}\frac{1}{V}\oint T(n,a)ds[/tex]
where T(nabla,a) is some combination of nabla and a,
and the integral is over the surface of a simple closed curve having volume V, and n is an outwardly pointed unit normal vector.
thus
[tex](u\cdot\nabla)a=\lim_{V\rightarrow 0}\frac{1}{V}\oint (u\cdot n)a \ ds[/tex]
[tex](u\cdot\nabla)\times a=\lim_{V\rightarrow 0}\frac{1}{V}\oint (u\cdot n)\times a \ ds[/tex]
and so forth
We can chose a particular curve to get a differential expression such as a cube.
of course we could aways use
[tex](a\times\nabla)\times b+a\nabla\cdot b=a\times(\nabla\times b)+(a\cdot\nabla)b[/tex]
 

1. What is a directional derivative?

A directional derivative is a mathematical concept in multivariable calculus that measures the rate of change of a function in a particular direction at a specific point. It is represented by the symbol ∇ f(x,y), where ∇ is the gradient operator and f(x,y) is the function.

2. How is the directional derivative calculated?

The directional derivative is calculated by taking the dot product of the gradient of the function and a unit vector in the desired direction. Mathematically, it can be expressed as Du f(x,y) = ∇ f(x,y) ⋅ u, where u is the unit vector.

3. Are there any conventions for the direction of the unit vector?

Yes, there are two common conventions for the direction of the unit vector. The first convention is to use the direction of the positive x-axis as the reference direction, while the second convention is to use the direction of the positive y-axis. It is important to specify which convention is being used in order to avoid confusion.

4. What is the significance of the directional derivative?

The directional derivative is important because it allows us to determine the slope of a function in a specific direction. This is useful in many applications, such as optimization and modeling physical systems.

5. Can the directional derivative be negative?

Yes, the directional derivative can be negative. This occurs when the function is decreasing in the specified direction, and the slope is negative. Similarly, the directional derivative can also be positive if the function is increasing in the specified direction, and the slope is positive.

Similar threads

Replies
1
Views
323
Replies
1
Views
2K
Replies
1
Views
1K
  • Calculus
Replies
4
Views
2K
Replies
2
Views
1K
Replies
3
Views
1K
  • Advanced Physics Homework Help
Replies
3
Views
366
Replies
3
Views
1K
Replies
13
Views
1K
Back
Top