Dismiss Notice
Join Physics Forums Today!
The friendliest, high quality science and math community on the planet! Everyone who loves science is here!

Delta, derivative, differential etc.

  1. Feb 2, 2009 #1
    I am still very confused about the differences between all the d's and delta's used to represent infinitesimal elements and/or derivatives and never know when and where to use what:
    - [tex]du[/tex]
    - [tex]\partial u[/tex]
    - [tex]\delta u[/tex]

    For instance what can be simplified exactly in the chain rule and also what to use in an integral.

    Thanks for your help!
  2. jcsd
  3. Feb 3, 2009 #2


    User Avatar
    Gold Member

    I think you are referring to [tex]\Delta x[/tex], [tex] dx[/tex] and [tex]\partial x[/tex]

    [tex]\Delta x[/tex] simply means some change in x

    [tex] dx[/tex] is an infinitesimal change in x

    [tex]\partial x[/tex] is also an infinitesimal change however most commonly used when referring to some other dependent variable.

    for instance if y and t are both dependent variables in some function [tex] g(y,t)[/tex]. For example [tex] g(y,t) = t + yt^{2}[/tex] then [tex] \frac{\partial g}{\partial t} = 1+ 2yt[/tex] and [tex] \frac{\partial g}{\partial y} = t^{2} [/tex]

    Notice that one is held constant while the other is taken the derivative of

    Also I am not sure wat you mean by "what can be simplified exactly in the chain rule" and as for the integral only [tex] dx[/tex] and [tex]\partial x[/tex] of course. It will do you good on finding out exactly what you are doing when you are integrating and deriving. Integrating is the summation of infinitesimally small sums for example.
    Last edited: Feb 3, 2009
  4. Feb 3, 2009 #3


    User Avatar
    Staff Emeritus
    Science Advisor
    Gold Member

    If [itex]f:\mathbb R\rightarrow\mathbb R[/itex], then [itex]df:\mathbb R^2\rightarrow\mathbb R[/itex] is defined by


    Sometimes the h is written as "dx", and df(x,h) as "dy", so that the expression above becomes


    Note that h=dx doesn't have to be "small" for this to be well-defined. It does however have to be small for f'(x)dx to be a good approximation of f(x+dx)-f(x) since

    [tex]f(x+h)=f(x)+hf'(x)+\frac 1 2 h^2f''(x)+\mathcal O(h^3)[/tex]

    when [itex]h\rightarrow 0[/itex].

    [itex]\Delta x[/itex] is the same as h=dx, i.e. it's just a number. [itex]\Delta y[/itex] or [itex]\Delta f[/itex] is the actual change of the value of the function f, i.e.

    [tex]\Delta f=f(x+h)-f(x)[/tex]

    so [itex]\Delta x=dx[/itex], but [itex]\Delta y\approx dy[/itex] with [itex]\Delta y-dy\rightarrow 0[/itex] when [itex]dx\rightarrow 0[/itex].

    The lowercase delta is used in the calculus of variations. See e.g. the Wikipedia definition of a functional derivative. I'm not sure if it's used outside of that context.

    I don't think I have ever seen the expression [itex]\partial x[/itex] defined, or even used other than as a part of the expression for a partial derivative:

    [tex]D_if(x)=f_{,i}(x)=\frac{\partial}{\partial x}f(x)=\frac{\partial f(x)}{\partial x}[/tex]

    The expression [itex]\partial_\mu[/itex] is however used in differential geometry, as a short version of [itex]\partial/\partial x^\mu[/itex], a partial derivative operator constructed from some coordinate system [itex]x:U\rightarrow\mathbb R^n[/itex], where U is an open subset of the manifold.

    [tex]\partial_\mu|_p f=\frac{\partial}{\partial x^\mu}\bigg|_p f=(f\circ x^{-1})_{,\mu}(x(p))[/tex]

    There are few things in the physics literature that I find as confusing as the use of the word "infinitesimal". It took me some time to realize that when the word "infinitesimal" is used (in a physics book), the relevant variables aren't infinitesimals at all, and do not even need to be "small" in any sense of the word. The only thing that the word "infinitesimal" represents in a physics book is that the equation that follows the word "infinitesimal" has been Taylor expanded around 0 and only the terms up to some finite order in the variables have been kept. Physicists apparently use the word "infinitesimal" only to avoid having to use the "big O" notation.
    Last edited: Feb 3, 2009
  5. Feb 3, 2009 #4


    User Avatar
    Staff Emeritus
    Science Advisor
    Gold Member

    It should be clear from the definition of df above why the chain rule needs to be proved even though it looks so "obvious" when you use the notation


    to represent the more precise

    [tex](f\circ g)'(x)=f'(g(x))g'(x)[/tex]

    Let's try to express this using differentials (expressions like "df").

    [tex]\frac{d(f\circ g)(x,h)}{h}=\frac{df(g(x),k)}{k}\frac{dg(x,l)}{l}[/tex]

    It looks a bit less obvious now, doesn't it? Now choose h=l=dx, and k=dg(x,dx) so that the denominator of the first factor on the right cancels the numerator of the second factor.

    [tex]\frac{d(f\circ g)(x,dx)}{dx}=\frac{df(g(x),dg(x,dx))}{dg(x,dx)}\frac{dg(x,dx)}{dx}=\frac{df(g(x),dg(x,dx))}{dx}[/tex]

    [tex]d(f\circ g)(x,dx)=df(g(x),dg(x,dx))[/tex]

    We have now canceled the "dg", but what we ended up with isn't a trivial equality (which would have proved the chain rule). (The "df" on the left isn't the same thing as the "df" on the right). Instead we got an equality that isn't at all obvious. We can verify that it's equivalent to a statement of the chain rule by using the definition of the differentials again, but that just brings us back to the start.

    [tex]d(f\circ g)(x,dx)=(f\circ g)'(x)dx[/tex]

    Last edited: Feb 3, 2009
Share this great discussion with others via Reddit, Google+, Twitter, or Facebook