Dirac Delta Function: Explanation & Usage

dreamLord · Jun 20, 2013

I know this probably belongs in one of the math sections, but I did not quite know where to put it, so I put it in here since I am studying Electrodynamics from Griffiths, and in the first chapter he talks about Dirac Delta function.

From what I've gathered, Dirac Delta function is 0 for x[itex]\neq[/itex]0, and ∞ for x = 0.

Now he assumes any function f(x), and says that the product f(x)*[itex]\delta[/itex](x) = 0 for x[itex]\neq[/itex]0. Fine, got that.

Now he goes on to say that the above statement can also be written as f(0)*[itex]\delta[/itex](x) = 0. My question is - we could also have written it as f(29.5)*[itex]\delta[/itex](x) = 0 for x[itex]\neq[/itex]0, right? So then why did we choose f(0)?

vanhees71 · Jun 20, 2013

First of all it is very important to understand that [itex]\delta[/itex] is not a function but a distribution. It is defined as a linear form on an appropriate space of functions, e.g., the infinitely many times differentiable functions with compact support or rapidly falling functions (Schwartz space). It is defined as
[tex]\int_{\mathbb{R}} \mathrm{d} x f(x) \delta(x)=f(0).[/tex]
This is not 0.

Sometimes you can simplify equations by the formal setting [itex]f(x) \delta(x)=f(0) \delta(x)[/itex]. Strictly speaking that's not correct, because you cannot integrate the [itex]\delta[/itex] distribution over the test function which is constant, because this function does not belong to the test-function space, where the [itex]\delta[/itex] distribution is defined.

dreamLord · Jun 21, 2013

I'm afraid your first few lines were completely lost on me! Is there any way you can dumb it down a bit?

Also, the equation that you wrote ; is this the definition of the Dirac Delta function? Or the fact that it is 0 when x is not zero, and infinity when x is 0. Which one defines it? Or are they the same thing.

DimReg · Jun 21, 2013

Griffths focuses on f(0)δ(x) because f(0) is the only value of f that matters with the dirac delta. So basically f(0)δ(x) will behave the same way as f(x)δ(x).

This can be best understood under an integral sign, which is the only place the dirac delta function is precisely defined. You have (edit: you can take these two properties are the definition, but the exact mathematical definition is a bit more complicated):

[itex] \int \delta (x) dx = 1[/itex] and [itex] \int f(x) \delta (x) dx = f(0) [/itex]

So you can write [itex] \int f(0) \delta (x) dx = f(0) \int \delta(x)dx = f(0)*1 = f(0) [/itex] which is the same as for f(x)δ(x)

On the other hand, [itex] \int f(1) \delta(x) dx = f(1)\int \delta(x)dx = f(1) [/itex], which is not correct.

Jano L. · Jun 21, 2013

The vanhees71 definition is right. The property "δ=0 for x ≠ 0, δ=∞ fo x = 0" is just an intuitive description of sharply peaked function, which is valid picture of δ only in some situations. For example, it is valid for charge density distribution of point-like charged particle. However, when solving for the Green function of the Schroedinger equation, such description of δ is incorrect, while the integral property above is valid.

DimReg · Jun 21, 2013

I don't get the impression that the OP is comfortable talking about distributions or Green's functions. If he were, I doubt he would be having trouble with the dirac delta function.

vanhees71 · Jun 21, 2013

dreamLord said:

I'm afraid your first few lines were completely lost on me! Is there any way you can dumb it down a bit?

Also, the equation that you wrote ; is this the definition of the Dirac Delta function? Or the fact that it is 0 when x is not zero, and infinity when x is 0. Which one defines it? Or are they the same thing.

The point is that many introductory physics books confuse their readers with unprecise definitions of what a distribution is. Griffiths seems to be another example. I don't know his E&M book very well besides from discussions here in the forum.

Objects like the Dirac [itex]\delta[/itex] are socalled distributions. They are defined as mappings from a function space (containing a certain set of functions, called test functions) to the (real or complex) numbers. They can only be defined in a manner that makes sense under an integral, where they are multiplied with a test function, and for the Dirac [itex]\delta[/itex] distribution this definition reads
[tex]\int_{\mathbb{R}} \mathrm{d} x \delta(x) f(x)=f(0).[/tex]
It's the value of the test function at the argument 0.

It is quite obvious that [itex]\delta(x)[/itex] cannot be a function in the usual sense, because you won't find any function with the above given property. However you can define the [itex]\delta[/itex] distribution as kind of limit, the socalled weak limit. The idea is to define functions which are sharply peaked around 0 with the integral normalized to 1. The most simple example is the "box function",
[tex]\delta_{\epsilon}(x)=\begin{cases}
1/(2 \epsilon) & \text{for} \quad x \in (-\epsilon,\epsilon) \\
0 & \text{elsewhere}.
\end{cases}[/tex]
The test function should have "nice properties" to make things convenient. They should still form a vector space of functions, i.e., with two functions also their sum and the product with a constant should belong to this function space. A very convenient choice is Schwartz's space of rapidly falling smooth functions, i.e., they are arbitrarily many times differentiable and fall off at infinity faster than any polynomial.

Now we check the integral
[tex]I_{\epsilon}=\int_{\mathbb{R}} \mathrm{d} x \delta_{\epsilon}(x) f(x) = \frac{1}{2 \epsilon} \int_{-\epsilon}^{\epsilon} \mathrm{d} x f(x).[/tex]
Now according to the mean-value problem for integrals over continuous functions, there is a value [itex]\xi \in [-\epsilon,\epsilon][/itex] such that
[tex]I_{\epsilon}=f(\xi).[/tex]
Now, since [itex]f[/itex] is continuous and we let [itex]\epsilon \rightarrow 0^+[/itex], you get
[tex]\lim_{\epsilon \rightarrow 0^+} I_{\epsilon}=f(0).[/tex]
This means that in the sense of a weak limit you may write
[tex]\lim_{\epsilon \rightarrow 0^+} \delta_{\epsilon}(x)=\delta(x).[/tex]
"Weak limit" means that you have to follow the above given procedure: You first have to take an integral with a test function and then take the limit. This is the crucial point.

If you now look what happens in this example, Griffiths sloppy definition makes some sense, but one has to keep in mind the proper meaning in the above given sense. Obviously our functions [itex]\delta_{\epsilon}[/itex] are concentrated around 0 and become the larger the smaller [itex]\epsilon[/itex] gets, when taking the limit [itex]\epsilon \rightarrow 0^+[/itex]. At the same time the interval, where our function is different from 0 shrinks, and the construction is such that the total area under the graph (which here is a rectangle) stays constant 1 for all [itex]\epsilon[/itex]. In this sense you may charcterize the [itex]\delta[/itex] distribution as cited by Griffiths. To avoid confusion, however, it's mandatory to learn about the proper definition of distributions (also known as "generalized functions").

The best book for the physicist I know of is

M. J. Lighthill, Introduction to Fourier Analysis and Generalised Functions, Cambridge University Press (1959)

That it treats the distributions together with Fourier series and Fourier integrals is no disadvantage since this you'll need anyway when studying electrodynamics.

Fredrik · Jun 21, 2013

dreamLord said:

I know this probably belongs in one of the math sections, but I did not quite know where to put it,

Topology & analysis is the right place for it. I'm moving it there. Edit: I also changed "Diract" to "Dirac" in the title.

lurflurf · Jun 21, 2013

Yes Griffiths explanation is horrible, here is the idea without technicalities.

In finite calculus we define the delta function so that

$$a_0=\sum_{k=-\infty}^\infty \delta_k a_k$$

That a handy thing to do, it let's us write function evaluation as a sum.
We would like to do the same thing in infinitesimal calculus

$$\mathop{f}(0)=\int_{-\infty}^\infty \! \mathop{\delta}(x) \, \mathop{f}(x) \,\mathop{dx}$$
we ignore that the delta function does not exist as a function

now we adopt as equality f=g if
$$\int_{-\infty}^\infty \! (\mathop{f}(x) - \mathop{g}(x)) \,\mathop{dx}=0$$

in this sense
$$\mathop{\delta}(x) \, \mathop{f}(x)=\mathop{\delta}(x) \, \mathop{f}(0)$$
since clearly
$$\int_{-\infty}^\infty \! (\mathop{\delta}(x) \, \mathop{f}(x) - \mathop{\delta}(x) \, \mathop{f}(0)) \,\mathop{dx}=\int_{-\infty}^\infty \! ( \mathop{\delta}(x) \, ( \mathop{f}(x) - \mathop{f}(0))) \,\mathop{dx}=( \mathop{f}(0) -\mathop{f}(0) )=0$$

For some purposes we probably want to adopt as equality f=g if
$$\int_{a}^b \! (\mathop{f}(x) - \mathop{g}(x)) \,\mathop{dx}=0$$
for all a and b

dreamLord · Jun 21, 2013

Things are becoming a little clearer now, though I am still fairly lost. Thank you for the amazing posts, vanhees, DimReg, Jano and lurflurf. I will need to read this thread a couple more times before I am ready to frame my doubts regarding your posts.

lurflurf · Jun 21, 2013

dreamLord said:

Now he goes on to say that the above statement can also be written as f(0)*[itex]\delta[/itex](x) = 0. My question is - we could also have written it as f(29.5)*[itex]\delta[/itex](x) = 0 for x[itex]\neq[/itex]0, right? So then why did we choose f(0)?

Do you know about the Riemann–Stieltjes integral?
By convention the spike is at x=0. Since δ(x) purpose is to evaluate f(x) near x=0 it does not care what f does away from zero much like

$$\lim_{x \rightarrow 0} \mathop{f}(x)$$

dreamLord · Jun 21, 2013

No lurflurf, I do not know what that integral is.

By the way, an immediate question regarding your post (#9) ; how did you proceed in the second last step? That is :
∫(δ(x)(f(x)−f(0)))dx=(f(0)−f(0))=0

Thanks for telling me the purpose of the delta function - I did not understand why Griffiths brought it up in the first place!

DimReg · Jun 21, 2013

dreamLord said:

No lurflurf, I do not know what that integral is.

By the way, an immediate question regarding your post (#9) ; how did you proceed in the second last step? That is :
∫(δ(x)(f(x)−f(0)))dx=(f(0)−f(0))=0

Thanks for telling me the purpose of the delta function - I did not understand why Griffiths brought it up in the first place!

I showed the algebraic steps required in my first reply. Basically, f(0) is a constant, and integrals are linear, so:

[itex] \int(\delta(x)(f(x) - f(0)))dx = \int \delta(x) f(x)dx - \int \delta(x) f(0) dx = \int \delta(x) f(x) dx - f(0) \int \delta(x) dx = f(0) - f(0) [/itex]

Where in the last step I used ∫f(x)δ(x)dx = f(0) for the first term and ∫δ(x)dx = 1 for the second term

Fredrik · Jun 21, 2013

One thing that I think should be mentioned is that when ##\delta## is defined as a function that takes test functions to numbers, the definition can be written as ##\delta(f)=f(0)## for all test functions f. The notation ##\delta(f)## is far more natural than ##\int \delta(x)f(x)dx##. The reason that the latter is used must be that distributions were invented to make sense of expressions like ##\int \delta(x)f(x)dx##, which were already used in non-rigorous calculations.

So ##\int\delta(x)f(x)dx## isn't an integral of the product of a distribution and a function. It's just a notation that means ##\delta(f)##.

For each real number x, define ##\delta_x## by ##\delta_x(f)=f(x)## for all test functions f. Define the notation ##\int f(x)\delta(x-y)dx## to mean ##\delta_y(f)##. This ensures that ##\int f(x)\delta(x-y)dx=f(y)##.

dreamLord · Jun 21, 2013

Thanks DimReg, I understand the step now.

Fredrik ; so does that mean that if I take f(x) = 2x - 5, then δ(f) = f(0) = -5 ?
Also, in your last 2 lines, why did you change your definition from δ(f) = f(0) to δ(f) = f(x)?

By the way, thanks for moving the thread to the correct section and also for fixing the typo!

dreamLord · Jun 21, 2013

vanhees71 said:

Now, since [itex]f[/itex] is continuous and we let [itex]\epsilon \rightarrow 0^+[/itex], you get
[tex]\lim_{\epsilon \rightarrow 0^+} I_{\epsilon}=f(0).[/tex]
This means that in the sense of a weak limit you may write
[tex]\lim_{\epsilon \rightarrow 0^+} \delta_{\epsilon}(x)=\delta(x).[/tex]
"Weak limit" means that you have to follow the above given procedure: You first have to take an integral with a test function and then take the limit. This is the crucial point.

You lost me in this specific paragraph. Why is epsilon approaching 0 from the + side? And if it is, how does the next equation follow?

Fredrik · Jun 21, 2013

dreamLord said:

Fredrik ; so does that mean that if I take f(x) = 2x - 5, then δ(f) = f(0) = -5 ?

Yes.

dreamLord said:

Also, in your last 2 lines, why did you change your definition from δ(f) = f(0) to δ(f) = f(x)?

I didn't, I defined infinitely many new distributions, one for each real number. Only one of them (##\delta_0##) is equal to ##\delta##.

dreamLord · Jun 21, 2013

So it is also true that δ(f) = f(1) = -3 ? If I take f(x) = 2x - 5.

Fredrik · Jun 21, 2013

dreamLord said:

So it is also true that δ(f) = f(1) = -3 ? If I take f(x) = 2x - 5.

No, by my definitions ##\delta(f)=\delta_0(f)=f(0)=-5##, but ##\delta_1(f)=f(1)=-3##.

I don't know if anyone else uses this notation by the way. I just think it's a good way to make sense of expressions of the form ##\int f(x)\delta(x-y)dx## where y is a real number.

Jolb · Jun 21, 2013

I have never seen Fredrik's notation, and I can't really make any sense of it. The Dirac delta is never equal to anything besides 0 or infinity. In fact I often use the identity
[tex] \delta(f(x))=\sum_{\{\tilde{x}|f(\tilde{x})=0\}}\frac{\delta(x-\tilde{x})}{\left | \frac{df}{dx}|_\tilde{x}\right|} [/tex]

If you're having trouble reading that, it just says to find the Dirac delta where the argument is a function, find all the zeros of the function, then form the sum of dirac deltas, one located at each zero, and each one devided by the absolute value of the function's derivative at that zero.

That's actually a rigorous statement that follows from the most common definition of the dirac delta function:
[tex]
\delta(x):=\lim_{\alpha\rightarrow\infty}\sqrt{\frac{\alpha}{\pi}}e^{-\alpha x^2}
[/tex]

This definition works better than the limit of rectangular functions since you can find the derivative with this one.

dreamLord · Jun 21, 2013

I can't quite understand what delta-not and delta-one are, Fredrik (apologies, I can't use LaTex currently). Can you explain what they stand for?

Jolb ; why do we need to find the zeroes of the function? I thought the delta function was valid for all x?

I have never encountered such a vague and confusing topic in maths so far - which probably means I haven't done much, but either way, I am thoroughly confused. I'm not even sure I know why we need the delta function.

Jolb · Jun 21, 2013

The reason you need to find the zeros of the function in the argument of the Dirac delta is because the Dirac delta only "fires" when its argument is zero. Whenever the Dirac delta's argument is nonzero, the Dirac delta is equal to zero, and does nothing interesting. When its argument is zero, the Dirac delta does interesting things.

To explain this and the OP in a dumbed-down way, there's a great mnemonic to help with this. Sometimes people call the Dirac delta the "sampling function." If you have any function f(x) and you want to somehow pull out its value at a point x', you can get it by "sampling" it with the Dirac delta:

f(x') = ∫δ(x-x') f(x) dx

dreamLord · Jun 21, 2013

By argument equaling zero, you mean the function that is multiplied with it - like f(x), should be 0 right? If that is the case, then why do we have expressions like the one in post #2 by vanhees? How are they relevant? Under the integral, we don't have f(x) = 0, which it ought to be for the Dirac function to be 'interesting' as you put it.

WannabeNewton · Jun 21, 2013

dreamLord said:

I'm not even sure I know why we need the delta function.

Most physics books at the undergrad level will thoroughly butcher the definition and unfortunately the rigorous formulation requires some advanced mathematics (distribution theory). For now, can you at least see the physical motivations for it? Recall Griffiths' motivation, which is the apparent vanishing divergence of the Coulomb field at all points in space even when there is a localized point charge which should technically contribute to the divergence via Gauss's law.

dreamLord · Jun 21, 2013

Yes, I understood how the divergence was vanishing everywhere except at r = 0. Does that mean that the divergence of Electric Field is a Dirac Delta function?

Also, Wannabe, aren't you an undergrad ? How are you so goddamn knowledgeable!

Jolb · Jun 21, 2013

dreamLord said:

By argument equaling zero, you mean the function that is multiplied with it - like f(x), should be 0 right? If that is the case, then why do we have expressions like the one in post #2 by vanhees? How are they relevant? Under the integral, we don't have f(x) = 0, which it ought to be for the Dirac function to be 'interesting' as you put it.

No... the "argument" of a function is what you stick into it, not what you multiply it with.

So if we have the expression
f(p)
then f is the function and p is its argument.

So δ(f(x)) is the Dirac delta with the argument f(x). This is completely different from things like δ(x)f(x). The latter is what appears in the sampling equation f(x') = ∫δ(x-x') f(x) dx.

dreamLord · Jun 21, 2013

Sorry, I thought δ(f(x)) = ∫f(x)δ(x)dx - which I see makes no sense.

WannabeNewton · Jun 21, 2013

dreamLord said:

Yes, I understood how the divergence was vanishing everywhere except at r = 0. Does that mean that the divergence of Electric Field is a Dirac Delta function?

Yes, at least for the case I mentioned above. Can you see why intuitively? Recall that ##\nabla \cdot E = \frac{\rho}{\epsilon_0}##. Now for the Coulomb field the source is just a single point charge ##Q## at say ##r = 0##. How in the world are we going to represent the charge density of this thing-it's localized to a point! Well what we want to do is somehow find a mathematical quantity that can represent nothing in space at every point except one-and at this one point there will be a sudden spike to represent the presence of that point charge.

This might help as well: https://www.physicsforums.com/showthread.php?t=695129&highlight=current+wire

For now I would personally just focus on how the dirac delta function is used in electromagnetism (because it's used quite a lot) and what it tries to model physically in terms of charge and current distributions. There's no need to make things any more complicated at this level by going into all the rigorous mathematics behind this.

dreamLord said:

Also, Wannabe, aren't you an undergrad ?

Yessir.

dreamLord · Jun 21, 2013

Yes Wannabe, I understood that part. Post #2 in that link was helpful. I will re-read this thread and Griffiths once again tomorrow, and make what I can of it. Thank you guys.

the_wolfman · Jun 21, 2013

I have never encountered such a vague and confusing topic in maths so far - which probably means I haven't done much, but either way, I am thoroughly confused. I'm not even sure I know why we need the delta function.

In E&M we often use something called a Green's function to solve tricky PDES. This is a very high level technique that you don't learn about until graduate level E&M. The delta function is central to this technique, and Griffiths is introducing the delta function now to give you some exposure to delta function and hopefully spare you some pain when you take Jackson E&M.

I'm going outline the technique below to illustrate why delta functions are important. When learning advanced math I find it instructive to study physical problems where the math is applicable. If its above you don't worry about it.

A common problem in electrostatics is to solve for the potential [itex] \phi(\vec x) [/itex] (and thus the Electric Field) of a given charge distribution [itex] \rho(\vec x) [/itex].

This problem amounts to solving Poisson's equation:
[itex]\nabla^2 \phi(\vec x) = \frac{ \rho(\vec x)}{\epsilon} [/itex] subject to certain boundary conditions.

One trick to solve this equation is to use Green Functions.

We start by solving a modified equation:
[itex]\nabla^2 G(\vec x, \vec x') = \delta(\vec x - \vec x') [/itex]

Here G is a Green's function and [itex] \vec x' [/itex] is a dummy variable.

Now we can use G to solve the original equation by noting:
[itex]\int d^3x' \nabla^2 G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} = \int d^3x' \delta(\vec x - \vec x')\frac{ \rho(\vec x')}{\epsilon} [/itex]

Using the properties of the delta function this becomes:
[itex]\int d^3x' \nabla^2 G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} =\frac{ \rho(\vec x)}{\epsilon} [/itex]
Using Poisson's equation we can then equate:
[itex]\int d^3x' \nabla^2 G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} = \nabla^2 \phi(\vec x) [/itex].
And finally we pull [itex]\nabla^2[/itex] outside the integral because it does not depend on x'
[itex]\nabla^2 \int d^3x' G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} = \nabla^2 \phi(\vec x) [/itex].
Or
[itex] \int d^3x' G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} = \phi(\vec x) [/itex].

What we have done is split up the task into two steps.
1) Solve [itex]\nabla^2 G(\vec x, \vec x') = \delta(\vec x - \vec x') [/itex] for G.
2) After solving for G we integrate [itex] \int d^3x' G(\vec x, \vec x') \frac{ \rho(\vec x')}{\epsilon} [/itex] giving us [itex] \phi(\vec x) [/itex].

This works because its often easier to solve the PDE for G than it is to solve for [itex] \phi(\vec x) [/itex].

Also once we solve the PDE for G for a given geometry and boundary conditions we can use the same G to solve for [itex] \phi(\vec x) [/itex] for a bunch of different charge distributions. This saves us a lot of work, because integrating a function is many times easier than solving a PDE.

Fredrik · Jun 21, 2013

dreamLord said:

I can't quite understand what delta-not and delta-one are, Fredrik (apologies, I can't use LaTex currently). Can you explain what they stand for?

I defined them in post #14. Not sure what more I can say, unless you explain what issues you're having with the definition.

dreamLord said:

I have never encountered such a vague and confusing topic in maths so far

Nothing is butchered quite as badly by mediocre physics text as the Dirac delta and tensors. Not sure which is worse. I remember being a lot more frustrated about the tensors actually, so maybe that's worse.

dreamLord · Jun 21, 2013

That usage of the Dirac function is quite beautiful wolfman, I like it very much. Thanks!

Fredrik · Jun 21, 2013

Jolb said:

I have never seen Fredrik's notation, and I can't really make any sense of it. The Dirac delta is never equal to anything besides 0 or infinity.

The second sentence here suggests that you're thinking of ##\delta## as a function that takes numbers as input. It's not defined that way in any rigorous treatments. It's defined either as a distribution (a function that takes "nice enough" functions to real numbers) or as a measure (a function that take subsets of ℝ to non-negative extended real numbers that we can think of as the "sizes" of those sets).

WannabeNewton · Jun 21, 2013

Fredrik said:

I remember being a lot more frustrated about the tensors actually, so maybe that's worse.

Lol you should see how tensors are defined in Melvin Schwartz' book on Electromagnetism. You will cry. It was some definition in terms of rotations that I had never even seen before, and I first skimmed this book after having done Wald for some time so I was like what the hell is this?!

Jolb · Jun 21, 2013

Fredrik said:

The second sentence here suggests that you're thinking of ##\delta## as a function that takes numbers as input. It's not defined that way in any rigorous treatments. It's defined either as a distribution (a function that takes "nice enough" functions to real numbers) or as a measure (a function that take subsets of ℝ to non-negative extended real numbers that we can think of as the "sizes" of those sets).

You're right, but this technicality couldn't be any more irrelevant. I gave a definition for the dirac as the limit of functions, and in almost any expression with a Dirac delta distribution, you could approximate the expression to arbitrary accuracy using a function by picking a big enough [itex]\alpha[/itex] in the definition I gave. The subtleties of what you're talking about won't matter at all when the OP is using Griffiths; in fact Griffiths himself says you can treat it like a function in his book.

On the other hand, your definitions are very confusing because you use the notation δ(f) to mean something completely different from δ(f(x)), which is highly nonstandard (at least for people at the level of Griffiths. Maybe it makes sense if you've taken distribution theory, but I'm sure the OP hasn't.) That's what caused the OP to get confused:

Sorry, I thought δ(f(x)) = ∫f(x)δ(x)dx - which I see makes no sense.

Dirac Delta Function: Explanation & Usage

Similar threads

Hot Threads

Recent Insights