Dirac Delta Function: Explanation & Usage

Click For Summary
The Dirac Delta function is a distribution, not a conventional function, defined primarily through its integral properties, specifically that integrating it against a test function yields the value of that function at zero. The discussion clarifies that while it is often described as being zero everywhere except at zero, this characterization is an intuitive simplification rather than a precise definition. The choice of f(0) in expressions involving the Dirac Delta function is crucial because it reflects the function's behavior at the point of interest, while values away from zero do not contribute to the integral. Misunderstandings often arise from imprecise definitions in introductory texts, which can confuse students about the nature of distributions. Understanding the Dirac Delta function requires familiarity with its integral properties and the concept of distributions in mathematical analysis.
dreamLord
Messages
203
Reaction score
7
I know this probably belongs in one of the math sections, but I did not quite know where to put it, so I put it in here since I am studying Electrodynamics from Griffiths, and in the first chapter he talks about Dirac Delta function.

From what I've gathered, Dirac Delta function is 0 for x\neq0, and ∞ for x = 0.

Now he assumes any function f(x), and says that the product f(x)*\delta(x) = 0 for x\neq0. Fine, got that.

Now he goes on to say that the above statement can also be written as f(0)*\delta(x) = 0. My question is - we could also have written it as f(29.5)*\delta(x) = 0 for x\neq0, right? So then why did we choose f(0)?
 
Physics news on Phys.org
First of all it is very important to understand that \delta is not a function but a distribution. It is defined as a linear form on an appropriate space of functions, e.g., the infinitely many times differentiable functions with compact support or rapidly falling functions (Schwartz space). It is defined as
\int_{\mathbb{R}} \mathrm{d} x f(x) \delta(x)=f(0).
This is not 0.

Sometimes you can simplify equations by the formal setting f(x) \delta(x)=f(0) \delta(x). Strictly speaking that's not correct, because you cannot integrate the \delta distribution over the test function which is constant, because this function does not belong to the test-function space, where the \delta distribution is defined.
 
I'm afraid your first few lines were completely lost on me! Is there any way you can dumb it down a bit?

Also, the equation that you wrote ; is this the definition of the Dirac Delta function? Or the fact that it is 0 when x is not zero, and infinity when x is 0. Which one defines it? Or are they the same thing.
 
Griffths focuses on f(0)δ(x) because f(0) is the only value of f that matters with the dirac delta. So basically f(0)δ(x) will behave the same way as f(x)δ(x).

This can be best understood under an integral sign, which is the only place the dirac delta function is precisely defined. You have (edit: you can take these two properties are the definition, but the exact mathematical definition is a bit more complicated):

\int \delta (x) dx = 1 and \int f(x) \delta (x) dx = f(0)

So you can write \int f(0) \delta (x) dx = f(0) \int \delta(x)dx = f(0)*1 = f(0) which is the same as for f(x)δ(x)

On the other hand, \int f(1) \delta(x) dx = f(1)\int \delta(x)dx = f(1), which is not correct.
 
Last edited:
The vanhees71 definition is right. The property "δ=0 for x ≠ 0, δ=∞ fo x = 0" is just an intuitive description of sharply peaked function, which is valid picture of δ only in some situations. For example, it is valid for charge density distribution of point-like charged particle. However, when solving for the Green function of the Schroedinger equation, such description of δ is incorrect, while the integral property above is valid.
 
I don't get the impression that the OP is comfortable talking about distributions or Green's functions. If he were, I doubt he would be having trouble with the dirac delta function.
 
dreamLord said:
I'm afraid your first few lines were completely lost on me! Is there any way you can dumb it down a bit?

Also, the equation that you wrote ; is this the definition of the Dirac Delta function? Or the fact that it is 0 when x is not zero, and infinity when x is 0. Which one defines it? Or are they the same thing.
The point is that many introductory physics books confuse their readers with unprecise definitions of what a distribution is. Griffiths seems to be another example. I don't know his E&M book very well besides from discussions here in the forum.

Objects like the Dirac \delta are socalled distributions. They are defined as mappings from a function space (containing a certain set of functions, called test functions) to the (real or complex) numbers. They can only be defined in a manner that makes sense under an integral, where they are multiplied with a test function, and for the Dirac \delta distribution this definition reads
\int_{\mathbb{R}} \mathrm{d} x \delta(x) f(x)=f(0).
It's the value of the test function at the argument 0.

It is quite obvious that \delta(x) cannot be a function in the usual sense, because you won't find any function with the above given property. However you can define the \delta distribution as kind of limit, the socalled weak limit. The idea is to define functions which are sharply peaked around 0 with the integral normalized to 1. The most simple example is the "box function",
\delta_{\epsilon}(x)=\begin{cases}<br /> 1/(2 \epsilon) &amp; \text{for} \quad x \in (-\epsilon,\epsilon) \\<br /> 0 &amp; \text{elsewhere}.<br /> \end{cases}
The test function should have "nice properties" to make things convenient. They should still form a vector space of functions, i.e., with two functions also their sum and the product with a constant should belong to this function space. A very convenient choice is Schwartz's space of rapidly falling smooth functions, i.e., they are arbitrarily many times differentiable and fall off at infinity faster than any polynomial.

Now we check the integral
I_{\epsilon}=\int_{\mathbb{R}} \mathrm{d} x \delta_{\epsilon}(x) f(x) = \frac{1}{2 \epsilon} \int_{-\epsilon}^{\epsilon} \mathrm{d} x f(x).
Now according to the mean-value problem for integrals over continuous functions, there is a value \xi \in [-\epsilon,\epsilon] such that
I_{\epsilon}=f(\xi).
Now, since f is continuous and we let \epsilon \rightarrow 0^+, you get
\lim_{\epsilon \rightarrow 0^+} I_{\epsilon}=f(0).
This means that in the sense of a weak limit you may write
\lim_{\epsilon \rightarrow 0^+} \delta_{\epsilon}(x)=\delta(x).
"Weak limit" means that you have to follow the above given procedure: You first have to take an integral with a test function and then take the limit. This is the crucial point.

If you now look what happens in this example, Griffiths sloppy definition makes some sense, but one has to keep in mind the proper meaning in the above given sense. Obviously our functions \delta_{\epsilon} are concentrated around 0 and become the larger the smaller \epsilon gets, when taking the limit \epsilon \rightarrow 0^+. At the same time the interval, where our function is different from 0 shrinks, and the construction is such that the total area under the graph (which here is a rectangle) stays constant 1 for all \epsilon. In this sense you may charcterize the \delta distribution as cited by Griffiths. To avoid confusion, however, it's mandatory to learn about the proper definition of distributions (also known as "generalized functions").

The best book for the physicist I know of is

M. J. Lighthill, Introduction to Fourier Analysis and Generalised Functions, Cambridge University Press (1959)

That it treats the distributions together with Fourier series and Fourier integrals is no disadvantage since this you'll need anyway when studying electrodynamics.
 
dreamLord said:
I know this probably belongs in one of the math sections, but I did not quite know where to put it,
Topology & analysis is the right place for it. I'm moving it there. Edit: I also changed "Diract" to "Dirac" in the title.
 
Last edited:
Yes Griffiths explanation is horrible, here is the idea without technicalities.

In finite calculus we define the delta function so that

$$a_0=\sum_{k=-\infty}^\infty \delta_k a_k$$

That a handy thing to do, it let's us write function evaluation as a sum.
We would like to do the same thing in infinitesimal calculus

$$\mathop{f}(0)=\int_{-\infty}^\infty \! \mathop{\delta}(x) \, \mathop{f}(x) \,\mathop{dx}$$
we ignore that the delta function does not exist as a function

now we adopt as equality f=g if
$$\int_{-\infty}^\infty \! (\mathop{f}(x) - \mathop{g}(x)) \,\mathop{dx}=0$$

in this sense
$$\mathop{\delta}(x) \, \mathop{f}(x)=\mathop{\delta}(x) \, \mathop{f}(0)$$
since clearly
$$\int_{-\infty}^\infty \! (\mathop{\delta}(x) \, \mathop{f}(x) - \mathop{\delta}(x) \, \mathop{f}(0)) \,\mathop{dx}=\int_{-\infty}^\infty \! ( \mathop{\delta}(x) \, ( \mathop{f}(x) - \mathop{f}(0))) \,\mathop{dx}=( \mathop{f}(0) -\mathop{f}(0) )=0$$

For some purposes we probably want to adopt as equality f=g if
$$\int_{a}^b \! (\mathop{f}(x) - \mathop{g}(x)) \,\mathop{dx}=0$$
for all a and b
 
Last edited:
  • #10
Things are becoming a little clearer now, though I am still fairly lost. Thank you for the amazing posts, vanhees, DimReg, Jano and lurflurf. I will need to read this thread a couple more times before I am ready to frame my doubts regarding your posts.
 
  • #11
dreamLord said:
Now he goes on to say that the above statement can also be written as f(0)*\delta(x) = 0. My question is - we could also have written it as f(29.5)*\delta(x) = 0 for x\neq0, right? So then why did we choose f(0)?

Do you know about the Riemann–Stieltjes integral?
By convention the spike is at x=0. Since δ(x) purpose is to evaluate f(x) near x=0 it does not care what f does away from zero much like

$$\lim_{x \rightarrow 0} \mathop{f}(x)$$
 
  • #12
No lurflurf, I do not know what that integral is.

By the way, an immediate question regarding your post (#9) ; how did you proceed in the second last step? That is :
∫(δ(x)(f(x)−f(0)))dx=(f(0)−f(0))=0

Thanks for telling me the purpose of the delta function - I did not understand why Griffiths brought it up in the first place!
 
  • #13
dreamLord said:
No lurflurf, I do not know what that integral is.

By the way, an immediate question regarding your post (#9) ; how did you proceed in the second last step? That is :
∫(δ(x)(f(x)−f(0)))dx=(f(0)−f(0))=0

Thanks for telling me the purpose of the delta function - I did not understand why Griffiths brought it up in the first place!

I showed the algebraic steps required in my first reply. Basically, f(0) is a constant, and integrals are linear, so:

\int(\delta(x)(f(x) - f(0)))dx = \int \delta(x) f(x)dx - \int \delta(x) f(0) dx = \int \delta(x) f(x) dx - f(0) \int \delta(x) dx = f(0) - f(0)

Where in the last step I used ∫f(x)δ(x)dx = f(0) for the first term and ∫δ(x)dx = 1 for the second term
 
Last edited:
  • #14
One thing that I think should be mentioned is that when ##\delta## is defined as a function that takes test functions to numbers, the definition can be written as ##\delta(f)=f(0)## for all test functions f. The notation ##\delta(f)## is far more natural than ##\int \delta(x)f(x)dx##. The reason that the latter is used must be that distributions were invented to make sense of expressions like ##\int \delta(x)f(x)dx##, which were already used in non-rigorous calculations.

So ##\int\delta(x)f(x)dx## isn't an integral of the product of a distribution and a function. It's just a notation that means ##\delta(f)##.

For each real number x, define ##\delta_x## by ##\delta_x(f)=f(x)## for all test functions f. Define the notation ##\int f(x)\delta(x-y)dx## to mean ##\delta_y(f)##. This ensures that ##\int f(x)\delta(x-y)dx=f(y)##.
 
  • #15
Thanks DimReg, I understand the step now.

Fredrik ; so does that mean that if I take f(x) = 2x - 5, then δ(f) = f(0) = -5 ?
Also, in your last 2 lines, why did you change your definition from δ(f) = f(0) to δ(f) = f(x)?

By the way, thanks for moving the thread to the correct section and also for fixing the typo!
 
  • #16
vanhees71 said:
Now, since f is continuous and we let \epsilon \rightarrow 0^+, you get
\lim_{\epsilon \rightarrow 0^+} I_{\epsilon}=f(0).
This means that in the sense of a weak limit you may write
\lim_{\epsilon \rightarrow 0^+} \delta_{\epsilon}(x)=\delta(x).
"Weak limit" means that you have to follow the above given procedure: You first have to take an integral with a test function and then take the limit. This is the crucial point.

You lost me in this specific paragraph. Why is epsilon approaching 0 from the + side? And if it is, how does the next equation follow?
 
  • #17
dreamLord said:
Fredrik ; so does that mean that if I take f(x) = 2x - 5, then δ(f) = f(0) = -5 ?
Yes.

dreamLord said:
Also, in your last 2 lines, why did you change your definition from δ(f) = f(0) to δ(f) = f(x)?
I didn't, I defined infinitely many new distributions, one for each real number. Only one of them (##\delta_0##) is equal to ##\delta##.
 
  • #18
So it is also true that δ(f) = f(1) = -3 ? If I take f(x) = 2x - 5.
 
  • #19
dreamLord said:
So it is also true that δ(f) = f(1) = -3 ? If I take f(x) = 2x - 5.
No, by my definitions ##\delta(f)=\delta_0(f)=f(0)=-5##, but ##\delta_1(f)=f(1)=-3##.

I don't know if anyone else uses this notation by the way. I just think it's a good way to make sense of expressions of the form ##\int f(x)\delta(x-y)dx## where y is a real number.
 
  • #20
I have never seen Fredrik's notation, and I can't really make any sense of it. The Dirac delta is never equal to anything besides 0 or infinity. In fact I often use the identity
\delta(f(x))=\sum_{\{\tilde{x}|f(\tilde{x})=0\}}\frac{\delta(x-\tilde{x})}{\left | \frac{df}{dx}|_\tilde{x}\right|}

If you're having trouble reading that, it just says to find the Dirac delta where the argument is a function, find all the zeros of the function, then form the sum of dirac deltas, one located at each zero, and each one devided by the absolute value of the function's derivative at that zero.

That's actually a rigorous statement that follows from the most common definition of the dirac delta function:
<br /> \delta(x):=\lim_{\alpha\rightarrow\infty}\sqrt{\frac{\alpha}{\pi}}e^{-\alpha x^2}<br />

This definition works better than the limit of rectangular functions since you can find the derivative with this one.
 
Last edited:
  • #21
I can't quite understand what delta-not and delta-one are, Fredrik (apologies, I can't use LaTex currently). Can you explain what they stand for?

Jolb ; why do we need to find the zeroes of the function? I thought the delta function was valid for all x?

I have never encountered such a vague and confusing topic in maths so far - which probably means I haven't done much, but either way, I am thoroughly confused. I'm not even sure I know why we need the delta function.
 
  • #22
The reason you need to find the zeros of the function in the argument of the Dirac delta is because the Dirac delta only "fires" when its argument is zero. Whenever the Dirac delta's argument is nonzero, the Dirac delta is equal to zero, and does nothing interesting. When its argument is zero, the Dirac delta does interesting things.

To explain this and the OP in a dumbed-down way, there's a great mnemonic to help with this. Sometimes people call the Dirac delta the "sampling function." If you have any function f(x) and you want to somehow pull out its value at a point x', you can get it by "sampling" it with the Dirac delta:

f(x') = ∫δ(x-x') f(x) dx
 
  • #23
By argument equaling zero, you mean the function that is multiplied with it - like f(x), should be 0 right? If that is the case, then why do we have expressions like the one in post #2 by vanhees? How are they relevant? Under the integral, we don't have f(x) = 0, which it ought to be for the Dirac function to be 'interesting' as you put it.
 
  • #24
dreamLord said:
I'm not even sure I know why we need the delta function.
Most physics books at the undergrad level will thoroughly butcher the definition and unfortunately the rigorous formulation requires some advanced mathematics (distribution theory). For now, can you at least see the physical motivations for it? Recall Griffiths' motivation, which is the apparent vanishing divergence of the Coulomb field at all points in space even when there is a localized point charge which should technically contribute to the divergence via Gauss's law.
 
  • #25
Yes, I understood how the divergence was vanishing everywhere except at r = 0. Does that mean that the divergence of Electric Field is a Dirac Delta function?

Also, Wannabe, aren't you an undergrad ? How are you so goddamn knowledgeable!
 
  • #26
dreamLord said:
By argument equaling zero, you mean the function that is multiplied with it - like f(x), should be 0 right? If that is the case, then why do we have expressions like the one in post #2 by vanhees? How are they relevant? Under the integral, we don't have f(x) = 0, which it ought to be for the Dirac function to be 'interesting' as you put it.

No... the "argument" of a function is what you stick into it, not what you multiply it with.

So if we have the expression
f(p)
then f is the function and p is its argument.

So δ(f(x)) is the Dirac delta with the argument f(x). This is completely different from things like δ(x)f(x). The latter is what appears in the sampling equation f(x') = ∫δ(x-x') f(x) dx.
 
  • #27
Sorry, I thought δ(f(x)) = ∫f(x)δ(x)dx - which I see makes no sense.
 
  • #28
dreamLord said:
Yes, I understood how the divergence was vanishing everywhere except at r = 0. Does that mean that the divergence of Electric Field is a Dirac Delta function?
Yes, at least for the case I mentioned above. Can you see why intuitively? Recall that ##\nabla \cdot E = \frac{\rho}{\epsilon_0}##. Now for the Coulomb field the source is just a single point charge ##Q## at say ##r = 0##. How in the world are we going to represent the charge density of this thing-it's localized to a point! Well what we want to do is somehow find a mathematical quantity that can represent nothing in space at every point except one-and at this one point there will be a sudden spike to represent the presence of that point charge.

This might help as well: https://www.physicsforums.com/showthread.php?t=695129&highlight=current+wire

For now I would personally just focus on how the dirac delta function is used in electromagnetism (because it's used quite a lot) and what it tries to model physically in terms of charge and current distributions. There's no need to make things any more complicated at this level by going into all the rigorous mathematics behind this.

dreamLord said:
Also, Wannabe, aren't you an undergrad ?
Yessir.
 
  • #29
Yes Wannabe, I understood that part. Post #2 in that link was helpful. I will re-read this thread and Griffiths once again tomorrow, and make what I can of it. Thank you guys.
 
  • #30
I have never encountered such a vague and confusing topic in maths so far - which probably means I haven't done much, but either way, I am thoroughly confused. I'm not even sure I know why we need the delta function.

In E&M we often use something called a Green's function to solve tricky PDES. This is a very high level technique that you don't learn about until graduate level E&M. The delta function is central to this technique, and Griffiths is introducing the delta function now to give you some exposure to delta function and hopefully spare you some pain when you take Jackson E&M.

I'm going outline the technique below to illustrate why delta functions are important. When learning advanced math I find it instructive to study physical problems where the math is applicable. If its above you don't worry about it.

A common problem in electrostatics is to solve for the potential \phi(\vec x) (and thus the Electric Field) of a given charge distribution \rho(\vec x).

This problem amounts to solving Poisson's equation:
\nabla^2 \phi(\vec x) = \frac{ \rho(\vec x)}{\epsilon} subject to certain boundary conditions.

One trick to solve this equation is to use Green Functions.

We start by solving a modified equation:
\nabla^2 G(\vec x, \vec x&#039;) = \delta(\vec x - \vec x&#039;)

Here G is a Green's function and \vec x&#039; is a dummy variable.

Now we can use G to solve the original equation by noting:
\int d^3x&#039; \nabla^2 G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} = \int d^3x&#039; \delta(\vec x - \vec x&#039;)\frac{ \rho(\vec x&#039;)}{\epsilon}

Using the properties of the delta function this becomes:
\int d^3x&#039; \nabla^2 G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} =\frac{ \rho(\vec x)}{\epsilon}
Using Poisson's equation we can then equate:
\int d^3x&#039; \nabla^2 G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} = \nabla^2 \phi(\vec x).
And finally we pull \nabla^2 outside the integral because it does not depend on x'
\nabla^2 \int d^3x&#039; G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} = \nabla^2 \phi(\vec x).
Or
\int d^3x&#039; G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} = \phi(\vec x).


What we have done is split up the task into two steps.
1) Solve \nabla^2 G(\vec x, \vec x&#039;) = \delta(\vec x - \vec x&#039;) for G.
2) After solving for G we integrate \int d^3x&#039; G(\vec x, \vec x&#039;) \frac{ \rho(\vec x&#039;)}{\epsilon} giving us \phi(\vec x).

This works because its often easier to solve the PDE for G than it is to solve for \phi(\vec x).

Also once we solve the PDE for G for a given geometry and boundary conditions we can use the same G to solve for \phi(\vec x) for a bunch of different charge distributions. This saves us a lot of work, because integrating a function is many times easier than solving a PDE.
 

Similar threads

  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 25 ·
Replies
25
Views
4K
  • · Replies 9 ·
Replies
9
Views
3K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
Replies
0
Views
2K
  • · Replies 12 ·
Replies
12
Views
5K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 2 ·
Replies
2
Views
1K
  • · Replies 31 ·
2
Replies
31
Views
4K