samalkhaiat said:
3) Given a generalized function f \in \mathcal{D}^{\prime}(\mathbb{R}^{n}) and a good function a \in C^{\infty}(\mathbb{R}^{n}), how do you know that the Leibniz rule holds for the differentiation of the product af? You don’t. But, if you use the definition (1), you can show that \partial^{m}\left( af \right) = \sum_{k \leq m} \begin{pmatrix} k \\ m \end{pmatrix} \ \partial^{k}a \ \partial^{m - k}f .
To properly answer this, first let's set up the proper definitions so everything checks out latter.
Let V be a vector space over the scalar field F which can be either \mathbb{R} or \mathbb{C}. Given two elements of V such as u, v \in V be elements of V, then we define the inner product as map
$$(.,.): V \times V \to F$$
$$ u, v \to (u, v) \in F$$
such that it satisfies the following properties:
1. Conjugate symmetry:
$$(u, v) = (u, v)^{*}$$
where the notation means that given z \in \mathbb{C}; then z^{*} is just the complex-conjugate of z.
2. Linearity in the first argument:
$$\alpha, \beta \in F; \quad u, v, b \in V; \quad (\alpha u + \beta v, b) = \alpha(u, b) + \beta(v, b)$$
3. Positive definite:
$$(x, x) \ge 0 \quad \forall x \in V \setminus \{0\}$$
Now, let \mathcal{D}(\mathbb{R}^n) be the vector space over the field \mathbb{R} (if it was over the field \mathbb{C}, then we'd say \mathcal{D}(\mathbb{C}^n)) such that its elements are "nice" functions \varphi :\mathbb{R}^n \to \mathbb{R}. We say that a function is "nice" if:
- It's smooth
- It has compact support
And finally, given f, g \in \mathcal{D}(\mathbb{R}^n), we take the our inner product to be:
$$(f, g) := \displaystyle\int d^n x \bigg( f(x) g(x) \bigg)$$
With all this in place, we can finally define a "distribution". If V = \mathcal{D}(\mathbb{R}^n) and V^{*} is its dual space, then T_f \in V^{*} is a linear form such that if f, \phi \in V are nice functions, then \quad T_f[\phi] = (f, \phi) \in \mathbb{R}. We call T_f a "distribution", but usually we abuse the notation by calling f itself the distribution. Because of that, it is desirable to define the derivative of a distribution such that \partial (T_f) = T_{\partial f}. Using integration by parts and the fact that the nice functions have compact support:
$$(\partial f, \phi) = \displaystyle\int d^n x \bigg( \partial f(x) \phi(x) \bigg) = \cancel{\bigg[ f(x) \phi(x) \bigg]_{-\infty}^{\infty}} - \displaystyle\int d^n x \bigg( f(x) \partial \phi(x) \bigg) = -(f, \partial \phi)$$
$$\implies T_{\partial f}[\phi] = -T_{f}[\partial \phi]$$
From now on we'll abuse the notation too and refer to the distribution T_f just as f.
Now, before proving the statement, I would like to point out that there's an error in the binomial number that you've written there, the m and k are swapped. I think the correct formula is:
$$\partial^{m}\left( af \right) = \sum_{k \leq m} \begin{pmatrix} m \\ k \end{pmatrix} \ \partial^{k}a \ \partial^{m - k}f = \sum_{k = 0}^m \begin{pmatrix} m \\ k \end{pmatrix} \ \partial^{k}a \ \partial^{m - k}f$$
based on what I see on wikipedia
here.
Anyway, let's begin with the proof:
We'll first prove that the distribution f satisfies the Leibniz rule as usual \partial(af) = (\partial a) f + a(\partial f) . Then the proof for the given formula for a function a and a functional f is the same as it would be for two functions.
Let f(x), \varphi(x) \in V be nice functions and f be a distribution. Also let \varphi(x) = a(x) \phi(x) where a, \phi \in V. We'll understand that when we write f(x) we refer to the function and when we write f or f[.] we refer to the distribution.
$$f[\varphi] = (f(x), \phi(x)) = \displaystyle\int d^n x \bigg( f(x) \varphi(x) \bigg) = \displaystyle\int d^n x \bigg( a(x) f(x) \phi(x) \bigg) = $$
$$ = \displaystyle\int d^n x \bigg( [a(x) f(x)] \phi(x) \bigg) = (af)[\phi] \equiv g[\phi]$$
where we have defined the distribution g as af and the function g(x) := a(x)f(x). Now
$$(\partial f, \varphi) = \displaystyle\int d^n x \bigg( \partial f(x) \varphi(x) \bigg) = -\displaystyle\int d^n x \bigg( f(x) \partial\{ a(x) \phi(x) \} \bigg) = $$
$$= -\displaystyle\int d^n x \bigg( \partial a(x) f(x) \phi(x) \bigg) -\displaystyle\int d^n x \bigg( g(x) \partial \phi(x) \bigg) = -(\partial a(x) f(x), \phi(x)) -(g(x), \partial \phi(x))$$
$$(\partial f, \varphi) = \displaystyle\int d^n x \bigg( \partial f(x) \varphi(x) \bigg) = \displaystyle\int d^n x \bigg( [a(x) \partial f(x)] \phi(x) \bigg) = (a(x)\partial f(x), \phi(x))$$
$$(a\partial f)[\phi] = -(\partial af)[\phi] - g[\partial \phi] = -(\partial af)[\phi] + (\partial g)[\phi] \implies$$
$$\implies \bigg( \partial(af) \bigg) = \bigg( (\partial a) f \bigg) + \bigg(a(\partial f) \bigg)$$
and so we've proven that distributions obey the Leibniz rule.
Q.E.D.
And so, because distributions behave the same way under differentiation as functions do, you can copy-paste the proof for the formula
$$\partial^{m}\left( af \right) = \sum_{k \leq m} \begin{pmatrix} m \\ k \end{pmatrix} \ \partial^{k}a \ \partial^{m - k}f = \sum_{k = 0}^m \begin{pmatrix} m \\ k \end{pmatrix} \ \partial^{k}a \ \partial^{m - k}f$$
for a and f being functions (which is trivial) into the one where they are distributions; thus proving the formula for distributions as well.
You can check that proof
here.