Of course, it's not enough, rigorously. I said above that it was not mathematically well-defined. You are over-analyzing it. I'm thinking like a physicist, here. I am a mathematician, but I can enter physicist mode and work with things that aren't mathematically well-defined. If we insisted on everything making perfect mathematical sense, then we wouldn't have quantum field theory! In fact, the situation with the delta function was precisely the same until Schwarz came along and was able to formulate it rigorously.
Let's suppose we define a function, I'm going to call it ϕ for now, from the real numbers to the extended reals where
ϕ(x)=0 for all non-zero x, and
ϕ(0)=∞.
This is completely valid. Now let's calculate ∫∞−∞ϕ(x)dx with Riemann sums. ϕ(x∗i) is going to be 0 or infinity. So the Riemann sum (if it existed) can never be equal to 1 and we break the sifting property, which is the most important property of Dirac delta! The sifting property is also needed in the definition itself.
I was well aware of that and that is why I fired away at the poor guy with my distributions.
The thing is, you can get away with non-rigor if you want. But it'll break the intuition you spent years establishing, and at some point you will start asking questions. And OP is at that point. There are self-contained books on this topic, some much easier to read than others, but it looks like OP has already left. Oh well.
I disagree that it will break the intuition that you spent years establishing. Why? Will it fly away? Nope. It will still be there. Is the intuition wrong? Not really, provided you are careful enough. It's just not quite precise. It suffices just to know that you are taking some liberties with rigor. Part of what I tried to convey earlier is that you can get functions that come arbitrarily close to doing what the delta function does. That's part of the intuition. So, there can be more rigor in a non-rigorous approach than you seem to presume.
Edit: At the end of the day, the confusion arises because we write
∫Rf(x)δ(x)dx
If we had chosen a different notation that didn't look like the integral, and reduces to the integral for ordinary functions, nobody would have a problem. Sometimes I think bra-ket needs more love.
Well, since I know something about Hilbert spaces, I sort of have an inner product in the back of my mind. But if you think of it as a weak limit, the integral doesn't seem like such a stretch. At the end of the day, the confusion only arises because people just don't think deeply enough. And that's different from rigorously enough.
Back when I was an electrical engineering student, there were delta functions all over the place. I didn't know anything about distributions. I knew that it didn't quite make mathematical sense, but I always had that intuition that the delta function was some sort of limit. Which it is. A weak limit. Turns out, there's nothing wrong with the way I was thinking of it, and it could be made perfectly precise and rigorous.
You can't ask engineers and probably most physicists to learn distributions. Forget it. They won't be interested. And they can get by just fine without it.