Nathan Warford said:
This is a step in the right direction. I get the feeling that part of the confusion comes from the fact that QFT and SM are used for different purposes. QFT is used to explain what gives rise to particles, SM is used to explain how particles interact. That's the sense that I get, at least. Correct me if I'm wrong.
Not quite. QFT is the general mathematical framework, in which we formulate physical models/theories. It is highly flexible and finds application not only in high energy physics, but also in condensed matter theory and statistical mechanics.
The standard model is a particular(!) quantum field theory, which describes the fundamental interactions (electromagnetic, weak and strong interactions) and the fundamental particle-fields that couple to them. The best way to think about particles is indeed in terms of excitations of the basic fields.
Maybe as a really simple example from condensed matter physics:
Think about a solid crystal, ie. a huge number of atoms arranged on a lattice, held together via springs. Now laying a coordinate system above this crystal, we might assign some number φ(x, y, z) to each spring, describing their compression/extension from the equilibrium position. Until now, the possible coordinates (x,y,z) are discrete (we had a lattice), but, if we zoom out far enough, we can choose to forget about this discreteness and just think about φ as being a continuous function (a "field"). Now, in an equilibrium situation, the field value should just vanish everywhere, φ = 0. But if i act on my crystal with a force on a few atoms, eg. i displace a single atom from its equilibrium position, pressure waves will travel through it. In the φ-language, it means that φ(x,y,z,t) is now some wave-like function of time. These would be the excitations of this "spring field". It is most useful to write φ(x,y,z,t) now as a sum of planar waves, ie. Fourier decomposition. These planar waves are in some sense the most elemenatry excitations of φ, as they have a definite momentum (or wave-vector). In the particle language, we call these excitations phonons, or more generally quasi-particles (as they are not excitations of some fundamental field).
In this example, it is obvious, that these phonons are not pointlike at all. But the creation of them might look point-like (the displacement of a single atom was point-like in the continuum limit, but if we zoom back in, we see that it's not really point-like at all). So the interaction of the phonon-field with "sources" can be point-like, but for the phonons themself, the term doesn't really make sense.
Going back to particle physics: The fundamental fields are, as far as we know, not just an emergend property of an underlying crystal of some sort, but the concepts stay the same. The basic excitations of the electron field, ie. the electrons, are no more point-like than the phonons were. But the interaction with another field (eg. the electromagnetic field/photon field) can be point-like (replace the displacement of a single atom in the above example with the interaction between the fields). So that's what people mostly mean when they speak about point-like particles: Interactions between fields are point-like, and there is no "zooming in" like in the crystal/example, that would resolve this point-likeness (At least to the experimental precision that we have achieved so far).