I think you still avoid the relevant problem. A photon certainly carries energy. If the photon is detected, all of the energy arrives exactly at the position of the detector. One may naively assume that the energy travels to the detector in a localized manner. Then the appearance of the double slit pattern is puzzling. One may also naively assume that the energy is somehow delocalized within the wave. In that case at the time of the detection, all the other positions of the wavefront must be "updated" instantaneously in order to ensure that the photon gets detected only once at the position of the detector. In this scenario the double slit pattern is trivial, but you run into problems with relativity and have non-locality as this "update" would happen faster than the speed of light. Of course you may also assume that this property is simply undefined between emission and detection. However, not everybody considers this as a satisfying answer.