Then why does the rule single out measurements?
There is no special interaction associated with measurements in classical theory. Look, try to formulate the Born rule without mentioning "measurement" or a macroscopic/microscopic distinction. It is not possible. In contrast, the rules for classical mechanics can be formulated without mentioning measurement. That doesn't mean that there are no measurements in classical mechanics, but that measurements are a derived concept, not a primitive concept.
We could try to make measurements a derived concept in QM, as well.
What is a measurement? Well, a first cut at this is that a measurement is an interaction between one system (the system to be measured) and a second system (the measuring device) so that the quantity to be measured causes a macroscopic change in the measuring device. Roughly speaking, we have:
- A system being measured that has some operator O with eigenvalues o_i.
- A measuring device, which has macroscopic states "S_{ready}" (for the state in which it has not yet measured anything) and "S_i" (for the state of having measured value o_i; each macroscopic state will correspond to a huge number of microscopic states.)
- The device is metastable when in the "ready" state, meaning that a small perturbation away from the ready state will lead to an irreversible, entropy-increasing transition to one of the states S_i.
- The interaction between system and measuring device is such that if the system is in a state having eigenvalue o_i, then the device is overwhelmingly more likely to end up in the state S_i than any other macroscopic state S_j. (There might be other macroscopic states representing a failed or ambiguous measurement, but I'll ignore those for simplicity.)
Note: There is a bit of circularity here, in that to really make sense of the Born rule, I have to define what a measuring device is, and to define a measuring device, I have to refer to probability (which transitions are much more likely than which others), which in quantum mechanics has to involve the Born rule. What we can do, though, is treat the fact that the device makes a transition to state S_i when it interacts with a system in a pure state with eigenvalue o_i as initially being a matter of empirical observation, or it can be derivable from classical or semiclassical physics.
Then the Born rule implies that if the system to be measured is initially in a superposition of states of the form: \sum_i c_i |\psi_i\rangle where |\psi_i\rangle is an eigenstate of O with eigenvalue o_i, then the measuring device will, upon interacting with the system, make a transition to one of the macroscopic states S_i with a probability given by |c_i|^2.
Now, at this point, I think we can see where the talk about "measurement" in quantum mechanics is something of a red herring. Presumably, the fact that a pure eigenstate |\psi_i\rangle of the system triggers the measuring device to go into macroscopic state S_i is in principle derivable from applying Schrodinger's equation to the composite system, if we know the interaction Hamiltonian. But because of the linearity of quantum evolution, one would expect that if the initial state of the system were a superposition \sum_i c_i |\psi_i\rangle, then the final state of the measuring device would be a superposition of different macroscopic states, as well. (Okay, decoherence will actually prevent the device from being in a superposition, but the combination system + device + environment would be in a superposition.) So the Born rule for measurements can be re-expressed in what I think is an equivalent form that doesn't involve measurements at all:
If a composite system is in a state of the form |\Psi\rangle = \sum_i c_i |\Psi_i\rangle, where for i \neq j, |\Psi_i\rangle and |\Psi_j\rangle represent macroscopically distinguishable states, then that means that the system is in exactly one of the states |\Psi_i\rangle, with probability |c_i|^2
Note the difference with the usual statement of the Born rule: I'm not saying that the system will be measured to have some eigenvalue \lambda_i with probability |c_i|^2, because that would lead to an infinite regress. You would need a microscopic system, a measuring device to measure the state of the microscopic system, a second measuring device to measure the state of the first measuring device, etc. The infinite regress must stop at some point where something simply HAS some value, not "is measured to have some value".
But with this reformulation of Born's rule, which I'm pretty sure is equivalent to the original, you can see that the macroscopic/microscopic distinction has to be there. You can't apply the rule without the restriction that i \neq j implies |\Psi_i\rangle is macroscopically distinguishable from |\Psi_j\rangle. To see this, take the simple case of a single spin-1/2 particle, where we only consider spin degrees of freedom. A general state can be written as |\psi\rangle = \alpha |u\rangle + \beta |d\rangle. Does that mean that the particle is "really" in the state |u\rangle or state |d\rangle, we just don't know which? No, it doesn't mean that. The state |\psi\rangle is a different state from either |u\rangle or |d\rangle, and it has observably different behavior.
If you eliminate "measurement" as a primitive, then you can't apply the Born rule without making a macroscopic/microscopic distinction.