rede96 said:
just because we can't find a classical equivalent doesn't mean the properties of the entangled particles aren't set at pair production.
What does it mean for "the properties of the particles to be set at pair production"? This is one of the key questions (to my mind
the key question) that Bell's analysis illuminated. He gave a very simple mathematical definition of what this means: it means that the expression for the probability of a given pair of measurement results (one for each particle) must
factorize.
In other words, if we have two particles ##a## and ##b##, and two measuring devices ##A## and ##B##, then the most general expression for the probability of a given pair of results ##a, b## is (using sloppy notation, but hopefully the meaning is clear)
$$
P(a, b) = F(A, B, \lambda)
$$
where the ##F## on the RHS is some function, and its arguments are: ##A##, the settings of measuring device ##A##; ##B##, the settings of measuring device ##B##; and ##\lambda##, which is just a general expression for any "hidden variables" that might affect the result. Any "properties of particles set at pair production" would be included in ##\lambda##.
Now, what Bell proposed was that, if the properties of the particles are "set at pair production", then the expression for ##P(a, b)## should factorize into two functions, as follows:
$$
P(a, b) = F_a(A, \lambda) F_b(B, \lambda)
$$
This seems natural enough: the measurement of particle ##a## should not "care" what the settings of measurement device ##B## are, and vice versa. But Bell proved that
no function of this form can reproduce the predictions of quantum mechanics. In other words, any function that can reproduce the predictions of QM
must be of the first form above,
not the second, factorized form.
The beauty of this is that it replaces a vague set of ordinary language words that are hard to interpret (your expression was "the properties of the particles are set at pair production", but other words like "locality" are also used) with a precise mathematical criterion that is easy to check. So perhaps it will help if you focus on the question: what are the implications of the mathematical statements I made above? What is QM telling us when it tells us we can't make the right predictions if we try to factorize the probability ##P(a, b)## for a given set of joint measurement results?