ThomasT said:
One of my contentions is that Bell's theorem doesn't actually make a locality assumption. If you think it does, then point out where you think it is in his formulation.
If Bell's locality condition isn't, in reality, a locality condition, then Bell's theorem doesn't contradict locality.
Bell uses TWO assumptions to be able to write:
P(A,B,lambda) = P(A,lambda) P(B,lambda).
The first assumption is locality. Now, locality has a slightly different definition depending on whether we have to do with a deterministic theory or with a stochastic theory. In a deterministic theory, the definition is simple: the time evolution of an ontological physical quantity at a space(time) point is entirely determined by the values of the ontological physical quantities defined in a close neighbourhood of said space(time) point.
This by itself already assumes that we have postulated ontological physical quantities, and that they are fields over space(time). Indeed, it doesn't make sense to talk about locality about physical quantities who are not attached to a point in space(time). It also assumes that we have given the full list of ontological (observable or non-observable) quantities.
In practice, this comes down to requiring that the time evolution of all ontological physical quantities is given by a set of partial differential equations.
Relativity requires on top of that, "an upper limit of propagation speed", which comes down requiring that the Green's functions of the partial differential equations vanish outside of the light cone.
This is locality for deterministic theories.
Things become a bit more difficult for stochastic theories. In a stochastic theory, physical quantities are not determined uniquely by the "current state", only their *probabilities* are determined by the "current ontological state". The thing is that probabilities are not physical quantities, because they depend on the conditions one imposes. Well, here one requires the following for locality. We still assume that there are ontological physical quantities associated to each point in space(time).
The conditional probability for an ontological physical quantity at point P to evolve into one or another value, given all the values of the ontological physical quantities within a neighbourhood of point P, remains unchanged when one adds extra conditions concerning the physical values of remote, or past, events.
If that's the case, then the stochastic theory is said to be local.
Let us understand this definition. Assume that we are at point P, at instant t0, and we look at a physical ontological quantity X. At t0+dt, X can take on certain values. Now, if we don't know anything about the physical situation, then we can say for instance that these potential values of X are distributed according to a certain distribution (say, uniform). One would think that "the more we know", the more "refined" our probabilities for X at P and at t0 + dt will be. For instance, the probability to have X0, knowing that at t and P, we had another physical quantity Y = Y0, will be different than if we didn't know Y to be equal to Y0. And if we know about Z = Z0 at P and t0, then that changes again our probabilities for X at t0 + dt. And if we know about Z = Z1 at another point, Q, then this still changes our probability of X at t0+dt.
But IF WE TAKE INTO ACCOUNT all the ontological physical quantities in a neighbourhood of P, at time t0, which we call collectively ALL0, then we find a certain probability P(X0|ALL0) to have X = X0 at t0 + dt, and this is "all the useful information we need and that will tell us something about X0". So if now we ADD another condition:
P(X0 | ALL0 AND STUFF) and "STUFF" is a condition on an ontological physical variable somewhere else, or in the past, then:
P(X0 | ALL0) = P(X0 | ALL0 AND STUFF)
In other words, knowing something extra won't change anything to the probability distribution of X anymore. The neighbourhood of P, and all ontological physical variables, specified everything there was to know.
If that's the case, we call our stochastic theory "local". Notice - and that is very important - that if our stochastic theory is actually a deterministic theory, then both definitions of locality coincide. The only difference is that the probability values will be 1 or 0.
Bell needs this definition to be able to write that P(A,lambda) is not dependent on B (the choice at Bob's). But note the "lambda": it stands for "all the ontological physical variables that are present at Alice". Lambda contains actually a bit more (the part sent to Bob), but we know that ONCE we have the "local" part, that normally, P won't change anymore.
So it is in writing P(A,lambda) (choice at Alice: local quantity, and variables dependent on the incoming particle, whatever they are), and not P(A,B,lambda), Bell uses locality of a stochastic theory to find the probability of having "up" with choice A.
We can write a similar thing Q(B,lambda) at Bob: the probability for Bob to find "up".
The second thing he needs, is that the probability to find, say, (up,up) (written: R(A,B,lambda) ) is now given by the product of the probability of "up" at Alice and the probability of "up" at Bob.
R(A,B,lambda) = P(A,lambda) x P(B,lambda).
HERE, we use the assumption of stochastic independence of our FULLY DETERMINED probabilities. Note that we don't write: R(A,B) = P(A) x P(B). No, we use lambda: for a given (unknowable in practice, but assumed to be given in theory) fully determined ontological state. This is the assumption of no superdeterminism.
Point is: with R(A,B,lambda), we can't do anything because we don't know lambda. So we will have to weight over lambda.
We use again locality in assuming that there is a P(lambda), a certain probability distribution of the ontological physical quantities sent out by the source, which doesn't depend on the choices A and B.
And we use again no superdeterminism when we apply:
integral over lambda of R(A,B,lambda) x P(lambda) to obtain the probability to have "up,up" without any lambda condition.