ThomasT said:
Bell's theorem would have us expecting that the intensity of light transmitted via crossed polarizers should vary as a linear function of the angular difference between the polarizers.
Not at all. Bell's theorem states that a set of correlations between measurements of which the correlations are due to a common origin, must satisfy certain arithmetic relationships.
Bell's theorem doesn't state anything about photons, polarizations, optics, quantum mechanics or whatever. It simply states something about the possible correlations that can be caused by a common cause. It could have been stated even if there were never any quantum mechanics. Only, with classical physics, it would have sounded as almost trivial.
In other words, it is a property of a set of correlations, wherever they come from, if they are assumed to come from a common cause. Bell's theorem is hence something that applies to sets of correlations. It can be formulated more generally, but in its simplest form, applied to a specific set of correlations, it goes like this:
Suppose that we can ask 6 questions about something. The questions are called A, B, C, D, E and F. To each question, one can have an answer "yes", or "no".
As I said, Bell's theorem is more general than this, but this can be a specific application of it.
We assume that we have a series of such objets, and we are going to consider that to each of the potential questions, each object has a potential answer.
Now, consider that we pick one question from the set {A,B,C}, and one question from the set {D,E,F}. That means that we have two questions, and hence two answers.
There are 9 possible sets of questions:
(A,D)
(A,E)
(A,F)
(B,D)
(B,E)
(B,F)
(C,D)
(C,E)
(C,F)
Let us call a generic set of questions: (X,Y) (here, X stands for A, B or C, and Y stands for D,E or F).
We are going to look at the statistics of the answers to the question (X,Y) for our series of objets, and we want to determine:
the average fraction of time that we have "yes,yes",
the average fraction of time that we have "yes,no"
the average fraction of time that we have "no,yes"
the average fraction of time that we have "no,no"
We take it that that average is the "population average".
In fact, we even look at a simpler statistic: the average fraction of time that we have the SAME answer (yesyes, or nono). We call this: the correlation of (X,Y) of the population.
We write it as: C(X,Y). If C(X,Y) = 1, that means that the answers to the questions X and Y are ALL of the kind: yesyes, or nono. Never we find a yesno or a noyes.
If C(X,Y) = 0, then it's the opposite: we never find yesyes, or nono.
If C(X,Y) = 0.5, that means that we have as many "equal" as "unequal" answers.
We have 9 possible combinations (X,Y), and hence we have 9 different values C(X,Y):
we have a C(A,D), we have a C(A,E), etc...
Suppose now that C(A,D) = 1, C(B,E) = 1, and C(C,F) = 1.
That means that each time we ask A, and we have yes, then if we measure D, we also have yes, and each time we ask A and we have no, then if we ask D, we have no.
So the answer to the question A is the same as the answer to the question D.
Same for B and E, and same for C and F.
So in a way, you can say that D is the same question as A, and E is the same question as B.
We can hence consider the six remaining combinations, and rewrite A for D etc...
C(A,B)
C(A,C)
C(B,A)
C(B,C)
C(C,A)
C(C,B)
We also suppose a symmetrical situation: C(A,B) = C(B,A).
We now have 3 numbers left: C(A,B), C(B,C) and C(A,C).
Well, Bell's theorem asserts: C(A,B) + C(B,C) + C(A,C) >= 1.
Another way of stating this is that if you look at it as:
C(A,B) + C(B,C) > = 1 - C(A,C)
It means that you cannot find a property B which is sufficiently anti-correlated with as well A as C, without A and C being somehow correlated. In other words, there's a lower limit to which 3 properties can be anti-correlated amongst themselves. It's almost trivial if you think about it: if A is the opposite of B, and B is the opposite of C, then A cannot be the opposite of C.
Let's indeed take A is not B, and B is not C: then C(A,B) = 0, and C(B,C) = 0. We then find that 0 + 0 must be larger than 1 - C(A,C), hence C(A,C) = 1. A must be the same as C.
Let's take A and B uncorrelated, and B and C uncorrelated. That means that C(A,B) = 0.5 and C(B,C) = 0.5. In this case, there's no requirement on C(A,C) which can go from 0 to 1.
So this is a "trivial" property of the correlations of the answers to the questions A, B and C one can ask about something.
Well, it is this which is violated in quantum mechanics.
C(X,Y) is given by cos(theta_x - theta_y) ^2, and so we have:
cos^2(th_xy) + cos^2(th_yz) + cos^2(th_xy + th_yz) is not always > 1.
Indeed, take angles 1 rad, 1 rad and 2 rad:
The sum is 0.757...