Axiomatization of quantum mechanics and physics in general ?

  • #51
microsansfil said:
This is unknown from mathematics. A door must be open or closed.

bhobba said:
That's wrong - simple as that.

You have read somewhere that mathematics is formal logic and for some reason don't seem to understand math in practice is not done that way.

Can I suggest that both of you are right, and talking about different things? It is true that there is nowhere in the world written an axiomatization of quantum mechanics in formal language. I don't even know whether Kolmogorov's axioms for probability have been written in formal language. However, I don't think anyone doubts that if one wanted to, Kolmogorov has been precise enough that his axioms can be translated into a formal statements. Similarly, although the proof of Fermat's last theorem was certainly not formal, I don't think the experts doubt that it could be rewritten using Peano's axioms if they wanted to. If Peano's were for some strange reason not enough, I think everyone would be very surprised if they couldn't do it in ZFC.

So do we believe that there is an axiomatization of quantum mechanics that is precise enough that we believe a formalization of it exists in principle? My guess is that it should, after all it doesn't seem much more than linear algebra and Kolmogorov's axioms, both of which we do believe can be formalized if we wished. Or would others disagree? For simplicity, one could take finite dimensional quantum mechanics, and maybe Hardy's axioms for specificity - is there any doubt that Hardy's axioms can be formalized?

Of course the question above would not answer which physical operations we describe in natural language would correspond to the mathematical operations.

However, as far as I can tell, Piron was not that much interested in the formalization of quantum mechanics. He was more interested in reasonable axioms - very much as Hardy. After all, if one were just interested in formalization we can just postulate the Hilbert space and the Born rule straightaway. The point of Piron's derivation is to try to make the Hilbert space seem natural or reasonable. Similarly, the point of Gleason's is that if one considers non-contextuality natural, then the Born rule is implied.
 
Last edited:
Physics news on Phys.org
  • #52
atyy said:
Can I suggest that both of you are right, and talking about different things? It is true that there is nowhere in the world written an axiomatization of quantum mechanics in formal language. I don't even know whether Kolmogorov's axioms for probability have been written in formal language. However, I don't think anyone doubts that if one wanted to, Kolmogorov has been precise enough that his axioms can be translated into a formal statements.

There may be a communication gap here.

No one doubts that the informal language used in practice will produce results in any way different to if formal logic such as found in Russell's Principa was used.

But that is not the sense I got - I got he was claiming its all done by formal logic - which isn't true.

OK let's move on from that.

atyy said:
Yes, it seems that atomicity is one of Piron's axioms. It seems that one gets it "for free" when one uses Kolmogorov's axioms for probability.

What's going on is this.

One takes Pirons axioms, and I assume he deliberately has them in a form he can apply his famous theorem, and shows the 'atoms' of his approach map to the yes-no projection operators on a Hilbert space.

One then invokes Gleason to show the only measure that can be defined on those projection operators is via the Born Rule.

Also note Gleason only works for dimension 3 or greater - dimension 2 is an issue if you are being rigorously exact.

But to proceed from that one needs to make further assumptions and introduce concepts like independent observations, show the measure defined by Gleason obeys the Kolmogorov axioms etc.

Thanks
Bill
 
Last edited:
  • #53
bhobba said:
Also note Gleason only works for dimension 3 or greater - dimension 2 is an issue if you are being rigorously exact.

Well, we could just use Busch's theorem. Anyway, intuitively I think that Busch's theorem is just saying that there's a Naimark extension for dimension 2, and we can apply Gleason's to the Naimark extension. And the Naimark extension is just the formalization of being able to place the Heisenberg cut in more than one place:)

Ok, that's silly, since it assumes Copenagen is intuitive (I confess it is:)
 
  • #54
atyy said:
Similarly, although the proof of Fermat's last theorem was certainly not formal, I don't think the experts doubt that it could be rewritten using Peano's axioms if they wanted to. If Peano's were for some strange reason not enough, I think everyone would be very surprised if they couldn't do it in ZFC.

OK, googling suggests I was too hasty there. Here's a very interesting blog post, with interesting comments too: http://blog.computationalcomplexity.org/2014/01/fermats-last-theorem-and-large.html.
 
  • #55
bhobba said:
What's going on is this.

One takes Pirons axioms, and I assume he deliberately has them in a form he can apply his famous theorem, and shows the 'atoms' of his approach map to the yes-no projection operators on a Hilbert space.

One then invokes Gleason to show the only measure that can be defined on those projection operators is via the Born Rule.

Also note Gleason only works for dimension 3 or greater - dimension 2 is an issue if you are being rigorously exact.

But to proceed from that one needs to make further assumptions and introduce concepts like independent observations, show the measure defined by Gleason obeys the Kolmogorov axioms etc.

Thanks
Bill

OK, if I understand Piron tries to have some natural axioms from which one can get (close to) the Hilbert space. Then if one believes non-contextuality is natural, one can also get the Born Rule by Gleason's.

What is a bit opaque to me is - in Hardy's derivation, where is it that non-contextuality enters?
 
  • #56
microsansfil said:
Another interesting paper Quantum Logic as Classical Logic : http://arxiv.org/pdf/1406.3526v2.pdf

And This http://indigo.uic.edu/bitstream/handle/10027/10195/DeJonghe_Richard.pdf?sequence=2which is the inverse : Rebuilding Mathematics on a Quantum Logical Foundation :smile:

All that stuff is well known.

It simply completes a line of reasoning started by Von-Neumann that dates back to the early days of QM.

Its the basis of the Geometric approach, the reference I have being Varadarajan that I have been studying on and off for a while now.

Formally its what is called a logic - which is what Piron's axioms define. One can define observables, states, even probability measures on those states, all sorts of things in a logic - see Chapter 3 of Varadarajan.

The key idea is to show the particular logic is equivalent to the projection operators on a Hilbert space - and by choosing the axioms of your QM logic carefully one can invoke Pirons Theorem or similar to prove that equivalence.

Then one invokes Gleason to derive the Born rule for those probability measures.

All very beautiful and mathematically satisfying.

BUT - and here is the clanger - one must introduce other axioms to arrive at QM eg something to address the dimension limitation of Gleason. Another is for filtering type observation to show the resultant state is an eigenvector of the observable you need to assume continuity. There are undoubtedly others as well.

This approach is extremely beautiful and alluring - which is why I in fits and starts keep studying it - but falls short of the aim of fully axiomatising QM.

Thanks
Bill
 
  • #57
What!? http://www.mth.kcl.ac.uk/~streater/piron.html
 
Last edited by a moderator:
  • #58
atyy said:
What is a bit opaque to me is - in Hardy's derivation, where is it that non-contextuality enters?

Well spotted.

He takes probabilities as his fundamental thing and shows via his axioms, QM, as a probability model, is what results.

However we come to the nasty little issue of applying it. Are the observables defined in that probability model the only things in the theory? That's where non-contextuality comes into it - hidden variables can be contextual.

Its the same with Piron of course. But that isn't my main concern - one defines observables etc and assumes, just like if you apply Hardy, you run into the issue of exactly how good a model it is.

Personally I give both Hardy and Piron a pass on that. The treatment of both define the theory pretty clearly - its simply how good a model is it.

Again well spotted - it never even occurred to me.

Thanks
Bill
 
  • #59
bhobba said:
Personally I give both Hardy and Piron a pass on that. The treatment of both define the theory pretty clearly - its simply how good a model is it.

But maybe they don't need a pass? Piron only tried to derive the Hilbert space, not the Born rule, so he doesn't obviously need contextuality. And yes, to get from Piron to the Born rule via Gleason, we understand nowadays that we need non-contextuality.

But does Hardy fail to mention the assumption of non-contextuality? Or is it in there, and just in a more natural or "reasonable" way, as he intends?
 
  • #60
atyy said:
OK, if I understand Piron tries to have some natural axioms from which one can get (close to) the Hilbert space.

By careful choice of the axioms you get exactly a Hilbert space - but some bits are not as 'natural' as one would like.

Soler's Theorem is a bit more of an advance in the natural department:
http://golem.ph.utexas.edu/category/2010/12/solers_theorem.html
http://arxiv.org/pdf/math/9504224v1.pdf
http://arxiv.org/pdf/quant-ph/0105107v1.pdf

But, while so tantalisingly close, still isn't quite there yet.

John Baez discussed it in some of his finds articles - if I remember correctly that is. You can almost hear him weep - if only - it would be just so beautiful if it was. I think its the natural reaction of those with a mathematical bent to this stuff (and of course I am one).

Thanks
Bill
 
  • #61
atyy said:
What!? http://www.mth.kcl.ac.uk/~streater/piron.html

Amusing.

I have been refreshing my memory on this stuff and came across:
http://arxiv.org/pdf/0811.2516.pdf

Added Later:
Whoops - posted the wrong paper - now fixed

It seems I was remiss in assuming Pirons axioms led to the Hilbert space formalism - there are 5 - not three - and they do not rule out quaternion Hilbert spaces.

'Starting from the set L of all operational propositions of a physical entity and introducing five axioms on L he proved that L is isomorphic to the set of closed subspaces L(V ) of a generalized Hilbert space V whenever these five axioms are satisfied [6]'

[6] Piron, C. (1964), Axiomatique quantique

Which is of course the paper Patrick has posted in French.

One must go to the theorem of Soler to do that and evoke a sixth plane transitivity axiom.

But that is neither here nor there really - Piron ESSENTIALLY does it.

Its just that 'essentially' isn't quite the same as true in formal logic.

Thanks
Bill
 
Last edited by a moderator:
  • #62
bhobba said:
By careful choice of the axioms you get exactly a Hilbert space - but some bits are not as 'natural' as one would like.

Soler's Theorem is a bit more of an advance in the natural department:
http://golem.ph.utexas.edu/category/2010/12/solers_theorem.html
http://arxiv.org/pdf/math/9504224v1.pdf
http://arxiv.org/pdf/quant-ph/0105107v1.pdf

But, while so tantalisingly close, still isn't quite there yet.

John Baez discussed it in some of his finds articles - if I remember correctly that is. You can almost hear him weep - if only - it would be just so beautiful if it was. I think its the natural reaction of those with a mathematical bent to this stuff (and of course I am one).

So the Piron-Soler sort of reasoning leads to infinite dimensional Hilbert spaces?

OTOH, the Hardy and Chirinell et al approaches lead to finite dimensional Hilbert spaces?
 
  • #63
atyy said:
So the Piron-Soler sort of reasoning leads to infinite dimensional Hilbert spaces?

Yes - and of course finite as well.

atyy said:
OTOH, the Hardy and Chirinell et al approaches lead to finite dimensional Hilbert spaces?

Yes - but for me that's not a worry - I simply generalise via Rigged Hilbert Spaces.

Thanks
Bill
 
Last edited:
  • #64
atyy said:
Well, we could just use Busch's theorem.

Ahhhh. But do the axioms of Piron map to a POVM. His theorem shows they map to projection operators, or equivalently subspaces (which is the same thing) but POVM's are another matter.

Thanks
Bill
 
  • #65
atyy said:
OK, googling suggests I was too hasty there. Here's a very interesting blog post, with interesting comments too: http://blog.computationalcomplexity.org/2014/01/fermats-last-theorem-and-large.html.

Interesting.

On the surface it doesn't seem to contradict what you said. Nor do I reasonably expect it to - Russell tried it with the Principa - I don't think anyone wants to repeat that tome.

I have to go and get some lunch will put on my thinking hat about it when I return.

Thanks
Bill
 
Last edited:
  • #66
bhobba said:
Ahhhh. But do the axioms of Piron map to a POVM. His theorem shows they map to projection operators, or equivalently subspaces (which is the same thing) but POVM's are another matter.

Hmmm, how about doing Piron + Gleason's in 3d, then defining 2d QM as resulting from projective measurements in 3d or higher? Basically this is adding an axiom that says 2d QM is defined by having a Naimark extension, ie. we need at least 3d, so Gleason's will apply.

Physically, I think this is saying we can move the Heisenberg cut outwards.
 
  • #67
atyy said:
Can I suggest that both of you are right, and talking about different things?
Perhaps, however it is not what there are more interesting.

I did not know there was so much work on the topic quantum logic. Express the foundations of quantum mechanics in the language of the logic of mathematics.

Here "A New Approach to Quantum Logic".

The message of the book is of interest to a broad audience consisting of logicians, mathematicians, philosophers of science, researchers in Artifficial Intelligence and last but not least physicists. These communities, however, strongly differ in their scientific backgrounds. Normally, a physicist has no training in mathematical logic, and a logician is by no means expected to master the Hilbert space formalism of quantum mechanics.
This fact constitutes a major problem in any attempt to present the topic of quantum logic in a way accessible to the broad audience to which, in principle, it is of interest.

This seem open a new perspective to quantum mechanics or then be a deadlocked.

What about relativistic quantum mechanics logic ?

Patrick
 
Last edited:
  • #68
microsansfil said:
What about relativistic quantum mechanics logic ?

Axiomatic QFT (which is relativistic QM) is a whole new ball game.

It's mathematically way above my current level with tomes of VERY deep mathematics supporting it.

Its not based on Hilbert Spaces like standard QM, but draws heavily on Rigged Hilbert Spaces and distribution theory:
http://www.mathematik.uni-muenchen.de/~schotten/LNP-cft-pdf/08_978-3-540-68625-5_Ch08_23-08-08.pdf

The standard reference is Gelfand and Vilenkin - Generalized Functions. I have studied it and even with my math background its - how to put it - challenging - meaning very non trivial.

BTW its the correct formalism for QM as well - but in axiomatic QFT its unavoidable. And that's just to start with - QFT scales rather 'inspiring heights'.

Thanks
Bill
 
Last edited:
  • #69
atyy said:
Physically, I think this is saying we can move the Heisenberg cut outwards.

One could use Neumark's theorem to show in lower dimensions resolutions of the identity looks like POVM's and its very reasonable to assume probabilities etc are not altered, but reasonable, and formally provable are two different things.

Thanks
Bill
 
  • #70
microsansfil said:
This seem open a new perspective to quantum mechanics or then be a deadlocked.

Its well known - the reference by Varadarajan details it pretty well.

It is, as far as foundations is concerned, as I have said previously, our most penetrating formalism.

Pirons axioms, and even better with Solers theorem, ESSENTIALLY implies the QM formalism.

The issue is in that word - essentially - eg you need extra assumptions of a seemingly ad-hoc variety to rigorously make it work.

But above all its - HARD.

Thanks
Bill
 
  • #71
microsansfil said:
What about relativistic quantum mechanics logic ?
The difference between non-relativistic QM and special relativistic QM is just a choice between the Galilean group and the Poincaré group. You postulate that there's a homomorphism from one of these groups into the group of automorphisms of the lattice. I you choose the former group, the result is non-relativistic QM. If you choose the latter, the result is special relativistic QM.

bhobba said:
Axiomatic QFT (which is relativistic QM) is a whole new ball game.
I view relativistic quantum field theories as theories defined within the framework of special relativistic QM (as defined above). I believe that there are also non-relativistic QFTs, but I have never studied one.

The axioms of (axiomatic) QFT are supposed to define what a quantum field theory is, so yes, it's definitely a very different game.
 
  • #72
Fredrik said:
I view relativistic quantum field theories as theories defined within the framework of special relativistic QM (as defined above). I believe that there are also non-relativistic QFTs, but I have never studied one.

I have off and on been studying QFT over the years from various books such as Zee and others.

Mathematically - yea - I got it. But physically it didn't gel.

Then I came across thew following book recently released:
https://www.amazon.com/dp/019969933X/?tag=pfamazon01-20

The Kindle price was pretty good so I took a punt.

Really good at explaining what it means.

Its not a well known fact, bu5t still true, that QM can be reformulated as a the3ory of creation and anhilation operators:
http://math.bu.edu/people/mak/Styer Am J Phys 2002.pdf

See interpretation F.

What that book does is explain that view step by step then shows how it applies to QFT so you immediately know what the formalism means.

Thanks
Bill
 
Last edited by a moderator:
  • #73
atyy said:
But does Hardy fail to mention the assumption of non-contextuality? Or is it in there, and just in a more natural or "reasonable" way, as he intends?
What does it even mean for an inherently probabilistic theory to be (non-)contextual?
 
  • #74
kith said:
What does it even mean for an inherently probabilistic theory to be (non-)contextual?

My understanding about non-contextuality in the context of Gleason's theorem comes from the comments by Peres in http://books.google.com/books?id=IjCNzbJYONIC&dq=peres+quantum&source=gbs_navlinks_s.

If understand Peres correctly, he says Gleason's theorem assumes that if Pu and Pv are orthogonal projectors, then <Pu + Pv> = <Pu> + <Pv>. However, there isn't a unique way to write P = Pu + Pv = Px + Py, so the assumption is that <Pu> + <Pv> = <Px> + <Py>. The assumption is non-trivial since measurements of Pu and Pv usually require different experimental setups from those that measure Px and Py.

Hardy doesn't even assume Hilbert spaces or anything, but he does derive the Born rule. Most people believe his derivation is correct and complete, so it's most likely that he has not left out an assumption such as non-contextuality. Rather, he has other axioms which do the work, and I'm wondering which of his axioms do that, and whether one can understand them as equivalent to non-contextuality in Gleason's.
 
  • #75
Fredrik said:
The difference between non-relativistic QM and special relativistic QM is just a choice between the Galilean group and the Poincaré group. You postulate that there's a homomorphism from one of these groups into the group of automorphisms of the lattice. I you choose the former group, the result is non-relativistic QM. If you choose the latter, the result is special relativistic QM.
"A principle of modern mathematics holds in this lesson: when you are dealing with an entity S with a measure of structure, try to determine its group of automorphisms, the group of transformations of its components that preserve the structural relations. You can expect to gain a deep understanding of the constitution of S in this way. "Hermann Wey

http://en.wikipedia.org/wiki/Representation_theory_of_the_Poincaré_group

Does this mathematical logic view could be useful in the search for unification of General relativity and Quantum mechanics ?

Patrick
 
  • #76
microsansfil said:
"A principle of modern mathematics holds in this lesson: when you are dealing with an entity S with a measure of structure, try to determine its group of automorphisms, the group of transformations of its components that preserve the structural relations. You can expect to gain a deep understanding of the constitution of S in this way. "Hermann Wey

http://en.wikipedia.org/wiki/Representation_theory_of_the_Poincaré_group

Does this mathematical logic view could be useful in the search for unification of General relativity and Quantum mechanics ?
I know almost nothing about that. I think that loop quantum gravity is an attempt to develop "general relativistic quantum mechanics" in a way that's similar to what I was talking about, but I don't really know.
 
  • #77
atyy said:
If understand Peres correctly, he says Gleason's theorem assumes that if Pu and Pv are orthogonal projectors, then <Pu + Pv> = <Pu> + <Pv>. However, there isn't a unique way to write P = Pu + Pv = Px + Py, so the assumption is that <Pu> + <Pv> = <Px> + <Py>. The assumption is non-trivial since measurements of Pu and Pv usually require different experimental setups from those that measure Px and Py.
So the assumption is called non-contextuality because for a single observable Pu+Pv, I can use all kinds of different experimental setups -where different setups correspond to different bases of the eigenspace of Pu+Pv- to measure it's expectation value?

I don't have access to Peres at the moment and an immediate follow-up question is how does this relate to Bohmian mechanics and it's contextuality? By what mathematical elements are observables even represented in Bohmian mechanics?

atyy said:
Hardy doesn't even assume Hilbert spaces or anything, but he does derive the Born rule. Most people believe his derivation is correct and complete, so it's most likely that he has not left out an assumption such as non-contextuality. Rather, he has other axioms which do the work, and I'm wondering which of his axioms do that, and whether one can understand them as equivalent to non-contextuality in Gleason's.
I haven't seen his derivation but since the first four axioms also apply to classical probability theory, it certainly has to do with the fifth. I would also be interested in seeing how the Hilbert space formalism and Hardy's formulation are related exactly.
 
  • #78
kith said:
So the assumption is called non-contextuality because for a single observable Pu+Pv, I can use all kinds of different experimental setups -where different setups correspond to different bases of the eigenspace of Pu+Pv- to measure it's expectation value?

I don't have access to Peres at the moment and an immediate follow-up question is how does this relate to Bohmian mechanics and it's contextuality? By what mathematical elements are observables even represented in Bohmian mechanics?

I don't understand this issue very well. As I understand it, the non-contextuality in Gleason's theorem seems to have nothing to do with hidden variables, since it is just about measures on states in Hilbert space.

However, by some corollary of Gleason's, there is apparently a link between the non-contextuality there and in hidden variable theories. It is discussed by in this link given by bhobba to an article by Bell http://fy.chalmers.se/~delsing/QI/Bell-RMP-66.pdf, and also in this proof by Busch of a Gleason-like theorem, but pertaining to POVMs instead of POMs http://arxiv.org/abs/quant-ph/9909073. Maybe bhobba or Fredrik can explain in more detail here?

kith said:
I haven't seen his derivation but since the first four axioms also apply to classical probability theory, it certainly has to do with the fifth. I would also be interested in seeing how the Hilbert space formalism and Hardy's formulation are related exactly.

Here's Hardy's first derivation http://arxiv.org/abs/quant-ph/0101012. The fifth axiom just says that there's a continuous reversible transformation between pure states. Isn't that met by classical mechanics in phase space?
 
  • #79
Just a quick note:
atyy said:
The fifth axiom just says that there's a continuous reversible transformation between pure states. Isn't that met by classical mechanics in phase space?
Hardy's states are probability vectors (p1,...,pn). For pure states, all but one of the pi are zero so the pure states correspond to the elements of the n-dimensional standard basis and there's no continuous transformation between them.

However, I am not sure if Hardy's formulation of QM works for infinite dimensional systems like a particle in space.
 
Last edited:
  • Like
Likes 1 person
  • #80
kith said:
Just a quick note:

Hardy's states are probability vectors (p1,...,pn). For pure states, all but one of the pi are zero so the pure states correspond to the elements of the n-dimensional standard basis and there's no continuous transformation between them.

However, I am not sure if Hardy's formulation of QM works for infinite dimensional systems like a particle in space.

Perhaps that is why Hardy's derivation doesn't work for continuous variables. For discrete variables, it is obvious that classically, a particle is either in one box or the other, whereas in the quantum case, it can be in a superposition of being in both boxes. As I understand it, the Chiribella et al derivation is also only of finite dimensional quantum mechanics.

On the other hand the Mackey-Piron-Soler approach, according to bhobba, gets finite and infinite dimensional quantum mechanics. The other "reformulation" I know that can get infinite dimensional QM is the Leifer-Spekkens http://arxiv.org/abs/1107.5849. That paper only deals with the finite dimensional case, but in the comments here he says the extension to the infinite dimensional should be straightforward http://mattleifer.info/2011/08/01/the-choi-jamiolkowski-isomorphism-youre-doing-it-wrong/.
 
  • #81
kith said:
What does it even mean for an inherently probabilistic theory to be (non-)contextual?

Non contextuality means the probability measure does not depend on what resolution of the identity a projection operator is part of. Its a very natural condition to impose mathematically because its simply expressing basis independence - you would not expect a measure to depend of your basis - after all that is a pretty basic property of vector spaces - the important geometric stuff like length or angle doesn't depend on basis. But physically it has profound implications because a resolution of the identity corresponds to an actual measurement apparatus.

Its the key ingredient in Gleason's proof - there are others such as the strong superposition principle - but that is the key one.

In the geometric approach to QM you start with a logistic then show its observables (one can define things like observables, states, even probability measures on observables, etc in a logistic - see Chapter 3 of Varadarajan) are isomorphic to a Hilbert space. One then uses Gleason to show that probability measure is the Born rule.

The issue though is when you model something using it. Are all the things that determine how the system behaves observables? There may be hidden variables and they may be contextual. In BM for example the pilot wave is explicitly contextual:
http://philsci-archive.pitt.edu/3026/1/bohm.pdf

Don't necessarily agree with that link saying BM is not using hidden variables - but non contextual ones. The pilot wave is hidden - you can't ever directly observe it.

But of course Dymystifyer is the expert on BM around here - not me.

Thanks
Bill
 
Last edited:
  • Like
Likes 1 person
  • #82
atyy said:
As I understand it, the non-contextuality in Gleason's theorem seems to have nothing to do with hidden variables, since it is just about measures on states in Hilbert space.

You are correct - it doesn't SEEM to. But if there are contextual variables, hidden or otherwise, then Gleason's breaks down.

Gleason is basically a stronger version of Kochen-Speker - in fact Kochen-Speker is a simple corollary to Gleason.

Thanks
Bill
 
  • #83
kith said:
However, I am not sure if Hardy's formulation of QM works for infinite dimensional systems like a particle in space.

Personally in discussing foundational issues I stick to finite vector spaces.

I view the infinite dimensional case via the Rigged Hilbert Space formalism where the dual is simply introduced for mathematical convenience. Here the dual I am referring to is the dual to the space of all sequences of finite length. In the weak convergence of that space any linear functional is the limit of a sequence of the space the functionals are defined on.

Thanks
Bill
 
  • #84
atyy said:
I don't understand this issue very well. As I understand it, the non-contextuality in Gleason's theorem seems to have nothing to do with hidden variables, since it is just about measures on states in Hilbert space.
Well, let's look at KS first. I am not sure if I am on the right track but my current understanding is this (following Peres).

KS says that if we have a Hilbert space of dimension N ≥ 3, we cannot find a function v which consistently assigns a probability of 0 and 1 to all projectors acting on this Hilbert space. Namely, if we combine a projector with N-1 other commuting projectors with \sum_i P_i=1 we cannot have \sum_i v(P_i)=1 for all choices of the N-1 projectors.

Let's say we have an observable Q with possible experimental outcomes a, b and c and associated projectors P_a, P_b and P_c (these commute and sum to the identity). In order to assign a true but hidden value to our quantity we could assign a probability v of 0 or 1 to each of the projectors. Let's say that the true value of Q is b, so v(P_a)=v(P_c)=0 and v(P_b)=1.

KS now implies that there exists a similar observable R where the probabilities either don't sum to one or are inconsistent with previously assigned probabilities. The situation could be something like this: Q and R share the projector P_a but instead of P_b and P_c, R is associated with P_{\beta} and P_{\gamma}. Consistency with other observables Q&#039;, Q&#039;&#039;, ... forces us to either choose v(P_{\beta}) = v(P_{\gamma}) = 0 or v(P_a)=1. So either the sum of probabilities for R is equal to 0 or the sum for Q is equal to 2. Both options imply that v isn't a probability in the first place, so this kind of hidden variable assignment is ruled out.

The problem goes away if the probability associated with P_a depends on the context, i.e when we allows different probabilities v(P_a; P_b,P_c)=0 and v(P_a; P_{\beta}, P_{\gamma})=1 for different observables Q and R.
 
  • #85
atyy said:
Perhaps that is why Hardy's derivation doesn't work for continuous variables.
Has it really been shown that it doesn't work or did he simply restrict his discussion to the finite / countably infinite case?
 
  • #86
bhobba said:
Gleason is basically a stronger version of Kochen-Speker - in fact Kochen-Speker is a simple corollary to Gleason.
I haven't followed the logic of the proofs in detail but I tend to agree. KS says that the non-contextual probability distribution cannot look a certain way while Gleason says how exactly it has to look.
 
  • #87
kith said:
Well, let's look at KS first. I am not sure if I am on the right track but my current understanding is this (following Peres).

KS says that if we have a Hilbert space of dimension N ≥ 3, we cannot find a function v which consistently assigns a probability of 0 and 1 to all projectors acting on this Hilbert space. Namely, if we combine a projector with N-1 other commuting projectors with \sum_i P_i=1 we cannot have \sum_i v(P_i)=1 for all choices of the N-1 projectors.

Let's say we have an observable Q with possible experimental outcomes a, b and c and associated projectors P_a, P_b and P_c (these commute and sum to the identity). In order to assign a true but hidden value to our quantity we could assign a probability v of 0 or 1 to each of the projectors. Let's say that the true value of Q is b, so v(P_a)=v(P_c)=0 and v(P_b)=1.

KS now implies that there exists a similar observable R where the probabilities either don't sum to one or are inconsistent with previously assigned probabilities. The situation could be something like this: Q and R share the projector P_a but instead of P_b and P_c, R is associated with P_{\beta} and P_{\gamma}. Consistency with other observables Q&#039;, Q&#039;&#039;, ... forces us to either choose v(P_{\beta}) = v(P_{\gamma}) = 0 or v(P_a)=1. So either the sum of probabilities for R is equal to 0 or the sum for Q is equal to 2. Both options imply that v isn't a probability in the first place, so this kind of hidden variable assignment is ruled out.

The problem goes away if the probability associated with P_a depends on the context, i.e when we allows different probabilities v(P_a; P_b,P_c)=0 and v(P_a; P_{\beta}, P_{\gamma})=1 for different observables Q and R.

Thanks! I found this explanation by Spekkens which seems to match what you wrote http://arxiv.org/abs/quant-ph/0406166v3: "Traditionally, a noncontextual hidden variable model of quantum theory is one wherein the measurement outcome that occurs for a particular set of values of the hidden variables depends only on the Hermitian operator associated with the measurement and not on which Hermitian operators are measured simultaneously with it. For instance, suppose A,B and C are Hermitian operators such that A and B commute, A and C commute, but B and C do not commute. Then the assumption of noncontextuality is that the value predicted to occur in a measurement of A does not depend on whether B or C was measured simultaneously. The Bell-Kochen-Specker theorem shows that a hidden variable model of quantum theory that is noncontextual in this sense is impossible for Hilbert spaces of dimension three or greater."

Intuitively, this seems like a generalization of the idea that canonically conjugate observables like momentum and position do not have simultaneous existence, since it depends on non-commuting observables, even if only indirectly.
 
  • #88
kith said:
Has it really been shown that it doesn't work or did he simply restrict his discussion to the finite / countably infinite case?

Originally I thought that his method did not work for the infinite dimensional case, because axiom five which is the distinction between finite dimensional classical and quantum theories, clearly holds for classical continuous variables. However, according to his discussion of the issue in section 9 of http://arxiv.org/abs/quant-ph/0101012, his axioms still rule out classical continuous variables, because of axiom 3: "A system whose state is constrained to belong to an M dimensional subspace (i.e. have support on only M of a set of N possible distinguishable states) behaves like a system of dimension M."

So I guess it is unknown whether his axioms work or not for the continuous case.
 
  • Like
Likes 1 person
  • #89
atyy said:
However, according to his discussion of the issue in section 9 of http://arxiv.org/abs/quant-ph/0101012, his axioms still rule out classical continuous variables, because of axiom 3: "A system whose state is constrained to belong to an M dimensional subspace (i.e. have support on only M of a set of N possible distinguishable states) behaves like a system of dimension M."
This sounds a bit unphysical to me. Such a finite-dimensional subspace of phase space is a set of measure zero (loosely speaking a collection of delta functions). It seems strange to say that classical statistical mechanics violates this axiom because such sets of measure zero are not considered physical there anyway. So I would say that this axiom is irrelevant in classical statistical mechanics because there are no physical subspaces with a smaller dimension than the whole phase space.

Another interesting thought in section 9 is that superpositions smooth out the discontinuities of a possibly quantized space which may reduce the discomfort associated with this notion.
 
Last edited:
  • #90
kith said:
This sounds a bit unphysical to me. Such a finite-dimensional subspace of phase space is a set of measure zero (loosely speaking a collection of delta functions). It seems strange to say that classical statistical mechanics violates this axiom because such sets of measure zero are not considered physical there anyway. So I would say that this axiom is irrelevant in classical statistical mechanics because there are no physical subspaces with a smaller dimension than the whole phase space.

Another interesting thought in section 9 is that superpositions smooth out the discontinuities of a possibly quantized space which may reduce the discomfort associated with this notion.

Should there be a natural distinction between classical and quantum mechanics for continuous variables? For position and momentum, there's Bohmian mechanics which is a classical way of viewing quantum mechanics. There's also Montina's arument http://arxiv.org/abs/0711.4770 that hidden variables for a finite dimensional quantum system must be continuous, if the dynamics are Markovian.

So I tend to think of quantum mechanics as a very good effective theory, because the true underlying variables are usually much more inconvenient.
 
  • #91
Interesting discussion. Although you guys haven't mentioned Bell's theorem but think it is relevant to the issue here. Specifically to demonstrate violation of the CHSH, it is often written that

S = E(a, b) − E(a, b′) + E(a′, b) + E(a′ b′) ≤ 2

E(a,b) = -E(a, b′) = E(a′, b) = E(a′ b′) = 1/√2
∴ S = 2√2 > 2 → Violation.

However, there is an ambiguity:
Possibility 1: All 4 terms are observables on single system. This is actually the assumption used in the derivation. In this case, although the a measurement commutes with the b measurement, E(a,b) does not commute with the E(a',b) and requires quite a different experimental arrangement to measure and it won't be proper to just add the add the separate individual terms from separate systems (the von Neuman error).
Possibility 2: Each term is an observable of a different but similarly prepared system. This allows S to be the linear combination of individual results but because of different degrees of freedom, the derivation of the inequality becomes problematic.

It therefore seems non-contextuality is relevant to the issue of hidden variables, both von Neuman's approach and Bell's. To measure each term, E(a,b) for example you post select a set of particle pairs using coincidence at (a,b) settings. Then to measure E(a, b') you have two possibilities. You could post select within the first set, all those pairs for which there is also (a,b') coincidence, but this is non-trivial since the b measurement does not commute with b' measurement, but will not necessarily give you the same result as if you post select a completely different set of particle pairs with coincidence at (a,b').

Did Bell make the same mistake as von Neuman then? It looks like it.
 
  • #92
billschnieder said:
Did Bell make the same mistake as von Neuman then? It looks like it.

No, that is not correct. The Bell derivation is just classical probability, see http://arxiv.org/abs/1208.4119 (particularly Fig. 19, 25-27).
 
  • #93
atyy said:
No, that is not correct. The Bell derivation is just classical probability, see http://arxiv.org/abs/1208.4119 (particularly Fig. 19, 25-27).
I'm not talking about the derivation. I'm talking about the demonstration of QM violation of the inequality. The part where expectations are linearly combined.

For example, how do you show that QM violates Bell's inequality. It is this calculation I'm talking about.
 
  • #94
billschnieder said:
For example, how do you show that QM violates Bell's inequality. It is this calculation I'm talking about.

Its a basic calculation following from the principles of QM eg:
http://en.wikipedia.org/wiki/Bell's...re_violated_by_quantum_mechanical_predictions

Of relation to this thread where QM is developed using the geometrical approach based on quantum logic the reason is the different logic of QM.

Thanks
Bill
 
  • #95
bhobba said:
Its a basic calculation following from the principles of QM eg:
http://en.wikipedia.org/wiki/Bell's...re_violated_by_quantum_mechanical_predictions

Of relation to this thread where QM is developed using the geometrical approach based on quantum logic the reason is the different logic of QM.

Thanks
Bill

Thanks for the link. I'm talking specifically about the expression after the "so that" in the section you quoted above. Filling in the part they left out, we get something like the following:

Starting from
##E_\psi(x,z) = \langle\psi|(\sigma_L\cdot x)(\sigma_R\cdot z)|\psi\rangle = -x\cdot{z} ##

For expectation values for a single system, that expression from wikipedia becomes something like:
##S^{\psi} = |\langle\psi|(\sigma_L\cdot{a})(\sigma_R\cdot{b}) - (\sigma_L\cdot{a})(\sigma_R\cdot{b'}) + (\sigma_L\cdot{a'})(\sigma_R\cdot{b}) + (\sigma_L\cdot{a'})(\sigma_R\cdot{b'})|\psi\rangle|##
Which has no solution.
However, the reason many people think it works is because they think it is equivalent to
expectation values for 4 independent similarly prepared systems which is:
##S^{1234} = |\langle\psi_1|(\sigma_L\cdot{a})(\sigma_R\cdot{b})|\psi_1\rangle - \langle\psi_2|(\sigma_L\cdot{a})(\sigma_R\cdot{b'})|\psi_2\rangle + \langle\psi_3|(\sigma_L\cdot{a'})(\sigma_R\cdot{b})|\psi_3\rangle + \langle\psi_4|(\sigma_L\cdot{a'})(\sigma_R\cdot{b'})|\psi_4\rangle|\psi \rangle|##
 
Last edited:
  • #96
billschnieder said:
Thanks for the link. I'm talking specifically about the expression after the "so that" in the section you quoted above.

Cant follow your concern - looks like a trivial substitution to me.

Thanks
Bill
 
  • #97
bhobba said:
Cant follow your concern - looks like a trivial substitution to me.

Thanks
Bill

Which of the two expressions I gave above is the correct representation of this "trivial" substitution?
##S^{\psi} = |\langle\psi|(\sigma_L\cdot{a})(\sigma_R\cdot{b})|\psi\rangle - \langle\psi|(\sigma_L\cdot{a})(\sigma_R\cdot{b'})|\psi\rangle + \langle\psi|(\sigma_L\cdot{a'})(\sigma_R\cdot{b})|\psi\rangle + \langle\psi|(\sigma_L\cdot{a'})(\sigma_R\cdot{b'})|\psi\rangle|##

##S^{1234} = |\langle\psi_1|(\sigma_L\cdot{a})(\sigma_R\cdot{b})|\psi_1\rangle - \langle\psi_2|(\sigma_L\cdot{a})(\sigma_R\cdot{b'})|\psi_2\rangle + \langle\psi_3|(\sigma_L\cdot{a'})(\sigma_R\cdot{b})|\psi_3\rangle + \langle\psi_4|(\sigma_L\cdot{a'})(\sigma_R\cdot{b'})|\psi_4\rangle|##

Note that in ##S^\psi## the 4 spin correlation observables do not commute so their linear combination is not an observable. It is a questionable substitution.
 
  • #98
billschnieder said:
Which of the two expressions I gave above is the correct representation of this "trivial" substitution?

You mentioned after 'so that'.

Just before that we have some rather easy to show identities that are 1/root 2 or -1/root 2.

Substitute them into the equation after and you have 2 root 2 > 2.

The equations you wrote down don't bear any relation to it - at least as far as I can see.

You will have to provide a LOT more detail of exactly what you are getting at.

Thanks
Bill
 
Last edited:
  • #99
billschnieder said:
Note that in ##S^\psi## the 4 spin correlation observables do not commute so their linear combination is not an observable. It is a questionable substitution.

bhobba said:
You will have to provide a LOT more detail of exactly what you are getting at.

I don't see any problem either, and I don't really understand billschneider's concern. However, I have a guess that what billschenider is saying is that the quantum expression T = <ψ|A+B|ψ> can be interpreted in two ways. First we can treat O = A + B as a single observable, and say that T = <ψ|O|ψ> should be measured by a single apparatus that measures O. However, we can also treat A and B as separate observables and measure <ψ|A|ψ> and <ψ|B|ψ> separately, then add them up to get T. The quantum formalism says that both physically different procedures yield the same value of T.

As I understand it, the Bell inequality as a derivation based on classical probability assumes the second interpretation: each term is a measurement with a different physical setup, and we add the results up.

However, it is interesting to consider the first interpretation, from the quantum point of view. Do A and B have to commute in order for O to exist as a quantum observable that can be measured by a single physical setup? I would say no. An example is the energy of the simple harmonic oscillator E = p2 + x2.
 
  • #100
bhobba said:
Just before that we have some rather easy to show identities that are 1/root 2 or -1/root 2.

Substitute them into the equation after and you have 2 root 2 > 2.


Ok, let me start by referring to some arguments from Bell's paper which you mentioned earlier
bhobba said:
Bell said:
Consider now the proof of von Neumann that dispersion free states, and so hidden variables are impossible. His essential assumption is: Any real linear combination of any two Hermitian operators represents an observable, and the same linear combination of expectation values is the expectation value of the combination. This is true for quantum mechanical states; it is required by von Neumann of the hypothetical dispersion free states also.
...
The essential assumption can be criticized as follows. At first sight, the required additivity of expectation values seems very reasonable, and is rather the non-additivity of allowed values (eigenvalues) which requires explanation. Of course the explanation is well known: A measurement of a sum of noncommuting observables cannot be made by combining trivially the results of separate observations on the two[individual] terms -- it requires a quite distict experiment. ... But this explanation of the non-additivity of allowed values also establishes the nontriviality of the additivity of expectation values

Now looking at the "so that" expression from Wikipedia, as you said yourself, it is a trivial addition of expectation values. I'm saying according to Bell's own argument against von Neumann, the addition of expectation values in QM is non-trivial, especially for non-commuting observables. So the claim that the the "so that" expression gives you 2√2, is suspect.
 
Back
Top