Yes, I think your suspicion is right. But it's not just that the proof is hard. If that had been the only problem, the books could at least have stated the theorem without proof, and then proceeded to derive Kochen-Specker as a corollary. But to understand the statement of the theorem, you need to understand terms like "probability measure" and "lattice", and understand why the set of closed subspaces forms a lattice. Physics students don't know these things.
That thesis looks interesting. I might read it later. I have only studied the statement of the theorem, not its proof. I was a bit inspired by your post last night and decided to take a quick look at the POVM-based proof.
http://arxiv.org/abs/quant-ph/9909073. (Another member at the forum mentioned it to me a few weeks ago). It's really short, but I didn't try to understand it right away, because I didn't understand what he meant by "effects". I got the impression that the book he coauthored, "Operational quantum physics", would be a good place to read about those things, so I may have to read a chapter or so of that book first. (I know almost nothing about POVMs. I recently started a thread where I asked for suggestions about what I should read to understand them, and got a lot of suggestions, but I haven't studied any of them yet).
Have you studied this proof? Since I have only had a quick look at it, I obviously don't understand it, but one thing bothers me about it. Gleason's theorem finds all the probability measures on the lattice of closed subspaces. The first thing Busch does is to make the domain larger, so apparently we're now trying to find all probability measures on a larger lattice (it's still a lattice, right?), the lattice of all "effects". So do we really end up with Gleason's theorem, or is this just a
similar theorem?