Carpe Physicum said:
Would you mind giving a little more detail on how the concept of observable gets included in the formalism, just a high level for a non-math person (though I get the basic terminology to a degree). Mucho appreciated.
Sorry - despite what you may have read not all concepts can be explained in English.
For this you need linear algebra:
http://quantum.phys.cmu.edu/CQT/chaps/cqt03.pdf
Linear Algebra is a standard course in virtually all mathematically related areas - Physics, Mathematics, Actuarial science, Finance, Economics, Econometrics, Statistics, Weather Forecasting and probably many others I forgot to mention. It really should, like calculus, be taught at HS, but due to our math phobic age educators think other things more important. But again that is the topic of another thread.
I will explain it using that.
Suppose you have an observation that can have yi (i = 1 to n) outcomes where each yi is a real number.
Pick any orthonormal basis in an n dimensional complex vector space |bi>. Then one can form an operator O = ∑ yi |bi><bi|. This encodes the possible outcomes of the observation
The yi are known as eigenvalues and the |bi> as their corresponding eigenvectors. Thus for any observation can find an operator that has eigenvalues the same as the observation.
I will not go into it because it involves a mathematical theorem (considered difficult) called Gleason's theorem. Gleason was one of those mathematicians not well known but who was in fact a quiet giant. This theorem was tough - but he was fired with the desire to solve it and he famously did. Just out of interest I will post a biography of Gleason:
https://www.ams.org/notices/200910/rtx091001236p.pdf
The theorem says just based on the definition of O I gave (and something called non-contextuality I will not go into - but its a very reasonable assumption considering that we are using vector spaces whose properties should not depend on the basis chosen) that you can calculate the average of the possible outcomes of the observation. It is Average (O) = Trace (OS) where S is something that pops out of the theorem and technically is known as a positive operater of unit trace.
By definition S is called the state of the system - but as presented here it, like probabilities is just something we assign to the thing being observed to aid in calculating that average of the observation.
This is what makes it more than just pure math - we are talking about things we observe. This is also the theories weakness. Presumably this theory can explain the macro world around us. But it is a theory about observations in that world. How can a theory that assumes such in the first place explain it? It seems hopeless, but believe it or not, without going into the details, a lot of progress has been made in doing that. Some issues remain but research is ongoing. We have various interpretations all having a different view. These may or may not be true - but all illuminate what the theory implies. I feel confident we will eventually arrive at a complete solution to this problem - we are almost there.
Thanks
Bill