entropy1 said:
I reached chapter 8 after going for a second peruse of the book until that chapter. I still can't picture what is mixed state really is, and what it has to do with decoherence, an before going on the wrong track, I thought I might need some indications of which direction to go in what I picture in my mind.
I don't want to go into the discussions about interpretion again. I consider them usually pretty useless, but I hope I can clarify what a "mixed state" describes from the minimal-interpertation point of view, which is, in my opinion, the only part of "interpretation" that is necessary to relate the QT formalism to observations of and doing experiments with real-world systems.
A pure state is represented by a vector ##|\Psi \rangle## in Hilbert space (more accurately, it's the corresponding ray in Hilbert space, but that's not important in this context). An observable ##A## is represented by a self-adjoint operator ##\hat{A}## in that Hilbert space. For simplicity let's assume that it's non-degenerate, i.e., for each eigenvalue or ##\hat{A}## the corresponding eigenspace is one-dimensional. Then the probability to measure the value ##a##, which necessarily is an einvalue of ##\hat{A}##, of the observable ##A## is
$$P(a)=|\langle a|\Psi \rangle|^2.$$
The properties of a system cannot be specified more accurately than by preparing it in a such described pure state. However, there is no other physical meaning of state kets than this probabilistic meaning, and thus the pure state in QT refers to ensembles of equally prepared systems to be in that state.
Now we assume that the system is not prepared in a pure state, but it's known that with probability ##p_j## it's described by the state ##|\Psi_j \rangle##. Tnis implies ##p_j \in [0,1]## and \sum_j p_j=1. This defines a "mixed state". Now the probability to find the value ##a## when measuring observable ##A##, provided it's prepared in state ##|\Psi_j \rangle## is given by the conditional probability
$$P(a|\Psi_j)=|\langle a|\Psi_j \rangle|^2,$$
and thus, due to Baye's theorem to find the value ##a## given the mixed state is
$$P(a)=\sum_j p_j P(a|\Psi_j)=\sum_j p_j |\langle a|\Psi_j \rangle|^2.$$
Now we rewrite this a bit to get a more crisp definition of how to describe such mixed states:
$$P(a)=\sum_j p_j \langle a|\Psi_j \rangle \langle \Psi_j|a \rangle= \langle a | \left (\sum_j p_j |\Psi_j \rangle \langle \Psi_j| \right)|a \rangle=:\langle a|\hat{\rho} a \rangle.$$
The operator
$$\hat{\rho}=\sum_j p_j |\Psi_j \rangle \langle \Psi_j|$$
is called the statistical operator of the system.
Of course, you can now also describe a pure state in terms of a statistical operator. It's just the "mixture", where the system is with probability 1 prepared in the pure state described by ##|\Psi \rangle##, and thus the pure state is represented by the statistical operator
$$\hat{\rho}_{\Psi} = |\Psi \rangle \langle \Psi|.$$