PeterDonis said:
I think this is a misuse of the term "superposition". That term never means that "the system is either in this state or that state, with such-and-such probability". That is a "mixture".
The question is whether a superposition (defined as I just have) of macroscopically distinguishable states is even possible. The MWI says it is; a collapse interpretation says it isn't (collapse always removes all but one term in the superposition before that happens).
I couldn't agree more!
First of all you have to tell superposition of which vectors. Usually one takes an observable and decomposes the pure state, in terms of a normalized vector
$$|\Psi \rangle=\sum_a \Psi_a |a \rangle,$$
where ##|a \rangle## is a complete orthonormalized set of eigenvectors of the representing self-adjoint operator of the quantity measured. The state is then given by the statistical operator
$$\hat{\rho}=|\Psi \rangle \langle \Psi|.$$
Then it's of course wrong to say "the system is in a state where the observable ##A## takes all the possible values ##a## at the same time" (to make it clear again this sentence is WRONG, no matter how often it is repeated even in real textbooks, not only in popular writings!). The right thing to say is that for the quantum system prepared in this state the probability to find the value ##a## when you measure the observable ##A## is given by ##P_a=|\Psi_a|^2=\langle a|\hat{\rho}|a \rangle## (here for simplicity I assume the case that ##\hat{A}## is non-degenerate, i.e., I assume that all eigenspaces are one-dimensional).
Consequently this implies that the system has a determined value of the observable ##A## being ##a## if and only if ##P_a=1## and ##P_{a'}=0## for all ##a' \neq a##. This implies that ##|\Psi \rangle=|a \rangle## and the state is ##\hat{\rho}=|a \rangle \langle a|##.
If this is not the case, the observable ##A##'s value is indetermined, and the probability to find any of the possible values ##a## is ##P_a##. There's no other meaning (within the minimal statistical interpretation) than this, and as far as I know it's the meaning which is testable in the lab on doing measurements of ##A## on ensembles of equally prepared systems, using the usual statistical analysis to test probabilistic predictions.
Also the final statement is completely correct. If you say it's in any case in a state where ##A## has a determined value, but it's not known which value but you know there are probabilities ##P_a## for each value, then the correct association of a state, if no other information is given, is
$$\hat{\rho}'=\sum_a P_a |a \rangle \langle a|.$$
The thought-experimental realization is that Alice prepares an ensemble of systems providing Bob with the corresponding single systems. This means that Alice prepares each single member of the ensemble in a state described by the statistical operator ##\hat{P}_a =|a \rangle \langle a|## and she sends a fraction ##P_a## of single systems for each determined value ##A## to Bob. Of course ##\hat{\rho}' \neq \hat{\rho}=|\Psi \rangle \langle \Psi|##. Although Bob cannot distinguish the two states by just measuring ##A##, the states are different, and you can in principle find the difference by more fancy observations (see the excellent chapter on "state preparation and determination" in Ballentine's textbook).