He didn't talk about it in terms of decoherence, but the mathematics of how mixed states arise from pure states by considering subsystems was described in Everett's original paper on Many Worlds (which is not the name that he used---that was Bryce Dewitt).
Suppose that you have a composite system described by a wave function |\Psi\rangle = \sum_{\alpha, j} C_{\alpha, j} |\alpha\rangle |j\rangle, where |\alpha\rangle is a complete set of states for the first subsystem, and |j\rangle is a complete set of states for the second subsystem. You can think of |\alpha \rangle as describing the system of interest--maybe an electron--while |j\rangle describes everything else in the universe. Now K be some operator that only affects the first component. That means that its affect on the composite state |\alpha\rangle|j\rangle is this:
K |\alpha\rangle |j \rangle = \sum_{\alpha'} K_{\alpha' \alpha} |\alpha\rangle |j\rangle
Since operators correspond to observables, K represents an observable of the first subsystem alone. Now, let's compute the expectation value of K in the composite state |\Psi\rangle:
\langle \Psi|K|\Psi \rangle = \sum_{\alpha, \alpha', j, j'} C^*_{\alpha', j'} C_{\alpha, j} \langle \alpha' | \langle j' | K | j \rangle |\alpha\rangle
= \sum_{\alpha, \alpha', j, j'} C^*_{\alpha', j'} C_{\alpha, j} K_{\alpha' \alpha} \delta_{j j'}
= \sum_{\alpha, \alpha', j} C^*_{\alpha', j} C_{\alpha, j} K_{\alpha' \alpha}
Now, if we define \rho_{\alpha \alpha'} to be: \sum_j C^*_{\alpha', j} C_{\alpha, j}, then we have:
\langle \Psi|K|\Psi\rangle = \sum_{\alpha, \alpha'} \rho_{\alpha \alpha'} K_{\alpha' \alpha} \equiv Tr(\rho K)
So for measurements only involving the first subsystem, the density matrix \rho is all that we need for computing expectation values. It's a mixed state, in general.