Philip Koeck said:
Are you saying that the heat bath has low occupancy?
If the small system is just a part of the whole large system, then I would expect it to also have low occupancy.
There is no need to assume anything about the small system along those lines.
That would mean the Boltzmann distribution is only valid in the low occupancy case.
The Boltzmann distribution describes the environment (heat bath), not the small system.
Then it's still strange that the partition function constructed from Boltzmann factors can be used to derive the FD and BE distribution.
The reasoning doesn't distinguish between bosons or fermions or distinguishable particles. Those distinctions go into computing ##W(N,E)##, the number of states with a given energy and number of particles. For distinguishable particles, swapping two particles results in a different state,while for bosons and fermions, it doesn't.
The following argument is semiclassical, in that it uses discrete energy levels and indistinguishability, but does not use operators as in
@vanhees71.
What you do need is for the number of states for two subsystems to be multiplicative. Assume that you have one large system, the heat bath, with ##W_B(N,E)## giving the number of states for a given number of particles and total energy.
You have another system, the system of interest, with its own function ##W_1(N_1, \varepsilon)##. Then the assumption is that the number of states ##W(N,E)## for the combined system is related to these two in the following way:
##W(N,E) = \sum_\varepsilon \sum_{N_1} W_1(N_1,\varepsilon) W_B(N-N_1, E - \varepsilon)##
Now, you assume that ##W_B## is huge compared with ##W_1##, so most of the particles and most of the energy will be with the heat bath, rather than the system of interest. Under this assumption, we can approximate: ##W_B(N-N_1, E - \varepsilon) \approx W_B(N,E) e^{-\beta (\varepsilon - \mu N_1)}##, where ##\beta = \frac{\partial S_B}{\partial E}/k## and ##\beta \mu = \frac{\partial S_B}{\partial N}/k##.
Then we have:
##W(N,E) = W_B(N,E) \sum_\varepsilon \sum_{N_1} e^{- \beta(\varepsilon - \mu N_1)} W_1(N_1,\varepsilon) ##
At this point, no assumption whatsoever is being made about the nature of the small system, other than the fact that it is much smaller than the heat bath. To go further and see where the statistics come into play, let's assume that the small system contains a number of identical particles that will be approximated as non-interacting. Then the many-particle states can be obtained from the single-particle states. Assume that a single particle has a discrete set of states, with energies ##e_1, e_2, ...##. Then the multiparticle state is specified by simply giving ##n_1, n_2, ...## where ##n_j## is the number of particles in state ##j##. The indistinguishability comes into play by the fact that we don't consider two states to be different if you swap two particles, because that leaves the numbers ##n_j## unchanged. For Fermions, each ##n_j## can have value 0 or 1. For Bosons, ##n_j## can be any nonnegative integer.
Given these assumptions, we can compute ##W_1(N_1, \varepsilon)## as follows:
##W_1(N_1, \varepsilon) = \sum_{n_1, n_2, n_3, ...} \Delta(\varepsilon - n_1 e_1 - n_2 e_2 - ...) \Delta(N_1 - n_1 - n_2 - ...)##
where ##\Delta(X)## is a function that is equal to 1 if ##X=0## and is equal to 0 otherwise. (That's actually the Kronecker delta, but I don't want to get the notion mixed up with the Diract delta function). In other words, ##W_1(N_1, \varepsilon)## is just counting the number of states with the correct number of particles and total energy. In terms of this expression, we can now rewrite our general expression above:
##W(N,E) = W_B(N,E) \sum_\varepsilon \sum_{N_1} \sum_{n_1, n_2, ...} e^{- \beta(\varepsilon - \mu N_1)} \Delta(E - n_1 e_1 - n_2 e_2 - ...) \Delta(N_1 - n_1 - n_2 - ...)##
Because of the presence of the ##\Delta##s, the quantity being summed is zero unless ##\varepsilon = n_1 e_1 + n_2 e_2 + ...## and ##N_1 = n_1 + n_2 + ...##. So we can replace ##\varepsilon## and ##N_1## in the exponential by those sums without changing the value of the whole expression:
##W(N,E) = W_B(N,E) \sum_\varepsilon \sum_{N_1} \sum_{n_1, n_2, ...} e^{- \beta(n_1 e_1 + n_2 e_2 + ... - \mu n_1 - \mu n_2 - ...)} \Delta(E - n_1 e_1 - n_2 e_2 - ...) \Delta(N_1 - n_1 - n_2 - ...)##
Now, since the exponential no longer involves ##\varepsilon## or ##N_1##, we can reorder the sums (assuming everything is absolutely convergent, anyway):
##W(N,E) = W_B(N,E) \sum_{n_1, n_2, ...} e^{- \beta(n_1 e_1 + n_2 e_2 - \mu n_1 - \mu n_2 - ...)} \sum_\varepsilon \sum_{N_1} \Delta(E - n_1 e_1 - n_2 e_2 - ...) \Delta(N_1 - n_1 - n_2 - ...)##
The innermost sum is just ##\sum_\varepsilon \sum_{N_1} \Delta(E - n_1 e_1 - n_2 e_2 - ...) \Delta(N_1 - n_1 - n_2 - ...)##. The only nonzero term is the one where ##N_1 = n_1 + ...## and ##\varepsilon = n_1 e_1 + n_2 e_2 + ...##. There is only one value making this true, so the sum just yields 1. So we get a huge simplification:
##W(N,E) = W_B(N,E) \sum_{n_1, n_2, ...} e^{- \beta(n_1 e_1 + n_2 e_2 + ... - \mu n_1 - \mu n_2 - ...)} ##
We can write this in a more tidy way:
##\sum_{n_1, n_2, ...} e^{-\beta(n_1 e_1 + n_2 e_2 + ... - \mu n_1 - \mu n_2 ..)} = \Pi_j \ [ \sum_{n} e^{-\beta n (e_j - \mu)}]##
So we just get: ##W(N,E) = W_B(N,E) \Pi_j Z_j(\beta,\mu)##
where ##Z_j(\beta, \mu) = \sum_n e^{-\beta n (e_j - \mu)}##. As mentioned earlier, for Fermions, ##n = 0## or ##n=1##. So ##Z_j(\beta,\mu) = 1 + e^{-\beta(e_j - \mu)}##. For Bosons, ##n= 0, 1,2,...## so the sum gives ##Z_j(\beta,\mu) = 1 + e^{-\beta(e_j - \mu)} + e^{-2\beta (e_j - \mu)} + ... = \frac{1}{1-e^{-\beta(e_j - \mu)}}##
Taking the natural log of both sides gives:
##S(N, E) = S_B(N,E) + \sum_j log(Z_j(\beta, \mu))##
This shows an amazing, or maybe not so amazing, result, which is that if the particles are non-interacting, then each single-particle state ##j## makes its own independent contribution to the total entropy, ##log(Z_j(\beta, \mu))##.