Reif's Derivation of Boltzman Probability Distribution in Thermodynamics

  • Thread starter Thread starter quasar987
  • Start date Start date
  • Tags Tags
    Thermodynamics
Click For Summary

Homework Help Overview

The discussion revolves around the derivation of the Boltzmann probability distribution as presented by Reif in thermodynamics. The original poster expresses skepticism regarding the mathematical manipulations involved, particularly in the context of an isolated system composed of two subsystems in thermal contact, where one subsystem has significantly more degrees of freedom than the other. The poster raises concerns about the assumptions made regarding energy states and the justification for using a Taylor expansion in the derivation.

Discussion Character

  • Conceptual clarification, Assumption checking, Mathematical reasoning

Approaches and Questions Raised

  • Participants explore the validity of assumptions regarding energy states, particularly the claim that certain energy states are unlikely. They question the justification for dropping higher-order terms in the Taylor expansion and discuss the implications of energy units on the derivation. There is also a focus on the probability distribution's behavior for large systems and how it relates to the mean energy value.

Discussion Status

The discussion is ongoing, with participants actively questioning and clarifying various aspects of the derivation. Some have offered insights into the nature of probability distributions in large systems, while others continue to challenge the assumptions and mathematical justifications presented in the original derivation.

Contextual Notes

Participants note that the assumptions about the relative sizes of the subsystems and their energy states are critical to the discussion. There is an emphasis on the need for clarity regarding the implications of energy units and the conditions under which the Taylor expansion is valid.

quasar987
Science Advisor
Homework Helper
Gold Member
Messages
4,796
Reaction score
32
I find the mathematical manipulation of thermodynamics are often dubious and this one is doubly so. It is the derivation of the Boltzmann probability distribution made by Reif (Section 6.2).

He considers an isolated system [itex]A^{(0)}[/itex] of energy [itex]E^{(0)}[/itex] in thermodynamical equilibrium composed of two system A and A' in thermal contact, where A' has much more degrees of freedom that A (A'>>A). He also supposes that the energy of interaction of A and A' are negligible so that [itex]E^{(0)}=E+E'[/itex].

These initial conditions set, he tries to find the probability of finding system A in state r of corresponding energy [itex]E_r[/itex]. It follows from the postulate of equiprobability that the probability of finding A in state r is just the ratio of all the possible states of [itex]A^{(0)}[/itex] given that A is in state r to the ratio of all possible states of [itex]A^{(0)}[/itex]*:

[tex]P_r=\frac{\Omega'(E^{(0)}-E_r)}{\Omega^{(0)}}[/tex]

He then says, let us expand [itex]\Omega'(E^{(0)}-E_r)[/itex] in a Taylor series about [itex]E^{(0)}[/itex]. But he actually expands the log of that in order to get a faster convergence:

[tex]\ln(\Omega'(E^{(0)}-E_r))\approx \ln(\Omega'(E^{(0)}))-\left[\frac{\partial \Omega'}{\partial E'} \right]_{E^{(0)}}E_r[/tex]

And he justifies the dropping of the higher order terms by the fact that [itex]E_r <<E^{(0)}[/itex].

But I have three problems with this justification. (problem #3 is closely linked to #2)

1) [itex]E_r=E^{(0)}[/itex], E'=0 is a possible state! And in principle, we do not know just how unlikely it is until we've effectively found the formula for [itex]P_r[/itex]!

2) It seems to me that [itex]E_r <<E^{(0)}[/itex] justifies nothing! We want [itex]E_r<<1[/itex] so that the higher the power n, the smaller [itex](E_r)^n[/itex] is.

3) And wheter [itex]E_r[/itex] is <1 actually depends on the units of energy used! In, say, a relativistic taylor expansion, the argument is v/c which is dimensionless, so the problem of dimensionality does not arise. But in this case?! :confused: I would really appreciate it if someone could nullify these two objections for me.

Many thanks!*Reif uses [itex]\Omega'(E')[/itex] to denote the cardinality of the set of all states for which the energy of A' is E' and [itex]\Omega^{(0)}[/itex] to denote the cardinality of the set of all possible states of the system [itex]A^{(0)}[/itex] as a whole.
 
Last edited:
Physics news on Phys.org
Concerning your point (2) and (3).

Its is Er/E° that must be << 1 , not Er.
There is no problem in the dimensions, because the dimension of dW/dE * dEr is right.
There is small mistake in your formula for the Taylor expansion of ln(W).

For point (1), for large enough system, the probability distribution of the total energy of the (sub-) system is a very sharp peak around the mean value. You should try to calculate that sometimes: try a perfect gas and the M-B distribution. Also, have a look back at statistics: how does the sum of many (gaussian) distribution shape as a function of the number of items in the sum? Nice games, with generating functions, etc ...
 
lalbatros said:
Concerning your point (2) and (3).

Its is Er/E° that must be << 1 , not Er.

My point exactly.

I miswrote the expansion; I forgot an "ln". So if I redo the expansion, expliciting every step, I want to expand [itex]\ln(\Omega'(E'))[/itex] about [itex]E^{(0)}[/itex], so

[tex]\ln(\Omega'(E'))=\ln(\Omega'(E^{(0)}))+\left[ \frac{\partial \ln(\Omega'(E'))}{\partial E'} \right]_{E^{(0)}}(E'-E^{(0)})+...[/tex]

But [itex]E'-E^{(0)}=-E_r[/itex], so it becomes

[tex]\ln(\Omega'(E^{(0)}-E_r))=\ln(\Omega'(E^{(0)}))-\left[ \frac{\partial \ln(\Omega'(E'))}{\partial E'} \right]_{E^{(0)}}E_r+...[/tex]

As you can see, it is E_r and not Er/E° that needs to be <<1 in order for the approximation to be justified, but this 1) is not true for all r, 2) depends on the units in which we measure the energy.


lalbatros said:
For point (1), for large enough system, the probability distribution of the total energy of the (sub-) system is a very sharp peak around the mean value.
I am aware of this result. But if we restrict our study to the states r of energies near the average value, how can we later on assert that the distribution holds for all r?
 
The point is that E0 is waaaay larger than Er. So you're still very much in the neighbourhood of E0 which justifies the expansion. It's always the relative magnitude that is important.

If you like you can write [tex]E^0-E_r=E^0(1-\frac{E_r}{E_0})[/tex].
 
But
[tex]\frac{\partial \ln(\Omega'(E'))}{\partial E'} = 1/k_{B}T[/tex]

Right? Which is basically just the energy of the reservoir, so the units work, and [itex]E_r << E^{(0)}[/itex] by supposition, so any higher terms are going to cancel.
 
Galileo said:
The point is that E0 is waaaay larger than Er.
Not necessarily. We're looking for the probability of r where r is any accesible state. An accesible state is any state that is compatible with the macroscopic state. In particular, Er=E0 and E'=0 is an accesible state. It is unlikely because there are not many ways in which this can be realized, and the probability that it be realized is the ratio of this number of alll the accesible states. But we don't know that yet. That's what we're supposed to find out!
Galileo said:
If you like you can write [tex]E^0-E_r=E^0(1-\frac{E_r}{E_0})[/tex].

What does this change in the context of the expansion above?
 
Last edited:
Tomsk said:
But
[tex]\frac{\partial \ln(\Omega'(E'))}{\partial E'} = 1/k_{B}T[/tex]

Right? Which is basically just the energy of the reservoir, so the units work, and [itex]E_r << E^{(0)}[/itex] by supposition, so any higher terms are going to cancel.

It has never been hypothesized that E_r << E^{(0)}. But if we suppose that it is, in order to make the approximation valid, i.e. if we choose to only consider this particular band of energies, then how can we say that the answer we are going to find is valid for all r even for those for which Er=E0?

And I am not saying that the units don't "work" in the sense that ln(W) doesn not turn out dimensionless. I am saying that since the Taylor expansion is in powers of E_r, if we measure the energy in units such that E_r is gigantic, the dropping of the terms is unjustified.

(or are you saying that k_B will be consequently adjusted so as to maintain the product Er/kT the same in any unit system? This is confusing.. :/)
 
Last edited:
The probability of the smaller system A being in a state [itex]E_r[/itex] is proportional to [itex]\Omega(E')\Omega(E_r)[/itex], as for every [itex]\Omega(E')[/itex], the small system can be in anyone of [itex]\Omega(E_r)[/itex]. But [itex]\Omega(E_r)=1[/itex] because the system is so small it only has one state, Er. So if you want ANY state up to E' ~ Er, then you need to Taylor expand [itex]\Omega(E')\Omega(E_r)[/itex].
 
[itex]E_r=E^{(0)}[/itex] is not a possible state if the number of degrees of freedom of A' >> A, which is what has been assumed. It follows that [itex]E_r << E^{(0)}[/itex]
 
  • #10
Tomsk said:
[itex]E_r=E^{(0)}[/itex] is not a possible state if the number of degrees of freedom of A' >> A, which is what has been assumed. It follows that [itex]E_r << E^{(0)}[/itex]

I was imagining a large lattice and a small latice. Even though the large latice has more atoms, it is possible that at some instant, they all be at rest and the energy be all in the small latice.
 
  • #11
quasar987 said:
I am saying that since the Taylor expansion is in powers of E_r, if we measure the energy in units such that E_r is gigantic, the dropping of the terms is unjustified.

(or are you saying that k_B will be consequently adjusted so as to maintain the product Er/kT the same in any unit system? This is confusing.. :/)

If we use units that give Er to be very big, then E' will be even bigger. kT is the energy E' I think, so it becomes negligible for higher powers. I think that follows, not entirely sure. I think you'd get something like Er^2/E'^2, which is pretty small if Er<<E'
 
  • #12
quasar987 said:
I was imagining a large lattice and a small latice. Even though the large latice has more atoms, it is possible that at some instant, they all be at rest and the energy be all in the small latice.

Doesn't that seem to be very improbable though? There are lots more ways of arranging the energy so that it is spread around the larger lattice than the number of way all of that energy can be arranged in the small lattice. A small lattice is only going to keep a large amount of energy for a very small time, it wouldn't be stable with it. I see your point, but I think this approximation is really for when the energy of the system stays well below the energy of the reservoir. Since this is a much more likely observation, I think this approximation is valid.
 
  • #13
The assumption is that [tex]\Omega(E')[/tex] is a steeply rising function of E' and [tex]\Omega(E_r)[/tex] is a steeply rising function of Er.
In fact, they are essentially power functions with the power of the order of the number of degrees of freedom (see section 2.5).
If you then look at section 3.3, it is treated why the probability
[tex]P(E) = C \Omega(E)\Omega'(E^0 - E)[/tex] is such a sharp function.

Once you have this, this justifies the appoximations of working in a very small neighbourhood of the maximum.

As concerning to your "dimensional" argument, it is not necessary that in a series a_1 x + a_2 x^2 + ... a_n x^n...
x be smaller than 1 to have quick convergence. It depends also on the n-dependence of a_n.

Clearly, if you change energy units, you will change the numerical value of x, but also of all coefficients in the expansion, and the a_n coefficient will change with the unit conversion factor ^ n.
 
  • #14
I don't have Reif's book, but in any thermobook I've read they assume Er<<E0.

You noted yourself that ln(W) doesn't depend on which units you use (to whatever order). If you measure in eV Er will be much bigger, but derivatives wrt E will drop in the same proportion giving the same entropy ln(W). So the problem is not the units, but whether the entropy S=ln(W) doesn't change much around S(E) when Er is small compared to E.

I think it can be shown, maybe from the convexity of S. Also, in the end it is only the relative magnitudes of [tex]\Omega(E^0-Er)[/tex] that will play a role, since it will be normalized such that the probabilities add to 1. I'm a bit fried at the moment but I`ll give it some thought.
 
  • #15
You could also write

ln(W) = ln(W°) + dln(W°)/dln(E°) * Er /E°

This shows you that Er/E° << 1 is the right condition.
And of course the coefficient in the taylor expansion which is

dln(W°)/dln(E°)

has no dimension.
 
  • #16
vanesch said:
As concerning to your "dimensional" argument, it is not necessary that in a series a_1 x + a_2 x^2 + ... a_n x^n...
x be smaller than 1 to have quick convergence. It depends also on the n-dependence of a_n.
And the logarithm is such a function? Do you have numbers? (ex: precise to k% regardless of the value of x when keeping only the linear term)


lalbatros said:
You could also write

ln(W) = ln(W°) + dln(W°)/dln(E°) * Er /E°

Eeeh? Would you mind explaining what you did here?
 
  • #17
Here's what I think is a pretty convincing reasoning using bits and pieces from all the people who gave their opinion on this:

[itex]\Omega'(E')\propto E^{f_A'}[/itex] (where f_A' is the degree of freedom of system A', which is large by hypothesis). Now,

[tex]\frac{\partial \ln(\Omega'(E'))}{\partial E'}\propto \frac{\partial f\ln(E')}{\partial E'}=\frac{f}{E'}[/tex]

So

[tex]\left[ \frac{\partial \ln(\Omega'(E'))}{\partial E'} \right]_{E^{(0)}}E_r\propto f\frac{E_r}{E^{(0)}}[/tex]

In general,

[tex]\frac{1}{n!}\left[ \frac{\partial^n \ln(\Omega'(E'))}{\partial E'^n} \right]_{E^{(0)}}E_r\propto (-1)^{n-1}\frac{1}{n}f\left(\frac{E_r}{E^{(0)}}\right)^n[/tex]

So at least now we have a power series in E_r/E°, which is always lesser or equal to unity.
 
Last edited:

Similar threads

  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
3K
Replies
9
Views
2K
Replies
4
Views
1K
  • · Replies 14 ·
Replies
14
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 5 ·
Replies
5
Views
3K
  • · Replies 2 ·
Replies
2
Views
3K