Statistical Mechanics Part I: Equilibrium Systems - Comments

NFuller · Jan 7, 2018

Greg Bernhardt submitted a new PF Insights post

Statistical Mechanics Part I: Equilibrium Systems

Continue reading the Original PF Insights Post.

Stephen Tashi · Jan 7, 2018

The "##\omega_i##" is introduced as a notation for a "cell" in phase space, but then ##\omega_i## is used again to denote the number of subsystems in a cell. It this ambiguous notation ? - or is there some distinction made in the typograhy that I'm not recognizing?

Stephen Tashi · Jan 8, 2018

Defining pi=ωi/ΩE as the probability to find a subsystem in a particular state ωi,

The choice of the word "Defining" is interesting. Can we arbitrarily define the probability of finding a "random" subsystem in a particular state ##\omega_i##?

It seems to me that such a definition involves some implicit assumption about the way the phase space has been divided into cells. For example, if the distribution of subsystems in phase space is uniform over its volume and we use cells that all have the same volume, we could deduce that the probability of finding a subsystem chosen from this uniform distribution to be in cell ##\omega_i## is the ratio of the volumes ##\omega_i/ \Omega_E##.

Alternatively, we could have a non-uniform distribution of substems over the phase space and explicitly say that we choose the division into cells in such a manner that the probability that a system chosen from this distribution being in a particular cell is the ratio of the volume of the cell to the total volume of the phase space.

Of course, I have similar complaints about all other expositions of statistical mechanics that I have read. They seem to invite the reader to make the elementary mistake of assuming that if there are N possibilities then each possibility must necessarily have probability 1/N. (e.g. There are 26 letters in the alphabet, so the probability that a randomly chosen citizen has a last name beginning with "X" is 1/26.)

vanhees71 · Jan 8, 2018

What you call "microcanonical" is usually called "canonical". In the microcanonical ensemble the energy is strictly in a little shell around a fixed value ##E_0##. It's for a closed system. In the microcanonical ensemble the energy is fixed only on average, i.e., you have an open system, where energy exchange with the environment or a heat bath is possible, but no particle exchange. Finally, the grand-canonical ensemble is the case, where the open system can exchange both energy and particles with the environment.

NFuller · Jan 8, 2018

Stephen Tashi said:

The "ωiωi\omega_i" is introduced as a notation for a "cell" in phase space, but then ωiωi\omega_i is used again to denote the number of subsystems in a cell. It this ambiguous notation ? - or is there some distinction made in the typograhy that I'm not recognizing?

##\omega## represents an accessible state in the phase space so when ##\omega## is used to denote the number of subsystems in the cell it is denoting the number of subsystems occupying a given state in the phase space.

Stephen Tashi said:

The choice of the word "Defining" is interesting. Can we arbitrarily define the probability of finding a "random" subsystem in a particular state ωiωi\omega_i?

It seems to me that such a definition involves some implicit assumption about the way the phase space has been divided into cells. For example, if the distribution of subsystems in phase space is uniform over its volume and we use cells that all have the same volume, we could deduce that the probability of finding a subsystem chosen from this uniform distribution to be in cell ωiωi\omega_i is the ratio of the volumes ωi/ΩEωi/ΩE\omega_i/ \Omega_E.

The size of ##\omega## is arbitrary. The definition of the probability ##p_{i}## is made by counting the number of subsystems which can occupy the section of phase space ##\omega_{i}##. The assumption is that since each subsystem is identical, each can arbitrarily switch positions with another subsystem. This leads to the distribution ##p_{i}=\omega_{i}/\Omega_{E}##.

NFuller · Jan 8, 2018

vanhees71 said:

What you call "microcanonical" is usually called "canonical".

Thanks for pointing that out. I will correct that.

NFuller · Jan 8, 2018

Stephen Tashi said:

Of course, I have similar complaints about all other expositions of statistical mechanics that I have read. They seem to invite the reader to make the elementary mistake of assuming that if there are N possibilities then each possibility must necessarily have probability 1/N. (e.g. There are 26 letters in the alphabet, so the probability that a randomly chosen citizen has a last name beginning with "X" is 1/26.)

This is not what is being assumed though. Using this example, each ##\omega## would represent a letter of the alphabet. Since more people have a last name starting with S then X, there will be many more points in ##\omega_{S}## then ##\omega_{X}##. The probability of someone's last name beginning with X would be ##p_{X}=\omega_{X}/\Omega## where ##\Omega## is the total number of people.

Notice however that if the probability of each letter was equal, then this would be a maximum entropy state where ##p_{i}=1/26## and the entropy would follow the Boltzmann equation.

Stephen Tashi · Jan 8, 2018

NFuller said:

##\omega## represents an accessible state in the phase space so when ##\omega## is used to denote the number of subsystems in the cell it is denoting the number of subsystems occupying a given state in the phase space.

That may be traditional notation in physics. However, it would more precise to use the subscript ##i## as the "label" for the "state" (or "cell") and notation ##\omega_i## the symbol for the number of subsystems in state ##i##. If there needs to be a discussion about the spatial volume of a state ##i## in phase space, that volume would require yet another symbol.

The assumption is that since each subsystem is identical, each can arbitrarily switch positions with another subsystem. This leads to the distribution ##p_{i}=\omega_{i}/\Omega_{E}##.

Does that assumption have a physical interpretation? Or is "switch positions" just a mental operation that we can do by re-labeling things?

Presumably we seek to model an experiment where a system is prepared to have total energy ##E## and is at "equilibrium". We model this by picking a subsystem ##s## at random from the set of subsystems giving each subsystem the same probability of being selected. If we make an ergodic hypothesis that the probability of ##s## being in state ##i## remains constant in time then there is a probability of ##\omega_i/\Omega_E## that the system ##s## is in state ##i##. Since your exposition hasn't yet dealt with gases or other processes where "equilibrium" has a technical definition, it seems to me that we are effectively defining "equilibrium" to mean that the probability of ##s## being in state ##i## is constant with respect to time.

NFuller · Jan 9, 2018

Stephen Tashi said:

Does that assumption have a physical interpretation?

My point was that since the subsystems are identical, if you relabel the points, one could not tell which points have been relabeled or which points have been moved. This allows for the a priori probability distribution ##p_{i}=\omega_{i}/\Omega##.

Stephen Tashi said:

it seems to me that we are effectively defining "equilibrium" to mean that the probability of sss being in state iii is constant with respect to time.

I defined equilibrium as a maximum entropy state. There is nothing preventing the probabilities from having some sort of time dependence. The Gibbs entropy is technically valid even in non-equilibrium systems.

Stephen Tashi · Jan 10, 2018

I realize the Insight is intended as an overview of Statistical mechanics as oppose to an introduction to it. However, I'll comment on the logic structure since I've never found an introduction where it is presented clearly -and perhaps someday you'll write a textbook.

NFuller said:

My point was that since the subsystems are identical, if you relabel the points, one could not tell which points have been relabeled or which points have been moved. This allows for the a priori probability distribution ##p_{i}=\omega_{i}/\Omega_E##.

I don't see that the subsystems are "identical".

My interpretation: Apparently the underlying probability space we are talking has outcomes of "pick a subsystem". If we assume each subsystem has an equal probability of being selected, then it follows that the probability of the event "we select a subsystem in state ##i##" is ##\omega_i/\Omega##.

The conceptual difficulty in applying this to a particular physical system (e.g gases) is that we are dealing with a finite set of points in phase space and nothing is said about how these points are selected. If the goal is to model what happens in experiments, the we should imitate how an experiment randomly encounters a subsystem. For example, if we are modeling the event "A randomly selected person's last name begins with 'X"" by randomly selecting a person from a set of people then we need a set of people whose last names are representative of the entire population.

Are we assuming that any "bias" in the finite set of subsystems is supposed to be overcome by the fact we attain a representatives sample as the number of subsystems under consideration approaches infinity? For a finite population, we would achieve a representative sample that was the whole population. However, I don't see why taking more and more points in a (continuous) phase space necessarily guarantees a representative sample.

I defined equilibrium as a maximum entropy state.

Did you define equilibrium?

You wrote:

At equilibrium, the probability to be in a given state is expected to be constant in time.

Do we take that as the definition of equilibrium?

Justifying that the maximum entropy distribution gives the probabilities that would occur in a random measurement of a system at equilibrium seems to require an argument involving the limiting probabilities of a random process .

The system may fluctuate between different microstates but these microstates must be consistent with a fixed macroscopic variables, i.e. pressure, volume, energy, etc. There is nothing which would lead us to believe that one microstate is favored over another.

Suppose Bob partitions ##\Omega_E## into 10 states ##b_1,b_2,...b_{10}## and Alice partitions ##\Omega_E## into 10 states ##a_1,a_2,...a_{10}## in a different manner. Setting ##p_{b_1} = \omega_{b_1}/\Omega_E = 1/10 ## might give a different model for a random measurement than setting ## p_{a_1} = \omega_{a_1}/\Omega_E = 1/10##. For example, Alice might have chosen to make her ##a_1## a proper subset of Bob's ##b_1##.

So something about the condition "these microstates must be consistent with a fixed macroscopic variables" needs to be invoked to prevent ambiguity. I don't see how this is done.

-----

Using Sterling’s approximation, this can be written as S=k[ΩElnΩE−ΩE−∑i=1lωilnωi+∑i=1lωi]

It's worth reminding readers that the error in Sterling's approximation for ##N!## approches infinity as ##N## approaches infinity. So finding the limit of a function of factorials cannot, in general, be done by replacing each factorial by a Sterling's approximation. For the specific case of multinomial coefficients, the replacement works. ( If we encounter a situation where we are letting the number of factorials in the denominator of a binomial coefficient approach infinity, I'm not sure what happens. For example, if we need to take a limit as we partition ##\Omega_E## into more and more ##\omega_i##, it might take some fancy analysis.)

NFuller · Jan 10, 2018

Stephen Tashi said:

Are we assuming that any "bias" in the finite set of subsystems is supposed to be overcome by the fact we attain a representatives sample as the number of subsystems under consideration approaches infinity?

I never actually gave a value for any of the probabilities; I only assumed that they exist. As far as a real system is concerned, equilibrium statistical mechanics generally requires something approaching the thermodynamic limit to give a reasonable answer.

Stephen Tashi said:

Suppose Bob partitions ΩEΩE\Omega_E into 10 states b1,b2,...b10b1,b2,...b10b_1,b_2,...b_{10} and Alice partitions ΩEΩE\Omega_E into 10 states a1,a2,...a10a1,a2,...a10a_1,a_2,...a_{10} in a different manner. Setting pb1=ωb1/ΩE=1/10pb1=ωb1/ΩE=1/10p_{b_1} = \omega_{b_1}/\Omega_E = 1/10 might give a different model for a random measurement than setting pa1=ωa1/ΩE=1/10pa1=ωa1/ΩE=1/10 p_{a_1} = \omega_{a_1}/\Omega_E = 1/10. For example, Alice might have chosen to make her a1a1a_1 a proper subset of Bob's b1b1b_1.

The way that the phase space is partitioned is not completely arbitrary. Each ##\omega## must only enclose identical states. The size of each ##\omega## is constant since under a canonical change of variables the volume of a section of phase space must be left unchanged, consistent with Liouville's theorem.

You mentioned that you have read other introductory books on statistical mechanics but did not find the exposition clear. Which books did you read? I would recommend both Statistical Physics of Particles and Statistical Physics of Fields by Mehran Kardar. The first book builds up the theory in a manner similar to what I have done here but with more background information.

Stephen Tashi · Jan 11, 2018

NFuller said:

Which books did you read?

Its too long ago to remember titles. The first text I see on my bookshelf is Statistical Physics by F. Mandl, but I didn't read much of that book.

----

##\Omega_E## is a subspace with constant energy ##E## of the total phase space.

Does the phrase "with constant energy" mean that each microstate in ##\Omega_E## has the same energy ##E##?

##<E> =\sum_i p_i E_i = U##

I don't understand how the "##E##" in this section is related to the "##E##" in the first paragraph of the article. Is "##E##" is being used both to denote a constant and a random variable?

How is ##E_i ## defined ?

NFuller · Jan 11, 2018

Stephen Tashi said:

Does the phrase "with constant energy" mean that each microstate in ΩEΩE\Omega_E has the same energy EEE?

Yes, here I am working with the microcanonical ensemble where ##E## is fixed. Perhaps I should clarify this.

Stephen Tashi said:

I don't understand how the "EEE" in this section is related to the "EEE" in the first paragraph of the article. Is "EEE" is being used both to denote a constant and a random variable?

This part is from the canonical ensemble where the temperature is fixed rather than the energy being fixed. The first section assumes constant energy and the second section assumes constant temperature.

Stephen Tashi said:

How is EiEiE_i defined ?

##E_{i}## is the energy of the ##i##th state.

anorlunda · Feb 24, 2018

I'm having trouble with the Latex rendition of this Insights article. It seems that many of the symbols appear doubled. Below is a screen shot of what I'm seeing.

NFuller · Feb 24, 2018

That's strange, do you have any idea what could be causing this? It looks normal on my computer.

Stephen Tashi · Feb 24, 2018

anorlunda said:

I'm having trouble with the Latex rendition of this Insights article

What software are you using?

One of my computers has an old version of Firefox 38.0.5 and it can have weird errors in displaying pages.
For example, @Philip Koeck 's researchgate article at https://www.researchgate.net/publication/322640913_A_microcanonical_derivation_gives_the_Boltzmann_for_distinguishable_and_the_Bose-Einstein_distribution_for_indistinguishable_particles is displayed with a weird error that omits some letter combinations like "ti".

anorlunda · Feb 25, 2018

Stephen Tashi said:

What software are you using?

Windows 10 with Google Chrome Version 64.0.3282.167

Lincon Ribeiro · Mar 18, 2018

I am having a hard time understanding why phase state represents the particle / system path through time and the connection of it with thermodynamic states. Where can I find more info about it? I have been reading Marion's Classical Dynamics, but it did not help. tks

vanhees71 · Mar 19, 2018

I don't know, where this inaccurate information comes from. I guess, what's meant is the path-integral formulation of quantum field theory. In QFT the "sum over paths" is in fact a functional integral (which is hard to properly define; to my knowledge it's not mathematically rigorously defined today) over field configurations.

The path integral comes in many forms. Two are most important:

(a) in vacuum QFT, which deals with scattering processes of a few particles, that can be depicted as the usual Feynman diagrams for the evaluation of S-matrix elements in perturbation theory. The Feynman diagrams depict formulae for S-matrix elements evaluated in perturbation theory. A diagram with ##n## external points depicts the time-ordered ##n##-point Green's functions of the theory. The same quantity can also be derived from a generating functional, which is given in terms of a path integral as
$$Z[J]=\int \mathrm{D} \phi \int \mathrm{D} \Pi \exp[\mathrm{i} S[\phi,\Pi]+\mathrm{i} \int \mathrm{d}^4 x J(x) \phi(x)],$$
where
$$S=\int \mathrm{d}^4 x [\phi(x) \Pi(x) - \mathcal{H}(\Phi,\Pi)]$$
is the action in terms of its Hamiltonian formulation, ##\phi## is a set of fields and ##\Pi## their canonical field momenta.

Unfortunately you can evaluate only path integrals where the action is a bilinear form of the fields and canonical field momenta, i.e., a Gaussian functional integral. This refers to free fields. So you split the action in the bilinear free part and the higher-order interaction part and expand the Path integral in terms of powers of the interaction part. What you get is a formal power series of the corresponding coupling constants and yields the same result as time-dependent perturbation theory when using the more conventional operator formalism ("canonical field quantization").

From this it beomes clear that what you evaluate in fact is the time evolution of the theory with a time-evolution operator
$$\hat{U}(t,t')=\exp(-\mathrm{i} \int_{t'}^t \mathrm{d} \tau \int \mathrm{d}^3 \vec{x} \mathcal{H}.$$
Usually after some care using "adiabatic switching" you make ##t' \rightarrow -\infty## and ##t \rightarrow +\infty## to finally obtain the LSZ-reduction formalism to evaluate the S-matrix elements from the corresponding ##n##-point timeordered Green's functions.

(b) in equilibrium many-body theory you usually are after the canonical or grand-canonical partition function, which is in the operator formalism given by
$$Z=\mathrm{Tr} \exp(-\beta \hat{H}),$$
where ##\beta=1/k_{\text{B}} T## is the inverse temperature (modulo the Boltzman constant, which we set to 1 in the following). The expression under the trace now looks like a time-evolution operator with the time running along the imaginary axis from ##0## to ##-\mathrm{i} \beta##, and that's used in the path-integral formalism: Thus as you can express the generating functional for time-ordered Green's functions as a path integral over field configurations you can evaluate the partition sum as a path-integral with an imaginary time.

Making the time imaginary leads to an action in Euclidean QFT, i.e., instead of the usual Minkowski-space action you get an action as if spacetime where a four-dimensional Euclidean space. Taking the trace leads to periodic (for bosons) or antiperiodic (for fermions) boundary conditions in imaginary time, i.e., the fields over which to integrate are subject to the constraint ##\phi(t-\mathrm{i} \beta,\vec{x})=\pm \phi(t,\vec{x})## with the upper (lower) sign for bosons (fermions), the socalled Kubo-Martin-Schwinger (or short KMS) conditions.

With that mathematical trick you get the same Feynman rules to evaluate the partition sum in this imaginary-time (or Matsubara) formulation of many-body QFT as in vacuum QFT with the only difference that internal lines stand for the Matsubara propagator, and instead of energy integrals in the energy-momentum version of the Feynman rules you have to sum over the Matsubara freqencies ##\mathrm{i} 2 \pi T n## (bosons) or ##\mathrm{i} \pi T (2n+1)## with ##n \in \mathbb{Z}##.

Greg Bernhardt · Mar 23, 2018

Part 2 is up
https://www.physicsforums.com/threads/statistical-mechanics-part-ii-the-ideal-gas-comments.942669/

Statistical Mechanics Part I: Equilibrium Systems - Comments

Attachments

Attachments

Attachments

Similar threads

Hot Threads

I Explain Bernoulli at the molecular level?

B What is the Correct Reading on the Scale in This Mass/Scale Puzzle?

I Topic about physics axioms, theory, laws etc..

I Does kinetic friction propel a person walking forward?

B (Electric) current is not a vector quantity

Recent Insights

Insights Thinking Outside The Box Versus Knowing What’s In The Box

Insights Why Entangled Photon-Polarization Qubits Violate Bell’s Inequality

Insights Quantum Entanglement is a Kinematic Fact, not a Dynamical Effect

Insights What Exactly is Dirac’s Delta Function? - Insight

Insights Relativator (Circular Slide-Rule): Simulated with Desmos - Insight

Insights Fixing Things Which Can Go Wrong With Complex Numbers