Lectures on statistical mechanics

Allan N. Kaufman; Bruce I. Cohen; Alain J. Brizard

doi:10.1017/S0022377825000042

Lectures on statistical mechanics

Part of: JPP Lecture Notes

Published online by Cambridge University Press: 20 June 2025

and

Allan N. Kaufman: Affiliation:
Physics Department, University of California, Berkeley, CA, USA
Bruce I. Cohen*: Affiliation:
Physics Division, Lawrence Livermore National Laboratory, CA, USA
Alain J. Brizard: Affiliation:
Physics Department, St. Michael’s College, Burlington, VT, USA
*: Corresponding author: Bruce I. Cohen, bruceicohen@gmail.com

Article contents

Abstract
Foreword
Equilibrium statistical mechanics
Nonequilibrium statistical mechanics
Footnotes
References

Rights & Permissions

Abstract

Presented here is a transcription of the lecture notes from Professor Allan N. Kaufman’s graduate statistical mechanics course Physics 212A and 212B at the University of California Berkeley from the 1972–1973 academic year. 212A addressed equilibrium statistical mechanics with topics: fundamentals (micro-canonical and sub-canonical ensembles, adiabatic law and action conservation, fluctuations, pressure, and virial theorem), classical fluids and other systems (equation of state, deviations from ideality, virial coefficients and van der Waals potential, canonical ensemble and partition function, quasistatic evolution, grand-canonical ensemble and partition function, chemical potential, simple model of a phase transition, quantum virial expansion, numerical simulation of equations of state, and phase transition), chemical equilibrium (systems with multiple species and chemical reactions, law of mass action, Saha equation, chemical equilibrium including ionization and excited states), and long-range interactions (including Coulomb, dipole, and gravitational interactions, Debye–Hückel theory, and shielding). 212B addressed nonequilibrium statistical mechanics with topics: fundamentals (definitions: realizations, moments, characteristic function, and discrete variables), Brownian motion (Langevin equation, fluctuation–dissipation theorem, spatial diffusion, Boltzmann’s H-theorem), Liouville and Klimontovich equations, Landau equation (derivation, elaboration, and H-theorem, and irreversibility), Markov processes and Fokker–Planck equation (derivations of the Fokker–Planck equation and a master equation), linear response and transport theory (linear Boltzmann equation, linear response theory of Kubo and Mori, relation of entropy production to electrical conductivity, transport relations and coefficients, normal mode solutions of the transport equations, sketch of a generalized Langevin equation method for transport theory), and an introduction to nonequilibrium quantum statistical mechanics.

Keywords

plasma dynamics plasma nonlinear phenomena

Information

Type: Lecture Notes
Information: Journal of Plasma Physics , Volume 91 , Issue 3 , June 2025 , E87

DOI: https://doi.org/10.1017/S0022377825000042 [Opens in a new window]

NASA ADS Abstract Service [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Allan N. Kaufman, 1927–2022

Foreword

Allan Kaufman (1927–2022) grew up in the Hyde Park neighborhood of Chicago not far from the University of Chicago. Allan attended the University of Chicago for both his undergraduate and doctoral degrees in physics. Allan’s doctoral thesis advisor was Murph Goldberger who was relatively new to the faculty at Chicago and just five years older than Allan. Allan presented a theoretical thesis on a strong-coupling theory of meson–nucleon scattering. Allan published an autobiographical article entitled ‘A half-century in plasma physics’ in A.N. Kaufman, Journal of Physics: Conference Series 169 (2009) 012002.

Allan worked at Lawrence Livermore Laboratory from June 1953–1963. While at Livermore Laboratory he taught the one-year graduate course in electricity and magnetism in 1959–1963 at UC Berkeley. In 1963 he first taught the first semester of the graduate course in Theoretical Plasma Physics 242 at Berkeley. He taught the plasma theory course at UCLA in the 1964–1965 school year while on leave from Livermore before joining the faculty at UC Berkeley in the 1965 school year. Allan frequently taught the graduate plasma theory course and the graduate statistical mechanics course Physics 212A and B until his retirement from teaching in 1998.

These lecture notes were from Kaufman’s graduate statistical mechanics course in the 1972–1973 academic year. The notes follow the chronological order of the lectures. The equations and derivations are as Kaufman presented, and the text is a reconstruction of Kaufman’s discussion and commentary. Equation numbers were added to facilitate the exposition of the derivations. Although the material is 50 years old, the mathematical rigor and elegance of Kaufman’s treatment of the subject matter should still be useful to students interested in learning the fundamentals of statistical mechanics. A few of the equations that are important results and conclusions in the analysis are labeled as “Theorems” to draw attention to them: these are not necessarily formal theorems in the mathematical sense but are consistent with terminology in physics textbooks. Editor’s Notes, Editor’s Addendum, and Reviewer’s Comments have been inserted with the goal of providing additional useful material, updates, and references. In this regard we are very much indebted to the three reviewers of these lecture notes who were very energetic and whose suggestions have added considerable value, which deserves attribution and recognition. In particular, we thank Dominique Escande and Martin Lemoine for their many valuable comments in reviewing the manuscript. These lecture notes are intended as a resource.

The focus of Kaufman’s research at Berkeley was plasma physics. Although these lecture notes address the general subject of statistical mechanics, there is a definite emphasis on plasma physics in the examples and applications. Statistical mechanics is foundational for plasma physics. Examples of specific material in these lecture notes addressing plasma physics topics are as follows: Hamiltonian theory for a kinetic plasma with Coulomb forces, rigorous derivation of the pressure and virial expansion, partition function and statistics for an unmagnetized plasma in thermal equilibrium with electromagnetic waves, the Bohr–Van Leeuwen theorem in an equilibrium plasma, derivation of the Poisson–Boltzmann equation for a Coulomb model of a plasma in thermal equilibrium, analysis of Debye shielding and quasineutrality conditions, derivation of the Maxwell–Boltzmann equilibrium distribution, Hamiltonian theory of a nonequilibrium plasma with electromagnetic fields, Langevin equation model of Brownian motion in a plasma with Coulomb forces, the fluctuation–dissipation theorem, Boltzmann’s H-theorem, derivations of Liouville, Klimontovich, Vlasov, Landau, Boltzmann, and Fokker–Planck equations in a plasma, linear response theory and derivation of transport equations and coefficients, collisions, and conductivities (electrical and thermal) in a plasma.

Bruce Cohen joined Kaufman’s research group during the 1971–1972 academic year and received his PhD in 1975. These lecture notes were word-processed in 2023 after Allan’s death in December of 2022. Allan encouraged Cohen to word-process his notes on plasma theory and statistical mechanics so that they could be shared. In 2019 Kaufman and Cohen published Cohen’s transcription of Kaufman’s lecture notes from the graduate plasma physics course at Berkeley, Physics 242 (Kaufman & Cohen Reference Kaufman and Cohen2019).

Alain Brizard worked at Berkeley as a post-doctoral researcher from 1989–1992 with Kaufman, from 1992–1994 with Ken Fowler, and the summers of 1995–2000 with Kaufman and Jonathan Wurtele. Alain was a research collaborator with Kaufman for three decades. Brizard published many papers with Kaufman and the book Ray Tracing and Beyond , with E.R. Tracy, A.S. Richardson, and Kaufman, Cambridge University Press (2014). Brizard reviewed Kaufman & Cohen (Reference Kaufman and Cohen2019) and suggested valuable improvements before its publication.

Professor Kaufman’s work on these lecture notes was performed while he was employed as a Professor of Physics at the University of California Berkeley. Professor Kaufman’s separate research activity was funded in part by the United States Department of Energy. Bruce Cohen’s work on these lecture notes was pro bono. Cohen’s separate research activity has been funded at the Lawrence Livermore National Laboratory by the United States Department of Energy. Alain Brizard is a Professor of Physics at St. Michael’s College in Vermont, and his separate research activity in the present and past has been supported by the United States National Science Foundation and the Department of Energy. Brizard’s work on these lecture notes was not funded under his research grants.

Bruce I. Cohen

Alain J. Brizard

1. Equilibrium statistical mechanics

[Editor’s Note: In the first lecture of Physics 212A Kaufman discussed the syllabus and schedule for the lectures. Kaufman used CGS units with some customizations throughout his notes, e.g., Boltzmann’s constant is set to unity. There was no textbook for the course. Some of the references for his lectures included L. D. Landau and E. M. Lifshitz, Statistical Physics (Landau & Lifshitz Reference Landau and LIfshitz1969); R. C. Tolman, The Principles of Statistical Mechanics (Tolman Reference Tolman1938); R. Kubo, Statistical Mechanics (Kubo Reference Kubo1965); J. O. Hirschfelder, C. F. Curtiss, and R. B. Bird, Molecular Theory of Gases and Liquids (Hirschfelder, Curtiss & Bird Reference Hirschfelder, Curtiss and Bird1954); F. Reif, Fundamentals of Statistical and Thermal Physics (Reif Reference Reif1965); H. B. Callen, Thermodynamics and an Introduction to Thermostatics (Callen Reference Callen1960); and R. Becker, Theory of Heat (Becker Reference Becker1967).]

[Reviewer Dominique Escande’s Comment: Section 2.4 of Sator, Pavloff & Couëdel (Reference Sator, Pavloff and Couëdel2023) provides a series of useful references in order to go further into the foundations of statistical mechanics.]

1.1. Fundamentals

Statistical mechanics provides a mathematical framework for bridging the gap between microscopic laws to macroscopic descriptions. Statistical mechanics is confronted with a set of dichotomies: equilibrium versus nonequilibrium; a range of degrees of freedom from few (∼10) to many (∼10 ${}^{3}$ ), to very many (∼10 ${}^{24}$ ), to a denumerable infinity, to an uncountable infinity; classical versus quantum; relativistic versus nonrelativistic; closed versus open systems; inert versus chemically reactive; and many levels of description, e.g., exact, kinetic f(x , v , t), fluid n(x , t), v (x ,t), T(x , t). Statistical mechanics is home to the fundamental laws of thermodynamics: (0) $A = B = C$ transitivity; (1) conservation of energy; (2) the change of entropy is nonnegative $\Delta S\ge 0$ ; (3) entropy ${S} \rightarrow 0$ as temperature ${T} \rightarrow 0$ . Statistical mechanics distinguishes between extensive and intensive properties of matter, i.e., properties that are either volume dependent or independent, respectively.

1.1.1. Postulate of equal probabilities

Definition: A macroscopic state is described by a set of partial information. A microscopic state can be described by a set of either classical information or quantum information that is a complete set of detailed information at the finest level including boundary and initial conditions.

Example: Consider $N$ coupled harmonic oscillators with Hamiltonian H given by

(1.1)

\begin{equation} H=\sum\limits ^N_{i=1}{{{\tfrac{1}{2}}}}\!\left (p^2_i+{\omega }^2_iq^2_i\right )+\lambda \sum\limits _{ijk}{c_{ijk}q_i}q_jq_k \end{equation}

with parameters $N, \lambda , \{c_{ijk}\},\{{\omega }_i\}$ . In a finite-sized box, the energy eigenstates for an uncoupled system of harmonic oscillators are discretized and representative of quantum systems. Because H in this example has no explicit time dependence, H is a constant of the motion; and the macrostate can be characterized by its energy without knowledge of the initial conditions. There is only partial information available in this example.

How does one relate microscopic information to a macroscopic description?

Postulate (Fundamental Postulate – R.C. Tolman) All microstates consistent with the given partial information (macrostate) are equally probable.

Definition: $\varGamma \!\left (E_0\right )\equiv$ number of microstates with $E\lt E_0$

At this point we drop the classical picture for a little while for pedagogic reasons, chiefly because counting and summing microstates over a discretized phase space resolves certain mathematical measure complications encountered in classical systems.

Definition: In the discretized quantum picture the probability of one given microstate is

(1.2)

\begin{equation} \textrm {probability} \equiv w_n=\left \{ \begin{array}{l@{\quad}l} \frac {1}{\varGamma \!\left (E_0\right )},& {E}_n\lt E_0, \\[4pt] 0,& {E}_n\gt E_0. \end{array} \right . \end{equation}

In the limit of a very large number $\varGamma (E_0)$ we employ the correspondence principle, and in the volume $\Omega$ of allowed phase space one obtains

(1.3)

\begin{equation} \varGamma \!\left (\Omega \right )=\mathop {\textrm {lim}}_{h\to 0}\frac {\int\nolimits _{\Omega }{\mathrm{d}q\mathrm{d}p}}{h^n}. \end{equation}

For large $\varGamma$ there are many accessible microstates, and the probability of a given microstate with energy ${E}_n$ is a relatively smooth function of energy because the granularity of the energy is such fine scale.

1.1.2. Example: $N$ uncoupled oscillators

Example: N uncoupled simple harmonic oscillators. Let ${N} = 3$ and use a canonical transformation to action-angle variables:

(1.4)

\begin{equation} \left . \begin{array}{c} q_i\equiv \sqrt {\frac {2J_i}{{\omega }_i}}{\sin {\theta }_i} \\[5pt] p_i\equiv \sqrt {2J_i{\omega }_i}{\cos {\theta }_i} \end{array} \right \}\to H=\sum\limits ^3_{i=1}{{\omega }_i}J_i. \end{equation}

Further simplify by requiring by requiring ${\omega }_1={\omega }_2={\omega }_3={\omega }_0$ . We note that $\{{\theta }_1,{\theta }_2,{\theta }_3\}$ are ignorable in H; hence, the actions $J_i$ are constants of the motion; and the energy is given by $E={\omega }_0\!\left (J_1+J_2+J_3\right )$ . The volume occupied by the system of three oscillators in phase space is a triangular solid in J-space, with vertices $\{J_0,0,0\}$ , $\{0,J_0,0\}$ , and $\left \{0,0,J_0\right \}$ where $J_0=E_0/{\omega }_0$ and a rectangular solid in $\theta$ -space spanning $[0,2\pi ]$ in each of the three $\theta$ ${}_{ }$ coordinates. The product volume in the $\{J,\theta \}$ phase space divided by h ${}^{3}$ yields the number of states:

(1.5)

\begin{equation} \varGamma \!\left (E_0\right )=\frac {\textrm{volume}}{h^3}=\frac {{(2\pi )}^3}{h^3}{{\frac{1}{2}}}{\bigg({{\frac {E_0}{{\omega }_0}}}\bigg)}^2\frac {1}{3}\frac {E_0}{{\omega }_0}={{\frac {1}{3!}}}{\bigg(\frac {E_0}{{\hslash \omega }_0}\bigg)}^3{.} \end{equation}

In (1.5) we are assuming that typically the number of states is large, i.e., $E_0\gg {\hslash \omega }_0N$ . For the general case of N oscillators,

(1.6)

\begin{equation} \varGamma \!\left (E_0,3\right )\to \varGamma \!\left (E_0,N\right )=\frac {1}{N!}{\bigg(\frac {E_0}{{\hslash \omega }_0}\bigg)}^N{.} \end{equation}

If we allow each of N oscillators to have any one of M possible energy states, where

(1.7)

\begin{equation} M\equiv \frac {E-{{\tfrac{1}{2}}}{\hslash \omega }_0}{{\hslash \omega }_0}, \end{equation}

then Kubo (Reference Kubo1965, p. 38) shows that the number of distinguishable states is given by

(1.8)

\begin{equation} \varGamma \!\left (E,{\omega }_0\right )=\sum\limits _M{\frac {\!\left (N+M-1\right )!}{\!\left (N-1\right )!M!}.} \end{equation}

Example: To illustrate (1.8) consider $N = 3$ oscillators and $M = 4$ energy levels, for which there are 15 states {(4,0,0), (3,1,0), (3,0,1), (2,2,0), (2,0,2), (2,1,1), (1,3,0), (1,0,3), (1,2,1), (1,1,2), (0,4,0), (0,0,4), (0,3,1), (0,1,3), (0,2,2)} in agreement with $(N + M - 1)$ !/[ $(N - 1)$ ! $M$ !] = 6!/(2! 4!) = 15.

Definition: The specific oscillator energy is ${\mathcal E}\equiv {E}/{N}$ .

Returning to (1.6) and using the definition of the specific oscillator energy one obtains

(1.9)

\begin{equation} \varGamma \!\left (E_0,N\right )=\frac {1}{N!}{\bigg(\frac {E_0}{{\hslash \omega }_0}\bigg)}^N=\frac {N^N}{N!}{\bigg(\frac {{{\mathcal E}}_0}{{\hslash \omega }_0}\bigg)}^N\approx \frac {e^N}{\sqrt {2\pi N}}{\bigg(\frac {{{\mathcal E}}_0}{{\hslash \omega }_0}\bigg)}^N{,} \end{equation}

where in (1.9) we have made use of Stirling’s approximation $N!\approx \ \sqrt {2\pi N}\,{\!\left (N/e\right )}^N$ for large N. We identify ${{{\mathcal E}}_0}/{{\hslash \omega }_0}$ as a basic quantum number. We note that if the basic quantum number is O(10) and N ∼ 10 ${}^{10{-}20}$ , $\varGamma$ is rather large.

Definition: Take the natural logarithm of the number of states and introduce the concept of entropy. If all states in $\varGamma$ are equally probable, then define the entropy as

(1.10)

\begin{equation} S\!\left (E_0,N\right )\equiv {\text{ ln } \varGamma }\to N\ \textrm {ln}\bigg(\frac {{{\mathcal E}}_0}{{\hslash \omega }_0}\bigg)+N-\frac{1}{2}{\text{ ln}\!\left (2\pi N\right )}\approx N{\text{ ln} \bigg(\frac {{{\mathcal E}}_0}{{\hslash \omega }_0}\bigg)}+N \end{equation}

for large N. Thus, S ∼ O(N).

Definition: Introduce the specific entropy ${\mathcal S}\equiv S/N$ . Hence,

(1.11)

\begin{equation} {\mathcal S}={\text{ ln} \bigg(\frac {{{\mathcal E}}_0}{{\hslash \omega }_0}\bigg)}+1, \end{equation}

which has no N dependence (‘normal dependence’) and is a number of order unity.

[Editor’s Note: Prof. Kaufman remarked at this point that Problem 2-33 in Kubo (Reference Kubo1965) addressing the correspondence principle was interesting and not at all obvious.]

Example: Consider an ensemble of N atoms or molecules with a harmonic oscillator Hamiltonian. We will derive the specific entropy for an ideal gas.

The model Hamiltonian for an ideal gas of atoms or molecules is given by

(1.12)

\begin{equation} H=\sum\limits ^{3N}_{i=1}{\bigg[\frac {p^2_i}{2m}+\Phi \bigg],} \end{equation}

where each atom or molecule has three degrees of freedom in its motion, and we consider a cube with volume $V=L^3$ and we assume the potential energy $\Phi =0.$ The magnitudes of the momentum components are constrained by the total energy for each oscillator: $p^2_1+p^2_2+p^2_3\lt {(\sqrt {2mE})}^2$ . The phase-space volume is the product of the cubic volume V and the spherical volume $({4\pi }/{3}){(\sqrt {2mE})}^3$ , and the number of states for three degrees of freedom per oscillator scales as

(1.13)

\begin{equation} {\varGamma }_{f=3}\sim \frac {\dfrac {4\pi }{3}{\big(\sqrt {2mE}\big)}^3L^3}{h^3}. \end{equation}

In (1.13) we note the quantum discretization. To derive the number of states for N atoms or molecules we begin with the volume of a 3N-dimensional sphere is given by

(1.14)

\begin{equation} V_{3N}\!\left (R\right )={\pi}^{\!\frac{3N}{2}}R^{3N}/\varGamma \bigg(\frac {3N}{2}+1\bigg), \end{equation}

where $\varGamma \!\left (z\right )$ denotes the gamma function. Note that since $\varGamma \!\left (n+1\right )=n!$ for any nonnegative integer, the shorthand notation $\varGamma[({3N}/{2})+1]=\left ({3N}/{2}\right )!$ is used. Now we introduce the dimensionless ‘phase-space’ radius

(1.15)

\begin{equation} R=\frac {pL}{h}=\sqrt {2mE}L/h, \end{equation}

where the volume in configuration space is $V=L^3$ . We divide $V_{3N}(R)$ by N! to eliminate permutations of indistinguishable states to obtain

(1.16)

\begin{equation} \varGamma \!\left (E;V,N\right )=\frac {V_{3N}\!\left (R\right )}{N!}=\frac {{\!\left (2\pi mE\right )}^{\frac{3N}{2}}V^N}{h^{3N}\bigg(\dfrac {3N}{2}\bigg)!N!}, \end{equation}

where E is the total energy for N particles with three degrees of freedom each. At this point we introduce a few definitions to facilitate reducing equation (1.16) to a more recognizable form.

Definition: The specific energy is ${\mathcal E}=E/N.$ The particle density is then $n\equiv N/V$ .

With these definitions, use of Stirling’s approximation to remove the factorials, $\varGamma \!\left (z+1\right )\cong \sqrt {2\pi z}{\left ({z}/{e}\right )}^z,$ for ${z}\gg 1$ , and with ${N} \gg 1$ we obtain the following result from (1.16)

(1.17)

\begin{equation} \varGamma \approx {\left (\frac {4\pi }{3}\frac {m{\mathcal E}}{{\overline {p}}^2}\right )}^{\frac {3N}{2}}{\left (\frac {1}{n{\Lambda}^3}\right )}^N\frac {e^{\frac {5N}{2}}}{\sqrt {6}\pi N}={\left (\frac {1}{n{\Lambda}^3}\right )}^N\frac {e^{\frac {5N}{2}}}{\sqrt {6}\pi N}, \end{equation}

where the average momentum per particle is $\overline {p}\equiv \sqrt {({4\pi }/{3})m{\mathcal E}}$ and the thermal de Broglie wavelength is $\Lambda \equiv {h}/{\overline {p}}$ . Here $n{\Lambda}^3$ is the number of particles in a de Broglie cube, which must be a small number to justify a classical description. From (1.17) we calculate the entropy and recover the specific entropy of an ideal gas:

(1.18)

\begin{align} S&=\text{ ln } \varGamma \to {\mathcal S}=\frac {S}{N}\nonumber\\[-2pt] &={\frac {{\text{ ln } \varGamma }}{N} = \frac {5}{2}} -{\text{ ln}(n{\Lambda}^3)} -\frac {{\text{ ln } \textrm {N}}}{N}-\frac {{\text{ ln } \pi }}{N}-\frac{1}{2}\frac {{\text{ ln } 6}}{N}\ \approx \frac {5}{2}-{\text{ ln}(n{\Lambda}^3)} \end{align}

1.1.3. Microcanonical ensemble

Next we introduce the concepts of subcanonical and microcanonical ensembles

Definitions: An ensemble of states for energies $E_{N}\lt E$ is a subcanonical ensemble, and we denote the number of states by ${\varGamma }_{\sigma }$ . The ensemble of states for energies $E-\delta E\lt E_n\lt E+\delta E$ is defined as a microcanonical ensemble, and its number of states is denoted ${\varGamma }_{\mu }$ .

Physical subcanonical ensembles have monotonically increasing ${\varGamma }_{\sigma }$ as functions of increasing energy E. We can evaluate ${\varGamma }_{\mu }$ as follows using (1.18):

(1.19)

\begin{align} {\varGamma }_{\mu }\!\left (E,\delta E\right )&={\varGamma }_{\sigma }\!\left (E\right )-{\varGamma }_{\sigma }\!\left (E-\delta E\right )={\varGamma }_{\sigma }\!\left (E\right )\left [1-\frac {{\varGamma }_{\sigma }\!\left (E-\delta E\right )}{{\varGamma }_{\sigma }\!\left (E\right )}\right ]\nonumber\\&={\varGamma }_{\sigma }\!\left (E\right )\left [1-\frac {e^{S(E-\delta E)}}{e^{S(E)}}\right ] \approx {\varGamma }_{\sigma }\!\left (E\right )\left [1-\frac {e^{S\left (E\right )-\delta E\frac {\textrm{d}S}{\textrm{d}E}+\frac{1}{2}{\left (\delta E\right )}^2\frac {\textrm{d}^2S}{\textrm{d}E^2}+\dots }}{e^{S\left (E\right )}}\right ]\nonumber\\ &\approx {\varGamma }_{\sigma }\!\left (E\right )\left [1-e^{-\beta \delta E+\frac{1}{2}{\left (\delta E\right )}^2\frac {\textrm{d}\beta }{\textrm{d}E}}\right ], \end{align}

where $\beta \equiv ({\textrm{d}S})/({\textrm{d}E})\equiv {1}/{T}$ . Note that the specific energy ${E}/{N}\sim O\!\left (T\right ),$ and hence the last term in the exponential on the right-hand side of (1.19) is small compared with the $\beta \delta E$ term given the constraint $T\ll \delta E\ll E$ , so that ${\varGamma }_{\mu }\!\left (E,\delta E\right )\sim {\varGamma }_{\sigma }\!\left (E\right )\!\left (1-e^{-\beta \delta E}\right )$ . Furthermore, $e^{-\beta \delta E}$ is exponentially small and, hence, ${\varGamma }_{\mu }\sim {\varGamma }_{\sigma }$ . The interpretation of this is that the number of states is a sharply increasing function of energy such that the volume of the hypersphere is dominated by the volume of the bounding annular shell, i.e., for $V\sim R^N\to {\delta V}/{V}\sim N({\delta R}/{R})$ , N $\gg$ 1 and ${\delta R}/{R}\ll 1,$ but $N({\delta R}/{R})=O(1)$ . For the conditions ${\mathcal E}\ll \delta E\ll E \to {1}/{N}\ll {\delta E}/{E}\ll 1$ , the system remains on the hypersurface that can be parametrized in terms of the actions and fills it. The angle space is filled as well.

Now consider the classical entropy after Taylor-series expanding,

(1.20)

\begin{equation} \,{\varGamma }_{\mu }\!\left (E,\delta E\right )={\varGamma }_{\sigma }\!\left (E\right )-{\varGamma }_{\sigma }\!\left (E-\delta E\right )\approx \delta E\frac {\textrm{d}{\varGamma }_{\sigma }}{\textrm{d}E}+O({\delta E}^2) \end{equation}

after Taylor-series expanding. As compared with (1.19), ${\varGamma }_{\mu }\propto \delta E$ as $\delta E\to 0$ rather than ${\varGamma }_{\sigma }$ . The entropy is the logarithm of ${\varGamma }_{\mu }$

(1.21)

\begin{equation} S_{\mu }={\text{ ln } {\varGamma }_{\mu }={\text{ ln } \delta E+{\text{ ln } \frac {\textrm{d}{\varGamma }_{\sigma }}{\textrm{d}E}}}}. \end{equation}

The first term on the right-hand side of (1.21) is a fixed additive term and small compared with the second term which is the natural logarithm of the density of states and is very large.

The classical microcanonical entropy is to good approximation

(1.22)

\begin{equation} S_{\mu ,\textrm{class}}={\text{ ln } \frac {\textrm{d}{\varGamma }_{\sigma }}{\textrm{d}E}}. \end{equation}

Example: Calculate $S_{\mu ,\textrm{class}}$ for the harmonic oscillator model of the ideal gas and compare it with the quantum entropy expression. We anticipate that if the two expressions are different, it is only due to constants. The classical microcanonical entropy is given in (1.21), while the quantum entropy is given by

(1.23)

\begin{equation} S_{qm}\equiv {\text{ ln } \varGamma } \to \varGamma =e^{S_{qm}} \to \frac {\textrm{d}\varGamma }{\textrm{d}E}=e^{S_{qm}}\frac {\textrm{d}S_{qm}}{\textrm{d}E}\equiv \beta e^{S_{qm}} \end{equation}

Using ${\textrm{d}\varGamma }/{\textrm{d}E}=\beta e^{S_{qm}}$ in the last term in (1.21),

(1.24)

\begin{equation} S_{\mu ,\textrm{class}}={\text{ ln } \delta E}+{\text{ ln } \beta e^{S_{qm}}={{\text{ ln } \delta E}+\text{ ln } \beta +S_{qm}}}=O(1)+O(1)+O(N)\approx S_{qm_{_{_{}}}} \end{equation}

We note that we should introduce h ${}^{N}$ in the denominator in the expressions for $\varGamma$ to give the correct dimensionless units for the phase-space normalized volume. However, this results in no change in the final formulas due to taking the logarithm of a product expands into the sum of logarithms; and the $S_{qm}\approx$ O(N) term remains dominant.

Next consider the phase space of a system with many degrees of freedom whose trajectory in phase space is constrained by a Hamiltonian. Define a subdomain in this phase space as a shell with thickness $\delta \ell$ and volume defined by $\textrm{d}p\textrm{d}q=\textrm{d}A\delta \ell$ , and the thickness of the shell is parametrized by a variation in the total energy:

(1.25)

\begin{equation} H\!\left (p,q\right )=E-\delta E,\quad \frac {\delta E}{\delta \ell }= |{\nabla H}|,\quad \delta \ell =\frac {\delta E} {|{\nabla H(p,q)}|}, \end{equation}

which varies as a function of p and q in phase space. If the probability of the system occupying a given subdomain in phase space is proportional to the volume of the subdomain, then

(1.26)

\begin{equation} \textrm {Probability}\propto \mathrm{d}A\ \delta \ell =\mathrm{d}A\frac {\delta E}{|{\nabla H(p,q)}|}. \end{equation}

Theorem (Boltzmann’s Ergodic Hypothesis): The orbit of the system of microstates will completely fill the volume of the accessible phase space given the initial data constraining the degrees of freedom. Over time any subdomain will be occupied for a time proportional to the subdomain’s volume, i.e., all accessible microstates are equally probable over a long period of time.

As stated here the Ergodic Hypothesis has the difficulty that the orbit of the system is a one-dimensional manifold embedded within the energy surface and has a different measure than that of the energy surface. Thus, the orbit of the system cannot fill the energy surface in a strict sense. Hence, there is a need for refining the Ergodic Hypothesis as follows.

Theorem (Quasi-ergodic Hypothesis due to G. D. Birkhoff (Reference Birkhoff1931)): Every finite region on the energy surface is accessible.

Theorem (Ergodic): A system spends equal times in equal volumes (except for a set of measure zero pathological initial conditions).

Corollary: For any integrable function of the phase-space coordinates $f(p,q)$ , the time average of $f(p,q)$ is equal to its space average almost everywhere. This is a very important consequence of the ergodic theorem.

1.1.4. Nonequilibrium macrostates

We next take up the examination of nonequilibrium macrostates. Consider the simple example of a domain composed of two adjacent contiguous subdomains I and II occupied by an ideal gas with numbers of particles and energies {N ${}_{\textrm{I}}$ , E ${}_{\textrm{I}}$ } and {N ${}_{\textrm{II}}$ , E ${}_{\textrm{II}}$ }. We further assume that the ideal gas is described by the same harmonic oscillator Hamiltonian introduced in (1.12). After an invisible membrane is removed we allow transfer of energy between the two subsystems, but no losses to the exterior world. Thus,

(1.27)

\begin{equation} E_{\text{I}}+E_{\textrm{II}}=\text{const}\equiv E. \end{equation}

The accessible number of microstates for the combined system before the membrane is removed is given by

(1.28)

\begin{equation} \varGamma \!\left (E_{\textrm{I}},E_{\textrm{II}}\right )={\varGamma }_{\textrm{I}}\!\left (E_{\textrm{I}}\right ){\varGamma }_{\textrm{II}}\!\left (E_{\textrm{II}}\right ), \end{equation}

from which follows that the entropy is given by

(1.29)

\begin{equation} S\!\left (E_{\textrm{I}},E_{\textrm{II}}\right )={\text{ ln } \varGamma \!\left (E_{\textrm{I}},E_{\textrm{II}}\right )={\text{ ln } {\varGamma }_{\textrm{I}}\!\left (E_{\textrm{I}}\right )+{\text{ ln } {\varGamma }_{\textrm{I}}\!\left (E_{\textrm{II}}\right )}}} = S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )+S_{\textrm{II}}\!\left (E_{\textrm{II}}\right ). \end{equation}

After the membrane is removed, a constraint on the number of states is removed; and the final number of states can exceed the initial number of states:

(1.30)

\begin{equation} {\varGamma }_{\text{init}}\lt {\varGamma }_{\textrm{final}}\!\left (E\right )=\int\nolimits ^E_0{\varGamma (}E_{\textrm{I}}, E_{\textrm{II}}=E-E_{\textrm{I}})\textrm{d}E_{\textrm{I}}. \end{equation}

Think of the integral in (1.30) as the sum over the number of possible energy states. Clearly the initial and final entropies satisfy $S_{\textrm{init}}\lt S_{\textrm{final}}$ . The probability that the subsystem I has energy E ${}_{\textrm{I} }$ subject to the constraint that the total system energy is E, is given by the relative fraction of states in system I having energy $E_{\textrm{I}}$ and system II having energy $E_{\textrm{II}}$ , which given by the product of the respective numbers of microstates, divided by the total number of states having energy ${E=E}_{\textrm{I}}+E_{\textrm{II}}$ :

(1.31)

\begin{equation} \rho \!\left (E_{\textrm{I}}| E\right )=\frac {\varGamma (E_{\textrm{I}}, E_{\textrm{II}})}{\varGamma (E)}=\frac {e^{S\left (E_{\textrm{I}},E_{\textrm{II}}\right )=S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )+S_{\textrm{II}}\!\left (E_{\textrm{II}}\right )}}{e^{S(E)}}\propto e^{S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )}e^{S_{\textrm{II}}\!\left (E_{\textrm{II}}=E-E_{\textrm{I}}\right )}. \end{equation}

The exponential $e^{S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )}$ in (1.31) is a monotonically increasing function of E ${}_{\textrm{I}}$ , whereas the exponential $e^{S_{\textrm{II}}\!\left (E_{\textrm{II}} = E-E_{\textrm{I}}\right )}$ is a monotonically decreasing function of E ${}_{\textrm{I}}$ . Hence, the probability $\rho$ has a sharp peak at some value E ${}_{\textrm{I}}$ = E ${}_{\textrm{I}*}$ satisfying

(1.32)

\begin{equation} \frac {\partial \rho }{\partial E_{\textrm{I}}}\propto \frac {\partial S_{\textrm{I}}}{\partial E_{\textrm{I}}}+\frac {\partial S_{\textrm{II}}}{\partial E_{\textrm{II}}}\!\left (-1\right )=0. \end{equation}

Using our earlier introduced definitions of $\beta$ and T, $\beta \equiv ({\textrm{d}S})/({\textrm{d}E})\equiv {1}/{T}$ , (1.32) yields the following relation:

(1.33)

\begin{equation} {\beta }_{\textrm{I}}\!\left (E_{\textrm{I}}=E_{\textrm{I}*}\right )={\beta }_{\textrm{II}}\!\left (E_{\textrm{II}}=E-E_{\textrm{I}*}\right ), \end{equation}

at $E_{\textrm{I}}=E_{\textrm{I}*}$ where $\rho$ achieves a sharp maximum, i.e., its equilibrium; and $T_{\textrm{I}}=T_{\textrm{II}}$ .

We have not yet specified what systems I and II are composed of. For example, recall the expression for the specific entropy of the system comprised of harmonic oscillators in (1.11),

(1.34)

\begin{equation} {\mathcal S}=\textrm {ln}\!\left (\frac {{\mathcal E}}{{\hslash \omega }_0}\right )+1, \end{equation}

and the expression for the specific entropy of an ideal gas given in (1.18)

(1.35)

\begin{equation} {{\mathcal S} =\frac {5}{2}-{\text{ln}\!\left (n{\Lambda}^3\right ),}} \end{equation}

where the thermal de Broglie wavelength is $\Lambda \equiv {h}/{\overline {p}}$ and $\overline {p}\equiv \sqrt {({4\pi }/{3})m{\mathcal E}}$ were introduced earlier preceding (1.17). For systems composed of an ideal gas each of the three degrees of freedom per atom has $({1}/{2})T$ energy, then ${\mathcal E}=({3}/{2})T$ is the specific entropy. Moreover,

(1.36)

\begin{equation} {\beta }_{\textrm{I}}=\frac {1}{T_{\textrm{I}*}}=\frac {N_{\textrm{I}}}{\frac {2}{3}E_{\textrm{I}*}}={\beta }_{\textrm{II}}=\frac {1}{T_{\textrm{II}}}=\frac {N_{\textrm{II}}}{\frac {2}{3}{(E-E}_{\textrm{I}*})}, \end{equation}

which determines ${\ E}_{\textrm{I}*}$ . For a system composed of one-dimensional harmonic oscillators, the specific potential energy and kinetic energy each have $({1}/{2})T$ energy; and thus the specific energy for each oscillator is ${\mathcal E}=T$ .

We next consider fluctuations ${\delta E}_{\textrm{I}}$ in ${\ E}_{\textrm{I}}$ away from its equilibrium value ${E}_{\textrm{I}*}$ . We examine the formal Taylor-series expansions of S ${}_{\textrm{I} }$ and S ${}_{\textrm{II}}$ with respect to deviations ${\delta E}_{\textrm{I}}$ from ${E}_{\textrm{I}*}$ :

(1.37)

\begin{align} \;\;\!\!\!\!\!\!\!\!S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )&=S\!\left (E_{\textrm{I}*}\right )+{\delta E}_{\textrm{I}}{\left (\frac {\textrm{d}S_{\textrm{I}}}{\textrm{d}E_{\textrm{I}}}\right )}_{E_{\textrm{I}*}}+\frac{1}{2}{\left ({\delta E}_{\textrm{I}}\right )}^2{\left (\frac {\textrm{d}{\beta }_{\textrm{I}}}{\textrm{d}E_{\textrm{I}}}\right )}_{E_{\textrm{I}*}}+\dots ,\qquad\qquad \end{align}

(1.38)

\begin{align} S_{\textrm{II}}\!\left (E_{\textrm{II}}\right )&=S\!\left ({E-E}_{\textrm{I}*}\right )+{\delta E}_{\textrm{II}}{\left (\frac {\textrm{d}S_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}\right )}_{E_{\textrm{II}}}+\frac{1}{2}{\left ({\delta E}_{\textrm{II}}\right )}^2{\left (\frac {\mathrm{d}{\beta }_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}\right )}_{E_{\textrm{II}}}+\dots \nonumber \\[4pt] &= S\!\left (E_{\textrm{II}}={E-E}_{\textrm{I}*}\right )\ -\,{\delta E}_{\textrm{I}}{\left (\frac {\textrm{d}S_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}\right )}_{E_{\textrm{II}}}+\frac{1}{2}{\left ({\delta E}_{\textrm{II}}\right )}^2{\left (\frac {\mathrm{d}{\beta }_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}\right )}_{{E-E}_{\textrm{I}*}}+\dots .\qquad \end{align}

Use of (1.31) for the probability and (1.33), (1.37), and (1.38) yields

(1.39)

\begin{align} \rho \!\left ({\delta E}_{\textrm{I}}\right )&=e^{\frac{1}{2}{\left ({\delta E}_{\textrm{I}}\right )}^2{\left (\frac {\textrm{d}{\beta }_{\textrm{I}}}{\textrm{d}E_{\textrm{I}}}+\frac {\textrm{d}{\beta }_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}\right )}_{E_{\textrm{I}*}}} \sim e^{-\frac{1}{2}{\left ({\beta \delta E}_{\textrm{I}}\right )}^2O\left (\frac {1}{N_{\textrm{I}}}+\frac {1}{N_{\textrm{II}}}\right )}\nonumber \\[4pt]&\sim e^{-\ \frac {{\beta }^2}{2}{\left ({\delta E}_{\textrm{I}}\right )}^2\left (\frac {1}{{C_{\textrm{I}}N}_{\textrm{I}}}+\frac {1}{C_{\textrm{II}}N_{\textrm{II}}}\right )}\equiv e^{-\ \frac {{\!\left ({\delta E}_{\textrm{I}}\right )}^2}{2{\sigma }^2}}. \end{align}

The probability has a sharp peak around its most probable (equilibrium) value at ${E}_{\textrm{I}*}.$

Definition: C appears in (1.39) and is the derivative of the specific energy with respect to the temperature and depends explicitly on the degrees of freedom,

(1.40)

\begin{equation} C\equiv \frac {\mathrm{d}{\mathcal E}}{\textrm{d}T}=\left \{ \begin{array}{l} \frac {3}{2}\ \ \textrm {ideal gas,} \\ 1\ \ \textrm {harmonic oscillator.}\ \end{array} \right . \end{equation}

One can read off the standard deviation $\sigma$ around the peak of the probability distribution $\rho \!\left ({\delta E}_{\textrm{I}}\right )$ from the right-hand side of (1.39)

(1.41)

\begin{equation} \sigma =T{\left (\frac {1}{{C_{\textrm{I}}N}_{\textrm{I}}}+\frac {1}{C_{\textrm{II}}N_{\textrm{II}}}\right )}^{-{1}/{2}}\sim T\sqrt {N}. \end{equation}

We note that for $N_{\textrm{I}}\sim N_{\textrm{II}},$ (1.41) determines that $\sigma \sim T\sqrt {N}$ and ${\sigma }/{E_{\textrm{I}}}\sim {1}/{\sqrt {N}}\ll 1$ . In the limit that $N_{\textrm{I}}{\ll N}_{\textrm{II}}$ , e.g., a heat bath, then $\sigma =\sqrt {{C_{\textrm{I}}N}_{\textrm{I}}}T$ and ${\sigma }/{E_{\textrm{I}}}\sim {1}/{\sqrt {N_{\textrm{I}}}}\ll 1$ .

Let’s compare S(E) with S(E $(E^*_{\textrm{I}},E^*_{\textrm{II}})$ :

(1.42)

\begin{equation} \varGamma \!\left (E\right )=\int\nolimits ^E_0{\textrm{d}E_{\textrm{I}}\varGamma \!\left (E_{\textrm{I}},E_{\textrm{II}}\right )=}\int\nolimits ^E_0{\textrm{d}E_{\textrm{I}}\varGamma \!\left (E^*_{\textrm{I}},E^*_{\textrm{II}}\right )e^{-\ \frac {{\!\left ({\delta E}_{\textrm{I}}\right )}^2}{2{\sigma }^2}}\approx \varGamma \!\left (E^*_{\textrm{I}},E^*_{\textrm{II}}\right )\sqrt {2\pi {\sigma }^2}} \end{equation}

aside from units. Using the relation $S={\text{ln}\,\varGamma}$ , (1.42) leads to

(1.43)

\begin{equation} S\!\left (E\right )=S\!\left (E^*_{\textrm{I}},E^*_{\textrm{II}}\right )+\tfrac{1}{2}{\text{ ln}\!\left (2\pi {\sigma }^2\right )}\approx S\!\left (E^*_{\textrm{I}},E^*_{\textrm{II}}\right ) \end{equation}

because $S\!\left (E\right ),S\!\left (E^*_{\textrm{I}},E^*_{\textrm{II}}\right )\sim O(N)$ $\gg$ $({1}/{2}){\text{ ln}(2\pi {\sigma }^2)}\sim O({\text{ln } N)}$ . The conclusion is that the entropy is somewhat invariant relative to the system constraints involved.

Example: Consider $N_{\textrm{II}}\gg N_{\textrm{I}}$ and $E_{\textrm{II}}\gg E_{\textrm{I}}$ , a heat bath if you will. Then

(1.44)

\begin{equation} \varGamma \!\left (E_{\textrm{I}},E\right )={\varGamma }_{\textrm{I}}\!\left (E_{\textrm{I}}\right ){\varGamma }_{\textrm{II}}\!\left ({E-E}_{\textrm{I}}\right )={\varGamma }_{\textrm{I}}\!\left (E_{\textrm{I}}\right )e^{S_{\textrm{II}}\left (E\right )-E_{\textrm{I}}\frac {\textrm{d}S_{\textrm{II}}}{\textrm{d}E_{\textrm{II}}}+\frac{1}{2}E^2_{\textrm{I}}\frac {\textrm{d}^2S_{\textrm{II}}}{\textrm{d}E^2_{\textrm{II}}}+\dots . } \end{equation}

We note that ${\textrm{d}^2S_{\textrm{II}}}/{\textrm{d}E^2_{\textrm{II}}}={\textrm{d}\beta }/{\textrm{d}E_{\textrm{II}}}\sim -{N^2_{\textrm{I}}}/{N_{\textrm{II}}}$ in (1.44), and we further impose that $N^2_{\textrm{I}}\ll N_{\textrm{II}}$ so that this term is small. Hence, (1.44) becomes

(1.45)

\begin{equation} \varGamma \!\left (E_{\textrm{I}},E\right )\cong {\varGamma }_{\textrm{II}}\!\left (E\right )e^{-{\beta }_{\textrm{II}}\!\left (E_{\textrm{I}}-T_{\textrm{II}}S_{\textrm{I}}(E_{\textrm{I}}\right ))}\equiv {\varGamma }_{\textrm{II}}\!\left (E\right )e^{-{\beta }_{\textrm{II}}F_{\textrm{I}}(E_{\textrm{I}},T_{\textrm{II}})}, \end{equation}

where we have introduced the definition of the free energy $F_{\textrm{I}}(E_{\textrm{I}},T_{\textrm{II}})\equiv \ E_{\textrm{I}}-T_{\textrm{II}}S_{\textrm{I}}(E_{\textrm{I}})$ , so called because this is the energy available to do work. We recall that for an ideal gas $T_{\textrm{I}}=({2}/{3})({E_{\textrm{I}}}/{N_{\textrm{I}}})$

The probability is proportional to $\varGamma \!\left (E_{\textrm{I}},E\right )$ , i.e.,

(1.46)

\begin{equation} \rho (E_{\textrm{I}})\propto e^{-{\beta }_{\textrm{II}}F_{\textrm{I}}(E_{\textrm{I}},T_{\textrm{II}})}. \end{equation}

The peak of the probability distribution determines the most probable value of E ${}_{\textrm{I}}$ which corresponds to the minimum of the free energy F ${}_{\textrm{I}}$ . Minimizing the free energy of the microstate is equivalent to maximizing the total system entropy.

Example: The free energy of an ideal gas is given by

(1.47)

\begin{equation} F_{\textrm{I}}\!\left (E_{\textrm{I}},T_{\textrm{II}}\right )\equiv \ E_{\textrm{I}}-T_{\textrm{II}}S_{\textrm{I}}\!\left (E_{\textrm{I}}\right )=N_{\textrm{I}}\frac {3}{2}T_{\textrm{I}}-T_{\textrm{II}}N_{\textrm{I}}\left [\frac {5}{2}-{\text{ln}\!\left (n_{\textrm{I}}{\Lambda}^3_{\textrm{I}}(T_{\textrm{I}})\right )}\right ]. \end{equation}

Exercise: Show that the most probable temperature is $T^*_{\textrm{I}}=T_{\textrm{II}}$ .

Next we turn to the calculation of the probability of a quantum microstate. Recall the expression given in (1.31). The probability of a microstate n in subsystem I in contact with subsystem II is constructed as follows. First, we observe based on (1.31)

(1.48)

\begin{equation} w^I_n\propto {\varGamma }_{\textrm{II}}\!\left (E_{\textrm{II}}=E-E^I_n\right )=e^{S_{\textrm{II}}(E_{\textrm{II}})\!\left (E-E^I_n\right )}=e^{S_{\textrm{II}}(E_{\textrm{II}})E}e^{-{\beta }_{\textrm{II}}E^I_n+\dots } \end{equation}

and we note that the $e^{S_{\textrm{II}}(E_{\textrm{II}})E}$ is just a probability constant that will cancel out after division. Dividing the right-hand side of (1.48) by the sum of ${\varGamma }_{\textrm{II}}$ over all n yields the probability

(1.49)

\begin{equation} w^I_n=\frac {e^{-{\beta }_{\textrm{II}}E^I_n}}{Z};\quad Z\equiv Z_{\textrm{I}}({\beta }_{\textrm{II}})\equiv \sum\limits _n{e^{-{\beta }_{\textrm{II}}E^I_n}}, \end{equation}

where Z constitutes the Gibbs canonical ensemble, i.e., the statistical ensemble of possible states in equilibrium with a heat bath at fixed temperature.

1.1.5. Adiabatic law and action conservation

Consider the slow evolution of a system, i.e., an adiabatic change. We refer to Kubo’s book for the ideas here.

Example: Assume a slowly varying Hamiltonian for a harmonic oscillator system with N = 1 modeled by

(1.50)

\begin{equation} H\!\left (p,q;t\right )=\tfrac{1}{2}p^2+\tfrac{1}{2}{\omega }^2_0(t)q^2 \end{equation}

and we assume ${\textrm{d}{\omega }_0}/{\textrm{d}t}\ll {\omega }^2_0$ . Energy is not conserved here because the Hamiltonian is time dependent due to ${\omega }_0\!\left (t\right ).$ The elliptical orbit of the system in the (p,q) phase space evolves, but the area of the ellipse is conserved, i.e., there is an adiabatic invariant. The time derivative of the Hamiltonian can be calculated from

(1.51)

\begin{equation} \frac {\textrm{d}H}{\textrm{d}t}=\frac {\partial H}{\partial t}={\omega }_0{\dot {\omega }}_0q^2 \end{equation}

From (1.51) we calculate the time-integrated change $\Delta H$ from (1.51):

(1.52)

\begin{equation} \Delta H=\int\nolimits {\textrm{d}t\dot {H}=\int\nolimits {\textrm{d}t\frac {{\dot {\omega }}_0}{{\omega }_0}}}{\omega }^2_0q^2. \end{equation}

For purposes of calculating $\Delta H$ over time durations long compared with the oscillation period, we can assume that ${{\dot {\omega }}_0}/{{\omega }_0}$ is approximately constant over the oscillation period; and we can average ${\omega }^2_0q^2$ over the oscillation period. Noting that $({1}/{2})\langle {\omega }^2_0q^2\rangle =({1}/{2})\langle H\rangle$ , we conclude that

(1.53)

\begin{equation} \Delta H\approx \int\nolimits {\textrm{d}t\frac {{\dot {\omega }}_0}{{\omega }_0}}\langle H\rangle \quad \textrm {and}\quad \frac {\langle \dot {H}\rangle }{\langle H\rangle }\approx \frac {{\dot {\omega }}_0}{{\omega }_0}. \end{equation}

Definition: Introduce the action

(1.54)

\begin{equation} J\equiv \frac {1}{2\pi }\int\nolimits {p\mathrm{d}q=\frac {\langle H\rangle }{{\omega }_0}}. \end{equation}

Exercise: Calculate the time derivative of J in (1.54) and use (1.53) to deduce

(1.55)

\begin{equation} \frac {\dot {J}}{J}=\frac {\dot {\langle H\rangle }}{\langle H\rangle }-\frac {{\dot {\omega }}_0}{{\omega }_0}\approx 0. \end{equation}

Hence, the action is ‘conserved.’

Example: $N \gt 1$ and generalize the time dependence of the Hamiltonian: $H\!\left (p,q;\lambda \!\left (t\right )\right )$ where $\lambda \to \lambda +\Delta \lambda $ in a time interval $\Delta t$ that is large, i.e., $\lambda$ is assumed to change at a very slow rate compared with ${\omega }_0$ :

(1.56)

\begin{align} H&=H(p,q;\lambda \!\left (t\right )),\ \ \ \ \frac {\mathrm{d}\textrm {ln}\ \lambda }{\textrm{d}t}\ll \frac {{\dot {\omega }}_0}{{\omega }_0}\to \nonumber \\[4pt] \Delta H&\equiv \int\nolimits ^{\Delta t}_0{\dot {H}\textrm{d}t=\int\nolimits ^{\Delta t}_0{\frac {\partial H}{\partial t}\textrm{d}t=}}\int\nolimits ^{\Delta t}_0{\frac {\partial H}{\partial \lambda }\frac {\textrm{d}\lambda }{\textrm{d}t}\textrm{d}t\approx \dot {\lambda }}\int\nolimits ^{\Delta t}_0{\frac {\partial H}{\partial \lambda }\textrm{d}t\equiv }\dot {\lambda }\Delta t{\left\langle \frac {\partial H}{\partial \lambda }\right\rangle }_t\nonumber \\[4pt] &=\dot {\lambda }\Delta t{\left\langle \frac {\partial H}{\partial \lambda }\right\rangle }_{{\varGamma }_{\mu }(E)}, \end{align}

where the time average has been replaced by an average over the energy surface in phase-space ${\varGamma }_{\!\mu }\!\left (E\right ).$

Hence, $\Delta H=\Delta \lambda {\langle {\partial H}/{\partial \lambda }\rangle }_E$ or ${\Delta E}/{\Delta \lambda }={\langle {\partial H}/{\partial \lambda }\rangle }_E$ where the energy E characterizes the microcanonical ensemble.

1.1.6. Subcanonical ensemble

The number of states for a subcanonical ensemble (all states with energies less than a particular energy E) is given by

(1.57)

\begin{equation} \varGamma (E,\lambda )\equiv \int\nolimits {\textrm{d}p\textrm{d}q \theta \!\left (E-H\!\left (p,q;\lambda \right )\right )},\quad \textrm{where}\quad \theta \equiv \left \{ \begin{array}{c} 1\ \ \ H\le E, \\[4pt] 0\ \ \ H\gt E. \end{array} \right . \end{equation}

Consider a small change in $\varGamma$ due to a small change in the parameter $\lambda$ and accompanying a change in E:

(1.58)

\begin{align} \triangle \varGamma &={\frac {\partial \varGamma }{\partial E}\bigg\vert }_{\lambda }\triangle E+{\frac {\partial \varGamma }{\partial \lambda }\bigg \vert }_E\triangle \lambda =\triangle \lambda \left \{{\frac {\partial \varGamma }{\partial E}}\bigg\vert _{\lambda }{\left \langle \frac {\partial H}{\partial \lambda }\right \rangle }_E+{\frac {\partial \varGamma }{\partial \lambda }\bigg\vert }_E\right \}\nonumber \\[4pt] &=\triangle \lambda \left \{{\int\nolimits {\textrm{d}p\textrm{d}q\delta (E-H)}\left \langle \frac {\partial H}{\partial \lambda }\right \rangle }_E+\int\nolimits {\textrm{d}p\textrm{d}q}\delta (E-H)\!\left (-{\left \langle \frac {\partial H}{\partial \lambda }\right \rangle }_E\right )\right \}=0,\;\; \end{align}

where we have made use of $({\textrm{d}\theta (x)})/({\textrm{d}x})=\delta (x)$ and $\delta \!\left (x\right )$ is the Dirac $\delta$ -function. Thus, the two terms cancel on the right-hand side of (1.58); and $\Delta \varGamma =0$ under a slow change in the parameter $\lambda$ . In practice, ${\textrm{d}\textrm {ln}\lambda }/{\textrm{d}t}$ is required to be much smaller than the rate of change of anything else in the system.

Corollary: Given ${\Delta E}/{\Delta \lambda }={\langle {\partial H}/{\partial \lambda }\rangle }_E$ for adiabatic changes, then $\Delta \varGamma =0$ , and in consequence $\Delta S=0$ . (Adiabatic Law: entropy is conserved.)

Example: For an ideal gas $\varGamma (E,V)\sim V^NE^{3N/2}$ and an adiabatic change in V, then $E\sim V^{-2/3}$ in order that $\Delta \varGamma = \textrm {0.}$ Hence, the pressure is $P\sim E/V\sim 1/V^{5/3}$ , i.e., $PV^{5/3}=$ const, the usual adiabatic law for an ideal gas.

1.1.7. Pressure and the virial theorem

We next introduce the concept of a generalized force $\left \{{\Lambda}_i\right \}$ to go with parameters $\lambda =\left \{{\lambda }_i\right \}$ . Consider an vector array of parameter values $\lambda$ and the Hamiltonian $H(p,q;\lambda )$ .

Example: Particles in their own electric field and in an externally applied electric field with electric potential $\phi _0$ have the Hamiltonian:

(1.59)

\begin{equation} H=\sum\limits _i{\frac {p^2_i}{2m_i}+\sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}}+\sum\limits _i{{e_i\phi }_0({{\boldsymbol{r}}}_i)}}, \end{equation}

where e ${}_{i}$ are the particle charges, m ${}_{i }$ are the particle charges, p ${}_{i}$ are the momenta, and r ${}_{ij}$ are the distances between the i and j particles. We note that

(1.60)

\begin{equation} \sum\limits _i{{e_i\phi }_0({{\boldsymbol{r}}}_i)}=\int\nolimits {\textrm{d}^3\textit {x}\ \rho ({\boldsymbol{x}},{\{{\boldsymbol{r}}}_i\})\phi _0({\boldsymbol{x}})} \end{equation}

and define the charge density as

(1.61)

\begin{equation} \rho \!\left ({\boldsymbol{x}},{{\boldsymbol{r}}}_i\right )=\sum\limits _i{e_i\delta (}{\boldsymbol{x}}-{{\boldsymbol{r}}}_i). \end{equation}

We can choose $\lambda$ to be whatever attribute of the Hamiltonian is of interest, e.g., $\boldsymbol\lambda=\left \{e_i\right \}$ or $\left \{{{\boldsymbol{r}}}_i\right \}$ or other.

Definition: The generalized force is

(1.62)

\begin{equation} {\boldsymbol {\Lambda} }(p,q;{\boldsymbol {\lambda }}{\mathbf )}\equiv \frac {\partial H(p,q;\boldsymbol\lambda)}{\partial\lambda} \quad\textrm{and}\quad {\boldsymbol {\Lambda} }(E;{\boldsymbol {\lambda }}{\mathbf )}\equiv {\langle {\boldsymbol {\Lambda} }(p,q;{\boldsymbol {\lambda }}{\mathbf )}\rangle }_{E,{\boldsymbol {\lambda } }} \end{equation}

Example: The functional derivative $({\partial H})/({\partial \phi _0(x)})= \rho \!\left ({\boldsymbol{x}},\{{{\boldsymbol{r}}}_i\}\right )$ (Goldstein Reference Goldstein1950; Schiff Reference Schiff1968).

In (1.62) ${\boldsymbol {\Lambda} }\!\left (E;{\boldsymbol {\lambda }}\right )$ is the thermodynamic generalized force, and the averaging brackets indicate an average over the accessible phase space for a given energy E. Then using the Adiabatic Law:

(1.63)

\begin{equation} {\boldsymbol {\Lambda} }(E;\lambda )=\frac {\partial H(p,q;\lambda )}{\partial \lambda } =\frac {\partial E(S,\lambda )}{\partial \lambda }\bigg \vert _S{.} \end{equation}

Example: The macroscopic charge density averaged over the phase space constrained by constant energy E and fixed entropy is

(1.64)

\begin{equation} \langle \rho \rangle \!\left (x\right )={\left .\frac {\partial E(S,\phi _0({\boldsymbol{x}}))}{\partial \phi _0(\textit{x})}\right .}\bigg \vert_S{.} \end{equation}

We next introduce the concept of pressure. Let $\lambda =V$ where V is the volume. Then using (1.63) the pressure P is

(1.65)

\begin{equation} P\!\left (p,q;V\right )\equiv -\Lambda =-\frac {\partial H(p,q;V)}{\partial V}. \end{equation}

Exercise: Take the model Hamiltonian for an ideal gas or a charged particle plasma and show the consistency of (1.65) with the elementary definition $P\equiv F/\textrm {area}$ where F is the macroscopic force. We note from (1.63) and (1.65) that $P=-({\partial E(S,V)})/({\partial V})\big \vert _S=-({\partial E\!\left (S,V\right )})/({\textrm {area}\partial \ell })\big \vert _S$ and one can identify the force from $-({\partial E\!\left (S,V\right )})/({\partial \ell })\big \vert _S$ noting that $\textrm{d}V={\textbf{area}} \cdot {\textbf{d}}\boldsymbol\ell$ . It is also helpful to note $T\equiv (\partial E(S,V))/({\partial S})\big \vert _V$ and generally $\textrm{d}E\!\left (S,\lambda \right )=T\mathrm{d}S+{\boldsymbol {\Lambda} }\cdot \textrm{d}{\boldsymbol {\lambda}}$ At constant entropy, $\textrm{d}E\!\left (\lambda \right )\vert_S={\boldsymbol {\Lambda} }\cdot \textrm{d}{\boldsymbol {\lambda } }=-P\mathrm{d}V=-P{{\textbf{area}}} \cdot {\textbf{d}}\boldsymbol\ell =-{\boldsymbol{F}}\cdot {\textbf{d}}\boldsymbol\ell =-\textrm{d}E$ . Hence, the pressure at constant entropy is the force divided by the area. We will return to consideration of the pressure subsequently.

[Editor’s Note: Kaufman made the cryptic remark that this exercise is not trivial and alluded to the Ergodic Theorem (§ 1.1.3) without further explanation.]

Next we introduce the concept of heat. Again consider a physical system composed of two subsystems I and II. The composite Hamiltonian is $H=H_{\textrm{I}}+\ H_{\textrm{II}}+H_{\textrm{int}}$ where $H_{\textrm{int}}$ is the interaction Hamiltonian. The energy gained or lost by subsystem I is then

(1.66)

\begin{equation} \Delta E_{\textrm{I}}=\int\nolimits {\textrm{d}t}{\dot {H}}_{\textrm{I}}=\int\nolimits {\textrm{d}t}\left \{H_{\textrm{I}},H\right \}=\int\nolimits {\textrm{d}t}\left \{H_{\textrm{I}},H_{\textrm{int}}\right \}\equiv Q_{\textrm{I}}. \end{equation}

The Poisson bracket is $\left \{A\!\left (p,q\right ),B(p,q)\right \}=-\Sigma _i(({\partial A})/({\partial p_i})({\partial B})/({\partial q_i})- ({\partial A})/({\partial q_i}) ({\partial B})/({\partial p_i})),$ and $Q_{\textrm{I}}$ is the heat transfer. The total time derivative of any quantity can be shown to be

(1.67)

\begin{equation} \dot {A}=\frac {\partial A}{\partial t}+\frac {\partial A}{\partial p}\dot {p}+\frac {\partial A}{\partial q}\dot {q}=\frac {\partial A}{\partial t}+\left \{A,H\right \}. \end{equation}

Theorem: If thermal equilibrium is maintained during heat input and if $\delta V=\delta \lambda =0$ , then

(1.68)

\begin{equation} Q=\delta E=T\delta S \to \delta S=Q/T \end{equation}

and for a slow variation of $\lambda$ in the neighborhood of thermal equilibrium:

(1.69)

\begin{equation} \Delta E=Q+W=Q+R, \end{equation}

where W or R equals the work done on the subsystem and Q is the heat or thermal input energy. If R = –PdV for small dV, then from dE(S,V) = TdS – PdV we realize that Q = TdS whether or not work is being done on or by the subsystem. If thermal equilibrium is not maintained, then internal processes will drive the system toward equilibrium with $\Delta S\gt 0$ and $\Delta S\ge Q/T$ where $\Delta S$ is the sum of internal and external heat input. We realize that (1.69) is quite general, and $\Delta S=(\Delta E-R)/T$ is generally true.

Theorem: The change in time of $T\textrm{d}S\ge \textrm{d}E+P\textrm{d}V,$ and there is equality if the system is in thermal equilibrium.

We return to consideration of the pressure. Consider a surface enveloping a volume and a differential surface area element $\textrm{d}^2{\boldsymbol \sigma }$ with the vector oriented outward and normal to the surface. The sum of forces on a “wall” at the surface of the volume is

(1.70)

\begin{equation} \sum\limits _i{{{\boldsymbol{f}}}_{\textit {i,w}}}=P\textrm{d}^2{\boldsymbol \sigma } \end{equation}

and as a consequence of Newton’s third law the force of the wall back on the volume is $\Sigma _i{{{\boldsymbol{f}}}_{\!\textit{w,i}}}=-P\textrm{d}^2{\boldsymbol \sigma }$ .

From (1.70), Newton’s law, and the divergence theorem

(1.71)

\begin{equation} \sum\limits ^N_{i=1}{{{\boldsymbol{f}}}_{\textit {w,i}}}\cdot {{\boldsymbol{r}}}_i=-\oint {P{{\boldsymbol{r}}\cdot\textrm{d}}^2{\boldsymbol \sigma }}=-\int\nolimits _V{\textrm{d}^3r}\nabla \cdot \!\left (P{\boldsymbol{r}}\right ) \end{equation}

Here the force of the wall back on the system is balanced by the pressure of the particles back on the wall. If we assume the system is in equilibrium then we can also assume that the pressure is uniform and pull P outside the integral in (1.71). Hence,

(1.72)

\begin{equation} \sum\limits ^N_{i=1}{{{\boldsymbol{f}}}_{\!\textit {w,i}}}\cdot {{\boldsymbol{r}}}_i=-\int\nolimits _V{\textrm{d}^3r}\nabla \cdot \!\left (P{\boldsymbol{r}}\right )=-P\int\nolimits _V{\textrm{d}^3r}\nabla \cdot \!\left ({\boldsymbol{r}}\right )=-3PV. \end{equation}

The last relation in (1.72) is the so-called ‘virial’ of the wall. Now consider Newton’s third law including forces on the particles on one another and of the wall on the particles:

(1.73)

\begin{equation} \sum\limits _i{{{\boldsymbol{r}}}_i\cdot m_i{\dot {{{{\boldsymbol v}}}}}_i=\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot }}}{{\boldsymbol{r}}}_i+\sum\limits _i{{{\boldsymbol{f}}}_{w,i}}\cdot {{\boldsymbol{r}}}_i. \end{equation}

Using (1.72) to replace the last term in (1.73) we obtain the following.

(1.74)

\begin{equation} P= -\frac {1}{3V}\left \{\sum\limits _i{{{\boldsymbol{r}}}_i\cdot m_i{\dot {{{{\boldsymbol v}}}}}_i}-\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot {{\boldsymbol{r}}}_i}}\right \} \end{equation}

which can be further manipulated and simplified using

(1.75)

\begin{equation} \sum\limits _i{{{\boldsymbol{r}}}_i\cdot m_i{\dot {{{{\boldsymbol v}}}}}_i= \frac {\mathrm{d}}{\textrm{d}t}\sum\limits _i{{{\boldsymbol{r}}}_i\cdot m_i{{{{\boldsymbol v}}}}_i-\sum\limits _i{m_i{{{{v}}}}^2_i}}}\equiv \frac {\mathrm{d}}{\textrm{d}t}A-2K, \end{equation}

where K is the total kinetic energy and $A\ \equiv \ \sum\limits _i{{{\boldsymbol{r}}}_i\cdot m_i{{{{\boldsymbol v}}}}_i}$ has units of action, and

(1.76)

\begin{align} \sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot {{\boldsymbol{r}}}_i}}&=-\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{i,j}\cdot {{\boldsymbol{r}}}_i}}=-\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot {{\boldsymbol{r}}}_j}}\nonumber\\[4pt]&=\tfrac{1}{2}\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot ({{\boldsymbol{r}}}_i-}}{{\boldsymbol{r}}}_j) \end{align}

to obtain

(1.77)

\begin{equation} P= \frac {1}{3V}\left \{2K-\frac {\mathrm{d}}{\textrm{d}t}A+\frac{1}{2}\sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot ({{\boldsymbol{r}}}_i-}}{{\boldsymbol{r}}}_j)\right \}. \end{equation}

Corollary: The phase-space average $\left\langle {\textrm{d}A}/{\textrm{d}t}\right\rangle =0.$

Proof: ${A} = {(q,p)}$ , then $\langle {\mathrm{d}A}/{\textrm{d}t}\rangle \equiv \smallint {\textrm{d}\varGamma \rho \!\left (q,p\right )}({\mathrm{d}}/{\textrm{d}t})A(q,p)$ where the integral is over the phase-space volume and $\rho $ is the phase-space probability density; and we can generalize to $A(q,p;t)$ . We use

(1.78)

\begin{equation} \frac {\textrm{d}A}{\textrm{d}t}=\frac {\partial A}{\partial t}+\dot {p}\frac {\partial A}{\partial p}+\dot {q}\frac {\partial A}{\partial q}=\frac {\partial A}{\partial t}+\left \{A,H\right \} \end{equation}

and note

(1.79)

\begin{equation} \bigg\langle \frac {\mathrm{d}A}{\textrm{d}t}\bigg\rangle \equiv \frac {\mathrm{d}}{\textrm{d}t}\int\nolimits {\textrm{d}{\boldsymbol \varGamma }\rho \!\left (q,p\right )}A\!\left (q,p\right )-\int\nolimits {\frac {\mathrm{d}}{\textrm{d}t}\!\left (\textrm{d}{\boldsymbol \varGamma }\rho \!\left (q,p\right )\right )}A\!\left (q,p\right ) \end{equation}

as the volume element in phase space may have time dependence. However, we note that ${\mathrm{d}\rho }/{\textrm{d}t}=0$ as a consequence of Liouville’s theorem, and $({\mathrm{d}}/{\textrm{d}t}){\boldsymbol \varGamma }=0$ due to conservation of probability volume (which is not independent of Liouville’s theorem). Hence, $\langle {\mathrm{d}A}/{\textrm{d}t}\rangle ={\mathrm{d}}/{\textrm{d}t}\langle A\rangle$ . Finally, at equilibrium with no explicit time dependence, ${\partial A}/{\partial t}=0$ and then ${\mathrm{d}}/{\textrm{d}t}\langle A\rangle =0\ .$ We can now calculate the phase-space average of (1.77) at equilibrium which becomes

(1.80)

\begin{equation} {\langle} P{\rangle} = \frac {1}{3V}\left \{2\langle K\rangle +\frac{1}{2}\left\langle \sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot ({{\boldsymbol{r}}}_i-}}{{\boldsymbol{r}}}_j)\right\rangle \right \}. \end{equation}

Example: $\langle K\rangle =({3}/{2})NT$ which is valid for an ideal or nonideal gas (with interactions), and (1.80) becomes

(1.81)

\begin{equation} \langle P\rangle =nT+\frac {1}{6V}\left\langle \sum\limits _{i\ne j}{\sum\limits _j{{{\boldsymbol{f}}}_{j,i}\cdot ({{\boldsymbol{r}}}_i-}}{{\boldsymbol{r}}}_j)\right\rangle , \end{equation}

where $n=N/V.$

1.2. Classical fluids and other systems

1.2.1. Equation of state and deviations from ideality

Postulate: Consider a general force law of particle j on particle i represented by

(1.82)

\begin{equation} {{\boldsymbol{f}}}_{ij}=-{\hat {{\boldsymbol{r}}}}_{ij}\frac {\partial }{\partial r_i}\phi ({{\boldsymbol{r}}}_{ij}), {{\boldsymbol{r}}}_{ij}\equiv {{\boldsymbol{r}}}_i-{{\boldsymbol{r}}}_j{,} \end{equation}

where $\left ({2}/{3}\right )\langle {K}/{V}\rangle =\left ({N}/{V}\right )T\equiv nT$ . We justify (1.82) based on Newton’s third law and symmetry. Then the equilibrium pressure deduced from (1.81) is

(1.83)

\begin{equation} \langle P\rangle =nT-\frac {1}{6V}\left\langle \sum\limits _{i\ne j}{{\sum\limits _{j}}{\boldsymbol{r}}_{ij}\cdot }{\hat {{\boldsymbol{r}}}}_{ij}\frac {\textrm{d}\phi }{\textrm{d}r_{ij}}\right\rangle = nT-\frac {1}{6V}\sum\limits _{i\ne j}{\sum\limits _j \left\langle {{\boldsymbol{r}}}_{ij}\cdot {\hat {{\boldsymbol{r}}}}_{ij}\frac {\textrm{d}\phi }{\textrm{d}r_{ij}}\right\rangle },\ \end{equation}

commuting the sum over N(N−1)∼N ${}^{2 }$ pairs of interacting particles with the averaging bracket. Hence,

(1.84)

\begin{equation} P= nT-\frac {N^2}{6V}\langle r_{12}\phi^{\prime} (r_{12})\rangle . \end{equation}

The nT term is the kinetic pressure and the second term in (1.84) is the interaction pressure. The average in the interaction pressure is

(1.85)

\begin{equation} \langle r_{12}\phi^{\prime} (r_{12})\rangle \equiv \int\nolimits {\textrm{d}^3r_{12}\rho (}{{\boldsymbol{r}}}_{ij}\to r_{12})\ r_{12}\phi^{\prime} (r_{12}) \end{equation}

due to isotropy in the probability density and because the interaction force depends only on the scalar separation distance. The probability density can be represented as

(1.86)

\begin{equation} \rho \!\left (r_{12}\right )=\frac {g(r_{12})}{V}, \end{equation}

where $g(r_{12})$ is the pair correlation function such that $g(\infty )\to 1$ and $\smallint {g\ \textrm {dvol}=V}$ . Then (1.84) becomes

(1.87)

\begin{equation} P= nT-\frac {N^2}{6V}\langle r_{12}\phi^{\prime} \!\left (r_{12}\right )\rangle =nT-\frac {n^2}{6}\int\nolimits {4\pi r^2_{12}\textrm{d}r_{12}g(r_{12}){r_{12}\phi }^{\prime} (r_{12})}. \end{equation}

If we formulate the energy of a particle (atom or molecule) from first principles by summing the kinetic energy and the potential energy due to interactions over the volume, one obtains

(1.88)

\begin{equation} {\mathcal E}= \frac {3}{2}T+\frac {n}{2}\int\nolimits {\textrm{d}^3r_{12}g(r_{12})\phi (r_{12})}. \end{equation}

We can make some qualitative remarks regarding the dependencies of the pair correlation function g and the interaction potential $\phi$ on $r_{12}$ so that the integral in (1.88) remains well behaved, and the results for P and $\mathcal E$ are physical. Given the constraints g $(\infty )\to 1$ and $\smallint {g\ \textrm {dvol}=V}$ , $\phi (r_{12})$ must fall off faster than $1/r^3_{12}$ as $r_{12}\to \infty$ . For $r_{12}\to 0$ , $g(r_{12})\phi (r_{12})$ cannot diverge as fast as $1/r^3_{12}$ . As a result, excluded are the Coulomb and gravitational potentials $\sim 1/r$ and the dipole–dipole interaction potential $\sim 1/r^3$ .

1.2.2. Virial coefficients and van der Waals potential

Consider a dilute gas with only pair interactions and $g(r_{12})\sim e^{-\beta \phi (r_{12})}$ . Particle 1 interacts with particle 2, and the rest of system acts as a heat bath. At high densities when triplet or higher-order interactions become important, there are corrections to this correlation function. As $r_{12}\to \infty \textrm {,}$ $\phi \to 0$ and $g(\infty )\to 1.$ We can substitute this into (1.87) for P and (1.88) for ${\mathcal E}$ :

(1.89)

\begin{align} P\!\left (n,T\right )&=nT-\frac {2\pi }{3}n^2\int\nolimits ^{\infty }_0{s^2\textrm{d}s\ e^{-\beta \phi \left (s\right )}s\phi^{\prime} \!\left (s\right )+O(n^3)}\nonumber \\[4pt] &= nT+\tfrac{1}{2}n^2T\int\nolimits ^{\infty }_0{\textrm{d}^3{\boldsymbol{s}}(1-e^{-\beta \phi (s)})+O(n^3)}, \end{align}

where $s\equiv r_{12}$ and $T=1/\beta$ , and we have integrated by parts. We can represent the result in the standard form

(1.90)

\begin{equation} \frac {P(n,T)}{T}=n+n^2b_2\!\left (T\right )+n^3b_3\!\left (T\right )+\dots , \end{equation}

where $b_{\ell }\!\left (T\right )$ are ‘virial’ coefficients. In this ‘classical’ example, the second virial coefficient is

(1.91)

\begin{equation} b_2\!\left (T\right )=\tfrac{1}{2}\int\nolimits {\textrm{d}^3{\boldsymbol{s}}(1-e^{-\beta \phi (s)}).} \end{equation}

The second virial coefficient gives information on the interaction potential of the two particles.

Figure 1. Model van der Waals + hard sphere potential.

Example: Van der Waals force + hard sphere – Consider the schematic for the electric potential shown in figure 1.

The repulsive force for r $\lt$ 2r ${}_{0}$ is represented as a hard sphere where 2r ${}_{0}$ is the minimum distance between two hard-sphere centers with r ${}_{0}$ the hard-sphere radius. For this model (1.91) yields

(1.92)

\begin{equation} b_2\!\left (T\right )=\frac{1}{2}\frac {4\pi }{3}{(2r_0)}^3+\frac{1}{2}\beta \int\nolimits ^{\infty }_{2r_0}{\textrm{d}^3s}\ \phi \!\left (s\right )=4V_0-\frac {\alpha }{T}, \end{equation}

where $V_0\equiv ({4\pi }/{3})r^3_0\gt 0$ due to repulsion, $-2\alpha =\smallint\nolimits ^{\infty }_{2r_0}{\textrm{d}^3s}\ \phi \!\left (s\right )$ which is attractive and assumed small, and $1-e^{-\beta \phi \left (s\right )}\approx \beta \phi (s)$ inside the integral. To be consistent with the expansion in (1.89) and (1.90) we require that $nV_0\ll 1.$

Exercise: (i) Show that ${\mathcal E}=({3}/{2})T-n\alpha$ and there is no contribution from the $V_0$ constant term. (ii) Show from $T\textrm{d}{\mathcal S}=\textrm{d}{\mathcal E}+P\textrm{d}{\mathcal V}$ where ${\mathcal V}\equiv {1}/{n}$ that ${\mathcal S}=({5}/{2})-{\text{ln } n{\Lambda}^3-4\!\left ({V_0}/{{\mathcal V}}\right )}$ , where $\Lambda ={h}/{\sqrt {2\pi mT}}$ (iii) Convert to standard van der Waals form:

(1.93)

\begin{equation} \!\left (P+\frac {\alpha }{{{\mathcal V}}^2}\right )\!\left ({\mathcal V}-4V_0\right )=T \end{equation}

1.2.3. Canonical ensemble and the partition function

Any subsystem, micro or macro, in contact with a heat bath at T has the attributes as described in (1.49) and parametrized by number N, volume V, and temperature T. The ensemble of such states is a canonical ensemble. The probability $w_{n}$ and partition function Z are

(1.94)

\begin{equation} w_n=\frac {e^{-\beta E_n}}{Z}; \quad Z\equiv \sum\limits _n{e^{-{\beta }E_n}} \end{equation}

Given the set of probabilities $\left \{w_n\right \}$ let us find $S\!\left \{w_n\right \}$ .

Example: Let ${n} = 1,2,3$ and ${E} = E_{1}$ , ${E}_{2}$ , ${E}_{3}$ , and make $M(M\to \infty )$ measurements. As a matter of definition what we mean by ${w}_i$ is that ${n} = i$ occurs ${w}_{i}$ M times. The number of states for a given number of measurements M is

(1.95)

\begin{equation} {\varGamma }_M=\frac {M!}{\prod\limits _n{\!\left (Mw_n\right )}!}=\frac {M!}{\!\left (w_1M\right )!\!\left (w_2M\right )!\!\left (w_3M\right )!} \end{equation}

and the corresponding entropy is

(1.96)

\begin{align} S_M\equiv \text{ ln } {\varGamma }_M&=M\text{ ln } M-M-\sum\limits _n\left [Mw_n{\text{ ln}(Mw_n})-Mw_n\right ]\nonumber\\&=-M\sum\limits _n{w_n{\text{ ln}(w_n)}}, \end{align}

where we have used $\Sigma _n{w_n=1}$ . From (1.96) we define the entropy associated with making a single measurement on the ensemble of three states in equilibrium with a heat bath:

(1.97)

\begin{equation} S\equiv {\mathop {\lim }_{M\to \infty } \!\left (\frac {S_M}{M}\right )= }-\sum\limits _n{w_n{\text{ ln}(w_n})}. \end{equation}

Example: Suppose all $\varGamma$ states are accessible with equal probability, such that

(1.98)

\begin{equation} w_n=\frac {1}{\varGamma }\quad \textrm {and}\quad S=-\sum\limits ^{\varGamma }_{n=1}{\frac {1}{\varGamma }{\text{ ln}\frac {1}{\varGamma }={\text{ ln } \varGamma }}}. \end{equation}

More generally, the entropy for the canonical ensemble using (1.49) is

(1.99)

\begin{equation} S_{can}=-\sum\limits _n{\frac {1}{Z}e^{-\beta E_n}{\text{ ln}\frac {1}{Z}e^{-\beta E_n}= }}{\text{ ln } Z+\beta \sum\limits _n{w_nE_n={\text{ ln } Z+\beta \langle E\rangle }}}. \end{equation}

For a canonical macroensemble we can convert the sum in the partition function into an integral:

(1.100)

\begin{equation} Z\!\left (\beta \right )=\int\nolimits {\textrm{d}\varGamma e^{-\beta E}=\int\nolimits {\textrm{d}E\frac {\textrm{d}\varGamma }{\textrm{d}E}}} e^{-\beta E}. \end{equation}

Recall that $S_{\mu }\sim S_{\sigma }={\text{ ln } \varGamma }$ from which it follows that $\varGamma =e^{S_{\sigma }}$ and $({\textrm{d}\varGamma }/{\textrm{d}E})=e^{S_{\sigma }}({\textrm{d}S_{\sigma }})/({\textrm{d}E})=\beta e^{S_{\sigma }}$ ; then we obtain

(1.101)

\begin{equation} Z\!\left (\beta \right )= \beta \int\nolimits {\mathrm{d}Ee^{\left (-\beta E+S_{\sigma }\right )}=\beta \int\nolimits {\mathrm{d}Ee^{-\beta \left (E-TS_{\sigma }(E)\right )}}}= \beta \int\nolimits {\mathrm{d}Ee^{-\beta F(E,T)}}, \end{equation}

where F is the free energy and T is the temperature of the heat bath independent of the energy of the system. That energy for which F is a minimum will maximize the partition function. We require the most probable energy E ${}^{ *}$ (T) is that which determines $({\partial F(E\cdot T)})/({\partial E})=0$ . Then we expand the free energy around E ${}^{ *}$ :

(1.102)

\begin{equation} F\!\left (E,T\right )=F\!\left (E^*,T\right )+\frac{1}{2}\delta E^2\frac {{\partial }^2F}{\partial E^2}+\dots . \end{equation}

From (1.101) and (1.102):

(1.103)

\begin{align} &Z\!\left (\beta \right )\approx \beta e^{-F\left (E^*,T\right )}\int\nolimits ^{\infty }_{-\infty }{\mathrm{d}\delta E}e^{-\beta \frac{1}{2}\delta E^2\frac {{\partial }^2F}{\partial E^2}} =\beta e^{-F\left (E^*,T\right )}\sqrt {2\pi }{\sigma }_E\equiv \beta e^{-F\left (E^*,T\right )}\sqrt {2\pi T/F^{\prime\prime}}, \end{align}

where ${\sigma }_E=\sqrt {T/F^{\prime\prime}},$ and from (1.99) and (1.103)

(1.104)

\begin{align} S_{can}&={\text{ ln } Z+\beta \langle E\rangle }={\text{ ln } \beta }-\beta F\!\left (E^*,T\right )+\tfrac {1}{2}{\text{ ln}(2\pi T/F^{\prime\prime})+}\beta \langle E\rangle \nonumber \\[4pt] &\approx -\beta F\!\left (E^*,T\right )+\ \beta \langle E\rangle =-\beta \!\left (E^*-TS_{\sigma }\!\left (E^*\right )\right )+\beta \langle E\rangle , \end{align}

where ${\text{ ln } \beta =O\!\left (1\right ),\ \ \ \beta F\!\left (E^*,T\right )=O\!\left (N\right ),\ \textrm {and}\ }({1}/{2}){\text{ ln}(2\pi T/F^{\prime\prime})=O(1).}$ Note that $F\sim O\!\left (E\right )\sim O\!\left (N\right ),\ \ F^{\prime\prime}\sim O(1/E)\sim O(1/N)$ and ${\text{ ln}\!\left (N\right )}\sim O(1)$ to justify $({1}/{2}){\text{ ln}(2\pi T/F^{\prime\prime})=O(1).}$

One rearranges terms in (1.104) to obtain

(1.105)

\begin{equation} S_{can}=S_{\sigma }\!\left (E^*\right )+\beta \!\left (\langle E\rangle -E^*\right ). \end{equation}

For the canonical ensemble one can calculate $\langle E\rangle$ as a function of $\beta$

(1.106)

\begin{equation} \langle E\rangle =\sum\limits _n{w_nE_n=\frac {1}{Z}\sum\limits _n{e^{-\beta E_n}E_n=-}}\frac {1}{Z}\frac {\partial Z}{\partial \beta }=-\frac {\partial {\text{ ln } Z}}{\partial \beta }. \end{equation}

We note that $F\!\left (E^*,T\right )=F(T)$ because fluctuations about E ${}^{*}$ are small. To O(N) from (1.104)

(1.107)

\begin{equation} {\text{ ln } Z}=-\beta F\!\left (E^*,T\right )=-\beta F\!\left (T\right )\quad \text{and}\quad F\!\left (T\right )=-T{\text{ ln } Z(\beta )}. \end{equation}

For ${\text{ ln } Z(\beta )}\gt 1,$ then $F\!\left (T\right )\lt 0$ . From (1.106) and (1.107) we deduce

(1.108)

\begin{equation} \langle E\rangle =-\frac {\partial {\text{ ln } Z}}{\partial \beta }=\frac {\partial \beta F}{\partial \beta } \quad \textrm{and}\quad \langle E\rangle (T)=E^*(T) \;\textrm{to}\; \textit {O}(\textit {N}). \end{equation}

It also follows that

(1.109)

\begin{equation} S_{\text{can}}\!\left (T\right )={\text{ ln } Z-}\beta \frac {\partial {\text{ ln } Z}}{\partial \beta }=-\frac {\partial F(T)}{\partial T}, \textrm {and hence}\; \textrm{d}F=-S\textrm{d}T. \end{equation}

1.2.4. Quasistatic evolution

We next identify a parameter $\lambda $ in the system in contact with a heat bath and consider slow changes of the parameter:

(1.110)

\begin{equation} w_n(\lambda )=\frac {1}{Z}\sum\limits _n{e^{-\beta E_n(\lambda )}}\quad \mathrm{and}\quad Z\!\left (\beta ,\lambda \right )=\sum\limits _n{e^{-{\beta }E_n(\lambda )}}. \end{equation}

For a slow and small change of $\lambda$ the total entropy does not change:

(1.111)

\begin{equation} \Delta (S_{\mathrm{system}}+S_{\mathrm{heat\;bath}})=0\quad\mathrm{and}\quad \Delta (S_{\mathrm{system}})=-\Delta (S_{\mathrm{heat\;bath}})=-{\left .\frac {\textrm{d}S}{\textrm{d}E}\right .}\bigg \vert _{\textrm{bath}}\Delta E_{\textrm{bath}}. \end{equation}

Using ${({\textrm{d}S})/({\textrm{d}E})}\vert _{\textrm{bath}}={1}/{T_{\textrm{bath}}}$ and $\Delta E_{\textrm{bath}}=-Q$ , where Q is the energy/heat input into the system, then

(1.112)

\begin{equation} \Delta S_{\mathrm{system}}=\frac {Q}{T}. \end{equation}

The system energy $E(S,\lambda )$ accrues a small change due to $\Delta \lambda$

(1.113)

\begin{equation} \Delta E=T\Delta S+\Lambda \Delta \lambda =Q+\Lambda \Delta \lambda . \end{equation}

Definition: Let $R\equiv \Delta E-Q=\Lambda \Delta \lambda$ where $\Lambda =(\partial E)/(\partial \lambda) \vert _S.$

From the definition of the free energy

(1.114)

\begin{align} F\equiv E-TS \to \Delta F&=\Delta E-T\Delta S-S\Delta T= T\Delta S+\Lambda \Delta \lambda -T\Delta S-S\Delta T\nonumber\\[4pt] &=\Lambda \Delta \lambda =R,\ \end{align}

because $\Delta T=0$ due to the contact with the heat bath. We can now make some general remarks regarding systems that are either thermally isolated or in contact with a heat bath (table 1). We note that if there are no changes in the system parameters, a system in contact with a heat bath experiences no change in S and E; and the heat bath is superfluous.

1.2.5. Mode counting, classical versus quantum systems

Example: N identifiable microsystems, e.g., N weakly interacting harmonic oscillators

(1.115)

\begin{equation} Z_N\!\left (\beta \right )=\sum\limits _{n=\left \{n_i\right \}}{e^{-\beta E_n}\ \ \ \ \textrm {and}\ \ \ \ E_{\left \{n_i\right \}}=}\sum\limits ^N_{n_i=1}{{{\mathcal E}}^{(i)}_{n_i}}, \end{equation}

where $n_i$ can be thought of as the quantum numbers for the energy levels. The partition function becomes

(1.116)

\begin{equation} Z=\sum\limits _{n_1}{\sum\limits _{n_2}{\dots \sum\limits _{n_N}{e^{-\beta \left ({{\mathcal E}}^{(1)}_{n_1}+{{\mathcal E}}^{(2)}_{n_2}+\dots +{{\mathcal E}}^{(N)}_{n_N}\right )}=\prod\limits^N_{i=1}{\sum\limits _{n_i}{e^{-\beta {{\mathcal E}}^{(i)}_{n_i}}=\prod\limits^N_{i=1}{Z_i}}}}}}. \end{equation}

Hence, ${\text{ ln } Z=\Sigma _i{{\textrm {ln}\ Z}_i}}.$ In the special case where all N subsystems have the same properties, $Z={(Z_1)}^N$ and $\text{ ln } Z=N{\text{ ln } Z_1}$ .

Example: A classical gas or fluid consisting of N indistinguishable particles

(1.117)

\begin{equation} Z_N(\beta ,V)\equiv \int\nolimits _V{\frac {\textrm{d}^{3N}p\textrm{d}^{3N}q}{h^{3N}N!}e^{-\beta H(p,q)}}, \end{equation}

(1.118)

\begin{equation} H\!\left (p,q\right )\to \sum\limits ^N_{i=1}{\frac {p^2_i}{2m}+\Phi (\left \{r_i\right \})}. \end{equation}

Using (1.118) in (1.117) one obtains

(1.119)

\begin{equation} Z_N\!\left (\beta ,V\right )=\frac {1}{N!}{\left [V\int\nolimits {\frac {\textrm{d}^3p}{h^3}e^{-\beta \frac {p^2}{2m}}}\right ]}^N\left [\int\nolimits {\frac {\textrm{d}^3r^{(N)}}{V^N}e^{-\beta \Phi (\left \{r\right \})}}\right ] \equiv \frac {1}{N!}{\left (\frac {V}{{\Lambda}^3\!\left (\beta \right )}\right )}^NQ_N(\beta ), \end{equation}

where $Q_N\!\left (\beta \right )\equiv [(\smallint {\textrm{d}q^N}/{V^N})e^{-\beta \Phi (\{r\})}]$ is the configurational partition function independent of volume.

Table 1. Adiabatically evolving systems.

For an ideal gas $\Phi (\{r\})\to 0$ and $Q_N\!\left (\beta \right )=1$ ; hence, $Z_N\!\left (\beta ,V\right )=$ $({1}/{N!}){\left ({V}/{{\Lambda}^3\!\left (\beta \right )}\right )}^N.$

Exercise: Show the specific free energy is given by $f={F}/{N}=T\!\left ({\text{ln } n{\Lambda}^3-1}\right )$ , ${\mathcal E}=({3}/{2})T$ , and ${\mathcal S}=({5}/{2})-{\text{ ln } n{\Lambda}^3}$ . (Recall (1.47) and (1.92), and the exercise following (1.92) in the limit $\Phi \to 0$ .)

Example: Single harmonic oscillator ( $\ell )$ with quantized energy levels

Energy levels:

(1.120)

\begin{equation} E^{\ell }_n=\hslash {\omega }_{\ell }n+E_0,\ \ E_0=\tfrac{1}{2}\hslash {\omega }_{\ell }, \end{equation}

(1.121)

\begin{equation} Z_{\ell }\!\left (\beta \right )=\sum\limits ^{\infty }_{n=0}{e^{-\beta \hslash \omega_{\ell }n}=1+e^{-x}+e^{-2x}+\dots =\frac {1}{1-e^{-x}}=\frac {1}{1-e^{-\beta \hslash {\omega }_{\ell }}}}. \end{equation}

We could compare the energy spectrum for the quantum harmonic oscillator in (1.120) to a few continuous medium systems.

1. Vibrating string: $\lambda ={2L}/{\ell },\ \ \ell =1,2,3,\dots .$
2. Drumhead.
3. Water waves, e.g., one-dimensional gravity waves in a narrow channel, three-dimensional ocean waves (surface gravity waves, internal waves, etc.).
4. Electromagnetic waves, e.g., free space ( $\omega =kc)$ , wave-guide modes, cavity modes.
5. Plasma waves, e.g., electromagnetic waves ( ${\omega }_k=\sqrt {k^2c^2+{\omega }^2_p}),$ longitudinal waves, etc.
6. Fluid sound waves: ${\omega }_k=kc_s, c_s=\sqrt {{\gamma P}/{\rho }}$ .
7. Waves in a solid: longitudinal sound wave ${\omega }_k=kc_{\ell }$ , transverse shear wave ${\omega }_k=kc_t$ .

In order to calculate the partition function and the statistical properties of any of these systems, one must properly count the distinct modes. Here are two illustrative examples.

(a) One-dimensional standing waves with nodes at x = 0 and x = L: $U\!\left (x,t\right )=$ $A\,\textrm {sin}(kx){\sin(\omega t)},\, \omega \gt 0,\, {\lambda }/{2}={L}/{\ell },\, k={2\pi }/{\lambda }$ and $\Sigma ^{\infty }_{\ell =1}{\to}\smallint\nolimits ^{\infty }_0\mathrm{d}\ell =$ $({L}/{\pi})\smallint\nolimits ^{\infty }_0{\textrm{d}k}$ .
(b) One-dimensional traveling waves with periodic boundary conditions at x = 0 and x = L: $U\!\left (x,t\right )=A{\sin \!\left (kx-\omega t\right )},\ \omega \gt 0,\ \lambda ={L}/{\ell },\ k={2\pi }/{\lambda }$ and $\Sigma ^{\infty }_{\ell =-\infty }{\to }\smallint\nolimits ^{\infty }_{-\infty }{\mathrm{d}\ell =({L}/{2\pi })}\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}k}$ . If the traveling wave spectrum is symmetric with respect to positive and negative k, then $({L}/{2\pi })\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}k}\to ({L}/{\pi})\smallint\nolimits ^{\infty }_0{\textrm{d}k}$ .

In three spatial dimensions $\Sigma _{\text{modes}}{\to {(2/\pi)}^3}\smallint {\textrm{d}^3k}$ .

Example: Classical noninteracting oscillators with $H_{\ell }\!\left (J_{\ell }\right )=J_{\ell }{\omega }_{\ell }$

(1.122)

\begin{equation} Z_{\ell }\!\left (\beta \right )=\int\nolimits {\frac {\textrm{d}p\textrm{d}q}{h}}e^{-\beta H(p,q)}=\frac {2\pi }{h}\int\nolimits ^{\infty }_0{\textrm{d}J_{\ell }e^{-\beta J_{\ell }{\omega }_{\ell }}=\frac {2\pi }{h{\beta \omega }_{\ell }}}=\frac {T}{\hslash {\omega }_{\ell }}. \end{equation}

We note that the result (1.121) for the partition function for the quantized harmonic oscillator recovers the classical limit in (1.122) in the limit $\hslash {\omega }_{\ell }\ll T$ :

(1.123)

\begin{equation} Z_{\ell }{(\beta )}_{\textrm{quantum}}={\mathop {\lim }_{\hslash {\omega }_{\ell }\ll T} \ \frac {1}{1-e^{-\beta \hslash {\omega }_{\ell }}}\to }\frac {T}{\hslash {\omega }_{\ell }}. \end{equation}

From (1.123) we can calculate the average energy per mode:

(1.124)

\begin{equation} \langle E_{\ell }\rangle \!\left (T\right )=-\frac {\partial {\text{ ln } Z}}{\partial \beta }=\frac {\hslash {\omega }_{\ell }}{e^{\beta \hslash {\omega }_{\ell }}-1}, \end{equation}

which yields $\langle E_{\ell }\rangle \!\left (T\right )\to T$ in the classical limit and recovers the Rayleigh–Jeans classical result. For black-body radiation, each of the infinite number of modes has energy T in the classical limit which leads to an infinite total energy when summing over all of the modes, i.e., the ultraviolet catastrophe!

1.2.6. Electromagnetic modes and interaction of particles

Consider electromagnetic waves in an unmagnetized plasma. The dispersion relation for transverse waves in an unmagnetized plasma is

(1.125)

\begin{equation} {\omega }^2_k=k^2c^2+{\omega }^2_p \end{equation}

and the total average wave energy summing over modes is

(1.126)

\begin{equation} W=\sum\limits _{\ell }{\frac {\hslash {\omega }_{\ell }}{e^{\beta \hslash {\omega }_{\ell }}-1}=2V}\int\nolimits {\frac {\textrm{d}^3k}{{(2\pi )}^3}\frac {\hslash {\omega }_{\ell }}{e^{\beta \hslash {\omega }_{\ell }}-1}}. \end{equation}

where the factor 2 in front of the integral in (1.126) takes into account the sum over right and left circularly polarized waves. The energy density derived from (1.126) is

(1.127)

\begin{equation} \frac {W}{V}=2\int\nolimits ^{\infty }_0{\frac {4\pi k^2\mathrm{d}k}{{(2\pi )}^3}}\frac {\hslash {\omega }_{\ell }}{e^{\beta \hslash {\omega }_{\ell }}-1}=\textrm {fn}(T,{\omega }_p,\ \hslash c). \end{equation}

Definition: The Wien wavelength and its inverse $k_w$ are defined by $\overline {\lambda }={1}/{k_w}={\hslash c}/{T}$ .

In the limit that the plasma density vanishes ${\omega }_p\to 0$ , then the right-hand side of (1.127) becomes

(1.128)

\begin{equation} \frac {W}{V}=\frac {{\pi}^2}{15}\frac {T}{{\overline {\lambda }}^3}=4\frac {\sigma T^4}{c},\quad \sigma =\frac {{\pi}^2}{60}\frac {c}{{\!\left (\hslash c\right )}^3}. \end{equation}

(1.128) is the Stefan–Boltzmann law.

We note that Jackson (Reference Jackson1975) showed that the wave energy density is related to the spatially averaged magnetic field energy density ${\langle B^2\rangle }/{8\pi }$ by the relation

(1.129)

\begin{equation} \frac {W}{V}=\frac {\langle B^2\rangle }{4\pi }\frac {1}{\epsilon }, \end{equation}

where $\epsilon$ is the longitudinal plasma dielectric function; $\epsilon =({k^2c^2}/{{\omega }^2})=1-({{\omega }^2_p}/{{\omega }^2})$ in a cold plasma. For a wave packet the energy flux density is the product of the wave energy density and the group velocity ${{{{v}}}}_g=({\textrm{d}\omega }/{\textrm{d}k})=({kc^2}/{\omega })$ .

Consider electromagnetic traveling waves in a system with a finite volume and periodic boundary conditions with magnetic field represented by

(1.130)

\begin{equation} {\boldsymbol{B}}\!\left ({\boldsymbol{x}},t\right )=\sqrt {2}\sum\limits _{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}{B_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\hat {{\boldsymbol{e}}}\;{\sin }({\boldsymbol{k}}\cdot {\boldsymbol{x}}-{\omega }_{{\boldsymbol{k}}}t+{\alpha }_{{\boldsymbol{k}}})}, \end{equation}

where the $B_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}$ are real, and the average energies per mode are given by (1.124) which yields $\langle E_{\ell }\rangle \!\left (T\right )\to T$ for ${\hslash \omega }_{\ell }\ll T$ . Equation (1.130) might model waves emitted by Bremsstrahlung. If we calculate the ensemble or spatial average of ${\vert {\boldsymbol{B}}\vert }^2\!\left ({\boldsymbol{x}},t\right )$ we eliminate phases so that $B_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}$ is real:

(1.131)

\begin{equation} \langle {\vert {\boldsymbol{B}}\vert }^2\!\left ({\boldsymbol{x}},t\right )\rangle =\sum\limits _{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}{{B_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}}^2}. \end{equation}

From (1.129) the wave energy density is then

(1.132)

\begin{equation} \frac {W_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}}{V}=\frac {\langle {\vert {\boldsymbol{B}}\vert }^2\!\left ({\boldsymbol{x}},t\right )\rangle }{4\pi }\frac {{\omega }^2_k}{k^2c^2} \to \langle {\vert \boldsymbol{B}}\vert ^2\!\left ({\boldsymbol{x}},t\right )\rangle =\sum\limits _{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}{\frac {4\pi k^2c^2}{{\omega }^2_k}\frac {\langle {W}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\rangle }{V}}. \end{equation}

With the assumption of thermal equilibrium and doing statistical averages (1.126) and (1.132) yield

(1.133)

\begin{align} \frac {\langle B^2\rangle }{8\pi }&=\frac {1}{2V}\sum\limits _{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}{\frac {k^2c^2}{{\omega }^2_k}}\frac {\hslash {\omega }_k}{e^{\beta \hslash {\omega }_k}-1}=\int\nolimits {\frac {\textrm{d}^3k}{{(2\pi )}^3}}\frac {k^2c^2}{{\omega }^2_k}\frac {\hslash {\omega }_k}{e^{\beta \hslash {\omega }_k}-1}\nonumber\\[4pt]&=\frac {T^4}{2{\pi}^2{(\hslash c)}^3}\int\nolimits ^{\infty }_{\alpha }{\textrm{d}x\frac {{(x^2-{\alpha }^2)}^{3/2}}{e^x-1}}, \end{align}

where ${\omega }^2_k=k^2c^2+{\omega }^2_p,$ ${k^2c^2}/{{\omega }^2_k}=1$ in vacuum, $x={\hslash {\omega }_k}/{T}$ , and $\alpha ={\hslash {\omega }_p}/{T}$ . For $\alpha ={\hslash {\omega }_p}/{T}\ll 1$ the last integral on the right-hand side of (1.133) yields ${{\pi}^4}/{15}$ which is the result for classical black-body radiation (1.128). For $\alpha ={\hslash {\omega }_p}/{T}\gg 1$ the integral yields $3\sqrt ({{\pi}/{2}}){\alpha }^{3/2}e^{-\alpha }$ which implies that the magnetic energy is exponentially small for $T\to 0$ accompanying a coalescence of the photons in the ground state as the entropy likewise goes to zero (Nernst theorem).

Exercise: For a d-dimensional medium supporting normal modes with ${\omega }_k\sim k^p$ with $p\gt 0$ , e.g., ${p}= 1/2$ for water waves, ${p} = 1$ for sound waves, and ${p }= 2$ for a de Broglie matter wave, find the specific heat C ∼ T ${}^{ q}$ . The specific heat capacity is defined as $C=T{\partial {\mathcal S}}/{\partial T}$ and recall that $\mathcal S$ is the specific entropy. Use (1.99) to evaluate the entropy in terms of the partition function and the examples in § 1.2.5 as a template to calculate the partition function.

We now extend the analysis to an electromagnetic plasma with applied fields. Consider a set of charged particles interacting with a given external field, e.g., $\left \{\phi _0({\boldsymbol{x}},t\right \},{{\boldsymbol{A}}}_0({\boldsymbol{x}},t)\}$ with Lagrangian given by (Jackson Reference Jackson1975, ch. 12)

(1.134)

\begin{align} L\left \{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i;\phi _0,{{\boldsymbol{A}}}_0\right \}&=\sum\limits ^N_{i=1}{\frac{1}{2}m_i{{{{v}}}}^2_i}-\sum\limits ^N_{i=1}{e_i}\phi _0\!\left ({{\boldsymbol{r}}}_i,t\right )+\sum\limits ^N_{i=1}{\frac {e_i}{c}{{{{\boldsymbol v}}}}_i\cdot }{{\boldsymbol{A}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )\nonumber\\[4pt]&\quad -\sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}}. \end{align}

The equations of motion determined by the Euler–Lagrange equations are

(1.135)

\begin{equation} m_i{\dot {{{{\boldsymbol v}}}}}_i=e_i\bigg[{{\boldsymbol{E}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )+\frac {1}{c}{{{{\boldsymbol v}}}}_i\times {{\boldsymbol{B}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )\bigg]+e_i\sum\limits _j{e_j\bigg(\!-{\nabla }_i\frac {1}{r_{ij}}\bigg)}. \end{equation}

We can further expand the expressions in (1.134) and (1.135) to include an internal electromagnetic field (also incorporating retarded time). To add a radiation field to the Lagrangian we posit (via guesswork or covariance arguments, $\{{\boldsymbol{E}}\cdot {\boldsymbol{B}}, E^2-B^2\}$ ) (Galloway & Kim Reference Galloway and Kim1971)

(1.136)

\begin{equation} L\{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i;{{\boldsymbol{A}},\dot {{\boldsymbol{A}};}\phi }_0,{{\boldsymbol{A}}}_0 \}=L\left \{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i;\phi _0,{{\boldsymbol{A}}}_0\right \}+\int\nolimits {\frac {\textrm{d}^3{\boldsymbol{x}}}{8\pi }}{\!\left (E^2_{\mathrm{rad}}-B^2_{\mathrm{rad}}\right )} + \dots , \end{equation}

where ${{\boldsymbol{E}}}_{\mathrm{rad}}\!\left ({\boldsymbol{x}},t\right )=-({1}/{c})\dot {{\boldsymbol{A}}}({\boldsymbol{x}},t)$ and ${{\boldsymbol{B}}}_{\mathrm{rad}}\!\left ({\boldsymbol{x}},t\right )=-\nabla \times {\boldsymbol{A}}\!\left ({\boldsymbol{x}},t\right )$ in Coulomb (transverse) gauge, $\nabla \cdot {\boldsymbol{A}}=0$ . After a little bit of guesswork and checking, the complete Lagrangian including radiation fields for a plasma in the presence of applied and internal electromagnetic fields is

(1.137)

\begin{align} L\{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i;{{\boldsymbol{A}},\dot{\boldsymbol{A}};\;\phi }_0,{{\boldsymbol{A}}}_0\}&=L\left \{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i;\phi _0,{{\boldsymbol{A}}}_0\right \}+\int\nolimits {{\frac {\textrm{d}^3\textit{x}}{8\pi }}}\!\left (\frac {1}{c^2}{\vert \dot {{\boldsymbol{A}}}({\boldsymbol{x}})\vert }^2-{\vert \nabla \times {\boldsymbol{A}}({\boldsymbol{x}})\vert }^2\right )\nonumber \\[4pt]&\quad +\frac {1}{c}\int\nolimits {\textrm{d}^3\textit{x}\,{\boldsymbol{j}}({\boldsymbol{x}},\left \{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i\right \})\cdot {\boldsymbol{A}}},\qquad \end{align}

where

(1.138)

\begin{equation} {\boldsymbol{j}}\!\left ({\boldsymbol{x}},\left \{{{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i\right \}\right )\equiv \sum\limits _i{e_i}{{{{\boldsymbol v}}}}_i\delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_{{\boldsymbol{i}}}). \end{equation}

The current j can be decomposed into a sum of longitudinal (curl free) and transverse (divergence free) terms ${\boldsymbol{j}}={{\boldsymbol{j}}}^{\ell }+{{\boldsymbol{j}}}^t$ . Only the transverse part contributes to (1.138) as a consequence of the following result:

(1.139)

\begin{equation} \int\nolimits {\textrm{d}^3\textit{x}\nabla \varphi \cdot {\boldsymbol{A}}=0=}-\int\nolimits {\textrm{d}^3\textit{x}\varphi \nabla \cdot {\boldsymbol{A}}=}0\quad \textrm {where}\quad \nabla \cdot {\boldsymbol{A}}=0. \end{equation}

We decompose j into longitudinal and transverse pieces by calculating $\nabla \cdot {\boldsymbol{j}}$ and $\nabla \times {\boldsymbol{j}}$ , and then inverting scalar and vector Poisson equations.

Exercise: Work out the Euler–Lagrange equations for the particles using (1.137) to find

(1.140)

\begin{align} {m}_i{\dot {{{{\boldsymbol v}}}}}_i&=e_i\bigg [{{\boldsymbol{E}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )+\frac {1}{c}{{{{\boldsymbol v}}}}_i\times {{\boldsymbol{B}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )\bigg ]+e_i\sum\limits _j{e_j\!\left (-{\nabla }_i\frac {1}{r_{ij}}\right )}\nonumber \\[4pt] &\quad +e_i\bigg [{{\boldsymbol{E}}}_{\mathrm{rad}}\!\left ({{\boldsymbol{r}}}_i,t\right )+\frac {1}{c}{{{{\boldsymbol v}}}}_i\times {{\boldsymbol{B}}}_{\mathrm{rad}}\!\left ({{\boldsymbol{r}}}_i,t\right )\bigg ]. \end{align}

Definition: Define the functional derivatives (introducing bars over the partial signs $\overline {\boldsymbol\partial }$ )

(1.141)

\begin{equation} {\boldsymbol \Pi }\equiv \frac {\overline {\boldsymbol\partial }{\boldsymbol{L}}}{\overline {\boldsymbol\partial }\dot {{\boldsymbol{A}}}{\mathbf (}{\boldsymbol{x}}{\mathbf )}}\equiv \frac {1}{4\pi c^2}\dot {{\boldsymbol{A}}}\!\left ({\boldsymbol{x}}\right )=-\frac {1}{4\pi }{{\boldsymbol{E}}}_{\mathrm{rad}}\quad \textrm {and}\quad \dot {{\boldsymbol \Pi }}\!\left ({\boldsymbol{x}}\right )\equiv \frac {\overline {\boldsymbol\partial }{\boldsymbol{L}}}{\overline {\boldsymbol\partial }{\boldsymbol{A}}\!\left ({\boldsymbol{x}}\right )}=-\frac {1}{4\pi c}{\dot {{\boldsymbol{E}}}}_{\mathrm{rad}}\!\left ({\boldsymbol{x}}\right )\!. \end{equation}

These are used in recovering Maxwell’s equations from the Euler–Lagrange equations applied to (1.137). For example, from the term $\smallint {\textrm{d}^3\textit {x}}{\vert \nabla \times {\boldsymbol{A}}\vert }^2$ one forms $2\smallint {\textrm{d}^3\textit {x}}{\nabla \times {\boldsymbol{A}}\cdot \nabla \times \delta {\boldsymbol{A}}}\to 2\smallint {\textrm{d}^3\textit{x}}\;\delta {\boldsymbol{A}}\cdot {\nabla \times \!\left (\nabla \times {\boldsymbol{A}}\right )=}2\smallint {\textrm{d}^3\textit {x}}\;\delta {\boldsymbol{A}}\cdot ({4\pi }/{c}){{\boldsymbol{j}}}$ from which $\dot {{\boldsymbol \Pi }}\!\left ({\boldsymbol{x}}\right )=-({1}/{4\pi c}){\dot {{\boldsymbol{E}}}}_{\mathrm{rad}}\!\left ({\boldsymbol{x}}\right )=-({1}/{4\pi })\nabla \times {{\boldsymbol{B}}}_{{\textrm{rad}}}{\mathbf +}({{1}}/{{{{c}}}}){{\boldsymbol{j}}}^{\textit {t}}$ . In summary, the Lagrangian in (1.137) recovers the correct Maxwell equations:

(1.142)

\begin{eqnarray} {{\boldsymbol{E}}}_{\mathrm{rad}}=-\frac {1}{c}{\dot {{\boldsymbol{A}}}}_{\mathrm{rad}}, \nabla \times {{\boldsymbol{B}}}_{\mathrm{rad}}-\frac {1}{c}{\dot {{\boldsymbol{E}}}}_{\mathrm{rad}}=\frac {4\pi }{c}{{\boldsymbol{j}}}^t\quad \textrm{and}\qquad\quad \nonumber \\[4pt] \nabla \cdot \!\left (\nabla \times {{\boldsymbol{B}}}-\frac {1}{c}{\dot {{\boldsymbol{E}}}}=\frac {4\pi }{c}{{\boldsymbol{j}}}\right ) \to 0= {\dot {{\boldsymbol{E}}}}+4\pi {{\boldsymbol{j}}}\to \nabla \cdot {\boldsymbol{E}}=4\pi \rho , \end{eqnarray}

where we have made use of charge continuity: $\dot {\rho }+\nabla \cdot {\boldsymbol{j}}=0$ .

Definition: The canonical momentum in an electromagnetic field is defined by

(1.143)

\begin{equation} {{\boldsymbol{p}}}_i\equiv \frac {\partial L}{\partial {{{{\boldsymbol v}}}}_{{\boldsymbol{i}}}}=m_i{{{{\boldsymbol v}}}}_i+\frac {e_i}{c}\left [{{\boldsymbol{A}}}_0\!\left ({{\boldsymbol{r}}}_i,t\right )+{\boldsymbol{A}}\!\left ({{\boldsymbol{r}}}_i,t\right )\right ] \end{equation}

and as noted in (1.141)

(1.144)

\begin{equation} {\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\right )\equiv \frac {\partial L}{\partial {\boldsymbol{A}}\!\left ({\boldsymbol{x}}\right )}=\frac {1}{4\pi c^2}\dot {{\boldsymbol{A}}}\!\left ({\boldsymbol{x}}\right )=-\frac {1}{4\pi c}{\boldsymbol{E}}\!\left ({\boldsymbol{x}}\right ). \end{equation}

Recalling the definitions in (1.141), the Hamiltonian implied by the Lagrangian in (1.137) is

(1.145)

\begin{align} H&=\sum\limits _i{{{\boldsymbol{p}}}_i\cdot }{{{{\boldsymbol v}}}}_i+\int\nolimits {\textrm{d}^3\textit {x}{\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\right )}\cdot \dot {{\boldsymbol{A}}}\!\left ({\boldsymbol{x}}\right )-L\nonumber\\[4pt] &= \sum\limits _i{\frac{1}{2}}m_i{{{{v}}}}^2_i+\sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}}+\sum\limits _i{e_i}\phi _0\!\left ({{\boldsymbol{r}}}_i,t\right )+\int\nolimits {\frac {\textrm{d}^3{\boldsymbol{x}}}{8\pi }\ \!\left (E^2_{\mathrm{rad}}+B^2_{\mathrm{rad}}\right )}\nonumber \\[4pt]&=K+C+R, \end{align}

where $K\equiv$ $\Sigma _i({{1}/{2}})m_i{{{{v}}}}^2_i,\, C\equiv \Sigma _{i\lt j}({ {e_ie_j}/{r_{ij}}})+\Sigma _i{e_i}\phi _0({{\boldsymbol{r}}}_i,t), R\equiv \smallint \textrm{d}^3 \boldsymbol{x} (E_{\textrm{rad}}^2 + B_{\textrm{rad}}^2)/$ $(8\pi).$ We note that there is no magnetic interaction energy in the Hamiltonian.

Exercise: Calculate $\dot {p}=-{\partial H}/{\partial q}$ , $\dot {q}={\partial H}/{\partial p}$ , and recover Maxwell’s equations. The generalized momentum and Maxwell’s equation in (1.141), (1.142), and (1.143) have been calculated already from the Lagrangian in (1.137).

The classical partition function for an electromagnetic plasma with applied field is given by

(1.146)

\begin{equation} Z(\beta ,{{{v}}};N_s;\phi _0,{\textrm {A}}_0)\equiv \int\nolimits {{\prod }_{\textit{x}}\frac {\textrm{d}{\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\right )\textrm{d}{\boldsymbol{A}}({\boldsymbol{x}})}{h}\frac {1}{\prod\limits _s{N_s!}}\prod\limits ^N_{i=1}{\int\nolimits {\frac {\textrm{d}^3{{\boldsymbol{p}}}_i\textrm{d}^3{{\boldsymbol{r}}}_i}{h^3}}}}e^{-\beta H}. \end{equation}

It is convenient to transform coordinates from $\!\left ({{\boldsymbol{p}}}_i,{{\boldsymbol{r}}}_i,{\boldsymbol{A}},\dot {{\boldsymbol{A}}}\right )\to (m_i{{{{\boldsymbol v}}}}_i,{{\boldsymbol{r}}}_i,{\boldsymbol{A}},\dot {{\boldsymbol{A}}})$ . Some coordinate transformations will require the introduction of a noncanonical transformation. In general, the new volume element is related to the old volume element by a Jacobian factor, which is one for a canonical transformation and must be evaluated for a noncanonical transformation. The Jacobian matrix and its determinant are generally

(1.147)

\begin{equation} {{\boldsymbol{J}}}_{ij}\equiv \frac {\partial x_{\textrm{new},i}}{\partial x_{\textrm{old},j}}\quad \textrm {and}\quad J\equiv \text{det}\left [{{\boldsymbol{J}}}_{ij}\right ]. \end{equation}

In this case ${{\boldsymbol{p}}}_i\to m_i{{{{\boldsymbol v}}}}_i$ simply. The partition function can be recast as

(1.148)

\begin{align} Z&=\left [V^N\frac {\int {\prod\nolimits_i{\textrm{d}}^3{{{{\boldsymbol v}}}}_im_ie^{-\beta K}}}{h^{3N}N_s!}\right ]\left [\int \prod\limits_i\frac {\textrm{d}^3{{\boldsymbol{r}}}_i}{V}e^{-\beta \phi }\right ]\left [\int {\prod\limits_{{\boldsymbol{x}}}\frac {\textrm{d}{\boldsymbol{A}}\textrm{d}{\boldsymbol \Pi }}{h}e^{-\beta R}}\right ]\nonumber \\&\equiv Z_{\mathrm{kinetic}}(\beta ,V)Z_{\mathrm{config}}(\beta ,\phi )Z_{\mathrm{rad}}(V,\beta ). \end{align}

Note that the dependence on A ${}_{0}$ has vanished. The kinetic component of the partition function becomes (for an ideal gas)

(1.149)

\begin{equation} Z_{\mathrm{kinetic}}(\beta ,V)=\prod\limits_s{\frac {1}{N_s!}{\left [\frac {V}{{\Lambda}^3_s(\beta )}\right ]}^{N_s}}. \end{equation}

The configuration component of the partition function becomes

(1.150)

\begin{equation} Z_{\mathrm{config}.}\!\left (\beta ,\phi \right )=\int{\prod\limits_i}\frac {\textrm{d}^3{{\boldsymbol{r}}}_i}{V}e^{-\beta \left [\sum\limits _i{e_i}\phi _0 \left ({{\boldsymbol{r}}}_i,t\right )+\sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}}\right ]} \end{equation}

Note that ${e_ie_j}/{r_{ij}}$ becomes divergent as $r_{ij}\to 0$ , which could affect attracting charge pairs, and requires a cutoff at the quantum limit.

Consider a Fourier series representation of the electromagnetic vector potential:

(1.151)

\begin{equation} {\boldsymbol{A}}\!\left ({\boldsymbol{x}},t\right )=\sum\limits _{{\boldsymbol{k}}}{\sum\limits _{\hat {{\boldsymbol{e}}}}{{{\boldsymbol{A}}}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}{\textrm {(}t\textrm {)}e}^{i{\boldsymbol{k}}\cdot {\boldsymbol{x}}}}}, \end{equation}

where x takes on a continuum of values, the sum over k is denumerably infinite, $\hat{{\boldsymbol{e}}}$ are the two polarization states orthogonal to $\hat {{\boldsymbol{k}}}$ , and we assume that ${{\boldsymbol{A}}}_{-{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}={{{\boldsymbol{A}}}}^*_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}$ so that the sum over k is over a half-space (which we denote by $\Sigma _{{\boldsymbol{k}}}{'})$ . The radiation component of the Lagrangian in (1.137) becomes

(1.152)

\begin{equation} L_{\mathrm{rad}}=\sum\limits_{{\boldsymbol{k}}}'{\sum\limits_{\hat {{\boldsymbol{e}}}}{\frac {V}{4\pi c^2}\left \{{\vert {\;\,\dot{{\kern-3pt\boldsymbol{A}}}}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\vert }^2-k^2c^2{\vert {{\boldsymbol{A}}}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\vert }^2\right \}}}. \end{equation}

Given (1.152), the following two functional derivative expressions are independent:

(1.153)

\begin{equation} \Pi_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\equiv \frac {\partial L_{\mathrm{rad}}}{\partial {\;\;\dot{{\kern-5pt\boldsymbol{A}}}}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}}=\frac {V}{4\pi c^2}{\dot {\;\;{\kern-5pt\boldsymbol{A}}}}^*_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}, \Pi^*_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\equiv \frac {\partial L_{\mathrm{rad}}}{\partial {\dot {\;\;{\kern-5pt\boldsymbol{A}}}}^*_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}}=\frac {V}{4\pi c^2}{\dot {\;\;{\kern-5pt\boldsymbol{A}}}}_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}. \end{equation}

The radiation component of the Hamiltonian becomes

(1.154)

\begin{equation} H_{\mathrm{rad}}=R=\sum\limits^{\prime} _{{\boldsymbol{k}}\widehat {,{\boldsymbol{e}}}}{\!\left (\Pi \dot {A}+{\Pi }^*{\;\dot{\!A}}^*\right )}-L_{\mathrm{rad}}=\sum\limits^{\prime} _{{\boldsymbol{k}}\widehat {,{\boldsymbol{e}}}}{\frac {4\pi c^2}{V}{\left\vert \Pi \right\vert }^2+\frac {Vk^2}{4\pi }{\vert A\vert }^2}. \end{equation}

We introduce the definition $A\equiv (a+ib)\sqrt {{2\pi c^2}/{V}}$ and recast (1.154) as

(1.155)

\begin{equation} H_{\mathrm{rad}}=\sum\limits^{\prime} _{{\boldsymbol{k}}}{\sum\limits_{\hat {{\boldsymbol{e}}}}{\left \{\frac{1}{2}{\dot {a}}^2_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}+\frac {k^2c^2}{2}a^2_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\right \}+\sum\limits^{\prime} _{{\boldsymbol{k}}}{\sum\limits _{\hat {{\boldsymbol{e}}}}{\left \{\frac{1}{2}{\dot {b}}^2_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}+\frac {k^2c^2}{2}b^2_{{\boldsymbol{k}},\hat {{\boldsymbol{e}}}}\right \}}}}}. \end{equation}

We are on our way to calculating the expectation of the statistically averaged Hamiltonian. Recall from (1.148) that $Z_{\textrm{classical}}=Z_{\textrm{kin}}Z_{\textrm{conf}}Z_{\textrm{rad}}$ and $e^{-\beta H}=e^{-\beta K}e^{-\beta C}e^{{-\beta}{R}}$ based on (1.145). We note that there are two harmonic oscillators in $H_{\textrm{rad}}$ . Similar to (1.126) but accounting for the two harmonic oscillators in the radiation contribution to the Hamiltonian one obtains

(1.156)

\begin{equation} \langle H_{{\boldsymbol{k}}\hat {{\boldsymbol{e}}}}\rangle =\frac {2\hslash kc}{e^{\beta \hslash kc}-1}\to 2T\quad \textrm{for}\quad \hslash kc\ll T \end{equation}

and

(1.157)

\begin{equation} \langle H_{\mathrm{rad}}\rangle =\sum\limits^{\prime} _{{\boldsymbol{k}}\widehat {,{\boldsymbol{e}}}}{\langle H_{{\boldsymbol{k}}\hat {{\boldsymbol{e}}}}\rangle }=\sum\limits^{\prime} _{{\boldsymbol{k}}\hat {{\boldsymbol{e}}}}{\frac {2\hslash kc}{e^{\beta \hslash kc}-1}=\sum\limits _{{\boldsymbol{k}}\hat {{\boldsymbol{e}}}}{\frac {2\hslash kc}{e^{\beta \hslash kc}-1}}}, \end{equation}

which again recovers the black-body radiation formula. We conclude that the radiation in the volume is independent of the particles except to the extent that they influence the dispersion relation for the electromagnetic normal modes. For example, in a plasma

(1.158a)

\begin{equation} {\bigg\langle \frac {B^2}{8\pi }\bigg\rangle }_{{\omega }_k,k,\hat {{\boldsymbol{e}}}}=\frac {T}{2}\frac {k^2c^2}{k^2c^2+{\omega }^2_p} \quad \textrm{(in a plasma)} \end{equation}

and in a vacuum

(1.158b)

\begin{equation} {\bigg\langle \frac {B^2}{8\pi }\bigg\rangle }_{{\omega }_k,k,\hat {{\boldsymbol{e}}}}=\frac {T}{2}\quad \textrm{(in a vacuum)} \end{equation}

Theorem: In a classical system in thermal equilibrium, statistical mechanics and classical mechanics imply that the average magnetization in response to a finite applied magnetic field B ${}_{0}$ vanishes, $\langle$ M $\rangle$ = 0 (Bohr–Van Leeuwen).

Definition: ${\boldsymbol{j}}({\boldsymbol{x}}\!\left ({{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i\right ))\equiv \Sigma _i{e_i{{{{\boldsymbol v}}}}_i\delta ({\boldsymbol{x}}-}{{\boldsymbol{r}}}_i).$

To illustrate the Bohr–Van Leeuwen theorem consider the term in the Lagrangian in (1.134) that depends on A ${}_{0.}$

(1.159)

\begin{equation} L=\dots +\frac {1}{c}\int\nolimits {\textrm{d}^3x}{{\boldsymbol{A}}}_0\cdot {\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right )+\dots , \end{equation}

from which follows using the functional derivatives introduced in (1.141)

(1.160)

\begin{equation} \frac {1}{c}{\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right )=\frac {\overline {\partial }L}{\overline {\partial }{{\boldsymbol{A}}}_0}=-\frac {\overline {\partial }H}{\overline {\partial }{{\boldsymbol{A}}}_0}\bigg \vert _{{\boldsymbol{p}},{\boldsymbol{q}}}. \end{equation}

Corollary: For small changes $\Delta \lambda$ in $L(q,\dot {q};\lambda )$ and $H(q,\dot {q};\lambda )$

(1.161)

\begin{equation} \delta L\vert _{q,\dot {q}}=-\delta {H} \vert _{p,q}. \end{equation}

Using $Z=\smallint {\textrm{d}\varGamma e^{-\beta H}}$ we note

(1.162)

\begin{equation} \frac {1}{Z}\frac {\overline {\partial }Z}{\overline {\partial }{{\boldsymbol{A}}}_0}=\frac {\overline {\partial }{\text{ ln } Z(\beta ,v,\phi _0)}}{\overline {\partial }{{\boldsymbol{A}}}_0}\equiv 0\ \ \end{equation}

and using (1.148) $Z=Z_{\mathrm{kinetic}}(\beta ,V)Z_{\mathrm{config}}(\beta ,\phi )Z_{\mathrm{rad}}(V,\beta )$ , none of which components Z ${}_{i}$ depend on ${{\boldsymbol{A}}}_0,$ and (1.162)

(1.163)

\begin{equation} \frac {1}{Z}\frac {\overline {\partial }Z}{\overline {\partial }{{\boldsymbol{A}}}_0}=-\frac {\int\nolimits {\textrm{d}\varGamma e^{-\beta H}}\frac {\overline {\partial }H}{\overline {\partial }{{\boldsymbol{A}}}_0}}{Z} \to \bigg\langle \frac {\overline {\partial }H}{\overline {\partial }{{\boldsymbol{A}}}_0}\bigg\rangle =0. \end{equation}

Hence, from (1.160)

(1.164)

\begin{equation} \bigg\langle \frac {1}{c}{\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right )\bigg\rangle =\bigg\langle \frac {\overline {\partial }L}{\overline {\partial }{{\boldsymbol{A}}}_0}\bigg\rangle =-\bigg\langle {\left .\frac {\overline {\partial }H}{\overline {\partial }{{\boldsymbol{A}}}_0}\right .}\bigg \vert _{{\boldsymbol{p}},{\boldsymbol{q}}}\bigg\rangle =0 \end{equation}

and

(1.165)

\begin{equation} \langle {\boldsymbol{j}}\rangle =c\nabla \times \langle {\boldsymbol{M}}\rangle \!\left ({\boldsymbol{x}}\right )\to \langle {\boldsymbol{j}}\rangle =0\to \langle {\boldsymbol{M}}\rangle =0. \end{equation}

Thus, the averaged equilibrium current and magnetization are zero in a classical system. Systems governed by quantum mechanics do not have to obey the Bohr–Van Leeuwen theorem, e.g., superconductors, permanent magnets with permanent magnetic dipole moment, etc.

Exercise: In a system with a uniform constant applied magnetic field B ${}_{0}$ and ${{\boldsymbol{A}}}_0=({1}/{2}){{\boldsymbol{B}}}_0 \times {\boldsymbol{x}}$ with $L_{\textrm{int}}={\boldsymbol \mu }\cdot {{\boldsymbol{B}}}_0$ and ${\boldsymbol \mu }\equiv (2c)^{-1}\smallint {\textrm{d}^3\textit{x}}{\boldsymbol{x}}\times {\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right ),$ show that $\langle {\boldsymbol \mu }\rangle =0=\langle {\partial L}/{\partial {{\boldsymbol{B}}}_0}\rangle$ analogous to the arguments and results in (1.160)–(1.165).

[Editor’s Note: The Bohr–Van Leeuwen theorem continues to attract attention and a rich literature exists. Some of the proofs of the Bohr–Van Leeuwen theorem reveal subtleties associated with boundary conditions for finite domains.]

1.2.7. Grand canonical ensemble, grand partition function, and chemical potential

We next turn to consideration of grand canonical ensembles. A grand canonical ensemble is a macroscopic ensemble of states that is in equilibrium with a reservoir. We assume that the system I is in contact with reservoir system II, and the volume of system I is fixed. System I may exchange particles and heat with system II, but the total energy $E=E_{\textrm{I}}+E_{\textrm{II}}$ and total number of particles $N_s=N^I_s+N^{II}_s$ are fixed. Typically system II is much larger than system I.

From (1.44) and (1.45), we note the probability of system I being in a particular microstate n with energy $E_{\textrm{I}}$ and number of particles $\textrm{N}_{s}^{I}$ is

(1.166)

\begin{align} w^I_{\left \{N^I_s\right \},n}\propto {\varGamma }_{\textrm{II}}\big[E_{\textrm{II}}&=E-E_{\textrm{I}}(n,\{N^I_s\}'), {N}^{II}_s=N_s-N^I_s\big]=e^{S_{\textrm{II}}(\dots\kern-0.2pt)}\nonumber \\[4pt]&=e^{S_{\textrm{II}}\!\left (E,N_s\right )-E_{\textrm{I}}{\frac {\partial S_{\textrm{II}}}{\partial E_{\textrm{I}}}\big\vert }_E-\sum\limits _s{N^I_s{\frac {\partial S_{\textrm{II}}}{\partial N^{II}_s}\big\vert }_{N_s}+\dots }}. \end{align}

Consider the microcanonical entropy $S\!\left (E,N_s;\lambda \right )$ and its properties $\beta \equiv {{\partial S}/{\partial E}\vert }_{N,\lambda }$ and ${{\boldsymbol \gamma }}_s\equiv {{\partial S}/{\partial {{\boldsymbol{N}}}_s}\vert }_{E,N_{s'},\lambda }$ where ${{\boldsymbol{N}}}_s$ is the vector of particle numbers whose component index s denotes the species. We note that ${{\boldsymbol \gamma }}_s=O\!\left (1\right ).$ For example, the entropy for an ideal gas is

(1.167)

\begin{align} S\!\left (E,N_s;\lambda \right )&=\sum\limits _s{N_s\left [\frac {5}{2}-{\text{ ln } \frac {N_s}{V}{\left (\frac {h}{\sqrt {\frac {4\pi }{3}m_s\frac {E}{N_s}}}\right )}^3}\right ]=\sum\limits _s{N_s\left [\frac {5}{2}-{\text{ ln } n_s{{\Lambda}_s}^3}\right ]}}, \nonumber\\[4pt] N&=\sum\limits _s{N_s}. \end{align}

We next normalize the right-hand side of (1.166) to finish evaluating the probability of being in the microstate n at thermal equilibrium:

(1.168)

\begin{equation} w^I_{\left \{N^I_s\right \},n}=\frac {e^{-{\beta }_{\textrm{II}}E_{\textrm{I}}-{{\boldsymbol \gamma }}_{\textrm{II}}\cdot {{\boldsymbol{N}}}_{\textrm{I}}}}{\sum\limits _{\left \{{{\boldsymbol{N}}}^I_s\right \}}{\sum\limits _n{e^{-{\beta }_{\textrm{II}}E_{\textrm{I}}-{{\boldsymbol \gamma }}_{\textrm{II}}\cdot {{\boldsymbol{N}}}_{\textrm{I}}}}}}. \end{equation}

The denominator in (1.168) is identified as the grand partition function

(1.169)

\begin{equation} \mathbb{Z}\textrm {(}{\beta }_{\textrm{II}},{\gamma }_{\textrm{II}}.\lambda )\equiv \ \sum\limits _{\left \{{{\boldsymbol{N}}}^I_s\right \}}{\sum\limits _n{e^{-{\beta }_{\textrm{II}}E_{\textrm{I}}(n,{{\boldsymbol{N}}}_{\textrm{I}})-{{\boldsymbol \gamma }}_{\textrm{II}}\cdot {{\boldsymbol{N}}}_{\textrm{I}}}}}\equiv \sum\limits _{\left \{{{\boldsymbol{N}}}^I_s\right \}}{e^{-{{\boldsymbol \gamma }}_{\textrm{II}}\cdot {{\boldsymbol{N}}}_{\textrm{I}}}{\textrm {Z}}_{\textrm{I}}({{\boldsymbol{N}}}_{\textrm{I}},}{\beta }_{\textrm{II}},\lambda ), \end{equation}

where

(1.170)

\begin{equation} {\textrm {Z}}_{\textrm{I}}({{\boldsymbol{N}}}_{\textrm{I}},\,{\beta }_{\textrm{II}},\lambda )\equiv \sum\limits _n{e^{-{\beta }_{\textrm{II}}E_{\textrm{I}}(n,{{\boldsymbol{N}}}_{\textrm{I}})}} \end{equation}

is the ‘petite’ partition function.

A small change in the entropy satisfies the difference equation

(1.171)

\begin{equation} \textrm{d}S=\beta \textrm{d}E+{{\boldsymbol \gamma }}_s\cdot \textrm{d}{{\boldsymbol{N}}}_s \to \textrm{d}E=T\textrm{d}S-T{{\boldsymbol \gamma }}_s\cdot \textrm{d}{{\boldsymbol{N}}}_s. \end{equation}

Definition: ${{\boldsymbol \mu }}_s\equiv -T{{\boldsymbol \gamma }}_s$ is a set of chemical potentials. Equivalently, ${{\boldsymbol \mu }}_s\equiv {\partial E(S,{{\boldsymbol{N}}}_s,\lambda )}/{\partial {{\boldsymbol{N}}}_s}$ .

Recalling the definition of ${{\boldsymbol \gamma }}_s$ , we can express the chemical potential as follows:

(1.172)

\begin{equation} {\mu }_s=T{\text{ ln } n_s{\Lambda}^3_s}. \end{equation}

Keep in mind that N ${}_{s}$ is variable.

Example: The grand partition function for an ideal gas (of identical particles) is

(1.173)

\begin{equation} \mathbb{Z} = \sum\limits _N{e^{-\gamma N}\frac {{(Z_1)}^N}{N!}\approx \sum\limits ^{\infty }_{N=0}{\frac {{\!\left (Z_1e^{-\gamma }\right )}^N}{N!}}}= e^{Z_1e^{-\gamma }}, \end{equation}

where Z ${}_{1}$ is the one-particle partition function as defined after (1.116), Z ${}_{1}$ $\equiv$ $\Sigma _{n_i}{e^{-\beta {{\mathcal E}}_{n_i}}}\to {V_{\textrm{I}}}/{{\Lambda}^3}$ using (1.119). We note that (1.170) and (1.171) yield the simple identities

(1.174)

\begin{equation} \langle {{\boldsymbol{N}}}_{\textrm{I}}\rangle =-\frac {\partial {\text{ ln } \mathbb {Z}\textrm {(}{\beta }_{\textrm{II}},{{\boldsymbol \gamma }}_{\textrm{II}})}}{\partial {{\boldsymbol \gamma }}_{\textrm{II}}}\quad \textrm {and}\quad \langle E_{\textrm{I}}\rangle =-\frac {\partial {\text{ ln } \mathbb {Z}\textrm {(}{\beta }_{\textrm{II}},{{\boldsymbol \gamma }}_{\textrm{II}})}}{\partial {\beta }_{\textrm{II}}}. \end{equation}

Example: In an ideal gas $\langle N_{\textrm{I}}\rangle =-({\partial {\text{ ln } \textrm {Z}\!\left ({\beta }_{\textrm{II}},{{\boldsymbol \gamma }}_{\textrm{II}}\right )}}/{\partial {{\boldsymbol \gamma }}_{\textrm{II}}})=Z_1e^{-\gamma }=$ ${V_{\textrm{I}}e^{-{\gamma }_{\textrm{II}}}}/{{\Lambda}^3{(\beta }_{\textrm{II}})}=(V_{\textrm{I}} n_{\textrm{II}} \Lambda^3(\beta_{\textrm{II}})/ \Lambda^3(\beta_{\textrm{II}})) = n_{\textrm{II}}V_{\textrm{I}}$ and the number densities $n_{\textrm{II}}=n_{\textrm{I}}$ using (1.171), (1.172), and the definitions, and assuming that the bath and system I share the same mix of species and can freely exchange particles without disturbing the physics.

Exercise: Show that $\langle {\left (\delta N_{\textrm{I}}\right )}^2\rangle \sim {{\partial }^2{\text{ ln } \textrm {Z}}}/{\partial {\gamma }^2_{\textrm{II}}}$ is small if system I is macroscopic, i.e., $N_{\textrm{I}}\sim O\!\left ({10}^{23}\right ).$

Consider a macroscopic system I whose probability is given by

(1.175)

\begin{equation} w_{{\{N}^I\}}\equiv \sum\limits _n{w_{n,\{N^I\}}=\frac {1}{\mathbb {Z}}e^{-{\boldsymbol \gamma }\cdot {\boldsymbol{N}}}Z\!\left (\beta ,{\boldsymbol{N}}\right )=}\frac {1}{\mathbb {Z}}e^{-\beta (F-{\boldsymbol \mu }\cdot {\boldsymbol{N}})}\equiv \frac {1}{\mathbb { Z}}e^{-\beta \Omega }, \end{equation}

where F is the Helmholtz free energy, ${\boldsymbol \mu }=-T{\boldsymbol \gamma }$ , and the grand potential is defined as

(1.176)

\begin{equation} \Omega \equiv \Omega \!\left (\beta ,{\boldsymbol \mu };{\boldsymbol{N}}\right )\equiv F\!\left (\beta ,{\boldsymbol{N}}\right )-{\boldsymbol \mu }\cdot {\boldsymbol{N}}. \end{equation}

Recall that $\boldsymbol{N}$ is the vector representing the set of occupation numbers in different states.

The probability of having a given set of occupation numbers is

(1.177)

\begin{equation} w_{{\boldsymbol{N}}}=\frac {1}{\textrm {Z}\textrm {(}\beta \textrm {,}\mu \textrm {)}}e^{-\beta \Omega \textrm {(}\beta \textrm {,}\mu ;{\boldsymbol{N}}{\mathbf )}}. \end{equation}

The maximum of the probability $w_{{\boldsymbol{N}}}$ with respect to $\boldsymbol{N}$ occurs at ${{\boldsymbol{N}}}^*$ , i.e., $\langle {\boldsymbol{N}}\rangle ={{\boldsymbol{N}}}^*$ . Given (1.177), the maximum of $w_{{\boldsymbol{N}}}$ corresponds to minimizing $\Omega$ with respect to ${\boldsymbol{N}}.$ At the most probable set of occupation numbers ${{\boldsymbol{N}}}^*$ the grand potential satisfies the relation

(1.178)

\begin{equation} \Omega \cong -\frac {1}{\beta }{\text{ln }\textrm {Z}} \end{equation}

plus a constant of smaller order.

Using the relations $F\equiv E-TS,$ $\Omega =F-{\boldsymbol \mu }\cdot {\boldsymbol{N}}$ , (1.171), and (1.172), we have

(1.179)

\begin{equation} \textrm{d}E=T\textrm{d}S+{\boldsymbol \mu }\cdot \textrm{d}{\boldsymbol{N}}+\Lambda \textrm{d}\lambda \quad \textrm {and}\quad \textrm{d}F=-S\textrm{d}T+{\boldsymbol \mu }\cdot \textrm{d}{\boldsymbol{N}}+\Lambda \textrm{d}\lambda \, \end{equation}

where, for example, we might choose $\lambda =V$ and $\Lambda =-P$ from (1.65); and we arrive at

(1.180)

\begin{equation} \textrm{d}\Omega =-S\textrm{d}T-{\boldsymbol{N}}\cdot \textrm{d}{\boldsymbol \mu }+\Lambda \textrm{d}\lambda \quad \textrm {and}\quad {\boldsymbol{N}}=-\frac {\partial \Omega }{\partial {\boldsymbol \mu }} =-\frac {\partial \textrm {ln}\mathbb {Z}}{\partial {\boldsymbol \gamma }} \to \langle {\boldsymbol{N}}\rangle ={{\boldsymbol{N}}}^{{\mathbf *}} \end{equation}

for a macroscopic system. From (1.180) with $\Lambda =-P$ and $\lambda =V$

(1.181)

\begin{equation} \textrm{d}\Omega =-S\textrm{d}T-{\boldsymbol{N}}\cdot \textrm{d}{\boldsymbol \mu }-P\textrm{d}V \to P=-\frac {\partial \Omega }{\partial V}\!\left (T,{\boldsymbol \mu },V\right )=-\frac {\Omega }{V}. \end{equation}

In deriving $P=-{\Omega }/{V}$ we argue that $\Omega$ is extensive (should scale with volume), while T and ${\boldsymbol \mu }$ are intensive. Hence, $\Omega =V\ \textrm {fn}(T,{\boldsymbol \mu }\textrm {)}$ and

(1.182)

\begin{equation} -PV=\Omega = F-{\boldsymbol \mu }\cdot {\boldsymbol{N}}. \end{equation}

Example: Grand partition function and grand potential for an ideal classical gas

(1.183)

\begin{equation} \Omega \!\left (T,\mu ,V\right )=-T{\text{ ln } \mathbb {Z} = -\frac {VT}{{\Lambda}^3(\beta )}e^{\beta \mu }}, \end{equation}

(1.184)

\begin{equation} P=-\frac {\Omega }{V}= \frac {T}{{\Lambda}^3(\beta )}e^{\beta \mu },\quad N=\frac {V}{{\Lambda}^3(\beta )}e^{\beta \mu },\quad \frac {P}{N}=\frac {T}{V} \to P=nT. \end{equation}

Theorem (Gibbs–Duhem): Take the differential of (1.182), substitute (1.181), and divide through by V to obtain

(1.185)

\begin{equation} \textrm{d}P=\frac {S}{V}\textrm{d}T+{\boldsymbol{n}}\cdot \textrm{d}{\boldsymbol \mu } \end{equation}

and P is determined as a function of T and ${\boldsymbol \mu },$ or T and ${\boldsymbol{n}}\equiv {\boldsymbol{N}}{\mathbf /}V$ , i.e., the equation of state,

(1.186)

\begin{equation} P\!\left (T,{\boldsymbol \mu }\right )\textrm {:}{\boldsymbol \ \ \ \ }{\boldsymbol{n}}\!\left (\textrm {T,}{\boldsymbol \mu }\right ) = \frac {\partial P}{\partial {\boldsymbol \mu }}\!\left (T,{\boldsymbol \mu }\right )\to P(T,{\boldsymbol{n}}). \end{equation}

We next consider a few interesting examples.

Example: Quantum ideal gas.

Consider a subsystem consisting of a single-particle quantum state k for a simple noninteracting electron gas. The energy of a single electron is

(1.187)

\begin{equation} {{\mathcal E}}_k=\left \{ \begin{array}{c@{\quad}l} \frac {p^2}{2m} & (\textrm {non-relativistic}), \\[4pt] \sqrt {p^2c^2+m^2c^4} & (\textrm {relativistic).} \end{array} \right . \end{equation}

Note: If the particles are photons (bosons) instead, one must be careful because they are not conserved. Ions, molecules, fermions, molecules, etc., are conserved if noninteracting (no ionization, no recombination, no chemistry). Including a magnetic field B and spin, the subsystem energy is

(1.188)

\begin{equation} E=\sum\limits _k{{{\mathcal E}}_kN_k\pm \hat {\mu }B}, \end{equation}

where $N_k$ is an occupation number ( $N_k=0,1$ for fermions due to the Pauli principle; and $N_k=0,1,2,\ldots ,\infty$ for bosons); and $\hat {\mu }$ is the magnetic moment associated with the spin. The probability of the macroscopic state k with occupation number $N_k$ is then

(1.189)

\begin{equation} w_{N_k}=\frac {e^{-\gamma N_k-\beta {{\mathcal E}}_kN_k}}{{\textrm {Z}}_k}\equiv \frac {e^{-\beta ({{\mathcal E}}_k-{{\mu }_s})N_k}}{\sum\nolimits ^{1\ or\ \infty }_{N_k=0}{e^{-\beta ({{\mathcal E}}_k-{{\mu }_s})N_k}}}, \end{equation}

where the sum in the denominator for the grand partition function is just two terms for fermions and a geometric series for bosons.

Example: Bosons.

In order that ${\textrm {Z}}_k$ converges, ${{\mathcal E}}_k\gt {\mu }$ for all k. We also note that ${{\mathcal E}}_0=0$ , which implies that ${\mu }\lt 0.$ Hence, $w_{N_k}\propto e^{-\beta ({{\mathcal E}}_k+\vert \mu \vert )N_k}$ , i.e., $w_{N_k}$ is a monotonic and exponentially decreasing function of $N_k$ . The most probable state is the state $N_k=0.$ From (1.189) one concludes

(1.190)

\begin{equation} \langle N_k\rangle \equiv \sum\limits ^{\infty }_{N_k=0}{w_{N_k}}N_k=\frac {1}{e^{\beta ({{\mathcal E}}_k+|{\mu }|)}-1}. \end{equation}

Here $\langle N_k\rangle$ is a monotonically decreasing function of ${{\mathcal E}}_k,$ and Einstein condensation can occur when $\langle N_k\rangle$ becomes macroscopically large which is possible at ${{\mathcal E}}_k=0$ .

Example: Fermions.

Because of the Pauli principle the occupation number $N_k$ = 0 or 1 for fermions. The probability $w_{N_k}$ is proportional to

(1.191)

\begin{equation} w_{N_k}\propto e^{-\beta ({{\mathcal E}}_k-{\mu })N_k}. \end{equation}

For ${\mu }\gt 0$ the argument of the exponential is positive for ${{\mathcal E}}_k\lt {\mu }_s$ and is negative for ${{\mathcal E}}_k\gt {\mu }$ . $w_{N_k}$ takes on just two values as a function of $N_k$ , at $N_k$ = 0 and 1. The argument of the exponential vanishes for ${{\mathcal E}}_k={\mu }$ the Fermi level. Figure 2 plots $\langle N_k\rangle =\Sigma ^1_{N_k=0}{w_{N_k}}N_k$ as a function of energy ${{\mathcal E}}_k$ (Fermi–Dirac distribution function).

Figure 2. Fermi–Dirac distribution function $\langle$ N ${}_{k}$ $\rangle$ =n( ${{\mathcal E}})$ (Riebesell Reference Riebesell2022).

We define ${{\mathcal E}}^{\prime} \equiv {{\mathcal E}}_k-{\mu }$ and the partition function ${\textrm {Z}}_k$ is then

(1.192)

\begin{equation} {\textrm {Z}}_k={\!\left (1+\sigma e^{-\beta {{\mathcal E}}^{\prime} }\right )}^{\sigma } \sigma \equiv \left \{ \begin{array}{l@{\quad}l} +1 & (\textrm {fermions}), \\[4pt] -1 & (\textrm {bosons}). \end{array} \right . \end{equation}

Then

(1.193)

\begin{equation} {\text{ ln } {\mathbb {Z}}_k}=\sigma {\text{ ln}(1+\sigma e^{-\beta {{\mathcal E}}^{\prime} })}\quad \textrm {and}\quad \mathbb {Z} = \prod\limits _k{{\mathbb {Z}}_k}\to {\text{ ln } \mathbb {Z} = \sum\limits _k{{\text{ ln } {\mathbb {Z}}_k}}} \end{equation}

We recall the analysis leading to (1.182) and obtain

(1.194)

\begin{equation} P=-\frac {\Omega }{V}=\frac {T}{V}{\text{ ln } \mathbb {Z} = \frac {T}{V}\sigma \sum\limits _k{{\text{ ln}\!\left (1+\sigma e^{-\beta ({{\mathcal E}}_k-{\mu })}\right )}}}. \end{equation}

We introduce the de Broglie wavenumber k and evaluate ${{\mathcal E}}_k={{\left (\hslash k\right )}^2}/({2m})$ . In (1.194) we replace $\Sigma _k{ \to gV\smallint {({\textrm{d}^3{\boldsymbol{k}}})/{{(2\pi )}^3}}}$ where the factor $g=2{\mathcal S}+1$ and the spin factor is ${\mathcal S}={1}/{2}$ or an integer. Then (1.194) becomes using $\text{ ln}\!\left (1+x\right )=\Sigma ^{\infty }_{\ell =1}{{(-1)}^{\ell -1}{x^{\ell }}/{\ell }}$

(1.195)

\begin{align} P&={\sigma gT\int\nolimits {\frac {\textrm{d}^3k}{{(2\pi )}^3}}\textrm {ln}\!\left (1+\sigma e^{-\beta ({{\mathcal E}}_k-{\mu })}\right )}\nonumber \\[4pt] &= \sigma gT\sum\limits ^{\infty }_{\ell =1}{\frac {{\!\left (-1\right )}^{\ell -\textrm {1}}}{\ell }}{\sigma }^{\ell }\int\nolimits {\frac {\textrm{d}^3{\boldsymbol{k}}}{{\!\left (2\pi \right )}^3}}e^{-\beta \ell \left ({{\mathcal E}}_k-{\mu }\right )}\nonumber \\[4pt] &= gT\sum\limits ^{\infty }_{\ell =1}{\frac {{\!\left (-\sigma \right )}^{\ell -\textrm {1}}}{\ell }}\int\nolimits {\frac {\textrm{d}^3{\boldsymbol{k}}}{{\!\left (2\pi \right )}^3}}e^{-\beta \ell \left ({{\mathcal E}}_k-{\mu }\right )}. \end{align}

Recalling the definition $\Lambda ={h}/{\sqrt {2\pi mT}}$ and introducing the dimensionless fugacity or absolute activity $\xi \equiv e^{\beta {\mu }}$ , (1.195) leads to

(1.196)

\begin{equation} P=\frac {gT}{{\Lambda}^{\textrm{d}}(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {{\!\left (-\sigma \right )}^{\ell -\textrm {1}}}{{\ell }^{1+\frac {\textrm{d}}{2}}}}{\xi }^{\ell } \to P(T,\xi )=\frac {gT}{{\Lambda}^3(\beta )}\left [\xi -\frac {\sigma {\xi }^2}{2^{5/2}}+\frac {{\xi }^3}{3^{5/2}}+\dots \right ], \end{equation}

where the dimensionality d has been set to d = 3. We recall (1.186) which determines the particle density $n={\partial P}/{\partial {\mu }_s}$

(1.197)

\begin{align} n\!\left (T,\xi \right )&=\frac {\partial P\!\left (T,{\mu }\right )}{\partial {\mu }}=\frac {\beta \xi \partial P\!\left (T,\xi \right )}{\partial \xi }=\frac {g}{{\Lambda}^{\text{d}}\!\left (\beta \right )}\left [\xi \!\left (1-\frac {\sigma }{2^{\frac {\textrm{d}}{2}}}\xi +\dots \right )\right ]\nonumber \\[4pt]&=\frac {g}{{\Lambda}^{\text{d}}(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {{\!\left (-\sigma \right )}^{\ell -1}}{{\ell }^{\text{d}/2}}{\xi }^{\ell }}. \end{align}

We note that the convergence of the expressions in (1.196) and (1.197) for P and $n$ requires that the absolute activity $\xi \equiv e^{\beta {\mu }}\lt 1$ , i.e., ${\mu }\lt 0$ . (1.197) can be inverted and solved iteratively for

(1.198)

\begin{equation} \xi \!\left (T,n\right )=\frac {n{\Lambda}^{\text{d}}(\beta )}{g}\!\left (1+\frac {\sigma }{2^{\text{d}/2}}\frac {n{\Lambda}^{\text{d}}(\beta )}{g}+\dots \right )\!. \end{equation}

The value of $n{\Lambda}^3$ yields a measure of how quantum mechanical the gas is: $n{\Lambda}^3\ll 1$ is the classical limit.

Using the definition $\xi \equiv e^{\beta {\mu }}$ and (1.198), one obtains

(1.199)

\begin{equation} \mu =T{\text{ ln}\!\left (\frac {n{\Lambda}^{\text{d}}(\beta )}{g}\right )}+ \textrm{corrections} \end{equation}

and from (1.196) and (1.198)

(1.200)

\begin{equation} P\!\left (n,T\right )=nT\!\left (1+\frac {\sigma }{2^{1+\text{d}/2}}\frac {n{\Lambda}^{\text{d}}}{g}+O{(n{\Lambda}^{\text{d}})}^2\right ) \end{equation}

where 3 is replaced by d.

The influence of $\sigma$ on the pressure is clear: $\sigma =+1$ for fermions has a repulsive effect, while $\sigma =-1$ for bosons has an attractive effect (symmetric wave function).

Example: Bose gas.

Consider a gas of bosons, $\sigma =-1.$ The pressure relation (1.196) becomes

(1.201)

\begin{equation} P(T,\xi )=\frac {gT}{{\Lambda}^{\text{d}}(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {1}{{\ell }^{1+\text{d}/2}}{\xi }^{\ell }} \end{equation}

and the density relation (1.197) becomes

(1.202)

\begin{equation} n\!\left (T,\xi \right )=\frac {g}{{\Lambda}^{\text{d}}(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {1}{{\ell }^{{\text{d}}/2}}{\xi }^{\ell }}. \end{equation}

The expression

(1.203)

\begin{equation} \frac {{\Lambda}^{\text{d}}(\beta )}{g}n\!\left (T,\xi \right )=\sum\limits ^{\infty }_{\ell =1}{\frac {1}{{\ell }^{{\text{d}}/2}}{\xi }^{\ell }} \end{equation}

is a monotonic increasing function of $\xi$ on [0,1] and takes on larger values for d = 2 than for d = 3 (it diverges at $\xi =1$ for d = 2). The limiting value of $\xi$ is $\xi =1,$ and the right-hand side of (1.203) becomes the Riemann zeta function ${R}(x={d}/{2})$ where $R (x )=\Sigma ^{\infty }_{\ell =1}{{1}/{{\ell }^x}}$ . A few values of ${R}(x\big)$ are given in the following list in order of increasing x: ${R}\big({1}/{2})=\infty, {R} (1)=\infty, {R}({3}/{2})=2.612, {R} (2 )={{\pi}^2}/{6}=1.645, {R} (4 )={{\pi}^4}/{90}=1.082, {R} (10 )=1.001.\ \textrm {Here}\;R$ is a monotonic decreasing function of its argument and asymptotes to unity for large argument. Thus, for d = 3, ${ (n{\Lambda}^3 )}_{\text{max}}=2.612g, g=2{\mathcal S}+1$ . Because $\xi$ is less than one, the value of ${ (n{\Lambda}^3 )}_{\text{max}}$ is actually less than $2.612g$ . Recall the discussion accompanying (1.190) that $\langle$ $N_k\rangle$ is a monotonically decreasing function of energy ${{\mathcal E}}_k$ and that a condensate can occur in the ground state. From (1.193) the partition function satisfies

(1.204)

\begin{equation} {\text{ ln } {\textrm {Z}}_0}=-{\text{ ln}(1-\xi )} \quad \textrm {and}\quad \text{ln } \textrm {Z} = \sum\limits _k{\sigma {\text{ ln}(1+\sigma e^{-\beta {{\mathcal E}}^{\prime} }).}} \end{equation}

Hence,

(1.205)

\begin{equation} \langle N_0\rangle =-\frac{\partial {\text{ ln } {\textrm {Z}}_0}}{\partial \gamma }=\xi \frac {\partial {\text{ ln } {\textrm{Z}}_0}}{\partial \xi }=\frac {\xi }{1-\xi } \to {\mathop {\lim }_{\xi \to 1} \frac {\xi }{1-\xi }=\frac {1}{1-\xi }} \end{equation}

and this is volume independent. Thus, $\langle N_0\rangle \to \infty$ as $\xi \to 1,$ and $\xi =1-1/\langle N_0\rangle$ . To illustrate the onset of the Bose–Einstein condensation, set $\xi$ to its limiting value $\xi$ =1 in (1.203), set d = 3, use $R\!\left ({3}/{2}\right )=2.612$ , and evaluate $\Lambda$ , $g=1,$ and n in a specific experiment for helium II to obtain

(1.206)

\begin{equation} T_0\!\left (n\right )=3.31\frac {{\hslash }^2}{m}n^{2/3}\; \to \; T_0\ \!\left (\textrm {theory}\right )=3.13^{\circ}\;\textrm{K}, \end{equation}

where $2\pi /{R\!\left ({3}/{2}\right )}^{{2}/{3}}=3.3128\dots$ . This compares with an experimental result of 2.19 $^{\circ}$ K.

[Editor’s Note: No reference to a specific experiment was given. The boiling point of He is 4.2 K, and He II becomes a superfluid at approximately 2.17 $^{\circ}$ K at 1 atmosphere pressure.]

There is a problem with applying the grand canonical ensemble to the description of the Bose–Einstein condensate. Consider (1.201) and (1.202) for the pressure P and the density n in the limits g = 1 and d = 3:

(1.207)

\begin{equation} P(T,\xi )=\frac {T}{{\Lambda}^3(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {1}{{\ell }^{5/2}}{\xi }^{\ell }},\quad n\!\left (T,\xi \right )=\frac {1}{{\Lambda}^3(\beta )}\sum\limits ^{\infty }_{\ell =1}{\frac {1}{{\ell }^{3/2}}{\xi }^{\ell }}, \end{equation}

where $\xi =e^{-\gamma }=e^{\beta \mu }\lt 1$ because $\mu \lt 0$ . In the limit that $\xi \to 0 \xi =n{\Lambda}^3$ is the number of particles in a de Broglie cube. However, from the expression for the number density in (1.207), as $\xi \to 1$

(1.208)

\begin{equation} n{\Lambda}^3={\Lambda}^3\frac {\langle N\rangle }{V}=2.612\to \langle N\rangle =2.612\frac {V}{{\Lambda}^3}, \end{equation}

while in contrast (1.205) asserts $\langle N_0\rangle ={1}/({1-\xi })$ which diverges as $\xi \to 1$ and is volume independent, while $\langle N\rangle$ scales with V. Thus, there is a problem here. The difficulty is that in using the grand canonical ensemble, any particular energy state uses all the other states (systems) as the bath at a given temperature. However, when the particular state is the ground state, the bath is not so large in comparison with the number of states occupying the ground state at conditions such that $\xi \to 1$ are approached. The model of the grand canonical ensemble falls apart here for Bose statistics.

A solution for bosons is to use the Gibbs ensemble $Z(\beta ,V_{\textrm{I}},N_{\textrm{I}})$ in which the system (I) is in contact with a heat bath allowing exchange of energy but in which particle exchange is prohibited. In § 9.6 of Reif (Reference Reif1965) the authors present presented an analysis of Bose–Einstein statistics. There is a clever use of a Lagrange multiplier there. They show N ${}_{0}$ to be proportional to V, and the ground state is shown to support large fluctuations. Landau & Lifshitz (Reference Landau and LIfshitz1969) is another good reference on the Bose gas.

Figure 3. Phase diagram for Bose–Einstein condensate, density versus temperature.

A plot of the density versus temperature where the Bose–Einstein condensate onsets based on (1.206) is shown in figure 3. A schematic of the relative fraction in the ground state for the corrected theory is shown in figure 4, ${\langle N_0\rangle }/{N}\;\textrm {versus}\; T$ . Note that in the corrected theory both ${N}_{0}$ and N scale with volume. Regarding the pressure, the ground state has no energy; only the excited states contribute. Equation (1.207) gives the correct pressure. Figure 5 presents a schematic for the pressure versus density for various temperatures. For $n{\Lambda}^3\ll 1$ the system satisfies the ideal gas relation $P\sim nT$ while for $n{\Lambda}^3\gt 2.612$ the system begins to fill the ground state.

Figure 4. Schematic of ${\langle N_0\rangle }/{N}\,\textrm {versus}\ T$ for the bose condensate.

Figure 5. Schematic: P versus n for various temperatures.

Figure 6 presents a schematic of the pressure P versus the volume V = N/n for various isotherms. The critical pressure for a given volume above which volume there is no condensate scales as $P_c\sim V^{3/5}.$

Figure 6. Schematic: P versus V for various temperatures.

Example: Bosons in which the number N is not conserved, e.g., excitations and photons. In such situations N is not conserved, $N_k=0, 1, \dots ,\infty$ and $\mu =0.$ Calculate the properties using the grand canonical ensemble but with $\mu =0$ (should agree with canonical ensemble). For the special case of photons in vacuum with ${\omega }_k=kc,$ ${{\mathcal E}}_k=\hslash {\omega }_k=\hslash kc$ , summing over right-hand and left-hand circularly polarized waves in three dimensions, we can calculate the grand potential $\Omega$ from (1.183) and (1.203):

(1.209)

\begin{equation} \Omega =-VT\ {R}\!\left (4\right )\frac {1}{{\pi}^2}\frac {1}{{\lambda }^3_w},\ \ \ \ \ \,{\lambda }_w\equiv \frac {\hslash c}{T}, \end{equation}

where ${R}\!\left (4\right )= {{\pi}^4}/{90}$ .

Exercise: Verify (1.209).

Example: Ideal Fermi gas. Consider an electron gas with $g=2{\mathcal S}+1=2$ and $\sigma =1.$ The pressure from (1.195) is

(1.210)

\begin{equation} P(T, \mu )={\textrm {} 2T\int\nolimits {\frac {\textrm{d}^3\textit{k}}{{(2\pi )}^3}}\textrm {ln}\!\left (1+\sigma e^{-\beta ({{\mathcal E}}_k-{\mu })}\right )}. \end{equation}

For ${{\mathcal E}}_k\gt \ \mu :\ e^{-\beta ({{\mathcal E}}_k-\mu )}\ll 1$ in the limit that $T\to 0$ ( $\beta \to \infty$ ), so that $\text{ ln}\!\left (1+0\right )=0$ ; and the electrons are completely degenerate. Note that for ${{\mathcal E}}_k\equiv {p^2}/{2m}={{{\hslash }^2k}^2}/{2m}\lt \mu \ \textrm {:}\ \ e^{\beta \left ({\mu -{\mathcal E}}_k\right )}\gg 1$ and ${\text{ ln } e^{\beta (\mu -{{\mathcal E}}_k)}=\beta \!\left (\mu -{{\mathcal E}}_k\right )}\ \textrm{in }$ (1.210). The pressure receives finite contributions only for ${{\mathcal E}}_k\lt \mu$ when $T\to 0$ :

(1.211)

\begin{equation} P(0, \mu )={2\int\nolimits ^{k_{\text{max}}\ ({{\mathcal E}}_k\lt \mu )}_0{\frac {\textrm{d}^3\textit{k}}{{(2\pi )}^3}}(\mu -{{\mathcal E}}_k)}. \end{equation}

The density satisfies

(1.212)

\begin{equation} n\!\left (T=0,\mu \right )=\frac {\partial P}{\partial \mu }=2\int\nolimits ^{k_{\text{max}}\ ({{\mathcal E}}_k\lt \mu )}_0{\frac {\textrm{d}^3\textit{k}}{{(2\pi )}^3}}=2\frac {\frac {4\pi }{3}k^3_f}{{(2\pi )}^3}=\frac {8\pi }{3}{\left (\frac {p_F}{h}\right )}^3\equiv \frac {8\pi }{3}{{\Lambda}}^{-3}_F, \end{equation}

where we define the Fermi level

(1.213)

\begin{equation} \mu ={{\mathcal E}}_F=\frac {p^2_F}{2m}=\frac {{\hslash }^2k^2_F}{2m}. \end{equation}

From (1.212) and (1.213)

(1.214)

\begin{equation} k^3_F=\frac {3}{8\pi }\frac {N_0}{V}{(2\pi )}^3\Longrightarrow \,{\Lambda}_F=\frac {h}{p_F}\quad \textrm {and}\quad n{\Lambda}^3_F=\frac {8\pi }{3}. \end{equation}

Hence, as $T\to 0$

(1.215)

\begin{equation} E=\frac {3}{5}{{\mathcal E}}_FN \quad \textrm {and}\quad P=\frac {2}{3}\frac {\frac {3}{5}{{\mathcal E}}_FN}{V}=\frac {2}{5}{n{\mathcal E}}_F \end{equation}

in the nonrelativistic limit. Beware that ${{\mathcal E}}_F$ depends on n via (1.213) and (1.214).

Example: A nonideal gas (‘real gas’) in the classical and quantum limits

Consider a simple one-species gas with partition function in the classical limit. From (1.119)

(1.216)

\begin{equation} Z(\beta ,N,V)\equiv \sum\limits _n{e^{-\beta E_n(V,N)}=\frac {1}{N!}{\left (\frac {V}{{\Lambda}^3}\right )}^NQ_N(\beta ,V)}, \end{equation}

where $Q_N(\beta ,V)\equiv \ \smallint {{\left ({\textrm{d}^3{\boldsymbol{r}}}/{V}\right )}^N}e^{-\beta {\Phi \textrm {(}r_i)}}$ is the configurational partition function and $\Phi$ is the interaction potential. The grand partition function is then

(1.217)

\begin{equation} \mathbb {Z}\!\left (\gamma ,\beta ,V\right )=\sum\limits _N{e^{-\gamma N}}Z(\beta ,N,V). \end{equation}

Substituting (1.216) into (1.217)

(1.218)

\begin{align} \mathbb {Z}\!\left (\gamma ,\beta ,V\right )&=\sum\limits ^{\infty }_{N=0}{\frac {1}{N!}{\left (\frac {Ve^{-\gamma }}{{\Lambda}^3}\right )}^N}Q_N\!\left (\beta ,V\right ) =\sum\limits ^{\infty }_{N=0}{\frac {1}{N!}{\left (Vz\right )}^N}Q_N\!\left (\beta ,V\right )\nonumber\\[4pt] &=1+VzQ_1+\tfrac{1}{2}V^2z^2Q_2+\dots , \end{align}

where $z\equiv {\xi }/{{\Lambda}^3}={e^{-\gamma }}/{{\Lambda}^3}$ is the activity in density units. Note that in the Boltzmann gas limit $\xi \to n{\Lambda}^3.$ In (1.218), $Q_1=1$ (no self-interaction); and

(1.219)

\begin{equation} Q_2=\int\nolimits {\frac {\textrm{d}^3{\textit {r}}_1\textrm{d}^3{\textit {r}}_2}{V^2}}e^{-\beta {\Phi }_{12}}=\int\nolimits {\frac {\textrm{d}^3{\textit{r}}_1\textrm{d}^3{\textit {r}}_2}{V^2}}e^{-\beta \phi (r_{12})}. \end{equation}

For large numbers of particles the thermodynamics is independent of the particular ensemble. However, for finite systems, e.g., N = 100, the grand canonical ensemble is invalid; N is not much larger than ln(N). Consider applications of using the grand canonical ensemble (1.217)–(1.219). For $z={e^{-\gamma }}/{{\Lambda}^3}\to n$ as $n\to 0$ with $\gamma ={\partial S}/{\partial N}(E,V,N)$ , we have (1.218) for $\textrm {Z}\!\left (\gamma ,\beta ,V\right )$ with

(1.220)

\begin{equation} {Q}_N\!\left (T,V\right )=\int\nolimits {\frac {\textrm{d}^{3N}r_i}{V^N}}e^{-\beta {\Phi \left (r_i\right )}}\gt 0. \end{equation}

Consider (1.219) in more detail:

(1.221)

\begin{equation} Q_2=\int\nolimits {\frac {\textrm{d}^3{\textit {r}}_1\textrm{d}^3{\textit {r}}_2}{V^2}}e^{-\beta \phi (r_{12})}=\int\nolimits {\frac {\textrm{d}^3{\textit {r}}_1}{V}}{\left \{\int\nolimits {\frac {\textrm{d}^3\textit {s}}{V}}\left [e^{-\beta \phi (s)}-1\right ]+1\right \}=1-\frac {2b_2(T)}{V}}, \end{equation}

where $b_2\!\left (T\right )\equiv -({1}/{2})\smallint {\textrm{d}^3\textrm {s}}\left [e^{-\beta \phi (s)}-1\right ]$ is defined in (1.91). We note that the term ${2b_2(T)}/{V}=O\!\left ({1}/{V}\right ).$ Then

(1.222)

\begin{equation} \textrm {Z}\!\left (\gamma ,\beta ,V\right )=1+Vz+\tfrac{1}{2}V^2z^2Q_2+O\!\left (N^{*3}\right )+\dots +O{\left (N^*\right )}^{N^*}, \end{equation}

where $Vz={O(N}^*)$ , $({1}/{2})V^2z^2Q_2=O(N^{*2})$ , and so on. The terms increase successively until the $N^*$ term, after which the terms in the series fall off, and the series converges. The expansion in (1.222) is not as useless as one might think because the series is monotonic increasing in z, the activity. As a function of z, $\textrm {Z}\!\left (\gamma ,\beta ,V\right )$ increases from unity at z = 0; and ln $\textrm {Z}$ is greater than 0 and increases with z. There are some assumptions to keep in mind. For example, one requires that $\Phi \gt -\infty$ in order that Q ${}_{N }$ not go to $\infty ,$ which excludes point masses and point charges. For the hard core + van der Waals potential diagrammed in figure 1, there is a minimum volume for the hard-sphere particle $({4\pi}/{3}) r^3_0$ and a maximum number of particles in the volume: $N_{\text{max}}\sim {V}/{({4\pi}/{3}) r^3_0}$ For N $\gt$ N ${}_{\text{max}}$ $Q_N\to 0.$ In these circumstances we can terminate the series in (1.222) at N = $N_{\text{max}}$ . The grand partition function is analytic and finite. From (1.194)

(1.223)

\begin{equation} P=\frac {1}{\beta V}{\text{ ln } \mathbb {Z}(z,T,V)} \end{equation}

and the physical pressure independent of volume satisfies

(1.224)

\begin{equation} P{(z,T)}_{\mathrm{physical}}={\mathop {\lim }_{V\to \infty } \frac {1}{\beta V}{\text{ ln } \textrm {Z}(z,T,V)}}. \end{equation}

Figure 7. Schematic: $\beta P=P/T$ versus n equation of state and phase diagram.

Here $\beta P$ is a nonnegative and a monotonic increasing function of z. There is a possibility of a discontinuity in the slope of $\beta P$ with respect to z. Using (1.197)

(1.225)

\begin{equation} n\!\left (z,T, V\right )=\frac {\partial P\!\left (\mu ,T,V\right )}{\partial {\mu }}=z\frac {\partial}{\partial z}\left [\beta P(z,T,V\right ]=z\frac {\partial}{\partial z}\frac {{\text{ ln}\!\left (1+Vz+\frac{1}{2}V^2z^2Q_2\right )}}{V}. \end{equation}

The density n is positive and so is ${\partial n}/{\partial z}\gt 0$ . Where $\beta P$ has a discontinuity in its slope at $z=z_T$ , for $V\to \infty$ , a jump discontinuity can develop in the density n. The equation of state for $P$ as a function of density n can then exhibit a phase transition at $z_T$ . Figure 7 depicts a schematic of an equation of state and a phase diagram. Phase I might represent a gas, while phase II might represent a liquid or a solid. In constructing the equation of state and phase diagram in figure 7, we note that $0\lt \ \langle {(\delta N)}^2\rangle \equiv \langle N^2\rangle -{\langle N\rangle }^2$ and, as a consequence of (1.225),

(1.226)

\begin{equation} N-\langle N\rangle =\frac {{\partial }^2{\text{ ln } \textrm {Z}}}{{\partial }^2{\gamma }^2}=Vz\frac {\partial n}{\partial z}=T\langle N\rangle \frac {\partial n}{\partial P}\gt 0\ \ \Longrightarrow \ \ \frac {\partial n}{\partial P}\gt 0\ \textrm {and}\ \frac {\textrm{d}P}{\textrm{d}n}\ge 0 \end{equation}

using ${T}\gt 0$ , $\langle N\rangle \gt 0,$ and ${\partial n}/{\partial z}\gt 0$ . Hence, P either increases with respect to increasing n or has flat intervals.

There was fundamental work on phase transitions in physical systems in the thermodynamic limit based on the properties of small, model systems by Lee and Yang (Lee & Yang Reference Lee and Yang1952; Yang & Lee Reference Yang and Lee1952). The theory revolves around complex zeros of the partition function in finite-sized systems which may permit the possibility of phase transitions.

1.2.8. Systems with external fields

We next analyze a system in the presence of an external field. Consider a system with a downward-directed (with respect to z) gravitational field with gravitational acceleration g. The gravitational potential for a particle in energy state a with occupation number N ${}_{a}$ in subsystem II at z ${}_{\textrm{II}}$ with mass species s is juxtaposed with a subsystem I at z ${}_{\textrm{I}}$ above it is given by

(1.227)

\begin{equation} {\psi }^s_a=m_sgz_a \end{equation}

and the subsystem total energy summed over the internal energy and the energy in the external field is

(1.228)

\begin{equation} E_a=E^{\text{int}}_a+N_a{\psi }^s_a. \end{equation}

The subsystem II and bath I are assumed to satisfy the conservation laws:

(1.229)

\begin{equation} N=N_{\textrm{I}}+N_{\textrm{II}}, E=E_{\textrm{I}}+E_{\textrm{II}}. \end{equation}

We maximize the total entropy $S=S_{\textrm{I}}+S_{\textrm{II}}$ and deduce the relations:

(1.230)

\begin{equation} 0=\textrm{d}S=\sum\limits _a{\textrm{d}S_a=\sum\limits _a{\frac {\partial S_a}{\partial E^{\text{int}}_a}\textrm{d}E^{\text{int}}_a+\frac {\partial S_a}{\partial N_a}\textrm{d}N_a}}, \end{equation}

with

(1.231)

\begin{equation} \gamma \equiv {\frac {\partial S(E,N)}{\partial N}}\bigg \vert _E \quad \textrm {and}\quad {\beta }_a\equiv {\frac {\partial S(E,N)}{\partial E_a}}\bigg \vert _{N_a} \end{equation}

and $T_{\textrm{I}}=T_{\textrm{II}}.$ The dependence of the entropy $S(E,N)$ on the internal energy in II can be expressed in terms of the internal energy E ${}_{\textrm{int}}$ and the occupation number N as $S\!\left (E,N\right )=S^0\!\left (E_{\textrm{int}},N\right )=S^0\!\left (E-N\psi ,N\right )$ from which follows

(1.232)

\begin{align} -\beta \mu \equiv \gamma &={\frac {\partial S\!\left (E,N\right )}{\partial N}}\bigg \vert _E={\frac {\partial S^0}{\partial N}}\bigg \vert _{E_{\textrm{int}}}-\psi {\frac {\partial S^0}{\partial E_{\textrm{int}}}}\bigg \vert _N={\gamma }^0-\beta \psi =-\beta \!\left ({\mu }_0+\psi \right )\nonumber\\[4pt] & \Longrightarrow \ \mu ={\mu }_0+\psi , \end{align}

i.e., the potential energy of the subsystem is the sum of the internal chemical potential and the external potential energy. Subsystems I and II are contiguous and in equilibrium with one another, and they share a common temperature. Thus, (1.232) applies to both; and $\gamma$ and, hence, $\mu$ are continuous:

(1.233)

\begin{equation} {\mu }_{\textrm{I}}={\mu }_{\textrm{II}}={\mu }^I_0+{\psi }^I={\mu }^{II}_0+{\psi }^{II}. \end{equation}

For the case of an ideal gas,

(1.234)

\begin{equation} {\mu }_0=T\textrm {ln}\!\left (n{\Lambda}^3\right ) \textrm{and}\; T\textrm {ln}\!\left (n_{\textrm{I}}{\Lambda}^3\right )+mgz_{\textrm{I}}=T\textrm{ln}\!\left (n_{\textrm{II}}{\Lambda}^3\right )+mgz_{\textrm{II}}, \end{equation}

from which follows the result for an isothermal system:

(1.235)

\begin{equation} \frac {n_{\textrm{I}}}{n_{\textrm{II}}}=e^{-\beta mg(z_{\textrm{I}}-z_{\textrm{II}})}. \end{equation}

If there is no net force on the isothermal system, then

(1.236)

\begin{equation} \nabla \mu =0\ \longrightarrow \ \nabla {\mu }_0=-\nabla \psi \to \frac {T\nabla n}{n}=mg\ \Longrightarrow \ n\!\left (z\right )=n(0)e^{-\beta mgz}. \end{equation}

We can further elaborate the results in (1.234)–(1.236) using the Gibbs–Duhem relation (1.186)

(1.237)

\begin{equation} {\boldsymbol{n}}\!\left (\textrm {T,}\,{\boldsymbol \mu }\right )=\frac {\partial P}{\partial {\boldsymbol \mu }}\!\left (T,{\boldsymbol \mu }\right )\to P(T,{\boldsymbol \mu }). \end{equation}

In the absence of the external field,

(1.238)

\begin{equation} P\!\left (T,\mu \right ) \to P^0\!\left (T,{\mu }_0\right )=P^0\!\left (T,\mu -\psi\right ) \Longrightarrow \ \nabla P=\frac {\partial P^0}{\partial {\mu }_0}(-\nabla \psi ). \end{equation}

For example, $-\nabla \psi =m{\boldsymbol{g}}$ and then $\Delta P=-nmg\Delta z$ .

1.2.9. Particle interactions: hard disks, pair and triplet correlations

We next return to consideration of correlated particles, the virial expansion, and (1.218) and (1.219):

(1.239)

\begin{align} \textrm {Z}\!\left (\gamma ,\beta ,V\right )&=\sum\limits ^{\infty }_{N=0}{\frac {1}{N!}{\left (Vz\right )}^N}Q_N\!\left (\beta ,V\right )\nonumber \\[4pt] &=1+Vz+\tfrac{1}{2}V^2z^2Q_2+\frac {1}{6}V^3z^3Q_3+\dots ,\nonumber \\[4pt] {Q}_N\!\left (T,V\right )&=\int\nolimits {\frac {\textrm{d}^{3N}r_i}{V^N}}e^{-\beta {\Phi \left (r_i\right )}}, \end{align}

(1.240)

\begin{equation} Q_2=\int\nolimits {\frac {\textrm{d}^3r_1\textrm{d}^3r_2}{V^2}}e^{-\beta {\Phi }_{12}}=1-\frac {2b_2(T)}{V}, \end{equation}

for the case of studying the transition from a fluid to a solid, and the formation of a close-packed structure. Berni Alder was a pioneer in molecular dynamics simulations and exploited hard-disk models in the interaction potentials. Consider a working model for $Q_3$ which captures the interaction of three particles:

(1.241)

\begin{equation} Q_3=\int\nolimits {\frac {\textrm{d}^3{\textit {r}}_1\textrm{d}^3{\textit {r}}_2\textrm{d}^3{\textit {r}}_3}{V^3}}e^{-\beta {\textrm{[}\phi }_{12}+\phi _{13}+\phi _{23}]}=\int\nolimits {\frac {\textrm{d}^3r_1\textrm{d}^3{\textit {r}}_2\textrm{d}^3{\textit {r}}_3}{V^3}}e^{-\beta \phi (r_{12})}e^{-\beta {\phi \textrm {(}r_{13})}}e^{-\beta {\phi \textrm {(}r_{23})}}. \end{equation}

Definition: Introduce $f_{ij},$ $e^{-\beta \phi _{ij}}=1+f_{ij}$ where $\mathop {\lim }\limits_{r_{ij}\to \infty } f_{ij}\to 0$ . Then

(1.242)

\begin{align} \!\left (1+f_{12}\right )\!\left (1+f_{23}\right )\!\left (1+f_{13}\right )&=1+f_{12}+f_{23}+f_{13}+f_{12}f_{23}+f_{12}f_{13}\nonumber\\[4pt] &\quad +{\ f}_{23}f_{13}+ f_{12}f_{23}f_{13}. \end{align}

Now start performing the integrals in (1.241), e.g., $({V}/{V^3})\smallint \textrm{d}^3r_1\textrm{d}^3r_2f_{12}=$ $({V^2}/{V^3})\smallint {\textrm{d}^3r_{21}f_{12}}$ ; all of the terms are like this. Hence,

(1.243)

\begin{align} Q_3&=1+\frac {3}{V}\int\nolimits {\textrm{d}^3r_{21}f_{12}}+\frac {3}{V^2}\int\nolimits {\textrm{d}^3r_{21}{\int\nolimits {\textrm{d}^3r_{31}f_{12}}f_{13}}}\nonumber\\[4pt]&+\frac {1}{V^3}\int\nolimits {\textrm{d}^3r_{23}}\int\nolimits {\textrm{d}^3r_{21}{\int\nolimits {\textrm{d}^3r_{31}f_{12}}f_{13}f_{23}}}. \end{align}

The second through fourth terms on the right-hand side of (1.243) correspond pictorially to $-,\wedge, {\Delta }$ , i.e., two-particle interactions among two vertices (particles) or three vertices (particles). Here $Q_4$ has additional terms like $\boxtimes$ and other ways to connect four vertices depicting two-particle interactions among four vertices (particles). From (1.218) we can employ a Taylor-series expansion to calculate

(1.244)

\begin{align} \text{ ln } \mathbb {Z}&=Vz+\frac{1}{2}V^2z^2\left [Q_2-1\right ]+\frac {1}{6}V^3z^3\left [Q_3-3Q_2+2\right ]\dots\nonumber\\[4pt] &= Vz+\frac{1}{2}V^2z^2\bigg[\frac {1}{V}(-)\bigg]+\frac {1}{6}V^3z^3\bigg[\frac {1}{V^2}(3\wedge {+\ \Delta })\bigg]+\dots \ \end{align}

in terms of the pictograms (Hill Reference Hill1960). Hence,

(1.245)

\begin{equation} \frac {{\text{ ln } \mathbb {Z}}}{V}={z+}\frac{1}{2}z^2(-)+\frac {1}{6}z^3{(3\wedge {+\ \Delta })+}\dots =\beta P. \end{equation}

Thus, the pressure is volume independent. We can rewrite (1.245) in the following form:

(1.246)

\begin{align} \beta P\!\left (z,T\right )&=\sum\limits ^{\infty }_{j=1}{z^{\,j}C_j(T)},\nonumber\\[4pt] C_1&=1,\nonumber \\[4pt] C_2&=\tfrac{1}{2}\!\left (-\right ),\nonumber\\[4pt] C_3&=\tfrac{1}{2}\!\left (\Lambda \right )+\frac {1}{6}\!\left (\Delta \right )=\tfrac{1}{2}{(-)}^2+\frac {1}{6}\!\left (\Delta \right ),\nonumber\\[4pt] C_4&=\dots . \end{align}

From (1.225)

(1.247)

\begin{equation} n\!\left (z,T\right )=z{\partial }/{\partial z}\!\left (\beta P\right )=\sum\limits ^{\infty }_{j=1}{{jz}^jC_j(T)}. \end{equation}

Hence,

(1.248)

\begin{equation} n=z+2z^2C_2+3z^3C_3+\dots \end{equation}

and

(1.249)

\begin{equation} \beta P=z+z^2C_2+z^3C_3+\dots \end{equation}

from which it follows that

(1.250)

\begin{equation} z=n-2n^2C_2+n^3\!\left (8{C_2}^2-3C_3\right )+\dots . \end{equation}

Substituting z from (1.250) into (1.249) and collecting terms in a power series in n, a virial expansion for $\beta P$ can be obtained:

(1.251)

\begin{equation} \beta P\!\left (n,T\right )=\sum\limits ^{\infty }_{j=1}{n^{\,j}b_j(T)}, \end{equation}

where $b_1=1, b_2=-C_2=-({1}/{2})\!\left (-\right ), b_3=4{C_2}^2-2C_3={\!\left (-\right )}^2-{\!\left (-\right )}^2-({1}/{3})\!\left (\Delta \right )=-({1}/{3})\!\left (\Delta \right )$ , and $b_4$ contains irreducible forms involving ( $\boxtimes,$ …). We discussed the evaluation of $b_2$ earlier for the van der Waals model (1.92). For $b_3$ we have

(1.252)

\begin{equation} b_3=-\frac {1}{3}\!\left (\Delta \right )=-\frac {1}{3}\int\nolimits {\textrm{d}^3r_{32}}\int\nolimits {\textrm{d}^3r_{21}{\int\nolimits {\textrm{d}^3r_{31}f_{12}}f_{13}f_{23}}}. \end{equation}

Example: Assume a hard-sphere model

(1.253)

\begin{align} f_{12} = \left\{ \begin{array}{l@{\quad}l} -1,& r_{12}\le a, \\[4pt]0, & r_{12} \gt a, \end{array} \right. \end{align}

which corresponds to $\phi _{12}=\infty$ for $r_{12}\le a$ and $\phi _{12}=0$ for $r_{12}\gt a$ , where a is the diameter of the hard sphere or disk. We note that this model is highly simplified and artificial. Actual configurations of three hard spheres generally cannot satisfy $r_{ij}\le a$ for all three interacting pairs except for one orientation in which the three hard sphere centers correspond to vertices of an equilateral triangle. Configurations of four hard spheres cannot satisfy $r_{ij}\le a$ for all possible pairs. Nevertheless, use of the hard-sphere model in (1.253) and ignoring the reality that certain interacting pairs can never satisfy $r_{ij}\le a$ allow us to push through the calculations determining the virial coefficients (Frisch Reference Frisch1964).

The evaluation of the virial coefficients in two dimensions leads to

(1.254)

\begin{equation} b_1=1,\ \ b_2=-C_2=-\tfrac{1}{2}\!\left (-\right )=\tfrac{1}{2}\pi a^2,\ \ b_3=0.782(b_2)^2, b_4=0.5327(b_2)^3. \end{equation}

Then using (1.251) it follows that

(1.255)

\begin{equation} \beta P=n+b_2n^2+0.782{b_2}^2n^3+0.533{b_2}^3n^4+O(n^5). \end{equation}

We note that $({1}/{2})\pi a^2$ is the area of two hard disks, and $n\pi a^2/2$ is the average number of particles within two disks in two dimensions. The maximum two-dimensional density for close-packed hard disks is $n_{\text{max}}=2/a^2$ from which $n\pi a^2/2=\pi n/n_{\text{max}}$ and (1.255) becomes

(1.256)

\begin{equation} \frac {P}{nT}=1+\frac {n}{n_{\text{max}}}+{\pi}^20.782{\left (\frac {n}{n_{\text{max}}}\right )}^2+{\pi}^30.533{\left (\frac {n}{n_{\text{max}}}\right )}^3+\dots , \end{equation}

where P/nT is an increasing function of n/n ${}_{\text{max}}$ , and increases faster as n/n ${}_{\text{max}}$ increases in this example; P/nT diverges at a value of n/n ${}_{\text{max}}$ equal to the radius of convergence which is less than or equal to unity. However, the virial coefficients are not necessarily positive in the nonfluid region; and the P/T versus n/n ${}_{\text{max}}$ plot can have a flat region as in figure 7 where there is a phase transition. In the flat region the local number density is the ratio of the sum of the number of particles or molecules in the two phases divided by the sum of the volumes of the two phases. Recall the arguments accompanying (1.224)–(1.226) regarding the positivity of both P/T and ${\textrm{d}P}/{\textrm{d}n}$ for the grand canonical ensemble. By including more physics in $\phi _{12}$ , e.g., attractive forces, P/T versus n can acquire more structure.

1.2.10. Simple model of a phase transition

Example: Hard-disk interaction in a two-dimensional periodic domain (particles leaving the domain re-enter symmetrically). Alder and collaborators computed the pressure using the virial expansion. In two dimensions and taking the time average to simplify

(1.257)

\begin{equation} P=\frac {\langle K\rangle }{V}+\frac {1}{2V}\bigg\langle \sum\limits _{\mathrm{collisions}}{\left ({{\boldsymbol{r}}}_i-{{\boldsymbol{r}}}_j\right )\cdot {{\boldsymbol{f}}}^{\,i}_j}\bigg\rangle . \end{equation}

The collisional interaction can be evaluated as ${{\boldsymbol{f}}}^{\,i}_j=({\textrm{d}}/{\textrm{d}t}){m{{{{\boldsymbol v}}}}_i\vert }_{\text{collision}}={\Delta m{{{{\boldsymbol v}}}}_i}/{\Delta t}$ and

(1.258)

\begin{equation} \frac {1}{\Delta t}\sum\limits ^{in\ \Delta t}_{\mathrm{coll}}{r_{ij}\langle }\vert \Delta m{{{{\boldsymbol v}}}}_i\vert \rangle ={\nu }_{\mathrm{coll}}\frac {N}{2}a\langle \vert \Delta m{{{{\boldsymbol v}}}}_i\vert \rangle , \end{equation}

where $r_{ij}=a$ for hard disks and ${\nu }_{\mathrm{coll}}$ is the single-particle collision rate. We define an effective temperature although there is no heat bath, $K\equiv NT$ . Hence,

(1.259)

\begin{equation} P=nT+\frac {na}{4}{\nu }_{\mathrm{coll}}\langle \vert \Delta m{{{{\boldsymbol v}}}}_i\vert \rangle \end{equation}

or using ${\nu }_{\mathrm{coll}}\equiv {\sqrt {\langle v^2_i\rangle }}/{\ell }$ where $\ell$ is the collisional mean-free-path

(1.260)

\begin{equation} \frac {P}{nT}=1+\frac {a{\nu }_{\mathrm{coll}}}{4}\frac {\langle \vert \Delta m{{{\boldsymbol v}}}_i\vert \rangle }{\langle \frac{1}{2}mv^2_i\rangle }\equiv 1+\frac {a}{\ell }\frac {\langle {{\boldsymbol \vert }\Delta {{{\boldsymbol v}}}}_i\vert \rangle }{2\sqrt {\langle v^2_i\rangle }}. \end{equation}

The right-hand side of (1.260) has no explicit temperature dependence, only geometry and density dependence, at least for hard disks.

Hence, P/T versus n is a universal curve independent of isotherm for hard-sphere interactions.

We note that in a dilute medium the collisional mean-free-path scales as $\ell \sim 1/n\sigma \sim 1/na^2$ where $\sigma$ is the collision cross-section. In dense regimes the collisions are pretty much head on. As $n\to n_{\text{max}}$ , then $\ell \to 0$ in crystalline structures.

In 1957 Alder and Wainwright published the results of numerical Monte Carlo calculations based on a hard-sphere model for the interaction potential leading to equation-of-state results (Alder & Wainwright Reference Alder and Wainwright1957; Wood & Jacobson Reference Wood and Jacobson1957). Figure 8 taken from Wood & Jacobson (Reference Wood and Jacobson1957) shows Alder and Wainwright’s equation of state results for systems with 108 and 32 molecules, in which PV ${}_{0}$ /T is plotted versus the normalized volume V/V ${}_{0}$ , which is proportional to the number density n, and V ${}_{0}$ is the close-packed volume for the system. Alder and Wainwright’s results show an overlapping region wherein two distinct pressure states can coexist for the same volume. The system supports the possibility of a spontaneous transition. In the overlap region, i.e., the two-phase regime, there is a great amount of instability with large fluctuations.

Figure 8. Monte Carlo equation of state results from Wood and Jacobson (1957) showing their results and those of Alder & Wainwright (Reference Alder and Wainwright1957) (solid line for 108 molecules; + for 32 molecules).

If we were to plot P/T versus n for Alder and Wainwright’s results in figure 8, P/T would grow from 0 at n = 0 in the fluid region for increasing n until it reaches a critical density n ${}_{c}$ , above which P/T might continue to increase if the system remains a fluid, but the system may instead jump to a crystalline solid branch at a lower value of P/T whose P/T then increases with n. Note that we earlier proved ${\textrm{d}P}/{\textrm{d}n}\gt 0$ for the grand canonical ensemble, whereas the Alder and Wainwright systems do not satisfy this constraint because their systems are microcanonical ensembles with a relatively small number of particles/molecules rather than an infinite number.

Theories of the liquid state are difficult. Liquids are hard-sphere systems with a weak interaction between particles/molecules considered as a perturbation.

(1.261)

\begin{equation} \frac {P}{T}=\frac {\partial S(E,V,N)}{\partial V}=-n^2\frac {\partial {\mathcal S}\!\left ({\mathcal E},n\right )}{\partial n}\to \frac {P}{nT}=1+h(n). \end{equation}

From (1.167) and (1.261) one obtains

(1.262)

\begin{equation} {\mathcal S}\!\left ({\mathcal E},n\right )=\frac {5}{2}-{\text{ ln } n{\Lambda}^3\!\left ({\mathcal E}\right )-\int\nolimits ^n_0{\frac {\textrm{d}n^{\prime} }{n^{\prime} }h(n').}} \end{equation}

Why is the entropy contribution negative as n increases? As n increases for N fixed, V decreases and configuration space is limited so $\varGamma$ decreases and the entropy must decrease.

Consider a solid and its own vapor with strong enough forces such that even at $P\to 0$ there is a solid. Assume there is a solid surrounded by a gas. Further assume that the energy of the solid can be represented by

(1.263)

\begin{equation} E_{\mathrm{sol}}=-N_{\mathrm{sol}}{{\mathcal E}}_b+f(T)\!\left (\textrm {neglible term}\right )+g(V)\!\left (\textrm {neglible term}\right ), \frac {1}{V}\frac {\textrm{d}V}{\textrm{d}P}P_0\ll 1. \end{equation}

The specific energy of the solid is ${{\mathcal E}}_{\mathrm{sol}}=-{{\mathcal E}}_b$ , the binding energy. Fixed constants are

(1.264)

\begin{equation} N=N_{\mathrm{gas}}+N_{\mathrm{sol}} \quad\textrm{and}\quad V=V_{\mathrm{gas}}+V_{\mathrm{sol}}, \;\textrm{where}\; V_{\mathrm{sol}}=N_{\mathrm{sol}}{{\mathcal V}}_0, \end{equation}

where ${{\mathcal V}}_0$ is the volume per solid molecule. At equilibrium

(1.265)

\begin{equation} {\mu }_{\mathrm{sol}}={\mu }_{\mathrm{gas}} \to {\mu }_{\mathrm{sol}}=\frac {\partial E_{\mathrm{sol}}}{\partial N_{\mathrm{sol}}}=-{{\mathcal E}}_b={\mu }_{\mathrm{gas}}=T\ \textrm {ln}\ n_{\mathrm{gas}}{\Lambda}^3\!\left (T\right ) \Longrightarrow \ n_{\mathrm{gas}}{\Lambda}^3=e^{-{{\mathcal E}}_b/T}. \end{equation}

Equation (1.265) is a nice formula for the vapor pressure. The classical limit in (1.265) is $n_{\mathrm{gas}}{\Lambda}^3\ll 1$ , which implies that for ${{\mathcal E}}_b\sim 1$ eV, $T\ll$ 1 eV = 12,000 $^{\circ}{} \textrm{K}$ .

Consider increasing N ${}_{\mathrm{sol}}$ and concomitantly $V_{\mathrm{sol}}=N_{\mathrm{sol}}{{\mathcal V}}_0$ , while holding T fixed. The volume $V_{\mathrm{gas}}$ must decrease for the total volume $V$ fixed. In this case, $N_{\mathrm{gas}}{\Lambda}^3=V_{\mathrm{gas}}e^{-{{\mathcal E}}_b/T}$ decreases with decreasing $V_{\mathrm{gas}}$ . Here $n_{\mathrm{gas}}=N_{\mathrm{gas}}/V_{\mathrm{gas}}$ is just a function of temperature and remains fixed. Hence, $P_{\mathrm{gas}}=n_{\mathrm{gas}}T=({T}/{{\Lambda}^3})e^{-{{\mathcal E}}_b/T}$ remains fixed, i.e., the vapor pressure remains constant as we increase N ${}_{\mathrm{sol}}$ .

We next include volume dependence in (1.263) so that $V_{\mathrm{sol}}$ is allowed to vary around its optimum equilibrium value:

(1.266)

\begin{equation} E_{\mathrm{sol}}=-N_{\mathrm{sol}}{{\mathcal E}}_b+\alpha {{\mathcal E}}_b\frac {N_{\mathrm{sol}}}{2}\frac {{\!\left ({{\mathcal V}}_s-{{\mathcal V}}_0\right )}^2}{{{\mathcal V}}^2_0}, \end{equation}

where ${N_{\mathrm{sol}}}/{2}$ represents the number of interacting pairs and ${{\mathcal V}}_s=V_{\mathrm{sol}}/N_{\mathrm{sol}}$ . From (1.266) is follows that

(1.267)

\begin{equation} P_{\mathrm{sol}}=-{\frac {\partial E_{\mathrm{sol}}}{\partial V_{\mathrm{sol}}}\bigg\vert }_{N_{\mathrm{sol}}}=-\alpha {{\mathcal E}}_b\frac {{\!\left ({{\mathcal V}}_s-{{\mathcal V}}_0\right )}}{{{\mathcal V}}^2_0}, \end{equation}

which has the form of a Hooke’s force law. We can divide through by $N_{\mathrm{sol}}$ in (1.266) to obtain the specific energy per solid molecule. We note that the system has the following attributes:

(1.268)

\begin{equation} P_{\mathrm{sol}}=P_{\mathrm{gas}},\quad {\mu }_{\mathrm{sol}}={\mu }_{\mathrm{gas}},\quad V=V_{\mathrm{sol}}+V_{\mathrm{gas}},\quad N=N_{\mathrm{sol}}+N_{\mathrm{gas}},\quad S=S_{\mathrm{sol}}+S_{\mathrm{gas}}.\ \end{equation}

The total entropy is the sum of the gas and solid entropies:

(1.269)

\begin{align} S&=S_{\mathrm{sol}}\!\left (E_{\mathrm{sol}},V_{\mathrm{sol}},N_{\mathrm{sol}}\right )+S_{\mathrm{gas}}\!\left (E_{\mathrm{gas}},V_{\mathrm{gas}},N_{\mathrm{gas}}\right )\nonumber\\[4pt]& = N_{\mathrm{sol}}{{\mathcal S}}_{\mathrm{sol}}\!\left ({{\mathcal E}}_{\mathrm{sol}},n_{\mathrm{sol}}\right ) + N_{\mathrm{gas}}{{\mathcal S}}_{\mathrm{gas}}\!\left ({{\mathcal E}}_{\mathrm{gas}},n_{\mathrm{gas}}\right ). \end{align}

Given that the solid and gas are in equilibrium with one another at the same temperature, then $P_{\mathrm{sol}}=P_{\mathrm{gas}}$ and ${\mu }_{\mathrm{sol}}={\mu }_{\mathrm{gas}}$ in (1.268). From ${\mu }_{\mathrm{sol}}={{\partial E_{\mathrm{sol}}}/{\partial N_{\mathrm{sol}}}\vert }_{S_{\mathrm{sol}},V_{\mathrm{sol}}}=-{{\mathcal E}}_b+({1}/{2})\alpha {{\mathcal E}}_b{{\left ({{\mathcal V}}_s-{{\mathcal V}}_0\right )}^2}/{{{\mathcal V}}^2_0}$ , ${\mu }_{\mathrm{gas}}=T\ \textrm {ln}\ n_{\mathrm{gas}}{\Lambda}^3\!\left (T\right )$ , ${\mu }_{\mathrm{sol}}={\mu }_{\mathrm{gas}}$ , $P_{\mathrm{gas}}=n_{\mathrm{gas}}T=({T}/{{\Lambda}^3})e^{-{{\mathcal E}}_b/T}=P_{\mathrm{sol}}=-\alpha {{\mathcal E}}_b{{\left ({{\mathcal V}}_s-{{\mathcal V}}_0\right )}}/{{{\mathcal V}}^2_0}$ , it then follows that

(1.270)

\begin{equation} \frac {{\!\left ({{\mathcal V}}_s-{{\mathcal V}}_0\right )}}{{{\mathcal V}}_0}=-\frac {1}{\alpha }\frac {T}{{{\mathcal E}}_b}\frac {v_0}{{\Lambda}^3} e^{-{{\mathcal E}}_b/T}, \end{equation}

where ${1}/{\alpha }\sim \ O\!\left (1\right ),\ \ {T}/{{{\mathcal E}}_b}\lt O\!\left (1\right ),$ ${v_0}/{{\Lambda}^3}$ $\ \sim O\!\left (1\right ),\ \textrm {and }e^{-{{\mathcal E}}_b/T}\ll 1$ .

From (1.270) we conclude that only negligible deviations (compression or expansion) of ${{\mathcal V}}_s$ from ${{\mathcal V}}_0$ are allowed. From (1.268) and (1.270) we can also derive

(1.271)

\begin{equation} N_{\mathrm{gas}}\cong \frac {V-N_{\mathrm{sol}}{{\mathcal V}}_0}{{\Lambda}^3}e^{-{{\mathcal E}}_b/T}. \end{equation}

Figure 9 presents of schematic diagram for the P versus V relation for the gas–solid system with N and T fixed. For small values of V greater than the minimum value Nv ${}_{0 }$ the system is a solid and the pressure is largest. As V increases, P decreases while ${N_{\mathrm{sol}}}/{N}\sim 1$ for a while, until ${N_{\mathrm{sol}}}/{N}$ begins to decrease and ${N_{\mathrm{gas}}}/{N}$ increases. Both gas and solid phases occupy the flattish intermediate region of the P versus V relation. At the largest values of V there is only the gas phase. Not shown in figure 9 are trajectories followed from either end of P versus V where we progress along curves in the complete absence of the other phase, namely, beginning at largest V gas only and beginning at smaller V solid phase only. Along these curves we can have metastable states which require either nonuniformities, e.g., for the supersaturated vapor to collect and precipitate upon, or over a long time cavities will appear (when the solid is subjected to too little pressure) in which there is vapor.

Figure 9. Schematic for P versus V phase diagram for the gas–solid system.

1.2.11. Quantum virial expansion

Consider hydrogen atoms (no ionization) and H ${}_{2 }$ formation. In the quantum picture the grand canonical partition function is

(1.272a)

\begin{align} \textrm {Z}\!\left (\beta ,\gamma ,V\right )&=\sum\limits ^{\infty }_{N=0}{e^{-\gamma N}Z_N}\!\left (\beta ,V\right )=\sum\limits ^{\infty }_{N=0}e^{-\gamma N}\sum\limits _n e^{-\beta E_n(V)}\nonumber\\[4pt]&=1+e^{-\gamma }Z_1+e^{-2\gamma }Z_2+\dots , \end{align}

where $\ \mu \equiv -T\gamma$ and introducing the internal energy ${\,{\mathcal E}}^{\text{int}}_k$

(1.272b)

\begin{align} Z_1&=\sum\limits _k{e^{-\beta {{\mathcal E}}_k}=}\sum\limits _k{e^{-\beta \left (\frac {{\hslash }^2k^2}{2m}+{{\,{\mathcal E}}^{\text{int}}_k}\right )}=}\!\left (\sum\limits _k{e^{-\beta \frac {{\hslash }^2k^2}{2m}}}\right )\!\left (\sum\limits _n{e^{-\beta {\,{\mathcal E}}^{\text{int}}_n}}\right )\nonumber\\[4pt]&=\frac {V}{{\Lambda}^3}Z^{\text{int}}_1(\beta ). \end{align}

Recalling the analysis in (1.192)–(1.194), the statistical weight factor associated with the possible quantum states now including angular momentum and spin quantum numbers is $g_0=(2S+1)(2I+1)$ where S = 1/2 and I = 1/2, then $Z^{\text{int}}_1(\beta )$ = $\Sigma _n{e^{-\beta {\,{\mathcal E}}^{\text{int}}_n}}$ = 4 $e^{-\beta {(-I}_H)}$ where $I_H=13.6$ eV, and we ignore excited states of hydrogen to be consistent with the assumption of no ionization. Recalling the virial expansion in (1.218) and (1.219) we have

(1.273)

\begin{equation} \mathbb{Z}\!\left (\gamma ,\beta ,V\right )=\sum\limits ^{\infty }_{N=0}{\frac {1}{N!}{\!\left (Vz\right )}^N}Q_N\!\left (\beta ,V\right ). \end{equation}

Define

(1.274)

\begin{equation} Z_N\equiv \frac {{\!\left (Z_1\right )}^N}{N!}Q_N \end{equation}

and $Q_N$ are quantum virial coefficients:

(1.275)

\begin{equation} Q_0=1,\quad Q_1=1,\quad \ Q_2=\frac {2Z_2}{Z^2_1},\ \dots \end{equation}

Analogous to (1.234), $\mu =T[{\text{ln } z{\Lambda}^3-{\text{ ln } g_0}}]-I_H$ where $g_0=\left (2S+1\right )\!\left (2I+1\right )$ $=O\!\left (1\right )$ compared with $z{\Lambda}^3$ . Note that z as in (1.250) is equal to n to lowest order. Analogous to (1.251)

(1.276)

\begin{equation} \beta P\!\left (n,T\right )=\sum\limits ^{\infty }_{j=1}{n^jb_j(T)},\ \ b_1=1,\ \ b_2\!\left (T\right )=-\frac {V}{2}\left [Q_2\!\left (T,V\right )-1\right ],\ \dots . \end{equation}

Example: For an ideal gas we employ classical counting for the possible states and conclude that $z_{2}=z_{1}^{2}/2$ and $b_{2}=0$ .

Example: For quantum systems we carefully count the possible states. Consider bosons with no internal structure, e.g., the helium atom with two states: $n=(k_1,k_2).$ Then with $E_n={{\mathcal E}}_{k_1}+{{\mathcal E}}_{k_2}$

(1.277)

\begin{align} Z_2&=\sum\limits _n{e^{-E_n}=\left (\sum\limits _{k_1,k_2,k_1\lt k_2}{+\sum\limits _{k_1=k_2}{}}\right )e^{-\beta \left ({{\mathcal E}}_{k_1}+{{\mathcal E}}_{k_2}\right )}}\nonumber\\[4pt]&=\left (\tfrac{1}{2}\sum\limits _{k_1,k_2}{+\tfrac{1}{2}\sum\limits _{k_1=k_2}{}}\right )e^{-\beta \left ({{\mathcal E}}_{k_1}+{{\mathcal E}}_{k_2}\right )}\nonumber\\[4pt]&= \tfrac{1}{2}\left (Z^2_1\!\left (\beta \right )+\sum\limits _{k_1}{e^{-2\beta {{\mathcal E}}_k}}\right )=\tfrac{1}{2}\left (Z^2_1\!\left (\beta \right )+Z_1\!\left (2\beta \right )\right ), \end{align}

where, from (1.119) and (1.274)–(1.276)

(1.278)

\begin{equation} Z_1\!\left (\beta \right )=V/{\Lambda}^3(\beta ) \quad \textrm{and}\quad b_2\!\left (T\right )=\frac {-V}{2Z^2_1\!\left (\beta \right )}Z_1\!\left (2\beta \right )=-\frac {{\Lambda}^3(\beta )}{4\sqrt {2}}. \end{equation}

Example: For Fermions we exclude the state with $k_1=k_2$ and derive

(1.279)

\begin{equation} b_2\!\left (T\right )=\frac {{\Lambda}^3(\beta )}{4\sqrt {2}}, \end{equation}

because Fermions have repulsive interactions.

1.2.12. Numerical simulation of equations of state and phase transitions, Berni Alder, and molecular dynamics

Berni Alder’s work was mentioned earlier in §§ 1.2.9 and 1.2.10. Alder made seminal contributions to molecular dynamics and was a pioneer in demonstrating molecular dynamics as a viable approach to studying the statistical mechanics of many-body interacting systems (Alder Reference Alder1972, Reference Alder1973). By employing Monte Carlo methods in the numerical integration of the Newtonian equations of motion for ensembles of particles, i.e., molecular dynamics simulations, a numerical scheme for solving the Liouville equations was devised; and the partition function and its derivatives were obtained. This approach was necessarily limited in the number of particles and, hence, generated a microcanonical ensemble. The application of Monte Carlo integration methods is not straightforward and is good only when the quasi-ergodic hypothesis is valid. How does one evaluate (1.220) using Monte Carlo integration methods? We have

(1.280)

\begin{equation} {Q}_N\!\left (T,V\right )=\int\nolimits {\frac {\textrm{d}^{3N}{\textit {r}}_i}{V^N}}e^{-\beta {\Phi \left ({{\boldsymbol{r}}}_1,{{\boldsymbol{r}}}_{\textrm {2}},\dots \right )}}. \end{equation}

In discussing the challenges attendant in numerically calculating Q ${}_{N}$ and the partition function there are several points to make.

1. The numerical integration of Q ${}_{N}$ involves a multidimensional integration with a certain number of Monte Carlo sampling points per dimension. If there are l points per dimension and 3N dimensions, then there could be as many as l ${}^{3}$ ${}^{N }$ evaluations of the integrand. The dimensionality could be a number of order ∼ 10 ${}^{23}$ . Furthermore, Q can be sharply peaked necessitating more numerical resolution locally. The curse of dimensionality is a formidable aspect in evaluating Q ${}_{N}$ .

[Editor’s Note: In recent years researchers dealing with uncertainty quantification and machine learning have introduced systematic approaches to sample a multidimensional space in an optimally efficient fashion to mitigate the “curse of dimensionality” problem. Furthermore, in the 50 years since these lectures computing power has grown enormously; and molecular dynamics simulations have been able to evolve to address systems that are orders of magnitude larger and more complex.]

Numerical methods have more success in the case of the virial expansion of the equation of state where the dimensionality and complexity of the successive virial coefficients grows in a very limited fashion, e.g., (1.255) and (1.256).
2. For instantaneous phase-space pictures one can calculate the potential energy by randomly placing particles in a box and computing $V_N=\Sigma {V_{ij}(r_{ij})},$ and this is used in $e^{-V_N/kT}$ . Problems emerge at high density where particles get so close together that the interaction potential $V_{ij}\to \infty$ and $e^{-V_N/kT}\to 0$ . In practice, it is unlikely that random loading of particles one by one will get to the critical densities where divergences may occur. So-called quiet and quasirandom loading algorithms have been developed. The loading problem becomes a function of the loading history.
3. What succeeds is ‘importance sampling’ in which a modified distribution is sampled instead of the actual distribution in order to reduce the variance in the sampling process. One generates a Markov chain numerically to obtain results for a canonical ensemble (in reality a microcanonical ensemble).

Example: Consider hard spheres for which $\phi =0$ or $\infty$ (not accessible). Load N hard spheres in a defined volume V for a given temperature T. Randomly displace one sphere in the list. If this results in no overlap with another hard sphere, then this is a successful new configuration and weight it by $e^{-V_N/kT}$ . If, instead, this results in overlapping another hard sphere, then return the sphere to its original position and count the old configuration in the sum over states weighted by $e^{-V_N/kT}$ . Continue through the list. This kind of method works in practice, but in a certain sense it is theoretically incapable of coming up with all configurations for hard spheres. The simulation examples shown in figure 8 illustrate phase transition phenomena for relatively small ensembles of simulation test particles. For insufficient numbers of particles it becomes difficult to distinguish the distinct phases, and the relative size of the statistical fluctuations becomes problematic.

1.2.13. Example: structureless particles with an interaction potential

Here we consider structureless particles with an interaction potential represented in the virial expansion (§§ 1.2.2 and 1.2.9) by

(1.281)

\begin{equation} b_2\!\left (T\right )=-\frac {V}{Z^2_1}\!\left (Z_2-\frac{1}{2}Z^2_1\right ), \end{equation}

where

(1.282)

\begin{equation} Z_1=\frac {V}{{\Lambda}^3(\beta ,m)}, Z_2=Z^{c.m.}_2Z^{\text{rel}}_2, Z^{c.m.}_2=\frac {V}{{\Lambda}^3(\beta ,2m)}=\frac {V}{{\Lambda}^3(\beta ,m)}2^{3/2}. \end{equation}

We introduce $Z^{\text{rel}}_2=(Z^{\text{rel}}_2-Z^{\text{rel}(0)}_2)+Z^{\text{rel}(0)}_2$ where the terms in the parentheses are the contribution due to the interaction. Then $b_2=b^{(0)}_2+b^{\text{int}}_2$ and

(1.283)

\begin{align} b^{\text{int}}_2\!\left (T\right )&=-\frac {V^2}{{\left (\frac {V}{{\Lambda}^3}\right )}^2{\Lambda}^3}2^{\frac {3}{2}}\Big({Z}^{\text{rel}}_2-{Z}^{\text{rel}\!\left (0\right )}_2\Big)\nonumber \\[4pt]&=-2^{\frac {3}{2}}{\Lambda}^3(\beta ,m)\left [\sum\limits _k{e^{-\beta {{\mathcal E}}_k}-\sum\limits _{k^0}{e^{-\beta {{\mathcal E}}_{k^0}}}}\right ]. \end{align}

The first sum on the right-hand side of (1.283) is

(1.284)

\begin{equation} \sum\limits _k e^{-\beta {{\mathcal E}}_k}=\sum\limits _{k,\mathrm{bound}}e^{+\beta {\vert {\mathcal E}}_k\vert }+\sum\limits _{k,\textrm{free}}{e^{-\beta {{\mathcal E}}_{k}}} \end{equation}

where ${{\mathcal E}}_k=({1}/{2}){{\hslash }^2k^2}/{\left ({m}/{2}\right )}$ (note the reduced mass m/2); and the second sum in (1.283) is just over free (or ‘scattered’) states ( ${{\mathcal E}}_{k^0}\gt 0)$ . At large distances the asymptotic wave function for free states (positive energy) with angular momentum $\ell$ is (Landau & Lifshitz Reference Landau and LIfshitz1969; § 77):

(1.285)

\begin{align} &{\psi }_{\ell }\sim \frac {1}{r}\textrm {sin}\!\left (kr-\frac {\ell \pi }{2}+{\delta }_{\ell }\!\left (k\right )\right ),\nonumber\\[4pt] &\quad k=\frac {p}{\hslash };\,{\delta }_{\ell }\!\left (k\right )=0\; \textrm{if no interaction, } \neq 0\; \textrm{with interaction}. \end{align}

Returning to (1.283) and (1.284), in the limit of large volume

(1.286)

\begin{equation} \sum\limits _{k,\textrm{free}}{e^{-\beta {{\mathcal E}}_k}-\sum\limits _{k^0,\textrm{free}}{e^{-\beta {{\mathcal E}}_{k^0}}\to \frac {1}{\pi}\sum\limits ^{\infty }_{\ell =0}{(2\ell +1)\int\nolimits ^{\infty }_0{\textrm{d}k\frac {\textrm{d}{\delta }_{\ell }}{\textrm{d}k}e^{-\beta\hslash k^2/2m}}}}} \end{equation}

and (1.283) yields, after integrating by parts,

(1.287)

\begin{align} b^{\text{int}}_2\!\left (T\right ) &=-{\Lambda}^3\bigg(\beta ,\frac {m}{2}\bigg)\sum\limits ^{\infty }_{\ell =0}{(2\ell +1)}\left [\sum\limits _{{{{\mathcal E}}}^{\ell }_k\lt 0}{e^{+\beta \vert {{\mathcal E}}^{\ell }_k\vert }-\frac {1}{\pi}{\delta }_{\ell }\!\left (0\right )+\frac {\beta }{\pi}\int\nolimits ^{\infty }_0{\text{d}{\mathcal E}{\delta }_{\ell }\!\left ({\mathcal E}\right )e^{-\beta {\mathcal E}}}}\right ]\nonumber\\[5pt]&=-{\Lambda}^3\bigg(\beta ,\frac {m}{2}\bigg)\sum\limits ^{\infty }_{\ell =0}{(2\ell +1)}\bigg[\sum\limits _{{\mathcal E}^{\ell }_k\lt 0}\left[e^{+\beta {{\mathcal E}}^{\ell }_k\vert }-1\right]+\frac {\beta }{\pi}\int\nolimits ^{\infty }_0\text{d}{\mathcal E}{\delta }_{\ell }({\mathcal E})e^{-\beta {\mathcal E}}\bigg], \end{align}

using Levinson’s theorem from quantum scattering theory: for k = 0 ${\delta }_{\ell }\!\left (0\right )=\pi \times$ number of bound states.

[Editor’s Note: § 77 of Landau & Lifshitz (Reference Landau and LIfshitz1969) gives a more detailed explanation of the analysis leading to the results in this section. Kaufman’s lectures in this section are not self-contained, depend on other sources, and are more of an overview.]

Example: Bosons. Bosons have symmetric wave functions, and the angular momentum quantum number $\ell$ is even. Note that Fermions have antisymmetric wave functions. If we assume that the bosons have repulsive interactions then there are no bound states, and the first sum inside the square bracket in (1.287) vanishes. Furthermore, from quantum mechanics ${\delta }_{\ell }\lt 0$ , which then leads directly to $b^{\text{int}}_2\!\left (T\right )\gt 0$ ; and the pressure has increased due to the interaction.

Example: Bose gas with hard-sphere repulsive interaction. From the quantum mechanical treatment for the phase shift ${\delta }_{\ell }$ in Schiff (Reference Schiff1968, § 19), there is a treatment for a spherically symmetric interaction potential in the special limit of a hard-sphere interaction, Schiff’s (19.20), which in the low-energy limit simplifies to Schiff’s (19.21). As $T\to 0$ only the $\ell =0$ quantum number significantly contributes, in which limit

(1.288)

\begin{equation} {\delta }_0\!\left (ka\right )=-ka \quad\mathrm{and}\quad b^{\text{int}}_2\!\left (T\to 0\right )={\Lambda}^2\!\left (\beta ,\frac {m}{2}\right )a, \end{equation}

where $r_{ij}=a$ defines the distance between the hard-sphere centers. The pressure increases because b ${}_{2}$ $\gt$ 0. The pressure at low temperatures for the Bose gas with hard-sphere interaction then follows from (1.200), (1.276), and (1.288), and recalling $b_2=b^{(0)}_2+b^{\text{int}}_2$ :

(1.289)

\begin{equation} \frac {P}{nT}=1-\frac {n{\Lambda}^3\!\left (\frac {m}{2}\right )}{16}+n{\Lambda}^2a+O(n^2). \end{equation}

Example: Bose gas with an attractive interaction potential. In this case, bound states are possible and $b^{\text{int}}_2\lt 0$ , so the pressure is reduced. The term involving $\Sigma _{{{{\mathcal E}}}^{\ell }_k\lt 0}{{[e}^{+\beta \vert {{{\mathcal E}}}^{\ell }_k\vert }-1]}$ in (1.287) contributes a net negative contribution to $b^{\text{int}}_2,$ and from quantum mechanics ${\delta }_{\ell }\gt 0$ so the second term in the bracket in (1.287) also contributes a net negative contribution. As $T\to 0\ (\beta \to \infty )$ just a single bound state and only $\ell =0$ are important. Hence, at low temperatures

(1.290)

\begin{equation} b^{\text{int}}_2\!\left (T\to 0\right )=-{\Lambda}^3e^{\beta {{\mathcal E}}_b} \;\textrm{and}\; \frac {P}{nT}=1-ne^{\beta {{\mathcal E}}_b}{\Lambda}^3\!\left (\frac {m}{2}\right ), \end{equation}

where ${{\mathcal E}}_b$ is the disassociation energy, i.e., the binding energy of the molecule.

1.3. Chemical equilibrium

1.3.1. Systems composed of multiple species allowing for chemical reactions

Here we analyze systems with multiple species which interact with one another through a chemical reaction. Some examples of simple systems are

(1.291)

\begin{align} &\mathrm{H}_2\ \leftrightarrow \mathrm{H}+\mathrm{H}\nonumber\\[4pt] &\mathrm{H}\ \ \leftrightarrow \ \ \mathrm{p}^++\ \mathrm{e}^- \nonumber\\[4pt] &2\mathrm{H}_2\mathrm{O}\ \ \leftrightarrow \ \ 2\mathrm{H}_2+\ \mathrm{O}_2\nonumber\\[4pt] &\mathrm{H}^++\ \ \mathrm{Cl}^{-}\leftrightarrow \ \ \mathrm{HCl} \end{align}

As a matter of notation, reactions such as $\mathrm{H}^++\ \mathrm{Cl}^{-}\leftrightarrow \ \mathrm{HCl}$ can be represented generically as $A+B\leftrightarrow \ \ C\equiv AB$ . We assume that a system I supporting a reaction like those in (1.291) is in contact with system II which acts as a heat bath such that

(1.292)

\begin{equation} E=E^I+E^{II}=\textrm {constant} \end{equation}

and the chemical reaction dictates the following conservation laws

(1.293)

\begin{equation} N_A+N_C=N_a=\textrm {constant,}\; N_B+N_c=N_b=\textrm {constant,} \end{equation}

where N ${}_{s}$ is the number of atoms or molecules in the combined system. From the point of view of a grand canonical ensemble, the probability $\rho$ of the system I with $\left \{N^I_A,N^I_B,N^I_C\right \}$ is

(1.294)

\begin{align} &\rho \!\left(N^I_A,N^I_B,N^I_C\right)\sim \sum\limits _{E^I}{{\varGamma }_{\textrm{I}}\!\left (N^I_A,N^I_B,N^I_C,E^I\right )}\nonumber\\[4pt] &\times {\varGamma }_{\textrm{II}}\!\left (N^{II}_a=N_a-\left [N^I_A+N^I_C\right ],N^{II}_B=N_b-\left [N^I_B+N^I_C\right ],E^{II}=E-E^I\right ) \end{align}

and

(1.295)

\begin{equation} {\varGamma }_{\textrm{I}}=e^{S_{\textrm{I}}}\quad \textrm{and}\quad {\varGamma }_{\textrm{II}}=e^{S_{\textrm{II}}}. \end{equation}

We assume that system I is a small perturbation with respect to system II, which allows us to evaluate

(1.296)

\begin{align} S_{\textrm{II}}&=S_{\textrm{II}}\!\left (N_a, N_b,E\right )-E^I\!\left (\frac {\partial S_{\textrm{II}}}{\partial E_{\textrm{II}}}\right )-\left [N^I_A+N^I_C\right ]\!\left (\frac {\partial S_{\textrm{II}}}{\partial N^{II}_a}\right )-\left [N^I_B+N^I_C\right ]\!\left (\frac {\partial S_{\textrm{II}}}{\partial N^{II}_b}\right )\nonumber\\[4pt]&= S_{\textrm{II}}\!\left (N_a, N_b,E\right )-{\beta }_{\textrm{II}}E^I-{\gamma }^{II}_a\left [N^I_A+N^I_C\right ]-{\gamma }^{II}_b\left [N^I_B+N^I_C\right ]. \end{align}

We then use (1.295) and (1.296) to express the probability in (1.294) as

(1.297)

\begin{align} &\rho \!\left(N^I_A,N^I_B,N^I_C\right)\sim \sum\limits _{E^I}{e^{-{\beta }_{\textrm{II}}\!\left (E^I-TS^I\right )}e^{-{{\gamma }_{\ }}^{II}_a\!\left (N^I_A+N^I_C\right )-{{\gamma }}^{II}_b\!\left (N^I_B+N^I_C\right )}}\nonumber\\[4pt] &\quad =\sum\limits _{{E^I}}{e^{-{\beta }\left (E^I-TS^I\right )}e^{-{\gamma }_a\left (N^I_A+N^I_C\right )-{\gamma }_b\left (N^I_B+N^I_C\right )}}, \end{align}

where $\beta ,\,{\gamma }_a, and \,{\gamma }_b$ are determined in (1.296) from partial derivatives on S ${}_{\textrm{II}}$ . We note that $F^I=E^I-TS^I$ . We next introduce the definitions:

Definition:

(1.298)

\begin{equation} {\mu }_A\equiv -T{\gamma }_a,\,{\ \ \mu }_B\equiv -T{\gamma }_b,\quad {\mu }_c\equiv -T({\gamma }_a+{\gamma }_b) \end{equation}

We note that the definitions in (1.298) dictate ${\mu }_c={\mu }_A+{\mu }_B$ . The probability in (1.297) can then be rewritten as

(1.299)

\begin{equation} \rho\! \left(N^I_A,N^I_B,N^I_C\right)\sim \ \sum\limits _{E^I}{e^{-{\beta }F^I+\beta {\boldsymbol \mu }\cdot {\boldsymbol{N}}}= \sum\limits _{{E^I}}{e^{-{\beta }(E^I-TS\!\left (E^I,N\right ))+\beta {\boldsymbol \mu }\cdot {\boldsymbol{N}}}}}, \end{equation}

where ${\boldsymbol \mu }\equiv \{{\mu }_A,{\mu }_B,{\mu }_C\}$ and ${\boldsymbol{N}}\equiv \{N_A,N_B,N_C\}$ .

Lemma: From the formalism in § 1.2.7 and the expression in (1.299) we deduce

(1.300)

\begin{equation} \langle {\boldsymbol{N}}\rangle =T\frac {\partial {\text{ ln } \mathbb {Z}}}{\partial {\boldsymbol \mu }}, \end{equation}

where $\mathbb {Z}$ is the grand canonical partition function:

(1.301)

\begin{align} \mathbb {Z}\!\left ({\boldsymbol \mu },\beta ,V\right )&\equiv \sum\limits _{\{N_s\}}{e^{\beta \sum\limits _s{{\mu }_sN_s}}Z\!\left ({\boldsymbol{N}},\beta ,V\right )=\sum\limits _{\{N_s\}}{e^{\beta \sum\limits _s{{\mu }_sN_s}}\prod\limits _s{Z_s(N_s,\beta ,V)}}}\nonumber\\[4pt]&=\prod\limits _s{{\sum\limits _{\{N_s\}}{e^{\beta {\mu }_sN_s}}Z_s}(N_s,\beta ,V)}=\prod\limits _s{{\mathbb {Z}}_s\!\left (\beta ,{\mu }_s,V\right )} \end{align}

extending expressions in § 1.2.7 to multiple species, using an ideal-gas approximation for $Z\!\left ({\boldsymbol{N}},\beta ,V\right )=\prod\limits _s{Z_s(N_s,\beta ,V)}$ , and recalling ${\mu }_c={\mu }_A+{\mu }_B.$ From (1.301) it follows that

(1.302)

\begin{equation} \text{ ln } \mathbb {Z} =\sum\limits _s{\text{ ln } {\mathbb {Z}}_s\!\left ({\mu }_s,\beta ,V\right )}. \end{equation}

We recall from (1.174) that $\langle N_s\rangle =T({\partial {\text{ ln } \mathbb {Z}}})/{\partial {{\boldsymbol \mu }}_{{\boldsymbol{s}}}}=T({\partial {\text{ ln } {\mathbb {Z}}_s}})/{\partial {{\boldsymbol \mu }}_{{\boldsymbol{s}}}}$ and observe that the existence of the chemical reactions is only felt through ${\mu }_c={\mu }_A+{\mu }_B.$

1.3.2. The law of mass action

From (1.302) and (1.223) we evaluate the pressure

(1.303)

\begin{equation} P\equiv \frac {T}{V}{\text{ ln } \mathbb {Z}}=\sum\limits _s{P_s} \end{equation}

and the partial pressure is

(1.304)

\begin{align} P_s\equiv \frac {T}{V}{\text{ ln } {\mathbb {Z}}_s}&=\frac {T}{V}{\text{ ln } \sum\limits ^{\infty }_{N_s=0}{e^{\beta {\mu }_sN_s}\frac {{\!\left (Z^s_1\right )}^{N^s}}{N_s!}=\frac {T}{V}}}{\text{ ln } {\exp \!\left (e^{\beta {\mu }_s}Z^s_1\right )}}\nonumber\\[4pt]&=\frac {T}{V}\ e^{\beta {\mu }_s}\frac {V}{{\Lambda}^3}Z^{\prime} _s = \frac {T}{{\Lambda}^3}\ e^{\beta {\mu }_s}Z^{\prime} _s, \end{align}

where $Z^s_1=({V}/{{\Lambda}^3})Z^{\prime} _s$ and $Z^{\prime} _s$ is the partition function for the internal states and the sum over $N_s$ in (1.304) is recognized as the exponential. We recall (1.174) and the Gibbs–Duhem relations (1.185) and (1.186) which when used in conjunction with (1.304) yield

(1.305)

\begin{equation} \frac {\partial P_s}{\partial {\mu }_s}=\frac {T}{V}\frac {\partial {\text{ ln } {\textrm {Z}}_s}}{\partial {\mu }_s}=\langle n_s\rangle \end{equation}

and note that $P_s=\langle n_s\rangle T$ . The statistical average $\langle n_s\rangle$ removes statistical fluctuations in the number density. From (1.304) and (1.305) we obtain $\langle n_s\rangle {\Lambda}^3=e^{\beta {\mu }_s}Z^{\prime} _s$ and then with $\langle n_s\rangle \approx n_s$ , ignoring fluctuations and taking the logarithm,

(1.306)

\begin{equation} {\mu }_s=T\big[{\text{ln } \big(n_s{\Lambda}^3\big)-{\text{ ln } Z^{\prime} _s}}\big]. \end{equation}

From ${\mu }_c={\mu }_A+{\mu }_B$ and (1.306) one derives

(1.307)

\begin{equation} {\text{ ln } n_A{{\Lambda}}^3_A}+{\text{ ln } n_B{{\Lambda}}^3_B-{\text{ ln } n_C{{\Lambda}}^3_C={\text{ ln } Z^{\prime} _A+{\text{ ln } Z^{\prime} _B-{\text{ ln } Z^{\prime} _C}}}}} \end{equation}

(1.308)

\begin{equation} \frac {n_A{{\Lambda}}^3_An_B{{\Lambda}}^3_B}{n_C{{\Lambda}}^3_C}=\frac {Z^{\prime} _AZ^{\prime} _B}{Z^{\prime} _C} \end{equation}

for an ideal gas. It is straightforward to include stochiometric coefficients: ${{\nu }_c\mu }_c={\nu }_A{\mu }_A+{{\nu }_B\mu }_B$ , and then (1.308) becomes

(1.309)

\begin{equation} \frac {{\!\left (n_A{{\Lambda}}^3_A\right )}^{{\nu }_A}{\!\left (n_B{{\Lambda}}^3_B\right )}^{{\nu }_B}}{(n_C \Lambda_C^3)^{\nu_C}}=\frac {{\!\left (Z^{\prime} _A\right )}^{{\nu }_A}{\!\left (Z^{\prime} _B\right )}^{{\nu }_B}}{{\!\left (Z^{\prime} _C\right )}^{{\nu }_C}}. \end{equation}

We can group all the temperature dependence in (1.308) on the right-hand side to obtain

(1.310)

\begin{equation} \frac {n_An_B}{n_C}=\frac {{{\Lambda}}^3_C}{{{\Lambda}}^3_A{{\Lambda}}^3_B}\frac {Z^{\prime} _AZ^{\prime} _B}{Z^{\prime} _C}(T). \end{equation}

From $P_s=n_sT$ ignoring fluctuations and (1.310) one then obtains

(1.311)

\begin{equation} \frac {P_AP_B}{P_C}=T\frac {{{\Lambda}}^3_C}{{{\Lambda}}^3_A{{\Lambda}}^3_B}\frac {Z^{\prime} _AZ^{\prime} _B}{Z^{\prime} _C}(T)\equiv K_p(T), \end{equation}

which is (104.3) of Landau & Lifshitz (Reference Landau and LIfshitz1969) and is called the law of mass action which is a formula employed in chemistry. With $c_s\equiv P_s/P$ then (1.311) is equivalent to ${c_Bc_B}/{c_C}=PK_p\!\left (T\right )\equiv K_c\!\left (P,T\right ).$

From the point of view of a closed isolated system with energy E and volume V, and constraint (1.293), we can maximize the entropy S with respect to $\!\left (N_A,N_B,N_c,E,V\right )$ and produce a general derivation of the most probable partition function independent of the assumption of an ideal gas. Then

(1.312)

\begin{equation} 0=\delta S=\delta N_A\frac {\partial S}{\partial N_A}+\delta N_B\frac {\partial S}{\partial N_B}+\delta N_C\frac {\partial S}{\partial N_C}, \end{equation}

with E and V fixed so they do not appear. With ${\gamma }_s\equiv {\partial S}/{\partial N_s}$ and $\delta N_A= \delta N_B=-\ \delta N_C$ , it follows that ${\gamma }_C={\gamma }_A+{\gamma }_B$ . From the definition, ${\mu }_s=-T{\gamma }_s$ we then recover ${\mu }_c={\mu }_A+{\mu }_B.$

1.3.3. Derivation of the Saha equation

Consider a system composed of hydrogen, protons, and electrons, allowing for ionization:

(1.313)

\begin{equation} H\ \leftrightarrow \ p^++\ e^{-}, {\mu }_H={\mu }_p+{\mu }_e,\ \ Z^{\prime} _p=2, \;\textrm{and}\; Z^{\prime} _e=2, \end{equation}

where the internal partition functions capture the two possible spin states (up and down).

From (1.308) assuming negligible excitation of the hydrogen atoms, $T\ll I\sim 13.6\;\textrm {eV,}$

(1.314)

\begin{equation} \frac {n_p{{\Lambda}}^3_pn_e{{\Lambda}}^3_e}{n_H{{\Lambda}}^3_H}=\frac {Z^{\prime} _pZ^{\prime} _e}{Z^{\prime} _H} \end{equation}

where $Z^{\prime} _H=4e^{-\beta E_0}=4e^{\beta I}$ and $E_0=-I$ . We further assume that there is no electron/nuclear spin interaction and simplify (1.314) using $n_p\sim n_e, m_p\sim m_H,$ and ${{\Lambda}}^3_p\sim {{\Lambda}}^3_H$ to obtain the Saha equation:

(1.315)

\begin{equation} \frac {n^2_e{{\Lambda}}^3_e}{n_H}=\frac {Z^{\prime} _pZ^{\prime} _e}{Z^{\prime} _H}=\frac {2\times 2}{4e^{\beta I}}=e^{-\beta I}. \end{equation}

Definition: The degree of ionization is

(1.316)

\begin{equation} f\equiv \frac {[H^+]}{\left [H^+\right ]+[H]}=\frac {n_e}{n_e+n_H}\equiv \frac {n_e}{n_0}. \end{equation}

The total density is defined $n=n_H+n_e+n_p=n_0+n_e=n_0(1+f)$ and P = nT. We use these definitions and divide (1.315) by $n_0=n/(1+f)$ to obtain the following.

From Landau & Lifshitz (Reference Landau and LIfshitz1969, equation (106.5)),

(1.317)

\begin{equation} \frac {f^2}{1-f^2}=\frac {1}{n{\Lambda}^3_e}e^{-\beta I} \to f^2\!\left (n,T\right )=\frac {1}{1+n{\Lambda}^3_ee^{\beta I}}. \end{equation}

Definition: We define the ionization temperature $T_{\textrm{I}}$ by setting $n{\Lambda}^3_e=e^{-\beta I}\ll 1$ so that $I\gg T_{\textrm{I}}$ and

(1.318)

\begin{equation} f(n,T_{\textrm{I}})\equiv \frac {1}{\sqrt {2}}. \end{equation}

From the ionization temperature relation

(1.319)

\begin{equation} {\text{ ln } \frac {1}{n{\Lambda}^3_e}=\frac {I}{T_{\textrm{I}}}}\;\textrm{or}\; T_{\textrm{I}}\!\left (n\right )=\frac {1}{{\text{ ln } \frac {1}{n{\Lambda}^3_e(T_{\textrm{I}})}}} \end{equation}

and

(1.320)

\begin{equation} {\text{ ln } \frac {1}{n{\Lambda}^3_e}=49.3-2.3\left [{\textrm {log}}_{\textrm {10}}n_{{cc}^{-1}}-\frac {3}{2}{\textrm{log}}_{\textrm {10}}T_{eV}\right ].} \end{equation}

Example: For the interstellar gas, $n\sim 1\ \textrm {cm}^{-3},$ $T_{\textrm{I}}={13.6\,\textrm{eV}}/(49.3+ 3.5\text{ ln} (T_{\textrm{I}}\sim {1}/ {4}))=({13.6}/{47.2})\,\textrm {eV}=0.29\,\textrm {eV}.$

Example: For atmospheric densities, $n\sim {10}^{19}\ \textrm {cm}^{-3}$ , $T_{\textrm{I}}=({13.6}/{6.6})\,\textrm{eV}\sim 2\,\textrm {eV}.$

Note: The recombination temperature for hydrogen is $1000^{\circ}-2000^{\circ\textrm{K}}\sim 0.1-0.2\ \,\textrm {eV}$ , which sets a lower limit for a physically realistic value of $T_{\textrm{I}}.$

Equation (1.317) can be expressed using (1.319)

(1.321)

\begin{equation} f\!\left (n,T\right )=\frac {1}{\sqrt {1+{\left (\frac {T_{\textrm{I}}(n)}{T}\right )}^{3/2}e^{T_{\textrm{I}}(n)/T}}}. \end{equation}

The fractional ionization in (1.321) is plotted in figure 10 as a function of ${T/T}_{\textrm{I}}(n)$ ; we note that most of the change in f occurs in the range $1/2\le \,{T/T}_{\textrm{I}}\le 3/2$ .

Figure 10. Fractional ionization versus ${T/T}_{\textrm{I}}(n)$ based on (1.321).

1.3.4. Chemical equilibrium including ionization and excited states

We now extend the analysis in § 1.3.3 to include excitation of the hydrogen atom as a first correction. The internal partition function becomes

(1.322)

\begin{equation} Z^{\prime} _H=\sum\limits ^{\infty }_{n=1}{n^2e^{\frac {\beta I}{n^2}}}, \end{equation}

where n is the principal quantum number and the multiplier $n^2$ account for degeneracy ignoring spin degeneracy. We note that the maximum atomic radius must scale as $R_{\text{max}}\sim {1}/{{n_0}^{1/3}}$ where n ${}_{0}$ is the density of atoms, while the atomic radius for an atom in the nth excited state scales as $R\approx n^2a_0$ where $a_0$ is the Bohr radius. Setting $R\approx R_{\text{max}}$ we deduce the maximum quantum number allowed

(1.323)

\begin{equation} n_{\text{max}}=\sqrt {\frac {R_{\text{max}}}{a_0}}=\sqrt {\frac {1}{n^{1/3}_0a_0}}=\frac {1}{{\!\left (n_0a^3_0\right )}^{1/6}}\gg 1. \end{equation}

Returning to (1.322)

(1.324)

\begin{equation} Z^{\prime} _H=e^{\beta I}+\sum\limits ^{\infty }_{n=2}{n^2e^{\frac {\beta I}{n^2}}}\sim e^{\beta I}+\int\nolimits ^{n_{\text{max}}}_0{n^2\text{d}n=}e^{\beta I}+\frac {1}{3}n^3_{\text{max}}=e^{\beta I}+\frac {1}{3}\frac {1}{{\!\left (n_0a^3_0\right )}^{1/2}}. \end{equation}

The claim is that although ${1}/{{\!\left (n_0a^3_0\right )}^{1/2}}$ is large compared with unity, it is small compared with $e^{\beta I}$ which scales as $e^{\beta I}\sim {1}/{n_0{\Lambda}^3_e}$ based on (1.315). Hence, the excited state’s contribution to the internal partition function is negligible.

1.4. Long-range interactions

Long-range interactions are important in neutral and nonneutral fluids and gases. Some examples of interactions are Coulomb interactions affecting both nonneutral and neutral systems. Dipole interactions are important when there is an applied electric field. Gravitational interactions are significant in neutral systems at long scales. Some of the topics addressed in this section include self-consistent fields, spatial nonuniformity, quasineutrality, Debye shielding, and a virial theorem.

1.4.1. Classical treatment of interactions: Coulomb, dipole, etc.

We postulate a classical treatment for a system in contact with a heat bath. Then the probability function will have the form

(1.325)

\begin{equation} \rho \!\left (p,q\right )\sim e^{-\beta (K+\Phi )}\sim e^{-\beta K}e^{-\beta \Phi }\sim {\rho }_p(p){\rho }_q(q) \end{equation}

for a canonical ensemble with prescribed temperature T. The configuration space probability function obeys

(1.326)

\begin{equation} {\rho }_q\!\left ({{\boldsymbol{r}}}^{\!\left (N\right )}\right )=\frac {e^{-\beta \Phi }}{Q},\quad Q\equiv \int\nolimits {\textrm{d}^Nre^{-\beta \Phi }}, \end{equation}

where N rolls up the dimensionality of the system and the number of elements. For the Coulomb interaction we have

(1.327)

\begin{equation} \Phi =\sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}+\sum\limits _i{e_i\phi _0({{\boldsymbol{r}}}_i)}}, \end{equation}

where $\Phi$ is the total electrostatic potential; and the electrostatic electric field satisfies ${\boldsymbol{E}}=-\nabla \Phi$ . Gauss’ law relates the electric field to the charge density.

The one-particle density is defined by

(1.328)

\begin{equation} n_1\!\left ({\boldsymbol{x}}\vert {{\boldsymbol{r}}}_i\right )\equiv \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right ),\quad \int\nolimits {\textrm{d}^3\textit{x}\ n_1\!\left ({\boldsymbol{x}}\vert {{\boldsymbol{r}}}_1\right )=1}. \end{equation}

The ensemble average of n ${}_{1}$ given a probability density function as in (1.325) and (1.326) is

(1.329)

\begin{equation} \langle n_1\rangle \!\left ({\boldsymbol{x}}\right )=\int\nolimits {\textrm{d}^Nr}\rho \!\left ({{\boldsymbol{r}}}^N\right )\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right )=\sum\limits ^N_{i=1}{\langle n_i\rangle ({\boldsymbol{x}})}. \end{equation}

We note that $\langle n_1\rangle \!\left ({\boldsymbol{x}}\right )={1}/{V}$ if $\phi \ne \phi ({\boldsymbol{x}})$ , i.e., if $\phi$ has no spatial dependence. If we introduce more than one species s, we have

(1.330)

\begin{equation} n^s\!\left ({\boldsymbol{x}}\right )=\sum\limits ^{N_s}_{i=1}{\langle n_{i,s}\rangle ({\boldsymbol{x}})}. \end{equation}

We also note that as a matter of economy in notation

(1.331)

\begin{equation} \rho \!\left ({{\boldsymbol{r}}}_1,\,{{\boldsymbol{r}}}_2,\ \dots , {{\boldsymbol{r}}}_N\right )\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right )=\rho ({{\boldsymbol{r}}}^N)\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right ) \end{equation}

and

(1.332)

\begin{equation} \langle n_1\rangle \!\left ({\boldsymbol{x}}\right )=\int\nolimits {\textrm{d}^N{\boldsymbol{r}}}\rho \!\left ({{\boldsymbol{r}}}^N\right )\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right )\equiv \ \rho \!\left ({{\boldsymbol{r}}}_1={\boldsymbol{x}}\right ). \end{equation}

Consider the spatial gradient of the number density in (1.330)

(1.333)

\begin{align} \frac {\partial }{\partial {\boldsymbol{x}}}n^s\!\left ({\boldsymbol{x}}\right )\equiv \frac {\partial }{\partial {\boldsymbol{x}}}\sum\limits ^{N_s}_{i=1}{\langle n_{i,s}\rangle \!\left ({\boldsymbol{x}}\right )}&=\sum\limits ^{N_s}_{i=1}{\int\nolimits {\textrm{d}^Nr}\rho \!\left ({{\boldsymbol{r}}}^N\right )\frac {\partial }{\partial {\boldsymbol{x}}}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )}\nonumber \\[4pt] &=-\sum\limits ^{N_s}_{i=1}{\int\nolimits {\textrm{d}^Nr}\rho \!\left ({{\boldsymbol{r}}}^N\right )\frac {\partial }{\partial {{\boldsymbol{r}}}_i}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )}\nonumber \\[4pt] &=\sum\limits ^{N_s}_{i=1}{\int\nolimits {\textrm{d}^Nr}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_1\right )}\frac {\partial }{\partial {{\boldsymbol{r}}}_{{\boldsymbol{i}}}}\rho \!\left ({{\boldsymbol{r}}}^N\right )\nonumber\\[4pt] &=-\beta \sum\limits ^{N_s}_{i=1}{\int\nolimits {\textrm{d}^Nr}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )}\frac {\partial \Phi }{\partial {{\boldsymbol{r}}}_i}\frac {1}{Q}e^{-\Phi }\nonumber \\[4pt] &=- \beta \sum\limits ^{N_s}_{i=1}{\bigg\langle \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )\frac {\partial \Phi }{\partial {{\boldsymbol{r}}}_i}\bigg\rangle }, \end{align}

integrating by parts and identifying the averaging process along the way. We recall (1.327) and take its gradient

(1.334)

\begin{equation} \frac {\partial \Phi }{\partial {{\boldsymbol{r}}}_{{\boldsymbol{i}}}}=\sum\limits _{j\ne i}{\frac {\partial }{\partial {{\boldsymbol{r}}}_{{\boldsymbol{i}}}}\frac {e_ie_j}{r_{ij}}+\sum\limits _j{\frac {\partial }{\partial {{\boldsymbol{r}}}_i}{e_j\phi }_0({{\boldsymbol{r}}}_i)}}\equiv \sum\limits _{j\ne i}{\frac {\partial }{\partial {{\boldsymbol{r}}}_i}{\Phi }_{ij}+\sum\limits _j{\frac {\partial }{\partial {{\boldsymbol{r}}}_i}{\Phi }_j({{\boldsymbol{r}}}_i)}}. \end{equation}

We return to (1.134) to obtain

(1.335)

\begin{align} \frac {\partial }{\partial {\boldsymbol{x}}}n^s\!\left ({\boldsymbol{x}}\right ) &=-\beta \left [n_s\!\left ({\boldsymbol{x}}\right )\frac {\partial {\Phi }_s}{\partial {\boldsymbol{x}}}+\sum\limits ^s_i{\sum\limits _{j\ne i}{\left\langle n_i({\boldsymbol{x}})\frac {\partial }{\partial {\boldsymbol{x}}}\int\nolimits {\textrm{d}^3\textit{x}'{\Phi }_{ss^{\prime} }\delta ({{\boldsymbol{x}}}^{\prime} -{{\boldsymbol{r}}}_j)}\right\rangle }}\right ]\nonumber \\[4pt] &=-\beta \left [n_s\!\left ({\boldsymbol{x}}\right )\frac {\partial {\Phi }_s}{\partial {\boldsymbol{x}}}+\sum\limits _{s^{\prime} }{\sum\limits ^s_i{\sum\limits ^{s'}_{j\ne i}{\int\nolimits {\textrm{d}^3x^{\prime} \langle n_i\!\left ({\boldsymbol{x}}\right )n_j\!\left ({{\boldsymbol{x}}}^{\prime} \right )\rangle \ \frac {\partial }{\partial {\boldsymbol{x}}}}}}}{\Phi }_{ss^{\prime} }({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} )\right ] \end{align}

and

(1.336)

\begin{align} \langle n_i\!\left ({\boldsymbol{x}}\right )n_j\!\left ({{\boldsymbol{x}}}^{\prime} \right )\rangle &\equiv \int\nolimits {\textrm{d}^Nr\rho \!\left ({{\boldsymbol{r}}}^N\right )}\ \delta \!\left ({{\boldsymbol{x}}}-{{\boldsymbol{r}}}_i\right )\delta \!\left ({{\boldsymbol{x}}}^{\prime} -{{\boldsymbol{r}}}_j\right )\equiv \rho \!\left ({{\boldsymbol{r}}}_i={\boldsymbol{x}},{{\boldsymbol{r}}}_j={{\boldsymbol{x}}}^{\prime} \right )\nonumber \\[4pt] &= \rho \!\left ({{\boldsymbol{r}}}_i={\boldsymbol{x}})\rho ({{\boldsymbol{r}}}_j={{\boldsymbol{x}}}^{\prime} \right )=\langle n_i\rangle \!\left ({\boldsymbol{x}}\right )\langle n_j\rangle \!\left ({\boldsymbol{x}}{\mathbf '}\right )\left [1+g_{ij}({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} )\right ], \end{align}

where the correlation function is $g_{ij}\ll 1$ for dilute gases and/or for ${{\Phi }_{ij}}/{T}\ll 1$ . We shall neglect $g_{ij}$ presently and proceed. We will check on $g_{ij}$ later.

We return to (1.335) and note that $\Sigma ^{N_{s^{\prime} }}_{j\ne i}=\Sigma ^{N_{s^{\prime} }}_j{-\Sigma _{j=i}{= }O\!\left (N_s\right )-O(1)\approx \Sigma ^{N_{s^{\prime} }}_j}.$ Furthermore, we set $\Sigma ^{N_s}_i{\langle n_i\rangle \!\left ({\boldsymbol{x}}\right )}=n_s\!\left ({\boldsymbol{x}}\right )$ and $\Sigma ^{N_s'}_i{\langle n_i\rangle \!\left ({\boldsymbol{x}}{\mathbf '}\right )}=n_{s'}\!\left ({\boldsymbol{x}}{\mathbf '}\right ).$ Hence, (1.335) becomes

(1.337)

\begin{equation} \frac {\partial }{\partial {\boldsymbol{x}}}n^s\!\left ({\boldsymbol{x}}\right )=-\beta n_s\!\left ({\boldsymbol{x}}\right )\frac {\partial }{\partial {\boldsymbol{x}}}\left [{\Phi }_s+\int\nolimits {\textrm{d}^3{{\boldsymbol{x}}}^{\prime} \sum\limits _{s^{\prime} }{n_{s'}\!\left ({\boldsymbol{x}}{\mathbf '}\right )}}{\Phi }_{ss^{\prime} }({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} )\right ]. \end{equation}

We introduce the total self-consistent electrostatic potential $\phi ({\boldsymbol{x}})$ neglecting correlations,

(1.338)

\begin{equation} \phi \!\left ({\boldsymbol{x}}\right )\equiv \phi ^0\!\left ({\boldsymbol{x}}\right )+\int\nolimits {\textrm{d}^3{\textit{x}}^{\prime} \sum\limits _{s^{\prime} }{\frac {{e_{s'}n}_{s'}\!\left ({\boldsymbol{x}}{\mathbf '}\right )}{\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert} }}. \end{equation}

Equation (1.337) then becomes

(1.339)

\begin{equation} \frac {\partial }{\partial {\boldsymbol{x}}}{\ n}_s=-\beta e_sn_s\!\left ({\boldsymbol{x}}\right )\frac {\partial }{\partial {\boldsymbol{x}}}\ \partial \!\left ({\boldsymbol{x}}\right ) \end{equation}

(1.340)

\begin{equation} \frac {\partial }{\partial {\boldsymbol{x}}}{\text{ ln } {\ n}_s=-\beta {e_s}\frac {\partial }{\partial {\boldsymbol{x}}}\ \phi \!\left ({\boldsymbol{x}}\right )} \to n_s\!\left ({\boldsymbol{x}}\right )=n_s(0)e^{-\beta e_s\phi \left ({\boldsymbol{x}}\right )}\!. \end{equation}

Poisson–Boltzmann equation: We apply the Laplacian to (1.338) and derive Poisson’s equation having identified $\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}^{\prime} \right )$ from ${\nabla }^2({1}/{\vert \boldsymbol{x}}-{\boldsymbol{x}}'\vert)$ inside the volume integral with respect to ${{\boldsymbol{x}}}^{\prime}$ :

(1.341)

\begin{equation} {\nabla }^2\phi =-4\pi \left [{\rho }^0\!\left ({\boldsymbol{x}}\right )+\sum\limits _s{{e_sn}_s(0)e^{-\beta e_s\phi \left ({\boldsymbol{x}}\right )}}\right ]. \end{equation}

Equation (1.341) is a single quasilinear partial differential equation for the electric potential.

1.4.2. Example of an electron gas and Coulomb interaction

Consider an electron gas with $\phi ^0\!\left ({\boldsymbol{x}}\right )=0$ . Equation (1.341) becomes

(1.342)

\begin{equation} {\nabla }^2\phi =-4\pi {en}_0e^{-\beta e\phi \left ({\boldsymbol{x}}\right )}. \end{equation}

Define $\psi \equiv \ \beta e\phi \!\left ({\boldsymbol{x}}\right )={e\phi }/{T}$ , from which (1.342) becomes

(1.343)

\begin{equation} {\nabla }^2\psi =-K^2e^{-\psi }, \end{equation}

where $K^2\equiv 4\pi \beta {e^2n}_0=1/{\lambda }^2_{\textrm{Debye}}$ . Now consider a one-dimensional limit of (1.343) and solve by standard methods:

(1.344)

\begin{equation} \frac {\textrm{d}^2}{\textrm{d}x^2}\psi =-K^2e^{-\psi } \to \psi \!\left (x\right )=-2{\text{ ln } \left [{\textrm {s}\textrm {ec} \!\left (\frac {Kx}{\sqrt {2}}\right )}\right ]} \end{equation}

and

(1.345)

\begin{equation} n\!\left (x\right )=n(0){\textrm {sec}}^{\textrm {2}}\!\left (\frac {Kx}{\sqrt {2}}\right ). \end{equation}

We note that the number of electrons N, assumed large, can be determined from the integral of (1.345) over a domain defined by [ $-$ a,a]:

(1.346)

\begin{equation} N=\int\nolimits ^a_{-a}{\textrm{d}x\ n\!\left (x\right )=2n\!\left (x=0\right )\frac {\sqrt {2}}{K}{\tan \!\left (\frac {Ka}{\sqrt {2}}\right )}}. \end{equation}

Assume that the argument of the tan( ) in (1.346) approaches $\pi /2$ so that N is large, but not too close. From

(1.347)

\begin{equation} \frac {Ka}{\sqrt {2}}\approx \frac {\pi}{2} \to n_0\approx \frac {\pi}{8}\frac {T}{e^2}\frac {1}{a^2} \to {10}^8\,{\textrm {cm}}^{-3} \end{equation}

for $T\sim 10\ \,\textrm {eV}$ and $a\sim 1\ \ \textrm {cm}.$ This estimate is independent of any specific value of N. Given (1.345) which diverges at argument value ${\pi}/{2}$ , as more particles are added they tend to end up at the edges, which is a consequence of the electron–electron repulsion. This is very different from a neutral system. Suppose N = 10 ${}^{12}$ instead of infinity. Then from (1.346) and (1.344) with a = 1 cm,

(1.348)

\begin{align} \frac {Ka}{\sqrt {2}}\sim {\textrm {tan}}^{-1}\!\left (\frac {N}{n_0a}\right )&={\textrm{tan}}^{-1}\!\left (\frac {N}{{10}^8}\right )\to \psi \!\left (a\right )\nonumber \\[4pt] &=-2{\text{ ln } \left [{\sec \!\left (\frac {Ka}{\sqrt {2}}\right )}\sim {\tan \!\left (\frac {Ka}{\sqrt {2}}\right )=\frac {N}{n_0a}}\right ]}\nonumber \\[4pt]&=-2{\text{ ln}(N}{10}^{-8})\approx -20. \end{align}

Hence, $\vert e\phi \vert \!\left (a\right )\approx 20T=200\ \,\textrm {eV}$ for the parameters of this example.

1.4.3. Example of a stellar cluster and gravitational interaction

Consider a system of charge-neutral, finite-mass elements interacting through gravitational forces. Assume that the elements share equal masses. Introduce a gravitational potential $\psi (x)$ such that

(1.349)

\begin{align} n\!\left (x\right )&=n_0e^{-\beta m_1\psi \left (x\right )}, \;{\nabla }^2\psi =4\pi Gm_1n\!\left (x\right )={4\pi Gm_1n}_0e^{-\beta m_1\psi \left (x\right )}. \end{align}

Introduce $K^2\equiv ({4\pi Gm^2_1n_0})/({T})$ and $\Psi \equiv ({m_1\psi })/({T})$ so that (1.349) becomes

(1.350)

\begin{equation} {\nabla }^2\Psi = K^2e^{-\Psi }. \end{equation}

It can be shown that (1.350) has the solution

(1.351)

\begin{align} \Psi \!\left (\textit{x}\right )&=\textrm {2 ln}\;{\cosh \!\left (\frac {Kx}{\sqrt {2}}\right ) \to \sqrt {2}Kx\ }\textrm{for large}\; \textit {x},\nonumber \\[4pt] n\!\left (x\right )&=n_0{\ \textrm {sech}}^2\!\left (\frac {Kx}{\sqrt {2}}\right ) \to 4n_0e^{-\frac {2Kx}{\sqrt {2}}}\ \textrm{for large}\ x. \end{align}

In three dimensions with spherical symmetry the gravitational potential has the asymptotic limit for large r, $\psi \to -{GM_{\text{tot}}}/{r}$ and $n(r)\to n_0e^{-{\beta m_1GM_{\text{tot}}}/{r}}$ , so that $n(\infty )\to n_0$ which is a contradiction ( $n(\infty )$ needs to vanish). In three dimensions with spherical symmetry, one cannot satisfy the equations of thermal equilibrium. Hence, clusters are constantly losing particles; and, similarly, terrestrial atmospheres are constantly losing particles.

1.4.4. Example with ions and electrons, and Coulomb interaction

Consider a plasma with electrons and singly charged ions (or perhaps positrons). Gauss’ law leads to Poisson’s equation:

(1.352)

\begin{equation} {\nabla }^2\phi =-4\pi e\left [n_i\!\left ({\boldsymbol{x}}\right )-n_e\!\left ({\boldsymbol{x}}\right )\right ] \to n_e\!\left ({\boldsymbol{x}}\right )=n_i\!\left ({\boldsymbol{x}}\right )+\frac {1}{4\pi e}{\nabla }^2\phi . \end{equation}

Equation (1.352) becomes a statement characterizing quasineutrality if $({1}/{4\pi e}){\nabla }^2\phi \ll n_i$ . In thermal equilibrium

(1.353)

\begin{equation} n_e\!\left ({\boldsymbol{x}}\right )=n^0_ee^{\beta e\phi }. \end{equation}

Suppose L ${}_{n}$ is the scale length for the density gradient. Then using $\beta e\phi ={\text{ ln } {{\textrm {} n_e}}/{n^0_e},}$ (1.352) leads to

(1.354)

\begin{equation} n_e\!\left ({\boldsymbol{x}}\right )=n_i\!\left ({\boldsymbol{x}}\right )+\frac {1}{4\pi \beta e^2}\frac {1}{L^2_{n_i}}=n_i\!\left ({\boldsymbol{x}}\right )\left [1+\frac {1}{4\pi \beta n_ie^2}\frac {1}{L^2_{n_i}}\right ]. \end{equation}

Definition: ${\lambda }_D\equiv \sqrt {{T}/{4\pi ne^2}}$ is the Debye length. Hence, ${\lambda }^2_D={1}/{4\pi \beta e^2n}.$

Equation (1.354) then becomes

(1.355)

\begin{equation} n_e\!\left ({\boldsymbol{x}}\right )=n_i\!\left ({\boldsymbol{x}}\right )\left [1+O\!\left (\frac {{\lambda }^2_D}{L^2_{n_i}}\right )\right ]. \end{equation}

The value of ${\lambda }_D$ is millimeters in many laboratory plasmas and meters in many space plasmas. Quasineutrality corresponds to $L^2_{n_i}\gg \,{\lambda }^2_D.$

1.4.5. Example with ions and electrons, and Coulomb and gravitational interactions

Next we consider a simple one-dimensional system composed of two species (ions and electrons) with Coulomb and gravitational interactions. In thermal equilibrium we have

(1.356)

\begin{equation} n_e\!\left ({\boldsymbol{x}}\right )=n^0_ee^{-\beta [-e\phi +m_egz]}, n_i\!\left ({\boldsymbol{x}}\right )=n^0_ie^{-\beta [e\phi +m_igz]}\ \end{equation}

and Poisson’s equation (1.352) is unmodified. Again $n_e\approx n_i$ . At z = 0 define $\phi =0$ ; then $n^0_e=n^0_i$ . As a consequence of quasineutrality and (1.356)

(1.357)

\begin{equation} e\phi +m_igz=-e\phi +m_egz. \end{equation}

We can then solve (1.357) for $\phi$ and take its gradient to determine the electric field:

(1.358)

\begin{equation} e\phi =-\tfrac{1}{2}\!\left (m_i-m_e\right )gz \end{equation}

and

(1.359)

\begin{equation} e{\boldsymbol{E}}=\tfrac{1}{2}\!\left (m_i-m_e\right ){\boldsymbol{g}}. \end{equation}

The equilibrium number densities are then

(1.360)

\begin{equation} n_e\!\left ({\boldsymbol{x}}\right )\approx n_i\!\left ({\boldsymbol{x}}\right )=n^0_ie^{-\frac{1}{2}\beta \left (m_i+m_e\right )gz}. \end{equation}

1.4.6. Example with ions and electrons, and Coulomb interaction with correlations: Debye–H $\ddot {\text{u}}$ ckel theory and shielding

Here we assume a plasma in thermal equilibrium with multiple charge species and an imposed test-particle distribution. Poisson’s equation becomes

(1.361)

\begin{equation} {\nabla }^2\phi =-4\pi \left [\sum\limits _s{e_sn_s\!\left ({\boldsymbol{x}}\right )+{\rho }^0_e({\boldsymbol{x}})}\right ], \end{equation}

where

(1.362)

\begin{equation} n_s\!\left ({\boldsymbol{x}}\right )=n^0_se^{-\beta e_s\phi ({\boldsymbol{x}})},\quad \phi \to \phi ^{(0)}+\delta \phi. \end{equation}

The linearly perturbed Poisson equation and charge density satisfy

(1.363)

\begin{equation} {\nabla }^2\delta \phi =-4\pi \left [\sum\limits _s{e_s{\delta n}_s\!\left ({\boldsymbol{x}}\right )+{\delta \rho }^0_e({\boldsymbol{x}})}\right ] \end{equation}

and

(1.364)

\begin{equation} {\delta n}_s\!\left ({\boldsymbol{x}}\right )={-n}^0_s({\boldsymbol{x}}){\beta e_s\delta \phi ({\boldsymbol{x}})}. \end{equation}

Substituting (1.364) in (1.363) one obtains

(1.365)

\begin{equation} {\nabla }^2\delta \phi =\left [4\pi \beta \sum\limits _s{e^2_s}n^0_s\!\left ({\boldsymbol{x}}\right )\right ]\delta \phi \!\left ({\boldsymbol{x}}\right )-4\pi {\delta \rho }^0_e\!\left ({\boldsymbol{x}}\right ) =\\[4pt] \frac {1}{{\lambda }^2_D}\ \delta \phi \!\left ({\boldsymbol{x}}\right )-4\pi e_0\delta \!\left ({\boldsymbol{x}}\right ), \end{equation}

where the Debye length ${\lambda }_D$ is defined here by

(1.366)

\begin{equation} {\lambda }_D\equiv \sqrt {\frac {T}{4\pi \sum\limits _s{e^2_s}n^0_s\!\left ({\boldsymbol{x}}\right )}}. \end{equation}

The solution of (1.365) with boundary conditions $\delta \phi \!\left (r\to \infty \right )=0$ and regularity at $r=0$ is

(1.367)

\begin{equation} \delta \phi \!\left ({\boldsymbol{x}}\right )=\frac {e_0}{r}e^{-\!\left (r/{\lambda }_D\right )} \end{equation}

and with the assumption of quasineutrality in the unperturbed equilibrium charge densities

(1.368)

\begin{equation} \delta n_e=\beta e\delta \phi n^0_e\gt 0\, \, \, \textrm {and}\, \, \, \delta n_i=-\beta e\delta \phi n^0_e\lt 0. \end{equation}

Thus, there is a slight excess of electrons around the test charge at x = 0 and a slight deficiency in the ions. Both perturbations in the charge densities decay spatially on the scale of the Debye length. This is a manifestation of plasma shielding of the test charge.

We next construct the conditional probability

(1.369)

\begin{align} {\rho }_{ij}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} \right )&\equiv {\rho }_i\!\left ({\boldsymbol{x}}\right ){\rho }_j\!\left ({{\boldsymbol{x}}}^{\prime} ;i\ at\,{\boldsymbol{x}}\right )=\langle n_i\rangle \!\left ({\boldsymbol{x}}\right )\langle n_j\rangle \!\left ({{\boldsymbol{x}}}^{\prime} ;i\ \textrm {at}\,{\boldsymbol{x}}\right )\nonumber \\[4pt] &=\langle n_i\rangle \!\left ({\boldsymbol{x}}\right )\langle n_j\rangle ({{\boldsymbol{x}}}^{\prime} )\left [1+\textrm {correction due to}\ i\ \textrm {at}\,{\boldsymbol{x}}\right ], \end{align}

where the correction is due to the perturbation associated with $\delta n_e$ :

(1.370)

\begin{align} \langle n_j\rangle \!\left ({{\boldsymbol{x}}}^{\prime} ;i\ \textrm {at}\,{\boldsymbol{x}}\right )&=\langle n_j\rangle \!\left ({{\boldsymbol{x}}}^{\prime} \right )\left [1+\!\left (-\beta \right )e_j\delta \phi \!\left ({{\boldsymbol{x}}}^{\prime} \right )\right ]\nonumber\\[4pt] &=\langle n_j\rangle \!\left ({{\boldsymbol{x}}}^{\prime} \right )\left [1+\!\left (-\beta \right )e_j\frac {e_i}{\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert} e^{-\frac {\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert }{{\lambda }_D}}\right ]. \end{align}

The conditional probability including the correlation function correction is then obtained from (1.369) and (1.370):

(1.371)

\begin{equation} {\rho }_{ij}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} \right )\equiv {\rho }_i\!\left ({\boldsymbol{x}}\right ){\rho }_j\!\left ({{\boldsymbol{x}}}^{\prime} \right )\left [1+\!\left (-\beta \frac {{e_je}_i}{\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert} e^{-\frac {\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert }{{\lambda }_D}}\right )\right ]. \end{equation}

Is the correlation $g_{ij}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} \right )\equiv -\beta ({e_je_i}/{\vert x-x'\vert }) e^{-{\vert \boldsymbol{x}-{\boldsymbol{x}}'\vert}/{{\lambda }_D}} \ll 1?$ We require for the consistency of the theory for the self-consistent field solution that at least the effect of $g_{ij}$ , if not $g_{ij}$ itself, is small. If ${\lambda }_D$ is very small, then long-range 1/r ${}^{2}$ forces are much reduced, and only short-range effects survive. Thus, electrically charged particles and their systems are uncorrelated in space if ${\lambda }_D$ is small; and all sorts of quantities such as the entropy and energies are additive.

Consider the internal energy:

(1.372)

\begin{align} \langle U\rangle &=\frac {3}{2}NT+\left\langle \sum\limits _{i\lt j}{\frac {e_ie_j}{r_{ij}}}\right\rangle \nonumber\\[4pt]&=\frac {3}{2}NT+\left\langle \sum\limits _{i\lt j}e_ie_j\int\nolimits \textrm{d}^3\textit{x}\int\nolimits \textrm{d}^3{\textit{x}}^{\prime} \frac {\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i\right )\delta \!\left ({\boldsymbol{x}}{\mathbf '}-{{\boldsymbol{x}}}_j\right )}{\vert \boldsymbol{x}-{{\boldsymbol{x}}}^{\prime} \vert }\right\rangle \nonumber \\[4pt] &=\frac {3}{2}NT+\frac{1}{2}\int\nolimits {\textrm{d}^3\textit{x}\int\nolimits {\textrm{d}^3{\textit{x}}^{\prime} \frac {\rho \!\left ({\boldsymbol{x}}\right ){\rho }_j\!\left ({{\boldsymbol{x}}}^{\prime} \right )}{\vert \boldsymbol{x}-{{\boldsymbol{x}}}^{\prime} \vert} +{\langle U_{\textrm {correlation}}\rangle }}}\nonumber\\[4pt]&= \frac {3}{2}NT+\int\nolimits {\textrm{d}^3\textit{x}}\frac {{\vert \boldsymbol{E}}({\boldsymbol{x}})\vert ^2}{8\pi }-\dfrac{1}{2}\int\nolimits {\textrm{d}^3\textit{x}}\sum\limits _s{n_s({\boldsymbol{x}})\frac {e^2_s}{{\lambda }_D}}, \end{align}

where the second term is the field energy due to the macro field associated with not having exact charge neutrality. Where does this result for $\langle U_{\text {correlation}}\rangle$ come from? One can expand the expression for $\delta \phi$

(1.373)

\begin{equation} \delta \phi =\frac {e_i}{r}e^{-\frac {r}{{\lambda }_D}}\approx \frac {e_i}{r}\!\left (1-\frac {r}{{\lambda }_D}\right )+O(r). \end{equation}

Thus, the correction to the electric potential due to the shielding cloud is $\delta \phi _{\text{cloud}} = -{e_i}/{{\lambda }_D}+O\!\left (r\right )$ . The associated energy is $e_i\delta \phi _{\text{cloud}}$ (0) = energy of the test particle interacting with the cloud $=-({e^2_i}/{{\lambda }_D})+O(r)\vert _{r=0}$ . Hence,

(1.374)

\begin{equation} U_{\text {correlation}}=\frac{1}{2}\int\nolimits {\textrm{d}^3\textit{x}}\sum\limits _s{n_s(x)\frac {(-e^2_s)}{{\lambda }_D}}. \end{equation}

Note that the factors of 1/2 that appear in the intermediate expression in (1.373) and in $U_{\text{correlation}}$ arise from counting only the unique pairs from the double sums over particles.

The condition for weak correlations on which the validity of the Debye–H $\ddot {\textrm {u}}$ ckel theory depends is

(1.375)

\begin{equation} \frac{1}{2}\frac {Ne^2}{{\lambda }_D}\ll \frac {3}{2}NT \to \frac {e^2}{{\lambda }_D}\ll T. \end{equation}

Define the radius $R_T\equiv {e^2}/{T}$ . For $T\sim 10$ eV, . The condition for weak correlations can be rewritten as

(1.376)

\begin{equation} nR^3_T\ll 1, \end{equation}

i.e., if there are many particles within the strong interaction distance $R_T$ the correlations are not weak. For $T\sim 10$ eV and $n\ll {10}^{24}\textrm {c}{\textrm {m}}^{-3}$ the interactions are weakly correlated, i.e., at anything less than solid densities the interactions are weakly correlated. We note that the ratio of $U_{\textrm{correlation}}$ to the total kinetic energy $({3}/{2})NT$ scales as

(1.377)

\begin{equation} \frac {U_{\text {correlation}}}{\frac {3}{2}NT}\ \sim \ \sqrt {nR^3_T}. \end{equation}

Now that we know $\langle$ U $\rangle$ in (1.373), define it simply as U, the total internal energy; and we can derive the other thermodynamic quantities from the relations

(1.378)

\begin{align} &U=-\frac {\partial {\text{ ln } \textrm {Z}}}{\partial \beta },\quad F=-T\text{ ln } \textrm {Z}\ ,\nonumber\\[4pt]& {\mathcal S}={\text{ ln } \textrm {Z}+\beta U,\quad {\mu }_s=}{\left .\frac {\partial F}{\partial N_s}\right .}\bigg \vert _{V,T}=T{\text{ ln } n_s{\Lambda}^3{{+\frac{1}{2}}}\epsilon _s+e_s\phi }, \end{align}

where $\epsilon _s\equiv -{e^2_s}/{{\lambda }_D}$ is due to the interaction of the particle with its own shielding cloud and the intrinsic electrochemical potential is identified as

(1.379)

\begin{equation} {\mu }^0_s\equiv \ T{\text{ ln } n_s{\Lambda}^3{{+\tfrac{1}{2}\epsilon _s}}}. \end{equation}

We note as a warning that if we violate the validity condition in (1.375) and (1.376) we can obtain a large correlation energy, but it is still short range in its effect.

For chemical equilibrium in a nonuniform medium one requires

(1.380)

\begin{equation} 0=\nabla {\mu }_s=\nabla {\mu }^0_s+\nabla e_s\phi =\sum\limits _{s'}\frac {\partial {\mu }^0_s(n_{s^{\prime} })}{\partial n_{s'}}\nabla n_{s'}+e_s\nabla \phi ,\quad {\boldsymbol{E}}=-\nabla \phi , \end{equation}

from which

(1.381)

\begin{equation} {\boldsymbol{E}} = \frac {1}{e_s}\sum\limits _{s'}{\frac {\partial {\mu }^0_s(n_{s^{\prime} })}{\partial n_{s'}}\nabla n_{s'}}. \end{equation}

Example: Correlations are negligible in an ideal gas, $T\text{ ln } n_s{\Lambda}^3+(1/2)\epsilon_s\approx T {\text{ln } n_s{\Lambda}^3}$ and

(1.382)

\begin{equation} {\boldsymbol{E}}\approx \frac {1}{e_s}\sum\limits _{s^{\prime} }{\frac {\partial T{\text{ ln } n_s{\Lambda}^3\ }}{\partial n_{s^{\prime} }}\nabla n_{s^{\prime} }}\approx \frac {1}{e_s}\sum\limits _{s^{\prime} }{\frac {T}{n_{s^{\prime} }}\nabla n_{s^{\prime} }} \to n_s\sim e^{-\beta e_s\phi }. \end{equation}

All this has been classical.

Example: Suppose we have a degenerate electron gas. Recall equations (1.212)–(1.215) in § 1.2.7 from which we have

(1.383)

\begin{equation} n{\Lambda}^3_f=\frac {8\pi }{3},\quad {\Lambda}_f\equiv \frac {h}{p_f},\quad \frac {p^2_f}{2m}=\mu \end{equation}

if there is no external electric potential. The effect of an external field is

(1.384)

\begin{equation} e_1\phi +\frac {p^2_f}{2m}=\mu =e_1\phi +{\mu }_0, {\Lambda}_f\equiv \frac {h}{\sqrt {2m(\mu -e_1\phi )}},\quad n\frac {h^3}{{\!\left (2m(\mu -e_1\phi )\right )}^{\frac {3}{2}}}=\frac {8\pi }{3}. \end{equation}

Equation (1.384) gives the lowest-order number density as a function of $\phi$ . Performing an analysis similar to that in § 1.4.6 involving the solution of the linearized Poisson equation we recover shielding but with

(1.385)

\begin{equation} {\lambda }_D=\sqrt {\frac {\frac {2}{3}{\mu }_0}{4\pi n_0e^2}} \end{equation}

and the condition that the correlations are weak is

(1.386)

\begin{equation} U_{\text{corr}}\ll K=\frac {3}{5}N{\mu }_0 \quad \textrm {or} \quad n{\left (\frac {e^2}{{\mu }_0}\right )}^3\ll 1. \end{equation}

For energies corresponding to 10 eV, the weak correlation condition (1.387) fails for $n\sim {10}^{24}\,\textrm {cm}^{-3},$ i.e., for solid-state conditions. With considerations similar to those leading to (1.380) and (1.381) we can deduce the equilibrium electric field in a nonuniform metal by setting $\nabla {\mu }=0$ given (1.384).

[Reviewer Dominique Escande’s Comments: By treating n as a continuous function, the analysis implicitly assumes that there are many particles in the shielding cloud: so it is for weakly coupled plasmas only. In reality the theory works for a number of particles larger than 40 (see https://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S1870-35422017000100063).

– A Vlasov calculation of shielding can be performed for a weakly perturbative test charge moving very slowly. Shielding does not exist for a fast particle. See, for instance, (Nicholson Reference Nicholson1983, § 9.2).
– A derivation of shielding avoiding the assumption of Boltzmann equilibrium is provided in Meyer-Vernet (Reference Meyer-Vernet1993). It relies upon Gauss’ theorem and the Coulomb deflection of particles.
– The shielded Coulomb potential is a basic example of a renormalized potential. See McComb (Reference McComb2004, § 3.2).
– In order to go further on long-range interacting systems, Campa et al. (Reference Campa, A.Dauxois and Ruffo2014) is a useful reference.]

2. Nonequilibrium statistical mechanics

[Editor’s Note: Physics 212B addressed nonequilibrium statistical mechanics. As in Physics 212A there was no textbook, and use was made of many of the same references.]

2.1. Fundamentals

2.1.1. Definitions of a realization, moments, characteristic function, and discrete variables

As a vehicle to introduce a number of fundamental concepts and definitions we consider a system in which a large particle with mass M is immersed in a collection of smaller particles with mass m that constitutes a gas or fluid. The Hamiltonian for such a system is

(2.1)

\begin{equation} H=\tfrac{1}{2}MV^2+\sum\limits _i{\tfrac{1}{2}m{{{{v}}}}^2_i\ +\sum\limits _i{\Phi (\vert {{\boldsymbol{r}}}_i-{\boldsymbol{R}}\vert )+\ \sum\limits _{i\lt j}{\phi (r_{ij})}}}. \end{equation}

The large particle has velocity V and position R , while the smaller particles have velocities v ${}_{i}$ and positions r ${}_{i}$ . We posit that the motion of the large particle consists of fast variations due to fluid particles colliding with the large particle and slow variations due to the net diffusion of the large particle. An example of this is the motion of pollen grains as in the famous observations of Brown (1827) that subsequently was named Brownian motion. In this system, the kinetic energy of the large particle satisfies

(2.2)

\begin{equation} \tfrac{1}{2}M\left\langle V^2_x+V^2_y+V^2_z\right\rangle =\tfrac {3}{2}T \end{equation}

and $\langle V^2_x\rangle =T/M$ .

Definition: A realization of this system is a single instance of the system with defined initial conditions.

A realization of a random process has a specific initial condition and time history. For a sufficiently long time we can compute the time average ${\langle V^2_x\rangle }_t=T/M$ . Now consider an ensemble of identical systems possessing different initial conditions within the domain of the accessible phase space of the system. Then we can compute the ensemble average ${\langle V^2_x\rangle }_{{\text{ensemble}}}(t)$ at a specified time t, averaging over the different initial conditions. We expect the same result $T/M$ if the ensemble average over initial conditions is equal to the time average of a single realization of the system. That is to say we expect the ergodic hypothesis to be valid: the dynamics should spend equal times in equal volumes of phase space for a random process. Furthermore, we expect that every degree of freedom of a weakly coupled system should have T/2 energy associated with it as a consequence of ergodicity.

However, the results of experiments indicate that ergodicity is not always occurring in systems that we might think are random. An example of this is found in the numerical integration of the equations of motion of a chain of nonlinear oscillators with Lenard-Jones interactions between nearest neighbors reported in Galgani & Scott (Reference Galgani and Scott1972). Instead of an equipartition scaling of the time-average kinetic energy as in the one-dimensional version of (2.2), the numerical integration exhibited a Planck-like scaling for the mean energy levels:

(2.3)

\begin{equation} {\overline {E}}_n\sim \frac {1}{e^{\alpha {\omega }_n}-1} \end{equation}

where $\alpha \equiv \beta \hslash .$ We return to the introduction of fundamental concepts that we will make use of in the course of the subsequent discussion.

Definition: Let x be a random variable, whose probability $\rho \!\left (x\right )$ is normalized on the domain of x:

(2.4)

\begin{equation} \int\nolimits {\textrm{d}x\ \rho \!\left (x\right )=1}. \end{equation}

The average of any function f of x is defined as

(2.5)

\begin{equation} \langle f(x)\rangle =\int\nolimits {\textrm{d}x\ \rho \!\left (x\right )f(x)}. \end{equation}

There is a one-to-one relation between f and x such that

(2.6)

\begin{equation} \rho \!\left (f\right )\mathrm{d}f=\rho \!\left (x\right )\textrm{d}x,\rho \!\left (f\right )=\rho \!\left (x\right ){\bigg\vert \frac {\mathrm{d}f}{\textrm{d}x}\bigg\vert }^{-1},\quad \int\nolimits {\mathrm{d}f\ \rho \!\left (f\right )=1.} \end{equation}

Example: Suppose $K=({1}/{2})MV^2$ , then

(2.7)

\begin{equation} \rho \!\left (V\right )\sim e^{-\beta {{\frac{1}{2}}}MV^2} \to \rho (K)\sim \frac {e^{-\beta }}{MV}\sim \frac {e^{-\beta K}}{\sqrt {K}}. \end{equation}

If there is absolute certainty that the random variable x has the value x ${}_{0}$ , then $\rho \!\left (x\right )=\delta \!\left (x-x_0\right ).$ If, instead, we have knowledge of the relative probabilities that x is equal to a set of discrete values, then

(2.8)

\begin{equation} \rho \!\left (x\right )=\sum\limits _i{{\rho }_i\delta \!\left (x-x_i\right ),\quad \sum\limits _i{{\rho }_i=1.}} \end{equation}

Definition: Moments of a distribution of random variables are defined by

(2.9)

\begin{equation} \langle x^{\ell }\rangle \equiv \int\nolimits {\text{d}xx^{\ell }\rho (x)}. \end{equation}

The mean value of x corresponds to $\ell =1.$

Definition: Fluctuations are defined by $\delta x\equiv x-\langle x\rangle$ and $\langle \delta x\rangle \equiv 0$ . The standard deviation $\sigma$ is the square root of the variance defined by ${\sigma }^2\equiv \langle {\left (\delta x\right )}^2\rangle =\langle x^2\rangle -{\langle x\rangle }^2$ .

Definition: The characteristic function is defined by

(2.10)

\begin{equation} Z_x(k)\equiv \int\nolimits {\mathrm{d}xe^{-ikx}\rho (x)}. \end{equation}

It then follows

(2.11)

\begin{equation} Z_x\!\left (k=0\right )=1,\quad e^{-ikx}=\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-ikx\right )}^{\ell }}{\ell !}},\quad Z_x\!\left (k\right )=\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-ik\right )}^{\ell }}{\ell !}}\langle x^{\ell }\rangle \end{equation}

and

(2.12)

\begin{equation} {\frac {\mathrm{d}^{\ell }Z_x}{\textrm{d}k^{\ell }}\bigg\vert }_{k=0}=\int\nolimits {\textrm{d}x{e^{-ikx}\vert }_{k=0}{\left (-ix\right )}^{\ell }\rho \!\left (x\right )={(\!-i)}^{\ell }\langle x^{\ell }\rangle }. \end{equation}

We conclude that the probability function of the random variable and the moments of the random variable completely determine one another:

(2.13)

\begin{equation} \rho \!\left (x\right )\ \ \rightleftharpoons \ \ \{\langle x^{\ell }\rangle ,\ \ \ell =0,\ 1,\ 2,\ \dots \}. \end{equation}

The inverse transform of (2.10) yields

(2.14)

\begin{equation} \rho \!\left (x\right )=\int\nolimits {\frac {\textrm{d}k}{2\pi }e^{ikx}}Z_x(k). \end{equation}

Central limit theorem: A statement of the central limit theorem is that when the sample size of the discrete random variables is large enough, the probability distribution tends toward a Gaussian:

(2.15)

\begin{equation} \rho \!\left (x\right )=\frac {1}{\sqrt {2\pi {\sigma }^2}}e^{-\frac {{\left [x-\langle x\rangle \right ]}^2}{2{\sigma }^2}}. \end{equation}

The characteristic function is obtained from (2.10) and (2.15):

(2.16)

\begin{equation} Z_x\!\left (k\right )=e^{-ik\langle x\rangle -\frac{1}{2}k^2{\sigma }^2} \to {\text{ln } Z_x\!\left (k\right )= }-ik\langle x\rangle -\tfrac{1}{2}k^2{\sigma }^2. \end{equation}

Taking ${\text{ ln } Z_x\!\left (k\right )}$ has separated $\langle$ x $\rangle$ from its shape with respect to its mean value.

More generally, $\text{ ln } Z_x\!\left (k\right )$ is determined by a series expansion of (2.10)

(2.17)

\begin{equation} {\text{ ln } Z_x\!\left (k\right )= }-ik\langle x\rangle -\tfrac{1}{2}k^2{\sigma }^2+\frac {{\!\left (-i\right )}^3}{3!}k^3{\chi }_3+\dots +\frac {{\!\left (-i\right )}^n}{n!}k^n{\chi }_n \end{equation}

where ${\chi }_n\equiv K_n$ are cumulants of the probability distribution and were described by Danish astronomer T. N. Thiele as semi-invariants more than a century ago. Two examples of the cumulants are the skewness and the kurtosis:

(2.18)

\begin{equation} \textrm {skewness:}\quad \frac {K_3}{{\sigma }^3}\quad K_3=\langle x^3\rangle -3\langle x^2\rangle \langle x\rangle +2{\langle x\rangle }^3=\langle {\delta x}^3\rangle , \end{equation}

(2.19)

\begin{equation} \textrm {kurtosis: }\quad \frac {K_4}{{\sigma }^4}\quad K_4= \langle {\delta x}^4\rangle -3{\sigma }^4.\qquad\qquad\qquad\qquad \end{equation}

Definition: Given two random variables x and y, the probability $\rho (x,y)$ is defined by

(2.20)

\begin{equation} \rho (x,y)\equiv \rho (x\vert y)\rho (y)\equiv \rho (y\vert x)\rho (x), \end{equation}

where $\rho (x|y)$ is the conditional probability of x given y; and

(2.21)

\begin{equation} \rho \!\left (x\right )=\int\nolimits {\textrm{d}y\ \rho \!\left (x,y\right )=\int\nolimits {\textrm{d}y\ \rho (y)\rho \!\left (x|y\right )}}. \end{equation}

Definition: Given the function $f(x,y)$ the probability function $\rho (f)$ is

(2.22)

\begin{equation} \rho \!\left (f\right )=\int\nolimits {\textrm{d}x\textrm{d}y\ \rho (x,y)\delta (f-f\!\left (x,y\right ))}. \end{equation}

Definition: x and y are statistically independent if $\rho \!\left (x,y\right )=\rho \!\left (x\right )\rho \!\left (y\right )\!.$

If x and y are statistically independent, then

(2.23)

\begin{equation} \rho \!\left (x|y\right )=\rho \!\left (x\right ). \end{equation}

Definition: The correlation of x and y can be inferred from $\langle \delta x\delta y\rangle \equiv \langle xy\rangle -\langle x\rangle \langle y\rangle$ using the definitions $\delta x\equiv x-\langle x\rangle ,\ \ \delta y\equiv y-\langle y\rangle$ . If x and y are independent then $\langle \delta x\delta y\rangle =0$ , but not the converse. If $\langle \delta x\delta y\rangle \ne 0$ , then x and y are dependent.

Definition: A set of N random variables can be represented by

(2.24)

\begin{equation} \left \{x_i\right \}\equiv {\boldsymbol{x}},\quad i=1, 2, \dots , N. \end{equation}

The probability distribution $\rho ({\boldsymbol{x}})$ satisfies the normalization condition $\smallint {\textrm{d}^N\textit{x}\ \rho \!\left ({\boldsymbol{x}}\right )=1}$ . If the set of random variables is statistically independent, then $\rho \!\left ({\boldsymbol{x}}\right )=\prod\nolimits ^N_{i=1}{{\rho }_i(x_i)}$ The generalization of (2.10) to a N-dimensional vector of random variables is

(2.25)

\begin{equation} Z_x({\boldsymbol{k}})\equiv \int\nolimits {\textrm{d}^N\textit{x}e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{x}}}\rho ({\boldsymbol{x}})} \end{equation}

and (2.14) generalizes to

(2.26)

\begin{equation} \rho \!\left ({\boldsymbol{x}}\right )=\int\nolimits {\frac {\textrm{d}^N\textit{k}}{2\pi }e^{i{\boldsymbol{k}}\cdot {\boldsymbol{x}}}}Z_x({\boldsymbol{k}}). \end{equation}

The generalization of (2.17) is

(2.27)

\begin{equation} {\text{ ln } Z_x\!\left ({\boldsymbol{k}}\right )= }-i{\boldsymbol{k}}\cdot \langle {\boldsymbol{x}}\rangle -\tfrac{1}{2}{\boldsymbol{k}}\cdot {{\boldsymbol \sigma }}^2\cdot {\boldsymbol{k}}+O({\boldsymbol{kkk}}). \end{equation}

Example: For a Gaussian probability distribution (2.26) becomes

(2.28)

\begin{equation} \rho \!\left ({\boldsymbol{x}}\right )=\frac {1}{{\!\left (2\pi \right )}^{N/2}\sqrt {{|{\boldsymbol \sigma }|}^2}}e^{-\frac{1}{2}{\boldsymbol{x}}\cdot {{\boldsymbol \sigma }}^{-2}\cdot {\boldsymbol{x}}}. \end{equation}

2.1.2. Derivation of the central limit theorem

Assume the set $\left \{x_i\right \}$ is statistically independent. Define $x\equiv \Sigma _i{x_i.\ }$ Then

(2.29)

\begin{equation} \rho (x)\equiv \int\nolimits {\textrm{d}^N\textit{x}\ \rho \!\left (\left \{x_i\right \}\right )\delta \!\left (x-\sum\limits _i{x_i}\right )=}\int\nolimits {\textrm{d}^N\textit{x}\prod\limits _i{{\rho }_i(x_i)}\ \delta \!\left (x-\sum\limits _i{x_i}\right )} \end{equation}

and

(2.30)

\begin{align} {Z}_x\!\left (k\right )\equiv \int\nolimits {\textrm{d}x e^{-ikx}\rho \!\left (x\right )}&=\int\nolimits {\textrm{d}^N\textit{x}\prod\limits _i{{\rho }_i\!\left (x_i\right )}}e^{-ik\sum\limits _{{\boldsymbol{i}}}{x_{{\boldsymbol{i}}}}}\nonumber \\[4pt]&=\prod\limits _i{\int\nolimits {\textrm{d}x_i{\rho }_i\!\left (x_i\right )}}e^{-ikx_i}=\prod\limits _i{Z_i\!\left (k\right ), } \end{align}

then ${\text{ ln } Z_x\!\left (\textit{k}\right )= }\sum\limits _i{{{{\text{ ln } Z_i\!\left (\textit{k}\right)}}}}$ .

Example: For the special case wherein ${\rho }_i={\rho }_j$ , then ${\text{ ln } Z_x\!\left (\textit {k}\right )= }\Sigma _i\, \text {ln }Z_i(\textit {k}) \to$ $ N{\text{ ln } Z_1 {(\textit{k})}}$ . As a matter of definition, $\langle x\rangle =\Sigma _i{\langle x_i\rangle \to N}\langle x_1\rangle$ , ${\sigma }^2_x=\Sigma _i{{\sigma }^2_i}\to N{\sigma }^2_1$ , and ${\sigma }_x\to \sqrt {N}{\sigma }_1$ . As a consequence of these relations ${{\sigma }_x}/{\langle x\rangle }=(1/\sqrt{(N)})\sigma_{1}/{\langle x_1\rangle }$ and

(2.31)

\begin{equation} {\text{ ln } Z_x\!\left (\textit{k}\right )= }-i\textit {k}\langle \textit {x}\rangle -\tfrac{1}{2}{\textit {k}}^2{{\sigma }}^2_x+\dots +\frac {{\!\left (-i\right )}^n}{n!}k^nK_n(x) \end{equation}

and $K_n\!\left (x\right )=NK_n(x_1)$ where $K_n$ are Thiele’s cumulants. Note that the inner products ${\boldsymbol{k}}\cdot \langle {\boldsymbol{x}}\rangle$ and ${\boldsymbol{k}}\cdot {{\boldsymbol \sigma }}^2\cdot {\boldsymbol{k}}$ in (2.27) and use of $\langle x\rangle =\Sigma _i{\langle x_i\rangle }$ and ${\sigma }^2_x=\Sigma _i{{\sigma }^2_i}$ have led to (2.31).

Hence, as $N\to \infty$ the distribution function becomes Gaussian:

(2.32)

\begin{equation} {\text{ ln } Z_x\!\left ({k}\right )=N{\text{ ln } Z_1{(k)}}=-ikN\langle x_1\rangle -\tfrac{1}{2}}k^2N{\sigma }^2_1+\dots +\frac {{\!\left (-i\right )}^n}{n!}k^nNK_n\!\left (x_1\right ). \end{equation}

Using (2.26) and (2.32) we obtain

(2.33)

\begin{align} \rho \!\left ({x}\right )&=\int\nolimits {\frac {\textrm{d}k}{2\pi }e^{i{kx}}}Z_x\!\left ({k}\right )=\int\nolimits {\frac {\textrm{d}k}{2\pi }e^{i{kx}}}{\exp \left \{-i{k}\langle {x}\rangle -\frac{1}{2}{{k}}^2{{\sigma }}^2_x+\dots \right \}}\nonumber \\[4pt] &=\int\nolimits {\frac {\textrm{d}k}{2\pi }e^{i{k}\left ({x}-\langle x\rangle \right )-\frac{1}{2}{{k}}^2N{\sigma }^2_1+\dots +\frac {{\!\left (-i\right )}^n}{n!}k^nNK_n\left (x_1\right )}}. \end{align}

We evaluate (2.33) in the limit of large N. Assume that the largest contributions to the integral over k derive from $k\sim N^{1/2}{\sigma }^{-1}_1$ so that $({1}/{2}){{k}}^2N{\sigma }^2_1\sim O(1)$ . What then happens to the general term?

(2.34)

\begin{equation} \frac {{\!\left (-i\right )}^n}{n!}k^nNK_n\!\left (x_1\right )\to \frac {{\!\left (-i\right )}^n}{n!}{\frac {K_n}{{\sigma }^n_1}N}^{1-{{\frac {n}{2}}}}\to \frac {{\!\left (-i\right )}^n}{n!}N^{1-{{\frac {n}{2}}}}, \end{equation}

with $K_n\sim O({\sigma }^n_1)$ . Hence, for $n\gt 2$ , $N^{1-{{{n}/{2}}}}\to 0$ as $N\to \infty$ ; and the general term for $n\gt 2$ is vanishingly small as $N\to \infty$ . Then for large N the probability distribution tends toward a Gaussian:

(2.35)

\begin{equation} \rho \!\left ({x}\right )=\int\nolimits {\frac {\textrm{d}k}{2\pi }e^{i{k}\left ({x}-\langle x\rangle \right )-\frac{1}{2}{{k}}^2N{\sigma }^2_1}=\frac {1}{\sqrt {2\pi N{\sigma }^2_1}}e^{-\frac{1}{2}\frac {{\left [x-\langle x\rangle \right ]}^2}{N{\sigma }^2_1}}}. \end{equation}

Example: Consider the non-Gaussian probability distribution $\rho \!\left (x_1\right )\sim x^{m-1}_1e^{-x_1}$ $x_1\ge 0$ For this distribution $\langle x_1\rangle =m$ and ${\sigma }_1=\sqrt {m}$ Based on $\rho \!\left (x_1\right ),$ $\rho \!\left (x\right )=$ $x^{Nm-1}e^{-x}$ exactly, which derives from considering the sums of gamma-distributed variables and is not a trivial result; and $\langle x\rangle =Nm$ and $\,{\sigma }_x=\sqrt {Nm}$ .

Exercise: Take the limit as $N\to \infty$ for $\rho \!\left (x\right )=x^{Nm-1}e^{-x}$ and recover $\rho \!\left (x\right )\sim $ $e^{-{{\left [x-\langle x\rangle \right ]}^2}/{2N{\sigma }^2_x}}$ to verify the central limit theorem.

Example: Consider the non-Gaussian probability distribution $\rho \!\left (x_1\right )\sim {1}/({x^2_1+1})$ for positive and negative $x_1$ Then $\langle x_1\rangle$ = 0 and ${\sigma }^2_1=\langle x^2_1\rangle \sim \smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}x({x^2}/({1+x^2}))=\infty }$ . Moreover, $\langle x^{2n}_1\rangle =\infty$ . The characteristic function is the Fourier transform of the Lorentzian in this example and is not analytic: $Z(k)\sim e^{-\vert k\vert }$ . The central limit theorem is invalid for this probability distribution because the moments are infinite.

2.1.3. Random processes, spectral density, and correlation function

Consider a random process for a particle velocity as a function of time $V(t)$ with probability distribution $\rho \!\left (V_{t_1}, V_{t_2}, V_{t_3}, \dots , V_{t_N}\right )$ . If one knows $\rho ,$ one can calculate all of the moments of $\left \{V_{t_i}\right \}$ . For example,

(2.36)

\begin{equation} \langle V_{t_1}V^2_{t_2}V_{t_3}\rangle =\int\nolimits {\textrm{d}^NV_t}V_{t_1}V^2_{t_2}V_{t_3}\rho (V_{t_1}, V_{t_2}, V_{t_3}, \dots , V_{t_N}). \end{equation}

Definition (Stationary random process): If $\rho (V_{{t+\tau }_1}, V_{{t+\tau }_2}, V_{{t+\tau }_3}, \dots , V_{{t+\tau }_N})$ is independent of t, then the system or process is stationary.

Definition (Ergodicity): In order for a system or process to be ergodic, it must be stationary; and the time average of any moment must equal the ensemble average of the same moment, e.g.,

(2.37)

\begin{equation} {\langle V_{{t+\tau }_1}V^2_{{t+\tau }_2}V_{{t+\tau }_3}\rangle }_t={\langle V_{{t+\tau }_1}V^2_{{t+\tau }_2}V_{{t+\tau }_3}\rangle }_{{\text{ensemble}}}. \end{equation}

The systems comprising the ensemble on the right-hand side of (2.37) are not prepared necessarily the same uniquely, but are macroscopically identical. The systems must be stationary in order to calculate the time average in (2.37) sensibly.

Definition (Spectral density): Assume that ${\langle V\rangle }_{{\text{ensemble}}}\!\left (t\right )=0$ or ${\langle V(t)\rangle }_{\text{time}}=0$ (equivalent if ergodic and stationary). The spectrum is determined by

(2.38)

\begin{equation} V(\omega )\equiv \int\nolimits {\textrm{d}t\ e^{i\omega t}V(t)}. \end{equation}

The two-time correlation is defined by

(2.39)

\begin{equation} C(\tau )\equiv {\langle V\!\left (t\right )V(t-\tau )\rangle }_{t\ \text{or}\ {\text{ensemble}}}. \end{equation}

The spectral density is defined by

(2.40)

\begin{equation} S(\omega )\equiv \int\nolimits {\textrm{d}\tau \ e^{i\omega \tau }C(\tau )}. \end{equation}

That $S\!\left (\omega \right )$ is the power spectral density of the $V\!\left (t\right )$ field is a consequence of the convolution theorem. Consider a process $V(t)$ that satisfies the stationary and ergodic assumptions. Then the Fourier transform satisfies

(2.41)

\begin{equation} V\!\left (\omega \right )=\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ V\!\left (t\right )e^{i\omega t},\quad \omega }= \textrm{real}, \end{equation}

with reality condition

(2.42)

\begin{equation} V\!\left (-\omega \right )=V^*\!\left (\omega \right ). \end{equation}

We note that $\langle V(\omega )\rangle \to 0$ due to symmetry. We want to calculate $\langle {\vert V(\omega )\vert }^2\rangle$ , but to evaluate this will lead to the auto correlation:

(2.43)

\begin{equation} \langle V\!\left (t_1\right )V\!\left (t_2\right )\rangle \to {\langle V\!\left (t_1\right )V\!\left (t_1+\tau \right )\rangle }_{{\text{ensemble}}}=C(\tau)={\langle V(t)V(t+\tau )\rangle }_t. \end{equation}

Then $C(\tau)={\langle V(t)V(t-\tau )\rangle }_t=C(-\tau )$ , i.e., $C(\tau)$ is an even function. Furthermore,

(2.44)

\begin{equation} C\!\left (0\right )={\langle V\!\left (t\right )V\!\left (t\right )\rangle }_{{\text{ensemble}}}={\langle V\!\left (t\right )V\!\left (t\right )\rangle }_t=\frac {T}{M},\quad C(\tau \to \infty )\to 0, \end{equation}

where T in (2.44) is the temperature if V is the velocity.

Definition: The normalized correlation function is

(2.45)

\begin{equation} R(\tau )\equiv C(\tau )/C(0). \end{equation}

Example: The normalized correlation function for a Lorentz model looks like

(2.46)

\begin{equation} R(\tau)=e^{-\nu \vert \tau \vert }{\cos {\omega }_0\tau }. \end{equation}

As a consequence of (2.40) and since $C(\tau )$ is even

(2.47)

\begin{equation} S\!\left (\omega \right )\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ e^{i\omega \tau }C(\tau)}\quad \textrm{and} \quad C(\tau)=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\ e^{-i\omega \tau }\!\left (\omega \right ).} \end{equation}

Then $S\!\left (\omega \right )=S\!\left (-\omega \right )=S^*\!\left (\omega \right )$ , i.e., $S\!\left (\omega \right )$ is an even function and is real also:

(2.48)

\begin{equation} \langle V^2\rangle =S\!\left (0\right )=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\ S\!\left (\omega \right )}. \end{equation}

Definition: It is convenient to introduce subscripts

(2.49)

\begin{equation} C_{\chi }\equiv {\langle \chi \!\left (t\right )\chi \!\left (t-\tau \right )\rangle }_{{\text{ensemble}}}={\langle \chi \!\left (t\right )\chi \!\left (t-\tau \right )\rangle }_t \end{equation}

to define the correlation function and spectral density for the general case.

Consider

(2.50)

\begin{align} \langle V(\omega )V^*(\omega ')\rangle &\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ e^{i\omega t}\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t'{\ e}^{-i{\omega }^{\prime} t^{\prime} }\langle V\!\left (t\right )V(t^{\prime})\rangle }}\nonumber \\[4pt]&= \int\nolimits ^{\infty }_{-\infty }{\textrm{d}t}\ e^{i(\omega -{\omega }^{\prime} )t}\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau e^{i{\omega }^{\prime} \tau }}\ C(\tau)=2\pi \delta \!\left (\omega -{\omega }^{\prime} \right )S({\omega }^{\prime} ), \end{align}

using $t^{\prime} =t-\tau$ . Equation (2.50) tells us that there are no correlations between Fourier components of the random field at different frequencies.

Lemma: Given that $\langle V(\omega )V^*(\omega )\rangle =\langle {\vert V(\omega )\vert }^2\rangle =2\pi \delta \!\left (0\right )S(\omega )$ we conclude that $S\!\left (\omega \right )\ge 0.$

Now we replace the time integrals in (2.50) with the limiting forms

(2.51)

\begin{equation} \int\nolimits ^{\infty }_{-\infty }{\textrm{d}t}\to {\mathop {\lim }_{T\to \infty } \int\nolimits ^{T/2}_{-T/2}{\textrm{d}t},} \end{equation}

(where now T represents a time interval) so that

(2.52)

\begin{equation} V\!\left (\omega \right )={\mathop {\lim }_{T\to \infty } \int\nolimits ^{T/2}_{-T/2}{\textrm{d}t}}\ V\!\left (t\right )e^{i\omega t}\ \quad \textrm {and}\quad 2\pi \delta \!\left (0\right )S(\omega )\to 2\pi TS(\omega ) \quad \textrm {for}\quad \omega ={\omega }^{\prime} \end{equation}

and we obtain the result of the Wiener–Khinchin theorem:

(2.53)

\begin{equation} {\mathop {\lim }_{T\to \infty } \frac {\langle {\vert V_T(\omega )\vert }^2\rangle }{T}}=S(\omega ). \end{equation}

Example: Return to consideration of the Lorentz model correlation function

(2.54)

\begin{equation} R(\tau)=e^{-\nu \vert \tau \vert }{\cos {\omega }_0\tau \to S\!\left (\omega \right )= \sum\limits _{\pm }{\frac {\nu }{{\nu }^2+{\!\left (\omega \pm {\omega }_0\right )}^2}}}, \end{equation}

where $\nu$ is the inverse correlation time.

Example: Exponential correlation function

(2.55)

\begin{equation} R(\tau)=e^{-\nu \vert \tau \vert }\to S\!\left (\omega \right )= \frac {2\nu }{{\nu }^2+{\omega }^2}. \end{equation}

2.2. Brownian motion

We return to the consideration of random processes and Brownian motion.

2.2.1. Langevin equation

The equation of motion for Brownian motion can be cast in the general form

(2.56)

\begin{equation} M\dot {V}=F\!\left (t\right )\equiv \langle F\rangle \!\left (t\right )+\delta F(t). \end{equation}

Given V(t) for a Brownian particle, we expect a viscous force for the mean force on the particle, i.e., a viscous drag force:

(2.57)

\begin{align} &\langle F\rangle \!\left (V\right )=\langle F\rangle \!\left (0\right )+V{\frac {\textrm{d}\langle F\rangle }{\textrm{d}V}\bigg\vert }_{V=0}+\dfrac{1}{2}V^2{\frac {\textrm{d}^2\langle F\rangle }{\textrm{d}V^2}\bigg\vert }_{V=0}=V{\frac {\textrm{d}\langle F\rangle }{\textrm{d}V}\bigg\vert }_{V=0}+\dfrac{1}{2}V^2{\frac {\textrm{d}^2\langle F\rangle }{\textrm{d}V^2}\bigg\vert }_{V=0},\nonumber\\[4pt] &\end{align}

where $\langle F\rangle \!\left (0\right )=0$ as we assume that there is no external nonzero fields influencing the particle at rest. Hence, to lowest order

(2.58)

\begin{equation} M\dot {V}=F\!\left (t\right )=-\gamma V+\ \delta F(t). \end{equation}

From hydrodynamics, Stokes’ law gives

(2.59)

\begin{equation} \gamma =6\pi \eta R, \end{equation}

where $\eta$ is the specific viscosity and R is the radius of the Brownian particle if it is a spherical object.

Definition: Equation (2.58) can be rewritten in the form of a Langevin equation:

(2.60)

\begin{equation} \!\left (M\frac {\textrm{d}}{\textrm{d}t}+\gamma \right )V\!\left (t\right )=\delta F(t)\; \textrm{or}\; \left (-i\omega M+\gamma \right )V\!\left (\omega \right )=\delta F(\omega ). \end{equation}

Hence,

(2.61)

\begin{equation} V\!\left (\omega \right )=\frac {\delta F(\omega )}{-i\omega M+\gamma } \to \langle\vert V\!\left (\omega \right )\vert ^2\rangle =\frac {\langle \vert\delta F\!\left (\omega \right )\vert ^2\rangle }{\vert-i\omega M+\gamma \vert ^2} \to S_V\!\left (\omega \right )=\frac {S_F(\omega )}{{\omega }^2M^2+{\gamma }^2}. \end{equation}

We assume that the Brownian particle has a much larger mass M than the particles in the surrounding fluid. This leads to much lower characteristic frequencies in $S_V\!\left (\omega \right )$ than in $S_F\!\left (\omega \right )$ . The fluid forces give rise to very rapid fluctuations in F:

(2.62)

\begin{equation} {\omega }_{\delta F}\sim \frac {\sqrt {\langle V^2\rangle }}{a}\sim \frac {\sqrt {T/m}}{a}\equiv {\nu }_{\delta F},\quad R_F(\tau )\sim e^{-{\nu }_{\delta F}\vert \tau\vert }, \end{equation}

while the response of the Brownian particle velocity is

(2.63)

\begin{equation} S_V\!\left (\omega \right )=\frac {\frac {1}{M^2}S_F(\omega )}{{\omega }^2+{{\nu }_V}^2},\quad \,{\nu }_V\equiv \frac {\gamma }{M},\quad R_V(\tau )\sim e^{-{\nu }_V\tau }. \end{equation}

Because M is large, ${\nu }_V\ll \,{\nu }_{\delta F}$ ; and the power spectrum $S_V\!\left (\omega \right )$ decays with respect to frequency at much lower frequency values than does $S_F\!\left (\omega \right )$ . We can estimate ${\nu }_F\sim n{\langle V^2\rangle }^{{1}/{2}}{\pi a}^2$ where a is approximately the atomic radius of the fluid particle and n is the fluid density. We assume that the mass density of the Brownian and fluid particles are the same. The specific viscosity is $\eta \sim \rho \ell {\langle V^2\rangle }^{{1}/{2}}\ \textrm {where}\ \ell \sim ({1}/({n\pi a^2})),$ and $M=({4\pi }/{3})\rho R^3$ . Hence, ${\nu }_V \sim ({6\pi \eta R})/{M} \sim ({6\pi \rho \ell {\langle V^2\rangle }^{{1}/{2}}R})/{M} \sim 6\pi \rho (1/(n\pi a^2)){\langle V^2\rangle }^{{1}/{2}}R/(({4\pi }/{3})\rho R^3)$ . Note that $\eta \sim ({{m\langle V^2\rangle }^{{1}/{2}}})/{a^2}$ is independent of the density. For a gas, ${\langle V^2\rangle }^{1/2}\sim \sqrt {T/M}$ , and $\eta \sim {10}^{-3}$ in cgs units, whereas for water $\eta \sim {10}^{-2}$ in cgs units; so we can introduce a fudge factor O(1–10) in the viscosity. Finally, our estimates of ${\nu }_F$ and ${\nu }_V$ lead to

(2.64)

\begin{equation} \frac {{\nu }_F}{{\nu }_V}=\frac {1}{6\pi }\frac {4\pi }{3}\rho R^3\frac {n{\langle V^2\rangle }^{1/2}{\pi a}^2}{\rho \frac {1}{n{\pi a}^2}{\langle V^2\rangle }^{1/2}R}=\frac {1}{6\pi }\frac {4\pi }{3}{{{\pi}^2\!\left (na^3\right )}^2}\frac {R^2}{a^2}\sim O(1){{\left (na^3\right )}^2}\frac {R^2}{a^2} \gg 1 . \end{equation}

If $R\sim {10}^{-5}\;\textrm {cm}$ and $a\sim {10}^{-8}\;\textrm {cm}$ there is significant margin for satisfying the inequality in (2.64). Note that the origin of the R ${}^{2}$ factor in the numerator of (2.65) is the 1/M in the ${\nu }_V$ expression.

2.2.2. Fluctuation–dissipation theorem

We return to consideration of the spectral densities $S_F\!\left (\omega \right )$ and $S_V\!\left (\omega \right )$ in (2.63):

(2.65)

\begin{equation} S_F\!\left (\omega \right )\sim \frac {1}{{\omega }^2+{\nu }^2_F},\quad \omega \sim {\nu }_V\ll \,{\nu }_F \end{equation}

for frequencies $\omega$ relevant to the velocity response. Then ${{\omega }^2}/{{\nu }^2_F}\sim {10}^{-12}$ or ${10}^{-8}$ and $S_F\!\left (\omega \ll {\nu }_F\right )=S_F\!\left (0\right )$ , and from (2.64)

(2.66)

\begin{equation} S_V\!\left (\omega \right )\cong \frac {\frac {1}{M^2}S_F(0)}{{\omega }^2+{{\nu }_V}^2}. \end{equation}

From (2.40) the inverse Fourier transform of $S_V\!\left (\omega \right )$ yields the correlation function:

(2.67)

\begin{equation} C_V(\tau)=\frac {S_F(0)}{M^22{\nu }_V}e^{-{\nu }_V\vert \tau\vert }. \end{equation}

However, we know from the fundamental definition of $C_V(\tau)$ in (2.39) that $C_V\!\left (0\right )=\langle V^2\rangle =T/M$ . Hence,

(2.68)

\begin{equation} C_V(\tau)=\frac {T}{M}e^{-{\nu }_V\vert \tau \vert }\quad \textrm{and} \quad S_F\!\left (\omega \right )=2\gamma T, \end{equation}

good for $\omega \ll \,{\nu }_F$ . Now consider the integral of $S_F\!\left (\omega \right )$ over a frequency interval $\left [-\Delta \omega ,\Delta \omega \right ]$ where $\Delta \omega \ll {\nu }_F.$

The fluctuation–dissipation relation is

(2.69)

\begin{equation} {\langle {\left (\delta F\right )}^2\rangle }_{\Delta \omega }=\int\nolimits ^{\Delta \omega }_{-\Delta \omega }{\frac {\textrm{d}\omega }{2\pi }}S_F\!\left (\omega \right )=2\frac {\Delta \omega }{2\pi }2\gamma T=4\frac {\Delta \omega }{2\pi }\gamma T. \end{equation}

This is the fluctuation–dissipation theorem or Nyquist theorem due to Einstein in his work on Brownian motion.

2.2.3. Spatial diffusion and diffusivity

If the velocity field in Brownian motion is a random process, the particle displacement inherits randomness from the velocity.

Definition: Let x be the position of the particle and V its velocity.

For a specified time interval $\Delta t$ there is an accrued displacement:

(2.70)

\begin{equation} \Delta x=\int\nolimits ^{t+\Delta t}_t{\textrm{d}t^{\prime} V(t^{\prime} )}. \end{equation}

The ensemble-averaged displacement inherits its value from the ensemble-averaged velocity:

(2.71)

\begin{equation} \langle \Delta x\rangle =\int\nolimits ^{t+\Delta t}_t{\textrm{d}t^{\prime} \langle V(t^{\prime} )\rangle =0} \end{equation}

if $\langle V(t^{\prime} )\rangle =0$ . We can then calculate the ensemble-averaged variance of the displacement:

(2.72)

\begin{eqnarray} \langle {\left (\Delta x\right )}^2\rangle &&=\int\nolimits ^{t+\Delta t}_t{\textrm{d}t^{\prime} \left \{\int\nolimits {\textrm{d}t^{\prime}\ \langle V(t^{\prime})V(t^{\prime})\rangle }\right \}}\nonumber \\[4pt] && =\int\nolimits ^{t+\Delta t}_t{\textrm{d}t^{\prime} \left \{{\int\nolimits {\textrm{d}t^{\prime}}C}_V(\vert t^{\prime} -t^{\prime}\vert )\right \}=2\frac {\langle V^2\rangle }{{\nu }^2_V}\left [{\nu }_V\Delta t-1+e^{-{\nu }_V\Delta t}\right ]},\quad \end{eqnarray}

where (2.68) is used to evaluate $C_V(\vert t^{\prime} -t^{\prime}\vert )=\langle V^2\rangle e^{-{\nu }_V\vert {{t}}^{\prime}{-t^{\prime}\vert }}$ The variance has the two limiting values:

(2.73)

\begin{equation} \langle {\left (\Delta x\right )}^2\rangle =\left \{ \begin{array}{c} \langle V^2\rangle {(\Delta t)}^2\quad \,{\nu }_V\Delta t\ll 1,\ \\[4pt] 2\langle V^2\rangle \dfrac {\Delta t}{{\nu }_V}\quad { \nu }_V\Delta \textit {t}\gg 1. \end{array} \right . \end{equation}

For short times the variance in the displacement grows quadratically in time as if the velocity is constant, whereas for long times the variance in the displacement grows linearly in time apropos of a diffusion process!

Definition: The diffusivity is defined by

(2.74)

\begin{equation} D\equiv {\mathop {\lim }_{{\nu }_V\Delta t\to \infty } \frac {\langle {\left (\Delta x\right )}^2\rangle }{2\Delta t}}. \end{equation}

The diffusivity is then

(2.75)

\begin{equation} D={\mathop {\lim }_{{\nu }_V\Delta t\to \infty } \frac {\langle {\left (\Delta x\right )}^2\rangle }{2\Delta t}}=\frac {\langle V^2\rangle }{{\nu }_V}\ = \frac {\ T}{M}\frac {M}{\gamma }=\frac {T}{\gamma }. \end{equation}

This result allows us to define and evaluate the steady velocity response to a steady external force, i.e., the mobility

(2.76)

\begin{equation} \langle M\dot {V}\rangle +\langle \gamma V\rangle =\langle \delta F\rangle +\langle F^{\text{ext}}\rangle \to \langle \gamma V\rangle = \langle F^{\text{ext}}\rangle =F^{\text{ext}}. \end{equation}

Definition: The mobility $\mu$ is then defined as

(2.77)

\begin{equation} \mu \equiv \frac {\langle V\rangle }{F^{\text{ext}}}=\frac {1}{\gamma }. \end{equation}

Using (2.75) and (2.77) we arrive at the Einstein relation:

(2.78)

\begin{equation} D=\mu T \end{equation}

The diffusivity, which characterizes random spatial diffusion, is related directly to the mobility, i.e., the response to an external steady force, and the temperature.

We return to the consideration of (2.73) in more detail:

(2.79)

\begin{eqnarray} \langle {\left (\Delta x\right )}^2\rangle =\int\nolimits ^{t_0+\Delta t}_{t_0}{\textrm{d}t^{\prime} \left \{\int\nolimits ^{t_0+\Delta t}_{t_0}{\textrm{d}t^{\prime}\ \langle V(t^{\prime})V(t^{\prime})\rangle }\right \}} \nonumber \\[4pt] =\int\nolimits ^{t_0+\Delta t}_{t_0}{\textrm{d}t^{\prime} }\left \{\int\nolimits {\textrm{d}\tau \ \langle V(t^{\prime})V(t^{\prime} -\tau )\rangle }\right \}. \end{eqnarray}

Note that the limits of integration in the second integral in (2.79) are implied based on the original limits of the double integral which corresponded to the boundaries of a rectangle in the t ^′ and t ^′′ domain, $t_0$ to $t_0+\Delta t$ in each direction. We recognize that $C_V(\tau)=\langle V(t^{\prime})V(t^{\prime} -\tau )\rangle$ from (2.49) and $C_V(\vert \tau \vert )=\langle V^2\rangle e^{-{\nu }_V\vert \tau \vert }$ . The correlation function $C_V(\vert \tau \vert )$ falls off sharply over a time ${\tau }_c\sim O(1){\nu }^{-1}_V$ . We assume that $\Delta t\gg {\tau }_c$ The direct implication of the sharp fall off of $C_V(\vert \tau \vert )$ is that the dominant contributions to the double integrals in (2.72) and (2.79) are over a narrow region surrounding the diagonal $t^{\prime} =t^{\prime}$ in the original rectangle in the t ^′ and t ^′′ domain. This allows us to extend the limits of integration in the $\smallint {\textrm{d}\tau }$ integral in (2.79) to [ $-\infty ,\infty ]$ with no loss of precision. Evaluation of (2.79) is then straightforward:

(2.80)

\begin{equation} \int\nolimits ^{t_0+\Delta t}_{t_0}{\textrm{d}t^{\prime} }\left \{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_V(\tau )}\right \}=\Delta t\left \{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_V(\tau )}\right \}=2\Delta t\left \{\int\nolimits ^{\infty }_0{\textrm{d}\tau \ C_V(\tau )}\right \}. \end{equation}

From (2.75) and (2.80)

(2.81)

\begin{equation} D_x\equiv \frac {\langle {\left (\Delta x\right )}^2\rangle }{2\Delta t}= \int\nolimits ^{\infty }_0{\textrm{d}\tau \ C_V(\tau)=\frac {\langle V^2\rangle }{{\nu }_V}} \end{equation}

and we recover the result in (2.75).

Example: Using the evaluation of ${\nu }_V$ and the specific viscosity $\eta$ in the last section we have

(2.82)

\begin{equation} D_x=\frac {T}{6\pi \eta R}. \end{equation}

Assume T = 20–25 ${}^\circ \textrm {C}$ , $R\sim {10}^{-5}\,\textrm {cm},\ \ \textrm {and}\ a\sim {10}^{-8}\,\textrm{cm}$ , then $D_x\sim 2\times {10}^{-8}\,{{\textrm {cm}}^2}/{\textit{s}}$ . Hence,

(2.83)

\begin{equation} \langle {\left (\Delta x\right )}^2\rangle =2D_x\Delta t\sim 4\times {10}^{-8}\Delta t\,{\textrm {cm}}^2\quad \textrm {and}\quad {\sigma }_x\sim 2\times {10}^{-4}\sqrt {\Delta t(\textrm {sec})}\ \textrm {cm}. \end{equation}

Because of the 1/R dependence in D ${}_{x}$ , smaller particles diffuse considerably faster.

Next we consider the probability of a particle having a displacement x ${}_{t}$ relative to a reference or initial displacement x ${}_{0}$ , and we appeal to the central limit theorem:

(2.84)

\begin{equation} \rho \!\left (x_t\vert x_0\right )=\frac {e^{-\frac {{\!\left (x_t-x_0\right )}^2}{4Dt}}}{\sqrt {4\pi Dt}}. \end{equation}

Now multiply both sides of (2.84) by the probability $\rho (x_0)$ . Then

(2.85)

\begin{equation} \rho \!\left (x_t,x_0\right )\equiv \rho \!\left (x_0\right )\rho \!\left (x_t\vert x_0\right )=\frac {e^{-\frac {{\!\left (x_t-x_0\right )}^2}{4Dt}}}{\sqrt {4\pi Dt}}\rho \!\left (x_0\right ). \end{equation}

These probabilities are not statistically independent. The displacement at time t is very dependent on the displacement at t = 0. The probability of the displacement $\rho \!\left (x_t\right )$ without specifying $x_0$ is given by the integral over $x_0$ :

(2.86)

\begin{equation} \rho \!\left (x_t\right )=\int\nolimits {\textrm{d}x_0\rho (x_t,x_0)\equiv \rho (x;t)} \end{equation}

and $\rho \!\left (x_0\right )=\rho (x;0)$ .

Then

(2.87)

\begin{equation} \rho \!\left (x;t\right )=\int\nolimits {\textrm{d}x'}\ \frac {e^{-\frac {{(x-x')}^2}{4Dt}}}{\sqrt {4\pi Dt}}\rho (x^{\prime} ;0)\quad \textrm {or}\quad \rho \!\left (x;t+\tau \right )=\int\nolimits {\textrm{d}x'}\ \frac {e^{-\frac {{\!\left (x-x'\right )}^2}{4Dt}}}{\sqrt {4\pi \textrm{D}\tau }}\rho \!\left (x^{\prime} ;t\right ). \end{equation}

The constructed solution $\rho \!\left (x;t\right )$ satisfies the diffusion equation:

(2.88)

\begin{equation} \frac {\partial }{\partial t}\rho \!\left (x;t\right )=D\frac {{\partial }^2\rho }{\partial x^2}. \end{equation}

This all carries over to three dimensions: ${\partial}/{\partial t}(\rho \!\left ({\boldsymbol{x}};t\right ))=D{\nabla }^2\rho$ We note that going from one to three dimensions in the diffusion equation comes with extending the boundary conditions from one to three dimensions, alters the Green’s functions for integral solutions of the diffusion equation, and changes how theorems are proven.

2.2.4. Boltzmann’s $\text{H}$ -theorem

In this section we consider example calculations derived from (2.88) which lead us to Boltzmann’s H-theorem.

Examples:

1. Given the initial condition $\rho \!\left (x,0\right )={\rho }_0\!\left (1+\epsilon \,{\sin kx}\right )$ the solution to the partial differential equation in (2.88) is straightforward: $\rho \!\left (x,t\right )={\rho }_0\!\left (1+\epsilon {e^{-k^2Dt}\sin kx}\right )$ for t $\gt$ 0.
2. Consider the integrated form of (2.88) with reflecting wall boundary conditions ${\partial \rho }/{\partial x}=0$ :
(2.89) \begin{equation} 0=\frac {\textrm{d}}{\textrm{d}t}\int\nolimits {\textrm{d}x\ \rho \!\left (x,t\right )=\int\nolimits {\textrm{d}x\ \frac {\partial \rho }{\partial t}=\int\nolimits {\textrm{d}x}}}D\frac {{\partial }^2\rho }{\partial x^2}=D\frac {\partial \rho }{\partial x}\bigg\vert _{{\text{boundary}}}. \end{equation}
Now solve the boundary value problem given the initial conditions in the first example and defining a relation between k and the length of the box L, e.g., $k=2\pi /L$ . Given that the definition of the flux is $\varGamma =-D\nabla n$ and the boundary conditions, there should be no flux across the bounding surfaces.
3. H-theorem (Boltzmann): Introduce the entropy
(2.90) \begin{equation} S\!\left (t\right )=-\int\nolimits {\textrm{d}^3\textit{x}\ \rho \!\left ({\boldsymbol{x}};t\right ){\text{ ln } \rho ({\boldsymbol{x}};t)}}. \end{equation}
Then from its time derivative
(2.91) \begin{align} \frac {\textrm{d}S}{\textrm{d}t}&= -\int\nolimits {\textrm{d}^3\textit{x}\ \left [\frac {\partial \rho }{\partial t}{\text{ ln } \rho +\ \frac {\partial \rho }{\partial t}}\right ]}=-\int\nolimits {\textrm{d}^3\textit{x}}\frac {\partial \rho }{\partial t}{\text{ ln } \rho -\ \frac {\textrm{d}}{\textrm{d}t}\int\nolimits {\textrm{d}^3\textit{x}\ \rho }}\nonumber \\[4pt] &=-\int\nolimits {\textrm{d}^3\textit{x}}\frac {\partial \rho }{\partial t}{\text{ ln } \rho }=-\int\nolimits {\textrm{d}^3\textit{x}}{\text{ ln } \rho \ D{\nabla }^2\rho} = \int\nolimits {\textrm{d}^3\textit{x}}{\frac {D}{\rho }\nabla \rho \cdot \nabla \rho}\nonumber\\[4pt] &= \int\nolimits {\textrm{d}^3\textit{x}}{\frac {D}{\rho } {\vert \nabla \rho \vert }^2\ge 0} \end{align}
and we note that surface terms in the integration by parts vanish since the flux across bounding surfaces is assumed to be zero, $\rho$ is nonnegative, and the volume integral of $\rho$ is conserved with the zero flux boundary conditions. Thus, only if $\rho$ is perfectly flat will S stop growing; and there is irreversible growth until the entropy achieves a maximum.
4. S has an upper bound if $\rho$ = constant corresponding to uniformity, which is the asymptotic limit corresponding to $\nabla \rho =0$ yielding ${\textrm{d}S}/{\textrm{d}t}=0.$

Note: Boltzmann (Reference Boltzmann1872) used the Boltzmann equation rather than the diffusion equation used here to derive the H-theorem and obtained the Boltzmann distribution as an asymptotic state.

2.3. Liouville and Klimontovich equations

In this section kinetic equations are introduced to describe the evolution in phase space of a deterministic system when initial conditions might not be precisely known.

2.3.1. Liouville equation

We postulate a vector function ${\boldsymbol \varGamma }\!\left (t\right )$ that describes the state of a system of particles, i.e., its momenta $\left \{{{\boldsymbol{p}}}_i\right \}$ and positions $\left \{{{\boldsymbol{q}}}_i\right \}$ , and assume further that there exists a Hamiltonian $H\!\left ({\boldsymbol{p}},{\boldsymbol{q}};t\right )$ that determines the evolution of the system:

(2.92)

\begin{equation} {\dot {q}}_i=\frac {\partial H}{\partial p_i},\quad {\dot {p}}_i=-\frac {\partial H}{\partial q_i}.\ \end{equation}

Example: Charged particle motion in electromagnetic fields is described by the equations

(2.93)

\begin{equation} \left \{{\boldsymbol{r}},{{{\boldsymbol v}}}\right \}:\quad \dot {{\boldsymbol{r}}}={{{\boldsymbol v}}}\quad \dot {{{{\boldsymbol v}}}}= \frac {e}{m}\left [{\boldsymbol{E}}\!\left ({\boldsymbol{r}},t\right )+\frac {1}{c}{{{\boldsymbol v}}}(t)\times {\boldsymbol{B}}({\boldsymbol{r}},t)\right ]. \end{equation}

Example: Viscous system

(2.94)

\begin{equation} \left \{{\boldsymbol{r}},{{{\boldsymbol v}}}\right \}:\quad \dot {{\boldsymbol{r}}}={{{\boldsymbol v}}}\quad \dot {{{{\boldsymbol v}}}}= -\frac {\gamma }{m}{{{\boldsymbol v}}}(t). \end{equation}

Here ${\boldsymbol \varGamma }\!\left (t\right )$ has 2f dimensions, where f is the number of degrees of freedom. The domain of ${\boldsymbol \varGamma }\!\left (t\right )$ is sometimes called phase space. The evolution of ${\boldsymbol \varGamma }\!\left (t\right )$ is formally expressed as

(2.95)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}{\boldsymbol \varGamma }\!\left (t\right )= \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right ). \end{equation}

Given ${\boldsymbol \varGamma }({{\boldsymbol \varGamma }}_{{\mathbf 0}})\to {{\boldsymbol \varGamma }}_t\equiv {\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_{{\mathbf 0}}\right )$ is determined. Keep in mind that $\boldsymbol \varGamma$ represents the phase-space-independent variable $\left \{{\boldsymbol{r}},{{{\boldsymbol v}}}\right \}$ . For three spatial dimensions, ${{\boldsymbol \varGamma }}$ – space has 6f dimensions. For a specified fixed time t, ${\boldsymbol \varGamma}\to \ \delta ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i,{\boldsymbol{p}}-{{\boldsymbol{p}}}_i)$ in probability. Hence,

(2.96a)

\begin{equation} \rho \!\left ({\boldsymbol\varGamma};\textit{t}\vert {{\boldsymbol \varGamma }}_0\right )=\delta \!\left ({\boldsymbol \varGamma }-{\boldsymbol \varGamma }\!\left (\textit {t}\vert {{\boldsymbol \varGamma }}_0\right )\right ), \end{equation}

(2.96b)

\begin{equation} \rho \!\left ({\boldsymbol \varGamma};\textit{t}\vert {{\boldsymbol \varGamma }}_0\right )\rho \!\left ({{\boldsymbol \varGamma }}_0\right )=\rho \!\left ({{\boldsymbol \varGamma }}_t,{{\boldsymbol \varGamma }}_0\right )\quad \rho ({\boldsymbol\varGamma};\textit{t})\equiv \int\nolimits {\textrm{d}{\varGamma }_0{\boldsymbol {\Lambda}}\rho \!\left ({{\boldsymbol \varGamma }}_t,{{\boldsymbol \varGamma }}_0\right )}. \end{equation}

How does $\rho ({\boldsymbol \varGamma };\textrm {t)}$ evolve with time t? This is the fundamental question of nonequilibrium statistical mechanics.

For a state function $A({\boldsymbol \varGamma }\textrm {)}$ ,

(2.97)

\begin{equation} \langle A\rangle (t)\equiv \int\nolimits {\textrm{d}\varGamma \;A{\mathbf (}{\boldsymbol \varGamma }{\mathbf )}\rho \!\left ({\boldsymbol\varGamma};t\right )}, \frac {\textrm{d}}{\textrm{d}t}\langle A\rangle =\int\nolimits {\textrm{d}\varGamma \;A({\boldsymbol \varGamma })\frac {\textrm{d}\rho }{\textrm{d}t}\!\left ({\boldsymbol\varGamma};t\right )}. \end{equation}

Thus, we evaluate

(2.98)

\begin{align} \frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )&=\frac {\partial}{\partial {\boldsymbol{x}}}\delta \!\left ({\boldsymbol{x}}\right )\cdot \frac {\partial {\boldsymbol{x}}}{\partial t}=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\delta \!\left ({\boldsymbol{x}}\right )\cdot \ \dot {{\boldsymbol \varGamma }}({\boldsymbol \varGamma },t{\mathbf )} \frac {\partial}{\partial {\boldsymbol \varGamma }}\delta \!\left ({\boldsymbol{x}}\right )=\frac {\partial}{\partial {\boldsymbol{X}}}\delta ({\boldsymbol{x}})\cdot \frac {\partial {\boldsymbol{X}}}{\partial {\boldsymbol \varGamma }}\nonumber\\[4pt]&=\frac {\partial}{\partial {\boldsymbol{X}}}\delta ({\boldsymbol{x}})\cdot {\boldsymbol{I}}, \end{align}

where ${\boldsymbol{X}}\equiv {\boldsymbol \varGamma }-{\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )$ , $\partial \delta(x) /\partial X=\partial \delta(X)/\partial{\boldsymbol\varGamma}$ , and ${\partial {\boldsymbol{X}}}/{\partial t}=-\dot {{\boldsymbol \varGamma }}({\boldsymbol \varGamma },t{\mathbf )}$ . Hence,

(2.99)

\begin{equation} \frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=-\frac {\partial}{\partial t}{\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )\cdot \frac {\partial}{\partial {\boldsymbol \varGamma }}\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \!\left (\frac {\partial}{\partial t}{\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )\right ). \end{equation}

Note that ${\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )$ is not a function of $\boldsymbol \varGamma$ and ${\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )\ne {\boldsymbol \varGamma }$ . However, $({\partial }/{\partial t}){\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )=\dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right ),t\right )$ ; and from (2.96a ) $\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=\delta \!\left ({\boldsymbol \varGamma }-{\boldsymbol \varGamma }\!\left (t\vert {{\boldsymbol \varGamma }}_0\right )\right )$ ; hence, (2.99) becomes

(2.100)

\begin{equation} \frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \!\left (\frac {\partial {\boldsymbol \varGamma }\!\left ({\boldsymbol \varGamma },t\right )}{\partial t}\rho \right )=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \!\left (\dot {{\boldsymbol \varGamma }}{\mathbf (}{\boldsymbol \varGamma },t\textrm {)}\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )\right ). \end{equation}

Example: Consider a one-dimensional, field-free, viscous system to illustrate (2.100)

(2.101)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}V=-\gamma V,\quad \frac {\partial}{\partial t}\ \rho \!\left (V;t\vert V_0\right )=-\frac {\partial}{\partial V}\cdot \!\left (-\gamma V\rho \right ), \end{equation}

where ${\boldsymbol \varGamma }\to {\boldsymbol{V}}\to V$ in one dimension.

Now we return to (2.99) and integrate over the ${{\boldsymbol \varGamma }}_0$ domain:

(2.102)

\begin{align} \frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\right )&=\int\nolimits {\textrm{d}{\varGamma }_0\rho \!\left ({{\boldsymbol \varGamma }}_0\right )\textrm {}\frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=}-\int\nolimits {\textrm{d}{\varGamma }_0\rho \!\left ({{\boldsymbol \varGamma }}_0\right )}\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \!\left (\dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )\right )\nonumber \\[4pt] &=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \int\nolimits {\textrm{d}{\varGamma }_0\rho \!\left ({{\boldsymbol \varGamma }}_0\right )}\dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )\rho \!\left ({\boldsymbol\varGamma};t\vert {{\boldsymbol \varGamma }}_0\right )=-\frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \!\left (\dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )\rho ({\boldsymbol\varGamma};t)\right ), \end{align}

which is the continuity equation for the probability in phase space.

Example: Return to the one-dimensional, field-free, viscous system in (2.101). We guess a solution

(2.103)

\begin{equation} \rho \!\left (V;t\right )=\frac {1}{\sqrt {2\pi {\sigma }^2(t)}}e^{-\ \frac {V^2}{2{\sigma }^2(t)}} \quad \textrm{and} \quad \sigma \!\left (t\right )={\sigma }_0e^{-\gamma t}. \end{equation}

Exercise: Check whether (2.103) is a solution of (2.101) and (2.102).

We return to (2.102) and expand the right-hand side of the continuity equation:

(2.104)

\begin{align} \textrm {}\frac {\partial}{\partial t}\ \rho \!\left ({\boldsymbol\varGamma};t\right )&=-\rho \frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )-\dot {{\boldsymbol \varGamma }}\cdot \frac {\partial}{\partial {\boldsymbol \varGamma }}\rho \!\left ({\boldsymbol\varGamma};t\right )\nonumber\\[4pt]&\quad \to \!\left (\frac {\partial}{\partial t}+\dot {{\boldsymbol \varGamma }}\cdot \frac {\partial}{\partial {\boldsymbol \varGamma }}\right )\rho \!\left ({\boldsymbol\varGamma};t\right ) =-\rho \frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right ). \end{align}

We define the convective derivative ${D}/{Dt}\equiv ({\partial}/{\partial t})+\dot {{\boldsymbol \varGamma }}\cdot ({\partial}/{\partial {\boldsymbol \varGamma }})$ and (2.104) becomes

(2.105)

\begin{equation} \frac {D}{Dt}\ \rho \!\left ({\boldsymbol\varGamma};t\right )=-\rho \frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right ). \end{equation}

Example: With $\dot {V}=-\gamma V$ , then $({D}/{Dt})\ \rho =\gamma V$ and $\rho$ increases following the orbit.

For a Hamiltonian system $({\partial}/{\partial {\boldsymbol \varGamma }})\cdot \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )=0$

Proof: We have

(2.106)

\begin{equation} \frac {\partial}{\partial {\boldsymbol \varGamma }}\cdot \dot {{\boldsymbol \varGamma }}\!\left ({\boldsymbol \varGamma },t\right )=\sum\limits _i{\frac {\partial}{\partial q_i}{\dot {q}}_i+\frac {\partial}{\partial p_i}{\dot {p}}_i=}\sum\limits _i{\frac {\partial}{\partial q_i}\frac {\partial H}{\partial p_i}-\frac {\partial}{\partial p_i}\frac {\partial H}{\partial q_i}\equiv 0}. \end{equation}

Thus, for a Hamiltonian system, $({D}/{Dt})\ \rho \!\left ({\boldsymbol\varGamma};t\right )=0$ ; or, more formally,

(2.107)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left ({\boldsymbol\varGamma};t\right )+\left \{\rho ,H\right \}=0, \end{equation}

where $\left \{\rho ,H\right \}$ is the Poisson bracket. Equation (2.107) is a statement of Liouville’s theorem.

Example: Consider $\dot {{\boldsymbol{V}}}=({e}/{mc}){\boldsymbol{V}}\times {\boldsymbol{B}}\!\left ({\boldsymbol{r}},t\right )\to {\dot {V}}_i=({e}/{mc}){\varepsilon }_{ijk}V_jB_k$ We note that $\Sigma _i({\partial {\dot {x}}_i}/{\partial x_i})+$ ${\partial {\dot {V}}_i}/{\partial V_i}=\dots =0.$ With this force law the system is not a Hamiltonian system; nevertheless, the divergence of the flow field is zero. Thus, the flow can be incompressible independent of whether the system has a Hamiltonian or not.

2.3.2. Klimontovich phase space and distribution function

Consider the velocity (or momentum) and position phase space for N particles. For three spatial dimensions and three velocity dimensions, this is a six-dimensional by N points phase space.

Definition: This phase space corresponds to the Klimontovich space $\equiv \mu$ and the corresponding phase-space density distribution function is the Klimontovich distribution.

The equations of motion (assumed nonrelativistic) are

(2.108a)

\begin{equation} \frac {\mathrm{d}{{\boldsymbol{x}}}_i}{\textrm{d}t}={{{{\boldsymbol v}}}}_i, \end{equation}

(2.108b)

\begin{equation} \frac {\mathrm{d}{{{{\boldsymbol v}}}}_i}{\textrm{d}t}=\frac {1}{m_i}\left [f^{\text{ext}}_i\!\left ({{\boldsymbol{x}}}_i,{{{{\boldsymbol v}}}}_i,t\right )+\sum\limits _{j(\ne i)}{f_{i,j}}\right ]. \end{equation}

For ${\xi }_i$ defined as the vector defining the i ${}^{th}$ particle in the six-dimensional $\mu$ space, (2.108b ) can be written as

(2.109)

\begin{equation} \frac {\mathrm{d}{{\boldsymbol \xi }}_i(t)}{\textrm{d}t}={\dot {{\boldsymbol \xi }}}^{\text{ext}}_i ({{\boldsymbol \xi }}_i,t)+\sum\limits ^N_{j(\ne i)}{{\dot {\xi }}^{in}({{\boldsymbol \xi }}_i,{{\boldsymbol \xi }}_j)}. \end{equation}

Definition: Let $F\equiv$ density,

(2.110)

\begin{equation} F\equiv \sum\limits ^N_{i=1}{\delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_i(t))}. \end{equation}

With the use of the density, the second term on the right-hand side of (2.109) can be expressed as

(2.111)

\begin{equation} \sum\limits ^N_{j(\ne i)}{{\dot {{\boldsymbol \xi }}}^{in}({{\boldsymbol \xi }}_i,{{\boldsymbol \xi }}_j)}\to \int\nolimits {\mathrm{d}\xi 'F({{\boldsymbol \xi }}^{\prime} ){\dot {{\boldsymbol \xi }}}^{in}({{\boldsymbol \xi }}_i,{\boldsymbol \xi }')}, \end{equation}

(2.112)

\begin{equation} \frac {\mathrm{d}{{\boldsymbol \xi }}(t)}{\textrm{d}t}={\dot {{\boldsymbol \xi }}}^{\text{ext}}\!\left ({{\boldsymbol \xi }},t\right )+\int\nolimits {\textrm{d}\xi 'F({{\boldsymbol \xi }}^{\prime} ){\dot {{\boldsymbol \xi }}}^{in}({{\boldsymbol \xi }},{\boldsymbol \xi }')}={\dot {{\boldsymbol \xi }}}\!\left ({{\boldsymbol \xi }},t,F\right ). \end{equation}

We can take the partial derivative of (2.110) with respect to time

(2.113)

\begin{align} \frac {\partial F\!\left ({{\boldsymbol \xi }},t\right )}{\partial t}&=-\sum\limits _i{\frac {\textrm{d}{\boldsymbol \xi }{\mathbf (}{\boldsymbol{t}}{\mathbf )}}{\textrm{d}t}}\cdot \frac {\partial}{\partial {\boldsymbol \xi }}\; \delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_i(t))\nonumber \\[4pt] &=-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \sum\limits _i{\frac {\textrm{d}{\boldsymbol \xi }}{\textrm{d}t}}{{{\mathbf (}{\boldsymbol \xi }}_i},t,F{\mathbf )}\delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_i (t))\nonumber \\[4pt] &=-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot {\bigg\{{{\frac {\textrm{d}{\boldsymbol \xi }}{\textrm{d}t}\textrm {(}}{\boldsymbol \xi }}},t,F{\mathbf )}\sum\limits _i{}\delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_i(t))\bigg\}\nonumber \\[4pt] &=-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \left \{{{{\dot {{\boldsymbol \xi }}\textrm {(}}{\boldsymbol \xi }}},t,F{\mathbf )}F({\boldsymbol \xi };t\textrm {)}\right \}. \end{align}

Equation (2.113) is a nonlinear partial differential equation in seven variables, the Klimontovich equation.

Lemma 1: Usually (unless the physical system is pathological)

(2.114)

\begin{equation} \frac {\partial}{\partial {\boldsymbol \xi }}\cdot \dot {{\boldsymbol \xi }}=0. \end{equation}

Lemma 2: Following from the previous lemma (if true) and (2.113),

(2.115)

\begin{equation} \!\left (\frac {\partial}{\partial t}+\dot {{\boldsymbol \xi }}\cdot \frac {\partial}{\partial {\boldsymbol \xi }}\right )F=0. \end{equation}

which is a Liouville-type equation.

Definition: We introduce the ensemble average of F over initial conditions for ${{\boldsymbol \xi }}_i$ :

(2.116)

\begin{equation} f_1\!\left ({\boldsymbol \xi },\textit {t}\right )\equiv \langle F\rangle \!\left ({\boldsymbol \xi },t\right ) =\int\nolimits {\textrm{d}{\varGamma }_0\rho ({{\boldsymbol \varGamma }}_0)}\sum\limits _i{\delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_i(t\vert {{\boldsymbol \varGamma }}_0))} \end{equation}

The mean value of the Klimontovich equation (2.113) is then

(2.117)

\begin{align} \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol \xi }\textrm {,t}\right )&=-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \langle {\dot {{\boldsymbol \xi }}}F\rangle\nonumber\\[4pt]& = -\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \left \{{{\dot {{\boldsymbol \xi }}}}^{\text{ext}}\!\left ({\boldsymbol \xi },t\right )f_1\!\left ({\boldsymbol \xi };t\right )\right \}-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \int\nolimits {\textrm{d}{\boldsymbol \xi }'}{{\dot {{\boldsymbol \xi }}}}^{in}\!\left ({\boldsymbol \xi },{\boldsymbol \xi }{\mathbf '}\right )\langle F\!\left ({{\boldsymbol \xi }}^{\prime} ,{\boldsymbol{t}}\right )F{\mathbf (}{\boldsymbol \xi },{\boldsymbol{t}}{\mathbf )}\rangle . \end{align}

Definition: In (2.117) we have introduced the two-position correlation function

(2.118)

\begin{align} \langle F({{\boldsymbol \xi }}^{\prime} ,{\boldsymbol{t}} )F({\boldsymbol \xi },{\boldsymbol{t}})\rangle &\equiv \sum\limits _i{\sum\limits _j{\langle \delta ({\boldsymbol \xi }-{{\boldsymbol \xi }}_{{\boldsymbol{i}}})\delta ({{\boldsymbol \xi }}^{\prime} -{{\boldsymbol \xi }}_{{\boldsymbol{j}}})\rangle }} \nonumber \\[4pt]&= \delta ({\boldsymbol \xi }-{\boldsymbol \xi }{\mathbf '})f_1({\boldsymbol \xi }\textit {,t})+\sum\limits _{i\ne j}{{\langle {F_i({\boldsymbol \xi };t)F}_j({{\boldsymbol \xi }}^{\prime} ;t)\rangle }} \end{align}

and $F_i\!\left ({\boldsymbol \xi };t\right )\equiv \ \delta \!\left ({\boldsymbol \xi }-{{\boldsymbol \xi }}_{{\boldsymbol{i}}}\right )$ . The first term on the right-hand side of (2.118) is zero to preclude self-forces.

[Editor’s Note: A slightly cleaner notation in which the double sum has $i\ne j$ to exclude self-forces could have been employed.]

Now expand the ensemble-average bracket after introducing $\delta F_i=F_i-\langle F_i\rangle$ :

(2.119)

\begin{equation} \langle F_iF_j\rangle =\langle (\langle F_i\rangle +\delta F_i)(\langle F_j\rangle +\delta F_j )\rangle =\langle F_i\rangle \langle F_j\rangle +\langle \delta F_i\delta F_j\rangle \end{equation}

using the identity $\langle \delta F_i\rangle =0.$ We note that $F_i$ is a single term out of the N terms in the sum over i leading to $f_1$ ; hence, $F_i\to O({1}/{N})\ f_1\!\left ({\boldsymbol \xi }\right )$ and $F_j\to O({1}/{N})\ f_1\!\left ({\boldsymbol \xi }{\mathbf '}\right )$ ; and the sum $\Sigma _{i\ne j}{{ \to N(N-1)}}$ pairs. These arguments lead to

(2.120)

\begin{equation} \textrm {}\langle F\!({{\boldsymbol \xi }}^{\prime} )F\!({\boldsymbol \xi })\rangle = \delta ({\boldsymbol \xi }-{\boldsymbol \xi }{\mathbf '})f_1({\boldsymbol \xi })+\bigg(1-\frac {1}{N}\bigg)f_1 ({\boldsymbol \xi })f_1 ({\boldsymbol \xi }{\mathbf '})+\sum\limits _{i\ne j}{{\langle {{\delta F}_i ({\boldsymbol \xi })\delta F}_j({{\boldsymbol \xi }}^{\prime} )\rangle }}. \end{equation}

N is large; so we drop the 1/N term in (2.120). We identify the last term

(2.121)

\begin{equation} \ h_2\equiv \sum\limits _{i\ne j}{{\langle {{\delta F}_{\!i} ({\boldsymbol \xi })\delta F}_{\!j} ({{\boldsymbol \xi }}^{\prime})\rangle }} \end{equation}

as the two-particle correlation which accounts for the forces between particles. We also drop the first term on the right side of (2.120) $\delta \!\left ({\boldsymbol \xi }-{\boldsymbol \xi }{\mathbf '}\right )f_1\!\left ({\boldsymbol \xi }\right )$ because the contribution to ${\dot {{\boldsymbol \xi }}}^{in}$ from this term must be zero because self-forces are disallowed. If we assume the lemma (2.114) is valid and can commute $({\partial }/{\partial {\boldsymbol \xi }})\cdot {{\dot {{\boldsymbol \xi }}}}$ leading to (2.115), then (2.117) becomes

(2.122)

\begin{eqnarray} \left [\frac {\partial}{\partial t}+{\dot {{\boldsymbol \xi }}}^{\text{ext}}\!\left ({\boldsymbol \xi },t\right )\cdot \frac {\partial}{\partial {\boldsymbol \xi }}+\int\nolimits {\textrm{d}{\xi }^{\prime} }{{\dot {{\boldsymbol \xi }}}}^{in}\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )f_1({{\boldsymbol \xi }}^{\prime} )\cdot \frac {\partial}{\partial {\boldsymbol \xi }}\right ]f_1\!\left ({\boldsymbol \xi };t\right )\nonumber \\[4pt] =-\frac {\partial}{\partial {\boldsymbol \xi }}\cdot \int\nolimits {\textrm{d}{\xi }^{\prime} }{{\dot {{\boldsymbol \xi }}}}^{in}\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )h_2\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} ;t\right ). \end{eqnarray}

We note that the second term on the left-hand side of (2.122) derives from the external force(s), and the third term contains the mean internal force. Equation (2.122) is not a closed equation determining the evolution of the distribution $f_1\!\left ({\boldsymbol \xi };t\right )$ because h ${}_{2}$ the two-particle correlation appears. The next step will be to get an equation for h ${}_{2}$ , but this will drag in h ${}_{3}$ , etc., that is, the Bogoliubov–Born–Green–Kirkwood–Yvon (BBGKY) hierarchy emerges.

If the correlations vanish, the right-hand side of (2.122) is zero and the Vlasov equation results:

(2.123)

\begin{equation} \left [\frac {\partial}{\partial t}+{\dot {{\boldsymbol \xi }}}^{\text{ext}}\!\left ({\boldsymbol \xi },t\right )\cdot \frac {\partial}{\partial {\boldsymbol \xi }}+\int\nolimits {\textrm{d}{\xi }^{\prime} }{{\dot {{\boldsymbol \xi }}}}^{in}\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )f_1({{\boldsymbol \xi }}^{\prime} )\cdot \frac {\partial}{\partial {\boldsymbol \xi }}\right ]f_1\!\left ({\boldsymbol \xi };t\right )=0. \end{equation}

Definition: Introduce $f_2\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )$ defined by

(2.124)

\begin{equation} f_2\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )\equiv \sum\limits _{i\ne j}{\langle F_i({\boldsymbol \xi }\textrm{)}F_{{\boldsymbol{j}}}({\boldsymbol \xi }{\mathbf '}\textrm {)}\rangle }, \end{equation}

Lemma: Given (2.119) and the definition of h ${}_{2}$ in (2.121) then

(2.125)

\begin{equation} f_2\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right )=f_1\!\left ({\boldsymbol \xi }\right )f_1\!\left ({{\boldsymbol \xi }}^{\prime} \right )+h_2\!\left ({\boldsymbol \xi },{{\boldsymbol \xi }}^{\prime} \right ). \end{equation}

From (2.125) we note that

(2.126)

\begin{equation} h_2\equiv \langle FF\rangle -f_1f_1 \to \frac {\partial h_2}{\partial t}=\bigg\langle \frac {\partial F}{\partial t}F\bigg\rangle +\bigg\langle F\frac {\partial F}{\partial t}\bigg\rangle -\frac {\partial f_1}{\partial t}f_1-f_1\frac {\partial f_1}{\partial t}. \end{equation}

2.4. Landau equation

In the previous section we derived (2.123) as the collisionless limit of (2.122). Progress can be made evaluating the right-hand side of (2.122) in particular limits. If the gas is sufficiently dilute, then the Boltzmann parameter is small, $na^3_0\ll 1$ . A simple representation of the collision operator can be derived if the particle interaction potential is small, ${\phi }/{T}\ll 1$ , leading to the Landau equation. Another limit useful for plasmas is obtained when $\!\left (e^2/{\lambda }_D\right )/T$ $\ll$ 1, and the Lenard–Balescu–Guernsey equation can be derived.

2.4.1. Derivation of the Landau equation

To derive the Landau equation we assume ${\phi }/{T}\ll 1$ and consider a plasma which is weakly coupled, i.e., the interaction potential and collisions are weak. In this limit, $h_2\sim \varepsilon \sim \phi _{ij}$ ; and $h_3\sim \textrm{force}\ \times \ h_2\sim {\varepsilon }^2$ , which is higher order in $\varepsilon$ and will be discarded. This truncates the hierarchy and allows the set of equations to be closed. Recall from the previous section, the most severe truncation of the hierarchy arises when $h_2$ and all higher interaction terms are discarded, in which limit the Vlasov equation (2.123) is obtained. From (2.126) we have

(2.127)

\begin{eqnarray} \left [\frac {\partial}{\partial t}+{{{\boldsymbol v}}}\cdot \frac {\partial}{\partial {\boldsymbol{r}}}+{{{{\boldsymbol v}}}}^{\prime} \cdot \frac {\partial}{\partial {{\boldsymbol{r}}}^{\prime} }+{\dot {{\boldsymbol{p}}}}^{\text{ext}}\!\left ({\boldsymbol{p}}\right )\cdot \frac {\partial}{\partial {\boldsymbol{p}}}+{\dot {{\boldsymbol{p}}}}^{\text{ext}}\!\left ({\boldsymbol{p}}'\right )\cdot \frac {\partial}{\partial {\boldsymbol{p}}'}\right ]h_2\!\left ({\boldsymbol{r}},{{\boldsymbol{r}}}^{\prime} ,{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t\right )\nonumber \\[4pt]={\boldsymbol{f}}({\boldsymbol{r}},{{\boldsymbol{r}}}^{\prime} )\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f_1({\boldsymbol{r}},{\boldsymbol{p}};t)f_1({\boldsymbol{r}}',{\boldsymbol{p}}';t).\quad \end{eqnarray}

In (2.127) ${\boldsymbol{f}}({\boldsymbol{r}},{{\boldsymbol{r}}}^{\prime} )$ is the electric field force.

Next consider a system in which there is no external forces and to further simplify we assume that system is uniform so that $f\to f({\boldsymbol{p}};t)$ . With these simplifications and defining ${\boldsymbol{s}}={\boldsymbol{r}}-{\boldsymbol{r}}'$ (2.127) becomes

(2.128)

\begin{equation} \left [\frac {\partial}{\partial t}+({{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} )\cdot \frac {\partial}{\partial {\boldsymbol{s}}}\right ]h_2\!\left ({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t\right )=-\frac {\partial \phi ({\boldsymbol{s}})}{\partial {\boldsymbol{s}}}\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f_1({\boldsymbol{p}};t)f_1({\boldsymbol{p}}';t). \end{equation}

Definition: The Fourier transform of $g\!\left ({\boldsymbol{s}}\right )$ is $g\!\left ({\boldsymbol{k}}\right )=\smallint {\textrm{d}^3}\textrm {s}\ g({\boldsymbol{s}})e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{s}}}.$

The Fourier transform of (2.128) is then

(2.129)

\begin{equation} \left [\frac {\partial}{\partial t}+({{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} )\cdot i{\boldsymbol{k}}\right ]h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t\right )=-i{\boldsymbol{k}}\phi ({\boldsymbol{k}})\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f_1({\boldsymbol{p}};t)f_1({\boldsymbol{p}}';t) \end{equation}

Equation (2.129) is a quasilinear differential equation that is first order in time. The corresponding limit of (2.122) for $f_1$ is

(2.130)

\begin{equation} \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )=-\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\textrm{d}^3\textit {r}'\textrm{d}^3{\boldsymbol{p}}'\!\left (-\frac {\partial \phi }{\partial {\boldsymbol{r}}}({\boldsymbol{s}})h_2\!\left ({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t\right )\right )}. \end{equation}

Fourier transforming (2.130) and using the convolution theorem, (2.130) becomes

(2.131)

\begin{equation} \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )=\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}\textrm{d}^3\textit{p}'\!\left ({\left [i{\boldsymbol{k}}\phi \!\left ({\boldsymbol{k}}\right )\right ]}^*h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t\right )\right )}. \end{equation}

We note that for $({\partial}/{\partial t})f_1\to -({\partial}/{\partial t})f_1$ and ${\boldsymbol{p}}\to -{\boldsymbol{p}}$ in (2.131) the sign changes on both the left- and right-hand sides of (2.131); hence, (2.131) appears to be fully time reversible. Thus far, there is no ‘H-theorem’ in (2.127)–(2.131).

With respect to the reversibility or irreversibility of (2.131) consider the following:

(2.132)

\begin{equation} \!\left (\frac {\partial}{\partial t}+\alpha \right )h_2=g\!\left (t\right ) \to \alpha =0\quad \frac {\partial}{\partial t}h\!\left (t\right )=g\!\left (t\right )\ \ \ \end{equation}

which has solutions

(2.133a)

\begin{equation} h\!\left (t\right )=\int\nolimits ^t_{-\infty }{\textrm{d}t^{\prime} g(t^{\prime})+h\!\left (-\infty \right )} \end{equation}

or based on future values

(2.133b)

\begin{equation} h\!\left (t\right )=h\!\left (\infty \right )-\int\nolimits ^{\infty }_t{\textrm{d}t^{\prime} g(t^{\prime})}. \end{equation}

The solution in (2.133b ) is disturbing because of causality considerations $.$ It is irreversibility that provides a philosophical basis for not solving history problems backwards. Macroscopic variables lead to equations that do not tolerate ‘backward’ or ‘final-value’ problems. However, the basic equations (2.127)–(2.131) at the moment are fully time reversible. More work is needed to derive an irreversible kinetic equation.

We return to the solutions of (2.128) and consider first the homogeneous equation in the nonrelativistic limit. With the masses $m=m^{\prime} \equiv 1\ \textrm {and}\,{\boldsymbol{w}}\equiv {{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} ,$ the solution to the homogeneous limit (right-hand side equals zero) of (2.128) is simply

(2.134)

\begin{equation} h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;t\right )=h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;0\right )e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}t}. \end{equation}

The particular solution of (2.128) is

(2.135)

\begin{align} h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;t\right )&=h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;0\right )e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}t}\nonumber \\[4pt] &\quad +\int\nolimits ^t_0{\textrm{d}t^{\prime} e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}\left (t-t^{\prime} \right )}\phi \!\left ({\boldsymbol{k}}\right )i{\boldsymbol{k}}\cdot \!\left (\frac {\partial}{\partial {\boldsymbol{p}}}-\frac {\partial}{\partial {\boldsymbol{p}}'}\right )\!\left (f_1({\boldsymbol{p}};t')f_1({\boldsymbol{p}}';t)\right )}. \end{align}

The solution in (2.135) is the causal solution to the initial-value problem. The interaction embodied in $\phi$ has been identified as the cause of the correlation.

Definition: $\tau \equiv t-t'$ .

Equation (2.135) becomes

(2.136)

\begin{align} h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;t\right )&=h_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}};{{\boldsymbol{p}}}^{\prime} ;0\right )e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}t}\nonumber\\[4pt]&\quad +\int\nolimits ^t_0{\textrm{d}\tau e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}{\boldsymbol \tau }}\phi \!\left ({\boldsymbol{k}}\right )i{\boldsymbol{k}}\cdot \!\left (\frac {\partial}{\partial {\boldsymbol{p}}}-\frac {\partial}{\partial {\boldsymbol{p}}'}\right )\!\left (f_1({\boldsymbol{p}},\textit {t}-\tau )f_1({{\boldsymbol{p}}}^{\prime} ,t)\right )}. \end{align}

We note that $\tau \gt 0$ is always causal. With (2.136) used for h ${}_{2}$ (2.131) becomes

(2.137)

\begin{equation} \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )=\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}\textrm{d}^3{{\boldsymbol{p}}}^{\prime} (-i{\boldsymbol{k}})\phi ^*{({\boldsymbol{k}}{\mathbf )}h}_2\!\left ({\boldsymbol{k}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;0\right )e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}t}+\dots } \end{equation}

We wish to show that the effect of initial correlations (the first term for h ${}_{2}$ in (2.136) used in (2.137)) falls off rapidly and only the second term in (2.136), i.e., recent collisions, persists.

Lemma (Riemann–Lebesgue): The Fourier transform of an L ${}^{1}$ function vanishes at infinity. The Fourier transform of a smooth function falls off rapidly in transform space, e.g., the transform of h ${}_{2}$ falls off for large ${\boldsymbol{w}}t.$

For a system at or near thermal equilibrium $h_2\to \beta \ \phi \!\left ({\boldsymbol{k}}\right )f_1({\boldsymbol{p}},0)f_1({{\boldsymbol{p}}}^{\prime} ,0)$ . Thus, we need only show

(2.138)

\begin{equation} \frac {\partial}{\partial {\boldsymbol{w}}t}\ \left \{\int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}{\big\vert \phi (k)\big\vert }^2}e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}t}\right \} \to 0\, \textrm {rapidly}. \end{equation}

Example: For $\phi \!\left (s\right )\sim e^{(-{s^2}/{2a^2})} \to {\vert \phi (k)\vert }^2\sim e^{-k^2a^2}$ and $\smallint \textrm{d}^3\textit{k}\ e^{-({k^2a^2}/{2}) -i\textit{k}\cdot {\boldsymbol{w}}t} \to$ $e^{-({{\vert {\boldsymbol{w}}{\boldsymbol{t}}\vert }^2}/{4a^2})}$ . From this last expression we conclude that the characteristic time t in which the correlation falls off is set by $t\sim a/\vert {\boldsymbol{w}}\vert \sim a/\overline {{{{v}}}}$ where $a$ is the range of the interaction, which is typically a small microscopic distance. Hence, the initial correlations disappear rapidly. For particles with ${\boldsymbol{w}} \sim 0,$ co-traveling with the test particle, their contribution to $f_1\!\left ({{{\boldsymbol v}}}\right )$ is small. Furthermore, three-particle correlations will destroy this special case.

We conclude that the contribution to h ${}_{2}$ from the initial correlation, the first term on the right-hand side of (2.135) is subdominant to the second term in contributing to the right-hand side of (2.137); hence, (2.137) becomes

(2.139)

\begin{align} &\frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )\nonumber\\[4pt]&=\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \int\nolimits ^t_0{\textrm{d}\tau }{\int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}}{\boldsymbol{k}}{\vert \phi (k)\vert }^{2}{e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}\tau }}}}{\boldsymbol{k}}\cdot \!\left (\frac {\partial}{\partial {\boldsymbol{p}}}-\frac {\partial}{\partial {\boldsymbol{p}}'}\right )\!\left (f_1({\boldsymbol{p}};t-\tau )f_1({{\boldsymbol{p}}}^{\prime} ,t)\right ). \end{align}

The integrand in (2.139) falls off for $\tau \gg a/\overline {{{{v}}}}$ , which allows us to extend the integral $\smallint\nolimits ^t_0{\textrm{d}\tau }\to \smallint\nolimits ^{\infty }_0{\textrm{d}\tau }.$ Equation (2.139) is a closed kinetic equation for $f_1.$ It is irreversible and depends only on earlier times. Equation (2.139) with the time integral extended to $\infty$ is

(2.140)

\begin{align}& \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )\nonumber\\[4pt]&=\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \int\nolimits ^{\infty }_0{\textrm{d}\tau }{\int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}}{\boldsymbol{k}}{\vert \phi (k)\vert }^{2}{e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}\tau }}}}{\boldsymbol{k}}\cdot \!\left (\frac {\partial}{\partial {\boldsymbol{p}}}-\frac {\partial}{\partial {\boldsymbol{p}}'}\right )\!\left (f_1({\boldsymbol{p}};t-\tau )f_1({{\boldsymbol{p}}}^{\prime} ,t)\right ). \end{align}

We argue that only present times matter in $f_1f_1$ and $f_1$ can be expanded in time: $f_1\!\left ({\boldsymbol{p}};t-\tau \right )=f_1\!\left ({\boldsymbol{p}};t\right )-\tau ({\partial f_1}/{\partial t})({\boldsymbol{p}};t$ ) subject to $\tau \lt {\tau }_{\mathrm{coll}}\sim a/\overline {{{{v}}}}$ which is a small collision duration. We note that ${\partial f_1}/{\partial t}\sim {f_1}/{{\tau }_f}$ where ${\tau }_f$ is a very large time corresponding to the time between collisions, ${\tau }_f\sim \ell /\overline {{{{v}}}}$ . Once again there are two time scales present and ${{\tau }_{\mathrm{coll}}}/{{\tau }_f}\ll 1$ . Hence, $f_1({\boldsymbol{p}};t-\tau )f_1({{\boldsymbol{p}}}^{\prime} ,t-\tau )\to f_1({\boldsymbol{p}};t)f_1({{\boldsymbol{p}}}^{\prime} ,t).$

Lemma: We have $\smallint \textrm{d}\tau \ e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{w}}\tau }=\pi \delta \!\left ({\boldsymbol{k}}\cdot {\boldsymbol{w}}\right )+{P}/({i{\boldsymbol{k}}\cdot {\boldsymbol{w}}})$ , where $P$ denotes the principal value. We use this lemma to evaluate the right-hand side of (2.140). We note that the second term in the lemma is an odd function of k and will never lead to a contribution.

We arrive at the Landau equation

(2.141)

\begin{equation} \frac {\partial}{\partial t}f_1\!\left ({\boldsymbol{p}};t\right )=\frac {\partial}{\partial {\boldsymbol{p}}}\cdot \int\nolimits {\textrm{d}^3{{\boldsymbol{p}}}^{\prime} \int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}}{\boldsymbol{k}}{{\boldsymbol{k}}\vert \phi \!\left (k\right )\vert }^2}\pi \delta ({\boldsymbol{k}}\cdot {\boldsymbol{w}}{\mathbf )}\cdot \!\left (\frac {\partial}{\partial {\boldsymbol{p}}}-\frac {\partial}{\partial {\boldsymbol{p}}'}\right )\!\left (f_1({\boldsymbol{p}};t)f_1({{\boldsymbol{p}}}^{\prime} ,t)\right ). \end{equation}

Let us recap the assumptions in arriving at the Landau equation. The simplifying assumptions are spatial uniformity, no external forces, and a single species. The essential assumptions are weak coupling ( $\phi$ $\ll$ T) leading to two disparate time scales (fast collision time and slower relaxation time), and the kinetic equation describes an initial-value problem.

2.4.2. Elaboration of the Landau equation and derivation of an H-theorem

In § 2.2.3 we analyzed the continuity equation for the phase-space density:

(2.142)

\begin{equation} \frac {\partial f}{\partial t}\!\left ({\boldsymbol{p}};t\right )=-\frac {\partial \textrm {}}{\partial {\boldsymbol{p}}}\cdot \tilde {{\boldsymbol \varGamma }}\textrm {(}{\boldsymbol{p}};t\textrm {)}, \end{equation}

where $\tilde {{\boldsymbol \varGamma }}$ is the particle flux. The ordinary number density is

(2.143)

\begin{equation} n(t)\equiv \int\nolimits {\textrm{d}^3\textit{p}f({\boldsymbol{p}};t)}. \end{equation}

Conservation of particles in phase space derives from the time derivative of (2.143) and the application of boundary conditions:

(2.144)

\begin{equation} \frac {\textrm{d}n}{\textrm{d}t}=\int\nolimits {\textrm{d}^3\textit{p}\frac {\partial f}{\partial t}({\boldsymbol{p}};t)}=-\int\nolimits {\textrm{d}^3\textit{p}}\frac {\partial \textrm {}}{\partial {\boldsymbol{p}}}\cdot \tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{p}};t\right )=-\oint {\textrm{d}{\boldsymbol \sigma }\cdot }\tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{p}};t\right )=0 \end{equation}

assuming ${\tilde {{\boldsymbol \varGamma }}\vert }_{\delta {\boldsymbol \sigma }=\infty }=0$ . In this case the flux has been assumed to vanish on the system boundaries. If instead ${\tilde {{\boldsymbol \varGamma }}\vert }_{\delta {\boldsymbol \sigma }}\ne 0$ , then there is a flow into or out of the region bounded by $\delta {\boldsymbol \sigma }.$ The continuity equation (2.142) is a statement of continuous flow in momentum space. In the Boltzmann collision equation there are large-angle scatters ( $\equiv$ strong interactions), in which case there is no continuous flow in momentum space. In a collision a particle will disappear from one position in momentum space and appear instantaneously elsewhere (unless the collision process is time-resolved on a microscopic time scale). From § 2.4.1

(2.145)

\begin{equation} \tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{p}};t\right )={\int\nolimits {\textrm{d}^3\textit{p}^{\prime}}} {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right ),\quad \,{\boldsymbol{w}}\equiv {{{\boldsymbol v}}}-{{{\boldsymbol v}}}', \end{equation}

(2.146)

\begin{equation} {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )=\int\nolimits {\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}{\vert \phi (\textit{k})\vert }^2{\boldsymbol{kk}}\pi \delta ({\boldsymbol{k}}\cdot {\boldsymbol{w}})}.\quad\qquad\qquad\qquad\qquad\qquad \end{equation}

In (2.146) $\phi \!\left (\textit{k}\right )$ is the Fourier transform of the interaction potential. Note: An alternative derivation using the Born approximation and the Fermi golden rule yields exactly the same results.

Lemma: ${\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )$ is manifestly symmetric, ${\boldsymbol{w}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )=0,$ and is positive-semi-definite

(2.147)

\begin{equation} ( \equiv {\boldsymbol{a}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot {\boldsymbol{a}}\ge 0). \end{equation}

We now derive an explicit expression for ${\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )$ . We begin with the decomposition ${\boldsymbol{k}}={\boldsymbol{k}}\cdot \hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}+{\boldsymbol{k}}\cdot (\overset{\leftrightarrow}{\textrm{I}}-\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}})$ . By choosing $\hat {{\boldsymbol{w}}} =\hat {{\boldsymbol{z}}}$ and using spherical coordinates $(k,\theta ,\phi )$ , then ${\boldsymbol{k}}\!\left (k,\theta ,\phi \right )\cdot {\boldsymbol{w}}=kw\;{\cos \theta },$ so that $\delta \!\left (\boldsymbol{k}\cdot{\boldsymbol{w}}\right )=\delta ({\cos \theta })/(kw),$ and ${\boldsymbol{k}}\!\left (k,\theta =\pi /2,\phi \right )=k\hat {{\boldsymbol \rho }},$ where $\hat {{\boldsymbol \rho }}={\cos \phi \hat {{\boldsymbol{x}}}+{\sin \phi \hat {{\boldsymbol{y}}}.}}$ Equation (2.146), therefore, becomes

(2.148)

\begin{align} \boldsymbol{Q}\!\left ({\boldsymbol{w}}\right )&=\int\nolimits {\frac {\mathrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}{\Big\vert \tilde {\phi }\!\left (\textit{k}\right )\Big\vert }^2{\boldsymbol{kk}}\pi \delta \!\left ({\boldsymbol{k}}\cdot {\boldsymbol{w}}\right )}\nonumber \\[4pt]&=\int\nolimits ^{\infty }_0{\frac {k^2\mathrm{d}k}{{(2\pi )}^3}{\vert \tilde {\phi }(\textit{k})\vert }^2\frac {k^2}{kw}\int\nolimits ^{\pi}_0{\pi \delta ({\cos \theta ){\sin \theta \ \mathrm{d}\theta \int\nolimits ^{2\pi }_0{\hat {{\boldsymbol \rho }}\;\hat {{\boldsymbol \rho }}}\mathrm{d}\phi }}}}\nonumber \\[4pt]&= \left [\frac {1}{8\pi w}\int\nolimits ^{\infty }_0{k^3}{\big\vert \tilde {\phi }\!\left (\textit{k}\right )\big\vert }^2\mathrm{d}k\right ](\hat {{\boldsymbol{x}}}\hat {{\boldsymbol{x}}}{\mathbf +}\hat {{\boldsymbol{y}}}\hat {{\boldsymbol{y}}}{\mathbf )}\equiv \ \frac {Q}{w}\!\left (\overset{\leftrightarrow}{\textrm{I}}-\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}\right ), \end{align}

where $\smallint\nolimits ^{2\pi }_0{\hat {{\boldsymbol \rho }}\;\hat {{\boldsymbol \rho }}}\;\text{d}\phi =\pi (\hat {{\boldsymbol{x}}}\;\hat {{\boldsymbol{x}}}{\mathbf +}\hat {{\boldsymbol{y}}}\;\hat {{\boldsymbol{y}}}{\mathbf )}$ .

An aside on the initial-value versus final-value problem: Consider the initial-value problem for a nonsingular f (t $\gt$ 0) given h ${}_{2}$ (t = 0). We note that we might instead wish to solve a final-value problem for f (t $\langle$ 0) given h ${}_{2}$ (t = 0). Then the only change would be that for the initial-value problem ${\boldsymbol \varGamma }\!\left ({\boldsymbol{p}};t\right )=\smallint {\textrm{d}^3\textit{p}'}\dots$ versus ${\boldsymbol \varGamma }\!\left ({\boldsymbol{p}};t\right )=-\smallint {\textrm{d}^3{\textit{p}}^{\prime} \dots }$ for the final-value problem.

At this point we have laid the groundwork for deriving an H-theorem for the Landau equation. Consider the expression for the entropy:

(2.149)

\begin{equation} S\!\left (t\right )\equiv -\int\nolimits {\textrm{d}^3\textit{p}\ f\!\left ({\boldsymbol{p}};t\right ){\text{ ln } f\!\left ({\boldsymbol{p}};t\right )}}. \end{equation}

Using (2.142) and (2.145), the time derivative of (2.149) is

(2.150)

\begin{align} \frac {\textrm{d}S}{\textrm{d}t}&=-\int\nolimits {\textrm{d}^3\textit{p}\ \left [\frac {\partial f}{\partial t}{\text{ ln } f+\ \frac {\partial f}{\partial t}}\right ]}-\int\nolimits {\textrm{d}^3\textit{p}\ \frac {\partial f}{\partial t}{\text{ ln } f}} =\int\nolimits {\textrm{d}^3\textit{p}\,{\text{ ln } f\frac {\partial \textrm {}}{\partial {\boldsymbol{p}}}\cdot \tilde {{\boldsymbol \varGamma }}}\nonumber}\\[4pt]&=-\int\nolimits {\textrm{d}^3\textit{p}\,{\frac {1}{f} \frac {\partial f\textrm {}}{\partial {\boldsymbol{p}}}\cdot \tilde {{\boldsymbol \varGamma }}}\!\left ({\boldsymbol{p}};t\right )}\nonumber \\[4pt]& =-\frac{1}{2}\int\nolimits {\textrm{d}^3\textit{p}}\int\nolimits \textrm{d}^3{\textit{p}}^{\prime} \left [\frac {1}{f\!\left ({\boldsymbol{p}}\right )}\frac {\partial \textrm {}f\!\left ({\boldsymbol{p}}\right )}{\partial {\boldsymbol{p}}}-\frac {1}{f\!\left ({{\boldsymbol{p}}}^{\prime} \right )}\frac {\partial \textrm {}f\!\left ({{\boldsymbol{p}}}^{\prime} \right )}{\partial {{\boldsymbol{p}}}^{\prime} }\right ]\nonumber \\[4pt]&\quad \cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right )\nonumber \\[4pt]& =\frac{1}{2}\int\nolimits {\textrm{d}^3\textit{p}}\int\nolimits \textrm{d}^3{\textit{p}}^{\prime}\frac {1}{f\!\left ({\boldsymbol{p}}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} \right )}\left [\!\left (\frac {\partial \textrm {}}{\partial {{\boldsymbol{p}}}^{\textrm{'}}}-\frac {\partial \textrm {}}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} \right )\right ]\nonumber \\[4pt]&\quad \cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot \left [\!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right )\right ]\nonumber \\[4pt]&=\frac{1}{2}\int\nolimits {\textrm{d}^3\textit{p}}\int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \frac {1}{f\!\left ({\boldsymbol{p}}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} \right )}{\boldsymbol{a}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot }{\boldsymbol{a}} \ge 0, \end{align}

where

(2.151)

\begin{equation} {\boldsymbol{a}}\equiv \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right ). \end{equation}

Equation (2.150) demonstrates the H-theorem for the Landau equation.

Theorem: We have

(2.152)

\begin{equation} \frac {\textrm{d}S}{\textrm{d}t}=0 \quad \textrm{iff}\quad {\boldsymbol{a}}={\boldsymbol{w}}g({\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ) \end{equation}

where g is any smooth function of p and p^′.

Now instead of a as defined in (2.151) consider

(2.153)

\begin{equation} {\boldsymbol{a}}{\mathbf '}\equiv \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right ){\text{ ln } \left [f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right )\right ]}, \end{equation}

thus a ^′ can be recast in the same form a ^′= ${\boldsymbol{w}}g({\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} )$ as in (2.152), i.e.,

(2.154)

\begin{equation} {{\boldsymbol{a}}}^{\prime} =({{{\boldsymbol v}}}-{{{\boldsymbol v}}}')g=\frac {\partial{\text{ln}f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right )}}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial{\text{ln}f\!\left ({{\boldsymbol{p}}};t\right )}}{\partial {{\boldsymbol{p}}}}, \end{equation}

with which ${\textrm{d}S}/{\textrm{d}t}=0$ .

Theorem: The only solution of (2.152) has the form

(2.155)

\begin{equation} {\text{ ln } f\!\left ({\boldsymbol{p}}\right )=C_1+{{\boldsymbol{C}}}_2\cdot {\boldsymbol{p}}+C_3p^2}. \end{equation}

We can assign ${{\boldsymbol{C}}}_2$ = u , a mean drift of the velocity distribution, and invert (2.155) to obtain the solution for the velocity distribution $f$ that satisfies ${\textrm{d}S}/{\textrm{d}t}=0$ , i.e., the equilibrium distribution:

(2.156)

\begin{equation} f\!\left ({\boldsymbol{p}}\right )=\frac {n}{\sqrt {2\pi mT}}e^{-\beta \frac{1}{2}m{\left ({{{\boldsymbol v}}}-{\boldsymbol{u}}\right )}^2} \end{equation}

Equation (2.156) is the formula for a drifting Maxwellian distribution. We have $({\textrm{d}S}/{\textrm{d}t})\gt 0$ if f is not a Maxwellian, and f will relax to an asymptotic equilibrium that is a Maxwellian.

Definition: Define the kinetic energy

(2.157)

\begin{equation} \textrm {K}\!\left (t\right )\equiv \int\nolimits {\textrm{d}^3\textit{p}\ \frac {p^2}{2m}f({\boldsymbol{p}};t)}, \end{equation}

since the interaction energy is higher order.

Exercise: Using the Landau equation (2.141) show that the kinetic energy is conserved, i.e., $({\textrm{d}K}\!/{\textrm{d}t})=0.$

Definition: Define the momentum moment of f

(2.158)

\begin{equation} {\boldsymbol{g}}\!\left (t\right )\equiv \int\nolimits {\textrm{d}^3\textit{p}\,{\boldsymbol{p}}f({\boldsymbol{p}};t)}. \end{equation}

Exercise: Using the Landau equation show that $({\textrm{d}{\boldsymbol{g}}}/{\textrm{d}t})=0.$

2.4.3. Irreversibility

Here we present a discussion and precise definition of irreversibility.

Reversibility :

Definition (Reversibility): If ${{{\boldsymbol v}}} \to -{{{\boldsymbol v}}}$ and $t \to -t$ without changing the physics except for merely duplicating the trajectory of the process $\chi \!\left (t\right ) \to \chi \!\left (-t\right )$ , ending up with the initial conditions defines a reversible process.

From the perspective of the BBGKY hierarchy consider $\left (\left ({\boldsymbol{p}};t\right ),\, h_2({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t)\right )$ defined at t = 0. Solve the equations in § 2.2.1 to obtain $\!\left (f_1,h_2\right )$ at ${t = t}_{1 }\gt 0$ . Now instead introduce

(2.159)

\begin{equation} {\tilde {h}}_2({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t_1)\equiv h_2({\boldsymbol{s}},-{\boldsymbol{p}},-{{\boldsymbol{p}}}^{\prime} ;t_1) \end{equation}

and solve for $(f_1,{\tilde {h}}_2)$ for t $\gt$ t ${}_{1}$ up to $t\to t_2=2t_1$ which yields

(2.160)

\begin{equation} {\tilde {h}}_2({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t_2)\equiv h_2({\boldsymbol{s}},-{\boldsymbol{p}},-{{\boldsymbol{p}}}^{\prime} ;0)\quad \textrm {and}\quad f_1\!\left (t_2\right )=f_1\!\left (0\right ) \end{equation}

if the system is reversible. We have assumed weak coupling and no time ordering.

Alternatively we could integrate the kinetic equations forward in time, use the solutions at $t_1$ for initial conditions, then integrate backward in time to recover the initial conditions at t = 0 once again, if the system is reversible.

Irreversibility: Given $\!\left (f_1\!\left ({\boldsymbol{p}};t\right ), h_2({\boldsymbol{s}},{\boldsymbol{p}},{{\boldsymbol{p}}}^{\prime} ;t)\right )$ defined at t = 0, solve the kinetic equations for $\!\left (f_1,h_2\right )$ at t $\gt$ 0. Given the solutions, calculate the entropy (2.149): $S\!\left (t\right )\equiv -\smallint {\textrm{d}^3\textit{p}\ f\!\left ({\boldsymbol{p}};t\right ){\text{ ln } f\!\left ({\boldsymbol{p}};t\right )}}.$ The system is irreversible if for any t ${}_{1}$ , $S\!\left (t\right )$ is asymmetric about t ${}_{1}$ , i.e., $S\!\left (t\right )$ is growing for increasing t. In making these arguments, there must arise a distinction between the microscopic evolution of the system which includes fluctuations and the macroscopic evolution as dictated by a kinetic equation such as the Landau equation in which ensemble averages have smoothed over the microscopic fluctuations. The Landau macroscopic evolutionary equation has a discontinuity in slope at t = 0. Of course this is not a problem because the Landau equation only applies for t $\gt$ 0.

Example: Consider the simple one-dimensional diffusion equation as an example of an irreversible process:

(2.161)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left (x;t\right )=D\frac {{\partial }^{2}}{\partial x^2}\rho \!\left (x;t\right ). \end{equation}

If $t\to -t$ , the left-hand side of (2.161) changes sign but the right-hand side does not. Given $\rho \!\left (x;0\right )$ we can find $\rho \!\left (x;t\right )$ for $t \gtrless 0$ by separating variables and Fourier analyzing:

(2.162a)

\begin{equation} \rho \!\left (x;t\right )\equiv \int\nolimits {\frac {\textrm{d}k}{2\pi }e^{ikx}\rho \!\left (k;t\right )}, \end{equation}

(2.162b)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left (k;t\right )=-Dk^2\rho \!\left (k;t\right ), \end{equation}

(2.162c)

\begin{equation} \rho \!\left (k;t\right )=\rho \!\left (k;0\right )e^{-Dk^2t}, \end{equation}

(2.162d)

\begin{equation} \rho \!\left (x;t\right )=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}k}{2\pi }}\rho \!\left (k;0\right )e^{ikx-Dk^2t}. \end{equation}

We see in (2.162c ) and (2.162d ) that the solution for $\rho$ decays for t $\gt$ 0 and blows up for $t\lt 0$ . Moreover, the integral over k in (2.162d ) does not exist for $t \lt 0$ as $k\to \pm \infty$ because the integral diverges; so there is no solution for $\rho$ for $t \lt 0$ .

Example: Suppose the initial condition for $\rho$ in the preceding example is $\rho \!\left (x;0\right )={1}/({\sqrt {2\pi {\sigma }^2}})e^{-({x^2}/{2{\sigma }^2})}$ . Then the solution of (2.162d ) is given by

(2.163)

\begin{equation} \rho \!\left (x;t\right )=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}k}{2\pi }}\rho \!\left (k;0\right )e^{ikx-\frac{1}{2}{\sigma }^2k^2-Dk^2t}. \end{equation}

We observe that the integral in (2.163) converges as long as $-Dt\lt ({1}/{2}){\sigma }^2$ , i.e., there is a nonsingular solution for $\rho \!\left (x;t\right )\ \textrm {for a finite interval of negative times}$ . At $t=-({{\sigma }^2}/({2D}))$ we have a $\delta$ -function solution for $\rho .$ In terms of a Green’s function we find that

(2.164)

\begin{equation} \rho \!\left (x;t\right )=\int\nolimits {\textrm{d}x'\rho (x^{\prime} ;0)\frac {e^{-\frac {{\!\left (x-x'\right )}^2}{4DT}}}{\sqrt {4\pi Dt}}}. \end{equation}

For ${t} \lt 0$ , the $\sqrt {4\pi Dt}$ in the denominator is imaginary; and the exponential in the numerator blows up with large $\vert {x - x'}\vert$ . However, the integral in (2.164) may still converge for a finite interval of negative times if $\rho (x^{\prime} ;0)$ falls off with ${\vert x}^{\prime} \vert$ fast enough. That $\sqrt {4\pi Dt}$ is imaginary is not fatal for obtaining a solution for negative times in the interval where the integral in (2.164) converges.

Exercise: For the Gaussian initial condition used in obtaining (2.163) show that the Green’s function method in (2.164) can recover the same solution as in (2.163).

2.5. Markov processes and the Fokker–Planck equation

Definition: A Markov process has no memory.

[Editor’s Note: The definition in Wikipedia is “A Markov chain or Markov process is a stochastic model describing a sequence of possible events in which the probability of each event depends only on the state attained in the previous event.”]

Processes, random, stochastic, or otherwise, fall into a few categories. There are Markov and non-Markov processes. Within Markov processes there are continuous and discontinuous processes. An example of a continuous Markov process is Brownian motion with Gaussian statistics. Large-angle collisions described by the Boltzmann equation fall into the discontinuous Markov process category. The Landau equation can describe a continuous Markov process with non-Gaussian statistics. There are examples of generalized Brownian motion that are non-Markov processes. Processes can also be characterized as ergodic, stationary, Gaussian, and so on.

Suppose there is a random process with probability distribution:

(2.165)

\begin{equation} \rho \!\left (x_1,x_2,\dots ,x_n\right )=\rho \!\left (x_1,x_2,\dots ,x_n\right )\rho \!\left (x_n\vert x_{n-1},x_{n-2},\dots \right ), \end{equation}

where $x\!\left (t_i\right )\equiv x_i$ measured at successive times. A Markov process corresponds to the condition

(2.166)

\begin{equation} \rho \!\left (x_n\vert x_{n-1},x_{n-2},\dots \right )=\rho \!\left (x_n,x_{n-1}\right ). \end{equation}

A classical random walk is an example of a Markov process.

Definition: Define ${\Delta }_n\equiv x_n-x_{n-1}$ .

We note $\rho \!\left ({\Delta }_n\vert x_{n-1},x_{n-2},\dots \right )=\rho \!\left ({\Delta }_n\vert x_{n-1}\right )$ also defines a Markov process.

Consider

(2.167)

\begin{equation} \rho \!\left (x_n, x_{n-1},\dots ,x_0\right )= \rho \!\left (x_n\vert x_{n-1}\right )\rho \!\left (x_{n-1}\vert x_{n-2}\right )\dots \rho \!\left (x_3\vert x_2\right )\rho \!\left (x_2\vert x_1\right )\rho \!\left (x_1\vert x_0\right )\rho \!\left (x_0\right ). \end{equation}

Divide both sides of (2.167) by $\rho \!\left (x_0\right )$ to obtain

(2.168)

\begin{equation} \rho \!\left (x_n, x_{n-1},\dots \vert x_0\right )= \rho \!\left (x_n\vert x_{n-1}\right )\rho \!\left (x_{n-1}\vert x_{n-2}\right )\dots \rho \!\left (x_3\vert x_2\right )\rho \!\left (x_2\vert x_1\right )\rho \!\left (x_1\vert x_0\right ) \end{equation}

The Chapman–Kolmogorov equation is:

(2.169)

\begin{equation} \rho \!\left (x_2,x_1\vert x_0\right )= \rho \!\left (x_2\vert x_1\right )\rho \!\left (x_1\vert x_0\right ). \end{equation}

Equation (2.168) is the Chapman–Kolmogorov equation for any three times. We can integrate (2.169) $\smallint {\textrm{d}x_1}$ to obtain

(2.170)

\begin{equation} \rho \!\left (x_2\vert x_0\right )= \int\nolimits {\textrm{d}x_1}\rho \!\left (x_2\vert x_1\right )\rho \!\left (x_1\vert x_0\right ) \end{equation}

which is true only for Markov processes. More generally, $x_0$ would appear in $\rho \!\left (x_2,x_1\vert x_0\right ).$

Lemma: We have

(2.171)

\begin{equation} {\langle x_2\rangle }_{\vert x_0}\equiv \int\nolimits {\textrm{d}x_1}{\langle x_2\rangle }_{\vert x_1}\rho \!\left (x_1\vert x_0\right ) \end{equation}

Lemma: We recall the definition of the normalized correlation function $R(\tau )$ from (2.45) and use (2.170) and (2.171) to obtain

(2.172)

\begin{equation} {\langle x_n\rangle }_{\vert x_m}=x_m\ R\!\left (\vert t_n-t_m\vert \right ) \end{equation}

to represent the average value of $\langle$ x $\rangle$ at time $t_n$ following the precise value x ${}_{m}$ at $t_m$ , and

(2.173)

\begin{equation} x_0\ R\!\left (\vert t_2-t_0\vert \right )=\int\nolimits {\textrm{d}x_1}x_1\ R\!\left (\vert t_2-t_1\vert \right )\ \rho \!\left (x_1\vert x_0\right )=R\!\left (\vert t_2-t_1\vert \right ){\ \langle x_1\rangle }_{\vert x_0}. \end{equation}

For a stationary Gaussian process the correlation function can be built up in multiplicative pieces

(2.174)

\begin{equation} R\!\left (t_2-t_0\right )=R\!\left (t_2-t_1\right )R\!\left (t_1-t_0\right ). \end{equation}

For a stationary Gaussian Markov process the correlation function has the form

(2.175)

\begin{equation} R(\tau)=e^{-\lambda \vert \tau \vert } \end{equation}

to be consistent with (2.174), and with Gaussian statistics and stationarity.

2.5.1. Expansion of the Chapman–Kolmogorov equation to derive the Fokker–Planck equation

Consider a continuous Markov process. Initial conditions become implicit, and we change the notation. For the probability distribution as a function of x at time $t_n,$ $\rho \!\left (x;t_n\right )$ given $\rho \!\left (x;t_0\right )=\delta (x-x_0)$ is

(2.176)

\begin{equation} \rho \!\left (x_n;t_n\right )=\int\nolimits {\textrm{d}x_{n-1}\rho (x_n\vert x_{n-1})}\rho \!\left (x_{n-1};t_{n-1}\right )=\int\nolimits {\textrm{d}x'\rho (x\vert x')}\rho \!\left (x';t_{n-1}\right ). \end{equation}

Definition: We introduce the transition probability

(2.177)

\begin{equation} \psi \!\left (\Delta x\vert x-\Delta x;t-\Delta t,\Delta t\right )=\rho (x\leftarrow x^{\prime} ;t-\Delta t,\Delta t). \end{equation}

Then

(2.178)

\begin{align} \rho \!\left (x;t\right )&=\int\nolimits {\textrm{d}x^{\prime} \rho \!\left (x\leftarrow x^{\prime} ;t-\Delta t,\Delta t\right )}\rho \!\left (x^{\prime} ;t-\Delta t\right )\nonumber \\[4pt] &=\int\nolimits {\text{d}(\Delta x)\psi (\Delta x\vert x-\Delta x;t-\Delta t,\Delta t)}\rho \!\left (x-\Delta x;t-\Delta t\right )\!, \end{align}

where $\Delta x=x-x^{\prime} .$ Note that t is discrete and x is continuous. Assume $\Delta x$ , the step size, is small so we can Taylor-series expand:

(2.179)

\begin{equation} \rho \!\left (x;t\right )=\int\nolimits {\text{d}(\Delta x)}\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-\Delta x\right )}^{\ell }}{\ell !}\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\ \psi \!\left (\Delta x\vert x;\;t-\Delta t,\Delta t\right )\rho \!\left (x;t-\Delta t\right )\!. \end{equation}

Now we change the notation by shifting t, $t\to t+\Delta t$ , so that

(2.180)

\begin{align} \rho \!\left (x;t+\Delta t\right )&=\int\nolimits {\text{d}(\Delta x)\ \psi \!\left (\Delta x\vert x-\Delta x;t,\Delta t\right )}\rho \!\left (x-\Delta x;t\right )\nonumber \\[4pt] &=\int\nolimits {\text{d}(\Delta x)}\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-\Delta x\right )}^{\ell }}{\ell !}\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\ \psi \!\left (\Delta x\vert x;t,\Delta t\right )\ \rho \!\left (x;t\right )\!. \end{align}

If the series expansion in (2.180) is uniformly convergent, then we can commute the integration and series summation to obtain

(2.181)

\begin{align} \rho \!\left (x;t+\Delta t\right )&=\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-1\right )}^{\ell }}{\ell !}\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\int\nolimits {\text{d}(\Delta x)\,{\!\left (\Delta x\right )}^{\ell }}\ \psi \!\left (\Delta x\vert x;t,\Delta t\right )\ \rho \!\left (x;t\right )\nonumber \\[4pt] &=\sum\limits ^{\infty }_{\ell =0}{\frac {{\!\left (-1\right )}^{\ell }}{\ell !}\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\langle {\left (\Delta x\right )}^{\ell }\rangle (x;t,\Delta t)\ \rho \!\left (x;t\right ).\quad \end{align}

Next we subtract $\rho \!\left (x;t\right )$ , which is just the $\ell =0$ term on the right-hand side, from both sides of (2.181) and then divide both sides of the resulting equation by $\Delta t$ to obtain

(2.182)

\begin{align} \frac {\partial}{\partial t}\ \rho \!\left (x;t\right )&=\sum\limits ^{\infty }_{\ell =1}{{\!\left (-1\right )}^{\ell }\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\left [{\mathop {\lim }_{\Delta t\to 0} \frac {\langle {\left (\Delta x\right )}^{\ell }\rangle (x;t,\Delta t)}{\ell !\ \Delta t}}\rho \!\left (x;t\right )\right ]\nonumber \\[4pt] & \equiv \sum\limits ^{\infty }_{\ell =1}{{\!\left (-1\right )}^{\ell }\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\left [D^{\!\left (\ell \right )}(x,t)\rho \!\left (x;t\right )\right ]\nonumber \\[4pt] & = -\frac {{\partial }}{\partial x}\left [D^{\!\left (1\right )}\!\left (x,t\right )\rho \!\left (x;t\right )\right ]+\frac {{\partial }^2}{\partial {x^2}}\left [D^{\!\left (2\right )}\!\left (x,t\right )\rho \!\left (x;t\right )\right ]\nonumber\\[4pt] &\quad +\sum\limits ^{\infty }_{\ell =3}{{\!\left (-1\right )}^{\ell }\frac {{\partial }^{\ell }}{\partial x^{\ell }}}\left [D^{\!\left (\ell \right )}(x,t)\rho \!\left (x;t\right )\right ], \end{align}

(2.183)

\begin{equation} D^{\!\left (\ell \right )}(x,t)\equiv \,{\mathop {\lim }_{\Delta t\to 0} \frac {\langle {\left (\Delta x\right )}^{\ell }\rangle (x;t,\Delta t)}{\ell !\ \Delta t}}. \end{equation}

Equation (2.182) is the generalized Fokker–Planck equation. It is useful if we can truncate the equation after the first two terms on the right-hand side: $D^{(1)}, D^{(2)}\ne 0; D^{\!\left (\ell \right )}\equiv 0, \ell \gt 2.$ We then rewrite the truncated version of (2.182) in the conventional form.

Fokker–Planck Equation:

(2.184)

\begin{equation} \frac {\partial}{\partial t}\ \rho \!\left (x;t\right )=-\frac {{\partial }}{\partial x}\left [\frac {\langle \Delta x\rangle }{\Delta t}\rho \!\left (x;t\right )\right ]+\frac {{\partial }^2}{\partial {x^2}}\left [D\!\left (x,t\right )\rho \!\left (x;t\right )\right ] \end{equation}

The first term on the right-hand side of (2.184) is defined as the dynamic friction. In the following two examples we show how the Langevin equation model for Brownian motion and the Landau equation can lead to the Fokker–Planck equation.

[Reviewer Dominique Escande’s Comments: The passage to the derivative in (2.182) can be further discussed. See, for instance, Ryskin (Reference Ryskin1997) The generalized Fokker–Planck equation (2.182) is called Kramers–Moyal expansion, or van Kampen’s system-size expansion (see the corresponding Wikipedia articles for original references). Equation (2.184) is not a theorem by Fokker–Planck, but by Ryskin (Reference Ryskin1997). The Pawula theorem (Pawula Reference Pawula1967) might be referenced here, since it shows that the only truncation of the expansion, which ensures solutions to be physically meaningful (e.g., positive everywhere) is that to the second order.]

Example: Brownian motion. Consider the Langevin equation for a particle with unit mass (M = 1):

(2.185)

\begin{equation} \dot {{{{v}}}}=-\gamma {{{v}}}+\delta F. \end{equation}

Here the velocity ${{{v}}}(t)$ is the random variable of interest in the Langevin equation. Integrate in time over $\Delta t\ll {\gamma }^{-1}$ but $\Delta t\gg {\tau }_{\delta F}$ the characteristic time for fluctuations in the forces. Here ${\nu }_V\equiv \gamma$ using our previously introduced notation:

(2.186)

\begin{equation} \Delta {{{v}}}=-\gamma {{{v}}}\Delta t+\int\nolimits ^{\Delta t}_0{\textrm{d}t\ \delta F(t)}. \end{equation}

Taking the ensemble average over fluctuations in $\delta F$ , (2.186) becomes

(2.187)

\begin{equation} \langle \Delta {{{v}}}\rangle =-\gamma {{{v}}}\Delta t+\int\nolimits ^{\Delta t}_0{\textrm{d}t\ \langle \delta F\rangle (t)} \end{equation}

assuming there is no correlation between the velocity and the fluctuating force, i.e., $\langle \delta F\rangle =0.$ Hence,

(2.188)

\begin{equation} \langle \Delta {{{v}}}\rangle =-\gamma {{{v}}}\Delta t \end{equation}

and

(2.189)

\begin{equation} \frac {\partial}{\partial t}\ \rho \!\left ({{{v}}};t\right )=-\frac {{\partial }}{\partial {{{v}}}}\left [-\gamma {{{v}}}\rho \!\left (x;t\right )\right ]+\frac {{\partial }^2}{\partial {v^2}}\left [D\!\left ({{{v}}},t\right )\rho \!\left ({{{v}}};t\right )\right ]. \end{equation}

Now calculate the ensemble average of the square of (2.186):

(2.190)

\begin{equation} \langle {\left (\Delta {{{v}}}\right )}^2\rangle ={\gamma }^2{{{{v}}}}^2{\Delta t}^2+\int\nolimits ^{\Delta t}_0{\int\nolimits {\textrm{d}t}\textrm{d}t^{\prime} \langle \delta F\!\left (t\right )\delta F(t^{\prime})\rangle -2\gamma {{{v}}}\Delta t\int\nolimits ^{\Delta t}_0{\textrm{d}t\langle \delta F\rangle (t)}} \end{equation}

but $\langle \delta F\rangle =0$ and $\langle \delta F\!\left (t\right )\delta F(t^{\prime})\rangle =C_F(\vert t-t^{\prime} \vert )$ . With $\Delta t\gg {\tau }_{\delta F}$ , (2.190) becomes

(2.191)

\begin{equation} \langle {\left (\Delta {{{v}}}\right )}^2\rangle ={\gamma }^2{{{{v}}}}^2{\Delta t}^2+\Delta t\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_F(\tau )}. \end{equation}

We divide (2.191) by $2\Delta t$ and take the limit as $\Delta t\to 0$ to obtain

(2.192)

\begin{equation} {\mathop {\lim }_{\Delta t\to 0\textrm {}} \frac {\langle {\left (\Delta {{{v}}}\right )}^2\rangle }{\textrm {}2\Delta t}= }\frac {{\gamma }^2{{{{v}}}}^2{\Delta t}}{2}+\frac{1}{2}\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_F(\tau )}. \end{equation}

At this point we recall that $\Delta t\ll {\gamma }^{-1}$ and $\Delta t\gg {\tau }_{\delta F}$ which allows us to argue that the first term on the right-hand side of (2.192) is small compared with the second term and is negligible.

From (2.192) we conclude that

(2.193)

\begin{equation} {D_{{{{v}}}}\equiv \mathop {\lim }_{\Delta t\to 0\textrm {}} \frac {\langle {\left (\Delta {{{v}}}\right )}^2\rangle }{\textrm {}2\Delta t} }=\frac{1}{2}\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_F(\tau )}. \end{equation}

From expressions we derived in § 2.2.1 for Brownian motion,

(2.194)

\begin{equation} D_{{{{v}}}}\equiv \tfrac{1}{2}\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \ C_F(\tau )}=S\!\left (\omega =0\right )=\gamma T, \end{equation}

which recovers the Einstein relation. Note that $\Delta t\to 0$ only on the slow time scale.

[Reviewer Dominique Escande’s Comment: Ryskin (Reference Ryskin1997) can be referenced with respect to $\Delta t\to 0$ only on the slow time scale.]

We can now identify terms in the Fokker–Planck equation (2.189):

(2.195)

\begin{equation} \frac {\partial}{\partial t}\ \rho \!\left ({{{v}}};t\right )=-\frac {{\partial }}{\partial {{{{v}}}}}\left [-\gamma {{{v}}}\rho \right ]+\frac {{\partial }^2}{\partial {{{{{v}}}}^2}}\left [\gamma T\rho \right ]=\gamma \frac {{\partial }}{\partial {{{{v}}}}}\!\left ({{{v}}}\rho +T\frac {{\partial }\rho }{\partial {{{{v}}}}}\right ). \end{equation}

This is a universal Fokker–Planck equation, a property of any one-dimensional Gaussian Markov process.

Example: Landau equation. In this example $x\to {\boldsymbol{p}}$ (t) and $\rho \to f$ . The Fokker–Planck equation is

(2.196)

\begin{equation} \frac {\partial}{\partial t}\ f\!\left ({\boldsymbol{p}};t\right )=-\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \left [{\mathop {\lim }_{\Delta t\to 0} \frac {\langle \Delta {\boldsymbol{p}}\rangle }{\Delta t}}f\!\left ({\boldsymbol{p}};t\right )-\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left ({\boldsymbol{D}}\!\left ({\boldsymbol{p}},t\right )f\!\left ({\boldsymbol{p}};t\right )\right )\right ], \end{equation}

where

(2.197)

\begin{equation} {\boldsymbol{D}}\!\left ({\boldsymbol{p}},t\right )={\mathop {\lim }_{\Delta t\to 0} \frac {\langle \Delta {\boldsymbol{p}}\Delta {\boldsymbol{p}}\rangle }{2\Delta t}}. \end{equation}

The Landau equation asserts

(2.198)

\begin{equation} \frac {\partial}{\partial t}\ f\!\left ({\boldsymbol{p}};t\right )=-\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \tilde {{\boldsymbol \varGamma }}({\boldsymbol{p}};t). \end{equation}

Can we show that the right-hand side of (2.196) is equal to $-({{\partial }}/{\partial {{\boldsymbol{p}}}})\cdot \tilde {{\boldsymbol \varGamma }}({\boldsymbol{p}};t)$ ? In § 2.4.1 we derived (2.145)

(2.199)

\begin{equation} \tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{p}};t\right )={\int\nolimits {\textrm{d}^3\textit{p}^{\prime}}} {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\cdot \!\left (\frac {\partial}{\partial {{\boldsymbol{p}}}^{\prime} }-\frac {\partial}{\partial {\boldsymbol{p}}}\right )f\!\left ({\boldsymbol{p}};t\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right ),\quad {\boldsymbol{w}}\equiv {{{\boldsymbol v}}}-{{{\boldsymbol v}}}'.\end{equation}

Inside the square bracket in (2.196) the two terms can be expressed as

(2.200)

\begin{equation} {\mathop {\lim }_{\Delta t\to 0} \frac {\langle \Delta {\boldsymbol{p}}\rangle }{\Delta t}}f+f\frac {\partial}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}-\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left ({\boldsymbol{D}}f\right )-f\frac {\partial}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}. \end{equation}

The first two terms in (2.200) can be identified with the ${\partial}/{\partial {{\boldsymbol{p}}}^{\prime} }$ on the right-hand side of (2.145) and the third and fourth terms in (2.200) can be identified with ${\partial}/{\partial {{\boldsymbol{p}}}}$ :

(2.201)

\begin{equation} {\boldsymbol{D}}\!\left ({\boldsymbol{p}},t\right )=\int\nolimits {\textrm{d}^3\textit{p}'}{\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} ;t\right )\!. \end{equation}

Definition: We have

(2.202)

\begin{equation} {\mathop {\lim }_{\Delta t\to 0} \frac {\langle \Delta {\boldsymbol{p}}\rangle }{\Delta t}}\equiv \langle {\boldsymbol{F}}\rangle \!\left ({\boldsymbol{p}};t\right )=2\frac {\partial}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}({\boldsymbol{p}};t). \end{equation}

With the use of (2.200), (2.145), (2.201), and (2.202) the Fokker–Planck equation (2.196) is recovered. The Landau equation is a particular Fokker–Planck equation for a Markov process. On an appropriate time scale the transition probability $\psi \!\left (\Delta {\boldsymbol{p}}\vert {\boldsymbol{p}};t,\Delta t\right )$ has no dependence of the jump $\Delta {\boldsymbol{p}}$ on the past history. Weak coupling has been assumed, and the effects of large-angle collisions are neglected. The Boltzmann equation can accommodate large-angle scatters.

[Reviewer Dominique Escande’s Comment: The consequences of large-angle collisions are overlooked. The latter may have a large effect: see Shoub (Reference Shoub1987).]

To summarize, the Landau equation written in the form of the Fokker–Planck equation is

(2.203)

\begin{equation} \frac {\partial}{\partial t}\ f\!\left ({\boldsymbol{p}};t\right )=\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \left [-\langle {\boldsymbol{F}}\rangle \!\left ({\boldsymbol{p}};t\right )f+\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left ({\boldsymbol{D}}\!\left ({\boldsymbol{p}},t\right )f\right )\right ], \end{equation}

where from (2.201),

(2.204)

For ${{{v}}}\ll \overline {{{{v}}}},\; {\boldsymbol{D}}\!\left ({\boldsymbol{p}},t\right )\to \boldsymbol{D}{\boldsymbol{I}}$ , i.e., the diffusion is isotropic in certain situations. The friction or drag term is

(2.205)

\begin{equation} \langle {\boldsymbol{F}}\rangle \!\left ({\boldsymbol{p}};t\right )=2\frac {\partial}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}({\boldsymbol{p}};t). \end{equation}

From § 2.4.1

(2.206)

\begin{equation} {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )=\frac {Q}{w}({\boldsymbol{I}}-\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}\textrm {)}, \hat {{\boldsymbol{w}}}\cdot {\boldsymbol{Q}}=0, \end{equation}

where from (2.148)

(2.207)

\begin{equation} Q= \frac {1}{8\pi }\int\nolimits {\textbf{d}k\ k^3{\vert \tilde {\phi }\!\left (k\right )\!\vert }^2}, {\boldsymbol{a}}\cdot {\boldsymbol{Q}}\cdot {\boldsymbol{a}}=\frac {Q}{w}(a^2-{\!\left ({\boldsymbol{a}}\cdot \hat {{\boldsymbol{w}}}\right )}^2)\ge 0. \end{equation}

Example: Consider $\phi \!\left (r\right )=\pm \phi _0e^{-({r^2}/{2a^2})}$ attractive or repulsive. Calculate the Fourier transform to obtain $\tilde {\phi }(k)=\pm {(\sqrt {2\pi }a)}^3\phi _0e^{-(k^2a^2/2)}$ , from which

(2.208)

\begin{align} Q &= \frac {1}{8\pi }\int\nolimits {\textrm{d}k\ k^3{\vert \tilde {\phi }\!\left (k\right )\!\vert }^2}=\frac {{(2\pi )}^3a^6}{8\pi }\phi ^2_0\int\nolimits ^{\infty }_0{k^3e^{-k^2a^2}\textrm{d}k}\nonumber \\[4pt] &={\pi}^2\phi ^2_0a^2\!\left (\frac{1}{2}\int\nolimits ^{\infty }_0{\textrm{d}t\ te^{-t}}\right )=\frac {{\pi}^2}{2}\phi ^2_0a^2. \end{align}

We can now find an expression for the Fokker–Planck equation:

(2.209)

\begin{align} \frac {\partial}{\partial t}\ f&=\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \left [-{\boldsymbol{F}}f+\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left ({\boldsymbol{D}}f\right )\right ]=\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \left [-\frac{1}{2}{\boldsymbol{F}}f+{\boldsymbol{D}}\cdot \frac {{\partial }}{\partial {{\boldsymbol{p}}}}f\right ]\nonumber \\[4pt] &= -\frac{1}{2}\!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{F}}\right )f+{\boldsymbol{D}}{\mathbf :}\frac {{\partial }^{{\mathbf 2}}{\boldsymbol{f}}}{\partial {{\boldsymbol{p}}}\partial {{\boldsymbol{p}}}}=-\left [\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}\right )\right ]f+{\boldsymbol{D}}{\mathbf :}\frac {{\partial }^{{\mathbf 2}}{\boldsymbol{f}}}{\partial {{\boldsymbol{p}}}\partial {{\boldsymbol{p}}}},\quad \end{align}

where we have used the relation (valid for a single-species plasma)

(2.210)

\begin{equation} {\boldsymbol{F}}\!\left ({\boldsymbol{p}}\right )\equiv \int\nolimits {\textrm{d}^3p^{\prime} \left [\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )-\frac {{\partial }}{\partial {{\boldsymbol{p}}{\mathbf '}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\right ]f\!\left ({\boldsymbol{p}}'\right )=2}\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}\!\left ({\boldsymbol{p}}\right ) \end{equation}

between the friction vector and the momentum diffusion (dyadic) tensor. Since (2.161) yields the definition

(2.211)

\begin{equation} {\boldsymbol{D}}\!\left ({\boldsymbol{p}}\right )=\int\nolimits {\textrm{d}^3\textit{p}'}{\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} \right )=Q\int\nolimits {\textrm{d}^3\textit{p}'}\!\left (\frac {\overset{\leftrightarrow}{\textrm{I}}}{w}-\frac {\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}}{w^3}\right )f\!\left ({{\boldsymbol{p}}}^{\prime} \right ), \end{equation}

we obtain

(2.212)

\begin{equation} -\left [\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}\right )\right ]=-\int\nolimits {\textrm{d}^3\textit{p}'\left [\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\right )\right ]}f\!\left ({{\boldsymbol{p}}}^{\prime} \right ). \end{equation}

Next, using (2.148) and w = v -v ’, we first calculate (note that Q is a constant)

(2.213)

\begin{equation} \frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )=\frac {1}{m}\frac {{\partial }}{\partial {{{{\boldsymbol v}}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )=\frac {Q}{m}\frac {{\partial }}{\partial {{{{\boldsymbol v}}}}}\cdot \!\left (\frac {\overset{\leftrightarrow}{\textrm{I}}}{w}-\frac {\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}}{w^3}\right )=-\frac {2Q}{m}\frac {\hat {{\boldsymbol{w}}}}{w^3}=\frac {2Q}{m}\frac {{\partial }}{\partial {{{{\boldsymbol v}}}}}\!\left (\frac {1}{w}\right ), \end{equation}

so that we obtain the friction vector from (2.205):

(2.214)

\begin{equation} {\boldsymbol{F}}=2\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}} =\frac {{\mathbf 2}}{{\boldsymbol{m}}}\int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \left [\frac {{\partial }}{\partial {{{\boldsymbol v}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\right ]}f\!\left ({{\boldsymbol{p}}}^{\prime} \right )=-\frac {2Q}{m}\int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \left [\frac {{{{\boldsymbol v}}}-{{{\boldsymbol v}}}'}{{\vert {{\boldsymbol v}}}-{{{\boldsymbol v}}}'\vert ^3}\right ]}f\!\left ({{\boldsymbol{p}}}^{\prime} \right ) \end{equation}

which naturally satisfies Newton’s Third Law. Lastly, using the definition of the Dirac delta function, ${\nabla }^2{\vert {\boldsymbol{r}}-{\boldsymbol{r}}'\vert }^{-1}\equiv -4\pi {\delta }^3({\boldsymbol{r}}-{\boldsymbol{r}}{\mathbf '})$ , we find

(2.215)

\begin{equation} \frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{Q}}\!\left ({\boldsymbol{w}}\right )\!\right )=\frac {2Q}{m^2}\frac {{\partial }}{\partial {{{\boldsymbol v}}}}\cdot \frac {{\partial }}{\partial {{{\boldsymbol v}}}}\!\left ({\vert {{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} \vert }^{-1}\right )\equiv -\frac {8\pi Q}{m^2}{\delta }^3\!\left ({{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} \right )=-8\pi Qm{\delta }^3\!\left ({\boldsymbol{p}}-{{\boldsymbol{p}}}^{\prime} \right )\!. \end{equation}

Hence, we conclude

(2.216)

\begin{equation} -\left [\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left (\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot {\boldsymbol{D}}\right )\right ]=-\int\nolimits {\textrm{d}^3{\textit{p}}^{\prime} \left [-8\pi Qm{\delta }^3\!\left ({\boldsymbol{p}}-{{\boldsymbol{p}}}^{\prime} \right )\right ]}f\!\left ({{\boldsymbol{p}}}^{\prime} \right )=8\pi Qmf\!\left ({\boldsymbol{p}}\right ). \end{equation}

Example: Despite the long-range interactions in a plasma, assume the plasma is sufficiently tenuous so that weak coupling prevails. Debye shielding affects the interactions:

(2.217)

\begin{equation} \phi \!\left (s\right )=\left \{ \begin{array}{c} \pm \dfrac {e^2}{s_0}\ \ \ \ \ \ \ \ \ \ (s\le s_0), \\[4pt] \pm \dfrac {e^2}{s}e^{-\frac {s}{{\lambda }_D}}\ \ \ \ \ (s\gt s_0), \end{array} \right .\ \ \ \ \ \ {\lambda }_D=\sqrt {\frac {T}{4\pi n_ee^2}}, \end{equation}

where s ${}_{0}$ determines a cutoff of the potential at short distances such that $({e^2}/{s_0})\lesssim T$ and $\Lambda \equiv {{\lambda }_D}/{({e^2}/{T})}\gg$ 1 to be consistent with the weak coupling assumption. In this example, $Q=2\pi e^4{\text{ ln } \Lambda }$ ; ${\text{ ln } \Lambda }=$ 3–15. At distances ${s\gt \lambda }_D$ the plasma screens the potential, and the potential decays exponentially in ${s/\lambda }_D$ . Hence, ${\tau }_{\text{relax}}\sim 1/Q\sim 1/\phi ^2_0.$ Q only affects the time scale for the relaxation of the distribution function, not the form of the solution.

Example: Multispecies Landau equation. Having two or more species only appears in the interaction potential. The Fokker–Planck equation becomes

(2.218)

\begin{equation} \frac {\partial}{\partial t}\ f^s\!\left ({\boldsymbol{p}};t\right )=\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \left [-{\langle {\boldsymbol{F}}\rangle }^s\!\left ({\boldsymbol{p}};t\right )f^s+\frac {{\partial }}{\partial {{\boldsymbol{p}}}}\cdot \!\left ({{\boldsymbol{D}}}^{{\boldsymbol{s}}}\!\left ({\boldsymbol{p}},t\right )\ f^s\right )\right ], \end{equation}

where

(2.219)

\begin{equation} {{\boldsymbol{D}}}^{{\boldsymbol{s}}}\!\left ({\boldsymbol{p}},t\right )=\sum\limits _{s'}{\int\nolimits {\textrm{d}^3\textit{p}'}{{\boldsymbol{Q}}}^{{\boldsymbol{ss}}{\mathbf '}}\!\left ({\boldsymbol{w}}\right )f^{s'}\!\left ({\boldsymbol{p}},t\right )\textrm {, }{\boldsymbol{w}} = {{{\boldsymbol v}}}-{{{{\boldsymbol v}}}}^{\prime} = \frac {{\boldsymbol{p}}}{m_s}-\frac {{\boldsymbol{p}}'}{m_{s'}}} \end{equation}

and

(2.220)

\begin{equation} {\langle {\boldsymbol{F}}\rangle }^s\!\left ({\boldsymbol{p}};t\right )=2\frac {\partial}{\partial {{\boldsymbol{p}}}}\cdot {{\boldsymbol{D}}}^{{\boldsymbol{s}}}({\boldsymbol{p}};t), \end{equation}

where

(2.221)

\begin{equation} {{\boldsymbol{Q}}}^{{\boldsymbol{ss}}}\left ({\boldsymbol{w}}\right )=\frac {Q^{ss'}}{w}\!\left ({\boldsymbol{I}}-\hat {{\boldsymbol{w}}}\,\hat {{\boldsymbol{w}}}\right ). \end{equation}

2.5.2. Discontinuous Markov process and derivation of a master equation

We reconsider the Chapman–Kolmogorov equation in the context of discrete steps in x:

(2.222)

\begin{equation} \rho \!\left (x_n\vert x_0\right )=\int\nolimits {\textrm{d}x_{n-1}}\rho \!\left (x_n\vert x_{n-1}\right )\rho \!\left (x_{n-1}x_0\right ), \end{equation}

which we recast in the form

(2.223)

\begin{equation} \rho \!\left (x;t\vert x_0\right )=\int\nolimits {\textrm{d}x'}\rho \!\left (x;t\vert x^{\prime} ,t'\right )\rho \!\left (x^{\prime} ;t' \vert x_0\right ), \end{equation}

where x is discrete with index m and $t\to t+\Delta t$ . The Chapman–Kolmogorov equation becomes

(2.224)

\begin{equation} {\rho }_m\!\left (t+\Delta t\right )=\sum\limits _{m'}{{\rho }_{m\leftarrow m^{\prime} }(t,\Delta t){\rho }_{m^{\prime} }(t)} \end{equation}

and we have suppressed the initial condition $x_0$ in the notation. Define the transition probability ${\psi }_{mm'}\equiv {\rho }_{m\leftarrow m^{\prime} }(t,\Delta t)$ , $\Sigma _m{{\psi }_{mm'}}=1$ . Using the identity ${\rho }_m\!\left (t\right )=\Sigma _{m'}{{\psi }_{mm'}}{\rho }_{m'}\!\left (t\right )$ and (2.224)

(2.225)

\begin{align} {\rho }_m\!\left (t+\Delta t\right )-{\rho }_m\!\left (t\right )&=\sum\limits _{m^{\prime} }{\left [{\psi }^{\!\left (\Delta t\right )}_{mm^{\prime} }{\rho }_{m^{\prime} }\!\left (t\right )-{\psi }^{\!\left (\Delta t\right )}_{m^{\prime} m}{\rho }_m\!\left (t\right )\right ]}\nonumber \\[4pt] &=\sum\limits _{m^{\prime} \ne m}{\left [{\psi }^{\!\left (\Delta t\right )}_{mm^{\prime} }{\rho }_{m^{\prime} }\!\left (t\right )-{\psi }^{\!\left (\Delta t\right )}_{m^{\prime} m}{\rho }_m\!\left (t\right )\right ]} \end{align}

since the m = m ^′ term cancels. We next divide (2.225) by $\Delta t$ and take the limit $\Delta t\to 0$ :

(2.226)

\begin{align} {\mathop {\lim }_{\Delta t\to 0} \frac {{\rho }_m\!\left (t+\Delta t\right )-{\rho }_m\!\left (t\right )}{\Delta t}}&=\sum\limits _{m^{\prime} \ne m}{\left [\frac {{\psi }^{\!\left (\Delta t\right )}_{mm^{\prime} }}{\Delta t}{\rho }_{m^{\prime} }\!\left (t\right )-\frac {{\psi }^{\!\left (\Delta t\right )}_{m^{\prime} m}}{\Delta t}{\rho }_m\!\left (t\right )\right ]} \to \nonumber \\[4pt] \frac {\partial}{\partial t}{\rho }_m\!\left (t\right )&=\sum\limits _{m^{\prime} \ne m}{\left [a_{mm'}{\rho }_{m^{\prime} }\!\left (t\right )-a_{m'm}{\rho }_m\!\left (t\right )\right ]}\nonumber\\[4pt] &=\sum\limits _{m^{\prime} }{\left [a_{mm'}{\rho }_{m^{\prime} }\!\left (t\right )-a_{m'm}{\rho }_m\!\left (t\right )\right ]}, \end{align}

where $a_{mm'}\equiv {\mathop {\lim }_{\Delta t\to 0} ({{\psi }^{\!\left (\Delta t\right )}_{mm^{\prime} }}/{\Delta t})}$ the transition probability per unit time which is nonnegative. Thus, the rate of change of probability for a discrete state m is just a function of the present time, due to $\Delta t\ll {\tau }_{\text{evolution}}$ The process becomes explicitly Markovian. The master equation (2.226) has not been derived from first principles: the derivation has used the Chapman–Kolmogorov equation.

Definition: A master equation as in (2.226) is a set of first-order differential equations describing the time evolution of the probability of a system to occupy each one of a discrete set of states with regard to a continuous time variable t. Pauli, Tolman, and Van Hove are among those credited with presenting master equations.

Rate equation for probability:

(2.227)

\begin{equation} \frac {\partial}{\partial t}{\rho }_m\!\left (t\right )=\sum\limits _{m^{\prime} }{\left [a_{mm'}{\rho }_{m^{\prime} }\!\left (t\right )-a_{m'm}{\rho }_m\!\left (t\right )\right ]}. \end{equation}

Using the lemmas $\Sigma _m{{\rho }_m\!\left (t\right )=1}$ and $(\text{d}/{\textrm{d}t})\Sigma _m{{\rho }_m\!\left (t\right )=0}$ , and the definition for the entropy $S\!\left (t\right )=-\Sigma _m{{\rho }_m\!\left (t\right ){\text{ ln } {\rho }_m\!\left (t\right )}}$ , we calculate the time derivative of the entropy:

(2.228)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}S=\frac{1}{2}\sum\limits _{mm'}{\left ({\text{ln}\; {\rho }_m-{\text{ ln } {\rho }_{m'}}}\right )\!\left (a_{m'm}{\rho }_m-a_{mm'}{\rho }_{m'}\right )}. \end{equation}

At this point we must assume something about $a_{m'm}$ versus $a_{mm'}$ .

Postulate: Assume detailed balance (à la Boltzmann), but not microscopic reversibility:

(2.229)

\begin{equation} a_{m'm}=a_{mm'}. \end{equation}

Detailed balance means that the probability of the process has the same probability as does the inverse process. In some instances detailed balance does not hold, but states can be grouped into superstates where detailed balance does hold.

Using (2.229), (2.228) yields an ‘H-theorem’:

(2.230)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}S=\frac{1}{2}\sum\limits _{mm'}{a_{m'm}\!\left ({\text{ln } {\rho }_m-{\text{ ln } {\rho }_{m'}}}\right )\!\left ({\rho }_m-{\rho }_{m'}\right )}\ge 0. \end{equation}

Theorem: We have $(\text{d}/{\textrm{d}t})S=0$ if and only if ${\tilde {n}\tilde {n}}_m={\rho }_{m'}$ , i.e., all states are equally probable, which defines equilibrium. The concept of equal a priori probabilities is synonymous with equilibrium. Moreover, as $t\to \infty$ , $(\text{d}/{\textrm{d}t})S\to 0\,$ and ${\rho }_m\to {\rho }_{m'}$ .

2.6. Linear response theory, linear Boltzmann equation, and transport theory

2.6.1. Evolution of velocity angle probability distribution due to scattering

Consider a system of scatterers, e.g., neutrons being scattered by point scatterers or light being scattered. For specificity, consider a neutron in a uniform system of scatterers (at least statistically uniform).

Definition: Define a scattering direction $\boldsymbol \Omega$ , $\smallint \text{d}{\boldsymbol \Omega } =\textrm{4}\pi \textrm {,}$ and the velocity direction probability function $\rho ({\boldsymbol \Omega };\textit {t}{\mathbf )}$ . We assume that the magnitude of the velocity |v |is unaffected by the scattering off recoilless particles.

(2.231)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left ({\boldsymbol \Omega };\textit {t}\right ) =\int\nolimits ^{{\boldsymbol {\Lambda}}}_{{\mathbf 4}{\boldsymbol \pi }}\text{d}\Omega {\mathbf '}\nu \textrm {(}{\boldsymbol \Omega }\leftrightarrow {{\boldsymbol \Omega }}^{\prime} \textrm {)}\left [\rho \!\left ({\boldsymbol \Omega }{\mathbf '};\textit {t}\right )-\rho ({\boldsymbol \Omega };\textit {t}{\mathbf )}\right ], \end{equation}

where $\nu$ is the probability per unit time for scattering from $\boldsymbol \Omega$ to ${\boldsymbol \Omega }{\mathbf '}$ or the reverse ${\boldsymbol \Omega }{\mathbf '}$ to $\boldsymbol \Omega$ : $\nu \sim n_0{{{v}}}\sigma \!\left (\Theta \right )$ where $\sigma$ is the differential cross-section through the angle $\Theta \textrm {}$ between scattering directions $\boldsymbol \Omega$ and ${\boldsymbol \Omega }{\mathbf '}$ , $n_0$ is the number density of scatterers, and v is the relative velocity between the neutron and the scatterer.

The differential cross-section can be decomposed into a series expansion separating its angular dependence from its dependence on speed:

(2.232)

\begin{equation} \sigma \!\left (\Theta \right )=\sum\limits ^{\infty }_{\ell =0}{P_{\ell }({\cos \Theta )\frac {2\ell +1}{4\pi }{\sigma }_{\ell }({{{v}}})}}, \end{equation}

where $P_{\ell }$ are Legendre polynomials. We decompose (2.231) into spherical harmonics and solve the linear equation to obtain

(2.233)

\begin{equation} \rho \!\left ({\boldsymbol \Omega };\textit {t}\right ) =\sum\limits_{\boldsymbol\ell ,{\boldsymbol{m}}}{Y^m_{\ell }}\textrm {(}{\boldsymbol \Omega }\textrm {)}{\rho }^m_{\ell }(t=0)e^{-\!\left ({\nu }_0-{\nu }_{\ell }\right )t}, \end{equation}

where ${\nu }_{\ell }\equiv n_0{{{v}}}{\sigma }_{\ell }$ . In going from (2.232) to (2.233) it is useful to employ the addition theorem for spherical harmonics:

(2.234)

\begin{equation} P_{\ell }{\textrm {(cos} \gamma )=\frac {4\pi }{2\ell +1}\sum\limits ^{m=\ell }_{m=-\ell }{Y^m_{\ell }(}}{\theta }_1, \phi _1)Y^{m*}_{\ell }({\theta }_2, \phi _2), \end{equation}

where ${\cos \gamma ={\cos {\theta }_1{\cos {\theta }_2+{\sin {\theta }_1}{\sin {\theta }_2}}}}\;{\cos{(\phi }_1-\phi _2}).$ We note that ${1}/{{\sigma }_{\ell \ne 0}}\gt {1}/{{\sigma }_0}$ and the $\ell =0$ term in right-hand side of (2.233) does not vanish as $t\to \infty$ .

Suppose that the scattering is isotropic, i.e., $\sigma \!\left (\Theta \right )={\sigma }_0$ and ${\sigma }_{\ell \ne 0}=0.$ In this case (2.233) becomes

(2.235)

\begin{equation} \rho \!\left ({\boldsymbol \Omega };\textit {t}\right ) =\sum\limits _{\boldsymbol\ell \ne 0,m}{Y^m_{\ell }}\!\left ({\boldsymbol \Omega }\right ){\rho }^m_{\ell }\!\left (t=0\right )e^{-{\nu }_0t}+{\rho }^0_0(t=0) \end{equation}

Thus, any initial anisotropy will decay away exponentially. The physical mechanism is that random scattering causes a loss of order and structure. These results require ${\sigma }_{\ell \ne 0}\lt {\sigma }_0$ to be physical; otherwise any initial anisotropy would grow. Equation (2.233) can be rewritten as

(2.236)

\begin{equation} \rho \!\left ({\boldsymbol \Omega };\textit {t}\right ) =\frac {\textrm {1}}{\textrm {4}\pi }\textrm {+}\sum\limits ^{\infty }_{\ell =1}{\sum\limits ^{m=\ell }_{m=-\ell }{Y^m_{\ell }}\!\left ({\boldsymbol \Omega }\right ){\rho }^m_{\ell }\!\left (0\right )e^{-{(\nu }_0-{\nu }_{\ell })t}} \end{equation}

with ${\nu }_{\ell }\lt {\nu }_0$ for all $\ell \gt 0$ , ${\nu }_{\ell }=n_0{{{v}}}{\sigma }_{\ell }$ and $\sigma \!\left (\Theta \right )$ given in (2.232).

2.6.2. Linear response fundamentals

Before discussing the analysis of the linear Boltzmann equation we first introduce some necessary definitions and properties associated with linear response functions. Consider the linear response $J(t)$ of a system due to an external agency $F(t)$ .

Postulate: Assume linearity, causality, and stationarity.

Given the postulated assumptions, quite generally one can write

(2.237)

\begin{equation} J\!\left (t\right )=\int\nolimits ^t_{-\infty }{\textrm{d}t^{\prime} R\!\left (t,t^{\prime} \right )F(t^{\prime})=\int\nolimits ^{\infty }_0{\textrm{d}\tau R\!\left (\tau =t-t'\right )F(t-\tau )}}. \end{equation}

Definition: The response or transfer function R satisfies

(2.238)

\begin{equation} R(\tau)=\left \{ \begin{array}{c} R(\tau)\, \, \, \, \, \, \, \tau \gt 0, \\[4pt] 0\, \, \, \, \, \, \, \, \, \, \, \, \, \tau \lt 0. \end{array} \right . \end{equation}

With (2.238), the integral in (2.237) can be extended:

(2.239)

\begin{equation} J\!\left (t\right )=\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau R(\tau)F(t-\tau )}. \end{equation}

Definition: Define the Fourier transform $g\!\left (\omega \right )=\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ g\!\left (t\right )e^{i\omega t}}.$

We use the convolution theorem and (2.239) to obtain

(2.240)

\begin{equation} J\!\left (\omega \right )=R\!\left (\omega \right )F(\omega ), \end{equation}

where $F\!\left (\omega \right )=\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ F\!\left (t\right )e^{i\omega t}}$ with ${F(t)\vert }_{-\infty }=0$ (F is turned on at a finite time) and ${{F(t)\vert }^{\infty }}=\textrm {finite}$ ; $R\!\left (\omega \right )=\smallint\nolimits ^{\infty }_0{\textrm{d}t\ R\!\left (t\right )e^{i\omega t}}$ ; and $J\!\left (\omega \right )=\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ J\!\left (t\right )e^{i\omega t}}$ = $\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ J\!\left (t\right )e^{i(\Omega +i\gamma )t},}\ \gamma \gt 0$ for convergence.

We note that ( $\tau$ ) is a causal function and is analytic in the upper half of the complex $\omega$ plane, which follows from $R(\tau)=0$ for $\tau \lt 0$ ; and $R\!\left (-\omega \right )=R^*(\omega )$ for real $\omega$ .

Example 1: $R(\tau)=e^{-\nu \tau }$ $\to R\!\left (\omega \right )=1/(\nu-i\omega)$ which has a simple pole at $\omega =-i\nu .$

Example 2: $R(\tau)={\sin {\omega }_0\tau \to R\!\left (\omega \right )={{\omega }_0}/(\omega_0^2-\omega^2) \left (\omega \ne {\omega }_0\right ).}$

Example 3: $R(\tau$ )= $\smallint\nolimits ^{\infty }_{-\infty }\text{d}{{{v}}}e^{-\nu \!\left ({{{v}}}\right )\tau }g\!\left ({{{v}}}\right ), 0\le {\nu }_0\lt \nu\! \left ({{{v}}}\right )\lt {\nu }_1, \to R\!\left (\omega \right )=\smallint\nolimits ^{\infty }_{-\infty }\text{d}{{{v}}}({g\!\left ({{{v}}}\right )}/$ $(\nu(v)-i\omega)$ ) There is a branch cut for –i ${\nu }_0\lt \omega \lt {-i\nu }_1.$

If we evaluate $R\!\left (\omega \right )$ on the real $\omega$ axis we note that $R\!\left (\omega \right )=R^{\prime} \!\left (\omega \right )+iR^{\prime}(\omega )$ where $R^{\prime} \!\left (\omega \right )$ is an even function of $\omega$ and $iR^{\prime}(\omega )$ is an odd function. The Kramers–Kronig relations assert that $R^{\prime} \!\left (\omega \right )\ \textrm {and}\ \ R^{\prime}(\omega )$ are Hilbert transforms of one another.

Kramers–Kronig relations: $R^{\prime} \!\left (\omega \right )$ and R $"(\omega )$ satisfy

(2.241)

\begin{equation} R^{\prime} \!\left (\omega \right )=\frac {1}{\pi}p.v.\int\nolimits ^{\infty }_{-\infty }{\mathrm{d}\xi \frac {R^{\prime}(\xi )}{\xi -\omega , }}\ \ \ \ \ \ \ \ \ \ R^{\prime}\!\left (\omega \right )=-\frac {1}{\pi}p.v.\int\nolimits ^{\infty }_{-\infty }{\mathrm{d}\xi \frac {R'(\xi )}{\xi -\omega }}, \end{equation}

where $p.v.\smallint\nolimits ^{\infty }_{-\infty }{\textrm{d}\xi }$ is the Cauchy principal-value integral.

[Editor’s Addendum: The Kramers–Kronig relations are derived as an application of the Cauchy residue formula for a function $f\!\left (z\right )\equiv u\!\left (z\right )+iv(z)$ that is analytic in the upper half-plane ( $Im\ z \ge 0)$ . Under this assumption, the contour integral $\oint _C{f\!\left (z\right )\textrm{d}z/(z-\zeta )=0}$ vanishes for any real variable $\zeta$ along a closed contour C that is composed of four segments: two segments along the real axis (from –R to $\zeta -\epsilon$ and $\zeta +\epsilon$ to +R), a semicircular segment in the clockwise direction (from $\zeta -\epsilon$ to $\zeta +\epsilon$ ), and a semi-circular segment in the counter-clockwise direction (from +R to –R). In the limits R $\to \infty$ and $\epsilon \to 0$ , we therefore obtain

(2.242)

\begin{equation} 0=p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}x\ \frac {u\!\left (x\right )+iv(x)}{x-\zeta }}\ -i\pi \left [u\!\left (\zeta \right )+iv(\zeta )\right ], \end{equation}

where

(2.243)

\begin{equation} p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}x\ \frac {f\!\left (x\right )}{(x-\zeta )}\ \equiv \,{\mathop {\lim }_{\epsilon \to 0} \left [\int\nolimits ^{\infty }_{\zeta +\epsilon }{\textrm{d}x\ \frac {f\!\left (x\right )}{(x-\zeta )}\ +\ \int\nolimits ^{\zeta -\epsilon }_{-\infty }{\textrm{d}x\ \frac {f\!\left (x\right )}{(x-\zeta )}}}\right ]}} \end{equation}

denotes the Cauchy principal-value integral. Hence, we obtain the dual relations

(2.244)

\begin{align} u\!\left (\zeta \right )&= \frac {1}{\pi}\ p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}x\ \frac {v(x)}{x-\zeta }\ \equiv H[v](\zeta )}\quad \textrm {and}\quad v\!\left (\zeta \right )\nonumber\\[4pt]&= -\frac {1}{\pi}\ p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}x\ \frac {u(x)}{x-\zeta }\ \equiv -H[u](\zeta )}, \end{align}

which are expressed in terms of the Hilbert transform $H\left [f\right ]\!\left (\zeta \right )\ \equiv ({1}/{\pi})p.v.\ \smallint\nolimits ^{\infty }_{-\infty }\textrm{d}x $ ${f(x)/(x-\zeta ).}$ We note that these relations are completely general for the real and imaginary parts of an analytic function in the upper half-complex plane.]

Example: For atomic spectra, absorption can occur at a particular frequency:

(2.245)

\begin{equation} R^{\prime}\!\left (\omega \right )=\delta (\omega -{\omega }_0)-\delta (\omega +{\omega }_0),\quad R^{\prime} \!\left (\omega \right )=\frac {2}{\pi}\frac {{\omega }_0}{{\omega }^2_0-{\omega }^2}, \omega \ne {\omega }_0. \end{equation}

Thus, dissipation implies dispersion; and dispersion implies absorption.

Example: $R(\tau)={\sin ({\omega }_0\tau ) \to R\!\left (\omega \right )= }{{\omega }_0}/(\omega_0^2-\omega^2)$ which is wrong. The correct result is

(2.246)

\begin{equation} R\!\left (\omega \right )=\frac {{\omega }_0}{{\omega }^2_0-{\omega }^2}+i\frac {\pi}{2}\!\left (\delta (\omega -{\omega }_0)-\delta (\omega +{\omega }_0)\right ), \end{equation}

which we should have caught when we performed the Cauchy integral carefully. Thus, the Kramers–Kronig relations provide a valuable check.

Exercise: Include damping in a model response function R(t). Use Kramers–Kronig to determine R ^′′.

[Editor’s Solution: Consider the model response function which includes damping

(2.247)

\begin{equation} R\!\left (t\right )= \ A\;{\exp \!\left (-\nu t\right )}{\sin }(\Omega t+\alpha ), \end{equation}

$that\ is\, a\ solution\ (for\ t\gt 0)\ of\ the\ damped\ oscillator\ equation\ \ddot {R}(t)+2\nu \dot {R}(t)+$ ${\omega }^2_0R(t)=0,$ where ${\Omega }^2\equiv \,{\omega }^2_0-{\nu }^2\gt 0$ and the constants (A, $\alpha )$ are determined from the initial conditions R(0) and $\dot {R}\!\left (0\right )$ . The Fourier transform ${\mathcal R}\!\left (\omega \right )\equiv \smallint\nolimits ^{\infty }_0{\textrm{d}tR\!\left (t\right ){\exp \!\left (i\omega t\right )}}$ is expressed as a complex-valued function

(2.248)

\begin{equation} {\mathcal R}\!\left (\omega \right )= \frac {A}{2}\ \left [\frac {\textrm {exp}(i\alpha )}{(\omega +\Omega +i\nu )}-\frac {\textrm {exp}(-i\alpha )}{(\omega -\Omega +i\nu )}\right ]\equiv {{\mathcal R}}^{\prime} \!\left (\omega \right )+i{\mathcal R}"(\omega ), \end{equation}

which has poles in the lower half-complex $\omega$ plane at $\omega = \pm \Omega -i\nu$ , with ${{\mathcal R}}^{\prime} \!\left (\omega \right )\equiv \textrm {Re}[{\mathcal R}](\omega )$ and ${{\mathcal R}"}\!\left (\omega \right )\equiv \textrm {Im}[{\mathcal R}](\omega )$ for real $\omega$ . Hence, the real and imaginary parts of ${\mathcal R}\!\left (\omega \right )$ are guaranteed to be related by the Kramers–Kronig relations (2.241)

(2.249)

\begin{equation} R^{\prime} \!\left (\omega \right )=\frac {1}{\pi}p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\xi \frac {R^{\prime\prime}(\xi )}{\xi -\omega }},\quad R^{\prime\prime}\!\left (\omega \right )=-\frac {1}{\pi}p.v.\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\xi \frac {R'(\xi )}{\xi -\omega }}, \end{equation}

which hold for arbitrary constants (A, $\alpha ).$ For example, we consider the case (A, $\alpha )=\left (1,0\right ),$ which yields the double Lorentzian distributions (centered at $\pm {\omega }_0$ )

(2.250)

\begin{equation} {{\mathcal R}}^{\prime} \!\left (\omega \right )= \frac {\Omega ({\omega }^2_0-{\omega }^2)}{{\!\left ({\omega }^2-{\omega }^2_0\right )}^2+4{\nu }^2{\omega }^2} \end{equation}

(2.251)

\begin{equation} {\mathcal R}"\!\left (\omega \right )= \frac {\textrm {2}\Omega \omega \nu }{{\!\left ({\omega }^2-{\omega }^2_0\right )}^2+4{\nu }^2{\omega }^2} \end{equation}

where ${{\mathcal R}}^{\prime} \!\left (\omega \right )$ and ${\mathcal R}"\!\left (\omega \right )$ are even and odd functions of $\omega$ , respectively. At resonance $\omega =\pm {\omega }_0$ (for $\nu \ne 0)$ , we find ${{\mathcal R}}^{\prime} \!\left (\pm {\omega }_0\right )=0$ and ${\mathcal R}^{\prime}(\pm {\omega }_0)=\pm \Omega /(2\nu {\omega }_0$ ). Lastly, in the limit $\nu \to 0$ , we find

(2.252)

\begin{equation} R\!\left (\omega \right )=\frac {{\omega }_0}{{\omega }^2_0-{\omega }^2}+i\frac {\pi}{2}\!\left (\delta (\omega -{\omega }_0)-\delta (\omega +{\omega }_0)\right ) \end{equation}

which is given by (2.246).

2.6.3. Linear Boltzmann equation

Consider a system comprised of an electron gas with electron charge e, immersed in a gas of neutrals in which the electrons scatter. If there is an externally applied electric field E (t), the current in response to the electric field is

(2.253)

\begin{equation} {\boldsymbol{j}}\!\left (t\right )=\int\nolimits ^{\infty }_0{\textrm{d}\tau \overset{\leftrightarrow}{\sigma}(\tau )\cdot {\boldsymbol{E}}(t-\tau )} \end{equation}

and

(2.254)

\begin{equation} {\boldsymbol{j}}\!\left (\omega \right )=\overset{\leftrightarrow}{\sigma}(\omega )\cdot {\boldsymbol{E}}(\omega ) \end{equation}

We use the linear Boltzmann equation to describe the scattering of the electrons by the surrounding neutrals.

Postulate: Assume conditions such that ${\nu }_{e,\textrm{neutrals}}\gg {\nu }_{ee}$ with Debye shielding, i.e., the electron–neutral collisions are dominant.

Before linearization the scattering equation for the electron velocity probability distribution is

(2.255)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left ({{{\boldsymbol v}}};\textit {t}\right ){\mathbf +}\dot {{{{\boldsymbol v}}}}\cdot \frac {\partial }{\partial {{{\boldsymbol v}}}}\rho \!\left ({{{\boldsymbol v}}};\textit {t}\right ) =\int\nolimits_{{\mathbf 4}{\boldsymbol \pi }}{\mathrm{d}}\Omega {\mathbf '}\nu \textrm {(}{\boldsymbol \Theta },{{{\boldsymbol v}}}\textrm {)}\left [\rho \!\left ({{{\boldsymbol v}}}{\mathbf '};\textit {t}\right )-\rho ({{{\boldsymbol v}}};\textit {t}{\mathbf )}\right ], \end{equation}

where $\dot {{{{\boldsymbol v}}}}=({e}/{m}){\boldsymbol{E}}$ and $\nu \sim n_0\sigma \!\left (\Theta ,{{{v}}}\right ){{{v}}}$ which depends on the particular neutral atoms. Equation (2.255) is not linear in $\rho ,\,{\boldsymbol{E}},$ etc. To justify linearization we require that $\boldsymbol{E}$ is weak and produces a small perturbation in $\rho$ :

(2.256)

\begin{equation} \rho \!\left ({{{\boldsymbol v}}};\textit {t}\right ) ={\rho }^{(0)}\!\left ({{{v}}}\right ){\mathbf +}\delta \rho \!\left ({{{\boldsymbol v}}};\textit {t}\right ), \delta \rho \ll {\rho }^{(0)}, \end{equation}

where ${\rho }^{(0)}\!\left ({{{v}}}\right )$ is isotropic and only depends on the electron speed, e.g., a Maxwellian. To first order (2.255) becomes

(2.257)

\begin{equation} \frac {\partial}{\partial t}\delta \rho \!\left ({{{\boldsymbol v}}};\textit {t}\right ){\mathbf +}\frac {e}{m}{}{\boldsymbol{E}}\cdot \frac {\partial}{\partial {{{\boldsymbol v}}}}{\rho }^0\!\left ({{{\boldsymbol v}}};\textit{t}\right ) =\int\nolimits _{4\pi }{\mathrm{d}}{\boldsymbol \Omega }{\mathbf '}\nu \textrm {(}{\boldsymbol \Theta },{{{\boldsymbol v}}}\textrm{)}\left [\delta \rho \!\left ({{{\boldsymbol v}}}{\mathbf '};\textit {t}\right )-\delta \rho ({{{\boldsymbol v}}};\textit {t}{\mathbf )}\right ]. \end{equation}

Equation (2.257) is a linear Boltzmann equation. Now solve the linear integrodifferential equation.

Definition: Introduce the notation

(2.258)

\begin{equation} \int\nolimits _{4\pi }\text{d}{{\boldsymbol \Omega }}^{\prime} \nu \!\left ({\boldsymbol \Theta },{{{\boldsymbol v}}}\right )\left [\delta \rho \!\left ({{{{\boldsymbol v}}}}^{\prime} ;\textit {t}\right )-\delta \rho \!\left ({{{\boldsymbol v}}};\textit {t}\right )\right ]\equiv -\tilde {\nu }\delta \rho ({{{\boldsymbol v}}};\textrm {t)}, \end{equation}

where $\tilde {\nu }$ is a positive-definite operator operating on $\delta \rho$ inside the integral on the right-hand side of (2.257). The operator $\tilde {\nu }$ is elaborated on in the rest of this section and, in particular, (2.266)–(2.269).

With the use of (2.258), (2.257) becomes

(2.259)

\begin{align} &-i\omega \delta \rho \!\left ({{{\boldsymbol v}}};\omega \right )+\frac {e}{m}{\boldsymbol{E}}\!\left (\omega \right )\cdot \frac {\partial {\rho }^{\!\left (0\right )}}{\partial {{{\boldsymbol v}}}}=-\tilde {\nu }\delta \rho \!\left ({{{\boldsymbol v}}};\omega \right ) \nonumber\\[4pt]&\qquad\qquad \to \!\left (\omega +i\tilde {\nu }\right )\delta \rho \!\left ({{{\boldsymbol v}}};\omega \right ) =-i\frac {e}{m}{\boldsymbol{E}}\!\left (\omega \right )\cdot \frac {\partial {\rho }^{\!\left (0\right )}}{\partial {{{\boldsymbol v}}}}. \end{align}

The current is related to the electron charge density and fluid velocity:

(2.260)

\begin{equation} {\boldsymbol{j}}\!\left (t\right )\equiv n_ee\langle {{{\boldsymbol v}}}\rangle \!\left (t\right )=en_e\int\nolimits {\textrm{d}^3{{{v}}}{{{\boldsymbol v}}}\!\left ({\rho }^{(0)}\!\left ({{{v}}}\right ){\mathbf +}\delta \rho \!\left ({{{\boldsymbol v}}};\textit{t}\right )\right )}, \end{equation}

where $\rho$ is normalized to unity, $n_e$ is the unperturbed electron density normalization factor, and only the $\delta \rho$ term on the right-hand side leads to a finite current. Using (2.259) and (2.260) the Fourier-transformed linearized current density is

(2.261)

\begin{equation} {\boldsymbol{j}}\!\left (\omega \right )=\left [\frac {n_ee^2}{m}(-i)\int\nolimits {\textrm{d}^3{{{v}}}}\frac {{{{\boldsymbol v}}}}{(\omega +i\tilde {\nu })}\frac {\partial {\rho }^{(0)}}{\partial {{{\boldsymbol v}}}}\right ]\cdot {\boldsymbol{E}}(\omega )=\sigma (\omega ){\boldsymbol{E}}(\omega ). \end{equation}

The quantity inside the square bracket in (2.261) is the conductivity tensor ${\boldsymbol \sigma }=\sigma \overset{\leftrightarrow}{\textrm{I}}$ which is isotropic due to the assumed isotropy of ${\rho }^{(0)}$ and has positive-definite eigenvalues:

(2.262)

\begin{equation} {\boldsymbol \sigma }\!\left (\omega \right )=-\left [\frac {n_ee^2}{m}\int\nolimits {\textrm{d}^3{{{v}}}}{{{\boldsymbol v}}}\frac {1}{(\tilde {\nu }-i\omega )}{{{\boldsymbol v}}}\frac {\partial {\rho }^{(0)}}{{{{v}}}\partial {{{v}}}}\right ]. \end{equation}

This is a universal result.

For $\nu \!\left ({{{v}}};\Theta \right )\equiv n_0{{{v}}}\sigma \!\left ({{{v}}};\Theta \right )$ (here $\sigma$ is the scattering cross-section, not to be confused with other definitions of $\sigma$ ) the linear Boltzmann equation derived from (2.231) is

(2.263)

\begin{equation} \tilde {\nu }f\textrm {(}{\boldsymbol \Omega };t\textrm {)} =\int\nolimits {\textrm{d}^2}\Omega {\mathbf '}\nu \textrm{(v};{\boldsymbol \Theta }\textrm {)}\left [f\!\left ({\boldsymbol \Omega };\textit {t}\right )-f({\boldsymbol \Omega }{\mathbf '};\textit{t}{\mathbf )}\right ]. \end{equation}

From the Landau and Boltzmann equations one can write a generic kinetic equation for the electron velocity distribution in a spatially uniform medium:

(2.264)

\begin{equation} \frac {\partial}{\partial t}f^e\!\left ({{{\boldsymbol v}}};t\right )+\frac {e}{m}{\boldsymbol{E}}\!\left (t\right )\cdot \frac {\partial}{\partial {{{\boldsymbol v}}}}f^e\!\left ({{{v}}};t\right )=C^{ee}\!\left (f^e,f^e\right )+C^{ei}\!\left (f^e,f^i\right ), \end{equation}

where $f^e\!\left ({{{\boldsymbol v}}};t\right )= f^{e(0)}\!\left ({{{\boldsymbol v}}}\right )+{\delta f}^e\!\left ({{{\boldsymbol v}}};t\right ).$ In (2.264) $f^e\!\left ({{{\boldsymbol v}}};t\right )=n^e\rho \!\left ({{{\boldsymbol v}}};t\right )$ , and the right-hand side can be linearized

(2.265)

\begin{equation} C^{ee}\!\left (f^e,f^e\right )+C^{ei}\!\left (f^e,f^i\right )=-{\tilde {\nu }}^{ee}\delta f^e-{\tilde {\nu }}^{ei}\delta f^e\equiv -\tilde {\nu }\delta f^e \end{equation}

so that (2.264) has the same form as the kinetic equation (2.257) after linearization.

The eigenfunctions of $\tilde {\nu }$ are $Y^m_{\ell }\textrm {(}{\boldsymbol \Omega }\textrm {)}$ :

(2.266)

\begin{equation} \nu \!\left ({{{v}}};\Theta \right )=\sum\limits ^{\infty }_{\ell =0}{P_{\ell }({\cos \Theta )\frac {2\ell +1}{4\pi }{\nu }_{\ell }\!\left ({{{v}}}\right ),\ \ \ \ \ \ \ \ \ }}\qquad\qquad\qquad\qquad \end{equation}

(2.267)

\begin{equation}\qquad\qquad {\nu }_{\ell }\!\left ({{{v}}}\right )\equiv \int\nolimits {\textrm{d}^2\Omega P_{\ell }({\cos \Theta )}}\ \nu \!\left ({{{v}}};\Theta \right )\le \int\nolimits {\textrm{d}^2\Omega }\nu \!\left ({{{v}}};\Theta \right )={\nu }_0\!\left ({{{v}}}\right ), \end{equation}

where ${\nu }_0\!\left ({{{v}}}\right )={\nu }\!\left ({{{v}}}\right )$ is the “total” rate, ${\nu }_0\!\left ({{{v}}}\right )=n_0\sigma \!\left ({{{v}}}\right )\textrm {v,}$ and we note that ${\nu }_{\ell }\gt 0.$ Using (2.263), (2.266), (2.267), and the addition theorem mentioned after (2.233), it can be shown that

(2.268)

\begin{equation} \tilde {\nu }Y^m_{\ell }\!\left ({\boldsymbol \Omega }\right )=({\nu }_0-{\nu }_{\ell })Y^m_{\ell }({\boldsymbol \Omega }\textrm {)}, \end{equation}

where on the left-hand side of (2.268) $\tilde {\nu }$ operates on $Y^m_{\ell }$ . Thus, ${\nu }_0-{\nu }_{\ell }$ is the positive eigenvalue of $\tilde {\nu }$ operating on the eigenvector $Y^m_{\ell }.$ Hence, any function of the operator F( $\tilde {\nu })$ operating on $Y^m_{\ell },\ F\textrm {(}\tilde {\nu })Y^m_{\ell}$ will yield $F\textrm {(}{\nu }_0-{\nu }_{\ell })Y^m_{\ell }$ . To illustrate the operator $\tilde {\nu },$ for $\ell =1$ these relations imply

(2.269)

\begin{equation} {\nu }_0-{\nu }_1\equiv \int\nolimits {\textrm{d}^2\Omega }\nu ({{{v}}};\Theta )(1-{\cos \theta )= }\int\nolimits {\textrm{d}^2\Omega {\ n}_0}{{{v}}}\sigma ({{{v}}};\Theta )(1-{\cos \theta )}, \end{equation}

where $\sigma \!\left ({{{v}}};\Theta \right )$ is the differential cross-section and ${\sigma }_{tr}({{{v}}})=\smallint {\textrm{d}^2\Omega }\sigma ({{{v}}};\Theta )$ $(1-\cos \theta )={(\nu }_0-{\nu }_1)/n_0{{{v}}}$ is the transport cross-section.

2.6.4. Collision models and conductivity

We can now apply (2.269) to the conductivity tensor in (2.262) for the $\ell =1,$ $m=\pm 1,0$ terms.

In the expression for the conductivity ${\!\left (\tilde {\nu }-i\omega \right )}^{-1}\to {\!\left ({\nu }_0-{\nu }_1-i\omega \right )}^{-1}$ and using (2.262) and (2.269) we obtain

(2.270)

\begin{equation} {\boldsymbol \sigma }\!\left (\omega \right )=\sigma \overset{\leftrightarrow}{\textrm{I}} =-\frac {n_ee^2}{m}\int\nolimits {\textrm{d}^3{{{v}}}}{{{\boldsymbol v}}}\frac {1}{({\nu }_0-{\nu }_1-i\omega )}{{{\boldsymbol v}}}\frac {\partial {\rho }^{(0)}}{{{{v}}}\partial {{{v}}}}. \end{equation}

With ${\rho }^{(0)}\sim e^{\beta mv^2/2}$ (2.270) becomes

(2.271)

\begin{equation} {\boldsymbol \sigma }\!\left (\omega \right )=\sigma \!\left (\omega \right )\overset{\leftrightarrow}{\textrm{I}}=\beta n_ee^2\int\nolimits {\textrm{d}^3{{{v}}}}{\boldsymbol{vv}}\frac {1}{({\nu }_{tr}-i\omega )}{\rho }^{(0)}. \end{equation}

Using the definition of the average over the velocity distribution and tr $\overset{\leftrightarrow}{\textrm{I}}$ =3, (2.271) becomes

(2.272)

\begin{equation} \sigma \!\left (\omega \right )=\frac {1}{3}\beta n_ee^2{\langle {{{{v}}}}^2{\!\left ({\nu }_{tr}({{{v}}})-i\omega \right )}^{-1}\rangle }_0. \end{equation}

Example: Consider a single electron in the presence of an atom. The atom feels the electric field of the electron ${\boldsymbol{E}}=({e}/{r^2})\hat {{\boldsymbol{r}}}$ which induces a dipole moment in the atom, ${\boldsymbol \Pi }=\alpha {\boldsymbol{E}}$ . The interaction energy of the atom’s dipole with the electron is $e\phi =e{\boldsymbol \Pi }\cdot \nabla {{\mathbf 1}}/{{\boldsymbol{r}}}\to \alpha ({e^2}/{r^4})$ . A model for the interaction of the electron with the dipole might be $\dot {{{{\boldsymbol v}}}}=-(\partial / \partial \textrm{r})(-(\alpha e^2/m)/r^4)=-{\nabla }_re\phi /m$ , with $\alpha \sim O\!\left (\textrm {volume}\right ).$ The interaction of an electron with an ion is $\dot {{{{\boldsymbol v}}}}=-({\partial}/{\partial {\boldsymbol{r}}})(-{({Ze^2}/{m})}/{r})$ . In general, the interaction of the electron with scattering center can be modeled as

(2.273)

\begin{equation} \dot {{{{\boldsymbol v}}}}=-\frac {\partial}{\partial {\boldsymbol{r}}}\!\left (\frac {a_s}{r^s}\right ), \end{equation}

where s = 4 for the dipole interaction and s = 1 for the interaction with the ion. The quantity a ${}_{s}$ has units $a_s\sim ({\text{velocity}\cdot L/{\text{time}}})L^s$ . The transport cross-section has units of L ${}^{2 }$ with functional dependence

(2.274)

\begin{equation} {\sigma }^{\!\left (s\right )}_{tr}\!\left ({{{v}}},a_s\right )=\frac {C_s{\vert a_s\vert }^q}{{{{{v}}}}^n}. \end{equation}

Hence, from (2.274) $L^2=({1}/{{\textrm {velocity}}^n}){\left ({\textrm {velocity}}^2L^s\right )}^q$ ; and 2q = n and 2 = sq from which we conclude that q = 2/s and n = 4/s:

(2.275)

\begin{equation} {\sigma }^{\!\left (s\right )}_{tr}\!\left ({{{v}}},a_s\right )=\frac {C_s{\vert a_s\vert }^{2/s}}{{{{{v}}}}^{4/s}}. \end{equation}

Example: For s = 1 and interaction of electrons with an ion,

(2.276)

\begin{equation} {\sigma }^{\!\left (1\right )}_{tr}=\frac {C_1}{v^4}{\left (\frac {Ze^2}{m}\right )}^2=C_1\frac {Z^2e^4}{m^2v^4}, \end{equation}

which we recognize as C ${}_{1 }$ multiplied by the Rutherford cross-section for the Coulomb interaction.

Exercise: Derive the expression for ${\sigma }_{tr}$ for a Debye-shielded electron–ion interaction based on the analysis in § 1.4.6

Example: Electron interaction with a neutral atom.

From the literature we find the transport cross-section

(2.277)

\begin{equation} {\sigma }_{tr}\!\left ({{{v}}}\right )=\frac {c_4}{{{{v}}}}{\left (\frac {\alpha e^2}{m}\right )}^{\frac{1}{2}}, \ c_4=1.1052, \end{equation}

and $\alpha$ is the polarizability. From (2.277) the scattering rate is

(2.278)

\begin{equation} {\nu }_{tr}\!\left ({{{v}}}\right )=n_0{\sigma }_{tr}\!\left ({{{v}}}\right ){{{v}}}=n_0c_4{\left (\frac {\alpha e^2}{m}\right )}^{\frac{1}{2}}, \end{equation}

which is independent of speed. From (2.272) we solve for the conductivity

(2.279)

\begin{equation} \sigma \!\left (\omega \right )=\frac {n_ee^2/m}{{\nu }_{tr}-i\omega }=\frac {{\omega }^2_{pe}}{4\pi ({\nu }_{tr}-i\omega )} \end{equation}

and there is a pole at $\omega =-i{\nu }_{tr}$ . After Fourier transforming,

(2.280)

\begin{equation} \sigma (\tau)=\frac {n_ee^2}{m}e^{-{\nu }_{tr}\tau }, {\boldsymbol{j}}\!\left (t\right )=\int\nolimits ^{\infty }_0{\textrm{d}\tau }\sigma (\tau){\boldsymbol{E}}(t-\tau ). \end{equation}

Exercise: Sketch ${\sigma }^{\prime} \!\left (\omega \right )=\textrm {Re}$ $({{\omega }^2_{pe}}/(4\pi (\nu_{tr}-i\omega)))$ and $\sigma^{\prime\prime}\!\left (\omega \right )=\textrm {Im}\ ({{\omega }^2_{pe}}/(4\pi (\nu_{tr}-i\omega)))$ using the estimate

(2.281)

\begin{align} \sigma \!\left (\omega =0\right )&=\frac {n_e}{n_0}{\left (\frac {e^2}{m}\right )}^{1/2}\frac {1}{{\alpha }^{1/2}c_4}\sim {10}^{16}\frac {n_e}{n_0}\,{\textrm {s}}^{-1},\, \, \, \, n_0\sim {10}^{19}\,\textrm {cm}^{-3},\nonumber\\[4pt] &\qquad\qquad {\nu }_{tr}\sim n_0{10}^{-8}\,{\textrm {cm}}^3\,{\textrm {s}}^{-1}\sim {10}^{11}\,{\textrm {s}}^{-1}. \end{align}

Using Maxwell’s equation (electrostatic limit) $({4\pi {\boldsymbol{j}}}/{c})+({1}/{c})({\partial {\boldsymbol{E}}}/{\partial t})=0$ and ${\boldsymbol{j}}\!\left (\omega \right )=\overset{\leftrightarrow}{\sigma}(\omega )\cdot {\boldsymbol{E}}\!\left (\omega \right )$ , we obtain the dispersion relation for electron plasma waves with collisional damping:

(2.282a)

\begin{equation} \omega \!\left (\omega +i{\nu }_{tr}\right )=\frac {4\pi n_ee^2}{m}\equiv {\omega }^2_{pe},\qquad\qquad\;\, \end{equation}

(2.282b)

\begin{equation} \qquad\quad \omega ={\omega }^{\prime} +i\omega^{\prime\prime}\sim {\omega }_{pe}-i\frac {{\nu }_{tr}}{2}. \end{equation}

The generalization of (2.280) for $\sigma (\tau)$ when ${\nu }_{tr}({{{v}}})$ is a function of v is

(2.283)

\begin{equation} \sigma (\tau)=\frac {n_ee^2}{m}{\frac {\langle e^{-{\nu }_{tr}({{{v}}})\tau }{{{{v}}}}^2\rangle }{\langle {{{{v}}}}^2\rangle }}. \end{equation}

Example: Useful formulas for a linear Boltzmann model of a plasma include

(2.284)

\begin{equation} {\sigma }^{\prime} \!\left (\omega =0\right )=\frac {8}{\sqrt {\pi}}\frac {n_ee^2}{m\overline {\nu }},\; \textrm{where}\;\,\,\overline {\nu }\equiv \frac {n_0}{{\!\left (2mT^3\right )}^{1/3}}Q \quad\textrm{and}\quad Q\cong 2\pi e^4{\text{ ln } \Lambda }, \end{equation}

where ${\sigma }^{\prime} \!\left (\omega =0\right )$ is the dc plasma conductivity. The linear Boltzmann model for plasma collisions is equivalent to a Lorentz model. A linear Landau equation plasma model yields ${\sigma '}_\textrm{Landau}\!\left (\omega =0\right )=1.98{\sigma '}_{\text{linear}\ \text{Boltz}.}\!\left (\omega =0\right )$ .

2.6.5. Linear response theory and Kubo formulae

Green (Reference Green1954) and Kubo (Reference Kubo1957) derived relations that give exact mathematical expressions for transport coefficients in terms of integrals of time correlation functions. These relations lead to a powerful fluctuation–dissipation theorem.

We posit a system that can be described with a Hamiltonian and obeys the Liouville equation. We further assume that the system possesses a small parameter that allows one to expand the Hamiltonian and the probability distribution to first order in the time-dependent perturbations:

(2.285)

\begin{equation} H=H_0({\boldsymbol \varGamma })+\delta H\!\left ({\boldsymbol\varGamma};t\right ),\quad {\boldsymbol \varGamma }=\left ({{\boldsymbol{p}}}_i,{{\boldsymbol{q}}}_{{\boldsymbol{i}}}\right ). \end{equation}

The system being Hamiltonian can be represented by the Liouville equation. For

(2.286)

\begin{equation} \rho \!\left ({\boldsymbol\varGamma};t\right )={\rho }_0({\boldsymbol \varGamma })\textrm {+}\delta \rho \textrm{(}{\boldsymbol\varGamma};t), \end{equation}

the Liouville equation is

(2.287)

\begin{equation} \frac {\textrm{d}\rho }{\textrm{d}t}=\frac {\partial \rho }{\partial t}+\left \{\rho ,H\right \}=0, \end{equation}

where {f,g} is the Poisson bracket. The equilibrium distribution ${\rho }_0$ is time independent and satisfies

(2.288)

\begin{equation} \left \{{\rho }_0,H_0\right \}=0 \end{equation}

and the ergodic hypothesis applies:

(2.289)

\begin{equation} {\rho }_0\!\left (H_0\right )=\frac {e^{-\beta H_0(\varGamma )}}{Z}. \end{equation}

The linearized version of (2.287) is

(2.290)

\begin{equation} \frac {\mathrm{d}\delta \rho }{\textrm{d}t}=\frac {\partial \delta \rho }{\partial t}+\left \{\delta \rho ,H_0\right \}+\left \{{\rho }_0,\delta H\right \}=0. \end{equation}

Definitions: We make the following definitions:

(2.291a)

\begin{equation} {{\mathcal L}}_0\ =\left \{\ ,H_0\right \}\equiv {\dot {q}}^{\!\left (0\right )}\frac {\partial}{\partial q}+{\dot {p}}^{\!\left (0\right )}\frac {\partial}{\partial p}, \end{equation}

(2.291b)

\begin{equation} \delta {{\mathcal L}(t)}\ \equiv \left \{\ ,{\delta H(t)}\right \}\equiv {\delta \dot {q}}\frac {\partial}{\partial q}+\delta {\dot {p}}\frac {\partial}{\partial p}. \end{equation}

Equation (2.290) can be integrated by introducing an integrating factor as

(2.292)

\begin{equation} e^{-t{{\mathcal L}}_0}\frac {\textrm{d}}{\textrm{d}t}\!\left (e^{t{{\mathcal L}}_0}\delta \rho \!\left (t\right )\right )=-\ \delta {{\mathcal L}(t){\rho }_0}, \end{equation}

which can be integrated from $-\infty$ to t:

(2.293)

\begin{equation} \delta \rho (t^{\prime}){e^{t'{{\mathcal L}}_0}\vert }^{t^{\prime} =t}_{t^{\prime} =-\infty }=\delta \rho \!\left (t\right )e^{\ t{{\mathcal L}}_0}=-\int\nolimits ^t_{-\infty }{\textrm{d}t'e^{-t'{{\mathcal L}}_0}}\delta {{\mathcal L}(t^{\prime}){\rho }_0}\quad \end{equation}

(2.294)

\begin{equation} \quad=-\int\nolimits ^{\infty }_0{\textrm{d}\tau e^{-(t-\tau {){\mathcal L}}_0}}\delta {{\mathcal L}(t-\tau ){\rho }_0},\;\; \end{equation}

where the last expression is obtained with the substitution $t^{\prime} =t-\tau$ . Hence, the solution for $\delta \rho \!\left (t\right )$ is given by

(2.295)

\begin{equation} \delta \rho \!\left (\varGamma ,t\right )=-\int\nolimits ^{\infty }_0{\textrm{d}\tau e^{-\tau {{\mathcal L}}_0}}\delta {{\mathcal L}(t-\tau ){\rho }_0({\boldsymbol \varGamma })}. \end{equation}

Lemma: Using (2.291a ), Hamilton’s equations of motion and (2.289),

(2.296)

\begin{equation} \delta {{\mathcal L}(t)}{\rho }_0=-\beta {\rho }_0\delta {{\mathcal L}\!\left (t\right )H_0=-\beta {\rho }_0}\left \{H_0,{\delta H\!\left (t\right )}\right \}=\beta {\rho }_0\left \{{\delta H\!\left (t\right ),H}_0\right \}=\beta {\rho }_0{{\mathcal L}}_0\ \delta H\!\left (t\right ). \end{equation}

Hence, $\delta {{\mathcal L}\!\left (t\right )}{\rho }_0= \beta {\rho }_0{{\mathcal L}}_0\ \delta H\!\left (t\right ).$

Given (2.296), $\delta {{\mathcal L}\!\left (t-\tau \right )}{\rho }_0({\boldsymbol \varGamma })= \beta {\rho }_0{({\boldsymbol \varGamma }){\mathcal L}}_0\ \delta H\!\left (t-\tau \right )$ , (2.295) is equivalent to

(2.297)

\begin{equation} \delta \rho \!\left ({\boldsymbol \varGamma },t\right )=-\beta {\rho }_0{({\boldsymbol \varGamma })}\int\nolimits ^{\infty }_0{\textrm{d}\tau e^{-\tau {{\mathcal L}}_0}}{{{\mathcal L}}_0\delta H\!\left (t-\tau \right )}. \end{equation}

We assume that the linear perturbation to the Hamiltonian can be expressed as a sum over terms that can be decomposed into products of spatial and temporal phase factors:

(2.298)

\begin{equation} \delta H\!\left ({\boldsymbol \varGamma },t\right )\equiv -\sum\limits _{\mu }{A_{\mu }({\boldsymbol \varGamma })\delta F_{\mu }(t)}. \end{equation}

Example: Electrons subject to externally imposed fields

(2.299)

\begin{equation} \delta H=\int\nolimits {\textrm{d}^3x\left \{\frac {1}{c}{\boldsymbol{j}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\cdot \delta {{\boldsymbol{A}}}^{\text{ext}}\!\left ({\boldsymbol{x}},t\right )-{\rho }_{\text{elec}}({\boldsymbol{x}},t)\delta \phi ^{\text{ext}}({\boldsymbol{x}},t)\right \}}. \end{equation}

Here $\boldsymbol{x}$ denotes the field position in configuration space, distinct from $\boldsymbol \varGamma$ which is the phase space of all the particle positions and momenta. This is the classical perturbed Hamiltonian for electromagnetic forces. Here

(2.300)

\begin{equation} {\rho }_{\text{elec}}=\sum\limits _i{e_i\delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i)},\quad \boldsymbol{j}=\sum\limits _i{e_i{{{{\boldsymbol v}}}}_i\delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i)}. \end{equation}

Using (2.298) in (2.297) the perturbed probability distribution becomes

(2.301)

\begin{equation} \delta \rho \!\left ({\boldsymbol \varGamma },t\right )=\beta {\rho }_0{({\boldsymbol \varGamma })}\sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau \delta F_{\mu }(t-\tau )e^{-\tau {{\mathcal L}}_0}}{{{\mathcal L}}_0A_{\mu }({\boldsymbol \varGamma })}}. \end{equation}

Since $A_{\mu }(\varGamma \textrm {)}$ has no explicit time dependence, its total time derivative along the zero order particle trajectory is given by

(2.302)

\begin{equation} \frac {\textrm{d}^{\!\left (0\right )}A_{\mu }(\varGamma \textrm {)}}{\textrm{d}t}=\frac {\partial A_{\mu }}{\partial q}{\dot {q}}^{(0)}+\frac {\partial A_{\mu }}{\partial p}{\dot {p}}^{(0)}=\left \{A_{\mu },H_0\right \}\equiv {\;\dot{\!A}}_{\mu }({\boldsymbol \varGamma }). \end{equation}

Example: $A_{\mu }={{\boldsymbol{r}}}_i$ , ${\dot {{\boldsymbol{r}}}}_i=\left \{{{\boldsymbol{r}}}_i,H_0\right \}\equiv {{{{\boldsymbol v}}}}_i$ .

We introduce

(2.303)

\begin{equation} B\!\left ({\boldsymbol \varGamma },t\right )\equiv e^{t{{\mathcal L}}_0}B({\boldsymbol \varGamma })=\sum\limits ^{\infty }_{n=0}{\frac {{\!\left (t{{\mathcal L}}_0\right )}^n}{n!}B({\boldsymbol \varGamma })}. \end{equation}

Example: $B\!\left ({\boldsymbol \varGamma },t\right )={{\boldsymbol{r}}}_i$ Hence, ${{\boldsymbol{r}}}_i(t)$ is the position ${{\boldsymbol{r}}}_i(t)$ at time t given ${{\boldsymbol{r}}}_i(0)$ at t = 0.

Corollary: $e^{-\tau {{\mathcal L}}_0}{{\mathcal L}}_0A_{\mu }({\boldsymbol \varGamma }) = e^{-\tau {{\mathcal L}}_0}{\;\dot{\!A}}_{\mu }$ ( ${\boldsymbol \varGamma })$ = ${\;\dot{\!A}}_{\mu }$ $({\boldsymbol \varGamma },-\tau )$ and thus (2.301) becomes

(2.304)

\begin{equation} \delta \rho \!\left ({\boldsymbol \varGamma },t\right )=\beta {\rho }_0{({\boldsymbol \varGamma })}\sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau \delta F_{\mu }(t-\tau )}}{\;\dot{\!A}}_{\mu }({\boldsymbol \varGamma },-\tau ). \end{equation}

Consider a set $\left \{B_{\nu }({\boldsymbol \varGamma })\right \}$ from which we calculate an average value at a given time:

(2.305)

\begin{equation} \langle B_{\nu }\rangle (t)\equiv \int\nolimits {\textrm{d}\varGamma B_{\nu }({\boldsymbol \varGamma })\rho \!\left ({\boldsymbol \varGamma },t\right )=\int\nolimits {\textrm{d}\varGamma B_{\nu }({\boldsymbol \varGamma })({\rho }_0+\delta \rho )}} \end{equation}

and

(2.306)

\begin{eqnarray} \delta \langle B_{\nu }\rangle (t)&&\equiv \int\nolimits {\textrm{d}\varGamma B_{\nu }({\boldsymbol \varGamma })\delta \rho \!\left ({\boldsymbol \varGamma },t\right )}\nonumber \\[4pt]&&=\beta \sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau \delta F_{\mu }(t-\tau )}}\int\nolimits {\textrm{d}\varGamma }{{\rho }_0({\boldsymbol \varGamma })B_{\nu }({\boldsymbol \varGamma })\dot {A}}_{\mu }(\varGamma ,-\tau )\nonumber \\[4pt]&&=\beta \sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau \delta F_{\mu }\!\left (t-\tau \right )}}{\langle {B_{\nu }\dot {A}}_{\mu }\!\left (-\tau \right )\rangle }_0\nonumber \\[4pt]&&={\beta \sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau \delta F_{\mu }\!\left (t-\tau \right )}}\langle {B_{\nu }\!\left (\textit {t}\right )\dot {A}}_{\mu }\!\left (t-\tau \right )\rangle }{}_0. \end{eqnarray}

Here ${\langle {B_{\nu }\dot {A}}_{\mu }\!\left (-\tau \right )\rangle }_0$ is a correlation function and can be shown to be stationary ${\langle {B_{\nu }\dot {A}}_{\mu }\!\left (-\tau \right )\rangle }_0=$ ${\langle {B_{\nu }\!\left (\textit{t}\right )\dot {A}}_{\mu }\!\left (t-\tau \right )\rangle }_0$ , which is a ‘cross-correlation function.’

Definition: We make the following definition:

(2.307)

\begin{equation} C^{B\dot {A}}_{\nu \mu }(\tau )\equiv {\langle {B_{\nu }\!\left (\textit {t}\right )\dot {A}}_{\mu }\!\left (t-\tau \right )\rangle }_0. \end{equation}

Hence,

(2.308)

\begin{equation} \delta \langle B_{\nu }\rangle (t)\equiv {\beta \sum\limits _{\mu }{\int\nolimits ^{\infty }_0{\textrm{d}\tau C^{B\dot {A}}_{\nu \mu }(\tau )\delta F_{\mu }\!\left (t-\tau \right )}}}. \end{equation}

The integrals over time in (2.304), (2.306), (2.307), and (2.308) follow the same convention as in (2.295) with respect to the limits of the integrations given the actual initial conditions.

Equation (2.308) gives the response of the system at thermal equilibrium to an external field based just on the unperturbed system. This is a fluctuation response theorem describing the linear response of a system in thermal equilibrium to a small-amplitude external (nonequilibrium) field.

Because (2.308) is a convolution, we can Fourier analyze and use the convolution theorem to obtain

(2.309)

\begin{equation} \delta \langle B_{\nu }\rangle (\omega )\equiv {\beta \sum\limits _{\mu }{C^{B\dot {A}}_{\nu \mu }(\omega )\delta F_{\mu }\!\left (\omega \right )}}, \end{equation}

where

(2.310)

\begin{equation} C^{B\dot {A}}_{\nu \mu }\!\left (\omega \right )\equiv \int\nolimits ^{\infty }_0{\textrm{d}\tau e^{i\omega \tau }C^{B\dot {A}}_{\nu \mu }(\tau )} \end{equation}

is complex and satisfies the Kramers–Kronig relations. However,

(2.311)

\begin{equation} S\!\left (\omega \right )\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau e^{i\omega \tau }C(\tau )} \end{equation}

is real, positive, and an even function of $\omega$ ; and

(2.312)

\begin{align} C\!\left (\omega \right )&=\int\nolimits ^{\infty }_0{\textrm{d}\tau e^{i\omega \tau }}\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega '}{2\pi }e^{-i\omega '\tau }}\nonumber\\[4pt] S\!\left (\omega '\right )&=\int\nolimits ^{\infty }_{-\infty }{\frac {\mathrm{d}{\omega }^{\prime} }{2\pi }{S\!\left ({\omega }^{\prime} \right )\int\nolimits ^{\infty }_0{\textrm{d}\tau e^{i(\omega -{\omega }^{\prime} )\tau }}}}\nonumber \\[4pt]&= \frac{1}{2}S\!\left (\omega \right )-\frac {i}{2\pi }\oint {\mathrm{d}\xi \frac {S(\xi )}{\xi -\omega }}, \end{align}

using $({1}/2){\pi}\smallint\nolimits ^{\infty }_0{\textrm{d}\tau e^{i(\omega -i{\omega }^{\prime} )\tau }}=({1}/{2})\delta \!\left (\omega -{\omega }^{\prime} \right )-(i/2\pi) (P/(\omega' -\omega))$ . Thus, $2\textrm {Re}\ C\!\left (\omega \right )=S\!\left (\omega \right )$ and $C'(\omega )=\textrm {Im}\ C(\omega )=-({1}/{\pi})\oint {\textrm{d}\xi {C'(\xi )}/({\xi -\omega' })}$ , which verifies the Kramers–Kronig relations.

The generalization of the linear response of the system at thermal equilibrium to an external field in three dimensions is

(2.313)

\begin{equation} \delta H=-{\boldsymbol{A}}\cdot \delta {\boldsymbol{F}}, \end{equation}

(2.314)

\begin{equation} \delta \langle {{\boldsymbol{B}}}_{\nu }\rangle (t)\equiv {\beta \int\nolimits ^{\infty }_0{\textrm{d}\tau \,{\boldsymbol{C}}(\tau )\cdot \delta {{\boldsymbol{F}}}\!\left (t-\tau \right )}}, \end{equation}

(2.315)

\begin{equation} \langle {\boldsymbol{B}}\rangle \!\left (\omega \right )={\beta {\boldsymbol{C}}(\omega )\cdot {\boldsymbol{F}}(\omega )}, \end{equation}

(2.316)

\begin{equation} {\boldsymbol{C}}\!\left (\omega \right )\equiv \int\nolimits ^{\infty }_0{\textrm{d}\tau e^{i\omega \tau }{\boldsymbol{C}}(\tau )}, \end{equation}

(2.317)

\begin{equation} 2{{\boldsymbol{C}}}^{\prime} \!\left (\omega \right )={\boldsymbol{S}}\!\left (\omega \right )=\int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau e^{i\omega \tau }{\boldsymbol{C}}(\tau )}, \end{equation}

where ${\boldsymbol{S}}\!\left (\omega \right )$ is Hermitian (real, symmetric) and positive-definite, and ${\boldsymbol{C}}(\omega )$ is complex and non-Hermitian.

Hence,

(2.318)

\begin{equation} {({\boldsymbol{C}}')}_{\mu \nu }\equiv \frac {{{\boldsymbol{C}}}_{\mu \nu }+{{\boldsymbol{C}}}^*_{\nu \mu }}{2},\quad {\left ({\boldsymbol{C}}^{\prime\prime}\right )}_{\mu \nu }\equiv \frac {{{\boldsymbol{C}}}_{\mu \nu }-{\textrm {C}}^*_{\nu \mu }}{2i}. \end{equation}

The real part is Hermitian and the imaginary part is anti-Hermitian.

Exercise: Prove (2.318) using ${\boldsymbol{C}}\textrm {(}\tau \textrm {)}$ is real.

Example: Consider Brownian motion with the perturbed Hamiltonian

(2.319)

\begin{equation} \delta H=e\delta \phi \!\left (X,t\right )=-Xe\delta E(t). \end{equation}

Here $A({\boldsymbol \varGamma })=-X$ and $\delta F=e\delta E(t)$ to touch base with (2.313), and $B\equiv {{{v}}}$ to connect with (2.314) and (2.315). Hence,

(2.320)

\begin{equation} \langle {{{v}}}\rangle \!\left (\omega \right )={\beta C^{{{{v}}}}(\omega )e\delta E(\omega )} \end{equation}

We use knowledge of the fluctuations $\delta E(\omega )$ to obtain the response $\langle {{{v}}}\rangle \!\left (\omega \right )$ , or vice versa. Recall from § 2.2 that

(2.321)

\begin{equation} C^{{{{v}}}}(\tau)=\langle {{{{v}}}}^2\rangle e^{-\nu \vert \tau \vert }, \ \nu =\frac {\gamma }{M} \to C^{{{{v}}}}\!\left (\omega \right )=\frac {T/M}{\nu -i\omega }. \end{equation}

From (2.320) and (2.321) it follows that

(2.322)

\begin{equation} \langle {{{v}}}\rangle \!\left (\omega \right )=\frac {e\delta E(\omega )}{\gamma -iM\omega }. \end{equation}

This agrees with the Langevin equation (2.60) suitably ensemble averaged:

(2.323)

\begin{equation} \!\left (\!M\frac {\textrm{d}}{\textrm{d}t}+\gamma\! \right )\!\langle V\rangle \!\left (t\right )=\langle \delta F\rangle \!\left (t\right )=e\ \delta E(t)\textrm{ or }\! \left (-i\omega M+\gamma \right )\langle V\rangle \!\left (\omega \right )=\langle \delta F\rangle (\omega ) =e\delta E(\omega ). \end{equation}

Example: Consider the current response to an external electric field turned on from zero.

We choose the gauge ${\boldsymbol{E}}=-\boldsymbol\nabla \phi$ . The perturbed Hamiltonian is

(2.324)

\begin{equation} \delta H\!\left ({\boldsymbol \varGamma },t\right )=\int\nolimits {\textrm{d}^3\textit{x}\,{\rho }^{\text{elec}}({\boldsymbol{x}}\vert {\boldsymbol \varGamma })\delta \phi ^{\text{ext}}({\boldsymbol{x}},t)}. \end{equation}

Relative to our previous notation for the linear response:

(2.325)

\begin{equation} \mu \to x^{\prime} \quad \rho \to A_{\mu }\quad \delta \phi ^{\text{ext}}\to -\delta F\quad B_{\mu }\to {\boldsymbol{j}}\!\left ({{\boldsymbol{x}}}^{\prime} \right )\quad B_{\nu }\to {\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right ) \end{equation}

Then using earlier results

(2.326)

\begin{equation} \delta \langle {\boldsymbol{j}}\rangle \!\left (x,t\right )=-\beta \int\nolimits {\textrm{d}^3\textit{x}'\int\nolimits ^{\infty }_0{\textrm{d}\tau C^{{\boldsymbol{j}}\dot {\rho }}_{{\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} }(\tau )\delta \phi ^{\text{ext}}({{\boldsymbol{x}}}^{\prime} ,t-\tau }}), \end{equation}

where

(2.327)

\begin{equation} C^{{\boldsymbol{j}}\dot {\rho }}_{{\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} }(\tau)={\langle {\boldsymbol{j}}({\boldsymbol{x}}\vert {\boldsymbol \varGamma },t){\dot {\rho }}^{\text{elec}}({{\boldsymbol{x}}}^{\prime} \vert {\boldsymbol \varGamma },t-\tau )\rangle }_0, \end{equation}

(2.328)

\begin{equation} {\dot {\rho }}^{\text{elec}}\!\left ({{\boldsymbol{x}}}^{\prime} \vert {\boldsymbol \varGamma },t-\tau \right )=-\frac {\partial}{\partial {{\boldsymbol{x}}}^{\prime} }\cdot {\boldsymbol{j}}({{\boldsymbol{x}}}^{\prime} \vert {\boldsymbol \varGamma },t-\tau ). \end{equation}

The ensemble average of (2.326) for the internal current removes the fluctuations from the current response. The internal current is the current carried by the charged particles within the system in response to fields but not including externally imposed currents in wires, say.

We use (2.328) and (2.327) in (2.326) and integrate with respect to $\textrm{d}^3{{\boldsymbol{x}}}^{\prime}$ by parts so that ${\partial}/{\partial {{\boldsymbol{x}}}^{\prime} }\cdot$ operates on $\delta \phi ^{\text{ext}}$ with a sign change to obtain

(2.329)

\begin{equation} \delta \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},t\right )=\int\nolimits {\textrm{d}^3\textit{x}'\int\nolimits ^{\infty }_0{\textrm{d}\tau {{\boldsymbol \sigma }}^{\text{ext}}(\tau )\cdot {{\boldsymbol{E}}}^{\text{ext}}({{\boldsymbol{x}}}^{\prime} ,t-\tau }}), \end{equation}

where ${{\boldsymbol \sigma }}^{\text{ext}}$ is the response tensor

(2.330)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\tau \right )=\beta {\langle {\boldsymbol{j}}({\boldsymbol{x}}\vert {\boldsymbol \varGamma },t){\boldsymbol{j}}({{\boldsymbol{x}}}^{\prime} \vert {\boldsymbol \varGamma },t-\tau )\rangle }_0. \end{equation}

Equation (2.329) is nonlocal and involves a two-point correlation function for the fluctuating current density ${\boldsymbol{j}}.$

Discussion of special cases:

– If ${{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\tau \right )={{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}^{\prime} ,\tau \right )$ ; perhaps this is good only for a uniform infinite medium based on a translational invariance argument.

– If ${{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\tau \right )={{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol \vert }{\boldsymbol{x}}-{{\boldsymbol{x}}}^{\prime} {\boldsymbol \vert },\tau \right )$ , i.e., an isotropic conductivity, then

(2.331)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}={\sigma }_1\!\left (s,\tau \right ){\boldsymbol{I}}+{\sigma }_2\!\left (s,\tau \right )\hat {{\boldsymbol{s}}}\,\hat {{\boldsymbol{s}}},\ \,{\boldsymbol{s}}={\boldsymbol{x}}-{\boldsymbol{x}}'. \end{equation}

– If ${{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\tau \right )$ is rotationally symmetric about a preferred direction, e.g., with respect to an applied magnetic field, the equation ${\boldsymbol{J}}\!\left (\omega \right )={\boldsymbol \sigma }(\omega )\cdot {\boldsymbol{E}}(\omega )$ can be written as

(2.332)

\begin{equation} {\boldsymbol{J}}\!\left (\omega \right )={\boldsymbol \sigma }\!\left (\omega \right )\cdot {\boldsymbol{E}}\!\left (\omega \right )={\sigma }_{\parallel }{\!\left (\omega \right ){\boldsymbol{E}}}_{\parallel }\!\left (\omega \right )+{\sigma }_{\bot }\!\left (\omega \right ){{\boldsymbol{E}}}_{\bot }\!\left (\omega \right )+{\sigma }_{\wedge }{\!\left (\omega \right ){\boldsymbol{E}}}_{\wedge }\!\left (\omega \right ). \end{equation}

where ${{\boldsymbol{E}}}_{\parallel }=({\boldsymbol{E}}\cdot \hat {{\boldsymbol{b}}})\hat {{\boldsymbol{b}}}$ , ${{\boldsymbol{E}}}_{\bot }={\boldsymbol{E}}-{{\boldsymbol{E}}}_{\parallel },$ and ${{\boldsymbol{E}}}_{\wedge }={\boldsymbol{E}}\times \hat {{\boldsymbol{b}}}$ .

Example: The most general external electromagnetic field can be expressed as

(2.333)

\begin{equation} {{\boldsymbol{E}}}^{\text{ext}}=-\nabla \phi ^{\text{ext}}-\frac {1}{c}\frac {\partial {{\boldsymbol{A}}}^{\text{ext}}}{\partial t}. \end{equation}

Example: The inclusion of thermal fluctuations in a system with nontrivial boundary conditions is considered in Landau & Lifshitz (Reference Landau and Lifshitz1963). This is a very difficult calculation.

Note that the linear response calculations presented here insist that the unperturbed system is in thermal equilibrium before the external field is turned on, and the theory is linear.

Example: Consider the linear response of the current in a uniform medium. Let ${\boldsymbol{s}}\equiv {\boldsymbol{x}}-{\boldsymbol{x}}{\mathbf '}$ ,

(2.334)

\begin{equation} \delta \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},t\right )=\int\nolimits {\textrm{d}^3\textit{x}'\int\nolimits ^{\infty }_0{\textrm{d}\tau {{\boldsymbol \sigma }}^{\text{ext}}({\boldsymbol{s}},\tau )\cdot {{\boldsymbol{E}}}^{\text{ext}}({\boldsymbol{x}}-{\boldsymbol{s}},t-\tau }}), \end{equation}

(2.335)

\begin{equation} \delta \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{k}},\omega \right )={{\boldsymbol \sigma }}^{\text{ext}}({\boldsymbol{k}},\omega )\,{\cdot {\boldsymbol{E}}}^{\text{ext}}({\boldsymbol{k}},\omega ), \end{equation}

(2.336)

\begin{equation} g({\boldsymbol{k}},\omega )\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\int\nolimits {\textrm{d}^3\textit{x}\ g({\boldsymbol{x}},t)e^{i\omega t-i{\boldsymbol{k}}\cdot {\boldsymbol{x}}}}}, \end{equation}

(2.337)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}({\boldsymbol{k}},\omega )\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau \int\nolimits {\textrm{d}^3\textrm {s}\,{{\boldsymbol \sigma }}^{\text{ext}}({\boldsymbol{s}},\tau )e^{i\omega \tau -i{\boldsymbol{k}}\cdot {\boldsymbol{s}}}}}, \end{equation}

and $\delta \langle {\boldsymbol{j}}\rangle \to \langle {\boldsymbol{j}}\rangle$ subsequently in the notation. Now what about the conventional conductivity?

Definition: We make the following definition:

(2.338)

\begin{equation} \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{k}},\omega \right )={{\boldsymbol \sigma }}({\boldsymbol{k}},\omega )\,{\cdot {\boldsymbol{E}}}^{\text{tot}}({\boldsymbol{k}},\omega )\; \textrm{and}\; {{\boldsymbol{E}}}^{\text{tot}}={{\boldsymbol{E}}}^{\text{ext}}+\ \langle {{\boldsymbol{E}}}^{\text{int}}\rangle . \end{equation}

This includes the internal electric field with the fluctuation field averaged out. In the limit that ${\boldsymbol{k}}\to 0$ , $\langle {\boldsymbol{j}}\rangle \!\left (\omega \right )={{\boldsymbol \sigma }}(\omega )\,{\cdot {\boldsymbol{E}}}^{\text{tot}}(\omega )$ for $\lambda \gg d$ usually. Maxwell’s equations tell us

(2.339)

\begin{equation} \nabla \times \langle {{\boldsymbol{B}}}^{\text{int}}\rangle -\frac {1}{c}\left\langle \frac {\partial {{\boldsymbol{E}}}^{\text{int}}}{\partial t}\right\rangle =\frac {4\pi }{c}\langle {{\boldsymbol{j}}}^{\text{int}}\rangle \to i{\boldsymbol{k}}\times {{\boldsymbol{B}}}^{\text{int}}+\frac {i\omega }{c}{{\boldsymbol{E}}}^{\text{int}}=\frac {4\pi }{c}{{\boldsymbol{j}}}^{\text{int}}, \end{equation}

(2.340)

\begin{equation} \nabla \times {\textrm {E}}^{\text{int}}=-\frac {1}{c}\frac {\partial {{\boldsymbol{B}}}^{\text{int}}}{\partial t} \to {\boldsymbol{k}}\times {{\boldsymbol{E}}}^{\text{int}}=\frac {\omega }{c}{{\boldsymbol{B}}}^{\text{int}}. \end{equation}

The Maxwell equations being linear, one can use the superposition principle and decompose.

Definition: Define ${{\boldsymbol{I}}}^{\prime}\equiv {\boldsymbol{I}}-(k^2c^2/\omega^2)({\boldsymbol{I}}-\hat {{\boldsymbol{k}}}\,\hat {{\boldsymbol{k}}}).$

Then given (2.335) and (2.338), and recognizing that $\delta \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{k}},\omega \right )$ and $\langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{k}},\omega \right )$ are equal to the internal current

(2.341)

\begin{equation} \frac {i\omega }{c}{{\boldsymbol{I}}}^{\prime} \cdot {{\boldsymbol{E}}}^{\text{int}}=\frac {4\pi }{c}{{\boldsymbol{j}}}^{\text{int}} \to \frac {i\omega }{4\pi }{{\boldsymbol{I}}}^{\prime} \cdot \!\left ({{\boldsymbol{E}}}^{\text{tot}}-{{\boldsymbol{E}}}^{\text{ext}}\right )={\boldsymbol \sigma }\cdot {{\boldsymbol{E}}}^{\text{tot}}={{\boldsymbol \sigma }}^{\text{ext}}\cdot {{\boldsymbol{E}}}^{\text{ext}}, \end{equation}

where ${{\boldsymbol \sigma }}^{\text{ext}}$ is the Kubo conductivity given by the fluctuations and

(2.342)

\begin{equation} \frac {i\omega }{c}{{\boldsymbol{I}}}^{\prime} \cdot {{\boldsymbol{E}}}^{\text{ext}}=\bigg({\boldsymbol \sigma }-\frac {i\omega }{4\pi }{{\boldsymbol{I}}}^{\prime} \bigg)\cdot {{\boldsymbol{E}}}^{\text{tot}}. \end{equation}

Then using $\nabla \times \langle {{\boldsymbol{B}}}^{\text{tot}}\rangle -({1}/{c})\langle {\partial {{\boldsymbol{E}}}^{\text{tot}}}/{\partial t}\rangle =({4\pi }/{c})\langle {{\boldsymbol{j}}}^{\text{tot}}={{\boldsymbol{j}}}^{\text{int}}+{{\boldsymbol{j}}}^{\text{ext}}\rangle$ and (2.341)

(2.343)

\begin{equation} i{\boldsymbol{k}}\times {{\boldsymbol{B}}}^{\text{tot}}+\frac {i\omega }{c}\!\left ({\boldsymbol{I}}-\frac {4\pi }{i\omega }{\boldsymbol \sigma }\right )\cdot {{\boldsymbol{E}}}^{\text{tot}}=\frac {4\pi }{c}{{\boldsymbol{j}}}^{\text{ext}}. \end{equation}

We note that the dielectric function is

(2.344)

\begin{equation} {{\boldsymbol \varepsilon }}\equiv \!\left ({\boldsymbol{I}}-\frac {4\pi }{i\omega }{\boldsymbol \sigma }\right ) \end{equation}

and (2.341) can be rewritten as

(2.345)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}\cdot {{\boldsymbol{E}}}^{\text{ext}}={\boldsymbol \sigma }\cdot {{\boldsymbol{E}}}^{\text{tot}}={\boldsymbol \sigma }{\!\left ({\boldsymbol \sigma }-\frac {i\omega }{4\pi }{{\boldsymbol{I}}}^{\prime} \right )}^{-1}\cdot \!\left (-\frac {i\omega }{4\pi }\right ){\boldsymbol{I}}'\cdot {{\boldsymbol{E}}}^{\text{ext}}. \end{equation}

Solving for ${{\boldsymbol \sigma }}^{\text{ext}}$ in (2.345) one obtains the relation between the Kubo ${{\boldsymbol \sigma }}^{\text{ext}}$ and the conventional conductivity $\boldsymbol \sigma$

(2.346)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}={\boldsymbol \sigma }{\cdot \!\left ({\boldsymbol \varepsilon }-\frac {k^2c^2}{{\omega }^2}\!\left ({\boldsymbol{I}}-\hat {{\boldsymbol{k}}}\,\hat {{\boldsymbol{k}}}\right )\right )}^{-1}\cdot {\boldsymbol{I}}{\mathbf '}. \end{equation}

Example: For an isotropic uniform medium we can separate longitudinal and transverse components of the conductivity tensor:

(2.347)

\begin{equation} {\boldsymbol \sigma }\!\left ({\boldsymbol{k}},\omega \right )={\sigma }^{\ell }\!\left ({\boldsymbol{k}},\omega \right )\hat {{\boldsymbol{k}}}\,\hat {{\boldsymbol{k}}}+{\sigma }^t({\boldsymbol{k}},\omega )({\boldsymbol{I}}-\hat {{\boldsymbol{k}}}\,\hat {{\boldsymbol{k}}}). \end{equation}

From (2.344) ${{{\boldsymbol \varepsilon }}^{\ell ,t}}=\left ({\boldsymbol{I}}-({4\pi }/{i\omega }){{\boldsymbol \sigma }}^{\ell ,t}\right )$ which is used on the right-hand side of (2.346) to obtain

(2.348)

\begin{align} &{\sigma }^{\ell }_{\text{ext}}\!\left (k,\omega \right )=\frac {i\omega }{4\pi }\!\left (\frac {1}{{\varepsilon }^{\ell }(k,\omega )}-1\right ),{\sigma }^t_{\text{ext}}\!\left (k,\omega \right )=\frac {i\omega }{4\pi }\!\left (1-\frac {k^2c^2}{{\omega }^2}\right )\!\left (\frac {1-\frac {k^2c^2}{{\omega }^2}}{{\varepsilon }^{tr}-\ \frac {k^2c^2}{{\omega }^2}}-1\right ).\nonumber\\[4pt]& \end{align}

It follows that

(2.349)

\begin{align} {\textrm {Re}\ \sigma }^{\ell }_{\text{ext}}\!\left (k,\omega \right )&=-\frac {\omega }{4\pi }\textrm {Im}\ \frac {1}{{\varepsilon }^{\ell }(k,\omega )} \;\textrm{and}\; \nonumber \\[4pt]{\textrm {Re}\ \sigma }^t_{\text{ext}}\!\left (k,\omega \right ) &=-\frac {\omega }{4\pi }\!\left (1-\frac {k^2c^2}{{\omega }^2}\right )\textrm {Im}\ \frac {1}{{\varepsilon }^t-\ \frac {k^2c^2}{{\omega }^2}}. \end{align}

We recall from (2.327) and (2.330) that

(2.350)

\begin{equation} {{\boldsymbol \sigma }}^{\text{ext}}\!\left ({\boldsymbol{s}},\tau \right )\equiv \beta {\langle {\boldsymbol{j}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma },t\right ){\boldsymbol{j}}\!\left ({\boldsymbol{x}}-{\boldsymbol{s}}\vert{\boldsymbol \varGamma },t-\tau \right )\rangle }_0\equiv {\beta {\boldsymbol{C}}}^j\!\left ({\boldsymbol{s}},\tau \right ),\ \tau \gt 0, \end{equation}

where the ‘0’ subscript denotes thermal equilibrium. We take the Fourier transforms of the longitudinal part of (2.350) to obtain

(2.351)

\begin{equation} {{{\boldsymbol \sigma }}}^{\text{ext}}_{\ell }\!\left ({\boldsymbol{k}},\omega \right )\equiv \beta {{\boldsymbol{C}}}^j_{\ell }\!\left ({\boldsymbol{k}},\omega \right ). \end{equation}

Next take the real Hermitian part of (2.351)

(2.352)

\begin{equation} {\mathcal H}e\,{{{\boldsymbol \sigma }}}^{\text{ext}}_{\ell }\!\left ({\boldsymbol{k}},\omega \right )\equiv \beta \,{\mathcal H}e\ C^j_{\ell }\!\left ({\boldsymbol{k}},\omega \right )=\frac {\beta }{2}\ S^j_{\ell }({\boldsymbol{k}},\omega ). \end{equation}

From (2.349) and (2.352)

(2.353)

\begin{equation} {\langle jj\rangle }^{\ell }_{k,\omega }=2T\!\left (-\frac {\omega }{4\pi }\textrm {Im}\ \frac {1}{{\varepsilon }^{\ell }(k,\omega )}\right ). \end{equation}

Charge conservation asserts that $\dot {\rho }=-\nabla \cdot {\boldsymbol{j}} \to -i\omega \rho =-i{\boldsymbol{k}}\cdot {\boldsymbol{j}}=-ikj^{\ell }$ , which we use in conjunction with (2.353) to obtain the following result.

The fluctuation–dissipation relation obtained (Kubo Reference Kubo1966) is

(2.354)

\begin{align}& {\left (\frac {\omega }{k}\right )}^2{\langle \rho \rho \rangle }_{k\omega }={\langle jj\rangle }^{\ell }_{k,\omega }=2T\!\left (-\frac {\omega }{4\pi }\textrm {Im}\ \frac {1}{{\varepsilon }^{\ell }\!\left (k,\omega \right )}\right ) \nonumber\\[4pt]&\qquad \qquad \to S^{{\rho }_{el.}}\!\left (k,\omega \right )\equiv {\langle \rho \rho \rangle }_{k\omega } =-\frac {T}{2\pi }\frac {k^2}{\omega }\textrm {Im}\ \frac {1}{{\varepsilon }^{\ell }(k,\omega )}. \end{align}

Example: Scattering of radio-frequency waves off the ionosphere. To understand the scattering experiments people calculated the conductivity and then inferred the $S^{{\rho }_{el.}}\!\left (k,\omega \right ).$

Example: In the very long wavelength limit, we claim that for $k \to 0, {\sigma }^t={\sigma }^{\ell }=\sigma$ from (2.348) and (2.349). Using $\varepsilon =1-{4\pi }/{i\omega }\sigma$ and $\sigma \!\left (\omega \right )=\sigma = (n_e e^2/m) (1/(\nu_{tr}-i\omega))$ from (2.354) one obtains

(2.355)

\begin{equation} S^{{\rho }_{el.}}(k,\omega )=\frac {{\omega }^2_p}{2\pi }Tk^2\frac {{\nu }_{tr}}{{\nu }^2_{tr}+{\omega }^2}\frac {1}{\vert\omega - \frac {{\omega }^2_p}{\omega +i{\nu }_{tr}}\vert ^2}. \end{equation}

We note that $S^{{\rho }_{el.}}\!\left (k\to 0,\omega =0\right )=(T/2\pi)k^2(\nu_{tr}/\omega_p^2)$ and $\sigma \!\left (\omega =0\right )={{\omega }^2_p}/(4\pi\nu_{tr})\equiv {1}/{\eta (\omega =0)}$ ; hence, $S^{{\rho }_{el.}}\!\left (k\to 0,\omega \approx 0\right )=({1}/{8{\pi}^2})Tk^2\eta \!\left (\omega =0\right )\!,\,\text{which}$ is the Johnson–Nyquist noise spectrum result. The resistivity at $\omega =0$ has been introduced here as the inverse of $\sigma \!\left (\omega =0\right ).$ As a function of frequency $\omega ,$ $S^{{\rho }_{el.}}$ has a peak at ${\omega }_{p}$ with a width of order ${\nu }_{tr}$ with the implicit assumption ${\nu }_{tr}\ll {\omega }_{p}.$

There are symmetry conditions pertinent to the two-time correlation function (recall the relations introduced in (2.316), (2.317), and (2.318)). Associated with stationarity in time we have

(2.356)

\begin{align} &C_{\mu \nu }(\tau)\equiv \langle A_{\mu }\!\left (t\right )A_{\nu }\!\left (t-\tau \right )\rangle =\langle A_{\mu }\!\left (t+\tau \right )A_{\nu }\!\left (t\right )\rangle =\langle A_{\nu }\!\left (t\right )A_{\mu }\!\left (t+\tau \right )\rangle {=C}_{\nu \mu }\!\left (-\tau \right )\nonumber\\[4pt]& \end{align}

and

(2.357)

\begin{align} S_{\mu \nu }(\omega )&\equiv \int\nolimits ^{\infty }_{-\infty }{\textrm{d}\tau e^{i\omega \tau }}C_{\mu \nu }(\tau)\nonumber\\[4pt] &= \int\nolimits ^{\infty }_{-\infty }{\mathrm{d}{\tau }^{'e^{-i\omega \tau }}}C_{\mu \nu }\!\left (-{\tau }^{\prime} \right ),\ \ \ \ \ \tau =-{\tau }^{\prime} ,\nonumber\\[4pt] &=S_{\nu \mu }\!\left (-\omega \right )=S^*_{\nu \mu }\!\left (\omega \right ), \end{align}

due to the reality of $C_{\mu \nu }(\tau )$ . From (2.357) we deduce that $S_{\mu \nu }(\omega )$ is Hermitian.

Now we assume time reversibility, and we will prove that $C_{\mu \nu }$ is symmetric. Consider the phase space $\varGamma (r_i,{{{{v}}}}_i)$ which becomes $\tilde {\varGamma }(r_i,-{{{{v}}}}_i)$ under time reversal. Assume a model Hamiltonian with unit mass, m = 1, and no magnetic effects:

(2.358)

\begin{equation} H={\sum\limits _i{{{{{\boldsymbol v}}}}^2_i}}+\phi \!\left ({{\boldsymbol{r}}}_i-{{\boldsymbol{r}}}\right )+e_i\phi _{\xi }\!\left ({{\boldsymbol{r}}}_i\right ). \end{equation}

Assume further that ${\rho }_0\!\left (H\right ):{\rho }_0({\boldsymbol \varGamma })={\rho }_0(\tilde {{\boldsymbol \varGamma }})$ . Given the assumptions, we have

(2.359)

\begin{equation} A_{\mu }({\boldsymbol \varGamma })=\pm A_{\mu }(\tilde {{\boldsymbol \varGamma }}),\quad A_{\mu }({\boldsymbol \varGamma }\textrm {,}\tau )=\pm A_{\mu }(\tilde {{\boldsymbol \varGamma }},-\tau ). \end{equation}

For example, ${\boldsymbol{j}}\!\left ({\boldsymbol{x}}\right )=\Sigma _i{{{{{\boldsymbol v}}}}_i\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_{{\boldsymbol{i}}}\right )\to -\Sigma _i{{{{{\boldsymbol v}}}}_i\delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_{\textrm {i}})}}$ under time reversal. Under time reversal the correlation function becomes

(2.360)

\begin{align} C_{\mu \nu }(\tau)&\equiv \int\nolimits {\textrm{d}\varGamma {\rho }_0({\boldsymbol \varGamma })A_{\mu }\!\left ({\boldsymbol \varGamma },t\right )A_{\nu }\!\left ({\boldsymbol \varGamma },t-\tau \right )}\nonumber\\[4pt]&=\int\nolimits {d\tilde {\varGamma }{\rho }_0(\tilde{{\boldsymbol \varGamma }})A_{\mu }(\tilde {{\boldsymbol \varGamma }},-t)A_{\nu }(\tilde {{\boldsymbol \varGamma }},-t+\tau)}. \end{align}

We note that with ${{\boldsymbol{r}}}_i({{\boldsymbol{r}}}_i,{{{{\boldsymbol v}}}}_i,\tau )={{\boldsymbol{r}}}_i+{{{{\boldsymbol v}}}}_i\tau \to {{\boldsymbol{r}}}_i({{\boldsymbol{r}}}_i,{-{{{\boldsymbol v}}}}_i,-\tau )={{\boldsymbol{r}}}_i+(-{{{{\boldsymbol v}}}}_i)(-\tau )$ under time reversal.

From (2.356) and (2.360) we conclude the proof of the symmetry of $C_{\mu \nu }(\tau)$

(2.361)

\begin{equation} C_{\mu \nu }(\tau)=C_{\mu \nu }\!\left (-\tau \right )\quad \textrm{and}\quad C_{\mu \nu }(\tau)=C_{\nu \mu }(\tau). \end{equation}

Thus, $C_{\mu \nu }(\tau)$ is symmetric as is $S_{\mu \nu }(\omega )$ ; and both are real. Hence, both $C_{\mu \nu }(\tau)$ and $S_{\mu \nu }(\omega )$ are Hermitian.

We now return to the electrical conductivity. Recall from (2.329) that

(2.362)

\begin{equation} \delta \langle {\boldsymbol{j}}\rangle ({\boldsymbol{x}},t)=\int\nolimits {\textrm{d}^3\textit{x}'\int\nolimits ^{\infty }_0{\textrm{d}\tau {{\boldsymbol \sigma }}^{\text{ext}}({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ;\tau )\cdot {{\boldsymbol{E}}}^{\text{ext}}({{\boldsymbol{x}}}^{\prime} ,t-\tau }}). \end{equation}

Using the symmetry relations for ${{\boldsymbol \sigma }}^{\text{ext}}$ we have

(2.363)

\begin{equation} {\sigma }^{\text{ext}}_{ij}\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ;\tau \right )={\sigma }^{\text{ext}}_{ji}\!\left ({\boldsymbol{x}}',{\boldsymbol{x}};\tau \right ). \end{equation}

In (2.363) the $\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} \right )$ dependence is $\!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}^{\prime} \right )$ . For no magnetic field $B_0=0$ , (2.363) leads to

(2.364)

\begin{equation} {\sigma }^{\text{ext}}_{ij}({\boldsymbol{k}};\omega )={\sigma }^{\text{ext}}_{ji}\!\left (-{\boldsymbol{k}};\omega \right ). \end{equation}

With a magnetic field, Onsager & Fuoss (Reference Onsager and Fuoss1932), making no assumption about isotropy, showed that

(2.365)

\begin{equation} {\sigma }_{ij}\!\left ({\boldsymbol{k}},\omega ;{{\boldsymbol{B}}}_0\right )={\sigma }_{ij}\!\left (-{\boldsymbol{k}},\omega ;{-{\boldsymbol{B}}}_0\right ) \end{equation}

if ${{\boldsymbol{B}}}_0\to -{{\boldsymbol{B}}}_0$ under time reversal. Equation (2.365) was discovered experimentally around 1900.

For future use, keep in mind that $C_{\mu \nu }(\tau)$ and $S_{\mu \nu }(\omega )$ are real and symmetric, but $C_{\mu \nu }\!\left (\omega \right )$ is not real.

2.6.6. Relation of entropy production to electrical conductivity

In this section we derive a relation between the entropy production and the electrical conductivity tensor. First consider a general form of the system Hamiltonian:

(2.366)

\begin{equation} H\!\left ({\boldsymbol \varGamma },t\right )=H_0({\boldsymbol \varGamma })+\delta H({\boldsymbol \varGamma },t), \end{equation}

from which we deduce

(2.367)

\begin{equation} \dot {H}=\frac {\partial}{\partial t}\delta H\!\left ({\boldsymbol \varGamma },t\right )=-\sum\limits _{\mu }{A_{\mu }({\boldsymbol \varGamma })\frac {\textrm{d}}{\textrm{d}t}F_{\mu }(t)}, \end{equation}

using the notation in (2.298) to separate phase factor components. We form the ensemble average and perform the time integral of (2.367):

(2.368)

\begin{align} \Delta \langle H\rangle &=\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ \langle \dot {H}\!\left (t\right )\rangle }\nonumber \\[4pt] &=-\sum\limits _{\mu }{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ \langle A_{\mu }({\boldsymbol \varGamma })\rangle }}\frac {\textrm{d}}{\textrm{d}t}F_{\mu }\!\left (t\right )\nonumber \\[4pt] &=-\sum\limits _{\mu }{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t\ \langle A_{\mu }\rangle }}(t)\frac {\textrm{d}}{\textrm{d}t}F_{\mu }\!\left (t\right )\nonumber \\[4pt] &=\sum\limits _{\mu }{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t{\ F}_{\mu }\!\left (t\right )\frac {\textrm{d}}{\textrm{d}t}\langle A\rangle }}(t)\nonumber \\[4pt] &=\sum\limits _{\mu }{\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t{\ F}_{\mu }\!\left (t\right )\langle {\;\dot{\!A}}_{\mu }\rangle }}(t), \end{align}

integrating by parts with vanishing contributions at the endpoints $t=$ $\pm \infty$ and using earlier results. Equation (2.368) can be expressed in alternative form using Parseval’s theorem:

(2.369)

\begin{align} \Delta \langle H\rangle &=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\sum\limits _{\mu }{{F^*_{\mu }}}\!\left (\omega \right )\langle {\;\dot{\!A}}_{\mu }\rangle }\!\left (\omega \right )\nonumber \\[4pt]&=\beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\sum\limits _{\mu }{{F^*_{\mu }}}\!\left (\omega \right )}\sum\limits _{\nu }{C^{\dot {A}\dot {A}}_{\mu \nu }F_{\nu }(\omega )}\nonumber \\[4pt] &= \beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{\boldsymbol{F}}}^*\!\left (\omega \right )}\cdot {{\boldsymbol{C}}}^{\;\dot{\!A}}(\omega )\cdot {\boldsymbol{F}}(\omega )\nonumber \\[4pt] &= \beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{\boldsymbol{F}}}\!\left (\omega \right )}\cdot {{\boldsymbol{C}}}^{\dot {A}*}\!\left (\omega \right )\cdot {{\boldsymbol{F}}}^{{\mathbf *}}(\omega )\nonumber \\[4pt] &= \beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{{\boldsymbol{F}}}^*}\!\left (\omega \right )}\cdot {\tilde {{\boldsymbol{C}}}}^{\dot {A}*}\!\left (\omega \right )\cdot {\boldsymbol{F}}(\omega )\nonumber \\[4pt] &= \beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{{\boldsymbol{F}}}^*}\!\left (\omega \right )}\cdot {{\boldsymbol{C}}}^{\dot {A}T}\!\left (\omega \right )\cdot {\boldsymbol{F}}(\omega ). \end{align}

We comment that Parseval’s theorem involves integrals over the infinite domains in both time and frequency. Hence, from the different forms of the right-hand side of (2.369) it follows that

(2.370)

\begin{align} \Delta \langle H\rangle &= \beta \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{{\boldsymbol{F}}}^*}\!\left (\omega \right )}\cdot {{\boldsymbol{C}}}^{{\mathcal H}e}_{\;\dot{\!A}}\!\left (\omega \right )\cdot {\boldsymbol{F}}\!\left (\omega \right )\nonumber \\[4pt] &=\frac {\beta }{2} \int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{{\boldsymbol{F}}}^*}\!\left (\omega \right )}\cdot {{\boldsymbol{S}}}^{\;\dot{\!A}}\!\left (\omega \right )\cdot {\boldsymbol{F}}\!\left (\omega \right )\nonumber \\[4pt] &= \frac {\beta }{2}\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }{{{\boldsymbol{F}}}^*}\!\left (\omega \right )}\cdot {{\omega }^2{\boldsymbol{S}}}^A\!\left (\omega \right )\cdot {\boldsymbol{F}}\!\left (\omega \right ), \end{align}

where ${{\boldsymbol{C}}}^{{\mathcal H}e}_{\;\dot{\!A}}$ denotes the Hermitian part of the tensor and the Wiener–Khinchin theorem (2.53) has been used.

Definition: The entropy is given by $\Delta S\equiv \beta \Delta \langle H\rangle$ from (1.112).

For the specific example of the electrical conductivity (2.370) becomes

(2.371)

\begin{align} \Delta \langle H\rangle &=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\ \langle {\dot {\rho }}_{\text{elec}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\phi ^{ex*}\!\left ({\boldsymbol{x}},\omega \right )}}\nonumber \\[4pt] &=-\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\nabla \cdot \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\phi ^{ex*}\!\left ({\boldsymbol{x}},\omega \right )}}\nonumber \\[4pt] &=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {{\boldsymbol{E}}}^{ext*}\!\left ({\boldsymbol{x}},\omega \right )}}, \end{align}

using charge continuity and integrating by parts. We use

(2.372)

\begin{equation} \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {{\boldsymbol{E}}}^{\text{ext}}\!\left ({\boldsymbol{x}},\omega \right )=\int\nolimits {\textrm{d}^3\textit{x}{\mathbf '}{\boldsymbol \sigma }\cdot \langle {{\boldsymbol{E}}}^{\text{tot}}\rangle \!\left ({\boldsymbol{x}}{\mathbf '},\omega \right )\cdot {\langle {{\boldsymbol{E}}}^{\text{tot}}-{{\boldsymbol{E}}}^{\text{int}}\rangle }^*\!\left ({\boldsymbol{x}},\omega \right )} \end{equation}

to express (2.371) as

(2.373)

\begin{align} \Delta \langle H\rangle &=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\;\langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {{\boldsymbol{E}}}^{ext*}\!\left ({\boldsymbol{x}},\omega \right )}}\nonumber \\[4pt]&=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}}\int\nolimits {\textrm{d}^3{\textit{x}}^{\prime} \langle {{\boldsymbol{E}}}^{\text{tot}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {\boldsymbol \sigma }\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\omega \right )\cdot {\langle {{\boldsymbol{E}}}^{\text{tot}}\rangle }^*}\!\left ({{\boldsymbol{x}}}^{\prime} ,\omega \right )}\nonumber \\[4pt]&\quad -\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\;\langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {{\langle {{\boldsymbol{E}}}^{\text{int}}\rangle }^*}\!\left ({\boldsymbol{x}},\omega \right )}}. \end{align}

However, conservation of energy derived for Maxwell’s equations using Parseval’s theorem for the $\smallint {dvol}{\boldsymbol{j}}\cdot {\boldsymbol{E}}$ term yields

(2.374)

\begin{align} &\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}\;\langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {{\langle {{\boldsymbol{E}}}^{\text{int}}\rangle }^*}\!\left ({\boldsymbol{x}},\omega \right )}}\nonumber \\[4pt]&\quad =\int\nolimits ^{\infty }_{-\infty }{\textrm{d}t}\int\nolimits {\textrm{d}^3\textit{x}\;\ \frac {\partial}{\partial t}\!\left (\frac {E^2+B^2}{8\pi }\right )-\nabla \cdot \frac {c}{4\pi }\!\left ({\boldsymbol{E}}\times {\boldsymbol{B}}\right )=0}, \end{align}

with no sources and suitable boundary conditions, so that (2.373) becomes

(2.375)

\begin{eqnarray} \Delta \langle H\rangle &&=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}}\int\nolimits {\textrm{d}^3{\textit{x}}^{\prime} \langle {{\boldsymbol{E}}}^{\text{tot}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\cdot {\boldsymbol \sigma }\!\left ({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ,\omega \right )\cdot {\langle {{\boldsymbol{E}}}^{\text{tot}}\rangle }^*}\!\left ({{\boldsymbol{x}}}^{\prime} ,\omega \right )}\nonumber \\[4pt]&&=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}}\,{\langle {{\boldsymbol{E}}}^{\text{tot}}\rangle }\!\left ({\boldsymbol{x}},\omega \right )\cdot {\boldsymbol \sigma }\!\left ({\boldsymbol{x}},\omega \right )\cdot {\langle {{\boldsymbol{E}}}^{\text{tot}}\rangle }^*\!\left ({\boldsymbol{x}},\omega \right )}. \end{eqnarray}

If the media is isotropic, then can be (2.375) simplified as

(2.376)

\begin{align} \Delta \langle H\rangle &=\int\nolimits ^{\infty }_{-\infty }\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}}\,\sigma \!\left ({\boldsymbol{x}},\omega \right ){\vert {{\boldsymbol{E}}}^{\text{tot}}\!\left ({\boldsymbol{x}},\omega \right )\vert }^2\nonumber\\[4pt]&=\int\nolimits ^{\infty }_{-\infty }{\frac {\textrm{d}\omega }{2\pi }\int\nolimits {\textrm{d}^3\textit{x}}\;{\textrm {Re}\;\sigma \!\left ({\boldsymbol{x}},\omega \right ){\vert \langle {{\boldsymbol{E}}}^{\text{tot}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\vert }^2}} \end{align}

because there is no contribution to the integral from Im $\sigma \!\left ({\boldsymbol{x}},\omega \right )$ . We note that

(2.377)

\begin{equation} Re\;\sigma \!\left ({\boldsymbol{x}},\omega \right ){\vert \langle {{\boldsymbol{E}}}^{\text{tot}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\vert }^2 \to \eta {\left ({\boldsymbol{x}},\omega \right )\vert \langle {\boldsymbol{j}}\rangle \!\left ({\boldsymbol{x}},\omega \right )\vert }^2, \end{equation}

which is just the $i^2R$ resistive heating source for the entropy production ( $\Delta S\equiv \beta \Delta \langle H\rangle$ ).

2.6.7. Transport relations and coefficients

What if there is a temperature gradient $\nabla T$ ? Temperature gradients tend to be accompanied by a heat flow (Newton, Maxwell, etc.): ${\boldsymbol{Q}}=-{\textrm {K}}\cdot \nabla T$ , where $\textrm {K}$ is the heat-flow tensor, i.e., the thermal conductivity (Mori Reference Mori1965; Kubo Reference Kubo1966). In the presence of a gradient in flow velocity there can arise a viscous stress: ${\boldsymbol \Pi }=-\mu \nabla {\boldsymbol{u}}$ , where $\mu$ is the viscosity coefficient in this equation. In the presence of a density gradient there can arise a particle-flux density as a result of diffusion, ${\boldsymbol \varGamma }=-{\boldsymbol{D}}\cdot \nabla n$ ; or, more generally, a diffusive flux driven by the gradient in the chemical potential, ${\boldsymbol \varGamma }=-{\boldsymbol{D}}\cdot \nabla \mu$ , where $\mu$ is the chemical potential. The current density is another example of diffusive transport: ${\boldsymbol{j}}=-{\boldsymbol \sigma }\cdot \nabla \phi ={\boldsymbol \sigma }\cdot {\boldsymbol{E}}$ .

Definition: Thermodynamic forces ${\boldsymbol{E}},$ $\nabla {\boldsymbol{u}}{\mathbf ,}\nabla n, \nabla T, \nabla \mu$ all give rise to thermodynamic fluxes.

Example: Onsager showed ${\boldsymbol{Q}}=-{\textrm {K}}\cdot \nabla T$ $\Longleftrightarrow \,{\boldsymbol{j}}={\boldsymbol \sigma }\cdot {\boldsymbol{E}}$ , i.e., in appropriate units these are the same relation.

In this section we present a few derivations of transport equations. An example of a model system that leads to kinetic transport equations is the derivation due to Chapman and Enskog (1911) (see also Reif (Reference Reif1965) and Liboff (Reference Liboff1969)). We derive kinetic transport equations from the linear Boltzmann equation. We also start from the Liouville equation and identify a small parameter, e.g., the magnitude of the gradient relative to the inverse of a characteristic length in the system, to facilitate the derivation of kinetic transport equations; Mori (Reference Mori1965) used this approach.

Consider the linear Boltzmann equation for the probability density:

(2.378)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}},t\right )+{{{\boldsymbol v}}}\cdot \frac {\partial}{\partial {\boldsymbol{r}}}\rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}},t\right )=-\tilde {\nu }\rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}},t\right ), \end{equation}

where ${{{\boldsymbol v}}}={{{v}}}{\boldsymbol \Omega }$ where ${\boldsymbol \Omega }\;\textrm {is a unit vector defining the direction of }{{{\boldsymbol v}}}.$ We define the $\tilde {\nu }$ operator:

(2.379)

\begin{equation} \tilde {\nu }f\!\left ({\boldsymbol \Omega }\right )\equiv \nu f\!\left ({\boldsymbol \Omega }\right )-\int\nolimits {\textrm{d}^3\Omega 'f({{\boldsymbol \Omega }}^{\prime} )\nu ({\boldsymbol \Theta } = ({\boldsymbol \Omega };{{\boldsymbol \Omega }}^{\prime})\textrm {)}}. \end{equation}

For elastic scattering of the particles the speed v remains constant. We drop ${{\boldsymbol v}}$ as an independent variable for constant speed and use $\boldsymbol \Omega$ instead. We Fourier transform

(2.380)

\begin{equation} \int\nolimits {\textrm{d}^3\textit {r}\ e^{-i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}\rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right )=\rho ({\boldsymbol{k}},{{{\boldsymbol v}}};t)} \end{equation}

to obtain

(2.381)

\begin{equation} \textrm {}\frac {\partial}{\partial t}\rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}},t\right )+i{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}\ \rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}},t\right )=-\tilde {\nu }\rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}},t\right ). \end{equation}

Representing the time dependence as $\rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}},t\right )=e^{-i\omega t}{\rho }_{\omega }\!\left ({\boldsymbol{k}},{{{\boldsymbol v}}}\right )$ we get

(2.382)

\begin{equation} \textrm {}i\!\left (-\omega +{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}\right )\,{\rho }_{\omega ,k}({\boldsymbol \Omega }\textrm{)}=-\tilde {\nu }{\rho }_{\omega ,k}({\boldsymbol \Omega }\textrm {)}, \end{equation}

with ${{{\boldsymbol v}}}={{{v}}}{\boldsymbol \Omega }$ . With use of (2.379), (2.382) becomes an eigenfunction–eigenvalue problem in time where $\omega$ is complex:

(2.383)

\begin{equation} \qquad\qquad i\!\left (-\omega +{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}\right )\,{\rho }_{\omega ,k}\!\left ({\boldsymbol \Omega }\right )=-\ \nu {\rho }_{\omega ,k}\!\left ({\boldsymbol \Omega }\right )+\int\nolimits {\textrm{d}^3\Omega '{\rho }_{\omega ,k}({{\boldsymbol \Omega }}^{\prime} )\nu ({\boldsymbol \Theta }\textrm {)}}. \end{equation}

Next we simplify by assuming that the scattering is isotropic to obtain $\nu \!\left ({\boldsymbol \Theta }\right ) = \nu \textrm {/4}\pi$ so that

(2.384)

\begin{equation} \int\nolimits {\textrm{d}^3\Omega '{\rho }_{\omega ,k}({{\boldsymbol \Omega }}^{\prime} )\nu ({\boldsymbol \Theta }\textrm {)}}=\int\nolimits {\frac {\textrm{d}\Omega^{\prime}}{4\pi }{\nu \rho }_{\omega ,k}({{\boldsymbol \Omega }}^{\prime} )}=\nu \overline {\rho }. \end{equation}

Hence, (2.383) becomes

(2.385)

\begin{equation} \textrm {}i\!\left (-\omega +{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}\right )\,{\rho }_{\omega ,k}({\boldsymbol \Omega }\textrm{)}=\nu (\overline {\rho }-{\rho }_{\omega ,k}({\boldsymbol \Omega }\textrm {)}). \end{equation}

With ${\boldsymbol{k}}\cdot {{{\boldsymbol v}}} =\textit{k}v\;\textrm{cos}\theta$ , (2.385) is

(2.386a)

\begin{equation} -i\!\left (\omega +i\nu -\textit{k}v\;\textrm {cos}\theta \right )\,{\rho }_{\omega ,k}(\theta ,\phi \textrm {)}=\nu \overline {\rho }. \end{equation}

(2.386b)

\begin{equation} {\rho }_{\omega ,k}(\theta ,\phi \textrm {)}=\frac {i\nu \overline {\rho }}{\omega +i\nu -\textit{k}v\;\textrm{cos}\theta }. \end{equation}

Let $\mu \equiv {\cos \theta }$ and integrate (2.386b ) over $\theta$ and $\phi$ to obtain the average $\overline {\rho }$ :

(2.387)

\begin{equation} \overline {\rho }=i\nu \overline {\rho }\int\nolimits ^1_{-1}{\text{d}\mu \ \frac {1}{\omega +i\nu -\textit{k}{{v}}\mu }}\frac {2\pi }{4\pi }=\frac {i\nu \overline {\rho }}{2}\int\nolimits ^1_{-1}{\text{d}\mu \ \frac {1}{\omega +i\nu -\textit{k}{{v}}\mu }} \end{equation}

We obtain a dispersion relation from (2.387) by dividing out $\overline {\rho }$ from both sides:

(2.388)

\begin{equation} 1=\frac {i\nu }{2}\int\nolimits ^1_{-1}{\text{d}\mu \ \frac {1}{\omega +i\nu -\textit{k}{{v}}\mu }}. \end{equation}

The denominator vanishes at $\omega =-i\nu +\textit{k}{{v}}\mu$ . In the complex $\omega$ plane there are branch points at $\omega =-i\nu \pm \textit{k}{{v}}$ and a branch cut between. We do the integral in (2.388) carefully to obtain the result

(2.389)

\begin{equation} {\text{ ln}\!\left (\frac {k{{{v}}}-(\omega +i\nu )}{-\textit{k}{{v}}-(\omega +i\nu )}\right )}=2i\!\left (\frac {k{{{v}}}}{\nu }\right )\equiv 2ik\ell, \end{equation}

which yields

(2.390)

\begin{equation} \omega =-i\nu \!\left (1-k\ell {\cot k\ell }\right ),\ \ \ k\ell \lt \pi /2, \end{equation}

where $\ell \equiv {{{v}}}/\nu$ is the mean-free path. Equation (2.390) is valid if and only if $k\ell \lt \pi /2$ and there is no solution of (2.388) for $k\ell \gt \pi /2$ . Thus, there is the one solution (2.390) for the dispersion relation given by (2.388). We note that $k\ell {\cot k\ell }$ equals 1 for $k\ell =0$ , decreases as $1-{\!\left (k\ell \right )}^2/3$ for small $k\ell$ , and equals 0 at $k\ell =\pi /2$ . Using (2.390) in (2.386), we obtain

(2.391)

\begin{equation} {\rho }_{\omega ,k}(\theta ,\phi \textrm {)}=\frac {const}{{\cot k\ell }+\textrm {i cos}\theta }, \end{equation}

where the constant in the numerator is a normalization constant. Equation (2.391) reflects that there is only one eigenvalue given in (2.390) as all the other possible eigenvalues are on the branch cut.

We return to (2.386b ) to consider the eigenvalues on the branch cut, i.e., the contributions to the probability density from the frequencies on the branch cut. (2.386b ) has the form $\rho \!\left (y\right )=a/y$ if $y\ne 0.$ If $y=0$ is possible, then we must include $\lambda \delta \!\left (y\right ),$ i.e.,

(2.392)

\begin{equation} \rho \!\left (y\right )={\mathcal P}\!\left (\frac {a}{y}\right )+\lambda \delta (y), \end{equation}

where the first term is the principal value and the second involves a $\delta$ -function. We are reminded of Van Kampen’s analysis of the linearized Vlasov equation in which he obtained singular eigenfunctions.

Equation (2.386b ) becomes

(2.393)

\begin{equation} {\rho }_{\omega ,k}\!\left (\theta ,\phi \right )={\mathcal P}\!\left (\frac {i\nu \overline {\rho }}{\omega +i\nu -\textit{k}v\;\textrm{cos}\theta }\right )+\lambda (\phi )\delta (\omega +i\nu -\textit{k}v\;\textrm{cos }\theta \textrm {)}={\rho }_{\mu \omega }({\boldsymbol \Omega }). \end{equation}

The relation (2.387) becomes

(2.394)

\begin{align} \overline {\rho }&=\int\nolimits ^{2\pi }_0{\frac {\textrm{d}\phi }{2\pi }\lambda \!\left (\phi \right )\int\nolimits ^1_{-1}{\frac {\textrm{d}\mu }{2}}\delta \!\left (\omega +i\nu -\textit{k}{{v}}\mu \right )\textrm {+}}\frac {i\nu \overline {\rho }}{2}\oint {\frac {\textrm{d}\mu }{\omega +i\nu -\textit{k}{{v}}\mu }}\nonumber \\[4pt] &= \frac {\overline {\lambda }}{2\textit{k}{{v}}}+\frac {i\nu \overline {\rho }}{2}\frac {1}{\textit{k}{{v}}}{\text{ ln } \frac {-1-{\mu }_{\omega }}{1-{\mu }_{\omega }},}\; {\mu }_{\omega }\equiv \ \frac {\omega +i\nu }{k{{{v}}}},\ \ -1\lt {\mu }_{\omega }\lt 1. \end{align}

Note that if we substitute ${\mu }_{\omega }\equiv ({\omega +i\nu })/{k{{{v}}}}={\cot \!\left (k\ell \right )}$ consistent with the dispersion relation in (2.390), then the solution to (2.394) dictates that $\overline {\lambda }=0$ ; and (2.390) is recovered.

Let ${\rho }_0\!\left ({\boldsymbol{k}},{{{\boldsymbol v}}}\right )\equiv {1}/(\textrm{cot}\,k\ell + i\mu)$ for the $\omega$ = 0 mode, where $\mu ={\cos \theta }$ and $\ell \equiv {{{v}}}/\nu$ . We transform (2.393) back from the $\omega$ domain to the time domain separating out the $\omega$ = 0 mode contribution:

(2.395)

\begin{equation} \rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}};t\right )=e^{-\nu (1-k\ell {\cot k\ell )t}}\frac {a_0({\boldsymbol{k}},{{{v}}})}{{\cot k\ell +i\mu }}+\int\nolimits ^1_{-1}{\text{d}{\mu }_{\omega }e^{-\nu t}e^{-ik{{{v}}}{\mu }_{\omega }t}{\rho }^{{\boldsymbol{k}},{{{\boldsymbol v}}}}_{\mu\omega}({\boldsymbol \Omega })a_{\mu \omega }({\boldsymbol{k}},{{{\boldsymbol v}}})}, \end{equation}

where $a_0({\boldsymbol{k}},{{{v}}}$ ) and $a_{\mu \omega }({\boldsymbol{k}},{{{\boldsymbol v}}}$ ) are Fourier coefficients associated with a normalization constant and initial conditions. (2.395) is the solution of the linearized Boltzmann equation for the probability distribution in phase space. We note that the first term in (2.395) dominates at long time because $1-k\ell \cot k\ell \lt 1$ , so it does not drop off as rapidly as the second term.

For long times tv/ $\ell \gg 1$ , the dominant solution for the probability distribution is then

(2.396)

\begin{equation} \rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}};t\right )=e^{-\nu (1-k\ell {\cot k\ell )t}}\ \frac {a_0({\boldsymbol{k}},{{{\boldsymbol v}}})}{{\cot k\ell +i\mu }}. \end{equation}

Next we Fourier transform $\rho \!\left ({\boldsymbol{k}},{{{\boldsymbol v}}};t\right )$ back to $\rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right )$ :

(2.397)

\begin{eqnarray} \rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right )=\int\nolimits _{k\ell \lt \frac {\pi}{2}}{\frac {\text{d}^{\textrm {3}}\textit{k}}{{\!\left (2\pi \right )}^3}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}}e^{-\nu (1-k\ell {\cot k\ell )t}}\ \frac {a_0\!\left ({\boldsymbol{k}},{{{\boldsymbol v}}}\right )}{{\cot k\ell +i\mu }}\nonumber \\[4pt]\approx \int\nolimits _{k\ell \lt \pi /2}{\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}}e^{-\frac {1}{3}\ell {{{v}}}k^2t}\ \frac {a_0({\boldsymbol{k}},{{{v}}})}{{\frac {1}{k\ell } +i\mu }}, \end{eqnarray}

where we have used $k\ell {\cot k\ell }$ $\approx 1-{\!\left (k\ell \right )}^2/3$ for $k\ell \ll 1$ , $a_0\!\left ({\boldsymbol{k}},{{{\boldsymbol v}}}\right )\to a_0({\boldsymbol{k}},{{{v}}})$ cannot depend on the direction of v , v is constant in time, and tv/ $\ell \gg 1.$ We next make use of

(2.398)

\begin{equation} \frac {1}{{\frac {1}{k\ell } +i\mu }}=\frac {k\ell }{1+i\mu k\ell }\approx k\ell \!\left (1-i\frac {{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}}{\nu }\right ) \end{equation}

in (2.397) to obtain

(2.399)

\begin{equation} \rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right )\approx \int\nolimits _{k\ell \lt \pi /2}{\frac {\textrm{d}^3\textit{k}}{{\!\left (2\pi \right )}^3}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}}e^{-\frac {1}{3}\ell {{{v}}}k^2t}k\ell \!\left (1-i\frac {{\boldsymbol{k}}\cdot {{{\boldsymbol v}}}}{\nu }\right )a_0({\boldsymbol{k}},{{{v}}}). \end{equation}

Now we calculate the average flux density:

(2.400)

\begin{equation} \tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{r}};t\right )=\rho \!\left ({\boldsymbol{r}};t\right )\langle {{{\boldsymbol v}}}\rangle \!\left (r,t\right )=\int\nolimits {\textrm{d}^3{{{v}}}\ \rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right ){{{\boldsymbol v}}}}, \end{equation}

where we note that v = v $\boldsymbol \Omega$ and v is constant. If we take the time derivative of (2.399), we easily obtain

(2.401)

\begin{equation} \frac {\partial \rho }{\partial t}=\frac {\partial}{\partial t}\left [\int\nolimits {\frac {\textrm{d}^3k}{{(2\pi )}^3}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}}e^{-Dk^2t}\!\left (\dots \right )\right ]=-D\int\nolimits {\frac {\textrm{d}^3k}{{\!\left (2\pi \right )}^3}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{r}}}}e^{-Dk^2t}\!\left (\dots \right )=D{\nabla }^2\rho \end{equation}

in which we identify D = $\ell \textrm {v/3}$ from inside the exponential in (2.399), and $\rho \!\left ({\boldsymbol{r}};t\right )=\int\nolimits {\textrm{d}^3{{{\boldsymbol v}}}\ \rho \!\left ({\boldsymbol{r}},{{{\boldsymbol v}}};t\right )}$ . Given that ${\partial \rho }/{\partial t}=-\nabla \cdot \tilde {{\boldsymbol \varGamma }} =D{\nabla }^2\rho$ , it follows that

(2.402)

\begin{equation} \tilde {{\boldsymbol \varGamma }}=-D\nabla \ \rho \!\left ({\boldsymbol{r}};t\right ). \end{equation}

No knowledge of (…) in (2.401) is required to obtain (2.402).

Example: Anisotropic scattering. In the limit of small $k\ell$ the eigenfunction can be shown to have the same form as the isotropic scattering limit. The other terms that are not retained are different in the anisotropic case, but it does not matter. In the anisotropic scattering case

(2.403)

\begin{equation} \nu \to {\nu }_{tr}\equiv \int\nolimits {\frac {\textrm{d}\Omega }{4\pi }\ \nu \!\left (\Theta \right )\!\left (1-{\cos \theta }\right )=n_0{\sigma }_{tr}{{{v}}}}. \end{equation}

We can generalize the analysis leading to (2.400) and (2.401) to all transport phenomena. Consider any density field: ${\mathcal A}({\boldsymbol{x}}\vert {\boldsymbol \varGamma })$ where $\mathcal A$ is the momentum, energy, current, etc., scalar or vector field and $\boldsymbol\varGamma$ is the total system ‘phase.’

Example: Let $K\equiv \Sigma _i K_i, K_i\equiv {1}/{2}m_i{{{{v}}}}^2_i=p^2_i/2m_i$ . Then the kinetic energy density is

(2.404)

\begin{equation} K({\boldsymbol{x}}\vert {\boldsymbol \varGamma })\equiv \sum\limits _i{K_i\delta ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i)} \end{equation}

and the number density is

(2.405)

\begin{equation} n({\boldsymbol{x}}\vert {\boldsymbol \varGamma })\equiv \sum\limits _i{\delta ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i)}. \end{equation}

We assert that the general conservation law for any density field ${\mathcal A}({\boldsymbol{x}}\vert {\boldsymbol \varGamma })$ is

(2.406)

\begin{equation} \dot {{\mathcal A}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )=-\frac {\partial}{\partial x}\cdot {{\boldsymbol \varGamma }}^{{\mathcal A}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )+{\sum\limits {}}^{{\mathcal A}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right ), \end{equation}

where the first term on the right-hand side is minus the divergence of the flux associated with the density field $\mathcal A$ and the second term is a source/sink term (if finite). Remember from (2.107) that

(2.407)

\begin{equation} \frac {\partial}{\partial t}\rho \!\left ({\boldsymbol\varGamma};t\right )+\left \{\rho ,H\right \}=0, \end{equation}

involving the Poisson bracket of $\rho$ with H the Hamiltonian, as a consequence of Liouville’s theorem in the absence of explicit sources and sinks.

Lemma: Any function A( ${\boldsymbol \varGamma }\textrm {)}$ satisfies

(2.408)

\begin{align} \langle \dot {A}\rangle \equiv \int\nolimits {\textrm{d}\varGamma \ \rho \!\left ({\boldsymbol \varGamma },t\right )\left \{A,H\right \}}&=-\int\nolimits {\textrm{d}\varGamma \ A({\boldsymbol \varGamma })\left \{\rho ,H\right \}=\int\nolimits {\textrm{d}\varGamma \ A({\boldsymbol \varGamma })\frac {\partial}{\partial t}\rho \!\left ({\boldsymbol \varGamma },t\right )}}\nonumber \\[4pt] &=\frac {\textrm{d}}{\textrm{d}t}\int\nolimits {\textrm{d}\varGamma \ A({\boldsymbol \varGamma })\rho \!\left ({\boldsymbol \varGamma },t\right )=\frac {\textrm{d}}{\textrm{d}t}\langle A\rangle }, \end{align}

using (2.107) and integrating by parts.

Hence, from (2.406) and (2.408), the volume integral of the ensemble average of (2.406) is

(2.409)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}\int\nolimits {\textrm{d}^3x\ \langle {\mathcal A}\rangle }({\boldsymbol{x}}\textrm {,t)}=-\oint {\textrm{d}{\boldsymbol \sigma }\cdot }\langle {{\boldsymbol \varGamma }}^{{\mathcal A}}\rangle ({\boldsymbol{x}}\textrm {,t)}+\int\nolimits {\textrm{d}^3x}\left\langle {\sum\limits {}}^{{\mathcal A}}\right\rangle ({\boldsymbol{x}}\textrm {,t)}. \end{equation}

Example: Returning to our example of the kinetic energy density (2.404), (2.406) becomes

(2.410)

\begin{equation} \dot {K}\!\left (x{\boldsymbol \varGamma }\right )=-\nabla \cdot \sum\limits _i{{{{{\boldsymbol v}}}}_i}\tfrac{1}{2}m_i{{{{v}}}}^2_i\ \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i\right )+\sum\limits _i{{{{{\boldsymbol v}}}}_i\cdot {{\boldsymbol{f}}}^{\,i}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{x}}}_i\right )}, \end{equation}

where the second term on the right-hand side of (2.409) is the power due to the work done on the particle by the force ${{\boldsymbol{f}}}^{\,i}$ on it. In the absence of external forces, the force on particle i is just the sum over j of the force of particle j on particle i:

(2.411)

\begin{equation} {{\boldsymbol{f}}}^{\,i}=\sum\limits _{j\ne i}{{{\boldsymbol{f}}}^{\,i}_j=-\sum\limits _{j\ne i}{\frac {\partial}{\partial r_j}\phi (r_{ij})}}, \end{equation}

where $\phi (r_{ij})$ is a potential energy. The first term on the right-hand side of (2.409) is just minus the divergence of the kinetic energy flux.

In the potential introduced in (2.411) where does the potential energy reside, at what location? If the potential energy resides in the particles, then an arbitrary designation is necessary. For example, is the potential energy shared equally by the two particles that define $r_{ij}$ or perhaps can the potential energy be assigned to the midpoint of the two particle locations along $r_{ij}?$ The latter implies

(2.412)

\begin{equation} {{\boldsymbol{r}}}_{i\ or\ j}\equiv {{\boldsymbol{R}}}_{ij}\pm \tfrac{1}{2}{{\boldsymbol{r}}}_{ij}, \end{equation}

which defines ${{\boldsymbol{R}}}_{ij}$ the midpoint between particles i and j. We take the midpoint and write

(2.413)

\begin{equation} \Phi ({\boldsymbol{x}}|{\boldsymbol \varGamma })\equiv \sum\limits _{i\lt j}{\phi (}{{\boldsymbol{r}}}_{ij})\delta ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}) \end{equation}

and

(2.414)

\begin{equation} \dot {\Phi }\!\left ({\boldsymbol{x}}{\boldsymbol \varGamma }\right )\equiv -\nabla \cdot \sum\limits _{i\lt j}{{{\boldsymbol{V}}}_{ij}}\phi ({{\boldsymbol{r}}}_{ij})\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )+\sum\limits _{i\lt j}{\dot {\phi }(}{{\boldsymbol{r}}}_{ij})\delta ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}), \end{equation}

where ${{\boldsymbol{V}}}_{ij}={\dot {{\boldsymbol{r}}}}_{ij}.$ In the absence of any explicit source or sink of particle energy, we expect relations of the form

(2.415)

\begin{equation} {\mathcal E}\!\left ({\boldsymbol{x}}|{\boldsymbol \varGamma }\right )\equiv K+\Phi , \dot {{\mathcal E}}\!\left ({\boldsymbol{x}}|{\boldsymbol \varGamma }\right )\equiv -\nabla \cdot \{\overline {{\boldsymbol{V}}}\left [K+\Phi \right ]\}. \end{equation}

We flesh out (2.415) in the following. In the analysis we symmetrize

(2.416)

\begin{equation} \sum\limits _{ij}{{{\boldsymbol{V}}}_{ij}}{{\boldsymbol{f}}}^{\,i}_j\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )\to \tfrac{1}{2}\sum\limits _{ij}{{\left [{{\boldsymbol{V}}}_i\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )+{{\boldsymbol{V}}}_j\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_j\right )\right ]{{\boldsymbol{f}}}^{\,i}_j}} \end{equation}

and

(2.417a)

\begin{equation} \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )\approx \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )-\tfrac{1}{2}{{\boldsymbol{r}}}_{ij}\nabla \cdot \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )+\dots \ \ \end{equation}

(2.417b)

\begin{equation} \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_j\right )\approx \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )+\tfrac{1}{2}{{\boldsymbol{r}}}_{ij}\nabla \cdot \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )+\dots \end{equation}

(Irving & Kirkwood Reference Irving and Kirkwood1950).

We use (2.410)–(2.417a ) to obtain, after cancellations,

(2.418)

\begin{equation} \dot {{\mathcal E}}=-\nabla \cdot \left [{{\boldsymbol \varGamma }}_K+{{\boldsymbol \varGamma }}_{\Phi }+\sum\limits _{i\lt j}{{{{\boldsymbol{r}}}_{ij}{\boldsymbol{V}}}_{ij}\cdot }{{\boldsymbol{f}}}^{\,i}_j\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{R}}}_{ij}\right )\right ], \end{equation}

where ${{\boldsymbol \varGamma }}_K$ is the kinetic energy flux density [first term on the right-hand side of (2.410)], ${{\boldsymbol \varGamma }}_{\Phi }$ is the potential energy flux density of the moving midpoint [first term on the right-hand side of (2.414)], and the third term on the right-hand side of (2.418) is the work done on the moving midpoint.

Definition: Equation (2.418) is in the form

(2.419)

\begin{equation} \dot {{\mathcal E}}=-\nabla \cdot {\boldsymbol{S}}({\boldsymbol{x}}\vert {\boldsymbol \varGamma }), \end{equation}

where $\boldsymbol{S}$ is the total energy flux density given by the sum of the three terms on the right-hand side of (2.418) and is analogous to the Poynting vector in electromagnetic theory.

Example: Temperature evolution. Given $T\!\left ({\boldsymbol{x}}\right )$ at t = 0, find $({\partial}/{\partial t})T\!\left ({\boldsymbol{x}},t\right ).$ Recall that the temperature is related to the entropy by the relation $\beta =({\partial {\mathcal S}}/{\partial E})$ which can be used to measure T in a very small volume. We note that matter may not flow, but energy can and will. The probability distribution function in phase space consistent with the definition of a local temperature can be expressed as

(2.420)

\begin{equation} \rho ({\boldsymbol \varGamma })\vert _{t=0}\sim e^{-\beta E}\to \frac {1}{Z}e^{-\int\nolimits {\textrm{d}^3{\bf x}\beta \left ({\boldsymbol{x}}\right ){\mathcal E}\!\left ({\bf x}\vert \varGamma \right )}},\quad Z\equiv \int\nolimits {\textrm{d}\varGamma }e^{-\int\nolimits {\textrm{d}^3{\bf x}\beta \!\left ({\bf x}\right ){\mathcal E}\!\left ({\bf x}\vert {\boldsymbol \varGamma }\right )}}. \end{equation}

Following Mori

(2.421)

\begin{equation} Z(t)\equiv e^{-\int\nolimits {\textrm{d}^3\textit{x}\beta \!\left ({\bf x},{\boldsymbol{t}}\right )[\langle {\mathcal E}\rangle \!\left ({\bf x},t\right )-T\!\left ({\bf x},t\right ){\mathcal S}\!\left ({\bf x},t\right )]}}. \end{equation}

Here $\langle {\mathcal E}\rangle ({\boldsymbol{x}},t)$ is not really a function of time. We expand $\rho \!\left ({\boldsymbol \varGamma },t\right )$ around a local thermal equilibrium for which the entropy is a maximum with respect to the internal energy

(2.422)

\begin{equation} \rho \!\left ({\boldsymbol \varGamma },t\right )={\rho }_0\!\left ({\boldsymbol \varGamma },t\right )+\delta \rho \!\left ({\boldsymbol \varGamma },t\right ),\ \ \ \ \ \,{\rho }_0\!\left ({\boldsymbol \varGamma },t\right )=\frac {1}{Z(t)}e^{-\int\nolimits {\textrm{d}^3\textit{x}\beta \left ({\bf x},t\right ){\mathcal E}\left ({\bf x}\vert {\boldsymbol \varGamma }\right ).}}\ \end{equation}

We use the Liouville equation ${\textrm{d}\rho }/{\textrm{d}t}=0,$ then using (2.419) and (2.420), integrate by parts, and use the chain rule ${\textrm{d}}/{\textrm{d}t}$ applied to ${\rho }_0\!\left ({\boldsymbol \varGamma },t\right )$ to obtain

(2.423)

\begin{eqnarray} \frac {\textrm{d}\delta \rho }{\textrm{d}t}=-\frac {\textrm{d}{\rho }_0}{\textrm{d}t}&&={\rho }_0\int\nolimits {\textrm{d}^3\textit{x}\beta \left ({\boldsymbol{x}},t\right )\dot {{\mathcal E}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )}=-{\rho }_0\int\nolimits {\textrm{d}^3\textit{x}\beta \left ({\boldsymbol{x}},t\right )\nabla \cdot {\boldsymbol{S}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )}\nonumber \\[4pt] && ={\rho }_0\int\nolimits {\textrm{d}^3\textit{x}{\boldsymbol{S}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\cdot \nabla \beta \!\left ({\boldsymbol{x}},t\right )}. \end{eqnarray}

We integrate (2.423) with respect to time noting that only $\boldsymbol{S}$ varies rapidly with time so one can set everything else to its value at $t = 0$ :

(2.424)

\begin{equation} \delta \rho \!\left ({\boldsymbol \varGamma },t\right )\approx {\rho }_0\int\nolimits \textrm{d}^3\textit{x}\int\nolimits ^t_0{\textrm{d}\tau }{\boldsymbol{S}}\!\left ({\boldsymbol{x}}|{\boldsymbol \varGamma }\textrm {,}\tau \right )\cdot \nabla \beta \!\left ({\boldsymbol{x}},0\right ). \end{equation}

Going back to (2.419) we can take the ensemble average $\langle \dot {{\mathcal E}}\rangle \textrm {(}{\boldsymbol{x}}\textrm {,t)}=-\nabla \cdot \langle {\boldsymbol{S}}\rangle \textrm {(}{\boldsymbol{x}}\textrm {,t)}$ , but the average energy flux density is

(2.425)

\begin{align} {\boldsymbol{Q}}&\equiv \langle {\boldsymbol{S}}\rangle \!\left ({\boldsymbol{x}},t\right )\equiv \int\nolimits {\textrm{d}\boldsymbol\varGamma \rho \!\left ({\boldsymbol \varGamma },t\right ){\boldsymbol{S}}\!\left ({\boldsymbol{x}},{\boldsymbol \varGamma }\right )=}\int\nolimits {\textrm{d}\boldsymbol\varGamma \left [{\rho }_0\!\left ({\boldsymbol \varGamma },t\right )+\delta \rho \!\left ({\boldsymbol \varGamma },t\right )\right ]{\boldsymbol{S}}\!\left ({\boldsymbol{x}},{\boldsymbol \varGamma }\right )}\nonumber \\[4pt] &=\int\nolimits {\textrm{d}\boldsymbol\varGamma \delta \rho \!\left ({\boldsymbol \varGamma },t\right ){\boldsymbol{S}}\!\left ({\boldsymbol{x}},{\boldsymbol \varGamma }\right )=}\int\nolimits {\textrm{d}^3\textit{x}\int\nolimits ^t_0{\textrm{d}\tau \int\nolimits {\textrm{d}{\boldsymbol \varGamma }{\rho }_0({\boldsymbol \varGamma }){\boldsymbol{S}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right ){\boldsymbol{S}}\!\left ({{\boldsymbol{x}}}^{\prime} {\boldsymbol \varGamma },\tau \right )\cdot \nabla \beta \!\left ({{\boldsymbol{x}}}^{\prime} ,{\boldsymbol{x}}\right )}}}\nonumber \\[4pt]&= \int\nolimits {\textrm{d}^3\textit{x}'\int\nolimits ^t_0{\textrm{d}\tau {{\boldsymbol{C}}}^S({\boldsymbol{x}},{{\boldsymbol{x}}}^{\prime} ;\tau )\cdot (-{\beta }^2\nabla T\!\left ({\boldsymbol{x}},t\right ))}}, \end{align}

where we have used that S is odd in $\boldsymbol \varGamma$ while ${\rho }_0$ is even, have assumed that $t\gg {1}/{{\nu }_{\mathrm{coll}}},$ have let ${\boldsymbol{x}}\to {\boldsymbol{x}}'$ so no macroscopic correlations for large |x -x ^′|, and used the analysis and notation in § 2.6.5 The result in (2.425) can be rewritten as (Mori Reference Mori1965; Kubo Reference Kubo1966)

(2.426a)

\begin{equation} {\boldsymbol{Q}}=-{\boldsymbol{K}}\cdot \nabla T \end{equation}

with the thermal conduction $\boldsymbol{K}$ defined by (Mori Reference Mori1965)

(2.426b)

\begin{equation} {\boldsymbol{K}}{\mathbf (}{\boldsymbol{x}},t\textrm {)}\equiv {\beta }^2\int\nolimits {\textrm{d}^3\textrm {s}'\int\nolimits ^{\infty }_0{\textrm{d}\tau {{\boldsymbol{C}}}^S(\vert {{\boldsymbol{x}}}^{\prime} -{\boldsymbol{x}}\vert ;{{\boldsymbol{x}}};t)}}e^{-i{\boldsymbol{k}}\cdot {{\boldsymbol{s}}}^{\prime} +i\omega \tau } = {\beta }^2{{\boldsymbol{C}}}^S\!\left ({\boldsymbol{x}},\omega =0,{\boldsymbol{k}}=0;t\right )\!, \end{equation}

where ${{\boldsymbol{s}}}^{\prime} ={{\boldsymbol{x}}}^{\prime} -{\boldsymbol{x}}$ , and $k\equiv \omega \equiv 0$ in the complex exponential.

[Editor’s Note: The inclusion of the complex exponential in (2.426b ) is artificial, because with $k\equiv \omega \equiv 0$ the complex exponential is equal to unity. The presence of the complex exponential foreshadows the Fourier transform introduced subsequently in (2.428b ) and (2.429).]

From time reversibility, there is an Onsager symmetry: $\boldsymbol{K}$ is symmetric, $K_{xy} = K_{yx}$ . To recap, the results in (2.425) and (2.426a )

(2.427)

\begin{equation} {\boldsymbol{Q}}\equiv \langle {\boldsymbol{S}}\rangle \!\left ({\boldsymbol{x}},t\right )=-{\boldsymbol{K}}{\mathbf (}{\boldsymbol{x}},t\textrm {)}\cdot \nabla T({\boldsymbol{x}},t) \end{equation}

and

(2.428)

\begin{equation} {\boldsymbol{K}}\!\left ({\boldsymbol{x}},t\right )={\beta }^2\int\nolimits ^{\infty }_0{\textrm{d}\tau \int\nolimits {\textrm{d}^3s\langle {\boldsymbol{S}}\!\left ({\boldsymbol{x}},t\right ){\boldsymbol{S}}({\boldsymbol{x}}-\boldsymbol{s};t-\tau )\rangle }}. \end{equation}

due to Mori. Here ${\boldsymbol{S}}\!\left ({\boldsymbol{x}},t\right )$ is the microscopic heat flux in the absence of $\nabla T$ , whereas $\langle {\boldsymbol{S}}\rangle \!\left ({\boldsymbol{x}},t\right )$ is the macroscopic heat flux. The correlation ${{\boldsymbol{C}}}^S$ does not fall off exponentially; it only obeys a power law, and convergence is marginally obtained. The process is not Markov which puts the Onsager approach in trouble.

[Editor’s Note: Professor Kaufman’s remarks about the fall off of ${{\boldsymbol{C}}}^S$ were not explained nor was a reference provided. However, these remarks have no bearing on the subsequent analysis.]

Suppose we insert the exponential phase factor $e^{i\omega \tau -i{\boldsymbol{k}}\cdot s}$ inside the two integrals on the right-hand side of (2.428) which then yields the Fourier transform ${\boldsymbol{K}}{\mathbf (}{\boldsymbol{k}},\omega )$ and

(2.429)

\begin{equation} \langle {\boldsymbol{S}}\rangle ({\boldsymbol{k}},\omega )=-{\boldsymbol{K}}{\mathbf (}{\boldsymbol{k}},\omega \textrm {)}\cdot (\nabla T)({\boldsymbol{k}},\omega ), \end{equation}

which is good for a stationary, uniform medium. From (2.427) and (2.419)

(2.430)

\begin{eqnarray} \langle {\boldsymbol{S}}\rangle \!\left ({\boldsymbol{x}},t\right )=-{\boldsymbol{K}}\cdot \nabla T\!\left ({\boldsymbol{x}},t\right ),\quad \langle \dot {{\mathcal E}}\rangle \!\left ({\boldsymbol{x}}\textrm {,t}\right )=-\nabla \cdot \langle {\boldsymbol{S}}\rangle \!\left ({\boldsymbol{x}}\textrm {,t}\right )\ \nonumber \\[4pt] \to \frac {\partial \langle {\mathcal E}\rangle }{\partial t}\!\left ({\boldsymbol{x}}\textrm {,t}\right ) = \nabla \cdot \!\left ({\boldsymbol{K}}\cdot \nabla T\right )=\frac {\partial \langle {\mathcal E}\rangle }{\partial T}\vert _V\frac {\partial T}{\partial t}\!\left ({\boldsymbol{x}},t\right )=K{\nabla }^2T, \end{eqnarray}

assuming $\nabla K=0$ and isotropy.

Definition: $C_V\equiv {\partial \langle {\mathcal E}\rangle }/{\partial T}\vert _V$ and $D_T\equiv {K}/{C_V}\sim \ \ell \overline {{{{v}}}}$ (Reif Reference Reif1965; Liboff Reference Liboff1969).

From the definitions of the heat capacity $C_V$ and $D_T$ , and (2.430), we obtain the diffusion equation for the temperature:

(2.431)

\begin{equation} \frac {\partial T}{\partial t}=D_T{\nabla }^2T. \end{equation}

Example: Momentum density evolution. Consider the evolution of the momentum density in a one-species system with point particles and central forces. We define the momentum density as

(2.432)

\begin{equation} {\boldsymbol{g}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\equiv \sum\limits _i{{{\boldsymbol{p}}}_i\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right )},\ \ \,{{\boldsymbol{p}}}_i\equiv m{{{{\boldsymbol v}}}}_i\, \end{equation}

with evolution equation

(2.433)

\begin{equation} \dot {{\boldsymbol{g}}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )=-\nabla \cdot {{\boldsymbol \Pi }}^K+{\boldsymbol{F}}, \end{equation}

where the force density F is defined by the sum of forces on the particles

(2.434)

\begin{equation} {\boldsymbol{F}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\equiv \sum\limits _i{{{\boldsymbol{f}}}^{\,i}}\ \delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right ) \end{equation}

and the momentum flux density ${{\boldsymbol \Pi }}^K$ is defined by

(2.435)

\begin{equation} {{\boldsymbol \Pi }}^K\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )=\sum\limits _i{{{{{\boldsymbol v}}}}_i}{{\boldsymbol{p}}}_i\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right ). \end{equation}

The force F can be related to an interaction stress tensor ${{\boldsymbol \Pi }}^F$ using (2.434):

(2.436a)

\begin{equation} {\boldsymbol{F}}\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )=-\nabla \cdot {{\boldsymbol \Pi }}^F, \end{equation}

(2.436b)

\begin{equation} {{\boldsymbol \Pi }}^F\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )=\sum\limits _{i,j}{{{\boldsymbol{r}}}_{ij}{{\boldsymbol{f}}}^{\,i}_j}\tfrac{1}{2}\ \delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_{ij}). \end{equation}

We define a total stress tensor as follows

(2.437)

\begin{equation} {{\boldsymbol \Pi }}\equiv \,{{\boldsymbol \Pi }}^K+{{\boldsymbol \Pi }}^F{.} \end{equation}

The evolution equation (2.433) then becomes

(2.438)

\begin{equation} \dot {{\boldsymbol{g}}}=-\nabla \cdot {\boldsymbol \Pi } \end{equation}

and from (2.419) we have $\dot {{\mathcal E}}=-\nabla \cdot {\boldsymbol{S}}\textrm {(}{\boldsymbol{x}}\vert {\boldsymbol \varGamma }$ ). In addition to the momentum and energy relations we also include conservation of particle number density as expressed in the continuity equation:

(2.439a)

\begin{equation} \dot {\textrm {n}}=-\nabla \cdot \tilde {{\boldsymbol \varGamma }}, \end{equation}

where the flux density $\tilde {{\boldsymbol \varGamma }}$ is defined by

(2.439b)

\begin{equation} \tilde {{\boldsymbol \varGamma }}\!\left ({\boldsymbol{x}},t\right )=\sum\limits _i{{{{{\boldsymbol v}}}}_i}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right ). \end{equation}

From (2.439a ) and the definitions it follows that the fluid flow velocity is given by

(2.440)

\begin{equation} {\boldsymbol{u}}\!\left ({\boldsymbol{x}},t\right )\equiv \frac {\langle \tilde {{\boldsymbol \varGamma }}\rangle \!\left ({\boldsymbol{x}},t\right )}{\langle n\rangle \!\left ({\boldsymbol{x}},t\right )} \end{equation}

and the continuity equation for the macroscopic fluid quantities can then be expressed as

(2.441)

\begin{equation} \frac {\partial \langle n\rangle }{\partial t}\!\left ({\boldsymbol{x}},t\right )=-\nabla \cdot \langle \tilde {{\boldsymbol \varGamma }}\rangle =-\nabla \cdot \!\left (\langle n\rangle {\boldsymbol{u}}\right ). \end{equation}

From (2.432)–(2.441) the fluid equation of motion is obtained:

(2.442)

\begin{equation} m\langle n\rangle \!\left (\frac {\partial}{\partial t}+{\boldsymbol{u}}\cdot \nabla \right ){\boldsymbol{u}}\!\left ({\boldsymbol{x}},t\right )=-\nabla \cdot {\boldsymbol{P}}, \end{equation}

where the pressure tensor ${\boldsymbol{P}}$ is

(2.443)

\begin{equation} {\boldsymbol{P}}=\left\langle {\boldsymbol \Pi }\right\rangle -m\langle n\rangle {\boldsymbol{u}u}={{\boldsymbol{P}}}^K+{{\boldsymbol{P}}}^F \end{equation}

with the kinetic stress tensor ${{\boldsymbol{P}}}^K$

(2.444)

\begin{equation} {{\boldsymbol{P}}}^K\equiv \sum\limits _i{m({{{{\boldsymbol v}}}}_i-{{\boldsymbol{u}}}_i)({{{{\boldsymbol v}}}}_i-{{\boldsymbol{u}}}_i)}\delta \!\left ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i\right ) \end{equation}

and the interaction stress tensor ${{\boldsymbol{P}}}^F$

(2.445)

\begin{equation} {{\boldsymbol{P}}}^F=\left\langle {{\boldsymbol \Pi }}^{{\boldsymbol{F}}}\right\rangle . \end{equation}

Exercise: Fill in the steps in deriving (2.443)–(2.445).

In thermal equilibrium for an isotropic medium, the pressure tensor P can be simplified, $\boldsymbol{P}=\textrm{P}{\boldsymbol{I}}$ :

(2.446)

\begin{equation} {{\boldsymbol{P}}}^K=\langle n\rangle T{\boldsymbol{I}},\quad {{\boldsymbol{P}}}^F=-\frac {2\pi }{3}{\langle n\rangle }^2\int\nolimits ^{\infty }_0{s^3\text{d}s}\frac {\textrm{d}\phi }{\text{d}s}g_2\!\left (s\right ){\boldsymbol{I}}, \end{equation}

where $g_2\!\left (s\right )$ is the two-particle correlation function (see § 1.2.1).

In a nonequilibrium system one has

(2.447)

\begin{equation} {\boldsymbol{P}} ={\boldsymbol{I}}\textrm {P}\!\left (\langle n\rangle \!\left ({\boldsymbol{x}},t\right ),T\!\left ({\boldsymbol{x}},t\right )\right )+{{\boldsymbol{P}}}^{\text{visc}}({\boldsymbol{x}},t), \end{equation}

if one can define a local temperature. In the nonequilibrium system (2.442) becomes

(2.448)

Using group theory or Mori’s approach, one can show for an isotropic fluid:

(2.449)

\begin{equation} {{\boldsymbol{P}}}^{\text{visc}}=-\zeta {\boldsymbol{I}}\nabla \cdot {\boldsymbol{u}}-\mu \!\left (\boldsymbol\nabla {\boldsymbol{u}}\right )', \end{equation}

where $\zeta$ is the bulk viscosity, $\mu$ or $\eta$ is the shear viscosity, and we define

(2.450)

\begin{equation} {\!\left (\boldsymbol\nabla {\boldsymbol{u}}\right )}^{\prime} \equiv \nabla {\boldsymbol{u}}+{\boldsymbol{u}}\mathop {\nabla }^{\leftharpoonup }-\frac {2}{3}\nabla \cdot {\boldsymbol{u}}\equiv \textrm {2}{\textrm {(}\nabla {\boldsymbol{u}})}^{\text{shear}} \end{equation}

associated with the shear and note that ${\!\left (\nabla {\boldsymbol{u}}\right )}^{\prime}$ is traceless. The shear stress is associated with a change in shape due to a change in volume or a rotation. The bulk stress is associated with a change volume without a change in shape.

[Editor’s Note: We assume that Professor Kaufman deemed the detailed consideration of the viscous stress tensor was more appropriate for lectures on fluid mechanics and did not have the time to take it up in detail here. A good reference for the viscous stress tensor is Landau & Lifshitz (Reference Landau and Lifshitz1987).]

Example: ${\!\left (\nabla {\boldsymbol{u}}\right )}^{\prime}$ in Cartesian two dimensions is ${\!\left (\nabla {\boldsymbol{u}}\right )}^{\prime} =({\partial}/{\partial x})u_y\hat {{\boldsymbol{y}}}+({\partial}/{\partial y})u_x\hat {{\boldsymbol{x}}}$ .

Now we pause and make some order of magnitude estimates of the transport coefficients in which we relate them to other quantities:

thermal conduction $K\sim ({\langle {{{\boldsymbol v}}}\rangle }/{{\sigma }_{\mathrm{coll}}})\sim nD$ where the diffusion coefficient $D\sim \ell \langle {{{\boldsymbol v}}}\rangle$ ,

shear viscosity $\mu \sim ({P}/{\nu })\sim ({T}/{\sigma \langle {{{\boldsymbol v}}}\rangle })\sim ({m\langle {{{\boldsymbol v}}}\rangle }/{\sigma })\sim mK$ ,

bulk viscosity $\zeta \sim \left \{ \begin{array}{l@{\quad}l} 0& \!\!\textrm {for a dilute gas of particles,}\\[4pt] {P}/{{\nu }_{\text{relax}}}& \!\!\textrm {for a dense gas or liquid with internal degrees of}\\[4pt] & \!\!\textrm{freedom.} \end{array} \right .$

Note: The bulk viscosity coefficient for a dense gas or liquid depends on the degrees of freedom (Landau & Lifshitz Reference Landau and Lifshitz1987).

We return to (2.442) rewritten as

(2.451)

\begin{equation} m\langle n\rangle \frac {D}{Dt}{\boldsymbol{u}}\!\left ({\boldsymbol{x}},t\right )=-\nabla P\!\left (n,T\right )-\nabla \cdot \left [-\zeta \!\left (n,T\right ){\boldsymbol{I}}\nabla \cdot {\boldsymbol{u}}-2\mu (n,T){\left (\nabla {\boldsymbol{u}}\right )}^{\text{shear}}\right ], \end{equation}

where ${D}/{Dt}\equiv {\partial}/{\partial t}+{\boldsymbol{u}}\cdot \nabla$ . From $\dot {{\mathcal E}}=-\nabla \cdot {\boldsymbol{S}}\textrm {(}{\boldsymbol{x}}\vert {\boldsymbol \varGamma }$ ), (2.419), and the definition of the specific internal energy ${{\mathcal U}}_m\equiv (\text{energy}/\text{mass})$ ,

(2.452)

\begin{equation} m\langle n\rangle {{\mathcal U}}_m\equiv \langle {\mathcal E}\rangle -\tfrac{1}{2}m\langle n\rangle {\vert {\boldsymbol{u}}\vert }^2, \end{equation}

which is just the energy in the moving frame. The internal heat flow is the heat flow in the moving frame of the flow, i.e., the internal heat flow is the heat flow in the laboratory frame with flow terms subtracted off:

(2.453)

\begin{equation} {{\boldsymbol{Q}}}^{\boldsymbol{int}}=\langle {\boldsymbol{S}}\rangle -{\boldsymbol{u}}\langle {\mathcal E}\rangle -{\boldsymbol{P}}\cdot {\boldsymbol{u}}=-K\nabla T. \end{equation}

It then follows that

(2.454)

\begin{equation} m\langle n\rangle \frac {D}{Dt}{{\mathcal U}}_m=-\nabla \cdot {{\boldsymbol{Q}}}^{{\boldsymbol{i}nt}}-{\boldsymbol{P}}:\nabla {\boldsymbol{u}} \end{equation}

and

(2.455)

\begin{equation} -{\boldsymbol{P}}:\nabla {\boldsymbol{u}}=-P\nabla \cdot {\boldsymbol{u}}+\zeta {\left (\nabla \cdot {\boldsymbol{u}}\right )}^2+2\mu {\left (\nabla {\boldsymbol{u}}\right )}^{\text{shear}}:{\!\left (\nabla {\boldsymbol{u}}\right )}^{\text{shear}}, \end{equation}

where the first term on the right-hand side of (2.455) is the adiabatic cooling or heating, the second term is the heating due to bulk viscosity, and the third term is the heating due to shear viscosity (’:’ denotes the sum of squares of all components, $\nabla u_{ij}\nabla u_{ij}).$

The specific entropy is ${{\mathcal S}}_m\!\left ({{\mathcal U}}_m,V_m\right )$ where $V_m\equiv {1}/{\rho }=({1}/{m\langle n\rangle })$ satisfies an equation

(2.456)

\begin{equation} \text{d}{{\mathcal S}}_m=\beta \text{d}{{\mathcal U}}_m+\beta P\textrm{d}V_m \end{equation}

from which it follows that

(2.457)

\begin{equation} \frac {D}{Dt}{{\mathcal S}}_m=\beta \frac {D}{Dt}{{\mathcal U}}_m-\frac {\beta P}{m{\langle n\rangle }^2}\frac {D}{Dt}\langle n\rangle \end{equation}

and with use of (2.453) and (2.454),

(2.458)

\begin{equation} \frac {D}{Dt}{{\mathcal S}}_m=-\beta \nabla \cdot {\boldsymbol{Q}}-\beta {{\boldsymbol{P}}}^{\text{visc}}:\nabla {\boldsymbol{u}}, \end{equation}

where A :B = A ${}_{ij}$ B ${}_{ij}$ . Defining the entropy density ${{\mathcal S}}_V\equiv m\langle n\rangle {{\mathcal S}}_m$ one can show

(2.459)

\begin{equation} \frac {\partial}{\partial t}{{\mathcal S}}_V\!\left ({\boldsymbol{x}},t\right )=-\nabla \cdot \!\left ({\boldsymbol{u}}{{\mathcal S}}_V+\beta {\boldsymbol{Q}}\right )+{\dot {{\mathcal S}}}_V, \end{equation}

where

(2.460)

\begin{equation} {{\boldsymbol \varGamma }}^{{\mathcal S}}\equiv {\boldsymbol{u}}{{\mathcal S}}_V+\beta {\boldsymbol{Q}}\quad \textrm{and}\quad {\dot {{\mathcal S}}}_V={\beta }^2K{\!\left (\nabla T\right )}^2+\beta \zeta {\!\left (\nabla \cdot \boldsymbol{u}\right )}^2+2\beta \mu {\!\left ({\vert \nabla \boldsymbol{u}\vert }^{\text{shear}}\right )}^2{.} \end{equation}

To be consistent with the second law of thermodynamics the source term ${\dot {{\mathcal S}}}_V\ge 0$ ; thus, $\zeta , \mu ,$ and K are all nonnegative. Necessarily $\beta \gt 0$ in this classical theory.

2.6.8. Normal mode solutions of the transport equations

We next analyze the linear normal modes supported by the transport equations. For this purpose we assume that the system is uniform and isotropic in space stretching to infinity. We assume infinitesimal-amplitude perturbations and examine both microscopic and macroscopic modes. Perturbed amplitudes will all have the form $A\!\left (x,t\right )=\tilde {A}e^{i{\boldsymbol{k}}\cdot {\boldsymbol{x}}-i\omega t)}$ .

Definitions: Polarizations. Longitudinal modes have $\boldsymbol{k}$ and velocity perturbation $\boldsymbol{u}$ parallel to one another. Moreover, a purely longitudinal mode is irrotational (curl free) and has a divergence. A compressional wave is a longitudinal wave and has a finite density perturbation. A transverse wave has $\boldsymbol{k}$ and velocity perturbation $\boldsymbol{u}$ perpendicular to one another. A purely transverse wave has a finite curl and is divergence free. In a uniform, isotropic medium there is no coupling between longitudinal and transverse waves.

In the macroscopic theory all equations are for mean values (ensemble averages have removed random fluctuations). An example of a simple linear equations set is as follows.

(2.461)

\begin{equation} {\boldsymbol{g}}=mn_0{\boldsymbol{u}}={\rho }_0{\boldsymbol{u}},\quad {{\boldsymbol{u}}}_0=0\quad \frac {\partial {\boldsymbol{g}}}{\partial t}=-\nabla \cdot {\boldsymbol \delta }{\boldsymbol \Pi },\quad {\boldsymbol \Pi }={\boldsymbol\Pi}_{0}+{\boldsymbol \delta }{\boldsymbol \Pi }. \end{equation}

After Fourier analyzing in time and space, (2.461) becomes

(2.462)

\begin{equation} -i\omega {\boldsymbol{g}}=-i{\boldsymbol{k}}\cdot {\boldsymbol \delta }{\boldsymbol \Pi }{\boldsymbol \ }\to {\boldsymbol \ }\omega {\rho }_0{\boldsymbol{u}}={\boldsymbol{k}}\cdot {\boldsymbol \delta }{\boldsymbol \Pi }\to {\boldsymbol \ }\omega {\rho }_0{\boldsymbol{u}} ={\boldsymbol{k}}\cdot {\boldsymbol \delta }{\boldsymbol{P}}, \end{equation}

using ${\boldsymbol{P}}={\boldsymbol \Pi }-\rho {\boldsymbol{uu}}\approx P{\boldsymbol{I}}+{{\boldsymbol{P}}}^{\text{visc}}$ because $\rho {\boldsymbol{uu}}$ is higher order.

Example: Shear mode. A shear mode is a transverse wave, for which ${\boldsymbol{k}}\cdot$ all of the perturbed quantities in (2.462) vanish. However, ${\boldsymbol{k}}\times$ on (2.462) yields

(2.463)

\begin{equation} \omega {\rho }_0{\boldsymbol{k}} \times {\boldsymbol{u}} ={\boldsymbol{k}}{\boldsymbol \times }{\boldsymbol \delta }{{\boldsymbol{P}}}^{\text{visc}}\cdot {\boldsymbol{k}}. \end{equation}

However, from (2.449) $\delta {{\boldsymbol{P}}}^{\text{visc}}=-\zeta {\boldsymbol{I}}\nabla \cdot {\boldsymbol{u}}-\mu \!\left (\nabla {\boldsymbol{u}}\right )'$ . The bulk viscosity term does not contribute because ${\boldsymbol{k}}{ \times }{\boldsymbol{k}}=0$ , which leaves the shear viscosity term:

(2.464)

\begin{equation} \omega {\rho }_0{\boldsymbol{k}} \times {\boldsymbol{u}} =-i{\mu }_0{\boldsymbol{k}} \times {\boldsymbol{u}}k^2{.} \end{equation}

Hence, the dispersion relation for the shear wave is

(2.465)

\begin{equation} \omega =-i\frac {{\mu }_0}{{\rho }_0}k^2=-iD_{sh}k^2{,} \end{equation}

where we have introduced $D_{sh}=({{\mu }_0}/{{\rho }_0})$ which is called the kinematic viscosity and has units of spatial diffusivity. Thus, the shear mode just decays. The vorticity $\nabla \times {\boldsymbol{u}} ={\boldsymbol \Omega }$ is a shear mode which just decays in a liquid: $({\partial {\boldsymbol \Omega }}/{\partial t})=D_{sh}{\nabla }^2{\boldsymbol \Omega }$ (Helmholtz). In a solid, vorticity may propagate. Note that the transport coefficients here have been assumed to be frequency independent. The correction for shear viscosity that has dependence on frequency and wavenumber is ${\mu }_0\!\left ({\boldsymbol{k}},\omega \right )\to {\mu }_0'\!\left ({\boldsymbol{k}},\omega \right )+i{\mu }^{\prime\prime}_0({\boldsymbol{k}},\omega )$ , which will allow $Re\ \omega \ne 0$ ; and then the shear mode may be able to propagate as well as just damp out.

Example: Compressional wave – To analyze the compressional wave we take the dot product of (2.462) with $\boldsymbol{k}$

(2.466)

\begin{align} \omega {\rho }_0{\boldsymbol{u}}\cdot {\boldsymbol{k}}&={\boldsymbol{k}}\cdot {\boldsymbol \delta }{\boldsymbol \prod }\cdot {\boldsymbol{k}} =k^2\delta P+{\boldsymbol{k}}\cdot \bigg[-\zeta {\boldsymbol{I}}i{\boldsymbol{k}}\cdot {\boldsymbol{u}}-\mu \bigg(i{\boldsymbol{ku}}+i{\boldsymbol{uk}}-\frac {2}{3}{\boldsymbol{I}}i{\boldsymbol{ku}}\bigg)\bigg]\cdot {\boldsymbol{k}}\nonumber\\[4pt]&=k^2\!\left (\delta P-i{\boldsymbol{k}}\cdot {\boldsymbol{u}}\!\left(\zeta +\tfrac {4}{3}\mu \right)\right ), \end{align}

which has the solution

(2.467)

\begin{equation} {\boldsymbol{k}}\cdot {\boldsymbol{u}} =\frac {k^2\delta P}{\omega {\rho }_0+ik^2(\zeta +\frac {4}{3}\mu )}. \end{equation}

Definition: Let $D_{\zeta }\equiv \frac {(\zeta +4\mu/3)}{{\rho }_0}$ .

Hence,

(2.468)

\begin{equation} {\boldsymbol{k}}\cdot {\boldsymbol{u}}\omega {\rho }_0 =\frac {k^2\delta P}{\textrm {1}+i\frac {k^2}{\omega }D_{\zeta }}. \end{equation}

From the linearized continuity equation to lowest order:

(2.469)

\begin{equation} -\frac {\partial \rho }{\partial t}=\nabla \cdot \!\left (\rho {\boldsymbol{u}}\right )=\rho \nabla \cdot {\boldsymbol{u}}+{\boldsymbol{u}}\cdot \nabla \rho ={\rho }_0\nabla \cdot {\boldsymbol{u}}\to {\boldsymbol \ }i\omega \delta \rho ={\rho }_0i{\boldsymbol{k}}\cdot {\boldsymbol{u}}. \end{equation}

Combining (2.468) and (2.469,)

(2.470)

\begin{equation} {\boldsymbol{k}}\cdot {\boldsymbol{u}} =\frac {\omega \delta \rho }{{\rho }_0} =\frac {\delta P}{\omega {\rho }_0}\frac {k^2}{\textrm {1}+i\frac {k^2}{\omega }D_{\zeta }} \to \frac {\delta P}{\delta \rho }=\frac {{\omega }^2}{k^2}\!\left (\textrm {1}+i\frac {k^2}{\omega }D_{\zeta }\right ). \end{equation}

From the entropy equation (2.458) $({D}/{Dt}){{\mathcal S}}_m=-\beta \nabla \cdot {\boldsymbol{Q}}-\beta {{\boldsymbol{P}}}^{\text{visc}}:\nabla {\boldsymbol{u}}$ and from (2.453) ${\boldsymbol{Q}}=-K\nabla T$ . We linearize and keep only lowest-order terms:

(2.471)

\begin{equation} {\rho }_0\!\left (-i\omega \right )\delta {{\mathcal S}}_m=-{\beta }_0i{\boldsymbol{k}}\cdot \!\left (-K\right )i{\boldsymbol{k}}\delta T=-k^2K\frac {\delta T}{T_0}, \end{equation}

from which we have

(2.472)

\begin{equation} \delta {{\mathcal S}}_m=\frac {k^2K}{i\omega {\rho }_0}\frac {\delta T}{T_0}=\left (\frac {k^2D_T}{i\omega }\right )\frac {C_V\delta T}{T_0}, \end{equation}

where we introduce the definition $D_T\equiv ({K}/{C_V{\rho }_0}).$ Next we supplement (2.471) with the basic thermodynamic relation

(2.473)

\begin{equation} {{\mathcal S}}_m\!\left (\rho ,T\right ) \to \delta {{\mathcal S}}_m={\frac {\partial {{\mathcal S}}_m}{\partial \rho }\bigg\vert }_T\delta \rho +{\frac {\partial {{\mathcal S}}_m}{\partial T}\bigg\vert }_{\rho }\delta T, \end{equation}

where ${{\mathcal S}}_m\equiv -\partial F(\rho ,T)/\partial T$ and $\textrm{d}F\!\left (\rho ,T\right )=-{{\mathcal S}}_m\text{d}T-P\textrm{d}\rho /{\rho }^2$ , i.e., $P\!\left (\rho ,T\right )\equiv -{\rho }^2\partial F\!\left (\rho ,T\right )/\partial \rho$ . In addition, the specific heat at constant volume is defined as $C_V\equiv T{\!\left (\partial {{\mathcal S}}_m/\partial T\right )}_{\rho }$ so that (2.473) yields

(2.474)

\begin{equation} \delta {{\mathcal S}}_m\!\left (\rho ,T\right )={\frac {\partial {{\mathcal S}}_m}{\partial \rho }\bigg\vert }_T\delta \rho +\frac {C_V\delta T}{T_0} \end{equation}

and by equating expressions for $\delta {{\mathcal S}}_m$ we obtain

(2.475)

\begin{equation} {\frac {\partial {{\mathcal S}}_m}{\partial \rho }\bigg\vert }_T\delta \rho =-\!\left (1-\frac {k^2D_T}{i\omega }\right )\frac {C_V\delta T}{T_0}, \end{equation}

which implies

(2.476)

\begin{equation} \delta {{\mathcal S}}_m=-\frac {k^2D_T}{i\omega }{\!\left (1-\frac {k^2D_T}{i\omega }\right )}^{-1}{\frac {\partial {{\mathcal S}}_m}{\partial \rho }\bigg\vert }_T\delta \rho . \end{equation}

From the equation of state for $P\!\left ({{\mathcal S}}_m,\rho \right )$

(2.477)

\begin{equation} \delta P={\frac {\partial P}{\partial {{\mathcal S}}_m}\bigg\vert }_{\rho }\delta {{\mathcal S}}_m+{\frac {\partial P}{\partial \rho }\bigg\vert }_{{{\mathcal S}}_m}\delta \rho , \end{equation}

which now yields

(2.478)

\begin{equation} \frac {\delta P}{\delta \rho }=C^2_s\left [1-{\frac {\partial \rho }{\partial P}\bigg\vert }_{{{\mathcal S}}_m}{\frac {\partial P}{\partial {{\mathcal S}}_m}\bigg\vert }_{\rho }{\frac {\partial {{\mathcal S}}_m}{\partial \rho }\bigg\vert }_T\frac {k^2}{i\omega }D_T{\!\left (1-\frac {k^2}{i\omega }D_T\right )}^{-1}\right ], \end{equation}

where we have introduced the square of the sound speed $C^2_s\equiv {{\partial P}/{\partial \rho }\vert }_{{{\mathcal S}}_m}.$ Lastly, using the Maxwell identity (A.4) shown in the Appendix (Editor’s Addendum), (2.478) becomes

(2.479)

\begin{equation} \frac {\delta P}{\delta \rho }=C^2_s\bigg [1+\bigg(1-\frac {1}{\gamma }\bigg)\frac {k^2}{i\omega }D_T{\bigg (1-\frac {k^2}{i\omega }D_T\bigg)}^{-1}\bigg], \end{equation}

where $\gamma \equiv C_p/C_V$ .

Equating the expressions for ${\delta P}/{\delta \rho }$ in (2.470) and (2.479) one obtains the dispersion relation for compressional waves:

(2.480)

\begin{equation} \frac {{\omega }^2}{k^2}\!\left (\textrm {1}+i\frac {k^2}{\omega }D_{\zeta }\right )=C^2\bigg[1+\bigg (1-\frac {1}{\gamma }\bigg )\frac {k^2}{i\omega }D_T{\bigg(1-\frac {k^2}{i\omega }D_T\bigg)}^{-1}\bigg]. \end{equation}

We solve (2.480) analytically in two simple limits.

Example: High-frequency sound waves in which $\omega \gg k^2D_{\text{sound}}$ , i.e., the wave oscillation period is short compared with the characteristic diffusion time for the sound wave:

(2.481)

\begin{equation} \frac {\omega }{k}=C-\frac {i}{2}kD_{\text{sound}}, \end{equation}

where $D_{\text{sound}}\equiv D_{\zeta }+\big(1-{1}/{\gamma }\big )D_T$ . The $-({i}/{2})kD_{\text{sound}}$ term in (2.481) sets the temporal or spatial damping rate. The expansion of (2.480) leading to the solution in (2.481) is valid if the wavelength is long compared to the mean free path; the wave attenuation rate must be weak.

Example: Low-frequency thermal mode $\omega \ll kC.$ We solve (2.480) by successive approximations to obtain

(2.482)

\begin{equation} \omega =\frac {-ik^2D_T}{\gamma }. \end{equation}

Exercise: Show that the sound wave is isentropic. Show that the thermal wave is approximately isobaric. Show that the shear mode is isochoric.

2.6.9. Generalized Langevin method for transport relations: a sketch

Here we present a sketch of a generalized Langevin method for obtaining the transport relations. References for the formalism are found in Landau & Lifshitz (Reference Landau and LIfshitz1969, §§ 121–124 and 126–127), and Landau & Lifshitz (Reference Landau and Lifshitz1987, ch. XVII).

We begin with $\dot {{\boldsymbol{g}}}=-\nabla \cdot {\boldsymbol \Pi },$ (2.438), which we expand as follows:

(2.483)

\begin{align} \frac {\partial {\boldsymbol{g}}}{\partial t}&=-\nabla \cdot {\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\nonumber\\[4pt]&=-\nabla \cdot \left [\left\langle {\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\vert {\boldsymbol{t}}\right )\right\rangle +\delta {\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\right ]=-\nabla \cdot \left [{\left\langle {\boldsymbol \Pi }\right\rangle }_0+\delta \left\langle {\boldsymbol \Pi }\right\rangle +\delta {\boldsymbol \Pi }\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\right ]. \end{align}

Just to simplify the analysis we replace $\nabla \cdot {\boldsymbol \Pi }$ with $\nabla P$ and pretend this is valid, i.e.,

(2.484)

\begin{equation} \frac {\partial {\boldsymbol{g}}}{\partial t}=-\nabla \!\left (\langle P\rangle +\delta P\right ). \end{equation}

Suppose for a linear sound wave ${\boldsymbol{g}}={\rho }_0{\boldsymbol{u}}$ with $\nabla \times {\boldsymbol{u}}=0$ and ${\boldsymbol{u}}=-\nabla \tilde {\phi }$ , where $\tilde {\phi }$ is the fluid velocity scalar potential function, then

(2.485)

\begin{equation} {\rho }_0\frac {\partial \tilde {\phi }}{\partial t}=\delta \langle P\rangle +\delta P={\frac {\textrm{d}P}{\text{d}\rho }\bigg\vert }_{{{\mathcal S}}_m}\delta \langle \rho \rangle +\delta P, \end{equation}

where $\delta \langle P\rangle$ is that part of $\langle P\rangle$ that varies in space and time. We Fourier analyze and obtain

(2.486)

\begin{equation} -i\omega {\rho }_0\phi _{{\boldsymbol{k}}\omega }=C^2{\langle \rho \rangle }_{{\boldsymbol{k}}\omega }+{\delta P}_{{\boldsymbol{k}}\omega }. \end{equation}

From continuity

(2.487)

\begin{equation} -\frac {\partial \langle \rho \rangle }{\partial t}={\rho }_0\nabla \cdot {\boldsymbol{u}} \to i\omega {\langle \rho \rangle }_{{\boldsymbol{k}}\omega }={\rho }_0k^2{\tilde {\phi }}_{{\boldsymbol{k}}\omega }, \end{equation}

we solve for ${\tilde {\phi }}_{{\boldsymbol{k}}\omega }$ and reduce (2.486) to

(2.488)

\begin{equation} {\langle \rho \rangle }_{{\boldsymbol{k}}\omega }=\frac {k^2{\delta P}_{{\boldsymbol{k}}\omega }}{{\omega }^2-k^2C^2}. \end{equation}

In terms of spectral densities (2.488) leads to

(2.489)

\begin{equation} S^{\rho }\!\left ({\boldsymbol{k}},\omega \right )=\frac {k^4S^p\!\left ({\boldsymbol{k}},\omega \right )}{{\!\left ({\omega }^2-k^2C^2\right )}^2}. \end{equation}

In (2.489) $\pm kC$ are eigenfrequencies ${\omega }_{{\boldsymbol{k}}}$ for the linear modes. We claim that with dissipation included, the right-hand side of (2.489) can be generalized to the form

(2.490)

\begin{equation} \sum\limits _{\alpha }\frac {const}{{\vert \omega -{\omega }^{\alpha }_{{\boldsymbol{k}}}\vert ^2}} \end{equation}

for complex ${\omega }^{\alpha }_{{\boldsymbol{k}}}$ . This analysis and expressions obtained are the analog of the Langevin method used for Brownian motion in §§ 2.2.1 and 2.2.2. We can argue using Rayleigh–Jeans that the density fluctuations have an energy spectrum kT per mode and evaluate the constant in (2.490) indirectly.

In the Kubo/Mori approach (§ 2.6.5) we used a Hamiltonian

(2.491)

\begin{equation} H=H_0+\int\nolimits {\textrm{d}^3\textit{x}\ n\!\left ({\boldsymbol{x}}\vert {\boldsymbol \varGamma }\right )\,{\tilde {\phi }}^{\text{ext}}\!\left ({\boldsymbol{x}},t\right )=}H_0+\int\nolimits {\textrm{d}^3\textit{x}\ \sum\limits _i{\delta ({\boldsymbol{x}}-{{\boldsymbol{r}}}_i)}\,{\tilde {\phi }}^{\text{ext}}\!\left ({\boldsymbol{x}},t\right )} \end{equation}

and obtained a kinetic equation for the response

(2.492)

\begin{equation} \delta \langle n\rangle \!\left ({\boldsymbol{x}},t\right )=\int\nolimits {\textrm{d}^3\textit{x}'}\int\nolimits {\textrm{d}t'}G{\tilde {\phi }}^{\text{ext}}\!\left ({\boldsymbol{x}}{\mathbf '},t'\right ), \end{equation}

where $G$ is a Green’s function. We then obtained a fluctuation–dissipation theorem relating $G$ to the spectral density $S^{\delta n}$ having derived $G$ from just the linearized hydrodynamic equations (linearized with respect to the external field strength).

2.7. Nonequilibrium quantum statistical mechanics

References for this discussion of nonequilibrium quantum statistical mechanics is Tolman (Reference Tolman1938); Kubo (Reference Kubo1965, ch. 2), de Boer & Uhlenbeck (Reference de Boer and Uhlenbeck1962); Cohen (Reference Cohen1962); and de Groot & Suttorp (Reference dE Groot and Suttorp1972). We begin by introducing a representation of the state function $\vert \psi \!\left (t\right )\rangle$ in terms of a decomposition in terms of basis functions that are assumed to be a complete set

(2.493)

\begin{equation} \vert \psi \!\left (t\right )\rangle =\sum\limits _n{c_n\!\left (t\right )\vert {\psi }_n\rangle }, \sum\limits _n{{\vert c_n\vert }^2}=1. \end{equation}

Definition: Consider any Hermitian operator A

(2.494)

\begin{equation} \langle A\rangle \equiv \langle \psi \vert a\vert \psi \rangle =\sum\limits _{m,n}{c^*_mc_n\langle {\psi }_m\vert a\vert {\psi }_n\rangle }\equiv \sum\limits _{m,n}{c^*_mc_nA_{mn}}. \end{equation}

Definition: The ensemble average of $\langle A\rangle$ , i.e., the quantum statistical average, is defined as

(2.495)

\begin{equation} \langle \langle A\rangle \rangle \equiv \overline {\langle \psi \vert a\vert \psi \rangle }=\sum\limits _{m,n}{\overline {c^*_m(t)c_n(t)}}A_{mn}. \end{equation}

Definition: The density matrix is defined by

(2.496)

\begin{equation} {\rho }_{mn}(t)\equiv \ \overline {c^*_m(t)c_n(t)}. \end{equation}

We note ${\rho }_{mn}(t)$ has the properties that it is Hermitian, positive-definite, and Tr( ${\rho }_{mn})=1$ :

(2.497)

\begin{equation} \langle \langle A\rangle \rangle =\sum\limits _{m,n}{{\rho }_{mn}(t)A_{mn}} \quad \textrm{and}\quad \langle A\rangle \!\left (t\right )=\textrm{Tr}\!\left (\rho (t)A\right ). \end{equation}

How $\rho (t)$ varies in time is determined by the Hamiltonian H(t):

(2.498)

\begin{equation} \frac {\partial \rho }{\partial t}=-\frac {1}{i\hslash }[\rho ,H]. \end{equation}

Equation (2.498) is the quantum mechanical analog of the Liouville equation. Remember that for A a time-independent operator $\dot {{\boldsymbol{A}}}=-({1}/({i\hslash }))[{\boldsymbol{A}},H$ ]; and if A is time dependent, then we include ${\partial {\boldsymbol{A}}}/{\partial t}$ additively.

It follows that

(2.499)

\begin{equation} \frac {\textrm{d}}{\textrm{d}t}\langle A\rangle \!\left (t\right )=\textrm {Tr}\!\left (\frac {\partial \rho }{\partial t}A\right )= \textrm {Tr}\!\left (\rho \dot {A}\right )=\langle \dot {A}\rangle . \end{equation}

Correspondence due to Wigner and Weyl:

(2.500)

\begin{equation} H\!\left (P,Q\right );\ \ \left [P,Q\right ]=\frac {\hslash }{i}, \end{equation}

for one degree of freedom. In (2.500), P and Q are operators. The phase-space coordinates are denoted by p and q. We next define the Weyl transform |A(P,Q)|but first note that

(2.501)

\begin{equation} \langle q' \vert a\vert q''\rangle \equiv \int\nolimits \text{d}q'''\delta (q'''-q')A\!\left (\frac {\hslash }{i}\frac {\partial}{\partial q'''},q'''\right )\delta (q'''-q'') \end{equation}

for A in the q representation.

Definition: The Weyl transform is defined

(2.502)

\begin{equation} a\!\left (p,q\right )=\int\nolimits {\text{d}s}e^{\frac {i}{\hslash }ps}\langle q-\tfrac{1}{2}s\vert A \left (P,Q\right )\vert q+\tfrac{1}{2}s\rangle \end{equation}

and its inverse (Wigner transform) is

(2.503)

\begin{equation} A(P,Q)\equiv \int\nolimits {\text{d}p\int\nolimits {\text{d}q\ \delta \!\left (q-Q\right )\delta \!\left (p-P\right )e^{\frac {\hslash }{2i}\frac {{\partial }^2}{\partial p\partial q}}\ a(p,q)}} \end{equation}

with the following prescriptions on the correspondence of independent variables:

(2.504a)

\begin{equation} p\leftrightarrow P, \end{equation}

(2.504b)

\begin{equation} pq\leftrightarrow \tfrac{1}{2}\!\left (PQ+QP\right ), \end{equation}

(2.504c)

\begin{equation} p^2q^2\leftrightarrow \frac {1}{4}\!\left (P^2Q^2+Q^2P^2+2PQ^2P\right ), \end{equation}

(2.504d)

\begin{equation} \frac {2}{\hslash }\left [{\sin \frac {\hslash }{2}}\left (\frac {{\partial }a}{\partial q}\frac {{\partial }b}{\partial p}-\frac {{\partial }b}{\partial q}\frac {{\partial }a}{\partial p}\right )\right ]\leftrightarrow \frac {1}{i\hslash }\left [A,B\right ], \end{equation}

where ${{\partial }a}/{\partial q}$ is the partial derivative with respect to q operating on a. We note that as $\hslash \to 0$ the left-hand side of (2.504d ) recovers the Poisson bracket $\left \{a,b\right \}$ .

Definition: The Wigner function is defined as the Weyl transform of $\rho (t)$ , i.e.,

(2.505)

\begin{equation} \textrm {Weyl transform of}\;\rho \!\left (t\right )\ \textrm {density matrix} \to {\rho }^{\text{Wigner}}(p,q;t). \end{equation}

The Wigner function is like a density in phase space:

(2.506)

\begin{equation} \rho \!\left (q;t\right )=\int\nolimits {\text{d}p}{\rho }^{\text{Wigner}}(p,q;t). \end{equation}

Here ${\rho }^{\text{Wigner}}(p,q;t)$ is real and normed, but can be negative. $\rho \!\left (q;t\right )$ is the correct quantum mechanical probability distribution with respect to q. Similarly,

(2.507)

\begin{equation} \rho \!\left (p;t\right )=\int\nolimits {\text{d}q}{\rho }^{\text{Wigner}}(p,q;t) \end{equation}

is the correct quantum mechanical probability distribution with respect to p. Furthermore, it can be show that ${\rho }^{\text{Wigner}}$ is bounded:

(2.508)

\begin{equation} \vert {\rho }^{\text{Wigner}}(p,q;t)\vert \le \frac {2}{h} \end{equation}

and using the Wigner function

(2.509)

\begin{equation} {\sigma }_p{\sigma }_q\ge \frac {h}{2}. \end{equation}

The equation of evolution for the Wigner function using the Weyl transformation is

(2.510)

\begin{equation} \frac {\partial}{\partial t}{\rho }^W\!\left (p,q;t\right )=\frac {2}{\hslash }\left [{\sin \frac {\hslash }{2}}\left (\frac {{\partial }^{\,{\mathcal H}}}{\partial q}\frac {{\partial }^{{\rho }^W}}{\partial p}-\frac {{\partial }^{{\rho }^W}}{\partial q}\frac {{\partial }^{{\mathcal H}}}{\partial p}\right )\right ]{\mathcal H}(p,q;t){\rho }^W\!\left (p,q;t\right ), \end{equation}

where the superscripts on the partial derivatives give guidance on what functions the partial derivatives operate in the expression that follows, $\mathcal H$ is the Weyl transform of the quantum mechanical Hamiltonian H, and only leading terms have been retained, which means only slow variations in ${\mathcal H}\!\left (p,q;t\right ){\rho }^W\!\left (p,q;t\right )$ are kept. In the limit $\hslash \to 0$ (2.510) becomes $({\partial}/{\partial t}){\rho }^W=-\left \{{\rho }^W,{\mathcal H}\right \}$ . Equation (2.510) is a Liouville equation that allows us to do everything on the Wigner function that we did on the classical probability distribution in § 2 of these lecture notes: all of the methods go through.

[Editors’ Note: This was an elegant conclusion to Kaufman’s graduate statistical mechanics lectures.]

Acknowledgements

Editor Alex Schekochihin thanks the referees for their advice in evaluating this article.

Editor’s Addendum: Appendix – Thermodynamic potentials, maxwell relations, and identities

A.1. Thermodynamic potentials

Classical thermodynamics is expressed in terms of four variables: Pressure P and Volume V as one conjugate pair, Temperature T and Entropy S as a second conjugate pair. Much of the thermodynamic analysis is based on which pair of thermodynamic variables are considered independent, and which pair of thermodynamic variables are considered dependent. Each pair of independent thermodynamic variables $(X,Y)$ is associated with a thermodynamic potential $\Psi (X, Y),$ with its defining differential relation $\textrm{d}\Psi (X, Y) \equiv {(\partial \Psi /\partial X)}_Y\ \text{d}X +{(\partial \Psi /\partial Y)}_X\ \textrm{d}y$ , as follows

(A.1)

\begin{align} &\textrm {Internal Energy}\quad U\!\left (S,V\right ) \to\nonumber\\[4pt] &\qquad \text{d}U=T\!\left (S,V\right )\textrm{d}S-P\!\left (S,V\right )\textrm{d}V, \end{align}

(A.2)

\begin{align} &\textrm{Helmholtz Free Energy }F\!\left (T,V\right )\equiv U-ST{ }\to \nonumber\\[4pt]&\qquad \textrm{d}F(T,V)=-S\!\left (T,V\right )\textrm{d}T-P\!\left (T,V\right )\textrm{d}V \end{align}

(A.3)

\begin{align} & \textrm{Enthalpy }H\!\left (S,P\right )\equiv U+PV\to\nonumber\\[4pt]&\qquad \textrm{d}H(S,P)= T\!\left (S,P\right )\textrm{d}S+V\!\left (S,P\right )\textrm{d}P \end{align}

(A.4)

\begin{align}&\textrm{Gibbs free energy }G\!\left (T,P\right )=U+PV-ST\to \nonumber\\[4pt]&\qquad \textrm{d}G\!\left (T,P\right )=-S\!\left (T,P\right )\textrm{d}T+V\!\left (T,P\right )\textrm{d}P, \end{align}

where the thermodynamic potentials $(U, F, H, G)$ are related by Legendre transformation associated with the substitution $S \to T,$ or $V \to P$ , or both. Hence, the thermodynamic variables $(S,T; P,V)$ can be seen to be either independent variables or dependent functions.

A.2. Maxwell relations

Because of the symmetry of partial derivatives ${{\partial }^2\Psi (X,Y)}/\partial X\partial Y\equiv$ $ {{\partial }^2\Psi (X,Y)}/{\partial Y\partial X}$ , we naturally arrive at the Maxwell relations

(A.5)

\begin{equation} \,{\left (\frac {\partial T}{\partial V}\right )}_S=\frac {{\partial }^2U}{\partial V\partial S}=-{\left (\frac {\partial P}{\partial S}\right )}_V, \end{equation}

(A.6)

\begin{equation} {\left (\frac {\partial S}{\partial V}\right )}_T=-\frac {{\partial }^2F}{\partial V\partial T}={\left (\frac {\partial P}{\partial T}\right )}_V, \end{equation}

(A.7)

\begin{equation} {\left (\frac {\partial T}{\partial P}\right )}_S=\frac {{\partial }^2H}{\partial P\partial S}={\left (\frac {\partial V}{\partial S}\right )}_P, \end{equation}

(A.8)

\begin{equation} {\left (\frac {\partial S}{\partial P}\right )}_T=-\frac {{\partial }^2G}{\partial P\partial T}=-{\left (\frac {\partial V}{\partial T}\right )}_P, \end{equation}

where ${(\partial X/\partial Y)}_Z$ denotes the partial derivative of $X(Y, Z)$ with respect to Y at constant Z.

A.3. Maxwell identities

The Maxwell relations lead to the following identity involving three thermodynamic variables $(X,Y,Z)$ :

(A.9)

\begin{equation} {\left (\frac {\partial X}{\partial Y}\right )}_Z{\left (\frac {\partial Y}{\partial Z}\right )}_X{\left (\frac {\partial Z}{\partial X}\right )}_Y= -1. \end{equation}

For example, consider the identity

(A.10)

\begin{equation} {\left (\frac {\partial V}{\partial P}\right )}_S{\left (\frac {\partial P}{\partial S}\right )}_V{\left (\frac {\partial S}{\partial V}\right )}_P= -1, \end{equation}

which can be proved from the Maxwell relations as follows. First we use the Maxwell relation ${\!\left ({\partial P}/{\partial S}\right )}_V=-{\left ({\partial T}/{\partial V}\right )}_S$ so that

(A.11)

\begin{equation} {\left (\frac {\partial V}{\partial P}\right )}_S{\left (\frac {\partial P}{\partial S}\right )}_V= -\,{\left (\frac {\partial V}{\partial P}\right )}_S{\left (\frac {\partial T}{\partial V}\right )}_S=-{\left (\frac {\partial T}{\partial P}\right )}_S, \end{equation}

which makes use of the identity ${\ (\partial X/\partial Y)}_{\xi }\,{(\partial Y/\partial Z)}_{\xi }\equiv {\ (\partial X/\partial Z)}_{\xi }$ . Next, we use the Maxwell relation ${(\partial S/\partial V)}_P=\,{(\partial P/\partial T)}_S$ , and we obtain

(A.12)

\begin{equation} -\,{\left (\frac {\partial T}{\partial P}\right )}_S{\left (\frac {\partial S}{\partial V}\right )}_S=-{\left (\frac {\partial T}{\partial P}\right )}_S{\left (\frac {\partial P}{\partial T}\right )}_S\equiv -1 \end{equation}

which makes use of the identity ${\ (\partial X/\partial Y)}_{\xi }\,{(\partial Y/\partial X)}_{\xi }\ \equiv \ 1$ .

A.4. Equations (2.478)–(2.479)

In (2.478), we find the triple product (returning to $V={\rho }^{-1}$ )

(A.13)

\begin{align} {\left (\frac {\partial \rho }{\partial P}\right )}_S{\left (\frac {\partial P}{\partial S}\right )}_{\rho }{\left (\frac {\partial S}{\partial \rho }\right )}_T&= \left [{\left (\frac {\partial \rho }{\partial P}\right )}_S{\left (\frac {\partial P}{\partial S}\right )}_{\rho }{\left (\frac {\partial S}{\partial \rho }\right )}_P\right ]{{\left (\frac {\partial \rho }{\partial S}\right )}_P\!\left (\frac {\partial S}{\partial \rho }\right )}_T\nonumber\\[4pt]&=-{\left (\frac {\partial \rho }{\partial S}\right )}_P{\left (\frac {\partial S}{\partial \rho }\right )}_T{,} \end{align}

where we use a Maxwell identity for $(\rho, P, S)$ with the identity ${(\partial S/\partial \rho )}_P\,{(\partial \rho /\partial S)}_P\equiv \ 1$ . Next we use the partial derivative identity

(A.14)

\begin{equation} {\left (\frac {\partial S}{\partial \rho }\right )}_T={\left (\frac {\partial S[\rho ,P\!\left (T,\rho \right )]}{\partial \rho }\right )}_T={\left (\frac {\partial S}{\partial \rho }\right )}_P+{\left (\frac {\partial S}{\partial P}\right )}_{\rho }{\left (\frac {\partial P}{\partial \rho }\right )}_T, \end{equation}

so that (A.14) becomes

(A.15)

\begin{align} -{\left (\frac {\partial \rho }{\partial S}\right )}_P{\left (\frac {\partial S}{\partial \rho }\right )}_T&=-{\left (\frac {\partial \rho }{\partial S}\right )}_P\left [{\left (\frac {\partial S}{\partial \rho }\right )}_P+{\left (\frac {\partial S}{\partial P}\right )}_{\rho }{\left (\frac {\partial P}{\partial \rho }\right )}_T\right ]\nonumber \\[4pt] &=-1-{\left (\frac {\partial \rho }{\partial S}\right )}_P{\left (\frac {\partial S}{\partial P}\right )}_{\rho }{\left (\frac {\partial P}{\partial \rho }\right )}_T, \end{align}

where we have used the identity ${(\partial S/\partial \rho )}_P\,{(\partial \rho /\partial S)}_P\equiv \ 1$ again. We now introduce the specific heat capacities at constant volume $C_V\equiv T{(\partial S/\partial T)}_{\rho }$ and constant pressure $C_P\equiv T{(\partial S/\partial T)}_P$ , so that we obtain

(A.16)

\begin{equation} {\left (\frac {\partial \rho }{\partial S}\right )}_P{\left (\frac {\partial S}{\partial P}\right )}_{\rho }=\frac {\ C_V}{\ C_P}{\left (\frac {\partial \rho }{\partial T}\right )}_P{\left (\frac {\partial T}{\partial P}\right )}_{\rho }\equiv \frac {1}{\gamma }{\left (\frac {\partial \rho }{\partial T}\right )}_P{\left (\frac {\partial T}{\partial P}\right )}_{\rho }, \end{equation}

where $\gamma \equiv C_P/C_V$ denotes the ratio of specific heat capacities and (A.15) becomes

(A.17)

\begin{equation} -1-{\left (\frac {\partial \rho }{\partial S}\right )}_P{\left (\frac {\partial S}{\partial P}\right )}_{\rho }{\left (\frac {\partial P}{\partial \rho }\right )}_T=-1-\frac {1}{\gamma }{\left (\frac {\partial \rho }{\partial T}\right )}_P{\left (\frac {\partial T}{\partial P}\right )}_{\rho }{\left (\frac {\partial P}{\partial \rho }\right )}_T. \end{equation}

Lastly, we use the Maxwell identity for $(\rho , T, P),$ so that (A.1)–(A.3) are combined to yield

(A.18)

\begin{equation} {\left (\frac {\partial \rho }{\partial P}\right )}_S{\left (\frac {\partial P}{\partial S}\right )}_{\rho }{\left (\frac {\partial S}{\partial \rho }\right )}_T= -\!\left (1-\frac {1}{\gamma }\right ) \end{equation}

which is now inserted into (2.478) to obtain (2.479).

Footnotes

†

Deceased.

‡

Semi-retired.

Transcribed and edited.

References

Alder, B.J. 1972 Numerical experiments in statistical mechanics. Comput. Phys. Commun. 3, 86.CrossRef Google Scholar

Alder, B.J. 1973 Computer dynamics, Ann. Rev. Phys. Chem. 24, 325.CrossRef Google Scholar

Alder, B.J. & Wainwright, T.E. 1957 Phase transition for a hard sphere system. J. Chem. Phys. 27, 1208.CrossRef Google Scholar

Becker, R. 1967 Theory of Heat. Springer.CrossRef Google Scholar

Birkhoff, G.D. 1931 Proof of the ergodic theorem. Proc. Natl. Acad. Sci. 17, 656.CrossRef Google Scholar PubMed

Boltzmann, L. 1872 Weitere studien über das Wärmegleichgewicht unter gasmolekülen. WA I, Wiener Berichte 66, 275–370.Google Scholar

Callen, H.B. 1960 Thermodynamics and an Introduction to Thermostatics, 1st edn. Wiley.Google Scholar

Campa, Fanelli, T., A.Dauxois, D. & Ruffo, S. 2014 Physics of Long-Range Interacting Systems. Oxford.CrossRef Google Scholar

Cohen, E.G. 1962 Fundamental Problems in Statistical Mechanics. North Holland.CrossRef Google Scholar

de Boer, J. & Uhlenbeck, G.E. 1962 Interscience Division, Vol. I. John Wiley & Sons.Google Scholar

dE Groot, S.R. & Suttorp, L.G. 1972 Foundations of Electrodynamics. North Holland.Google Scholar

Frisch, H.L. 1964 The equation of state of the classical hard sphere fluid. Adv. Chem. Phys. VI, 239–289.Google Scholar

Galgani, L. & Scott, A. 1972 Planck-like distributions in classical nonlinear mechanics. Phys. Rev. Lett. 28 (18), 1173–1176.CrossRef Google Scholar

Galloway, J.J. & Kim, M. 1971 Lagrangian approach to non-linear wave interactions in a warm plasma. J. Plasma Phys. 6 (1), 53–72.CrossRef Google Scholar

Goldstein, H. 1950 Classical Mechanics, 1st edn. Addison-Wesley.Google Scholar

Green, M.S. 1954 Markoff random processes and the statistical mechanics of time-dependent phenomena. II. Irreversible processes in fluids. J. Chem. Phys. 22 (3), 398–413.CrossRef Google Scholar

Hill, T.L. 1960 An Introduction to Statistical Thermodynamics. Addison-Wesley.Google Scholar

Hirschfelder, J.O., Curtiss, C.F. & Bird, R.B. 1954 Molecular Theory of Gases and Liquids. Wiley.Google Scholar

Irving, J. & Kirkwood, J.G. 1950 The statistical mechanical theory of transport processes. IV. The Equations of Hydrodynamics. J. Chem. Phys. 18 (6), 817–829.CrossRef Google Scholar

Jackson, J.D. 1975 Classical Electrodynamics, 2nd edn. Wiley.Google Scholar

Kaufman, A.N. & Cohen, B.I. 2019 Theoretical plasma physics. J. Plasma Phys. 85 (6), 1–236.CrossRef Google Scholar

Kubo, R. 1957 Statistical-mechanical theory of irreversible processes. I. General theory and simple applications to magnetic and conduction problems. J. Phys. Soc. Jpn. 12 (6), 570–586, 12 (quertion mark): 570.CrossRef Google Scholar

Kubo, R. 1965 Statistical Mechanics. Elsevier.Google Scholar

Kubo, R. 1966 The fluctuation-dissipation theorem. Rep. Prog. Phys. 29 (1), 255–284.CrossRef Google Scholar

Landau, LD. & Lifshitz, E.M. 1963 Electrodynamics of Continuous Media. Pergamon Press.Google Scholar

Landau, LD. & LIfshitz, E.M. 1969 Statistical Physics, 2nd edn. Pergamon Press.Google Scholar

Landau, L.D. & Lifshitz, E.M. 1987 Fluid Mechanics, 2nd edn. Pergamon Press.Google Scholar

Lee, T.D. & Yang, C.N. 1952 Statistical theory of equations of state and phase transitions. II. Lattice gas and Ising model. Phys. Rev. 87 (3) 410–419.CrossRef Google Scholar

Liboff, R.L. 1969 Introduction to Kinetic Theory. Wiley.Google Scholar

Meyer-Vernet, N. 1993 Aspects of Debye shielding. Am. J. Phys. 61 (3), 249–257.CrossRef Google Scholar

McComb, W.D. 2004 Renormalization Methods: A Guide for Beginners. Oxford University Press.Google Scholar

Mori, H. 1965 Transport, collective motion, and Brownian motion. Prog. Theor. Phys. 33 (3) 423–455.CrossRef Google Scholar

Nicholson, D.R. 1983 Introduction to Plasma Theory. Wiley.Google Scholar

Onsager, L. & Fuoss, R.M. 1932 Irreversible processes in electrolytes. Diffusion, conductance and viscous flow in arbitrary mixtures of strong electrolytes. J. Phys. Chem. 36 (11), 2689–2778.CrossRef Google Scholar

Pawula, R.F. 1967 Approximation of linear Boltzmann equation by Fokker-Planck equation. Phys. Rev. 162 (1), 186–188.CrossRef Google Scholar

Reif, F. 1965 Fundamentals of Statistical and Thermal Physics. McGraw-Hill.Google Scholar

Riebesell, J. 2022 Solid-state physics, tikz.netlify.app. on-line graphic.Google Scholar

Ryskin, G. 1997 Simple procedure for correcting equations of evolution: application to Markov processes. Phys. Rev. E 56 (5), 5123–5127.CrossRef Google Scholar

Sator, N., Pavloff, N. & Couëdel, L. 2023 Statistical Physics. CRC Press.CrossRef Google Scholar

Schiff, L.I. 1968 Quantum Mechanics, 3rd edn. McGraw-Hill.Google Scholar

Shoub, E.C. 1987 Failure of the Fokker-Planck approximation to the Boltzmann integral for (1/r) potentials. Phys. Fluids 30 (5), 1340–1352.CrossRef Google Scholar

Tolman, R.C. 1938 The Principles of Statistical Mechanics. Clarendon Press.Google Scholar

Wood, W.W. & Jacobson, J.D. 1957 Preliminary results from a recalculation of the Monte Carlo equation of state of hard spheres. J. Chem. Phys. 27 (5), 1207–1208.CrossRef Google Scholar

Yang, C.N. & Lee, T.D. 1952 Statistical theory of equations of state and phase transitions. I. Theory of condensation. Phys. Rev. 87 (3), 404–409.CrossRef Google Scholar

Figure 1. Model van der Waals + hard sphere potential.

Table 1. Adiabatically evolving systems.

Figure 2. Fermi–Dirac distribution function $\langle$N${}_{k}$$\rangle$=n(${{\mathcal E}})$ (Riebesell 2022).

Figure 3. Phase diagram for Bose–Einstein condensate, density versus temperature.

Figure 4. Schematic of ${\langle N_0\rangle }/{N}\,\textrm {versus}\ T$ for the bose condensate.

Figure 5. Schematic: P versus n for various temperatures.

Figure 6. Schematic: P versus V for various temperatures.

Figure 7. Schematic: $\beta P=P/T$ versus n equation of state and phase diagram.

Figure 8. Monte Carlo equation of state results from Wood and Jacobson (1957) showing their results and those of Alder & Wainwright (1957) (solid line for 108 molecules; + for 32 molecules).

Figure 9. Schematic for P versus V phase diagram for the gas–solid system.

Figure 10. Fractional ionization versus ${T/T}_{\textrm{I}}(n)$ based on (1.321).