Molecular Physical Chemistry: José J.C. Teixeira-Dias
Molecular Physical Chemistry: José J.C. Teixeira-Dias
Molecular Physical Chemistry: José J.C. Teixeira-Dias
Teixeira-Dias
Molecular
Physical
Chemistry
A Computer-based Approach using
Mathematica® and Gaussian
Molecular Physical Chemistry
José J.C. Teixeira-Dias
Molecular Physical
Chemistry
A Computer-based Approach using
Mathematica® and Gaussian
123
José J.C. Teixeira-Dias
University of Aveiro
Aveiro, Baixo Vouga
Portugal
v
Contents
1 Thermodynamics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Ideal Gas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Kinetic Model of Gases . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.2.1 Pressure and Temperature . . . . . . . . . . . . . . . . . . . . 4
1.2.2 Distribution of Velocities . . . . . . . . . . . . . . . . . . . . . 6
1.2.3 Mean Free Path . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Van der Waals Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.4 Mathematical Tools . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
1.4.1 Exact Differential . . . . . . . . . . . . . . . . . . . . . . . . . . 17
1.4.2 Fundamental Theorem of Calculus . . . . . . . . . . . . . . 20
1.4.3 Line Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
1.5 Thermodynamic Systems. . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
1.6 Heat and Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
1.6.1 Mechanical Work . . . . . . . . . . . . . . . . . . . . . . . . . . 25
1.7 First Law . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
1.7.1 Heat Capacities. . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
1.7.2 Calorimeter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
1.7.3 Standard States . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
1.8 Reversible Heat Engine. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
1.8.1 Carnot’s Heat Engine . . . . . . . . . . . . . . . . . . . . . . . 33
1.8.2 Absolute Temperature . . . . . . . . . . . . . . . . . . . . . . . 33
1.9 Entropy and the Second Law . . . . . . . . . . . . . . . . . . . . . . . . . 35
1.10 Irreversible Processes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
1.10.1 Heat Flow . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
1.10.2 Gas Expansion . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
1.10.3 Diffusion of Matter . . . . . . . . . . . . . . . . . . . . . . . . . 40
1.10.4 Chemical Reaction . . . . . . . . . . . . . . . . . . . . . . . . . 41
1.11 Chemical Potential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
1.11.1 Gibbs–Duhem Equation . . . . . . . . . . . . . . . . . . . . . . 43
1.11.2 Ideal Gas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
1.11.3 Real Gases. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
1.11.4 Liquid Solutions . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
1.11.5 Pure Liquids and Solids. . . . . . . . . . . . . . . . . . . . . . 47
vii
viii Contents
2 Chemical Kinetics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
2.1 Rate of a Chemical Reaction . . . . . . . . . . . . . . . . . . . . . . . . . 84
2.2 Experimental Rate Equation. . . . . . . . . . . . . . . . . . . . . . . . . . 86
2.2.1 First-Order Reactions . . . . . . . . . . . . . . . . . . . . . . . 87
2.2.2 Second-Order Reactions. . . . . . . . . . . . . . . . . . . . . . 88
2.2.3 Zeroth-Order Reactions . . . . . . . . . . . . . . . . . . . . . . 89
2.3 Effect of Temperature Change . . . . . . . . . . . . . . . . . . . . . . . . 89
2.4 Elementary Reactions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
2.5 Complex Reactions. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
2.6 Extremely Fast Reactions . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
2.6.1 Neutralization Reaction in Water . . . . . . . . . . . . . . . 96
2.7 Chemical Oscillations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
2.7.1 Brusselator. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102
Mathematica Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
Glossary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Further Reading. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
Mathematica Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
Glossary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 203
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
Further Reading. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
6 Crystals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
6.1 Packing Disks and Spheres . . . . . . . . . . . . . . . . . . . . . . . . . . 287
6.1.1 Disks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
6.1.2 Spheres . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 289
6.1.3 Hexagonal Close Packing . . . . . . . . . . . . . . . . . . . . 291
6.1.4 Cubic Close Packing . . . . . . . . . . . . . . . . . . . . . . . . 292
6.1.5 Packing Densities . . . . . . . . . . . . . . . . . . . . . . . . . . 292
6.1.6 Occupying Interstices . . . . . . . . . . . . . . . . . . . . . . . 295
6.2 Translation Symmetries . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
6.2.1 2D Bravais Lattices. . . . . . . . . . . . . . . . . . . . . . . . . 297
6.2.2 3D Bravais Lattices. . . . . . . . . . . . . . . . . . . . . . . . . 301
6.3 Crystal Structures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
6.3.1 Metals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
6.3.2 Lattice Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
6.3.3 Cesium Chloride and Sodium Chloride . . . . . . . . . . . 305
6.3.4 Diamond and Zinc Blende . . . . . . . . . . . . . . . . . . . . 306
6.4 X-Ray Diffraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 308
6.5 Electrons in Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310
6.6 Semiconductors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 312
Mathematica Codes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
Glossary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 327
Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 328
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
Further Reading. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
7 Water . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 331
7.1 Molecular Geometry . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 332
7.2 Enthalpy of Formation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 333
7.3 Atomic Charges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 333
7.4 Dipole Moment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
7.4.1 Electric Multipoles . . . . . . . . . . . . . . . . . . . . . . . . . 334
7.4.2 Point Dipole. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 336
7.4.3 Electric Field Streamlines. . . . . . . . . . . . . . . . . . . . . 337
7.4.4 H2O Dipole and Quadrupole . . . . . . . . . . . . . . . . . . 338
7.5 Molecular Orbitals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339
7.5.1 Natural Bond Orbitals . . . . . . . . . . . . . . . . . . . . . . . 341
7.6 Molecular Vibrations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342
7.7 Intermolecular Interactions. . . . . . . . . . . . . . . . . . . . . . . . . . . 344
7.7.1 Electrostatic Interaction . . . . . . . . . . . . . . . . . . . . . . 344
7.7.2 Induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 346
7.7.3 Dispersion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
7.8 Hydrogen Bond . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 349
7.8.1 The Water Dimer . . . . . . . . . . . . . . . . . . . . . . . . . . 351
xii Contents
Gaussian (www.gaussian.com)
The computational quantum chemistry calculations whose results are presented in
this book were carried out with the Gaussian 09 series of programs.
Authors for Gaussian 09, Revision A.02: M.J. Frisch, G.W. Trucks, H.B.
Schlegel, G.E. Scuseria, M.A. Robb, J.R. Cheeseman, G. Scalmani, V. Barone, B.
Mennucci, G.A. Petersson, H. Nakatsuji, M. Caricato, X. Li, H.P. Hratchian, A.F.
Izmaylov, J. Bloino, G. Zheng, J.L. Sonnenberg, M. Hada, M. Ehara, K. Toyota, R.
Fukuda, J. Hasegawa, M. Ishida, T. Nakajima, Y. Honda, O. Kitao, H. Nakai,
T. Vreven, J.A. Montgomery, Jr., J.E. Peralta, F. Ogliaro, M. Bearpark, J.J. Heyd,
E. Brothers, K.N. Kudin, V.N. Staroverov, R. Kobayashi, J. Normand, K.
Raghavachari, A. Rendell, J.C. Burant, S.S. Iyengar, J. Tomasi, M. Cossi, N. Rega,
J.M. Millam, M. Klene, J.E. Knox, J.B. Cross, V. Bakken, C. Adamo, J. Jaramillo,
R. Gomperts, R.E. Stratmann, O. Yazyev, A.J. Austin, R. Cammi, C. Pomelli, J.W.
Ochterski, R.L. Martin, K. Morokuma, V.G. Zakrzewski, G.A. Voth, P. Salvador, J.
J. Dannenberg, S. Dapprich, A.D. Daniels, Ö. Farkas, J.B. Foresman, J.V. Ortiz,
J. Cioslowski, and D.J. Fox, Gaussian, Inc., Wallingford CT, 2009.
GaussView (www.gaussian.com)
GaussView is the graphical interface to Gaussian.
Authors for GaussView, Version 5.0.9: Roy Dennington, Todd Keith, and John
Millam, Semichem Inc., Shawnee Mission, KS, 2009.
Natural bond orbitals
A natural bond orbitals analysis of the electron population is included in Gaussian
09.
Reference: NBO Version 3.1, E.D. Glendening, A.E. Reed, J.E. Carpenter,
and F. Weinhold.
Mathematica (www.wolfram.com)
Author of Mathematica: Wolfram Research Inc. Champaign, IL, 2010.
Wolfram Alpha (www.wolframalpha.com)
This is a computational knowledge-based engine that provides answers to questions
raised through the Internet in free-form input of natural language (English).
xiii
xiv Credits
Abstract
The first three sections of this chapter include the ideal gas equation, the kinetic
model of gases, and the Van der Waals equation. They aim at building a unified
background for thermodynamics. Having in mind those students who might need
some refreshment of the mathematical concepts much used in thermodynamics,
such as exact and inexact differentials, the fundamental theorem of calculus, and
line integrals, we introduce a short section on these mathematical tools, followed
by sections on thermodynamic systems, heat and work, and the first law. In order
to formally arrive at the state function entropy, we show that an arbitrary
reversible cycle can be covered to the required precision by a mesh of
infinitesimal Carnot cycles and arrive at the Clausius inequality for an arbitrary
irreversible cycle. Following Prigogine, a distinction is then established between
entropy transfer across system boundaries and entropy produced inside a system
due to irreversible processes that drive the system to equilibrium. These are
illustrated with heat flow, gas expansion, matter diffusion, and a chemical
reaction in the gas phase. Next, we derive the Gibbs–Duhem equation and obtain
the chemical potential for the ideal gas, real gases, liquid solutions, pure liquids,
and solids, before presenting sections on the Gibbs energy, chemical equilib-
rium, the Gibbs phase rule, and the Helmholtz energy. The last three sections of
this chapter, on surface tension, membrane potential, and the electrochemical
cell, are important applications of the first and second laws of thermodynamics.
At the end of the chapter we present several Mathematica codes (Maxwell
Distribution of Molecular Speeds, Critical Point for Van der Waals Fluid,
3D Plot of the Van der Waals Equation in Reduced Variables, Absolute and
Reduced Temperatures for the Van der Waals Carbon Dioxide, Isothermal and
Adiabatic Transformations on an Ideal Gas Surface, Efficiency of the Carnot
Heat Engine, Gibbs Energy, and Affinity of a Chemical Reaction) that are
interconnected with the main text and contain detailed explanations of important
In, 1660, Robert Boyle (1627–1691) showed that at a defined temperature T, the
volume V of a gas is inversely proportional to its pressure p, that is,
V ¼ f1 ðTÞ=p ð1:1Þ
where f1(T) is a function of temperature (see Note §1). Edmé Mariotte [1620(?)–
1684] independently confirmed this functional dependence between volume and
pressure at constant temperature. In the last quarter of the eighteenth century,
Jacques Charles (1746–1823) studied the dependence between volume and tem-
perature, having found that the volume is proportional to the temperature at constant
pressure,
n ¼ N=NA ð1:3Þ
Combining (1.1), (1.2) and the Avogadro hypothesis (gas volume proportional to
the amount of gas), one concludes that f1(T) is given by
where R is called the gas constant (R = 8.3144 J mol−1 K−1; see Appendix).
Substitution of (1.4) in (1.1) leads to
pV ¼ nRT ð1:5Þ
1.1 Ideal Gas 3
The gas constant is related to the Boltzmann constant (kB = 1.3806 10−23
J K−1; see Appendix) by
R ¼ kB N A ð1:6Þ
p V ¼ N kB T ð1:7Þ
p ¼ qN kB T ð1:8Þ
qN ¼ N=V ð1:9Þ
Equations (1.5), (1.7), and (1.8) are equivalent forms of the ideal gas equation of
state.
A gas can be studied at the macroscopic and molecular levels. The macroscopic
study requires the measurement of variables such as pressure, volume, temperature,
and amount of gas, interrelated by an empirical, relationship involving these vari-
ables (the equation of state), as we have just shown for the ideal gas. In turn, at the
molecular level, the study of gases requires a theoretical model that allows for the
understanding of the dynamical behavior of molecules under the model assump-
tions and the evaluation of macroscopic intensive variables such as pressure and
temperature.
It was Daniel Bernoulli (1700–1782) who first concluded that the pressure of a
gas results from collisions of gas particles with the container walls. At that time, no
one knew about molecules or could predict the order of magnitude of the speed of
gas particles.
The kinetic model of gases explains macroscopic gas properties such as pressure
and temperature from the random motion of molecules and was mostly developed
during the nineteenth century, with main contributions by James Prescott Joule
4 1 Thermodynamics
Among the molecules that move toward the container wall at the right in Fig. 1.2,
those that collide with the wall within the time interval Dt cannot be at a distance
from the wall greater than vxDt, where vx > 0. The volume occupied by such
molecules is vxDtA, where A is the area of the wall, and the number of such
molecules is given by
vx DtAqN =2 ð1:10Þ
1.2 Kinetic Model of Gases 5
where qN is the number density given by (1.9). The factor of 1/2 in (1.10) accounts
for the fact that one-half of the molecules move toward each of the walls, right and
left. In each collision with the wall at the right, the x component of the velocity
changes sign. Hence, the variation of linear momentum on collision with the wall is
given by mvx − (−mvx) = 2mvx (Fig. 1.3), and the total variation of linear
momentum is 2mvx times the number of molecules (1.10), that is,
According to Newton’s equation, the force is equal to the time derivative of the
linear momentum, which is given approximately by the ratio of (1.11) to Dt,
since the time interval is very short. Therefore, the pressure (force per unit area) on
the right wall is given by (1.12) divided by A, that is,
p ¼ mv2x qN ð1:13Þ
v2 ¼ 3v2x ð1:15Þ
p ¼ mv2 qN =3 ð1:16Þ
However, not all molecular velocities have the same absolute value (the same
speed), and consequently the same square speed. In the ensemble average of square
speeds, the average is taken over all N gas molecules in the container,
2 2
v ¼ v1 þ v22 þ þ v2N =N ð1:17Þ
In velocity space, the Cartesian axes correspond to the velocity components vx, vy,
and vz. The probability of finding the velocity vector v = (vx, vy, vz) in the domain
(vx, vx + dvx), (vy, vy + dvy), (vz, vz + dvz) (Fig. 1.4) is given by
where the function P(v) is the probability density for the distribution of molecular
velocities, first evaluated by James Clerk Maxwell for a gas in thermodynamic
equilibrium and given by
1.2 Kinetic Model of Gases 7
h i
PðvÞ / exp m v2x þ v2y þ v2z =ð2kB T Þ ð1:21Þ
where the symbol / means that the proportionality constant of the above expression
has been omitted and the absolute value of the exponent of (1.21) is the molecular
kinetic energy m(v2x + v2y + v2z )/2 divided by kBT. The proportionality constant of
(1.21) is given by
Z1 Z1 Z1 h i 3=2
m
1= exp m v2x þ v2y þ v2z =ð2kB T Þ dvx dvy dvz ¼
2pkB T
1 1 1
ð1:22Þ
and is said to be normalized, that is, the integral of the probability density over all
velocity space is equal to 1. The density P(v) is called the Maxwell velocity
distribution. Expression (1.23) can be easily factored as the product of the vx, vy,
and vz components of the velocity vector v, that is,
where
1=2
m
Pðvx Þ ¼ exp mv2x =ð2kB T Þ ð1:25Þ
2p kB T
Since P(vx) depends on v2x , both vx and −vx give the same value to the function,
which is symmetric about zero, that is, its mean value is zero. Figure 1.5 shows
P(vx) for molecular oxygen, at 100 and 300 K.
8 1 Thermodynamics
and the geometric place for the tips of velocity vectors with the same absolute value
is the volume between two concentric spheres with radii v and v + dv, which is
given by the product of the area of the spherical surface 4pv2 and dv. Thus, using
(1.23) and (1.14), we can write
3=2
m
f ðvÞdv ¼ PðvÞ4pv2 dv ¼ 4p exp mv2 =ð2kB T Þ v2 dv ð1:27Þ
2pkB T
where f(v) is the distribution of molecular speeds [the word speed is used to
express the absolute value of the velocity vector, and the integration of (1.27) is now
between zero and infinity]. Figure 1.6 shows the distributions of molecular speeds
for oxygen, at 100 and 300 K (see Mathematica code M1). It can be seen that
these distributions are no longer symmetric, due to the factor v2, which strongly
weights larger values of molecular speeds, thus widening the higher wing of the
distributions.
[f(v) has a single positive stationary point that is the curve maximum]. The positive
solution for (1.28) is
rffiffiffiffiffiffiffiffiffiffiffi
2kB T
vp ¼ ð1:29Þ
m
Z1 rffiffiffiffiffiffiffiffiffiffiffi
8kB T
vm ¼ f ðvÞvdv ¼ ð1:30Þ
pm
0
Consider the mutual diffusion of two gases, oxygen and nitrogen, at a defined
pressure and temperature, after the wall separating them has been removed
(Fig. 1.7a). When the equilibrium is reached, the gas mixture has the same con-
centration everywhere in the system. In a container with a few liters of volume, the
equilibrium state may be reached in approximately 1 h. This result may look sur-
prising, since molecular speeds are on the order of hundreds of meters per second.
While each molecule has a large number of collisions, not all of them drive the
molecule away from the starting point, since some collisions drive the molecule
backward.
10 1 Thermodynamics
(b)
N2
(a)
before some time later ...
O2 N2
O2
Fig. 1.7 a Diffusion of two gases at the same pressure and temperature. b Test molecule of O2
diffusing through N2. Note that the colors used to distinguish the gases are fictitious: both O2 and
N2 do not absorb in the visible spectrum
Let us follow an oxygen molecule on its path through nitrogen gas (Fig. 1.7b).
To simplify matters, let us assume that the nitrogen molecules are fixed. Through
what distance does the oxygen molecule move before it first collides with a nitrogen
molecule? How long does it move before its second/third collision? The average
distance between collisions is called the mean free path.
In this thought experiment, the test molecule O2 and the molecules N2 are
represented by spheres of the same diameter d. The cross section for collision is the
area of a circle of radius d (pd2) (Fig. 1.8a). Between consecutive collisions, this
circle spans a cylinder whose height is equal to the distance traveled by the test
molecule (Fig. 1.8b). The volume of the broken cylinder is called the interaction
volume, and it contains all the nitrogen molecules that collide with the test molecule
within a certain time interval Dt.
(b)
(a)
Fig. 1.8 a Cross section for collision between spheres of equal radius. b All molecules colliding
with the test molecule have their centers of mass within the broken cylinder
1.2 Kinetic Model of Gases 11
A reasonable estimate k for the mean free path can be obtained by dividing the
distance d traveled by the test molecule by the number of collisions Nc, that is, the
number of N2 molecules within the interaction volume (= volume of the broken
cylinder),
k ¼ d=Nc ð1:32Þ
Nc is given by
Nc ¼ Vint qN ð1:33Þ
where qN is the number density of molecular nitrogen and the interaction volume
Vint is
d 2 1
k¼ pd qN ð1:35Þ
pd2 dqN
This result is based on the less realistic assumption of the nitrogen molecules
remaining fixed while the test molecule travels among them. However, the distance
traveled by the test molecule [d in the numerator of (1.35)] and the length of the
interaction cylinder [d in the denominator of (1.35)] are distinct quantities, which
we name dn and dd, respectively. In fact, the distance traveled by the test molecule
dn is equal to the rms speed times the corresponding time interval,
dn ¼ vrms Dt ð1:36Þ
whereas dd, the real length of the interaction cylinder, is equal to the rms speed of
the test molecule relative to the speed of the colliding nitrogen molecule,
Hence, substitution of (1.36) in (1.35) and of (1.38) and (1.37) in (1.35) leads to
the following expression for the mean free path:
Considering the assumptions of the kinetic model of gases, this expression can
give only an order of magnitude for the mean free path. According to (1.39), the
mean free path is inversely proportional to the cross section for collisions and the gas
number density. In fact, increases of the cross section and of the gas number density
are expected to lead to decreases in the mean free path. Note that the mean free path
is independent of the distance traveled by the test particle [see (1.39)], since this
distance was chosen in an arbitrary way. Considering the approximate nature of
(1.39), one can deal with d as a parameter whose value can be chosen so that a
realistic value is obtained for the collision cross section. In particular, d = 0.3 nm
(= 3 Å) allows one to obtain a reasonable estimate for the mean free path.
The quantity qN can be obtained from the gas pressure and temperature using
(1.8), assuming ideal gas behavior. Alternatively, in order to relate qN to the
molecular diameter d, one can take the ratio of the liquid and gas densities and
equate this ratio to the inverse ratio of the molecular volumes,
ql m=Vl Vg
¼ ¼ ð1:40Þ
qg m=Vg Vl
since a molecule has the same mass m in the liquid and gas phases. For nitrogen, the
liquid density is 800 kg m−3 and the gas density is 1.2 kg m−3. Therefore, the ratio
of densities is approximately equal to 670 ( 800/1.2). In the liquid, the volume of
one molecule is approximately given by d3 (volume of a cube containing one
nitrogen molecule). Hence, applying (1.40) to nitrogen, we can write
Vg
670 ) Vg 670d3 ð1:41Þ
d3
If the definition of number density [see (1.9)] is applied to one molecule in the
gas and (1.41) is used, we obtain
1
qN ¼ Vg ð670d3 Þ1 ð1:42Þ
Hence, the mean free path is about 150 times the collision cross-sectional radius
or the particle diameter (see Fig. 1.8a). The average distance between molecules in
the gas is given approximately by
1=3
670d 3 10d ð1:44Þ
that is, is about 15 times smaller than the mean free path.
1.2 Kinetic Model of Gases 13
Vm b ð1:45Þ
where Vm is the ideal gas molar volume and b accounts for the nonzero volume of
molecules per mole of gas, that is, the excluded molar volume. The second mod-
ification of the ideal gas equation introduced by Van der Waals accounts for the
pressure reduction due to attractive forces between particles. Attraction between
particles reduces their velocities, thus decreasing the number of collisions per time
unit on a unit area of wall. Considering the RT/(Vm − b) pressure being reduced by
dp, we can write
RT
p¼ dp ð1:46Þ
Vm b
Van der Waals took dp inversely proportional to the square of the volume,
a
dp ¼ ð1:47Þ
Vm2
At a specified temperature, the (p, Vm) curve is an isotherm. Figure 1.9 shows
isotherms for carbon dioxide, from 274 to 344 K, in steps of 10 K. As the tem-
perature of the isotherm increases, the maximum and minimum of the isotherms
gradually become closer, until they merge at the critical temperature. In the
critical isotherm, pressure and molar volume at which maximum and minimum
coincide correspond to an inflection called a critical point. This point is mathe-
matically defined by the following equalities:
@p @2p
¼0 ¼0 ð1:49Þ
@Vm T @Vm2 T
Solving this system of equations gives the critical volume Vmc and the critical
temperature Tc. The corresponding pressure is the critical pressure pc (see Mathe-
matica code M2). The critical values depend on the van der Waals constants as
follows:
8a a
Vmc ¼ 3b Tc ¼ pc ¼ ð1:50Þ
27bR 27b2
showing that the Van der Waals equation is cubic in Vm. When the linear and
quadratic terms cancel, the three roots coincide, meaning that the maximum and
minimum of the isotherm converge to the critical point. For T > Tc, the isotherms
are single-valued, and as the temperature increases, they gradually lose the
inflection point and become similar to the ideal gas isotherms (Fig. 1.9) (E5).
Figure 1.10 presents a Van der Waals isotherm (continuous solid curve) for
carbon dioxide at 270 K. Let us start at E′ in the gas and progress as the volume
decreases. The point at which the gas starts to condense is E. When the liquid is in
equilibrium with the gas, a progressive volume reduction causes more gas to
condense, while the pressure is kept constant during the liquid–gas equilibrium. The
dashed straight line AE represents the real liquid–gas equilibrium, whereas the
oscillating part of the Van der Waals curve that includes a maximum and a mini-
mum is not real, and so the points along the continuous solid curve ABCDE cannot
be experimentally observed. Maxwell determined the horizontal part AE and found
that the areas a (=ABCA) and b (=CDEC) are equal (Maxwell construction).
Progressing along the straight line EA as the volume decreases due to gradual
condensation of the gas, the point A is attained where all the gas has been con-
densed to the liquid. Thus, the steep curve AA′ corresponds to the liquid. The
steepness of this part of the curve is due to the reduced compressibility of the liquid.
Fig. 1.10 Van der Waals isotherm at 270 K, for carbon dioxide. A′A corresponds to the liquid,
the dashed straight line ACE corresponds to the liquid–gas equilibrium, and EE′ corresponds to
the gas. Graph obtained with Mathematica
16 1 Thermodynamics
We now consider the critical values (1.50) and use them as units of volume,
temperature, and pressure to obtain the reduced variables
If the values of Vm, T, and p taken from (1.52) are substituted in (1.48), one
obtains the Van der Waals equation in reduced variables
3 1 8
pr þ 2
Vmr ¼ Tr ð1:53Þ
Vmr 3 3
(see Mathematica code M3). This is a universal equation in the sense that it does
not include Van der Waals constants specific to any particular fluid. In fact,
comparison of (1.48) with (1.53) shows that a, b, and the gas constant R correspond
to 3, 1/3, and 8/3, respectively, that is, these values of a, b, and R lead to unit values
of the critical variables. Figure 1.11 shows the Van der Waals isotherms in reduced
variables with the reduced temperatures corresponding to the absolute temperatures
of the isotherms for carbon dioxide in Fig. 1.9 (see Mathematica code M4, E6).
In the following sections of the present chapter, most of the expressions use dif-
ferentials of thermodynamic functions and deal with inexact differentials and line
integrals. So the present section has in mind those students who might need some
refreshment of mathematical concepts such as exact and inexact differentials, the
fundamental theorem of calculus, and line integrals. Those who are aware of these
mathematical concepts might skip this section.
1.4 Mathematical Tools 17
f ðx þ DxÞ f ðxÞ
f 0 ðxÞ ¼ lim ð1:54Þ
Dx!0 Dx
The slope of the tangent line at x (Fig. 1.12) is the trigonometric tangent of the
angle h
sinðhÞ
tanðhÞ ¼ ð1:55Þ
cosðhÞ
where h is the angle formed by the tangent line and the positive x-axis. Figure 1.12
shows a curve and its tangent at x, and illustrates the difference between
and
dy ¼ y0 ðxÞDx ð1:57Þ
Inspection of Fig. 1.12 also shows that the difference between (1.56) and (1.57),
Dy − dy, goes to zero as Dx ! 0. The question is whether Dy − dy approaches
zero faster or slower than Dx. To answer this question, we need to take the limit of
(Dy − dy)/Dx as Dx ! 0, which is given by
Dy dy Dy
lim ¼ lim y0 ðxÞ ¼ y0 ðxÞ y0 ðxÞ ¼ 0 ð1:58Þ
Dx!0 Dx Dx!0 Dx
where eDx goes to zero faster than Dx (for example, as Dx2). Substitution of (1.57)
in (1.59) gives
and so
Dy y0 ðxÞ Dx ð1:61Þ
(note the approximation). We now replace Dx and Dy by finite yet arbitrarily small
variations dx and dy and write
dy ¼ y0 ðxÞdx ð1:62Þ
where dy is the differential of y. Equation (1.62) can be rewritten using the fol-
lowing equivalent forms:
dy df
dy ¼ dx df ¼ dx ð1:63Þ
dx dx
Note that dy/dx or the equivalent df/dx represents a limit [see (1.54)], not a
quotient of differentials dy and dx.
For a function f(x, y) of two independent variables, the partial derivative of
f(x, y) with respect to x is given by
@f f ðx þ Dx; yÞ f ðx; yÞ
fx ¼ ¼ lim ð1:64Þ
@x Dx!0 Dx
assuming that this limit exists. Note that in evaluating the rate of change of f(x, y)
with respect to x, the variable y is held constant. For this reason, we sometimes
write (∂f/∂x)y emphasizing that y is kept constant. The partial derivative of f(x, y)
with respect to y is given by
@f @f
df ¼ dx þ dy ð1:66Þ
@x @y
where df is the total differential of f. For two independent variables x and y, (1.66)
is the equivalent of the second equality of (1.63) for one independent variable.
We now consider the differential
and ask whether this is an exact differential, that is, whether there is a function
f(x, y) such that
@f @f
df ¼ dx þ dy ¼ gðx; yÞdx þ hðx; yÞdy ð1:68Þ
@x y @y x
that is,
@f @f
¼ gðx; yÞ ¼ hðx; yÞ ð1:69Þ
@x y @y x
since dx and dy are arbitrary variations. Note that the first-order derivatives of
f(x, y) are related to each other by
@f @f
@ @
@x @2f @y @2f
¼ ¼ ¼ ð1:70Þ
@y @y@x @x @x@y
that is, the mixed partial second derivatives are equal. Substitution of (1.69) in
(1.70) leads to
@gðx; yÞ @hðx; yÞ
¼ ð1:71Þ
@y @x
Recall now that the integral of a continuous function f(x) in the interval [a,b] is
given by
Zb X
f ðxÞdx ¼ lim f ðxi ÞDxi ð1:72Þ
jLj!0
i
b
where [a,b] is divided into n subintervals Dxi and |L| represents the width of the
largest subinterval (Fig. 1.13).
Since f(xi) times Dxi is the area of the rectangle of height f(xi) and width Dxi, the
summation on the right-hand side of (1.72) corresponds to the area of all the
rectangles, and its limit is the area under the curve in the interval [a,b] (Fig. 1.13). If
the upper limit b is replaced by the variable x defined within the interval [a,b], one
obtains the indefinite integral
Zx
F ð xÞ ¼ f ðuÞdu ð1:73Þ
a
i.e., F(x) is the antiderivative of f(x). In order to understand this important result,
known as the fundamental theorem of calculus, we apply the definition of the
derivative of F(x) and use (1.73), obtaining
Fig. 1.13 The integral of the function in the interval [0.5,3.0] is equal to the area under the curve.
Figure obtained with Mathematica
1.4 Mathematical Tools 21
xZþ Dx Zx xZþ Dx
The last equality of the first line above shows that f(n) is the mean value of f(x) in
the interval Dx. When applied to a general interval [a,b], this equality expresses the
so-called mean value theorem for integrals. In turn, comparison of the second and
third equalities above allows one to conclude that f(n) ! f(x) as Dx ! 0.
Without loss of generality, we can assume that the parameter t is the time
variable. The particle is driven by a force F over the path defined by the position
vector r(t). When the particle undergoes the displacement dr during the time
interval dt, the infinitesimal work dW done by F on the particle is given by the
scalar product F.dr and the integral of dW over the curve C is
Z Z Ztf
dr
W¼ dW ¼ F:dr ¼ F: dt ð1:76Þ
dt
c ti
where ti and tf are the limits of integration (the initial and final values of t), and r(ti)
and r(tf) denote the initial and final positions of the particle on the curve C. The last
equality of (1.76) results from
dr
dr ¼ dt ð1:77Þ
dt
[see (1.63)]. From the last member of (1.76), the curve can be seen as consisting of
a finite number of infinitesimal segments, each of which has a tangent given by dr/
dt defined at each point of the curve. The direction of the tangent varies continu-
ously with t. Since the integral (1.76) depends on the initial and final points of the
path and on the path itself, it is called a line integral or path integral. The
path-dependence of (1.76) is reflected by the scalar product F . dr/dt, since both
F and dr/dt are path-dependent functions. Here dW is an inexact differential.
22 1 Thermodynamics
ZB
Df ¼ df ¼ f ðBÞ f ðAÞ ð1:78Þ
A
If the integration of an exact differential is done over a cycle (Fig. 1.14), then the
initial and final points of integration coincide, and
I
df ¼ 0 ð1:79Þ
Where
Z Z Z
df ¼ df ¼ df ð1:81Þ
A!Bð1Þ B!Að2Þ A!Bð2Þ
The equality between the first and the last members of (1.81) allows one to
conclude that the integral of an exact differential is path-independent, because the
paths 1 and 2 that link A and B were arbitrarily chosen. Then it can be concluded
that (1.79) is a necessary and sufficient condition for df to be an exact differential
[necessary condition: if df is an exact differential, then (1.79) is valid; sufficient
condition: if (1.79) is valid, then df is an exact differential].
Every thermodynamic study begins with a clear distinction between the object of
study, the system, and everything else, the exterior. Thus, the system is absolutely
central to every thermodynamic study. One can say that the system is the “pro-
tagonist” in thermodynamics. It may be a gas, two metal blocks in contact, a liquid
and its gas in equilibrium, a solute in a solution, a chemical reaction. Besides the
system composition, a thermodynamic study requires the knowledge of its
boundaries, since interactions may occur between the system and its surroundings.
On the whole, system and exterior form the universe of the transformation.
Systems can be classified into three types, isolated, closed, and open, according
to the way they interact with the exterior. An isolated system does not exchange
matter or energy with the exterior (Fig. 1.15). The exterior of an isolated system is
irrelevant, and it may be considered nonexistent. Thus, an isolated system coincides
with the universe of the transformation. For this reason, isolated systems have
considerable theoretical importance, in particular in the formulation of the first and
second laws of thermodynamics, since these laws become simpler and more
comprehensive when they are referred to isolated systems. However, the complete
and effective isolation of a system is difficult to realize, since it requires walls that
behave as perfect insulators. A closed system does not exchange matter with its
surroundings, and an open system exchanges matter and energy with its sur-
roundings (Fig. 1.15).
When the variables needed to characterize a system take defined values, the
system is said to exist in a thermodynamic state, and the variables are called state
variables. Pressure p, temperature T, volume V, and amount of substance nk (the
index k distinguishes different substances) are state variables used in describing
equilibrium states. Variables such as volume and amount of substance provide
information about the size of a system and for that reason are called extensive
variables. On the other hand, variables such as pressure and temperature are local
variables. As such, they do not provide information on the size of the system and
are called intensive variables. The ratio of two extensive variables that have the
same size-dependence is an intensive variable. For example, the ratio of mass to
volume is density, an intensive variable.
Fig. 1.15 Systems can be classified according to the way in which they interact with the exterior
24 1 Thermodynamics
The intensive variables temperature and pressure are associated with two funda-
mental types of equilibrium, namely thermal equilibrium and mechanical equilib-
rium. Consider an isolated system formed by two blocks of the same metal, at
temperatures T1 and T2, with T1 > T2 (= T1 − dT) (Fig. 1.16). Heat dQ flows
spontaneously from the part of the system at temperature T1 to the part of the
system at temperature T2, until both parts reach the same temperature. One says that
the system has then attained thermal equilibrium. It is assumed that small vari-
ations in volume (part 1 reduces its volume as it cools; part 2 increases its volume
as it warms) can be easily accommodated by the system, since the heat transferred is
Fig. 1.17 At constant temperature, a frictionless wall moves from higher to lower pressures
small. Note that a system in thermal equilibrium has the same temperature in all
points.
We now consider a closed system at constant temperature formed by a gas in two
compartments with pressures p1 and p2, and p1 > p2 (= p1 − dp). A frictionless wall
separates these compartments and moves spontaneously from higher to lower
pressures (Fig. 1.17). When the pressures become equal, we say that the system has
attained mechanical equilibrium. A system in mechanical equilibrium has the
same pressure at all points.
(b)
(a)
Fig. 1.18 a The system does work on the exterior during an isothermal expansion. b Apart from
its sign, the mechanical work is equal to the area under the curve p(V)
26 1 Thermodynamics
change produced on the system, that is, the work done on expansion is negative. In
turn, the work done on compression is positive, just like the corresponding energy
change of the system (on compression, the energy of the system increases). For an
infinitesimal quantity of work, one has
F
dW ¼ Fdx ¼ Adx ¼ pdV ð1:83Þ
A
where F is the force exerted on the moving wall or piston and A is the area of the
piston. Integration of (1.83) for the whole expansion leads to
ZVf
W ¼ pdV ð1:84Þ
Vi
where Vi and Vf are the initial and final volumes, respectively. The absolute value of
the work is the area under the curve p(V) (Fig. 1.18b). Work is a path-dependent
quantity, or process-dependent quantity, not a state function (E8). Mathematically,
dW is an inexact differential.
dUuniv ¼ 0 ð1:85Þ
For an isolated system, the exterior is irrelevant, because an isolated system does
not exchange matter or energy with its surroundings. Therefore,
dUisol ¼ 0 ð1:86Þ
In a closed system, the system can exchange energy with the surroundings in two
ways, through heat flow and through work. Representing by dQ the heat exchanged
between the system and its surroundings during the time interval dt and by dW the
1.7 First Law 27
work exchanged between the system and its surroundings during the same time
interval, then
dU ¼ dQ þ dW ð1:87Þ
The dQ and dW terms in (1.87) are positive if they correspond to heat and work
transferred from the exterior to the system (in this case, the system gains energy),
and negative if they correspond to heat or work transferred from the system to the
exterior (in this case, the system loses energy). This convention of signs for dQ and
dW is consistent with the system being the “protagonist” in thermodynamics.
Substitution of (1.83) in (1.87) leads to
dU ¼ dQ pdV ð1:89Þ
dQV ¼ dU ð1:90Þ
where dQV represents the exchanged heat at constant volume (Fig. 1.19). At
constant pressure,
where the added term (Vdp)p does not affect the result, because at constant pressure,
dp = 0, and so Vdp = 0. If we define the state variable enthalpy (en- = within;
thalpein = to heat),
H U þ pV ð1:92Þ
dQp ¼ dH ð1:93Þ
Note that U is a state function, both p and V are state variables, and the product
and sum of two state variables is a state function. This reasoning allows to conclude
that U + pV is a state function.
Expressions (1.90) and (1.93) can be used to define isochoric heat capacity (heat
capacity at constant volume, isochoric from Greek isos “equal” + choros
“space”),
dQV @U
CV ¼ ¼ ð1:94Þ
dT @T V
and isobaric heat capacity (heat capacity at constant pressure, isobaric from Greek
isos “equal” + baros “weight”),
dQp @H
Cp ¼ ¼ ð1:95Þ
dT @T p
c ¼ C=m ð1:96Þ
where c is the specific heat capacity and m is the mass, usually expressed in grams.
Thus, the specific heat capacity is the heat capacity for 1 g of substance. Whenever
it is possible and appropriate to deal with an amount of substance, then
Cm ¼ C=n ð1:97Þ
where Cm is the molar heat capacity and n is the amount of substance. Figure 1.20
shows the molar heat capacities at constant pressure (1 bar = 100,000 Pa), for two
important gases in the atmosphere, water and carbon dioxide, as functions of
temperature. At ambient temperature and pressure of 1 atm, the heat capacity for
liquid water approximately doubles the heat capacity for gaseous water.
1.7 First Law 29
Fig. 1.20 Molar heat capacities at constant pressure, for water and carbon dioxide in the gaseous
states, as functions of temperature. Values taken from the Handbook of Chemistry and Physics
2011, and graphs obtained with Mathematica
1.7.2 Calorimeter
where Ccal is the heat capacity of the calorimeter. In order to determine dQV from
a measured dT, Ccal has to be known beforehand. To this end, one chooses to
(a) (b)
Fig. 1.21 Schematic representation of calorimeters for determining the heat of a saturated
solution (at left) and the heat of combustion of a compound (at right)
30 1 Thermodynamics
The thermodynamic properties of a system depend on the nature and physical state
(gas, liquid or pure crystalline solid) of its component substances, as well as on the
physical variables such as temperature, pressure, and, in the case of solutions,
concentrations of solutes (partial pressures for gaseous mixtures). Thus, tabulated
data on a system, besides specifying its temperature, rely on a standard pressure p0
and, in the case of solutions, on a standard molality m0 or a standard molarity c0 for
each solute. Without such standard values, tabulated thermodynamic data could be
of little practical use, due to the dispersion of physical conditions. The most
common choice of tabulating data includes a reference temperature equal to
298.15 K, the pressure p0 = 1 bar (= 100,000 Pa), and, in the case of solutions, the
solute molality m0 = 1 mol of solute per kg of solvent, or the solute molarity
c0 = 1 mol of solute per dm3 of solution. The standard state of a pure substance at
a specified temperature is its most stable state (gas, liquid, or pure crystalline solid)
at the standard pressure p0 = 1 bar.
In the gaseous state, the standard state is the hypothetical state in which the gas
has ideal behavior at the standard pressure p0. This means that a real gas at 1 bar of
pressure is not in its standard state. Deviations from ideal behavior result from
intermolecular interactions that depend on the gas, its pressure, and temperature,
and so would not lead to a definable standard state.
The standard state of a pure liquid or a pure solid is its state at the standard
pressure p0. Unlike gases, liquids and solids have reduced compressibility, since
their volumes are little affected by pressure variation.
The standard state of a solute is its hypothetical state in a solution with ideal
behavior, at the standard concentration m0 or c0, and the standard pressure p0. Ideal
behavior corresponds to the absence of solute–solute intermolecular interactions,
implying that the solute vapor pressure is proportional to the mole fraction of the
solute in solution (Henry’s law). At standard concentration m0 or c0 and standard
1.7 First Law 31
pressure p0, the solute in a real solution is not in the standard state, since the
deviations from ideal behavior are solution-specific, that is, do not lead to a
definable standard state.
Absolute energies cannot be determined. Only energy changes can be measured
or evaluated. This applies to the energy U, the enthalpy H, and other thermody-
namic state functions later considered in this chapter.
The enthalpies of formation of all elemental substances in their standard states
are zero, at all temperatures. Hence, at a specified temperature, the standard molar
enthalpy of formation Hfo of a compound is the difference between its molar
enthalpy in its standard state and the enthalpies of its constituent elements in their
standard states. In addition, at a specified temperature, the standard reaction en-
thalpy Hro is the difference between the stoichiometric sum of the enthalpies of
formation of the products and the stoichiometric sum of the enthalpies of formation
of the reactants, with reactants and products being in their standard states (E11).
Tabulated data of standard thermodynamic properties at T = 298.15 K usually
include data for a large number of compounds (see Handbook of Chemistry and
Physics, 2011).
Can heat be converted into work with 100 % efficiency? All existing experimental
knowledge gives a negative answer to this question. Even in idealized conditions of
maximum efficiency, a fraction of the heat is necessarily released to a heat sink
without being converted into work. In contrast, work can be converted into heat
with 100 % theoretical efficiency. In order to understand the consequences of this
asymmetry of nature, we consider the heat engine (a device that converts heat into
work) functioning under idealized conditions of maximum efficiency.
The first thermodynamic studies on the heat engine were strongly impelled by
the great impact of the steam engine on charcoal extraction, transportation, agri-
culture, and industry during the so-called industrial revolution, from the second half
of the eighteenth century to the first half of the nineteenth. In particular, the studies
carried out by Sadi Carnot (1796–1832) led to an understanding of the conditions
for maximum efficiency of the heat engine.
Under idealized conditions of maximum efficiency, the essential workings of the
heat engine are schematically shown in Fig. 1.22. During a complete cycle, the heat
engine absorbs heat Qh from the heat source (hot reservoir) at temperature Th,
converts part of Qh into work W, and discards the remaining heat Qc (= Qh − W) to
the heat sink (cold reservoir) at temperature Tc (note that Qh, W, and Qc represent
absolute values). After performing these actions, the heat engine returns to its initial
state to complete a cycle. Hence, the variations of all state variables in the heat
engine (our system) are zero. The temperatures of the heat source and heat sink are
not affected by the release or capture of the heats Qh and Qc, respectively. Hence,
32 1 Thermodynamics
the heat source and the heat sink are assumed to be infinite heat reservoirs, that is to
say, to have infinite heat capacities. The efficiency of the heat engine is given by
W
g¼ ð1:99Þ
Qh
Qh Qc ¼ W ð1:100Þ
Qc
g¼1 ð1:101Þ
Qh
Thus, the heat engine efficiency is a positive quantity smaller than 1. Maximum
efficiency corresponds to the minimum value of Qc/Qh, that is, to the minimum
value of Qc for the same value Qh. The maximum efficiency of the heat engine
cannot depend on the engine materials or engine gears, since otherwise, the effi-
ciency could in principle be surpassed by an improved choice of these items. This
reasoning leads to the conclusion that the maximum efficiency of a heat engine can
depend only on the temperatures of the heat source and heat sink, sh and sc,
Qc
gmax ¼ 1 ¼ 1 f ðsc ; sh Þ ð1:102Þ
Qh
Carnot’s theorem. Note that under the conditions of maximum efficiency, volume
changes associated with mechanical work cannot produce thermal gradients, for
otherwise, additional heat would be released to the heat sink, and maximum effi-
ciency would not be attained. Hence, the heat engine with maximum efficiency
takes unlimited time to accomplish its operations. Only then can the heat engine
1.8 Reversible Heat Engine 33
(a) (b)
Fig. 1.23 A Carnot engine cycle is formed by an isothermal expansion (AB), an adiabatic expansion
(BC), an isothermal compression (CD), and an adiabatic compression (DA). a p = Rh/V surface
with isothermal and adiabatic curves. b Surface (a) projected in the plane (p, V, 100) seen from the
point (p, V, h) = (0, 10, 200). Ideal gas surface with isotherms and adiabatic curves obtained with
Mathematica
follow a reversible path and attain both thermal and mechanical equilibrium at each
point in the path, thus being a reversible heat engine.
Carnot’s heat engine consists of an ideal gas that operates over a reversible cycle
formed by steps A ! B (isothermal expansion), B ! C (adiabatic expansion; see
§3), C ! D (isothermal compression), and D ! A (adiabatic compression)
(Fig. 1.23) (see Mathematica code M5). The ideal gas temperatures for the heat
source and the heat sink are represented by hh and hc, respectively.
It can be shown (see §4) that the Carnot engine’s efficiency depends only on the
temperatures of the heat source and heat sink according to the expression
Qc hc
gCarnot ¼ 1 ¼1 ð1:103Þ
Qh hh
where hh and hc are the temperatures given by the ideal gas empirical scale of
temperatures, (E12).
As shown before, the heat engine’s efficiency is a positive quantity less than 1 [see
(1.101)]. For a reversible heat engine, the efficiency depends only on the
34 1 Thermodynamics
temperatures of the hot and cold reservoirs. These two conclusions are combined in
(1.102), which, in turn, leads to
Qc
¼ f ðsc ; sh Þ ð1:104Þ
Qh
Qc Q0 0 0 f ðsc ; s0 Þ gðsc Þ
¼ f ðs c ; s Þf ðs ; s h Þ ¼ ¼ ð1:105Þ
Q0 Qh f ðsh ; s0 Þ gsh
gðsÞ ¼ c s T ð1:106Þ
From (1.104) and (1.105), it can be concluded that the functions f and g can have
only positive values. Thus, the identity of (1.106) defines a scale of absolute
temperatures. Substitution of (1.106) in (1.105) and then in (1.102) leads to
Qc Tc
gmax ¼ 1 ¼1 ð1:107Þ
Qh Th
Qh Qc
þ ¼0 ð1:108Þ
Th Tc
Comparison of (1.107) with (1.103) allows on to conclude that the scales of the
ideal gas temperature and absolute temperature coincide.
Consider now a heat engine that operates in a cycle between temperatures Th and
Tc and absorbs Qh from the heat source but produces less work than the corre-
sponding Carnot engine (Wirr < W), since volume changes during production of
mechanical work are associated with thermal gradients. These volume changes lead
c [ Qc Þ. For this irreversible
to irreversible heat discarded to the heat sink ðQirr
cycle,
Qh Qirr
þ c
\0 ð1:109Þ
Th Tc
The equality for a Carnot cycle (1.108) can be generalized to an arbitrary reversible
cycle. To this end, consider a reversible cycle and the enclosed area occupied by a
mesh of Carnot cycles in which the temperatures of the isotherms differ by DT. This
mesh comprises two types of Carnot cycle (Fig. 1.25): those that are completely
inside the arbitrary cycle and share their isothermal and adiabatic transformations
with four neighboring Carnot cycles, and those that are over the cyclic path and
have fewer than four neighboring Carnot cycles. Transformations that are shared by
two contiguous Carnot cycles are carried out in opposite directions in both cycles
and cancel out. The isothermal and adiabatic transformations that are not canceled
describe a zigzag path over the periphery of the arbitrary cycle (Fig. 1.25). In the
limit of infinitesimal Carnot cycles (the difference between temperatures of con-
secutive isotherms tends to zero, DT ! 0, and consecutive adiabatics gradually get
closer), the zigzag path becomes indistinguishable from an arbitrary reversible
cycle. Hence, the generalization of (1.108) leads to
36 1 Thermodynamics
Fig. 1.25 An arbitrary reversible cycle can be covered to the required precision by a mesh of
infinitesimal Carnot cycles. Carnot cycles are carried out in the same direction (in this figure, they
are performed in the clockwise direction). Figure obtained with Mathematica
I
Qh Qc dQrev
þ ¼0 ) ¼0 ð1:110Þ
Th Tc T
Unlike Qh and Qc, which represent absolute values, dQrev includes a positive or
negative sign for absorbed or released heat, respectively. The last equality of
(1.110) shows that dQrev/T is an exact differential [see (1.79)], that is, dQrev/T is the
differential of a state function, which was named by Rudolf Clausius (1822–1888)
entropy (en “inside” + from Greek tropé “transformation”) and is usually rep-
resented by a capital S:
I I
dQrev dQrev
¼0 dS ¼ 0 dS ¼ ð1:111Þ
T T
where the inequalities hold for irreversible processes. An irreversible process occurs
in a specific direction, not in its reverse, and evolves to equilibrium. Examples of
irreversible processes are the flow of heat from a higher to a lower temperature, the
motion of a piston from a higher to a lower pressure, the melting of an ice cube at
ambient temperature, the fall of a weight from a higher to a lower altitude.
The last inequality of (1.113), called Clausius’s inequality, shows that dS is
greater than dQ/T when the latter corresponds to an irreversible exchange of heat
between system and surroundings. Hence, for an irreversible process, the difference
between dS and dQ/T is always greater than zero,
dQ
dS [0 ð1:114Þ
T
According to Ilya Prigogine (1917–2003, Nobel Prize in chemistry in 1977), a
distinction should be made between two terms included in the entropy change
dS (Fig. 1.26) (see Kondepudi and Prigogine, Further Reading). The first, deS, is
the transfer of entropy across system boundaries due to energy and matter ex-
changes with the surroundings. In principle, these energy and matter exchanges can
be performed reversibly [deS = dQrev/T; see the last equality of (1.111)], and the
corresponding entropy change deS is either positive or negative (no thermodynamic
law can be formulated on this term). In turn, the second term, diS, is the entropy
produced within the system by irreversible processes that drive the system toward
equilibrium. This term is always positive. The following expressions summarize
these considerations:
dQ
dS ¼ de S þ di S de S ¼ di S [ 0 ð1:115Þ
T
The statement according to which an irreversible process within the system has
always a positive entropy change (diS > 0) is one of the formulations of the second
law of thermodynamics (E20). While the equilibrium is a time-independent state,
an irreversible process has time as an associated external variable, and the arrow of
time points toward equilibrium. Therefore, for an irreversible process,
Fig. 1.26 Irreversible processes that occur inside the system are associated with an increase of
entropy
38 1 Thermodynamics
di S
[0 ð1:116Þ
dt
We begin with an isolated system formed by two blocks of the same metal at two
different temperatures T1 and T2 (Fig. 1.27). An isolated system does not exchange
energy or matter with the surroundings, and so dV = 0, dU = 0, and deS = 0 (E15).
For an infinitesimal heat dQ (dQ is an absolute value) passing from the metal block
at higher temperature to the metal block at lower temperature, we can write
dQ dQ 1 1
di S ¼ þ ¼ dQ [ 0 ð1:117Þ
T1 T2 T2 T1
since the entropy is an additive function, and for the rate of entropy production,
Fig. 1.27 In an isolated system, heat flows irreversibly from a higher to a lower temperature
1.10 Irreversible Processes 39
di S 1 1 dQ
¼ [0 ð1:118Þ
dt T2 T1 dT
with diS being produced in time dt. As we know from experience, heat flows
irreversibly from higher to lower temperature, and this process corresponds to
diS greater than zero.
Consider now a closed system at a defined temperature (dT = 0), with fixed boundaries
(dV = 0) and a frictionless wall separating a gas at pressures p1, on one side of the
piston, and p2, on the other side of the piston (p1 > p2) (Fig. 1.28). The changes in
volume of parts 1 and 2 of the system have the same absolute value, and so
The system, consisting of a gas with a frictionless wall separating two parts at
different pressures, exchanges heat with a thermostat in its surroundings, to
maintain a constant temperature. The frictionless wall moves from higher to lower
pressure, subject to the pressure difference p1 − p2 (p1 > p2). Then,
Tde S ¼ dU ð1:120Þ
Therefore,
di S p1 p2 dVa
¼ [0 ð1:122Þ
dt T dt
The situation depicted in Fig. 1.29 represents a closed system consisting of two
solutions with the same solute and same solvent, at constant temperature, separated
by a semipermeable membrane that hypothetically allows the passage of the solute
but not the solvent. It is further assumed that the parts of the system have the same
volume, but the amount of solute is greater in part 1 than in part 2. Considering that
the system has fixed boundaries (dV = 0), the changes in the amounts of solute are
associated with irreversible diffusion of matter through the semipermeable mem-
brane. The equation for dU is
where
@U @U
l1 ¼ l2 ¼ ð1:124Þ
@n1 S;n2 @n2 S;n1
are called the chemical potentials for the solute in parts 1 and 2, and
where dn1 is negative (the amount of solute in part 1 decreases), dn2 is positive (the
amount of solute in part 2 increases) and dn is the extent of solute spontaneously
diffused through the semipermeable membrane in the time interval dt. From
(1.124), we can conclude that the chemical potential is an intensive variable. The
dn1 and dn2 terms account for changes in energy U due to solute transfer from part
1 to part 2. Expression (1.123) shows that U = U (S, n1, n2), since the volume of the
whole system is kept constant (V is not a variable in this experiment).
Making use of the first equality in (1.115), Eq. (1.123) can be split in two
equations, one referring to the exchange of energy across the system boundaries
(the whole system is kept at constant temperature),
Fig. 1.29 At constant temperature, matter diffuses through the membrane from higher to lower
chemical potential
1.10 Irreversible Processes 41
dU ¼ Tde S ð1:126Þ
the other being due to irreversible changes that occur inside the system, namely, in
the amounts of the solute in parts 1 and 2, dn1 and dn2, and in the corresponding
entropy change diS,
Note that addition of (1.126) and (1.127) gives back (1.123). Substitution of
(1.125) in (1.127) leads to
1 l l2
di S ¼ ðl1 dn1 þ l2 dn2 Þ ¼ 1 dn [ 0 ð1:128Þ
T T
and
di S l1 l2 dn
¼ [0 ð1:129Þ
dt T dt
Therefore, diffusion of matter occurs from the higher chemical potential l1 to
the lower chemical potential l2. In the irreversible process of matter flow, l1
decreases, whereas l2 increases until equality between the chemical potentials l1
and l2 is attained at equilibrium.
Consider now a closed system with the following chemical reaction at a defined
temperature T and pressure p,
where the identity defines the state variable n, called the extent of the chemical
reaction. The stoichiometric coefficients for the reactants are multiplied by −1
because the infinitesimal changes in the corresponding amounts of the reactants, dnA
and dnB, are negative. Equation (1.131) can be rewritten in the more general form
dnk ¼ vk dn ð1:132Þ
42 1 Thermodynamics
where mk takes the values −a, −b, c, d, which are called stoichiometric numbers.
Combining the first law (1.87) and the second law (1.115) and generalizing from
(1.127), we can write
1
di S ¼ ðlA dnA þ lB dnB þ lC dnC þ lD dnD Þ
T
1
¼ ðalA blB þ clC þ dlD Þdn ð1:135Þ
T
Theophile De Donder (1872–1957) defined the state variable called affinity of
the chemical reaction as the stoichiometric sum of the chemical potentials of the
reactants minus the stoichiometric sum of the chemical potentials of the products,
A
di S ¼ dn [ 0 ð1:137Þ
T
Considering now that dn refers to the time interval dt, we can write
di S A dn
¼ [0 ð1:138Þ
dt T dt
where dn/dt is the rate of the chemical reaction. As the chemical reaction proceeds
to equilibrium, where dn/dt = 0, the chemical affinity decreases to zero, a value
attained at equilibrium.
which simplifies to
@U @U X @U
Sþ Vþ nk ¼ U
@ðkSÞ @ðkVÞ k
@ðknk Þ
@U @U X @U
Sþ Vþ nk ¼ U ð1:142Þ
@S @V k
@nk
X
U ¼ TS pV þ lk nk ð1:143Þ
k
This equality, known as the Gibbs–Duhem equation, shows that the intensive
variables T, p, and lk are not independent variables. For a system with one com-
ponent, the Gibbs–Duhem equation takes the form
dl ¼ Sm dT þ Vm dp ð1:146Þ
where Sm and Vm represent the molar entropy and molar volume, respectively. At a
specified temperature (dT = 0), integration of (1.146) gives
Zl Zp
dl ¼ Vm dp ð1:147Þ
l0 p0
where the lower limits are the chemical potential and pressure of a reference state
(usually, p0 = 1 bar), and the upper limits are the chemical potential and pressure as
variables (this equation is an equality between indefinite integrals). After integration
of the first member, we obtain
Zp
lðp; TÞ ¼ l ðTÞ þ
0
Vm ðp; TÞdp ð1:148Þ
p0
We will now proceed by applying this equation to ideal and real gases, liquid
solutions, pure liquids, and solids.
Considering an ideal gas mixture, the chemical potential for the k component is
given by
pk
lk ðpk ; TÞ ¼ l0k ðTÞ þ RT ln ð1:150Þ
p0
pk ¼ x k p ð1:151Þ
with xk the molar fraction for the k component and p the total pressure of the
gaseous mixture. Substitution of (1.151) in (1.150) gives
p
lk ðp; T; xk Þ ¼ l0k ðTÞ þ RT ln 0 þ RT ln xk ¼ l0k ðp; TÞ þ RT ln xk ð1:152Þ
p
Let us now consider the chemical potential for a real gas and its deviation from the
corresponding ideal gas that defines the reference state. In order to compare molar
volumes of real and ideal gases, we take the compressibility factor Z defined by
RT
Vm;real ¼ ZVm;ideal ¼ Z ð1:153Þ
p
This definition shows that Z = 1 for the ideal gas. Subtracting Vm,ideal from each
member of the above equation leads to
Z1
Vm;real Vm;ideal ¼ ðZ 1ÞVm;ideal ¼ RT ð1:154Þ
p
Making use of (1.148) for the real and ideal gases and noting that the standard
state is defined by the ideal gas, we can write
Zp
lreal ðp; TÞ ¼ l0ideal ðTÞ þ Vm;real ðp; TÞdp
p0
ð1:155Þ
Zp
lideal ðp; TÞ ¼ l0ideal ðTÞ þ Vm;ideal ðp; TÞdp
p0
46 1 Thermodynamics
Now we take the difference between these equations and use (1.154) to obtain
Zp
Z1
lreal ðp; TÞ ¼ lideal ðp; TÞ þ RT dp ð1:156Þ
p
p0
G.N. Lewis (1875–1946) defined a quantity with pressure units called fugacity
f that gives a pressure-dependence similar to that of the ideal gas,
f
lreal ðp; TÞ ¼ lideal ðp; TÞ þ RT ln ð1:157Þ
p
[compare with (1.149)]. Fugacity is a corrected pressure that takes into account
deviations from the ideal gas’s behavior due to the presence of intermolecular
interactions. When the pressure approaches zero, a real gas acquires ideal gas
behavior. Hence,
f
limp!0 ¼1 ð1:158Þ
p
where xk
1 and the dependence on pressure has been ignored, since liquids, and
condensed phases (liquids and solids) in general, have little sensitivity to pressure.
For nonideal solutions, Lewis introduced the concept of activity to keep the same
form of (1.159), and wrote
ak ¼ c k x k ð1:161Þ
and ck is called the activity coefficient (ck = 1 implies ak = xk, that is, ideal
behavior).
1.11 Chemical Potential 47
ZT Zp
lðp; TÞ ¼ lðp ; 0Þ
0
Sm ðTÞdT þ Vm dp ð1:162Þ
0 p0
Solids and liquids are little affected by pressure changes. Hence, the molar
volume can be considered approximately a constant. Then,
where
ZT
l0 ðTÞ lðp0 ; 0Þ Sm ðTÞdT ð1:164Þ
0
Making use of (1.160) for one component and of (1.163), we can write
For solids and liquids, l0(T) is several orders of magnitude greater than
VmDp. For instance, for liquid water and Dp = 1 bar (= 100000 Pa), VmDp = 1.8
J mol−1 and l0(T) = −237.1 kJ mol−1 (chemical potential value taken from
Handbook of Chemistry and Physics, 2011). Hence, within a good approximation,
the activity of pure solids and liquids is equal to 1 (ln a 0).
At constant temperature and pressure, the state function that evolves to a minimum
at equilibrium is the Gibbs energy, defined by the following identity:
G U þ pV TS ð1:166Þ
Differentiating G and using (1.89) and (1.115) for a closed system, we obtain
Fig. 1.30 At equilibrium, the Gibbs energy attains a minimum, and the reaction affinity is zero.
Graphs obtained with Mathematica
This result shows that at constant temperature and pressure, the Gibbs energy
evolves through an irreversible process to a minimum at equilibrium (diS > 0, and
so dGT,p is negative; at the minimum, the entropy is a maximum). When the
irreversible process is a chemical reaction,
X
dGT;p ¼ Tdi S ¼ lk dnk ¼ Adn ð1:169Þ
k
where A is the chemical reaction affinity [see (1.132) and (1.136)]. Making use of
(1.63) in (1.169), we can write
@G @G
¼ lk ¼ A ð1:170Þ
@nk T;p;nj ðj6¼kÞ @n T;p
The second of the above equations shows that the partial derivative of the Gibbs
energy with respect to the extent of the reaction, at constant temperature and
constant pressure, is the negative of the chemical reaction affinity. At equilibrium,
the Gibbs energy attains a minimum, and the chemical reaction affinity is zero
(Fig. 1.30).
The first of the above equations shows that the partial derivative of the Gibbs
energy with respect to the amount of substance of the k component, at constant
temperature, constant pressure, and fixed amounts of other substances in the
reaction mixture, is equal to the chemical potential of the k component of the
1.12 Gibbs Energy 49
chemical reaction mixture. While this is the formal relationship between chemical
potential and Gibbs energy (the chemical potential is the partial molar Gibbs
energy), we can proceed to find a more pragmatic expression between chemical
potential and Gibbs energy as experimentally measurable physical quantities.
Gibbs energy is an extensive state variable. In order to understand the rela-
tionship between standard Gibbs energies of formation and standard chemical
potentials, we substitute (1.143), obtained during the derivation of the Gibbs–
Duhem equation, into (1.166), the definition of Gibbs energy, leading to
X
G¼ lk nk ð1:171Þ
k
G
l ¼ Gm ¼ ð1:172Þ
n
thus concluding that the chemical potential is a molar Gibbs energy, Gm (chemical
potential and molar Gibbs energy have the same units, energy per mole). Therefore,
G is an extensive state function proportional to n, with the proportionality constant
being the chemical potential. However, in order to determine Gibbs energies
experimentally, zeros for the Gibbs energies and chemical potentials need to be
established. As mentioned before, the standard state of a pure substance at a
specified temperature is its most stable state (gas, liquid, or pure crystalline solid) at
the standard pressure p0 = 1 bar. At a specified temperature, the standard molar
Gibbs energy of formation DG0f of a particular compound is the difference
between the molar Gibbs energy of this compound in its standard state and the
Gibbs energies of its constituent elements in their standard states. Since atoms of
elemental substances do not transmute in chemical reactions (only nuclear reactions
outside the thermodynamic realm can lead to transmutation of chemical elements),
all elemental substances in their standard states are assumed to have a standard
chemical potential and a standard molar Gibbs energy of formation equal to zero at
all temperatures,
where a generically denotes elemental substances. Having established the zeros for
the standard chemical potentials and the standard molar Gibbs energies of formation
of compounds, we can now conclude that at a specified temperature, the standard
chemical potential of substance k is equal to the standard Gibbs energy of formation
of the same substance,
Combining the definition of Gibbs energy [see (1.166) and (1.92)] with (1.175),
we can write
@G
G ¼ H þT ð1:176Þ
@T p
that is,
1
@ TG H
¼ 2 ð1:178Þ
@T T
where both reactants and products are gases. Breaking down the Gibbs energy of
the chemical reaction into the Gibbs energies of formation of products and reac-
tants, and using the relationship between Gibbs energy of formation and chemical
potential, we can write
acC adD
v¼ ð1:183Þ
aaA abB
When approaching chemical equilibrium, ΔGr tends to zero, and the reaction
quotient tends to the equilibrium constant,
DGr ! 0 v ! Ke ð1:186Þ
where
acC;eq adD;eq
Ke ¼ ð1:187Þ
aaA;eq abB;eq
52 1 Thermodynamics
(E16; see Mathematica code M7). From (1.160) and (1.184), it can be concluded
that the equilibrium constant is a function of temperature only. This important result
is at the origin of the expression “equilibrium constant.”
there are no irreversible processes in the system, and the entropy production is zero
[see (1.169)],
di S lliq ðp; TÞ lgas ðp; TÞ dn
¼ ¼0 ð1:190Þ
dt T dt
Hence,
p
l0liq ðTÞ ¼ l0gas ðTÞ þ RT ln 0 ð1:191Þ
p
[see (1.149) and (1.163)], where p is the vapor pressure of the pure liquid. After
rearrangement, we can write
p l0liq ðTÞ l0gas ðTÞ
ln 0 ¼ ð1:192Þ
p RT
(E17). This equation allows us to conclude that the vapor pressure is a function of
temperature, a functional dependence that corresponds to the p–T curve of the
liquid–gas equilibrium in the phase diagram. Equilibria between any two phases of
one-component systems also lead to p-T curves of phase diagrams. Note that these
curves relate two intensive variables, namely p and T.
Besides pressure and temperature, solutions have additional intensive variables
related to composition. Let us consider a system at equilibrium containing C com-
ponents in each of its P phases: how many independent intensive variables or
degrees of freedom does such a system have?
If composition variables are expressed by mole fractions, then the sum of all
mole fractions in each phase equals 1. In particular, for phase a,
X
C
ðaÞ
xk ¼ 1 ð1:193Þ
k¼1
A 1 ¼ 0 A2 ¼ 0 . . . AR ¼ 0 ð1:196Þ
f ¼ ðC RÞ P þ 2 ¼ Ci P þ 2 ð1:197Þ
At constant temperature and volume, the state function that evolves to a minimum
at equilibrium is the Helmholtz energy, defined by the following identity:
F U TS ð1:198Þ
where the inequality results from the second law for irreversible processes (diS >
0), and the equality corresponds to equilibrium (diS = 0). Therefore, in a closed
system at constant temperature and volume, the Helmholtz energy evolves through
an irreversible process to a minimum at equilibrium (dFT,V is negative). When the
irreversible process is a chemical reaction,
X
dFT;V ¼ Tdi S ¼ lk dnk ¼ Adn ð1:201Þ
k
1.15 Helmholtz Energy 55
where A is the chemical reaction affinity [see (1.132) and (1.136)]. Making use of
(1.63) in (1.201), we can write
@F @F
¼ lk ¼ A ð1:202Þ
@nk T;V;nj ðj6¼kÞ @n T;V
The second of the above equations shows that the partial derivative of the
Helmholtz energy with respect to the extent of the reaction, at constant temperature
and volume, is the negative of the chemical reaction affinity. At equilibrium, the
Helmholtz energy attains a minimum, and the chemical reaction affinity is zero.
Consider now a system formed by a small liquid droplet in the air of a box, at
constant temperature and volume (Fig. 1.32). Since the gravitational attraction on
the droplet is negligible, the droplet takes an approximately spherical shape. It is
found that the diameter of the droplet is smaller than that of a sphere with the same
mass in the liquid bulk. At constant temperature and volume, the liquid droplet
reduces its diameter as the contracting force on its surface due to the surface
tension is balanced by an increase in the inside pressure (Fig. 1.32).
Let p′ and V′ be the pressure and air volume in the box, and p″ and V″ the
pressure and volume of the droplet. Then, using (1.87) and the second equation of
(1.115), we can write
The last term of this expression is the work required to increase the interfacial
area by dR, and the proportionality coefficient c is called surface tension, whose
units are energy per area or force per length. Substitution of (1.203) in the first
equality of (1.199) leads to
Fig. 1.32 At constant temperature and volume, a suspended liquid droplet reduces its diameter in
an irreversible process that ends when the contracting force of the surface is balanced by the
increase in the inside pressure. In this irreversible process, the Helmholtz energy evolves to a
minimum
56 1 Thermodynamics
where
@F
c¼ ð1:205Þ
@R T;V 0 ;V 00
that is, the surface tension is the Helmholtz energy per unit of interfacial area, at
constant T, V′, and V″. At 25 °C, water has a surface tension equal to 71.99 mJ m−2,
the surface tension of mercury is 485.48 mJ m−2, and most organic compounds
have, at temperatures below their boiling points, surface tensions between 15 and
50 mJ m−2 (values taken from Handbook of Chemistry and Physics, 2011).
The contraction of the liquid droplet in air is an irreversible process that makes
the system evolve to equilibrium in a time interval dt. Introducing constant tem-
perature (dT = 0) and constant overall volume (dV′ + dV″ = 0) in (1.204) and
making use of (1.200), we obtain
di S dV 00 dR
T ¼ ðp00 p0 Þ þc ð1:206Þ
dt dt dt
where V″ is the volume of the liquid droplet (V″ = 4pr3/3, dV″ = 4pr2dr) and R is its
surface area (R = 4pr2, dR = 8prdr). Substitution of these values in (1.206) leads to
di S
dr
T ¼ ðp00 p0 Þ4pr 2 þ c8pr ð1:207Þ
dt dt
During the evolution to equilibrium, the expression in square brackets tends to zero
and impels dr/dt to tend to zero, in the same way as a temperature gradient is the cause
for the corresponding irreversible heat flow or a concentration gradient is the cause for
an irreversible flow of matter. Therefore, at equilibrium, both the expression in square
brackets and dr/dt are zero. Hence, the excess pressure in the liquid droplet can be
expressed in terms of the surface tension and the droplet radius,
When the lower end of a vertical capillary tube is dipped into water, water rises in the
capillary tube as a manifestation of surface tension. Unlike the liquid droplet in air,
where there was a single interfacial surface, the liquid–air interface, the capillary tube
dipped into water has two interfaces involving water, namely the water–air and
water–glass interfaces. Water wets the glass, and so water rises in the capillary tube and
the meniscus is concave.
1.16 Surface Tension 57
The intersection between the liquid–air interface and the liquid–solid interface is
a circumference of a circle whose radius is equal to the internal radius of the
capillary tube. The angle at which the liquid is in contact with the capillary tube is
called the contact angle and is denoted by h (Fig. 1.33a). If the liquid wets the
glass, then this angle is smaller than p/2 (cosh > 0) and the height of liquid inside
the capillary tube is denoted by h (h > 0) (Fig. 1.33a). Since temperature is kept
constant and the total volume of liquid is also constant, the thermodynamic state
function that evolves to a minimum (equilibrium) under these conditions is the
Helmholtz energy [see the equality in (1.200)]. In the irreversible process leading to
equilibrium, the liquid inside the capillary tube gradually rises, and the pressure qgh
times the volume of the rising column of liquid inside the capillary tube (in units,
pressure volume = energy) is positive and increases. In turn, the surface tension
c cosh times the area of the interior wall of the capillary tube (in units, surface
tension area = energy) is negative and decreases as h increases. Hence, the
minimum of the Helmholtz energy results from a balance between these two
opposing quantities and is given by
Zh
DFðhÞ ¼ ðc cos hÞ2prh þ ðqhgÞpr 2 dh
0 ð1:209Þ
2 2
qgpr h
¼ ðc cos hÞ2prh þ
2
(a) (b)
Fig. 1.33 a For a liquid that wets the glass, one observes capillary rise, and the meniscus is
concave. The contact angle h is less than p/2 and cosh > 0. At equal heights throughout the liquid,
pressure is equal. b For a contact angle h greater than p/2, cosh < 0, and the liquid is depressed in
the capillary tube. This is the case of mercury, for example
58 1 Thermodynamics
@DFðhÞ 2c cos h
¼ ðc cos hÞ2pr þ qgpr 2 h ¼ 0 ) h ¼ ð1:210Þ
@h qgr
If the liquid does not wet the solid surface and has a high surface tension like that
of mercury, then the contact angle is greater than p/2 (cosh < 0), the liquid is
depressed inside the capillary tube, and the height of liquid inside the capillary tube
is negative (h < 0) (Fig. 1.33b). Note that the terms of (1.209) keep the same signs
as when there is capillary rise.
Fig. 1.34 Irreversible transport of potassium ions through a porous membrane destroys electrical
neutrality in the liquid–membrane and membrane–liquid interfaces
1.17 Membrane Potential 59
where n represents the extent of potassium ions passing through the membrane. For
each dni, besides the chemical potential associated with mass transfer, there is an
additional contribution of electric energy (= charge electric potential) associated
with the charge transfer. Representing by z the amount of transported ion charge
and by F the Faraday constant, equal to the charge of 1 mole of elementary charges
(F = eNA = 96485.3365 C mol−1, see Appendix), we can write, at constant volume
and temperature (dV = 0, dT = 0),
Making use of the first equality of (1.115), Eq. (1.212) can be split into two
equations, one for the exchange of energy across system boundaries (the whole
system is kept at constant temperature),
dU ¼ Tde S ð1:213Þ
Note that addition of (1.213) and (1.214) gives (1.212). Substitution of (1.211) in
(1.214) and rearrangement of the resulting equation leads to
Tdi S ¼ ð~
l1 l e
~2 Þdn ¼ Adn ð1:216Þ
e
l i ¼ li þ zF/i ð1:217Þ
60 1 Thermodynamics
e¼e
A l1 e
l 2 ¼ ðl1 l2 Þ þ zFð/1 /2 Þ ð1:218Þ
(see Kondepudi, Further Reading). The transport of potassium ions through the
membrane is an irreversible process. Hence, diS > 0 and from (1.216) we can write
l e
di S ðe l 2 Þ dn Ae dn
¼ 1 ¼ ð1:219Þ
dt T dt T dt
where dt is the time interval that corresponds to the entropy production diS.
At equilibrium, diS/dt = 0, and both the electrochemical affinity à and the rate of
the irreversible process dn/dt are zero (dn/dt = 0 is a consequence of à being zero).
Then, at equilibrium, the electrochemical potentials are equal,
e
l1 e
l2 ¼ 0 ð1:220Þ
1
/2 /1 ¼ ðl l2 Þ ð1:221Þ
zF 1
where the difference between electrical potentials is called the membrane potential
(a positive quantity). Substitution of (1.160) in (1.221) leads to
RT a1
/2 /1 ¼ ln ð1:222Þ
zF a2
Fig. 1.35 The selective irreversible transport of calcium ions through the membrane is a result of
their coordination by a ligand embedded in the membrane
1.18 Electrochemical Cell 61
Just try to submerge a copper wire into an aqueous solution of zinc sulfate (col-
orless). Nothing happens! However, if a zinc plate is immersed in a copper sulfate
solution (blue), the solution gradually becomes colorless, the zinc plate becomes
progressively eroded, and copper is deposited over it. This experiment can be
described by the following chemical equation,
di S Ae dn
¼ ð1:227Þ
dt T dt
dn I
¼ ð1:228Þ
dt nF
I is the electric current intensity, that is, the charge transported per unit of time, and
n is the number of electrons involved in the corresponding oxidation–reduction
reaction. In dealing with a galvanic cell, we use a procedure similar to the one
1.18 Electrochemical Cell 63
e
l e ¼ le F/ ð1:230Þ
where / is the electric potential, and the minus sign is due to the negative charge of
the electron. The electrochemical affinities of the anode and cathode half-reactions
are given by
where the superscripts a and c stand for anode and cathode, respectively. Since the
oxidation–reduction reaction is the sum of the oxidation and reduction
half-reactions, the electrochemical affinity of a galvanic cell à is also the sum of the
electrochemical affinities of the anode and cathode reactions,
X þ Yn þ ! Xn þ þ Y
ð1:232Þ
e¼A
A e c ¼ ðlX þ lYn þ Þ ðlXn þ þ lY Þ nFð/c /a Þ
ea þ A
This equation should be compared with (1.218), which refers to the electro-
chemical affinity of an ion-selective membrane. In the latter, the irreversible process
is a physical process consisting in the transfer of an ion through the semipermeable
membrane, whereas in the electrochemical cell, the irreversible process is an oxi-
dation–reduction reaction of a galvanic cell.
When à = 0, then dn/dt = 0 [see (1.227)], and the current intensity is zero [see
(1.228)]. Hence,
1
De ¼ /c /a ¼ ½ðl þ lYn þ Þ ðlXn þ þ lY Þ ð1:233Þ
nF X
64 1 Thermodynamics
lk ¼ l0k þ RT ln ak ð1:234Þ
Substitution of this expression in (1.233), with k taking the values X, Yn+, Xn+,
and Y, leads to the Nernst equation
RT a n þ aY
De ¼ De0 ln X ð1:235Þ
nF aX aY n þ
where
1
0
De0 ¼ lX þ l0Yn þ l0Xn þ þ l0Y ð1:236Þ
nF
is the standard cell potential. When chemical equilibrium is reached, the reaction
quotient, that is, the expression within parentheses in (1.235), becomes equal to the
equilibrium constant, and De = 0. Then we can write
RT
De0 ¼ ln Ke ð1:237Þ
nF
At 298.15 K,
For n = 1 and De0 = 0.5 V, log Ke 8.5, that is, Ke 108.5 3 108.
If the Nernst equation (1.235) is rewritten so that the cathode and anode
potentials are separate terms, one obtains
RT aY RT aX
De ¼ e0;c ln e0;a ln ð1:239Þ
nF aY n þ nF aX n þ
RT aX
Xn þ þ ne ! X e = e0 ln ð1:240Þ
nF aXn þ
Notes
§1. Pressure is defined as force per unit area. The SI unit of pressure is the pascal,
1 Pa = 1 N m−2. A column of liquid of uniform density q and height h exerts a
pressure qhg, where g represents the acceleration due to gravity; 1 mmHg is the
pressure exerted by a column of mercury of height 1 mm when g = 9.80665 m s−2 and
q(Hg) = 13.5951 g cm−3. 1 atm = 760 mmHg = 101325 Pa, 1 bar = 100,000 Pa.
Temperature is usually expressed in kelvin (K), degrees Celsius (°C), or degrees
Fahrenheit (°F). The Celsius and Fahrenheit scales are empirical, and use the
melting and boiling temperatures of water at 1 atm of external pressure as refer-
ences. For the Celsius scale, 0 °C corresponds to the normal melting point of water,
and 100 °C corresponds to the normal boiling point of water. In the Fahrenheit
scale, 32 °F corresponds to the normal melting point of water, and 212 °F corre-
sponds to the normal boiling point of water. The Kelvin scale is defined using the
second law of thermodynamics, namely, the efficiency of a reversible heat engine or
Carnot engine. The triple point of water is exactly 273.16 K (0.01 °C) and
611.73 Pa. The Kelvin and Celsius scales are related by T/K = T/°C + 273.15.
§2. Let v1 and v2 be the velocities of the test molecule O2 and of one target molecule
N2. The relative velocity is equal to v1 − v2. The absolute value of this vector is
given by
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
ðv1 v2 Þ ðv1 v2 Þ ¼ v21 þ v22 2v1 v2
The molecular motions are random, and so the velocities of molecules 1 and 2
are uncorrelated, that is,
h v1 v2 i ¼ 0
Hence,
pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
pffiffiffipffiffiffiffiffiffiffiffi pffiffiffi
hðv1 v2 Þ ðv1 v2 Þi ¼ v21 þ v22 2hv1 v2 i ¼ 2 hv2 i ¼ 2vrms
TV c1 ¼ constant
pV c ¼ constant
where c is defined by
c ¼ Cmp =CmV
and Cmp and CmV are the isobaric and isochoric molar heat capacities. For a
monatomic ideal gas, c = 5/3.
§4. In the following derivation, we use the definition of mechanical work given by
(1.84) and the expression for the first law that results from integration of both
members of (1.187), DU = Q + W. We assume 1 mole of ideal gas, so we deal with
molar volumes. A Carnot cycle consists of an ideal gas undergoing the following
four steps:
Notes 67
In this step, the gas absorbs QAB from the heat source at temperature hh and uses
this heat to do the work WAB on the surroundings. Note that the energy of an ideal
gas depends only on its temperature [see (1.19)]. Hence, for an isothermal trans-
formation, the energy change is zero, and so from the first law, QAB + WAB = 0:
ZB ZB
1
WAB ¼ pdV ¼ Rhh dV ¼ Rhh lnðVB =VA Þ
V
A A
For an expansion, VB > VA, and so WAB is a negative quantity (work done by the
gas on the surroundings).
pB VBc ¼ pC VCc
ZC ZC
pV c pC VCc VC1c pB VBc VB1c pC V C pB V B
WBC ¼ pdV ¼ dV ¼ þ ¼ þ
Vc 1c 1c 1c 1c
B B
Rðhc hh Þ Rðhh hc Þ
¼ ¼
1c c1
In this step, the work WCD done on the system during the gas compression is
given by
ZD ZD
1
WCD ¼ pdV ¼ Rhc dV ¼ Rhc lnðVD =VC Þ
V
C C
68 1 Thermodynamics
Being a compression, VD < VC, and WCD is a positive quantity. For an isothermal
transformation in an ideal gas, the energy change is zero, and so QCD + WCD = 0
(QCD corresponds to released heat, and is thus a negative quantity).
pD VDc ¼ pA VAc
ZA ZA
pV c pA VAc VA1c pD VDc VD1c
WDA ¼ pdV ¼ dV ¼ þ
Vc 1c 1c
D D
p A VA pD VD Rðhh hc Þ
¼ þ ¼
1c 1c c1
From the results obtained for WBC and WDA, we conclude that WBC + WDA = 0.
The work done on the Carnot reversible cycle is equal to the sum of the works
done in the four steps of the cycle. Using the above results (QAB + WAB = 0,
QBC = 0, QCD + WCD = 0, QDA = 0, and WBC + WDA = 0), we can write
where W, Qh, and Qc represent absolute values. The equations for the isothermal
transformations (pAVA = pBVB and pCVC = pDVD) and adiabatic transformations
(pBVcB = pCVcC and pDVcD = pAVcA) can be simultaneously solved, provided VB/
VA = VC/VD. This equality justifies the last equation above. Therefore, the Carnot
engine efficiency is given by
Qc hc
gCarnot ¼ 1 ¼1
Qh hh
Mathematica Codes 69
Mathematica Codes
This Mathematica code evaluates the normalization factor of f(v) in (1.27) and
plots this probability function for a gas of oxygen molecules at 100 and 300 K. The
normalization factor is calculated as the inverse of the definite integral of f(v) in
the range {0,∞} and requires the use of the Mathematica command
Integrate[f(v),{v,0,Infinity},Assumptions!{T>0,m>0,kB>0}]
where v is the integration variable, whose range is between zero and infinity, because v
represents a molecular speed, that is, an absolute value of the velocity vector, and the
option Assumptions indicates that the symbolic quantities T,m, and kB are positive.
They are assigned specific values in lines of code 3, 4, and 5, and line of code 6 outputs
the value of the norm for the oxygen molecule, at an unspecified temperature T.
The Mathematica command
Plot[f[v]/.T !{100,300},{v,0,1200}]
generates plots of functions f(v) in the range {0,1200}, where T is replaced by
100 and 300 (/. stands for the ReplaceAll Mathematica command). This is the
point where the variable T is assigned specific values, namely, 100 and 300. The
physical quantities m, v, and T, and the constants avogadro and kB are in SI
70 1 Thermodynamics
units. If you wish to run the above code for different molecular masses or different
temperatures and do not want to transfer universal constants and mass values from
one run to the next, you should use Clear[avogadro,kB,m] as the first line of
the code. This Mathematica function clears values and definitions of the specified
symbols, making it possible to evaluate the norm on the second line of code using
symbolic constants and variables.
Suggestion: Use the Graphics primitive Text to add the temperatures
“100 K” and “300 K” next to the corresponding curves in the above plot. If nec-
essary, consult Wolfram Documentation in the Help menu of Mathematica with the
words Graphics and Text.
This code shows how to use Mathematica to determine the critical values of
volume, temperature, and pressure for a Van der Waals fluid. The code solves the
system of Eqs. (1.49) and confirms the critical values presented in (1.50). The first
line of code defines the p[V,T] function for the Van der Waals equation, and the
Mathematica Codes 71
second line of code solves the system of Eqs. (1.49) to determine the critical values
of volume and temperature. The expression D[p[V,T],V] gives the partial
derivative of p with respect to V at constant T, and D[p[V,T],{V,2}] gives the
second-order partial derivative of p with respect to V at constant T. The solutions of
the system of equations are assigned to sol, which is a list of a list of length 2 (see
the first line of output; in Mathematica, a list is denoted by {…}). The second line
of output shows that sol[[1]] stands for the first part of sol, that is, a list of the
critical volume and critical temperature. The Print command of the fourth line of
code is used to print the string of characters inside quotation marks (“Critical
Volume”), and the next line of code uses ReplaceAll (/.) to replace V by the
first part of sol, that is, by sol[[1]]. The last line of code solves for the values
of a, b, and R that lead to critical values of volume, temperature, and pressure equal
to 1, thus to the Van der Waals equation in reduced variables.
This Mathematica code plots the Van der Waals equation in reduced variables
[see (1.53)] using the Mathematica command
Plot3D[f[V,T],{V,0.34,3},{T,0.34,3}]
where f[V,T] corresponds to the reduced pressure as a function of the reduced
volume V and reduced temperature T, in the above indicated ranges. Note that a
function of two independent variables is represented by a surface. The Van der
Waals isotherms of Fig. 1.11 are cross sections of this surface by planes that
correspond to defined temperature values. The option Lighting specifies the
simulated lighting used to color the plotted surface. Lighting!“Neutral”
simulates white light.
Suggestion: Run the above Mathematica code line leaving out the options
PlotStyle and Lighting.
72 1 Thermodynamics
This Mathematica code evaluates the critical temperature for carbon dioxide (see the
values of a and b in the Table for Van der Waals constants) and calculates reduced
temperatures corresponding to absolute temperatures from 274 to 344 K in steps of 10.
This Mathematica code plots the ideal gas surface with meshes for two
isothermal and four adiabatic transformations. The option for Plot3D that speci-
fies the number of isothermal and adiabatic transformations is
Mesh!{2,4}
and the option for the corresponding functions is
MeshFunctions!{#1&,#1*#2^(2/3)&}
where #1& is the pure function for an isothermal transformation (#1 is the first
argument of f[T,V]) and #1*#2^(2/3) is the pure function for an adiabatic
transformation (#2 is the second argument of f[T,V]). The adiabatic transfor-
mation is given by TVc −1 = constant, where c is equal to 5/3 for a monatomic ideal
gas (see §3).
Mathematica Codes 73
This Mathematica code calculates the efficiencies of Carnot heat engines [see
(1.107)] when the temperature of the hot reservoir is fixed at 1001 K and the
temperatures of the cold reservoir are 1, 10, 100, and 1000 K. The efficiencies of
Carnot engines are presented as percentages with three significant digits using the
Mathematica command SetPrecision. This code illustrates the use of Table to
obtain two lists of length 4: the list aa of four cold reservoir temperatures and the list
bb composed of the corresponding Carnot efficiencies. These two lists are interleaved
by the use of the Mathematica command Riffle. The command Partition par-
titions the list cc into nonoverlapping lists of length 2 composed of the temperature of
the cold reservoir Tc and the maximum Carnot’s engine efficiency η. Note that the
above code can be compressed into the following single line of code:
Partition[Riffle[Table[10^i,{i,0,3}],
SetPrecision[Table[(1.-10^i/1001)*100,{i,0,3}],3]],2]
The following code uses ListPlot to plot the list of points (Tc, η), where Tc
varies from 1 to 1000 in steps of 1, and η is indicated as a percentage. This plot
shows a linear dependence of η versus Tc, with η decreasing with the increase of Tc.
74 1 Thermodynamics
Consider a hypothetical chemical reaction R(g) ! P(g), where R and P are ideal
gases and the equilibrium constant is assumed to be 3.0. Taking the Gibbs energy
and the affinity as functions of the extent of the chemical reaction n [see (1.131) and
(1.132)] whose range is [0,1], the partial derivative of the Gibbs energy with respect
to n, at constant temperature and pressure, is equal to the negative of affinity of the
chemical reaction [see second equality of (1.170)].
The following Mathematica code shows how the Gibbs energy of the above
chemical reaction and the corresponding affinity evolve to equilibrium. In order to
obtain the Gibbs energy as a function of n, we write
X
G ¼ G0R þ DG
where G0R is the standard Gibbs energy of the reactant R (in the code, we assume
G0R ¼ 0) and DG is obtained by substituting (1.188) into (1.185), that is, using
v
DG ¼ RT ln
Keq
n
v¼
1n
i
n¼
1000
where i varies in the range [1,1000]. Therefore, each step of n is equal to 0.001, and
the command ListPlot[data] plots 1000 points. The Mathematica command
Fit finds a least squares fit to data as a linear combination of functions in the list
{1,x,x^2,x^3,x^4}, where x is real and corresponds to i. The value of x at
equilibrium is obtained by the code lines
sol=Solve[bestfit’[x]==0,x][[3]];
eq=x/.sol;
Solving bestfit’[x]==0 leads to three roots, the first two being complex
numbers, the third being real and equal to 750.181. The third root, denoted by sol,
is selected using [[3]] immediately after the Solve command. The value of x at
equilibrium is assigned to eq. Plots of the Gibbs energy G and affinity A are
restricted to the neighborhood of equilibrium.
Mathematica Codes 75
Suggestion: Write a Mathematica code to obtain the Gibbs energy and the
affinity as functions of the extent of the chemical reaction A + B ! 2C and assume
Keq = 100.
Glossary
Exercises
@U @U
dQ ¼ dT þ p þ dV
@T V @V T
@U @V
Cp CV ¼ p þ
@V T @T p
E10. The heat capacity of iron is 25.09 J mol−1 K−1. Calculate the entropy change
for 4 mol of iron heated from 273 K to 373 K, assuming that the heat capacity for
iron is constant in this temperature interval.
E11. Evaluate the standard reaction enthalpy for C3H8(g) + 5O2(g) ! 3CO2(g) +
4H2O(l), at 298.15 K ½DHfo ½C3 H8 ðgÞ ¼ 104:7 kJ mol1 ; DHof ½CO2 ðgÞ ¼
393:5 kJ mol1 ; DHof ½H2 OðlÞ ¼ 285:8 kJ mol1 .
E12. Rudolf Clausius expressed the second law in the following way: No process is
possible whose sole result is the transfer of heat from a body of lower temperature
to a body of higher temperature. In turn, Lord Kelvin used the following statement:
No process is possible in which the sole result is the absorption of heat from a
reservoir and its complete conversion into work. Show that Clausius’s and Kelvin’s
statements correspond to equivalent heat engines.
82 1 Thermodynamics
References
Angrist SW (1968) Perpetual motion machines. Sci Am
Further Reading
Cheung C-K, Keough GE, Gross RH, Landraitis C (2009) Getting started with mathematica, third
edn. Wiley, New Jersey
Kondepudi D (2008) Introduction to modern thermodynamics. Wiley, New Jersey
Kondepudi D, Prigogine I (1998) Modern thermodynamics: from heat engines to dissipative
structures. Wiley, New Jersey
Wolfram S (2015) An elementary introduction to the Wolfram Language. Wolfram Media, Inc.
Available online
Chemical Kinetics
2
Abstract
Chemical kinetics is the branch of chemistry that measures rates of chemical
reactions, studies the factors that influence them, designs and prepares new
catalysts, and interprets the results at the molecular level. The independent
variable of chemical kinetics, from the chemical reaction starting moment when
the reactants are mixed to its final moment when equilibrium is reached, is time,
a variable introduced by the second law of thermodynamics for irreversible
processes. The first study of the rate of a chemical reaction is credited to Ludwig
Wilhelmy in 1850 for the decomposition of sucrose (table sugar) into glucose
and fructose, in acid medium. Wilhelmy found that the rate of this chemical
reaction is proportional to the existing amount of sucrose at each instant in the
course of the chemical reaction. This chapter begins with sections on the rate of a
chemical reaction, the experimental rate equation, and the effect of temperature
change. We then consider elementary reactions, complex reactions, and
extremely fast reactions. Most chemical reactions function like one-way streets:
the concentrations of reactants decrease, those of reaction intermediates increase
at first and decrease later, and the concentrations of products increase. However,
for a few reactions far from equilibrium, the concentrations of some intermediate
species oscillate, increasing and decreasing repeatedly. These reactions are
illustrated with the Brusselator, a model chemical oscillator developed in the
Brussels thermodynamic school founded by Prigogine. At the end of this
chapter, the student can find two notes on matrix diagonalization and systems of
first-order linear differential equations, useful for understanding the mathemat-
ical treatment given to the Brusselator, two Mathematica codes (First-Order
Chemical Reaction, Brusselator) with references to expressions in the main text,
detailed explanations for new commands and suggestions for the student to
follow, a glossary that explains important scientific terms, and a list of exercises,
whose complete answers can be found at the end of the book.
A!B ð2:1Þ
where B represents the colorless form of the dye (the bleach is omitted because its
concentration is approximately constant during the experiment). The following
table gives the concentration of A as a function of time at 1-min intervals.
t/min 0 1 2 3 4 5 6 7 8 9 10
[A]/10−5 M 5.00 3.57 2.49 1.82 1.26 0.89 0.61 0.45 0.30 0.23 0.16
The graph of [A] as a function of t is called a kinetic reaction profile for the
chemical reaction (2.1) and is presented in Fig. 2.1 (see Mathematica code M1).
Once the fitting curve is known (an exponential, as we will soon find out), it is
possible to obtain the derivative at each instant in the considered time range.
Since A is the reactant, [A] is a decreasing function of time, its first derivative is
negative, and the reaction rate is given by −d[A]/dt. The reaction rate defined
using B is given by d[B]/dt, that is, d[B]/dt = −d[A]/dt (E1).
A þ B ! 2C ð2:2Þ
where dnA, dnB, and dnC represent changes in the molar quantities of A, B, and C.
Both dnA and dnB are negative, because A and B are reactants, dnC is positive (C is
the single product) and is twice −dnA or −dnB [see (2.2)]. From (2.3), we can write
aA þ bB ! cC þ dD ð2:5Þ
we can write
where n is called the extent of the chemical reaction that specifies the variable
composition along the course of the chemical reaction (n is a state variable). If the
chemical reaction (2.5) is carried out at constant volume, then
where the square brackets stand for molarity. Each member of (2.7) defines the rate
of the chemical reaction that can be obtained at any specified instant during the
course of the chemical reaction (E2). Note that (2.7) assumes a time-independent
stoichiometry, that is, the reaction stoichiometry is assumed to be valid at every
instant during the reaction. This assumption is not always true. If the chemical
reaction has intermediate species with lifetimes that enable their detection, then the
stoichiometry is not time-independent, and the rate of the reaction cannot be defined
in a unique way. In these cases, we are really talking about a sequence of chemical
reactions, and n is a state variable for each chemical reaction in the sequence.
86 2 Chemical Kinetics
Experimental studies show that the rates of many chemical reactions depend on the
concentrations of the reactants according to the following equation:
where A and B are reactants, the exponents x and y are called partial orders of A
and B, and k is called the rate constant. The overall order of the reaction is equal
to x + y. Since k, x, y, and the concentrations in the second member of (2.8) are
experimentally determined quantities, (2.8) is called the experimental rate equa-
tion. Partial orders x and y usually take the values 1, 2, or 0 (when the rate equation
does not depend on the concentration of a particular reactant, its partial order with
respect to that reactant is zero). Since the reaction rate depends on temperature, the
experimental rate equation is always determined at a specified temperature.
Most chemical reactions have more than one reactant. When the partial order of
one particular reactant is to be experimentally determined, conditions should be set
to guarantee that other reactants do not interfere, and their concentrations will not
vary significantly. For instance, considering a chemical reaction with reactants A
and B, the experimental determination of the partial order of reactant A requires that
[B] be kept approximately constant. Under these conditions, (2.8) takes the form
0
v ¼ k ½Ax ð2:9Þ
where k′ = k[B]y constant, because [B] does not significantly change in the
course of the reaction. Equation (2.9) is called the pseudo rate equation, and k′
represents the pseudo rate constant. The partial order x acts like the overall
pseudo-order for the chemical reaction. The isolation and initial rate methods allow
separation of the concentration variables of a rate equation in order to determine the
partial order of a particular reactant.
In the experimental determination of the partial order of A by the isolation
method, the concentration of B should greatly exceed the initial concentration of A.
If A and B are in the stoichiometric proportion 1:1 and the initial concentrations of
A and B are in the ratio 1:100 (for example, [A]0 = 0.100 M and [B]0 = 10.0 M),
then when 99 % of A has reacted, the change in the concentration of B is only 1 %.
The initial rate method takes the rate equation at the initial instant (t = 0),
The rate equation for a first-order chemical reaction with a single reactant (or a
pseudo first-order chemical reaction) is given by
Solving this differential equation consists in finding the function f(t) = [A] that
satisfies (2.11) and the initial condition f(0) = [A]0, where [A]0 is the initial con-
centration of A. While it is easy to conclude that the function that satisfies (2.11) is
an exponential [the derivative of expðktÞ is k expðktÞ], we solve (2.11) in a
more general way that can be applied to second-order reactions. We begin by
separating variables [A] and t in each member,
Now we obtain indefinite integrals of both members. The integration limits for
the first member are [A]0 and [A], and for the second member, are the corre-
sponding values of time, that is, 0 and t. We obtain
ln ½A=½A0 ¼ kt ð2:13Þ
that is, the graph of ln[A] as a function of t gives a straight line whose intercept is ln
[A]0 and whose slope is equal to −k. If we now consider the time interval t = t1/2
such that [A] is half its initial value, [A] = [A]0/2, then substitution of these
equalities in (2.13) leads to
where the time interval t1/2 is called the half-life. Contrary to what the name might
suggest, the half-life is not half of the time to reaction completion. Equality (2.15),
valid for first-order chemical reactions, shows that the half-life does not depend on
[A]0. This is an important result that applies only to first-order chemical reactions.
Hence, when the half-life of a chemical reaction is independent of the initial con-
centration, we conclude that the chemical reaction has first-order kinetics (Fig. 2.2).
Equality (2.13) can be rewritten in the following equivalent way,
which shows that [A] has exponential decay with time. Spontaneous decays of
radioactive atomic nuclei are first-order processes. Adapting the above equalities to
88 2 Chemical Kinetics
Fig. 2.2 The half-life of a first-order reaction does not depend on [A]0. For the kinetic profile
illustrated above, the half-life is approximately equal to 2 min. Figure obtained with Mathematica
The rate equation for a second-order chemical reaction with a single reactant (or a
pseudo second-order chemical reaction) is given by
1 1
¼ þ kt ð2:19Þ
½A ½A0
Hence, the graph of 1/[A] as a function of t gives a straight line whose intercept
is 1/[A]0 and whose slope is equal to k. Substitution of [A] by [A]0/2 and of t by t1/2
in (2.19) gives the half-life for a second-order reaction:
2.2 Experimental Rate Equation 89
1
t1=2 ¼ ð2:20Þ
k½A0
Zeroth order with respect to a particular reactant means that the chemical reaction
rate does not depend on the concentration of that reactant. This can occur when the
reactant is not involved in the slowest step of the reaction (the rate-limiting or
rate-determining step), or when the reaction rate depends on the adsorption of a
particular reactant by a fully covered catalyst surface. Since the surface is saturated,
the reaction rate does not depend on the concentration of the adsorbed reactant.
The rate equation of a zeroth-order chemical reaction for a single reactant is
given by
Thus, the graph of [A] as a function of t gives a straight line whose intercept is
[A]0 and whose slope is equal to −k. Substitution of [A] by [A]0/2 and of t by t1/2 in
(2.22) leads to
Ea
kðTÞ ¼ A exp ð2:24Þ
RT
Ea
lnkðTÞ ¼ lnA ð2:25Þ
RT
thus showing that ln kðTÞ depends linearly on 1/T. The intercept of the resulting
straight line is ln A and the slope is equal to −Ea/R. The derivative of (2.25) is given
by
dlnkðTÞ Ea
¼ ð2:26Þ
dT RT 2
This equality suggests a way for determining the activation energy from
experimental data on k(T), provided the chemical reaction rate constant follows an
Arrhenius dependence.
Above the troposphere, the stratosphere ranges in altitude from about 11–50 km,
with temperature varying approximately from −60 to −2 °C. The ozone layer is
mainly found in the lower layer of the stratosphere, up to an altitude of approxi-
mately 30 km. As an example of the Arrhenius dependence, consider the
second-order stratospheric chemical reaction
N þ O2 ! NO þ O ð2:27Þ
for which A = 1.5 10−11 cm3 molecule−1 s−1 and Ea/R = 3600 K (values taken
from Handbook of Chemistry and Physics, 2011). The Arrhenius
temperature-dependence of the rate constant of the above chemical reaction is
illustrated in Fig. 2.3.
The Arrhenius activation energy has a strong influence on the rate equation. To
illustrate this point, the table shows approximate values of the Arrhenius expo-
nential function for three hypothetical values of the activation energy (10, 50, and
100 kJ mol−1), at two different temperatures (300 and 600 K). At 300 K, the
Arrhenius exponential decreases by a factor of order 10−7 when the activation
energy changes from 10 to 50 kJ mol−1, and by a factor of order 10−9 when the
activation energy changes from 50 to 100 kJ mol−1. In turn, at 600 K, the same
increments in the activation energy lead to exponential decreases by factors of order
10−4. In conclusion, the decrease in the activation energy due to the eventual use of
catalysts and the increase in temperature lead to appreciable increases in the
chemical reaction rates that present the Arrhenius dependence. Note that the catalyst
reduces the activation energy of the chemical reaction, thus leading to an increase in
the reaction rate. The decrease in the activation energy is the result of a change of
mechanism without altering the initial and final states of the reaction (the reactants
and products of the overall chemical reaction).
In the graph of ln k as a function of 103/T [see (2.25)], the extrapolation needed to
obtain the intercept may lead to a large uncertainty in the determination of the A-
factor, since the data points are usually within a short range of 103/T. For instance, the
data points of Fig. 2.3 range from 103/T = 2.85 K−1 (T 351 K) to 103/T = 5.0 K−1
(T = 200 K). At 1000 K, the value of 103/T is 1 K−1, still far from the origin.
improbable that four molecules collide at the same instant with the energy and
orientation required for a chemical reaction. Note that the term molecularity applies
only to elementary reactions or to individual steps of complex reactions.
Consider a reaction of the following type:
v ¼ k½RX]½Nu ð2:30Þ
This reaction has second-order kinetics and obeys the following experimental
rate equation:
This reaction mechanism has a single step with a single transition state, [HO…
CH3…Br]−. The transition state involves both reactant species, and the nucleophilic
substitution is bimolecular, that is, the mechanism is SN2.
When the mechanism of a chemical reaction consists of more than one step, the reaction
is said to be a complex reaction. Experimental evidence for the existence of one
reaction intermediate leads to the conclusion that the reaction mechanism is formed by
at least two steps. Sometimes, the lifetime of a reaction intermediate enables its isolation
and characterization. However, reaction intermediates usually are reactive species with
2.5 Complex Reactions 93
very low concentrations, thus being difficult to detect. The physical methods generally
used in the detection of short-lived reaction intermediates are spectroscopic methods,
which involve an interaction with electromagnetic radiation.
The rate equation of an elementary reaction can be written once the reaction
stoichiometry is known. Hence, when the rate equation does not reflect the reaction
stoichiometry, we have a complex reaction. For example, the reaction between
hypochlorite and iodide ions in aqueous solution,
The presence of the concentration of the hydroxide ion in the rate equation and
its absence from the stoichiometric Eq. (2.33) leads to the conclusion that the
chemical reaction (2.33) has more than one step in its mechanism; that is, it is a
complex reaction.
The reaction of 2-bromo-2-methylpropane, (CH3)3CBr, with sodium hydroxide,
in methanol at 25 °C,
This rate equation shows that the slowest reaction step involves a single reactant
species, namely, (CH3)3CBr. Having the chemical reaction (2.35) in mind, we can
infer from (2.36) that the first step of (2.35) consists in the dissociation of
(CH3)3CBr in the (CH3)3C+ carbocation and the bromide ion,
This is a fast step, because the hydroxide ion has a stronger nucleophilic char-
acter than the bromide ion. Addition of steps (2.37) and (2.38) brings back the
original overall reaction (2.35). The transition state involves a single reactant
molecule, (CH3)3CBr, and the overall reaction (2.35) represents a unimolecular
nucleophilic substitution, thus being an example of an SN1 mechanism.
94 2 Chemical Kinetics
The difference in the mechanisms of reactions (2.31) and (2.35) lies in the
different stabilities of the involved carbocations, CH3+ and (CH3)3C+. CH3+ is a
stronger electrophile than (CH3)3C+, because the same positive charge is distributed
over a larger carbocation in (CH3)3C+ as compared with CH3+. Hence, reaction
(2.31) does not have a slow initial step for the dissociation of CH3Br into CH3+ and
Br−, since CH3+ is quite a strong electrophile for that to occur.
Most reactions of ions in aqueous solution are extremely fast, reaching equilibrium
in times of order 10−10 and 10−12 s. Such reactions cannot be studied using con-
ventional methods that depend on the mixture of reactants, since the diffusion times
(times for migration of reactant molecules until they collide with each other) are
orders of magnitude greater than reaction times. Techniques can be used that apply
an almost instantaneous perturbation, after which the system momentarily leaves
the equilibrium state (concentrations of reactants and products are changed for a
moment) and returns to the equilibrium state, since the duration of the perturbation
is not significant when compared with the reaction half-life. The process of
returning to the equilibrium state is called relaxation. The applied perturbation can
be a shock wave, a pulse of electromagnetic radiation that produces a photo-
chemical reversible reaction (flash photolysis), a sudden temperature jump (T-
jump), or a sudden pressure jump associated with sound absorption in gaseous
systems (Eigen 1954). The study of extremely fast reactions using brief energy
pulses led to the award of the 1967 Nobel Prize in chemistry to Manfred Eigen,
Ronald Norrish, and George Porter.
In order to proceed with the study of an extremely fast chemical reaction, it is
necessary to know the rate equation for the reaction in the forward and reverse
directions. Consider the reaction in aqueous solution
AþBC ð2:39Þ
where A and B are positive and negative ions and C is the chemical species
resulting from the combination of those ions. Assume that the forward reaction is of
first order in A and in B and that the reverse reaction is of first order in C, and
represent by k! and k← the rate constants in the forward and reverse reactions.
When a brief perturbation, for instance a temperature jump, is applied to the
reaction at equilibrium, the A, B, and C concentrations will change with respect to
their equilibrium values according to the following equalities, which preserve the
reaction stoichiometry:
dd
¼ k! ½A½B k ½C ð2:43Þ
dt
Substitution of (2.40) in (2.43) leads to
dd
¼ a bd þ vd2 ð2:44Þ
dt
where
a ¼ k! ½Aeq ½Beq k ½Ceq b ¼ k! ½Aeq þ ½Beq þ k v ¼ k! ð2:45Þ
k! ½Ceq
Keq ¼ ¼ ð2:46Þ
k ½Aeq ½Beq
dd
bd ð2:47Þ
dt
This result shows that the perturbation d follows first-order kinetics,
where
b ¼ s1 ð2:49Þ
96 2 Chemical Kinetics
(the units of b are the inverse of time, s−1) (E4). The rate constants k! and k← can
be experimentally determined if two values of b, say b1 and b2, have been previ-
ously determined corresponding to two sets of [A]eq and [B]eq values (set1 and set2).
where H+(aq) stands for the symbolic representation of a protonated water molecule H3O+
or a protonated group of hydrogen-bonded water molecules as H5O2+, H7O3+, H9O4+, ….
The rate constant for the combination of H+ and OH− is equal to k! = 1.4 1011
dm3 mol−1 s−1, that is, the neutralization is an extremely fast reaction, whereas the rate
constant for the reverse reaction (ionization of H2O) is equal to k← = 2.5 10−5 s−1
(Eigen 1967). At equilibrium, making use of (2.46), we can write
where [H2O] 55.6 mol dm−3. Hence, the constant for the ionic product of water
at 25 °C is given by
k
½H þ eq ½OH eq ¼ ½H2 O ¼ 1:0 1014 ð2:52Þ
k!
2.7.1 Brusselator
In 1968, Prigogine and Lefever developed a model that shows how a chemical
reaction, far from equilibrium, can pass from a stationary point to an oscillatory
state (Prigogine 1968). This chemical oscillator, often called the Brusselator as a
reminder of the Brussels thermodynamic school founded by Prigogine, consists of
the following mechanism:
k1
A ! X
k2
B þ X ! Y þ D
ð2:53Þ
k3
2X þ Y ! 3X
k4
X ! E
where the inverse reactions are assumed to have negligible rate constants, and the
overall chemical reaction is
In order to simplify the notation, A, B, X, and Y stand for [A], [B], [X], and [Y],
respectively. Note that an increase of X, in elementary reactions 1 and 3, is followed
by its decrease, in chemical reactions 2 and 4. A similar oscillatory behavior can be
assigned to Y that decreases in chemical reaction 3, where Y is a reactant, and
increases in reaction 2, where Y is a product.
The rate equations for the reaction intermediates X and Y in (2.53) are given by
the following set of nonlinear coupled differential equations:
dX dY
¼ k1 A k2 BX þ k3 X 2 Y k4 X ¼ k2 BX k3 X 2 Y ð2:55Þ
dt dt
98 2 Chemical Kinetics
Fig. 2.4 In a flow reactor, the constant supply of A and B keeps the concentrations of these
reactants in the reaction mixture fixed, and the constant removal of D and E maintains the system
far from these reactants equilibrium
where the effects of diffusion of X and Y have been ignored, since we assume that
chemical reactions (2.53) occur in a homogeneous medium, and so there is spatial
uniformity of concentrations of all involved chemical species. In solving this set of
nonlinear differential equations, the initial conditions are such that the corre-
sponding physical system (the reaction mixture) is kept far from equilibrium: A and
B have fixed values, while D and E are constantly removed from the reaction
mixture to prevent reverse chemical reactions from occurring (Fig 2.4). Solving
(2.55) for its stationary-point solutions (dX/dt = 0 and dY/dt = 0) leads to
k1 k2 k4 B
Xs ¼ A Ys ¼ ð2:56Þ
k4 k1 k3 A
dX dY
¼ Z 1 ðX; YÞ ¼ Z2 ðX; YÞ ð2:57Þ
dt dt
The time-dependent perturbations x(t) and y(t) are added to the stationary-point
coordinates Xs and Ys,
Therefore,
dX dx dY dy
¼ ¼ ð2:59Þ
dt dt dt dt
since Xs and Ys are constants of the experiment. The Taylor expansion of Z1 and Z2
in (2.57) about the stationary point leads to
2.7 Chemical Oscillations 99
@Z1 @Z1
Z1 ðXs þ x; Ys þ yÞ ¼ Z1 ðXs ; Ys Þ þ xþ yþ
@X s @Y s
ð2:60Þ
@Z2 @Z2
Z2 ðXs þ x; Ys þ yÞ ¼ Z2 ðXs ; Ys Þ þ xþ yþ
@X s @Y s
Since x(t) and y(t) represent small perturbations, only the linear terms in (2.60)
are retained (quadratic and higher-order terms are ignored). Hence, making use of
(2.57) and (2.59), and noting that Z1(Xs, Ys) and Z2(Xs, Ys) are zero by definition of
the stationary point, we conclude that
dx @Z1 @Z1 dy @Z2 @Z2
¼ xþ y ¼ xþ y ð2:61Þ
dt @X s @Y s dt @X s @Y s
or in vector notation,
dx
¼ Kx ð2:62Þ
dt
where
" @Z #
@Z1
@Z
@X s @Z
1
@Y s x
K¼ x¼ ð2:63Þ
2
@X s
2
@Y s
y
with K being the Jacobian matrix. The Jacobian matrix elements quantify the
variation of the Z1 and Z2 rates with respect to changes in X and Y at the stationary
point.
For the Brusselator, the Jacobian matrix is given by
k2 B k4 k3 Xs2
K¼ ð2:64Þ
k2 B k3 Xs2
where Xs is given by the first equality of (2.56). The characteristic equation (see
§1) is
that is,
k2 B k4 k k3 Xs2
¼ 0 k2 ðk2 B k4 k3 Xs2 Þk þ k3 k4 Xs2 ¼ 0
k2 B k3 Xs2 k
ð2:66Þ
100 2 Chemical Kinetics
p 1
k ¼ ðp2 4qÞ1=2 ð2:67Þ
2 2
where
(E5).
Consider now the following set of parameters k1 = k2 = k3 = k4 = 1.0 and the
initial condition A = 1.0, leading to Xs = 1.0, Ys = B [see (2.56)], p = B − 2, q = 1
[see (2.68)], and p2 − 4q = B2 − 4B. For B = 2, we have p = 0, and the eigen-
values of the Jacobian matrix are pure imaginary. For B = 1.5 or 2.5, we have
p2 − 4q = (±0.5)2 − 4 < 0, and the eigenvalues have a nonzero imaginary
part. The value B = 1.5 leads to p < 0 and to a complex conjugate pair with a
negative real part, whereas B = 2.5 gives p > 0 and a positive real part. We now
return to the initial conditions A = 1.0 and B = 2. From (2.62) and (2.64), we can
write
x_ ðtÞ 1 1 xðtÞ
¼ ð2:69Þ
y_ ðtÞ 2 1 yðtÞ
The eigenvalues of this Jacobian matrix are ±i, and the corresponding eigen-
vectors are (−1 − i, 2)T and (−1 + i, 2)T. Then the general solution to the above
system of differential equations (see §2) is
xðtÞ 1 i it 1 þ i it
¼ c1 e þ c2 e ð2:70Þ
yðtÞ 2 2
This equality shows that the x(t) and y(t) functions are linear combinations of
cosine and sine functions. We now define U and V variables by U X − Xs and
V Y − Ys, so that the stationary point is at (Us, Vs) = (0, 0) instead of at (Xs, Ys).
Figure 2.5 considers the same set of parameters (k1 = k2 = k3 = k4 = 1.0) and the
initial condition A = 1.0, with B equal to 1.5, 2.0, and 2.5, and shows U and V as
functions of time in the first plot and V(t) as a function of U(t) in the second plot
(phase trajectory). For B = 1.5, the eigenvalues of the Jacobian matrix are a
2.7 Chemical Oscillations 101
Fig. 2.5 Oscillations and phase trajectories for the Brusselator. Figures obtained with
Mathematica
complex conjugate pair with a negative real part, and U and V quickly decay to zero
as the phase trajectories rapidly spiral into the origin. For B = 2.0, the eigenvalues
of Jacobian matrix are pure imaginary, and the trajectories spiral asymptotically to
the origin as t ! ∞ (see Mathematica code M2). In turn, when B = 2.5, the
eigenvalues of the Jacobian matrix are a complex conjugate pair with a positive real
part, and the trajectories quickly approach a stable limit cycle as t ! ∞, with
U(t) and V(t) indefinitely maintaining the same amplitudes and same shapes (the
system is said to have reached permanent oscillations) (E6).
102 2 Chemical Kinetics
Notes
Au ¼ w ð2:72Þ
This equality represents a linear transformation within the same vector space,
which is said to preserve the vector length, since u and w have the same length. One
especial and very important case of (2.72) quite frequent in quantum mechanics
occurs when the vector w is equal to a scalar k times the column vector u, that is,
Au ¼ ku ð2:73Þ
where the scalar k and the vector u are to be determined (they are not known
beforehand). Certainly, the trivial solution u = 0 is of no practical interest! Values
of k and nontrivial vectors u that satisfy (2.73) are called eigenvalues and eigen-
vectors, respectively. Equality (2.73) can be rewritten as
ðA kI Þu ¼ 0 ð2:74Þ
Using (2.74) for each of the eigenvalues k1, k2, and k3, we can write
0 10 1
j j j k1 0 0
AU ¼ðAu1 ; Au2 ; Au3 Þ ¼ ðk1 u1 ; k2 u2 ; k3 u3 Þ@ u1 u2 u3 A@ 0 k2 0A
j j j 0 0 k3
¼ UD
ð2:82Þ
AU ¼ UD ð2:83Þ
104 2 Chemical Kinetics
If the eigenvectors are linearly independent, then U is nonsingular, that is, its
determinant is different from zero. Therefore, U−1 exists, and we can multiply
(2.83) from the left by U−1 and obtain
U 1 AU ¼ D ð2:84Þ
Two n n matrices A and B are called similar if B = X−1AX for some non-
singular and therefore invertible matrix X. The transformation A ! X−1AX is a
similarity transformation. Similar matrices represent the same transformation
(operator) in two different bases of the same vector space related by the matrix
X. Equality (2.84) shows a particular kind of a similarity transformation, whose
result is a diagonal matrix. For that reason, (2.84) represents the diagonalization of
the matrix A. The similarity transformation that diagonalizes the matrix A linearly
combines the basis vectors, so that the resulting matrix becomes diagonal in the
new basis.
We now illustrate the concepts of similarity transformation and similar matrices
using a three-dimensional vector space with Cartesian coordinates and the basis
vectors (xyz)T and (rst)T. Consider the 3 3 matrix A that converts (xyz)T into (x′y′
z′)T,
0 0 1 0 1
x x
@ y 0 A ¼ A@ y A ð2:85Þ
0
z z
Matrix X relates these bases: by applying the matrix X to (rst)T and (r′s′t′)T, we
obtain (xyz)T and (x′y′z′)T, respectively,
0 1 0 1 0 0 1 0 01
x r x r
@ y A ¼ X@ s A @ y A ¼ X @ s0 A
0
ð2:87Þ
0 0
z t z t
The change of basis can be carried in the opposite direction, from (xyz)T and (x′y′z
′) to (rst)T and (r′s′t′)T, meaning that X−1 exists, that is, X is nonsingular (its deter-
T
B ¼ X 1 AX ð2:89Þ
X 1 ¼ X T ð2:90Þ
Consider the system of two simultaneous first-order linear equations that corre-
sponds to (2.62),
x_ ¼ Kx ð2:91Þ
where u and k are to be determined. Substitution of (2.92) into (2.91) and can-
cellation of ekt from both sides gives the eigenvalue equation
Ku ¼ ku ð2:93Þ
Since (2.91) is a system of linear differential equations, if x1(t) and x2(t) are
solutions, then
xð t Þ ¼ c 1 x1 ð t Þ þ c 2 x 2 ð t Þ ð2:94Þ
is also a solution. Therefore, from (2.92) and (2.94) we conclude that the general
solution for (2.91) is given by
where u1 and u2 are eigenvectors and k1 and k2 are the corresponding eigenvalues
(see McQuarrie, Sect. 11.6, p. 563).
106 2 Chemical Kinetics
Mathematica Codes
5:01042 e0:344655t
The function f[t] is plotted in the last line of code, and the data points are
rendered over the plotted curve by the Mathematica command Epilog, whose
main action is
Map[Point,data]
This Mathematica function Map applies Point to each element of data.
Suggestion: Complete the code to evaluate the half-life to a precision of four
decimal places.
Mathematica Codes 107
M2. Brusselator
q = 1; see (2.67) and (2.68)], and the trajectories spiral asymptotically to the origin
as t ! ∞, as can be concluded by inspection of the phase trajectory shown above.
Suggestion: Using the Mathematica function Table, write a Mathematica code
for solving the equations of the Brusselator for k1 = k2 = k3 = k4 = 1.0, A = 1.0,
and three values of B, namely, B = 1.5, 2.0, and 2.5.
Glossary
Exercises
E1. Considering the experimental data that led to Fig. 2.1, use Mathematica to
calculate the rate constant for the first-order chemical reaction and the time
elapsed until the concentration of A is reduced to 5 % of its initial value.
E2. Derive (2.7).
E3. For a second-order chemical reaction and the following rate constant values,
use Mathematica for determining the Arrhenius activation energy.
(a) Write the set of differential equations for the concentrations of X and Y as
functions of time and determine the stationary point.
(b) Obtain the Jacobian matrix at the stationary point. Assume k1 = k2 = k3 =
k4 = 1.0 and A = 1.0, and determine the Jacobian matrix in terms of B.
(c) Use Mathematica for determining the eigenvalues of the Jacobian matrix for
B = 2.0.
k1
A þ X ! 2X
k2
X þ Y ! 2Y
k3
Y ! B
(a) Write the set of differential equations for the concentrations of X and Y as
functions of time and determine the stationary point.
(b) Determine the Jacobian matrix at the stationary point. Assume k1 = k2 =
k3 = 1 and A = 1.0 and determine the Jacobian matrix for these values.
(c) Use Mathematica for determining the eigenvalues of the Jacobian matrix.
(d) Use Mathematica for solving the kinetic equations, plot U(t), V(t), and the
phase trajectory.
References 111
References
Bray WC (1921) A periodic reaction in homogeneous solution and its relation to catalysis. J Am
Chem Soc 43:1262–1267
Eigen M (1954) Methods for investigation of ionic reactions in aqueous solutions with half-times
as short as 10−9 s. Application to neutralization and hydrolysis reactions Discuss Faraday Soc
17:194–205
Eigen M (1967) Immeasurably fast reactions, Nobel Lecture, December 11
Lotka AJ (1920) Undamped oscillations derived from the law of mass action. J Am Chem Soc
42:1595–1599
Prigogine I, Lefever R (1968) Symmetry breaking instabilities in dissipative systems II. J Chem
Phys 48:1695–1700
Winfree AT (1984) The prehistory of the Belousov-Zhabotinsky oscillator. J Chem Educ 61:661–
663
Further Reading
Abstract
The equation of motion of macroscopic particles was proposed by Isaac Newton
toward the end of the seventeenth century. At the beginning of the twentieth
century, several scientific advances led to the conclusion that classical mechanics
was unable to correctly describe the mechanical behavior of subatomic particles.
In particular, the electron, with its small mass of about 1 over 2 103 the proton
mass, exhibits both particlelike and wavelike properties. This chapter aims at
providing the student with the important background theory on quantum
chemistry. To this end, it contains sections on operators, the harmonic oscillator,
spherical coordinates, angular momentum, the hydrogen atom, the antisymmetry
principle, the variational method, Born–Oppenheimer approximation, the
Hartree–Fock method, and density functional theory, followed by sections on
perturbation theory, time-dependent perturbation theory, absorption and emis-
sion of radiation and Raman scattering, and a final section on molecular
calculations that comprises computational methods and standard basis sets. At
the end of this chapter, the student finds two notes, one on the particle in a
one-dimensional box and the other on the two-particle rigid rotor, that
complement the main text without diverting the student’s attention from the
sequence of the chapter sections. Next, several Mathematica codes (Wave
Equation, Helmholtz Equation, Harmonic Oscillator, Spherical Harmonics,
Determinants, Systems of Homogeneous Linear Equations, Normalization
Constants for Slater-Type Orbitals, Functional Derivative, STO versus a
Gaussian Function at the Origin, Fitting Gaussian Functions to a 1s Hydrogen
Orbital, Product of Gaussian Functions) are interconnected with the main text,
along with detailed explanations of new Mathematica commands. Some of these
have suggestions for the student to follow. Finally, the student will find a
glossary of important scientific and technical terms and a list of exercises, whose
complete answers are presented in a section at the end of the book.
3.1 Operators
An operator is a rule that transforms a given function into another function. For
example, the derivative of f(x) results from applying the operator d/dx to f(x),
0 d
f ðxÞ ¼ f ðxÞ ð3:1Þ
dx
^ and B
Two operators A ^ are said to be equal if
^ ¼ Bf
Af ^ ð3:2Þ
where the operator on the right is first applied to f, and the operator on the left is
applied to the function in parentheses. The composition of operators is associative:
^ B
A ^ ¼ A
^C ^B ^
^ C ð3:4Þ
For two commuting operators, their commutator is equal to the null operator,
and the order of application of both operators is irrelevant. The operator d/dx and a
constant c commute,
d d d d d
cf ðxÞ ¼ c f ðxÞ ) ;c ¼ cc ¼0 ð3:6Þ
dx dx dx dx dx
^ ½f ðxÞ þ gðxÞ ¼ Af
A ^ ðxÞ þ AgðxÞ
^ ^ ½c f ðxÞ ¼ c A
A ^ f ðxÞ ð3:8Þ
3.1 Operators 115
When the result of applying a linear operator to a function f is a constant times the
same function,
^ ¼ af
Af ð3:9Þ
(E1, E2).
p2x
ð3:11Þ
2m
where px is the linear momentum along the x-axis, whose corresponding quantum
mechanical operator is given by
h @
^px ¼ ð3:12Þ
i @x
(E3, E4).
The time-independent Schrödinger equation is given by the eigenvalue equation
^ k ¼ Ek wk
Hw ð3:13Þ
where the operator is called the Hamiltonian (the energy operator), wk is the
eigenfunction, and Ek is the corresponding energy eigenvalue. For a system con-
sisting of one particle in one dimension, the Hamiltonian is given by
^ ¼ h d þ VðxÞ
2 2
H ð3:14Þ
2m dx2
116 3 The Schrödinger Equation
where the first term is the kinetic energy operator (E5), the second term is the
potential energy operator, and
h
h ¼ ð3:15Þ
2p
Multiplication of (3.13) by the complex conjugate of wk followed by integration
over all coordinate space leads to
Zþ 1 Zþ 1
^ k dx
wk Hw ¼ Ek wk wk dx ¼ Ek ð3:16Þ
1 1
where
Zþ 1 Zþ 1
wk wk dx ¼ jwk j2 dx ¼ 1 ð3:17Þ
1 1
The square of the absolute value of the wave function, |wk(x)|2, is the probability
density for finding the particle at state wk and point x, and the integral of |wk(x)|2
over the entire domain of x is 1, since wk is normalized. The physical meaning of
|wk|2 as a probability density implies that wk is single-valued and continuous, has
continuous first derivatives, and is quadratically integrable, that is, the integral of
|wk|2 is finite and so can be normalized. A function that meets these requirements is
called a well-behaved function.
The Schrödinger equation cannot be derived any more than the Newton equation of
classical mechanics can be derived. In fact, these equations cannot be derived from
first principles, since they express first principles for the mechanical behavior of
macroscopic bodies and subatomic particles. In spite of that, we can imagine the line
of thought followed by Schrödinger to arrive at his famous and less intuitive equation.
To this end, the classical wave equation is combined with the De Broglie expression
that connects linear momentum, a particle characteristic, with wavelength, a wave
feature. We begin by writing the following one-dimensional wave equation,
@ 2 uðx; tÞ 1 @ 2 uðx; tÞ
¼ ð3:18Þ
@x2 v2 @t2
where u is any twice differentiable function, and v represents the speed of the
propagating wave. The Mathematica code M1 solves this differential equation,
showing that its general solution is a linear combination of any twice differentiable
functions of arguments x − vt and x + vt. Functions with these arguments are called
waves and propagate to the right or forward direction (argument x − vt) and to the
left or backward direction (argument x + vt) (E6).
A propagating wave conveys two kinds of periodicity: space periodicity in the
x direction with wavelength k as its characteristic parameter, and time periodicity with
3.1 Operators 117
d2 wðxÞ
þ k2 wðxÞ ¼ 0 ð3:20Þ
dx2
p2x
E¼ þ VðxÞ ð3:21Þ
2m
kpx ¼ h ð3:22Þ
h2 d2 wð xÞ
þ V ð xÞwð xÞ ¼ Ewð xÞ ð3:23Þ
2m dx2
Consider now an arbitrary well-behaved function f and a set g1, g2, …, gi,… of
well-behaved functions that obey the same boundary conditions of f and satisfy the
equality
X
f ¼ c i gi ð3:24Þ
i
118 3 The Schrödinger Equation
where the ci are coefficients. The set of functions g1, g2, …, gi, … is said to be a
complete set of functions and can be used to expand any arbitrary function that
obeys the same boundary conditions of the functions gi.
Now consider a linear operator  that represents the physical quantity A. When
the system is described by the state function w, the average value of  is given by
Z
h Ai ¼ ^
w Awds ð3:25Þ
where the integral is over the full range of coordinates of the wave function. The
average value of a physical quantity is a real number, so
h Ai ¼ h A i ð3:26Þ
and
Z Z
^
w Awds ¼ ^
w Aw ds ð3:27Þ
A linear operator that satisfies this equality for all well-behaved functions is
called a Hermitian operator. Replacing w by one eigenfunction of the Hermitian
operator, one easily concludes that the eigenvalues of a Hermitian operator are real
numbers. The above definition of Hermitian operator can be shown to be equivalent
to the following,
Z Z
^
f Agds ¼ ^ ds
g Af ð3:28Þ
^ k ¼ ak gk
Ag ð3:29Þ
ak ¼ ak ð3:32Þ
that is, the eigenvalue ak is real. Therefore, the eigenvalues of a Hermitian operator
are real numbers.
Theorem 2 Two linear operators with a common complete set of eigenfunctions
commute.
If two linear operators have a common complete set of eigenfunctions g1, g2, …,
gi, …,
^ i ¼ ai gi
Ag ^ i ¼ bi gi
Bg ð3:33Þ
where ai and bi are eigenvalues, then they commute, i.e., their commutator is the
null operator, and consequently,
A^B
^B
^A^ f ¼0 ð3:34Þ
^ and
In order to demonstrate this theorem, we consider two Hermitian operators A
^ and write the eigenvalue equation for A,
B ^
^ i ¼ ai gi
Ag ð3:35Þ
^ i ¼ bi gi
Bg ð3:38Þ
^ ¼ af
Af ^ ¼ bg
Ag ð3:39Þ
^ that
Consider two independent eigenfunctions of the Hermitian operator A
correspond to the same eigenvalue (degenerate eigenvalue),
^ ¼ sf
Af ^ ¼ sg
Ag ð3:41Þ
If we now assume that f and g are not orthogonal, we can take f and g + cf (note
that g + cf is also an eigenfunction of  with the same eigenvalue s) and choose
c such that these two independent eigenfunctions become orthogonal, i.e.,
Z R
f gds
f ðg þ cf Þds ¼ 0 ) c ¼ R ð3:42Þ
f f ds
A notation introduced by Dirac, very useful in quantum mechanics, uses the fol-
lowing symbolism:
Z Z
f Agds ¼ f
A
^ ^
g f gds ¼ hf jgi ð3:43Þ
where the integrals are over the full range of coordinates of the wave functions. We
have
Z Z
f gds ¼ fg ds ð3:44Þ
so
The atoms of a molecule oscillate about their equilibrium positions. For an isolated
molecule (a molecule in the gas phase at low pressure), suitable spectroscopic
methods show that these oscillations have well-defined frequencies. In order to
understand the vibrations of molecules, we begin by separating the translations and
rotations. For a diatomic molecule AB, the motions of atoms A and B involve six
122 3 The Schrödinger Equation
Cartesian coordinates, three for each atom. Let us consider two systems of refer-
ence, one fixed in the laboratory, the other locked to the molecule, with one of its
axes coincident with the internuclear axis (Fig. 3.1). The motions of the atoms can
now be separated into the center of mass motion described by three translations,
corresponding to the components of the vector R(t) along the xlab, ylab, and zlab axes,
two rotations, around two orthogonal axes perpendicular to the internuclear axis
(for a linear molecule, rotation around the internuclear axis does not lead to
physically distinguishable configurations and so is not considered), and one
vibration, described by the variation in the internuclear distance between atoms A
and B (Fig. 3.1).
A molecule with N atoms needs 3N Cartesian coordinates as functions of time
(three Cartesian coordinates per atom) to fully describe the motions of its atoms.
Then, 3N coordinates include three coordinates for the center of mass motion
(molecular translations), three coordinates for the rotations of a nonlinear molecule
around three orthogonal axes, or two coordinates for the rotations of a linear
molecule, and 3N − 6 or 3N − 5 coordinates for the vibrations of a nonlinear or a
linear molecule, respectively.
2 2
1 drA 1 drB
T ¼ mA þ mB ð3:46Þ
2 dt 2 dt
If we now consider the molecule AB with its center of mass defined by the
vector R, then we can write
RM ¼ rA mA þ rB mB ð3:47Þ
where
M ¼ mA þ mB ð3:48Þ
r ¼ rB rA ð3:49Þ
1 1 1
þ ð3:51Þ
l mA mB
(E12, E13).
F ¼ kx ð3:52Þ
where x(t) = s(t) − se, with se representing the equilibrium position. The propor-
tionality constant k is called a force constant. Use of the Newton equation in (3.52)
gives
d2 x
m ¼ kx ð3:53Þ
dt2
124 3 The Schrödinger Equation
Fig. 3.2 (Left) A single mass linked to a rigid wall by a spring. (Right) Two masses linked by a
spring (R is the center of mass of the system)
x ¼ A sinðxt þ bÞ ð3:54Þ
We now write
dV
F¼ ð3:56Þ
dx
dV
¼ kx ð3:57Þ
dx
1
V ¼ kx2 ð3:58Þ
2
on the assumption that the integration constant is zero (the potential energy always
has an arbitrary additive constant, so we can take it as zero). The Taylor expansion
of the potential energy around the potential energy minimum, Vmin, is given by
dV 1 d2 V
V ¼ Vmin þ þ x2 þ ð3:59Þ
dx min 2 dx2 min
The first term can be chosen to be zero, and the second term is zero because the
minimum is a stationary point, and the slope at a stationary point is zero. Therefore,
we can write, after taking (3.58) and (3.59) into account,
d2 V
¼k ð3:60Þ
dx2 min
that is, the curvature of the potential energy curve in the minimum is equal to the
harmonic oscillator force constant. Terms of order higher than x2 account for an-
harmonicity and are not included in the harmonic oscillator model.
The one-dimensional Schrödinger equation for the harmonic oscillator [(3.23)
with the potential energy given by (3.58)] is
h2 d2 wv ð xÞ 1 2
þ kx wv ð xÞ ¼ Ev wv ð xÞ ð3:61Þ
2m dx2 2
wv ¼ Nv ez =2 Hv ðzÞ
2
ð3:62Þ
(see Mathematica code for obtaining these normalization factors in M3), and Hv are
the polynomial factors known as Hermite polynomials (E15, E16), defined by
2 dv z2
Hv ðzÞ ð1Þv ez e ð3:64Þ
dzv
H0 ¼ 1
H1 ¼ 2z
H2 ¼ 4z2 2
H3 ¼ 8z3 12z
H4 ¼ 16z4 48z2 þ 12 ð3:66Þ
Figure 3.3 shows the squares of the harmonic oscillator eigenfunctions (3.62)
(probability density curves), for v from 0 to 5 (see Mathematica code in M3). The
number of nodes in each eigenfunction equals the value of the vibrational quantum
number.
If we now substitute back z by x using z = √ax, we obtain the first four harmonic
oscillator normalized wave functions expressed in the original x variable:
a1=4
eax =2
2
w0 ¼
p
a1=4 1 pffiffiffi
pffiffiffi eax =2 2 ax
2
w1 ¼
p 2
a1=4 1
pffiffiffi eax =2 4ax2 2
2
w2 ¼
p 8
a1=4 1 h pffiffiffi pffiffiffi i
pffiffiffiffiffi eax =2 8 a x3 12 ax
2 3
w3 ¼ ð3:67Þ
p 48
Fig. 3.3 Probability density curves for the harmonic oscillator as functions of z. This figure was
obtained with Mathematica
3.2 Harmonic Oscillator 127
pffiffiffiffiffiffiffiffiffiffiffiffi
where the normalized factors are (a/p)1/4 ½1= ð2v v!Þ (see Mathematica code in M3
for verifying these normalizing factors and confirming that these wave functions are
orthogonal).
The existence of boundary conditions for the wave functions (these must go to
zero as x goes to infinity) implies that Ev = (v + 1/2)hx, where v = 0, 1, 2, …, that
is, the energies of the vibrational levels depend linearly on the vibrational quantum
number v, and the separation between consecutive energy levels is hx. At 0 K, the
harmonic oscillator quantum number is zero (v = 0); the harmonic oscillator energy
is called the zero-point energy and is equal to hx=2. For a molecule as a collection
of harmonic oscillators, the total zero-point energy is the sum of all zero-point
energies.
The single vibrational mode of a diatomic molecule is the stretching of its inter-
nuclear distance. An empirical representation of the potential energy function for a
diatomic molecule, more realistic than the harmonic oscillator, is the so-called
Morse potential energy function
h i2
UðRÞ ¼ D 1 ebðRRe Þ ð3:68Þ
ðR Re Þ2 00 ðR Re Þ3 000
UðRÞ ¼ UðRe Þ þ ðR Re ÞU 0 ðRe Þ þ U ðRe Þ þ U ðRe Þ þ
2! 3!
ð3:69Þ
where
2
dU dU
UðRe Þ ¼ 0 ¼0 ¼k
dR Re dR2 Re
rffiffiffiffiffiffi
d3 U k d4 U 7k2
¼ 3k ¼ ð3:70Þ
dR3 Re 2D 4
dR Re 2D
The first nonzero term is the harmonic term. Cubic, quartic, and following terms
account for the Morse potential anharmonicity.
128 3 The Schrödinger Equation
Fig. 3.4 Morse function fitted to the points obtained by scanning the bond distance of F2. The
parabola represents the harmonic oscillator potential energy curve. This figure was obtained with
Mathematica
Figure 3.4 shows the potential energy curves for the harmonic oscillator
(parabola) and the Morse function fitted to the points in black obtained by deter-
mining the potential energy of the system of two fluorine atoms at predefined
internuclear distances (scan of the bond distance for F2, Gaussian 09 B3LYP/
cc-pVTZ calculation; for now, the student should simply consider this a reliable and
accurate calculation for its purpose, leaving the details of the level of calculation
and the basis of functions to be understood later on, in Sect. 3.15). The minimum of
the parabola was set to be coincident with the minimum of the potential energy
curve for F2. Both curves (Morse function and parabola) have the same curvature at
their minima. It can be seen that the harmonic oscillator deviates increasingly from
the calculated potential energy curve for F2 as the amplitude of vibration increases.
Large-amplitude vibrations such as the stretching vibrations involving light atoms
as in H2, CH, NH, OH, FH, F2, have important anharmonic corrections. Unlike the
harmonic oscillator curve, the Morse potential function can be well fitted to the
points of the potential energy curve of F2.
The potential energy function for the electron of a hydrogen atom is a function of
the distance from the electron at point (x, y, z) to the origin at the nucleus, and thus a
spherically symmetric function. One says that the hydrogen atom is an example of a
central force problem. The set of coordinates adequate to a central force problem
are spherical coordinates, (r, h, /), where r is the distance from the origin to the
point (x, y, z) (0 r < ∞), h is the angle between the positive z-axis and the
r vector to the point (0 h p), and / is the angle between the positive x-axis
3.3 Spherical Coordinates 129
with the projection of the r vector in the xy-plane (0 / < 2p) (Fig. 3.5). Right
triangle trigonometry can easily lead to the following equalities:
In classical mechanics, the motion of one particle around a center of force can be
characterized by two kinds of momenta, linear and angular momenta. Let r be the
vector from the center of force to the particle position,
r ¼ ix þ jy þ kz ð3:78Þ
dr dx dy dz
v ¼ i þj þk ð3:79Þ
dt dt dt dt
The classical definitions of the linear and angular momenta are given by
p mv ð3:80Þ
and
Lrp ð3:81Þ
i.e.,
i j k
L ¼
x y z
ð3:82Þ
px py pz
(E19).
The quantum-mechanical operators of the linear momentum [see (3.12)] are given
by
@ @ @
^px ¼ ih ^py ¼ ih ^pz ¼ ih ð3:84Þ
@x @y @z
L ^L
^2 ¼ L ^¼L
^2x þ L
^2y þ L
^2z ð3:87Þ
^z ¼ ih @
L ð3:90Þ
@/
where the last expression suggests a separation of the variables h and / in the
eigenfunctions. The one-electron eigenvalue equations and quantum numbers for
the square and the z-component of the angular momentum operators are given by
are called spherical harmonics and Pm ‘ are the associated Legendre polynomials
(note that in Y‘m and Pm
‘ , m is a superscript, not an exponent) (see §2). Concise and
useful definitions of associated Legendre polynomials, implemented in Mathe-
matica as LegendreP½‘; m; cos h, and spherical harmonics, implemented in
Mathematica as SphericalHarmonicY[‘,m,h,/], can be obtained by
querying Wolfram Alpha at www.wolframalpha.com. The interested reader can find
3.4 Angular Momentum 133
Fig. 3.7 Polar plots of associated Legendre polynomials P00 , P01 , and P02 , and three-dimensional
plots of their corresponding spherical harmonics, obtained with Mathematica
1
P00 ðcos hÞ ¼ 1 P01 ðcos hÞ ¼ cos h P02 ðcos hÞ ¼ ð3 cos2 h 1Þ
2
3.4.2 Spin
The yellow color imparted to a flame by sodium atoms results from emission from
the electronic configuration 1s22s22p63p1 to the ground state configuration
1s22s22p63s1. While this spectral feature of atomic sodium atoms is usually called
the sodium D line, it is in fact associated to two closely spaced spectral lines. In
1925, Uhlenbeck and Goudsmit explained this fine structure by proposing that the
electron has an intrinsic angular momentum called the spin angular momentum,
or more simply, spin, with two possible values of its projection along a preferred
axis such as the axis of an applied magnetic field (Fig. 3.8). Later, in 1928, Dirac
developed the relativistic quantum mechanics of the electron, where the electron
spin arises naturally. In nonrelativistic quantum mechanics, the spin must be con-
sidered an additional hypothesis. While the orbital angular momentum we have
134 3 The Schrödinger Equation
where the +1/2 and −1/2 values of ms correspond to a and b spin eigenfunctions
that form a complete set of orthonormal functions, i.e., normalized and mutually
orthogonal functions,
Z Z Z Z
a adr ¼ b bdr ¼ 1 a bdr ¼ b adr ¼ 0 ð3:94Þ
Since the spin is a quantum concept without classical equivalent, (3.94) should
be seen as merely expressing formal equations.
^ ¼ h r2 þ VðrÞ
2
H ð3:95Þ
2l
3.5 Hydrogen Atom 135
where
@2 @2 @2
r2 þ þ ð3:96Þ
@x2 @y2 @z2
is the Laplacian operator and l is the reduced mass of the electron–nucleus system
that satisfies the following equality:
1 1 1
þ ð3:97Þ
l me mN
For the lightest atom protium, the mass of the nucleus is about 2 103 times the
mass of the electron. Therefore, the second term on the right-hand side of (3.97) is
negligible, and l is approximately equal to the electron mass,
l me ð3:98Þ
e2
VðrÞ ¼ ð3:99Þ
4pe0 r
@2 2@ 1 ^2
r2 ¼ þ 2 2L ð3:100Þ
@r 2 r @r r h
Therefore, the Hamiltonian for the hydrogen atom commutes with the square of
the angular momentum operator [see (3.89)] and with the z-component of the
angular momentum operator [see (3.90)], and we can write
^ L
½H; ^2 ¼ 0 ^ L
½H; ^z ¼ 0 ð3:101Þ
with the radial factors and spherical harmonics being labeled by the n, ‘, and
m quantum numbers. Alternatively, ‘ can be substituted by a letter, where the first
letters of the words of spectroscopic origin sharp, principal, diffuse, and funda-
mental stand for ‘ = 0, 1, 2, and 3, respectively. After the letter f, the alphabetical
order is followed, but j is omitted, and s and p are skipped because they have
already been used for ‘ = 0 and 1,
‘ 0 1 2 3 4 ...
s p d f g ...
Boundary conditions for the radial factors imply the introduction of the quantum
number n. For n 3 and the hydrogen atom (Z = 1) wave functions in atomic
units (unit of length is a0 = 0.52917721092 10−10 m; see Appendix), the radial
factors are given by
R1s ¼ 2er
1 1
R2s ¼ pffiffiffi ð2 rÞer=2 R2p ¼ pffiffiffi rer=2
2 2 2 6
2 2 r=3 4 4
R3s ¼ pffiffiffi ð27 18r þ 2r Þe R3p ¼ pffiffiffi ð6 rÞrer=3 R3d ¼ pffiffiffiffiffi r2 er=3
81 3 81 6 81 30
ð3:103Þ
(E25, E26).
The probability of finding the electron with spherical coordinates lying in the
ranges r to r + dr, h to h + dh, and / to / + d/ is given by
2
jwn‘m ðr; h; /Þj2 r 2 sin hdrdhd/ ¼ R2n‘ ðrÞr 2 dr
Y‘m ðh; /Þ
sin hdhd/ ð3:104Þ
Z2p Zp
m
Y ðh; /Þ
2 sin hdhd/ ¼ 1 ð3:106Þ
‘
0 0
The jRn‘ ðrÞj2 r 2 are called radial distribution functions (Fig. 3.9).
3.6 Antisymmetry Principle 137
Fig. 3.9 Radial distribution functions for the 1s, 2s, and 2p hydrogen orbitals. Compare 2s and
2p and note the 2s lobes separated by a node. This graph was obtained with Mathematica
^ 12 Wðs1 ; s2 Þ ¼ Wðs2 ; s1 Þ
P ð3:107Þ
Applying this operator twice has no net effect, since the initial configuration is
restored,
P ^ 12 P
^ 212 Wðs1 ; s2 Þ ¼ P ^ 12 Wðs2 ; s1 Þ ¼ Wðs1 ; s2 Þ
^ 12 Wðs1 ; s2 Þ ¼ P ð3:108Þ
^ 12 Wðs1 ; s2 Þ ¼ Wðs1 ; s2 Þ
P ð3:109Þ
138 3 The Schrödinger Equation
and the wave function is said to be antisymmetric with respect to the interchange
of coordinates of two electrons including the spin coordinates (antisymmetry
principle). If s2 = s1, then
and consequently,
Therefore, two electrons with the same spin have zero probability of being found
at the same point. Since the wave function is a well-behaved function and so is
continuous, the probability of finding two electrons with the same spin in the same
region of space should be small. This consequence of the antisymmetry principle is
called the Pauli exclusion principle.
How to build a many-electron wave function W from one-electron functions, so
that W obeys the antisymmetry requirement? First of all, we take each one-electron
function as being obtained by the product of a spatial factor and one of the spin
functions a or b. For electron 1, we can write
w ð1Þ wb ð1Þ wc ð1Þ
a
W¼ pffiffiffiffi
wa ð2Þ wb ð2Þ wc ð2Þ
ð3:117Þ
3!
w ð3Þ w ð3Þ wc ð3Þ
a b
wa ð2Þ wb ð2Þ wc ð2Þ
wa ð1Þ wb ð1Þ wc ð1Þ
w ð1Þ w ð1Þ w ð1Þ
¼
w ð2Þ w ð2Þ w ð2Þ
ð3:118Þ
a b c
a b c
w ð3Þ w ð3Þ w ð3Þ
w ð3Þ w ð3Þ w ð3Þ
a b c a b c
[note that normalizing constants have been omitted in (3.118)]. In addition, two
equal columns make the Slater determinant equal to zero, since
wa ð1Þ wa ð1Þ wc ð1Þ
w ð2Þ wa ð2Þ wc ð2Þ
¼ 0 ð3:119Þ
a
w ð3Þ wa ð3Þ wc ð3Þ
and the corresponding probability density becomes zero. Therefore, no two elec-
trons of an atom or a molecule can have the same spin-orbital. This conclusion
shows that the Pauli exclusion principle is a corollary of the antisymmetric behavior
of the wave function.
If columns 1 and 3 of the Slater determinant in (3.117) are substituted by
normalized orthogonal combinations of these columns, the resulting determinant
remains the same or simply changes its sign (the corresponding probability density
is not altered),
140 3 The Schrödinger Equation
w ð1Þ þ w ð1Þ
a pffiffi c w ð1Þ wa ð1Þw pffiffi c
ð1Þ
b
wa ð1Þ wb ð1Þ wc ð1Þ
w ð2Þ þ w ð2Þ
2 2
a pffiffi c wb ð2Þ wa ð2Þw pffiffi c
ð2Þ
¼
w ð2Þ w ð2Þ wc ð2Þ
ð3:120Þ
2 2
a b
wa ð3Þ þ wc ð3Þ wa ð3Þwc ð3Þ
w ð3Þ w ð3Þ w ð3Þ
pffiffi w ð3Þ pffiffi
a b c
2 b 2
wa ðs1 Þ wb ðs1 Þ
1
Wðs1 ; s2 Þ ¼ pffiffiffiffi
¼ pffiffiffi ½wa ðs1 Þwb ðs2 Þ wa ðs2 Þwb ðs1 Þ ð3:121Þ
2! wa ðs2 Þ wb ðs2 Þ
2
ðS¼0;MS ¼0Þ 1
Wðs1 ; s2 Þ ¼ pffiffiffiffi
ð3:122Þ
1
¼ /a ðr1 Þ/a ðr2 Þ pffiffiffi ½að1Þbð2Þ að2Þbð1Þ
2
Electrons 1 and 2 have antiparallel spins, so the sum Ms = ms1 + ms2 is zero.
Therefore, the total spin is zero (S = 0), and the number of distinct values of MS, or
electron-spin multiplicity (2S + 1), is equal to 1 (the state is called a singlet).
Consider now an open shell configuration with electrons 1 and 2 in different
orbitals, both with spins a. The corresponding Slater determinant is the product of
an antisymmetric product of spatial factors and the simple product of spin
functions:
1
/a ðr1 Það1Þ /b ðr1 Það1Þ
ð3:123Þ
1
¼ pffiffiffi ½/a ðr1 Þ/b ðr2 Þ /a ðr2 Þ/b ðr1 Það1Það2Þ
2
1
/a ðr1 Þbð1Þ /b ðr1 Þbð1Þ
ð3:125Þ
1 1
¼ pffiffiffi ½/a ðr1 Þ/b ðr2 Þ /a ðr2 Þ/b ðr1 Þ pffiffiffi ½að1Þbð2Þ þ bð1Það2Þ
2 2
Solving the Schrödinger equation for atoms or molecules with more than one
electron requires the use of approximate methods, the most powerful of these being
based on the variational theorem. Consider a system with a Hamiltonian whose
lowest-energy eigenvalue is E0, and a well-behaved function / that satisfies the
boundary conditions of the system. According to the variational theorem,
R
^
/ H/ds
R
E0 ð3:126Þ
/ /ds
^ k ¼ Ek wk
Hw ð3:127Þ
and form a complete orthonormal set of functions that can be used to expand /,
X
/¼ ak wk ð3:128Þ
k
To prove (3.126), we substitute (3.128) into (3.126) and use (3.127) to obtain
R P P P
^
/ H/ds k aj ak Ek djk k a k a k Ek
W R ¼ P P ¼ P
j
E0 ð3:129Þ
/ /ds j k aj ak djk k ak ak
142 3 The Schrödinger Equation
where djk is the Kronecker delta (djk = 0, for j 6¼ k, and djk = 1, for j = k) and the
inequality is justified because E0 is the lowest-energy eigenvalue.
If / is a linear combination of n linearly independent functions that satisfy the
boundary conditions of the system,
X
n
/¼ cj fj ð3:130Þ
j¼1
the set of functions fj is said to be a set of basis functions with the coefficients
being the variation parameters. Assuming real basis functions and real coefficients,
then
R P P
^
/ H/ds k cj ck Hjk
W¼ R ¼P P
j
ð3:131Þ
/ /ds j k cj ck Sjk
where
Z Z
Sjk ¼ fj fk ds Hjk ¼ ^ k ds
fj Hf ð3:132Þ
@W
¼0 ð3:133Þ
@cj
which is equivalent to
@K @D
W ¼0 ð3:134Þ
@cj @cj
where
XX XX
K¼ cj ck Hjk D¼ cj ck Sjk ð3:135Þ
j k j k
(E27). Illustration of the above equations with a set of two basis functions leads to
3.7 Variational Method 143
@K @D
@c1 ¼ 2c1 H11 þ 2c2 H12 @c1 ¼ 2c1 S11 þ 2c2 S12
@K @D ð3:136Þ
@c2 ¼ 2c1 H12 þ 2c2 H22 @c2 ¼ 2c1 S12 þ 2c2 S22
(E28, E29, E30). A nontrivial solution for this system of equations (the trivial
solution c1 = c2 = 0 corresponds to a null / everywhere) requires a vanishing
determinant of the coefficients, i.e.,
H11 WS11 H12 WS12
¼0 ð3:138Þ
H12 WS12 H22 WS22
where a and b refer to nuclei, i and j to electrons. The first and second terms on the
right side of (3.139) are operators for the kinetic energies of the nuclei and elec-
trons, and the last three terms are potential energy operators for the nucleus–nucleus
repulsions, electron–nucleus attractions, and electron–electron repulsions. Benzene,
a typical medium-size molecule, has 12 nuclei and 42 electrons corresponding to 36
nuclear coordinates and 126 electron coordinates. Solving the Schrödinger equation
for this molecule is a formidable task. However, nuclei are much heavier than
electrons: one proton mass is about 1.8 103 times the electron mass, and one 12C
nucleus mass is about 2.2 104 times the electron mass. This suggests an
approximation whereby electron and nuclear coordinates are dealt with separately,
with the electrons carrying out their motions while the nuclei are fixed. The Born–
Oppenheimer approximation deals with the separation between electron and
nuclear coordinates in two steps. In the first step, the kinetic energy of the nuclei is
assumed to be zero and the Schrödinger equation
^e þ V
ðH ^NN Þwe ðsi ; Ra Þ ¼ UðRa Þwe ðsi ; Ra Þ ð3:140Þ
is solved, where
^ e ¼ T^e þ V
H ^Ne þ V
^ee ð3:141Þ
^ N wN ðRa Þ ¼ EN wN ðRa Þ
H ð3:142Þ
where
^ N ¼ T^N þ UðRa Þ
H ð3:143Þ
0 2
^NN ¼ ZZ e
V ð3:144Þ
4pe0 R
3.8 Born–Oppenheimer Approximation 145
Fig. 3.10 A U(R) function for the ground electronic state of F2 obtained by scanning the
internuclear distance (Gaussian 09 B3LYP/cc-pVTZ calculation). The inset shows the zero-point
energy level marked with 0 and the first, second, and third stretching energy levels marked with 1,
2, and 3, respectively. This figure was obtained with Mathematica
where Z and Z′ are atomic numbers and R is the internuclear distance. The potential
energy curve U(R) is a function of R, and the difference between the energy at
infinite internuclear distance and the energy at the minimum Re is called the
equilibrium dissociation energy,
harmonic behavior, that is, the 2–1 and 3–2 transitions were made equal to the
harmonic frequency (anharmonicity was ignored). All these calculations were
carried out using the system of programs Gaussian 09, with the B3LYP method of
calculation and the cc-pVTZ basis set (the student can leave the technical details of
this calculation to Sect. 3.15).
where f and g are the products of radial factors by spherical harmonics, the former
being the variation functions. Zero superscripts indicate that the one-electron wave
functions are initial guesses, a concept that will soon be understood, and 1 and 2
inside parentheses stand for spherical coordinates (r1, h1, /1) and (r2, h2, /2) of
electrons 1 and 2. The average repulsion potential “felt” by electron 1 is obtained by
integrating the density of electron 2 over all its positions (integral over
dv2 ¼ r22 sin h2 dr2 dh2 d/2 ). In atomic units (see Appendix), this average potential is
given by
ð0Þ
2
ð0Þ g ð2Þ
V1 ð1Þ ¼ dv2 ð3:147Þ
r12
(E31). The above potential is a function of the coordinates of electron 1 only, since
the integration over dv2 removes the coordinates of electron 2, thus replacing
3.9 Hartree–Fock Method 147
ð0Þ
2
ð0Þ f ð1Þ
V2 ð2Þ ¼ dv1 ð3:149Þ
r12
and
R 2p R p ð0Þ
ð0Þ 0 V2 ð2Þ sin h2 dh2 d/2
V2 ðr2 Þ ¼ 0
R 2p R p ð3:150Þ
0 0 sin h2 dh2 d/2
where the second terms inside square brackets are the electron–nucleus attractions
(Z is the atomic number), and r1 and r2 are the distances from electrons 1 and 2 to
the atomic nucleus.
Note that the Hamiltonians for the above equations depend on one-electron wave
functions. In particular, V(0) (0) (0) (0)
1 (r1) depends on g (2), and V2 (r2) depends on f (1).
(1) (1)
Orbitals f (1) and g (2), obtained by solving these equations, are used to evaluate
V(1) (1)
1 and V2 , and these potential operators are introduced back into the one-electron
Schrödinger equations to obtain f(2)(1) and g(2)(2). This iterative procedure con-
tinues until no significant change is found in the one-electron wave functions and
corresponding energies, from one iteration to the next. Then, one says that
self-consistency has been attained. Energy eigenvalues e1 and e2 include the 1–2
and 2–1 repulsions, respectively. These repulsions are the same, so the sum e1 + e2
contains the electron–electron repulsion twice. To obtain the total energy, the
148 3 The Schrödinger Equation
repulsion between electron densities |f(1)|2 and |g(2)|2 has to be subtracted from the
sum e1 + e2,
ZZ
jf ð1Þj2 jgð2Þj2
E ¼ e1 þ e2 dv1 dv2 ¼ e1 þ e2 J12 ð3:153Þ
r12
X
n=2 n=2 X
X n=2
EHF ¼ 2 ei ð2Jij Kij Þ þ VNN ð3:154Þ
i¼1 i¼1 j¼1
(E34, E35). While each Coulomb integral represents the repulsion between two
orbital densities, each exchange integral can be physically associated with the
interaction between two overlap densities and results from antisymmetry in the
wave function. Integrations over electron coordinates in (3.155) and (3.156) have
the effect of dealing with Coulombic and exchange electron–electron interactions,
not as real instantaneous repulsions, but as averages over the spatial coordinates of
the electrons. This is the price the Hartree–Fock method pays for dealing with
one-electron functions (orbitals) and, at the same time, having to calculate integrals
involving the electron–electron distance rij.
where n, ‘, and m are quantum numbers, f is the orbital exponent, and the nor-
malization constant is given by
ð2fÞn þ 1=2
Nnf ¼ ð3:158Þ
½ð2nÞ!1=2
When the Hartree–Fock SCF procedure is carried out for a closed shell electronic
configuration, that is, for a nondegenerate ground state (electron-spin multiplicity
2S + 1 = 1), the wave function is given by a single Slater determinant of
spin-orbitals. The orthonormal orbitals that minimize the Hartree–Fock energy
(3.154) satisfy the Hartree–Fock equations,
^
Fð1Þ/i ð1Þ ¼ ei /i ð1Þ ð3:159Þ
1 X Za X
n=2
^
Fð1Þ ¼ r21 þ 2^Jj ð1Þ K
^ j ð1Þ ð3:160Þ
2 a
r1a j¼1
where the subscript a runs over the nuclei, and the Coulomb and exchange op-
erators are defined by the following equalities:
Z
/j ð2Þ/j ð2Þ
^Jj ð1Þ/i ð1Þ ¼ /i ð1Þ dv2 ð3:161Þ
r12
and
Z
/j ð2Þ/i ð2Þ
^ j ð1Þ/i ð1Þ ¼ /j ð1Þ
K dv2 ð3:162Þ
r12
X
b
/i ¼ vt cti ð3:163Þ
t¼1
where b is the total number of basis functions. Substitution of (3.163) in the Fock
equations (3.159) and multiplication by the complex conjugate of vs followed by
integration leads to
X
b
ðFst ei Sst Þcti ¼ 0 s ¼ 1; 2; . . .; b ð3:164Þ
t¼1
where
Z Z
Fst ¼ ^ t dv
vs Fv Sst ¼ vs vt dv ð3:165Þ
This equation leads to a polynomial whose unknowns are the orbital energies ei.
Solving the equations that result from the substitution of each determined ei in
(3.164) and normalizing /i gives the coefficients cti, where t = 1, 2, …, b. The
F matrix elements are obtained from the Fock operator, which in turn depends on
the /i orbitals. Therefore, the Hartree–Fock–Roothaan equations are solved by an
iterative process that ends when self-consistency is reached.
Rearranging (3.164) leads to
X
b X
b
Fst cti ¼ Sst cti ei s ¼ 1; 2; . . .; b ð3:167Þ
t¼1 t¼1
which suggests the use of matrix formalism. In fact, the first member shows that
each row of the F matrix (a fixed s value, with t varying from 1 to b) multiplies a
column of the matrix of coefficients (with a defined i, with t varying from 1 to b).
Repeating this procedure for all values of s and subsequently all values of i gives
the product of the matrices F and C. In the second member of (3.167), the matrix
3.9 Hartree–Fock Method 151
S replaces the matrix F of the first member and each column of the matrix of
coefficients C is multiplied by the corresponding orbital energy value. In matrix
formalism, Eq. (3.167) are expressed as
FC ¼ SCe ð3:168Þ
A Hartree–Fock SCF wave function takes the interactions between electrons only in
an average way, whereas interelectronic repulsions are in reality instantaneous.
Thus, the difference between the exact nonrelativistic energy and the Hartree–Fock
energy is defined as the correlation energy
where the integrals extend to the spatial coordinates of electrons 2 to n and to the
spin coordinates of electrons 1 to n. The electrons are indistinguishable, so the
probability for finding any electron, not just electron 1, in the volume dv, is n times
(3.171). Therefore, the ground state electron probability density is
Z Z
2
q0 ðrÞ ¼ n ... jW0 ðs1 ; . . .; sn Þj dv2 . . .dvn dr1 . . .drn ð3:172Þ
We now consider the pure electronic Hamiltonian (3.141) and the electronic
Schrödinger equation (3.140), within the Born–Oppenheimer approximation
(Sect. 3.8). The electronic Schrödinger equation is solved for fixed positions of the
nuclei, since nucleus–nucleus repulsions simply add a constant to the electronic
Hamiltonian in the electronic Schrödinger equation (3.140). The set of nuclei in
their fixed positions in the molecule, that is, the nuclear framework of the molecule,
originates on each electron i the potential given by
3.10 Density Functional Theory 153
X Za
vðri Þ ¼ ð3:174Þ
a
rai
X
n
^Ne ¼
V vðri Þ ð3:175Þ
i¼1
Since the electrons are indistinguishable and v(r) commutes with W0 [v(r) is
simultaneously an operator and a function, and as such, commutes with another
function], we can write
Z Z !
X
n
. . . W0 vðri Þ W0 dv1 . . .dvn dr1 . . .drn
Z Z
i¼1 Z
¼ vðrÞn 2
. . . jW0 j dv2 . . .dvn dr1 . . .drn dv ð3:176Þ
Zb Zb Zb
2 1=2
1=2
J¼ ds ¼ dx þ dy
2
¼ 1 þ y02 dx ð3:178Þ
a a a
154 3 The Schrödinger Equation
where dy = y′dx. This integral sets a correspondence between its value J, a real
number, and the derivative of each plane curve connecting the given points a and b,
the function y′. The integral J is a functional of y′, and this is represented by J[y′].
We are interested in finding the curve with the shortest length. Of course, this is a
trivial question, since the shortest path between two points is a straight line.
However, solving this problem in a systematic way requires evaluation of functional
derivatives, an important question in density functional theory. To this end, let us
consider the more general functional
Zb
J½y ¼ Fðx; y; y0 Þdx ð3:179Þ
a
so that h(a) = h(b) = 0 (a and b are fixed endpoints, so the variations at these points
are zero). The corresponding increment for the functional (3.179) is
Zb
DJ ¼ J½y þ h J½y ¼ ½Fðx; y þ h; y0 þ h0 Þ Fðx; y; y0 Þdx ð3:181Þ
a
Zb
DJ ¼ Fy h þ Fy0 h0 dx þ ð3:182Þ
a
where Fy and Fy′ represent partial derivatives of F with respect to y and y′, and the dots
represent terms of order higher than 1, relative to h and h′. The differential of J[y] is
Zb
dJ ¼ Fy h þ Fy0 h0 dx ð3:183Þ
a
(note the use of the symbol d to represent a functional differential and compare (3.183)
with (3.179)). Integration by parts of the second term inside square brackets leads to
Zb Zb Zb
b d
dJ ¼ 0
Fy h þ Fy0 h dx ¼ Fy hdx þ Fy0 h
a
Fy0 hdx
dx
a a a
Zb
d
¼ Fy Fy0 hdx ð3:184Þ
dx
a
3.10 Density Functional Theory 155
since the second term is zero because h(a) = h(b) = 0. At a stationary point,
Zb
d
dJ ¼ Fy Fy0 hðxÞdx ¼ 0 ð3:185Þ
dx
a
d
Fy F y0 ¼ 0 ð3:186Þ
dx
This equation, called the Euler–Lagrange differential equation, is central to
the calculus of variations. The necessary condition for J[y] to have an extremum
(maximum or minimum) is that y(x) satisfy the Euler–Lagrange equation (see
Gelfand and Fomin, Further Reading, Chap. 1, Sect. 4).
Returning now to the example of the curve in a plane with the shortest length
between two points a and b and applying the Euler–Lagrange equation, we obtain
Fy = 0, Fy’ = y′/√(1 + y′2), dFy′/dx = 0, and y′ = constant, that is, y(x) = Ax + B, as
anticipated.
Since h(x) is an arbitrarily small variation in y(x) [see (3.180)], we can represent
h(x) by dy and rewrite the last equality of (3.184) as
Zb
d
dJ ¼ Fy Fy0 dydx ð3:187Þ
dx
a
dJ d
¼ F y F y0 ð3:188Þ
dy dx
(a proof of this statement can be found in Gelfand and Fomin, Further Reading,
Chap. 1, Sect. 7). Hence, the Euler–Lagrange equation (3.186) can be formulated
as expressing that the functional derivative at a stationary point is zero, a result that
is also applicable to an ordinary derivative of a function at a stationary point.
We now consider (3.177) for a trial electron density q and, using (3.179), write
F(r, q) = v(r)q(r) and apply (3.188). Since F does not explicitly depend on q′, its
derivative with respect to q′ is zero. In turn, the derivative of F with respect to q is
v(r). Hence, using (3.188), we conclude that the functional derivative of VNe with
respect to q is equal to the external potential v(r),
dVNe ½qðrÞ
¼ vðrÞ ð3:189Þ
dqðrÞ
156 3 The Schrödinger Equation
This result connects the external potential v(r) with the average nuclei–electrons
attraction potential VNe. The Mathematica code M8 calculates several functional
derivatives and confirms (3.189).
where E0, Te, VNe, and Vee represent ground state averages of the Hamiltonian,
electron kinetic energy operator, electron–nucleus attractions, and electron–electron
repulsions, respectively. Since E0 is a functional of the electron density q0 and VNe
is a known functional of q0 [see (3.177)], we conclude that Te and Vee are also
functionals of q0 and write
Z
E0 ½q0 ¼ Te ½q0 þ vðrÞq0 ðrÞdv þ Vee ½q0 ð3:191Þ
3.10 Density Functional Theory 157
Note that the Hohenberg–Kohn theorem does not tell us how to calculate E0
from q0, since Te and Vee are unknown functionals of q0.
For a trial electron density q(r) that satisfies
Z
qðrÞdv ¼ n ð3:192Þ
and q(r)
0 for all r, Hohenberg and Kohn also proved that
Z
E½q ¼ Te ½q þ vðrÞqðrÞdv þ Vee ½q
E0 ½q0 ð3:193Þ
that is, the true ground state electron density minimizes the energy functional. This
variational theorem shows that no trial electron density can give a lower ground state
energy than the exact ground state electron density. Note that the validity of this theorem
requires the use of the electronic Hamiltonian, as mentioned above [see (3.190)].
Let us now recall the first Hohenberg–Kohn theorem, which tells us that the ground
state electron probability density q0 uniquely determines the ground state molecular
energy and all other ground state molecular properties. However, this theorem does
not tell us how to calculate E0 from q0, since Te and Vee in (3.191) are unknown
functionals of q0, nor does it provide us with a method for evaluating q0 without
having first to obtain the ground state wave function.
In 1965, Kohn and Sham considered a fictitious reference system (we label this
system by the subscript s) formed by n noninteracting electrons, with each electron
being subject to the effective potential vs(ri), which was chosen so that the ground
state electron density of the reference system qs is equal to the exact ground state
electron density of the real molecule, qs = q0 (Kohn and Sham 1965). Note that the
electron probability density is a function of 3 variables and is obtained from a wave
function of 4n variables by integration over 4n-3 variables [see (3.172)]. The
connection between this noninteracting system and the target molecule is provided
by the first Hohenberg–Kohn theorem, since the ground state electron density of the
reference system qs, being equal to q0, uniquely determines the ground state
molecular energy and all other ground state molecular properties of our real target
system of interacting electrons. The noninteracting electrons of the fictitious system
enable us to define one-electron wave functions (spin-orbitals) and form a Slater
determinant of the lowest-energy Kohn–Sham spin-orbitals,
ws;0 ¼
vKS KS KS
1 v2 . . .vn k ¼ hi a
vKS KS
k þ 1 ¼ hi b
vKS KS
ð3:194Þ
where k varies from 1 to n, i varies from 1 to n/2 (for n electrons, the closed shell
configuration has n/2 occupied orbitals), and each spin-orbital vk is the product of a
Kohn–Sham orbital hi and the spin function a or b. Note that the Slater
158 3 The Schrödinger Equation
pffiffiffiffi
determinant includes the normalizing factor 1= n!. Just as the orthonormal orbitals
that minimize the Hartree–Fock energy satisfy the Fock equations, the Kohn–Sham
orthonormal orbitals that minimize the ground state energy satisfy the Kohn–Sham
eigenvalue equations
X
n=2
KS
2
q0 ¼ qs ¼
h
ð3:197Þ
i
i¼1
(E38).
We now recall Eq. (3.191), which represents the first Hohenberg–Kohn theorem,
Z
E0 ½q0 ¼ Te ½q0 þ vðrÞq0 ðrÞdv þ Vee ½q0 ð3:198Þ
with Te and Vee unknown functionals of q0. The kinetic energy operator is a sum of
one-electron kinetic energy operators. Hence, for the noninteracting system of Kohn
and Sham, it can be shown that the kinetic energy is given by
Z Z " #
X
n
Te;s ½q0 ¼ ... ws;0 T^e ðiÞ ws;0 dv1 . . .dvn dr1 . . .drn
n Z
i¼1
X
¼ ðhKS ^
i Þ ð1ÞTe ð1Þ hi ð1Þdv1
KS
ð3:199Þ
i¼1
(E39). Note that this kinetic energy is not equal to the kinetic energy of the real
interacting system, even if the noninteracting and interacting systems share the
same electron density. Therefore, we can write
where DTe is the difference between the average kinetic energy of the molecule and
the average kinetic energy of the reference system. We now consider the third term
on the right-hand side of (3.198) and write
ZZ
1 q0 ðr1 Þq0 ðr2 Þ
Vee ½q0 ¼ dv1 dv2 þ DVee ½q0 ð3:201Þ
2 r12
3.10 Density Functional Theory 159
where the second term of the right-hand side of this equation is the difference
between the average electron–electron repulsion energy of the molecule and the
electrostatic electron–electron repulsion obtained from the ground state electron
density q0. In order to understand the 1/2 factor of the first term of (3.201), we need
to go back to (3.172) and find that each q0 factor involves a multiplication by n, that
is, it runs over n electron labels that are different in both q0 factors, as required by
the operator 1/rij (i 6¼ j). For example, for electrons with labels 1, 2, and 3, the
six-dimensional integration of (3.201) applies to a sum over electron pairs 1–2, 1–3,
2–1, 2–3, 3–1, and 3–2. Hence, each electron–electron pair is repeated, and the
factor 1/2 corrects for that double counting. Considering now the physical meaning
of the electron charge density in atomic units −q0, the products −q0(r1)dv1 and
−q0(r2)dv2 in the six-dimensional integral of (3.201) represent electronic charges in
the volume elements dv1 and dv2. Therefore, on the right hand-side of (3.201), the
first term represents the electrostatic electron–electron repulsion energy.
Substitution of the result of (3.199) in (3.200) and subsequent substitution of the
resulting Eq. (3.200) and of (3.201) in (3.198) leads to
n Z
X
E0 ½q0 ¼ ðhKS ^
i Þ ð1ÞTe ð1Þhi ð1Þdv1
KS
i¼1
Z ZZ ð3:202Þ
1 q0 ðr1 Þq0 ðr2 Þ
þ vðrÞq0 ðr Þdv þ dv1 dv2 þ Exc ½q0
2 r12
where the delta terms DTe and DVee are grouped in the so-called
exchange-correlation energy functional defined by
The energy components of Exc[q0] are the kinetic correlation energy, which is
the DTe difference between the kinetic energy for the real molecule and the kinetic
energy of the reference system, the exchange energy, which results from the
antisymmetric characteristic of the wave function [see (3.194)], the Coulombic
correlation energy, which accounts for the influence of electron correlation on the
electrostatic electron–electron repulsions described by the first term of the
right-hand side of (3.201), and the self-interaction correction, which results from
the fact that in the electrostatic electron–electron repulsion energy, the electron
density q0(r1) is different from zero in dv1, and so electron 1 erroneously interacts
with itself, the same thing happening with q0(r2) and dv2.
Since the exact form of the exchange-correlation energy functional is unknown,
one needs to resort to approximate functionals to describe it. Having arrived at the
expression for the ground state energy functional [see (3.202)], we can now be a bit
more explicit about the external potential operator of (3.196) for the one-electron
Kohn–Sham Hamiltonian
160 3 The Schrödinger Equation
X Za Z
^ 1 qðr2 Þ
hKS ð1Þ ¼ T^e ðr1 Þ þ vs ðr1 Þ ¼ r21 þ dv2 þ vxc ð1Þ ð3:204Þ
2 a
r1a r12
[see (3.177)]. This equation results from (3.205), since we defined vxc(r) from
Exc[q0], not the other way round.
3.10.6 Overview
Let us now summarize the main steps in the derivation of the density functional
theory equations. The most important statement lies in the first Hohenberg–Kohn
theorem, which says that the ground state electron probability density q0 uniquely
determines the Hamiltonian operator, the ground state molecular energy, and all
other ground state molecular properties.
Within the Born–Oppenheimer approximation, the electronic Hamiltonian is the
sum of operators for the kinetic energy of electrons, potential energy of the elec-
tron–nucleus attractions, and potential energy of electron–electron repulsions.
Based on the first Hohenberg–Kohn theorem, we conclude that the electronic
energy and its component terms are functionals of q0. However, this theorem does
not tell us how to calculate E0 from q0, since Te and Vee are unknown functionals of
q0.
The Kohn–Sham method overcomes this difficulty by considering a fictitious
reference system formed by n noninteracting electrons, with each electron subject to
an unknown effective potential, which in turn is supposed to originate the exact
ground state electron density for the real molecule.
The fictitious reference system of noninteracting electrons leads to one-electron
functions (i.e., spin-orbitals), thus enabling the calculation of the average value of
the kinetic operator for the corresponding Slater determinant. Since this kinetic
energy is not equal to the kinetic energy of the real interacting system, one needs to
determine the difference DTe between the average kinetic energy of the real
molecule and the average kinetic energy of the reference system.
In turn, the average electron–electron repulsion energy is another unknown
functional. However, since the electrostatic electron–electron repulsion obtained
from the ground state electron density q0 can in principle be calculated, the
3.10 Density Functional Theory 161
difference DVee between the average electron–electron repulsion energy of the real
molecule and the electrostatic electron–electron repulsion of the fictitious nonin-
teracting system remains to be calculated.
Now the delta terms DTe and DVee are grouped in the so-called
exchange-correlation energy functional, whose exact form is unknown. For this
reason, in the various DFT methods of calculation, approximate functionals are
used to describe the exchange-correlation energy functional.
One practical question remains to be answered: how to obtain the electron
density of the target molecule? The answer to this question is provided by the
variational Hohenberg–Kohn theorem, which shows that the true ground state
electron density minimizes the energy functional obtained using the exact electronic
Hamiltonian. This theorem suggests a way of improving the trial electron density,
obtained as the sum of the squares of the absolute values of the spatial Kohn–Sham
orbitals [see (3.197)], by varying the electron density, so as to minimize the energy
functional. Like the one-electron Hartree–Fock equations (3.159), the one-electron
Kohn–Sham equations (3.195) also have to be solved iteratively, since the
one-electron Kohn–Sham Hamiltonian depends on all one-electron hi functions [see
(3.197) and (3.204)].
Note the parallelism between the Hartree–Fock and Kohn–Sham methods. Both
assume fictitious systems in order to define one-electron functions (orbitals).
However, in the Hartree–Fock method, each electron is subject to the average
Coulomb repulsion of the remaining electrons, to which is added the exchange
potential that results from the antisymmetry requirement. In turn, in the Kohn–
Sham method, a fictitious reference system is formed by n noninteracting electrons,
with each electron being subject to the effective potential vs(ri) so chosen that the
ground state electron density of the reference system qs is equal to the exact ground
state electron density of the real molecule, qs = q0. The Kohn–Sham equa-
tions (3.195) and (3.196) correspond to the Hartree–Fock equations (3.159) and
(3.160). Both Hartree–Fock and Kohn–Sham methods can be expressed by matrix
formalisms, whereby Hartree–Fock orbitals are replaced by Kohn–Sham orbitals,
and the Fock matrix is replaced by a Kohn–Sham matrix.
After the variational method, the major method for approximately solving the
Schrödinger equation is perturbation theory, applicable to a system B whose
Schrödinger equation cannot be solved but can be obtained from the Schrödinger
equation for system A by the introduction of a small perturbation. A hydrogen atom
in the presence of a small uniform electric field illustrates the use of perturbation
theory. In this section, both the unperturbed Hamiltonian (system A) and the per-
turbation are time-independent operators.
162 3 The Schrödinger Equation
^ k ¼ Ek wk
Hw ð3:207Þ
ð0Þ
where wk represents the wave function for the unperturbed nondegenerate level
ð0Þ ^ differs slightly from H^ 0 by the perturbation
with energy Ek . Suppose that H
^,
operator H 0
^ ¼H
H ^0
^0 þH ð3:209Þ
^ ¼H
H ^ 0 þ kH
^0 ð3:210Þ
where k varies between 0, for the unperturbed system, and 1, for the perturbed
system when the perturbation is fully “switched on.” We now consider a nonde-
ð0Þ ð0Þ
generate eigenfunction w0 and corresponding eigenvalue E0 of the unperturbed
Hamiltonian H^ 0,
and the Schrödinger equation for the system with the perturbation fully “switched
on,”
^ 0 ¼ E0 w0
Hw ð3:212Þ
Substitution of (3.210) in (3.212) shows that both the eigenfunction and the
eigenvalue depend on k,
where s represents the space and spin coordinates. We can now expand in Taylor
series the eigenfunction and the eigenvalue in powers of k, obtaining
3.11 Perturbation Theory 163
@w0 k2 @ 2 w 0 ð0Þ ð1Þ ð2Þ
w0 ¼ ðw0 Þk¼0 þ k þ þ ¼ w0 þ kw0 þ k2 w0 þ
@k k¼0 2! @k k¼0
2
ð3:214Þ
@E0 k2 @ 2 E 0 ð0Þ ð1Þ ð2Þ
E0 ¼ ðE0 Þk¼0 þ k þ þ . . . ¼ E0 þ kE0 þ k2 E0 þ
@k k¼0 2! @k2 k¼0
ð3:215Þ
where
ðkÞ 1 @ k w0 ðkÞ 1 @ k E0
w0 E0 ð3:216Þ
k! @kk k¼0 k! @kk k¼0
We assume that
D
E
ð0Þ
w0
w0 ¼ 1 ð3:217Þ
k1 : H^ 0 E0ð0Þ w0ð1Þ þ H^ 0 E0ð1Þ w0ð0Þ ¼ 0 ð3:221Þ
k2 : ^ 0 E0ð0Þ w0ð2Þ þ H
H ^ 0 E0ð1Þ w0ð1Þ E0ð2Þ wð00Þ ¼ 0 ð3:222Þ
D E D E
ð0Þ
^ 0
ð0Þ ð0Þ
^ 0
ð1Þ
E0 ¼ w0
H E0 ¼ w0
H
ð1Þ ð2Þ
w0 w0 ð3:223Þ
ð1Þ
We now expand the unknown function w0 in the complete set of orthonormal
eigenfunctions of the unperturbed Hamiltonian,
X XD
E
ð1Þ ð0Þ ð0Þ
ð1Þ ð0Þ
w0 ¼ ak w k ¼ wk
w0 wk ð3:224Þ
k6¼0 k6¼0
since the unperturbed and perturbed systems are assumed to have the same
ð1Þ ð0Þ
boundary conditions. The expansion coefficient a0 is zero because w0 and w0 are
orthogonal [see (3.218)]. Multiplying (3.221) on the left by the complex conjugate
ð0Þ
of wk and integrating over all space gives
D
E D E
ð0Þ
ð1Þ ð0Þ
^ 0
ð0Þ
¼ wk
H
ð0Þ ð0Þ
Ek E0 wk
w0 w0 )
|fflfflfflfflfflfflfflffl{zfflfflfflfflfflfflfflffl}
ak D E ð3:225Þ
ð0Þ
^ 0
ð0Þ
D
E w H w
ð0Þ
ð1Þ k 0
ak ¼ wk
w0 ¼ ð0Þ ð0Þ
Ek E0
Hence,
D E
ð0Þ
^ 0
ð0Þ
ð1Þ
X wk
H w0
ð0Þ
w0 ¼ ð0Þ ð0Þ
wk ð3:226Þ
k6¼0 Ek E0
ð0Þ ð0Þ ^ 0
ð0Þ
ð2Þ
X w0
H wk wk H w0
E0 ¼ ð0Þ ð0Þ
ð3:227Þ
k6¼0 Ek E0
thus enabling us to conclude that the second-order energy can be derived from the
first-order correction to the wave function. Also, it can be shown that the third-order
energy could also be obtained from the first-order wave function, and in general, the
(2n + 1)th energy correction can be derived from the nth-order perturbation wave
function.
Perturbation expressions like (3.226) and (3.227) that involve infinite sums are not
suitable for quantitative evaluations. However, the variational principle can be used
3.11 Perturbation Theory 165
ð0Þ
We now multiply (3.222) on the left by the complex conjugate of w0 , integrate
over all space, and apply the orthogonality relationships of (3.218). Using the
Hermitian characteristics of the Hamiltonian and the perturbation operator gives
D
E
ð1Þ
^ 0 ð1Þ
ð0Þ ð2Þ
w0
H E0
w0 ¼ E0 ð3:229Þ
We now return again to Eq. (3.221), multiply it on the left by the complex
^ 0 , and substitute in
conjugate of /, integrate, use the Hermitian characteristics of H
(3.233), obtaining
D
E D
E D
E
^0 ð0Þ
^0 ð1Þ
ð0Þ ð0Þ
^ 0 ð1Þ
ð2Þ
/
H E0
/ þ /
H E0
w0 þ w0
H E0
/
E0 ð3:234Þ
Note that labels 1, 2, …, d do not necessarily imply that the corresponding states
are the lowest-lying states. The wave functions (3.235) are assumed to be
orthonormal. Every linear combination of them is still an eigenfunction of the
unperturbed Hamiltonian with the same eigenvalue,
X
d
^ 0 E0ð0Þ
H
ð0Þ
ak w0;k ¼ 0 ð3:237Þ
k¼1
Assuming that the perturbation splits the degeneracy of the energy level, there is
a unique normalized eigenfunction w0 with eigenvalue E0 for the total Hamiltonian,
H^ E0 w 0 ¼ 0 ð3:238Þ
ð0Þ
X
d
ð0Þ
lim w0 ¼ /0 ¼ ak w0;k ð3:239Þ
k!0
k¼1
How can we determine the coefficients ak? As previously done for the nonde-
generate energy level, the eigenfunction w0 and the eigenvalue E0 can be expanded
as Taylor series in powers of k, yielding
Substitution of (3.240) and (3.241) in the Schrödinger equation for the perturbed
system (3.238) leads to
3.11 Perturbation Theory 167
h i
^ 0 E0ð0Þ þ kE0ð1Þ þ k2 E0ð2Þ . . .
^ 0 þ kH
H
ð0Þ ð1Þ ð2Þ
/0 þ kw0 þ k2 w0 þ ¼ 0
ð3:242Þ
that is,
X
d
^ 0 E0ð0Þ w0ð1Þ þ H
H ^ 0 E0ð1Þ ð0Þ
ak w0;k ¼ 0 ð3:244Þ
k¼1
ð0Þ
We now multiply both members by the complex conjugate of w0;m , where m is
one of the states of the d-fold degenerate unperturbed level, and integrate over all
space, obtaining
X
d
^ mk
0 ð1Þ
ak H E0 dmk ¼ 0 ð3:245Þ
k¼1
where
D E
ð0Þ
^ 0
ð0Þ
^ mk
H 0
¼ w0;m
H w0;k ð3:246Þ
ð1Þ
This is an equation of degree d in E0 , with d roots that are first-order correc-
ð0Þ
tions to E0 ,
If all roots of (3.247) are distinct, the degeneracy is completely removed to first
order. Assuming distinct roots for (3.247), we introduce each of them in (3.245),
solve this set of equations for a2/a1, a3/a1, …, ad/a1, and find a1 by normalization of
ð1Þ
(3.239). For each root of (3.247), E0;k , the set of coefficients determines the cor-
responding zeroth-order function (3.239).
168 3 The Schrödinger Equation
h @W0 ^ 0 W0
¼H ð3:249Þ
i @t
where W0 is a function of the coordinates s and time t, W0(s, t). If our system is in a
stationary state of energy E0, then we can use the time-independent Schrödinger
equation
^ 0 W0 ¼ E0 W0
H ð3:250Þ
h @W0
¼ E 0 W0 ð3:251Þ
i @t
where A(s) = W0(s, 0). Hence, A(s) can be identified with the eigenfunction of the
time-independent Schrödinger equation for a stationary state w0(s),
^ 0 w0 ðsÞ ¼ E0 w0 ðsÞ
H ð3:253Þ
Note that
since the exponential factor of W0 in (3.254) is a phase factor eia, and as such does
not affect the probability density given by (3.255).
3.12 Time-Dependent Perturbation Theory 169
h @W ^ 0 ^ 0
¼ H þ H ðtÞ W ð3:259Þ
i @t
where W(s, t) is the new state function of the perturbed system. If we specify a value
for time, for example t = t0, then
X
Wðs; tÞ ¼ bk ðt0 Þ expðiEk0 t=hÞw0k ðsÞ ð3:260Þ
k
P " #
@
h bk ðtÞ expðiEk0 t=hÞw0k X
k ^ 0 ^ 0
¼ ðH þ H Þ bk ðtÞ expðiEk t=hÞwk
0 0
i @t k
ð3:262Þ
We now differentiate the first member with respect to time, use the linear
^ 0 and H
behavior of H ^ 0 and apply (3.256)
X dbk ðtÞ
h X
exp iEk0 t h w0k þ bk ðtÞ exp iEk0 t h Ek0 w0k
i k dt k
X 0 0 X 0 0
¼ bk ðtÞ exp iEk t h Ek wk þ
0 ^ wk
bk ðtÞ exp iEk0 t h H
k k
X dbk ðtÞ
h X 0 0
^ wk
exp iEk0 t h w0k ¼ bk ðtÞ exp iEk0 t h H ð3:263Þ
i k dt k
In order to select the time derivative of bn(t) on the left-hand side of this
equation, we multiply both members by w0 n , integrate over the spatial and spin
coordinates, and use
w0n jw0k ¼ dnk ð3:264Þ
obtaining
dbn ðtÞ iX
^ 0 jw0k
¼ bk ðtÞ expðixnk tÞ w0n jH
dt h k
where
En0 Ek0
xnk ¼
h
As with time-independent perturbation theory [see (3.210)], we introduce now
the scaling parameter k, rewrite the above equation as
dbn ðtÞ iX
^ 0 jw0k
¼ bk ðtÞ expðixnk tÞ w0n jkH ð3:265Þ
dt h k
and expand the coefficients bk(t) in powers of k (see Ballentine, Further Reading,
Sect. 12.5),
3.12 Time-Dependent Perturbation Theory 171
i X ð0Þ
ð1Þ
dbn ðtÞ
¼ ^ 0 jw0k
bk expðixnk tÞ w0n jH ð3:267Þ
dt h k
i X ð1Þ
ð2Þ
dbn ðtÞ
¼ ^ 0 jw0k
bk ðtÞ expðixnk tÞ w0n jH
dt h k
The perturbation is switched on in the time interval 0 < t < s, during which the
coefficients of (3.261) become variable, and so there are no stationary states. For
times t
s, the perturbation vanishes, and the coefficients retain the values they
had at time s, that is, bk(t > s) = bk(s). The probability of having a nondegenerate
state with energy Ef0 is given by |bf(s)|2 [see (3.261)]. In order to obtain this
probability, we begin by using (3.267) and write
ð1Þ
dbf ðtÞ i X ð0Þ D
E
¼ bk expðixfi tÞ w0f
H ^ 0
w0k
dt h k
i D
E
¼ expðixfi tÞ w0f
H
^ 0
w0i ð0Þ
ðbk ¼ dki Þ ð3:269Þ
h
We now integrate both members between t = 0 and t = s, use (3.266) for k = 1,
ð0Þ
and note that bf = 0, obtaining
Zs D
i
0
0E
w0f
H
^
wi expðixfi tÞdt
ð1Þ
bf ðsÞ bf ðsÞ ¼ ð3:270Þ
h
0
172 3 The Schrödinger Equation
Finally, we take the square of the absolute value of both members to calculate
the probability of having the state with energy Ef0 :
s
2
Z
2 1
D 0
0
0 E
bf ðsÞ
wf H^ wi expðixfi tÞdt
ð3:271Þ
2
h
2p 2p
x¼ ¼ 2pv k¼ ¼ 2p~v ð3:273Þ
s k
where x is the angular frequency (s is the period of the radiation and m its fre-
quency), and k is the angular wavenumber (k is the wavelength, its inverse being
the wavenumber). The electric field (the electric force per unit charge) interacts with
charges in the molecule, electrons, and nuclei. The force on charge q is
F ¼ E x q ¼ dV=dx, where V is the potential energy. Integration of the latter
equation gives V ¼ E x qx. Therefore, for a distribution of charges and an x-
polarized electric field, the perturbation is given by
X X
^ 0 ðtÞ ¼ E x
H qa xa ¼ E 0 sinðxt kzÞ qa x a ð3:274Þ
a a
where qa represent the charges and xa their positions along x. Substitution of (3.274)
in (3.270) gives
Zs *
+
X
iE 0
bf ðsÞ w0f
qa xa
w0i sinðxt kzÞ expðixfi Þdt ð3:275Þ
h
a
0
3.13 Absorption and Emission of Radiation 173
Note that the wavelength of ultraviolet radiation is of order 102 nm, and visible
and infrared radiations have larger wavelengths. Hence, the term kz in the argument
of the sine function is small, since k is proportional to the inverse of the wavelength
[see (3.273)]. In addition, the order of magnitude of the largest dimension of a
typical molecule is 1 nm, and so w0i and w0f are vanishingly small for distances
greater than 1 nm. Hence, using the so-called long wavelength approximation, we
can replace (3.275) by the simpler expression
Zs *
+
X
iE 0
bf ðsÞ w0f
qa xa
w0i sinðxtÞ expðixfi tÞdt ð3:276Þ
h
a
iE 0
bf ðsÞ w0f
qa xa
w0i sinðxtÞ expðixfi tÞdt ð3:277Þ
h
a
since w0i and w0f are time-independent wave functions and the dipole moment
operator does not explicitly depend on time. Substitution of
expðixtÞ expðixtÞ
sinðxtÞ ¼ ð3:278Þ
2i
in (3.277) gives
*
+ Zs
E0
X
0
bf ðsÞ wf
qa xa
w0i exp½iðxfi þ xÞt exp½iðxfi xÞt dt
2
h
a
0
ð3:279Þ
In turn, use of
Zs
expðasÞ 1
expðatÞdt ¼ ð3:280Þ
a
0
leads to
*
+
E0
X
exp½iðxfi þ xÞs 1 exp½iðxfi xÞs 1
0
bf ðsÞ 0
wf
q x
w
2
hi
a a a
i xfi þ x xfi x
ð3:281Þ
174 3 The Schrödinger Equation
The first and second terms of the expression in braces lead to 0/0 indeterminate
forms for x = −xfi and x = xfi, respectively, which can be solved by taking the
following limits:
exp½iðxfi xÞs 1
lim ¼ is ð3:282Þ
xfi x!0 xfi x
(E41). These limits can be easily obtained by the quotient of derivatives of the
numerator and denominator, according to l’Hôpital’s rule.
Considering the square of the absolute value of the expression in braces in
(3.281), Fig. 3.11 plots the resulting band shape as a function of x in a neigh-
borhood of xfi. The maximum at x = xfi corresponds to an ascending transition
from i to f that leads to absorption of energy from the electromagnetic radiation. In
turn, the maximum at x = −xfi corresponds to a downward transition from f to
i that leads to stimulated emission of electromagnetic radiation, a radiation process
found in lasers.
The probability of the spectroscopic transition from i to f is given by the square
of the absolute value of bf [see (3.281)]. If the radiation has additional electric field
components along y and z, then we have to add the squares of the absolute values of
(3.281) for the corresponding y and z terms and separate the square of the band
shape factor in (3.281) (see Fig. 3.11), being left with
D E
2
D
E
2
D
E
2
0
^x jw0i
þ
w0f
l
wf jl ^y
w0i
þ
w0f
l
^z
w0i
ð3:283Þ
Fig. 3.11 Band shapes for emission (x = −xfi) and absorption (x = xfi) as functions of x. The
units are arbitrary. Figure obtained with Mathematica
3.13 Absorption and Emission of Radiation 175
Both absorption and stimulated emission of radiation are induced by radiation and
have transition probabilities proportional to the spatial density of radiation, d(x).
The probability of absorption per unit of radiation density and unit of population of
the initial state is called for historical reasons the Einstein B coefficient for
absorption and is equal to the Einstein coefficient for stimulated emission, thus
being represented by the same letter. The coefficient for absorption and stimulated
emission is proportional to the square of the transition dipole moment (3.283).
Let us consider two nondegenerate quantum states a and b, where b is the
excited state. The equality between Einstein coefficients for absorption and stim-
ulated emission implies the occurrence of transitions that are independent of the
radiation density, thus being associated with spontaneous emission. If spontaneous
emission did not exist, state b would acquire the same population as state a, and
then it would not be possible to observe the transition between these states spec-
troscopically. The probability of spontaneous emission per unit of population of
the initial state is called for historical reasons the Einstein A coefficient.
At equilibrium, the increase in the population of state b is balanced with the
decrease of this population, and we can write
where qa and qb are the populations of the a and b states, the first term is the
probability for absorption, the second term stands for the probability of stimulated
emission, and the last term accounts for the probability of spontaneous emission. At
equilibrium and a specified temperature, qa and qb are related by the Boltzmann
distribution, which is given by
We now need to know the radiation density d. Using the spatial density of
radiation in equilibrium with the blackbody at a defined temperature, we write,
according to Planck’s law,
where the symbol / stands for “proportional to.” Substitution of (3.287) in (3.286)
gives
176 3 The Schrödinger Equation
expðhxab =kTÞ 1
A / Bx3 ð3:288Þ
expðhx=kTÞ 1
As mentioned before [see (3.281) and Fig. 3.11], B takes nonnegligible values in
the resonance condition, that is, when x = xab. Hence, (3.288) can be replaced by
A / Bx3 ð3:289Þ
This result shows that spontaneous emission is increasingly important for high
frequencies and justifies the difficulties in obtaining laser radiation in the ultraviolet
region.
In 1928, the Indian physicist C.V. Raman (1888–1970) experimentally showed that
a small fraction of a monochromatic beam of visible light is scattered by a sample
with change of frequency (inelastic light scattering) and that the observed shifts in
frequency depend on the chemical structure of the molecules responsible for the
scattering. For this discovery, Raman was awarded the 1930 Nobel Prize in physics.
Light scattering involves incident and scattered photons. Most of the latter are
scattered with the same frequency as the incident photons (elastic or Rayleigh
scattering, E42), and a small fraction of them are scattered at a different frequency
(inelastic or Raman scattering). In Rayleigh scattering, the energies of the photon
and molecule are separately conserved, whereas in Raman scattering, energy con-
servation is observed for the whole radiation–molecule system, that is, frequency
changes in the radiation and molecule are of equal magnitude but opposite signs. In
Raman scattering, Stokes transitions are those in which the scattered photons lose
energy to the molecule, whereas anti-Stokes transitions are those in which the
scattered photons gain energy from the molecule. Figure 3.12 presents the Rayleigh
and Raman spectra of carbon tetrachloride (liquid) according to the IUPAC
(International Union of Pure and Applied Chemistry) recommendations, where the
abscissa is a linear scale occupied by the Raman shifts presented in cm−1 and
decreasing from left to right (the frequency of the scattered light increases from left
to right, as it should), and the ordinate is linear and proportional to the intensity.
The intensity of Raman scattering is several orders of magnitude lower than the
intensity of Rayleigh scattering. The distribution of Stokes and corresponding
anti-Stokes lines on each side of the Rayleigh line is symmetric in frequency, but
Stokes lines are more intense than the corresponding anti-Stokes lines (Fig. 3.12).
For this reason, a Raman spectrum usually records only the Stokes side of the
spectrum.
3.14 Raman Scattering 177
Fig. 3.12 Rayleigh and Raman spectra of carbon tetrachloride (liquid) presented according to
IUPAC recommendations. Details of band shapes and band intensities should be considered
schematic. This figure was obtained using Mathematica with each band simulated by a linear
combination of Lorentzian and Gaussian functions
lind ¼ aE ð3:290Þ
where the induced dipole moment and the electric field are represented by column
vectors in the (xyz)T basis, and the polarizability tensor is represented by a 3 3
symmetric matrix
0 1
axx axy axz
a ¼ @ ayx ayy ayz A ð3:291Þ
azx azy azz
(E43). In the following equalities of this subsection, we assume that the polariz-
ability is spherical (axx = ayy = azz = a and off-diagonal elements equal to zero) to
make the equations simpler without unnecessary subscripts. The electric field of a
plane-polarized electromagnetic radiation varies in time with the frequency of the
radiation m0 and is given by
178 3 The Schrödinger Equation
E ¼ E 0 cosð2pv0 tÞ ð3:292Þ
In turn, the polarizability changes with the vibrational coordinate involved in the
Raman transition, and its expansion in a Taylor series gives
@a
a ¼ a0 þ Qþ ð3:293Þ
@Q 0
Q ¼ A cosð2pvtÞ ð3:294Þ
Using simple trigonometry [cos (a + b) = cosa cosb − sina sinb], one obtains
1 @a
lind ¼ a0 E 0 cosð2pv0 tÞ þ AE 0 fcos½2pðv0 þ vÞt þ cos½2pðv0 vÞtg
2 @Q 0
ð3:296Þ
This expression shows that the frequency m0 corresponds to the Rayleigh scat-
tering and involves the static polarizability a0, whereas the Stokes and anti-Stokes
Raman frequencies m0 − m and m0 + m involve the polarizability derivative with
respect to the vibrational coordinate (∂a/∂Q)0 (E44). A Raman active vibration
requires
@a
6¼ 0 ð3:297Þ
@Q 0
with the Raman intensity being proportional to the square of the polarizability
derivative with respect to the vibrational coordinate.
X n o
lðq1Þ ¼ aqr fi E r0 exp i x0 xfi
fi
r¼x;y;z
where er0 is the electric field amplitude along r (r can be x, y, or z) (see Long,
Further Reading, Sect. 4.2). Separation of the electric field [see (3.290)] in this
expression leads to the transition polarizability, whose qr component between the
initial and final states i and f is given by
0D
1
1 X @ wf
^
0
lq
w0r w0r
l
^r
w0i lr
w0r w0r
l
w0f
^ ^q
w0i
aqr ¼ þ A ð3:298Þ
fi h r6¼i;f xri x0 iCr xrf þ x0 þ iCr
where w0i , w0f , and w0r are time-independent unperturbed wave functions for the
initial state i, the final state f, and the intermediate state r (these wave functions are
assumed to be real), l ^q is the dipole moment operator, x0 is the angular frequency
of the incident radiation,
x0 ¼ E0 =h ð3:299Þ
and
xri ¼ xr xi ð3:300Þ
where Cr is the half-width of state r, which is related to the lifetime sr through the
energy–time uncertainty principle, as shown below:
1
DEDt h ) DxDt 1 ) 2Cr sr 1 ) sr ð3:302Þ
2Cr
Each state r is not stationary, that is to say, is not a solution of the time-independent
Schrödinger equation, and so does not have a defined energy, and is therefore called a
virtual state. In turn, the initial and final states i and f are stationary, have defined
energies, i.e., DEi = DEf = 0 (Ci = Cf = 0), and so their lifetimes are infinite (the
physical attributes of states i and f never change). As a result,
Fig. 3.13 Raman scattering processes: a normal Raman scattering, x0 « xri; b preresonance
Raman scattering, x0 ! xri; c discrete resonance Raman scattering, x0 xri; d continuum
resonance Raman scattering, x0 in the range of dissociative continuum levels. The potential energy
curve was obtained with Mathematica using a Morse function to which terms in x−6, x−8, and x−10
were added
3.14 Raman Scattering 181
Nijk;a xi yi zk ear
2
ð3:304Þ
Fig. 3.14 While a 1s STO (black) has a cusp at the nucleus, the s-type Gaussian function (gray)
is continuous in both the function and its derivative. These graphs were obtained with
Mathematica
Fig. 3.15 At the left, the figure shows the radial factors of three s-type Gaussian functions and the
1s orbital of the hydrogen atom. At the right, a linear combination of the Gaussian functions is
fitted to the 1s orbital of the hydrogen atom. The inset shows the cusp of the 1s orbital at the
nucleus, contrasting with the absence of a cusp of the Gaussian function. These graphs were
obtained with Mathematica
184 3 The Schrödinger Equation
Fig. 3.16 Gaussian functions centered at four points are reduced to two Gaussian functions
centered at two points
Fig. 3.17 The product of two Gaussian functions centered at different points is a third Gaussian
function centered on an intermediate point. These graphs were obtained with Mathematica
where
ab aA þ bB
d¼ ðA BÞ2 c ¼ aþb C¼ ð3:309Þ
aþb aþb
Equation (3.308) can be easily proved by showing that the sum of the exponents
of gA and gB is equal to the exponent of gC, that is,
h i
aðx AÞ2 þ bðx BÞ2 cðx CÞ2 þ d ¼ 0 ð3:310Þ
This result is easily proved using Mathematica (see Mathematica code M11).
We now consider some of the more popular standard basis sets. We begin with a
minimal basis set, i.e., a basis set with one Slater-type orbital (STO) per each
inner-shell or valence-shell atomic orbital. The STO-3G basis set is a minimal basis
set, where each STO is represented by a linear combination of three Gaussian
functions. For the carbon atom, this basis set consists of five STOs (one 1s + one
2s + three 2p STOs) and three Gaussian functions for each STO, that is, a total of
15 Gaussian functions.
A double-zeta basis set consists of two STOs with distinct f orbital exponents
for each atomic orbital, irrespective of this being an inner-shell or a valence-shell
orbital. In turn, a split-valence basis set consists of two or more STOs per
valence-shell atomic orbital, and only one STO per inner-shell atomic orbital. The
3-21G basis set is a valence double-zeta basis set consisting of three primitive
Gaussian functions per inner-shell atomic orbital and a split-valence shell with one
contracted Gaussian function as a linear combination of two primitive Gaussian
functions and an additional single primitive Gaussian function. Another example of
a valence double-zeta basis set is given by the 6-31G basis set. The basis set
6-311G is an example of a valence triple-zeta basis set.
Atomic orbitals become distorted and have their centers shifted upon molecule
formation. They are said to be polarized. Polarization is accounted for by adding
STOs with ‘ quantum number greater than the maximum ‘ of the valence shell of
the ground state atom. In the CH4 molecule, the 6-31G** basis set adds a set of five
186 3 The Schrödinger Equation
d-type Cartesian Gaussian functions to the carbon atom (the first star) and a set of
three p-type Cartesian Gaussian functions to each hydrogen atom (the second star).
In turn, the 6-31++G** basis set includes a set of four Gaussian functions, one s-
type and three p-type functions, with very low exponents in the carbon atom (the
first plus sign) and one s-type Gaussian function with very low exponent in each
hydrogen atom (the second plus sign). These functions fall off very slowly as
r increases and are called diffuse functions (E46).
Considering an sp valence shell, the inclusion of orbitals of higher angular
momentum in the basis set should take into account the effect of each additional
orbital in the lowering of the total electronic energy through its contribution to the
correlation energy. For example, the lowering of energy associated with the addi-
tion of a set of d orbitals is approximately of the order of magnitude of one f orbital.
In turn, the addition of a second set of f orbitals is equivalent to the lowering of
energy caused by one g orbital. These facts were considered by Dunning, who
proposed the so-called correlation-consistent (cc) basis sets (Dunning 1989). For
the carbon atom, the cc-pVTZ (correlation-consistent polarized valence triple-zeta)
basis set consists of 4s, 3p, 2d, 1f sets of contracted Gaussian functions
(4 1 + 3 3 + 2 5 + 1 7 = 30 functions). For the hydrogen atom, we
have 3s, 2p, 1d sets of contracted Gaussian functions (3 1 + 2 3 + 1 5 =
14 functions). These basis sets may be augmented with diffuse functions by adding
the prefix aug- (for example, aug-cc-pVTZ).
Computing a thermodynamic energy, such as the atomization energy or the
enthalpy of formation, with chemical accuracy means attaining an accuracy of
1 kcal/mol. However, the largest errors in ab initio thermodynamic calculations
result from basis set truncation. Therefore, to achieve chemical accuracy, the total
energy should be computed from the results of a series of calculations that
extrapolate to the complete basis set limit.
Notes
2 d2 w
h
þ VðxÞwðxÞ ¼ EwðxÞ 0\x\‘: VðxÞ ¼ 0 0
x
‘: VðxÞ ¼ 1
2m dx2
ð3:312Þ
Outside the box, that is, for V(x) = ∞, it is easy to conclude that the wave
function is zero, i.e., the probability of finding the particle outside the box is zero,
since substitution of V(x) = ∞ in the Schrödinger equation leads to
Notes 187
d2 w 1 d2 w
2
¼ 1wðxÞ ) ¼ 1 ) wðxÞ ¼ 0 ð3:313Þ
dx wðxÞ dx2
d2 w
þ k2 wðxÞ ¼ 0 ð3:314Þ
dx2
where
ð2mEÞ1=2
k¼ ð3:315Þ
h
The solution for a differential equation like (3.314) is a linear combination of
sine and cosine functions,
The second boundary condition allows one to determine the energy, which
becomes dependent on the quantum number n. In fact, at x = l, w(l) = 0, B 6¼ 0,
and
" #
ð2mEÞ1=2 ð2mEÞ1=2
sin ‘ ¼0) ‘ ¼ np ð3:318Þ
h h
n2 h2
E¼ n ¼ 1; 2; 3. . . ð3:319Þ
8ml2
Using (3.315) and (3.318), we obtain
np
k¼ ð3:320Þ
‘
B is the normalization constant of the wave functions, which can be easily deter-
mined using Mathematica (see Answer to E8) and gives
188 3 The Schrödinger Equation
1=2
2
B¼ : ð3:321Þ
‘
Therefore,
1=2
2 npx
wn ðxÞ ¼ sin ð3:322Þ
‘ ‘
Consider a two-particle system with the particles held at a fixed distance d from
each other. The energy of the rotor is entirely kinetic energy (rotational energy).
Since the potential energy is zero, the Hamiltonian is
^ ¼ h r2
2
H ð3:323Þ
2l
where
1 1 1
¼ þ ð3:324Þ
l m1 m2
@2 2@ 1 ^2
r2 ¼ þ L ð3:325Þ
@r 2 r @r r 2 h2
[see (3.100)]. Since the rotor is rigid, r is constant (r = d), and therefore we can
write
1 ^2
r2 ¼ L ð3:326Þ
d 2 h2
Substitution of (3.326) in (3.323) leads to
^ ¼ 1 ^2
H L ð3:327Þ
2ld2
^ 1 ^2 m
Hwðh; /Þ ¼ Ewðh; /Þ ) L YJ ðh; /Þ ¼ EJ YJm ðh; /Þ ð3:328Þ
2ld2
Notes 189
1 JðJ þ 1Þh2
JðJ þ 1Þ
h2 YJm ðh; /Þ ¼ EJ YJm ðh; /Þ ) EJ ¼ J ¼ 0; 1; 2; . . .
2ld2 2ld2
ð3:329Þ
Note that the energy levels depend on J only, but the eigenfunctions depend on
J and m [see (3.328) and (3.329)], where mℏ is the z-component of the rigid rotor
angular momentum. This means that each specified J level has a (2J + 1)-degen-
eracy, since m takes values from −J to J corresponding to different orientations of
the rigid rotor angular momentum projection on a preferential direction that is
quantized.
The moment of inertia I of a system of n particles about a specified axis is
defined as
X
n
I mi ri2 ð3:330Þ
i¼1
where mi is the mass of the ith particle and ri is the distance from the particle to the
axis along the perpendicular to the axis. For a two-particle rigid rotor, we have
where
d ¼ r1 þ r2 ð3:332Þ
m1 r1 ¼ m2 r2 ð3:333Þ
From Eqs. (3.324), (3.331), (3.332), and (3.333), it can be concluded that
I ¼ ld 2 ð3:334Þ
Mathematica Codes
This Mathematica code considers the one-dimensional wave equation (3.18) and
uses the Mathematica function DSolve for solving the differential equation for the
function u with independent variables t and x. The symbols @x;x u[t,x] and @t;t u
[t,x] stand for second-order partial derivatives of u[t,x] with respect to x and
t. In general, C[1] and C[2] are default parameters generated by Mathematica,
but in this case. they represent arbitrary functions. In the second line of code, the
first level of sol (a list of one sublist; see the first line of results) is replaced in u
[t,x], thus giving the general solution of the differential equation presented in a
more familiar way (see second line of results).
Suggestion: Insert the option GeneratedParameters!f in the first line of
code. What is the effect of this change?
This Mathematica code solves the so-called Helmholtz equation [see (3.20)].
The notation @x;x D[f[x],{x,2}] represents the second derivative of f[x] with
respect to x; C[1] and C[2] are default generated parameters, which in this case
represent arbitrary coefficients of a linear combination of Cos and Sin functions
(see second line of results).
Consider the one-dimensional Schrödinger equation for the harmonic oscillator and
its eigenfunctions [see (3.62)], which are the product of a Gaussian function and
polynomial factors known as Hermite polynomials. The built-in Mathematica
function HermiteH[n,z] generates Hermite polynomials for nonnegative values
of n. The following line of code gives the first six Hermite polynomials.
Mathematica Codes 191
wn ¼ Nn /n
Z1 Z1
wn dz ¼ Nn
2 2
/2n dz ¼ 1
1 1
1
Nn ¼ sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
R1 2
/n dz
1
1
pffiffiffiffiffiffiffiffiffi
p1=4 2n n!
The following Mathematica code plots the first six probability densities of the
one-dimensional Schrödinger equation for the harmonic oscillator. The first line of
code defines the function f[n,z], and the symbol := (SetDelayed) considers the
right-hand side of the expression to be the delayed value of the left-hand side. The
right-hand side is evaluated each time the left-hand side appears. The result of this
code is presented as a Table of graphs.
192 3 The Schrödinger Equation
Suggestion: Plot the first six Hermite polynomials in the range {z,-5,5}.
The following code confirms that the wave functions (3.67) are normalized and
mutually orthogonal:
Mathematica Codes 193
The following Mathematica code generates 3D plots in which the absolute value
of the spherical harmonic is plotted as the distance from the origin to the point, for
the angles h and /. The option Lighting!“Neutral” specifies that the sim-
ulated lighting is a source of white light.
M5. Determinants
The following Mathematica code begins by obtaining normalization factors for the
1s function of the hydrogen atom and three s-type Gaussian functions. Being s func-
tions, they depend only on the r coordinate, not on the angular spherical coordinates h
and /, and consequently, integration is over R2(r)r2dr [see (3.105)]. After normal-
ization, the code confirms the normalizing constants. In order to find numerical values
for the Gaussians coefficients that give the best fit to data, the code prepares a list of
lists called data with regularly spaced points from the function f[r] and defines the
variable model. FindFit finds numerical values for the coefficients c1, c2, and
c3 that lead to the best fit of model to data. In the line of code
c1 ¼ c1=:fit½½1
198 3 The Schrödinger Equation
fit[[1]] stands for the first part of fit, which is c1!0.122444, c1/.fit[[1]]
replaces c1 by 0.122444, and c1=c1/.fit[[1]] assigns 0.122444 to c1.
Similar lines of code are repeated for c2 and c3. The resulting values of c1, c2,
and c3 are presented as output. The smallest coefficient (c1) corresponds to the
Gaussian function with the highest exponent, that is, the function that does more
effective fitting near the nucleus:
Mathematica Codes 199
The following line of Mathematica code shows that the product of Gaussian
functions located at two different positions with different exponents gA and gB is a
third Gaussian function gC located at an intermediate point [see (3.308)], by
demonstrating that the sum of the exponents of gA and gB is equal to the exponent of
gC [see (3.311)]. This code illustrates the use of FullSimplify, which simplifies
its argument (the expression to which it applies), and of TrueQ, which yields the
result True or False for the expression to which it applies:
Glossary
Exercises
1 1
E1. Find the eigenvalues and eigenvectors of the matrix A ¼ .
4 2
E2. Use
0 Mathematica 1 to find the eigenvalues and eigenvectors of the matrix
1 0 1
A ¼ @ 1 2 1 A.
2 2 3
E3. Show that ∂/∂x is a linear operator.
E4. Calculate the commutator between ∂/∂x and x.
E5. Relate the kinetic energy operator of the Hamiltonian to the linear
momentum operator.
E6. Let u be any twice differentiable function. Show that u(x − vt) is a solution
of the wave equation and a forward propagating wave with speed v.
E7. Confirm that Acos(kx − xt) is a solution of the wave equation.
E8. Consider a particle in a one-dimensional box. Use Mathematica to deter-
mine the constant B for the wave functions wn(x) = Bsin(npx/‘).
E9. Use Mathematica to plot the wave functions for the particle in a
one-dimensional box for n = 1, 2, 3.
E10. Use Mathematica to solve the equation y″(x) + y′(x) − 6y(x) = 0.
E11. Beginning with (3.27) for the definition of a Hermitian operator, arrive at
(3.28). [Hint: Consider w as the linear combination f + cg, where c is an arbi-
trary complex number.]
E12. Derive (3.50) for a diatomic molecule AB.
E13. Write the equation to obtain the reduced mass of a system of four masses,
mA, mB, mC, and mD.
E14. Show that (3.54) is a solution of the differential Eq. (3.53) of the classical
harmonic oscillator.
E15. Use Mathematica to list the first six Hermite polynomials.
E16. Use Mathematica to find the normalization constants of the first six har-
monic oscillator eigenfunctions assuming that a is equal to 1 and confirm that
the normalization constants satisfy the formula [p1/4√(2nn!)]−1.
E17. Confirm the recurrence equality (3.64) for H4(z), H3(z), and H2(z).
E18. Confirm (3.70).
E19. Derive (3.83).
E20. Derive the first of the equalities (3.86).
E21. Derive the first of the equalities (3.88).
E22. Use Mathematica to obtain polar plots of |sin(nh)|, where n takes integer
values from 1 to 4 and h ranges from 0 to p.
E23. Use Mathematica to write an expression for Y20 (h, 0) and make a polar plot
of this function.
E24. Derive (3.101).
204 3 The Schrödinger Equation
References
Becke AD (1993) Density-functional thermochemistry. III. The role of exact exchange. J Chem
Phys 98:5648–5652
Coulson CA, Neilson AH (1961) Proc. Phys. Soc. London 78:831–837
Dunning TH Jr (1989) Gaussian basis sets for use in correlated molecular calculations. I. The
atoms boron through neon and hydrogen. J Chem Phys 90:1007–1023
Hohenberg P, Kohn W (1964) Inhomogeneous electron gas. Phys Rev 136:B864–B871
Kohn W, Sham LJ (1965) Self-consistent equations including exchange and correlation effects.
Phys Rev 140:A1133–A1138
Lee C, Yang W, Parr RG (1988) Development of the Colle-Salveti correlation-energy formula into
a functional of the electron density. Phys Rev B 37:785–789
Miehlich B, Savin A, Stoll H, Preuss H (1989) Results obtained with the correlation-energy
density functionals of Becke, and Lee, Yang and Parr. Chem Phys Lett 157:200–206
Moller C, Plesset MS (1934) Note on an approximation treatment for many-electron systems. Phys
Rev 46:618–622
Further Reading
Abstract
Symmetry manifests itself in the shapes of things, from macroscopic objects to
molecules and atoms. Within the Born–Oppenheimer approximation, the nuclear
framework of a molecule at equilibrium defines its symmetry, which reflects on
molecular orbitals and molecular vibrations. This chapter begins by defining
symmetry operations and presenting the symmetry elements of several
molecules. We show that the set of symmetry operations of a molecule form a
group and present a dichotomy diagram for finding the point group in a
systematic way. We then represent reflection in a plane, rotation about an axis,
and inversion in a point by 3 3 matrices in the (x, y, z)-basis, using these to
illustrate the concepts of conjugate elements and class. The implications of
symmetry on transition dipole moments and transition polarizabilities lead to
selection rules, which are explained and illustrated. The last section of this
chapter consists of the analysis of symmetry properties of molecular vibrations
in order to identify point groups. At the end of this chapter, the student can find
several Mathematica codes (Classes of Symmetry Operations, Multiplication
Table for the C2v Group, Selection Rule for a Particle in a One-Dimensional
Box, Selection Rule for the Two-Particle Rigid Rotor, Selection Rule for the
Harmonic Oscillator) with references to the main text and detailed explanations
of new commands, a glossary of important scientific terms, and a list of
exercises, whose complete answers can be found after the Appendix.
initial and final configurations are indistinguishable. In this chapter, the objects
acted on by symmetry operations are molecules with their nuclei at equilibrium
positions. Each symmetry operation is carried out by a symmetry element. For a
finite object such as a molecule, a symmetry element can be a plane, an axis, a
point, or a combination of an axis and a plane. Symmetry operations and the
corresponding symmetry elements are represented by the same letters, with sym-
metry operations shown in italics.
The symmetry of the H2O molecule results from the equivalence of its OH
bonds. There are three operations that express this symmetry (Fig. 4.1). The first is
the anticlockwise rotation by 2p/2 radians (180º) around the C2 axis of rotation
(this axis coincides with the z axis; anticlockwise is the direction of an angle in the
plane). The second is the reflection through the xz plane of reflection, rxz (r =
sigma is the Greek letter for s; Spiegel is the German word for mirror). The third
symmetry operation is the reflection in the yz plane of reflection ryz.
Consider now the benzene molecule at equilibrium, with two regular concentric
and parallel hexagons, one formed by the carbon nuclei, the other formed by the
hydrogen nuclei (Fig. 4.2). The most evident symmetry element of the benzene
molecule is the sixfold rotation axis C6, perpendicular to the reflection plane rh
defined by the molecule’s nuclei. The axis C6 coincides with the rotation axes C3
and C2. As the highest-order axis, C6 is the principal axis of the molecule.
Reflection planes that contain C6 are of two types, namely vertical reflection planes
rv, which contain diametrically opposed CH bonds, and diagonal reflection planes
rd, which bisect the angles formed by rv planes. If a Cartesian coordinate system is
set up with its origin at the center of the regular hexagon formed by the carbon
nuclei, the exchange of Cartesian coordinates (x, y, z) by (−x, −y, −z) for any point
in space leads to an indistinguishable molecular configuration (Fig. 4.2). This
symmetry operation is called inversion and is represented by i, with the corre-
sponding symmetry element being a point at the origin of the Cartesian reference
system, called the center of symmetry i. Note that every inversion symmetry
operation can be seen as resulting from a 2p/2 rotation around one C2 axis, followed
by a reflection in rh. Hence, the center of symmetry i is said to be equivalent to a
twofold rotation–reflection axis S2. A symmetry operation around C6 followed by
C2
σxz
z
O O O
σyz
z y
z x
Fig. 4.1 Nontrivial symmetry elements of the H2O molecule
4.1 Symmetry Operations 209
1 3 1 1
C3 C2
B B B B
2 3 1 2 2 3 3 2
Fig. 4.3 C3 and C2 rotations in BF3. Fluorine atoms are numbered in order to show the effect of
the symmetry operation
Fig. 4.4 The fourfold rotation–reflection symmetry operation on the CH4 molecule is a 2p/4
rotation (90º) followed by reflection in the perpendicular plane that contains the carbon nucleus
F F F F
H H Br Br
C C C C
H H H H
Cl Cl Cl Cl
If A and B represent two symmetry operations for the same molecule, the sequential
application of these symmetry operations, A followed by B, defines the product of
symmetry operations, denoted by BA, with A being applied first. Considering the
definition of symmetry operation, we conclude that the product of two symmetry
4.2 Point Groups 211
operations is also a symmetry operation for the same molecule (the product is said
to be an internal mathematical operation). In addition, the action corresponding to
a particular symmetry operation can always be reversed. This means that every
symmetry operation of a particular molecule has an inverse. Note that the product
of a symmetry operation by its inverse is equivalent to “doing nothing,” that is, to
the identity E (from the German word Einheit for unity).
The set of symmetry operations for a particular molecule obeys the following
requirements:
(1) the product of any two elements, including the product of an element and itself,
is a member of the same set of symmetry operations (this is called the closure
requirement);
(2) each symmetry operation has an inverse;
(3) there is an identity operation, which we represent by E;
(4) the product is associative, that is, (AB)C = A(BC) (this property results from
the concept of symmetry operation).
Having these properties, one says that the set of symmetry operations of a
particular molecule belongs to a point group (all symmetry elements have a point
in common which is kept unaltered by any symmetry operation of the group). The
number of elements of a group is called the order of the group. When all products
of symmetry operations are commutative, BA = AB, the group is said to be com-
mutative or abelian. Group theory is a branch of mathematics.
We now consider a systematic way of finding the point group of a molecule. We
begin by identifying groups of molecules with high symmetry, including linear
molecules. These are generally called special groups (Fig. 4.6). The internuclear
axis of linear molecules is a principal axis of infinite order C∞, since no matter
how small the rotation around the principal axis is, a new indistinguishable
molecular configuration is always obtained. There are two groups for linear
molecules, depending on whether they have a center of symmetry (Fig. 4.7). Linear
molecules with no center of symmetry like HCl, HCN, and HCCCl (Fig. 4.7),
belong to the C∞v group, whereas linear molecules with a center of symmetry, like
CO2, HCCH (Fig. 4.7), and H2, belong to the D∞h group. Molecules with the
shape of regular polyhedrons exhibit high symmetry and belong to the groups we
classify as special groups. Regular tetrahedral molecules like CH4, CCl4 (Fig. 4.7),
and P4 belong to the Td group (tetrahedral group). A regular octahedral molecule
like SF6 (Fig. 4.7) belongs to the Oh group (octahedral group).
If a molecule whose group of symmetry is to be identified does not belong to any
of these groups of high symmetry, one should look for the presence of a Cn
symmetry axis. A molecule that does not have a Cn axis of symmetry may belong to
one of the three following groups: Cs, if the molecule, like HOCl (Fig. 4.8), has
only one plane of symmetry; Ci, if the molecule, like CHClF-CHClF in the trans
conformation (Fig. 4.8), has only one center of symmetry; C1, if the molecule, like
CHBrClF (Fig. 4.8), does not have any element of symmetry.
212 4 Molecular Symmetry
If the molecule does not belong to any special group or to any of the three last
groups, it should have at least one Cn symmetry axis (possibly n = 2). Then, one
should ask whether an S2n symmetry element (n 2) collinear with the Cn axis of
symmetry is present. If the answer to this question is no, then one should ask
whether there are nC2 axes of symmetry perpendicular to the Cn axis of symmetry.
Answering no to this question leads us to three groups of symmetry, namely, Cn
(HOOH as an example of the C2 group; Fig. 4.9), Cnh (trans-FHC = CHF as an
4.2 Point Groups 213
Fig. 4.9 Molecules with a principal axis of symmetry Cn but no C2 perpendicular axes
example of the C2h group; Fig. 4.9); and Cnv (NH3 as an example of the C3v group;
Fig. 4.9).
Molecules with nC2 axes of symmetry perpendicular to the Cn axis of symmetry
belong to groups Dn [tris(ethylenediamine)cobalt (III) cation, [Co(en)3]3+ with
en = H2NCH2CH2NH2, as an example of the D3 group; Fig. 4.10], Dnh (ethylene as
an example of the D2h group; Fig. 4.10) and Dnd (allene as an example of the D2d
group; Fig. 4.10).
Fig. 4.10 Molecules with nC2 axes of symmetry perpendicular to the principal axis of symmetry
214 4 Molecular Symmetry
A transformation that converts the vector (x, y, z)T to (x′, y′, z′)T (the superscript
T stands for “transpose,” thus giving the corresponding column vector) is mathe-
matically represented by a matrix whose elements are the coefficients of the linear
combinations of the original coordinates. Such a transformation is linear and applies
within the same vector space. Let us find the representation matrices for symmetry
operations, namely the identity, reflection, inversion, rotation, and rotation–reflec-
tion. Starting with the identity E and the column vector (x, y, z)T, we can write
0 1 0 1 0 10 1
x0 x 1 0 0 x
@ y0 A ¼ E @ y A ¼ @ 0 1 0 A@ y A ð4:1Þ
z0 z 0 0 1 z
since this operation “does nothing,” that is, x′ = x, y′ = y, and z′ = z. One says that
the 3 3 identity matrix represents the identity operation and the column vector
(x, y, z)T is the basis for this representation.
For the reflection through the xz plane, rxz, we write
4.3 Matrix Representations 215
0 1 0 1 0 10 1
x0 x 1 0 0 x
@ y0 A ¼ rxz @ y A ¼ @ 0 1 0 A@ y A ð4:2Þ
z0 z 0 0 1 z
and
Considering now the basis formed by the column vector (x, y, z)T, we write
0 1 0 1 0 10 1
x0 x cos h sin h 0 x
@ y0 A ¼ Cn @ y A ¼ @ sin h cos h 0 A@ y A ð4:7Þ
z0 z 0 0 1 z
θ
α
216 4 Molecular Symmetry
0 E 1 0 C2 1 0 rxz 1 0 ryz 1
1 0 0 1 0 0 1 0 0 1 0 0
@0 ð4:9Þ
1 0A @ 0 1 0A @0 1 0A @ 0 1 0A
0 0 1 0 0 1 0 0 1 0 0 1
We now pass to the C3v point group and consider the group of nontrivial symmetry
operations C3, C23 (= C3C3), rv1, rv2, and rv3. Assuming that C3 coincides with the
z-axis and rv1 uses yz as the plane of reflection (Fig. 4.13), then C3 and C23
are represented by the following 3 3 matrices [h(C3) = 2p/3 radians and
h (C23) = 4p/3 radians, see (4.7)],
0 C3 1 0 C32 1
pffiffi pffiffi
1 23 0 p12ffiffi 23 0
B pffiffi32 C B C ð4:10Þ
@ 12 0A @ 3 1 0A
2 2 2
0 0 1 0 0 1
We now consider a new basis (x′ y′ z′)T obtained from (x y z)T by a counter-
clockwise rotation of 2p/3 radians around the z-axis that is coincident with both z′
and C3 (Fig. 4.13):
4.3 Matrix Representations 217
0 1 0 1
x0 x
@ y0 A ¼ C3 @ y A ð4:12Þ
z0 z
Inspection of Fig. 4.13 shows that the change of basis (4.12) converts rv1 into rv2.
Therefore, rv2 is obtained from rv1 by a similarity transformation B = X−1A X,
where X is C3 [rv1 and rv2 are represented by similar matrices; see (2.89):
Following the same reasoning, rv3 is obtained from rv1 by another similarity
transformation in which X is C23 (the change of basis is due to C23):
1
rv3 ¼ C32 rv1 C32 ð4:14Þ
1 T
Note that C31 ¼ C3T and C32 ¼ C32 , since both (x y z)T and (x′ y′ z′)T are
orthogonal systems of axes [see (4.7)]. Substitution of (4.10) and (4.11) in (4.13)
and (4.14) leads to
Hence, the matrices that represent the symmetry operations of the C3v group in the
basis (x y z)T are [see (4.10), (4.11), and (4.15)]
218 4 Molecular Symmetry
ð4:16Þ
The symmetry operations of the C3v point group (E, C3, C23, rv1, rv2, rv3) can be
grouped in three sets, {E}, {C3, C23}, and {rv1, rv2, rv3}, formed by symmetry
operations having the same characters and related by similarity transformations.
Each of these sets of symmetry operations is called a class, and the symmetry
operations within one specified class are said to be mutually conjugate (E7).
Character tables group symmetry operations in classes. Using matrix representa-
tions of the symmetry operations for the C3v point group in the basis (x y z)T [see
(4.10), (4.11), (4.15)], the first Mathematica code of M1 confirms that {C3, C23},
and {rv1, rv2, rv3} are two classes of the C3v point group, and the second code of
M1 enables one to observe rTv1.C3(h).rv1 “behind the scenes.”
The product of two symmetry operations of a point group is a member of the same
point group. The multiplication table of a point group presents the results of the
products of symmetry operations BA, where A and B are symmetry operations of the
group and A is applied first (A and B are presented as columns and rows of the
multiplication table). Using matrix representations of the symmetry operations of
the C2v point group in the basis (x y z)T [see (4.9)], the Mathematica code M2 builds
the multiplication table of the C2v point group. Inspection of this multiplication
table shows that each symmetry operation of the C2v group coincides with its own
inverse, since the diagonal is occupied by the identity. In addition, the upper and
lower matrix triangles of the multiplication table are equal (the product matrix is
symmetric), meaning that the order of multiplication in a product is irrelevant; that
is, symmetry operations commute, and C2v is an abelian point group. Among the
nontrivial symmetry operations C2, rxz, and ryz, the product of two of them is equal
4.4 Character Tables 219
E C2 rxz ryz
E E C2 rxz ryz
C2 C2 E ryz rxz
rxz rxz ryz E C2
ryz ryz rxz C2 E
Consider now a molecular orbital for H2O. The multiplication table for the C2v
symmetry group can be used to find how a particular molecular orbital transforms
when a symmetry operation is applied to it. Applying the identity operator to a
function does not modify it in any conceivable way, the result being the same as if
the function had been multiplied by 1, E /i = /i. Consider now the diagonal of the
C2v multiplication table: the identity operation can be replaced by the square of any
other symmetry operation, that is,
Therefore, the eigenvalues for the nontrivial symmetry operations of H2O (C2, rxz,
ryz) should be either +1 or −1. Since the product of two of the nontrivial symmetry
operations (C2, rxz, ryz) is equal to the one left out [this can be confirmed by
inspection of the elements (2,3) (second row, third column), (2,4), (3,2), (3,4), (4,2),
and (4,3) of the multiplication table], either all of the corresponding eigenvalues of
the C2v point group have to be equal to +1, or two of them are equal to –1. Each set
of these eigenvalues consistent with the multiplication table of the group is called
an irreducible representation of the C2v group (the word “irreducible” means that
these representations cannot be further decomposed into simpler ones). The table
that contains the characters of the irreducible representations of the group is called
the character table.
The irreducible representations with 1 in the axis of highest order (the so-called
principal axis; for the C2v group, the C2 axis) are identified by A. In turn, those
representations with −1 in the principal axis are identified by B. The subscript 1 is
used when the eigenvalue of the symmetry operation rxz is equal to +1. The
representation with 1 as eigenvalues in all symmetry operations of the group reflects
the maximum symmetry within the group and is called a totally symmetric species.
220 4 Molecular Symmetry
E C2 rxz ryz
A1 1 1 1 1 z x2, y2, z2
A2 1 1 −1 −1 Rz xy
B1 1 −1 1 −1 x, Ry xz
B2 1 −1 −1 1 y, Rx yz
(http://symmetry.jacobs-university.de)
The character table also presents the functions x, y, z, x2, y2, z2, xy, xz, yz, and the
rotations Rx, Ry, and Rz around the x-, y-, and z-axes. The z-coordinate transforms
like the totally symmetric species (A1), since the z-axis coincides with C2
(Fig. 4.14). Inspection of Fig. 4.14 for the orientation of the x- and y-axes shows
that x transforms like B1 and y transforms like B2. The x, y and z functions represent
translations along these axes and so transform like the dipole moment components.
The function xy transforms like A2 (= B1 B2), xz and yz transform like B1
(= B1 A1) and B2 (= B2 A1), respectively, and x2 (B1 B1 = A1),
y2 (B2 B2 = A1), and z2 (A1 A1 = A1) are totally symmetric. Under symmetry
operations, the polarizability matrix elements transform according to the corre-
sponding subscripts (axx transforms as x2, axy transforms as xy, …).
The rotations around the x-, y-, and z-axes are carried out by the corresponding
torques (Fig. 4.15). In particular, by applying the symmetry operations of C2v to the
torque for Rx, we conclude that Rx transforms like the symmetry species B2 (see
Fig. 4.15 and the character table). Similar reasoning applied to the torques for Ry
and Rz leads to their assignments to B1 and A2, respectively.
Inspection of the C2v character table reveals that the irreducible representations
behave like “orthogonal vectors.” In fact, the dot product of two irreducible rep-
resentations is given by
C2
σxz
z
O O O
σ yz
z
z x y
Rz
Ry
y
Rx
where
CT Ci
ai ¼ ð4:23Þ
h
since substitution of (4.22) into (4.23) using (4.20) gives an identity. Applying
(4.23) to C in (4.21) leads to
C ¼ A1 þ B1 þ B2 ð4:24Þ
222 4 Molecular Symmetry
a result that is not at all surprising, since A1, B1, and B2 are the irreducible
representations for the functions x, y, and z that form the basis for C (see the
character table of the C2v point group).
We now consider the character table for the C3v group (http://symmetry.jacobs-
university.de) and note that the irreducible representation of the point group labeled
E is two-dimensional, since it is shown by the character of the identity operation
E and the basis functions grouped in pairs, on the right-hand side of the irreducible
representation E (E8).
The characters of the symmetry operations of the C3v point group in the (x y z)T
basis [see (4.16)] are given by
C ¼ A1 þ E ð4:26Þ
a result that can be easily confirmed by inspection of the character table for the C3v
point group.
Character table of the C3v point group
E 2C3 3rv
A1 1 1 1 z x2 + y2, z2
A2 1 1 −1 Rz
E 2 −1 0 (x, y) (Rx, Ry) (x2–y2, xy) (xz, yz)
(http://symmetry.jacobs-university.de)
A function f(x) is said to be even if f(x) = f(−x). Functions like x2, cos x, exp(−x2),
|x| are even functions. In turn, g(x) is said to be an odd function if g(x) = −g(−x).
Functions like x, x3, sin x are odd functions. Let us now consider the integral of a
function h(x) defined for −a x a. It is easy to show that h(x) can be
decomposed into the sum of the even function f(x) = [ h(x) + h(−x)]/2 and the odd
function g(x) = [ h(x) − h(−x)]/2. If the odd function is integrable, then its integral
over the symmetric interval [−a, a] is identically zero. Thus, we can conclude that
only the even function contributes to the integral of f(x) over the symmetric interval
[−a, a], with this integral being precisely the same as twice the integral over the
interval [0, a]. Generalization of this result enables one to conclude that the integral
of f(s) over the entire configuration space is significant when f(s) is a basis for a
4.5 Selection Rules 223
is the basis for a representation that coincides with or contains the irreducible
representation for the function x.
When
D E
^jw0i ¼ 0
w0f jl ð4:30Þ
0D ED E D E
1
0 0
1 X @ wf jl
0
^x jw0r w0r l
^y wi ^y wr w0r jl
w0f l ^x jw0i
axy ¼ þ A ð4:32Þ
fi h r6¼i;f xri x0 iCr xrf þ x0 þ iCr
This expression suggests that the Raman selection rule depends on the product of
transition dipole moments in the numerators of (4.32),
D E
w0f j xjw0r w0r j yjw0i ð4:33Þ
and involve a summation over all virtual states r. However, since no direct infor-
mation can be obtained about the virtual states r, except perhaps in discrete reso-
nance Raman, where a limited number of them dominate the sum over r, the general
condition for
D 0 0E
axy fi ¼ wf axy wi ð4:34Þ
to be nonzero is that
D E
w0f jxyjw0i ð4:35Þ
We now determine the symmetry species of the vibrations of a molecule and find
the numbers of distinct fundamental frequencies in infrared and Raman spectra. In
small molecules, these numbers may eventually be used to distinguish isomers or
identify geometries corresponding to a known molecular stoichiometry.
We begin by considering the cis-1,2-dichloroethene molecule (Fig. 4.16). This
molecule belongs to the C2v point group, whose character table is
Character table of the C2v point group
E C2 rxz ryz
A1 1 1 1 1 z x2, y2, z2
A2 1 1 −1 −1 Rz xy
B1 1 −1 1 −1 x, Ry xz
B2 1 −1 −1 1 y, Rx yz
(http://symmetry.jacobs-university.de)
4.6 Molecular Vibrations 225
y
Cl x Cl
C C
H H
Fig. 4.16 cis-1,2-Dichloroethene (C2v group) with system of Cartesian axes. The latter should be
centered on each nucleus of the molecule, thus yielding a basis of 18 coordinates
The set of unit vectors directed along the Cartesian axes and centered on each
nucleus of the molecule forms a basis of 18 vectors (= 3 N, where N = 6 is the
number of atoms in the molecule) that can describe all molecular motions, namely
translations, rotations, and vibrations. Therefore, these vectors form the basis for a
reducible representation of the C2v group. The diagonal elements of the matrices
that represent the symmetry operations can be easily obtained without the need for
determining the whole representation matrices. Simple and intuitive rules can be
applied. In particular, when a vector moves its origin under the action of a sym-
metry operation, one should count zero for the diagonal element of the corre-
sponding transformation matrix. In turn, if a vector remains unaltered under the
action of a symmetry operation, one should count 1. When a vector rotates by h
radians, the trace of the corresponding matrix is given by 2cosh + 1 [see (4.7)].
Thus, applying the symmetry operations to the set of the basis vectors (see
Fig. 4.16) gives the following reducible representation:
Inspection of the character table of the C2v group shows that the irreducible rep-
resentations of the group that correspond to rotations Rx, Ry, and Rz are B2, B1, and
A2, and those that correspond to translations x, y, and z are B1, B2, and A1. Thus, we
can write
226 4 Molecular Symmetry
Crot ¼ A2 þ B1 þ B2
ð4:38Þ
Ctrans ¼ A1 þ B1 þ B2
Subtracting (4.38) from (4.37) leads to the reducible representation for the
molecular vibrations
Irreducible representations that transform like x, y, and z (A1, B1, B2) correspond to
infrared active vibrations, and those that transform like x2, y2, z2, xy, xz, and yz (A1,
A2, B1, B2) correspond to Raman active vibrations. Hence,
E C2 i rh
Ag 1 1 1 1 Rz x2, y2, z2, xy
Bg 1 −1 1 −1 Rx, Ry xz, yz
Au 1 1 −1 −1 z
Bu 1 −1 −1 1 x, y
(http://symmetry.jacobs-university.de)
Cl y
x H
C C
H
Cl
Fig. 4.17 trans-1,2-Dichloroethene (C2h group) with system of Cartesian axes. The latter should
be centered on each nucleus of the molecule thus yielding a basis of 18 coordinates
4.6 Molecular Vibrations 227
C2h E C2 i rh
ð4:41Þ
C 18 0 0 6
Using (4.22) and (4.23), we decompose C into the irreducible representations of the
group, obtaining
Crot ¼ Ag þ 2Bg
ð4:43Þ
Ctrans ¼ Au þ 2Bu
We now subtract (4.43) from (4.42) to obtain the reducible representation for the
molecular vibrations
From the character table, we can conclude that the symmetry species of the infrared
active and Raman active vibrations are
The symmetry species for the infrared active vibrations are of the u-type, whereas
the symmetry species for the Raman active vibrations are of the g-type (E9).
Inspection of the character tables leads one to conclude that those groups that
include inversion as a symmetry operation have mutual exclusion of the infrared
and Raman active vibrations, i.e., the infrared active vibrations are Raman inactive;
and conversely, the Raman active vibrations are infrared inactive. Comparison of
(4.40) with (4.45) leads one to conclude that the cis and trans isomers can be
distinguished by the number of infrared active and Raman active fundamental
vibrations.
Consider now a molecule of general formula XY3, with the X atom bonded to
each of the Y atoms. For this molecular connectivity, the molecule can be pyra-
midal (C3v group) or planar (D3h group). As we will find, this question can be
answered by symmetry analysis of vibrations, in particular, by the number of
distinct fundamental frequencies.
228 4 Molecular Symmetry
N
H
H H
A pyramidal XY3 molecule (Fig. 4.18) belongs to the C3v point group, whose
character table is given by
(http://symmetry.jacobs-university.de)
We now consider the set of unit vectors directed along the Cartesian axes and
centered on each nucleus of the XY3 molecule. These form a basis of 12 vectors
that correspond to the following reducible representation:
C ¼ 3A1 þ A2 þ 4E ð4:47Þ
Crot ¼ A2 þ E
ð4:48Þ
Ctrans ¼ A1 þ E
4.6 Molecular Vibrations 229
Hence we have
CIR ¼ 2A1 þ 2E
ð4:50Þ
CR ¼ 2A1 þ 2E
Thus, a pyramidal XY3 molecule (C3v group) has four distinct fundamental fre-
quencies of vibration, all infrared and Raman active. Note that each E species
corresponds to two degenerate vibrational modes and a single distinct frequency.
The planar XY3 molecule (Fig. 4.19) belongs to the D3h group, whose character
table is
(http://symmetry.jacobs-university.de)
The set of unit vectors directed along the Cartesian axes and centered on each
nucleus of the XY3 molecule form a basis of 12 vectors that correspond to the
following reducible representation:
C 3, S 3
Y
X Y
Y
Fig. 4.19 A planar XY3 molecule includes, as symmetry elements, a threefold rotation axis C3
that coincides with a threefold rotation-reflection axis S3, three twofold rotation axes C2 that are
aligned with each of the X–Y bonds, and four planes of symmetry: one horizontal plane and three
vertical planes
230 4 Molecular Symmetry
Hence,
and
In conclusion, a planar XY3 molecule (D3h group) has three distinct fundamental
frequencies of vibration, both in the infrared and Raman spectra. Thus, it is possible
to distinguish the geometries of an XY3 molecule by the number of distinct fun-
damental frequencies of vibration, since this result differs from that obtained for a
pyramidal XY3 molecule [see (4.50)].
In Raman spectroscopy, the incident radiation is a plane-polarized electromag-
netic wave. Analyzing the polarization of the scattered light may provide useful
information on the symmetry of vibrations. In order to carry on this polarization
analysis, we need to interpose a polarization analyzer in the way of the scattered
light, just before this enters the spectrometer. This polarization analyzer acts like a
filter with two perpendicular orientations that allow parallel and perpendicular
polarized scattered light go through. Then, two Raman spectra are recorded: one
records the intensity IVV (the subscripts stand for Vertical, Vertical) the other
records IVH (the subscripts stand for Vertical, Horizontal). The depolarization
ratio of a Raman band is given by
R
IVH ð~mÞd~v
qp ¼ R ð4:56Þ
IVV ð~mÞd~v
4.6 Molecular Vibrations 231
I0
molecule
IVV
IVH
Fig. 4.20 Geometry for observing the depolarization ratio in a Raman spectrum
where the integrals extend over each band of the Raman spectrum. (Fig. 4.20).
Theoretical considerations enable one to conclude that the depolarization ratio of
a Raman band lies in the range
0 qp 3=4 ð4:57Þ
X
X X Y Y
H H
C H C C C
H
H H
H X X
X Z
C3v C2v Cs C1
Fig. 4.21 Tetrahedral molecules belonging to different point groups; X, Y, and Z represent
different halogen atoms
Mathematica Codes
In the following code, the symmetry operations of the C3v group are represented by
3 3 matrices in the orthogonal basis (x y z)T, with the matrices for rv2 and rv3
being obtained by similarity transformations on rv1, with X equal to C3 and C23, and
the concept of class is illustrated. Since (x y z)T is an orthogonal system of axes, the
inverses of matrices are obtained simply by transposing them. The Mathematica
function Transpose transposes a matrix, and If[condition,t,f] gives t if
condition evaluates to True, and f if it evaluates to False.
Mathematica Codes 233
calculate C3(h).rv1,
cos h sin h 1 0 cos h sin h
C3 ðhÞ rv1 ¼ ¼ ð4:59Þ
sin h cos h 0 1 sin h cos h
and rTv1.C3(h).rv1
234 4 Molecular Symmetry
1 0 cos h sin h cos h sin h
ðrv1 ÞT C3 ðhÞ rv1 ¼ ¼
0 1 sin h cos h sin h cos h
ð4:60Þ
and obtain
x0 1 cos h sin h 1 cos h
0 ¼ C 3 ð hÞ ¼ ¼
y0 0 sinh cos h 0 sin h
x 1 cos h sin h 1 cos h
0 ¼ C3 ðhÞ rv1 ¼ ¼
y0 0 sinh cos h 0 sin h
x T 1 cos h sin h 1 cos h
¼ ðrv1 Þ C3 ðhÞ rv1 ¼ ¼
y0 0 sin h cos h 0 sin h
ð4:62Þ
The following code plots (x’,y’)T for C3(h), C3(h).rv1 and rTv1.C3(h).rv1 using
ParametricPlot. This Mathematica function plots the (x’, y’)-coordinates as
functions of a parameter (= external variable).
The first plot shows the arc of a circle of radius 1 (r = 1) drawn in the coun-
terclockwise direction for h from 0 to 2p/3. The second plot shows the same
rotation in the same direction (counterclockwise) after the initial point of the
rotation was reflected on rv1 (see Fig. 4.13). In turn, the third plot shows that the
effect of multiplying on the left by rTv1 causes the curve to reflect by rv1 (note that
rTv1 = rv1, since the matrix representing rv1 is diagonal), and this reflection amounts
to a change of direction from counterclockwise to clockwise. On the whole, these
Mathematica Codes 235
plots enable one to follow the rTv1.C3(h).rv1 orthogonal transformation “behind the
scenes.”
The following Mathematica code builds the multiplication table for the group C2v.
To this end, the symmetry operations are represented by matrices in the basis
(x y z)T [see (4.9)]. Note the way the command If[condition,t,f] is used,
with the field f being used to include another If. For presenting the results, the
function Array builds a 4 4 array called a and uses the command Grid to
present a two-dimensional grid with items of size 2 by 2, where the elements of the
array are included:
The following code calculates transition dipole moment integrals for a charged
particle in a one-dimensional box. The first line defines the length of the box as 1,
the second line presents the general expression for the one-dimensional box wave
functions, the third line shows the values of the quantum numbers for the con-
sidered transitions (the list associated with i is outermost, that is, i changes more
slowly than n), and the fourth line of code calculates the transition dipole moments
for these transitions. The results indicate that for nonzero transition moments, the
quantum number must change by an odd integer, ±1, ±3, …. The last line of code
plots the integrand for the 1!2 transition, so that we can understand why the
nonzero calculated transition dipole moments are negative. The option
236 4 Molecular Symmetry
Filling!Axis specifies that the area under the plotted curve should be filled to
the axis. Note that the factor x of the integrand for the transition dipole moment
weights more high values of x, thus giving rise to a negative area greater than the
positive area and so to a negative integral.
The following code calculates transition dipole moment integrals for the
two-particle rigid rotor, whose eigenfunctions are spherical harmonics (see 3.§2).
The first line shows the quantum numbers for the considered transitions, and the
second line calculates the transition dipole moments and shows that allowed
transitions have DJ = ±1 (J is the rotational quantum number). Since the spherical
harmonics are functions of the angular coordinates, x should be substituted by cosh,
since x = r cosh, and r is a constant. In addition, the integrand should include an
additional sinh factor coming from the volume element, which is given by sinh dh
d/. The last line of code plots the integrand for the 0!1 J-transition. Note that the
command for the plot is SphericalPlot3D, since the integrand is a function of
angular coordinates h and / (Plot3D plots f as a function of x and y):
Mathematica Codes 237
The following code calculates transition dipole moment integrals for the harmonic
oscillator. The first line of code represents the eigenfunctions as functions of z (= √a
x) [see (3.62) and (3.63)], the second line shows the quantum numbers for the
considered transitions, and the third line calculates the corresponding transition
dipole moments. The results indicate that nonzero transition moments correspond to
Dn = ±1, where n is the vibrational quantum number. The following lines of code
plot the eigenfunctions and the integrands of the transition dipole moment integrals
involved in one allowed transition (the 0!1 transition, where the integrand is an
even function) and one forbidden transition (the 0!2 transition, where the inte-
grand is an odd function):
238 4 Molecular Symmetry
Glossary
Exercises
E1. What is the principal axis of symmetry in each of the following molecules:
(a) CHCl3; (b) PF5; (c) SF6; (d) C5H5−. Give the point group of each of the
following molecules: (e) CH2Cl2; (f) BF3; (g) C6H6; (h) CO2; (i) CH4.
E2. Use Mathematica to show that the transpose of the matrix representation of
Cn in the basis (x, y, z)T is equal to its inverse.
E3. Use Mathematica to find the matrix representation for S4 in the basis
(x, y, z)T.
E4. Use Mathematica to show that the transpose of a product of 3 3 matrices
is equal to the product of transposes of matrix factors in reverse order,
(A.B)T = BT.AT.
E5. Use Mathematica to show that the inverse of a product of 3 3 matrices is
equal to the product of the inverses of the matrix factors in reverse order,
(A.B)−1 = B−1.A−1.
E6. Prove that square matrices B and A related by a similarity transformation
(B = X−1AX) have the same sum of their diagonal elements.
E7. Show that
E8. Use Mathematica to find the multiplication table for the C3v group. Is this an
abelian group?
E9. Considering a molecule with a center of symmetry, then the symmetry
species for infrared active vibrations are of u-type, whereas the symmetry
species for Raman active vibrations are of g-type. Explain.
E10. Find the reducible representations for vibrations of tetrahedral molecules
belonging to the point groups C3v, C2v, Cs, and C1.
Exercises 241
Further Reading
Cotton, F.A.: Chemical applications of group theory, 3rd edn. Wiley, New York (1990)
Kettle, S.F.A.: Symmetry and structure (readable group theory for chemists). Wiley, New York
(1995)
Vincent, A.: Molecular symmetry and group theory, 2nd edn. Wiley, New York (2001)
Molecular Structure
5
Abstract
The first two sections of this chapter define electron probability density and
electrostatic potential and illustrate these topics with the water molecule. The
ways of apportioning electrons of a molecule to different regions of space are
performed by population analysis methods. We illustrate the Mulliken popu-
lation analysis method for the methane molecule and the natural bond orbitals
method for methane, ethene, and ethyne molecules. Next, we present a typical
potential energy surface with one first-order saddle point obtained using a
combination of Morse functions and calculate intrinsic reaction coordinates for
the isomerization reaction HCN ! CNH and the symmetric reaction Cl− +
H3CCl ! ClCH3 + Cl−. Potential energy profiles for the rotations around the
C–C bonds of ethane and 1,2-dichloroethane are presented and discussed. In
particular, the potential energy profiles for the staggered conformation of ethane
and the synclinal and antiperiplanar conformations of 1,2-dichloroethane,
combined with vibrational calculations, enable one to estimate the amplitudes of
the corresponding torsional motions. The last section of this chapter comprises
chiral molecules, the Cahn–Ingold–Prelog rules for distinguishing R and
S enantiomers of carvone, the polarimeter, and the way optically active
molecules interact with plane-polarized electromagnetic radiation. At the end of
this chapter, the student can find several Mathematica codes (Natural Bond
Orbitals for CH4, Potential Energy Surface, Right and Left Helices, Optical
Rotation) with detailed explanations of new commands, a glossary of important
scientific terms, and a list of exercises, whose complete answers are given after
the appendix.
Consider a molecule with n electrons. The wave function for the ground state can be
written as
Wðs1 ; . . .; sn Þ ð5:1Þ
where each si represents the set of three spatial and one spin coordinates of electron
i. The probability of finding electron 1 in volume dv1 for any position of the
remaining electrons and any spin of all electrons is given by
Z Z
dpðr1 Þ ¼ ... W0 ðs1 ; . . .; sn ÞW0 ðs1 ; . . .; sn Þdv2 . . .dvn dr1 . . .drn dv1 ð5:2Þ
where r1 = (x1, y1, z1) is the position vector in some reference system of Cartesian
axes, the integral over the spatial coordinates runs from electrons 2 to n, and the
integral over the spin coordinates runs from 1 to n. The electrons are indistin-
guishable (the labeling of electrons is arbitrary), and so the probability of finding
any electron in a volume dv is n times (5.2),
where
Z Z
q0 ðrÞ ¼ n ... W0 ðs1 ; . . .; sn ÞW0 ðs1 ; . . .; sn Þdv2 . . .dvn dr1 . . .drn ð5:4Þ
q0 ¼ q0 ðx; y; zÞ ð5:5Þ
Having four variables (x, y, z, and the function value q0), this function cannot be
plotted as a whole, since the world we live in is a three-dimensional world. In order
to reduce the number of variables by one, we take the set of all points with
coordinates (x, y, z) that satisfy the equation
q0 ðx; y; zÞ ¼ c ð5:6Þ
where c is a constant called an isovalue (from Greek isos “equal”). The above
equation can be solved to obtain the z value as a function of x and y, that is,
Fig. 5.1 Total electron probability density of the H2O molecule represented by density surfaces
with isovalues 0.04000, 0.00400, and 0.00040 (Gaussian 09 B3LYP/cc-pVTZ calculation with
figure presented by GaussView)
Being a function of two variables, f(x, y) represents a surface [recall that a function
of one variable y = g(x) represents a curve], and the surface derived from (5.6) is
called an isodensity surface, because it is the set of points for which q0 is equal to a
specified isovalue.
Figure 5.1 presents three closed surfaces of the electron probability density for
H2O. Note that the surface with density isovalue 0.04000 is inside the surface with
isovalue 0.00400, which, in turn, is totally contained within the surface whose
isovalue is 0.00040. This relationship between surfaces with different isovalues
resembles that of a set of Russian dolls: a smaller doll fits inside a larger one. The
probability of finding any of the 10 H2O electrons within the 0.00040 isodensity
surface should be close to 100 %, and so this isodensity surface can be used to
represent the overall shape of the molecule.
where r12 is the distance between the charges, and e0 is the permittivity of vacuum
(e0 = 8.854187817 10−12 C2 N−1 m−2, see Appendix), and the potential energy
of interaction is
q1 q2
Ep ¼ ð5:9Þ
4pe0 r12
246 5 Molecular Structure
where q is the absolute value of each charge. In turn, at P2, the sum is positive,
meaning that the dominant influence comes from the positive charge,
q qþ q q
P2 ðr2 þ \r2 Þ þ ¼ þ [0
r2 r2 þ r2 r2 þ
Fig. 5.2 The point P1 “feels” a dominant electrostatic influence from the nearer negative charge,
whereas P2 “feels” a dominant electrostatic influence from the nearer positive charge
5.2 Electrostatic Potential 247
Fig. 5.3 The electrostatic potential at point P is equal to the reversible work done by the electric
field in the transport of an infinitesimal test charge from infinity to point P, divided by the test
charge
between two points in space depends only on these points, not on the path followed
between them (the electrostatic force is a conservative force). In order to obtain a
function defined at each point in space that does not depend on the test charge qt,
one divides w(∞ ! P) by qt, thus obtaining the electrostatic potential defined by
wð1 ! PÞ
/ðPÞ ð5:10Þ
qt
ZB
/B /A ¼ EðrÞ dr ð5:11Þ
A
where E is the electric field vector, whose magnitude is the ratio of the electrostatic
force exerted on the test charge qt and this charge, and dr is the infinitesimal
displacement vector along the path (A and B are the lower and upper limits of the
integral). The electric field belongs to the system. When the electric field vector and
the infinitesimal displacement vector have the same direction, E(r) dr > 0, the
system does work on the exterior, thus releasing energy, and so the work is neg-
ative. In turn, when E(r) dr < 0, the exterior does work on the system, and the
work is positive. When A is infinity (the test charge at infinity does not interact with
the charge distribution, and so the electric potential at infinity is zero, /∞ = 0), and
B is the point P, the electrostatic potential /P due to a point charge is given by
ZP Zr
q 1 q
/P /1 ¼ EðrÞ dr¼ dr ¼ ð5:12Þ
4pe0 r2 4pe0 r
1 1
248 5 Molecular Structure
where the charge qi labeled by the subscript i includes not only its magnitude, but
also its sign, and riP stands for the distance between the charge i and the point
P where the electrostatic potential /(P) is evaluated.
Consider now a point on the 0.00040 isodensity surface of H2O (Fig. 5.1). The
electrostatic potential of a molecule is the sum of two main contributions, one
positive, given by the electrostatic potential produced by the nuclei with atomic
numbers Zn, the other negative, given by the electrostatic potential resulting from
the electron charge density defined at each point in space. The electron charge
density is obtained by multiplying the electron probability density q0 by the elec-
tron charge −e. Since the electron charge is −1 in atomic units, the electron charge
density is given in these units by −q0, and the electron charge in the volume
element dv is −q0dv. Hence,
X Zn Z
q0 ðrÞ
/ðPÞ ¼ dv ð5:14Þ
n
rnP reP
Fig. 5.4 Electrostatic potential for the H2O molecule mapped over the total density surface with
isovalue equal to 0.00040 (Gaussian 09 B3LYP/cc-pVTZ calculation with figure presented by
GaussView)
The integral of the ground state electron density of a molecule over all space around
the nuclei [see (5.4)] is equal to the total number of the molecule’s electrons,
Z
q0 ðrÞdv ¼ n ð5:15Þ
The ground state total electronic wave function for a closed-shell configuration is
given by the following Slater determinant of molecular spin-orbitals,
h i
/ /
W0 ¼ ð1; . . .; nÞ ¼ /1 /
1 2 2 . . ./n=2 /n=2 ð5:16Þ
where each column corresponds to a molecular spin-orbital, and the bar over /i
indicates beta spin-orbitals (E2). The spatial factors of the molecular spin-orbitals
are given by linear combinations of Slater-type atomic orbitals
250 5 Molecular Structure
X
b
/i ¼ csi vs ð5:17Þ
s¼1
X
n=2 b X
X b X
n=2
q0 ðrÞ ¼ 2 /j /j ¼ 2 crj csj vr vs ð5:18Þ
j¼1 r¼1 s¼1 j¼1
Z b X
X b X
n=2 b X
X b
q0 ðrÞdv ¼ 2 crj csj Srs ¼ Prs Srs ¼ n ð5:19Þ
r¼1 s¼1 j¼1 r¼1 s¼1
where Prs represents the rs density matrix element and Srs is the corresponding
overlap integral,
X
n=2 Z
Prs ¼ 2 crj csj Srs ¼ vr vs dv ð5:20Þ
j¼1
Both the density and overlap matrices are symmetric matrices. For real coefficients,
X
n=2 X
n=2
Prs ¼ 2 crj csj ¼ 2 crj cTjs ¼ 2Cr CsT ð5:21Þ
j¼1 j¼1
where Cr represents the r-row vector [1 (n/2) vector] that multiplies the transpose
of the s-row vector [(n/2) 1 vector]. The result of this vector product is a number,
the Prs element of the density matrix. In matrix notation, we can write
P ¼ 2CCT ð5:22Þ
b X
X b X
b b X
X b
n¼ Prs Srs ¼ Prr þ 2 Prs Srs ð5:23Þ
r¼1 s¼1 r¼1 r [ s s¼1
where the off-diagonal elements correspond to the matrix lower triangle (the row
number is greater than the column number).
5.3 Mulliken Population Analysis 251
Fig. 5.5 Molecular orbitals for CH4 represented by surfaces with isovalues equal to 0.06 and the
corresponding symmetry species of the Td point group. Lobes with different grades of gray
correspond to different signs. HF/STO-3G Calculation performed by Gaussian 09, and molecular
orbitals presented by GaussView
252 5 Molecular Structure
The first relevant feature of these orbitals is their delocalization over the whole
molecule. In particular, all the doubly occupied orbitals have nonzero coefficients in
each of the four hydrogen atoms’ 1s orbitals. Note that the molecular orbital with the
lowest energy is essentially an atomic orbital of the carbon atom core, since the largest
coefficient refers to the carbon 1s orbital (the 2p orbitals have zero coefficients), and
the coefficients for the hydrogen atoms’ 1s orbitals are close to negligible. The
molecular orbital with the second-lowest energy has its largest coefficient on the
carbon 2s orbital. These two molecular orbitals transform like the A1 irreducible
representation of the Td group, i.e., they are totally symmetric orbitals. The three
molecular orbitals of highest energy are degenerate (they have the same energy
eigenvalue). Each of these molecular orbitals has a C2 axis that is coincident with each
of the x-, y-, and z-axes and transforms like each of these coordinates. In addition, each
of the C2 symmetry axes coincides with an S4 axis. The S4 rotation–reflection sym-
metry operation includes a 2p/4 rotation around the S4 axis followed by a reflection in
the plane perpendicular to the axis. Therefore, application of S4 twice is equivalent to
one C2 symmetry operation. Note that the coefficients of degenerate molecular orbitals
are not unique, since these can always be replaced by linear combinations that are
orthogonal and correspond to the same energy eigenvalue.
The five molecular orbitals of methane are represented in Fig. 5.5 by surfaces
whose isovalues are equal to 0.06. Considering the orientation and the different
signs of the two lobes of each of the triple degenerate molecular orbitals, it can be
concluded that the character of the S4 rotation–reflection symmetry operation is
equal to −1. Hence, these three molecular orbitals transform as the symmetry
species T2.
Density matrix
C 1s C 2s C 2px C 2py C 2pz H 1s1 H 1s2 H 1s3 H 1s4
C 1s 2.06590
C 2s −0.20184 0.79067
C 2px 0.00000 0.00000 0.65429
C 2py 0.00000 0.00000 0.00000 0.65429
C 2pz 0.00000 0.00000 0.00000 0.00000 0.65429
H 1s1 −0.09387 0.22611 0.34412 0.34412 0.34412 0.60826
H 1s2 −0.09387 0.22611 −0.34412 −0.34412 0.34412 −0.11568 0.60826
H 1s3 −0.09387 0.22611 −0.34412 0.34412 −0.34412 −0.11568 −0.11568 0.60826
H 1s4 −0.09387 0.22611 0.34412 −0.34412 −0.34412 −0.11568 −0.11568 −0.11568 0.60826
The density matrix [matrix whose general elements are Prs; see (5.21)] is a
symmetric matrix, and so we present only the lower matrix triangle (E3). In
addition, the elements of the full Mulliken population analysis matrix (see below)
are given by PrsSrs [see (5.19)], since they correspond to the product of the density
5.3 Mulliken Population Analysis 253
matrix elements Prs and the corresponding overlap integrals Srs. These can be
obtained by dividing the Mulliken population matrix elements PrsSrs by the cor-
responding density matrix elements Prs (E4). Note that the diagonal elements for
the density matrix and the full Mulliken population analysis are the same, since the
diagonal overlap integrals are 1, Srr = 1.
Full Mulliken population analysis matrix
C 1s C 2s C 2px C 2py C 2pz H 1s1 H 1s2 H 1s3 H 1s4
C 1s 2.06590
C 2s −0.05013 0.79067
C 2px 0.00000 0.00000 0.65429
C 2py 0.00000 0.00000 0.00000 0.65429
C 2pz 0.00000 0.00000 0.00000 0.00000 0.65429
H 1s1 −0.00594 0.11187 0.09331 0.09331 0.09331 0.60826
H 1s2 −0.00594 0.11187 0.09331 0.09331 0.09331 −0.01993 0.60826
H 1s3 −0.00594 0.11187 0.09331 0.09331 0.09331 −0.01993 −0.01993 0.60826
H 1s4 −0.00594 0.11187 0.09331 0.09331 0.09331 −0.01993 −0.01993 −0.01993 0.60826
Gross orbital populations
C 1s C 2s C 2px C 2py C 2pz H 1s1 H 1s2 H 1s3 H 1s4
1.99201 1.18802 1.02753 1.02753 1.02753 0.93433 0.93433 0.93433 0.93433
The sum of the elements of the Mulliken population matrix over one particular
orbital, that is, over a row or a column, leads to the gross orbital population (see
above), and the sum of the gross orbital populations over all atomic orbitals gives
the total number of electrons [see (5.23)]. In turn, subtracting the sum of gross
orbital populations belonging to a particular atom in the molecule from the total
number of electrons in the isolated atom gives the Mulliken atomic charges (C
−0.262620, H1 0.065670, H2 0.065670, H3 0.065670, H4 0.065670), whose sum
(the total charge of the molecule) is zero. If the matrix elements of the Mulliken
population analysis matrix corresponding to a specified atom are summed, one
obtains the population condensed to that atom (see below) (E5)
Populations condensed to atoms (all electrons)
1C 2H 3H 4H 5H
1 C 4.71918
2 H 0.38586 0.608263
3 H 0.38586 −0.019929 0.608263
4 H 0.38586 −0.019929 −0.019929 0.608263
5 H 0.38586 −0.019929 −0.019929 −0.019929 0.608263
The strong dependence between the Mulliken population analysis results and the
basis set used in the calculation enable us to conclude that this population analysis
method is of reduced practical utility.
254 5 Molecular Structure
The electronic configuration of an isolated carbon atom is 1s2 2s2 2p2. Since all
directions that emerge from the nucleus of an isolated atom are equivalent (the
space around the nucleus is isotropic), the 2p electrons of an isolated carbon atom
are uniformly distributed over the 2px, 2py, and 2pz orbitals, which share the same
electron density. Now consider the carbon atom of a CH4 molecule and four
equivalent independent linear combinations of the 2s, 2px, 2py, and 2pz orbitals
directed to the hydrogen atoms on the vertices of a regular tetrahedron. These
Fig. 5.6 a CH4 inscribed in a cube. b Tetrahedral angle. c Cartesian coordinates for the vertices
of a regular tetrahedron
5.4 Natural Bond Orbitals 255
orbitals can be made orthonormal and are called hybrid atomic orbitals. Having
spherical symmetry, the 2s orbital participates in them with the same coefficient (we
can begin by assigning 1, apart the normalization constant for the whole hybrid
orbital). The subscript in each of the 2px, 2py, 2pz orbitals indicates how these
atomic orbitals transform under a symmetry operation. For example, 2px transforms
like the x coordinate, that is, 2px = x f(r), where r is the distance from the point to
the origin. Therefore, the 2px, 2py, and 2pz orbitals act like unit vectors along the
x-, y-, and z-axes, and their coefficients in the hybrid orbitals are given by the
corresponding Cartesian coordinates of each vertex to which the hybrid orbital is
directed (Fig. 5.6c), apart from the normalization constant.
The hybrid orbital directed to the (1, −1, −1) vertex is given by
where N is the normalization constant. Since the atomic orbitals are orthogonal and
normalized, the normalization constant of the hybrid atomic orbital is given by the
inverse of the square root of the sum of the squared orbital coefficients. In partic-
ular, for (5.24) we can write
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1
N ¼ 1= ð þ 1Þ2 þ ð þ 1Þ2 þ ð1Þ2 þ ð1Þ2 ¼ ð5:25Þ
2
We now consider an spk hybrid orbital
1 pffiffiffi
hA ¼ pffiffiffiffiffiffiffiffiffiffiffi s þ kpA ð5:26Þ
1þk
1 k
fs ¼ fp ¼ ð5:27Þ
1þk 1þk
are the s and p fractions, and k is the hybridization parameter, given by the ratio
fp/fs (E7, E8, E9). Note that the sum of fs and fp is equal to 1.
Let us now consider the spkA and spkB hybrids directed to atoms A and B,
1 pffiffiffiffiffi 1 pffiffiffiffiffi
hA ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffi s þ kA pA hB ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffi s þ kB pB ð5:28Þ
1 þ kA 1 þ kB
256 5 Molecular Structure
(E10). The scalar product of two unit vectors directed to atoms A and B is given by
These unit vectors are aligned with pA and pB orbitals, which in turn behave like
unit vectors whose scalar product is given by the overlap integral of pA and pB.
Hence, we can equate the second members of (5.29) and (5.30) and write
1
pffiffiffiffiffiffiffiffiffiffi ¼ cos x ð5:31Þ
kA kB
1
k¼ ð5:32Þ
cos x
h ¼ cs s þ cx px þ cy py þ cz pz ð5:33Þ
then k is given by
This expression shows that the hybridization parameter k depends on the coeffi-
cients of s and p orbitals and so on the level and basis set of the calculation. In
addition, (5.34) allows one to conclude that the k value of an spk atomic hybrid
orbital cannot be restricted to integer values 3, 2, and 1. The integer values of k are
imposed by symmetry and correspond to four equivalent hybrid orbitals (Td point
group, example CH4), three equivalent hybrid orbitals (D3h point group, example
BF3) and two equivalent hybrid orbitals (D∞h point group, example CO2),
respectively.
5.4 Natural Bond Orbitals 257
Consider now the HF/STO-3G scan of the CH bond length RCH in CH4. Keeping
the regular tetrahedron geometry, all HCH angles are equal to the tetrahedral angle,
and the CH bonds have the same length. With these geometric constraints, there is
only one independent variable, the CH bond length, RCH. For large values of RCH,
the interactions between the carbon atomic orbitals and the hydrogen atomic
orbitals are negligible, and so all orbitals reduce to those of isolated atoms. For RCH
smaller than 2.0 Å (this value may vary slightly according to the calculation level
and the basis set of functions), the p fraction becomes equal to ¾, i.e., sp3 hybrids
are formed. Figure 5.7 shows the occupation of bond orbitals and the total
molecular energy as functions of RCH. At RCH = 1.08 Å, the occupation of the
orbitals attains a maximum, and the total molecular energy reaches a minimum.
Molecular orbitals localized in the bonds that attain the maximum occupation
allowed by the Pauli exclusion principle (n = 2) are called natural bond orbitals
(NBOs). The carbon 1s orbital remains essentially a doubly occupied orbital of the
carbon atom core and is thus called a natural atomic orbital (NAO). Each of the
natural bond orbitals is a linear combination of one sp3 hybrid on the carbon atom
and the 1s orbital on the hydrogen atom to which the sp3 hybrid is directed. In
matrix form, we have
Orbital coefficients
C 1s 1.0000 0.0000 0.0000 0.0000 0.0000
C (sp3)1 0.0000 0.7253 0.0000 0.0000 0.0000
C (sp3)2 0.0000 0.0000 0.7253 0.0000 0.0000
C (sp3)3 0.0000 0.0000 0.0000 0.7253 0.0000
C (sp3)4 0.0000 0.0000 0.0000 0.0000 0.7253
H 1s1 0.0000 0.6885 0.0000 0.0000 0.0000
H 1s2 0.0000 0.0000 0.6885 0.0000 0.0000
H 1s3 0.0000 0.0000 0.0000 0.6885 0.0000
H 1s4 0.0000 0.0000 0.0000 0.0000 0.6885
The density matrix that corresponds to this coefficient matrix can be obtained by
applying (5.22); it is given by
Density matrix
C 1s C (sp3)1 C (sp3)2 C (sp3)3 C (sp3)4 H 1s1 H 1s2 H 1s3 H 1s4
C 1s 2.0000
C (sp3)1 0.0000 1.0521
C (sp3)2 0.0000 0.0000 1.0521
C (sp3)3 0.0000 0.0000 0.0000 1.0521
C (sp3)4 0.0000 0.0000 0.0000 0.0000 1.0521
H 1s1 0.0000 0.9987 0.0000 0.0000 0.0000 0.9481
H 1s2 0.0000 0.0000 0.9987 0.0000 0.0000 0.0000 0.9481
H 1s3 0.0000 0.0000 0.0000 0.9987 0.0000 0.0000 0.0000 0.9481
H 1s4 0.0000 0.0000 0.0000 0.0000 0.9987 0.0000 0.0000 0.0000 0.9481
258 5 Molecular Structure
Fig. 5.7 Occupation of bond orbitals and total molecular energy of CH4, as functions of RCH.
Natural bond orbitals (NBOs) occupation and total molecular energy as functions of RCH, for
regular tetrahedral arrangements of CH4. Results from an HF/STO-3G calculation performed by
Gaussian 09 and graphs obtained with Mathematica
We now turn to the ethene molecule (ethene is the systematic name for ethylene),
H2C=CH2, where each carbon atom is bonded to three neighboring atoms: two
hydrogen atoms and one carbon atom. The total number of valence electrons in the
molecule is 12, four from each carbon atom, one from each hydrogen atom. The
corresponding six electron pairs include four in the C–H bonds and two in the
carbon–carbon double bond.
The natural bond orbitals for the valence electrons of the ethene molecule are
represented by surfaces with isovalue equal to 0.06 and are shown in Fig. 5.9 by
increasing order of energy. The lower row shows a natural bond orbital that has a
nonzero electron density along the internuclear bond axis and results from a
head-on overlap of atomic orbitals on both carbon atoms. It is called a r bond. The
middle row shows the C–H bonds that are equivalent by symmetry and have the
same energy (are degenerate), and the natural bond orbital in the upper row is
formed by two lobes, one above the plane of the nuclei, the other below, thus
exhibiting zero electron density in the internuclear axis and in the plane of all the
5.4 Natural Bond Orbitals 259
Fig. 5.8 Natural bond orbitals for CH4 represented by surfaces with isovalues equal to 0.06, one
per CH electron bond pair. HF/STO-3G Calculation by Gaussian 09, representation of molecule
and natural bond orbitals presented by GaussView
Fig. 5.9 Natural bond orbitals for H2C=CH2 illustrated by surfaces with isovalue equal to 0.06,
one for each electron bond pair. They are ordered by increasing energy: one CC r bond, four
degenerate CH r bonds, and one CC p bond (Gaussian 09 HF/STO-3G calculation; natural bond
orbitals are drawn by GaussView)
nuclei (nodal plane). This bond results mainly from overlap of p orbitals with
parallel axes that are perpendicular to the molecular plane and is called a p bond.
In the ethyne molecule (ethyne is the systematic name for acetylene), HCCH,
each carbon atom is bonded to two neighboring atoms: one carbon atom and one
hydrogen atom. The ethyne molecule is linear. The total number of valence elec-
trons in the ethyne molecule is 10, four from each carbon atom, one from each
hydrogen atom. The corresponding five electron pairs include two electrons pairs
for the carbon–hydrogen bonds and three electron pairs for the carbon–carbon triple
260 5 Molecular Structure
Fig. 5.10 Doubly occupied natural bond orbitals for the ethyne molecule are represented by
surfaces with isovalue 0.06 and are shown by order of increasing energy: one CC r bond, two
degenerate CH r bonds, two CC p bonds. HF/STO-3G Calculation performed by Gaussian 09,
and natural bond orbitals visualized by GaussView
bond (one r and two p bonds). The natural bond orbitals are presented in Fig. 5.10
by order of increasing energy, that is, E(r C–C bond) < E(degenerate CH
bonds) < E(degenerate p CC bonds).
Considering now the CH hybrid orbitals in the methane, ethene, and ethyne
molecules (find details of the calculations in Sect. 5.4.6), a clear trend can be
observed, namely, the increasing s orbital fraction in the spk hybrid orbitals, as we
go from CH4, to H2CCH2 and HCCH (see table below). The greater s fraction of the
hybrid orbitals in the ethyne molecule allows negative charge to be held closer to
the nucleus in the carbon atom, thus resulting in a greater acidic character of the
hydrogen atoms in ethyne.
Carbon atom spk hybrids and the corresponding s orbital fractions
Molecule CH hybrids s fraction CC hybrids s fraction
CH4 sp3 0.25 – –
H2C=CH2 sp2.37 0.30 sp1.45 0.41
HCCH sp1.09 0.48 sp0.91 0.52
5.4 Natural Bond Orbitals 261
Fig. 5.11 Ball and bond representation of the ethane, ethene, and ethyne molecules with
calculated versus experimental (CCCBDB) geometric parameters (bond lengths are in angstroms,
bond angles in degrees, experimental values in parentheses). B3LYP/cc-pVTZ calculations
performed by Gaussian 09 and molecular representations obtained with GaussView
262 5 Molecular Structure
Fig. 5.12 Total density surfaces (isovalue = 0.000400) mapped with the corresponding electro-
static potentials, from left to right, for methane, ethene, and ethyne molecules. B3LYP/cc-pVTZ
calculation performed by Gaussian 09, and representations of molecules and electrostatic
potentials visualized by GaussView
exhibit a strong positive electrostatic potential, in contrast with the hydrogen atoms
of ethene and methane.
A þ BC ! AB þ C ð5:35Þ
The system of three atoms, A, B, and C, can be described by three internal coor-
dinates, namely, the internuclear AB distance rAB, the internuclear BC distance rBC,
and the bond angle \ABC. For a defined angle, for example, \ABC ¼ p radians,
the potential energy difference DU between the potential energy of the interacting
three-atom system (A…B…C) and the sum of the energies of the three isolated
atoms, A, B, and C, is a function of variables rAB and rBC, DU = f(rAB, rBC). This
function describes a surface called a potential energy surface.
From the mathematical point of view, a general potential energy surface can be
characterized by its stationary points, i.e., by the points whose derivative is zero,
5.5 Potential Energy Surfaces 263
Fig. 5.13 Maximum, minimum, and saddle point at (0,0), for the functions −x2 − y2, x2 + y2 and
x2 − y2. Figure obtained with Mathematica
@U
¼ 0 k ¼ 1; 2; . . .; 3N 6 ð5:36Þ
@rk
where rk is each of the 3N − 6 coordinates for a system with N atoms, and each
stationary point can be classified as a maximum, a minimum, or a saddle point
(Fig. 5.13). In a first-order saddle point, the function takes a maximum in one of
the 3N − 6 internal coordinates and minima in the remaining 3N − 7 coordinates.
The potential energy surface of Fig. 5.14 corresponds to a linear triatomic
system XHH and was obtained using Morse functions (see Baggott et al. 1988). It
shows two valleys approximately parallel to the corresponding coordinate axes and
an upper pass between them through one first-order saddle point. The reactant
Fig. 5.14 Typical potential energy surface obtained using a combination of Morse functions.
Right view shows the minimum energy path passing by the first-order saddle point. Figure obtained
with Mathematica
264 5 Molecular Structure
When the reaction coordinate begins at the saddle point and goes down the hill
toward the reactant channel (negative values of the reaction coordinate) and toward
the product channel (positive values of the reaction coordinate), the obtained
coordinate is called intrinsic reaction coordinate. Suggestively, the mathematical
method used in the calculation for obtaining the intrinsic reaction coordinate is the
method of steepest descent.
The linear triatomic molecules HCN (hydrogen cyanide) and HNC (hydrogen
isocyanide) are isomers. Conversion of HCN into HNC implies hydrogen migra-
tion, where the transition state is given by a cyclic triangular structure (Fig. 5.15).
The results presented in this figure show that the HCN ! HNC reaction is
endothermic.
Consider now the general reaction
Fig. 5.15 Potential energy variation as a function of the intrinsic reaction coordinate for the
isomerization reaction HCN ! CNH (Gaussian 09 B3LYP/cc-pVTZ calculation). Figure obtained
with Mathematica
5.5 Potential Energy Surfaces 265
Fig. 5.16 Potential energy variation as a function of the intrinsic reaction coordinate for the
reaction Cl− + H3CCl ! ClCH3 + Cl− (Gaussian 09 B3LYP/aug-cc-pVTZ calculation). Fig-
ure drawn by Mathematica
If the nucleophilic substitution reaction occurs in a single step, the approach of Nu−
is concerted with the withdrawal of X−. In this case, the transition state involves
both reactant species Nu− and RX, the reaction is bimolecular and named SN2, and
is said to occur in a bimolecular concerted step, with its rate equation given by
v ¼ k½RX½Nu ð5:39Þ
5.6.1 Ethane
We now consider the ethane molecule, H3C–CH3, and the rotation of one of its
methyl groups around the CC axis. The distances between the hydrogen atoms of
the distinct methyl groups vary in the course of this methyl rotation (Fig. 5.17a) and
so does the total molecular energy. In order to define the relative position of the
methyl groups, four atoms bonded in sequence H(1)C(2)C(3)H(4) are considered,
and the angle measured in the clockwise direction between the H(1)C(2)C(3) and C
(2)C(3)H(4) planes is called a dihedral angle (Fig. 5.17b). Note that an angle
between two planes is measured in the plane perpendicular to the common edge.
Therefore, the dihedral angle for the ethane molecule is determined in the plane
perpendicular to the CC bond axis (the plane of the paper in Fig. 5.17c). We
consider the dihedral angle to vary in the range between 0° and 360°. A 60°
dihedral angle for the atomic sequence H(1)C(2)C(3)H(4) corresponds to a 300°
dihedral angle for the atomic sequence H(4)C(3)C(2)H(1).
The ethane molecule has one CC bond and six CH bonds. Around each tetra-
hedral center (each carbon atom), there are six bond angles (three CCH angles and
three HCH angles), whose sum covers the whole space, thus being equal to 4p
steradian (a constant). Hence, around each carbon atom there is one bond angle
redundancy, meaning that there are five independent bond angle variables, not six.
One additional coordinate is needed to define the relative position of the methyl
groups, that being one HCCH dihedral angle. Altogether, we have seven bond
lengths, plus 5 + 5 bond angles, plus one dihedral angle, making a total of 18
(= 3N − 6, where N = 8) geometric variables. In the potential energy plot of
Fig. 5.18, the dihedral angle is the independent variable. Changing the dihedral
angle will make the remaining 3N − 7 = 17 variables adjust their values to each
new value of the dihedral angle. One says that the dihedral angle scan is accom-
panied by relaxation of the molecular geometry (it is a relaxed scan).
The plot of the potential energy of the ethane molecule H3C–CH3 as a function of
the dihedral angle HC–CH is called a potential energy profile for the rotation
around the C–C bond (Fig. 5.18). As shown in this figure, for every initial value of
the dihedral angle, the potential energy retakes the same value after a dihedral angle
Fig. 5.17 a In the ethane molecule, methyl group rotation around the CC axis changes the
distance between hydrogen atoms, thus causing the total molecular energy to vary. b Numbering of
atoms involved in the dihedral angle. c The angle between 1-2-3 and 2-3-4 planes is measured in
the plane perpendicular to the CC axis. Representation of molecules visualized by GaussView
5.6 Molecular Conformations 267
Fig. 5.18 Potential energy profile for the rotation of a methyl group in the ethane molecule.
Staggered and eclipsed conformations correspond to stationary points, minimum, and maximum,
respectively (Gaussian 09 B3LYP/cc-pVTZ calculation). Graph drawn with Mathematica
Fig. 5.19 CC bond length of the ethane molecule as a function of the dihedral angle (Gaussian
09 B3LYP/cc-pVTZ calculation; graph obtained with Mathematica)
268 5 Molecular Structure
The relaxation of geometric variables during the scan of the dihedral angle can be
illustrated by the variation of the CC bond length. The observed variation of the CC
bond length with the dihedral angle is in phase with the potential energy profile for
the molecule, since the CC bond is along the threefold rotation axis of the molecule.
The largest increase of the CC bond length attenuates the effect of CH interactions of
distinct methyl groups in the eclipsed conformation (Fig. 5.19). In turn, the smallest
values of the CC bond length correspond to the staggered conformation.
The ethane molecule has 18 normal modes of vibration (N = 8, 3N − 6 = 18).
Among these, the normal mode with the lowest frequency is the torsional mode,
which is the out of phase oscillation of the methyl groups, whose atomic dis-
placement vectors are shown in Fig. 5.20 for the staggered conformation, with the
vectors applied to the hydrogen atoms of the same methyl group pointing in the
same direction, either clockwise or counterclockwise, whereas the vectors applied
to hydrogen atoms of different methyl groups point in opposite directions.
The calculated harmonic frequency for the torsional mode in the staggered con-
formation is 306 cm−1 (Gaussian 09 B3LYP/cc-pVTZ calculation). Figure 5.20
shows the potential energy curve with the minimum at the dihedral angle equal to 60°
(staggered conformation), the zero-point energy level (level 0), and the first torsion
level (level 1), which is 306 cm−1 above the zero-point level. The 0–1 transition
corresponds to the fundamental transition, whereas the 0–2 transition is the first
overtone (not shown). The amplitude of torsion in level 1 can be roughly estimated
from Fig. 5.20 as ±30°. This means that each methyl group spans an arc of about 60°,
and the total amplitude of motion relative to each other methyl group spans an arc of
about 120° (the methyl groups rotate in opposite directions, one clockwise, the other
counterclockwise). Hence, each methyl group probes practically all dihedral angles
corresponding to different conformations around the staggered conformation.
Fig. 5.20 Potential energy curve for the staggered conformation of ethane with harmonic energy
levels for the zero-point energy and the v = 1 energy level of the torsional mode (Gaussian 09
B3LYP/cc-pVTZ calculation; molecule representation obtained with GaussView; graph obtained
with Mathematica)
5.6 Molecular Conformations 269
5.6.2 1,2-Dichloroethane
Fig. 5.21 Potential energy profile for the 1,2-dichloroethane molecule. Chlorine atoms are
represented in dark gray. Synclinal and antiperiplanar conformations correspond to minima,
synperiplanar and anticlinal conformations to maxima (Gaussian 09 B3LYP/cc-pVTZ calcula-
tion; molecule representation obtained with GaussView; graph obtained with Mathematica)
270 5 Molecular Structure
Fig. 5.22 Potential energy curves for the synclinal (left) and antiperiplanar conformations with
the corresponding zero-point energy and v = 1 torsional levels and the displacement vectors for the
corresponding vibrations (Gaussian 09 B3LYP/cc-pVTZ calculation; molecule representation
obtained with GaussView; graph obtained with Mathematica)
Figure 5.22 shows the potential energy curves for the synclinal and antiperi-
planar conformations, the displacement vectors for the corresponding torsional
vibrations at the calculated values of 110 cm−1 for the synclinal conformation and
116 cm−1 for the antiperiplanar conformation, and the zero-point energy and v = 1
levels. The large volumes of the chlorine and carbon atoms hide their small dis-
placement vectors. On each ClH2 set of atoms, the displacement vectors point in the
same direction, either clockwise or counterclockwise, whereas the vectors on dif-
ferent ClH2 sets of atoms point in opposing directions. Note that the carbon atoms
move slightly, so that the molecule’s center of mass is kept still. Looking now at the
amplitudes of the synclinal and antiperiplanar torsional motions in the v = 1 levels,
we find that these can be roughly estimated from Fig. 5.22 as ±12°. The substi-
tution of hydrogen in ethane by chlorine in 1,2-dichloroethane increases the
reduced mass of the torsional oscillator, thus reducing the torsional frequency [see
(3.55)] and, consequently, the amplitude of the torsional vibration. Unlike the
methyl groups in ethane, the ClH2C- groups of 1,2-dichloroethane have small
amplitudes of torsional oscillations around their corresponding equilibrium posi-
tions. Thus, the synclinal and antiperiplanar conformations of 1,2-dichloroethane
correspond to spectroscopically distinguishable conformational isomers, or
conformers.
where
X
U¼ gk exp½ek =ðkB TÞ ð5:42Þ
k
gi exp½Ei =ðRTÞ
fi ¼ ð5:43Þ
U
where
X
U¼ gk exp½Ek =ðRTÞ ð5:44Þ
k
For a system of two levels with degeneracy degrees g1 and g2, we can write
f2 =g2
DE21 ¼ E2 E1 ¼ RT ln ð5:45Þ
f1 =g1
From (5.43) we can obtain the quotient fi/gi that appears in (5.45), for i = 1 and
i = 2 (E11).
Vibrational spectroscopic studies in solution with a nonpolar solvent have to be
carried out in order to obtain reliable experimental values concerning individual
conformers. The first step consists in assigning vibrational bands to distinct con-
formers. This is not a simple task! Usually, it is accomplished with the help of
relatively accurate frequency calculations for the considered conformers and of
additional spectroscopic information that might be gathered. In the vibrational
spectrum, one selects a pair of bands of the same vibrational mode in distinct
conformers that allows a reliable determination of their intensities. The ratio of
intensities of those bands equals the ratio of populations for the corresponding
conformers.
272 5 Molecular Structure
Molecules that are mirror images of one another have the same atoms and the same
bonds, i.e., the same connectivity. Can we necessarily say that they are the same
molecule with identical physical and chemical properties? A test that enables us to
answer this question determines whether molecules that are mirror images of one
other can be superimposed. Superposition is a virtual test, since molecules are
impenetrable objects. Two molecules that are mirror images of each other and
cannot be superimposed have a relationship analogous to a person’s left and right
hands.
Two chlorofluoromethane molecules, CH2ClF, that are mirror images of each
other can be superimposed (Fig. 5.23). Superposition implies a C2 rotation of one
of the molecules followed by its translation to match the positions of the atoms of
the other molecule. The CH2ClF molecules of this pair represent the same com-
pound, with specific physical and chemical properties.
Replacement of the hydrogen atom of each CH2ClF molecule away from the
observer by one bromine atom results in two bromochlorofluoromethane molecules,
CHBrClF (Fig. 5.23). Unlike CH2ClF, the CHBrClF molecules exhibit a new
feature: they are mirror images of each other but cannot be superimposed. No
combined rotation and translation of one CHBrClF molecule of the pair can lead to
superposition of both molecules. In fact, these molecules are to each other as the left
and right hands of a person, and are thus called chiral objects (from Greek kheir,
hand). Being isomers, they are also called enantiomers (from Greek
enantios = opposite).
How are chirality and symmetry related? For a compound like CH2ClF having a
tetrahedral carbon atom as the central atom and not being chiral, we concluded that
a C2 rotation of the molecule followed by a translation leads to superposition with
the other molecule of the object-to-image pair of molecules. In turn, a chiral
molecule does not admit any Sn symmetry operation (Sn = rCn), including a
reflection (r = S1) or an inversion (i = S2).
Fig. 5.23 Left Ball and bond representations of two superposable chlorofluoromethane molecules
with an object-to-image plane mirror relationship to each other. Right Ball and bond
representations of two nonsuperposable bromochlorofluoromethane molecules with an
object-to-image relationship in a plane mirror to each other. Molecular representations obtained
with GaussView
5.7 Chiral Molecules 273
Molecules that are mirror images of each other and cannot be superimposed have
a chiral center. The Cahn–Ingold–Prelog rules [Sidney Cahn (1899–1981, Bri-
tish chemist), Christopher Kelk Ingold (1893–1970, British chemist), and Vladimir
Prelog (1906–1998, Croatian chemist), Nobel prize in chemistry in 1975 for
advances in the stereochemistry of organic molecules and chemical reactions]
assign different priorities to the atoms or groups bonded to a particular chiral center.
These priorities follow the decreasing order of the atomic numbers. The atom or
group with the lowest priority points away from the observer. If there is one
hydrogen atom bonded to the chiral center, then the hydrogen atom is pointing
away from the observer. If the decreasing priority of the atoms or groups bonded to
the chiral center is in the clockwise direction, the chiral center is assigned an
R (from Latin rectus = right) descriptor, whereas for the counterclockwise direc-
tion, the chiral center is assigned an S (from Latin sinister = left) descriptor
(Fig. 5.24).
Fig. 5.26 Left- and right-handed helices having the object-to-image relationship in a plane mirror
are not superimposable. Figure obtained with Mathematica
5.7 Chiral Molecules 275
2p 2p
k¼ ¼ 2p~v x ¼ ¼ 2pv ð5:47Þ
k s
and E z0 is the electric field maximum amplitude, whose square of the absolute value
is the energy carried by the wave.
Fig. 5.27 Plane-polarized electromagnetic radiation propagating along the x-axis. The electric
field oscillates in the vertical plane. Figure obtained with Mathematica
a ¼ ½aTk cd ð5:48Þ
where [a] is called the specific rotation, i.e., the rotation for unit values of c and
d. The temperature and wavelength dependencies of a are included in the specific
rotation [a], measured for specified values of T and k. The yellow sodium D line
near the wavelength 589 nm is commonly used in measurements.
A plane-polarized electric field wave can be decomposed into left- and
right-circularly polarized electric field waves (Fig. 5.29) that, in an optically active
medium, have different refractive indices, thus propagating at different speeds.
Fig. 5.29 A left-circularly polarized wave added to a right-circularly polarized wave gives a
plane-polarized wave. Figure obtained with Mathematica
5.7 Chiral Molecules 277
Fig. 5.30 On traversing an optically active medium, left- and right-circularly polarized waves
change phase difference progressively, and the resulting wave gradually rotates. In a polarimeter,
the views along the sample tube show the rotation angle for increasing lengths of the sample tube.
Figure obtained with Mathematica
Hence, the phase relationship between left- and right-circularly polarized waves
changes progressively, and the resulting wave gradually rotates (Fig. 5.30). The
Mathematica code M4 illustrates this point.
Mathematica Codes
This potential energy surface is modeled for a linear triatomic system XHH as the
sum of three Morse functions [see (3.68)], with one of them representing an
interaction term (see Baggott et al. 1988). The channels linked by a first-order
saddle point correspond to the chemical reaction X + H2 ! XH + H. The disso-
ciation energies for the reactant and product channels are equal to 20 and 50
arbitrary units of energy, and the force constants for the H2 and XH diatomics are
equal to 40 and 80 arbitrary units of energy per square length. The equilibrium point
for each Morse function is defined at 1.2.
The clockwise screwing motion of a right-handed helix moves the helix away from
the observer. From the mathematical point of view, a right-handed helix can be
obtained by combining a circle drawn in the clockwise direction with a perpen-
dicular line segment. In a similar way, a left-handed helix can be seen as the result
of combining a circle drawn in the counterclockwise direction with a perpendicular
line segment. The next two Mathematica codes illustrate the mathematical ways of
forming right- and left-handed helices. They use the ParametricPlot3D
command, which produces a three-dimensional curve in which the x, y, and
z component functions depend on a single parameter (an external variable) that can
be interpreted as the time variable:
280 5 Molecular Structure
Suggestion: Write a Mathematica code to form right- and left-handed helices that
are mirror images of each other.
This Mathematica code simulates a wave approaching the observer and gradually
rotating its plane of polarization as it passes through an optically active medium.
The wave is built by adding left- and right-circularly polarized waves with slightly
different phases:
Suggestion: Change the code so that the incoming wave rotates to the coun-
terclockwise direction.
Glossary
Exercises
References
Mulliken RS (1955) Electronic population analysis on LCAO-MO molecular save functions I.
J Chem Phys 23:1833–1840
Baggott JE, Caldow GL, Mills IM (1988) J Chem Soc Faraday Trans 2(84):1407–1422
El Youssoufi Y, Herman M, Liévin J (1998) The ground electronic state of 1,2-dichloroethane I.
Ab initio investigation of the geometrical, vibrational and torsional structure. Molec Phys
94:461–472
References 285
Further Reading
Levine IN (2013) Quantum chemistry, 7th edn. Pearson Education Inc., Upper Saddle River
Weinhold F, Landis CR (2012) Discovering chemistry with natural bond orbitals. Wiley, Hoboken
Crystals
6
Abstract
We begin this chapter by describing modes for packing disks and spheres, with
particular attention to three-dimensional close-packed modes. Next, we consider
the concept of translation symmetry, central to crystallography, and present two-
and three-dimensional Bravais lattices. We show the distribution of the most
representative structures of metals in the periodic table of the elements and build
the cesium chloride, sodium chloride, diamond, and zinc blende structures, using
their corresponding translation vectors. X-ray diffraction is the most widely used
experimental method for solving the structure of crystals. For this reason, we
give a brief introduction to the subject and explain the optical transform of an
X-ray diffraction experiment. Next, we consider and illustrate the formation of
bands in solids, introduce semiconductors and the Fermi–Dirac distribution of
electrons for metallic conductors. At the end of this chapter, the student can find
several Mathematica codes (Packing of Disks, Hexagon of Disks, Disk Layers,
The Third Dimension, HCP Structure, 2D Square Point Lattice, 2D Hexagonal
Point Lattice, 3D Bravais Lattices, CsCl Structure, NaCl Structure, From Energy
Levels to Bands, Fermi–Dirac Distribution) that enable one to visualize the
crystalline structures considered in this chapter and include detailed explanations
of new commands, a glossary of important scientific terms, and a list of
exercises, whose answers can be found after the Appendix.
6.1.1 Disks
Consider the packing of identical disks in a plane. Assuming disks of radius 1/2, the
distance between centers of tangent disks is 1. In square packing, the (x,y) coordinates
© Springer International Publishing Switzerland 2017 287
J.J.C. Teixeira-Dias, Molecular Physical Chemistry,
DOI 10.1007/978-3-319-41093-7_6
288 6 Crystals
of the disks’ centers describe a square (Fig. 6.1). For hexagonal close packing, the
evaluation of coordinates of the disks’ centers requires simple trigonometry of right
angle triangles, as shown in Fig. 6.1.
Figure 6.2 shows square and hexagonal packing modes of identical disks. The
Mathematica code M1 builds the disk lattices of this figure. In square packing, each
disk is tangent to four neighboring disks, whereas in hexagonal packing, each disk
is tangent to six neighboring disks (Fig. 6.2) whose centers are the vertices of a
regular hexagon. This arrangement reveals the existence of a sixfold rotational
symmetry axis. The Mathematica code M2 illustrates this point and generates the
picture shown in Fig. 6.3.
As shown in Fig. 6.4, the centers of three close-packed disks are the vertices of
an equilateral triangle, whose center is the center of the interstice that can be
determined making use of simple trigonometry for a right triangle.
The Mathematica code M3 considers two close-packed layers of identical disks,
where the second layer is displaced so that its disks’ centers coincide with the
Fig. 6.1 Coordinates of centers of disks of radius 1/2, in square and hexagonal packing modes.
Figure obtained with Mathematica
Fig. 6.2 Square and hexagonal packing modes of identical disks. Figure obtained with
Mathematica
6.1 Packing Disks and Spheres 289
interstices of the first layer (Fig. 6.5). The translation vector that shifts the second
layer of disks with respect to the first layer is shown in Fig. 6.4. The coordinates of
its origin are (1, √3/2), and the coordinates of its tip at the center of the interstice are
(3/2, 2/√3). Hence, the translation vector is given by
pffiffiffi
1 3
t ¼ ^x þ ^y ð6:1Þ
2 6
6.1.2 Spheres
We begin by considering four equivalent and mutually tangent spheres of radius 1/2
whose centers form a regular tetrahedron that can be drawn inside a cube (Fig. 6.6).
The distance between center 4 and the 1-2-3 plane is given by √2/√3, as can be
290 6 Crystals
Fig. 6.5 Close-packed disk layer and a second close-packed layer of disks in gray displaced so
that its disks’ centers coincide with the interstices of the first layer. Figure obtained with
Mathematica
Fig. 6.6 Right triangle trigonometry determines the distance between center 4 and the 1-2-3 plane
shown by simple trigonometry of right triangles (Fig. 6.6). Sphere 4 is nested on the
interstice of spheres 1, 2, and 3. Therefore, √2/√3 is the vertical distance between the
centers of two horizontal close-packed layers of spheres of radius 1/2. The vector
that goes from center 1 to center 4 [= vector (4-c) + vector (c-1), where vector (4-c)
is obtained from Fig. 6.6 and vector (c-1) is given by (6.1)] is
pffiffiffi rffiffiffi
1 3 2
t ¼ ^x þ ^y þ ^z ð6:2Þ
2 6 3
(E1).
Figure 6.7 shows a regular tetrahedron formed by the centers of four mutually
tangent spheres. The Mathematica code M4 uses the threefold rotation symmetry
axis to draw Fig. 6.7 (E2).
6.1 Packing Disks and Spheres 291
Fig. 6.7 Regular tetrahedron formed by the centers of four mutually tangent spheres.
Figure obtained with Mathematica
The close packing of identical spheres on one layer poses no great difficulty: each
sphere is tangent to six neighboring spheres, whose centers form a regular hexagon.
If projected on the plane of the paper, the spheres are seen as tangent disks with
interstices appearing as triangles of curved sides, half of them with an upside
vertex, the other half with a downside vertex (Fig. 6.8). The second layer of spheres
nests on interstices with downside vertices, since these correspond to depressions
on the upper surface of the layer of spheres. Unlike the first layer, where the
downside and upside curved triangles are equivalent sites (rotation of the whole
structure by 180° transforms downside curved triangles into upside curved triangles
and vice versa), the second layer has two kinds of interstices: those that pass
through both layers and those that encounter spheres of the first layer (Fig. 6.8).
Given the option to lay down the third layer, we nest the spheres of this layer on
the second layer interstices that encounter first layer spheres, keeping open the
interstices that pass through all layers. Then the third and first layers coincide on
projection (Fig. 6.8). If successive layers are labeled by capital letters with the first
layer being labeled A, then the second layer is B and the third layer is A again (layer
sequence ABAB… with spheres of alternating layers overlaying one another). This
structure is called hexagonal close-packed (hcp), with the representative polyhe-
dron being a hexagonal prism (Fig. 6.8). The Mathematica code M5 builds
Fig. 6.8.
Fig. 6.8 Hexagonal close packing of rigid spheres with equal radius. Figure obtained with
Mathematica
292 6 Crystals
The number of spheres tangent to any one sphere (number of nearest neighbors)
is called coordination number. The coordination number of the hcp structure is 12.
To obtain this number, we consider layer A and two additional B layers, one above,
the other below, to form the BAB sequence. The central sphere of layer A is tangent
to six spheres on the same equatorial layer and to three spheres on each of the B
layers, above and below the central sphere of layer A.
We now go back to the first two layers of close-packed spheres and nest the sphere
of the third layer on the interstice that passes through B and A (Fig. 6.9). The
resulting sequence of layers ABCABC… leads to the cubic close-packed (ccp)
structure, which is also a face-centered cubic (fcc) structure, since the represen-
tative polyhedron is a face-centered cube. Layers A and B correspond to planes that
pass through face diagonals of three cube faces. The central sphere of the cube top
face is tangent to four spheres on this face, four spheres on the layer below, and four
spheres on the layer above (the latter is not shown in Fig. 6.9) thus having the
coordination number 12.
Spheres cannot occupy 100 % of the volume of a crystal repeating unit. The effi-
ciency of occupation of the three-dimensional space by rigid spheres can be
measured by the packing density, which is a ratio of two volumes, the volume of
all spheres contained in the repeating unit (a parallelepiped; note that the cube is a
particular kind of parallelepiped) over the volume of the repeating unit. In order to
evaluate the packing density, one should take into account the contribution of each
sphere to the repeating unit of the crystal, which depends on the position of the
sphere in the parallelepiped of the repeating unit. Each sphere located on a paral-
lelepiped vertex is shared by eight repeating units (Fig. 6.10). Thus, each repeating
unit contains 1/8 of one sphere. If the sphere occupies the midpoint of one edge,
then each parallelepiped contains 1/4 of the sphere. In turn, if the sphere occupies
the center of a face, it is shared by two parallelepipeds, so 1/2 belongs to each
parallelepiped.
Fig. 6.9 Cubic close packing of spheres with equal radius. Figure obtained with Mathematica
6.1 Packing Disks and Spheres 293
Fig. 6.10 Spheres occupying a vertex, the midpoint of an edge, and the center of a face contribute
to each crystal repeating unit with 1/8, 1/4, and 1/2 of a sphere, respectively
If one layer of the hcp structure (say layer A) is displaced to nest over yet
uncovered interstices of the preceding layer, then it transforms itself into layer C of
the ccp structure, and the ABA sequence of the original hcp structure is converted
into the ABC sequence of the ccp structure. This reasoning suggests that the hcp
and ccp structures have the same coordination number (6.12) and the same packing
density, since they are both close-packed structures differing by displaced layers of
spheres.
In 1611, the astronomer and mathematician Johannes Kepler (1571–1630)
conjectured that it would be impossible to exceed the packing density of the cubic
close-packed structure. In the nineteenth century, Karl Friedrich Gauss (1777–
1855) showed that the hexagonal close-packed structure reached the maximum
packing density for a lattice of identical spheres. Finally, in 1998, Thomas Hales
proved Kepler’s conjecture, thus concluding that both the hexagonal and the cubic
close-packed structures reach the maximum packing density for a regular structure
of identical spheres.
In the cubic close-packed (ccp) structure, the central sphere of a cube face is
tangent to the spheres on the vertices (Fig. 6.11). Therefore, we can write
Fig. 6.11 In the cubic close-packed structure, spheres are tangent along the face diagonals.
Arrangement of spheres obtained with Mathematica
294 6 Crystals
pffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffi
4r = a2 þ a2 ) a ¼ 2 2r ð6:3Þ
where a is the edge and r is the sphere’s radius. The packing density is given by
4
8 18 þ 6 12 3 pr
3
4 4 pr 3 p
¼ p3ffiffiffi 3 ¼ pffiffiffi 0:74 ð6:4Þ
a3 2 2r 3 2
(E3).
The body-centered cubic (bcc) structure has a coordination number equal to 8,
since the central sphere is tangent to eight spheres on the vertices of the cube
(Fig. 6.12). The right triangle formed by one cube main diagonal, one cube face
diagonal, and one cube edge allows one to determine the cube edge a as a function
of the sphere’s radius r:
qffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffi
4
4r ¼ a2 þ dface 2 ¼ a2 þ 2 a2 ¼ 3 a ) a ¼ pffiffiffi r ð6:5Þ
3
This packing density is smaller than those of the hcp and ccp structures: the bcc
structure is not a close-packed structure.
The simple cubic (sc) structure presents spheres on the cube vertices (Fig. 6.13),
and each sphere is tangent to six spheres on the vertices of a regular octahedron.
The packing density is given by
8 18 43 pr 3 43 pr 3 p
¼ ¼ 0:52 ð6:7Þ
a3 ð2rÞ3 6
Fig. 6.12 In the body-centered cubic structure, spheres are tangent along the cube’s main
diagonals. Arrangement of spheres obtained with Mathematica
6.1 Packing Disks and Spheres 295
Fig. 6.13 Spheres are tangent along the cube edges of the simple cubic structure. Figure obtained
with Mathematica
Fig. 6.14 Close-packed layers of identical spheres have octahedral and tetrahedral interstices.
Figure obtained with Mathematica
296 6 Crystals
Fig. 6.15 Octahedral interstice occupied by one sphere. The diameter of each larger sphere is
arbitrarily equal to 1. Figure obtained with Mathematica
pffiffiffi
rs 2rs 21
¼ ¼ 0:414 ð6:8Þ
r 2r 1
where rs is the radius of the smaller sphere that occupies the octahedral interstice.
Consider now the tetrahedral interstice occupied by a sphere tangent to its
neighboring spheres. From the right triangle of Fig. 6.16, we can write
1
1 ¼ ð1 þ rs Þ sin ða = 2Þ ) rs ¼ 1 ð6:9Þ
sin ða = 2Þ
and consequently,
rs sin ða = 2Þ 1
1
1
¼ ¼ 1 0:225 ð6:10Þ
r 1 sin ½arccos ð1 = 3Þ = 2
where a is the tetrahedral angle, and the radii of the larger spheres are set arbitrarily
equal to 1.
Let us now consider the simple cubic structure and the occupation of the cubic
interstice by a sphere tangent to its eight neighboring spheres. The ratio of the
radius of the smaller sphere and the radius of the larger sphere can be easily
obtained from Fig. 6.17 and is given by
pffiffiffi
rs 2rs 31
¼ ¼ 0:732 ð6:11Þ
r 2r 1
Fig. 6.16 Tetrahedral interstice occupied by one sphere. The marked angle is half of the
tetrahedral angle. The radii of the larger spheres are set arbitrarily equal to 1. Arrangement of
spheres obtained with Mathematica
6.1 Packing Disks and Spheres 297
Fig. 6.17 Simple cubic interstice occupied by one sphere. The diameter of each larger sphere is
arbitrarily equal to 1. Arrangement of spheres obtained with Mathematica
The values of the ratios rs/r and the corresponding coordination numbers are
collected in the following table:
The lower value of each rs/r range has been determined assuming that the
spheres of larger radii preserve their initial structure and the sphere of smaller radius
is tangent to its surrounding spheres. In each rs/r range, values greater than the
lower value force the larger spheres to drift away from each other, no longer being
tangent. At each upper limit in parentheses, the structure abruptly changes, and its
coordination number increases.
Figure 6.18 shows a finite section of an infinite discrete array of points with a
square arrangement. For each point P of this array, it is possible to find a pair of
integers (n1, n2) (positive or negative integers and zero included) such that the
positional vector of a point P is given by
298 6 Crystals
Fig. 6.18 Square point lattice and its p/4 counterclockwise rotation. Figure obtained with
Mathematica
p ¼ n1 v1 þ n2 v2 ð6:12Þ
where v1 and v2 are linearly independent vectors (noncollinear vectors). Each point
on the lattice corresponds to a defined pair of integers (n1, n2). For instance,
P1 = v1 + 2v2 and P2 = −2v1 − v2 correspond to (n1, n2) equal to (1, 2) and (−2,
−1), respectively. The vector Eq. (6.12) generates every point of the 2D square
array. Therefore, all points are equivalent, since every point has the same
arrangement of neighbors. In the case of Fig. 6.18, v1 and v2 are orthogonal and are
given by
v1 ¼ a ^x v2 ¼ a ^
y ð6:13Þ
where the scalar a is the lattice constant, vectors v1 and v2 are called primitive
translation vectors, and the array of points is a two-dimensional Bravais square
lattice. All points that satisfy the following equality cover the gray area:
r ¼ x 1 v1 þ x 2 v2 0 xi 1 ð6:14Þ
When this area is translated by all vectors p, the two-dimensional space is filled
without overlapping itself or leaving voids, and such an area is called a
two-dimensional unit cell. The Mathematica code M6 uses (6.12) and (6.13) to
build a square point lattice like the one of Fig. 6.18 and applies a p/4 counter-
clockwise rotation to the resulting square point lattice.
Both arrays of points of Fig. 6.18 represent the same lattice, since the translation
vectors w1 and w2 result from v1 and v2 by a p/4 counterclockwise rotation,
for h = p/4,
1 0
v1 = v2 = ð6:17Þ
0 1
and
1 1 1 1
w1 = pffiffiffi w2 = pffiffiffi ð6:18Þ
2 1 2 1
1 1
w1 = pffiffiffi ðv1 þ v2 Þ w2 = pffiffiffi ðv2 v1 Þ ð6:19Þ
2 2
where 1/√2 is the normalizing factor (v1 and v2 are normalized vectors).
The choice of primitive translation vectors is not unique. For instance, we can
take v1 and v1 + v2 (Fig. 6.19), since these are linearly independent vectors.
However, v1 and v1 + v2 are not orthogonal, do not have the same length, and so do
not reflect the symmetry of the square lattice. Therefore, they are not a convenient
choice of primitive vectors for a square array.
Consider now a two-dimensional lattice where each point is surrounded by six
points on the vertices of a regular hexagon (Fig. 6.20). The translation vectors of
this hexagonal lattice are given by
pffiffiffi
1 3
v1 ¼ a ^x v2 ¼ a ^x þ y^ ð6:20Þ
2 2
where a is the lattice constant and these vectors correspond to the sides of a unit-cell
parallelogram. The Mathematica code M7 uses (6.12) and (6.20) to build a
hexagonal point lattice like the one of Fig. 6.20 (E4).
In two-dimensional space, there are five Bravais lattices, namely square, rect-
angular, oblique, hexagonal, and centered rectangular lattices (Fig. 6.21).
The black and gray points of Fig. 6.22 correspond to motifs of a distinct nature,
where each black point has four gray points as closest neighbors, and conversely,
each gray point has four black points as closest neighbors. This structure cannot be
described by a single set of primitive translation vectors, since the points marked
black and gray cannot simultaneously correspond to integer values of n1 and n2 in
(6.12). The dark gray vector is given by
v1 þ v 2
t¼ ð6:21Þ
2
When this vector is applied to every point of the black lattice, it generates the
gray lattice and vice versa. One says that the periodic structure of Fig. 6.22 results
from merging two square point lattices. The resulting structure can be considered a
two-dimensional equivalent of the cesium chloride structure. The tri-dimensional
Fig. 6.21 Bravais lattices in two-dimensional space. Lattices and translation vectors obtained
with Mathematica
6.2 Translation Symmetries 301
Fig. 6.22 Two square lattices merge with one another. When the orange translation vector is
applied to every point on the black point lattice, the gray lattice is obtained. Figure obtained with
Mathematica
CsCl structure results from merging two simple cubic lattices, one for cesium ions,
the other for chloride ions (cesium and chloride ions have approximately the same
radius).
p ¼ n1 v1 + n2 v2 þ n3 v3 ð6:22Þ
where v1, v2, and v3 are linearly independent translation vectors (noncollinear
vectors) that span the lattice. The lengths of cell edges (a, b, c) and the angles
between them (a, b, c) characterize the lattice systems.
In three-dimensional space, there are 7 lattice systems and 14 Bravais lattices.
The lattice systems are cubic (a, a, a; a = b = c = 90°), tetragonal (a, a, c; a =
b = c = 90°), orthorhombic (a, b, c; a = b = c = 90°), monoclinic (a, b, c;
a = c = 90°, b 6¼ 90°), triclinic (a, b, c; a, b, c 6¼ 90°), trigonal (a, a, a; a, b,
c 6¼ 90°), and hexagonal (a, a, c; a = b = 90°, c = 120°). The Bravais lattices are
simple cubic, body-centered cubic, face-centered cubic, simple tetragonal,
body-centered tetragonal, simple orthorhombic, base-centered orthorhombic,
body-centered orthorhombic, face-centered orthorhombic, simple monoclinic,
base-centered monoclinic, simple triclinic, simple trigonal, and simple hexagonal
(see Mathematica code M8). To extend group theory considerations from the
molecule (point groups; see Chap. 4) to the crystalline state (space groups),
translations need to be considered.
302 6 Crystals
We now illustrate the translation vectors v1, v2, and v3 for a few crystal struc-
tures. Starting with the simple cubic structure, its translation vectors are given by
v1 ¼ a ^x v2 ¼ a ^y v3 ¼ a ^z ð6:23Þ
as suggested by the symmetry of the structure (Fig. 6.23, E5). Polonium provides a
rare example of the simple cubic structure.
We now consider the body centered cubic structure, whose translation vectors
are given by
a
v1 ¼ a ^x v2 ¼ a ^y v3 ¼ ð^x þ ^y þ ^zÞ ð6:24Þ
2
where v3 is along the cube main diagonal (Fig. 6.24, E6), and the projection of v3
along, ^
x; ^
y, and ^z has length a/2. The alkali metal cesium is an example of the
body-centered cubic structure.
The translation vectors for the face-centered cubic structure are given by
a a a
v1 = (^y þ ^z) v2 ¼ (^x þ ^z) v3 ¼ (^x þ ^y) ð6:25Þ
2 2 2
Fig. 6.23 Translation vectors for the simple cubic structure. Figure obtained with Mathematica
Fig. 6.24 Translation vectors for the body-centered cubic structure. Figure obtained with
Mathematica
6.2 Translation Symmetries 303
Fig. 6.25 Translation vectors for the face-centered cubic structure. Figure obtained with
Mathematica
6.3.1 Metals
Consider now an ionic crystal of general formula XmYn. The energy of formation of
one mole of XmYn from infinitely separated ions,
is called the lattice energy and is invariably negative (energy released by the
system). The lattice energy (electrostatic energy of attraction and repulsion between
ions in one mole of XmYn) plus the energy term associated with the volume change
at constant pressure (the ions Xn+ and Ym− are in the gaseous state) is called the
lattice enthalpy. This energy is usually determined by the Born–Haber cycle, a
multistep thermochemical cycle in which the enthalpy of one of the steps is the
lattice enthalpy due to German scientists, Max Born (1882–1950; Nobel Prize in
physics in 1954 for his research achievements on the statistical interpretation of
quantum mechanics) and Fritz Haber (1868–1934; Nobel prize in chemistry in
1918, for his research on the synthesis of ammonia). When the lattice energy is not
Na Mg Al
K Ca Sc Ti V Cr Mn Fe Co Ni Cu Zn Ga
Rb Sr Y Zr Nb Mo Tc Ru Rh Pd Ag Cd In Sn
Cs Ba Lu Hf Ta W Re Os Ir Pt Au Hg Tl Pb
Fig. 6.27 Distribution of body-centered cubic, cubic close-packed, and hexagonal close-packed
structures on the periodic table of elements
6.3 Crystal Structures 305
Cesium chloride is formed by equal amounts of cesium and chloride ions. Its crystal
structure results from merging two simple cubic lattices, one for the Cs+ ions, the
other for the Cl− ions. Its lattice energy is about −670 kJ mol−1 (Handbook of
Chemistry and Physics, 2011). If one Cl− is at the origin, the translation vectors for
the chloride ion lattice are given by (6.23), and the vector that translates this lattice
to produce the cesium ion lattice is given by
a
t ¼ (^x þ ^y þ ^zÞ ð6:29Þ
2
whose magnitude is equal to half of the main diagonal of a cube with chloride ions
at the vertices. The Mathematica code M9 builds the cesium chloride structure with
the Cs+ and Cl− ions being represented by spheres (Fig. 6.28).
Sodium chloride is formed by equal amounts of sodium and chloride ions. Its
crystal structure results from merging two face-centered cubic lattices, one for Na+
ions, the other for Cl− ions (Fig. 6.29). Its lattice energy is about −767 kJ mol−1
(Handbook of Chemistry and Physics, 2011). If one Cl− is at the origin, the
translation vectors for the chloride ion lattice are given by (6.25), and the vector that
translates this lattice to generate the sodium lattice is given by
a
t ¼ ^z ð6:30Þ
2
The Mathematica code M10 builds the sodium chloride structure, with the Na+
and Cl− ions represented by spheres (Fig. 6.29).
Fig. 6.28 The cesium chloride structure with the Cs+ and Cl− ions represented by spheres (Cs+
light gray spheres, Cl− gray spheres). Figure obtained with Mathematica
306 6 Crystals
Fig. 6.29 The sodium chloride structure with the Na+ and Cl− ions represented by spheres (Na+
light gray spheres, Cl− gray spheres). Figure obtained with Mathematica
In diamond, each carbon atom is covalently bonded to four carbon atoms on the
vertices of regular tetrahedra. The diamond lattice consists of two face-centered
cubic lattices that interpenetrate each other. The translation vectors for one of these
lattices is given by
n1 v1 þ n2 v2 þ n3 v3 ð6:31Þ
where v1, v2, and v3 are given by (6.25), and (n1, n2, n3) are integers (which can be
positive, negative, or zero). The translation vectors for the other lattice of diamond
is given by
n1 v1 þ n2 v2 þ n3 v3 þ t ð6:32Þ
where t is given by
a
t ¼ (^x þ ^y þ ^z) ð6:33Þ
4
The magnitude of t is one-fourth that of the cube’s main diagonal. For the sake
of clarity, the spheres that schematically represent carbon atoms in Fig. 6.30 have a
radius much smaller than the covalent radius of the carbon atom, and darker and
lighter spheres represent carbon atoms belonging to distinct lattices.
Zinc blende and sphalerite are names for a mineral whose composition is mostly
zinc sulfide with a variable percentage of zinc being replaced by iron (II). The iron
presence gives a dark and glossy appearance to zinc blende, similar to that of
galena, an iron (II) sulfide. The likely confusion of zinc blende with galena is at the
origin of the word sphalerite, from the Greek word sphaleron, meaning misleading.
The word blende gives the same idea, but the origin of blende is German (blen-
den = to blind, also to deceive). In this text, zinc blende is used to refer to one of the
6.3 Crystal Structures 307
crystalline forms of zinc sulfide, ZnS, consisting of two merged face-centered cubic
lattices, one for the sulfur atoms, the other for the zinc atoms.
Figure 6.31 shows two distinct main cubes that schematically represent the same
zinc blende crystal. The cube on the right results from the one on the left by shifting
the latter by the vector t. This vector is directed along the diagonal of the cube, and
its length is one-fourth that of the main diagonal [see (6.33) and Fig. 6.31]. The
translation of the atoms of the main cube on the left by the vector t leads to the zinc
blende structure on the right of Fig. 6.31. One of these main cubes (the one on the
left) shows sulfur atoms represented by light gray spheres on the vertices and face
centers and four zinc atoms (gray spheres) totally contained in the repeating unit. In
turn, the cube on the right shows zinc atoms on the vertices and face centers and
four sulfur atoms totally contained in the repeating unit.
Fig. 6.31 Zinc blende structure with an illustration of the vector t in the left-hand drawing:
light-gray spheres represent sulfur atoms, and gray spheres represent zinc atoms. Left- and
right-hand drawings represent two main cubes of the same crystal. Figure obtained with
Mathematica
308 6 Crystals
If we divide each main cube into eight smaller and equal cubes, we conclude that
only four of these are occupied by ZnS4 tetrahedra on the left of Fig. 6.31, and by
SZn4 tetrahedra on the right of this figure. On the whole, each of the main cubes
corresponds to the formula Zn4S4, since there are 8 (1/8) + 6 (1/2) = 4 atoms
of one of the elements (sulfur or zinc) on the vertices and centers of the faces and
four atoms of the other element totally (zinc or sulfur) included in the repeating
unit. If carbon atoms substitute zinc and sulfur atoms and the lattice is correctly
adjusted to make all bond lengths equal, one obtains the diamond structure.
Atoms and monatomic ions have dimensions of order 1 Å (= 10−10 m), and elec-
tromagnetic radiation with wavelength of this order of magnitude consists of
X-rays. In 1912, Max von Laue (1879–1960, Nobel Prize in physics 1914) showed
that crystals diffract X-rays and at the same time, William Henry Bragg and William
Lawrence Bragg, father and son, solved several crystalline structures using X-ray
diffraction and shared the Nobel Prize in physics in 1915. One of the first structures
to be determined was that of sodium chloride. Nowadays, X-ray diffraction is the
most widely used method for solving the structure of crystals, in particular that of
macromolecules of biological importance such as proteins.
After being irradiated with X-ray waves, each atom acts like a secondary source
of X-rays. When an X-ray beam with defined wavelength impinges on a crystal,
multiple X-ray beams with the same wavelength emerge that can leave their imprint
(diffraction pattern) on an X-ray sensitive support (Fig. 6.32). As found by W.H.
and W.L. Bragg, diffracted X-rays beams result from reflection of X-rays by parallel
planes of the same atoms or ions in a crystal (Fig. 6.33). The condition for con-
structive interference requires that the path difference between X-ray waves
reflected by successive planes of atoms be a multiple of the wavelength,
2d sin h = nk ð6:34Þ
where d is the spacing between adjacent planes of atoms, h the angle of incidence,
n the diffraction order, and k the X-ray wavelength.
Each set of parallel planes of the same atoms or ions in the crystal acts like one
three-dimensional diffraction grating. The diffracted X-ray intensity is mathemati-
cally related to the electronic density of the atoms by Fourier transforms. Solving a
diffraction pattern consists in determining the crystalline structure that leads to the
obtained diffraction pattern. This is a complex and time-consuming task that
requires the expertise of an X-ray crystallographer, sophisticated software, and
adequate computational resources.
In order to clarify the relationship between the diffraction pattern and the
crystalline structure on purely qualitative and empirical grounds, a change of scale
can be applied to an X-ray diffraction experiment, with the X-ray beam being
replaced by a laser beam in the visible, and the 3D crystal being substituted by
ordered patterns printed on 2D projection slides. Experiments like this are called
optical transforms of the X-ray diffraction experiment (Lisensky et al. 1991).
Optical transforms correspond to a change of scale by a factor of order 104 on
the X-rays’ wavelength (kX-ray 10−10 m, klaser 10−6 m) and a factor of order
106 on the motif dimensions (an atom diameter 10−10 m, a motif on the slide
10−4 m). The diffraction pattern of an optical transform is obtained on a screen
about 20 m from the slide, which is about 20 cm from the laser. For the slide
perpendicular to the incident laser beam, the condition for constructive interference
(Fraunhofer diffraction, Fig. 6.34) is given by
d sin h = nk ð6:35Þ
At such a distance between slide and screen, the diffraction pattern is about 1°
wide (1° = 0.0175 radians) and sinh h. Considering first-order diffraction
(n = 1), (6.35) can be replaced by
dh k ð6:36Þ
in the optical transform. This expression allows one to correlate changes in the
ordered pattern on the slide (changes in d) with changes produced in the optical
transform (changes in h). For the same k, d and h are inversely related.
The method of optical transforms has been used to show that B-DNA structural
parameters such as the spacing between base pairs and the radius of the vertical
section of the backbone structure can be obtained from the optical transform of the
B-DNA X-ray diffraction pattern (Lucas et al. 1999).
Electrons in atoms and molecules have discrete energies and are said to occupy
defined energy levels, since such can be confirmed by spectroscopic methods.
However, in a solid, the proximity between an atom and its neighbors leads to
significant overlap between electronic wave functions and to electron delocaliza-
tion, resulting in large numbers of energy levels with closely spaced energies,
which in practical terms, form a continuum of energy levels, or band.
Consider a planar hydrocarbon with CC bonds of uniform bond distance and
uniform bond order (1.5) and use the Hückel empirical method to calculate the
energy levels of the system of p electrons as the number of carbon atoms increases
(McWeeny 1979). In this model system, each carbon atom has one electron in one
p orbital perpendicular to the backbone of carbon atoms. The overlap between
adjacent p orbitals leads to p electron delocalization over the entire carbon back-
bone (Fig. 6.35).
In its original form, Hückel’s method was conceived to deal with delocalized p
electron systems and uses two parameters, a and b, which represent the ionization
energy of a carbon atom and the interaction energy between pp orbitals of adjacent
Fig. 6.35 The p electron system of a linear polyene hydrocarbon can inspire a model for a
unidimensional crystal. Figure obtained with Mathematica
6.5 Electrons in Solids 311
carbon atoms, respectively. The parameters a and b are negative quantities. For a
linear polyene, the k molecular orbital energy calculated by Hückel’s method is
given by
p
ek = a + 2b cos k ð6:37Þ
N þ1
where N is the number of carbon atoms. For k = 1 (the lowest energy level) and
N ! ∞,
lim e1 ¼ a þ 2b ð6:38Þ
N!1
lim eN ¼ a 2b ð6:39Þ
N!1
Making use of (6.37), we see that the energy difference between consecutive
levels is given by
pðk þ 1Þ pk
De = ek þ 1 ek ¼ 2b cos cos
N þ1 N þ1
p ð2k þ 1Þp
¼ 4b sin sin ð6:40Þ
2ðN þ 1Þ 2ðN þ 1Þ
(E9).
With a linear polyene with N ! ∞ taken as a model for a unidimensional
crystal, the energy levels are limited by the lowest and highest energies [(6.38) and
(6.39)] that do not depend on k. In addition, from (6.40), it can be concluded that
the energy difference between consecutive levels decreases as N increases. There-
fore, for N ! ∞, the energy difference (6.40) tends to zero, and the energy levels
are replaced by a continuum of levels called a band (Fig. 6.36). The Mathematica
code M11 draws this figure.
Fig. 6.36 Energy levels for the p electron delocalized system of a linear polyene, calculated by
Hückel’s method, as the number of carbon atoms increases from 10 to 100, in steps of 10.
Figure obtained with Mathematica
312 6 Crystals
6.6 Semiconductors
1 dq
a¼ ð6:41Þ
q dT
with units K−1 or °C−1. Typical values for the resistivity of metals are of order of
magnitude 10−8 X m, and they increase with temperature. In particular, the resis-
tivities of iron, beryllium, gold, and copper increase almost linearly with temper-
ature, except at very low temperatures, where the increase is exponential
(Fig. 6.37). Solids with resistivity of this order of magnitude that increase with
temperature are called metallic conductors. With high values of electrical resis-
tivity, we find materials called electrical insulators, such as glasses with electrical
resistivities 1010–1014 X m and polytetrafluoroethylene (PTFE, trademark Teflon)
with resistivities 1022–1024 X m. Materials with intermediate values of electrical
resistivities (10−5–10 X m) are called semiconductors. Semiconductors have the
Fig. 6.37 Resistivity as a function of temperature for iron, beryllium, gold, and copper. Data
values taken from Handbook of Chemistry and Physics, 2011 and figure obtained with
Mathematica
6.6 Semiconductors 313
electric current transported by electrons, as occurs with metals, but unlike metals,
they exhibit an electrical resistivity that decreases exponentially with temperature
when they are extremely pure, and the resistivity may even decrease by several
orders of magnitude when they are doped with trace quantities of an electrical active
material.
The valence band of insulators is fully occupied with valence electrons, and the
band gap between the valence band and the lowest energy unoccupied band is far
greater than the energy of the medium (energy associated with the thermal motion
of atoms, sometimes referred to as thermal energy as its name suggests, it is zero at
T = 0 K). Thus, electrons of the highest energy in the valence band, that is, at the
band top, do not have unoccupied neighboring levels available to be excited.
Materials with this band structure are insulators.
Unlike the valence band of electrical insulators, the valence band of metallic
conductors is not fully occupied, and a temperature increase causes more collisions
between excited electrons and atoms, thus leading to an increased resistivity. The
mean number of electrons in each energy level is given by the Fermi–Dirac
distribution, a variant of the Boltzmann distribution that takes into account the
Pauli exclusion principle. At a defined temperature T, i.e., at thermal equilibrium,
the mean number of electrons in energy level e is given by
1
/ðeÞ ¼ ð6:42Þ
1 þ exp ½ðe lÞ = ðkB TÞ
where kB is the Boltzmann constant and l is such that when e = l, then / = 1/2,
i.e., 50 % of the energy level with energy l is occupied. The inverse function of
(6.42) that gives the energy e as a function of the population of electrons / is
1
e ¼ l þ kB T ln 1 ð6:43Þ
/
This function is plotted in Fig. 6.38 for sodium (l = 3.24 eV), at 100, 1200,
4000, and 10,000 K (see Mathematica code M12, E10).
Semiconductors have a band structure qualitatively similar to that of electrical
insulators, with their valence band being fully occupied. However, unlike insula-
tors, the band gap for semiconductors is small and can be overcome by thermal
excitation of electrons at temperatures greater than a minimum value defined by the
band gap [Fig. 6.39a]. The excitation of electrons from the valence band leaves the
electron holes behind. In the presence of an electric field, excited electrons and
electron holes move in opposite directions. In fact, an electron moves in the
opposite direction to the applied electric field, whereas an electron hole moves in
the direction of the applied electric field, thus behaving like a positive charge. In a
semiconductor, both the electrons of the conduction band and the holes in the
valence band contribute to the electric current.
314 6 Crystals
Fig. 6.38 The Fermi–Dirac distribution for sodium (l = 3.24 eV) at T = 100, 1200, 4000, and
10,000 K shown as energy as a function of electrons population. Graph obtained with
Mathematica
Consider now a silicon crystal and the substitution of a few of its atoms by
arsenic in a mixing proportion of about one arsenic atom per one million atoms of
silicon (doping of silicon with arsenic). Arsenic has one more valence electron than
silicon. Thus, one arsenic atom is equivalent to a positive ion, isoelectronic with a
silicon atom, plus one electron added to the silicon lattice. At such a small mixing
proportion, it is unlikely that the crystalline network of silicon will be significantly
modified or disrupted, an undesirable effect that would cause an abrupt increase in
electrical resistivity. The addition of one electron per arsenic atom to the silicon
lattice results in a small donor band, just below the unoccupied band [Fig. 6.39b],
with an energy gap smaller than the medium thermal energy Therefore, thermal
excitation of electrons to the unoccupied band becomes possible, and the electrical
resistivity decreases with temperature (n-type semiconductor).
If the silicon crystal is doped with boron, whose atoms have one valence electron
fewer than silicon, then each boron atom is equivalent to a negative ion, isoelectronic
with silicon, plus one electron hole added to the silicon lattice that acts like a positive
charge. Formation of electron holes leads to charge separation (electrons and holes
move in opposite directions). Since energy is required to separate charges, the
acceptor band forms just above the fully occupied valence band [Fig. 6.39b].
6.6 Semiconductors 315
Fig. 6.39 Valence bands for semiconductors (schematic). a Valence band for pure semiconductor
at T = 0 K and T > 0 K. b Valence bands for n-type and p-type semiconductors
Therefore, electrons from the valence band are excited to the acceptor band using the
medium thermal energy and combine with holes in the acceptor band, causing a
decrease of electrical resistivity with temperature (p-type semiconductor).
Mathematica Codes
The following Mathematica code builds disk lattices for square packing, whereby
each disk is tangent to four neighboring disks, and hexagonal packing, whereby
each disk is tangent to six neighboring disks. The Mathematica graphics primitive
for a disk is
Disk[{x,y},r]
where x and y are the Cartesian coordinates of the center of a disk with radius r. For a
square lattice of disks with radius 1/2, each of the Cartesian coordinates of the disk
centers varies in one-unit increments, and so the array of coordinates are given by
Table[Disk[{i,j},1/2],{i,3},{j,3}]
where i and j correspond to x and y, respectively. Hence, a fixed x and variable y
correspond to a row of disks, whereas a fixed y and variable x give a column of
316 6 Crystals
disks. Note that the outermost j of the Table function changes more rapidly than
the innermost i. This means that the disks are organized by rows, and the coor-
dinates of the disk centers in the square packing are given by
Table[{i,j},{i,3},{j,3}]
EdgeForm[Black] is a graphics directive that specifies that the edge of the
disks should be drawn in Black. In turn, the graphics directive LightGray
indicates that the disks should be filled in this color.
For a hexagonal close packing columns with odd j have x=i, and columns with
even j have x=i+1/2. In the Mathematica code, we use Boole[EvenQ[j]],
which gives the result 0 if EvenQ[j] is False (j is odd) and produces 1 if
EvenQ[j] is True (j is even). In addition, by inspection of the triangle of disks
in Fig. 6.1, it can be found that y=(√3/2)j. Hence, the hexagonal close packing
of disks can be obtained with
Table[Disk[{i+Boole[EvenQ[j]]/2,Sqrt[3]/2 j},1/2],{i,3},{j,3}]
and the coordinates of disk centers are given by
Table[{i+Boole[EvenQ[j]]/2,Sqrt[3]/2 j},{i,3},{j,3}]
Mathematica Codes 317
The close packing of disks in a plane shows that each disk is tangent to six disks,
whose centers are the vertices of a regular hexagon. The following Mathematica
code presents a hexagonal arrangement of disks around a central disk, by taking
advantage of its sixfold rotational symmetry axis. The Mathematica code that leads
to this arrangement is
Graphics[{EdgeForm[Black],Lightgray,Disk[{0,0},1/2],
Table[{EdgeForm[Black],Lightgray,Disk[{Cos[2pi/6],
Sin[2pi/6],1/2]},{i,0,5}]
{{0,0},Table[{Cos[2pi/6],Sin[2pi/6]},{i,0,5}]
318 6 Crystals
In the picture on the right, this Mathematica code considers two close-packed
layers of identical disks, where the second layer is displaced so that its disk centers
coincide with the interstices of the first layer. The translation vector that shifts the
second layer of disks with respect to the first layer is shown in Fig. 6.4, where the
coordinates of its origin and its tip at the center of the interstice are (1, √3/2) and
(3/2, 2/√3), respectively. Therefore, the translation vector is given by
pffiffiffi
1 3
t ¼ ^x þ ^y
2 6
where x, y, and z are the Cartesian coordinates of the center of a sphere with
radius r. In order to determine the z coordinate of a second horizontal layer of
close-packed spheres, we begin by considering four equivalent and mutually tan-
gent spheres of radius 1/2 whose centers form a regular tetrahedron inside a cube
(see Fig. 6.6). The distance between center 4 and the plane 1-2-3 is given by √2/√3,
as can be shown by simple trigonometry. Sphere 4 is nested on the interstice of
spheres 1, 2, and 3. Therefore, √2/√3 is the vertical distance between two horizontal
close-packed layers of spheres of radius 1/2. The vector that goes from center 1 to
center 4 is equal to the sum of vectors (4-c) and (c-1) and is given by
pffiffiffi rffiffiffi
1 3 2
t ¼ ^x þ ^y þ ^z
2 6 3
The figure below on the right shows a regular tetrahedron formed by four
mutually tangent spheres. The Mathematica line of code that leads to the
arrangement of three spheres on the left takes advantage of the threefold rotational
symmetry axis and is given by
Graphics3D[Table[{Gray,Sphere[{f*Sin[2pi/3],
f*Cos[2pi/3],0.0,1/2]},{i,0,2}]
where
f=1/Sqrt[3]
and
f*Sin[2pi/3]=x and f*Cos[2pi/3]=y
(see Fig. 6). Viewpoint!{0,0,Infinity} specifies a plan view from above,
supposedly from infinite distance.
320 6 Crystals
The following Mathematica code draws a figure for the hexagonal close packing of
disks and rigid spheres with equal radius:
The following Mathematica code uses (6.12) and (6.13) to build a square point
lattice to which is applied a p/4 anticlockwise rotation (RotationMatrix rotates
2D vectors anticlockwise). Each point on the lattice corresponds to a defined pair of
integers {n1,n2}. Both arrays of points represent the same lattice, since the
translation vectors w1 and w2 result from v1 and v2 by a p/4 counterclockwise
rotation [see (6.15)]. The Mathematica command
Mathematica Codes 321
The following Mathematica code builds the cesium chloride structure with the Cs+
and Cl− ions represented by spheres. If one Cl− is at the origin, the translation
vectors for the chloride ion lattice are given by (6.23) (Fig. 6.23) (simple cubic
lattice), and the vector that translates this lattice to produce the cesium ion lattice is
given by (6.29). The code forms two lattices, one for Cs+, the other for Cl−. One of
the most important Mathematica commands in this code is Cases, which gives a
list of elements that match a pattern. The output of Cases is a list of True and
False values. Another important command is Pick, which selects elements of a
list of True and False values. To illustrate the use of these two Mathematica
commands, we consider the following line,
Cases[{{-4.12},{-6.18},{2.06},{6.18}},{x_}!x<5&&x>-5]
whose output is
{True,False,True,False}
and
Pick[{aaa,bbb,ccc,ddd},{True,False,True,False}]
whose output is
{aaa,ccc}
Mathematica Codes 325
Note that the above two lines of code can be condensed into the following single
line:
Pick[{aaa,bbb,ccc,ddd},Cases[{{-4.12},{-6.18},{2.06},
{6.18}},{x_}!x<5&&x>-5]]
The following Mathematica code uses Pick and Cases combined in the same
lines of code to define latticeCs and latticeCl:
This Mathematica code builds the sodium chloride structure with the Na+ and Cl−
ions represented by spheres. This structure results from the interpenetration of two
face-centered cubic structures, one for the chloride ions, the other for the sodium
ions. The translation vectors for the face-centered cubic structure are given by
(6.25). If one Cl− is at the origin, the vector that translates the lattice of Cl− ions to
generate the sodium ion lattice is given by (6.30):
326 6 Crystals
This Mathematica code draws energy levels for the p electron delocalized system of
a linear polyene, calculated by Hückel’s method [see (6.37)], as the number of
carbon atoms increases from 10 to 100, in steps of 10:
Mathematica Codes 327
The following Mathematica code plots the Fermi–Dirac distribution (6.42) for
sodium (l = 3.24 eV) at T = 100, 4000, and 10,000 K, showing the electron
population as a function of energy:
Glossary
Exercises
E1. Change the numerical values of Fig. 6.6 so that the new values refer to
identical spheres of radius r.
Exercises 329
E2. Use Mathematica to draw three tangent disks of radius 1/2 whose centers form
an equilateral triangle and print coordinates of the disks’ centers. Discuss the
results.
E3. Calculate the packing density for the hexagonal close-packed structure.
E4. Consider a two-dimensional structure formed by the merging of two Bravais
point lattices of black and gray points, where each black point is the center of an
equilateral triangle of gray points with the vertex pointing downward and each
gray point is the center of an equilateral triangle of black points with the vertex
pointing upward. Find the translation vector for converting one Bravais lattice
into the other, and use Mathematica to represent the whole two-dimensional
structure (black and gray points).
E5. Use Mathematica to represent a cube of spheres in the sc structure. Assume
r = 1/2 and take the lattice constant a so that the spheres become tangent along
the cube edge.
E6. Use Mathematica to represent a cube of spheres in the bcc structure.
Assume r = 1/2 and take the lattice constant a so that the spheres become
tangent along the cube main diagonal.
E7. Use Mathematica to represent a cube of spheres in the fcc structure. Assume
r = 1/2 and take the lattice constant a so that the obtained structure is cubic
close-packed.
E8. Use Mathematica to represent a hexagonal prism of spheres in the hcp
structure. Assume r = 1/2 and take the lattice constants a and c so that the
obtained structure is close-packed.
E9. Use Mathematica to arrive at the last equality of (6.40).
E10. Consider the Fermi–Dirac distribution function for metallic sodium at
7000 K. Calculate the energy in eV of the level whose occupation fraction is equal
to 0.25.
References
Kapustinskii AF (1956) Lattice energy of ionic crystals. Q Rev Chem Soc 10:283–294
Lisensky GC, Kelly TF, Neu DR, Ellis AB (1991) Simulating diffraction experiments in
introductory courses. J Chem Educ 68:91–96
Lucas AA, Lambin Ph, Mairesse R, Mathot M (1999) Revealing the backbone structure of B-DNA
from laser optical simulations of its X-ray diffraction diagram. J Chem Educ 76:378–383
McWeeny R (ed) (1979) Coulson’s Valence, 3rd edn. Oxford University Press, Oxford
Further Reading
Ashcroft NW, Mermin ND (1976) Solid state physics. Brooks/Cole Publishers Co.
Atkins PW (1998) Physical chemistry, 6th edn. W. H. Freeman and Company
Smart, L. and Gagan, M., editors, The Third Dimension, The Open University, Series The
Molecular World, 2002
Water
7
Abstract
Water determines life on earth to an extent greater than any other substance. We
know it in three physical states: as a liquid, in the oceans, seas, lakes, and rivers;
as a solid, in lakes covered with ice, in glaciers, and in the North and South
Poles; as a vapor, in the Earth’s atmosphere. Despite having a simple molecular
formula, water has unique structural and thermodynamic properties: it is denser
in the liquid than in the solid states (ice floats in liquid water), and it melts when
ice is subject to high pressure. In addition, the specific heat capacity of liquid
water is about four times that of dry air, and its surface tension is about three
times that of ethanol. The computer simulation of water by molecular dynamics
requires large computational resources and is still is a topic of current research.
In this chapter, we consider several properties of the water molecule including its
geometry, enthalpy of formation, atomic charges, dipole moment, molecular
orbitals, molecular vibrations, and hydrogen bond. The remaining sections of
this chapter take the standpoint of thermodynamics and consider normal ice,
liquid water, the phase diagram of water, water as a solvent, simple nonpolar
solutes, ionic solutions, amphipathic molecules, acids and bases, and standard
electrode potentials. The Mathematica codes presented at the end of the chapter
(Representation of the Water Molecule, Electrostatic Potential Contours for a
Dipole, Interactive Manipulation of Charge, Vector Field Streamlines, Luzar’s
Model, Micelle-Monomers Equilibrium, Critical Micelle Concentration, Weak
Acid HA) are accompanied with detailed explanations and provide insight into
the corresponding subjects. Following the Mathematica codes, we present a
glossary of important scientific and technical terms on water and aqueous
solutions and a list of exercises, whose complete answers can be found after the
Appendix.
The experimental values of the OH bond length and the HOH bond angle of a water
molecule are ROH = 0.958 Å and aHOH = 104.4776° (CCCBDB), showing that H2O
is a symmetric nonlinear triatomic molecule (Fig. 7.1; see Mathematica code
M1). The geometry optimization of a water molecule carried out by the system of
programs Gaussian 09 at the B3LYP level of calculation with the basis set cc-pVTZ
gives ROH = 0.9611 Å and aHOH = 104.5296° (Fig. 7.1).
Figure 7.2 shows the energy profiles for the scans of the bond length ROH when
aHOH is 104.53°, and of the bond angle aHOH when ROH is 0.961 Å. Note that the
HOH bond angle differs from the tetrahedral angle (109.47°) by approximately 5°, a
value much smaller than the HOH bond angle difference with respect to the right
angle, which is about 19°.
0.961 Å
(0.958 Å)
O
104.53o
(104.48o)
Fig. 7.1 Ball and bond representation of the H2O molecule with calculated and experimental
(CCCBDB) values of the geometric variables ROH and aHOH (experimental values are shown in
parentheses). Molecular representation obtained with GaussView
Fig. 7.2 At left, scan of the bond length ROH when aHOH is equal to 104.53°. At right, scan of the
bond angle aHOH with ROH equal to 0.961 Å. Note that the energy scales of the above plots are
quite different. Graphs obtained with Mathematica
7.2 Enthalpy of Formation 333
For a gas phase water molecule, the experimental value of the enthalpy of formation
at 298.15 K is −241.81 kJ mol−1 (CCCBDB). Computing a molecular energy with
chemical accuracy means attaining an accuracy of 1 kcal mol−1 (4.2 kJ mol−1).
Since the largest errors in ab initio calculations result from basis set truncation,
achieving chemical accuracy requires a series of calculations that extrapolate to the
complete basis set limit. This type of calculation was carried out by the system of
programs Gaussian 09 using the composite method W1BD (Martin and de Oliveira
1999; Parthiban and Martin 2001), where W stands for Weizmann Institute and BD
for Brueckner Doubles (Brueckner orbitals). Within W1BD, the geometry opti-
mization is carried out by a B3LYP/cc-pVTZ calculation.
The Schrödinger equation is obtained within the Born–Oppenheimer approxi-
mation, and the electron wave function and its corresponding energy Uelec depend
parametrically on the nuclear coordinates. The energy Uelec is evaluated at 0 K. In
order to convert this energy from 0 to 298.15 K, the thermal energy given by the
sum of the translation, rotation, and vibration contributions (Utrans + Urot + Uvib) at
298.15 K needs to be added to Uelec. Gaussian evaluates the thermal energy and the
enthalpy at 298.15 K (the enthalpy H is defined as U + pV, with the product pV
being equal to RT for a mole of an ideal gas). In order to obtain the enthalpy of
formation of H2O(g), the same composite calculation method has to be applied to
H2O, H2, and O2, since the chemical reaction for the formation of one mole of water
in the gas phase is given by H2(g) + (1/2) O2(g) ! H2O(g). The calculated value of
DHof at 298.15 K is −245.23 kJ mol−1, a value that differs from the experimental
value by less than 1 kcal mol−1.
The electrostatic potential for the H2O molecule mapped with rainbow colors over a
total density surface with isovalue equal to 0.00040 (Fig. 5.4) shows that the hy-
drogen atoms of the water molecule are strongly electrophilic and the oxygen atom
is strongly nucleophilic.
The electrostatic potential of a molecule can be used to determine its distribution
of atomic charges. To this end, the molecular electrostatic potential / and the
electrostatic potential η that results from a set of point charges qa located on the
molecule’s nuclei are evaluated on the nodes i of a grid. For a neutral molecule like
H2O the sum of all the atomic charges is zero, and so the charge centered on the
oxygen nucleus is −2 times the charge on each of the hydrogen atoms, qO = −2 qH.
Hence, in atomic units, η is given by
X qa 2qH qH qH
gi ¼ ¼ þ þ ð7:1Þ
a
rai rOi rH1 i rH2 i
334 7 Water
where i represents a grid node. By the least squares optimization procedure, the
atomic charge qH can be determined by minimizing
X
HðqH ; rOi ; rH1 i ; rH2 i Þ ¼ ðgi /i Þ2 ð7:2Þ
i
@H
¼0 ð7:3Þ
@qH
The following table shows the molecular electrostatic potential (MEP) atomic
charges and the Mulliken atomic charges for H2O obtained from B3LYP calcula-
tions with different basis sets. It is shown that the introduction of polarized func-
tions on oxygen and hydrogen atoms significantly improves the results, as can be
expected. In addition, molecular electrostatic atomic charges obtained from large
basis sets with diffuse functions give the same atomic charges. Note that Mulliken
charges vary with the basis sets somewhat erratically.
MEP Mulliken
Method/basis set qO qH qO qH
B3LYP/6-31G −0.897 0.448 −0.718 0.359
B3LYP/6-31G(d,p) −0.737 0.368 −0.610 0.305
B3LYP/6-311G(d,p) −0.749 0.375 −0.474 0.237
B3LYP/cc-pVTZ −0.703 0.352 −0.435 0.218
B3LYP/cc-pVQZ −0.698 0.349 −0.490 0.245
B3LYP/aug-cc-pVTZ −0.680 0.340 −0.354 0.177
B3LYP/aug-cc-pVQZ −0.680 0.340 −0.587 0.294
In this section, we deal with simple charge distributions and their electrostatic
potentials. We begin by considering charge distributions that correspond to pure
electric multipoles (charge, dipole, quadrupole, and octupole) and their corre-
sponding numbers of poles expressed as powers of 2 (Fig. 7.3). Since a charge can
be either positive or negative, the number of poles is given by a power of 2.
The contours for the electrostatic potential of a charge q, in the xy-plane and
atomic units, are given by
7.4 Dipole Moment 335
Fig. 7.3 Electric multipoles (schematic) with the corresponding number of poles expressed as a
power of 2
q
/ðx; yÞ ¼ ð7:4Þ
r
where r represents the distance between the point P with coordinates (x, y, 0) and
the charge located at the origin (0, 0, 0). These contours are concentric circles
whose center is the point charge q (Fig. 7.4), since the electrostatic potential pro-
duced by a point charge depends only on the distance to the charge. All the radial
directions emerging from the charge display the same variation of the electrostatic
potential, that is, they are equivalent directions, and the space around the central
charge is said to be isotropic.
The electrostatic potential of a dipole is
1 1
/ðx; yÞ ¼ q ð7:5Þ
r1;0 r1;0
where q represents the absolute value of the charges (the minus sign in the second
member of this expression accounts for the sign of the corresponding negative
charge), and r−1,0 and r1,0 represent the distances between the point P in the xy-
plane and the negative and positive charges with coordinates (−1, 0, 0) and (1, 0, 0),
respectively (Fig. 7.4; see Mathematica codes M2 and M3; E1, E2, E3).
Fig. 7.4 Contours in the xy-plane for electrostatic potentials produced by one charge, one dipole,
and one quadrupole. Figures obtained with Mathematica
336 7 Water
Substitution of
" 2 #1=2 " 2 #1=2
d d
rþ ¼ x þ y2 r ¼ xþ þ y2 ð7:8Þ
2 2
in (7.7) followed by Taylor expansion of the resulting function of d about the point
d = 0 gives
d2 00 d3
/ðdÞ ¼ /ð0Þ þ d/0 ð0Þ þ / ð0Þ þ /000 ð0Þ þ ð7:9Þ
2! 3!
where
qx qxð15x2 9r 2 Þ
/ð0Þ ¼ 0 /0 ð0Þ ¼ /00 ð0Þ ¼ 0 /000 ð0Þ ¼ ð7:10Þ
r3 4r 7
q d x q d3 xð5x2 3r 2 Þ
/ðdÞ ¼ þ þ Oðd5 Þ ð7:11Þ
r3 8r 7
Since the exchange of positive and negative charges changes the sign of the
electrostatic potential, this is an odd function of d [/(d) = −/(−d)], and conse-
quently, the even powers of d in (7.9) are zero. If d represents an arbitrarily small
separation between the charges of the dipole, then we can approximate /(d) to the
first term of (7.11) (the terms in d3 and O(d5) become negligible compared with the
d term) and write
l ¼ qd ð7:12Þ
where l is the magnitude of the point dipole vector. Note that x = r cosh (E4).
From the definition of electrostatic potential, (see 5.10), one can write
V ¼ qt / ð7:13Þ
where V is the potential energy, qt is the test charge, and / is the electrostatic
potential. The x and y components of the electric force are given by
@V @/ @V @/
Fx ¼ ¼ qt Fy ¼ ¼ qt ð7:14Þ
@x @x @y @y
Fx Fy
Ex ¼ Ey ¼ ð7:15Þ
qt qt
@/ @/
Ex ¼ Ey ¼ ð7:16Þ
@x @y
Note that
@/ ^ @/ ^
iþ j ¼ r/ ð7:17Þ
@x @y
is the gradient vector of the electrostatic potential in the xy-plane, and ∇ represents
the del, or nabla, operator indicative of the vector differential operator.
338 7 Water
is the velocity vector tangent to the contour at (x0, y0) (this point corresponds to
time t0). The dot product of the gradient of the electrostatic potential and the
velocity vector is given by
@/ @x @/ @y d/
r/:v ¼ þ ¼ ¼0 ð7:19Þ
@x x0 ;y0 @t t0 @y x0 ;y0 @t t0 dt t0
This dot product is zero, because the electrostatic potential does not vary on a
contour curve by the definition of contour. Hence, at (x0, y0), the gradient of the
electrostatic potential is perpendicular to the velocity vector, which in turn is tan-
gent to the contour at that point. The gradient vector points in the direction of the
greatest rate of increase of the electrostatic potential. From (7.16) and (7.17), one
concludes that the electric field vector in the xy-plane is
and so, at a defined point of the xy-plane, the electric field vector points in the
direction of the greatest decrease of the electrostatic potential. Figure 7.6 shows a
plot of the electrostatic potential contours and the electric field streamlines for a
dipole (Fig. 7.6, left) and a point dipole (Fig. 7.6, right; see Mathematica code
M4). It can be seen that the streamlines move away from the positive charge (the
source) and converge to the negative charge (the sink), since these are the direc-
tions of the greatest decrease of the electrostatic potential. In the case of the point
dipole, source and sink coincide (the streamlines move away and converge to the
point dipole).
We now consider the atomic charges for the water molecule derived from the
electrostatic potential obtained from a Gaussian 09 B3LYP/aug-cc-pVTZ calcula-
tion (see Sect. 7.3). Since the electrostatic potential is an additive function, it can be
equated to a sum of electrostatic potentials associated with nonnull electric multi-
poles and represented by the contours in the molecular plane. Figure 7.7 shows the
contours of the total electrostatic potential of H2O as an approximate sum of the
contours for the electrostatic potential of the dipole term associated with the vector
7.4 Dipole Moment 339
Fig. 7.6 Contours and streamlines show the direction of the electric field, for a dipole (left) and a
point dipole (right). Figures obtained with Mathematica
sum of the O–H bond dipoles and the electrostatic potential for a quadrupole term
(see E5). These results show that besides the dipole, the H2O molecule has also a
significant contribution from the quadrupole.
In Hartree–Fock and DFT calculations, the wave function for a closed shell
molecule is a Slater determinant of one-electron wave functions (spin-orbitals) that
are eigenfunctions of effective one-electron Hamiltonians called Fock and Kohm–
Fig. 7.7 Contours for the electrostatic potential of the H2O molecule derived from the distribution
of atomic charges (Gaussian09 B3LYP/aug-cc-pVTZ calculation) as an approximate sum of the
contours for a dipole and a quadrupole. Figures obtained with Mathematica
340 7 Water
Sham operators (E6). For a molecule, these one-electron wave functions are called
molecular orbitals and are expressed as linear combinations of basis functions
centered at each atom of the molecule.
C2v character table
E C2 rxz ryz
A1 1 1 1 1 z x2, y2, z2
A2 1 1 −1 −1 Rz xy
B1 1 −1 1 −1 x, Ry xz
B2 1 −1 −1 1 y, Rx yz
(http://symmetry.jacobs-university.de)
The H2O molecule belongs to the C2v symmetry group, whose symmetry ele-
ments are shown in Fig. 7.8. The symmetry species of this group are presented in
the corresponding character table (see above). Upon application of the C2v sym-
metry operations, the H2O molecular orbitals transform as the group symmetry
species. For this reason, the molecular orbitals are labeled by the corresponding
symmetry species. Being one-electron functions, they are labeled by lowercase
letters.
The water molecule has ten electrons that are allocated to the five molecular
orbitals with lower energy (each molecular orbital is occupied by two electrons with
antiparallel spins). The five doubly occupied molecular orbitals of H2O are shown
in Fig. 7.9 by surfaces of defined isovalues, with the corresponding energies in eV,
C2
σ xz
z
O O O
σ yz
z y
z x
Fig. 7.8 Symmetry elements for the H2O molecule. Molecular representations obtained with
GaussView
Fig. 7.9 Molecular orbitals for the H2O molecule represented by surfaces whose isovalue is equal
to 0.02000 (Gaussian 09 B3LYP/cc-pVTZ calculation)
7.5 Molecular Orbitals 341
in parentheses, and a1, b1, b2 name the molecular orbitals by their symmetry types.
The orbitals that have a common symmetry label are numbered by increasing
energy (for instance, 1a1, 2a1, 3a1). The occupied molecular orbital of lowest energy
(1a1) is essentially restricted to the 1s oxygen atom core electrons, which are not
significantly involved in the O–H bonds.
Figure 7.10 shows the natural bond orbitals BD1 and BD2 and the natural atomic
orbitals LP1 and LP2 for the H2O molecule. These natural orbitals have the fol-
lowing approximate expressions:
LP2 pO
LP1 sp0:86
O
ð7:21Þ
BD2 0:85 sp3:33
O!H2 þ 0:52 sH2
BD1 0:85 sp3:33
O!H1 þ 0:52 sH1
The natural bond orbitals BD1 and BD2 are approximate localized in the OH
bonds and are equivalent orbitals by molecular symmetry, since the OH bonds of
H2O are equivalent bonds. In turn, LP1 and LP2 are approximate localized in the
oxygen atom (lone pairs or natural atomic orbitals).
LP 2
LP 1
BD 1 BD 2
Fig. 7.10 Surfaces of isovalue equal to 0.0200 represent the natural bond orbitals BD1 and BD2
and the oxygen atom lone pairs LP1 and LP2 (Gaussian 09 B3LYP/cc-pVTZ calculation)
342 7 Water
O O O
H H H H H H
Fig. 7.11 Normal vibrational modes of H2O with the wavenumbers in cm−1 after scaling by the
factor 0.965 (CCCBDB) and the corresponding displacement vectors (Gaussian 09 B3LYP/
cc-pVTZ calculation). In the normal mode at 3767 cm−1, the displacement vector for the OH
oscillator on the right has its origin on the H atom and is directed toward the O atom, thus being
hidden by the segment that represents the OH bond
7.6 Molecular Vibrations 343
The calculated displacement vectors are shown in the above table. Note that the
x-axis is perpendicular to the molecular plane (the yz-plane) (see Fig. 7.8). Since the
displacement vectors lie in the molecular plane, all the x-components of the dis-
placement vectors are zero. The numbers highlighted in bold correspond to the
largest displacements of the hydrogen atoms, and the small displacements of the
oxygen atom move this atom so that the center of mass of the molecule stays
unshifted (a center of mass motion is a translation, not a vibration).
Displacement vectors and scaled wavenumbers for the calculated normal modes of vibration of
DOH
m/cm−1 3720 2701 1386
x y z x y z x y z
O 0.00 −0.05 0.03 0.00 0.09 −0.07 0.00 0.00 0.09
H 0.00 0.81 −0.59 0.00 0.11 −0.01 0.00 −0.55 −0.70
D 0.00 0.01 0.03 0.00 −0.80 −0.58 0.00 0.28 −0.35
Protium (1H) and deuterium (D = 2H) have the same atomic number Z = 1, and
so both are hydrogen atoms. Within the Born–Oppenheimer approximation, the
H2O and DOH molecules have the same molecular orbitals. However, the vibra-
tional frequencies depend on the reduced masses of the oscillators, and a protium by
deuterium substitution breaks the equivalence of the oxygen–hydrogen oscillators.
The reduced masses of the OH and OD oscillators in unified mass units are
approximately equal to 16 1/(16+1) = 16/17 and 16 2/(16+2) = 32/18,
respectively. Hence,
rffiffiffiffiffiffiffiffi pffiffiffi
xOH lOD
2 ð7:22Þ
xOD lOH
(E7).
The calculated scaled wavenumbers and displacement vectors for the normal
modes of DHO are shown in Fig. 7.12. The normal modes at 3720 and 2701 cm−1
have predominant contributions of the OH and OD stretching vibrations, respec-
tively (see table of atomic displacements). The large variation in the reduced mass
O O O
H D H D H D
Fig. 7.12 Normal vibrational modes of DHO with the wavenumbers in cm−1 after scaling with
the factor 0.965 (CCCBDB), and the corresponding displacement vectors (Gaussian 09 B3LYP/
cc-pVTZ calculation)
344 7 Water
Sodium and fluoride ions have the same number of electrons (10) and the same
closed-shell electron configuration, 1s22s22p6. The potential energy curve for
Na+…F– shows a deep minimum at approximately 2.0 Å that results from a balance
between repulsive and attractive interactions (Fig. 7.13). For interatomic distances
smaller than that of the minimum, the dominant interaction has its origin in the
repulsion and exchange between electrons of different atoms (exchange of electrons
of different atoms requires overlap between the electron densities of both atoms)
and corresponds to a curve that decreases with increasing internuclear distance. In
turn, the leading long-range interaction between the positive and negative charges
of Na+ and F– is electrostatic, mostly of the charge-to-charge type. This interaction
is approximately described by a −1/R curve that represents the attractive wing of the
potential well (Fig. 7.13) and tends asymptotically to zero, thus meaning that the
additive constant in the −1/R fitting is set to zero energy. At R = 6.0 Å, the
potential energy curve is well below zero, and the charge-to-charge electrostatic
7.7 Intermolecular Interactions 345
7.7.2 Induction
The sodium ion is isoelectronic with a neon atom, and both have the same number
of electrons (10) and the same electronic configuration, 1s22s22p6. Gaussian
calculations for the Na+…Ne system were carried out using the Moller–Plesset
perturbation method to fourth order of perturbation (MP4) with the aug-cc-pVTZ
basis set [MP4(sdq)/aug-cc-pVTZ calculation]. The potential energy curve for
Na+…Ne (Fig. 7.14) shows an energy minimum at 2.55 Å whose depth is more
than one order of magnitude smaller than the minimum observed for Na+…F–. The
charge of Na+ induces a dipole in the neon atom with the negative end closer to the
sodium ion. The resulting charge-induced dipole attractive interaction is called
induction.
In general, the interaction between a permanent 2n-pole and an induced 2 m-pole
varies as 1/R2(n+m+1). For a charge-induced dipole interaction, n=0 and m=1, the
distance dependence is given by 1/R4. Induction interactions are clearly nonaddi-
tive, since the multipole moments induced by a third molecule interact, being
subject to mutual change. The weak solubility of oxygen in water can be approx-
imately described by the interaction between the dipole moment of the water
molecule and the induced dipole moment in the diatomic oxygen molecule.
7.7.3 Dispersion
Helium and neon boil at 4.22 K and 27.07 K, with these boiling temperatures
corresponding to RT values approximately equal to 0.04 and 0.2 kJ mol−1. Since
these energies are well below 1 kcal mol−1 (4.2 kJ mol−1), the calculation of the
interaction energy between two rare gas atoms cannot be obtained from the dif-
ference between the total energy of the system of the interacting atoms (EA…A) and
the sum of the individual energies of each atom (2EA), since the errors of these large
numbers are far greater than the resulting energy difference. The interaction
between two rare gas atoms corresponds to a tiny perturbation, and so should be
evaluated as such by a perturbation method. To this end, Gaussian 09 calculations
for the He…He and Ne…Ne systems were carried out using the Moller–Plesset
perturbation method to fourth order of perturbation (MP4) with the aug-cc-pVTZ
basis set [MP4(sdq)/aug-cc-pVTZ calculation] (Aziz and Slaman 1991; Woon
1991). The potential energy curves for He…He and Ne…Ne systems present very
shallow minima (approximately 0.07 and 0.2 kJ mol−1) at interatomic distances
around 3.05 and 3.15 Å, respectively, with the minimum for Ne…Ne being about
nine times deeper than for He…He (see insets in the potential energy curves of
Fig. 7.15).
Isolated atoms are isotropic systems with respect to their nuclei, and all per-
manent electric moments of neutral atoms are null. Therefore, the observed minima
Fig. 7.15 Potential energy curves for He…He and Ne…Ne as functions of the internuclear
distance [Gaussian 09 MP4(sdq)/aug-ccpVTZ calculation]. Curves are fitted to the short-range
repulsive wings of the potential energy curve. Details of the minima and the curves fitted to the
attractive wings of the very shallow potential wells are shown in the insets. Energy zeros were
chosen so that the long-range fitting curves tend to zero as R ! ∞. Figures obtained with
Mathematica
348 7 Water
in the potential energy curves of He…He and Ne…Ne systems cannot derive from
interactions involving permanent electric moments. What is the origin of the
observed minima? Why is the minimum of Ne…Ne deeper than that of He…He?
Electrons try to avoid each other due to instantaneous repulsions. While the
instantaneous asymmetry in the electron distribution of a single atom averages to
zero over time, the asymmetry in the electron distribution of one atom produced by
correlation between electrons of different atoms can be described in each atom by
successive induced electric moments (induced dipole, induced quadrupole, …) and
depends on the internuclear distance. Figure 7.16 schematically illustrates how an
instantaneous dipole in the electron distribution of one atom induces a dipole in the
electron distribution of the other atom. The resulting induced dipole-induced dipole
interaction is attractive, since induced dipoles always reorient to produce attraction.
In general, the interaction between induced 2n- and 2m-poles varies as 1/R2(n+m+1).
An induced dipole-induced dipole interaction (n = 1 and m = 1) corresponds to
1/R6 dependence. The correlation between electrons of different atoms can also
result from the contributions of induced electric moments of order greater than 1,
for instance, induced dipole-induced quadrupole (1/R8) and induced quadrupole-
induced quadrupole (1/R10) interactions. The overall resulting attraction is called
dispersion. Due to correlation between electrons of different atoms, neon atoms
lead to a larger well depth than helium atoms, since the dispersion energy increases
with the number of electrons of the interacting atoms. An example of this is the fact
that chlorine is a gas, bromine a liquid, and iodine a solid.
Since the electron densities of interacting atoms or molecules are polarizable,
dispersion always occurs, that is, it is found in every molecular interaction. Its 1/R6
distance-dependence makes it approximately additive, despite being due to an
interaction between induced electric moments. For small or medium-size mole-
cules, dispersion is a weak interaction, but it becomes significant and sometimes
even decisive when the remaining interactions are null or negligible.
Dispersion interactions between alkane molecules contribute appreciably to
determining the physical state of a hydrocarbon. In fact, the magnitude of the
dispersion interactions increases with the number of electrons in the atoms located
on the molecular surface (for alkane molecules, these are hydrogen atoms). This
means that the magnitude of the dispersion interaction increases with the area of the
molecular surface and the number of hydrogen atoms on it. For example,
2,2-dimethyloctane (a branched decane) has a normal boiling point at 155 °C,
Fig. 7.16 Correlation between electrons of different atoms leads to attraction involving an
induced dipole-induced dipole interaction (schematic)
7.7 Intermolecular Interactions 349
whereas the boiling point of n-decane is 174 °C (the linear alkane has a larger
surface area than the branched alkane).
The boiling points of linear alkanes are plotted in Fig. 7.17 as a function of the
number of hydrogen atoms. Since the general formula for an alkane is CnH2n+2, an
increase of one carbon atom corresponds to an increase of two hydrogen atoms, and
linear alkanes that have four or fewer carbon atoms are gases at ambient temper-
ature. Long hydrocarbon chain molecules with a large number of hydrogen atoms in
the interfacial area between molecules have appreciable dispersion energies. This
can be inferred from their high boiling points: at ambient temperature and normal
pressure, linear alkanes with more than seven carbon atoms are liquids, and linear
alkanes with more than 17 carbon atoms are solids.
Electrostatic, induction, dispersion, and exchange-repulsion interactions are
fundamental interactions. We now consider a specific interaction in which the
hydrogen atom exerts a major role: the hydrogen bond.
Consider one hydrogen fluoride molecule interacting with one water molecule. The
potential energy curve as a function of the H…O distance (Fig. 7.18) presents an
energy minimum in which the hydrogen atom covalently bonded to the fluorine
atom interacts with the oxygen atom of the water molecule. The relaxed scan of the
H…O distance leads to an energy minimum whose depth is approximately
37 kJ mol−1 (Gaussian 09 B3LYP/aug-cc-pVTZ calculation). Acting like a bridge
between two electronegative atoms, fluorine and oxygen, the hydrogen atom
establishes a weak directional interaction with one of the lone pairs in the oxygen
atom. At the minimum, the F–H bond length (0.945 Å) is slightly longer than the
bond length of an isolated hydrogen fluoride molecule (0.924 Å), and the H…O
internuclear distance (1.692 Å) almost doubles the F–H bond length. The F–H…O
interaction is a hydrogen bond and the FH…OH2 complex can be called a
hydrogen bond heterodimer, with the HF and H2O molecules having distinct roles:
HF provides the hydrogen atom (it is called the donor), whereas H2O provides the
lone pair in the oxygen atom (it is called the acceptor).
The hydrogen chloride–water system provides another example of a hydrogen
bond heterodimer, ClH…OH2. This time, the relaxed scan of the H…O distance
leads to a minimum, whose calculated depth is approximately 20 kJ mol−1
(Gaussian 09 B3LYP/aug-cc-pVTZ calculation).
Isodensity surfaces mapped with electrostatic potentials can be used to illustrate
the hydrogen bond formation in the FH…OH2 heterodimer (Fig. 7.19). The elec-
trostatic attractive interaction between the electrophilic region of the hydrogen atom
in the hydrogen fluoride molecule and the nucleophilic region of the oxygen atom
in the water molecule leads to the formation of the hydrogen bond heterodimer. The
complementary functions of the donor and acceptor molecules are well illustrated
by this electrostatic potential color map.
Note that each hydrogen bond heterodimer corresponds to a single closed iso-
density surface, that is, the hydrogen fluoride and the water moieties do not have
distinct isodensity surfaces. This means that the H…O interaction has some partial
covalent character, which leads to the directional properties of the hydrogen bond.
The fact that the expression “hydrogen bond” proposed by M.L. Huggins in 1919
prevailed over “hydrogen bridge” is probably due to the recognition of the covalent
character of the hydrogen bond. In addition, the heterodimer exhibits stronger and
larger electrophilic regions in the hydrogen atoms of the water molecule than in the
isolated water molecule and a stronger and larger nucleophilic region in the fluoride
atom of the heterodimer than in the corresponding isolated hydrogen fluoride
molecule. These features point to the polarization of the FH…OH2 heterodimer
with respect to the individual molecules.
30
20
10
0 RHO /
2 3 4 5 6 7
7.8 Hydrogen Bond 351
-5.000e-2 5.000e-2
H
… O
F H O F
H H
Fig. 7.19 Isodensity surfaces mapped with electrostatic potentials for HF and H2O molecules,
and the hydrogen bond heterodimer FH…OH2 (Gaussian 09 B3LYP/aug-cc-pVTZ calculation)
0.329 0.345
0.329 0.345
Fig. 7.20 Atomic charges fitted to electrostatic potentials of HF, H2O, and FH…OH2 (Gaussian
09 B3LYP/aug-cc-pVTZ calculation)
Atomic charges fitted to electrostatic potentials, before and after hydrogen bond
formation, are shown in Fig. 7.20. It can be seen that the hydrogen atoms of the
water molecule are more positively charged in the hydrogen bond heterodimer than
in the isolated water molecule, and the fluoride atom is more negatively charged in
the hydrogen bond heterodimer than in the isolated hydrogen fluoride molecule.
Thus, the distribution of atomic charges reveals polarization of the donor and
acceptor molecules in the hydrogen bond heterodimer, as we already found by
inspection of the electrostatic potential map. As a result of this polarization, the
hydrogen fluoride and the water moieties are no longer neutral, due to the occur-
rence of charge transfer within the heterodimer, a fact that stresses the partial
covalent character of the hydrogen bond. Only the sum of all the atomic charges in
the whole hydrogen bond heterodimer is zero.
Consider the hydrogen bond water dimer, HOH…OH2. Figure 7.21 shows the
potential energy curve as a function of the H…O distance for the Cs configuration,
where the plane of reflection contains the HOH…O nuclei. The relaxed scan of the
H…O distance leads to an energy minimum, whose depth is approximately
352 7 Water
15
10
0 RHO /
1.5 2.0 2.5 3.0 3.5 4.0
7.8 Hydrogen Bond 353
-5.000e-2 5.000e-2
•••
Fig. 7.22 Isodensity surfaces mapped with electrostatic potentials before and after hydrogen
bond formation for the HOH…OH2 dimer (Gaussian 09 B3LYP/aug-cc-pVTZ calculation)
feature that confirms the covalent character of the hydrogen bond. In particular, the
hydrogen bond stretching at 143 cm−1 (calculated frequency 185 cm−1) is a con-
sequence of the partial covalent character in the O…H interaction.
7.9 Ice Ih
In ice each water molecule has both donor and acceptor hydrogen bond function-
alities, and it is involved in four hydrogen bonds directed to the vertices of a regular
tetrahedron occupied by the nuclei of oxygen atoms of the neighboring water
molecules. In fact, the water molecule at the center of this tetrahedral arrangement
acts as a donor in two hydrogen bonds and as an acceptor in the other two hydrogen
354 7 Water
O"
O"
O"
O"
O"
Fig. 7.24 Pentamer of water molecules in ice Ih. This structure is stable only as part of ice Ih
bonds (Fig. 7.24). Each water molecule at one vertex of the tetrahedral arrangement
also takes the role of a central water molecule in a new tetrahedron not shown in
Fig. 7.24. This tetrahedral pattern leads to the formation of hexagonal channels in
ice Ih (I stands for 1 in roman numerals, and the subscript h stands for hexagonal)
that result from the stacking of hexagonal cycles of hydrogen-bonded water
molecules in the chair configuration (Fig. 7.25). At atmospheric pressure (1 atm =
101.325 kPa), the melting point of ice Ih is 273.15 K.
More than ten different forms of ice are known in distinct p-T regions of the
phase diagram of water. Ice polymorphism (variety of hydrogen bonding archi-
tectures of water molecules in solid phase) results from the flexibility that structures
based on O–H…O hydrogen bond networks have when subject to high pressures
and very low temperatures.
The diversity of solid structures of water is not restricted to ice forms. Under
appropriate conditions of pressure and temperature, small gas molecules like CH4,
NH3, and H2S become trapped inside convex polyhedra formed by
hydrogen-bonded water molecules. A convex polyhedron has no indentations or
holes, that is, every linear segment connecting two points inside a convex poly-
hedron is entirely contained within it. The Swiss mathematician Leonhard Euler
(1707–1783) discovered in 1751 that a convex polyhedron with V vertices, E edges,
and F faces satisfies the equation V – E + F = 2. Oxygen atoms of
hydrogen-bonded water molecules are at the vertices of the polyhedra whose edges
correspond to O–H…O hydrogen bonds The sharing of faces by the convex
polyhedra of hydrogen-bonded water molecules gives rise to solid forms of water
that include small gas molecules and are called gas hydrates. Without the trapped
molecules, the network of hydrogen-bonded water molecules would collapse into
liquid water or a conventional ice form.
Large deposits of methane hydrates are found on the deep ocean floor of con-
tinental edges, at depths of hundreds of meters and pressures of hundreds of bars,
near volcanic fissures from which gases like methane can burst. Methane from
7.9 Ice Ih 355
Fig. 7.25 Left Regular tetrahedron of hydrogen-bonded water molecules in ice seen as a pyramid.
Right Ice fragment showing one hexagonal cycle of hydrogen-bonded water molecules in the chair
configuration. For clarity, some hydrogen atoms covalently bonded to oxygen atoms are omitted
methane hydrate reservoirs doubles the amount of fossil fuel in the world (see Seuss
et al. 1999). However, gas hydrates become unstable at temperatures above 0 °C
and pressures below those found in the submarine deposits where they were
formed.
In the liquid state, the hydrogen-bonded network is dynamic, less ordered than in
ice, and the hydrogen bonds around each water molecule are distorted with little
directionality. In addition, the distribution of hydrogen bonds is strongly dependent
on temperature. According to a simple model due to Luzar (see Luzar et al. 1983;
Teixeira and Luzar 1999), it is assumed that in liquid water, no water molecule can
form more than four hydrogen bonds with adjacent water molecules, that is, there
are five groups of water molecules according to the number of hydrogen bonds each
water molecule can form (4, 3, 2, 1, 0). In order to evaluate the population of each
of these groups of water molecules at a defined temperature, we need to express the
fraction of formed hydrogen bonds as a function of temperature. Neglecting
hydrogen bond cooperativeness, each pair of oxygen atoms of adjacent water
molecules can be in one of two states, one for a formed hydrogen bond, the other
for a broken hydrogen bond. According to this model, the ratio of populations of
these two states (broken over formed hydrogen bonds) is given by
ð1 phb Þ e
¼ a exp ð7:23Þ
phb RT
where phb is the probability of one hydrogen bond 1 − phb is the probability of a
broken hydrogen bond, and a and e are empirical parameters whose values a = 178
356 7 Water
and e = 13.8 kJ mol−1 better describe the experimental results (E10). Rearranging
(7.23), we obtain
1 a exp½e=ðRTÞ
phb ¼ 1 phb ¼ ð7:24Þ
1 þ a exp½e=ðRTÞ 1 þ a exp½e=ðRTÞ
The binomial distribution to obtain the fraction of water molecules in each of the
five groups (k takes the values 4, 3, 2, 1, 0) is given by
4 4!
¼ 0k4 ð7:25Þ
k k!ð4 kÞ!
Figure 7.27 shows the hydrogen bond distributions given by the Luzar model, at
-30 °C (supercooled water), 10 °C, 50 °C and 90 °C, with maxima at k = 4, 3, 2, and
1 hydrogen bonds, respectively. The Mathematica code M5 shows how the dis-
tribution of hydrogen bonds varies with temperature between 273 and 373 K, in
steps of 1 K.
Fig. 7.26 The number of distinct ways for one water molecule to form two hydrogen bonds (solid
lines) is given by 4!/(2! 2!) = 6
- 30 o C 10 o C 50 o C 90 o C
0.5 0.5 0.5 0.5
0.4 0.4 0.4 0.4
0.3 0.3 0.3 0.3
0.2 0.2 0.2 0.2
0.1 0.1 0.1 0.1
0.0 0.0 0.0 0.0
0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4
Fig. 7.27 Distribution of hydrogen bonds in water at defined temperatures, according to the
Luzar model. This figure was obtained with Mathematica
7.10 Liquid Water 357
The main limitations of the Luzar model are due to the fact that the binomial
distribution is restricted to five groups of water molecules (k takes the discrete
values 4, 3, 2, 1, and 0) and to the neglect of cooperativeness in hydrogen bond
formation (phb does not depend on k). These limitations prevent us from extracting
quantitative conclusions from the above results. Nevertheless, the plots show two
important general trends, namely, the hydrogen bond distribution in liquid water is
strongly dependent on temperature, and as temperature increases, the distribution
maximum moves to lower numbers of hydrogen bonds.
The phase diagram of water shows ice–liquid, liquid–vapor and ice–vapor curves
for biphasic equilibria and the triple point at the intersection of these three curves
(Fig. 7.28). The distinguishing feature is the negative slope of the curve for the ice–
liquid equilibrium, which appears to be infinite. Considering an isothermal trans-
formation that crosses this curve and is carried out in the direction of increasing
pressures, from ice to liquid water, it is shown that ice melts when sufficient
pressure is applied to it. Since increase of pressure always leads to increased
compactness, one concludes that liquid water is denser than ice, that is, ice fluc-
tuates on liquid water, and this is because the hexagonal channels of ice Ih produce
a less dense structure than that of liquid water.
In the liquid–vapor equilibrium, the chemical potentials of the liquid and vapor
are equal,
Fig. 7.28 Phase diagram of water for ordinary pressures (Haynes 2011). The triple point is at
T = 273.16 K and p = 0.611657 kPa. The critical point (not shown) occurs at T = 647.10 K and
p = 22064 kPa. This figure was obtained with Mathematica
358 7 Water
where T is the boiling temperature at pressure p. From (7.27) and (1.149) (this
expression gives the chemical potential of an ideal gas, an assumption that will soon
be justified), we can write
p
l0liq ðTÞ ¼ l0gas ðTÞ þ RT ln 0 ð7:28Þ
p
where
At 298.15 K, one has RT 2.5 kJ mol−1, and the difference between the
standard chemical potentials of vapor and liquid water in equilibrium is approxi-
mately 9 kJ mol−1 (2.5 3.5). Considering that the chemical potential is a
partial molar Gibbs energy and the water vapor at such a low pressure
(p = 3.1699 kPa) can be considered an ideal gas, (7.29) provides an estimate of the
Gibbs energy necessary for vaporizing one mole of liquid water at 298.15 K.
The liquid–gas equilibrium curve has a terminal point called the critical point
(not shown in Fig. 7.29) at T = 647.10 K and p = 22,064 kPa. Beyond the tem-
perature of the critical point, the vapor cannot be liquefied by pressure increase. The
critical point can be approached through a transformation that follows the liquid–
Fig. 7.29 Starting with the liquid–vapor equilibrium, an increase of temperature at constant
volume leads the system to the critical point and beyond
7.11 Phase Diagram 359
vapor curve and is carried out at constant volume on a closed system consisting of
liquid water in equilibrium with its vapor. As temperature increases, the pressure
and the density of the vapor increase, while the density of the liquid decreases (the
average density of liquid and vapor inside the sample is constant, since both the
total mass of water and the volume are fixed). When approaching the critical point,
the interfacial surface that distinguishes liquid from vapor gradually disappears, and
an extremely dense vapor is formed called supercritical fluid (Fig. 7.29).
For a solution of one solute, it is possible to group the general types of inter-
molecular interactions as solute–solvent, solvent–solvent, and solute–solute. In a
diluted solution of a nonionic solute, solute–solute interactions can in principle be
neglected, since there is a small probability of two solute molecules being at an
interaction range distance. However, for solutions of ionic compounds, the elec-
trostatic interactions between positively and negatively charged ions may be
important even for diluted solutions, since the potential energy of interaction
between two point charges varies as the long-range function 1/R, where R is the
distance between the charges. For aqueous diluted solutions, the solvent concen-
tration is approximately equal to 55.5 M (1000 g dm−3/18 g mol−1). Considering
a 0.10 M nonionic solution, for each solute molecule there are about 5.5 102
H2O molecules. Solute–water interactions may appreciably affect the hydrogen
bond network and the structure of water in a neighborhood of each solute molecule.
Solubility is the concentration of a saturated solution where the solute in solution
is in equilibrium with the pure solute. The sodium chloride solubility in water at 25
°C is about 6.15 mol of sodium chloride per kilogram of water; that is, a saturated
sodium chloride solution contains nine water molecules for each pair of sodium–
chloride ions. At the same temperature, the solubility of carbon tetrachloride in
water is about 0.0042 mol per kilogram of water, meaning that for each carbon
tetrachloride molecule, there are more than 13,000 water molecules. Mixing carbon
tetrachloride with water leads to distinct phases, and thus one usually says that
carbon tetrachloride and water are immiscible liquids, that is, they do not (signif-
icantly) mix. On a molar basis, sodium chloride is approximately 1500 times more
soluble in water than in carbon tetrachloride.
We are now interested in finding a solvent property that may quantify the solvent
polarity. Being a molecular substance, the solvent is a nonconducting material or
dielectric. When we apply an electric field, charges are slightly displaced from their
equilibrium positions, and, in the liquid phase, the total molecular dipoles (per-
manent plus induced dipoles) tend to become aligned with the applied electric field.
360 7 Water
C ¼ q=V ð7:32Þ
[the SI unit of capacitance is called the farad (F) after Michael Faraday (1791–
1867); 1 F = 1 C V−1]. The capacitance of a parallel-plate capacitor can be related to
its geometry and the electric nature of the material between plates as
C ¼ e A=d ð7:33Þ
where A is the plate area, d is the plate separation, and e is the permittivity of the
material between the plates. The permittivity of vacuum is a universal constant (eo =
8.854187816 10−12 F m−1). Usually, the permittivity of a dielectric is equated to the
product of the permittivity of vacuum by a dimensionless factor called the relative
permittivity er, where
e ¼ e0 er ð7:34Þ
where e is the permittivity of the material in which the charges q1 and q2 are
immersed, and r is the distance between them. Both in (7.33) and (7.35), the
Fig. 7.30 An external electric field induces an electric field inside the dielectric. Inset shows
details of induced dipoles in the molecules of the dielectric
7.12 Water as Solvent 361
Fig. 7.31 Argon and methane solubilities in water given by mole fractions (note the 105 scaling
factor) and the solubility of sodium chloride given as molality, as functions of temperature
(Haynes 2011). This figure was obtained with Mathematica
permittivity can be seen as a scaling factor that measures the polarity of the
medium.
At ambient temperature, the relative permittivity of water is 78.3553. This means
that the potential energy of attraction between sodium and chloride ions in water is
78 times smaller than the potential energy of attraction for the same ions, at the
same distance, in free space. Hence, water is quite efficient for promoting the
dissociation of sodium chloride. In an aqueous sodium chloride solution, the local
electric field created by the Na+ and Cl– ions reorients the neighboring water
molecules around the ions by aligning their dipole moments with the electric fields
generated by the ions. In contrast with water, the relative permittivity of carbon
tetrachloride at ambient temperature is 2.2280. Since electrostatic interactions are
much less important in a solvent of high permittivity than in a solvent of low
permittivity, sodium chloride is very soluble in water but is insoluble in carbon
tetrachloride.
Simple nonpolar solutes like argon and methane have extremely small solubilities
in water that decrease with increasing temperature, in contrast with the large sol-
ubilities of electrolytes like sodium chloride that increase with temperature
(Fig. 7.31). In order to understand these trends, we consider, on the one hand, the
potassium ion–water molecule interactions, which are dominated by electrostatic
interactions between the potassium ion charge and the dipole moment of neigh-
boring water molecules (the electric field produced by the potassium ion aligns the
dipole moments of surrounding water molecules, thus producing a solvated
potassium ion, [K(OH2)x]+), and on the other hand, the absence of charge and any
electric moment in the argon atom, implying the nonexistence of electrostatic
interactions between the argon atom and the water molecules, with these becoming
available to form hydrogen bonds between them (Fig. 7.32). An increase in
362 7 Water
Fig. 7.32 Schematic representation of the solvation in water of a potassium ion and an argon
atom
Consider now a gas like argon or methane in contact with water. A decrease in the
volume of the gas as a result of its dissolution in water, at a defined temperature and
constant pressure of 1 atm (=101,325 Pa), corresponds to an irreversible process
leading to equilibrium (Fig. 7.33). The ratio of the gas volume change DVg to the
volume of the aqueous solution Vsln, at equilibrium,
DVg
cs ¼ ð7:36Þ
Vsln eq
is called the Ostwald coefficient and can be used to measure the gas solubility in
water.
The Ostwald coefficient can also be defined as the ratio of the molar densities of
the nonpolar solute in the liquid to gas phases at equilibrium,
7.13 Simple Nonpolar Solutes 363
!
DVg ns =Vsln qs;sln
cs ¼ ¼ ¼ ð7:37Þ
Vsln eq ns =DVg eq qg
eq
where ns is the amount of the gas solute dissolved in water, and qs,sln is the molar
density of the solute in the liquid phase. At equilibrium, the whole system has two
phases, two independent components, and two degrees of freedom, which are
temperature and pressure. The chemical potentials of the solute in the liquid and in
the gas phases are equal,
ls;sln ¼ lg ð7:38Þ
since the aqueous solution is saturated in the gas. The chemical potential of the gas,
assumed to be ideal (the solute is nonionic and extremely diluted), is
p
lg ¼ l0g ðTÞ þ RTln 0 ð7:39Þ
p
p ¼ qg RT ð7:40Þ
We now consider the chemical potential of the solute in the liquid phase. The
extremely low solubility of a nonpolar solute like argon or methane in water and, as
a result, the random distribution of its molecules in water lead to an ideal solution.
Hence, we can assume unitary activity coefficients and write
364 7 Water
where xs,sln is the solute mole fraction in the liquid phase (the solution). Considering
the extremely low solubility of the nonpolar solute in water, then
where Vsln is the volume of the liquid phase, and nS and qS are the amount of
solvent and the molar density of the solvent, respectively. Substitution of (7.43) in
(7.42) leads to
At equilibrium, the chemical potentials of the nonpolar solute in the liquid and
gas phases are equal, that is,
l0s;sln ðqS ; TÞ þ RTln qs;sln eq ¼ l0g p0 ; T þ RTln qg eq ð7:45Þ
Rearranging this equation and using the last equality of (7.37), we obtain
l0s;sln ðqS ; T Þ l0g p0 ; T ¼ RTlncs ð7:46Þ
where
DGg!sln ðTÞ l0s;sln ðqS ; TÞ l0g p0 ; T ð7:48Þ
We have been dealing with nonpolar solutes of small molecules that are gases when
pure. However, when the pure nonpolar solute is a liquid and its solubility in water
is exceeded, liquid phase separation occurs between water and the liquid nonpolar
solute. Considering a gradual increase of the nonpolar solute, it is inevitable to
conclude that the liquid nonpolar solute phase results from coalescence of solute
cavities (Fig. 7.35). This is a gradual and progressive irreversible process that
begins well before phase separation with the coalescence of solute cavities, with
Fig. 7.34 Thermodynamic functions for the transfer of one mole of the nonpolar solute from the
gas phase to the ideal solution, in the liquid phase, as functions of temperature [Ostwald
coefficients for argon in water were taken from Ben-Naim and Baer (1963), and those for methane
in water were taken from Yaacobi and Ben-Naim (1973)]. This figure was obtained with
Mathematica
366 7 Water
one solute molecule forming larger cavities with two solute molecules, followed by
the formation of cavities with three solute molecules, four solute molecules, and so
on, until large clusters of solute molecules can eventually form a distinct liquid
phase.
A nonpolar solute in water that leads to liquid phase separation is called an oil.
A typical example of an oil is n-decane, a linear alkane with ten carbon atoms per
molecule. The process of coalescence of solute cavities cannot ignore the relevant
role of water in the formation of hydrogen bonds between water molecules on the
surfaces of the solute cavities (hydrogen bonds mainly oriented as tangents to the
cavity surface), and in the important contribution that the high surface tension of
water gives to the reduction of the surface area during coalescence of solute cav-
ities. In fact, the ratio of surface area to volume of a cavity decreases as solute
cavities coalesce (Fig. 7.35). In the absence of the guest molecule, solute cavities
could not exist. They would become unstable and would collapse. The dispersion
interaction between nonpolar solute molecules in the solute cavities is often called
hydrophobic interaction.
Due to its high relative permittivity water can easily dissociate strong electrolytes,
yielding solutions that conduct electricity (ionic solutions). Consider the dissocia-
tion of a strong electrolyte
Fig. 7.35 Coalescence of oil-in-water cavities leads to a decrease in the surface area per oil
molecule and to an increase in dispersion interactions between oil molecules
7.14 Ionic Solutions 367
For each mole of the electrolyte, the mole fractions of the cation and anion [see
(7.51)] are given by
n m
cation: anion: ð7:54Þ
nþm nþm
Applying these mole fractions to (7.53) and adding the resulting equations, we
obtain the chemical potential for the electrolyte, given by
where
Note that the arithmetic mean of the chemical potentials of the cation and anion
corresponds to the geometric mean of mole fractions and activity coefficients, since
these are arguments of a logarithm function. For sodium chloride, m = 1 and n = 1,
and the mean activity coefficient is given by
ns ns ns =1 kg solvent ms
xs ¼ ¼ ¼ ð7:58Þ
ns þ nS nS nS =1 kg solvent mS
where ns and nS are the amounts of solute and solvent and ms and mS are the solute
and solvent molalities. For water as solvent of a diluted solution, mS 55.55 mol
368 7 Water
(kg water)−1. Note that 1/mS 0.018 kg mol−1, the molar mass of water expressed
in kilograms (E11).
We have previously considered the chemical potential for a nonideal solution in
terms of the activity of a solute
where
as ¼ c s x s ð7:60Þ
where
+
+
Fig. 7.36 Schematic representation of motion of ions around a particular cation. Note that anions
are attracted by the cation, whereas cations are repelled. Therefore, anions remain longer than
cations within the ionic atmosphere
where zc and za are the charge numbers of the cation and anion, A = 0.509 for water
(note that this value uses a base-10 logarithm), and I represents the ionic strength
which is given by
1 X 2
I¼ zk mk =m0 ð7:64Þ
2 k
where the summation is over all ions in solution, zk is the charge number of ion k,
mk is its molality, and m0 = 1 mol (kg water)−1 (E12). The following figure presents
the mean activity coefficients of sodium chloride, potassium chloride, and hydrogen
chloride as functions of the solute molality, and shows that only extremely diluted
ionic solutions have activity coefficients equal to 1 (Fig. 7.37).
Fig. 7.37 Mean activity coefficients of sodium chloride, potassium chloride, and hydrogen
chloride as functions of solute molality in aqueous solutions. Data taken from Haynes (2011). This
figure was obtained with Mathematica
370 7 Water
Fig. 7.38 Electrostatic potential mapped on isodensity surface with isovalue 0.00040 for the
decanoate ion [Gaussian 09 B3LYP/6-31G(d,p) calculation]. On the left, the isodensity surface
was intentionally made transparent to visualize the molecular geometry
7.15 Amphipathic Molecules 371
nM1 Mn ð7:65Þ
C0 ¼ C1 þ Cn ð7:66Þ
Fig. 7.39 As an amphipathic solid is gradually added to water, the amphipathic molecules begin
by covering the interfacial liquid–air and liquid–solid surfaces of the beaker, and the surface
tension of the medium decreases. Once the critical micellar concentration is reached, micelles are
formed, and the surface tension shows an approximately horizontal variation (note that the graph is
merely schematic)
372 7 Water
1 ¼ x1 þ xn ð7:67Þ
where x1 is the mole fraction of the monomers in the bulk solvent (=C1/C0) and xn is
the mole fraction of the monomer in the micelles (=Cn/C0). Under ideal conditions
(negligible monomer–micelle interactions), the equilibrium constant for (7.65) is
given by
ðCn =nÞ
Kn ¼ ð7:68Þ
C1n
Cn ¼ xn C0 C1 ¼ ð1 xn ÞC0 ð7:69Þ
in (7.68) gives
xn
Kn ¼ C 1n ð7:70Þ
nð 1 x n Þ n 0
After taking the 1/n root of both members and rearranging, we obtain
x1=n h i
ð1=n1Þ
xn ¼ 1 C 0 n
¼ f C0 ; x1=n
n ð7:71Þ
ðnKn Þ1=n
This equation cannot be algebraically solved with respect to xn, since this
variable appears in both members. Therefore, it needs to be solved numerically by
iteration, where an initial guess of xn is introduced in the second member of (7.71),
and this member of the equation is solved to give an improved value of xn in the
first member of (7.71). This value is again introduced in the second member,
leading to a new and presumably better value of xn. Then, we can write
h i
1=n
xnði þ 1Þ ¼ f C0 ; xnðiÞ ð7:72Þ
For C0 > CMC, sodium decanoate in water forms spherical micelles with a more or
less defined aggregation number. The methyl protons of the decanoate ion in the
bulk solvent and in the micelles “feel” distinct environments that can be probed
using 1H nuclear magnetic resonance. Nuclear magnetic resonance (NMR) spec-
troscopy measures frequencies and intensities of spin transitions of nuclei with
nonnull spin quantum number. In order to observe a nuclear spin transition, a
magnetic field has to be applied to split the otherwise degenerate spin states. The
chemical shift of a nucleus is given by
ðm mref Þ
d¼ 106 ð7:73Þ
mref
where m is the frequency of the considered nucleus and mref is the frequency of a
reference, that for 1H NMR is usually tetramethylsilane, Si(CH3)4. Since the order
of magnitude of the frequency difference m − mref is 10−6 mref, one usually includes
in (7.73) a factor 106, thus expressing the chemical shift in parts per million (ppm).
The chemical shift of a nucleus is independent of the applied magnetic field, since
both m − mref and mref are proportional to the applied magnetic field.
Consider now the single equilibrium (7.65) between the monomers of the
decanoate ion in the bulk (D2O) and the decanoate ion micelles. Our model system
(see Andrade-Dias et al. 2007) consists of two proton states, one for the methyl
group protons of the decanoate ion in the bulk, the other for the methyl group
protons of the decanoate ion in the micelles. Nuclear magnetic resonance cannot
distinguish the chemical shifts of the methyl group protons in both of these
374 7 Water
environments, due to fast exchange in the NMR time scale between the monomer in
the bulk and in the micelles. Hence, the observed chemical shift is the average of
the chemical shifts of the methyl group protons in both of these environments,
weighted by the corresponding mole fractions (fast exchange regime),
where d 01 is the chemical shift for the methyl protons when x1 = 1, i.e., when all the
monomers are in the bulk solvent, and d0n is the chemical shift for the methyl
protons when xn = 1, i.e., when all the monomers are in the micelles Substitution of
(7.67) in (7.74) and rearrangement of the obtained equation leads to
Dd ¼ xn Dd0 ð7:75Þ
where
C0 Dd ¼ Cn Dd0 ð7:77Þ
For C0 < CMC, micelles do not occur and Cn = 0, that is, the first member of
(7.77) is equal to zero. For C0 > CMC, the concentration of the monomers in the
micelles (Cn) increases linearly with C0, since Dd0 is a constant in the experiment.
Figure 7.41 shows C0 Dd versus C0 for sodium decanoate in D2O (D = 2H). It
can be seen that the general pattern predicted by Eq. (7.77) is approximately fol-
lowed. The addition of an ionic compound like sodium fluoride increases the ions
Fig. 7.41 Dd.C0 versus C0/mM for sodium decanoate in deuterated water (left), and with sodium
fluoride ([NaF] = 300 mM) added (right). The intercepts of straight lines with the C0 axis give
approximate values for the critical micelle concentrations (Left, CMC110 mM; Right,
CMC 73 mM). This figure was obtained with Mathematica
7.15 Amphipathic Molecules 375
of opposite charge in the medium, thus reducing the repulsion between the polar
heads of the amphipathic molecules in the micelles and lowering the critical micelle
concentration.
corresponds to the transfer of a hydrogen ion from one HCl molecule that converts
itself into a Cl- ion to one NH3 molecule that produces an ammonium ion NH4+.
This reaction illustrates the concepts of acid and base that were developed by J.N.
Bronsted (1879–1947, Danish) and T.M. Lowry (1874–1936, British) in 1923: the
acid is the chemical species that donates one hydrogen ion; the base is the chemical
species that accepts one hydrogen ion.
Let us substitute ammonia by water. In contrast to (7.78), the reactants now have
different physical states, since pure hydrogen chloride is a gas and water is a liquid.
The greater density of water “tilts” all molecular events on its side. In a 0.10 M
aqueous solution, for each mole of solute there are about 5.5 102 H2O molecules!
Water has the important role of being the solvent, intervening in all chemical
processes that occur in solution. The chemical equation is
where HCl acts as a Bronsted–Lowry acid by donating a hydrogen ion to one H2O
molecule and leaving behind a chloride ion, and H2O acts as a Bronsted–Lowry
base by receiving one hydrogen ion to form one hydronium ion, H3O+ (this ion has
a triangular pyramidal geometry). The symbol aq in curved brackets indicates that
the chemical species to which it applies is solvated by water. Since each hydrogen
chloride molecule is surrounded by water molecules, each hydrogen chloride
molecule does not need to migrate to form an acid–base pair.
For aqueous solutions with concentration not exceeding 1 M, scientific studies
have concluded that an acid–base reaction includes H2O molecules for solvation of
the acid and/or the base, that is, H2O molecules mediate the hydrogen ion transfer
between acid and base. When mediated by one H2O molecule, the hydrogen ion
transfer becomes one order of magnitude slower (1 ps) than the unmediated
transfer. In turn, concentrated aqueous solutions present a bimodal mechanism,
376 7 Water
since acid–base pairs also occur in solution, leading to the direct transfer of a
hydrogen ion between acid and base (see Rini et al. 2003; Mohammed et al. 2005).
The water molecule has both acid and base functionalities. The following chemical
equation describes the autoionization equilibrium of water:
From the thermodynamic point of view, every chemical species requires the
definition of a standard state. However, there are no substances formed by ions with
the same charge, since the extremely high electrostatic repulsion between the ions
would prevent the formation of a stable substance. In aqueous solution, it is
established by definition that at all temperatures,
and as a consequence,
Cp ½H þ ðaqÞ ¼ 0 ð7:82Þ
where Cp is the isobaric heat capacity. Since (7.81) and (7.82) are thermodynam-
ically defined for the hydrogen ion H+, thermodynamics represents the autoion-
ization of water by the chemical equation
T = 298.15 K DH0f /kJ mol−1 DG0f /kJ mol−1 S0/J mol−1 K−1
+
H (aq) 0 0 0
OH–(aq) −230.0 −157.2 −10.9
H2O(L) −285.8 −237.1 70.0
aH þ aOH
Kw ¼ aH þ aOH ð7:85Þ
aH 2 O
where ak is the activity of chemical species k (note that aH2 O 1). At 25 °C, the
constant of autoionization of water is approximately equal to 1.0 10−14. Fig-
ure 7.42 shows the temperature dependence of -logKw, from 0 to 350 °C. The
minimum of this curve occurs at 250 °C, with −logKw = 11.2.
Since H3O+(aq) is equivalent to H+(aq) + H2O(aq) from the stoichiometric point
of view, the activity of the hydronium ion is defined by
aH 3 O þ ¼ aH þ aH 2 O ð7:86Þ
H3 O þ H2 O þ H þ ð7:87Þ
is 1, which, in turn, means that H3O+(aq) and H+(aq) + H2O(aq) are also equivalent
from the thermodynamic point of view. Note that the hydronium species H3O+
exists in water as such and as chemical species resulting from solvation with
different numbers of water molecules, such as H5O2+, H7O3+, and H9O4+. On the
other hand, H+ is a proton that really exists only in the transition state of an acid–
base reaction like (7.78). From (7.86) and aH2 O 1, it can be concluded that the
equilibrium constant for the autoionization of water as expressed by (7.80) is
numerically equal to the equilibrium constant for the autoionization of water
obtained from (7.83).
The hydrogen ion activity is usually expressed by the solution’s pH, defined by
pH log aH þ ð7:92Þ
For very diluted aqueous solutions, we can assume that the activity coefficients are
approximately equal to 1, and the substitution of (7.92) and (7.93) in (7.91) leads to
xHA
pH pKHA log ð7:94Þ
xA
ð1 xA Þ
pH pKHA log ð7:96Þ
xA
1 1
xA xHA ð7:97Þ
1 þ 10ðpKHA pHÞ 1 þ 10ðpHpKHA Þ
Figure 7.43 shows, at left, xHA and xA as functions of pH for pKHA = 4, and xHA
as a function of pH for pKHA = 2, 3, 4, 5, and 6; xHA and xA are S-shaped functions
of pH that sum to 1 and cross at the function value 1/2 when pH = pKHA.
7.16 Acids and Bases 379
Fig. 7.43 xHA and xA as functions of pH for pKHA = 4, at left, and xHA as a function of pH, for
pKHA = 2, 3, 4, 5, and 6, at right. This figure was obtained with Mathematica
where
mA mH þ
Ka ¼ ð7:99Þ
mHA
Fig. 7.44 pH as a function of xA for pKHA = 4, at left, and dxA /dpH as a function of xA with
maximum at xA ¼ 0:5, at right. This figure was obtained with Mathematica
380 7 Water
shares with (7.100) the essential features of an acid–base chemical reaction: both
the boron atom of trimethylborane and the hydrogen ion are electrophiles (the
hydrogen ion is an extremely strong electrophile) that bond to ammonia through the
lone electron pair of nitrogen. Hence, (7.100) is an acid–base reaction and
trimethylborane an acid, despite lacking any ionizable hydrogen atom.
Gilbert Lewis (1875–1946), known for his association with the valence electron
pair structures of molecules, captured the similarity between reactions (7.100) and
(7.101) and proposed an acid–base concept that comprises both of these reactions.
According to Lewis, an acid is a chemical species that contains an electrophilic
center and attracts a nonbonding electron pair from the base (the nucleophile), thus
forming a covalent bond between the electrophilic and the nucleophilic atoms or
ions. Hence, (7.100) is a Lewis acid –base reaction, where B(CH3)3 is the Lewis
acid, NH3 is the Lewis base, and (CH3)3B-NH3 is the Lewis complex.
The elementary process of addition of one hydrogen ion (a Lewis acid) to the
oxygen atom of a water molecule (a Lewis base),
H þ þ H2 O ! H3 O þ ð7:102Þ
7.16 Acids and Bases 381
leads to formation of one hydronium ion (a Lewis complex). This elementary acid–
base reaction is similar to (7.101).
A different example of a Lewis acid–base reaction is given by a carbon dioxide
molecule reacting with the hydroxide ion to form the hydrogenocarbonate ion,
In this example, carbon dioxide is the Lewis acid, the hydroxide ion is the Lewis
base, and the hydrogenocarbonate ion is the Lewis complex.
The solvation of a metal cation can also illustrate a Lewis acid–base reaction in
which the Lewis acid is the metal cation, the coordinating water molecules are the
Lewis bases, and the obtained aquo-complex is the Lewis complex. For instance,
2 þ
Cu2 þ þ 6H2 O ! CuðH2 OÞ6 ð7:104Þ
Lewis and Bronsted–Lowry acid–base reactions are of different types: while the
first is an addition reaction (the Lewis acid reacts with the Lewis base to form the
Lewis complex), the second is a displacement reaction (the hydrogen ion is
transferred from the Bronsted–Lowry acid to the Bronsted-Lowry base). According
to Lewis nomenclature, the Bronsted–Lowry acid is the Lewis complex of a
hydrogen ion (a Lewis acid) with a Lewis base, and a Bronsted–Lowry acid–base
reaction is a chemical reaction of the following general kind:
1
H þ ðaqÞ þ e ! H2 ðgÞ ð7:106Þ
2
382 7 Water
where the molality of the hydrogen ion is given by m(H+) = 1 mol (kg of solvent)−1,
and the pressure of molecular hydrogen by p(H2) = 101.325 kPa (Fig. 7.45). The
thermic coefficient for the standard hydrogen electrode potential, deo/dT, is also
zero by definition, meaning that the above definition of the standard hydrogen
electrode potential applies to all temperatures.
The International Union of Pure and Applied Chemistry (IUPAC) proposed a
method for determination of hydrogen ion activity that uses the Harned cell,
1
H2 ðgÞ þ AgCl(s) ! Ag(s) þ H þ ðaqÞ þ Cl ðaqÞ ð7:107Þ
2
(see Covington 2011; Haynes 2011). This cell involves the silver chloride–silver
electrode
and the hydrogen electrode (7.106) and does not use any liquid junction. The
method measures the potential difference between two Harned cells designed in
such way that the obtained value does not depend on the standard hydrogen
electrode potential, which is zero by definition, since this electrode potential is
canceled in the potential difference. The measured potential difference depends only
on the hydrogen ion activity of the hydrogen chloride aqueous solution in one of the
cells in which HCl is present at a fixed molality [e.g., m = 0.01 mol (kg of
solvent)−1], and the mean activity coefficient is known (e.g., at 298.15 K, c± =
0.904). The method enables one to determine the hydrogen ion activity by the
solution pH [see (7.92)] at different temperatures.
Half-equations (7.106) and
1
H2 OðaqÞ þ e ! H2 ðgÞ þ OH ðaqÞ ð7:109Þ
2
involve the same oxidation states of hydrogen (+1, 0), in acid and alkaline solu-
tions, respectively. Since the activities of H+ and OH− are interdependent by the
ionization constant of water, the above half-equations are equivalent from the
physical point of view, that is, they correspond to the same electrode potential,
whether it is obtained by half-equation (7.106),
1=2
RT aH2
e¼ ln ð7:110Þ
F aH þ
or by half-equation (7.109),
1=2
RT aOH aH2
e ¼ e0H2 O=H2 ln ð7:111Þ
F aH 2 O
RT
e0H2 O=H2 ¼ ln Kw ð7:112Þ
F
Taking into consideration the zero of standard electrode potentials (the hydrogen
standard electrode potential), (7.113) can be replaced by
G0 ¼ nFe0 ð7:114Þ
Let us now consider a metal M with oxidation states Mm+ and Mn+ (n > m).
Being an extensive thermodynamic function, the Gibbs energy is additive, and so
we can write
This result allows us to determine any one of these standard potentials, provided
we know the other two. For instance, if (7.116) is applied to the standard potentials
of iron in acidic solution, we can write
(E14, E15).
Let us consider now manganese and its various oxidation states in acidic solu-
tion: Mn(VII) as MnO4–, Mn(VI) as MnO42–, Mn(IV) as MnO2, Mn3+, Mn2+, and
Mn. The Latimer diagram for this element shows the standard electrode potentials
connecting the various oxidation states of the element (Fig. 7.46).
A graph of ne0 as a function of n is called Frost diagram, where each point
corresponds to the pair of values (n, ne0) and the slope between two consecutive
points m and n is given by
Figure 7.47 shows a Frost diagram for manganese, in acidic aqueous solution.
This diagram can be obtained using the standard electrode potentials of the Latimer
diagram of Fig. 7.46, with the values to be plotted on the y-axis being obtained by
multiplying the number of electrons involved in the reduction half-reaction by the
corresponding standard electrode potential.
By the Latimer diagram, it can be seen that the reduction pair Mn3+/Mn2+ has a
standard reduction potential, e0 = 1.5415 V, greater than that of the pair Mn2+/Mn,
e0 = −1.185 V. Therefore, the oxidation–reduction reaction involving Mn3+, Mn2+,
and Mn in acidic solution is
Fig. 7.46 Latimer diagram for manganese with its various oxidation states in acidic aqueous
solution. The standard electrode potentials are expressed in volts (V) and were extracted from
Haynes (2011), with those inside parentheses having been calculated. This figure was obtained
with Mathematica
7.17 Standard Electrode Potentials 385
Mathematica Codes
The following Mathematica code generates contour plots in the xy-plane for the
electrostatic potential of a distribution of two charges (−1 and 1), located on the y-
axis, at y = 0.5 and y = −0.5. The code begins by defining the function f[q,p,
r], which gives the electrostatic potential of a static distribution of electric charges
[see (5.13)]. The Mathematica command Lighter[Gray,i] gives the fraction i
of Gray (i=0 corresponds to Gray and i=1 to White), and the Mathematica
command ContourPlot generates the contours of a function in the xy-plane,
which is the electrostatic potential of a dipole:
Mathematica Codes 387
-
- - -
This Mathematica code uses Manipulate to control the absolute value of the
charge in the electrostatic potential of a dipole:
3
- 0.1
2
- 0.4
- 0.2
- 0.3
0 0
-1 0.3
0.4 0.2
-2
0.1
-3
-3 -2 -1 0 1 2 3
Suggestion: Run the code and observe the contour changes as the absolute value
of the charge is increased.
1 1
/ðx; yÞ ¼
r þ r
Mathematica Codes 389
where
h i1=2 h i1=2
r þ ¼ ðx 1Þ2 þ y2 r ¼ ðx þ 1Þ2 þ y2
1 1 1 1
/ðx; yÞ ¼ ¼h i1=2 h i1=2
r þ r
ðx 1Þ2 þ y2 ðx þ 1Þ2 þ y2
@/ x1 xþ1
Ex ¼ ¼h i3=2 h i3=2
@x
ðx 1Þ2 þ y2 ðx þ 1Þ2 þ y2
@/ y y
Ey ¼ ¼h i3=2 h i3=2
@y
ðx 1Þ2 þ y2 ðx þ 1Þ2 þ y2
dx dx
/ðdÞ ¼ ¼
r 3 ðx2 þ y2 Þ3=2
and
@/ 3dx2 d @/ 3dxy
Ex ¼ ¼ Ey ¼ ¼
@x ðx2 þ y2 Þ5=2 ðx2 þ y2 Þ3=2 @y ðx2 þ y2 Þ5=2
The above equations are used in the following Mathematica code, where we
have set d = 0.01. The command StreamPlot plots the vector field as a function
of x and y. Observe the difference between the contours for the dipole and the point
dipole:
390 7 Water
4 4
2 2
0 0
-2 -2
-4 -4
-4 -2 0 2 4 -4 -2 0 2 4
Suggestion: Run the above code with different values for d, for example 0.1 and
0.5, and observe the differences.
Assume that in liquid water, there are five types of H2O molecules according to the
number of hydrogen bonds each molecule can form, namely, 0, 1, 2, 3, or 4. This
Mathematica code uses (7.26) to calculate the distribution of hydrogen bonds in
water according to Luzar’s model as temperature varies between 273 and 373 K, in
steps of 1
K. The Mathematica function Binomial[n,k] gives the binomial
n 4
coefficient , and the following line of code calculates for k = 0, 1, 2, 3, 4:
k k
Mathematica Codes 391
35
30
25
20
15
10
0 1 2 3 4
Suggestion: Run the above Mathematica code and find how the hydrogen bond
network changes with temperature.
The following Mathematica code shows how to solve (7.71) by the iteration process
given by (7.72). The independent variable represented in the code by x is the total
concentration of monomers C0, and # is the mole fraction of monomers in the
micelles xn that appears in both members of (7.72).
In the code below, the Mathematica function Nest applies three times (three
iterations) to 1-(x*10^-3)^(1./n-1)*(#/(n*10^(2 n)))^(1/n)&. In
this expression [see (7.71)], x in millimolar units is multiplied by 10−3 to convert
into molar units, and the equilibrium constant Kn is made equal to 102n. The result
of the third iteration is given by iteration[[3]]:
392 7 Water
The following Mathematica code shows how to obtain an estimate for the critical
micelle concentration (CMC 21 mM) through the intersection of tangents to the
xn-versus-C0 curve, at C0 = 10 mM and C0 = 800 mM. The intersection of these
tangents is the point at which h[x] and g[x] (the derivatives of f[x]) are equal.
In order to solve the equation that expresses this equality, the code uses FindRoot
to search a numerical root starting from x=50:
Mathematica Codes 393
The way in which the solution of a weak acid HA resists a change of pH can be
determined by evaluating dxA =dpH. The following Mathematica code considers
the pH as a function of xA [see (7.96)] for pKHA = 4, and evaluates dxA =dpH
by taking the inverse of the derivative of pH = f(xA ), as a function of xA , at
right. Note that the range of xA values is [0.001, 0.999], since 0 and 1 would give
rise to singularities. It is shown that the solution attains maximum buffer capacity at
xA = 0.5, when HA and A– have equal mole fractions. The code makes use of
Line for representing the dashed line segments joining a sequence of points:
394 7 Water
The following Mathematica code plots xA and dxA =dpH as functions of pH
[see (7.97)] for pKHA = 4:
Glossary
Exercises
E1. Considering the dipole moment definition, for a static distribution of point
charges, show that the dipole moment vector points to the positive charge.
E2. Show that the dipole moment of a charge distribution whose total charge is
zero is independent of the origin.
E3. Considering the electrostatic potential atomic charges of H2O, qO = −0.680
and qH = 0.340, and the geometric values ROH = 0.9611 Å and aHOH =
104.53°, obtain the molecular dipole moment.
396 7 Water
E4. Use Mathematica to plot the angular functions involved in the d and d3
terms of (7.13). Show that those functions are proportional to the Y01(h,0) and
Y03(h,0) spherical harmonics in the plane.
E5. Use Mathematica to plot the contours of the electrostatic potential of a
quadrupole with the charges (−0.225, −0.225, 0.225, 0.225) at the x,y-coordi-
nates (0., 0.1176), (0., −1.05922), (−1.0, −0.48), and (1., −0.48), respectively.
Compare these contours with those of the quadrupole term for H2O (Fig. 7.7).
E6. At CCCBDB (cccbdb.nist.gov), find the energy of H2O at equilibrium (no
zero-point correction) given by calculations obtained with the B3LYP method
and the STO-3G, 3-21G, 6-31G, 6-311G*, 6-311G**, 6-311+G(3df,2p), 6-311
+G(3df,2pd), cc-pVQZ, aug-cc-pVQZ basis sets. Use Mathematica to plot these
data with the energy in eV. Comment on the obtained graph.
E7. Derive (7.22).
E8. Specify the symmetry species of the normal modes of H2O and DHO.
E9. Chlorine has two natural isotopes, 35Cl and 37Cl, whose natural abundances
are in the approximate ratio of 3:1. The breathing mode (symmetric stretching of
the carbon tetrachloride molecule can be observed in the Raman spectrum of this
liquid. Give an estimate of the relative Raman intensities of the distinct isotopic
lines of the breathing mode of the carbon tetrachloride molecule and the fre-
quency difference in cm−1 between the lines corresponding to the isotopic
combinations (35, 35, 35, 35) and (35, 35, 35, 37). The breathing vibration of
the (35, 35, 35, 35) isotopic combination occurs at approximately 459 cm−1
(CCCBDB).
E10. Consider phb as a function of e, for T = 298.15 K, and assume a = 178
kJ mol−1. Use Mathematica to plot the function phb(e) and its first derivative
with respect to e, and determine the inflection point of phb(e).
E11. Consider a dilute solution of one particular nonpolar solute and convert
mole fraction into molarity. What is the value of the conversion factor when the
solvent is water?
E12. Calculate the mean activity coefficient for a 0.001 molal aqueous solution
of CaCl2.
E13. An oxidizing agent that converts H2O into O2 and a reducing agent that
reduces H2O to H2 are not thermodynamically stable in aqueous solution.
Derive the potentials of the O2/H2O and H2O/H2 reductions as functions of pH
at constant temperature (T = 298.15 K). Assume that pO2 = pH2 = p0 = 1 atm.
Making use of Mathematica, plot the obtained functions and fill with gray the
region of thermodynamic stability for water.
E14. Consider MnO2, Mn3+, and Mn2+ in acidic aqueous solution. Evaluate
e0(MnO2/Mn3+), knowing that e0(MnO2/Mn2+) = 1.224 V and e0(Mn3+/Mn2+)
= 1.5415 V.
E15. Consider MnO4–, MnO42–, and MnO2 in acidic aqueous solution. Evaluate
e0(MnO42–/MnO2), knowing that e0(MnO4–/MnO42–) = 0.558 V and e0(MnO4–/
MnO2) = 1.679 V.
E16. Consider e0(Fe3+/Fe2+) = 0.771 V and e0(Fe2+/Fe) = -0.447 V and an
acidic aqueous solution consisting of Fe(III), Fe(II), and Fe(0). Use
Exercises 397
Mathematica to obtain the Frost diagram for iron in acidic aqueous solution.
How are Fe3+, Fe2+, and Fe(s) related, by disproportionation or compropor-
tionation? Explain.
E17. Consider e0(Cu2+/Cu+) = 0.153 V and e0(Cu+/Cu) = 0.521 V and an acidic
aqueous solution consisting of Cu(II), Cu(I), and Cu(0). Use Mathematica to
obtain the Frost diagram for copper. How are Cu2+, Cu+, and Cu(s) related, by
disproportionation or comproportionation? Explain.
E18. Consider the electrochemical concentration cell Ag|Ag+||Ag+|Ag, and
represent by aa and ac the silver ion activities at the anode and cathode,
respectively. In which electrode, anode or cathode, is the activity of the silver
ion greater? Why? Write the Nernst equation for this cell.
References
Martin JML, de Oliveira G (1999) J Chem Phys 111:1843
Parthiban S, Martin JML (2001) J Chem Phys 114:6014
Aziz RA, Slaman MJ (1991) An examination of ab initio results for the helium potential energy
curve. J Chem Phys 94:8047–8053
Woon DE (1991) Benchmark calculations with correlated molecular wave functions. V. The
determination of accurate ab initio intermolecular potentials for He2, Ne2, and Ar2. J Chem
Phys 100:2838–2850
Seuss E, Bohrmann G, Greinert J, Lausch E (1999) Flammable ice. Sci Am
Luzar A, Svetina S, Zeks B (1983) The contribution of hydrogen bonds to the surface tension of
water. Chem Phys Lett 96:485–490
Teixeira J, Luzar A (1999) Physics of liquid water. Structure and dynamics. In: Bellissent-Funel
MC (ed) Hydration processes in biology, NATO ASI series, Series A: Life sciences, vol 305.
Kluwer, Dordrecht, pp 35–65
Haynes WM (Editor-in-Chief) (2011) Handbook of chemistry and physics on CD-ROM, version
2011. CRC Press, Taylor and Francis Group
Ben-Naim A, Baer S (1963) Method for measuring solubilities of slightly soluble gases in liquids.
Trans. Faraday Soc. 59:2735–2738
Yaacobi M, Ben-Naim A (1973) Hydrophobic interaction in water-ethanol mixtures. J Solution
Chem 2:425–443
Vargaftik NB, Volkov BN, Voljak LD (1983) International tables of the surface tension of water.
J Phys Chem Ref Data 12(3):817–820
Berg JC (2010) An introduction to interfaces and colloids—the bridge to nanoscience. World
Scientific Publishing Co. Pte. Ltd
Oh SG, Shah DO (1993) The effect of micellar lifetime on the rate of solubilization and detergency
in sodium dodecyl sulfate solutions. J Amer Oil Chemists Soc 70:673–678
Andrade-Dias C, Lima S, Teixeira-Dias JJC (2007) From simple amphiphilic to surfactant
behavior: analysis of 1H NMR chemical shift variations. J Colloid Interface Sci 316:31–36
Kapustinskii AF (1956) Lattice energy of ionic crystals. Q Rev Chem Soc 10:283–294
Rini M, Magnes B-Z, Pines E, Nibbering ETJ (2003) Real-time observation of bimodal proton
transfer in acid-base pairs in water. Science 301:349–352
Mohammed OF, Pines D, Dreyer J, Pines E, Nibbering ETJ (2005) Sequential proton transfer
through water bridges in acid-base reactions. Science 310:83–86
Bandura AV, Lvov SN (2006) The ionization constant of water over wide ranges of temperature
and density. J Phys Chem Ref Data 35(1):15–30
398 7 Water
Further Reading
Physical Constants
P.J. Mohr, B.N. Taylor, and D.B. Newell (2011), “The 2010 CODATA
Recommended Values of the Fundamental Physical Constants” (Web Version
6.0). This database was developed by J. Baker, M. Douma, and S. Kotochigova.
Available: http://physics.nist.gov/constants. National Institute of Standards and
Technology, Gaithersburg, MD 20899. Alternatively, each of these physical con-
stants can be obtained from the computational knowledge engine Wolfram Alpha at
www.woframalpha.com.
Conversion Factors
Atomic Units
Greek Alphabet
Integrals
Mathematica instruction Output
Integrate[1/x,x] Log[x]
Integrate[x^n,x] x^(1+n)/(1+n)
Integrate[Sin[x],x] -Cos[x]
Integrate[Cos[x],x] Sin[x]
Integrate[Tan[x],x] -Log[Cos[x]]
Integrate[Exp[-x^2],x] 1/2 Sqrt[p] Erf[x]
Integrate[Exp[-b x^2],{x,0,Infinity}, Sqrt[p]/(2 Sqrt[b])
Assumptions!b>0]
Chapter 1—Thermodynamics
E1. Mathematica Code and Results
E7.
4x2 y3 dx þ 3x3 y2 dy ¼ gðx; yÞdx þ hðx; yÞdy
@gðx; yÞ @hðx; yÞ
¼ 12x2 y2 ¼ 9x2 y2
@y @x
@gðx; yÞ @hðx; yÞ
6¼
@y @x
Therefore, 4x2y3dx + 3x3y2dy is an inexact differential.
E8.
@p @V
pdV þ Vdp ¼ dðpVÞ ¼1 ¼1
@p @V
Therefore, pdV +Vdp is an exact differential.
406 Answers to Exercises
E9.
dU ¼ dQ þ dW
@U @U
dU ¼ dT þ dV dW ¼ pdV
@T V @V T
@U @U @U @U
dT þ dV ¼ dQ pdV dQ ¼ dT þ p þ dV
@T V @V T @T V @V T
@U @V
dQp ¼ Cp dTp CV ¼ dVp ¼ dTp
@T V @T p
@U @U
dQp ¼ dTp þ p þ dVp
@T V @V
T
@U @V
Cp CV ¼ p þ
@V T @T p
E10.
dQ CdT
dS ¼ dS ¼
T T
Z373 Z373
CdT dT 373
DS ¼ ¼C ¼ 4 25:09 ln ¼ 31:3 J K1
T T 273
273 273
E11.
E12.
E13.
V ¼ V0 þ V0 at
t ¼ 0 C ) V ¼ V0
@V 1 @V
¼ V0 a ) a ¼
@t p V0 @t p
E14.
E15.
E16.
dp DSm DHm
Sm;a dT þ Vm;a dp ¼ Sm;b dT þ Vm;b dp ) ¼ ¼
dT DVm TDVm
dp DHm DHm DHm p dp DHm
¼ ) dT
dT TðVm;g Vm;‘ Þ TVm;g RT 2 p RT 2
E17.
E18.
dU ¼ Tde S pdV dU ¼ 0; dV ¼ 0; de S ¼ 0 di S 0
Therefore, S attains a maximum at the end of the irreversible process, when
equilibrium is reached.
E19.
RT aX
X nþ þ ne ! X e ¼ e0 ln
nF aXnþ
1 nþ 1 RT aX 1=n 0 RT aX
X þ e ! X e ¼ e0 ln ¼e ln
n n F aX nþ nF aX nþ
E20.
RT
2Agþ þ Cu ! 2Ag þ Cu2þ De0 ¼ lnKe logKe 16:9 nDe0
nF
Agþ þ e ! Ag e0Agþ = Ag ¼ 0:7996 V
Cu2þ þ 2e ! Cu e0Cu2þ =Cu ¼ 0:3419 V
0
De0 ¼ 0:7996 0:3419 ¼ 0:4577 V Ke 1016:9 nDe
¼ 1016:920:4577 ¼ 1015:5 ¼ 3:2 1015
Standard electrode potential values taken from Handbook of Chemistry and
Physics, 2011.
410 Answers to Exercises
ln0:05 ln0:05
½A ¼ ½A0 ekt 0:05½A0 ¼ ½A0 ekt t¼ ¼ ¼ 8:7 min
k 0:345
E2.
aA þ bB ! cC þ dD
dnA dnB dnC dnD
¼ ¼ ¼ ¼ dn
a b c d
½A ¼ nA =V ½B ¼ nB =V ½C ¼ nC =V ½D ¼ nD =V
d½A 1 dnA d½B 1 dnB d½C 1 dnC d½D 1 dnD
¼ ¼ ¼ ¼
dt V dt dt V dt dt V dt dt V dt
1 d½A 1 d½B 1 d½C 1 d½D 1 dn
¼ ¼ ¼ ¼
a dt b dt c dt d dt V dt
E4.
A þ BC
[A] ¼ [A]eq d [B] ¼ [B]eq d [C] ¼ [C]eq þ d
dd
¼ k! [A][B] k [C] ¼ k! [A]eq d [B]eq d k [C]eq þ d
dt
dd
E5.
(a)
dX dY
¼ k1 A k2 BX þ k3 X 2 Y k4 X ¼ Z1 ðX; YÞ ¼ k2 BX k3 X 2 Y ¼ Z2 ðX; YÞ
dt dt
(
0 ¼ k1 A k2 BXs þ k3 Xs2 Ys k4 Xs Xs ¼ kk14A
0 ¼ k2 BXs k3 Xs2 Ys Ys ¼ kk21 kk43 BA
(b)
@Z @Z !
@Z
1
@X s @Z
1
@Y s k2 B k4 þ 2k3 Xs Ys k3 Xs2
¼
2
@X s
2
@Y s
k2 B 2k3 Xs Ys k3 Xs2
k2 B k4 k3 Xs2 B1 1
¼ ¼
k2 B k3 Xs2 B 1
E6.
(a)
dX dY
¼ k1 AX k2 XY ¼ Z1 (X; Y) ¼ k2 XY k3 Y ¼ Z2 (X; Y)
dt ( dt
0 ¼ k1 AXs k2 Xs Ys Xs ¼ kk32
0 ¼ k2 Xs Ys k3 Ys Ys ¼ kk1 A
2
(b)
@Z @Z !
@Z
1
@X s @Z
1
@Y s k1 A k2 Y s k2 Xs 0 k3 0 1
¼ ¼ ¼
2
@X s @Y
2 k2 Ys k2 Xs k3 k1 A 0 1 0
s
E1.
Ax ¼ kx (A kI)x ¼ 0 det(A kI) = jA kI j ¼ 0
where I is the identity matrix (all its elements are equal to zero except those along
the main diagonal, which equal one), k represents the eigenvalues, and x the
corresponding eigenvectors.
1 1 1 k 1
A¼ ¼ 0 k2 k 6 ¼ 0 k1 ¼ 3 k2 ¼ 2
4 2 4 2 k
1 1 x1 3x1 a
k1 ¼ 3 ¼ x2 ¼ 4x1 eigenvector
4 2 x2 3x2 4a
1 1 x1 2x1 b
k2 ¼ 2 ¼ x2 ¼ x1 eigenvector
4 2 x2 2x2 b
E3.
@ @f @g @ @f
ðf þ gÞ ¼ þ ðcf Þ ¼ c
@x @x @x @x @x
E4.
@ @ @f @f @f @
; x f ¼ ðxf Þ x ¼ f þ x x ¼ f ) ;x ¼ 1
@x @x @x @x @x @x
414 Answers to Exercises
E5.
h @ p2x 1 h @ 2 h2 @ 2
^px ¼ ¼ ¼
i @x 2m 2m i @x 2m @x2
E6.
@s @s
uðx vtÞ ¼ uðsÞ s ¼ x vt ¼1 ¼ v
@x @t
@u @u @s @u @ 2 u @ 2 u
¼ ¼ ¼
@x @s @x @s @x2 @s2
@u @u @s @u @ 2 u @ v @u
@s 2@ u
2
@2u 1 @2u
¼ ¼ v ¼ v ¼ v ¼
@t @s @t @s @t2 @s @s2 @x2 v2 @t2
If the argument x of the function u(x) is changed by x −a, where a is a positive
number, then the function u(x − a) is displaced to the right of u(x) by a. If we let
a =vt, where v is a positive number and t is time, then the function displacement is
continuous and increases with time.
E7.
@s @s
uðsÞ ¼ Acos(kx xtÞ s = kx xt ¼k ¼ x
@x @t
@u @u @2u 2@ u
2
¼ k ¼ kAsin(kx xtÞ ¼ k ¼ k2 Acos(kx xtÞ
@x @s @x2 @s2
@u @u @2u 2@ u
2
¼ ðxÞ ¼ xAsin(kx xtÞ ¼ x ¼ x2 Acos(kx xtÞ
@t @s @t2 @s2
1 @2u 1 @2u 1 2 @2u 1 @2u
¼ 2 2 k ¼ 2 x2 2
k @x
2 2 x @t k 2 @s 2 x @s
E11.
Z Z
b
w Awds ¼ b
w Aw ds w ¼ f þ cg
Z Z h i
b b þ cgÞ ds
ðf þ cgÞ Aðf þ cgÞds ¼ ðf þ cgÞ Aðf
Z Z Z Z
b ds þ c c g Agds
f Af b b ds þ c f Agds
þ c g Af b
Z
Z
Z
Z
b
¼ f Af ds þ c c g Ag ds þ cb b b
f Ag ds þ c g Af ds
Since  is Hermitian, the first terms on the left- and right-hand sides of this equation
are equal, and so are the second terms. Therefore,
Z Z
Z
Z
c b ds
g Af b
f Ag ds ¼ c b
g Af b
ds f Agds
E12.
RM ¼ rA mA þ rB mB RM ¼ rA mA þ rB mB
rmA ¼ rA mA þ rB mA rmB ¼ rA mB rB mB
mA mB
rB ¼ R þ r rA ¼ R r
M M
2 2 2
1 drA 1 drB 1 dR 1 dr 2
mA þ mB ¼ M þ l
2 dt 2 dt 2 dt 2 dt
E13.
1 1 1 1 1
¼ þ þ þ
l mA mB mC mD
E14.
dx d2 x
x ¼ A sinðxt + bÞ ¼ xA cosðxt + bÞ 2
¼ x2 A sinðxt + bÞ
rffiffiffiffi dt dt
k d2 x
x= k ¼ mx2 m 2 ¼ mx2 A sinðxt + bÞ = kx
m dt
E17.
Hnþ1 ðzÞ ¼ 2zHn ðzÞ 2nHn1 ðzÞ n¼3 H4 ðzÞ ¼ 2zH3 ðzÞ 2 3H2 ðzÞ
H2 ðzÞ ¼4z 2
2
H3 ðzÞ ¼ 8z 12z H4 ðzÞ ¼ 16z4 48z2 þ 12
3
16z4 48z2 + 12 ¼ 2z 8z3 12z 6 4z2 2 ¼ 16z4 48z2 þ 12
E18.
2
UðRÞ ¼ D 1 ebðRRe Þ
e Þ ¼ 0
UðR
dU ¼ 2Db ebðRRe Þ e2bðRRe Þ
dU ¼0
dR dR qffiffiffiffiffi
2 R¼Re
d2 U ¼ 2Db2 ebðRRe Þ þ 2e2bðRRe Þ
dU ¼ 2Db2
¼ k b ¼ k
dR 2 dR 2
qffiffiffiffiffi
2D
3 R¼Re
d3 U ¼ 2Db3 ebðRRe Þ 4e2bðRRe Þ
dU ¼ 6Db3 ¼ 3k 2D k
dR 3
d4R R¼Re
3
dR 4 dR 4R¼Re 2D
418 Answers to Exercises
E19.
i j k
L¼ x y z
px py pz
y z z x x y
L¼Lx i + Ly j + Lz k ¼ i þ j + k
p y p p
z z px px p
y
Lx ¼ y pz z py Ly ¼ z px x pz Lz ¼ x py y px
E20.
Lx b
b L y f ¼ y^pz z^py ðz^px x^pz Þf ¼ y^pz ðz^px Þ yx^p2z z2 ^py ^px þ zx^py ^pz f
h
¼ y^px þ yz^pz ^px yx^p2z z2 ^py ^px þ zx^py ^pz f
i
b
Ly b
L x f ¼ ðz^ pz Þ y^pz z^py f ¼ zy^px ^pz z2 ^
px x^ px ^py xy^p2z þ x^pz z^py f
h
^ ^ ^
¼ zy^px pz z px py xy^pz þ x^py þ xz^pz py f
2 2
^
i
h h h i
b
Lx b
Ly b L x f ¼ y^px x^py f ¼ b
Ly b L z f ¼ ih b
Lzf b
Lx; b
L y ¼ ih b
Lz
i i
Answers to Exercises 419
E21.
h i h i h i h i
b
L2; b
Lx ¼ bL 2x þ b
L 2y þ b
L 2z ; b
Lx ¼ bL 2y ; b
Lx þ bL 2z ; b
Lx
by b
¼L Ly Lbx Lbx L by þ b
by L Lz b
Lz b
Lx Lbx b
Lz b
Lz
¼ bLy b
Ly bLx b
Ly bLx b
Ly þ b Ly b
Lx b
Ly bLx b
Ly b
Ly
þ b Lz b
Lz b
Lx b
Lz b
Lx bLz þ b Lz bLx b Lx b
Lz b Lz b
Lz
h i h i h i h i
¼b
Ly b Ly; b
Lx þ b Ly; bLx b Lz b
Ly þ b Lz; b
Lx þ b Lx b
Lz; b Lz
Ly b
¼ ih b Lz b
L z ih b Lz b
L y þ ih b Ly b
L y þ ih b Lz
¼0
When the parameter n increases, more points are sampled, and the function
changes more rapidly.
E24.
b @2 @ 1 @2 b @
L 2 ¼
h2 þ coth þ L z ¼ ih
@h2 @h sin2 h @/2 @/
b ¼
H
h2 2 @2 2 @
r þ VðrÞ r2 ¼ 2 þ
1 b2
L
2l @r r @r r 2 h2
h i h2
h2
h i
b; b
H L 2 ¼ r2 þ VðrÞ; b
L 2 ¼ r2 ; b L 2 þ VðrÞ; b
L2 ¼ 0 þ 0 ¼ 0
2l 2l
h i h2
h2 2 b
h i
b b
H ; L z ¼ r þ VðrÞ; L z ¼ r ; L z þ VðrÞ; b
2 b Lz ¼ 0 þ 0 ¼ 0
2l 2l
E25.
The square of the radial factor is a probability density, thus having units of inverse
of volume, that is, a−3
0 , where a0 = 0.52917721092 10
−10
m is the atomic unit
of length. Therefore, all radial factors come with the factor a−3/2 . In SI units,
3=2
0
E27.
R
/ Hb / ds K Xn
W½/ ¼ R ¼ /¼ cj fj
/ / ds D j¼1
K @K @D
@W @ D @cj K @cj
@K @D
@cj W @cj @K @D
D
¼0 ¼ ¼ ¼0 W ¼0
@cj @cj D2 D @cj @cj
Answers to Exercises 421
0 1 0 1 0 1
a b c r s t arþbuþcy asþbvþcz atþcwþbx
@d e fA @u v x A ¼ @ drþeuþfy dsþevþfz dtþfwþex A
g h i y z w grþhuþiy gsþhvþiz gtþiwþhx
E30.
ðH11 WS11 Þ ðH12 WS12 Þ ðH11 WS11 Þ ðH12 WS12 Þ
ðH WS Þ ðH WS Þ ¼ 0 ðH WS Þ ¼ ðH WS Þ
12 12 22 22 12 12 22 22
ðH11 WS11 Þc1 þ ðH12 WS12 Þc2 ¼ 0
ðH12 WS12 Þc1 þ ðH22 WS22 Þc2 ¼ 0
8 8
> ðH11 WS11 Þ c1 þ c2 ¼ 0 >
> > c2 ¼ ðH11 WS11 Þ
< ðH WS Þ <c ðH12 WS12 Þ
12 12 1
>
> ðH WS12 Þ > ðH WS12 Þ
c1 þ c2 ¼ 0 >
c
: 12 : 2 ¼ 12
ðH22 WS22 Þ c1 ðH22 WS22 Þ
E31.
n Z 2
X
n X gj
V1 ¼ V1j ¼ dvj
j¼2 j¼2
r1j
422 Answers to Exercises
E32.
Because equation (3.154) applies to molecules. A molecule has more than one
nucleus, and VNN represents the nucleus–nucleus repulsion energy. This energy
term does not exist for atoms, because they have one nucleus.
E33.
When applied to atoms, (3.154) does not include the energy term VNN. Note that the
sums in (3.154) are over doubly occupied orbitals. From the definitions of Coulomb
and exchange integrals, (3.155) and (3.156), and the fact that electron labels 1 and 2
are dummy variables, it can be inferred that Jij =Jji andKij =Kji. In addition, Kii =Jii.
Therefore, applying (3.154) to the 1s22s2 electron configuration leads to
E34.
ZZ
wi ð1Þwj ð1Þwj ð2Þwi ð2Þ
Kij ¼ ds1 ds2
r12
If wi ¼ /i a and wj ¼ /j b
ZZ
wi ð1Þwj ð1Þwj ð2Þwi ð2Þ
Kij ¼ ds1 ds2
r12
ZZ Z Z
/i ð1Þ/j ð1Þ/j ð2Þ/i ð2Þ
¼ dv1 dv2 a ð1Þbð1Þdr1 b ð2Það2Þdr2 ¼ 0
r12
due to integration over the spin variables.
E35.
1 1
Jij ¼ /i ð1Þ/j ð2Þ /i ð1Þ/j ð2Þ Kij ¼ /i ð1Þ/j ð2Þ /j ð1Þ/i ð2Þ
r12 r12
Answers to Exercises 423
E37.
n1
r ð2fÞnþ1=2 1
Nn;f efr=a0 where Nn;f ¼
a0 ½ð2nÞ!1=2 a03=2
In atomic units, a0 = 1, and the radial factor of a Slater-type orbital is given by
ð2fÞnþ1=2
Nn;f r n1 efr where Nn;f ¼
½ð2nÞ!1=2
424 Answers to Exercises
E38.
pffiffiffi
W ¼ /(1)/ð2Þ½a(1)b(2) a(2)bð1Þ/ 2
Z
q0 ð1Þ ¼ W W dv2 dr1 dr2
Z Z
1
¼ / ð1Þ/ ð2Þ/ð1Þ/ð2Þdv2 ½a ð1Þb ð2Þ
2
a ð2Þb ð1Þ½að1Þbð2Þ að2Þbð1Þdr1 dr2
1
¼ / ð1Þ/ð1Þ ð1 0 0 þ 1Þ ¼ j/j2
2
E39.
pffiffiffi
W ¼/(1)/(2)[a(1)b(2) a(2)b(1)]/ 2
Z h i Z
b b b ð1Þ/ð1Þdv1
W T ð1Þ þ T ð2Þ Wdv1 dv2 dr1 dr2 ¼ 2 / ð1Þ T
E40.
^0 ¼ r 1 @2 2@
2
1 ^2 ^2 ^ 0 ¼ z
H r2 ¼ þ 2L L Y‘m ¼ ‘ð‘ þ 1ÞY‘m H
2 r @r 2 r @r r
1 1
R1s ¼ 2er Y0;0 ¼ pffiffiffi 1s ¼ pffiffiffi er 2pz ðSTOÞ ¼ R2f Y1;0
2 p p
! rffiffiffi !
5=2
2f 1 3 1
¼ pffiffiffi refr cos h ¼ pffiffiffi f5=2 refr cos h
3 2 p p
Z1 Zp Z2p
f5=2 ðf þ 1Þr 4
z ¼ r cos h h1sjzj2pz i ¼ e r dr 2
cos h sin hdh d/
p
0 0 0
Z1 pffiffiffiffiffi
4 32 f5
¼ f5=2 eðf þ 1Þr r4 dr ¼
3 ð1 þ fÞ5
0
r2 1 r2 1
1s 1s ¼ 0:5
R2f R2f
2 r 2 r
Z1
2f5 1
¼ e2fr ðf2 r2 4fr þ 2rÞdr ¼ ð2 3fÞf3
3 6
0
0 0
¼ h1sjzj1si ¼ 0 J ¼ R2f H
^ R2f 1sH
^ 1s
ð1Þ
E0
ð2Þ
þ 2h1sjzj2pz i E0 minJðf ¼ 0:844Þ
Answers to Exercises 425
E41.
eiss 1 dðeiss 1Þ=ds
s ¼ xnm
x lim ¼ ¼ is
s!0 s ds=ds
E42.
The lowest and highest wavelengths of the visible region are approximately equal to
400 and 800 nm. The 400 nm wavelength is 24 = 16 times more intensely scattered
than the wavelength at 800 nm. This explains the color of the sky.
426 Answers to Exercises
E44.
The intensity of the Stokes line is proportional to the population of level a in the
vibrational transition a !b, whereas the intensity of the corresponding anti-Stokes
line is proportional to the population of the excited level b in the vibrational
transition b !a. Hence, using the Boltzmann distribution and m4-dependence of the
Stokes and anti-Stokes lines, we can write
E46.
For C2H6 at 298 K, EMP2/6-311G*− EHF/6-311G* −177.7 kcal mol−1, whereas for the
chemical reaction C2H4(g) + H2(g) ! C2H6(g) at 298 K, EMP2/6-311G*− EHF/6-311G*
1.7 kcal mol−1. The MP2 correlation correction for the ethane molecule
corresponds to 0.36 % of the Hartree–Fock energy and is about the energy of two
C–H bonds. For the chemical reaction, the MP2 correlation correction is less than
1 % of the absolute value of the corresponding value for the molecule. This finding
suggests that the correlation energy is mainly due to the electrons of each electron
pair. Now, since the number of electron bond pairs is conserved for reactants and
products, the correlation correction is appreciably lower than for a molecule. In
addition, the correlation correction for a reaction is not necessarily negative, since it
is the correlation correction for the products minus the correlation correction for the
reactants.
E1.
(a) C3 (b) C3 (c) C4 (d) C5 (e) C2v (f) D3h (g) D6h (h) D∞h (i) Td
428 Answers to Exercises
E6.
X XX XX X
Bkk ¼ X 1 ki Aij Xjk ¼ Aij Xjk X 1 ki ¼ Aij X X 1 ji
k X
k i;j X k i;j i;j
¼ Aij dji ¼ Aii
i;j i
ðQEDÞ
E7.
(a)
A ¼ X 1 AX ) XAX 1 ¼ A
(b)
B ¼ X 1 AX ) XBX 1 ¼ A
(c)
A ¼ X 1 BX A ¼ Y 1 CY
Y 1 CY ¼ X 1 BX C ¼ YX 1 BXY 1
1
C ¼ XY 1 B XY 1 C ¼ Z 1 BZ
E9.
Infrared active vibrations transform like x, y, and z, and for a point group with a center
of inversion, these functions are of plus–minus- or u-type. In turn, Raman active
vibrations transform like the quadratic functions x2, y2, z2, xy, xz, and yz, and for a
point group with a center of inversion, these functions are of plus–plus- or g-type.
E10.
E1.
X Zn Z
q0 ðrÞ
/ðPÞ ¼ dv
n
rnP reP
The first term of this expression is the electrostatic potential, which results from
the discrete distribution of nuclei of the molecule. The second term results from the
continuous distribution of electronic charge, where q0 ðrÞdv is the electronic charge
on the volume element dv centered at P.
E2.
1 1
W0 ð1; 2Þ ¼ pffiffiffi j/1 að1Þ/1 bð2Þj ¼ pffiffiffi /1 ð1Þ/1 ð2Þ½að1Þbð2Þ að2Þbð1Þ
2 2
/1 ¼ MO /1 ¼ c11 v1 þ c21 v2 v1 ; v2 ¼ AOs v1 ¼ 1s1 v2 ¼ 1s2
Z
q0 ðrÞ ¼ 2 W0 ð1; 2ÞW0 ð1; 2Þdv2 dr1 dr2 q0 ðrÞ ¼ 2/1 /1 ¼ 2c211 v21 þ 2c221 v22 þ 4c11 c21 v1 v2
Z Z
q0 ðrÞdv ¼ 2c211 þ 2c221 þ 4c11 c21 S12 ¼ 2 S12 ¼ v1 v2 dv
1
c211 þ c221 þ 2c11 c21 S12 ¼ 1 c211 ¼ c221 jc11 j ¼ jc21 j ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
2ð1 þ S12 Þ
Answers to Exercises 433
E4.
Full Mulliken population matrix element between 1s2 and 1s3: −0.01993.
Density matrix element between 1s2 and 1s3: −0.11568.
Overlap matrix element between 1s2 and 1s3: −0.01993/−0.11568 = 0.172286.
E5.
E6.
pffiffiffi
df2 ¼ a2 + a2 df ¼ 2 a
2 2
dc df a 2 pffiffiffi
¼ þ dc ¼ 3 a
2 2 2
rffiffiffi rffiffiffi
df dc a a 2 2
¼ sin sin ¼ a ¼ 2 arcsin a ¼ 109:4712
2 2 2 2 3 3
E7.
hA ¼ 12 ðs þ px þ py þ pz Þ hB ¼ 12 ðs px þ py pz Þ
hC ¼ 12 ðs þ px py pz Þ hD ¼ 12 ðs px py þ pz Þ
SAA ¼ 14 ð1 þ 1 þ 1 þ 1Þ ¼ 1 SAA ¼ SBB ¼ SCC ¼ SDD ¼ 1
SAB ¼ 14 ð1 1 þ 1 1Þ ¼ 0 SAB ¼ SAC ¼ SAD ¼ SBC ¼ SBD ¼ SCD ¼ 0
Answers to Exercises 435
pffiffiffi
1 pffiffiffi 1 pffiffiffi px 3py
hA ¼ pffiffiffi ðs þ 2px Þ hB ¼ pffiffiffi s þ 2
3 3 2 2
pffiffiffi
1 pffiffiffi px 3py
hC ¼ pffiffiffi s þ 2 þ
3 2 2
Z
1 1
SAB ¼ hA hB dv ¼ 1 þ 2 0 ¼ 0 SAB ¼ SAC ¼ SBC ¼ 0
3 2
SAA ¼ SBB ¼ SCC ¼ 1
E9.
1 1
hA ¼ pffiffiffi ðs þ px Þ hB ¼ pffiffiffi ðs px Þ
2 2
Z
1
SAB ¼ hA hB dv ¼ ð1 1Þ ¼ 0 SAA ¼ SBB ¼ 1
2
436 Answers to Exercises
E10.
1
pffiffiffiffiffi 1
pffiffiffiffiffi
hA ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffi s þ kA pA hB ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffi s þ kB pB
1 þ kA 1 þ kB
1
pffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffi pffiffiffiffiffi
ShA hB ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi Sss þ kA kB SpA pB þ kA SpA s þ kB SspB
ð 1 þ kA Þ ð 1 þ kB Þ
1
pffiffiffiffiffiffiffiffiffiffi pffiffiffiffiffi pffiffiffiffiffi
¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi 1 þ kA kB SpA pB þ kA 0 þ kB 0 ¼ 0
ð 1 þ kA Þ ð 1 þ kB Þ
pffiffiffiffiffiffiffiffiffiffi 1
1 þ kA kB SpA pB ¼ 0 SpA pB ¼ pffiffiffiffiffiffiffiffiffiffi
kA kB
E11.
E12.
The naturally occurring form is the R,R stereoisomer. The R,S stereoisomer is
optically inactive.
Answers to Exercises 437
Chapter 6—Crystals
E1.
Comparing with Fig. 6.6, all distances now have a scaling factor 2r.
E2.
Considering Fig. 6.6, the distance from point c (the intersection of the threefold axis
with the plane of the disks) to the disk centers is equal to d = 1/√3. Hence, x =
d cosh and y = d sinh. Note that when r = 1/2, d = 1.
Mathematica code and results:
438 Answers to Exercises
Inspection of the above (x,y) coordinates shows that disks are drawn in the
counterclockwise direction in the first two graphs and in the clockwise direction in
the last two graphs.
E3.
Consider a hexagonal prism as repeating unit (see Fig. 6.8). Each sphere in a prism
vertex is shared by six repeating units, thus being considered with a factor 1/6 in the
packing density. The centers of the bottom and top of the hexagonal prism are
occupied by one sphere that is shared by two repeating units, thus having the factor
1/2. Note that there are three spheres inside the hexagonal prism counting 1 each of
them. The volume of the spheres in one repeating unit is given by
1 1 1 4
6 + 6 þ 2 þ 3 pr 3 ¼ 8pr 3
6 6 2 3
In turn, the volume of the hexagonal prism can be obtained by adding the volume of
three parallelepipeds whose height is given by 2 2(√2/√3)r (see answer to E1).
The edge of the base parallelogram of each parallelepiped is given by 2r, which
multiplies by r√3. Therefore, the volume of the hexagonal prism is given by
pffiffiffi
pffiffiffi 2 2 pffiffiffi
3 2r r 3 2 pffiffiffi r ¼ 3 8 2r 3
3
The packing density of the hcp structure of identical spheres is obtained by
dividing the volume of the spheres contained within one hexagonal prism as
repeating unit of the hcp structure by the total volume of the hexagonal prism,
8pr 3 p
pffiffiffi ¼ pffiffiffi 0:74
3 8 2r 3 3 2
Answers to Exercises 439
E10.
1 1
/ðeÞ ¼ ) e ¼ l þ kB T ln 1
1 þ exp½ðe lÞ=ðkB TÞ /
1
l ¼ 3:24 eV T ¼ 7000 K ) e ¼ 3:24 þ kB 7000 ln 1 ¼ 3:90 eV:
0:25
Chapter 7—Water
E1.
P
The definition of the dipole moment vector is given by l ¼ k qk rk :
For a pure electric dipole, that is, two electric charges +q and −q separated by a
distance d, the above expression is consistent with the dipole moment vector being
directed from the negative to the positive charge. In fact, if we take the origin of the
dipole moment at the positive charge, the positional vector r+ is zero (qr+ = 0), and
the positional vector that points to the negative charge r− is multiplied by the
negative charge −q, thus giving a dipole moment vector l = qr+ + (−q)r− = −qr−
that points to the positive charge.
E2.
X X X X X
qi r0i ¼ qi (ri + R) ¼ qi r i þ R qi ¼ qi ri
|fflffl{zfflffl}
0
E3.
-
- - -
Answers to Exercises 443
This graph shows that for the same molecule (H2O) and method of calculation
(B3LYP), the increase of the basis set lowers the ground state energy, as stated by
the variational principle (see Sect.3.7).
E7.
sffiffiffi rffiffiffiffiffiffiffiffi rffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi pffiffiffi
k 16 2 16 1 xOH lOD 32 17
x¼ lOD ¼ lOH ¼ 2
l 18 17 xOD lOH 18 16
E8.
H2O belongs to the C2v point group. The antisymmetric stretching transforms as B2;
the symmetric stretching and the bending transform as A1.
DHO belongs to the Cs point group. The normal modes are totally symmetric
(transform as A′) because all displacement vectors are in the molecular plane.
444 Answers to Exercises
E9.
12.8
E11.
ns ns ns = 1 mol dm3 cs
xs ¼ ¼ ¼ ¼ cs VS
ns þ nS nS nS = 1 mol dm3 cS
Answers to Exercises 445
where cs is the solute molarity, cS is the solvent molarity (for water, cS = 55.55 mol
dm−3). Note that the inverse of cS is the molar volume of the solvent in dm3 (for
water, Vm = 0.018 dm3 mol−1).
E12.
X
E13.
An oxidizing agent that converts H2O into O2 and a reducing agent that converts
H2O to H2 are not thermodynamically stable in aqueous solution. Redox couples
that are thermodynamically stable in water lie between the above sloping straight
lines e = e(pH) (region marked gray).
Mathematica code and results:
446 Answers to Exercises
E14.
E15.
E18.
RT RT
ec = lnaAg þ ðcÞ = 0.05916 log aAg þ ðcÞ ea ¼ lnaAg þ ðaÞ = 0.05916 log aAg þ ðaÞ
F F
aAg þ ðcÞ
ec [ ea De = ec ea = 0.05916 log [ 0 ) aAg þ ðcÞ [ aAg þ ðaÞ
aAg þ ðaÞ
Index
A 6-31++G**, 188
Ab initio calculations, 183, 204, 339 6-311G, 187, 210, 340, 402
Absorption of radiation. See Einstein A and B aug-cc-pVTZ, 153, 188, 271, 340, 345,
coefficients 351–353, 356–359
Acidity, relative, 386 cc-pVTZ, 130, 148, 188, 251, 255,
Acids 266–268, 270, 273, 274, 276, 338, 340,
Bronsted–Lowry, 381, 387, 391 346–349
Lewis, 386, 387, 392 STO-3G, 185, 187, 257, 402
Activation energy. See Arrhenius activation Basis set
energy double-zeta, 187
Activity coefficient, 46, 369, 373, 374, 384, polarization, 187, 236, 281, 287, 288
388, 392, 403 split-valence, 187
Addition reaction, 387 triple-zeta, 187
Adiabatic valence double-zeta, 187
compression, 33, 68, 76, 79 valence triple-zeta, 188
expansion, 33, 67, 76, 79 Bernoulli, 3
transformation, 1, 35, 66, 68, 72, 76, 79 Boiling points of linear alkanes, 355
Affinity of chemical reaction, 42, 54, 74, 76, 79 Boltzmann distribution
Angular momentum Boltzmann constant, 3, 177, 277, 317
orbital, 133, 136, 188, 207 degeneracy degree, 168, 277
spin, 135, 143 population fraction, 277
Anharmonicity. See Morse function Bond orbital
Antiderivative, 20 p, 265
Antisymmetric function, 141 r, 265
Antisymmetry principle, 139, 140, 205 Bond orbitals, natural, 264–266, 283, 288, 347
Arrhenius activation energy, 93, 113 Born–Oppenheimer approximation, 145, 154,
Arrhenius A-factor, 92, 111 162, 205, 339, 349
Associated Legendre polynomials, 134, 135 Boyle. See Ideal gas
Avogadro Bracket notation, 144
constant, 2 Bravais lattices
hypothesis, 2 three-dimensional, 305, 333
Axis of rotation, 246 two-dimensional, 302–304
Buffer capacity, 385, 401
B
B3LYP method, 148, 153, 402 C
Basis functions Cahn–Ingold–Prelog rules applied to
3-21G, 187, 402 bromochlorofluoromethane, 216, 278, 279
6-31G, 187, 340, 376, 402 carvone, 279
6-31G**, 187 Calculus, fundamental theorem of, 20, 157
Plane of reflection, 214, 215, 222, 246, 357 Reaction rate. See Rate of chemical reaction
Point groups, 245 Reduced mass, 124, 137, 207
abelian, 217, 225, 245, 247 Representation of point group
commutative, 217 character table, 224, 225
dichotomy diagram, 218 class, 224, 238
multiplication table, 224, 225, 241, 247 in x, y, z-basis, 224, 323
principal axis, 217, 225, 246 in x,y,z-basis, 225, 228
properties, 217, 245 irreducible, 225, 226, 228, 229, 231, 233,
Polarimeter, 282, 283, 288 245, 246
Polar plot, 135 totally symmetric, 225, 226, 229
Population analysis.See Mulliken population Resistivity, electrical
analysis; Natural bond orbitals for insulators, 316, 317
Potential energy surface for metallic conductors, 316
first order saddle point, 269 semiconductors, 316, 317, 319
Pressure, 2–5, 9, 12–16, 23–28, 30, 31, 37, 39, Reversible cycle, 33, 35, 36, 68, 76, 77, 79, 80
41, 42, 44–49, 52, 53, 55–57, 65, 70, Rigid rotor, two-particle, 190, 209, 243
71, 74, 76–83, 96, 123, 308, 355, 360, Rotation–reflection axis, 214–216
363, 364, 368, 369, 382, 388, 389 Rotations, molecular, 146
Prigogine, 37, 77, 81, 99, 111
Process dependent quantity, 26 S
Saddle point
R first- order, 269, 285, 289
Radial distribution function, 138 Schrödinger equation
Raman depolarization ratio, 236 for atoms, 143, 147
Raman scattering, 245, 246 for molecules, 146
anti-Stokes transition, 178 one-dimensional, 117, 119, 127
carbon tetrachloride spectrum, 178, 365, Second law
367, 402 Clausius’s inequality, 83
classical treatment, 125, 179 Secular equation, 105
depolarization ratio, 236, 237 Selection rules
discrete resonance, 182, 183, 230 and symmetry, 230, 231, 233–238, 241,
normal, 182 245, 246
preresonance, 182 for absorption and stimulated emission, 229
quantum-mechanical treatment, 180 for a harmonic oscillator, 229, 244
resonance, 178, 182 for a particle in a one-dimensional box,
Stokes transition, 178 229, 242
transition polarizability, 181, 182 for Raman scattering, 229
virtual state, 181, 182, 230 for two-particle rigid rotor, 191, 229
Rate of chemical reaction Self-consistent field, 150, 153, 205
experimental rate equation, 88, 94, 95, 112 Semiconductor
extent of chemical reaction, 41, 77, 81 doping, 318
first-order, 113 n-type, 319
half-life, 90, 112 p-type, 319
overall order, 88 Semiempirical calculations, 183, 207
partial orders, 88, 112 Set of basis functions, 144, 153, 184
pseudo-order rate constant, 88, 112 Sigma (r) bond, 214
pseudo-order rate equation, 88 Similarity transformation, 106, 107, 223, 224,
rate constant, 88, 90, 92, 111, 113 238, 239
second-order, 90, 97 Similar matrices, 106, 107, 223
zeroth-order, 91 Singlet, 142
Reaction coordinate, 288 Slater determinant, 141, 142, 148, 150, 151,
intrinsic, 270, 271 153, 159, 160, 162, 207, 255, 345
transition state, 94, 270, 271 Slater-type orbital, 150, 184, 187, 199
456 Index