SUPERSTRINGS
AND OTHER
THINGS
A GUIDE
TO
PHYSICS
About the Author
Carlos I Calle received his PhD in theoretical nuclear
physics from Ohio University. He is a senior research
scientist at NASA Kennedy Space Center where he
leads the electromagnetic physics research group. Dr
Calle is currently working on the problem of electrostatic phenomena on planetary surfaces, particularly
on Mars and the Moon, developing instrumentation
for future planetary exploration missions. He is the
principal investigator for the electrostatic studies of
Martian soil and dust and for the electrometer calibration project for the Mars Surveyor mission. He is also
project manager for the study of the electrostatic
properties of lunar soil and dust.
His earlier research work involved the development
of a theoretical model for a microscopic treatment of
particle scattering. He also introduced one-particle
excitation operators in a separable particle-hole
Hamiltonian for the calculation of particle excitations.
As a professor of physics for many years, he taught the
whole range of college physics courses. He has published over eighty scienti®c papers and been invited
to participate in international scienti®c conferences.
He has been the recipient of ten research grants from
NSF, from NASA, and from private foundations.
SUPERSTRINGS
AND OTHER
THINGS
A GUIDE
TO
PHYSICS
CARLOS I CALLE
NASA Kennedy Space Center
Institute of Physics Publishing
Bristol and Philadelphia
# IOP Publishing Ltd 2001
All rights reserved. No part of this publication may be reproduced,
stored in a retrieval system or transmitted in any form or by any means,
electronic, mechanical, photocopying, recording or otherwise, without
the prior permission of the publisher. Multiple copying is permitted in
accordance with the terms of licences issued by the Copyright Licensing
Agency under the terms of its agreement with the Committee of ViceChancellors and Principals.
British Library Cataloguing-in-Publication Data
A catalogue record for this book is available from the British Library.
ISBN 0 7503 0707 2
Library of Congress Cataloging-in-Publication Data are available
Commissioning Editor: Nicki Dennis
Production Editor: Simon Laurenson
Production Control: Sarah Plenty
Cover Design: FreÂdeÂrique Swist
Marketing Executive: Laura Serratrice
Published by Institute of Physics Publishing, wholly owned by The
Institute of Physics, London
Institute of Physics, Dirac House, Temple Back, Bristol BS1 6BE, UK
US Of®ce: Institute of Physics Publishing, The Public Ledger Building,
Suite 1035, 150 South Independence Mall West, Philadelphia, PA 19106,
USA
Typeset by Academic Technical Typesetting, Bristol
Printed in the UK by J W Arrowsmith Ltd, Bristol
To Dr Luz Marina Calle,
Fellow NASA Scientist and Wife
and to our son Daniel
CONTENTS
PREFACE
xv
PART 1: INTRODUCTORY CONCEPTS
1
PHYSICS: THE FUNDAMENTAL SCIENCE
What is physics?
The scienti®c method: learning from our mistakes
Physics and other sciences
Sizes of things: measurement
Fundamental units
Physics and mathematics
Frontiers of physics: Very small numbers
Pioneers of physics: Measuring the circumference of the Earth
3
3
7
9
13
15
18
19
20
PART 2: THE LAWS OF
MECHANICS
2
THE DESCRIPTION OF MOTION
25
Understanding motion
Uniform motion
Average speed
The frontiers of physics: Friction
Instantaneous speed
Velocity: Speed and direction
Vectors
Acceleration
Uniformly accelerated motion
Falling bodies
Pioneers of physics: Galileo's method
The motion of projectiles
25
26
27
29
30
31
31
34
35
37
39
40
vii
SUPERSTRINGS AND OTHER THINGS
3
4
5
6
THE LAWS OF MECHANICS:
NEWTON'S LAWS OF MOTION
43
The concept of force
The ancient idea of motion
The birth of modern science
Pioneers of physics: Galileo's dialog with Aristotle
Galileo formulates the Law of Inertia
Physics in our world: The Leaning Tower of Pisa
Newton's First Law: Law of inertia
Physics in our world: Car seat belt
Newton's Second Law: Law of force
Newton's Third Law: Law of action and reaction
43
44
45
47
48
50
52
54
56
59
ENERGY
62
What is energy?
The concept of work
Units of work and energy
The concept of energy
Pioneers of Physics: James Prescott Joule (1818±1889)
The work-energy theorem
Conservative and nonconservative forces
62
62
66
66
67
74
75
CONSERVATION OF ENERGY AND
MOMENTUM
78
Transformation of energy
The principle of conservation of energy
The energy of mass
Ef®ciency
Pioneers of physics: The physicists' letters
Power
Physics in our world: Automobile ef®ciency
Impulse and momentum
Physics in our world: Air bags
Conservation of momentum
Elastic and inelastic collisions
Cannons and rockets
78
80
81
82
83
85
86
88
90
91
94
95
ROTATION AND THE UNIVERSAL LAW
OF GRAVITATION
98
Rotational motion
The frontiers of physics: CD-ROM drives
Torque and angular momentum
viii
98
101
102
Contents
Physics in our world: Twisting cats
Centripetal acceleration
Satellites
Origins of our view of the universe
Kepler's laws of planetary motion
Newton's law of universal gravitation
The frontiers of physics: Measuring the distance to the Moon
Spacecraft and orbital motion
The frontiers of physics: The Global Positioning Satellite System
106
108
110
110
114
118
124
125
128
PART 3: THE STRUCTURE OF
MATTER
7
8
9
ATOMS: BUILDING BLOCKS OF THE
UNIVERSE
133
The underlying structure of matter
The Atomic Hypothesis
Early concept of the atom
First models of the atom
Waves and quanta
The Bohr model of the atom
Molecules
Physics in our world: Winemaking
133
133
134
136
141
145
147
149
THE HEART OF THE ATOM:
THE NUCLEUS
151
Raw material: Protons and neutrons
Pioneers of physics: Heisenberg's failing grade
The composition of the nucleus
The glue that keeps the nucleus together
Size and shape of the nucleus
Nuclear energy levels
151
152
153
155
159
161
FLUIDS
164
States of matter
Density
Pressure
Buoyancy
Surface tension and capillarity
Fluids in motion
The human cardiovascular system
Physics in our world: Curve balls
164
165
166
176
179
184
186
187
ix
SUPERSTRINGS AND OTHER THINGS
PART 4: THERMODYNAMICS
10 H E A T A N D T E M P E R A T U R E
Heat as a form of energy
Pioneers of physics: Count Rumford
Measuring temperature
Temperature and heat
Physics in our world: Thermography
Heat capacity
Heat of fusion and heat of vaporization
Evaporation and boiling
Physics in our world: Instant ice cream
Humidity
Thermal expansion
The unusual expansion of water
11 T H E L A W S O F T H E R M O D Y N A M I C S
The four laws of thermodynamics
The ideal gas law
Physics in our world: Automobile engines
The zeroth law of thermodynamics
The ®rst law of thermodynamics
The second law of thermodynamics
The third law of thermodynamics
The frontiers of physics: Entropy that organizes?
Entropy and the origin of the Universe
Entropy and the arrow of time
195
195
197
199
201
204
205
207
210
213
213
215
218
221
221
221
224
225
226
229
233
234
235
239
PART 5: ELECTRICITY AND
MAGNETISM
12 E L E C T R I C I T Y
247
Electromagnetism
Electric charge
Coulomb's law
The electric ®eld
The fundamental charge
The frontiers of physics: Electrostatics on Mars
Electric potential
Storing electrical energy
The frontiers of physics: Storing single electrons
Physics in our world: Inkjet printers
x
247
248
250
253
256
258
260
262
264
265
Contents
13 A P P L I E D E L E C T R I C I T Y
Conductors and insulators
Electric current and batteries
Ohm's Law
Physics in our world: Electric cars
The frontiers of physics: Electric dentists
Simple electric circuits
Resistor combinations
Electrical energy and power
Semiconductors
Superconductors
14 E L E C T R O M A G N E T I S M
The discovery of magnets
The magnetic ®eld
Physics in our world: Magneto-optical drives
Electric currents and magnetism
A moving charge in a magnetic ®eld
Particle accelerators
Magnetism of the earth
Physics in our world: Avian magnetic navigation
The source of magnetism
Faraday's law of induction
Motors and generators
Maxwell's equations
Physics in our world: Microwave ovens
267
267
268
270
271
275
275
278
280
281
287
291
291
292
295
295
298
301
304
307
307
309
312
315
317
PART 6: WAVES
15 W A V E M O T I O N
321
The nature of waves
The principle of superposition
Resonance and chaos
The frontiers of physics: Chaos in the brain
Water waves
Seismic waves
16 S O U N D
321
325
333
336
337
339
342
The nature of sound
The speed of sound
Physics in our world: Telephone tones
Intensity of sound waves
xi
342
343
344
346
SUPERSTRINGS AND OTHER THINGS
The ear
The frontiers of physics: Electronic ear implants
The sound of music
Musical instruments
The Doppler effect
Shockwaves
Ultrasound
17 O P T I C S
347
352
352
357
362
365
367
370
Waves of light
Re¯ection of light
Re¯ection from mirrors
Curved mirrors
Refraction of light
The frontiers of physics: Gradient-index lenses
Total internal re¯ection
Optical instruments
The human eye
The frontiers of physics: Arti®cial vision
18 T H E N A T U R E O F L I G H T
The wave nature of light
The speed of light
The electromagnetic spectrum
Color
Spectra: The signature of atoms
Young's experiment
Polarization
Lasers
Physics in our world: Compact disc player
Holography
370
370
374
376
380
389
390
393
399
403
405
405
406
411
413
417
421
425
430
434
435
PART 7: MODERN PHYSICS
19 T H E S P E C I A L T H E O R Y O F R E L A T I V I T Y
Galilean relativity
The Michelson±Morley experiment
Einstein's postulates
Time dilation
The frontiers of physics: Intergalactic travel
Simultaneity
Length contraction
xii
441
441
445
449
453
458
460
462
Contents
Addition of velocities
E mc2
464
465
20 T H E G E N E R A L T H E O R Y O F R E L A T I V I T Y
The principle of equivalence
Warped spacetime continuum
The bending of light
The perihelion of Mercury
The gravitational time dilation
The frontiers of physics: Orbiting clocks
Black holes
The frontiers of physics: Spacetime drag
21 T H E E A R L Y A T O M I C T H E O R Y
The physics of the atom
Black body radiation
The photoelectric effect
The Bohr model of the atom revisited
Physics in our world: Using photons to detect tumors
De Broglie's waves
Quantum mechanics
22 Q U A N T U M M E C H A N I C S
The beginnings of quantum mechanics
The new mechanics of the atom
Wave mechanics
Pioneers of physics: SchroÈdinger's inspired guess
Heisenberg's uncertainty principle
The new physics
The frontiers of physics: Knowledge and certainty
Physics in our world: Electron microscopes
The frontiers of physics: Quantum teleportation
23 N U C L E A R P H Y S I C S
469
469
473
479
483
485
488
489
494
496
496
496
499
503
504
506
509
511
511
511
514
516
517
521
522
523
530
532
Beyond the atom
Radioactivity
Nuclear reactions
Nuclear energy: Fission and fusion
Applications of nuclear physics
Pioneers of physics: Enrico Fermi (1901±1954)
Physics in our world: Proton beams for cancer therapy
xiii
532
532
538
540
546
547
552
SUPERSTRINGS AND OTHER THINGS
24 E L E M E N T A R Y P A R T I C L E S
Antimatter
The fundamental forces
Exchange forces
Pions
Particle classi®cation: Hadrons and leptons
Conservation laws
Strange particles
Quarks
Pioneers of physics: Gell-Mann's quark
Particles with charm
25 S U P E R F O R C E : E I N S T E I N ' S D R E A M
Symmetry
Global and local symmetries
The electroweak uni®cation
The color force
GUTs, the third uni®cation
Supersymmetry and superstrings
The creation of the universe
The ®rst moments of the universe
The frontiers of physics: The cosmic background explorer
553
553
555
557
559
561
563
565
565
568
568
571
571
573
575
581
584
585
588
591
594
Appendix A P O W E R S O F T E N
596
Appendix B T H E E L E M E N T S
599
Appendix C N O B E L P R I Z E W I N N E R S I N
PHYSICS
602
Appendix D P H Y S I C S T I M E - L I N E
608
Glossary
615
Index
632
xiv
PREFACE
As a research scientist at NASA Kennedy Space Center working
on planetary exploration, I am very fortunate to be able to experience ®rst hand the excitement of discovery. As a physicist, it is not
surprising that I ®nd science in general and physics in particular
captivating. I have written this book to try to convey my excitement and fascination with physics to those who are curious
about nature and who would like to get a feeling for the thrills
that scientists experience at the moment of discovery.
The advances in physics that have taken place during the
twentieth century have been astounding. One hundred years
ago, Max Planck and Albert Einstein introduced the concept of
the quantum of energy that made possible the development of
quantum mechanics. This revolutionary theory opened the
doors for the breathtaking pace of innovation and discovery
that we have witnessed during the last ®fty years.
At the beginning of the new century, physics continues its
inexorable pace toward new discoveries. An exciting new
theory might give us the ``theory of everything,'' the uni®cation
of all the forces of nature into one single force which would
reveal to us how the universe began and perhaps how it will end.
Although these exciting new theories are highly mathematical, their conceptual foundations are not dif®cult to understand.
As a college professor for many years, I had the occasion to
teach physics to nonscience students and to give public lectures
on physics topics. In those lectures and presentations, I kept the
mathematics to a minimum and concentrated on the concepts.
The idea for this book grew out of those experiences.
This book is intended for the informed reader who is interested in learning about physics. It is also useful to scientists in
other disciplines and to professionals in non-scienti®c ®elds.
The book takes the reader from the basic introductory concepts
xv
SUPERSTRINGS AND OTHER THINGS
to discussions about the current theories about the structure of
matter, the nature of time, and the beginning of the universe.
Since the book is conceptual, I have kept simple mathematical formulas to a minimum. I have used short, simple algebraic derivations in places where they would serve to illustrate the discovery
process (for example, in describing Newton's incredible beautiful
discovery of the universal law of gravitation). These short forays
into elementary algebra can be skipped without loss of continuity. The reader who completes the book will be rewarded with
a basic understanding of the fundamental concepts of physics
and will have a very good idea of where the frontiers of physics
lie at the present time.
I have divided the book into seven parts. Part 1 starts with
some introductory concepts and sets the stage for our study of
physics. Part 2 presents the science of mechanics and the study
of energy. Part 3 follows with an introduction to the structure
of matter, where we learn the story of the atom and its nucleus.
The book continues with thermodynamics in Part 4, the conceptual development of electricity and magnetism in Part 5, waves
and light (Part 6), and ®nally, in Part 7, with the rest of the
story of modern physics, from the development of quantum
theory and relativity to the present theories of the structure of
matter.
Acknowledgments
I wish to thank ®rst my wife, Dr Luz Marina Calle, a fellow NASA
research scientist and my invaluable support throughout the
many years that writing this book took. She witnessed all the
ups and downs, the dif®culties, setbacks, and the slow progress
in the long project. She read the entire manuscript and offered
many suggestions for clari®cation, especially in the chapters
where, as a physical chemist, she is an expert.
I wish to thank Professor Karen Parshall, of the University of
Virginia, who very carefully and thoroughly read the ®rst draft of
the ®rst four chapters and made many suggestions. I also thank
Professors George H Lenz, Scott D Hyman, Joseph Giammarco,
and Robert L Chase in the physics department at Sweet Briar
College, who read all or part of the manuscript and offered
xvi
Preface
many comments. I am grateful to Karla Faulconer for many of the
illustrations that appear in the book. For their help with different
aspects of the preparation of the manuscript, I am indebted to
Gwen Hudson, Rebecca Harvey, and Rachelle Raphael. I would
especially like to acknowledge the invaluable help of my son
Daniel, now a software engineer at Digital Paper, who read the
entire manuscript, made many important suggestions and was
my early test for the readability of many dif®cult sections.
No book can be written without the important peer review
process. The criticisms, corrections and, sometimes, praise
made the completion of this book possible. Over a dozen
university physics professors reviewed this book during the
different stages of its development. I wish to thank them for
their invaluable advice. The work of two reviewers was particularly important in the development of the book. I appreciate the
comprehensive reviews of Professor Michael J Hones at Villanova
University, who reviewed the manuscript four times, offering
criticism and advice every time. Professor Kirby W Kemper at
Florida State University, reviewed the book several times and
suggested changes, corrections, and better ways to describe or
explain a concept. The book is better because of them.
Finally, I wish to thank Nicki Dennis, Simon Laurenson and
Victoria Le Billon at IOP Publishing, and Graham Saxby, for their
understanding and for their ef®ciency in converting my manuscript into this book.
Carlos I Calle
Kennedy Space Center, Florida
xvii
1
PHYSICS: THE
FUNDAMENTAL
SCIENCE
What is physics?
Physics deals with the way the universe works at the most fundamental level. The same basic laws apply to the motion of a falling
snow¯ake, the eruption of a volcano, the explosion of a distant
star, the ¯ight of a butter¯y or the formation of the early universe.
It is not dif®cult to imagine that, some thirty thousand years
ago, during a cold, dark spring night, a young child, moved perhaps by the pristine beauty of the starry sky, looked at his mother
and, in a language incomprehensible to any of us today, asked
her: ``Mother, who made the world?''
To wonder how things come about is, of course, a universal
human quality. As near as we can tell, human beings have been
preoccupied with the origin and nature of the world for as long
as we have been human. Each of us echoes the words of the
great Austrian physicist Erwin SchroÈdinger, ``I know not whence
I came nor whither I go nor who I am,'' and seeks the answers.
Here lies the excitement that this quest for answers brings to
our minds. Today, scientists have been able to pierce a few of the
veils that cloud the fundamental questions that whisper in our
minds with a new and wonderful way of thinking which is
®rmly anchored in the works of Galileo, Newton, Einstein,
Bohr, SchroÈdinger, Heisenberg, Dirac and many others whom
we shall meet in our incursion into the world of physics.
Physics, then, attempts to describe the way the universe
works at the most basic level. Although it deals with a great
variety of phenomena of nature, physics strives for explanations
with as few laws as possible. Let us, through a few examples,
taste some of the ¯avor of physics.
3
SUPERSTRINGS AND OTHER THINGS
Figure 1.1. The laws of physics apply to a falling snow¯ake (courtesy
W P Wirgin), the explosion of a star or the eruption of a volcano (courtesy
NASA).
We all know that if we drop a sugar cube in water, the sugar
dissolves in the water and as a result the water becomes thicker,
denser; that is, more viscous. We, however, are not likely to pay a
great deal of attention to this well-known phenomenon. One
inquisitive mind did.
One year after graduating from college, the young Albert
Einstein considered the same phenomenon and did, indeed,
pay attention to it. Owing to his rebellious character, Einstein
had been unable to ®nd a university position as he had wanted
and was supporting himself with temporary jobs as tutor or as
a substitute teacher. While substituting for a mathematics teacher
in the Technical School in Winterthur, near Zurich, from May to
July 1901, Einstein started thinking about the sweetened water
4
Physics: The Fundamental Science
problem. ``The idea . . . may well have come to Einstein as he was
having tea,'' writes a former collaborator of Einstein.
Einstein simpli®ed the problem by considering the sugar
molecules to be small hard bodies swimming in a structureless
¯uid. This simpli®cation allowed him to perform calculations
that had been impossible until then and that explained how the
sugar molecules would diffuse in the water, making the liquid
more viscous.
This was not suf®cient for the twenty-two-year-old scientist.
He looked up actual values of viscosities of different solutions of
sugar in water, put these numbers into his theory and obtained
from his equations the size of sugar molecules! He also found
a value for the number of molecules in a certain mass of any
substance (Avogadro's number). With this number, he could
calculate the mass of any atom. Einstein wrote a scienti®c paper
with his theory entitled ``A New Determination of the Sizes of
Molecules.''
Figure 1.2. Albert Einstein.
5
SUPERSTRINGS AND OTHER THINGS
On the heels of this paper, Einstein submitted for publication
another important paper on molecular motion, where he
explained the erratic, zigzag motion of individual particles of
smoke. Again, always seeking the fundamental, Einstein was
able to show that this chaotic motion gives direct evidence of
the existence of molecules and atoms. ``My main aim,'' he
wrote later, ``was to ®nd facts that would guarantee as far as
possible the existence of atoms of de®nite ®nite size.''
Almost a century earlier, Joseph von FraunhoÈfer, an illustrious German physicist, discovered that the apparent continuity of
the sun's spectrum is actually an illusion. This seemingly unrelated discovery was actually the beginning of the long and tortuous road toward the understanding of the atom. The eleventh and
youngest child of a glazier, FraunhoÈfer became apprenticed to a
glass maker at the age of twelve. Three years later, a freak accident turned the young lad's life around; the rickety boarding
house he was living in collapsed and he was the only survivor.
Maximilian I, the elector of Bavaria, rushed to the scene and
took pity of the poor boy. He gave the young man eighteen
ducats. With this small capital, FraunhoÈfer was able to buy
books on optics and a few machines with which he started his
own glass-working shop. While testing high-quality prisms
FraunhoÈfer found that the spectrum formed by sunlight after it
passed through one of his prisms was missing some colors; it
was crossed by numerous minuscule black lines, as in ®gure 1.3
(color plate). FraunhoÈfer, intrigued, continued studying the
phenomenon, measuring the position of several hundred lines.
He placed a prism behind the eyepiece of a telescope and discovered that the dark lines in the spectrum formed by the light from
the stars did not have quite the same pattern as that of sunlight.
He later discovered that looking at the light from a hot gas
through a prism produced a set of bright lines similar to the
pattern of dark lines in the solar spectrum.
Today we know that the gaps in the spectrum that FraunhoÈfer discovered are a manifestation of the interaction between
light and matter. The missing colors in the spectrum are determined by the atoms that make up the body emitting the light.
In the spring of 1925 a twenty-four-year old physicist named
Werner Heisenberg, suffering from severe hay fever, decided to
take a two week vacation on a small island in the North Sea,
6
Physics: The Fundamental Science
away from the ¯owers and the pollen. During the previous year,
Heisenberg had been trying to understand this interaction
between light and matter, looking for a mathematical expression
for the lines in the spectrum. He had decided that the problem of
the relationship between these lines and the atoms could be analyzed in a simple manner by considering the atom as if it were an
oscillating pendulum. In the peace and tranquility of the island,
Heisenberg was able to work out his solution, inventing the
mechanics of the atom. Heisenberg's new theory turned out to
be extremely powerful, reaching beyond the original purpose of
obtaining a mathematical expression for the spectral lines.
In 1984, this idea of thinking about the atom as oscillations
took a new turn. John Schwarz of the California Institute of
Technology and Michael B Green of the University of London
proposed that the fundamental particles that make up the atom
are actually oscillating strings. The different particles that scientists detect are actually different types or modes of oscillation of
these strings, much like the different ways in which a guitar
string vibrates. This clever idea, which was incredibly dif®cult
to implement, produced a theory of enormous beauty and
power which explains and solves many of the dif®culties that
previous theories had encountered. The current version of the
theory, called superstring theory ± which we will study in more
detail in chapter 25 ± promises to unify all of physics and help
us understand the ®rst moments in the life of the universe. Still
far from complete, superstring theory is one of the most active
areas of research in physics at the present time.
In all these cases, the scientists considered a phenomenon of
nature, simpli®ed its description, constructed a theory of its behavior based on the knowledge acquired by other scientists in the
past, and used the new theory not only to explain the phenomenon, but also to predict new phenomena. This is the way physics
is done. This book shows how we can also do physics, and share
in its excitement.
The scienti®c method: learning from our mistakes
In contrast to that of many other professionals, the work of a
scientist is not to produce a ®nished product. No scienti®c
7
SUPERSTRINGS AND OTHER THINGS
theory will ever be a correct, ®nished result. ``There could be no
fairer destiny for any . . . theory,'' wrote Albert Einstein, ``than
that it should point the way to a more comprehensive theory in
which it lives on, as a limiting case.''
Science is distinguished from other human endeavor by its
empirical method, which proceeds from observation or experiment.
The distinguished philosopher of science Karl R Popper said that
the real basis of science is the possibility of empirical disproof.
A scienti®c theory cannot be proved correct. It can, however, be
disproved.
According to the scienti®c method, a scientist formulates a
theory inspired by the existing knowledge. The scientist uses
this new theory to make predictions of the results of future
experiments. If when these experiments are carried out the predictions disagree with the results of the experiments the theory
is disproved; we know it is incorrect. If, however, the results
agree with the forecasts of the theory, it is the task of the scientists
to draw additional predictions from the theory, which can be
tested by future experiments. No test can prove a theory, but
any single test can disprove it.
In the 1950s, a great variety of unpredicted subatomic particles discovered in laboratories around the world left physicists
bewildered. The picture that scientists had of the structure of
matter up to the 1940s ± as we will learn in more detail in chapters
7 and 8 ± was relatively simple and fairly easy to understand:
matter was made of atoms, which were composed of a tiny
nucleus surrounded by a cloud of electrons. The nucleus, in
turn, was made up of two kinds of particles, protons and neutrons. The new particles being discovered did not ®t this simple
scheme. Two theories were formulated to explain their existence.
The ®rst one proposed a ``particle democracy,'' in which no particle was any more fundamental than any other. This theory was
so well received by the scienti®c community in the United Sates
that one of the proponents of the second theory, Murray GellMann of the California Institute of Technology decided to publish
his paper in a European journal where he felt the opposition to his
new ideas would not be so great. Gell-Mann and independently
George Zweig, also of Caltech, proposed that many of the growing number of particles and in particular the proton and the
neutron were actually made up of smaller, indivisible particles
8
Physics: The Fundamental Science
which Gell-Mann called quarks. Different combinations of quarks,
in groups of two or three, were responsible for many of these particles. According to their theory, the growing number of new particles being discovered was not a problem anymore. What
mattered was that the objects of which these particles were
made of were simple and small in number.
Which theory was correct? In 1959 Stanford University built
a large particle accelerator which, among other things, could
determine whether or not quarks existed. Seven years later,
experiments carried out at the Stanford Linear Accelerator
Laboratory, SLAC, allowed physicists to determine the presence
of the quarks inside protons and neutrons. Since then, many
experiments have corroborated the Stanford results; the quark
is accepted today as one of the fundamental constituents of
matter and the ``particle democracy'' theory is no longer viable.
We shall see in the ®nal chapters of this book that these new
theories of matter are far from complete. Nevertheless, the knowledge obtained from these theories has given us not only a better
understanding of the universe we live in but has also produced
the modern technological world based largely on the computer
chip.
We can summarize the scienti®c method by saying that we
can learn from our mistakes. Scienti®c knowledge progresses by
guesses, by conjectures which are controlled by criticism, by
critical tests. These conjectures or guesses may survive the tests;
but they can never be established as true. ``The very refutation
of a theory,'' writes Popper, ``is always a step forward that
takes us nearer to the truth. And this is how we learn from our
mistakes.''
Physics and other sciences
Physicists often become interested in phenomena normally
studied by scientists in other scienti®c disciplines, and apply
their knowledge of physics to these problems with great success.
The recent formulation of the impact theory of mass extinctions is
a good illustration of physicists becoming involved in other scienti®c ®elds and of the way working scientists apply the scienti®c
method to their work.
9
SUPERSTRINGS AND OTHER THINGS
In 1980, the Nobel prize winning physicist Luis Alvarez and
his son Walter, a professor of geology at the University of California at Berkeley, reported in a paper published in the journal
Science that some 65 million years ago a giant meteorite crashed
into the earth and caused the extinction of most species. The
dinosaurs were the most famous casualties. Alvarez and his
collaborators based their theory on their study of the geological
record. Walter Alvarez had told his father that the 1-cm-thick
clay layer that separates the Italian limestone deposits of the
Cretaceous period ± the last period of age of reptiles ± from those
of the Tertiary period ± the ®rst period of the age of mammals,
Figure 1.4. An unorthodox theory of the extinction of the dinosaurs.
(Cartoon by Sydney Harris.)
10
Physics: The Fundamental Science
was laid down during precisely the time when the great majority
of the small swimming animals in the marine waters of that region
had disappeared. What made it even more exciting was the fact
that this time also coincided with the disappearance of the dinosaurs.
The layer of clay ± observed worldwide and known as the
K-T boundary layer ± contains an unusually high concentration
of the element iridium. This element is present in very small
amounts in the earth's crust but is much more abundant in
meteorites. The father and son team thought that by measuring
the amount of iridium present in the clay they could determine
how long the layer had taken to form. They assumed that iridium
could have rained down on the earth from meteoritic dust at a
fairly steady rate during the thousand years that it took to
form. If that were the case, they could measure the amount of
iridium in the clay and in the rocks above the clay (formed
later) and below (formed earlier) and determine the time it had
taken for the iridium to accumulate. To that effect, they enlisted
the help of Frank Asaro and Helen Michel, nuclear chemists at
the Lawrence Berkeley Laboratory. Asaro and Michel showed
that the clay layer contains three hundred times as much iridium
as the layers above and below.
The source of this unusual amount of iridium had to be
extraterrestrial, Luis Alvarez reasoned. Meteorites, which are
Figure 1.5. K-T boundary layer with a high concentration of iridium.
(Courtesy Alessandro Montanari.)
11
SUPERSTRINGS AND OTHER THINGS
extraterrestrial, have fallen on the earth since its formation. If the
iridium came from the meteorites, why this sudden increase in
the meteorite rate during this particular time and why did it
decrease again to normal levels? What was so special about this
particular time in the history of the earth? More importantly,
why did it coincide with the extinction of about 50 percent of
the species in existence then?
The Alvarez team ®rst proposed that the iridium could have
come from the explosion of a supernova near the solar system.
Astrophysicists had proposed that the mass extinctions could
have been caused by such an explosion. Since these tremendous
explosions produce heavy elements Luis Alvarez proposed analyzing the samples taken from the clay for their presence.
Detailed measurements revealed no heavy elements, however,
and the supernova idea had to be abandoned.
While Walter Alvarez returned to Italy to collect more clay
samples, his father worked on theory, inventing ``a new scheme
every week for six weeks and [shooting] them down one by
one,'' as he wrote later. Luis Alvarez then considered the possibility of an asteroid or a comet passing through the atmosphere,
breaking up into dust which would eventually fall to the ground,
like the comet that broke up over Tunguska, Siberia, in 1908.
Calculations showed him that a larger asteroid, of 10 kilometers
in diameter, for example, would not break up into pieces. The
Tunguska comet was smaller.
Alvarez then concluded that some 65 million years ago, a 10kilometer comet or asteroid struck the earth, disintegrated, and
threw dust into the atmosphere. The dust remained in the atmosphere for several years, blocking sunlight, turning day into night,
and preventing photosynthesis, the process by which, in the
presence of light, plants convert water, carbon dioxide, and minerals into oxygen and other compounds. Without plants to eat,
animals starved to death. We see the remnants of dust today as
the global K-T boundary layer between the Cretaceous and
Terciary layers. Alvarez calculated the diameter of the object
from the known concentration of iridium in meteorites and his
group's data on the iridium content of the Italian clay samples.
Other scientists proposed the idea that intense volcanic eruptions could account for the mass extinctions. These scientists
found high levels of iridium in tiny airborne particles released
12
Physics: The Fundamental Science
by the Kilauea volcano in Hawaii and concluded that iridium
from the inner earth can reach the surface. For a few years after
they were proposed, both ideas could be used to explain the
K-T extinctions. However, different predictions could be drawn
from the two competing ideas and scientists scurried to ®nd
new evidence in support of the different predictions. Recent ®ndings, however, appear to con®rm the predictions of the impact
theory.
According to the scienti®c method, however, no theory can
ever be proved correct. One of the theories will eventually be
shown to be incorrect, leaving the remaining theories stronger,
but not proven. ``You will never convince some [scientists] that
an impact killed the dinosaurs unless you ®nd a dinosaur skeleton with a crushed skull and a ring of iridium around the hole,''
joked a scientist at a conference on the subject.
Sizes of things: measurement
Most work in physics depends upon observation and measurement. To describe the phenomena encountered in nature
and to be able to make observations, physicists must agree on a
consistent set of units.
Throughout history, several different systems of units were
developed. It began with the Babylonians and the Egyptians,
thousands of years ago. The earliest recorded unit of measurement, the cubit, based on the length of the arm, appeared in
Egyptian papyrus texts. According to Genesis, Noah's Ark was
300 cubits long (about 150 m). Because the length of the arm
varies from person to person, so did the cubits used among various civilizations. The Egyptians used a short cubit of 0.45 m and a
royal cubit of 0.524 m. The ancient Romans used the mille passus,
1000 double steps by a Roman legionary, which was equal to
5000 Roman feet. In the 15th century, Queen Bess of England
added 280 feet to the mile to make it eight ``furrow-longs'' or
furlongs.
In 1790, Thomas Jefferson proposed a system based on units
of 10 where 10 feet would be a decad, 10 decads a road, 10 roads a
furlong, and 10 furlongs a mile. Congress did not approve
Jefferson's system.
13
SUPERSTRINGS AND OTHER THINGS
Figure 1.6. The meter was originally de®ned as the 1/10 000 000 of the
length of the Earth's meridian from the North Pole to the Equator
In France, however, the French Revolution brought an interest in science and another base 10 system, the metric system, was
born. This system, based on the meter, from the Greek metron,
meaning ``measure'', was more scienti®c. Instead of using
human anatomy, the meter, as approved by the French National
Convention in 1795, was de®ned as 1/10 000 000 of the length of
Earth's meridian between the Equator and the North Pole (®gure
1.6).
Once the meter was de®ned, a unit of volume, the liter, could
be de®ned by cubing a tenth of a meter. From the liter, the
kilogram as a unit of mass was derived. Multiples of 10 provided
larger units indicated by Greek pre®xes, and for smaller units,
Latin pre®xes were used.
Due to the consistency and uniformity of the system and
the easiness of de®ning new units merely by adding a Greek
or a Latin pre®x, the metric system was adopted in Europe in
the 19th century. Today, an expanded version of the system,
SI units, for Le SysteÁme International d'UniteÂs, is used by 95
percent of the world's population and is the of®cial system in
science. In Table 1.1 we list the standard pre®xes used in the SI
system.
Notice in Table 1.1 that for large and small numbers, it is
easier to use scienti®c notation. In the scienti®c notation, numbers
are written as a number between 1 and 10 multiplied by a power
of 10. The radius of the earth, for example, which is 6380 km, can
be written in scienti®c notation as 6:380 103 km. To see why,
14
Physics: The Fundamental Science
Table 1.1. Powers of ten pre®xes
Value
18
10 1 000 000 000 000 000 000
1015 1 000 000 000 000 000
1012 1 000 000 000 000
109 1 000 000 000
106 1 000 000
103 1000
102 100
101 10
10ÿ1 0:1
10ÿ2 0:01
10ÿ3 0:001
10ÿ6 0.000 001
10ÿ9 0.000 000 001
10ÿ12 0.000 000 000 001
10ÿ15 0.000 000 000 000 001
10ÿ18 0.000 000 000 000 000 001
Symbol
Pre®x
E
P
T
G
M
k
h
da
d
c
m
m
n
p
f
a
exa
peta
tera
giga
mega
kilo
hecto
deca
deci
centi
milli
micro
nano
pico
femto
atto
note that we can write the number 1000 as follows:
1000 10 10 10 103 :
The radius of the earth is, then,
6380 km 6:38 1000 km 6:38 103 km:
Fundamental units
All mechanical properties can be expressed in terms of three
fundamental physical quantities: length, mass, and time. The SI
fundamental units are:
Quantity
Fundamental Unit
Symbol
Length
Mass
Time
meter
kilogram
second
m
kg
s
The General Conference on Weights and Measures held in Paris in
1983 de®ned the meter as the distance traveled by light through
space in 1/299 792 458 seconds. Notice that the unit of length is
de®ned with such high precision in terms of the unit of time. This
is possible because the second is known to better than 1 part in 10
trillion. The 1967 General Conference on Weights and Measures
15
SUPERSTRINGS AND OTHER THINGS
16
Figure 1.7. Range of masses, lengths and time intervals found in the universe.
Physics: The Fundamental Science
Figure 1.8. A cesium atomic clock at the National Institute of Standards
and Technology in Washington, DC. (Courtesy National Institute of
Standards and Technology.)
de®ned the second as the duration of 9 192 631 770 periods (durations of one oscillation) of a particular radiation emitted by the
cesium atom. The device that permits this measurement is the
cesium clock, an instrument of such high precision that it would
lose or gain only 3 seconds in one million years (®gure 1.8).
The last fundamental mechanical quantity is mass. Mass is a
measure of the resistance that an object offers to a change in its
condition of motion. For an object at rest with respect to us,
mass is a measure of the amount of matter present in the object.
The standard unit of mass is the standard kilogram, a solid platinum-iridium cylinder carefully preserved at the Bureau of
Weights and Measures in SeÁvres, near Paris. The kilogram is
now derived from the meter, which is derived from the second.
A copy of the standard kilogram, the Prototype Kilogram No
20, is kept at the National Bureau of Standards in Washington,
DC. A high precision balance, especially designed for the
National Bureau of Standards, allows the comparison of the
masses of other bodies within a few parts in a billion.
17
SUPERSTRINGS AND OTHER THINGS
(Cartoon by Sydney Harris.)
The mass of an atom cannot be measured by comparison
with the standard kilogram with such a high degree of precision.
The masses of atoms, however, can be compared with each other
with high accuracy. For this reason, the masses of atoms are given
in atomic mass units (amu). The mass of carbon in these units has
been assigned a value of 12 atomic mass units. In kilograms, an
atomic mass unit is
1 amu 1:660 540 2 10ÿ27 kg:
Physics and mathematics
Physics and mathematics are closely intertwined. Mathematics is
an invention of the human mind inspired by our capacity to deal
18
Physics: The Fundamental Science
Frontiers of physics: Very small numbers
What does a mass like 1:660 540 2 10ÿ27 kg mean? Suppose
that you start with one grain of salt, which has a mass of
about one ten-thousandth of a gram and with a very precise
cutting instrument you divide it into ten equal parts, take
each one of the tenths, divide them into ten new equal
parts, and so on. You will not arrive at single electrons this
way because, as we shall see in chapters 7 and 8, the electron
is one of the several constituents of atoms. Although atoms
can be split, you cannot do it with a cutting instrument.
Suppose, however, that we divide the grain of salt into
the smallest amounts of salt possible, single molecules of
salt. One single molecule of table salt has a mass of about
9 10ÿ23 g. Let's round this number up to 10ÿ22 g. If your
instrument takes one second, say, to take each piece of salt
and divide it into ten equal parts, how long would it take
to end up with individual molecules of salt? The answer is
3 1010 years.
Astrophysicists estimate that the age of the universe is of
the order of 1010 years. It would take our hypothetical instrument roughly the age of the universe to arrive at a single
molecule of salt!
Grains of salt, magni®ed 100 times. (Courtesy V Cummings, NASA.)
with abstract ideas; physics deals with the real material world.
Yet, mathematical concepts invented by mathematicians who
did not foresee their applications outside the abstract world of
mathematics have been applied by physicists to describe natural
19
SUPERSTRINGS AND OTHER THINGS
Pioneers of physics: Measuring the
circumference of the Earth
The meter, as we saw, was de®ned in 1795 as 1/10 000 000 of
the length of the earth's meridian from the Equator to the
North Pole. For that de®nition to make sense, an accurate
knowledge of the Earth's dimensions was needed. That is,
the actual length of the meridian from the Equator to the
North Pole had to be known with good precision. How did
we come to know the Earth's dimensions before the advent
of twentieth century technology?
The dimensions of the Earth have been known since the
time of the ancient Greeks. The Greek astronomer Eratosthenes, who lived in the third century BC in Alexandria
(Egypt), came up with a very clever method for obtaining
the circumference of the Earth. Eratosthenes had heard that
in the city of Syene, an ancient city on the Nile, near
today's Aswan, on the ®rst day of summer, the sun shone
on the bottom of a vertical well at noon. However, in his
native Alexandria, the sun's rays did not fall vertically
down but at an angle of 78 to the vertical. This angle of 78
was about one-®ftieth of a circle and that meant that the
20
Physics: The Fundamental Science
distance between Syene and Alexandria must be one-®ftieth
of the earth's circumference.
During Eratosthene's time, the distance between these
two cities was estimated to be 5000 stadia. So the circumference of the earth was 50 times this distance or 250 000
stadia. Although the exact length of that Greek unit is not
known, we do know that the length of a Greek stadium
varied between 154 and 215 meters. If we use an average
value of 185 m, the result is only about 15% larger than
modern measurements, a remarkable achievement.
phenomena. ``It is a mystery to me,'' wrote the Nobel Prize
winning physicist Sheldon Glashow, ``that the concepts of mathematics (things like the real and complex numbers, the calculus
and group theory), which are purely inventions of the human
imagination, turn out to be essential for the description of the
real world.''
Physicists, on the other hand, have invented powerful mathematical techniques in their search to understand the physical
world. Newton developed the calculus to solve the problem of
the attraction that the earth exerts on all objects on its surface.
Mathematicians later continued the development of calculus
into what it is today.
Mathematics is then the instrument of physics; the only language in which the nature of the world can be understood. None
the less, in this book we are interested in the concepts of physics.
These concepts can usually be described with words and examples. In some instances, however, there is no substitute for the
elegance and conciseness of a simple formula. In these cases,
we shall consider such a formula to see how it explains new concepts and how they can be linked to other concepts already
learned. The reader should always keep in mind that our purpose
is to understand the physical phenomenon, not the mathematics
that describes it.
21
2
THE DESCRIPTION
OF MOTION
Understanding motion
The understanding of motion is fundamental in our comprehension of nature. ``To understand motion is to understand nature,''
wrote Leonardo da Vinci. If we understand how an object moves,
we might be able to discover where it has been and predict where
it will be some time in the future, provided that the present
conditions are maintained. In physics, we are interested in the
description of the motion of the different bodies that we observe,
such as automobiles, airplanes, basketballs, sound waves, electrons, planets, and stars.
To study how objects move, we need to begin by studying
how a simple object moves. An object without moving parts,
such as a ball or a block, is simpler than one with separate
parts because we do not need to worry about the motions of
the parts, and we can concentrate on how the object moves as a
whole. A ball can roll and a block can slide on a surface. Which
one is simpler? It would be easier for us if we did not have to
decide beforehand either the shape of the object or its internal
structure. Physicists simplify the problem by considering the
motion of a point, an ideal object with no size, and therefore no
internal structure and no shape.
We will consider ®rst the motion of a point. However, in
our illustrations and examples we might refer to the motion
of real objects, like cars, baseballs, rockets or people. If we do
not consider the internal structure of the object, and do not
allow it to rotate, this object behaves like a point for our
purposes.
25
SUPERSTRINGS AND OTHER THINGS
Uniform motion
``My purpose is to set forth a very new science dealing with a very
ancient subject,'' wrote Galileo in his Two New Sciences. He continued: ``There is, in nature, perhaps nothing older than motion,
concerning which the books written by philosophers are neither
few nor small; nevertheless I have discovered by experiment
some properties of it which are worth knowing and which have
not hitherto been either observed or demonstrated.''
Galileo, one of the ®rst modern scientists and the ®rst one to
understand the nature of motion, was born in Pisa the same year
that Shakespeare was born in England and three days before
Michelangelo died. The year was 1564. His full name was Galileo
Galilei, following a Tuscan custom of using a variation of the
family name as the ®rst name of the eldest son.
His father, a renowned musician, wanted his son to be a
physician, a far more lucrative profession in those days. Thus,
he entered the University of Pisa to study medicine. Upon hearing a lecture on geometry which encouraged him to study the
work of Archimedes, the young medical student decided that
science and mathematics seemed far more interesting than
medicine. Galileo talked to his father about letting him switch.
Fortunately for the world his father consented.
Galileo became well known throughout Italy for his scienti®c
ability and at the age of 26 was appointed Professor of Mathematics at the University of Pisa. There he dug deeply into fundamental science. He also made some enemies, especially among
the older and more respected professors, who did not like their
opinions and views challenged by the young and tactless Galileo.
Partly because of this, and partly because the Republic of Venice
was, in 1600, the hub of the Mediterranean, which in turn was the
center of the world, Galileo accepted a position as Professor of
Mathematics at Padua, where he began the work in astronomy
that was to bring him immortal fame.
Galileo's work on mechanics was published as Discourses and
Mathematical Demonstrations Concerning Two New Sciences Pertaining to Mechanics and Local Motion, which appeared in 1638. In
the chapter ``De Motu Locali'' or ``Change of Position'', he writes:
The discussion is divided into three parts; the ®rst part deals with
motion which is steady or uniform; the second treats of motion as
26
The Description of Motion
Figure 2.1.
Several positions of a runner running along a straight track.
we ®nd it accelerated in nature; the third deals with the so-called
violent motions and with projectiles.
Galileo then goes on to explain what ``motion which is steady or
uniform'' is:
By steady or uniform motion, I mean one in which the distances
traversed by the moving particle during any equal interval of time,
are themselves equal.
Figure 2.1 is an example of uniform motion; it shows several
positions of an athlete running along a straight 100-m track at a
steady pace. The marks alongside the track show that the
runner moves equal distances of 10 meters in equal intervals of
6 seconds.
Average speed
The runner in ®gure 2.1 travels 10 meters in six seconds or 100
meters in 60 seconds (1 minute). We can say that the runner
travels at 100 meters per minute. Average speed is de®ned as
the total distance traveled divided by the time taken to travel
this distance. If we use the letter d to indicate distance, and the
letter t to indicate time, we can write the average speed, v, as
v
distance traveled d
time taken
t
27
SUPERSTRINGS AND OTHER THINGS
Figure 2.2. Multiple-exposure photograph of a disk of ``dry ice'' moving
on a smooth surface. (Illustration from PROJECT PHYSICS, copyright #
1981 by Holt, Rinehart and Winston, Inc. Reprinted by permission of the
publisher.)
where the bar above the letter v is used to indicate that this is the
average value. The runner of our example travels a distance of
100 meters in one minute. The average speed of the runner,
then, is
v
d 100 m
100 m=min:
t 1 min
Figure 2.2 shows a multiple-exposure photograph of a disk
of solid carbon dioxide (``dry ice'') in uniform motion. The disk,
resting on the polished surface of a table, is given a gentle
push. With the room darkened, the shutter of a camera set on a
tripod is kept open while at equal intervals of time a strobe is
®red. Since the only source of light comes from the strobe ±
which for this experiment was ®red at 0.10 second intervals ±
the ®lm records the position of the disk as it slides on the table.
The meter rule shows that the disk moves 13 cm between ¯ashes.
The disk, then, traverses equal distances of 13 cm in equal intervals of 0.10 s or 130 centimeters in 1.0 second. We can say that
the disk travels at an average speed v 130 centimeters per
second.
The units of speed are units of distance divided by units of
time. Speed can thus be given in miles per hour, kilometers per
hour, meters per second, feet per minute, etc. The SI unit of
speed is the meter per second (m/s).
In both of those cases, the speed did not change. The runner
and the disk were moving at a uniform or constant speed, at least
for the intervals that were considered. However, few motions are
uniform. The most common situation is that of variable speed. If
you drive from your dorm to the movies, you start from rest,
speed up to 30 miles per hour and probably drive at that speed
28
The Description of Motion
The frontiers of physics: Friction
The disk of ``dry ice'' shown in the multiple-exposure photograph of ®gure 2.2 moves equal distances in equal times
because friction between the puck and the smooth surface
is negligible. When two surfaces rub together, the atoms
that make up the two surfaces interact in ways that depend
on the atomic composition of the substances, making them
stick to each other. Although the general mechanism is well
understood, the details of how friction appears are only
now beginning to become clear. Recently, scientists at
Georgia Institute of Technology used an atomic force microscope, which measures the forces between two objects separated by less than 10 nanometers, to examine the tip of a
tiny nickel probe moving on a gold surface.
Friction seen at the atomic level between a nickel tip and a gold
surface. (Scienti®c American.)
When the nickel tip was pulled back slightly after it had
made contact with the surface, a connective ``neck'' of atoms
developed between the two surfaces, a sort of bridge at the
atomic scale. After the tip was pulled far enough, the neck
snapped, leaving the tip covered with gold atoms. Why
would gold atoms move over to the nickel tip instead of the
other way around? Gold, it turns out, requires less energy
to have one of its surface atoms removed than nickel. The
researchers believe that these differences in energy account
for the differences in friction between different substances.
29
SUPERSTRINGS AND OTHER THINGS
for a few minutes until you have to slow down to make a turn or
come to a stop at a traf®c light. The speed at which you drive
changes many times throughout your trip. We can obtain the
average speed of the motion by dividing the total distance traveled by the time it took to cover that distance.
Let's consider a numerical example. A boy takes 15 minutes
to ride his bicycle to his friend's house, which is 2 km away. He
talks to his friend for 20 minutes and then continues towards
his grandparent's home, 4 additional km. He arrives there 25
minutes after leaving his friend's house.
Since the total distance traveled is 6 km (2 km to the friend's
house and 4 to the grandparent's home) and it took the boy a total
of 60 minutes to get there, the average speed is
v
d
6 km
6 km=h:
t 60 min
Notice that we have included the time the boy spends at his
friend's house in our calculation of the total time taken for the trip.
Instantaneous speed
Average speed is useful information. When we are traveling, we
can calculate the average speed for a section of the trip and use it
to estimate how long it will take us to complete the trip, provided
we continue driving under similar conditions. However, in some
cases, we might be interested in obtaining more information.
It would take you about 8 hours to drive from Washington,
DC to Charlotte, North Carolina, a distance of 400 miles. Although
the average speed in this case is 50 mph, you know that at times
you would drive at a higher speed, whereas heavy traf®c or
lower speed limits through certain parts would force you to
drive at a lower speed. Knowing that you can average 50 mph
for this trip does not provide information about how fast you
actually traveled or whether you stopped at all along the way.
The instantaneous speed, when we can obtain it, will give us information about the detail of the trip. Instantaneous speed is the
speed given by a car's speedometer, the speed at a given instant.
If your car speedometer fails before you complete a 60-mile
trip that usually takes you one hour, you cannot be sure that
30
The Description of Motion
you did not break the 65 mph speed limit at any time during your
trip even if it still took you one hour to arrive at your destination.
However, if you time yourself between successive mile posts and
it takes you about one minute to travel one mile, you know that
you are maintaining a speed close to 60 mph. To measure your
speed more accurately you would need to reduce the time intervals to one second, perhaps ± during which you would travel only
about 90 feet ± or, with precision equipment, even to 1/10 second.
Yet, this very small interval of time would still not give you the
instantaneous speed. You would need to reduce that interval to
an instant.
Modern instrumentation allows measurements of the speed
of an object at intervals small enough to provide us with excellent
approximations to the instantaneous speed. Mathematically, it is
possible to obtain the exact value of the instantaneous speed by
the use of calculus, a mathematical technique developed over
300 years ago by Isaac Newton.
Velocity: Speed and direction
In some cases the direction in which we are moving is also important information. A pilot needs to know how fast the wind is blowing and in what direction. The pilot needs to know the wind
velocity. Velocity gives the speed and the direction of motion.
The wind velocity of a 50-mph wind blowing east would
push off course a small airplane heading north at 80 mph. If the
pilot did not correct the airplane's heading it would end up
¯ying in a northeasterly direction.
Velocity, and other quantities that require a magnitude and a
direction, are called vector quantities. In the next section we shall
study some properties of these new quantities. Quantities that do
not require a direction are said to be scalar quantities. Speed is a
scalar quantity.
Vectors
As we have said, vector quantities are those that possess both
magnitude and a direction. We represent vectors as arrows
31
SUPERSTRINGS AND OTHER THINGS
with length proportional to the magnitude and with a direction
indicating the direction of the vector quantity. We use bold face
letters, (v, V) to completely represent vectors and standard letters
(v, V) to indicate their magnitudes.
To illustrate some properties of vectors, let's consider the
following situation. A man wants to buy a paperback he has
heard about recently. He walks 3 km east to his friend's house
and then both walk together to the nearest drugstore, 4 km
from his friend's house and in the same direction (®gure 2.3(a)).
Figure 2.3. (a) A man walks 3 km to his friend's house and then 4 km to
the drugstore. The man is 7 km away from home. If instead he walks 4 km
from the friend's house to the library, the man is only 5 km from home.
(b) The two individual trips of 3 and 4 km are equal to a single trip of
7 km. (c) Walking east 3 km (labeled with vector a) and then north
4 km (vector b) is equivalent to walking across in the direction shown
by vector c.
32
The Description of Motion
Obviously, the total distance traveled by the man is 7 km. Since
the two trips take place in the same direction, the man ®nds
himself 7 km away from home. Graphically, we can illustrate
this situation as in ®gure 2.3(b).
Let's suppose now that the man wants to borrow the book
from the local library instead of buying it at the drugstore
(®gure 2.3(a)). If the library is also 4 km from his friend's house,
but north instead of east, as illustrated in ®gure 2.3(b), the man
is now only 5 km fromphome. Using Pythagoras's Theorem we
get a displacement of
42 32 km 5 km. If he had wanted,
he could have cut across the ®eld straight from his home to the
library, walking only the 5 km.
If vector a represents the 3-km walk east, and b the 4-km
walk north, vector c represents the straight walk across the ®eld
from the house to the library. Vector c is equivalent to the two
vectors a and b together. In other words, walking east for 3 km
to the friend's house and then north for 4 km to the library is
the same as walking 5 km across the ®eld from the house to the
library. We call vector c the resultant of vectors a and b and the
process addition of vectors.
An alternative method, the so-called parallelogram method for
the addition of vectors, consists of placing the vectors to be added
``tail-to-tail'' instead of ``head-to-tail,'' keeping their orientations
®xed. The resultant is obtained by completing the diagonal of
the parallelogram. In ®gure 2.4 we illustrate the addition of a
Figure 2.4.
Addition of two vectors using the parallelogram method.
33
SUPERSTRINGS AND OTHER THINGS
horizontal vector a and a vertical vector b, producing a resultant
vector c. (In this case the parallelogram is a rectangle.)
Just as horizontal vector a and vertical vector b can be
combined to form the resultant vector c, any vector can be
viewed as the resultant of two vectors that are perpendicular to
each other. These new vectors are called the components of the
original vector.
Acceleration
``The properties belonging to uniform motion have been discussed in the preceding section; but accelerated motion remains
to be considered,'' wrote Galileo in the chapter ``De Motu
Locali'' of his Two New Sciences, where he had discussed his
discoveries on uniform motion. He continued:
And ®rst of all it seems desirable to ®nd and explain a de®nition best
®tting natural phenomena. This we readily understand when we
consider the intimate relationship between time and motion . . .
We can illustrate this relationship between time and motion
with the following situation. Laurie and her friend Matthew are
discussing the responsiveness of Laurie's recent investment: a
nice used car she bought with the money she earned last
summer. Laurie says the dealer assured her that the car can
make 0 to 50 miles per hour in less that 10 seconds. Matthew
does not think that a three-year-old subcompact can do that
and suspects that the speedometer might have been altered. To
settle the argument Laurie asks Matthew to ride with her while
she accelerates and to record the speed of the car every second.
Because this is dif®cult to do, they also convince ten of their
friends to line up at regular intervals along a straight and ¯at
section of an infrequently traveled road, to time the car as
Laurie starts accelerating from rest. The ten friends are separated
by intervals of 10 meters each and start their digital timers as soon
as Laurie steps on the gas pedal. Laurie and Matthew collect their
friends' data and tabulate it as follows:
x (m)
t (s)
0
0
10
3.2
20
4.4
30
5.5
40
6.3
50
7.1
34
60
7.7
70
8.4
80
8.9
90
9.5
100
10
The Description of Motion
Since the car is accelerating, the speed is not constant. This can
be seen by studying the data. Laurie covers the ®rst 10 meters in 3.2
seconds, but takes only 1.2 seconds to cover the second interval of
10 meters. The last 10 meters are traveled in half a second!
According to the car's speedometer, Laurie was traveling at
45 miles per hour at the end of the 10 seconds. To check this,
Laurie calculates the average speed during the last interval,
which is equal to the 10 meters traveled (100 m ÿ 90 m) divided
by the 0.5 seconds it took, or
v
10 m
20 m=s
0:5 s
or 45 miles per hour, con®rming Matthew's data, and assuring
Laurie that the speedometer had not been tampered with. Since
45 miles per hour is the average speed during the last 10 meters,
the actual speed at the 100-m mark is slightly higher (48 mph in
this case). However, since this was only one trial, Laurie is satis®ed that the 0 to 50 in about 10 seconds may be possible.
When we say that the car makes ``0 to 50 in 10 seconds'' we
obviously mean that the car accelerates from rest to 50 miles per
hour in 10 seconds. Or in the case of Laurie's car, 45 miles per
hour in ten seconds. Acceleration is the rate at which velocity
changes and is given in units of distance per time squared
(miles per hour per second, for example). Thus, average acceleration is given by
a
v
t
where v is the change in velocity that takes place during the
time interval t. (The Greek capital delta is used in this fashion
to indicate change in some quantity, here the change in velocity.)
The SI unit for acceleration is the meter per second squared
(m=s2 ). Since velocity is a vector quantity, acceleration is also a
vector quantity.
Uniformly accelerated motion
In the previous example, the acceleration of the car is constant. In
other words, the velocity increased at a constant rate. In the Two
35
SUPERSTRINGS AND OTHER THINGS
New Sciences Galileo called motion with constant acceleration
uniformly accelerated motion. He wrote:
[F]or just as uniformity of motion is de®ned by and conceived through
equal times and equal spaces (thus we call a motion uniform when
equal distances are traversed during equal time intervals), so we
may, in a similar manner, through equal time-intervals, conceive
additions of speed as taking place without complication . . .
Hence the de®nition of [uniformly accelerated] motion . . . may be
stated as follows:
A motion is said to be uniformly accelerated when, starting from
rest, it acquires during equal time-intervals, equal increments of
speed.
Uniformly accelerated motion is a special case of accelerated
motion, since acceleration does not have to be always constant.
Here we shall consider only this special case. When the acceleration is constant, a a. Thus, we can write,
a a
v
:
t
Let's consider a numerical example. A driver enters a
straight highway at 30 km/h and accelerates to 55 km/h in 5
seconds. What is the acceleration? Calling the initial and ®nal
velocities vi and vf , respectively, we can calculate the change in
velocity is v vf ÿ vi 55 km/h ÿ 30 km/h 25 km/h. The
driver accelerates at
a
v 25 km=h
5 km=h=s:
t
5s
That is, the driver speeds up at a rate of 5 km/h every second.
If we know the acceleration of an object moving with uniformly accelerated motion and are interested in calculating the
change in the velocity v after certain time t, we can turn our
equation around to obtain
v at:
We can use this equation to compute, for example, the takeoff
speed of a jet airplane. Suppose an airliner increases its speed
by 10 km=h every second; that is, its takeoff acceleration is
10 km=h=s. If the plane takes off 36 seconds after it ®rst began
36
The Description of Motion
accelerating, the takeoff speed can be obtained from the previous
equation by realizing that the change in speed during these 36
seconds is simply the takeoff speed, v, since the initial speed is
0. The takeoff speed of a jet airliner is then,
10 km=h
v at
36 s 360 km=h:
s
Falling bodies
One important example of uniformly accelerated motion is the
vertical motion of an object falling. The acceleration, g, is
caused by the gravitational attraction of the earth upon the
object. The magnitude of g at the earth's surface is 9.8 m/s2 .
This value varies slightly with altitude and latitude and with
the different geological features near the earth's surface. In the
following chapter we shall see how Galileo ®rst calculated this
value. Here we merely use the fact that, neglecting air resistance,
all objects near the surface of the earth fall with an acceleration
g 9:8 m/s2 as an example of motion with constant acceleration.
Since for this case, a g, the last equation should be written as
v gt:
A common situation is the fall of an object from rest. In this
case, in a certain time t, the velocity changes from zero to the
value of the instantaneous velocity, v. The change in velocity
after a time t is the instantaneous velocity, and we can write
v gt:
Thus, after one second, the stone that the boy in ®gure 2.5 drops
from a bridge is moving with a velocity of v 9:8 m=s2 1 s
9.8 m/s towards the water. After 5 seconds, the velocity has
increased to v 9:8 m=s2 5 s 49 m/s. In Table 2.1, we show
the instantaneous speed of an object falling at intervals of one
second.
Table 2.1 illustrates Galileo's de®nition of uniformly accelerated motion as that which acquires equal increments of speed
during equal time intervals. We see that for each time interval
of 1 second, the speed increases by 9.8 m/s. This result is in agreement with our commonsense idea of acceleration. When we say
37
SUPERSTRINGS AND OTHER THINGS
Figure 2.5. After a time t the instantaneous velocity of the stone that the
boy drops from the bridge is v gt.
Table 2.1. Instantaneous speed of an object falling from rest.
Time of Fall (s)
Velocity (m/s)
0
1
2
3
4
5
6
7
8
9
10
0
9.8
19.6
29.4
39.2
49
58.8
68.6
78.4
88.2
98
38
The Description of Motion
Pioneers of physics: Galileo's method
Galileo arrived at the expression for computing the distance
traveled by a falling object guided by his desire to ®nd quantities that he could measure directly. A motion is uniformly
accelerated (he had written) when, starting from rest, it
acquires equal increments of speed during equal time intervals. Measuring increments of speed at equal time intervals
for a falling object was not practicable with the rudimentary
clocks that were available to him.
Galileo had discovered, however, that an object falling
from rest increases its speed uniformly from an initial value
of 0 to a ®nal value vfinal . He realized that for a quantity
that changes uniformly, the average value is halfway
between the initial and ®nal values. Since the initial value
of the speed of an object falling from rest is zero, the average
value of the speed is then 12 vfinal (halfway between 0 and
vfinal ). Using the de®nition of average speed, vav d=t, one
can obtain the distance d traveled in terms of the average
speed and the time; that is d vav t. Thus, the distance
traveled by an object moving with an average speed 12 vfinal
is
d 12 vfinal t:
Galileo could not test this expression directly either, since
he would have had to measure the speed of the falling
object just before it hit the ground, a task that is dif®cult
even today.
The ®nal speed of an object falling from rest is, as we saw
above, vfinal gt. This expression for vfinal can be substituted
into the expression for the distance d:
d 12 gt t 12 gt2 :
This is the expression that Galileo was seeking. It relates the
total distance that the object falls from rest to the total time,
quantities that were easier to measure. The timing devices
that Galileo had at his disposal, however, only allowed him
to test this expression in an indirect way.
39
SUPERSTRINGS AND OTHER THINGS
that an object is accelerating, we seem to imply that the longer the
elapsed time, the faster it goes. However, we could also imply that
the farther the object goes, the faster it goes. In fact, Galileo wrote
that at one time he thought it would be more useful to de®ne uniform acceleration in terms of the increase in speed and distance
traveled.
We already know how to compute the change in speed with
time elapsed. How do we ®nd the distance traveled as the speed
increases? Galileo, guided by a need to ®nd an expression that he
could test with the limited instruments of his time, was able to
show from the de®nitions of average speed and acceleration
that the distance traveled by a falling object that starts from rest is
d 12 gt2 :
Notice that since the acceleration due to gravity, g, is constant, the
distance traveled is proportional to the square of the time
elapsed. The stone that the boy in ®gure 2.5 drops moves 4.9 m
in the ®rst second. In two seconds, twice the time, it moves four
times the distance, or 19.6 meters; and in three seconds it moves
nine times as far, or 44 meters.
The motion of projectiles
The third and last part in which Galileo divided his study of
motion is the motion of projectiles. ``I now propose to set forth
those properties which belong to a body whose motion is compounded of two other motions, namely, one uniform and one
naturally accelerated. This is the kind of motion seen in a
moving projectile.'' Galileo further explains:
Imagine any particle projected along a horizontal plane without friction; then we know, from what has been more fully explained in the preceding pages, that this particle will move along this same plane with a
motion which is uniform and perpetual, provided the plane has no
limits. But if the plane is limited and elevated, then the moving particle,
which we imagine to be a heavy one, will on passing over the edge of the
plane, acquire, in addition to its previous uniform motion, a downward
propensity due to its own weight; so that the resulting motion, which I
call projection, is compounded of one which is uniform and horizontal
and of another which is vertical and naturally accelerated.
40
The Description of Motion
Figure 2.6. Stroboscopic photograph of two balls released simultaneously from the same height. One of the balls is given an initial horizontal velocity and moves off to the side as it falls. The horizontal lines in the
photograph helps us see that both balls hit the ground simultaneously.
(From PSSC Physics Seventh Edition, by Haber-Schaim, Dodge, Gardner,
and Shore. Published by Kendall/Hunt Publishing Company, 1991.)
The strobe photograph in ®gure 2.6 illustrates Galileo's
experiment. Two balls were released simultaneously from the
same height. One ball was simply dropped while the other was
thrown horizontally. The equally spaced horizontal lines in the
photograph show us that both balls keep pace as they fall, accelerating toward the ground at the same rate. This acceleration is
the acceleration due to gravity, g. The initial horizontal velocity
41
SUPERSTRINGS AND OTHER THINGS
given to one of the balls does not affect its vertical motion. Careful
examination of the photograph tells us that the horizontal
distance between positions of the ball that was given an initial
horizontal velocity are all equal. The horizontal component of
the motion is uniform motion with a constant horizontal velocity
component. Therefore, the vertical motion of the ball does not
affect its horizontal motion.
The independence of horizontal and vertical motions allows
us to predict the position and velocity of projectiles at any time
during their ¯ight merely by applying what we have learned
about motion to the two independent components. The horizontal
motion is uniform, at a constant speed vX , and the vertical motion
is uniformly accelerated motion, with a constant acceleration g.
42
3
THE LAWS OF
MECHANICS:
NEWTON'S LAWS OF
MOTION
The concept of force
Central to the laws of mechanics is the concept of force. Our idea
of force is closely related to muscular activity. When we push or
pull on an object, we exert a force on it (®gure 3.1). When we
push a lawn mower across a yard, pull a hand truck loaded
Figure 3.1. The mother pushing the stroller exerts a force on it. There
are other forces in nature.
43
SUPERSTRINGS AND OTHER THINGS
with boxes, push against the arms of a chair to get up from it or
when we turn the ignition key with our index ®nger and thumb
to get the car started we are applying a force. These forces associated with muscular activity are not the only ones that exist in
nature. When you bring a small magnet near a nail, a magnetic
force pulls the nail towards the magnet; and a gravitational force
keeps the moon orbiting around the earth and the earth around
the sun, and keeps us attached to the ground.
The concept of force is directly involved in the formulation of
the laws of motion. The discovery of these laws marks the birth of
our modern understanding of the universe.
The ancient idea of motion
We all know today that, neglecting the very small effect of air
resistance, an object falling towards the ground experiences a
constant acceleration caused by the gravitational attraction of
the earth upon the object, and that all falling objects experience
this acceleration. This was not known before the early 1600s.
Until then, it was believed that heavier objects would fall
towards the ground faster than lighter ones. This idea was
based on the teachings of Aristotle, the greatest scienti®c authority
of antiquity.
Born in the Greek province of Macedonia in the year 384 BC,
Aristotle was raised by a friend of the family, having lost both
parents while still a child. At the age of seventeen he went to
Athens for his advanced education and later joined Plato's Academy, becoming ``the intelligence of the school,'' as Plato himself
called him. It was in Athens, many years later, that Aristotle
founded the Lyceum, a school so named because it was near
the temple to Apollo, also known as Lykaios or Lyceius (apparently because he protected the ¯ocks from wolves (lykoi)). This
was the famous ``peripatetic (walk about) school'' where Aristotle
would sometimes lecture while strolling in the school's garden.
Aristotle's lectures were collected in some 150 volumes, of
which only about 50 have survived. Aristotle's writings remained
largely forgotten until the thirteenth century AD. Throughout the
Middle Ages, Aristotle became one of the most important in¯uences and perhaps the greatest philosopher.
44
The Laws of Mechanics: Newton's Laws of Motion
According to Aristotle, there was a sharp distinction between
heaven and earth, with different sets of natural laws for each
region. The boundary between these two regions was the
sphere of the moon, above which all motion was perpetual, circular, and uniform. This was the region of no-change, the home of
the ñther, where things were eternal and unchanging. In contrast,
in the region below the sphere of the moon, all motion was along
straight lines. Things constantly changed due to the interplay
between hot and cold, dry and moist. The four combinations of
these opposites produced the four ``elements'': Earth, Water, Air,
and Fire. These four elements had their own natural place, and
motion was an attempt to reach that place. Since the earth was
at the center, an object composed mostly of earth, like a rock,
would fall towards the ground, its natural place.
Thus, the fall of an object towards the earth is an example of
natural motion. Moreover, since a heavy object contains more
earth than a light one, it could have a stronger tendency to fall
towards its natural place. According to Aristotle, heavier objects
fall faster than lighter objects. It took nineteen centuries and the
genius of Galileo for this error to be corrected.
The birth of modern science
In his Two New Sciences, Galileo presents his theories in the form
of a dialog among three people: Simplicio, who represents the
views of Aristotle, Salviati, who represents Galileo, and Sagredo,
who represents the intelligent layman. At one point, after discussing whether Aristotle had ever tested by experiment if a
heavier stone would fall to the ground faster than a lighter
stone, Simplicio and Salviati continue:
Salviati [Galileo]: If then we take two bodies whose natural speeds are
different, it is clear that on uniting the two, the more rapid one will be
partly retarded by the slower, and the slower will be somewhat
hastened by the swifter. Do you not agree with me in this opinion?
Simplicio [Aristotle]: You are unquestionably right.
Salviati [Galileo]: But if this is true, and if a large stone moves with a
speed of, say, eight, while a smaller moves with a speed of four, then
when they are united, the system will move with a speed less than
45
SUPERSTRINGS AND OTHER THINGS
eight; but the two stones when tied together make a stone larger than
that which before moved with a speed of eight. Hence the heavier body
moves with less speed than the lighter one; an effect which is contrary
to your supposition. Thus you see now, from your assumption that the
heavier body moves more rapidly than the lighter one, I infer that the
heavier body moves more slowly.
Simplicio [Aristotle]: I am all at sea. . . This is, indeed, quite beyond
my comprehension. . .
Galileo had actually proved theoretically in 1604 that falling
bodies are accelerated towards the ground at a constant rate. An
object falling to the ground experiences what came to be known
as uniformly accelerated motion. In fact, Galileo actually de®ned
uniform acceleration in terms of the behavior of falling bodies.
However, with the clocks and instruments available to him at
the time, he could not directly test whether his theoretical predictions were correct. The legend that he dropped weights from the
leaning tower of Pisa is very likely false.
Galileo realized that there was an indirect way of testing his
theory. If an object is falling to the ground at a slower rate, as
when a ball rolls down a smooth inclined plane, it can be timed
Figure 3.2. Picture painted in 1841 by G Bezzuoli. Galileo (the tall man
in the center, pointing with his right hand at the open book) demonstrates one of his experiments with the ball rolling down an inclined
plane. Changing the angle of inclination of the plane allowed Galileo
to infer that when the angle was 908 (vertical plane) the acceleration of
the ball was also constant. He could not perform this last experiment
because he lacked a timing device accurate enough to time the rapidly
falling ball. (Alinari/Art Resource, NY.)
46
The Laws of Mechanics: Newton's Laws of Motion
with good accuracy. Galileo actually constructed such a plane and
left detailed notes on his experiment. In the Two New Sciences it
is Salviati who describes in great detail how the hundreds of
experiments were performed. By changing the angle of inclination and determining the acceleration of the ball as it rolled
down, Galileo was able to infer that in the limiting case, when
the angle was 908, the acceleration, having been constant for all
the other angles, had to be constant too. And 908 was, of course,
free fall.
Thus, we see how Galileo not only was able to argue against
the Aristotelian approach with his mathematical rationalism, but
in the process he established the modern scienti®c method from
observation to hypothesis. From the mathematical analysis of
Pioneers of physics: Galileo's dialog with
Aristotle
Simplicio: Your discussion is really admirable; yet I do not ®nd
it easy to believe that a bird-shot falls as swiftly as a cannon
ball.
Salviati: Why not say a grain of sand as rapidly as a
grindstone? But Simplicio, I trust you will not follow the
example of many others who divert the discussion from its
main intent and fasten upon some statement of mine which
lacks a hair's-breath of truth and, under this hair, hide the
fault of another which is as big as a ship's cable. Aristotle
says that ``an iron ball of one hundred pounds falling from
a height of one hundred cubits reaches the ground before a
one-pound ball has fallen a single cubit.'' I say that they
arrive at the same time. You ®nd, on making the experiment,
that the larger outstrips the smaller by two ®nger-breaths;
now you would not hide behind these two ®ngers the
ninety-nine cubits of Aristotle, nor would you mention my
small error and at the same time pass over in silence his
very large one.
Galileo Galilei, Dialogs Concerning Two New Sciences, 1638,
translated by Henry Crew and Alfonso de Savio (Macmillan,
New York, 1914; reprinted by Dover, New York), pp. 64±65.
47
SUPERSTRINGS AND OTHER THINGS
the hypothesis, predictions are drawn, and these can in turn be
tested by experimental observation. Galileo was aware of his
having founded the experimental method. In the Two New
Sciences, he writes:
Salviati: . . . we may say the door is now open, for the ®rst time, to a
new method fraught with numerous and wonderful results which in
future years will command the attention of other minds.
Galileo formulates the Law of Inertia
In one of Galileo's experiments, he drops two balls, ``one, say, of
lead, the other of oak, both descending from a height of 150 or 200
braccia [yards].'' ``Experience shows us,'' writes Galileo in The Two
New Sciences, ``that [the] two balls . . . arrive at the earth with very
little difference in speed.'' Galileo shows that the opposition
presented by the air when the object is moving with great
speed is not much larger than when the object is moving at a
lower speed. He further explains:
As to speed, the greater this is, the greater will be the opposition made
to it by the air, which will also impede bodies the more, the less heavy
they are. Thus the falling heavy thing ought to go on accelerating in
the squared ratio of the duration of its motion; yet, however heavy
the movable might be, when it falls through very great heights the
impediment of the air will take away the power of increasing its
speed further, and will reduce it to uniform and equable motion.
And this equilibration will occur more quickly and at lesser heights
as the movable shall be less heavy.
Galileo af®rms here that a body falling through a great distance will be slowed down by increasing air resistance until this
resistance equals the weight of the falling object. At this moment,
the object is prevented from any increase in its speed and will
continue falling with a constant velocity due to the cancellation
of the forces acting on the object. This is contrary to the Aristotelian idea that to keep an object in motion you would need to
supply a force, and that if the motive force is balanced by the
resistance, the speed becomes zero. Thus, according to Galileo,
an object moving at constant speed along a straight line will continue to do so in spite of the fact that it has lost all contact with the
48
The Laws of Mechanics: Newton's Laws of Motion
Figure 3.3. An ice skater moving on perfectly smooth ice at a constant
speed along a straight line would not slow down. Air resistance and
some friction (since even Olympic ice rinks are not perfectly smooth)
actually slows down the ice skater.
source of that motion. This can be interpreted as the tendency of a
body to resist any change in its state of motion, a property that
Newton would later call inertia. In its Two New Sciences, Galileo
writes:
I mentally conceive of some movable projected on a horizontal
plane, all impediments being put aside. Now it is evident from what
has been said elsewhere at greater length that equable [uniform]
motion on this plane would be perpetual if the plane were of in®nite
extent.
Galileo arrived at this important conclusion in a very clever
way. ``Let us suppose,'' he wrote, ``that the descent [of a body]
has been made along the downward sloping plane AB, from,
which the body is de¯ected so as to continue its motion along
the upward sloping plane BC.'' In the same way that a pendulum
bob reaches nearly the same height from where it was released
after one swing, the rolling body in Galileo's experiment reaches
the same height when rolling upwards on plane BC. ``And this is
true whether the inclinations of the planes are the same or
different, as in the case of the planes AB and BD.'' Thus, if the
49
SUPERSTRINGS AND OTHER THINGS
Figure 3.4. Galileo's clever design to prove that, neglecting friction, an
object moving along a horizontal plane will continue moving forever at a
constant velocity.
inclination of the plane on the right in ®gure 3.4 is changed from
C to D to E, the body must roll farther in each case to reach the
same height. When the second plane is horizontal, the rolling
body never reaches its original height and must ``maintain a uniform velocity equal to that which it had acquired at B after fall
from A.''
There is still some disagreement among historians of science
as to whether Galileo believed that a plane of ``in®nite extent''
could actually exist and some historians interpret Galileo's
phrase ``rolling on forever'' as meaning ``staying at a constant
height above the earth'' and not moving along a straight line
through space. It is, however, evident that he saw that a body
could have a uniform, nonaccelerated, constant velocity, and
that Isaac Newton would use Galileo's ideas in the formulation
of his laws of motion.
Physics in our world: The Leaning Tower of Pisa
The leaning Tower of Pisa tilts today at an angle of 321 minutes of arc, making the top of the tower 5.227 meters offcenter. The extent of this lean places a very large stress on
the southern wall of the tower and might actually cause it
to collapse.
The leaning tower has only been straight for a short
period during its 800-year history. Right after its construction
in the twelfth century it tilted north. It straightened up a few
50
The Laws of Mechanics: Newton's Laws of Motion
years later and stood nearly upright for almost a century,
only to start tilting at an alarming rate in a southward direction during the early part of the 14th century. During the
early 1600s, when Galileo lived in Pisa, the tower had already
leaned over 180 minutes of arc.
Construction of the tower began in 1173 in the Piazza dei
Miracoli. The tower or campanile formed part of a grandiose
and spectacular cathedral and baptistery built in white
marble in a blend of Romanesque and Gothic styles. The
soil under the piazza is composed of several layers of clay
and sand. The area under the entire square has been sinking
slowly and unevenly, with a few places where the soil is
sinking faster than in the rest of the area. One of these
places lies exactly below the tower.
Since 1911, when regular monitoring of the tower began,
the top of the tower has moved about 1.2 mm per year.
Although several efforts to stop the tilting have been undertaken, none has succeeded. In 1990, the Italian government
formed an international commission of experts to study
methods to stabilize the tower. In 1992, the ®rst level of the
tower was strapped with steel bands to prevent the walls
from breaking. In 1993 the commission proposed placing
750 tons of lead bricks on the northern side of the concrete
ring that encircles the tower. These counterweights not only
stopped the rate of the incline for nine months, but actually
straightened it about 2.5 cm.
In 1995, the lead bricks were replaced by a second ring
anchored with steel cables to one of the deep clay layers.
Attempts at consolidating the layers of clay under the
tower are underway. One technique involves extracting
small quantities of water from the clay under the north
side of the tower so that the drier clay compresses slightly,
thus sinking the northern side of the tower. Other techniques under consideration involve extracting small
amounts of clay from the northern side to accomplish similar
results.
51
SUPERSTRINGS AND OTHER THINGS
Newton's First Law: Law of inertia
Isaac Newton was born in Woolsthorpe, in the county of Lincolnshire, on Christmas Day 1642, the same year that Galileo died.
His father, Isaac, had died 3 months before he was born. Of his
early education little is known, except that he attended two
small schools in villages near Woolsthorpe. At the age of
twelve, Newton enrolled in King's School at Grantham, a few
miles north of Woolsthorpe, where his intellectual interests
were awakened. He began building ingenious mechanical toys
and clocks and became interested in space and time and in the
motion of objects.
In 1660, Newton entered Trinity College and ®ve years later
he took his Bachelor of Arts degree. That year the great plague
broke out in London, and Newton went to his mother's farm in
Woolsthorpe when the university closed in June 1665. He did
not return to Cambridge until April 22, 1667. During the two
years that Newton remained at his mother's farm, his mathematical genius ¯ourished, and his studies reached a climax in
October 1666. This period of creativity is usually known as the
annus mirabilis, the year of wonders, telescoped for commemorative purposes into the single year of 1666. During that period he
performed experiments to investigate the nature of light, completing his theory of colors in the winter of 1666. He began his
observations of comets, tracking them for nights on end until
he fell ill from exhaustion, as he told his biographer. During
that same period, Newton developed the main ideas of what he
called his theory of ¯uxions, which we know today as calculus.
It was also during that glorious period of creativity that
Newton laid the foundations of his celestial mechanics with his
discovery of the law of gravity. ``All this was in the two plague
years of 1665 &1666,'' Newton wrote, ``for in those days I was
in the prime of my age for invention, & minded Mathematics &
Philosophy [i.e. physics] more than at any time since.''
It is not clear why Newton, after having conceived the idea of
universal gravitation, did not attempt to publish anything about
The Gregorian calendar was not in use in England at the time. On the Continent,
however, where the Gregorian calendar had been in use since 1582, Newton's date
of birth was ten days later: January 4, 1643. In addition, in England, which used the
Old Style calendar, the new year started on March 25 rather than on January 1.
52
The Laws of Mechanics: Newton's Laws of Motion
Figure 3.5. The physics of the Principia still guides our space vehicles.
(Courtesy NASA.)
it for almost twenty years. When he ®nally did, at the instigation
of one of his closest friends, the scientist Edmond Halley, he
published what many consider the greatest scienti®c treatise
ever written, the Philosophiae Naturalis Principia Mathematica. The
physics of the Principia still guides our communication satellites,
our space shuttles, and the spacecraft that we send to study the
solar system (®gure 3.5).
The Principia is divided into three books or parts. In Book
One, The Motion of Bodies, Newton develops the physics of
moving bodies. This ®rst part is preceded by ``axioms, or laws
of motion'':
LAW I: EVERY
BODY CONTINUES IN ITS STATE OF REST, OR OF UNIFORM
MOTION IN A RIGHT LINE, UNLESS IT IS COMPELLED TO CHANGE THAT
STATE BY FORCES IMPRESSED UPON IT.
''Projectiles continue in their motions,'' explained Newton after
stating his ®rst law, ``so far as they are not retarded by the resistance of the air, or impelled downwards by the force of gravity.''
Therefore, as long as there are no forces pushing or pulling on an
object, it will not change its state of motion. This state of motion
53
SUPERSTRINGS AND OTHER THINGS
could be the absence of motion in relation to an observer, that is, a
body at rest. Newton is stating very clearly in his ®rst law what
Galileo hinted at, that an object moving along a straight line
with a constant velocity (uniform motion) will continue moving
along that line with the same velocity unless a force is applied
to the object so as to change its velocity.
Physics in our world: Car seat belt
A car seat belt moves freely when it is pulled and locks when
the car suddenly stops. The mechanism that makes a seat belt
work is based on the concept of inertia.
The belt wraps around a belt shaft. Attached to the shaft,
a toothed plate rotates freely during normal operation. When
the car suddenly stops, however, the heavier end of an elongated clutch moves forward because of its inertia, engaging
the toothed plate with its own inner teeth. The clutch itself
is prevented from rotating further by a small pawl which
engages and locks a ratchet, also attached to the belt shaft.
The pawl is attached ®rmly to the car body.
Regardless of how fast you pull on the belt, you will not
be able to lock it, since the clutch is not engaged. When the
driver suddenly slams on the brakes or the car collides,
there is a rapid decrease in the car's speed. You, whatever
your have loose in your car, and, more importantly, the elongated clutch, continue moving with the velocity that the car
had. The objects will probably fall on the ¯oor, but you will
be prevented from hitting the windshield when the moving
clutch engages the toothed wheel.
54
The Laws of Mechanics: Newton's Laws of Motion
The tendency of objects to maintain their state of motion (rest
or motion with a constant velocity) is called the principle of
inertia. Inertia is the resistance that all objects present to any
attempt to change their state of motion. The inertia of an object
is related to its mass. A massive object has a high inertia, which
means it is hard to move or to stop once you get it moving.
Light objects can be moved about more easily. It is inertia that
projects you out of your seat when you are in a car and it stops
suddenly, and inertia is responsible for your dif®culty in getting
your car moving on an icy road. It is inertia that makes you hold
on to a rotating merry-go-round so that you are not thrown off,
and bursts a ¯ywheel that is spinning too fast. Newton's ®rst
law is also called the law of inertia.
We have spoken of forces pushing and pulling on objects to
change their states of motion. These forces caused the objects to
move. However, we all know that we can push on a heavy
piece of furniture, like a dresser for example, and fail to move
it. In this case the force applied is balanced by the frictional
force between the dresser and the ¯oor. We would say that the
net force acting on the dresser is zero. Forces are vector quantities
and this knowledge helps us understand why we can have
several forces applied to an object and still have a net or resultant
force equal to zero. When the two teams participating in a tug-ofwar contest are unable to drag each other across the center line
(®gure 3.6), the net force on the rope is zero, even though the individual team members, each of different strengths, pull with different forces. The sum of all the individual forces pulling on the
rope is zero and the rope does not move. In ®gure 3.6, we have
Figure 3.6. Equilibrium: The net force on the rope is zero if neither team
is able to drag the other. Although the forces acting on the rope are all
different, the net force is equal to zero.
55
SUPERSTRINGS AND OTHER THINGS
the three different forces F1 , F2 , and F3 exerted by each one of the
three women acting on the left side of the rope, and the three
additional forces, F01 , F02 , and F03 , exerted by the men, all
different, and differing from the ®rst ones, acting on the right
side of the rope. In this vector diagram, the lengths of the
vector forces are proportional to their magnitudes. Since the
total length of the vectors on the left is equal to the total length
of the three vectors acting on the right, the vector sum of these
six forces is equal to zero. Thus the net force acting on the rope
is zero and we say that the rope is in equilibrium.
Newton's ®rst law says that, if the net force acting on an object
is zero, the object will not change its state of motion. Thus, if the
object is in equilibrium, its velocity remains constant. A constant
velocity means that the velocity can take up any value that does
not change, including zero, the case of an object at rest. This fact
means that ``motion with a constant velocity'' and ``a state of
rest'' are equivalent, according to Newton's ®rst law. On close
examination, a velocity is always given in reference to some
point. If you are traveling on an airplane, the book you might be
reading is at rest with respect to you, but is traveling, along with
the other passengers and the entire contents of the plane, at the
cruising speed of the plane, about 1000 km/h with respect to the
ground. A ¯ight attendant walking up the aisle at 2 km/h with
respect to the plane would be moving at 1002 km/h with respect
to the ground. Whether an object is at rest or moving with a
constant velocity depends on the frame of reference to which it
is referred. These special reference frames, in which the law of
inertia is valid, are called inertial frames of reference. Thus, motion
is entirely relative, an idea that is central to Newton's laws and
was recognized by Newton himself.
Newton's Second Law: Law of force
Newton's ®rst law tells us what happens to an object in the absence
of a net force. What happens if the net force acting on an object is
not zero? According to the ®rst law, the object will not move with a
constant velocity. This means, of course, that the object will experience an acceleration in the direction of the applied net force.
Newton explained it in Book One of the Principia:
56
The Laws of Mechanics: Newton's Laws of Motion
LAW II: THE CHANGE OF MOTION IS PROPORTIONAL TO THE MOTIVE FORCE
IMPRESSED; AND IS MADE IN THE DIRECTION OF THE RIGHT LINE IN WHICH
THAT FORCE IS IMPRESSED.
According to Newton's Second Law, when an instantaneous force
acts on a body, as when a baseball bat strikes a ball, a change in
the body's motion takes place which is proportional to the
``force impressed.'' If the force acts on the body continuously
rather than instantaneously, as when we push on an object for
some time, an acceleration is produced that is proportional to
the applied force.
As an illustration of the second law, suppose that your sports
car has stalled and you decide to push it (®gure 3.7(a)). If your car
has a mass m of 1000 kg, you would apply a force F for 10 seconds
to accelerate it from rest to a speed of 5 km/h. When two of your
friends decide to join you and help you push the car (®gure 3.7(b)),
a larger force of about 3F applied to the car of mass m for the same
10 seconds would bring the car to a speed of 15 km/h, producing
an acceleration three times larger and perhaps getting your car
started. Thus, for a constant mass, we can write
F is proportional to a or F / a:
When your 2000-kg wagon stalls you know that you have to
push much harder, with a larger force, to accelerate it from rest to
5 km/h in 10 seconds (®gure 3.8). In fact, you have to apply a
force twice as large as you did for your sports car to produce
the same acceleration, since the mass of the wagon is twice as
Figure 3.7. (a) An applied force F on the car of mass m, produces an
acceleration a as the car increases its speed from zero to a ®nal v
during a time t. (b) A force of 3F produces an acceleration 3a, thus
increasing the car's speed from zero to a ®nal 3v during the same time t.
57
SUPERSTRINGS AND OTHER THINGS
Figure 3.8. A larger mass M requires a larger force F to produce a given
acceleration.
large. We should write then, for a ®xed acceleration,
F / m:
In the ®rst case, for a constant mass (the sports car), the applied
force is proportional to the acceleration produced. In the second
case, for a given acceleration (0 to 5 km/h), the applied force is
proportional to the mass of the object, larger for the larger mass
of the wagon and smaller for the smaller mass of the sports car.
We can combine these two proportionality expressions into a
single equation:
F ma
which is the mathematical expression for Newton's Second Law.
In converting from a proportionality to an equality, a constant is
usually introduced. In this case, however, if SI units are used, the
constant is unity. Notice that both F and a are vector quantities.
We can restate the second law as follows:
THE ACCELERATION OF AN OBJECT IS DIRECTLY PROPORTIONAL TO THE NET
FORCE ACTING ON IT AND INVERSELY PROPORTIONAL TO ITS MASS.
The SI unit of force is the newton (N), de®ned as the force that
produces an acceleration of 1 m/s2 when acting on a 1-kg mass.
From this de®nition we can write:
1 N 1 kg m=s2
In Newton's second law, it is the net force, F, acting on an
object of mass m that produces an acceleration a.
58
The Laws of Mechanics: Newton's Laws of Motion
Newton's Third Law: Law of action and reaction
Newton's second law allows us to calculate the average force
required to accelerate a certain sports car from 0 to 60 mph in
9.9 seconds, for example, by determining the car's mass and multiplying it times the acceleration, which we would of course
obtain from the initial and ®nal speeds, and the time taken to
accelerate the car between those two speeds. However, where
does this force come from? What is the source for this force?
The car's engine, which would perhaps come to mind as the
answer, only makes the wheels spin; it does not make the car
go! Moreover, the spinning wheels act on the pavement, not on
the car. Newton gave us the answer to this problem in the form
of his third law:
LAW III: TO EVERY ACTION THERE IS ALWAYS AN EQUAL REACTION: OR, THE
MUTUAL ACTIONS OF TWO BODIES UPON EACH OTHER ARE ALWAYS EQUAL,
AND DIRECTED TO CONTRARY PARTS.
''Whatever draws or presses another is as much drawn or pressed
by that other,'' he explained. Thus, the car's wheels push on the
pavement with a force produced by the engine. This push or
action is matched by a force from the pavement on the wheels,
which make the car go. These two equal forces, the force of the
wheels on the pavement and the force or reaction of the pavement
on the wheels act, as we can see, on different bodies, the pavement
and the wheels.
Newton's third law tells us that forces always come in pairs,
acting on different bodies. A single isolated force acting on a body
without another equal force acting somewhere else cannot exist.
According to the third law, the force that keeps the moon
orbiting around the earth, FEM , is equal and opposite to the
force that attracts the earth towards the moon, FME , (®gure 3.9).
The force of attraction between the earth and any object is
Figure 3.9. The earth and the moon attract each other with equal and
opposite forces.
59
SUPERSTRINGS AND OTHER THINGS
Figure 3.10. The earth pulls on the falling ball with a force that we call
the ball's weight. The ball pulls on the earth with an equal and opposite
force.
called the weight of the object, w. If we drop a ball near the surface
of the earth and neglect air resistance, the force of attraction
between the earth and the ball makes the ball fall towards the
center of the earth with an acceleration of 9.8 m/s2 , which is, as
we learned in chapter 2, the acceleration due to gravity, g.
Newton's second law tells us that this force due to gravity or
weight (w), is given by
w mg
where m is the object's mass. The third law tells us that the object
pulls on the earth with an equal and opposite force. Thus, the ball
pulls on the earth with a force equal in magnitude to its weight
(®gure 3.10). Newton's second law also explains why the acceleration at which the earth ``falls'' towards the ball is negligibly
small, since the force exerted by the ball on the earth is equal in
magnitude to the ball's weight. Therefore,
F w ME a mg:
Since ME is much greater than m, a has to be much smaller than g.
60
The Laws of Mechanics: Newton's Laws of Motion
Figure 3.11. The skaters pull on each other's arms with equal and
opposite forces.
Any two objects interacting with each other obey Newton's
third law. The two skaters of ®gure 3.11 pull on each other's
arms with equal and opposite forces acting on two different
bodies, namely the two skaters.
61
4
ENERGY
What is energy?
Energy is one of the more important concepts in science. It
appears in many forms, including mechanical energy, thermal
energy, electromagnetic energy, chemical energy, and nuclear
energy. Wherever and whenever anything happens, like the
explosion of a distant sun or the falling of a golden leaf from a
tree in autumn, a change in some form of energy is involved. In
spite of our familiarity with the concept of energy, few of us
can de®ne it properly. What is energy? Can we measure it? Can
we touch it?
Energy is an abstract concept introduced by physicists in
order to better understand how nature operates. Because it is an
abstract idea, we cannot form a concrete picture of it in our
minds, and we ®nd it very dif®cult to de®ne it in simple terms.
But we can perhaps understand what it can do. Energy is the
ability to do work. Therefore, before we can fully understand
what this de®nition of energy means we need to know what we
mean in physics by work, a word that we use in our everyday
language.
The concept of work
Work involves an effort directed toward the production of something. In physics, what we understand for work differs somewhat
from our everyday meaning of the word, and we must be careful
to distinguish between the two meanings. A few examples should
illustrate what work means in physics. Suppose your car's battery
62
SUPERSTRINGS AND OTHER THINGS
Figure 4.2. (Top) Pulling a moderately heavy suitcase. Pulling the strap
in an almost vertical direction makes it dif®cult to move the suitcase.
(Bottom) To pull effectively on an extremely heavy suitcase the traveler
must lean forward and pull in a horizontal direction.
We can say that work is a measure of the productivity of a force.
For the simple case of a constant force acting on an object along
the direction of motion of the object, as in ®gures 4.2(c) and
4.3), the work done on the object is the product of the force and
the distance the object moves, or
Work force distance
or
W F d:
When the applied force is not along the direction of motion,
the force can be resolved into two components, one parallel to the
direction of motion and the other perpendicular to it. As we saw
from our discussion, only the parallel component does work.
64
Energy
(a)
(b)
Figure 4.1. (a) Work is done when the force applied by the man pushing
makes the car move some distance. (b) The man pushing the bulldozer
will not be able to move it. In this case, he performs no work on the
bulldozer.
dies and you decide to push the car to the gas station 500 feet along
the road. When you push the car you exert a force on it, even if the
car does not move. However, if the car begins to move as you push
it, then you are doing work on it. The man pushing the stalled car
(®gure 4.1(a)) performs work only if he is able to move it through
some distance. If he attempts to push a bulldozer, he probably will
not be able to move it. In this case the work done is zero, even if the
man gets very tired (®gure 4.1(b)).
Suppose now that a traveler is pulling a heavy suitcase in an
airport (®gure 4.2(a)). The suitcase rolls along the horizontal ¯oor
as the traveler pulls on the strap. We know from our own experience that if the direction in which the strap is pulled is too close to
the vertical (®gure 4.2(b)), the traveler's effort to move the suitcase is not as effective. If the suitcase is extremely heavy, the
traveler must lean forward and pull on the strap in an almost
horizontal direction. In fact, the horizontal direction makes the
force most effective in this case (although this position is probably
uncomfortable for the traveler; as seen in ®gure 4.2(c)). If the
traveler were to pull on the suitcase in a direction perpendicular
to the direction of motion, her effort would be wasted; the force
that she would apply in this case would do nothing for the
motion of the suitcase along the ¯oor.
If the traveler pulls with the same force in all cases described
in the previous situation, the work done on the suitcase will be
maximum when the applied force is in the direction of motion
of the suitcase, zero when the force is at right angles to the direction of motion, and an intermediate value when the force is along
any other direction. The closer the applied force gets to the direction of motion, the more effective it becomes in producing work.
63
Energy
Figure 4.3. The force exerted by the horse is in the same direction as that
of the resultant motion. In this case, the work done by the horse is equal to
the product of the magnitude of the force and the distance traveled.
It is interesting to notice that if a person walks on a horizontal
surface at a steady pace while carrying a suitcase, the force that he
exerts on the suitcase to prevent it from falling to the ground does
not produce work on the suitcase. However, if the suitcase is
heavy and the person has to walk a long distance, he begins to
sweat. Why should he sweat if he is doing no work? Even if he
is standing still while holding the suitcase, he will get tired. The
fact is that in this case there actually is motion inside the man's
arm and work is being done on the muscle ®bers. While the
man is holding the suitcase, nerve impulses are continuously
reaching the muscles in his arm. When these nerve impulses
reach a muscle ®ber, the ®ber lurches for an instant and then
relaxes. At any one time, large numbers of ®bers are tightening
up while the rest are relaxing. Since we were analyzing the
motion of the suitcase held by the man, a situation external to
the activity of the muscle ®bers inside the man's arm, we concluded correctly that no work was done on the suitcase.
65
SUPERSTRINGS AND OTHER THINGS
Units of work and energy
The SI unit of energy (and work, since energy is the ability to do
work, and thus must have the same units) is the joule ( J). This unit
is named in honor of the English physicist James Prescott Joule
(1818±1889) whose work clari®ed the concepts of work and
energy. A joule combines the units of force and distance:
1 J 1 Nm
We can illustrate this unit with two examples. It requires about
one joule of work to lift a baseball from the ground to your
chest, whereas it takes about ten joules of energy to pick up an
average physics textbook from the bottom shelf of a bookcase
and stand up to read it.
When dealing with the energies of atoms or electrons,
though, the joule is too large a unit. For these purposes, another
unit, the electron volt (eV), is used. The conversion factor between
electron volts and joules is
1 eV 1:602 10ÿ19 J:
A frequently-used multiple of the eV is the MeV which equals one
million eV.
The concept of energy
The concept of work is very useful in understanding the concept
of energy. As we stated at the beginning of the chapter, energy is
the capacity to do work; that is, energy allows us to perform tasks,
to do work. We can also think of energy as the result of doing
work. It is the chemical energy stored in the man's body in
®gure 4.4 what enables him to do the work on the car as he
pushes it, converting chemical energy into energy of motion of
the car and into heat (thermal energy) as the tires rub against
the pavement.
This idea of energy as something stored that can do work was
called ®rst vis viva (Latin for ``living force'') by the German philosopher Gottfried W. Leibniz (1646±1716), because he thought that
only living things could have the capacity to do work. The English
scientist Thomas Young (1773±1829) realized that inanimate
66
Energy
Pioneers of Physics: James Prescott Joule
(1818±1889)
The second son of a wealthy brewer, James Joule had a good
early education. As a young man, he was taught by the
renowned chemist John Dalton and showed a talent for
science. At 19, he did several experiments investigating the
nature of electromagnets, which resulted in a published paper.
Born in 1818 near Manchester, England, Joule developed
an early interest for the machines in his father's large brewery. This interest made him pro®cient at designing experiments and building the machines required to run them. He
soon developed an almost fanatical zeal for accurate measurements. Such was his dedication that he even took time
during his honeymoon to design a special thermometer
with which to measure the temperature difference at the
top and at the bottom of a waterfall he and his bride visited.
When Joule was 15, his father became ill and retired.
Although the young James had to spend time running the
brewery, he continued his scienti®c endeavors. At 22, he calculated the amount of energy produced by an electric current
and went on to spend the next 10 years devising experiments
to measure energy in every conceivable way.
The initial report of his experiments was met with skepticism and even rejection. The Royal Society did not accept
his original paper and Joule was forced to present his results
at a public lecture. His report was ®nally published in the
Manchester newspaper at the instigation of his brother,
who was the paper's music critic.
Eventually, his work caught the attention of other scientists and Joule gained the recognition he deserved. He was
elected to the Royal Society in 1850 and years later became president of the British Association for the Advancement of
Science. In 1854, his wife died after only six years of marriage
and Joule, deeply distressed, retreated to his work. In 1875, he
began to have ®nancial dif®culties and Queen Victoria granted
him a pension. Toward the end of his life, he became concerned and disturbed about the applications of his work to
warfare. He died in 1889 at the age of 71 from a long illness.
67
SUPERSTRINGS AND OTHER THINGS
Figure 4.4. Energy is the capacity to do work. The chemical energy in
the man's body enables him to do work on the car.
objects, like the wind, can do work by moving a windmill or a
ship, for example. He proposed the name energy, a name he
fashioned from Greek words meaning ``work within,'' for this
work stored in bodies.
Of the various types of energy listed at the beginning of the
chapter, we will only consider mechanical energy for the
moment. Later in the book, most of the other kinds of energy
will be studied in some detail. An object may have mechanical
energy by virtue of its state of motion, its location in space or
its internal structure.
Energy of motion
As we have seen, work involves forces and motion. An object in
motion has the capacity to do work: running water can turn a
millstone, a gusty wind sets a windmill in motion and drives a
sailing ship, and a truck ramming into the rear of a small car at
a traf®c light will surely move it some distance!
Thus, an object in motion has energy. It is the motion of the
object that causes it to contain energy. Still water does not turn a
millstone and still air does not drive a sailing ship. We call the
energy of an object in motion kinetic energy. The word ``kinetic''
was ®rst introduced by the English physicist Lord Kelvin in
1856 and comes from a Greek word that means ``motion.'' The
amount of kinetic energy that an object has depends on its mass
and on its speed. Thus, a large truck traveling at the same speed
68
Energy
as a small sports car would have more kinetic energy due to its
larger mass. Likewise, a runner would have more kinetic energy
than a person of similar weight walking along the same path,
due to the runner's greater speed.
Kinetic energy ± the energy that an object has by virtue of its
motion ± is thus proportional to the mass of the object and to its
speed. It is equal to one-half the product of the mass, m, and
the square of the speed, v2 :
Kinetic Energy KE 12 mv2 :
Energy of position
A snowball at rest at the top of a hill has no kinetic energy since it
is not moving. However, it is potentially capable of doing work on
a snowman at the bottom of the cliff, if it is set into motion by the
boy (®gure 4.5). This type of energy, which we call (gravitational)
potential energy is due to the object's separation from the earth. It is
called ``potential'' because energy has been stored for later use,
and it is called gravitational because the gravitational force of
attraction of the earth does work on the object as it falls towards
the ground. Notice that unless the boy pushes it, the snowball
will not reach the ground below where it can harm the snowman. If the snowball is never pushed, it will never become
separated from the earth, remaining on the ground at the top of
the cliff. When it is pushed, it acquires a separation from the
earth equal to the height of the cliff. What we call ground, then,
is actually the lowest position the object can reach in a particular
situation. This lowest position or ground is the reference level from
which the position of the object is measured. We are free to
choose an arbitrary reference level that better suits our particular
situation.
The gravitational potential energy depends also on the
object's mass. Thus, a tree falling on a house after a storm does
much more damage (more work!) than a walnut falling from a
standing tree on the roof of the same house. If an object's
height changes, so does its potential energy because the distance
to the ground increases. If you lift a box full of books from the
ground and place it on a chair, the box acquires potential
energy because of its height with respect to the ground. This
69
SUPERSTRINGS AND OTHER THINGS
Figure 4.5. The snowball has gravitational potential energy by virtue of
its position with respect to the ground. If pushed off the cliff, its potential
energy will allow it to do work on the snowman below.
potential energy comes from the work done by your muscles in
lifting the box. If you now decide to place the box on the table,
the potential energy of the box increases by an amount proportional to the increase in height. The increase in potential energy
results from the additional work that you have to do to lift the
box from the chair up to the table.
We can summarize the previous discussion as follows:
Gravitational potential energy is the energy that an object has by
virtue of its separation from the earth's surface. Potential energy is
proportional to the mass of the object and to the height above an
arbitrary reference level.
It is not dif®cult to obtain the exact expression for the gravitational potential energy. Consider the boy lifting the baseball
of mass m in ®gure 4.6. The potential energy of the ball before
70
Energy
Figure 4.6. As the boy slowly lifts the ball, the ball gains gravitational
potential energy.
the boy picks it up is zero, if we chose the ¯oor as our reference
level. As the boy slowly lifts the ball, the potential energy
increases until the ball reaches the table. The work done by the
boy in lifting the ball slowly, without acceleration, is equal to
the force applied by the boy times the distance traveled by the
ball, which is equal to the height of the table, h. The magnitude
of the force applied by the boy equals the ball's weight, mg. The
force is in the same direction to that of the motion. Therefore,
W Fh mgh:
Thus, W mgh is the work done by the boy on the ball. When the
ball is resting on the table, it has no speed, and therefore no
kinetic energy. It, however, has stored energy by virtue of being
71
SUPERSTRINGS AND OTHER THINGS
Figure 4.7. Different kinds of potential energy. (a) The gravitational
potential energy of the weights increases as the weights are lifted. (b)
The magnetic potential energy increases as the two magnets are brought
closer together with their north poles facing. (c) The elastic potential
energy of the spring increases as it is compressed.
at a height h above the ¯oor; in other words, by being at a distance
h above the reference level. If the boy decides to push the ball off
the table, it would, of course, fall down and gain kinetic energy,
which would have come from the stored energy when it was
resting on the table. This stored energy is what we call potential
energy. We can write for potential energy, then,
PEgrav mgh:
There are other kinds of potential energy (®gure 4.7). When
we push two magnets together with their north poles facing,
there is an increase in magnetic potential energy. If you compress
a spring by holding it in between the palms of your hands and
pushing, the elastic potential energy of the spring increases.
In all these cases, the potential energy is stored in the entire
system of interacting bodies. After we bring the two magnets
together, for example, we could hold either one of the two
magnets in place and let the other move away. Since either
magnet can be released and allowed to move away from the
other magnet, making use of the available potential energy, this
magnetic potential energy must reside in the system of the two
interacting magnets. When we lift a baseball up to a certain
height and then release it so that it falls towards the earth, it
seems as if the potential energy belongs only to the baseball.
72
Energy
However, if we could devise a method to secure the baseball in
space with respect to the sun, the earth would ``fall'' towards
the baseball. The gravitational potential energy in this case
resides in the earth-baseball system .
Elastic potential energy
The stretched spring in ®gure 4.8 has stored energy. We call this
energy elastic potential energy. If we pull on the spring with a
force F, the increase in length x is proportional to the stretching
force (as long as the spring is not stretched too much). We
can write for the force exerted by the person pulling on the
spring:
F kx
where k is called the force constant of the spring. This force law is
known as Hooke's Law. By Newton's Third Law, the spring
exerts an equal and opposite force on the person pulling, or
F ÿkx:
The applied force increases from 0, when the spring is not
stretched, to kx, when the spring has been stretched a distance
x. Thus the average force is the sum of these two values divided
Figure 4.8. The spring is stretched from its equilibrium position by a
force that is proportional to the displacement x.
73
SUPERSTRINGS AND OTHER THINGS
by two, or
0 kx 1 kx:
F
2
2
The work done in stretching the spring from the equilibrium
position out to a distance x is the product of this average force
and the displacement x:
F
W Fx
1
2 kxx
12 kx2 :
This work is converted into elastic potential energy in the spring:
PEelastic 12 kx2 :
The work-energy theorem
When you throw a bowling ball you do work on it as you push the
ball through some distance. The bowling ball gains speed and its
kinetic energy increases. After the ball leaves your hand, it travels
along the lane and hits the pins, pushing them down, thereby
doing work on them. The kinetic energy that the ball acquired
came from the work done on it. The work that the ball does on
the pins comes at the expense of some of its kinetic energy: the
ball slows down after it hits the pins. In this case, work is being
converted into kinetic energy (and some of that kinetic energy
is being converted back into work).
We had observed when discussing gravitational energy that
the work done by the boy of ®gure 4.6 in lifting the ball up to a
certain height h was W mgh. The potential energy acquired
by the ball when lifted to this height comes from the work
that the boy does on it. If the boy lifts the ball without accelerating it, all the work done on the ball is converted into potential
energy.
In general, work can be converted into both, kinetic and
potential energy. This statement is what we call the work-energy
theorem. When a basketball player shoots a basket, the work
done on the ball is changed into an increase in the kinetic
energy of the ball, as it accelerates in the player's hands, and
into potential energy as the ball gains height.
74
Energy
Conservative and nonconservative forces
When we lift an object of mass m, initially at rest on the ground,
slowly up to a height h, the work that we do on the object is
W mgh:
If the object is a book with a mass of 1 kg and we lift it to a
height of 0.5 m, the work done on the book would be W
1 kg 9:8 m=s2 0:5 m 4:9 J. Now, suppose that we pick up
the same book from a shelf that is 1 meter above the ¯oor and
place it on another shelf 1.5 meters high (see ®gure 4.9). The
change in height would be h 1:5 m ÿ 1 m 0:5 m and the
work done by us in moving the book is the same 4.9 J. This tells
us that the work done against the force of gravity on the book
of mass m depends only on the difference in heights as we lift it
from 0.5 m to 1 m above the ground; that is, the work depends
Figure 4.9. Three different paths that can be followed to move a book
from a shelf 1 m high to a second shelf 1.5 m high. The work done against
gravity is the same in all three cases.
75
SUPERSTRINGS AND OTHER THINGS
on the initial and ®nal heights. Motion perpendicular to the direction of the force of gravity contributes nothing to the work done
on the book. The work done does not depend on the path through
which the book moved between the two points. We could move
it following a straight path between the two end points (path 1
in ®gure 4.9) or we could move it to the side ®rst, then up, then
to the other side so that it ends at the same end point (path 2);
we could, in fact, move the book following any trajectory that
begins and ends at the same points (as in path 3); the work
done will always be the same. A force with the property to
produce work that is independent of the path, such as the gravitational force, is called a conservative force.
If, on the other hand, you reach for a book that lies on your
desk at the other end of where you are and slide it towards you
following a straight path (®gure 4.10), the work done against the
force of friction that exists between the book and the surface of
Figure 4.10. The work done against the force of friction is greater when
the woman slides the book along the longer path 2 than along the straight
path 1. The force of friction is a nonconservative force.
76
Energy
the desk is less than if you decide to slide the book between the
same initial and ®nal points following some other path longer
than the previous straight path. In this case, the work done against
the force of friction does depend on the path taken. We call these
forces, like the force of friction, nonconservative forces.
77
5
CONSERVATION OF
ENERGY AND
MOMENTUM
Transformation of energy
As we have learnt in the previous chapter, the amount of work
done on an object equals the energy transformed from one form
to another. The chemical energy in our bodies is transformed
into potential energy as we lift a baseball, and this energy is
further transformed into kinetic energy as we drop the ball and
it falls with increasing speed towards the ground. Energy, then,
can be transformed from one form to another. Before the boulder
in ®gure 5.1 starts rolling down the cliff its energy is potential. As
the boulder falls, its potential energy is continuously converted
into kinetic energy. When it reaches the ground below, the
boulder has no potential energy.
We can better understand how energy transforms from one
type to another with the example of a girl on a swing (®gure
5.2). When the swing is at H it momentarily stops to reverse direction, so its kinetic energy is zero there, but its potential energy is
maximum because it has reached its maximum height. At point L
the situation is reversed: the swing ¯ies pass that point at its
maximum speed (maximum kinetic energy) but its potential
energy is zero because it cannot fall any further. This maximum
kinetic energy begins to decrease as the swing moves away
from the lowest position L. At the same time, the potential
energy increases as the swing approaches H0 (which is at the
same height as H) where it momentarily stops. At this point,
the kinetic energy is again zero.
Notice that we have chosen the lowest position of the swing
(point L) as the point at which the potential energy is zero.
78
Conservation of Energy and Momentum
Figure 5.1. The potential energy of the boulder will be converted into
kinetic energy when the boulder rolls down the cliff.
However, we could have chosen the ground as the point of
zero potential energy. In this case, the potential energy at point
L has a minimum value which is not zero. Since the swing
never reaches the ground, the potential energy decreases from
its maximum value at H to its minimum at L, while the kinetic
energy changes from zero at H to its maximum value at L. It is
then the difference in the potential energy at the heighest point
and at the lowest point that is important. As we stated in the
previous chapter, the reference level at which the potential
energy is zero is arbitrary.
In most cases, it is more convenient to use the lowest position
that the object under consideration can take as the reference
point; that is the point of zero potential energy.
79
SUPERSTRINGS AND OTHER THINGS
Figure 5.2. The girl on the swing sways from a maximum height H to a
lowest point L. The potential energy at H is transformed into kinetic
energy, which reaches a maximum value at L.
The principle of conservation of energy
The example of the girl on the swing described in the previous
section is, of course, an ideal situation. We know from experience
that the swing will not reach the same height unless the girl
``pumps.'' In the ideal situation, however, assuming that there
is no friction, the girl on the swing will continue oscillating
between points H and H 0 , forever exchanging kinetic energy
and potential energy and vice versa. As one form of energy
decreases, the other increases with the total amount remaining
constant. This constant value is called the total mechanical energy
(E) of the system. Clearly, at any point, the sum of the two
energies has to be the same. We can express this relation as kinetic
energy potential energy total mechanical energy constant.
This, the principle of conservation of mechanical energy, states
that the sum of the kinetic energy (KE) and potential energy
(PE) of an isolated system remains constant. An isolated system
is one that experiences no external forces and into or out of
80
Conservation of Energy and Momentum
which there is no ¯ow of energy. In the case of our idealized
swing ± that is, one which neither interacts with the air nor has
friction in the supports ± the isolated system consists of the
swing, the girl, and the earth.
In the ideal situations described so far, the forces acting on
the system are conservative forces. In general, however, both
conservative and nonconservative forces (like friction) act on a
system. In reality, as we have stated before, when the girl on
the swing of the earlier example swings back and forth, she
pushes the air aside, losing some of the energy. Even if we replace
the girl with a mannequin, enclose the entire swing in a big
container and evacuate the air, the swing will still not reach the
same height at every swing. The metal hooks holding the swing
rub against the bar, however well lubricated they might be,
producing heat, which is a form of energy, in the same way that
rubbing your hands together when you are cold makes your
hands warmer. This heat or thermal energy is dissipated into the
environment, producing an energy loss. As the girl swings,
mechanical energy is continuously transformed into thermal
energy and this results in a decrease in height with each swing.
This rubbing of parts of the system against each other is what
we call friction. Likewise, when the child playing in a park
slides down a slide some of her kinetic energy is converted into
heat or thermal energy. These situations do not violate the
principle of conservation of mechanical energy as stated above,
however, since thermal energy is not a form of mechanical
energy.
Could we include nonconservative forces and still have
conservation of energy? If we consider all forms of energy in a
system, we can expand the principle of conservation of mechanical energy to a more general principle of conservation of energy,
which can be stated as follows:
Energy is neither created nor destroyed; it only changes from one kind
to another.
The energy of mass
In 1905, Einstein extended the principle of conservation of energy
still further to include mass. In a beautiful paper written when he
81
SUPERSTRINGS AND OTHER THINGS
was 26 years old ± the ®fth scienti®c paper that he published that
year ± Einstein deduced that mass and energy were equivalent.
The famous formula E mc2 gives the energy equivalent of a
mass m; c is the speed of light.
According to Einstein, the mass of an object is a form of
energy. Conversely, energy is a form of mass. For example, the
combination of one pound of hydrogen with four pounds of
oxygen to form water releases enough energy to run a hair
dryer for about 10 hours. If we had an extremely precise balance,
we would discover that the mass of the water formed is less than
the total mass of the oxygen and hydrogen used by about one part
per billion. The mass loss is exactly equivalent to the energy
released in the process. A more dramatic example ± as we shall
see in Chapter 23 ± is the release of energy in nuclear reactions.
If the same amount of hydrogen were to be used in a nuclear
reaction, we could obtain about ten million times more energy.
As in the chemical process, the end product would weigh less
than the original material. The difference in mass is converted
into energy.
Einstein's extension of the principle of conservation of
energy is a profound generalization. In everyday life, however,
the limited principle of conservation of energy is suf®cient. A
¯ying bird has more energy than a bird standing on the branch
of a tree, but the increase in the bird's mass due to its greater
energy when ¯ying is so small that it cannot be measured by
any experiment.
Ef®ciency
The principle of conservation of energy tells us that energy can
change its form but it never disappears. In any process that
involves friction, however, we lose control of some energy; it dissipates into the environment. The moving parts of an automobile
engine, for example, require a lubricant to minimize friction.
When we fail to add oil to the engine, it overheats and the heat
produced by friction dissipates into the environment. Even the
most ef®cient engine cannot regain energy lost through friction.
According to the principle of conservation of energy, if there
are any energy losses, the input energy must equal the output
82
Conservation of Energy and Momentum
Pioneers of physics: The physicists' letters
``I have heard rumors that you are on the war path and wanting to upset Conservation of Energy, both microscopically
and macroscopically. I will wait and see before expressing
an opinion, but I always feel `there are more things in
heaven and Earth than are dreamed of in our Philosophy.' ''
Thus wrote the New Zealand-born physicist Ernest Rutherford in a letter to the great Danish physicist Niels Bohr in
November 1929.
Rutherford's letter was referring to a discovery that had
been made a few years before concerning the behavior of
some subatomic particles. It had been observed that the
kinetic energies of the particles emitted in certain radioactive
processes were not in accordance with the laws governing the
motion of subatomic particles. It appeared as if some energy
was not accounted for in spite of the best efforts of the
experimental physicists. Bohr, a towering ®gure in twentieth
century physics (whom we will meet in chapter 7), began
to doubt the validity of the principle of conservation of
energy in these processes and the letter from Rutherford
(whom we will also meet in chapter 7) showed his great
concern.
In February of that same year the German theoretical
physicist Wolfgang Pauli had written in a letter to his
friend the physicist Oscar Klein that ``with his consideration
about a violation of the energy law, Bohr is on a completely
wrong track.'' By December of the following year Pauli
would write a letter to the physicists attending a conference
in TuÈbingen that began: ``Dear radioactive ladies and gentlemen, I have come upon a desperate way out regarding the
[problem of the energy violation]. To wit, the possibility
that there could exist . . . neutral particles which I shall call
neutrons. The [energy violation problem] would then
become understandable from the assumption that . . . a neutron is emitted along with the [other particles] in such a
way that the sum of the energies . . . is constant.''
``For the time being,'' Pauli continued in his letter, ``I
dare not publish anything about this idea and address
83
SUPERSTRINGS AND OTHER THINGS
myself con®dentially ®rst to you, dear radioactive ones, with
the question how it would be with the experimental proof of
such a neutron . . .'' He closes his letter with the following: ``I
admit that my way out may not seem very probable a priori
since one would probably have seen the neutrons a long
time ago if they exist. But only he who dares wins.''
Pauli won. His ``neutron,'' rechristened the neutrino, for
``little neutral one'' by the Italian physicist Enrico Fermi,
was discovered twenty-®ve years later.
Physicists have great faith in the principle of conservation of energy. Whenever a new phenomenon seems to
violate this principle, physicists invariably look for some
hidden object or particle that could account for the missing
energy rather than accept a violation of this principle. In
addition to the neutrino, many other particles have been discovered this way.
work plus the energy losses. Of course, the smaller the energy
losses are, the greater the output work becomes. Thus, if we minimize the energy losses, the output work increases, approaching
the input energy. Because a real machine always has energy
losses, the output work is always less than the input energy.
The ratio of the output work to the input energy can give us a
way of comparing how ef®cient different machines are. We
de®ne the ef®ciency of a machine by the relationship
efficiency
work or energy out
work or energy in
or, in symbols,
"
WOUT
< 1:
EIN
", the Greek letter epsilon, is the standard symbol for ef®ciency.
Ef®ciency is often expressed as a percentage by multiplying
it by 100. Table 5.1 shows the ef®ciency of some energy conversion devices.
84
Conservation of Energy and Momentum
Table 5.1 Ef®ciencies of some energy conversion devices.
Ef®ciency, " (%)
Incandescent Lamp
Fluorescent Lamp
Automobile Engine (gasoline)
Automobile Engine (diesel)
Home Oil Furnace
Electric Motor
Electric Generator
4
20
25
35
65
95
99
Power
The Great Pyramid at Giza in Egypt was constructed by thousands of Egyptian peasants, who worked for years transporting,
preparing, and laying about 2 300 000 blocks of an average
weight of 2 12 tons. If a similar project were to be undertaken
today using the same materials but modern equipment and techniques, the task could be accomplished in a much shorter time but
the total work done would be the same, since the same 2 300 000
blocks would have to be lifted to the same heights. What we call
power, however, has changed by using different methods of
construction that allow the project to be completed in less time.
Power is the rate at which work is done. If we use P to indicate
power, we can write
P
W
t
and since energy is the ability to do work, power can also be
expressed in terms of the energy used per unit time, or
P
E
:
t
It was James Watt (1736±1819), a Scottish engineer, who
during his pioneering experiments to improve the steam engine
®rst attempted to describe the power of an engine by comparing
it with the power exerted by an average horse. He determined
that a horse could lift a 550-pound weight slightly less than
four feet in 4 seconds. This led him to assume that an average
horse could lift 550 pounds up to a height of one foot in one
85
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Automobile ef®ciency
Today, a ¯eet of 800 million vehicles travel the world's roads.
These vehicles consume half of the world's oil and produce
carbon monoxide and other gases that are harmful to the
global environment. Most of the industrialized nations have
implemented policies to reduce automobile emissions and
conserve energy.
86
Conservation of Energy and Momentum
Most energy-saving procedures involve new engine and
body designs to make them more ef®cient. These designs
have reduced the average consumption of gasoline in the
U.S. during the last 15 years by about half.
Energy losses occur in the engine itself. Friction among
the different engine components converts some of the available energy into heat, which escapes into the environment.
Additional energy losses take place in the transmission,
where friction removes some of the energy. Finally, friction
between the tires and the pavement and the air resistance
that appears as the vehicle moves (aerodynamic drag) produce additional losses in ef®ciency.
Manufacturers in recent years have experimented with
engine designs that reduce energy losses. Some of these
engines involve new lightweight materials and new computerized fuel injection designs. One company has built an experimental automobile with a computer-controlled system that
shuts down the engine when the car is decelerating and
restarts it when the accelerator is pressed.
To improve the ef®ciency of the transmission, where the
goal is to keep the engine under the highest load required
without wasting power, new computerized gear-changing
mechanisms that optimize engine performance are being considered. Adding more gears also improves ef®ciency. For this
reason a belt drive, which is equivalent to an in®nite number
of gears, increases transmission ef®ciency.
Finally, reducing the vehicle's weight with plastics, aluminum, and high-grade steel reduces the rolling resistance of
the tires. Aerodynamic styling also improves the appearance
of the new automobile designs and at the same time reduces
drag.
second. He de®ned this rate of performing work as 1 horsepower
(hp). Thus
1 hp 550 ft lb=s:
The SI unit of power is the watt (W), named in honor of James
Watt. 1 watt is 1 joule per second, that is, 1 W 1 J=s. This unit is
87
SUPERSTRINGS AND OTHER THINGS
the same unit as we encounter in the description of electrical
appliances. A 1500-watt electric heater uses 1500 watts of
power, which means that it uses 1500 joules of electrical energy
every second. A 1000-watt hair drier uses 1000 joules of electrical
energy per second. The conversion factor from horsepower to
watts is
1 hp 746 W:
There is a unit of energy that is derived from the expression of
energy in terms of power. From our de®nition of power, we
can write for energy
E Pt:
If we operate a 1000-watt electric heater for 1 hour, the total
energy used is 1 kilowatt-hour:
E 1000 W 1 h 1 kW h:
This unit may be familiar to the reader, as it is used by power
companies in their monthly statements to indicate the amount
of electrical energy used.
We can calculate the power required to move an object at a
constant speed v against a constant force F, as when pushing it
up a ramp, by expressing the work done as
W Fd
where d is the displacement of the object caused by the applied
force F. Since d=t is the speed v, power is then
P
W Fd
Fv:
t
t
Impulse and momentum
You are freewheeling down a small hill on your bicycle and when
the road starts going uphill, you decide that you are too tired to
pedal, so you coast. As we all know from experience, the bicycle
does not stop moving the very moment one stops pedaling. It
keeps going, moving against the force of friction that exists
between the tires and the pavement and the much greater friction
from the resistance of the air. The force that overcame the force of
88
Conservation of Energy and Momentum
friction is no longer there. However, the bicycle keeps going
because it has momentum. In fact, on ¯at ground, if we were to
remove all frictional forces, the bicycle would keep moving
forever. Newton's ®rst law tells us that this is the case: unless
there is a force, an object continues moving in the same direction
and with the same speed. As we shall see, the concept of momentum helps us quantify what we already know from Newton's ®rst
law.
Let us return to the example with the bicycle. Suppose now
that a 50-kg boy is riding his bicycle on ¯at ground. As soon as
the boy reaches a speed of 20 km/h, he stops pedaling and
coasts the rest of the way. After he returns, his friend, who
weighs 60 kg, decides to see how far she is able to coast using
her friend's bicycle. When she reaches the same speed of
20 km/h she also stops pedaling, and she coasts for a longer distance, as we would have been able to predict even without being
present at this experiment. The girl, by virtue of her larger mass,
has a larger momentum.
In a second phase of this experiment, the boy asks his
brother, who is a year older but weighs the same 50 kg, to see
how far he can coast. His brother is stronger, however, and is
able to reach a speed of 30 km/h before he stops pedaling.
Again, we would agree that the brother will coast for a longer distance, simply because he is moving with a greater speed before
starting to coast. For this second experiment, we can say that
the brother has a larger momentum.
Momentum depends on both of these quantities. It is proportional to the product of the mass of the object and the object's
speed. If we use the letter p to indicate momentum, we can write,
p mv:
The units of momentum are the units of mass times the units of
speed.
Momentum is a vector quantity. In vector form, it is proportional to the velocity of the body and is de®ned as
p mv:
Since momentum is a vector quantity, an object could be moving
with a constant speed and still change its momentum due to the
fact that velocity, being a vector, can change direction even if its
89
SUPERSTRINGS AND OTHER THINGS
magnitude (the speed) remains constant. An object moving in
circles at a constant rate is one case where the speed is constant
but there is a change in momentum.
If we push a lawn mower of mass m across a yard, the lawn
mower accelerates. Since momentum is the product of mass and
velocity, a changing velocity implies a changing momentum.
Thus, if we apply a force to an object for any length of time, the
momentum of the object changes. Push a stalled car for a few
Physics in our world: Air bags
Air bag systems that protect automobile drivers and, in
newer automobiles, the passengers too, are probably the
most effective of the automatic or passive safety devices
currently used in automobiles. When a sudden decrease in
speed is detected by an electronic sensor an electric contact
is closed which causes a small explosive to rapidly release
nitrogen gas into the bag. (Nitrogen comprises 78% of the
air we breathe.)
Air bags are designed to pop out only in a frontal crash
that is equivalent to hitting a wall at 12 miles per hour. At that
speed, the force that the wall exerts on the automobile to stop
it would cause severe injury or death to the driver in only
1=8 s. Since the air bag pops out in just 1=25 s, the driver
hits the cushion of nitrogen gas in the air bag. Because the
bag deforms, the driver continues moving into the bag until
®nally stopped.
Although the change in momentum is the same since the
car is stopped completely, the time during which the force
acts on the driver is longer. The change in momentum
equals the impulse, which in turn is the product of the net
force F times the time t during which the force acts. Since
this product Ft is the same with or without air bag (it
equals the change in momentum), a larger value of t implies
a smaller value of the force F. This smaller force should not
injure the driver.
After the impact, the nitrogen escapes through the fabric
of the bag and the bag rapidly de¯ates to avoid suffocating
the driver.
90
Conservation of Energy and Momentum
seconds and the momentum of the car changes by a small
amount. Push with the same force for one whole minute, and
the car's momentum increases by a greater amount. The product
of the applied force and the time during which the force is
applied, F t, is called impulse, I, and this impulse is equal to
the change in momentum of the body. In symbols:
I Ft p:
The units of impulse are units of force times units of time. The SI
unit of impulse is the Newton second (N s). According to our
previous equation, this is also the unit of momentum.
It is because impulse is equal to the change in momentum
that we can see why it is advantageous to bend our knees to
absorb the shock when jumping (or falling!) from a certain
height. In this particular case the change in momentum is ®xed;
we arrive at the ground with a momentum that is determined
by the height from where we jumped and we must stop. The
momentum must change from the value just before hitting the
ground to zero. How fast this change takes place determines
the magnitude of the force that we must exert with our legs.
The longer we take to stop by ¯exing our knees, the smaller the
average force.
Conservation of momentum
When we look at the world around us, we notice that all objects
that are moving eventually stop, that even the most modern
machines cannot run forever. Skylab, the ®rst space laboratory,
came down to earth in fragments after circumnavigating the
Earth more than 31 000 times. Even motion in the heavens is
not forever. Stars explode and disappear, together with their
possible planetary systems. In a few billion years, the time will
come for our own Sun to brighten until it swells and engulfs
Mercury and Venus, disturbing the motions of the remaining
planets.
An object set in motion with respect to an observer will eventually stop owing to its interaction with other objects in the
universe. When objects interact, however, there is still a quantity,
related to the motion of the objects, that does not change. This
91
SUPERSTRINGS AND OTHER THINGS
Figure 5.3. Two equal carts approach each other with the same speed v.
After the collision they recede from each other at the same speed, if we
neglect friction.
quantity of motion is the total momentum of the interacting bodies.
As an illustration of this, let's consider two identical carts with
well-greased wheels moving towards each other at the same
speed (®gure 5.3). If the cart bumpers were to be ®tted with perfectly elastic springs, you would notice that after the collision
takes place, the two carts would be moving in opposite directions
at the same speed. In fact if you were to measure the speeds of
each cart before and after the collision with an instrument, you
would ®nd that they are almost exactly equal, the friction in the
wheels accounting for the small difference. If the friction is
reduced, so is the difference in the speeds before and after. Considering the velocities to include also the direction of motion of
the carts, we can easily see that the sum of the velocity v1 of
one cart and the velocity v2 of the other cart does not change
with the collision of the carts. We can then say that, neglecting
friction, velocity is a conserved quantity for this particular case.
In a second part of this experiment, we add weights to one of
the carts so as to double its mass and set them in motion so that
again, the two carts approach each other with the same speeds
(®gure 5.4). In this case, however, the two carts do not maintain
their speeds after the collision. Rather, the light cart bounces
back with a larger speed than that of the cart carrying the big
mass. Certainly, velocity is not a conserved quantity for this
more general case. The different masses of the carts prevented
the vector sum of the velocities from remaining unchanged.
Could momentum be conserved, then? Multiplying the velocity
92
Conservation of Energy and Momentum
Figure 5.4. Adding a weight to cart 2 to increase its mass allows us to
see that the velocity is not the conserved quantity. We need to take
into account the different masses. It is the total momentum of the two
carts that is conserved.
times its mass we obtain the momentum of each cart and adding
them together, we discover that the total momentum of the carts is
conserved. Representing the momenta of the two carts after the
collision with primes, we have
p1 p2 p01 p02
or
pbefore pafter :
This is a very important law in physics. It is called the principle of
conservation of momentum, and can be stated as follows: If no net
external force acts on a system, the total momentum of the system is
conserved. This law is general and universal, which means that
it applies to any system of bodies anywhere in the universe on
which no external forces are acting. It holds regardless of the
type of force that the bodies exert on each other.
The principle of conservation of momentum was clearly
stated ®rst by the Dutch physicist Christiaan Huyghens, one of
the most gifted of Newton's contemporaries. Employing kinematical analyses and the methods of ancient geometers rather
than the modern analytical methods of dynamics, (at the time
known only to Newton), Huyghens extended the work of
John Wallis and Robert Boyle who, responding to a challenge
by the Royal Society to investigate the behavior of colliding
bodies, suggested in 1668 that the product mv was conserved in
collisions.
93
SUPERSTRINGS AND OTHER THINGS
Elastic and inelastic collisions
You might have seen displayed in novelty shops a small device
consisting of ®ve or six shiny steel balls attached by means of
two threads to two parallel, horizontal rods (see ®gure 5.5). The
device is often called Newton's cradle. When one of the two end
balls is raised up to a certain height and allowed to swing
down to collide with the next ball, which is at rest, the momentum
of the ®rst ball is transmitted through all the other stationary balls
to the last one, which swings up to nearly the same height. This
motion continues for several swings.
In 1666, Christiaan Huyghens saw a demonstration before the
recently founded Royal Society of London of a similar experiment,
Figure 5.5. After the swinging ball collides with the ®rst stationary ball,
the momentum of the swinging ball is transmitted to this stationary ball
which collides with the next, until ®nally the last ball swings up to nearly
the same height. Conservation of momentum alone does not explain
completely the motion of these balls.
94
Conservation of Energy and Momentum
with two balls. For the two following years, the members of the
Royal Society argued as to why the struck ball would swing up
to nearly the same height while the ®rst ball would stop completely after the collision. According to the principle of conservation of momentum, the momentum of the swinging ball before the
collision must equal the momentum of all the moving balls after
the collision, without the need to specify how the balls must move.
In 1668, Huyghens explained to the Royal Society that there
was another conservation principle involved in this process. In
addition to momentum, the product mv2 was also a conserved
quantity. In other words, according to Huyghens, the sum of
the products mv2 for all the moving objects before the collision
must equal the sum of the products mv2 after the collision. This
product (divided by 2) is what we call today kinetic energy.
Thus, in addition to momentum, kinetic energy is also conserved
in this type of collision. Collisions where kinetic energy is also a
conserved quantity are called elastic collisions. Collisions involving objects that do not lose their shape or heat up in any way
are elastic collisions. The perfectly elastic bumpers are restored
to their original shapes immediately after the collision, and two
``perfectly hard'' spheres would collide without any distortion
of their shapes. Of course ``perfectly elastic'' springs and ``perfectly hard'' balls do not exist, which is why the second ball in
the Royal Society demonstration rose to nearly but not exactly
the same height as that from which the ®rst ball was released.
Collisions where the kinetic energy is not conserved are
called inelastic collisions. The collision of two automobiles is an
example of an inelastic collision. All types of collision conserve
momentum, however.
Cannons and rockets
Let's apply the law of conservation of momentum to the ®ring of
a cannon and a rocket. Consider ®rst the cannon of ®gure 5.6,
where we have called M the mass of the cannon and m the
mass of the cannonball. Obviously, as the cannon and cannonball
are at rest before ®ring, the total momentum of our system,
cannon plus cannonball, is zero. Conservation of momentum
tells us that the total momentum of this system must remain
95
SUPERSTRINGS AND OTHER THINGS
Figure 5.6. After ®ring, the cannonball moves to the right with a velocity
V and the cannon recoils with a smaller velocity v, due to its larger mass.
That is why you feel a kick on ®ring a gun.
zero after the cannonball is ®red. After ®ring, the cannonball
acquires a velocity V and a momentum pb mV and the
cannon recoils with a momentum pc Mv. The total momentum
after ®ring is equal to zero, i.e.
pb pc 0:
If we call the direction in which the cannonball moves positive,
the magnitude of its momentum is mV, whereas the magnitude
of the cannonball's momentum, moving in the opposite (negative) direction would be M ÿv. We can use our expressions for
momentum above to write for the magnitude of the momentum
of each of the two components of this system:
mV M ÿv 0
or
mV Mv:
This relation tells us how the total momentum remains zero after
®ring. The large mass of the cannon moves with a small velocity
in the opposite direction to the motion of the cannonball which,
because of its smaller mass, moves with a large velocity. The
momentum of the cannon and of the cannonball are equal in
magnitude and opposite in direction.
Rockets work on the same basic principle. Burning expanding gases, which are produced at high pressures in a combustion
chamber, escape at large velocities through a constricted nozzle
propelling the rocket in the opposite direction (®gure 5.7). As
was the case with the cannon, the magnitude of the momentum
of the gas exiting through the rear of the rocket must equal the
96
Conservation of Energy and Momentum
(Courtesy NASA).
momentum of the rocket in the opposite direction. In symbols:
Mrocket vrocket mgas Vgas :
This expression is actually an approximation since we have
neglected the loss in mass due to the gases ejected. It would be
a good approximation for a rocket out in space, away from the
gravitational pull of the earth, as it undergoes a small change of
course, for example. In this case, the loss of mass due to the
ejection of gases is very small compared to the mass of the
rocket itself.
Figure 5.7. As the gases are ejected through the nozzle with a momentum mV, the rocket moves in the opposite direction with a momentum
Mv.
97
6
ROTATION AND THE
UNIVERSAL LAW OF
GRAVITATION
Rotational motion
Our lives are spent moving in circles. The Earth spins on its axis
once every day and revolves around the Sun once a year (®gure
6.1(a)). The Solar System revolves, together with billions of
other stars, around the center of the Milky Way Galaxy which
forms part of a small conglomerate of galaxies called the Local
Group; this also rotates in space (®gure 6.1(b)). Up to this point,
we have con®ned our study of motion to motion in one dimension in which the entire object is displaced from one point to
another following a straight path, without changing the orientation in space of the object. This type of motion is called translational motion. In this chapter we will extend our study to
motion in a circle, which will help us understand Newton's law
of universal gravitation.
In studying circular motion we should distinguish between
rotation and revolution. A turntable rotates or spins around its
axis and the Earth revolves around the Sun. The rotating turntable does not travel anywhere, it merely spins in place. On the
other hand, as the Earth revolves around the Sun its spatial location changes while it rotates on its axis. The motion of the turntable is an example of pure rotational motion, whereas that of the
Earth spinning and revolving combines rotational and translational motion.
Consider the motion of a merry-go-round in an amusement
park and assume that it is rotating at a constant rate. In ®gure
6.2, we have depicted the merry-go-round as seen from above
and rotating counter-clockwise. A child riding a horse on the
rotating platform at a distance r from the center would describe
98
Rotation and the Universal Law of Gravitation
(a)
(b)
(c)
Figure 6.1. The Earth (a) spins around its axis and revolves around the
Sun (b). Our solar system resides in one of the spiral arms of the Milky
Way Galaxy. The Galaxy itself is spinning around its center. (c) This is
a photograph of the spiral galaxy NGC-6946, believed to have a similar
structure to that of the Milky Way. (Courtesy NASA.)
99
SUPERSTRINGS AND OTHER THINGS
Figure 6.2. A merry-go-round rotating counter-clockwise as seen from
above.
a circular path. As you have probably experienced, the farther
away you are from the center of the merry-go-round, the faster
you move. In other words, the speed increases in proportion to
your distance from the center. This phenomenon can be dramatically observed in ice skating shows. As the skaters begin forming
a spinning line, the ones near the center of rotation barely move,
whereas the skaters at the two ends of the line have to work hard
to keep up their speeds and maintain the line straight.
As the merry-go-round of our example rotates in relation to
the ground, the line joining the center with the place where the
child is riding sweeps out an angle (the Greek letter theta) in
the time t that it takes the child to travel the length s (®gure
6.2). The rate at which this angle increases with respect to time
as the merry-go-round rotates is the same regardless of where
the child rides. We call this rate of change the angular velocity of
the merry-go-round. It is standard practice to use Greek symbols
to represent angular quantities, and we use a lowercase omega
(!) for angular velocity. If it took 18 seconds for the merry-goround to complete a turn, we could say that its angular velocity
was 208/s. The Earth, on the other hand, rotates around its axis
at the rate of 158/h (3608/24h).
As we have just seen, the speed v at each point on the
platform depends on the distance to the center of rotation,
whereas we can say that the entire merry-go-round rotates with
100
Rotation and the Universal Law of Gravitation
the same angular velocity !. Angular velocity, then, is a more
useful quantity for rotational motion. The units of angular velocity that are commonly used are radians per second. The
conversion from degrees to radians is 3608 2 radians. (In this
book, however, we will use degrees per unit time for angular
velocity.) We can relate the angular velocity of the rotating
platform to the linear speed of a particle on the platform with
the expression v r!, with ! in radians.
The frontiers of physics: CD-ROM drives
Computer CD-ROM drives transfer data at different rates,
depending on the technology used by the manufacturer.
The ®rst generation of CD-ROM drives transferred data at a
constant rate as the head moved from the inner to the outer
tracks of the disc. To achieve this constant transfer rate, the
linear speed at which the tracks moved past the head was
kept constant, regardless of the position of the head. That
meant that the disc spun faster when the head was near the
center and slower when farther away. Thus the angular velocity was not constant; it decreased with increasing radius.
This technology is called constant linear velocity (CLV),
although it is the linear speed, not the linear velocity, that
remained constant.
Newer CD-ROM drives, reaching transfer rates up to 52
times that of the original drives, maintain their angular velocity constant, like hard drives and magneto-optical drives.
This technology, appropriately called constant angular velocity (CAV), implies a variable transfer rate, since the linear
speed at which the tracks move relative to the head increases
with increasing distance from the center. That means that a
52 CD-ROM drive actually has transfer rates ranging from
as low as 22 for the information on the inner tracks to a maximum of 52 for the outer tracks.
A third technology, called partial constant angular velocity (PCAV), is a combination of the two technologies. In this
case, the angular velocity is maintained constant for a section
of the inner tracks but is decreased for sections farther away
from the center.
101
SUPERSTRINGS AND OTHER THINGS
When the rotating platform completes one revolution, the
angle becomes equal to 3608. It is customary to call the time
taken in completing one revolution the period, T, of the motion.
For one revolution the distance traveled is the length of the
circumference or 2r. The speed can thus be written as
v
2r
:
T
Torque and angular momentum
Suppose you need to loosen up a tight bolt and are using a
medium size adjustable wrench without success. You know
from your past experience that you probably will have better
luck if you switch to a larger wrench (®gure 6.3). Although you
might apply the same force with either wrench, the larger one
will have a longer handle, allowing you to apply the force at a
larger distance from the wrench. The same force is more effective
in rotating the stuck bolt if you apply it at a larger distance from
the bolt.
To push open a heavy metal door in a building, the farther
away from the hinges you push the easier it is to open it. If you
push from the wrong end, you might still be able to open it but,
as you know, it will be much more dif®cult.
From these examples we can see that the ability to rotate an
object depends on the applied force and the point of application of
the force. In ®gure 6.3, the wrench rotates around the center of the
bolt, O, and the force is applied at point P. The distance from the
center of rotation O and the point, P, where the force is applied is
Figure 6.3. A larger wrench facilitates loosening up a tight bolt because
of its longer handle.
102
Rotation and the Universal Law of Gravitation
Figure 6.4. A force F applied at different distances from the rotating
center of a disk. Any position other than A (B and C for example) will
result in rotation.
called the lever arm, r. We call torque, (the Greek letter tau), the
quantity that depends on both force and lever arm. Torque is
de®ned as the product of the lever arm and the applied force:
torque lever arm force:
rF:
Thus, to be able to start an object rotating you need to apply a
torque to it. The disk in ®gure 6.4 will start rotating only if the
force is applied to a position other than A, such as B or C. At
position A the force will not produce any rotation; at this point
the torque is zero because the lever arm is zero.
In chapter 5 we learned that if the net force on an object is
zero, the momentum of the object is constant. For a rotating
object, however, torque is what causes the object to change its
state of rotation. In other words, if the object is not rotating, a
torque is required to set it into rotation; if it is rotating at a
constant rate, a torque is needed to change its rate of rotation to
a faster or slower rate; and, ®nally, a torque is needed to stop a
rotating object. In this sense, torque is the rotational analogue
of force, since in linear motion force is what causes an object to
change its state of motion. If the net torque acting on an object
is zero, we say that the angular momentum of the object is constant.
Angular momentum is a measure of the rotation of an object; it is
103
SUPERSTRINGS AND OTHER THINGS
the rotational analogue of linear momentum, which, as we
remember, is a measure of the linear motion of an object.
Since angular momentum is the rotational analogue of
momentum, we can express it in terms of analogous quantities.
We expressed momentum as the product of mass times velocity
or p mv. Mass is the measure of the inertia of a body, the resistance to a change in the state of motion of the body. Angular
momentum can be expressed as the product of the rotational
inertia of the body and the angular velocity. Rotational inertia
or, more commonly, moment of inertia, I, measures the resistance
of a body to a change in its state of rotation; that is, the dif®culty
in starting or stopping a rotating object or changing its present
rate of rotation. The moment of inertia of a body will in general
change if its axis of rotation is changed. It is easier to spin a
baton around its long axis than around an axis perpendicular to
its center, because in the latter situation, most of the mass of the
baton is not as close to the axis of rotation. The moment of inertia
of a body changes if the mass distribution with respect of the axis
104
Rotation and the Universal Law of Gravitation
Figure 6.5. When the spinning skater draws in her arms, the moment of
inertia of her body decreases and the angular velocity increases, keeping
the angular momentum constant.
of rotation changes, as when an ice skater spinning with her arms
extended, draws them closer to her body (®gure 6.5). With her
arms extended, the moment of inertia of the skater is larger
because more of her mass is located farther away from the axis
of rotation.
Using the letter L for angular momentum we can express it as:
L I!:
We can think of angular momentum as the tendency of a rotating
object to keep rotating because of its inertia.
The angular momentum of an object, as we have mentioned,
is conserved or remains constant if the net external torque on the
object is zero. This is a very important statement. It is called the
105
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Twisting cats
One of the most spectacular maneuvers that a domestic animal is
capable of is done by a cat. As
many children know, when the
cat is dropped upside down
from a height of about 1 m, it is
able to twist itself into the right
position and land on its paws,
unharmed. Not only is this feat
dazzling, it also seems to violate
the law of conservation of
angular momentum. A more
careful look at what the cat does
in about one-third of a second
helps us realize it is actually
the conservation of angular
momentum that allows the cat
to turn.
Since the cat is dropped with
no rotation to its body, the law of
conservation of angular momentum tells us that, unless an external torque were to act on the cat,
it should fall without any rotation. Only two forces act on the
cat as it falls; gravity and friction
with the air. The gravitational
force acts on the entire body at
once and provides no torque. As
Newton discovered, the gravitational force of attraction of the
Earth upon any body can be
considered as if it were acting
on the center of mass of the
body, and thus provides no
torque. Although the frictional
force due to the air does provide
106
(# Gerard Lacz/Natural
History Photographic
Agency.)
Rotation and the Universal Law of Gravitation
a torque, it is too small to cause the twist. How can the cat
spin once released, then?
A careful look at the strobe photographs provides the
clue. The second photograph shows the cat with its front
paws brought in, closer to its body and its rear paws
extended. This position has the effect of reducing the
moment of inertia of the front of its body and increasing
the moment of inertia of the lower part of the body. Because
the body is bent in the middle, the front and rear parts of the
body rotate about different axes. This second photograph
also shows that the cat has twisted its upper body towards
us, a 908 rotation, while twisting its lower body away from
us. Because the lower body has a greater moment of inertia,
a smaller rotation (in this case of about 108) is equivalent to
the larger rotation of the upper body. Since the two rotations
are in opposite directions, the angular momentum of the front
and rear cancel giving a zero net angular momentum.
In the third photograph, the situation is reversed. The
front legs are extended, increasing the moment of inertia of
the front, while the hind legs are pulled in, decreasing the
moment of inertia of the rear. The rear end is now twisted
by a large angle, which causes a small twist in the opposite
direction in the front. The last three photographs show the
cat, with zero angular momentum, on its way to a safe landing.
law of conservation of angular momentum, and joins the other conservation laws that we have learned so far; the law of conservation of
energy and the law of conservation of momentum. We can see
the results of angular momentum conservation in an ice skater
spinning on her toes (®gure 6.5). If she starts spinning with her
arms extended and then draws them closer to her torso, the
moment of inertia of her body decreases. Because friction is
very small in this situation, the external torque on the skater is
zero and angular momentum is conserved. Since angular
momentum is the product of the moment of inertia and the angular velocity, a decrease in the moment of inertia of the skater must
be compensated for by an increase in her angular velocity, which
we observe in her faster spinning rate.
107
SUPERSTRINGS AND OTHER THINGS
Centripetal acceleration
Isaac Newton was one of the ®rst scientists to recognize the
importance of circular motion. Newton was able to show that
an object moving in circles needs an unbalanced force to maintain
this circular motion. This unbalanced force produces an acceleration on the object which we call centripetal acceleration.
If we twirl a ball on a string, the ball will keep moving in a
circle for as long as we keep the tension on the string. If we let
the string loose or the string snaps, the ball will go off in a straight
line and will continue to move in the direction the velocity vector
had at the moment the string was cut (neglecting gravity). The
string then exerts a force on the ball which acts towards the
center of the circle and prevents the ball from moving along a
straight line. This force is called centripetal force, literally ``the
force that seeks the center,'' from the Latin centripetus.
The centripetal force causes the ball to accelerate towards the
center of the circle. If we increase the speed of the twirling ball,
we feel a greater pull on our hand. On the other hand, if we
keep the ball moving at the same speed but increase the length
of the string, the pull on our hand decreases. We conclude that
the centripetal acceleration increases with increasing speed (it
actually increases with the square of the speed) and decreases
with increasing radius. Therefore, an object moving with a constant speed v in a circular path of radius r (®gure 6.6) has an
acceleration directed toward the center of the circle called centripetal acceleration, ac , which is directly proportional to the square
of the speed and inversely proportional to the radius:
v2
:
r
The centripetal force causing the ball to accelerate towards
the center of the circle is given by Newton's second law. If the
mass of the ball is m, the centripetal force is Fc mac , or
ac
mv2
:
r
This equation gives the force required to maintain an object
moving about a ®xed center with a constant speed. For example,
when a car takes a curve, the friction between the pavement and
the tires provides this force. If the pavement is wet or is covered
Fc
108
Rotation and the Universal Law of Gravitation
Figure 6.6. The centripetal acceleration of an object moving in a circular
path of radius r with a constant speed v is ac v2 =r.
with snow or ice, the friction force might not be large enough and
the car is unable to negotiate the curve. On the other hand, even
on dry pavement, if the speed of the car is too large, the centripetal force required to keep the car moving in a circular path,
being proportional to the square of the speed, could become too
large and the car might skid. In older race tracks, where race
cars take curves at high speeds, the tracks are sometimes
banked steeply so that the horizontal component of the force
that the track exerts on the car (FR ), perpendicular to the
banked track, is equal to the centripetal force (®gure 6.7).
Figure 6.7. In a banked race track, the horizontal component of the reaction force FR between track and car, which is perpendicular to the track,
is equal to the centripetal force Fc .
109
SUPERSTRINGS AND OTHER THINGS
Satellites
On July 26, 1963, the United States launched Syncom 2, the ®rst
communications satellite in geosynchronous orbit (i.e., it stays
above the same spot on Earth) at an average height of
35 725 km. Syncom 2 completes an orbit in the same time that
the Earth completes a revolution, and because it revolves
around the Earth at the same rate as the Earth rotates, the satellite
moves with uniform circular motion. With this information
we can calculate the orbital speed of this satellite by dividing
the distance traveled by the time taken. The distance traveled in
the 24-hour time interval or period is the length of the circumference, 2r, with r being the radius of the orbit, which is the
distance between the center of the Earth and the satellite;
that is, r radius of the Earth height of satellite
6367 km 35 725 km 42 092 km, since the radius of the Earth
is known to be equal to 6367 km. We can calculate the linear
speed of Syncom 2 as follows:
v
2r 2 3:14 42 092 103
m=s
T
24 60 60
which gives us a linear speed of 3061 m/s or 11 020 km/h.
As we have learned, a centripetal force must exist that causes
the satellite to orbit around the Earth in an almost circular orbit.
The centripetal acceleration produced by this force is ac v2 =r
3061 m=s2 = 42 092 103 m 0:22 m=s2 . What is the source
of this centripetal force? It is again due to the genius of Isaac
Newton that we are able to understand that the force that keeps
a satellite orbiting around the Earth is also the force that keeps
the Moon around the Earth, the Earth around the Sun and all the
planets of the Solar System moving around the Sun; the force
that we call gravity. The discovery of the universality of this
force, one of the four forces known to us today, ranks as one of
the major achievements of the human mind. We shall occupy ourselves with the discovery of this force in the following sections.
Origins of our view of the universe
To place Newton's discovery of the law of gravitation into historical perspective, we must brie¯y go back to the beginning, namely
110
Rotation and the Universal Law of Gravitation
with the ancient Babylonians; in particular, the people of the
Second Babylonian Empire, who lived in Mesopotamia, the
region occupied today by Iraq. By 3000 BC the Babylonians had
begun a systematic study of the sky out of their need to know
the best harvesting times, since theirs was an agrarian culture.
Modern astronomy can trace its roots back to their early discoveries. The Babylonians made deities out of the Sun, Moon, and
the ®ve visible planets, Mercury, Venus, Mars, Jupiter, and
Saturn; and this drove them to follow closely their motions
across the sky. Their knowledge of the paths of the Sun and
Moon was used in setting up a calendar. They also observed
that the planets, unlike the Sun and Moon, did not follow
simple paths across the sky, but would stop their eastward
motion now and again, retracing part of their path, then stopping
once more, before resuming their eastward motions.
Some of the knowledge acquired by the Babylonians reached
the Greeks. From 600 BC to 400 AD, ancient science reached its
highest peak with the Greek culture. The Greeks, unlike their predecessors, attempted to ®nd rational explanations to the observed
phenomena rather than accepting them as works of the gods.
Thales of Miletus, born in 640 BC, one of the ®rst known Greek
thinkers, is unanimously acclaimed as the founder of Greek
philosophy. Thales attempted to interpret the changing world
in terms of physical processes.
Pythagoras, born in Samos in 560 BC, introduced the idea that
the Earth was spherical and held that the Earth was at the center
of the Universe. This view was cast into a more complete theory
by Aristotle, who thought that the Earth was immovable and
®xed at the center of the whirling heavens. Although this geocentric or Earth-centered model, as it came to be known, was
later expanded into an extremely complicated system by Ptolemy
and accepted by nearly everybody for 18 centuries, certain
thinkers, notably Aristarchus of Samos, held opposing views.
Aristarchus, who lived between 310 and 230 BC, held that the
Sun was ®xed at the center of the universe, and that the Earth
revolved around the Sun in a circular orbit. He also held that
the Earth rotated on its axis as it revolved and that this axis
was inclined with respect to the plane of the orbit.
There was one major obstacle that prevented the acceptance
of Aristarchus' model: the Earth seemed motionless. How could it
111
SUPERSTRINGS AND OTHER THINGS
Figure 6.8. Apparent shift in the position of a star as the Earth rotates
around the Sun.
rotate around the Sun if no motion was detected? Moreover, if the
Earth rotated around the Sun, there should have been an apparent shift in the position of the stars as the Earth moves (®gure
6.8). This apparent shift in the position of the stars or parallax
was never observed and the Sun-centered or heliocentric model
of Aristarchus was abandoned. We know today that, because
the stars are so remote, none has a parallax that can be observed
with the unaided eye. The ®rst observation of the parallax of a
star was made in 1838 by the German astronomer Friedrich
Bessel, with the aid of a telescope.
The geocentric model developed by Claudius Ptolemy in the
second century AD placed the Earth in the middle of the rotating
heavens. To explain the seemingly complicated motion of the
planets in the sky that results, as we know today, from the combination of their own motions and Earth's motion around the Sun
(®gure 6.9(a)), Ptolemy had the planets moving in small circles
call epicycles which in turn would rotate around the Earth, following circular orbits called deferents. The combination of the motion
of both these circles produced a cycloid pattern of the planet's
motions. However, the center of the great circle, the deferent,
112
Rotation and the Universal Law of Gravitation
Figure 6.9. (a) Actual positions of the Earth and another planet as both
rotate around the Sun. On the right of the diagram we see the apparent
path of the planet in the sky as seen from the moving Earth. (b) Ptolemy's
system of the world.
was not at the Earth's center. Instead, it was situated a short distance from it, at a different point for each planet. Moreover, the
epicycle moved with a uniform rate not around the center of
the large circle or deferent but around another point in space
called the equant (®gure 6.9(b)). As more and more data on the
planetary orbits were obtained, it became necessary to add
more epicycles, giving epicycles moving on epicycles, with the
planet moving on the last and smallest one. Eventually Ptolemy
had a system of 40 epicyles that reproduced with fairly good
accuracy the observations of his day. His system of the world
113
SUPERSTRINGS AND OTHER THINGS
was published in The Mathematical Collection, a treatise of 13
volumes that was passed on to the Arabs after the destruction
of the famous library of Alexandria, and became known as
al Magiste (The Greatest). The book was introduced in Europe
by the Arabs where, known as the Almagest, it was famous for
over a thousand years.
Kepler's laws of planetary motion
From Aristotle and Ptolemy until Copernicus, thirteen centuries
later, no advance was made in man's understanding of motion
in the heavens. Mikolaj Kopernigk (or Nicolaus Copernicus, his
Latin name), born on February 19, 1473 in the little town of
Torun in Poland, was the revolutionary who brought change to
ideas that were ®rmly rooted in people's minds for more than a
thousand years. He attended ®rst the University of Cracow and
later the University of Bologna in Italy where he became interested in astronomy.
The intellectual atmosphere that existed then in Italy facilitated the criticism of established ideas. Not only was the Ptolemaic
system of the world cumbersome and inelegant, but recent astronomical data no longer supported the model regardless of the
number of epicycles used. Copernicus especially disliked the
idea of the equant, which had been introduced by Ptolemy in an
effort to keep uniform circular motion in the description of the heavens. Copernicus realized that the motion was uniform about the
equant and circular about the center of the deferent, two completely different points. It occurred to Copernicus that everything
could be much simpler if the Sun were at the center of the universe
rather than the Earth. In his model the Earth and the ®ve known
planets, Mercury, Venus, Mars, Jupiter and Saturn, would move
around the Sun with uniform circular motion.
The heliocentric model of Copernicus was considered heretical because it removed the Earth from the center of Creation and
for this reason Copernicus delayed publication of his theory until
very late in his life. At the urging of some of his friends, Copernicus ®nally authorized the publication of his theory in a book. De
Revolutionibus (On Revolutions) was published in 1543, a few
weeks before Copernicus died at age 70.
114
Rotation and the Universal Law of Gravitation
The revolution that Copernicus started was not completely
accepted until the early 1600s when Johannes Kepler, who had
joined the superb observatory of the Danish astronomer Tycho
Brahe as an assistant at the age of 30, made use of very precise
data on the planet Mars to prove that the orbit of this planet
was not a circle but an ellipse. Circular orbits, which were the
basis for all the epicycles and deferents in the pre-Copernican
theories, were not after all the actual paths of the planets.
Kepler was born at Weil der Stadt, in the wine country of
WuÈrttemberg in southern Germany, on December 25, 1571, and
attended the University of TuÈbingen where he graduated in
1588, having been converted to the Copernican theory by one of
his professors. When he joined Tycho's observatory, Kepler had
already published a book, Mysterium Cosmographicum (Cosmic
Mystery), where he proposed, unsuccessfully, that there were
six planets because there are only ®ve perfect solids. It was in
search of better data to perfect his theory that he went to work
in Tycho's observatory.
It took Kepler six years and thousands of pages of calculations to wrest the secret of the orbit of Mars. During his ``battle
with Mars,'' as he called his work, Kepler realized that the velocity of the planet should be related to its distance from the
Sun because it is the Sun that provides the driving force. Because
the orbit was not circular but elliptical, the distance from the Sun
varied. An ellipse, we might recall, is a very speci®c curve that
can be obtained by placing two tacks some distance apart and
loosely threading a loop of string through them. By moving a
pencil in such a way as to keep the string taut we can draw an
ellipse (®gure 6.10). The position of each tack is called a focus.
Since the string is not elastic, the sum of the distances from any
point on the ellipse to the two foci is always the same. Kepler
placed the Sun at a focus of the ellipse. His calculations showed
him that a planet moved faster when it was nearer the Sun and
more slowly when it was farther away from the Sun.
There are only ®ve regular solids: the triangular pyramid (tetrahedron), whose
faces are four equilateral triangles; the cube, which has six squares as faces; the
octahedron, with eight equilateral triangles; the dodecahedron, with 12 regular
pentagons; and the icosahedron, with twenty equilateral triangles as faces. Any
regular solid may be inscribed in (have its vertices on) or circumscribed about
(have its faces tangent to) a sphere.
115
SUPERSTRINGS AND OTHER THINGS
Figure 6.10 An ellipse.
Kepler set out to calculate just how the speed of the planet
changes as it orbits the Sun. After many tedious and long calculations he found that the areas swept out in equal times by the line
joining the Sun and the planet were equal. In uniform circular
motion, equal angles are covered in equal times, so that it takes
twice as long to go around one half of the circumference as it
does to go any one quarter. For elliptical orbits, when the
planet is close to the Sun the area swept out by the line from
the Sun to the planet as it moves from A to B (®gure 6.11) is smaller
than the corresponding area formed when the planet moves the
same length, from C to D, at a farther distance from the Sun.
Figure 6.11. Kepler's Second Law, Law of Areas. The planet moves
from A to B in the same time interval t that it takes for it to move
from C to E. The area swept by the line between the Sun and the
planet in moving from A to B is the same as the area swept out in
moving from C to E.
116
Rotation and the Universal Law of Gravitation
Kepler found that since the planet moves at a slower speed when
it is farther from the Sun, the length of the orbit covered in a given
time was smaller when the planet was farther from the Sun. If it
took a certain time t to move from A to B, the planet would
move from C to E in the same time t. However, the area
swept out in moving from A to B was the same as the area
swept out in moving from C to E. Kepler published his two discoveries in a second book that he entitled Astronomia Nova (New
Astronomy) in 1609. These two discoveries, known today as
Kepler's First and Second Laws, lie at the foundation of modern
astronomy.
Kepler did not rest on his discoveries of these laws. Convinced that God the Creator was also a musician, he set out to
discover a relationship between the distance of a planet to the
Sun and its velocity that was similar to the musical ratios. After
many efforts, he was forced to admit that ``God the Creator did
not wish to introduce harmonic proportions into the durations
of the planetary years.'' His effort was not in vain, however, for
Kepler was able to ®nd a mathematical relationship between
the planets' distances from the Sun and their periods of
revolution. He found that ``the periodic times of any two
planets are in the sesquilateral ratio to their mean distances,''
that is, the squares of the periods of any two planets (the time
taken for a complete revolution around the Sun) are proportional
to the cubes of their mean distances from the Sun. The mean
distance is also the semi-major axis of the ellipse. If T is the
period and r the mean distance or semi-major axis, Kepler's
Third Law is
T12 r13
:
T22 r23
The indices 1 and 2 indicate the period and average distance for
each one of the two planets. This remarkable discovery, known
today as Kepler's Third Law or the harmonic law, was contained
in his monumental work Harmonici Mundi (Harmonies of the
World), published in 1619.
We can state Kepler's three laws as follows:
1 Law of orbits: Each planet moves around the Sun in an elliptical
orbit, with the Sun at one focus.
117
SUPERSTRINGS AND OTHER THINGS
2 Law of areas: A planet moves around the Sun at a rate such that
the line from the Sun to the planet sweeps out equal areas in
equal intervals of time.
3 Harmonic law or law of periods: The squares of the periods of any
two planets are proportional to the cubes of their average distances from the Sun.
Newton's law of universal gravitation
Kepler had deduced that a force or anima motrix was needed to
keep the planets orbiting around the Sun and that this force
came from the Sun. Galileo, who was a contemporary of Kepler,
was trying to understand motion and had arrived at the conclusion
that an object moving at a constant speed along a straight line will
maintain that state of motion when there is no net force acting on
the object. This idea was further developed by Newton into what
became his ®rst law of motion, the law of inertia, as we saw in
chapter 3. In his second law, Newton af®rms that the only way
to change the motion of an object is to apply a net force. As we
learned earlier, to keep an object moving in circles we need to
apply an unbalanced force to the object. This force, which we
called the centripetal force, acts toward the center of the circle.
It was clear to Newton that to keep a planet orbiting around
the Sun, a force, directed exactly toward the Sun, was required.
What was the origin of this force? Newton hypothesized that
there is a universal force of attraction between all bodies everywhere in the universe. By analyzing Kepler's third law, Newton
was able to show that this force was inversely proportional to
the square of the distance between the bodies. This was one of
the problems that Newton considered while he was at his
mother's farm in Woolsthorpe during the great plague years. It
is not known whether or not Newton solved the entire problem
then. Newton's biographer and contemporary, William Stuckey,
who drew upon conversations with Newton and interviews
with Newton's friends, writes that on April 15, 1726, ``after
dinner, the weather being warm, we went into the garden and
drank thea, under the shade of some appletrees, only he and
myself. Amidst other discourse, he told me, he was just in the
same situation, as when formerly, the notion of gravitation
118
Rotation and the Universal Law of Gravitation
came into his mind. It was occasion'd by the fall of an apple, as he
sat in a contemplative mood.''
What is clear is that Newton did not publish anything on
what ranks as one of the greatest discoveries in the history of
science for almost twenty years. And then only because two
scientists of great renown, Edmond Halley and Robert Hooke,
had also arrived at the mathematical form for the force of gravity
but had not been able to prove it. By January 1684, Halley had
concluded that the force keeping the planets on their orbits
``decreased in the proportion of the squares of the distances
reciprocally.'' When Halley told Hooke at the Royal Society of
his conclusion, Hooke boasted that he had arrived at exactly the
same conclusion and had a mathematical proof of it but did not ±
and most probably could not ± produce it. Halley then decided to
consult with his dear friend Newton. Abraham DeMoivre, a
member of the Royal Society and friend of Newton and Halley,
wrote of that visit:
After they had been some time together, the Dr. [Halley] asked him
what he thought the curve would be that would be described by the
planets supposing the force of attraction towards the Sun to be
reciprocal to the square of their distance from it. Sir Isaac replied
immediately that it would be an ellipis. The Doctor, struck with joy
and amazement, asked him how he knew it. Why, saith he, I have
calculated it.
What was Newton's method to calculate the force of gravity?
It is not known in detail the technique that Newton used in developing his theory. However, in a book entitled A View of Sir Isaac
Newton's Philosophy published in London in 1728, Henry Pemberton relates this account:
As he sat alone in a garden, he fell into a speculation on the power of
gravity: that as this power is not found sensibly diminished at the
remotest distance from the center of the Earth, to which we can
rise, neither at the tops of the loftiest buildings, nor even on the
summits of the highest mountains; it appeared to him reasonable to
conclude, that this power must extend much farther than was usually
thought; why not as high as the Moon, said he to himself ? and if so,
her motion must be in¯uenced by it; perhaps she is retained in her
orbit thereby. However, though the power of gravity is not sensibly
weakened in the little change of distance, at which we can place our
selves from the center of the Earth; yet it is very possible, that so
119
SUPERSTRINGS AND OTHER THINGS
high as the Moon this power may differ much in strength from what it
is here. To make an estimate, what might be the degree of this diminution, he considered with himself, that if the Moon be retained in her
orbit by the force of gravity, no doubt the primary planets are carried
round the Sun by the like power. And by comparing the periods of the
several planets with their distances from the Sun, he found, that if any
power like gravity held them in their courses, its strength must
decrease in the duplicate proportion of the increase of distance.
Newton himself mentions in a memorandum of about 1714 that he
``began to think of gravity extending to the orb of the Moon, and
having found out how to estimate the force which [a] globe revolving within a sphere presses the surface of the sphere, from
Kepler's rule of the periodical times of the planets . . . I deduced
that the forces which keep the planets in their orbs must [be] reciprocally as the squares of their distances from the centers . . .''
We can use these accounts to reconstruct Newton's technique. Consider a planet of mass m revolving around the Sun
(®gure 6.12). If the planet moves with a speed v and orbits the
Sun in a near circular orbit of radius r, the centripetal force keeping the planet in its orbit is
Fc
mv2
:
r
If the period of the planet is T, then the speed can be written
as the distance traveled in one period, which is the length of
the circumference or 2r, divided by the period, or v 2r=T.
Since we need the square of the speed in our expression for the
Figure 6.12. A planet revolving around the Sun in an orbit of radius r.
120
Rotation and the Universal Law of Gravitation
centripetal force, that is, v2 42 r2 =T 2 , we obtain after replacing
for v2 in our expression for Fc :
Fc
mv2 m 42 r2
:
r
r T2
Multiplying numerator and denominator by r and rearranging,
we write
3
42 m
r
Fc 2
:
T2
r
We notice that the expression in parenthesis, r3 =T 2 , is a relationship between the square of the period and the cube of the distance
between the planet and the Sun. Kepler's third law tells us that
this ratio is constant. If we call this ratio K, the centripetal force is
42 m
K
r2
and if we multiply both numerator and denominator by the mass
of the Sun, it becomes
2
4 K Ms m
Fc
:
Ms
r2
Fc
It is here where the genius of Newton made the incredible leap.
Although this derivation was done for the case of a planet of
mass m orbiting around the Sun, Newton said that this same
expression would apply to the force between the Earth and the
Moon, between the Earth and an apple falling from a tree, and
between any two objects in the universe, by considering the
expression in parenthesis to be a universal constant. This is Newton's law of Universal Gravitation. Any two objects of mass M
and m, separated by a distance r, anywhere in the universe, will
attract each other with a force given by
Mn
FG 2 :
r
The value for the universal constant G was obtained in 1798 by
Henry Cavendish by measuring the gravitational attraction
between two pairs of lead spheres. The currently accepted
value in SI units is
G 6:67 10ÿ11 N m2 =kg2 :
121
SUPERSTRINGS AND OTHER THINGS
Newton's law of universal gravitation means not only that
the Earth pulls on the Moon with a gravitational force that
keeps it in orbit or that the Sun exerts a force of attraction on
the Earth and all the other planets in the Solar System, but that
this force is the same force that is responsible for making an
apple fall from a tree. This universal law, which explains how
the universe works, says that you pull the Sun towards you
with a force that can be calculated. And when we think about
this, we begin to understand the scope of the mind of Newton.
Because, as the distinguished historian of science I. Bernard
Cohen writes, ``[t]here is no mathematics ± whether algebra,
geometry, or the calculus ± to justify this bold step. One can say
of it only that it is one of those triumphs that humble ordinary
men in the presence of genius.''
Newton actually tested his great discovery. He realized that
the Moon is continuously falling toward the Earth. We can understand this ``falling'' of the Moon if we recognize that if we were to
turn off the gravitational force, by Newton's ®rst law, the Moon
would immediately go off in a straight line along the tangent. It
is the gravitational force of attraction that makes it fall from the
straight line into its circular orbit (®gure 6.13). Newton knew
that the distance from the center of the Earth to the center of
Figure 6.13. Acceleration of the Moon toward the Earth. By Newton's
®rst law, if the gravitational force of attraction between the Earth and
the Moon were to be eliminated, the Moon would go off along a straight
line with a speed v. The gravitational force pulls the Moon toward the
Earth, making it ``fall'' a distance h into the circular orbit.
122
Rotation and the Universal Law of Gravitation
the Moon was about 60 times the radius of the Earth. Since his
gravitational law says that the force varies with the distance as
1=r2 , the acceleration due to the gravitational attraction should
be 1/602 the acceleration due to gravity on the surface of the
Earth which in Newton's days was already known to be
9.80 m/s2 . According to this, the Moon accelerates toward the
2
Earth at the rate of 1=3600 9:80 m=s2 2:72 mm=s .
To check his result, Newton calculated the centripetal acceleration of the Moon knowing that its period was 27.32 days. As
we have seen, the centripetal acceleration can be written as
ac
v2 42 r
2 :
r
T
Newton obtained for this acceleration the value of 2.74 mm/s2 , in
excellent agreement with the value predicted by his theory.
Some historians believe that Newton actually carried out a
calculation like this during his annus mirabilis (year of wonders),
as a young man of 23, although, due to incorrect data on the distance to the Moon, his ®rst attempt was off by a large amount.
Only when, six years later, new measurements of the size of the
Earth yielded a more accurate value for the distance to the
Moon, could Newton make the calculation again with the agreement that we have just seen.
Newton's universal law of gravitation allows us to de®ne the
weight of an object more precisely than before as the net gravitational force exerted on the object by all bodies in the universe. Near
the surface of the Earth, the gravitational force of attraction of
the Earth is much larger than that of all other bodies in the universe and we can neglect their effect on the object; the weight
of an object near the surface of the Earth is then equal to the
gravitational force of attraction of the Earth.
The weight of an object of mass m near the surface of the
Earth is then
w Fg G
ME m
:
rE2
As we learned in chapter 3, Galileo discovered that the weight of
an object is proportional to its mass:
w mg
123
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Measuring the distance
to the Moon
To check his universal law, Newton estimated that the distance to the Moon was about 60 times the radius of the
Earth. The method to calculate this distance was actually
developed by the ancient Greeks. Around 280 BC, the Greek
astronomer Aristarchus developed a procedure to measure
the relative distances to the Sun and the Moon. He realized
that during the Moon's ®rst and last quarter phases, the
Sun, the Moon, and the Earth formed a right triangle. He
then carefully measured the angle between the Moon and
the Sun, obtaining a value of 878. With this information he
estimated that the Sun was about 20 times farther away
than the Moon.
Aristarchus also obtained estimates for the relative sizes
of the Earth, the Moon, and the Sun. He measured the time it
takes the Moon to move through the Earth's shadow during a
lunar eclipse and from that value he calculated that the
Earth's diameter was three times greater than the Moon's
diameter. To calculate the Sun's diameter, he used the fact
that the apparent sizes of the Moon and the Sun, as seen
from the Earth, are nearly the same. Since he had calculated
that the Sun was 20 times farther away than the Moon, he
reasoned that the Sun's actual diameter was 20 times that
of the Moon.
All that was needed to determine the actual distances
and diameters was measuring the radius of the Earth. Some
eighty years later, the astronomer Eratosthenes devised a
clever method to do just that. Aristarchus' measurements of
the Sun's size and of its distance were off due to the dif®culty
in determining the exact moment when the Moon was at ®rst
124
Rotation and the Universal Law of Gravitation
or last quarter phase which introduced a large uncertainty in
his measurement of the angles. However, his calculations of
the size of the Moon and its distance to the Earth were incredibly close to modern measurements.
Today, we can measure the distance to the Moon with a
precision of 1 centimeter using lasers. From 1969 until 1971,
the astronauts of the Apollo missions placed laser re¯ectors
on the surface of the Moon. Lunar laser ranging stations on
Earth send laser pulses to these re¯ectors and detect the
re¯ected light 2.6 seconds later. Since the speed of light is
known with a high degree of precision, an extremely accurate
determination of the distance to the Moon and its variation as
the Moon orbits the Earth is possible.
where g is the acceleration due to gravity. Therefore, from the last
two expressions, we obtain
gG
ME
rE2
where we can see that the acceleration due to gravity is the same
for all bodies at the surface of the Earth.
Spacecraft and orbital motion
Newton, as we have said before, realized that the Moon is falling
toward the Earth. Any satellite in orbit around the Earth is a free
falling object. We can illustrate this in another way by using a
diagram that appeared in Newton's Principia (®gure 6.14). If we
throw a stone horizontally from the tall mountain, it would fall
to the ground following a path like the one that ends at point
D. If we throw the stone harder (perhaps with the aid of some
mechanical device), it would still fall to the ground hitting at
point E, farther away, due to the fact that the stone was thrown
with a higher speed and also because the Earth curves out from
under it. At successively higher speeds the stone would hit the
ground at points F and G, until ®nally, if thrown fast enough,
its curvature would match that of the Earth, and the stone
125
SUPERSTRINGS AND OTHER THINGS
Figure 6.14. A projectile shot from a high mountain follows different
curved paths. ``[T]he greater the velocity . . . with which it is projected,
the farther it goes before it falls to the Earth. We may therefor suppose
the velocity to be so increased . . . til at last, exceeding the limits of the
Earth, it should pass into space without touching it.'' (Newton, System
of the World.)
would fall to the ground without ever reaching it. We would have
placed the stone in orbit around the Earth.
A satellite in orbit around the Earth is, then, in free fall; it is
continuously being accelerated toward the center of the Earth.
What is the value of this acceleration? We know that the force
acting on the satellite is the gravitational force. Therefore, the
satellite falls with the acceleration due to gravity. However,
this acceleration due to gravity does not have the value of
9.80 m/s2 , but the value of the acceleration due to gravity at
that particular distance from the center of the Earth. Remember
that 9.80 m/s2 is the acceleration due to gravity at the surface of
the Earth, which is at a distance from the center of the Earth
equal to the Earth's radius.
126
Rotation and the Universal Law of Gravitation
Figure 6.15. The gravitational force acts on astronaut Wendy Lawrence
(at work on the ¯ight deck of the Space Shuttle Discovery) in such a way
that she is kept moving in circles around the Earth. (Courtesy NASA.)
When the space shuttle is in orbit around the Earth, the
shuttle, the astronauts and all the objects inside it appear weightless, since the shuttle in orbit is in free fall. As we all have seen
many times on the television images broadcast from space,
when an astronaut lets go of a pen, it continues going around
the Earth at the same speed as the rest of the shuttle and appears
to be ¯oating inside the spacecraft. Since the gravitational force is
proportional to the mass, the magnitude of the gravitational force
keeping the astronaut in orbit is larger than the gravitational force
keeping the pen, with a smaller mass, in orbit. The gravitational
force acts on the astronaut and the pen in the exact right proportion to keep them moving in circles together.
In chapter 3 we said that mass was the measure of the inertia
of a body. By Newton's second law, a net force acting on an object
is proportional to the acceleration of the body and to the body's
resistance to acceleration or its inertial mass. In this chapter, however, mass has been used as a quantity that is proportional to the
gravitational force. We might call this mass gravitational mass. In
1922, an experiment performed in Budapest by Roland von
EoÈtvoÈs showed that the gravitational force acts in exactly the
right proportions on objects of different masses. In other words,
within the accuracy of his experiment, the gravitational and
inertial masses were the same. A recent reanalysis of the data
taken by EoÈtvoÈs seemed to indicate a slight discrepancy implying
that gravity does not act in the exact required proportions. If this
were to be the case, an object with a larger gravitational mass
would fall to the ground with a greater acceleration. Some
127
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: The Global Positioning
Satellite System
In orbit at an altitude of about 20 000 km, an armada of 21
satellites broadcasting a continuous signal would potentially
allow people to determine their location with millimeter precision. Although the satellites that form the US Department of
Defense's Global Positioning Satellite System (GPS) were
placed in orbit in the mid 1970s for military purposes, provisions were also made for civilian use.
The satellites broadcast high-frequency radio waves continuously. The signal propagates in all directions with the
same speed (the speed of light). Since the speed of light is
known to a high degree of accuracy, one would only need
to know how long it took the signal to arrive at one's location
to determine the distance to a particular satellite. The
designers of the GPS system borrowed a technique used by
astronomers to determine distances to other planets. This
technique involves sending coded signals at predetermined
known times. Since the code is known, comparing the
signal from the satellite with the known coded signal shifted
in time to various intervals allows the receiver to determine
how long the signal has been traveling.
Knowing the distance to one satellite is not enough to
determine one's location in space. If you were to determine
that one of the satellites is 25 000 km away, you could be anywhere on a large sphere 25 000 km in radius centered on the
satellite. Determining the distance to a second satellite
would narrow your possible position to a circle (the intersection between the two spheres around each satellite). A third
satellite will reduce the possible positions to 2 points (the
intersection of this third sphere with the previous circle)
and a fourth satellite will narrow this down to one unique
point.
Since the GPS was intended for military purposes, the
signals were originally encrypted to prevent potential adversaries from using the system to determine the position of their
troops. The designers wanted to allow for limited civilian use
and released slightly incorrect information about the timing
128
Rotation and the Universal Law of Gravitation
of the signals. The designers estimated that this error would
allow other parties to determine their position at best within
100 m. Soon, however, scientists were able to devise methods
to circumvent the clock errors and managed to increase the
accuracy to within a few millimeters. The intentional degrading of the signal has been discontinued, increasing the resolution of satellite navigation devices to about 20 m. Millimeter
accuracy is available with the corrections still being provided
by the US Coast Guard and by commercial providers.
scientists proposed the existence of a new force, the ®fth force, that
would explain the new ®ndings. This ®fth force is related to the
total number of protons and neutrons in the nuclei of different
substances. Recent experiments at Los Alamos National Laboratory and at the Joint Institute for Laboratory Astrophysics in
Colorado have cast some doubts about the existence of such a
®fth force. More sophisticated experiments are currently underway in an attempt to clarify the situation. We should point out,
however, that these discrepancies are extremely small.
129
7
ATOMS: BUILDING
BLOCKS OF THE
UNIVERSE
The underlying structure of matter
The question of the nature of matter is perhaps as old as humanity. Twenty-®ve centuries ago, Thales of Miletus ± the idealized
Greek thinker credited by Aristotle as the founder of European
philosophy ± is believed to have posed that question. Clearly,
discovering the hidden structure of nature constitutes one of
the most basic reasons for our exploration of the world. Up to
this point, we have occupied ourselves with the behavior of
matter. We now turn our attention to the underlying structure
of matter.
The Atomic Hypothesis
If some global catastrophe were to destroy civilization and you
were one of the few survivors, what would be the best single
statement that you could communicate to the next generations?
The late Nobel Prizewinning physicist Richard P Feynman had
a good candidate for this statement: the atomic hypothesis. Feynman states this hypothesis as follows: ``All things are made up of
atoms ± little particles that move around in perpetual motion, attracting
each other when they are a little distance apart, but repelling upon being
squeezed into one another.''
Why is this statement so important? Because it is the basis for
modern physics, the physics that has made possible the computer
chip, television, modern communications and electronics.
Modern physics began with the con®rmation of the atomic
hypothesis at the end of the nineteenth century.
133
SUPERSTRINGS AND OTHER THINGS
Early concept of the atom
The Greek philosopher Leucippus (believed to have been born in
Miletus about 490 BC) is sometimes credited as the originator of
the atomic principle. However, it is his student Democritus, a
Greek natural philosopher, who was responsible for its introduction into Greek thought.
Democritus said that matter was made up of small, indivisible particles, which he called ``atoms'', from the Greek word
for ``indivisible.'' As the scienti®c method (as we understand it
today) did not exist in the time of Democritus, it did not occur
to anyone to check ideas against experiment. People simply
chose among these different ideas on the basis of taste and personal preference. And the views of Democritus were not widely
accepted by most philosophers of his time.
The atomic concept lay dormant until the nineteenth century,
when it was revived by the work of the English scientist John
Dalton. Dalton proposed an atomic theory in which he stated
that matter is composed of atoms, which he considered to be
indivisible particles; that the atoms of a given element are identical; and that atoms are arranged, in chemical reactions, to form
molecules.
By this time physicists had some understanding of electricity
and had identi®ed what the American physicist and statesman
Benjamin Franklin called positive charge and negative charge.
Experiments had shown that two positive charges repel each
other, two negative charges also repel each other, but that a positive charge and a negative charge attract each other (®gure 7.1).
By studying how electrical currents decompose water and other
chemical compounds, the English physicist Michael Faraday
Figure 7.1. Like charges (all positive or all negative) repel each other,
whereas unlike charges (a positive and a negative) attract each other.
134
Atoms: Building Blocks of the Universe
Figure 7.2. When a battery is connected across the two electrodes inside
an evacuated glass tube or cathode ray tube, a green glow appears in the
glass.
realized that electricity exists in discrete units or particles. In 1891
these particles of electricity were named electrons.
These two discoveries would eventually help explain a seemingly unrelated phenomenon that had baf¯ed scientists for
years. During the last decade of the nineteenth century physicists
tried to understand the nature of the strange and fascinating glow
that appeared when an electric current passed through a wire
inside a glass tube from which air had been evacuated. Two
metal discs, or electrodes, sealed into the ends of a long glass
tube, were connected to a high-voltage battery (®gure 7.2).
When the battery was connected, a green glow became visible
in the glass. Sometimes, investigators would place ¯uorescent
materials inside the tube which would also glow when the electrical current was present. Since the rays seemed to emanate from
the electrode connected to the negative end of the battery
which was called the cathode, these rays were known as cathode
rays, and the evacuated glass tube as a cathode ray tube. These
tubes were the forerunners of today's TV picture tubes. The
modern computer monitor is still called a CRT, an abbreviation
for cathode ray tube.
Sir Joseph John Thomson was the director of the Cavendish
Laboratory in Cambridge, England. With a staff of twenty, ``J J''
had designed experiments with cathode ray tubes for more
than a decade, attempting to decipher the nature of the glow.
135
SUPERSTRINGS AND OTHER THINGS
Obviously, something, was passing out of the cathode, through the
vacuum, and colliding with the positive electrode. On Friday,
April 29, 1897, Thomson announced that the glowing beam was
not made up of light waves, as most physicists held. Rather,
cathode rays were particles (corpuscles, as they were called then)
carrying negative electric charge. These corpuscles were ¯ying
off the negatively charged cathode into the positive anode. Thomson was able to show that these corpuscles were electrons and
was also able to determine the ratio of their mass to the electric
charge. From this ratio he concluded that the mass of the electron
was about two thousand times smaller than the mass of the
lightest atom (hydrogen).
Thomson's discovery had tremendous implications. From
Democritus to Dalton, atoms were thought to be the most basic
constituents of matter. Yet Thomson had discovered something
two thousand times smaller than an atom. From his experiments
he was forced to conclude that atoms were not indivisible; they
were made up of electrons. Electrons, he said, were the basic
components of the atom.
First models of the atom
The idea that matter is made up of atoms was beginning to be
accepted by the turn of the twentieth century. After Thomson discovered the electron in 1897 he and other physicists began to
realize that the atom must be a complex structure. Thomson himself put forward a model for this complex atom. It has been called
the ``plum pudding'' model because it represented the atom as a
homogeneous sphere of positive electric charge in which the
negatively charged electrons were embedded (®gure 7.3).
Things moved rapidly during the last decade of the nineteenth century. In 1895 Wilhelm Conrad RoÈntgen in Germany
discovered a highly penetrating and invisible radiation that he
called X-rays. A few weeks after RoÈntgen announced his discovery before the Physical Medical Society in WuÈrzburg, Antoine
Henri Becquerel in Paris discovered radioactivity by experimenting with uranium crystals. Working in Becquerel's laboratory,
Pierre and Marie Curie discovered two other radioactive elements, which they named polonium and radium.
136
Atoms: Building Blocks of the Universe
Figure 7.3. Thomson's model of the atom: a homogeneous sphere of
positive charge with negative electrons embedded in it.
However, it was Ernest Rutherford, a New Zealander who
studied under J J Thomson at England's Cavendish Laboratory,
who showed that the rays given off by radium were of at least
two different kinds. By placing thin sheets of aluminum in the
path of the rays he stopped one kind of radiation, which he
called alpha rays. The other kind, which penetrated thicker
sheets, was named beta rays. A third type of radiation, gamma
rays, was discovered a year later. The Curies and Henri Becquerel
showed in 1900 that beta rays have negative electric charge. In
1907 Rutherford proved that the alpha particles were helium
atoms with the electrons removed. Thus they had to be positively
charged.
In 1908 Rutherford was awarded the Nobel Prize for his
investigations on radioactivity. Most scientists do their most
important work before they are honored with the Nobel Prize.
This was not the case with Rutherford. In that very same year
of 1908, Rutherford and his assistant Hans Geiger were looking
for a way to detect alpha rays and discovered that a screen
coated with zinc sul®de would ¯ash where an alpha particle
struck the screen. Thus, such a screen could serve as a detector
of alpha particles. Rutherford decided to use his newly discovered detectors to investigate the structure of atoms.
This presented a problem, though. Rutherford knew that
the atoms were too small to be seen. How could he investigate
137
SUPERSTRINGS AND OTHER THINGS
Figure 7.4. Although the water surface in a deep well might not be visible, we can ®nd out how deep the well is by listening to the sound of a
stone as it hits the water. Similarly, Rutherford's experiment allowed him
to probe the atom, too small to be seen directly.
something that he could not even see? By ®ring a projectile at an
unseen object and analyzing the projectile's behavior as it
emerges, information about the nature of the interaction with
the unseen object can be obtained, much in the same way as the
sound of a stone hitting the surface of the water in a deep well
can yield information about its depth (®gure 7.4). This is more
or less what Rutherford decided to do. He set up an experiment
to shoot alpha particles at a sheet of gold foil with a zinc sul®de
screen placed at some distance from the gold foil (®gure 7.5).
His collaborator Hans Geiger, who did most of the counting,
observed that most of the alpha particles underwent very small
de¯ections of only one degree or less. This was of course in accordance with Thomson's plum pudding model of the atom. A fast
alpha particle ± with positive electric charge ± would pass
through the electrons almost unde¯ected. The positive charge
in Thomson's model was spread uniformly over the entire
volume of the atom and thus was not concentrated, like that of
the alpha particle. As soon as the alpha particle penetrated the
atom, the positive charge of the atom would perhaps slow
down the particle's motion but, since this charge was not localized at one point, it would not de¯ect the particle's trajectory.
Rutherford, however, decided to ask a young undergraduate
138
Atoms: Building Blocks of the Universe
Figure 7.5. Schematic representation of Rutherford's experimental set
up. The large angle de¯ections of the scattered particles forced Rutherford to conclude that the atom must have a positively charged nucleus.
student named Ernest Marsden to look for alpha particles at
angles up to 458.
Marsden did ®nd a few ¯ashes at 458. Encouraged by this,
he decided to swing the telescope to larger angles, even past
908 and found ¯ashes from the front side of the experiment.
Rutherford was astonished. The de¯ections at such large
angles forced Rutherford to conclude that the atom must have a
positively charged nucleus. Some 20 years later he would
comment, ``It was quite the most incredible event that has ever
happened to me in my life. It was almost as incredible as if you
®red a 15-inch shell at a piece of tissue paper and it came
back and hit you. On consideration, I realized that this scattering
backward must be the result of a single collision, and when I
made the calculations I saw that it was impossible to get anything of that order of magnitude unless you took a system in
which the greater part of the atom was concentrated in a
minute nucleus.''
In 1911 Rutherford published his new model of the atom in
which the positive electric charge is concentrated in a core that
carries almost the entire mass of the atom, and the negatively
charged electrons are uniformly distributed around the nucleus
(®gure 7.6). Rutherford concluded that more than 99.9% of the
mass of the atom was concentrated in the nucleus, which
occupies about one part in 1014 of the volume of the atom.
139
SUPERSTRINGS AND OTHER THINGS
Figure 7.6. Rutherford's nuclear model of the atom. The positively
charged nucleus is surrounded by a uniform spherical distribution of
negative electrons.
The Japanese physicist Hantaro Nagaoka, one of Japan's ®rst
physicists, had proposed a model of the atom with a ring of
electrons in orbit about a small nucleus. His conclusion did not
come from experimental evidence, like Rutherford's. There is
no evidence that Rutherford knew of Nagaoka's model.
Rutherford's nuclear model explained the observational evidence very well. There was a major problem, however. Physicists
knew that if an electrically charged particle orbits around a
center, it must radiate energy. The electron then must lose some
of its energy with this radiation and should spiral down into
the nucleus with the resulting collapse taking less than a microsecond. If this were really the case, atoms would not be stable
and we would not be here to ponder these questions.
In 1913, the 28-year-old Danish physicist Niels Bohr, who
had worked with Rutherford in England, proposed a solution
to this problem, which involved a remarkable property of light
that had been discovered only a few years before. Until then,
light had been considered to be a wave phenomenon. This discovery would result in the awarding of the Nobel prize to two of the
most famous scientists that ever lived: Albert Einstein and Max
Planck. To understand the importance of this discovery, we
need to become more familiar with the language of waves. (A
more detailed treatment of waves is undertaken in chapter 15.)
140
Atoms: Building Blocks of the Universe
Waves and quanta
We are all familiar with water waves and waves on a string
(®gure 7.7). The ripples that form on the surface of the water
after a child throws a pebble in the water are one of the most
common and familiar examples of what we call wave motion.
The rapid up and down motion of a guitar string after a skilled
player plucks it, displaces the air molecules around it, forcing
them to collide with other nearby air molecules. The ®rst molecules bounce back after the collisions only to collide with the
returning string again, which sends them back toward the
second set of molecules, themselves returning from other collisions. Very quickly, these molecular collisions spread out in all
directions from the vibrating guitar string. Eventually, they
reach our ears where, after traveling through the ear canal, they
set the eardrum into similar oscillations. Hair cells a few thousandths of a millimeter thick begin then to move in response to
these oscillations. Their motion is transformed into electrical
impulses that travel to our brain at about 100 miles per hour.
There, in a process that physics cannot yet explain, we feel a
sensation that we describe as pleasure.
Figure 7.7. Ripples formed on the surface of the water are one of the
most familiar examples of wave motion.
141
SUPERSTRINGS AND OTHER THINGS
These are examples of mechanical waves, and we know from
experience that these waves propagate through some medium
(water, the string, the air, the ear membrane or the hair cells in
our examples). Although we cannot see sound waves, we know
that they need some medium to travel through. The elasticity of
the medium allows a disturbance created at some point in
space to propagate through the medium. As a disturbance
propagates, energy is transmitted through the medium without
transferring matter. A wave, then, is a mechanism for the transmission of energy which does not require the actual translocation
of matter. We can convince ourselves that this is the case by observing a piece of driftwood ¯oating in the water. As the waves pass
by, we will see the wood rise and fall. The driftwood does not
travel with the wave. The water moves up and down with the
wood, while the wave travels from one point to another.
Light, on the other hand, is a wave that does not require a
medium to propagate. The main evidence for the wave nature
of light lies with the phenomenon of interference. If you drop
two pebbles into a pond you will notice that at certain places
the crests from the two waves meet and reinforce each other,
while at other places, where a crest meets a trough, the two
waves cancel each other out (®gure 7.8). When two crests or
two troughs meet, the waves interfere constructively, and when
a crest meets a trough, the waves interfere destructively. Combining two wave motions, then, can result in nothing. Two particles
cannot annihilate each other like that.
We can draw an important conclusion from our brief look at
wave phenomena. Waves behave very differently from particles.
Particles travel in straight lines when free of external forces and
they do not show interference. Waves do.
Although some Greek philosophers believed that light was
composed of particles that travel in straight lines (even Isaac
Newton in the seventeenth century favored this idea), by the
end of the nineteenth century there was overwhelming evidence
that light was a wave phenomenon. As has been known since
around 1803 when the English scientist Thomas Young demonstrated it in a series of landmark experiments, light shows the
phenomenon of interference, characteristic of waves. Young
also showed experimentally that different wavelengths of light
correspond to the different colors that we perceive.
142
Atoms: Building Blocks of the Universe
Figure 7.8. Interference of two wave motions. The waves are produced
by two vibrating rods in a water tank. The circular waves combine to
produce a pattern of alternating regions of moving and still water.
(From PSSC Physics Seventh Edition, by Haber-Schaim, Dodge, Gardner,
and Shore. Published by Kendall/Hunt Publishing Company, 1991.)
The colors observed in soap bubbles are an example of interference with light, produced when the light waves are re¯ected
from the front and back of the thin ®lm of the bubble and meet.
When white light, which consists of light waves of all the colors
or frequencies, shines on the bubble, certain frequencies interfere
destructively. The colors corresponding to these frequencies disappear from the re¯ected light. Since the re¯ected light lacks
several frequencies, it appears colored.
Although by the turn of the present century it seemed very
clear that light was a wave, this idea was going to be seriously
challenged, as we are about to learn.
In 1900, the German physicist Max Planck was attempting
to explain the relationship between the temperature of a hot
object and the color of the radiation emitted. This problem had
been in the minds of physicists for some time because the
theory did not ®t the experimental data for radiation emitted in
the ultraviolet. The assumption had always been that the light
143
SUPERSTRINGS AND OTHER THINGS
was emitted from the hot body in a continuous manner. After
many failed attempts, Planck reluctantly incorporated into his
theory the assumption that this radiation was being emitted not
continuously but in discrete amounts. His equation now matched
the experimental data perfectly. According to Planck's new (and,
as it turned out, revolutionary) assumption, the energy emitted
by a hot object could only have certain discrete values, as if it
were carried out in little bundles or packets. Planck called these
bundles of energy quanta. The discreteness of the energy emitted
was expressed through a constant h, now called Planck's constant, which has a value of 6:63 10ÿ34 J s.
Planck's discovery was an ``act of desperation,'' as he would
call it later, without physical basis and done for the purpose of
explaining that particular experimental fact. He would attempt
in vain to ®nd an explanation for his radical assumption within
classical physics for the rest of his life.
In 1905, the young Albert Einstein, unable to obtain a post as
professor of physics because of his rebellious character, was
working as a clerk at a patent of®ce in Berne. The job, he said
later, was not very demanding and left him plenty of time to
work on physics. One of the problems that he considered was
the way in which electrons were ejected from certain metals
when light was shining on them. Although the phenomenon
had been known since 1887, physicists were puzzled by the fact
that the kinetic energies of the released electrons did not seem
to be at all related to the brightness of the light shining on the
metal. Energetic electrons were released from metals even
under extremely weak light.
Einstein realized that the puzzle arose from the assumption
that light was a wave and that a gentle wave (a weak light
beam) could not propel an electron at great speeds. Building on
Planck's discovery, Einstein proposed that light does not only
behave as a wave, as the interference phenomena showed, but
also as a bundle of energy packets that he called light quanta or
photons. A weak beam of light of a certain color is composed
not of weaker bundles of energy but of fewer bundles of the
same strength as those of an intense beam of light of the same
color. Each one of the (few) quanta that make up the weak light
beam is as capable of releasing a fast electron as the quanta that
make up the stronger beam.
144
Atoms: Building Blocks of the Universe
Einstein generalized Planck's theory. He assumed that the
quanta of energy that Planck had introduced were also characteristic of light rather than a special property related only to a single
phenomenon. Light is not simply emitted in bundles of energy;
light is made up of these bundles.
The Bohr model of the atom
We are now ready to understand the basic ideas behind Niels
Bohr's model of the atom. Bohr's model was similar to Rutherford's planetary model, but with some important differences.
Bohr postulated that the electrons moved around the nucleus in
certain speci®c orbits, which he called stationary orbits. Since electrons are negatively charged, they are attracted to the positive
nucleus. This attractive electric force provides the centripetal
acceleration of the electron as it orbits the nucleus. In these
orbits the energy of the electrons (the sum of kinetic and electric
potential energy) remained constant; thus the electrons did not
radiate energy. Only when the electrons moved from one stationary orbit to another orbit closer to the nucleus, decreasing their
potential energy, did they radiate energy. For an electron to
jump to a higher orbit it had to absorb energy. Therefore, when
the electrons moved from one stationary orbit, or state, to another,
they absorbed or emitted energy.
Bohr used Planck's formula to compute the allowed stationary orbits of electrons in orbit; he called this procedure his ``quantization condition.'' He postulated that only certain values of the
angular momentum, L, of the orbiting electron were allowed. The
lowest value of the angular momentum was simply Planck's constant h divided by 2. Higher values of L were multiples of this
minimum value, namely
L
nh
2
where n was a positive integer. Thus, the angular momentum of
the electron was ``quantized''; that is, it took only certain discrete
values. For each allowed value of L, a corresponding value of the
energy of the electron was calculated. In Bohr's theory, each
allowed orbit corresponds to an energy level: the higher the
145
SUPERSTRINGS AND OTHER THINGS
Figure 7.9. A marble on a staircase has quantized values of its potential
energy since it can only reside on a particular step and cannot ¯oat in
between steps. In contrast, a marble on a ramp can acquire any intermediate value of the potential energy as it rolls from top to bottom of
the ramp.
orbit, the higher the energy level. Since each one of the electrons
in an atom resides in one of the allowed orbits where it has a
de®nite energy, the total energy of the atom must have a speci®c
value. Thus, the atom's total energy is quantized. An analogy
using a system from mechanics could help us visualize this concept. A marble on a staircase also has quantized values of its
potential energy (®gure 7.9). It can only increase or decrease its
energy in ®xed amounts which are determined by the height of
each step. A marble on a ramp, on the other hand, can have
any intermediate value of the potential energy as it rolls up or
down the ramp.
The quantization or discreteness of the energy levels in an
atom guarantees that the light absorbed or emitted by an atom
when one of its electrons undergoes a transition between orbits
must be discrete also. Bohr postulated that when an electron
jumped from one energy level to another, a photon was emitted
or absorbed. The energy of the emitted or absorbed photon must
equal the energy difference between the two energy levels.
Figure 7.10 shows an energy level diagram with several possible
transitions. Since each energy level corresponds to an allowed
orbit with a speci®c value of the angular momentum, we use the
positive integer n to indicate the different energy levels. The
energy level with the lowest level, the ground state, has the value
n 1; higher energy levels have values of n equal to 2, 3, and so
on. The number n is called the principal quantum number.
146
Atoms: Building Blocks of the Universe
Figure 7.10. Simpli®ed energy level diagram for an atom where several
possible transitions are shown.
Bohr calculated the allowed transitions between energy
levels for the hydrogen atom and found that they agreed exactly
with the experimental measurements. Soon, however, it became
clear that his theory did not work well for other, more complicated
atoms. There were other problems, too. No explanation was given
for the mechanism of absorption and emission of energy when an
electron jumped orbits nor for the assumption that the electron did
not radiate when in an stationary orbit. Nevertheless, Bohr's
theory was the ®rst attempt at explaining the structure of the
atom and as such, it opened the road for a more complete and
successful theory, quantum mechanics. We shall study the fundamental concepts of this theory in Part 7 of the text.
Molecules
Although Bohr's atomic theory does not work for atoms other
than hydrogen (and therefore is not completely satisfactory), it
is still used to calculate the atomic properties of hydrogen and
as the starting point for approximate calculations for other
atoms. The Bohr model has also been used to obtain an estimate
of the way in which atoms interact with each other. More sophisticated calculations based on quantum mechanics have then been
undertaken to obtain a more accurate result.
Atoms of one particular element bond to other atoms in very
speci®c ways, which are governed by the properties of the
147
SUPERSTRINGS AND OTHER THINGS
electrons of those atoms and their arrangement around their
nuclei. Two or more atoms can combine to form a molecule. The
combination of two hydrogen atoms (H) and one of oxygen (O)
form the water molecule (H2 O). Two atoms of carbon (C), three
of hydrogen, and one of chlorine (Cl) form the vinyl chloride
molecule, a carcinogenic gas which is manufactured in great
quantities because it can be used to form polyvinyl chloride
(PVC), a very useful plastics material. Twenty carbon atoms,
twenty-eight hydrogen atoms and one oxygen form the retinal
molecule, the forerunner of the rhodopsin and iodopsin molecules which absorb the light that reaches the rods and cones,
the photoreceptor cells in the retina of your eye.
We can think of a molecule as a stable con®guration of
several nuclei and electrons. As such, this view is an extension
of the concept of the atom. The atoms that form the molecule
lose their identities and the molecule becomes the new building
block of matter. The bonding of the atoms that form a molecule
is due to the sharing of electrons by the atoms. A shared electron
pair forms a molecular bond. This type of bonding is called a
covalent bond.
Not all combinations of atoms end up as molecules. Some
atoms combine to form a compound. A compound can be considered as formed by the association of several atoms in such a
way as to maintain the identities of the individual atoms without
forming a separate unit. An ionic bond is such a case. In an ionic
bond, the bonding is due to the electrical attraction between oppositely charged ions. Recall that atoms are electrically neutral and
are composed of a positive nucleus at the center and negative electrons around this nucleus. Since electrons occupy the outermost
parts of the atom, they can be removed from or added to atoms
to form ions. A positive ion is an atom lacking one or more electrons, whereas a negative ion is formed when one or more
electrons are added to a neutral atom. The two ions do not exist
as an isolated unit but as a cell of many such pairs. The ionic bonding between a sodium ion (Na ) and a chloride ion (Clÿ ) in table
salt is an example of an ionic compound. In this case, if we remove
an electron from Na to form the positive ion of sodium (Na ) and
attach that electron to the neutral chlorine atom to form a Clÿ ion,
the two ions, one positive and the other one negative, attract each
other. Moving this electron from one atom to the other requires a
148
Atoms: Building Blocks of the Universe
certain amount of energy. In this particular case, the total energy
required is 1.3 eV. However, the energy of attraction between
the two ions is greater than 1.3 eV. It is more energetically favorable for the ions to attract each other than to bring the electron
back and form neutral atoms again.
Physics in our world: Winemaking
You may have noticed that wine labels say ``Contains Sul®tes.'' Wines actually contain sulfur dioxide, a molecule
formed by the combination of one atom of sulfur and two
of oxygen. Although the amount of sulfur dioxide present
naturally in wines is no more than about 10 to 20 parts per
million, a 1988 law requires winemakers to include the statement on their labels if their wines contain more than 10 parts
per million. This law is intended to protect some people who
suffer from asthma and who might be sensitive to it (about
®ve percent of asthmatics are sensitive to sul®tes).
The sulfur dioxide molecule plays an important role in
the winemaking process. Wine is made from fermented
grapes. After the ripe grapes are crushed to release their
juice, yeasts (one-celled organisms that exist on the grape
skins) convert the sugar in the juice into alcohol, a process
known as fermentation. When all the sugar has been converted into alcohol, the grape juice has become wine. The
enormous variety of wines that are produced in the world
comes not only from the different types of grape but also
from the kind of container used to ferment the wine (oak or
stainless steel), the temperature inside this container and
even its size, and how long the wine is stored afterwards.
During the fermentation process, sulfur dioxide is produced naturally in the grape juice. Because it is produced
only in very small quantities, winemakers also add it to
their wines. Sulfur dioxide prevents the growth of bacteria
that would sour the wine into vinegar. Sulfur dioxide also
prevents the growth of wild yeasts that would continue the
fermentation of sweet wines after they are bottled. Finally,
sulfur dioxide is an antioxidant; that is, it prevents oxygen
from combining with the wine.
149
SUPERSTRINGS AND OTHER THINGS
Figure 7.11. Graphic representation of the NaCl ionic compound. Each
Na ion is surrounded by 6 Clÿ ions and each Clÿ ion is surrounded by
6 Na ions. Since no particular Na ion is attached exclusively to a
particular Clÿ ion, there is no unique NaCl molecule.
Although sodium atoms bind to chlorine atoms in the way
just described, molecules are not formed. Each sodium ion is surrounded by six chloride ions and each chloride ion is surrounded
by six sodium ions. Since there is no exclusive union between any
two given ions, there is no unique NaCl molecule (®gure 7.11).
150
8
THE HEART OF THE
ATOM: THE NUCLEUS
Raw material: Protons and neutrons
Werner Heisenberg, a 29-year-old professor of physics at the
University of Leipzig, told Bohr in 1931 that he had given up
concerning himself with fundamental questions, ``which are too
dif®cult for me.'' Heisenberg had in mind his failure to explain
the physics of the atomic nucleus with the quantum mechanics
that he had invented only ®ve years earlier. In the Christmas
issue of the Berliner Tageblatt, he wrote that progress on fundamental questions such as the quantum mechanics of the nucleus
would have to wait for new discoveries about that small piece of
matter at the heart of the atom. ``Whether indeed the year 1932
will lead us to such knowledge is quite doubtful.''
Heisenberg was mistaken. Six months later he came up with
the ®rst quantum mechanics of the nucleus, a theory that became
the basis for the nuclear physics of today. Rutherford's scattering
experiment had given physicists the ®rst clues about the size,
charge, and mass of the nucleus of the atom: The nucleus is
about 10ÿ15 meters across, is positively charged, and contains
most of the mass of the atom. The atom itself, on the other
hand, is about 10 000 times larger and is electrically neutral. If
the atom were expanded to the size of an auditorium, the nucleus
would be represented by a pea at the center.
As soon as the existence of the nucleus was demonstrated,
physicists started thinking about its composition. Rutherford
had identi®ed the basic positive charge, which had been observed
as early as 1886 by Eugen Goldstein in Germany, as nuclei of
hydrogen. He proposed the name proton for this particle. It was
found that the proton was 1836 times more massive than the
151
SUPERSTRINGS AND OTHER THINGS
electron, but contained a positive electric charge of the same magnitude as the negative electric charge of the electron. For the lightest element, hydrogen, things were relatively simple: its nucleus
was a single proton with an electron orbiting around it, making
the atom neutral. Furthermore, the ratio of the mass of the hydrogen nucleus to that of the orbiting electron was consistent with the
ratio of the mass of the proton to the mass of the electron.
Things were not working well for the other atoms. Although
it was known that the helium atom had two electrons and thus
two protons in its nucleus, its mass was not simply twice the
Pioneers of physics: Heisenberg's failing grade
Perhaps the most important early twentieth century physicist
after Einstein and Bohr, Werner Heisenberg rose meteorically
to the top of his profession. He published his ®rst scienti®c
paper at the age of 20, invented quantum mechanics at
the age of 24, attained a full professorship in physics at the
age of 25, and the Nobel Prize at the age of 32. Yet, this
brilliant scientist almost failed to pass his ®nal oral doctoral
examination.
Because of his talent and his successes in theoretical
physics, Heisenberg completed his dissertation research in
a very short time. In Germany at the time, students entering
the university could pursue studies leading toward a
diploma ± equivalent to an American masters degree ± or
toward a doctorate. The latter required being admitted by a
professor into a research program with no speci®c course
requirements. Heisenberg decided on the doctorate and
took courses in mathematics and theoretical physics but
neglected laboratory courses. Professor Wilhelm Wien was
in charge of the physics laboratory and was also on Heisenberg's ®nal exam committee. When Heisenberg could not
explain in detail the operation of an electric battery during
the ®nal oral nor answer several questions about microscopes
and telescopes, Wien decided to give him a failing grade
for the dissertation. Fortunately for Heisenberg, the other
professors on the committee gave him better grades and he
squeezed through with a pass grade.
152
The Heart of the Atom: The Nucleus
mass of the hydrogen atom. Thus, the nucleus of the helium atom,
which Rutherford had identi®ed as being an alpha particle, and
which was known to have a positive electric charge equivalent
to the negative charge of two electrons, had to contain something
more than just two protons.
In 1920 Rutherford predicted the existence of a particle with
no electric charge but possessing the same mass as that of the
proton. William D Harkings in the United States, and Orme
Masson in Australia also predicted its existence at about the
same time. Harkings proposed the name neutron for this new
subatomic particle. Twelve years later, James Chadwick, a
former collaborator of Rutherford's, announced the discovery of
neutrons. Chadwick received the 1935 Nobel prize in physics
for this discovery.
After the discovery of the neutron was announced, Heisenberg realized that this neutral particle provided the key to the
physics of the nucleus. In a now classic three-part paper entitled
``On the composition of atomic nuclei,'' Heisenberg proposed
that atomic nuclei were composed of two types of particles,
protons and neutrons. Subsequent experiments veri®ed that
this scheme was indeed correct. These two nuclear particles are
collectively known as nucleons.
The composition of the nucleus
The present view of the nucleus is essentially the one proposed by
Heisenberg; that is, the nucleus is composed of protons and
neutrons. Protons have an electric charge of 1:6 10ÿ19 C,
where the C stands for coulomb, the SI unit of electric charge.
The mass of one proton is 1:673 10ÿ27 kg, slightly smaller than
the mass of a neutron, which is 1:675 10ÿ27 kg. Neutrons, as
we said earlier, have no electric charge.
Since protons and electrons have equal and opposite charges
and the atom is electrically neutral, the number of protons in the
nucleus must be equal to the number of orbiting electrons in the
atom. Oxygen, for example, has eight protons in its nucleus and
eight orbiting electrons.
The number of protons in the atom determines the atomic
number, Z. The atomic number of oxygen is thus 8. The total
153
SUPERSTRINGS AND OTHER THINGS
number of nucleons ± that is, the total number of neutrons and
protons ± is called the atomic mass number, A, since it gives the
approximate total mass of the nucleus. The atom of oxygen,
with 8 protons and 8 neutrons, has an atomic mass number of
16. The number of neutrons is given by the neutron number, N.
Thus
A Z N:
The different nuclei are designated by the same symbols used to
name the corresponding chemical elements. So we would use H
for the nucleus of hydrogen, O for oxygen, and so on. The different types of nuclei are referred to as nuclides. A complete description of a nuclide is given by specifying its symbol together with
the atomic and mass numbers, in the form
A
ZE
where E is the chemical symbol for the element. The neutron
number need not be speci®ed since N A ÿ Z. For example,
the nucleus of oxygen with eight protons and eight neutrons is
written as 168 O because its atomic number is 8 and its mass
number is 16. The nucleus of helium, with two protons and two
neutrons, is written as 42 He.
The number of protons in the nucleus or the atomic number
determines the element. The nucleus of carbon, for example, contains 6 protons. The number of neutrons can vary, however, and
we might ®nd carbon nuclei containing different number of
nucleons, like 126 C, 136 C, and 146 C, with 6, 7, and 8 neutrons respectively. Nuclei with the same number of protons but a different
number of neutrons are called isotopes. The nucleus of the most
common isotope of hydrogen, as we have seen, consists of a
single proton. Another less common isotope of hydrogen, deuterium or heavy hydrogen, has a nucleus with one proton and one
neutron (®gure 8.1). Water made from deuterium is called heavy
water. While chemically indistinguishable from regular water,
heavy water has different physical properties. For example, it
boils at a slightly higher temperature than regular water.
The isotope 126 C of carbon is used to de®ne a new unit of mass
for atomic and nuclear calculations. Because the mass of the
proton and the mass of the neutron are slightly different and
much larger than the mass of the electron, this new unit of
154
The Heart of the Atom: The Nucleus
Figure 8.1. The nucleus of the most common isotope of hydrogen
consists of one proton, represented by a small sphere with a (for its
positive electric charge). Deuterium, another hydrogen isotope, contains
one proton and one neutron.
mass was de®ned to take this into account. The atomic mass unit
(amu) is de®ned as one-twelfth of the mass of the isotope 126 C.
Because the mass of the atom is used, this de®nition includes
the mass of the 6 electrons in this isotope. As we saw in
chapter 1, the conversion factor between atomic mass units and
kilograms is
1 amu 1:660 566 10ÿ27 kg:
The following table lists the masses of the proton, neutron and
electron in amu and kg.
Particle
Mass (amu)
Mass (kg)
proton
neutron
electron
1.007 276
1.008 665
5:486 10ÿ4
1:6726 10ÿ27
1:6750 10ÿ27
9:1094 10ÿ31
The glue that keeps the nucleus together
There is an apparent problem with what we have just said about
the composition of the nucleus. As we saw in Chapter 3, positive
charges repel each other (®gure 8.2). Since the nucleus contains
only positively charged protons and uncharged neutrons, then
the protons should repel each other and the nucleus should ¯y
apart. In fact, the electrical repulsion between two protons in
the nucleus is about 100 billion times greater than the electrical
Figure 8.2. Electrostatic repulsion between two protons.
155
SUPERSTRINGS AND OTHER THINGS
Figure 8.3. The attractive nuclear force between two protons separated
by a distance of 2 10ÿ15 m is about 100 times greater than the repulsive
electric force.
attraction between the negative electrons and the positive
nucleus. Furthermore, the neutrons do not help. They have no
charge and there is no reason why they should remain together.
Yet the nucleus does not ¯y apart. What keeps it together?
There has to be an attractive force, stronger than the electrical
repulsion between two protons when they are close together,
that acts not only on the charged protons but also on the
uncharged neutrons and keeps them together. This new force is
called the strong nuclear force (®gure 8.3). At a distance of
2 10ÿ15 m the attractive strong nuclear force between two
protons is about 100 times greater than their electrical repulsion.
The nuclear force is charge independent; that is, it acts with the
same strength on neutrons as it does on protons (®gure 8.4). It is
also a short range force; that is, it becomes active at very short
distances, over a range of 10ÿ14 to 10ÿ15 meter. If the nucleons
are separated farther, the force approaches zero. Because of the
short range nature of the nuclear force, a nucleon in the nucleus
interacts only with its nearest neighbors. In a nucleus with
more than about 30 nucleons, the interaction of a nucleon with
its immediate neighbors is not affected by the total number of
nucleons present in the nucleus. The nuclear force thus becomes
saturated.
The electrical force, on the other hand, has an in®nite range,
so a proton feels the electrical repulsion of all the other protons in
the nucleus. If a particular nucleus has too many protons, the
electrical repulsion on any one proton from all the other protons
present may exceed the nuclear attraction thereby making the
nucleus unstable. This type of nucleus is called radioactive and
156
The Heart of the Atom: The Nucleus
Figure 8.4. The nuclear force acts with the same strength on protons or
neutrons. We say that the nuclear force is charge independent.
is the same kind of nucleus as was used by Rutherford in his
experiments.
Because the nuclear force binds the nucleons together in the
nucleus, work must be done to separate them. This is analogous
to the case of two magnets sticking together with their opposite
poles facing; work must be done to separate them (®gure 8.5).
Conversely, when you bring the magnets together with the opposite poles facing, they pull your hands closer; the magnetic force
does work on you. The energy required to move your hands
comes from the magnetic potential energy stored in the two magnets. Similarly, when nucleons are assembled together to form a
nucleus, energy is released. Thus, the total energy of the nucleus
(bound system) is less than the total energy of its separated
nucleons. This energy difference is called the binding energy and
is equal to the work needed to separate the nucleus into its component nucleons.
From Einstein's special theory of relativity, it is known that if
the energy of the free nucleons is more than the total energy of the
nucleus, then the mass of the nucleons must be greater than the
157
SUPERSTRINGS AND OTHER THINGS
Figure 8.5. You must do work to separate two magnets with their
opposite poles facing. If you stop pulling them apart, the magnets will
move closer together, pulling your hands along and doing work on
you. The energy required to move your hands is stored in the two
separated magnets.
mass of the nucleus formed from the same nucleons. As we shall
see in Chapter 19, the difference in mass is converted into energy
when the nucleons are brought together, according to Einstein's
formula E mc2 , where c is the speed of light, equal to
2:997 925 108 m/s.
Einstein's formula allows us to ®nd the energy equivalent
in MeV of any given mass. The energy equivalent of 1 amu can
be calculated as follows (recall that 1 amu is equal to
1:660 566 10ÿ27 kg):
E mc2 1:660 566 10ÿ27 kg 2:997 925 108 m=s2
14:9244 10ÿ11 kg m2 =s2 :
Since we are using SI units throughout, the units of energy should
come out in joules (as we saw in Chapter 4, 1 kg m2 =s2 1 J).
Remembering that 1 eV 1:602 10ÿ19 J, or 1 MeV 106 eV
1:602 10ÿ13 J, we have
E mc2 14:9244 10ÿ11 J
1 MeV
931:5 MeV:
1:602 10ÿ13 J
Figure 8.6 illustrates the fact that the mass of the nucleus and
consequently the mass of the atom (42 He in the ®gure) must be
smaller than the total mass of the component particles (two
protons, two neutrons and two electrons). The ``missing mass''
or mass defect is the source of the energy that keeps the atom
together.
158
The Heart of the Atom: The Nucleus
Figure 8.6. The 42 He atom has less mass than its component particles.
To get an idea of how strongly the nucleons are bound in the
nucleus it is useful to calculate the binding energy per nucleon,
obtained by dividing the total binding energy by the mass
number A. For 42 He, the total binding energy is 28.3 MeV. Since
the mass number (the total number of nucleons) is 4, the binding
energy per nucleon is 7.1 MeV. If we calculate the binding energy
per nucleon for different nuclei and make a graph of the values
obtained against mass number, we ®nd that it increases with
mass number, from 1.1 MeV for deuterium (an isotope of hydrogen) to about 8.0 MeV for oxygen-16. For nuclei with mass
number greater than about 16, the binding energy per nucleon
remains nearly the same, increasing slightly from A 18 to
A 50 where it peaks at 8.7 MeV. Beyond A 50 it begins to
decline to about 7.5 MeV for A 250 (®gure 8.7). The peak at
around A 50 (the elements near iron in the middle of the
periodic table), means that more energy is required to remove
a nucleon from these elements; they are, consequently, more
stable.
Size and shape of the nucleus
Most nuclei have a nearly spherical shape with a diameter of
about 10ÿ15 m. Nuclear sizes were originally determined by
means of scattering experiments similar to Rutherford's; fast
alpha particles were shot at a nucleus and detected at the other
159
SUPERSTRINGS AND OTHER THINGS
Figure 8.7. Binding energy per nucleon versus atomic mass number.
Nuclei near the middle of the periodic table (mass number between 50
and 60) are more stable than elements at either end of the table, since
the binding energies per nucleon for these middle elements are maximum. They are, therefore, more tightly bound.
side. As in Rutherford's experiments, observing the distribution
of the alpha particles after they interacted with the nucleus
yielded information about the size of the target nucleus.
Additional information about the nucleus was obtained
when fast electrons replaced the alpha particles as projectiles.
Electrons offer the advantage that they do not feel the nuclear
force and therefore are only affected by the electrical force of
attraction to the protons in the nucleus.
Neutrons have also been used to measure nuclear dimensions. Since neutrons have no electric charge, they feel only the
nuclear force.
All these experiments have yielded information about the
dimensions of the nucleus by measuring different parameters.
From the experimental evidence obtained we know that the
volume of the nucleus is proportional to the mass number A.
Since the mass number is the total number of protons and
neutrons in the nucleus, this result means that the nucleons are
packed in more or less the same way in all nuclei. We can understand this concept better with an example. Suppose two children
are given several ping-pong balls and some plasticine and told to
build a large sphere using the plasticine as glue (®gure 8.8). If one
160
The Heart of the Atom: The Nucleus
Figure 8.8. The two children build spheres with ping-pong balls stuck
together with plasticine. The girl uses up 30 balls to build up her sphere
while the boy uses 60. If they use the same amount of plasticine to stick
each ball to the others, the balls will be equally packed. In this case, the
60-ball sphere will have twice the volume as the 30-ball one. Nucleons in
nuclei are also equally packed and their volumes are proportional to the
number of nucleons in the nucleus (its mass number).
child uses up 30 balls to make her sphere and the second child
uses 60 balls to make a sphere that occupies twice the volume,
we can de®nitely say that the two children packed the balls the
same distance apart; that is, they used more or less the same
amount of plasticine to stick each ball to the others. If, however,
the second child ended up with a sphere occupying a volume
of only one and a half times, we conclude that he packed his
ping-pong balls closer together. Nucleons in nuclei are equally
packed.
Nuclear energy levels
Neutrons and protons in the nucleus, like electrons in the atom,
must obey the rules of quantum mechanics. These rules specify
that nucleons (protons and neutrons) can move only in certain
161
SUPERSTRINGS AND OTHER THINGS
Figure 8.9. Nuclear energy level diagram. The nucleon energies are
distributed in groups or shells, separated by gaps. N.B. Nucleons with
higher energies are not necessarily orbiting farther away: the diagram
refers only to the energy values.
162
The Heart of the Atom: The Nucleus
allowed orbits. These orbits are speci®ed by four labels or quantum numbers.
Similarly to the case of the electrons orbiting in the atom, the
quantum number n determines the energy of the orbiting
nucleon. This number can only take whole-number values; that
is, 1, 2, 3, 4, etc. The higher the number, the higher is the value
of the energy. For most purposes, we can say that a higher
value of the energy means that the nucleon travels at a higher
speed in its orbit within the nucleus.
The second quantum number, the angular momentum quantum number, determines the shape of the orbit. Higher values of
the angular momentum represent more circular orbits whereas
lower values represent more elliptical ones. The third and
fourth quantum numbers determine the direction in which the
nucleon orbits and the orientation of this orbit in space.
Using these quantum numbers, the possible energies of the
allowed orbits can be determined. As shown in ®gure 8.9, these
energies are distributed in groups or shells. As is the case in
most cases governed by the laws of quantum mechanics, we
must be careful not to assign too literal a meaning to these
shells. All nucleon orbits have nearly the same average radius.
The shells refer only to the energy values, not to the orbital sizes.
163
9
FLUIDS
States of matter
The atoms that make up all matter in the universe arrange themselves in different ways to form rocks, water, galaxies, trees or
people. Different atoms interact in different ways to form a
substance. When the interacting atoms of a substance are not
moving about too much, they occupy more or less ®xed positions
and arrange themselves in a geometric pattern which minimizes
the interaction energy. This pattern is repeated throughout the
substance. When this happens the substance is called a crystal
(®gure 9.1).
The atoms of a substance can also arrange themselves in
other energy con®gurations which are not ordered and do not
show a repetitive pattern, even when the atoms are not moving
very much. In this case, the substance is noncrystalline or
amorphous. Crystals and amorphous materials are solids. The
Figure 9.1. A representation of a crystal of sodium chloride.
164
Fluids
forces that bind the atoms together in a solid are strong enough
for the solid to maintain its shape.
When the binding forces are weaker, the atoms or molecules
do not occupy ®xed positions and move at random. These substances are called ¯uids. Liquids and gases are ¯uids. In a liquid
the binding forces between the molecules are strong enough to
make the liquid stay together but weak enough to allow it to
¯ow. The molecules of a gas, on the other hand, are in chaotic
random motion and interact only during very short times. Since
the molecules of a gas are not bound together, the gas expands
to ®ll any container.
Density
If you have ever held a small bottle ®lled with mercury, you probably remember the strange impression of holding such a heavy
liquid. Actually, mercury is not, strictly, ``heavier'' than water
nor any other liquid. You can certainly have a container ®lled
with water that weighs as much as your small bottle with
mercury (®gure 9.2). The only difference is that the bottle with
water has to be much larger and contain much more water.
Mercury is denser than water. If you take the same volumes of
water and mercury, the mercury weighs much more; 13.6 times
more, to be exact. This means that the mass of mercury is 13.6
times greater than the corresponding mass of water that occupies
the same volume. If you want to have the same mass of both
mercury and water, the amount of water required will occupy a
volume 13.6 times as large as the volume of mercury.
Thus if two substances have the same volume, the substance
with the greater mass will have a greater density. That is, density
is directly proportional to mass. Conversely, if you have two
different substances with the same mass, the denser substance
occupies a smaller volume. That is, density is inversely proportional to volume. The density of a substance is the mass per unit
volume of the substance, or
mass
density
:
volume
The Greek letter (rho) is generally used to represent density.
If we call the mass m and the volume V, we can write this
165
SUPERSTRINGS AND OTHER THINGS
Figure 9.2. The bottle of water in the man's left hand has a volume 13.6
times as large as the small bottle with mercury of the same weight in his
right hand.
expression in symbols as
m
:
V
Since the gram was originally de®ned as the mass of 1 cubic
centimeter of water, the density of water is equal to 1 gram/
cm3 or 1000 kg/m3 . 1000 kg is a metric tonne. The densities of
other common substances are shown in Table 9.1.
Pressure
Fluids can exert forces on other bodies. Since a ¯uid is in contact
with the surface of the container, the force that the ¯uid exerts on
the container is not localized at any one particular point and
instead distributes itself over the entire surface. Consider, for
166
Fluids
Table 9.1. Densities of some common substances
Substance
Density (kg/m3 )
Substance
Density (kg/m3 )
Hydrogen
helium
air
oxygen
wood (oak)
steam (1008C)
Oil
Ice
Water (48C)
0.090
0.1758
1.29
1.4
600±900
600
900
920
1000
human blood
sea water
bone
iron
copper
silver
lead
mercury
gold
1060
1300
1700±2000
7800
8930
10 500
11 300
13 600
19 300
At sea level atmospheric pressure and at 08C.
example, a glass of water. Each molecule of water moves, on average, at speeds of over a thousand kilometers per hour and collides
several billion times a second with other molecules and with the
sides of the glass. Each collision with the surface of the glass
exerts a tiny force. The sum of the billions of collisions that take
place each second, each exerting a small force on a very small
area of the glass, results in the total force that the water exerts
on the surface of the glass. We can see that it is more convenient
to consider the force per unit area of the surface, which we call the
pressure, P:
pressure
force
area
or P
F
:
A
If we stop for a moment and consider a couple of examples,
we might be able to visualize the concept of pressure better. If you
hold a nail between your thumb and index ®nger, the sharp tip of
the nail will push into your skin much more readily than the blunt
head (®gure 9.3(a)). You feel that the pressure of the tip is greater,
because the force is applied to a much smaller area. The smaller
the area to which the force is applied, the greater the pressure.
Conversely, the pressure is smaller if the area increases. This is
the principle behind snowshoes (®gure 9.3(b)). By increasing
the surface area in contact with the snow, the pressure that the
person's body exerts is decreased. When somebody falls through
thin ice in a frozen lake, rescue personnel approach the scene
crawling, to minimize the pressure that their bodies exert on
the ice.
167
SUPERSTRINGS AND OTHER THINGS
Figure 9.3. (a) Since the same force is applied to a smaller area, the
sharp end of the nail will push into your skin much more readily than
the head. (b) By increasing the area in contact with the snow, the pressure
is reduced and the boy does not sink in the snow.
The SI unit of pressure is newtons per square meter (N/m2 ),
which is called the pascal (Pa), in honor of the French scientist
Blaise Pascal (1623±1662) who did pioneering work with ¯uids.
Atmospheric pressure
Galileo in his Two New Sciences (1638) wrote that a lift-type pump
cannot pump water from a well deeper than about 10 meters.
These pumps were used widely at the time to obtain drinking
water but nobody knew exactly why they worked or why there
was a limit to the depth of the well.
It was a student of Galileo, Evangelista Torricelli, who solved
the problem. By studying the results of his own experiments and
those of his contemporaries Guericke, Pascal, and Boyle, Torricelli was able to explain that a lift pump works because of the
pressure of the air. By pumping, the pressure at the top of the
pipe is reduced. The pressure of the air on the water surface
below pushes the water up the pipe.
The air exerts pressure because the blanket of air surrounding the earth has weight. The force exerted on a 1 square meter
area on the surface of the earth at sea level by the column of air
above is 101 300 N. The pressure of the atmosphere at sea level
is thus 101:3 103 N/m2 or 101.3 kPa. This is normal atmospheric
168
Fluids
Figure 9.4. A column of water about 10 meters high exerts the same
pressure as a column of air above the surface of the earth.
pressure, called 1 atmosphere (atm). Thus,
1 atm 101:3 kPa:
A column of water of about 10 meters high with a cross
section of 1 m2 also weighs 101.3 kN. Consequently, this column
of water exerts the same pressure as the air above the surface of
the earth (®gure 9.4). Therefore, if a good lift-pump can reduce
the pressure inside the pipe to almost zero, the atmospheric
pressure can force water up to some 10 meters. It will not be
able to raise water up to 11 meters, for example, since this
requires a pressure that is larger than the atmospheric pressure
at sea level.
169
SUPERSTRINGS AND OTHER THINGS
Figure 9.5. The atmospheric pressure on the water in the bowl is larger
than the pressure of the water in the glass.
For the same reason, if you invert a glass of water into a bowl
with water, the water does not ¯ow out of the glass into the bowl.
The atmosphere pushes down on the water in the bowl with a
force larger than the force exerted by the water in the glass
(®gure 9.5).
Torricelli performed his experiments on air pressure not with
water but with mercury. Mercury, having a density of 13 600 kg/
m3 or 13.6 times the density of water, requires a column that is 13.6
times smaller than water to match the atmospheric pressure. A
glass tube ®lled with mercury inverted into a pool of mercury is
supported by the atmospheric pressure on the pool (®gure 9.6).
At sea level, the air pressure supports a column of mercury
760 mm high. At higher altitudes, the air pressure is less, since
there is less air above, and the mercury column that can be
supported at these higher altitudes is less than 760 mm. Because
it can be used to measure atmospheric pressure, Torricelli's invention is called a barometer, from the Greek baros, meaning weight.
The pressure of air that supports a column of mercury 1 mm
high is called a torr, in honor of Torricelli. At sea level the atmospheric pressure has an average value of 101.3 kPa or 760 mm of
mercury or 760 torr. This pressure is also called 1 standard atmosphere (1 atm).
Knowing the pressure of the air, we can calculate the mass of
air above us. Assume that our bodies, when standing up, have a
cross-sectional area of about 0.70 m2 . The weight of the air above
is w F PA 101:3 103 N/m2 0.70 m2 ) or 71 103 N.
Now, this weight is mg, and the value of the acceleration due to
gravity, g, is still 9.8 m/s2 at altitudes of a few tens of kilometers.
170
Fluids
Figure 9.6. Torricelli's barometer consists of a glass tube on a pool of
mercury. The air above the mercury level in the tube has been evacuated
(although mercury vapor is present). The air pressure at sea level is the
same as the pressure of a column of mercury 760 mm high.
Therefore,
m
w 71 103 N
7200 kg:
g
9:8 m=s2
Certainly, we cannot support a mass of 7200 kg over our
shoulders. Why are we not crushed by this enormous weight?
The reason is that the pressure of the air acts in all directions,
not just downward. Every square centimeter of our bodies is subject to this pressure. The air pushes down and up and sideways
171
SUPERSTRINGS AND OTHER THINGS
on us. And since there is air in our lungs, it pushes from the
inside, too. All these forces balance out and we are not crushed
by the atmospheric pressure.
The air inside airliner cabins is kept at a pressure no less than
the pressure of the air at an altitude of 2500 m above sea level. In
the United States, aircraft that have accumulated more than
55 000 landings are barred from ¯ying above 7500 m until they
pass a stringent inspection. At an altitude of 8000 m, if the
fuselage structure has been weakened by cracks, the difference
between the inside and outside pressure can cause a rupture. In
April 1988 an older airplane with more than 90 000 landings
tore open while ¯ying over Hawaii at an altitude of 8000 m.
Much of the upper part of the fuselage behind the cockpit was
ripped away, causing the death of a ¯ight attendant who was
swept out of the cabin.
Pressure in a liquid
When you dive into the deep part of a swimming pool, you feel
the pressure in your ears. This pressure increases as you descend
to greater depths in a lake or the ocean. The pressure buildup is
due to the weight of the water above you.
In ®gure 9.7 the woman is swimming at a depth h (1 meter,
for example). Consider the column of water pushing against a
Figure 9.7. A swimmer at a depth h feels a pressure due to the water
equal to gh.
172
Fluids
patch of skin on her back, 10 cm on its side. The column of water
has a volume V Ah so that its mass is m V Ah. The
pressure of the column of water on the patch A is
P
F mg
Ahg
A
A
A
or
P gh:
Notice that the size of the patch is not important since the value of
the area A does not appear in our expression for pressure. The
pressure on the swimmer is proportional to the depth at which
she swims. This is because the density of the liquid is constant
no matter what the depth. Liquids are incompressible, since the
electrons of the atoms and molecules, already very close to each
other, resist getting any closer. Because the water molecules
move at random in all possible directions, they collide with the
swimmer's body from all sides, exerting pressure equally in all
directions.
The pressure (due to the liquid) on the woman, swimming
1 meter below the surface, would be P (1000 kg)(9.8 m/s2 )
(1 m) 9:8 kPa. The pressure on a ®sh, swimming in fresh
water 1 meter below the surface would also be 9.8 kPa.
Since there usually is atmospheric pressure acting on the surface of a liquid, the total pressure is the sum of the atmospheric
pressure, Pa , and the pressure of the liquid:
PT Pa gh:
The fact that the pressure in a liquid depends only on the distance
below the surface means that the pressure does not depend on the
shape of the container.
If we apply some external pressure to a liquid, by means of a
piston, for example, as shown in ®gure 9.8, the pressure is transmitted undiminished to all points of the liquid. This was discovered
by the French scientist and philosopher Blaise Pascal in 1651
and is known as Pascal's principle.
In ®gure 9.8, the pressure exerted by the force F on the lid at
the left side of the container is transmitted to all the points in the
liquid and to the walls of the container. The pressure is transmitted also to the piston on the right, and if we assume that
173
SUPERSTRINGS AND OTHER THINGS
Figure 9.8. An external pressure applied to a liquid is transmitted undiminished to all points in the liquid. This is called Pascal's principle.
this piston can move without friction, it will rise as the piston on
the left is pushed down.
Pascal's principle is valid for any ¯uid in equilibrium. (For
gases, it must be modi®ed to take into account the change in
volume when the pressure changes.) For liquids, Pascal's principle is a consequence of the incompressibility of the liquids.
The molecules of a liquid are nearly as close to each other as
they are in a solid, so there is not much space left between
them. Any pressure applied at some point in a liquid is transmitted to every point of the liquid as each molecule feels the pressure of the molecule behind. Because, as was mentioned earlier,
the molecules move at random in all directions, the collisions
take place in all possible directions, which results in the pressure
being transmitted equally in all directions. The swimmer feels the
pressure in her eardrums no matter what the orientation of her
head is in the water.
The hydraulic lift illustrated in ®gure 9.9 is an application of
Pascal's principle. A force f applied to the piston of area a results
in an external pressure P f =a. This pressure is transmitted to all
the points in the liquid in all directions; in particular to the
platform holding the car. If the section of the platform in contact
with the liquid has an area A, this pressure will be also equal to
the force lifting the car, F, divided by the area A. Because this
area is much larger, the force lifting the car has to be larger to
keep the same ratio. The hydraulic lift acts as a force magni®er,
174
Fluids
Figure 9.9. Hydraulic lift. The pressure P f =a exerted by the piston of
area a is transmitted to all points in the liquid. The pressure P F=A on
the larger area A, must be the same as the pressure on the smaller area on
the left. To maintain the same ratio, the force F acting on the larger
platform must be larger. This larger force can balance a large weight
(like that of a car) resting on the platform.
so that a small force f on one side produces a large force F on the
other that is able to lift the car. To compensate, the piston moves
much farther than the platform, a distance A=a as great.
The brake system of a car, illustrated schematically in ®gure
9.10, is another application of Pascal's principle. As you step on
Figure 9.10. Brake system of a car.
175
SUPERSTRINGS AND OTHER THINGS
the brake pedal, the increase in pressure is transmitted through
the brake ¯uid, which causes movable pistons to push the
brake shoes or pads against the drums or discs. Friction between
these surfaces slows the car.
Buoyancy
In the third century BC, the Syracusan king Hieron II asked his
relative Archimedes, who happened to be the greatest scientist
of the time, to determine whether a gold crown the king had commissioned had more than the allowed mixture of silver in it, and
to do this without destroying the crown.
Archimedes was famous for the invention of ingenious
mechanical devices. He is supposed to have constructed war
machines that for three years held the invading Romans at bay
in their siege of the city. Cranes mounted on the cliffs above the
sea would hold heavy stones that pounded on the soldiers that
ventured toward the walls. Similar cranes would be used to lift
and overturn the landing ships. There is a story that he constructed a large mirror to re¯ect the sun's heat on the Roman
ships and set them on ®re.
Much of this was undoubtedly exaggerated, especially since
we know about it only indirectly, from the writings of the later
Greeks. Archimedes himself set no value on these devices,
regarded them as beneath the dignity of pure science and published only his scienti®c work.
The importance of his scienti®c work is probably best
exempli®ed by his method for the calculation of the number .
He calculated the perimeters of polygons inscribed inside and
outside a circle and obtained bounds for that ranged from 3 10
71
to 3 17. His method was unsurpassed until Newton invented
integral calculus, eighteen hundred years later. It has been
asserted that Archimedes's method truly anticipated the integral
calculus.
After the king's request, Archimedes was perplexed as to
how to go about determining the gold content of the crown without destroying it. As he stepped into his bath one day he noticed
that the water over¯owed and it occurred to him, or so the story
goes, that the amount of water that over¯owed must have the
176
Fluids
same volume as the part of his body under the water. By placing
an amount of gold of the same weight as the crown under water,
the correct volume of a gold crown could be determined. If the
crown had any silver, it would displace more water than the
same weight of gold, because silver is less dense than gold. He
was allegedly so excited by this discovery that he ran through
the streets of Syracuse to the palace without any clothes on shouting ``eureka, eureka'' (``I've got it! I've got it!''). Archimedes had
proved that the crown had more than the required amount of
silver in it and king Hieron had the cheating goldsmith executed.
From this incident, Archimedes worked out his principle of
buoyancy, known today as Archimedes' Principle, which can be
stated as follows:
Archimedes' principle: An object partially or completely submerged
in a ¯uid is buoyed up by a force equal to the weight of the ¯uid
displaced by the object.
We all have experienced buoyancy. It is relatively easy to lift a
rock underwater. It requires a much greater effort to lift the
same rock on dry land. If you have ever attempted to teach someone to swim, you probably noticed how easy it was to hold that
person up while in the water. In all these cases, the water exerts
an upward force on the object. The source of this upward force
is the difference in pressure between the top and and the
bottom of the submerged object. Since pressure increases with
depth, the pressure at the top of the object is smaller that the pressure at the bottom. This pressure difference results in an upward
force on the submerged object.
To illustrate Archimedes' Principle, let's consider a small
aluminum cube immersed in water, as shown in ®gure 9.11.
Imagine now that, somehow, the solid cube is removed and the
cavity left behind is ®lled with more ¯uid. Clearly, this added
¯uid, undistinguishable from the ¯uid surrounding it, is in equilibrium and experiences pressures and forces that depend on the
distance below the surface. Since the entire ¯uid remains in equilibrium, the extra ¯uid must be subject to the same upward net
force as the solid cube it replaced. Moreover, this buoyant force
must be just suf®cient to balance the weight of the extra ¯uid.
We can see that if we now replace the added ¯uid with a lead
cube of the same dimensions as the original aluminum cube,
177
SUPERSTRINGS AND OTHER THINGS
Figure 9.11. The buoyant force acting on the submerged cube results
from the difference in pressure at the top and at the bottom. This buoyant
force is equal to the weight of the ¯uid displaced by the cube and does
not depend on the density of the cube. This is Archimedes' Principle.
this new heavier cube will be subject to the same buoyant force as
the ®rst cube and as the ¯uid itself. The force that buoys up the
cubes is equal to the force that buoys up the ¯uid that each
cube displaces, and this force is equal to the weight of the displaced ¯uid.
Clearly, for a given weight, the larger the amount of water it
displaces when submerged in water the greater the buoyant force
would be. This is the reason why ships, made of steel, which is
much denser than water, can ¯oat. If we were to take the
amount of steel used in building a ship and reshape it into a
big solid cube, it would sink when immersed in water (see
®gure 9.12). The amount of water displaced by the solid cube of
steel is not large enough to provide a buoyant force that could
balance the weight of the cube. When this steel is in the shape
of a ship, that is, a large hollow bowl, the same weight of steel
now displaces a larger volume of water which has a larger
weight. By Archimedes' principle, this larger weight of the
178
Fluids
Figure 9.12. A big mass of steel in the shape of a cube sinks because the
volume of water it displaces is not large enough. The same mass, in the
shape of a ship, displaces more water, and the ship ¯oats.
water displaced produces a larger buoyant force on the ship,
enough to keep it a¯oat.
Surface tension and capillarity
Why is a liquid drop spherical? How can a spider walk on the
surface of a pond? Why is it possible to carefully place a razor
blade on the surface of the water in a glass and have it ¯oat?
These phenomena result from a property of liquids called surface
tension. To understand how this works, let's consider a water
molecule in the interior of a glass of water (®gure 9.13). This
water molecule is surrounded in all directions by other water
molecules, so that all the attractive forces from these molecules
cancel out. A molecule on the surface of the water, however,
has other water molecules only to the sides and below it
exerting attractive forces. Therefore, this molecule is pulled in
toward the interior of the liquid by these unbalanced forces.
The result is that the surface of the liquid becomes compressed,
and this acts to make the surface area of the liquid as small as
possible.
We can now answer the questions posed above. A liquid
drop is spherical because a sphere has the smallest surface area
for a given volume. When you place a razor blade or a needle
carefully on the surface of the water, the molecules of water on
the surface are depressed slightly and the neighboring molecules
exert upward restoring forces on them, supporting the razor
blade. The surface of the liquid acts like a stretched elastic membrane. This is why it is possible for spiders and small insects to
walk on water.
179
SUPERSTRINGS AND OTHER THINGS
Figure 9.13. A molecule in the interior of the water feels attractive
forces from all directions, whereas a molecule on the surface feels
forces that point to the interior of the liquid, since there are no molecules
above it. These molecules on the surface resist being pulled apart so that
the surface of the liquid acts as an elastic membrane, able to support
small objects.
Surface tension can also help us understand the action of a
detergent in cleaning. The surface tension of water is about
0.07 N/m. As shown in Table 9.2, water has a large surface
tension. When you add detergent to water, the surface tension
is reduced. The water then penetrates more easily into the
fabric, dissolving dirt.
We can think of surface tension as the energy per unit area of
the surface. In equilibrium the surface of a liquid would have the
lowest possible energy, which is why liquids minimize their
Table 9.2 Surface tension of some substances in contact
with air at room temperature.
Substance (N/m)
Surface tension (N/m)
Benzene
Mercury
Water
Soapy water
0.0289
0.465
0.0728
0.025
180
Fluids
surface area. The units of surface tension are thus units of energy
divided by units of area, or J/m2 . Since 1 J 1 Nm, we can see
that these units can be written as Nm=m2 N=m; that is, force
divided by length. These are the units for surface tension used
in Table 9.2. Thus, surface tension can also be de®ned in terms
of force per unit length rather than energy per unit area. Experimentally, surface tension is determined by measuring the force
required to pull a thin wire in the form of a ring out of the surface
of the liquid until the surface stretches to the point of breaking
(®gure 9.14).
Figure 9.14. Measuring the force required to stretch the surface of the
liquid to the point of breaking allows experimenters to determine the
surface tension of the liquid.
181
SUPERSTRINGS AND OTHER THINGS
Figure 9.15. The curved surface observed on the surface of a liquid in a
container is what we call a meniscus. The cohesive forces in water are
smaller than the adhesive forces between water and glass. This produces
a positive meniscus. The cohesive forces for mercury, on the other hand,
are larger than the adhesive forces between mercury and glass, and this
produces a negative meniscus.
If you look closely at the surface of the water in a glass, you
would notice that near the glass, the surface curves up and the
water seems to travel up the surface of the glass a small
amount (®gure 9.15). This is what we call a meniscus. The forces
of attraction acting between the molecules of a liquid are called
cohesive forces. In addition to these forces, the molecules also
experience adhesive forces that tend to bind the molecules to the
walls of the container. In the case of water and glass, the adhesive
forces are greater than the cohesive forces between the water
molecules so that the molecules of water close to the glass experience a larger force toward the molecules of glass than toward the
other water molecules and they are pulled up a small distance
away from the surface of the water. The cohesive forces in
mercury, on the other hand, are greater than the adhesive
forces between mercury and glass, and a mercury atom near
the surface of the glass experiences a larger force toward the
other mercury atoms, with the result that its surface curves downward, away from the glass. A meniscus that curves upward is
called positive and one that curves downward, like the surface
of mercury in a glass container, is called negative.
If we insert a narrow tube in a bowl with water (®gure 9.16),
the water will rise in the tube to a higher level than the water in
182
Fluids
Figure 9.16. A liquid rises in a narrow tube because of a phenomenon
known as capillarity.
the bowl. This phenomenon is known as capillarity, a word that
means ``hair-like.'' The water in the capillary tube will rise until
the adhesive forces pulling the liquid up are balanced by the
weight of the liquid in the tube. Therefore, the narrower the
tube, the higher the water will rise. If we dip the edge of a
cotton towel in water, the water will rise through the fabric
®bers because the adhesive forces between cotton and water are
larger than the cohesive forces in water. On the other hand,
water will not spread through the fabric of a wool sweater with
the edge dipped in water, because the adhesive forces between
water and wool are smaller than the cohesive forces in water.
Unlike cotton, wool ®bers are covered with tiny scales. When
wool is in contact with water, the ®ber's scales rub against each
other, pulling the ®bers together. The scaly surface of the ®bers
tends to repel liquids. For this reason, wool fabrics are better
for wet weather and cotton is a better material for towels.
Cotton is also more comfortable to wear in hot humid weather
because it absorbs excess perspiration through capillary action.
Capillarity is also responsible for dampness in basements, as
water travels through the narrow cavities in the cinder blocks
used to build the basement walls. To prevent this, the exterior
walls must be coated with some waterproo®ng compound.
Because of capillarity, soil can hold rain water in the narrow
spaces between tightly packed soil particles, and plants growing
in this soil can make use of this water in photosynthesis, the
183
SUPERSTRINGS AND OTHER THINGS
process by which green plants convert water and carbon dioxide
into oxygen and various organic compounds in the presence of
light.
Fluids in motion
When you put a nozzle on your garden hose, the water comes out
at a higher speed and reaches farther. The explanation for this
phenomenon is due to an eighteenth century Swiss mathematician named Daniel Bernoulli, although he did not set out to
explain the behavior of water in a garden hose. Bernoulli's principle
explains also why an airplane ¯ies and why you can throw a
curve with a baseball.
Suppose that you connect two different hoses and turn the
water on. Let's assume that the two hoses have different diameters and that the connection between them allows for a
smooth transition from one diameter to the other, as shown in
®gure 9.17. We have indicated the direction of the ¯ow of water
with lines called streamlines. Since liquids are incompressible,
the amount of ¯uid passing through area A on the left during a
certain interval of time t is exactly the same as the amount of
¯uid passing through area a on the right during the same time
t, and this means that the two volumes, VL and VR are the
same. Since the areas are different, the volume of water passing
through A moves through a length l of hose which is smaller
than the length L along which the volume of water through
area a must move. Therefore, the velocity of the water on the
right, moving along the longer path L, must be greater that that
Figure 9.17. Two hoses of different diameters connected in such a way
as to have a smooth transition from one diameter to the other.
184
Fluids
on the left, which moves along the shorter path l, in order for the
same amount of water to pass through both the narrow and the
wide hoses in the same time. We can now understand why a
nozzle on a garden hose shoots the water farther. The nozzle
has a cross sectional area that is smaller than the cross sectional
area of the hose, and the speed of the water through the nozzle
is greater.
If the velocity of the water is greater on the right, through the
narrow section, than on the left, through the wide section of hose,
work has to be done on the ¯uid to account for the increase in
kinetic energy from left to right. This work is due to the forces
exerted by the ¯uid on the cross sectional areas in both sections.
From the work-energy theorem, the net work is equal to the
change in kinetic energy. Since the kinetic energy increases as
the water ¯ows from left to right, the work on the left, narrow
section must be greater than that on the right, wide section.
This implies that the pressure on the ¯uid at the left, wide section
is greater than the pressure on the ¯uid on the right, narrow
section.
Bernoulli's Principle tells us that the pressure is greater when the
velocity of the ¯uid is less, and, conversely, that the pressure is
less when the velocity of the ¯uid is greater.
Armed with this beautiful discovery, we can understand
what keeps an airplane in the air. Figure 9.18 shows a cross
section of an airplane wing. The top surface of the wing is
curved and the lower surface is ¯at so that the air rushing by
the upper surface has a longer distance to cover than the air
Figure 9.18. Cross section of an airplane wing. The air passing by the
upper surface has a longer distance to cover, so it must move at a
higher speed than the air passing by the lower surface. The pressure
on the underside is greater, where the velocity is smaller, thus providing
a net upward force on the wing.
185
SUPERSTRINGS AND OTHER THINGS
Figure 9.19. The pressure from the bottom, where the air is still is larger
than the pressure from above, where the air is moving as you blow. This
net pressure upwards lifts the paper.
passing by the lower surface. Therefore the velocity of the air
above the wing is greater than the velocity of the air below the
wing. Bernoulli's principle tells us that the pressure is greatest
where the velocity is least, so that the pressure on the wing
from below is greater than the pressure on the wing from above.
The pressure difference provides a net upward force on the wing.
It is not very dif®cult for you to experience Bernoulli's principle ®rst hand. If you pick up a sheet of paper and blow across
the upper surface, as shown in ®gure 9.19, the velocity of the
air above will be greater than the velocity of the air below,
which will be near zero. The larger pressure from below pushes
the paper up.
The human cardiovascular system
The human cardiovascular system is a closed tubular system
in which a ¯uid, the blood, ¯ows through arteries, veins and
186
Fluids
Physics in our world: Curve balls
To throw a curve ball, a baseball pitcher gives a spin to the
ball. The spinning ball drags some of the air around in the
direction of the spin. The speed of air on one side of the ball
becomes greater than on the other side. According to Bernoulli's principle, the side of the ball where the air moves faster
has a lower pressure than the other side. The difference in
pressure causes the ball to curve.
We can also understand the behavior of the curve ball
by studying the way the air ¯ows relative to the ball.
When a ball is thrown through the air at high speed, the
air behind the ball moves in an irregular, turbulent way.
The streamlines curve symmetrically around the ball.
When the ball is spinning, the air is dragged around, causing
the region of turbulence to move slightly in the direction of
the spin. The asymmetry of the region of turbulence produces a crowding of the streamlines on one side. The pressure is then lower where the streamlines are crowded.
This pressure difference produces a net force that de¯ects
the ball.
187
SUPERSTRINGS AND OTHER THINGS
(a) As seen from above, the air moves faster on one side (where the
streamlines are closer together) and the pressure is lower there. The
ball moves in the direction of the net force. (b) The ball spinning
counterclockwise as seen from above curves to the left of the pitcher.
(c) A nonspinning ball has symmetric streamlines and the area of
turbulence is behind it. (d) In a spinning ball, the area of turbulence
is dragged to one side. The ball curves in the direction of the net
force exerted by the air.
Bastard wing
An airplane ¯ying at very low speeds generates little lift and
can stall, since the difference in pressures above and below the
wing comes from the difference in wind velocity above and
below. Birds, on the other hand, usually manage to ¯y at
such low speeds. Their secret lies in a special structure
called a bastard wing or alula. This structure consists of a
few feathers attached to the ®rst digit or thumb of the wing.
When birds slow down at takeoff or when landing, they tilt
the wing and use the alula to separate its feathers from the
188
Fluids
rest of the wing, creating a slot that prevents the air from
breaking away from the upper surface and distributes more
air ¯ow to the top of the wing. The improved difference
between the air velocities above and below the wing increases
the pressure difference enough to balance the bird's weight.
capillaries to and from all parts of the body. The total length of the
vessels in the human body through which blood is transported
reaches many miles.
The heart is the muscular pump that propels the blood into
and out of the vessels. An average adult human heart is about
13 by 9 by 6 cm and weighs about 300 g. During a lifetime of 70
years, by beating some 3 billion times, the heart pumps nearly
250 million liters of blood, which would ®ll a football stadium
almost knee deep. The heart consists of four chambers ± two
atria, left and right; and two ventricles, left and right (®gure
9.20). The atria are the receiving chambers for blood from the
body which is pumped into the ventricles; and the ventricles
pump blood into the lungs and to the rest of the body. Four
valves control the ¯ow of blood in the heart: the tricuspid valve,
the mitral valve, the pulmonic valve, and the aortic valve.
Blood coming from the head through a large vein or superior
vena cava and from the arms, liver, body muscles, kidney and legs
through a second large vein, the inferior vena cava, enters the right
atrium, located in the right upper part of the heart. A third vein,
the coronary sinus, drains blood from the heart itself into the right
atrium. All this blood, which has been depleted of oxygen, is
pumped into the right ventricle through the tricuspid valve. As
the right ventricle contracts, blood is carried into the pulmonary
189
SUPERSTRINGS AND OTHER THINGS
Figure 9.20. The human heart consists of four chambers ± left and right
atria and left and right ventricles.
artery to the lungs where the blood is oxygenated. The pulmonary valve prevents this blood from reentering the heart.
The slightly smaller left atrium receives oxygenated blood
from the lungs through four pulmonary veins and pumps it
into the thick-walled left ventricle through the mitral valve. At
the same time that the right ventricle pumps blood into the
pulmonary artery, the left ventricle contracts and forces blood
out through the aorta into the arteries of the body.
The system that assures the pumping of the blood in and out
of the heart at the right times is controled by a special structure
called the sino-auricular node, which acts as the pacemaker of the
heart. The heart beats some 70 to 80 times per minute, contracting
and expanding. During the period of relaxation of the heart or
diastole, the pressure is typically 80 mmHg (mercury), while at
the peak of its cycle, during the contraction period or systole,
the pressure is about 120 mmHg.
This blood pressure can be measured with a sphygmomanometer, a device that uses a U-tube of mercury to read the pressure
190
Fluids
Figure 9.21.
A sphygmomanometer used to measure blood pressure.
during the relaxation and contraction periods of the heart (®gure
9.21). As we saw earlier, pressure in a liquid increases with depth.
When a person is lying down, the pressure throughout the major
arteries of the body does not change more than 2 mmHg, this
small change being due to some resistance to the ¯ow of blood
in the arteries. Standing up, however, the difference in pressure
between the heart and the foot is about 100 mmHg. Because the
upper arm is at about the same level as the heart, the pressure
there is close to the pressure at the heart. For this reason, the
sphygmomanometer measures the pressure with an in¯atable
cuff wrapped around the upper arm. A column of mercury or a
spring scale is connected by rubber tubing to the cuff and a
stethoscope is used to listen for noises in the artery. The cuff is
in¯ated until the ¯ow of blood through the brachial artery is
stopped by the pressure in the cuff. Air is gradually allowed to
escape from the cuff until the pressure in the cuff matches the
peak pressure in the heart (systolic). At this point, blood begins
to ¯ow through the artery only during the systolic pressure
part of the cycle. Since the brachial artery is partially opened,
the area is small and the equation of continuity tells us that the
191
SUPERSTRINGS AND OTHER THINGS
velocity should be high. This makes the ¯ow of blood noisy and
easy to pick up with the stethoscope. As the cuff continues de¯ating, the pressure in the sack becomes lower than the systolic
pressure but still higher than the diastolic pressure. Blood still
does not ¯ow during the diastolic pressure part of the cycle.
When the pressure in the cuff matches the diastolic pressure,
the artery remains open during the entire cycle. By reading the
values of the cuff pressure at the two points, when the pulse is
®rst heard and when the sound of blood through the artery is
continuous, the systolic and diastolic pressures can be measured.
The normal systolic pressure of a healthy adult varies from about
110 to about 140 mmHg and the diastolic from 60 to 90 mmHg.
192
10
HEAT AND
TEMPERATURE
Heat as a form of energy
When you push a book that is lying on a polished table, part of the
work that you do on the book is converted into kinetic energy and
the book begins to move with a certain velocity (®gure 10.1). After
you release it, however, the book does not continue moving with
the same velocity; it slows down and eventually stops. What
happens to the initial kinetic energy of the book? Since there is
no change in the potential energy, as the book remains on the
table, the mechanical energy is not conserved. As we pointed
out in chapter 5, friction has taken away some of the energy.
Although it might be dif®cult to verify without sophisticated
equipment, the book and the table are warmer than before.
From our experience, however, we know that friction causes
objects to become warmer. When the weather is cold, for
example, we rub our hands together to warm them up; the
wheels of a motorcycle become hot during braking due to rubbing with the brake pads; and the tires get warmer due to friction
(®gure 10.2).
Heat, then, seems to be a form of energy. This was not known
until the middle of the nineteenth century. Before then, heat was
thought to be a ¯uid that was transmitted from hot bodies to cold
bodies. This caloric ¯uid (as the French chemist Lavoisier called it)
was believed to be a conserved quantity, since it had been
observed that when equal parts of hot water and cold water
were mixed, the result was warm water at a temperature that
was exactly the mean of the initial temperatures. The caloric
¯uid was thought to have been transmitted from the hot water
to the cold water without any losses.
195
SUPERSTRINGS AND OTHER THINGS
Figure 10.1. If we push a book lying on a table, the book slows down
and stops. What happens to the kinetic energy of the book?
Figure 10.2. Friction between the wheels and the brake pads makes the
wheels hot. The tires also get warm after the motorcycle has been driven
due to friction with the pavement.
196
Heat and Temperature
The problem with the idea of the caloric was that it could not
explain how heat was produced by friction. The correct interpretation of heat began with the experiments of an American-born
scientist who took up British citizenship, held a German title of
nobility, and led a strange and colorful life. Benjamin Thompson,
Count Rumford, was born in Woburn, Massachusetts, in 1753, a
couple of miles from the birthplace ± some 50 years before ± of
his namesake Benjamin Franklin.
While Count Rumford was supervising the boring of a
cannon in Munich in 1798 he noticed that the iron became hot
enough to glow and had to be cooled with water. Rumford
decided to study this phenomenon and performed many experiments in which he immersed the hot metal in water to measure
the rate at which the temperature rose. He was able to determine
that the amount of heat generated by friction while the boring
tool drilled the iron was large enough to melt the metal had it
not been cooled. He realized that the amount of heat that could
be generated was not only not constant, as the caloric theory
Pioneers of physics: Count Rumford
The son of a farmer, Benjamin Thompson started as an
apprentice to a storekeeper. At the age of nineteen he married
a wealthy and older widow from the town of Rumford (the
present Concord), New Hampshire. When the Revolutionary
War broke out, Thompson's sympathies were with the
English Crown and he served the King by spying on his
countrymen.
When the British left Boston, Thompson was forced to
¯ee to London, leaving his wife and daughter behind. After
the war ended, he remained in England and was knighted
by King George III, but was later accused of being a spy for
the French and of accepting bribes. In 1783 the king allowed
him to enter the Bavarian civil service and Thompson left for
Germany where he worked in the court of Elector Karl Theodor of Bavaria as an administrator. He served the Elector well
and was rewarded with the title of Count of the Holy Roman
Empire in 1790. Thompson chose the name of his wife's
hometown for his title and became Count von Rumford.
197
SUPERSTRINGS AND OTHER THINGS
proposed, but unlimited as long as work was done. In his report
to the Royal Society entitled ``An Inquiry Concerning the Source
of Heat Which is Excited By Friction,'' Rumford wrote:
And, in reasoning on this subject, we must not forget to consider that
most remarkable circumstance, that the source of the Heat generated
by friction, in these experiments, appeared evidently to be inexhaustible . . . [I]t appears to me to be extremely dif®cult, if not quite
impossible, to form any distinct idea of anything capable of being
excited and communicated in the manner the Heat was excited and
communicated in these experiments, except it be MOTION.
Rumford's brilliant ± and correct ± conclusion was that the
motion of the drill was being converted into heat and that heat
was a form of motion. His calculation of the ratio of heat to
work was very close to the accepted values today.
In spite of Rumford's insights, the caloric theory continued as
the leading theory for some forty years. Starting in the late 1830s,
James Prescott Joule repeated some of Rumford's experiments
many times, enhancing and re®ning them. In one of his experiments, he measured the heat generated by an electric current
and compared it with the mechanical energy required to run the
simple electric generator that produced the current. In another
experiment he measured the heat generated when water was
forced through pipes and compared it with the work required to
maintain the ¯ow of water through the pipes. He also designed
experiments that compared the work required to compress a gas
contained in a bottle, which was in turn immersed in water,
with the amount of heat gained by the water.
Joule's most famous experiment, however, involved the
design of an apparatus in which a brass paddle wheel immersed
in water was turned by the descension of weights suspended by
string from pulleys (®gure 10.3). The potential energy lost by the
weights as they fell was compared with the heat gained by the
water.
In his 1890 report to the Royal Society entitled ``On the
Mechanical Equivalent of Heat,'' Joule concluded from this
experiment ``[t]hat the quantity of heat produced by the friction
of bodies, whether solid or liquid, is always proportional to the
quantity of [energy] expended.'' Heat, then, as Joule was able to
establish, is a form of energy, contrary to the caloric theory.
198
Heat and Temperature
Figure 10.3.
heat.
Joule's apparatus to measure the mechanical equivalent of
Measuring temperature
Temperature is a familiar concept to us as a measure of the hotness or coldness of an object and for this reason we have used
the concept in the previous section. The device with which we
measure temperature, the thermometer, is also familiar to us.
A thermometer measures variations of some physical property that changes with temperature, such as the volume of a
liquid or a gas. The idea of representing hot and cold with numbers dates from antiquity. In the second century AD the Greek
physician Galen proposed a temperature scale based on ice and
boiling water. Arab and Latin physicians actually developed a
numeric scale from 0 to 4 to represent coldness and hotness but
lacked instruments to measure and relied only on their senses.
In the seventeenth century, Galileo invented a thermometer that
consisted of a glass tube ®lled with air connected to a thinner
tube marked with divisions and immersed in a vessel of colored
water. When it was cold, the air in the tube contracted and the
199
SUPERSTRINGS AND OTHER THINGS
colored water rose in the glass tube. When it was warm, the air
expanded and the water moved down in the thin tube. Galileo's
thermometer was not very accurate, however, because it did not
take into account changes in atmospheric pressure.
In 1657, some of Galileo's disciples improved upon this
design and developed thermometers in sealed tubes to avoid
changes due to the atmospheric pressure. These thermometers
were based on the expansion and contraction of alcohol with
changes in temperature and became popular throughout
Europe. Alcohol, however, boiled at a low temperature, so high
temperatures could not be measured.
In 1714 the German±Dutch physicist Gabriel Fahrenheit, who
had emigrated to Amsterdam after his parents died in Germany,
decided to substitute mercury for alcohol and for the ®rst time
extreme temperatures, well above the boiling point of water and
below the freezing point of water, could be measured. Thirteen
years before, Newton had suggested that the temperature scale
should be zero for the freezing point of water and that of the
human body, 12. Fahrenheit, however, did not want to use negative numbers for the temperatures of the cold winter days and
added salt to the water to obtain the lowest freezing point that
he was capable of in his laboratory and called it the zero temperature of his scale. Instead of dividing the difference between this
freezing point and the temperature of the human body in twelve
equal parts as Newton had suggested, he decided to divide this
interval into eight times twelve, or ninety-six, to obtain a larger
number of divisions in his scale. In 1724, Fahrenheit adjusted
this scale so that the boiling point of water would come out to
be exactly 180 degrees above the freezing point of water alone,
which was 32 in his scale. With this correction, the human body
temperature was found to be 98.48 and the boiling point of
water, 2128.
The Swedish astronomer Anders Celsius proposed in 1742 a
new scale in which the freezing point of water was 1008 and the
boiling point of water at sea level, 08. The following year, he
reversed his scale, making the freezing point 08 and the boiling
point 1008. Since there were 100 degrees between the freezing
and boiling temperatures for water, the scale was known for
many years as the centigrade or ``hundred step'' scale. Today,
we call it the Celsius scale.
200
Heat and Temperature
On the Fahrenheit scale there are 180 degrees between the
freezing and boiling points of water whereas in the Celsius
scale there are 100. Thus, each Celsius degree is larger than each
Fahrenheit degree by a factor of 180=100 or 9=5. Since the freezing
point of water is 328 in the Fahrenheit scale and 08 in the Celsius
scale, we can convert any Celsius temperature to its corresponding value in Fahrenheit by multiplying the Celsius temperature
by the factor 9=5 and adding 32:
TF 95 TC 32:
The Fahrenheit to Celsius conversion can be obtained from this
expression by simply solving for TC :
TC 59 TF ÿ 32:
In 1848 the English physicist William Thomson, later Lord
Kelvin, devised what is now known as the Kelvin scale, Kelvin
was able to show that if a gas was cooled down to 2738 below
08C, the random motion of its molecules would be minimum.
Therefore, there could be no temperature below this value. He
then proposed a scale that would start at this absolute zero and
would increase in intervals that were the same as the Celsius
intervals.
The Kelvin scale differs from the Celsius scale only in the
choice of zero temperature. To convert from Celsius degrees to
kelvins, as the intervals in the absolute scale are known, we add
273.15 (a more precise value). Using T for the temperature in
kelvins we have
T TC 273:15:
In ®gure 10.4 we compare the three different scales of temperature that we have considered here. Notice that the intervals
between any two temperatures are the same only in the Kelvin
and Celsius scales.
Temperature and heat
What exactly does temperature measure? When we measure the
temperature of a body, we are actually obtaining information
about the average kinetic energy of the atoms and molecules
201
SUPERSTRINGS AND OTHER THINGS
Figure 10.4. A comparison of the Kelvin, Celsius and Fahrenheit
temperature scales.
that make up the body. When the body is warm, there is more
molecular motion, and temperature is the measure of the kinetic
energy of this motion. Temperature is directly related to the
average kinetic energy of the atoms and molecules of a body. At
higher temperatures, the kinetic energy is greater and this infers
that the average molecular speed is greater.
As Joule and Rumford established, heat is a form of energy.
What is then the difference between heat and temperature? To
answer this question, we need to consider the behavior of the
atoms and molecules of a substance. These, as we have seen,
are in constant motion. In gases and liquids, they move about
at random. In a gas composed of single atoms such as helium,
in addition to this kinetic energy due to random motion, the
atoms possess kinetic energy due to the spinning of the atoms
themselves. However, this rotational kinetic energy is very small
compared with the translational kinetic energy.
In gases like nitrogen and oxygen, composed of diatomic molecules, that is, molecules formed by two atoms bound together
(®gure 10.5), the molecules can also rotate. The moment of inertia
of these diatomic molecules is much larger, however, since the
masses of the individual atoms in each molecule are farther
202
Heat and Temperature
Figure 10.5. An oxygen molecule consists of two oxygen atoms bound
together by spring-like forces. The molecule can rotate around an axis
perpendicular to the line joining the atoms. The atoms can also vibrate
back and forth.
apart from the axis of rotation, and the kinetic energy due to this
rotation is greater. The two nitrogen atoms in the nitrogen molecule can also vibrate back and forth, and this vibrational motion
also contributes to the total kinetic energy of the molecule.
In solids, the situation is more complicated since the molecules, ions or atoms that make the solid are not free to drift
around. These particles have no random translational kinetic
energy. There are, however, vibrational kinetic and potential
energies, as these particles vibrate back and forth around more
or less ®xed positions.
The sum of all the random energies of the atoms and molecules in a substance constitutes the thermal energy of that substance. The thermal energy transferred from a warm object to a
cooler object as a result of the temperature difference between
the two is what we call heat energy. Heat, then, is the ¯ow of
thermal energy, and for this reason, it would not make sense to
speak of the amount of heat that a substance has. We can, however, speak of the amount of thermal energy in a substance and
the amount of heat transferred from one substance to another.
On the other hand, the temperature of a gas or a liquid
depends only on the random translational kinetic energy per
203
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Thermography
As we shall see in chapter 23, all objects emit radiation. This
radiation can be used to determine temperature. At low to
intermediate temperatures most of the radiation is in the
infrared region. When the temperature increases, the intensity
of the radiation emitted increases considerably. An infrared
camera can detect this radiation, producing electronic signals
that can be displayed on a television monitor or photographed
on special ®lm. The image produced is called a thermogram,
and the technique thermography (see color plate).
The intensity of the infrared radiation emitted by the
human body depends on several factors, including state of
health. Certain regions of the body have a larger blood
supply than others and therefore emit more radiation. However bone tumors, bone infections such as osteomyelitis,
arthritic conditions, and muscle and tendon diseases resulting from in¯ammation emit more radiation than normal.
On the other hand, areas where blood ¯ow is reduced emit
less radiation than normal, permitting the detection of
blockages of the arteries and disorders where blood circulation is below normal, such as atherosclerosis. Thermography
has shown itself to be a very important technique for the
detection and diagnosis of these and other disorders. A
typical thermogram is shown in the color plate.
Thermography, however, is no longer recommended by
the American College of Radiology for detecting breast
cancer, because benign cysts also produce increased radiation. The x-ray process called mammography remains the
recommended procedure for early detection of breast
cancer, although other techniques for screening are currently
being explored.
molecule. (In solids, since there is no translational kinetic energy,
the temperature depends on the vibrational energy). The temperature of a substance, then, does not depend on the number of
atoms and molecules in the substance nor on the kinds of atoms
that form the substance. If you ®ll a bowl with water from a
swimming pool, the number of water molecules in the pool is,
204
Heat and Temperature
of course, much greater than in the bowl, and the total sum of the
translational, vibrational, and rotational kinetic energies of the
water molecules in the pool is much greater than the corresponding energies of the water in the bowl. Therefore, the thermal
energy of the water in the swimming pool is much greater than
the thermal energy of the water in the bowl. However, the temperatures of the water in the swimming pool and in the bowl
are the same, since we ®lled the bowl with water from the pool
and temperature is the measure of the average kinetic energy
per molecule.
Since heat is a form of energy, the SI unit is the joule. During
the times of the caloric theory, a unit of heat called the calorie (cal),
was introduced. The calorie, still in use today, was de®ned as the
amount of heat required to raise the temperature of 1 gram of
water by 18C. We de®ne the calorie today in terms of joules:
1 cal 4:186 J:
(The nutritionists' calorie is actually 1000 calories or a kilocalorie,
as the unit is de®ned in physics. Because of its widespread use
regarding the energy value of foods, to avoid confusion it is sometimes distinguished by writing it with a capital letter C as Calorie,
Cal.)
Heat capacity
What happens when we heat a substance? When we heat up some
water to prepare ourselves a cup of instant coffee, where does the
thermal energy transferred to the water ± the heat ± go? When we
place a TV dinner in the oven and heat it, why is the skin of the
potato warm but the inside usually very hot? Some foods seem
to store more thermal energy than others when placed together
in the oven.
The capacities for storage of thermal energy for different
substances are different. When heat ¯ows into a substance, this
thermal energy goes into the different forms of kinetic energy;
translational, rotational or vibrational. If the substance is a
monatomic gas like helium, most of the energy goes into translational kinetic energy. For diatomic gases like nitrogen and
oxygen, the energy is shared among the different forms of kinetic
205
SUPERSTRINGS AND OTHER THINGS
energy. If the substance is a liquid or a solid, the interactions
between the molecules give rise to other ways for this thermal
energy to go. Only the increase in the translational kinetic
energy, however, gives rise to an increase in temperature.
Adding the same amount of heat to two different substances,
such as water for your coffee and soup for your dinner, produces
different increases in the temperatures of the two substances, as
the amount of heat that goes into translational kinetic energy is
different for the two substances.
It takes longer to warm up water for two cups of coffee than
for just one cup. The reason is, of course, that heating two cups of
water requires more heat than heating just one. A cup of water
has a volume of 250 ml and a mass of 250 g. Since one calorie of
heat is required to raise the temperature of water 18C, it takes
250 calories for every degree Celsius that we want to raise the
temperature of one cup of water, and 500 calories for two cups.
The greater the mass, the greater the amount of heat required
to raise the temperature of a substance.
Thus the heat required to warm up a substance is proportional both to the mass of the substance and to the change in temperature. The heat required to increase the temperature of a mass
m of a substance an amount T is called the heat capacity, C, of the
substance. A more useful quantity, since it has the same value
regardless of the mass of the substance, is the speci®c heat capacity,
c, which is the heat capacity per unit mass of a substance. The
speci®c heat capacity, then, is the heat required to raise the temperature of a unit mass of a substance by one degree. If Q is the amount of
heat required, m the mass of the substance, and T the change in
temperature, the speci®c heat capacity is
c
Q
:
mT
The SI unit of speci®c heat capacity is the joule per kilogram per
Kelvin (J/kgK), but kcal/kg 8C are also commonly used. The
speci®c heat capacities of several substances are listed in Table
10.1. Notice that the speci®c heat capacity of water is exactly
1.000 kcal/kg8C. This is because the calorie was de®ned as the
heat required to raise the temperature of 1 g of water by 18C. To
raise the temperature of one kilogram of water by 18C requires
one kilocalorie.
206
Heat and Temperature
Table 10.1 Speci®c heat capacities of some substances
Substance
Water
Human body
Ice
Steam (1008C)
Wood
Aluminum
Speci®c heat
Substance
(kcal/kg 8C)
J/kg K
1.000
0.83
0.50
0.46
0.42
0.21
4184
3470
2090
1925
1750
880
Glass
Steel
Copper
Mercury
Gold
Lead
Speci®c heat
(kcal/kg 8C)
J/kg K
0.15
0.107
0.093
0.033
0.032
0.031
630
490
390
138
134
130
Heat of fusion and heat of vaporization
When heat is added to a substance, its temperature usually
increases. There are, however, some special situations where
the temperature stays constant as heat is added. If you add heat
to an ice cube in a closed container at 08C and monitor its temperature, you would notice that the mixture of ice and water
resulting from the melting of the ice remains at 08C until all the
ice has melted. It seems as if the water-ice mixture is able to
absorb the heat without any change in temperature. The ice, however, is melting; that is, it is changing its phase from solid to liquid
and this requires energy.
A similar phenomenon takes place when water changes into
steam at 1008C. If you start with tap water at 208C and add heat by
placing the container with water on a stove, for example, the
temperature of the water will steadily increase until it reaches
1008C. The temperature will then remain at 1008C until all the
water has turned into steam. The temperature increase will
resume after that.
The change of a liquid into solid is called fusion, and the
change of a liquid into its gas phase is called vaporization. The
amount of heat required for a substance to change phase depends
on the type of substance, its mass, and whether the phase change is
from solid to liquid or from liquid to gas. For water, 80 kcal of heat
are needed to melt one kilogram of ice. Ice, like most solids, is a
crystal and, as we saw at the beginning of chapter 9, the molecules
in a crystal occupy ®xed positions and are held in those positions
by strong intermolecular forces. When the ice melts, work must be
207
SUPERSTRINGS AND OTHER THINGS
done against these forces which means that energy must be
supplied to the solid. As more heat energy is supplied, the
water molecules are able to break loose from their neighbors
and the crystal structure is broken. The molecules that formed
the crystal and occupied ®xed positions while in the solid phase
are now free to move about; the water is now in the liquid phase
(®gure 10.6). Since any other substance would have a different
molecular structure and different binding forces between its molecules, the energy required for the molecules of this substance to
break loose from their neighbors is different.
Figure 10.6. The heat that is added to the ice goes into increasing the
potential energy of the water molecules in the crystal structure. When
these molecules gain enough energy, they break loose from their neighbors and are free to move. This is the liquid phase of water.
208
Heat and Temperature
The heat energy that is supplied to the solid goes into increasing the potential energy of the molecules of the solid. Since temperature is a measure of the random kinetic energies of the
molecules, the temperature remains ®xed while the solid melts.
Although the molecules of a liquid are free of the strong binding forces that keep them in more or less ®xed positions while in
the solid phase, they are still bound by weaker intermolecular
forces that allow them to move around. These weaker bonds
must also be broken when the liquid vaporizes. In the gaseous
phase, however, the molecules are separated from each other by
considerable distances and this separation requires energy. In
the transition from solid to liquid, the molecules remain more or
less at the same distances and it is only the breaking of the crystal
structure that requires energy. The heat required to vaporize a
substance in the liquid phase is therefore larger than that required
to melt it. In the case of water, for example, the heat needed to
vaporize one kilogram of water is 540 kcal, almost seven times
the heat required to melt one kilogram of ice.
When a gas condenses into its liquid phase, energy is
released. This amount of energy is the same amount that was
required to vaporize the liquid. Water, then, releases 540 kcal
for every kilogram of steam that condenses. Similarly, when 1
kilogram of liquid water solidi®es, 80 kcal of energy are released.
The heat absorbed or released by one kilogram of a substance
during a phase transition is called the latent heat, L. If the transition is from solid to liquid or vice versa, this latent heat is
called latent heat of fusion, Lf . If the transition involves the liquid
and gas phases of a substance, it is called the latent heat of vaporization, Lv . Thus, for water,
Lf 80 kcal=kg 335 kJ=kg
Lv 540 kcal=kg 2259 kJ=kg:
Since 80 kcal of heat are needed to melt 1 kg of ice, 160 kcal
would be needed to melt 2 kg, and 800 kcal to melt 10 kg. If we
have m kg of ice, we would need 80 m kcal of heat to melt them:
Heat to melt a mass m:
Q mLf :
Similarly, to vaporize a mass m of a liquid requires an amount of
heat Q mLv .
Heat to vaporize a mass m:
209
Q mLv :
SUPERSTRINGS AND OTHER THINGS
Evaporation and boiling
When a glass of water is left overnight on the kitchen counter, for
example, we notice that the level of the water in the glass drops a
certain amount. Some of the water has evaporated. This happens
because some of the molecules of water have escaped and became
gaseous molecules. As we have seen, the molecules of a liquid are
in continuous random motion and possess random kinetic energies. Some molecules, then, would move with greater velocities
than the average and if they happen to be near the surface of
the liquid, moving in an upward direction, they might have
enough energy to escape. This escape is what we call evaporation.
Since the molecules that escape are, on average, the ones with
greater kinetic energies, the average value of the kinetic energy
of the molecules that remain in the liquid is reduced. Because
temperature is the measure of the average kinetic energy per
molecule of the liquid, the temperature of the liquid drops as
the liquid evaporates (®gure 10.7).
Evaporation, then, is a cooling process. This explains why
you feel cool as soon as you come out of a swimming pool; as
Figure 10.7. As the fastest molecules leave the surface of the water, the
temperature of the remaining liquid drops.
210
Heat and Temperature
the water on your skin evaporates, the remaining water, which is
in contact with your skin, is left at a lower temperature. You may
also have noticed that when a nurse rubs alcohol on your skin, it
feels cool. This is because alcohol evaporates very rapidly.
Another liquid that evaporates fast is gasoline, and if you ever
accidentally spilled some of it on your hand while pumping gas
at a self-service gas station, you might have noticed that it also
leaves your skin cooler. And the reason why a fan offers some
relief on a hot summer day is because it blows air around your
body, speeding the evaporation of perspiration.
When the temperature of the liquid is high enough, close to
the boiling point, evaporation is increased substantially, since a
greater number of molecules possess enough energy to escape
from the liquid. At these high temperatures, evaporation can
also take place inside the liquid, away from the surface, where
bubbles of gas form. When these bubbles of gas begin to appear
in the interior of the liquid, boiling starts.
For the bubbles of gas to form, the molecules inside the
bubble must have kinetic energies large enough for the pressure
from inside the bubble to match the pressure from the liquid on
the bubble. The pressure that the liquid exerts on the bubble is
the atmospheric pressure plus the pressure due to the liquid at
that depth. In most cases the pressure due to the liquid is very
small compared to the atmospheric pressure and we can say
that boiling begins when the pressure of the gas is equal to the
atmospheric pressure (®gure 10.8). Because the gas in the bubbles
is less dense than the liquid surrounding them, they are buoyed
to the surface of the liquid.
As we have seen, water boils at 1008C. This, however, is only
true when the atmospheric pressure is 1 atm. At different atmospheric pressures, the boiling point of water ± or of any other
liquid ± changes. Since boiling begins when the pressure of the
gas in the bubbles that form in the interior of the liquid is equal
to the atmospheric pressure, the molecules of the liquid require
a greater kinetic energy to be able to form bubbles that can sustain
an increased atmospheric pressure. Water, then, boils at a higher
temperature when the atmospheric pressure increases, and conversely, its boiling point decreases when the atmospheric pressure decreases. If you go camping in the mountains, water boils
at a temperature lower than 1008C; since the rate at which
211
SUPERSTRINGS AND OTHER THINGS
Figure 10.8. When the temperature of the liquid is high enough (1008C
for water), evaporation takes place even inside the liquid and bubbles
begin to appear. When the pressure of the gas in the bubbles equals
the atmospheric pressure, boiling begins.
Figure 10.9. A pressure cooker con®nes the steam until the pressure
under the sealed lid reaches 2 atm. Foods cook faster because of the
higher temperatures.
212
Heat and Temperature
Physics in our world: Instant ice cream
The traditional method for making ice cream involves churning a mixture of milk, eggs, sugar, and ¯avorings while it is
chilled slowly. The churning process prevents the formation
of large crystals, thus producing a smooth texture, and also
traps air inside the mixture, which accounts for the ¯uf®ness
and lightness of good ice cream.
Physicist Peter Barham of the University of Bristol in the
United Kingdom has developed a technique to make ice
cream in two minutes. Instead of chilling the mixture
slowly, as in the traditional method, Barham chills it instantaneously. He pours liquid nitrogen on the mixture. Since
nitrogen lique®es at temperatures below ÿ1968 C. the mixture cools so fast that very few crystals develop. When the
liquid nitrogen comes into contact with the much warmer
mixture, it begins to boil, creating bubbles in the mixture.
foods cook in boiling water depends on the temperature of the
water, it takes longer to boil an egg. In Cocoa Beach, Florida,
water boils at 1008C and it takes about 10 minutes after the
water reaches the boiling point to hard-boil an egg. On the highest mountain in the Eastern United States, Mt. Mitchell, North
Carolina, elevation 6600 feet, water boils at 938C; it takes almost
twice as long to hard-boil an egg there.
A pressure cooker con®nes the steam under the sealed lid
until the pressure is nearly 2 atmospheres (®gure 10.9). At this
pressure, water boils at 1208C and foods cook faster at this
higher temperature.
Humidity
Due to the evaporation of water from lakes, ponds, rivers, and the
sea, the air contains some water vapor. If we place a lid on a pan
with water so that the evaporated gaseous water molecules
cannot escape (®gure 10.10), an equilibrium state will eventually
be reached where the number of molecules leaving the surface
213
SUPERSTRINGS AND OTHER THINGS
Figure 10.10. The number of molecules leaving the surface of the water
is equal to the number of molecules reentering the liquid. The air under
the lid is said to be saturated.
of the water equals the number of water molecules that return to
it after bouncing off the lid and walls of the pan. When this happens, the air above the water becomes saturated with vapor, in
other words, the air holds the maximum amount of water
vapor that it can at that temperature.
The amount of water vapor that the air can hold depends on
the temperature of both the water and the air. As we discussed in
the previous section, an increase in the water temperature results
in greater evaporation, and this results in more water vapor in the
air. If the air temperature increases, the average kinetic energy of
the air molecules increases, and this means that more collisions
among the different molecules present in air take place. The
water molecules in air collide with greater speeds and have less
likelihood of sticking to each other, forming small water droplets
that can ``rain'' down on the water surface. Thus, the amount of
water vapor required for saturation increases with temperature.
Humidity is a measure of the amount of water vapor present
in the air at any given time. Absolute humidity, AH, is the actual
amount of water vapor that the air contains; that is, the total
mass of water vapor in the air per unit volume, generally given
214
Heat and Temperature
Table 10.2 Humidity at saturation for different air temperatures
Temperature
(8C)
Water vapor/m3 air
(g/m3 )
Temperature
(8C)
Water vapor/m3 air
(g/m3 )
ÿ8
ÿ4
0
4
8
12
2.74
3.66
4.84
6.33
8.21
10.57
16
20
24
28
32
36
13.50
17.12
21.54
26.93
33.45
41.82
in g/m3 . Weather forecasters, however, give humidity in percentages; this is actually a relative humidity, RH, which is the ratio of
the mass of water vapor per unit volume of air (the absolute
humidity) to the mass per unit volume of water vapor required
to saturate the air. In Table 10.2, we list the values of this humidity
at saturation, HS, for different temperatures. Therefore, if we
know the absolute humidity at any given time, the relative
humidity can be calculated as follows:
RH
AH
:
HS
On a nice spring day, the temperature might reach 208C
during the day. If the relative humidity is 50% (or 0.50), and the
absolute humidity remains more or less the same throughout
the evening, the air will become saturated just before the temperature drops to 88C. At this temperature, drops begin to condense
from the vapor. If the temperature is above the freezing point of
water, 08C, dew forms; if it is below 08C, frost forms. We call the
temperature at which the air starts to saturate the dew point.
Thermal expansion
Most objects expand when the temperature is increased and contract when the temperature is decreased. As we saw in an earlier
section, some thermometers are based on this phenomenon. Concrete highways and bridges must be built with gaps to take into
account the expansion when temperatures rise. Large bridges
may have expansion joints as wide as 40 to 60 cm to allow for
expansion.
215
SUPERSTRINGS AND OTHER THINGS
Temperature, as we have seen, is a measure of the average
kinetic energy of the molecules of a body. As the temperature of
a body increases, the average kinetic energy of its molecules also
increases. This increase in kinetic energy means that the molecules
move through larger distances, requiring more room to do so. For
this reason, a substance expands when its temperature increases.
A 10-m steel beam used in the construction of a bridge
reduces in length to 9.996 m when the temperature drops to
ÿ108C. When the temperature rises to 358C during a hot
summer day its length increases to 10.001 m. Although the
increase in length amounts to only 5 mm for the 10 m beam, a
large bridge with a total length of 1000 m would need 100 of
these beams and the total increase in length between winter
and summer would be 50 cm.
We can see in the above example that the increase in length of
the steel beam due to the increase in temperature is proportional
to the length of the beam. A 10 m beam increases only 5 mm when
the temperature changes from ÿ108C to 358C, whereas a 1000 m
beam (or 100 10 m beams placed one after the other) would
increase its length by 50 cm. This thermal expansion is also proportional to the temperature change. Finally, if the beams were
made of aluminum (not practical for bridges) the thermal expansion would be twice as large. Thus, the increase in length depends
on the original length of the beam, the temperature change, and
the kind of material used in its construction.
We call L0 the initial length of the beam when the temperature is ÿ108C and L its increase in length as the temperature
rises to 358C in the summer. This increase in length L is proportional to the initial length L0 (10 m in our example), the change in
temperature T, and a property that depends on the material
used (steel in our example) called the coef®cient of linear expansion,
(the Greek letter alpha). We can express this as
L Lo T:
In Table 10.3 we list the coef®cients of linear expansion for some
common substances. This coef®cient, which has units of inverse
degrees Celsius (1/8C), measures the fractional change in length
with a given change in temperature for a substance.
The fact that different substances expand differently with a
given temperature change, as illustrated by the different values
216
Heat and Temperature
Table 10.3 Coef®cients of thermal expansion of some substances
Substance
Rubber
Ice
Lead
Aluminum
Silver
Brass
Coef®cient (1/8C)
ÿ6
80 10
51 10ÿ6
30 10ÿ6
24 10ÿ6
20 10ÿ6
19 10ÿ6
Substance
Coef®cient (1/8C)
Copper
Iron (Steel)
Concrete
Glass (ordinary)
Glass (Pyrex)
Carbon (Graphite)
Carbon (Diamond)
17 10ÿ6
12 10ÿ6
10 10ÿ6
9 10ÿ6
3:2 10ÿ6
8 10ÿ6
1:2 10ÿ6
of the expansion coef®cients, has been used in the construction of
thermostats. A thermostat is an electrical switch that is activated
by changes in temperature, and is used to control electrical appliances that depend on temperature changes, like air conditioners,
heaters, and toasters. A simple thermostat can be constructed
with two strips of two different metals bonded together as
illustrated in ®gure 10.11(a). If one of the strips of metal is brass
and the other aluminum, when the temperature increases, the
aluminum strip expands more than the brass strip, since aluminum has a larger thermal coef®cient. Because the two strips are
joined together, the result is a bending of the two strips so that
the aluminum side is on the outside and the brass side on the
inside.
As shown in ®gure 10.11(b), this simple thermostat can be
used to control the operation of any appliance that works when
the temperature falls below a certain value, like a heater, or an
oven. If we reverse the strip so that the aluminum is toward the
Figure 10.11. (a) A bimetallic strip made with brass (shown as shaded)
and aluminum would bend upward so that the aluminum is on the outside since it expands more. (b) If this bimetallic strip is used as a simple
thermostat, it can control a heater, an oven, or any other device that needs
to operate when the temperature falls below a certain value.
217
SUPERSTRINGS AND OTHER THINGS
Figure 10.12. A bimetallic strip shaped like a coil is a common type of
thermostat. When the temperature changes, the coil opens or closes the
circuit.
top part of the ®gure, the strip would bend downward when the
temperature increases above a certain value. In this case, it can be
used to control an air conditioner, for example.
A common type of thermostat uses a bimetallic strip shaped
like a coil (®gure 10.12). With changes in temperature, the different rate of expansion of the two metals in the strip subjects the coil
to stresses that open or close the contact.
The unusual expansion of water
Water is the important exception to the rule that substances
expand when the temperature increases. Although this also
applies to water in its three phases at most temperatures, between
08C and 48C water actually contracts when the temperature
increases. This is due to the structure of the water molecule.
Each oxygen atom in a water molecule can bind itself to two
additional hydrogen atoms from other water molecules and
each one of its two hydrogen atoms can in turn bind themselves
to one oxygen in an adjacent water molecule. This means that
each water molecule can participate in 4 bonds with other
water molecules, as shown in ®gure 10.13(a). In liquid water,
about 80% of these bonds are completed, whereas in ice all
218
Heat and Temperature
Figure 10.13. (a) The four bonds of liquid water: each oxygen atom can
bind itself to two hydrogen atoms in neighboring molecules and each one
of the two hydrogen atoms in a water molecule can bind to oxygen atoms
from two nearby molecules. (b) The open structure of ice.
water molecules are bonded to other water molecules to form the
crystal structure of the solid (®gure 10.13(b)).
The crystals in ice have what is described as an open structure
because of the large open, unoccupied spaces between the molecules. Most other solids may have up to 12 molecules joined
together as nearest neighbors and their structures are not open
like that of ice. The structure of liquid water is less open than
that of ice because the number of bonds in the liquid decreases
with increasing temperature, and this makes the water molecules
at temperatures near 08C more tightly packed than they are in ice.
As more bonds are broken when the temperature of the water
increases from 08C to 48C, the molecules are able to occupy
more of these open spaces, so that the density of water increases
reaching its maximum value of 1000 kg/m3 at 48C. At temperatures higher than 48C, the larger kinetic energies of the water
molecules require that more room be available for the increased
motion of the molecules, and water begins to expand with
increases in temperatures, as all other substances do.
Life as we know it on the earth exists in part because of these
two unusual phenomena; the lower density of ice as compared
with water, and the contraction of water as the temperature
increases from 08C to 48C. When the temperature drops in the
winter, the surface water of lakes and ponds cools down to 48C,
becoming denser. The warmer, less dense water beneath is
219
SUPERSTRINGS AND OTHER THINGS
Figure 10.14. The lower density ice ¯oats, insulating the water beneath
and slowing further freezing. Aquatic life then survives in many lakes
and ponds.
buoyed up to the surface, where it gets chilled to 48C, becoming in
turn denser. The cool descending water brings oxygen down with
it. This process continues, keeping the water deep below the surface at 48C, until the atmospheric temperature drops further and
ice forms on the surface of the water. Since this lower density ice
¯oats, lakes and ponds freeze from the top down, and this layer of
ice insulates the water below, slowing further freezing. Fish and
other forms of aquatic life survive the winter in the slightly
warmer and oxygenated water below the ice (®gure 10.14).
220
11
THE LAWS OF
THERMODYNAMICS
The four laws of thermodynamics
Why does time seem to ¯ow in only one direction? Can the ¯ow
of time be reversed? The directionality of time is still a puzzle
because all the laws of physics except one are applicable if time
were to be reversed. As we shall see in this chapter, the second
law of thermodynamics is the exception. The ¯ow of time seems
to arise from the second law.
There are four laws of thermodynamics: the Second Law was
discovered ®rst; the First was the second; the Third was the third,
but it probably is not a law of thermodynamics after all; and the
Zeroth law was an afterthought. We shall occupy ourselves in this
chapter with the study of these laws.
The ideal gas law
The study of thermodynamics is intimately connected with the
study of the behavior of gases. The reason is that gases, being
much simpler, are better understood than liquids and solids.
An ideal gas is any gas in which the cohesive forces between molecules are negligible and the collisions between molecules are
perfectly elastic; that is, both momentum and kinetic energy are
conserved. Many real gases behave as ideal gases at temperatures
well above their boiling points and at low pressures.
The English scientist Robert Boyle, the fourteenth child of the
Earl of Cork, was an infant prodigy. At the age of eight he spoke
Greek and Latin and at fourteen traveled to Italy to study the
works of Galileo. He returned to England in 1645 to ®nd his
221
SUPERSTRINGS AND OTHER THINGS
Figure 11.1. (a) A container ®lled with a gas kept at the same temperature by placing it on a temperature-controlled hot plate. The pressure is
changed by adding sand on the movable piston. When the pressure is
increased, the volume decreases. (b) A ®xed amount of sand on top
of the frictionless lid maintains the gas in the container at a constant
pressure. Increasing the temperature of the gas increases its volume in
a linear way.
father dead and himself wealthy. In 1654, he became a member of
the ``Invisible College,'' which later became the Royal Society,
where he met Newton, Halley, and Hooke.
In 1662, while experimenting with gases, he was able to show
that if a ®xed amount of a gas was kept at a constant temperature,
the pressure and the volume of the gas follow a simple mathematical relationship. Boyle discovered that gases were compressible,
so that when the pressure was increased (as when the piston in
®gure 11.1(a) is pushed down by the extra sand) the volume of
the gas decreased. Boyle further found that if the container was
placed on a hotplate kept at a constant temperature, the increase
in pressure was matched by the decrease in volume. This meant
that if the pressure was doubled, the volume halved; if the pressure was tripled, the volume decreased to exactly one third. Boyle
222
The Laws of Thermodynamics
expressed this relationship between pressure and volume as
PV constant
at constant temperature:
This expression is known today as Boyle's law. Several years after
Boyle's experiments, it was found that the constant in Boyle's law
was the same for all gases.
Throughout the eighteenth century, many scientists investigated the expansion of gases when heat was added, but their
results lacked consistency and no conclusion regarding the dependence among volume, pressure and temperature was reached. In
1804, the French chemist Joseph Louis Gay-Lussac was able to
show that if the pressure of the gas was kept constant (as illustrated in ®gure 11.1(b) with the constant weight of the sand on
the frictionless lid), the change in volume was proportional to
the change in temperature. He investigated this relationship
between temperature and volume with air, hydrogen, oxygen,
nitrogen, nitrous oxide, ammonia, carbon dioxide, hydrogen
chloride, and sulfur dioxide, and found that it held consistently.
Since the volume and the temperature of a gas at constant
pressure are directly proportional, a plot of volume versus temperature for different gases should give us straight lines for
each gas. Gay-Lussac found that if these lines were extrapolated,
they all cross the temperature axis at exactly the same point
(®gure 11.2(a)). This point is the absolute zero of temperature,
0 K or ÿ273.168C. This was the basis for the introduction of the
absolute or Kelvin scale of temperature.
Figure 11.2. (a) A plot of volume versus temperature for different gases
yields straight lines. When extrapolated, these lines intersect the temperature axis at ÿ2738C. (b) A plot of volume versus absolute temperature for an ideal gas is a straight line through the origin.
223
SUPERSTRINGS AND OTHER THINGS
The absolute zero is the minimum temperature attainable
because at this temperature the volume of the gas would be
zero, as we can see in the graph of ®gure 11.2(b). A plot of
volume versus absolute temperature for an ideal gas yields a
straight line that passes through the origin, as shown in ®gure
Physics in our world: Automobile engines
The gasoline engine used in automobiles is a heat engine
which generates the input heat from the combustion of gasoline inside the engine. For this reason, gasoline engines are
called internal combustion engines.
An automobile's gasoline engine consists of the cylinder
head, the cylinder block, and the crankcase. The cylinder
head has two sets of valves, intake and exhaust. When the
intake valves are opened, a mixture of air and gasoline
enters the cylinders. When the exhaust valves are open, the
burned gases are expelled from the cylinders. The valves
are opened and closed by the camshaft, a system of cams on
a rotating shaft, while the moving pistons turn a shaft, the
crankshaft, to which they are connected. The camshaft and
the crankshaft are interconnected by a drive belt or chain so
that as the pistons move, turning the crankshaft, the camshaft
is also turned, opening and closing the valves.
Most automobiles have a four stroke cycle engine. In
the intake stroke the downward motion of the piston draws
fuel into the cylinder. The volume of the cylinder increases
from a minimum volume Vmin to a maximum volume
Vmax rVmin , where r is the compression ratio. For modern
automobiles, the compression ratio is about 8. In the compression stroke the intake valve closes as the piston reaches the end
of the downstroke, and the piston compresses the air±fuel
mixture to Vmin . In the power stroke, an electric spark from
the spark plug ignites the gases, increasing their temperature
and pressure. The heated gases expand back to Vmax , pushing
the piston and doing work on the crankshaft. Finally, in the
exhaust stroke, the exhaust valve opens, and the piston
moves upward, pushing the burned gases out of the cylinder.
The cylinder is now ready for the next cycle.
224
The Laws of Thermodynamics
11.2(b), since T in Kelvin is zero for V 0. Therefore,
V constant T
at constant P:
This is Gay-Lussac's law, also known as Charles' law, because the
French physicist Jacques Alexandre Charles had independently
made the same discovery a few years earlier but had failed to
publish it.
We can combine Boyle's law and Gay-Lussac's law into one
single expression,
PV constant T
which is known as the ideal gas law.
We can extract a third relationship from the ideal gas law.
When the volume of the gas remains constant, the pressure is
proportional to the temperature,
P constant T
at constant V:
We should keep in mind that the temperature, T, in all the gas
laws is in kelvins.
The zeroth law of thermodynamics
Although we have not labeled them as such, we have already
studied the zeroth and ®rst laws of thermodynamics in the previous chapters. The zeroth law deals with bodies in thermal equilibrium. If we place two objects that are at different temperatures
in contact with each other and wait a suf®cient length of time, the
two objects will reach the same temperature; they are in thermal
equilibrium with each other.
How do we check that the two objects have reached thermal
equilibrium? We need to use a third object, a thermometer for
example, to verify that the two objects are in thermal equilibrium.
When we use a thermometer to measure the temperature of an
object, we bring it into contact with the object and wait some
time before reading the temperature. What we have done is to
wait until the object and the thermometer reach thermal equilibrium; that is, until the thermometer is at the same temperature
as the object. If we now use the same thermometer to measure
the temperature of a second object, and after waiting a long
225
SUPERSTRINGS AND OTHER THINGS
enough time to make sure that the thermometer has reached
thermal equilibrium with this second body, the reading of the
thermometer is the same as when it was in thermal equilibrium
with the ®rst object, we can say that the two objects are in thermal
equilibrium with each other. In other words,
if two objects are each in thermal equilibrium with a third object [the
thermometer], they are in thermal equilibrium with each other
This seemingly obvious statement is the zeroth law of thermodynamics.
The ®rst law of thermodynamics
The ®rst law of thermodynamics is a generalization of the principle
of conservation of energy to include thermal energy. In chapter 10,
we used the term thermal energy to describe the sum of all the
random kinetic energies of the atoms and molecules in a substance.
As we learned earlier in the book, atoms in a system have binding
energies, the nuclei of these atoms also have binding energies, and
the molecules have energy in the chemical bonds. These energies
stored in the molecules, atoms, and nuclei are different forms of
potential energy. The total energy of a system ± a gas, for example ±
includes all forms of energy, thermal and potential, and is called
the internal energy, U, of the system.
If a system is isolated, that is, if it does not exchange energy
with its surroundings, the total energy must remain constant.
This is the principle of conservation of energy, familiar to us
from chapters 4 and 5. It is also the ®rst law of thermodynamics;
we can state it very precisely as follows:
In an isolated system, the total internal energy remains constant,
although it can change from one kind to another.
A system can interact with its surroundings in two ways:
First, when it does work or work is done on it and second, when
heat is exchanged with the surroundings. To understand how this
comes about, consider the gas in the well-insulated container of
®gure 11.3. The air-tight frictionless lid can slide up or down
and we have added enough sand on top of it to balance the pressure exerted by the gas.
226
The Laws of Thermodynamics
Figure 11.3. A gas in a well-insulated container. The pressure of the lid,
sand and the atmospheric pressure balance the pressure of the gas.
We now remove the insulation from the bottom of the container and place it on a stove so that an amount of heat Q is
added to the gas while at the same time we add enough sand
so that the lid remains in place (®gure 11.4(a)). This means that
the gas does not do any work on the surroundings and all the
heat added goes to increase the internal energy of the gas. That is,
Q U:
Figure 11.4. (a) Adding enough sand to the lid keeps the volume of the
gas constant. In this case, since no work is done by or to the gas, the heat
added goes into increasing the internal energy of the system. (b) Removing sand from the lid lets the gas expand. The head added goes into
doing work. (c) If the amount of sand is not altered in any way, the
heat added increases the internal energy of the gas and the gas does
work on the surroundings. This is the ®rst law of thermodynamics.
227
SUPERSTRINGS AND OTHER THINGS
In this process, the volume of the gas remains constant, and we
call it an isochoric (equal volume) process. When the mixture of
air and gasoline ignites in a car's engine, the volume of the mixture remains unchanged and the heat from the ignition increases
the internal energy. This process is close to an isochoric process.
Suppose that instead of adding sand to the lid to keep the gas
at a constant volume, we slowly remove enough sand so that
the gas is allowed to expand (®gure 11.4(b)). In this case all the
heat added goes into doing work on the surroundings, and the
internal energy of the gas remains the same; that is,
Q W:
Recall that the temperature of a gas depends only on the random
kinetic energy per molecule. In an ideal gas there are no intermolecular interactions, so the potential energy of the molecules
is zero. Therefore, the internal energy of an ideal gas depends
only on the temperature of the gas. Since the internal energy of
the gas in the situation depicted in ®gure 11.4(b) does not
change, there is no change in the temperature of the gas. We
call this process an isothermal process.
We can see that if we do not alter the amount of sand on the
lid and simply add heat to the gas, the gas will increase its internal energy and do work on the surroundings (®gure 11.4(c)). In
other words, if an amount Q of heat ¯ows into the gas resulting
in an increase in its internal energy U, and the gas does an
amount of work on the surroundings, then
Q U W :
What this expression tells us is that
work done
increase in internal
heat added
:
by system
energy of system
to system
This is the principle of conservation of energy, which we call here
the ®rst law of thermodynamics. In this case, the external pressure that the atmosphere, the sand and the lid exert on the gas
is constant. This pressure must be matched by the expanding
gas; thus, the gas exerts a constant pressure on the surroundings.
This is an isobaric (equal pressure) process.
The ®rst law of thermodynamics shows that if a system
undergoes a volume and pressure change, the internal energy
228
The Laws of Thermodynamics
of the system changes by an amount that is given by Q ÿ W . In
chapter 5 we learned that the work done in moving an object in
a gravitational ®eld does not depend on the path through
which the object is taken between the initial and ®nal points.
The gravitational force is a conservative force. In thermodynamics, when the system changes from some initial state to
some ®nal state, the quantity Q ÿ W is found experimentally to
depend only on the initial and ®nal coordinates and not on the
path taken. This quantity Q ÿ W is the change in the internal
energy of the system,
U Q ÿ W :
The second law of thermodynamics
The second law of thermodynamics was made famous several
years ago by the English novelist and physicist C P Snow in his
well known essay ``The Two Cultures,'' where he suggested
that some understanding of it should be expected of every
educated person. His choice was a fortunate one because the
second law is one of the most important laws in all of science.
This, however, does not make it dif®cult to understand.
The German physicist Rudolf Gottlieb, known today by the
name of Clausius, stated it more than a hundred years ago as
follows: ``Heat does not pass spontaneously from cold to hot.'' This
statement of the second law is known as the Clausius statement.
It would not violate the principle of conservation of energy if
heat were to pass spontaneously from cold to hot. During a warm
summer day a puddle could spontaneously release heat to the
surroundings, cooling down and solidifying into ice, without
violating the conservation of energy principle. It would not
violate the principle of conservation of energy either if water
waves were to converge on a stone at the bottom of a pond and
propel it out of the water into the hands of a child standing
nearby. These events are never observed. Many times we have
seen blocks of ice melting into puddles and children throwing
stones into ponds, causing ripples that spread away from the
stone but never the reverse processes (®gure 11.5).
The fact that these events are always observed taking place in
one direction and not in the other is related to the direction of time.
229
SUPERSTRINGS AND OTHER THINGS
Figure 11.5. A child throwing a stone into a pond is a common sight
during warm days. The reverse situation, in which water waves converge
onto a stone lying at the bottom of a pond and propel it out of the water
into the child's hands, is never seen.
People grow older, not younger; stars are formed from rotating
clouds of hydrogen gas, begin their thermonuclear processes
out of which heavier elements are produced and energy is
released, expand and contract after millions of years, and ®nally
explode as supernovas or become black holes; the reverse
sequence of events is never observed (®gure 11.6).
There is a trend in nature towards a greater degree of disorder. It is not just your room that is hard to keep organized;
the entire universe keeps getting more disorganized. In 1865,
Clausius introduced the term entropy, from a Greek word that
means transformation, as a measure of the disorder of a system.
(The conventional symbol for entropy is S.) In terms of entropy,
the second law can be stated as follows:
The entropy of the universe never decreases; all natural changes take
place in the direction of increasing entropy.
There is only one way that all the pieces of a jigsaw puzzle
can be organized to make a picture but many incorrect ways in
which they can be put together. It is, therefore, extremely unlikely
that by throwing the pieces together they will fall in the correct
230
The Laws of Thermodynamics
Figure 11.6. An exploding supernova discovered and photographed in
January 1987 in the Magellanic Cloud, a satellite galaxy of the Milky Way.
These gigantic explosions represent the end of the life cycle for certain
stars. They also represent the beginning, as carbon, oxygen, silicon,
iron, and other heavy elements that were produced in the old star are
spewed out in the explosion. New stars are born out of the matter of
the explosion mixed with the surrounding gas in the galaxies. Our Sun
and Planet Earth contain the ashes of early supernova explosions.
(Courtesy NASA, Space Telescope Science Institute, Hubble Heritage
Team.)
231
SUPERSTRINGS AND OTHER THINGS
order. When you accidentally drop a stack of papers and
hurriedly pick them up, they become out of order. There is one
correct order for the papers and many incorrect ones. There are
only a few ways of placing the things in your room that are
esthetically pleasing to you and very many ways that are not.
Your room is therefore more likely to be in an esthetically displeasing order.
When you open a soft drink bottle, some of the carbon dioxide
that had been dissolved under pressure in the liquid and is mixed
with air above the liquid leaves the bottle and diffuses into the
atmosphere. The reverse of this process, where the diffused gas
collects itself and enters the bottle so that you can replace the
cap and restore the previous order, is never observed. The propane gas in a tank of a recreational vehicle diffuses into the air
after you open the valve. The original situation, with the propane
gas in the tank separated from the air outside, is more ordered
than the latter, where the propane is mixed with the air. Entropy
increases in each one of these normal processes.
We should note that nothing drives the molecules of a gas
into a state of greater disorder; there is no special force behind
this phenomenon. The diffusion of a gas is purposeless and is
only the result of the random motion of its molecules.
When we let a gas expand, as when we let the air out of a tire,
its temperature decreases. As the gas expands, it does work on
the surroundings and this means a reduction in the kinetic
energy of the gas. The air leaking out of a tire does work by pushing the outside air in the vicinity of the valve through molecular
collisions. The energy of the gas jostles out into the environment.
This diffusion of energy out into the environment explains
Clausius's statement of the second law. The tungsten atoms in
the ®lament of a light bulb are vibrating rapidly when the light
is on. The vibrating atoms near the surface of the ®lament collide
with nearby air molecules, which in turn collide with other air
molecules farther out. The energy of the ®lament is thus diffused
into the cooler environment. Heat passes from the hot ®lament to
the colder air and never the other way around.
When we add heat to a substance, its molecular motion
increases and this results in more disorder. Adding heat to a substance, then, increases its entropy and the more heat we add, the
greater the increase. A given amount of heat that is added to a
232
The Laws of Thermodynamics
Figure 11.7. (Cartoon by Sydney Harris.)
substance is more effective in producing disorder if the substance
is cold than if it is already hot. A misplaced item stands out more
in a well organized room; you would hardly notice the same
misplaced item in a messy room (®gure 11.7). We can see that
the change in entropy of a substance is directly proportional to
the amount of heat added and inversely proportional to the temperature. If a system absorbs an amount of heat Q at an absolute
temperature T, the change in entropy is
Q
:
T
The units of entropy are joules per kelvin, J/K.
S
The third law of thermodynamics
Absolute zero is the lowest temperature possible. The German
physical chemist Harmann Walther Nernst (1864±1941) proposed
233
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Entropy that organizes?
By mixing tiny polystyrene spheres of two different sizes,
physicists have been able to turn the tables on entropy and
use it to line up the spheres in an organized pattern. Arjun
G Yodh of the University of Pennsylvania and his research
group placed a drop of salt water containing a mixture of
these spheres between two microscope slides. They used
many 0.08 micrometer polystyrene spheres and mixed in a
few 0.46 micrometer ones and etched a small groove in the
slide.
Because of their random motion, the small spheres in the
salt water collided with themselves and with the fewer larger
spheres. The collisions produced a state of maximum entropy,
in which the small spheres spread out in disordered state. The
random motion of the larger spheres also tended toward an
increase in entropy. However, the much larger number of
smaller spheres obstructed the motion of the larger spheres.
Larger spheres reaching the edge of the groove, for example,
would not be allowed to enter it; the jostling motion of the
smaller spheres pushed them back. Even when the experimenters placed the large spheres in the groove, they were
driven out by the smaller particles. Many of the larger spheres
get trapped at the edge of the straight groove, forming a row.
The scientists measured the cumulative force keeping
the larger particles from entering the channel to be 40 femtonewtons (10ÿ15 N). This was the ®rst measurement of the
``entropic interaction.''
in 1907 that the absolute zero temperature cannot be reached. In
an experiment, the temperature of a system can in principle be
reduced from the previous temperature obtained, even if by a
very small amount, as has been done recently, achieving temperatures of the order of 10ÿ7 K (one ten-millionth of a kelvin). What
Nernst proposed was that although it is possible to get closer and
closer to absolute zero, to actually get there requires an in®nite
number of steps. This, of course, makes the absolute zero temperature unattainable. This statement is known as the third law
of thermodynamics.
234
The Laws of Thermodynamics
Third Law of Thermodynamics: It is impossible to reach the absolute
zero temperature in a ®nite number of steps of a process.
Nernst was awarded the 1920 Nobel Prize in Chemistry for this
discovery.
Some scientists think that this statement is actually an extension of the second law and not a separate law. Whether a separate
law or only an extension of the second law, the statement that
absolute zero cannot be reached is universally accepted.
Entropy and the origin of the Universe
The fact that all natural processes take place in the direction of
increasing entropy means, as we have seen, that the entropy of
the entire Universe is constantly increasing. Nature's tendency
towards greater disorder diminishes the amount of energy available to do work. Think, for example, of a bouncing ball. With each
bounce, the mechanical energy of the ball decreases and the ball
reaches a lower height every time, until, ®nally, it lies still on the
ground. While the ball is falling, although there is still random
molecular motion, all its molecules also possess the same translational motion towards the ground, producing an ordered
state. When the ball hits the ground, there is a slight increase in
the random motion of the molecules of both ground and ball as
a result of the collision and this increases the thermal energy of
the ball (and the ground). There is, therefore, less mechanical
energy for the second bounce. When the ball loses all of its
mechanical energy to thermal energy through the repeated collisions and bounces no more, the ordered translational motion of
its molecules has been lost to a random disordered motion that
has produced an increase in thermal energy. As the ordered
state disappears, the entropy of the system increases.
If we think of the bouncing ball as a very simple machine to
do work (using it to catapult a small stone, for example, as illustrated in ®gure 11.8), the energy that is available to perform
this work decreases after every bounce and disappears altogether
when the ball lies still on the ground. The total energy of the
system (ball and ground) remains constant; the mechanical
energy has been transformed into thermal energy, no longer
available to do work.
235
SUPERSTRINGS AND OTHER THINGS
Figure 11.8. A bouncing ball can be used to catapult a small stone. The
energy available to do this work decreases with every bounce as it
dissipates into heat.
In all processes that involve exchange of heat, there is an
increase in entropy and a decrease in the energy available to do
work. The entropy of the universe must eventually reach a maximum value when everything is in a state of perfect disorder and
the total energy of the universe is distributed uniformly. This
led scientists to predict the ``heat death'' of the universe when
processes are no longer possible.
We now know that the ``heat death'' of the universe has
already happened. Even though, as we have said, the entropy
of the universe is continuously increasing, the total entropy of
the universe is essentially constant. The reason is that almost all
of the entropy of the universe is in the radiation of photons and
not in matter ± by a factor of 400 million to one ± and the entropy
of the radiation has already reached its peak.
The universe consists of matter in the form of galaxies and
radiation in the form of photons. The galaxies are conglomerates
of billions of stars: our own galaxy, the Milky Way, is estimated to
contain 100 billion stars. From observational evidence, astronomers calculate that the universe contains some 100 billion
galaxies each separated from the others by a few million lightyears. (One light-year is the distance light travels in a year,
nearly ten trillion kilometers.) In all these galaxies, stars are constantly forming, evolving, exploding and collapsing. All these
processes increase the total entropy of the universe. However,
the total increase in entropy due to all the changes that have
taken place since the beginning of the universe amounts to only
one ten-thousandth of the entropy of the radiation.
236
The Laws of Thermodynamics
When the universe was formed some 13 billion years ago, it
was all radiation. It is a triumph of modern physics that we are
today able to trace the history of the universe beginning at a hundredth of a decillionth (10ÿ35 ) of a second after the very moment
of creation, an interval of time so small that we are incapable of
imagining it. We will be studying some of the details of this
wonderful theory in chapter 25. We can, however, outline the
main ideas here, so that we can understand how the total entropy
of the universe can be considered to be almost constant.
The Big Bang theory of the origin of the universe was proposed by the Russian±American physicist George Gamow
towards the end of the 1940s to explain a strange phenomenon
that had been discovered some twenty years earlier by the American astronomer Edwin Hubble. Hubble had undertaken a study
of 100 galaxies with the 100-inch telescope at Mount Wilson and
had made a startling discovery. He noticed that all these galaxies
were invariably moving away from us. For a while it looked as if
Ptolemy was right after all; the earth seemed to be at the center of
the universe.
Soon Hubble realized that this was not the case. By comparing the velocity of recession with the known distances to the
galaxies, he discovered that the more distant the galaxy, the
faster it receded. The velocities that he measured were not
minor. Galaxies in the constellation Ursa Major, for example,
were found to be receding at a rate of 42 000 km/s or one-seventh
the speed of light. Hubble showed that the velocities of recession
of galaxies were related to their distances by a simple expression,
now known as Hubble's law. This law states that
v Hd
where v is the velocity of recession of a galaxy, d is the distance to
the galaxy, and H is Hubble's constant. This constant has a value
22 km/s per million light-years. In other words, a galaxy one
million light years from us moves away at a speed of 22 km/s.
Wendy L Freedman of the Carnegie Observatories in Pasadena,
CA, has recently calculated this value from an analysis of data collected during the past eight years by the Hubble Space Telescope.
A second group led by Alan Sandage at Carnegie has calculated a
slightly lower number for the Hubble constant, at 18.5 km/s per
light-year. This discovery led Hubble to realize that although
237
SUPERSTRINGS AND OTHER THINGS
Figure 11.9. The raisin bread analogy of the expansion of the universe.
As the loaf of bread is baked, the expanding dough carries the raisins
away from each other. Raisins A, B, and C move away from raisin MW
as the dough rises. MW can be any raisin in the loaf.
every one of the distant galaxies that he studied was receding from
us, the earth was not at a special place in the universe, but rather
that every galaxy was moving away from every other galaxy.
Hubble's discovery means that the universe is expanding.
Imagine baking a loaf of raisin bread; as the dough rises the
raisins move apart from each other (®gure 11.9). If we select
one particular raisin, we would see that all the other raisins
move away from it and the farther a raisin is from the one we
selected, the faster it moves away. We did not pick a special
raisin; it works the same with any one raisin as they all move
away from each other. The expanding dough carries the raisins
along so that the distance between them increases. The expanding
space carries the galaxies along so that from our own galaxy we
see the other galaxies moving away from us. Intelligent beings
on a planet in a distant galaxy would see our galaxy and all the
other galaxies moving away from theirs.
An expanding universe means that at earlier times the
galaxies were closer together and this suggests a beginning for
everything. This is what motivated George Gamow to propose
the Big Bang theory of the origin of the universe.
As we said earlier, the universe consists of matter and radiation. The average density of matter in the universe has been
estimated to be about one nuclear particle per cubic meter. The
average density of radiation in the universe today is about 400
238
The Laws of Thermodynamics
million photons per cubic meter. Since the entropy of a system is
proportional to the number of particles, we can see that the
entropy of the universe is mostly in the photons, by a factor of
400 million.
Since almost all the entropy of the universe is in the photons,
the increase in entropy that takes place when processes occur in
the planetary systems and galaxies of the universe, although
numbered in the billions, can add very little to the entropy that
already exists in the photons. The ``heat death'' of the universe,
then, effectively happened shortly after the big bang, with the
creation of all the photons.
Entropy and the arrow of time
The concept of time is intimately linked to the concept of entropy.
Our perception of time arises from the accumulation of information in our brains; we remember the past. Events, changes, and
processes need to occur so that experiences can be sensed and
stored in our memories. As we have seen, changes in the universe
take place only when the entropy of the entire universe increases.
The accumulation of information in our brains takes place only
when these changes take place; that is, when the entropy of the
universe increases. Nature has used this increase in entropy to
give direction to the arrow of time.
Except for the second law of thermodynamics, all the laws of
physics remain unchanged if the direction of the arrow of time is
reversed; they are time-reversible. The laws of mechanics, for
example, are time-reversible. If we make a movie of an oscillating
pendulum, a perfectly elastic bouncing ball or any other purely
mechanical process (where no exchange of heat takes place)
and then run the movie backwards, it is impossible for us to tell
the difference. The pendulum oscillates back and forth and the
ball bounces the same as when the movie is run forward.
Since the laws of mechanics and electromagnetism do not
depend on the direction of time, the elastic collision of two molecules in a gas appears equally possible if the ¯ow of time were to
be reversed, as illustrated in ®gure 11.10. However, when we
consider all the molecules in the gas, the situation is no longer
time-reversible. If the gas is originally con®ned to the left half
239
SUPERSTRINGS AND OTHER THINGS
Figure 11.10. The collision of two molecules is time-reversible. We
could not tell the difference if a video of this collision were to be
shown in reverse.
of a container by means of a partition, as seen in ®gure 11.11, and
a small hole is made on the partition, the gas will diffuse until it
is uniformly distributed throughout both sides of the container.
If a video of this process were to be shown in reverse, we
would immediately recognize that this is the wrong direction.
Although the motion of any individual molecule of the gas is
time-reversible, the behavior of the whole gas is not.
Lord Kelvin recognized this reversibility paradox in the 1870s.
Kelvin himself and later the German physicist Ludwig Boltzmann (1844±1906) realized that it was the statistical nature of
the second law that explained the paradox. There are many
Figure 11.11. The diffusion of a gas through an opening in the partition
is not a time-reversible process. We could immediately tell if a video of
this process were shown in reverse.
240
The Laws of Thermodynamics
Figure 11.12. After a long enough time, the interacting molecules will
®nd themselves occupying the original positions. This is known as
Poincare recurrence. In a real system, it will actually take many times
the age of the universe for this to happen.
more disordered arrangements of a system than ordered ones,
and for this reason the disordered arrangements are much
more likely to occur. Given enough time, even the very few
ordered arrangements can occur, although for real situations,
enough time means a time longer than the age of the universe.
This last statement, on the recurrence of a mechanical
system, was proposed in 1889 by the French scientist Henri PoincareÂ, and is known as Poincare recurrence. Consider, for example, a
gas enclosed in a sealed container (®gure 11.12). If we concentrate
our attention on a single molecule, we would ®nd it undergoing
collisions with other molecules and with the walls of the container. Suppose, for simplicity, that there are only three molecules
in this ``gas.'' If we wait a long enough time, we will eventually
see the three molecules occupying the same positions that they
occupied at the beginning, when we ®rst started observing
them. The time it takes for this to happen is known as PoincareÂ
cycle time. For an actual macroscopic system, the Poincare cycle
time turns out to be many millions of times the age of the universe. No wonder we never see it happening!
The Russian±Belgian physical chemist Ilya Prigogine, who
won the Nobel prize in 1977 for his work in irreversible thermodynamics, postulates that the Poincare recurrence does not
happen because very small changes made to a system can drastically change its future behavior, preventing it from ever reaching
the same initial state. The following example will help us clarify
his hypothesis. Suppose somebody traces a large circle on the
ground and asks you to follow it as closely as possible on a
bicycle, so that, after you complete the circle, you end up at the
241
SUPERSTRINGS AND OTHER THINGS
same position from where you started. If you are an average
bicycle rider, you can easily correct small deviations from the
circular path, and it would not be very dif®cult for you to accomplish the task. Suppose, however, that, as you are about to
complete your circle, you momentarily lose your balance and
deviate from the path, not ending at the same spot. You could
continue riding, completing a second loop, and perhaps a third,
until you make it to the right spot. If you wait a long enough
time and complete enough loops, your ®nal position is indistinguishable from the initial position.
Now, suppose that the game becomes more challenging,
and the circle is traced on the edge of a very tall and narrow
circular wall (®gure 11.13). Here you would have to be extremely
cautious because even a tiny deviation from the circle would
cause you to fall off the edge, preventing you from ever completing the circle. In this case, a small deviation would completely
change the system so that the same initial state could not be
reached.
Figure 11.13. A cyclist riding on the edge of a very tall and narrow
circular wall.
242
The Laws of Thermodynamics
Which one of these opposing views is the correct one? This is
a question that has not yet been resolved. We can see that thermodynamics is a part of physics that is still being developed and that
it has implications that reach into the heart of our understanding
of the world.
243
12
ELECTRICITY
Electromagnetism
Amber is a beautiful stone that has been used since prehistoric
times to make jewelry and ornamental carved objects. A fossil
tree resin from pine and other softwood trees, amber was also
of interest to the ancient Greeks who called it "o (elektron).
In the seventh century BC, Thales of Miletus observed that when
amber was rubbed vigorously with cloth, it attracted small bits
of straw, feathers or seeds. Other materials that show this property were discovered in the centuries that followed.
Lodestone is another mineral with unusual properties.
Known also as magnetite or magnetic iron ore, lodestone is an
iron oxide mineral that attracts iron. Known for this property as
far back as 500 BC, lodestone turns to a north±south direction
when ¯oating in a liquid or suspended from a string. The
Roman poet Lucretius advanced a theory about the cause of
magnetism in his poem On the Nature of the Universe.
A detailed study of these properties only began with the work
of William Gilbert (1544±1603) in England. Gilbert received a medical degree from Cambridge and established himself as a physician of renown, becoming president of the College of Physicians
and later court physician to Queen Elizabeth I. He became interested in the work on magnets of the French scholar and engineer
Petrus Peregrinus de Maricourt and started performing very careful experiments to determine the nature of magnetism. His experiments eventually led him to investigate also the properties of
amber and to realizing that its attraction was different from
magnetism which involved only iron. In his book De Magnete,
published in 1600, he not only presented a systematic discussion
247
SUPERSTRINGS AND OTHER THINGS
on magnetism but also gave a discussion on the force of attraction
caused by friction in amber. He coined the word electric for
``bodies that attract in the same way as amber.''
Electricity and magnetism developed as two different sciences
until the early nineteenth century when the Danish physicist Hans
Christian Oersted observed that there was a connection between
them. This connection was developed further by, among others,
the English scientist Michael Faraday. It was, however, the Scottish
physicist James Clerk Maxwell who brought together electricity
and magnetism in a complete form by the formulation of his
theory of electromagnetism in the form of the four equations that
bear his name. We shall return to the fascinating story of the
development of electromagnetism in chapter 14.
Electric charge
For twenty-four centuries, from Thales to Gilbert, the attractive
properties of amber and of ``bodies that attract in the same way
as amber'' were known. In 1733, the French chemist Charles
FrancËois de Cisterney du Fay performed a series of experiments
in which he touched a gold foil with a glass rod that had been
electri®ed by rubbing it with silk (®gure 12.1). Before it was
touched by the glass, the foil was attracted towards the rod but
after it was touched, the gold foil was repelled away from it. Moreover, contrary to his expectation, the foil was attracted towards an
amber rod that had been rubbed with wool. A gold foil that was
touched ®rst by the electri®ed amber rod and was repelled by it,
was then attracted towards the glass rod.
Du Fay supposed that there were two kinds of electri®cation
and he called the type produced by the glass vitreous (from the
Latin word for ``glass'') and the type obtained with the amber
rod, resinous. Du Fay generalized his ®ndings by stating that
bodies with the same type of electri®cation repel each other, whereas
bodies with different type of electri®cation attract each other.
In 1747, Benjamin Franklin, the great statesman, inventor,
writer, and the ®rst American physicist, conducted experiments
that showed that one type of electri®cation could be neutralized
by the other type. This indicated to him that the two types of
electricity were not just different; they were opposites. He further
248
Electricity
Figure 12.1. (a) A glass rod that has been rubbed attracts a gold leaf. (b)
When the gold leaf is touched with the glass rod, the rod and the leaf
repel each other. (c) A gold leaf charged with the glass rod is attracted
by a charged amber rod.
proposed that all objects possessed a normal amount of electricity
some of which was transferred to another body by rubbing. When
this transfer of electricity took effect, the ®rst body had a de®ciency on its normal amount of electricity, and this could be
indicated by a negative sign, while the body that received the electricity ended up with an excess, which could be indicated by a
positive sign. Since there was no difference in the behavior of
the two ``types'' of electricity, Franklin had no way of knowing
which one was positive and which one was negative. He
arbitrarily decided that rubbing glass with a silk cloth transferred
electricity to the rod and, therefore, was positive, while rubbing
amber with wool made it lose electricity, and was negative
(®gure 12.2). We now use the term electric charge and speak of
``positive electric charge'' and ``negative electric charge.''
Franklin had a 50% chance of being right with his convention
of signs. He lost. We now know that electrons are the carriers of
electric charge when the rods are rubbed and, in Franklin's sign
249
SUPERSTRINGS AND OTHER THINGS
Figure 12.2. A glass rod that has been rubbed with silk acquires a
positive charge. An amber rod that has been rubbed with wool becomes
negatively charged.
convention, they have negative charge. In the case of the glass
rod, when it is rubbed with silk, electrons actually leave the rod
and join the silk molecules. Some of the glass molecules near
the surface of the rod are left lacking a negatively charged electron and become positive ions; that is, the silicon and oxygen
atoms that make up the glass molecules have one electron
fewer than the corresponding number of protons in their nuclei
and the molecule is left with a net positive charge. The silk, on
the other hand, acquires extra electrons in the rubbing process
and becomes negatively charged. In a typical experiment, about
one billion electrons get transferred in the rubbing process.
Franklin had assumed that positive charges were transferred.
Although we still use Franklin's convention of signs, we need
to be aware of the correct interpretation.
Coulomb's law
What is the nature of the attractive force between a positive and a
negative electric charge or the repulsive force between two positive or two negative charges? The English chemist Joseph Priestley, in a brilliant insight, provided the answer. Priestley had been
asked by his friend Franklin to investigate a phenomenon that
Franklin had encountered. About 1775, Franklin noticed that
small charged cork balls were not affected when hung from a
thread close to the inner surface of a charged metal can, although
250
Electricity
Figure 12.3. The net gravitational force acting on the object of mass m
inside a shell is the sum of the gravitational forces exerted on the
object by all the parts of the shell. All these forces balance out and the
net force is zero.
they were attracted by the can when placed near the outside surface. Priestley correctly realized that a similar situation occurs in
mechanics. If an object of mass m were to be placed inside a
hollow planet, the net gravitational force acting on the object
would be zero (®gure 12.3). Since the gravitational force is an
inverse square law, that is, it is proportional to the inverse of
the square of the distance, Priestley proposed that the force
between two electric charges also varies as the inverse of the
square of the distance between the charges.
Charles Augustin Coulomb was born to wealthy parents in
1736 in AngouleÃme, France. He studied mathematics and science
and became a military engineer, serving in the West Indies until
1776, when he returned to Paris. Because of his precarious health
and his desire for a quiet life, he retired to the town of Blois
where, while the French Revolution started, he dedicated himself
to scienti®c experimentation. In 1777, Coulomb invented a torsion
balance to measure the force between electrically charged objects.
(Shortly thereafter, Cavendish used a similar torsion balance to
``weigh the earth;'' that is, to determine the value of G.) A small
charged sphere was attached to a horizontal, insulating rod
which was suspended by a thin silver wire (®gure 12.4). Close
to this sphere, he placed a second charged sphere. As the spheres
were attracted to or repelled from each other (depending on
whether the charges were opposite or the same), the wire twisted.
251
SUPERSTRINGS AND OTHER THINGS
Figure 12.4. Coulomb's torsion balance. The electrical force between
the suspended sphere and the ®xed sphere can be determined from
the twisting of the wire.
The twisting angle allowed Coulomb to determine that the force
between the spheres for different separations was proportional to
the inverse of the square of the separation, as Priestley had proposed. If r is the distance between the centers of the spheres,
the electrical force F between them is given by
1
F / 2:
r
Coulomb also showed that the electric force also depends on
the magnitude of the charges. Although at the time there was no
method for measuring the amount of electric charge on an object,
Coulomb ingeniously ®gured out a way of comparing charges.
Bringing together two identical spheres, one charged and the
other uncharged, he found that the original charge was distributed in equal parts between the two spheres, so that each sphere
held one half of the original charge. Bringing other uncharged
spheres into contact with one of the charged ones, he could produce fractions of one fourth, one eighth, and so on, of the original
charge. This allowed Coulomb to establish that the electric force
between two charged objects is also proportional to the product
of the magnitudes of the charges.
The results of Coulomb's experiments on the forces exerted
by one charged object on another can be summarized in what
we now call Coulomb's Law:
The force exerted by one charged object on another varies inversely as
the square of the distance separating the objects and is proportional to
252
Electricity
Figure 12.5. The electric force exerted by one charge on another is along
the line joining the charges. (a) Charges of opposite signs attract each
other. (b), (c) charges of the same sign repel each other.
the product of the magnitude of the charges. The force is along the line
joining the charges and is attractive if the charges have opposite signs
and repulsive if they have the same sign [see ®gure 12.5].
If we call q1 and q2 the magnitudes of the two charges, and r the
distance between their centers, we can state Coulomb's law in a
single equation:
q q
F k 12 2
r
where k is known as Coulomb's constant. The SI unit of charge is
the coulomb (C) and the value of Coulomb's constant is
k 9 109 N m2 =C2 :
In practical situations charges as large as one coulomb are very
seldom encountered. The force that two objects each holding a
charge of 1 C would exert on each other when they are separated
by a distance of 1 m is 9 109 N, which is about equivalent to the
weight of one million tons. Typical charges produced by rubbing
small objects are of the order of nanocoulombs (nC) to microcoulombs (mC).
The most fundamental unit of charge is the charge of one
electron or one proton. The fundamental charge, e, has a value
e 1:602 10ÿ19 C:
The electric ®eld
How does the electric force between two charged objects separated by a distance r propagate from one object to the other? In
253
SUPERSTRINGS AND OTHER THINGS
the nineteenth century the English physicist Michael Faraday
introduced the concept of ®eld as an intuitive way of looking at
the electrical interaction between charges. Although, as we shall
see in the ®nal chapters of the book, there are other ways of looking at this interaction, the concept of ®eld is still a very powerful
and useful one.
There are situations in everyday life that we can use to illustrate the concept of ®eld. A line of people waiting for tickets for a
rock concert is an example. When the tickets run out, the word
spreads out very quickly. Some people hear it directly from the
person at the ticket window, others are told by the people who
heard it ®rst, these people tell others, and still others guess it
from the movement of the crowd or from people's disappointed
expressions. Knowledge about the lack of tickets spreads out
through the crowd without a need for everyone in line to speak
directly to the person at the window. The region around the
ticket of®ce where the people interested in obtaining tickets are
located constitutes a ``®eld.''
In physics, ®eld is used to specify a quantity for all points in a
particular region of space. The temperature distribution of the
water in a pond is an example. The property of the space
around the earth where any object experiences its gravitational
attraction constitutes the gravitational ®eld of the earth. Similarly,
the electric ®eld describes the property of the space around an
electrically charged object. The presence of a charged body at a
particular point distorts the space around it in such a way that
any other charged body placed in this space feels a force that is
given by Coulomb's law.
Figure 12.6 shows a positive charge q at some point in space.
If we place a small positive test charge q0 at some other nearby
point, the charge q will exert a force on this test charge which
points away from the location of q and along the line joining
them. The electric ®eld strength E is the Coulomb force divided
by the magnitude of the test charge q0 ,
E
F
:
q0
The electric ®eld strength is a vector quantity since it has magnitude and direction and is proportional to the Coulomb force,
which is a vector quantity. The direction of the electric ®eld
254
Electricity
Figure 12.6. The electric ®eld strength E at some point P near a positive
charge q is determined by placing a small positive charge q0 at point P.
The value of E is the magnitude of the coulomb force felt by the test
charge divided by the magnitude of the test charge.
vector is the direction of the force on a positive test charge. The
units of E are newtons per coulomb, N/C.
By placing the test charge at several points in the vicinity of
the charge q, we can map the electric ®eld around this charge.
Figure 12.7 shows the electric ®elds due to a positive and a negative charge. Since the test charge is always positive, the electric
®eld around the negative charge points inward. The electric
®eld line con®guration for a pair of charges is a superposition
of the ®eld lines for two single charges, as shown in ®gure
12.8(a). Figure 12.8(b) shows the electric ®elds for two unlike
charges and for two like charges. Notice, also, the ®eld lines in
between the two parallel metal plates holding opposite charges
(®gure 12.9). In this case, the ®eld is constant throughout the
region enclosed by the plates, as evidenced by the parallel,
equally spaced ®eld lines. Figure 12.10 is a photograph of two
Figure 12.7. Electric ®elds around a positive and around a negative
charge.
255
SUPERSTRINGS AND OTHER THINGS
Figure 12.8. The electric ®eld due to a pair of charges is a superposition
of the ®elds for two single charges. (a) The ®eld at points 1 and 2 is the
resultant of the ®eld due to the positive charge, a vector pointing away
from the positive charge, and the ®eld due to the negative charge, a
vector pointing toward the negative charge. (b) Electric ®elds due to
two unlike charges (left) and two like charges (right).
Figure 12.9.
Electric ®eld due to two oppositely charged metal plates.
rods carrying equal and opposite charges. The electric ®elds are
made visible by seeds ¯oating in an insulated liquid.
The fundamental charge
In 1891, a young American student who had just completed
his undergraduate education with a major in Greek at Oberlin
College, was approached by the school with an interesting
proposition. Due to a shortage of scientists in the United States,
256
Electricity
Figure 12.10. Electric ®elds produced by two rods carrying equal and
opposite charges. The patterns are made visible by grass seed ¯oating
in an insulating liquid. (From PSSC Physics Seventh Edition, by HaberSchaim, Dodge, Gardner, and Shore. Published by Kendall/Hunt
Publishing Company, 1991.)
the school had been unable to hire a quali®ed physics instructor
and asked the recent graduate if he would accept the challenge.
Robert A Millikan not only accepted the offer but fell in love
with the subject and took his master's degree while he taught
introductory physics at the school. He went on to obtain the
®rst PhD degree in physics that Columbia University ever
awarded. After working as a postdoctoral fellow under Max
Planck in Germany he returned to the United States and accepted
a position at the University of Chicago in 1910.
In 1897, J J Thomson had succeeded in determining the
charge-to-mass ratio of the electron, in what is now considered
to be one of the landmark experiments in the history of
science. After his determination of this ratio, Thomson and his
collaborators J S Townsend and H A Wilson attempted to
determine the charge e of the electron in a series of experiments
in which an ionized gas was bubbled through water to form a
cloud. By determining the mass and total electric charge of the
cloud and estimating the number of ions in the cloud, they
obtained a value for the magnitude of the charge of the electron
of about 1:0 10ÿ16 C. However, the method used in the
estimation of the number of ions in the cloud could not be veri®ed.
257
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Electrostatics on Mars
Mars, our neighbor in the Solar System, has been the subject
of human interest since early times. Today, the possibility
that there is water under its surface that could harbor living
organisms has made the exploration of Mars a matter of enormous relevance to us. NASA and the European Space Agency
have aggressive plans to explore the planet.
Scientists at NASA are attempting to solve the many
problems that current and future missions to the planet
may face. The Viking Lander missions of the 1970s together
with the more recent Path®nder mission have shown that
large areas of the surface of Mars are covered by ®ne particles
in a fairly homogeneous, thin layer. These dusty conditions
combined with the frequent dust devils and occasional
large dust storms, as well as with the extremely low absolute
humidity near the surface, create an environment conducive
to electrostatic charge buildup. The surfaces of landers,
rovers or equipment may acquire an electric charge when
placed in contact with the dust particles in the soil or suspended in the atmosphere. These dust particles, which can
themselves acquire an electric charge as they collide with
each other when blown by the wind, get attached to some
of these surfaces by electrostatic forces. The result could be
clogged ®lters, inef®cient dust covered solar cells or thermal
radiators, and obscure viewports. In addition, the voltages
generated as the surfaces charge up could produce electrostatic discharges that could be hazardous to equipment and
astronauts in future missions.
To assess these problems, NASA developed the Mars
Environmental Compatibility Assessment (MECA) Electrometer, an instrument that can measure the electrostatic
charge generated when different materials are rubbed.
Designed by scientists at NASA's Jet Propulsion Laboratory
(JPL) and NASA Kennedy Space Center (KSC) for a future
lander mission to Mars, this instrument is currently being
calibrated by teams led by Dr Martin Buehler at JPL and by
the author at KSC. The MECA Electrometer contains eight
sensors. Six of the sensors measure the charge developed by
258
Electricity
contact or friction on six insulating materials: Fiberglass/
Epoxy, Lexan, Te¯on, Rulon J, and Lucite. These materials
were chosen for their wide range at which they acquire
charge. There is also an ion gauge that will measure the ion
concentration in the martian atmosphere, a bare electrometer
to measure atmospheric electric ®elds, and a thermometer.
The instrument will be placed on the heel of the lander
robot arm. During operation, the electrometer will be rubbed
against the Martian soil. Although some of the materials
selected for the instrument are commonly used in space, the
principal idea was to generate data that can be replicated in
the laboratory. Future, more advanced instruments are
being developed that could characterize not only the soil
and dust on Mars but also new materials that can be used in
exploration missions.
The MECA Electrometer with the row of 5 electrostatic sensors, the
bare electrometer (top left), and the ion gauge (top right). There is
also a temperature sensor inside the housing.
259
SUPERSTRINGS AND OTHER THINGS
Figure 12.11. Millikan's oil drop experiment. The weight of the oil drop,
mg, is balanced by the electric force acting on the charged drop, qE.
In 1909, Millikan began a series of experiments in which
charged oil drops were balanced in midair for several hours by
a constant electric ®eld obtained with two parallel plates charged
with opposite charges (®gure 12.11). An oil drop of mass m would
be attracted towards the ground with a force F mg, which is the
drop's weight. When the drop holding a charge q is placed in a
region of a constant electric ®eld, an electric force Fel qE in
the upward direction is exerted on the drop, which could be
made to balance the weight of the drop by applying the appropriate ®eld. That is,
qE mg:
By varying the electric ®eld between the plates, the oil drops
could be moved up or down. If the charge on a drop changed
during the observation, its velocity was observed to change.
This allowed Millikan to conclude that the electric charge occurred
always in multiples of an elementary unit which was the magnitude of
the charge of the electron, e. He obtained a value for the charge
of the electron with an accuracy of one part in a 1000. In 1923,
Millikan was awarded the Nobel Prize in physics for these
experiments.
Electric potential
In our study of energy, we found that the concept of potential
energy was very useful, in particular when we used it to describe
the behavior of objects in the vicinity of the gravitational pull of
the earth. As we learned in chapter 4, the gravitational force is
260
Electricity
Figure 12.12. The work done in moving the charge q0 from point 1 to
point 2 is independent of the path taken. We say that the electric force
is a conservative force.
a conservative force, which means that the work done in moving
an object from one point to another in a gravitational ®eld
depends only on the initial and ®nal points and not on the path
through which the object moves.
An electric charge q0 placed in the electric ®eld of another
charge q feels a force F that is proportional to the product of the
magnitudes of the two charges and inversely proportional to
the square of their separation r1 . When the charge q0 is brought
in closer to the charge q, at a distance r2 , the only thing that
changes in the expression for the force between the two charges
is the distance between them. The path taken in moving the
charge q0 from the ®rst position to the second does not matter
(®gure 12.12). Like the gravitational force, the electric force is
also a conservative force.
If the charges q and q0 are both of the same kind, we would
need to do work to bring q0 closer to q. When we do that, the total
energy of the particle carrying the charge q0 increases by an
amount equal to the work done. This increase in total energy
appears as an increase in electric potential energy, PE. When the
positive charge q0 is moved away from the charge q, there is a
decrease in electric potential energy, since the two positive charges
repel each other. On the other hand, if the two charges are
oppositely charged, as when a positive charge q0 is placed in
the electric ®eld of a negative charge q or a negative charge q0 is
placed in the electric ®eld of a positive charge q, moving q0
closer to q would result in a decrease in electric potential and
261
SUPERSTRINGS AND OTHER THINGS
moving them apart would result in an increase in electric potential, since the two charges attract each other.
The magnitude of the change in electric potential energy
depends on the magnitude of the charge q0 . It is convenient,
then, to have a quantity that does not depend on this charge
and for this reason we de®ne the electric potential difference, V, as
the change in electric potential energy of a charge q0 divided by the
magnitude of that charge; that is,
V
PE
:
q0
The SI unit of potential difference is the volt (V). From the de®nition, 1 volt equals 1 joule divided by 1 coulomb, or
1V
1J
:
1C
The volt was named in honor of Count Alessandro Volta, a professor of physics at the University of Pavia, Italy, who invented
the electric battery. We sometimes refer to potential difference
as voltage, a term derived from the name of the unit. From the
de®nition of potential difference we can see that the electron volt
is a unit of energy. A particle with a charge equal to that of the
electron which is moved between two points in an electric ®eld
so that their potential difference is 1 volt will change its potential
energy by 1 electron volt:
PE qV 1 eV 1:6 10ÿ19 CV 1:6 10ÿ19 J:
Storing electrical energy
As we have seen, we must do work to move an electric charge in
an electric ®eld and this changes the potential energy of the
charge. Could we store this potential energy for later use? In
1746, the Dutch physicist Pieter van Musschenbroeck who was
professor of physics at the University of Leyden attempted for
the ®rst time to do just that. He suspended a metal jar ®lled
with water from insulating silk threads and led a brass wire
from a charged cylinder into the water. A student who was assisting Musschenbroeck with the experiment happened to touch the
brass wire and became the ®rst person to receive an arti®cially
262
Electricity
Figure 12.13. When a small amount of charge is transferred between
two uncharged metal plates, a small potential difference in created.
This device is a parallel plate capacitor.
produced electric shock. Musschenbroeck realized that he had
accidentally discovered a way of storing charge. News of the
experiment spread rapidly and soon ``Leyden jars'' were being
built and improved upon in many other laboratories. Today, a
device that has the capacity of storing electrical energy, like the
Leyden jar, is called a capacitor.
We can store electrical energy by creating an electric ®eld.
One way of creating an electric ®eld is with two uncharged
metal plates separated by a distance d. When some small
amount of charge is transferred from one plate to the other, a
small potential difference appears (®gure 12.13). If we continue
the process of transferring charge, we end up with a potential
difference V between the plates and an electric ®eld E in this
region. It takes work to separate the charges and create the ®eld
and this work becomes the potential energy that is stored. This
particular device is called a parallel plate capacitor.
Capacitors can have different shapes. The amount of charge
that can be stored in a capacitor at a given potential depends on
its physical characteristics. Volta introduced the expression ``electrical capacity,'' in analogy with heat capacity, to indicate the
storage capacity of these devices. Today we call this concept capacitance. The smaller the voltage needed to store a given charge, the
greater is the capacitance.
Many capacitors have a nonconducting material between the
charged plates. This increases the capacitance. If the insulating
material is air, the increase is very small; about 6 parts per
10 000. For other materials the increase can be much greater.
Glass, for example, increases the capacitance by a factor of
between 5 and 10. The increase in the capacitance is due to a
reduction in the potential difference between the plates (since
263
SUPERSTRINGS AND OTHER THINGS
capacitance is inversely proportional to the potential difference).
When the plates are charged, the charges in the nonconducting
material between the plates reorient themselves, so that the positive charges point toward the negatively charged plate and the
negative charges toward the positive plate. This orientation sets
up an electric ®eld in the nonconductor that is in the opposite
direction to the electric ®eld between the plates. When the ®eld
is reduced, the potential difference is also reduced, thus increasing the capacitance.
An animal cell is a living example of a capacitor. The membranes of cells are generally composed of lipid or fat molecules
and protein molecules oriented so that the inner part of the membrane containing lipid and cholesterol molecules is sandwiched
between layers containing protein molecules. The membrane
The frontiers of physics: Storing single electrons
A new device, developed by Mark W Keller and his collaborators at the National Institute of Standards in Boulder,
Colorado, allows these researchers to individually count
and store millions of electrons, one by one, into a specially
developed capacitor. With their electron pump, as the
device is called, individual electrons are transferred into a
capacitor for storage. The pump consists of an array of six
microscopic bullet-shaped regions of aluminum, separated
by small walls of aluminum oxide. Electrons are allowed to
pass through the aluminum islands only when an electric
current is applied to the islands. Timed electrical pulses
push the electrons from island to island until they reach the
capacitor.
The researchers have used their pump to count millions
of electrons, missing only one in 70 million. The main application for the pump, however, is not to count electrons, but to
increase the accuracy in the determination of capacitance. By
knowing exactly how many electrons are stored in a capacitor
(which gives the total charge stored) and measuring the
voltage, Keller and his collaborators can calculate the capacitance with an accuracy not possible previously.
264
Electricity
Physics in our world: Inkjet printers
An inkjet printer uses a print head that shoots ink at the paper.
A resistor inside the print head heats a thin layer of ink which
expands into a vapor bubble. The expansion forces the ink
through a small nozzle which causes it to break up in droplets
a tenth of a millimeter in diameter. The print head shoots
about 6000 droplets every second at speeds of about 15 m/s
®rst through a charging device and then through the charged
plates of a capacitor. In places where the paper is to be left
blank, an electric change is deposited on the droplets. As the
charged droplets pass through the electric ®eld that exists
inside the capacitor, they are de¯ected away from the paper,
back to the ink reservoir. When the ink is to hit the paper,
the charging device is turned off and the ink droplets are
left uncharged. The neutral droplets ¯y unde¯ected through
the capacitor and hit the paper.
Color inkjet printers have four ink reservoirs containing
black, cyan, magenta, and yellow ink, the standard colors
used in commercial printing. By mixing these four colors,
all other colors can be obtained.
265
SUPERSTRINGS AND OTHER THINGS
wall of a cell separates two regions that contain potassium ions
(positive) and chloride ions (negative). Thus, a cell is like a very
small capacitor where the positive and negative charges are
separated by the nonconducting membrane wall. The potential
difference across a typical cell is of the order of 100 mV.
266
13
APPLIED
ELECTRICITY
Conductors and insulators
As we have seen, rubbing a glass or an amber rod with a piece of
cloth produces an electric charge on the rod (®gure 13.1(a)). If the
rod is made of metal, no charge develops (®gure 13.1(b)).
However, touching a metal rod with a charged object will cause
the metal to become charged (®gure 13.1(c)). The reason for this
behavior is that the atoms in a metal have some electrons that
are not tightly bound to their nuclei and are free to move about.
Any excess charge readily moves in conductors. In metals, then,
electric charges move or ¯ow through the material. We say that
metals are good conductors of electricity. In glass, amber or other
materials like them, on the other hand, electrons are not free to
move; they are bound to individual molecules or atoms. Any
excess charge placed on them remains (unless they are touched
by some other object). These materials are called insulators.
Plastics, wood, and rubber are examples of good insulators.
Pure water is also an insulator. Tap water, however, contains
salts that form ions which can move through the liquid, making
it a good conductor.
There are some materials, called semiconductors, that are
intermediate between conductors and insulators. Modern electronics has developed due to the discovery of the properties of
these materials. The electrical conductivity of semiconductors
can be enhanced by the addition of traces of other elements
with a slightly different electronic structure. As we shall see,
these impurities provide an additional electron or the lack of
one, which results in a negative or positive charge that can
move around.
267
SUPERSTRINGS AND OTHER THINGS
Figure 13.1. (a) Rubbing an amber rod with silk produces an electric
charge on the rod. (b) No charge is produced on the metal rod. (c) Touching a metal rod with a charged amber rod causes the metal to become
charged.
Electric current and batteries
During the second half of the eighteenth century, a ¯ow of electric
charge could be produced only by discharging a Leyden jar. In
1800, Alessandro Volta, a professor of physics at the University
of Pavia, Italy, discovered that a stack of discs of silver and
zinc, interspersed with wet pasteboard and held with an insulating handle, produced a separation of charge and a potential
difference between the two metals. When the ®rst and last discs
were connected by a conductor, Volta obtained a ¯ow of electric
charge or current through the conductor with the same properties
as the current obtained by discharging a Leyden jar, but with the
important difference that the ¯ow was more or less continuous.
Volta called his device a battery (®gure 13.2).
268
Applied Electricity
Figure 13.2. Alessandro Volta reported his battery to the Royal Society
as, ``an assemblage of a number of good conductors of different sorts . . .
30, 40, 60 pieces or more of copper, or better of silver, each in contact with
a piece of tin, or what is much better, of zinc, and an equal number of
layers of water . . . or pieces of cardboard or leather . . . well soaked.''
(# Bettman/CORBIS.)
An electric current exists whenever there is a net ¯ow of
charge. A potential difference is needed for a ¯ow of charge to
exist, in much the same way that a stone would not fall to the
ground unless it is at some height above the ground; that is,
unless there is a gravitational potential difference with respect
to the ground. In a television tube, for example, electrons are
accelerated due to a potential difference. The electron beam that
strikes the phosphor-coated screen and produces the image
constitutes an electric current. An electric current can exist in a
conductor provided a potential difference exists between two
points in the conductor. When the ends of a metal wire, for
example, are connected to a battery, an electric ®eld is created
inside the wire which acts on the electrons that are free to move
in the metal, moving them, thus producing an electric current.
If during a time t an amount of charge q ¯ows past a particular
269
SUPERSTRINGS AND OTHER THINGS
Figure 13.3. The conventional direction of the ¯ow of current in a conductor is opposite to the direction of motion of the electrons, the carriers
of the current.
point in a conductor, the electric current i is given as
electric current
charge that flows
time
or
q
i :
t
Electric current is the rate at which charge ¯ows in a conductor.
The unit of current is the ampere (A), named in honor of the
French physicist Andre Marie AmpeÁre, and is equal to one
coulomb per second. It is a fundamental SI unit.
Maintaining an electric current requires the maintenance of a
potential difference between two points. Volta's battery did just
that; it provided electrical energy so that a potential difference
could be maintained between the two metals (called electrodes).
Before a good understanding of the atomic nature of matter
was achieved, it was believed that the charge carriers that moved
in a conductor were the positive charges and the direction of electric current was chosen to be that of the positive charges. Today,
we know that it is electrons that move but the convention of the
direction of current has not been changed (®gure 13.3). We can
still apply this convention if we understand that the view of the
negatively charged electrons moving in one direction with
respect to the stationary positive ions is equivalent to the positive
ions moving in the opposite direction with respect to the negative
electrons.
Ohm's Law
In a metal, the atoms are arranged in a crystal lattice with a large
number of electrons that are free to move around in the metal.
270
Applied Electricity
Physics in our world: Electric cars
The concept of automobiles powered by an electric motor that
is run by batteries is not new; the essential battery technology
was developed towards the end of the nineteenth century
and by 1900 many electric cars were being manufactured.
Due to the weight of the large batteries required and the
need to recharge them at fairly short intervals, electric cars
were heavy and slow to operate. The development of lighter
materials and recent advances in battery technology have
made possible feasible electric cars. General Motors' EV1,
the ®rst commercial electric car, uses twenty-six 12-volt
lead±acid batteries that take 3 hours to charge and give the
car a range of 70 to 90 miles.
Gasoline-powered automobiles use a lead±acid battery
with cells which have lead peroxide and metallic lead as electrodes in a sulfuric acid solution. Since the total surface area of
the plates is large, the battery can generate large currents for
short periods of time, as required when starting the engine.
In an electric car, however, the source of power comes not
from the combustion of gasoline but directly from the battery.
Standard lead-acid batteries are not suitable for the continued
supply of energy needed to run the automobile. The gasrecombinant lead±acid batteries in the EV1 hold the liquid
acid in special ®brous bags located between the lead plates,
and have an all-up weight of 1175 pounds.
50 µm Teflon support
with 2 mm carbon
current collector
20 µm TFPT anode
1 mm gel electrolyte
20 µm polymer cathode
50 µm Teflon support
with 2 mm carbon
current collector
271
SUPERSTRINGS AND OTHER THINGS
Several kinds of advanced batteries are being developed.
The most innovative ones involve the use of polymer electrodes and electrolytes (the conductive medium that electrically connects the electrodes). These all-plastic batteries can
be rolled up and molded into small cavities in the car's
body. Although very safe, researchers still need to increase
the electrical conductivity of the polymer before these batteries can be made commercially viable.
In addition to batteries, other technologies are being
considered for electric vehicles. Among the replacements
are devices to store large amounts of electric charge and
which can discharge quickly; ¯ywheels, which store energy
in a spinning rotor; and fuel cells, which convert chemical
energy into electricity.
These electrons, called conduction or valence electrons, are in
continuous motion like gas molecules, bouncing off the lattice
ions in such a way that, in the absence of an electric ®eld, their
average velocity is zero. Bumper cars at an amusement park
move fairly fast between collisions; they, however, do not go
anywhere very much (®gure 13.4). Unlike the bumper cars, electrons do not collide with each other but with the atoms in the
lattice. In metals, about one electron per atom is used in the conduction process and the atoms in the lattice are actually positive
ions, since they have contributed one of their electrons to this
process.
When a potential difference is applied across the metal, an
electric ®eld E appears in the metal and a force of magnitude eE
acts on the electrons. A conduction electron is then accelerated
to large speeds in the direction of this force. Before long, however,
the electron collides with an ion in the lattice, bounces off, and is
accelerated again in the same direction only to collide again with
another ion. The net result is that conduction electrons move
along the wire at very small drift velocities, of the order of
1 mm/s, in spite of the large velocities acquired in between collisions (®gure 13.5(a)). We could illustrate the effect of this
applied potential with the bumper cars example if we imagine
a surface where the cars move to become tilted (®gure 13.5(b)).
272
Applied Electricity
Figure 13.4. Bumper cars at an amusement park collide continuously
with each other and do not go anywhere.
Although the cars still collide with each other and move in all
directions, the sloped surface makes them move in that direction
somewhat more often than in the others. There is a drift velocity
towards the lower end of the surface. We should be careful
with this analogy, however. A single electron does not necessarily
move from one end of the conductor to the other. It is energy and
momentum that is transmitted through the conductor.
A metal, then, offers some resistance to the ¯ow of current
through it. When we apply a potential difference V to the two
ends of a metal wire, a current i appears in the wire. This current
increases as the applied potential difference increases. In our
analogy with the bumper cars, raising the slope increases the
¯ow of cars towards the lower part, thus increasing the ``current.''
The magnitude of the current in the wire depends on several
factors. One is the type of metal used; different lattice con®gurations interact in different ways with the conduction electrons. The size of the wire also affects the amount of current. A
thin wire presents a greater resistance to the ¯ow of current
than a thicker wire, and a longer wire also presents a greater
resistance.
273
SUPERSTRINGS AND OTHER THINGS
Figure 13.5. (a) The drift velocity of the conduction electrons in a metal
is very small compared with the velocities of the electrons between
collisions with the lattice ions. (b) A gravitational potential provided
by the inclined surface makes the cars drift towards the lower part.
We can see that, although we speak of the conduction electrons as free to move, they do not really move freely inside the
conductor. For this reason, the presence of the external force eE
does not accelerate the electrons in a way that makes the current
increase continuously. Rather, the electrons promptly reach a
steady-state situation so that current and voltage are related by
a very simple relationship, as the German physicist Georg
Wilhelm Ohm (1787±1854) discovered. The current i ¯owing
through a conductor is directly proportional to the voltage V
that exists between the two ends of the conductor, that is
i/V
or
i constant V:
274
Applied Electricity
The frontiers of physics: Electric dentists
According to recent statistics, almost 85% of all 17-year-olds
have already had several cavities. The reason for its prevalence is the dif®culty in detecting cavities early on. Tooth
decay starts in the enamel coating of the tooth where foods
are fermented by bacteria, producing acids that erode the
mineral in the tooth enamel. By the time this demineralization becomes detectable, it may be too late for ¯uoride treatments to have any healing effect.
A method recently discovered by researchers at the
universities of Dundee and St Andrews in Scotland, and the
University of Nijmegen in the Netherlands makes use of
the change in electrical resistance of the eroded regions in
the tooth enamel. These regions are ®lled with ¯uids that
have smaller resistances than enamel. Measuring the resistance at different places in the tooth reveals the presence of
cavities. The researchers used the technique on extracted
teeth with perfect accuracy. The next step in the researchers'
schedule is to obtain funds to make this promising method
available clinically.
This simple expression, known as Ohm's law, is usually written as
V
or
V iR
R
where R is the resistance of the conductor and has the units of ohms
( ) (Greek capital omega). From this expression, we can see that
1 equals 1 V/1 A.
Although it bears the name of a law, Ohm's law is not a fundamental law of nature like Newton's law of universal gravitation.
Rather, it is a result of experimental observations and is valid
only for certain materials within a limited temperature range.
i
Simple electric circuits
The electric circuit in a ¯ashlight is one of the simplest we can
study. It consists of one or more batteries, a metal conductor, a
275
SUPERSTRINGS AND OTHER THINGS
Figure 13.6. (a) A simple circuit in a ¯ashlight. (b) Representation of
this circuit with a single battery, a conducting wire, a switch and a
light bulb. (c) Hydraulic analogue with a pump, pipe, a valve, and a
paddle wheel. Water is pumped to a high potential (tank) from where
it falls, transforming potential energy into kinetic energy. Some of this
energy is used to drive the paddle wheel. This low potential water is
returned to the pump where the circuit is completed.
switch, and a light bulb (®gure 13.6(a)). A diagram of this circuit
is shown in ®gure 13.6(b). A hydraulic analog of this circuit
appears in ®gure 13.6(c). Water ¯owing from the tank (high
potential) falls through the pipe where it moves a paddle
wheel, thus performing work. Low pressure water is then
pumped up to the tank. Water ¯ows in this circuit only when
the valve is open. Similarly, in our electric circuit, the battery
``pumps'' up the bulb terminals to a high potential. Work is
performed as the current ¯ows through the narrow ®lament of
the bulb, producing light and heat. Low potential current returns
276
Applied Electricity
to the battery where it is ``pumped'' up again. Current ¯ows in
the circuit only when the switch is closed.
Again, we must be careful with our analogy. In the hydraulic
case, the water molecules actually move around in the pipes. In
the electric circuit, the electric ®eld that exists in the conductor
acts on the conduction electrons which then move. The ions
remain ®xed in the lattice.
The light bulb in our simple circuit dissipates energy (as light
and heat) by means of its resistance. Electric heaters, toasters,
electric stoves and ovens are other examples of devices that dissipate energy by means of their resistance. As electrical energy
dissipates, electric potential drops. There are situations where it
is desirable to lower this electric potential or voltage in a circuit.
For these cases, a resistor is used. A resistor is merely a device that
dissipates electrical energy; it can be a strong narrow wire offering a greater resistance to the ¯ow of current than the wires used
in the circuit. Usually, however, it is made of ceramic materials
with a low conductance.
To simplify the graphic representation of electric circuits,
certain standard symbols are used. The internationally accepted
symbols are:
Table 13.1. Some standard electrical symbols.
277
SUPERSTRINGS AND OTHER THINGS
Figure 13.7.
A schematic diagram of an electric circuit.
A schematic diagram of an electric circuit consisting of a 1.5 V
battery, a 10 resistor and a switch is shown in ®gure 13.7. It is
customary to include the values of the different components.
Resistor combinations
There are two ways of connecting items such as light bulbs into an
electrical circuit: series and parallel. In a parallel connected circuit
such as household lighting, every lamp is connected separately to
the input line, and any one of them can be turned on or off without affecting the others. Series connection have the lamps ``strung
out,'' so that if one of the lamps fails they all go out. Christmas
tree lights are connected this way. We shall study these two
types of circuits brie¯y.
In a series circuit, like the Christmas tree light bulbs, current
passes through each one of the light bulbs one after the other. In
®gure 13.8 we have a series combination of two light bulbs of
resistances R1 and R2 . The total resistance in the circuit is the
sum of the resistances of the two bulbs; that is, Rs R1 R2 .
This last expression means that if we have two light bulbs with
resistances of 5 and 8 connected to a 1.5-V battery, we could
replace them with a single light bulb having a resistance of
13 , without changing the amount of current ¯owing through
the circuit.
From the law of conservation of energy, the potential or
voltage delivered by the battery in ®gure 13.8 must equal the
potential drop through the ®rst light bulb plus the potential
drop through the second. This is analogous to the situation
shown in ®gure 13.9. The potential energy gained by the girl as
278
Applied Electricity
Figure 13.8. (a) Two light bulbs connected in series to a battery. (b)
Schematic diagram of this circuit. The resistances of the two bulbs are
shown as R1 and R2 , and the potential drops as V1 and V2 , respectively.
The voltage supplied by the battery is shown as V.
she climbed to the top of the double slide equals the sum of the
potential energies lost in the ®rst and the second slopes. Similarly,
the potential increase delivered by the battery should equal the
sum of the potential drops through the two light bulbs.
In a parallel connection, the current splits into one or more
branches. In ®gure 13.10, two light bulbs of resistances R1 and
Figure 13.9. The child sliding down the double slide loses potential
energy in the two sections of the slide which is equal to the potential
energy gained when she climbed to the top. This is analogous to the
potential differences in the circuit of ®gure 13.8.
279
SUPERSTRINGS AND OTHER THINGS
Figure 13.10. (a) Two light bulbs connected in parallel. (b) Schematic
diagram of this circuit.
R2 are connected in parallel, forming two branches. The current i
coming from the battery splits into i1 , which ¯ows through R1 ,
and i2 , ¯owing through R2 . Since these are the only two possible
paths for the current, the current from the battery must equal the
sum of the individual currents ¯owing through each one of the
branches. That is, i i1 i2 .
We can see that both light bulbs are connected directly to the
battery by means of conducting wires. This means that each light
bulb draws current from the battery independently of the presence
of the other light bulb. We could, in principle, connect as many
light bulbs as we wanted without affecting the ®rst light bulb.
The only limitation is the ability of the battery to provide electrical energy to many light bulbs. In a parallel connection, the total
resistance of the circuit decreases as more branches are added,
since there are more paths for the current to ¯ow.
Electrical energy and power
An electric current can exist in a region of space that has been
evacuated, for example the electron beam in a television tube.
In particle accelerators, like SLAC, the two mile electron accelerator at Stanford University, electrons are accelerated along the two
mile tube and emerge at one end traveling at speeds close to the
speed of light to strike various targets. Scientists study these
collisions to gain a better understanding of the sub-microscopic
world.
When current exists in a conductor, however, the collisions of
the electrons with the ions in the crystal lattice transfer energy to
280
Applied Electricity
these ions and this results in an increase in the internal energy of
the material. As we know, an increase in internal energy means
an increase in thermal energy and, therefore, an increase in
temperature. The ¯ow of current in a conductor produces heat.
The amount of heat produced by an electric current ¯owing
in a conductor depends on the magnitude of the current and on
the resistance of the conductor. The ®lament of a light bulb, for
example, is made of tungsten, a metal that melts at 33878C.
When enough current ¯ows through it, the metal heats up to
around 26008C, radiating energy as heat and visible light. Current
¯owing through the heating element of a toaster, a metal coil, produces heat at a lower temperature, which is used to toast the
slices of bread. When this same amount of current ¯ows through
the toaster cord, a copper wire of much greater cross sectional
area and therefore much lower resistance, no appreciable heat
is produced.
When an electric current i ¯ows through a light bulb, an
amount of charge q ¯ows through it in a certain time interval.
The rate at which the battery performs work in moving this
charge is called electric power. Electric power is equal to voltage
times current, or
P Vi:
By Ohm's law, V iR; therefore, electric power can be expressed
in terms of current and resistance as
P Vi i2 R:
The SI unit of power, remember, is the watt, W, which is joules
per second. Since P Vi, then 1 W 1 V 1 A:
James Joule, who developed an experiment to measure the
mechanical equivalent of heat, also developed a way of measuring the heat dissipated by an electric current and determined
that it was proportional to the square of the current in the conductor. For this reason, the expression P i2 R is known as Joule's law.
Semiconductors
Electric current can ¯ow in a vacuum, through a conductor such
as a metal wire, and through semiconductors. As we said early in
281
SUPERSTRINGS AND OTHER THINGS
Figure 13.11. Energy band structure of a solid. In insulators, the energy
gap is large. In semiconductors, it is small.
the chapter, semiconductors are materials with electrical conductivities that are intermediate between those of conductors and
insulators.
The properties of a solid depend not only on its constituent
atoms but also on the way these atoms are stacked together.
When two identical atoms are brought together, for example,
each one of their energy levels split in two, with the separation
between the pairs of levels depending on the distance between
the atoms. If four atoms are brought closer together, their
energy levels split into four. In a solid, there are billions of
atoms very close together, and each one of their energy levels
splits into billions of very closely packed levels. In one gram of
sodium there are about 1022 atoms of sodium and each one of
the energy levels in sodium splits into 1022 levels so close together
that they can be considered to be a single energy band. The bands
from all the different energy levels are separated by a forbidden
energy gap, in which no electron can exist (®gure 13.11). In good
conductors, the outermost occupied energy band, called the
valence band, is not completely ®lled. In sodium, a good conductor, this valence band is only half-®lled so that an electron in
one gram of sodium has about 5 1021 different possible, allowed
energy states. In nonconductors, on the other hand, the valence
band is completely ®lled and there are no available levels for
an electron to move into. The energy gap between the valence
band and the next band, called conduction band, is large. When
the energy gap is small, the solid is a semiconductor. The conduction band is where the electrons that conduct electricity reside.
282
Applied Electricity
Silicon and germanium are the most common semiconductor
materials used in electronics. Pure semiconductors, however, are
of no great practical importance. In contrast to a metal, where
almost every atom contributes one or two electrons that conduct
electricity, in a pure semiconductor, only one atom in a billion
contributes an electron. When small amounts of impurities, at
the level of one in a million, are added to a semiconductor in a
process called doping, the conduction properties of semiconductors can be enhanced. Silicon and germanium have four valence
electrons which almost ®ll the valence band. An ``impurity''
atom with ®ve valence electrons (such as arsenic or phosphorus)
contributes an extra electron which does not bond with the surrounding silicon atoms (®gure 13.12(a)). This additional electron,
being loosely bound, can easily jump up to the conduction band
Figure 13.12. (a) In an n-type semiconductor, the loosely bound extra
electron from the impurity atom contributes to the conductivity. (b) In
a p-type semiconductor, the missing electron leaves a hole in the electron
structure. When a nearby electron moves in to ®ll the hole, it leaves a hole
where it was, in essence moving the hole in the opposite direction.
283
SUPERSTRINGS AND OTHER THINGS
when it gains a small amount of energy, thus contributing to the
conductivity. Semiconductors doped with these donor atoms are
called n-type semiconductors because the charge carriers are the
negatively charged electrons.
When a piece of silicon is doped with impurity atoms with
only three valence electrons, like boron or aluminum, there is
an electron de®ciency which leaves the lattice lacking one bond
(®gure 13.12(b)). This electron de®ciency constitutes what has
been called a hole. In this case, there is a tendency to capture electrons from the valence band, in effect moving a nearby electron to
that site and transferring the hole to the location previously occupied by the electron. Since the hole represents an absence of a
negative electron and moves in the opposite direction to the
electrons, these holes can be considered positively charged. Semiconductors doped in this way are called p-type semiconductors
because the charge carriers are the positively charged holes.
Transistors, diodes, and other solid state electronic devices
can be manufactured by joining different types of semiconductors. The simplest solid state device is a p±n junction diode,
which allows current to pass in only one direction. When an ntype semiconductor and a p-type semiconductor are brought
together, some electrons from the n region drift into the p
region and some holes from the p region drift into the n region.
This migration of electrons and holes is due to the unequal concentration of charges in the two regions. The electrons that drift
into the p region move into the holes near the boundary, neutralizing the free charge carriers. However, because each region is
electrically neutral (since the electrons and holes come from
neutral impurity atoms), this diffusion of charges near the boundary creates a layer of positive charges in the n region and a layer of
negative charges in the p region. A barrier region is then formed
where an electric ®eld appears (®gure 13.13). This electric ®eld
prevents any further diffusion of charges between the two
regions since an electron from the n region that makes it into
the barrier will be repelled by the layer of electrons in the p
region; likewise, a hole that moves into the barrier region will
be repelled by the layer of holes in the n region.
Suppose we connect a battery to a p±n junction with the positive terminal connected to the p side and the negative to the n side
as in ®gure 13.14(a). Electrons coming from the battery recombine
284
Applied Electricity
Figure 13.13. The migration of electrons in the p±n junction creates a
barrier region that prevents further diffusion.
with the holes of the barrier layer in the n region, neutralizing
them. This recombination has the effect of lowering the potential
difference that was set up by the electric ®eld in the barrier
region. Electrons can now move from the n side into the p side
to complete the circuit thus allowing the ¯ow of current. The
junction is said to be forward biased.
When the polarity of the battery is reversed so that the positive terminal is connected to the n side and the negative to the p
side, electrons are pulled from the n region and holes from the
p region. When electrons or holes are drawn from near the barrier
region, the charge of opposite sign is enhanced thus increasing
the potential difference across the barrier. An increase in this
potential further prevents the ¯ow of charge across the junction
and no current ¯ows through it. The junction is now connected
in reverse bias (®gure 13.14(b)).
A p±n junction acts like a discriminating switch, preventing
the ¯ow of current in one direction and allowing it in the other
Figure 13.14. (a) Current ¯ows through the junction when it is forward
biased. (b) When the junction is reverse biased, no current ¯ows through it.
285
SUPERSTRINGS AND OTHER THINGS
Figure 13.15. A p±n±p transistor. A small change in the voltage to the
base produces a large change in the collector.
direction. Such a device, called a diode, is able to rectify an alternating current; that is, to change it into direct current since it allows
current to pass in only one direction when the voltage is applied.
In 1947 three American physicists working at the Bell
Telephone Research Laboratories, John Bardeen, Walter Brattain
and William Shockley, invented the transistor. Their ®rst transistor, a point-contact design, consisted of a wedge of semiconductor
of about 3 cm on each side. This device was followed in 1951 by
the more reliable p±n±p transistor with a thin layer of n-type semiconductor sandwiched between two thicker layers of p-type
semiconductor.
The three regions in a p±n±p transistor are called the emitter,
base, and collector. Suppose that we connect a p±n±p transistor to
two batteries, as shown in ®gure 13.15, so that the emitter±base
p±n junction on the left is forward biased while the base±collector
junction on the right is reverse biased. The positive terminal connected to the emitter side on the left pushes holes in the p region
toward the emitter±base junction. Since this junction is forward
biased, it acts as a diode, allowing the ¯ow of holes from emitter
to base. Once in the n region, these holes would come under the
in¯uence of the second battery, moving into the collector due to
the attraction of the negative terminal of this battery.
Not all the holes that enter the base region travel through to
the collector. About 2% of them combine with the free electrons
in this region, thus producing a small base current. Because the
base is only about one micrometer in thickness, most of the
holes (the remaining 98%) pass through, forming the collector
current. The result of an increase in the base potential is a large
increase in collector current and a small increase in base current.
286
Applied Electricity
If we were to connect the voltage coming out of a compact disc
player in series with the battery on the emitter side and a loudspeaker in series with the battery on the collector side, any small
change in the input voltage of the CD player would produce a
large change in the current ¯owing through the loudspeaker,
thus amplifying the input voltage. A transistor is useful because
it can amplify a small signal into a larger one.
Superconductors
As we have learned, the resistance to the ¯ow of current is caused
by electrons being scattered by interactions with the vibrating
atoms in the metal lattice. When the temperature decreases, the
vibrations decrease and therefore the resistance decreases. However, we would not expect the resistance to drop to zero but
rather to reach a minimum constant value at a certain low temperature. Below this temperature, an additional decrease in the
vibrations of the atoms in the lattice would not be expected to
appreciably affect the ¯ow of current. The presence of defects
in the lattice and the fact that the vibrations never stop completely
regardless of how close we get to the absolute zero would seem to
indicate that the resistance should never reach zero.
In 1911, the Dutch physicist Kamerlingh Onnes discovered
that, below a certain critical temperature, the electrical resistance
of certain metals vanished completely, the metals becoming
superconductors. He had recently accomplished the liquefaction
of helium and was measuring the electrical resistance of metals
at the newly achieved low temperatures (helium boils at 4.2 K
at a pressure of one atmosphere) when he discovered that
below 4 K the resistance of mercury dropped to zero. Soon he
found that other metals showed the same property at these
very low temperatures.
Superconductivity seems to contradict what we have learned
of the behavior of matter. How can superconductivity be understood? In 1935, the brothers Heinz and Fritz London were carrying out experiments in superconductivity in Oxford when they
realized that the observed effects could be explained, at least
partially, if the conduction electrons moved as a unit, as if they
were linked together like the cars of a train. Although this idea
287
SUPERSTRINGS AND OTHER THINGS
Figure 13.16. When the ions of the lattice are drawn towards the path of
the electrons, a positive region is created that attracts other electrons.
explained why the electrons could move throughout the crystal
lattice without being stopped, it also presented problems; electrons are negatively charged and repel each other. It was dif®cult
to understand how they could remain together.
During the early 1950s, the English physicists David Bohm
and Herbert FroÈhlich advanced a theory that could explain the
problem of the electron repulsion. The key is in the ions of the
lattice. A metal contributes an average of one electron per atom
to the conduction process. These conduction electrons move
about freely, leaving behind the positive ions that form the lattice.
According to Bohm and FroÈhlich, when the electrons in a superconductor pass through the lattice ions, the negative charge of the
electron attracts the positive ions, drawing them closer together.
Since the ions are much more massive than the electrons, they
take a little longer to separate back to their original positions,
thus creating a slightly greater concentration of positive charge
that attracts other electrons which might be following a similar
path (®gure 13.16). Eventually, the ions push away from each
other, due to their mutual electrical repulsion, moving past the
initial position. This gives rise to a vibration, called a phonon, in
the crystal lattice.
In 1956, Leon N Cooper, at the University of Illinois, proposed that the interaction between the vibrating lattice and the
electrons does not just unify the electric charge but creates pairs
of electrons that behave as a single particle. In the physics department at the University of Illinois there was at the time a shortage
of space and Cooper had to share an of®ce with John Bardeen.
John R Schrieffer was Bardeen's graduate student and the three
decided to extend Cooper's work on Cooper pairs, as the electron
pairs came to be known, to the entire lattice. After spending a
great deal of time on the problem, Schrieffer felt he was getting
288
Applied Electricity
nowhere with it and was thinking of changing his thesis research.
At about this time, Bardeen had to travel to Stockholm to receive
the Nobel prize for his work on the invention of the transistor and
asked Schrieffer to work on the problem for one more month. In
this month Schrieffer realized that the two electrons in a Cooper
pair had opposite velocities that added up to a net momentum of
zero. This allowed him to express the problem in a more manageable form. His result became the basis for a more complete theory
of superconductivity, later known as BCS theory. The three
received the Nobel prize in physics in 1972 for this theory,
making Bardeen the only person ever to win two Nobel prizes
in the same subject.
According to BCS theory, when the temperature falls below
the critical temperature, the electrons act collectively as they
interact with the lattice ions, forming Cooper pairs. The energy
of these superconducting electrons is lower than when they act
individually, at higher temperatures. There is an energy gap
between the superconducting state and the normal, nonsuperconducting state. At very low temperatures, below the critical
temperature, there is not enough energy available to excite the
electrons from the lower superconducting state to the higher nonsuperconducting state. The conduction electrons therefore
remain in this superconducting state. Scientists have been able
to maintain steady currents in superconducting rings for several
years with no measurable reduction.
Until early 1986, superconductivity required extremely low
temperatures obtainable only by cooling the materials with
expensive liquid helium. By 1973, however, scientists discovered
by accident that certain oxides of some rare earth elements, which
are normally insulators, could become superconductors. In January 1986, J Georg Bednorz and K Alex MuÈller, of the IBM Zurich
Research Laboratory in Switzerland, found that a compound of
barium, lanthanum (a rare earth), copper and oxygen became
superconductor at 35 K. Shortly after the results were published,
the American physicist Paul C W Chu, of the University of
Houston and his collaborator, professor Mau-Kuen Wu of the
University of Alabama, began a series of experiments with similar compounds using different rare earth elements, achieving
superconductivity at the incredible temperature of 98 K. This
means that the new superconductors can be cooled to these
289
SUPERSTRINGS AND OTHER THINGS
Figure 13.17. Superconducting wire developed at Argonne's National
Laboratory. (Courtesy Argonne National Laboratory.)
levels with liquid nitrogen, which lique®es at 77 K and is inexpensive. MuÈller and Bednorz won the 1987 Nobel Prize in physics for
their discovery.
Currently, researchers have achieved superconductivity at
temperatures as high as 135 K and are developing new materials,
such as superconducting wires and ®lms (®gure 13.17). These
new materials would eventually lead to extremely ef®cient electric motors, generators, and transmission lines, which could
lead to new ways of storing energy.
The original BCS theory not only did not predict superconductivity at these higher temperatures but it actually fails to
explain it at temperatures above 40 K. Theoreticians are hard at
work trying to modify the theory, or to come up with a new
one, so that high temperature superconductivity can be understood.
290
14
ELECTROMAGNETISM
The discovery of magnets
''At this point, I will set out to explain what law of nature causes
iron to be attracted by that stone which the Greeks call from its
place of origin magnet, because it occurs in the territory of the
Magnesians.'' Thus wrote the Roman poet Lucretius in his book
De Natura Rerum (''On the Nature of the Universe'') published
in 55 BC. Written in the form of a long poem, his book barely survived, having been lost throughout the Middle Ages. In 1497, a
surviving manuscript was discovered and a treasure of world
literature was saved for posterity. His poem continues:
Men look upon this stone as miraculous. They are amazed to see it
form a chain of little rings hanging from it. Sometimes you may see
as many as ®ve or more in pendent succession swaying in the light
puffs of air; one hangs from another, clinging to it underneath, and
one derives from another the cohesive force of the stone. Such is the
permeative power of this force.
In matters of this sort it is necessary to establish a number of facts
before you can offer an explanation of them. This may mean approaching the problem by a very roundabout route. For this reason I beg you to
lend me your ears and your mind with particular attentiveness.
Lucretius then proceeds to explain that magnets attract iron
because they emanate a stream of atoms that pushes away the air
between the magnet and the iron, producing a vacuum that the
atoms of iron promptly move to ®ll. This process is aided by
the air behind the iron which pushes it from behind towards
the void.
In the thirteenth century the French scholar Petrus de
Maricourt, known as Peregrinus (''the Pilgrim''), made what
291
SUPERSTRINGS AND OTHER THINGS
were probably the ®rst experiments that have any bearing on
modern ideas of magnetism. Peregrinus, an engineer in the
army of Louis IX, started trying to design a motor that would
keep a planetarium rotating for some time and thought of using
magnets to accomplish the task. In 1269, while in Italy during
the siege of Lucera, Peregrinus described in a letter, Epistola de
Magnete, the results of his investigations with magnets. He
explained how he was able to determine the north and south
poles of a magnet and his discovery that like poles repel each
other while unlike poles attract. In an ingenious experiment,
Peregrinus took a natural magnet which he shaped like a
sphere and marked the directions taken up by a magnetic
needle placed near the surface. All these directions ``will run
together in two points just as the meridian circles of the world
run together in two opposite poles of the world.'' He interpreted
the behavior of the magnetic needle as pointing to the pole of the
celestial sphere,
Peregrinus further wrote that magnetic poles could not be
isolated, because every time a magnet was split in half, two complete magnets, with north and south poles, were formed. In a
second part of this work, Peregrinus presented a detailed work
on the magnetic compass and described an improved ¯oating
compass encircled by a graduated scale.
The modern treatment of magnetism started in 1600 when
William Gilbert published his monumental treatise De Magnete.
Gilbert explained why the compass needle lines up in a north±
south direction: the earth itself is a magnet. He demonstrated
his theory with a globular lodestone similar to the one that
Peregrinus had used and which he called terrela. He laid a magnetic needle at different places on the surface of his spherical
magnet and found that the needle acted just like a compass
needle, pointing toward the north±south direction of the spherical
lodestone.
The magnetic ®eld
We learned in chapter 12 that the electric ®eld describes the property of the space around an electrically charged object. Similarly,
we can say that the magnetic ®eld describes the property of the
292
Electromagnetism
Figure 14.1. The magnetic ®eld around a magnet, shown here with iron
®lings spread on a ¯at surface around a magnet. (From PSSC Physics
Seventh Edition, by Haber-Schaim, Dodge, Gardner, and Shore. Published by Kendall/Hunt Publishing Company, 1991.)
space around a magnet. In ®gure 14.1, we can see the magnetic
®eld lines around a permanent magnet, which resemble the electric ®eld lines around two unlike charges, as seen in ®gure 12.10.
Since magnetic poles may exist only in pairs, we do not see
straight magnetic ®eld lines as was the case with the electric
®eld lines around a single charge shown in ®gure 12.7. In 1931,
the English physicist P A M Dirac postulated the existence of
magnetic monopoles to round off the symmetry between electricity
and magnetism. A magnetic monopole, if it exists, would be a
single north or south pole ¯ying free. Recent theories of particle
physics and cosmology suggest that magnetic monopoles existed
during the early universe. On February 14, 1982 Blas Cabrera of
Stanford University recorded the passage of what appeared to
be a magnetic monopole. Although the searches continue in
many laboratories, due to the failure to detect a second monopole
293
SUPERSTRINGS AND OTHER THINGS
Figure 14.2. Forces between two magnets. (a) If the magnets are brought
together with like poles facing each other, they repel. (b) If they are
brought closer with unlike poles facing, the magnets attract each other.
in the years since, Cabrera later said that the event he reported
might not have been a real one.
The presence of a magnet in a particular place distorts the
space around it in such a way that any other magnetized body
in this region feels a magnetic force. As Peregrinus discovered,
when two magnets with their north poles facing each other are
brought close together, they repel each other. If they are brought
closer with their south poles facing, the magnets also repel each
other. However, when the north pole of one magnet faces the
south pole of the other magnet, the two magnets attract each
other (®gure 14.2). The magnetic poles of magnets exert forces
on each other. About two hundred years ago, the English scientist
John Michell found that the force between magnetic poles obeys
the inverse-square law. The magnetic force F between them is
inversely proportional to the square of the distance r between
the two poles:
F/
1
:
r2
In 1785, Charles Coulomb experimentally measured the magnetic
force between two poles using a torsion balance similar to the one
he used to determine the nature of the electrical force between
two charges, and con®rmed Michell's inverse-square law for
magnetic poles. In addition to being proportional to the square
of their distances, the magnetic force between two poles is
directly proportional to their product, or
F/
qm q0m
r2
where qm and q0m are the strengths of the two interacting magnetic
poles.
294
Electromagnetism
Physics in our world: Magneto-optical drives
A fairly common computer storage device, the magnetooptical drive uses light from a laser to heat a tiny spot on a
magnetic disk to change its magnetic polarity. These disks,
encased in small portable cartridges, come in two sizes,
13 cm and 9 cm in diameter, and are inserted into the drive
mechanism through a slot.
The process of writing information to the magneto-optical
disk is similar to that of writing to any digital magnetic storage
device, such as hard disks, ¯oppy disks, or digital audio
cassettes. This process requires setting the polarity of different
spots on the disk. A negative polarity is interpreted by the
computer as a 0 and a positive polarity as a 1.
Before the polarity is set, a laser beam ®rst heats a small
spot on the rapidly spinning disk to a temperature of about
1508C. At this temperature, called the Curie temperature,
the atoms in the material of the disk have magnetic dipoles
which are oriented at random. When an external magnetic
®eld is present, such as that supplied by a small electromagnet located under the disk, the magnetic dipoles align along
the direction of the external ®eld. The total magnetic ®eld
of all the atomic dipoles in the spot produces its magnetization. The electromagnet switches polarity to write 0s or 1s
(negative and positive polarity).
Electric currents and magnetism
If, as Peregrinus said, magnetic poles cannot be isolated, what is
then the meaning of the magnetic pole strength qm ? Although, as
we said above, magnetic monopoles could exist, they cannot
be obtained by splitting a magnet as splitting a magnet in two
produces two complete magnets (®gure 14.3). Therefore, if a
magnet cannot be considered to be made up of two separate magnetic monopoles, what is then the source of magnetism? The
answer has to do with an accidental discovery that the Danish
physicist Hans Christian Oersted (1771±1851) made in 1819.
Because many properties of magnetic behavior resemble electrical behavior, scientists had long suspected that there might be a
295
SUPERSTRINGS AND OTHER THINGS
Figure 14.3. Cutting a magnet in two produces two smaller but complete magnets. This process can continue down to atomic dimensions
without ever isolating a single magnetic pole.
connection between electricity and magnetism and had
attempted to measure the effects of electric currents on magnetic
compasses. During a physics lecture, Oersted was trying to show
that an electric current ¯owing through a wire lying on a table did
not de¯ect a compass needle. After he placed the compass on the
table at various locations near the wire with the needle always
pointing north, Oersted picked up the compass and held it
above the wire. The compass needle twitched and pointed in a
direction perpendicular to the wire, and when he reversed the current, the needle swung and pointed in the opposite direction,
always perpendicular to the wire. Clearly, a force was acting on
the compass needle.
Oersted's results were the ®rst ever found in which the force
was not in the same direction as the line connecting the sources of
the force. It had apparently not occurred to anybody to look for a
force that was not parallel to the direction of the ¯ow of current.
However, a similar discovery had been reported in the August 3,
1802 issue of the Gazetta di Trentino by the Italian jurist Gian
Domenico Romagnosi, but had been ignored. Oersted wrote a
pamphlet in Latin, as was customary in those days for scienti®c
papers, where he described his discovery, and sent the paper
off to many scienti®c societies. A translation of his paper
appeared in 1820 in the Annals of Philosophy and before the end
of the year, the French scientist Andre Marie AmpeÁre had
296
Electromagnetism
Figure 14.4. A wire carrying a current surrounded by small magnets.
The magnets form a circle around the wire.
extended Oersted's work and had concluded that all magnetism
was due to small electric currents. AmpeÁre gave mathematical
form to Oersted's discovery. His formulation is known as
AmpeÁre's law. These discoveries were the ®rst steps towards a
complete understanding of the close relationship between electricity and magnetism.
To examine Oersted's discovery more closely, consider a
straight segment of wire carrying a current i where several
small magnets or compasses allow us to observe the direction
of the magnetic ®eld in the vicinity of the wire. If we place the
magnets around the wire and turn off the current, all the compasses point north; as soon as the current ¯ows through the
wire, the magnetic needles of the compasses point in such a
way as to form a circle whose center is at the wire and whose
plane is perpendicular to the wire (®gure 14.4). When the compasses surrounding the wire are placed farther away from the
wire, the magnetic needles again form a circle centered at the
wire. According to AmpeÁre's law, the electric current ¯owing in
the wire produces a circular magnetic ®eld around the wire.
The magnetic ®eld lines due to a current follow concentric
circles that surround the wire. The direction of the ®eld can be
found with the right-hand rule: if we grasp the wire with the
right hand so that the thumb points in the direction of the current,
the curled ®ngers indicate the direction of the magnetic ®eld
(®gure 14.5). We can see that the direction of the magnetic ®eld
is perpendicular to that of the current.
The simplest application of AmpeÁre's law involves the calculation of the magnetic ®eld due to a current i ¯owing through a
long straight wire. The magnetic ®eld B at a distance r from the
297
SUPERSTRINGS AND OTHER THINGS
Figure 14.5. The right-hand rule to determine the direction of the magnetic ®eld around a current-carrying wire. If the right thumb points in the
direction of the current, the curled ®ngers indicate the direction of
the magnetic ®eld.
wire is given by
Bk
i
r
where k is a constant. The SI unit of magnetic ®eld is the tesla (T)
which is a relatively large unit. For this reason, the gauss, equal to
one ten thousandth of a tesla, is also de®ned. 1 gauss 10ÿ4 T.
In the expression for the magnetic ®eld B, the constant k is
equal to 2 10ÿ7 T m=A.
A moving charge in a magnetic ®eld
An electric current, as we know, is the rate at which charge ¯ows.
A moving electric charge creates a magnetic ®eld, even if it is a
single charge instead of a current. This magnetic ®eld exerts a
force on a magnet. Therefore, a magnetic ®eld must in turn
exert a force on a moving charge. Two electrically charged objects
at rest with respect to each other exert a force on each other that is
given by Coulomb's law. However, if the charges are moving in
relation to each other, the situation gets more complicated,
since a moving charge creates a magnetic ®eld which in turn
exerts a force on the other charge.
What is the nature of the force exerted by a magnetic ®eld on
a moving charge? The answer to this question can be found
through experiment. If we place an electric charge at rest in a
region near a large magnet, where the magnetic ®eld is fairly uniform, we ®nd that if this region is free of electric ®elds the charge
experiences no net force. If, however, the charge is moving with
respect to the magnetic ®eld, a net force acts on the charge. This
298
Electromagnetism
Figure 14.6. A charge q moving with a velocity v in a magnetic ®eld of
strength B is subject to a force F. This force varies from zero, when the
velocity is parallel to the ®eld, to a maximum value, when the velocity
is perpendicular to the direction of the magnetic ®eld.
depends on the magnitude of the charge, its velocity, and the
strength of the magnetic ®eld. If we call the strength of the
magnetic ®eld B, the force experienced by the charge q moving
with a velocity v is proportional to the product qvB; that is,
F / qvB:
However, when the particle is moving in a direction parallel to
the magnetic ®eld, this force is zero. If the particle moves in a
direction that is perpendicular to the ®eld, the force is maximum.
Motion along a direction other than parallel or perpendicular
produces a force that falls in between this maximum value and
zero (®gure 14.6). The force, then, is proportional to the component of the velocity along a direction perpendicular to the
magnetic ®eld, or
F / qvperp B:
As shown in ®gure 14.6, the force is always at right angles to
both the velocity and the direction of the magnetic ®eld. If the
strength of the magnetic ®eld is given in teslas, the expression
for the magnetic force on the moving charge can be written as
F qvperp B:
For our discussions, we will consider only the simpler case where
the velocity is at right angles to the direction of the magnetic ®eld.
299
SUPERSTRINGS AND OTHER THINGS
Figure 14.7. A proton moving with a velocity v perpendicular to the
magnetic ®eld experiences a force F which makes it move in a circle
with a constant speed.
In this case, the force can be written as
F qvB:
If a charged particle, such as a proton, enters a region where
there is a uniform magnetic ®eld, and the particle moves with a
velocity v perpendicular to the direction of the magnetic ®eld, a
magnetic force F, perpendicular to both the velocity and the direction of the magnetic ®eld, acts on the particle (®gure 14.7). Since
the force is perpendicular to the velocity, it does no work on the
particle and its kinetic energy remains constant. Therefore, the
particle's speed remains constant. This magnetic force, however,
accelerates the particle in a direction that is perpendicular to the
velocity, producing a centripetal acceleration that makes the
particle move in a circle with a constant speed.
In ®gure 14.8 an electron leaves a curved track in a liquid
hydrogen bubble chamber as it moves through a uniform magnetic
300
Electromagnetism
Figure 14.8. An electron track in a bubble chamber. The track is curved
as the electron moves through a uniform magnetic ®eld pointing into
the page. The electron is slowed down by the liquid hydrogen, making
the path a spiral rather that a circle. (Courtesy Brookhaven National
Laboratory.)
®eld pointing into the page. Although electrons cannot be seen,
their tracks are made visible brie¯y in a bubble chamber as thin
lines of boiling liquid. In a hydrogen bubble chamber, liquid hydrogen is heated under pressure to a temperature slightly below its
boiling point. When a charged particle moves through the liquid,
it ionizes some of the hydrogen molecules. If the pressure is suddenly decreased, the liquid starts boiling preferentially around
the ion paths. In ®gure 14.8 the radius of curvature decreases
because the electron is slowed down by the liquid hydrogen.
Particle accelerators
The circular motion of a charged particle moving in a direction
that is perpendicular to a magnetic ®eld was used in the early
301
SUPERSTRINGS AND OTHER THINGS
1930s by physicist Ernest Orlando Lawrence of the University
of California at Berkeley to accelerate protons to very high
energies.
The desire to accelerate particles originated with Rutherford,
who had shown in 1919 that the nitrogen nucleus could be disintegrated by alpha particles emitted during the radioactive
decays of radium and thorium. In this nuclear process the nitrogen nucleus is transformed into oxygen by the collision with the
alpha particle. This was the ®rst arti®cially produced nuclear
reaction; for the ®rst time humans had modi®ed the structure
of an atomic nucleus. However, the particles obtained as products
of radioactive decays had energies of only a few MeV. The need
for higher energies led scientists to invent machines that could
accelerate these particles. These machines, known as particle
accelerators, started in 1932 with J D Cockcroft and E T S
Walton at the Cavendish Laboratory of Cambridge University
and their voltage multiplier, which was developed after a suggestion by Rutherford. Accelerating protons to energies of about
400 keV, they were able to split lithium atoms into two alpha
particles.
In 1928, Rolf WinderoÈe in Germany had used an alternating
high voltage to accelerate ions of sodium and potassium to twice
the applied voltage. Ernest Lawrence was a twenty-seven-year
old professor of physics at Berkeley when WinderoÈe published
his results. The son of an educator, Lawrence had wanted to be
a physician. A physics professor at the University of Minnesota
sparked his interest in physics and Lawrence decided instead to
attend Yale University, obtaining his PhD in physics in 1925.
He joined the physics faculty at Yale and in 1928 moved to Berkeley where he founded the world-famous Radiation Laboratory.
He remained in charge of the laboratory until his death in 1958
at the age of ®fty-seven. Shortly after moving to Berkeley, while
browsing through several journals in the physics library,
Lawrence found WinderoÈe's paper in the Archiv fuÈr Elektrotechnik.
It occurred to Lawrence that a magnetic ®eld could steer the electrons into a circular path so that the acceleration stage could be
repeated. In the summer of 1930 his graduate student M Stanley
Livingston started the design and construction of the ®rst
magnetic resonance accelerator or cyclotron, as part of his
experimental research for a doctorate in physics. On January 2,
302
Electromagnetism
Figure 14.9. In a cyclotron, two metal dees lie in between the poles of a
big magnet. A source of ions in the middle of the assembly injects
charged particles into the dees, which are connected to an electrical
oscillator. As the particles are accelerated in the space between the
dees, they spiral outward and gain energy.
1931, their ®rst cyclotron produced ions of 80 keV energy using a
borrowed magnet with a ®eld of 13 000 gauss.
The operation of the cyclotron is based on the fact that
the number of revolutions per second made by the particle
(which gives us the angular velocity) does not depend on the
particle's speed nor on its radius of rotation. In a cyclotron, this
angular velocity depends only on the charge and mass of the
particle being accelerated, and on the magnetic ®eld of the
cyclotron.
Figure 14.9 shows a schematic drawing of a cyclotron. A
source of particles lies in between two hollow metal dees which
are connected to an electrical oscillator producing an alternating
potential. If a proton, for example, is injected into the cyclotron, it
will be accelerated towards the dee that happens to be negative at
that time. Inside the dee, the proton is shielded from the electric
®eld but not from the magnetic ®eld of the external magnet which
forces the proton to move in a semicircle. As the proton completes
a semicircle, it returns to the other dee, which turns negative as
soon as the particle enters. The potential difference between the
two dees causes the proton to increase its speed each time it
crosses the space between the dees. Since the number of revolutions that the proton makes per second is independent of the
velocity or the radius of curvature, the proton will change dees
at equal intervals of time and the polarity of the dees is set to
change accordingly. In a modern cyclotron, the particle may
303
SUPERSTRINGS AND OTHER THINGS
make 100 revolutions before leaving the dees with an energy of
several MeVs.
Owing to engineering limitations in the construction of very
large magnets, cyclotrons have reached their limit, although they
remain extremely useful for the range of energies under 40 MeV
for protons. An accelerator that does not require a single large
magnet to bend the path of the charged particles is the synchrotron. This type of machine uses a magnet in the form of a ring
and the particles follow a single circular path instead of a
spiral, as in the cyclotron. The Fermi National Accelerator
Laboratory or Fermilab, in Batavia, Illinois, houses the largest
synchrotron in the world. Recently, the laboratory has produced
energies of the order of the TeV, or one trillion electron-volts, and
for this reason, the accelerator is called the Tevatron.
Magnetism of the earth
As Gilbert wrote in De Magnete, the earth is a magnet. The earth
has a large, dense core about 2400 kilometers in diameter, composed of heavy atoms believed to be mostly iron. Study of the
seismic waves generated by earthquakes has revealed that the
inner part of the earth's core is probably solid while the outer
core is liquid. Very slow motions of the core, of the order of 1
millimeter per year, are believed to generate the magnetic ®eld
of the earth. At the earth's surface, the earth's magnetic ®eld
strength is about 5 10ÿ5 T or about 0.5 gauss.
Magnetism is due to the motion of electric charges. In the
earth, the slow motions in the metallic core produce electric currents in the hot, electrically conductive material. These currents
¯ow upward and are in turn carried around by the earth's fast
rotation, producing the magnetic ®eld. This process, however,
is not totally understood, and scientists were surprised to ®nd a
magnetic ®eld in Mercury, which rotates too slowly to produce
even a weak magnetic ®eld. The source of Mercury's magnetic
®eld is still a problem to be resolved.
The earth's magnetic ®eld extends out into the surrounding
space where it interacts with the solar wind, the ¯ow of ionized
atoms and electrons that constantly streams away from the
atmosphere of the sun at speeds of several hundred kilometers
304
Electromagnetism
Figure 14.10. The earth's magnetic ®eld acts as a shield against the
supersonic particles that stream away from the sun.
per second. The magnetic ®eld acts as a shield against the solar
wind, slowing the supersonic particles to subsonic speeds, forming a shock wave similar to the bow wave formed by a ship
moving through water (®gure 14.10). This magnetic ®eld also
protects the earth from most of the highly energetic cosmic ray
particles originating in the interstellar medium.
The magnetic ®eld of the earth traps some matter from the
solar wind, creating the earth's magnetosphere, which is the
region around the earth where the magnetic ®eld plays a dominant part in controlling the physical processes taking place
there. The magnetosphere was discovered in 1958 with the ®rst
American arti®cial satellite, the Explorer 1 which carried instruments built by James van Allen, a professor of physics at the
University of Iowa. Subsequently, other planets were found to
possess magnetospheres. The strongest planetary magnetosphere
is that of Jupiter, which completely envelops the innermost Jovian
satellites.
If a charged particle ± proton or electron ± from the solar
wind enters the earth's magnetosphere with a velocity at right
angles to the magnetic ®eld, the resulting motion of the particle
will be a circle. If the particle enters the magnetosphere with a
velocity component parallel to the magnetic ®eld, it will move
along the ®eld (because of its parallel component) and also
along a circle (due to its perpendicular component), so that the
resulting motion is a spiral path (®gure 14.11). Because the magnetic ®eld of the earth is stronger near the poles and weaker at the
305
SUPERSTRINGS AND OTHER THINGS
Figure 14.11. A charged particle entering earth's magnetosphere with a
velocity component parallel to the magnetic ®eld lines moves in a spiral
path.
center, a particle trapped in this ®eld will spiral around the ®eld
lines, oscillating back and forth, as in a magnetic bottle. The regions
of space where electrons and protons from the solar wind become
trapped, oscillating back and forth, are called Van Allen belts. The
inner Van Allen belt, which extends between altitudes of 2000
and about 5000 km, contains mostly protons. The outer Van
Allen belt, extending from about 13 000 to about 19 000 km, contains mostly electrons.
Solar ¯ares are sudden gigantic explosions that take place in
the atmosphere of the sun. In addition to emitting enormous
amounts of energy at all wavelengths, ¯ares also eject highly
ionized matter. When this ionized matter reaches the earth, it
interacts with the magnetic ®eld producing magnetic storms and
dumping into the atmosphere some of the particles that had
been trapped in the Van Allen belts. When these high speed electrons and protons collide with gases in the upper atmosphere,
atoms of nitrogen and oxygen absorb the ultraviolet radiation
emitted. This results in the emission of visible light (a phenomenon called ¯uorescence), producing the beautiful Northern Lights,
also known as the aurora borealis, or the Southern Lights (aurora
australis).
306
Electromagnetism
Physics in our world: Avian magnetic navigation
Migratory birds have a built-in compass: they possess
magnetite crystals near their nostrils, which allow them to
detect the orientation of the magnetic ®eld of the earth and
to use it to navigate. Scientists have discovered that these
birds also navigate using the positions of the stars and that
in some cases they must use the magnetic and the celestial
cues together to orient themselves correctly.
Peter Weindler and his research group at the J W Goethe
University in Frankfurt studied garden warblers which breed
in central Europe and migrate to Africa in the winter. In their
¯ight to Africa, the birds take a detour west to avoid climbing
over the Alps, ¯ying instead southwest to the Iberian peninsula and then turning southeast to Africa. Weindler's team
raised a group of warbler chicks in captivity. During the
summer before their ®rst migration, they placed the birds
inside cages with arti®cial light and a rotating dome with
holes that simulated the night sky. They also isolated a
group of these birds from the earth's magnetic ®eld. When
the time came for the birds' normal migration, all the warblers
were placed in large cages with only celestial cues. The birds
that were raised isolated from the earth's magnetic ®eld
headed south. The others oriented themselves correctly, in a
southwest direction. Later in the season, when the birds
should have turned southeast, both groups failed to do so.
They had not been exposed to a simulation of the fall sky.
The scientists concluded that migratory birds need both
magnetic and celestial information to orient themselves. The
night sky provides only the general direction; south in the fall
and north in the spring. The birds use the direction of the
earth's magnetic ®eld to deviate from this general orientation.
The source of magnetism
Oersted's discovery that an electric current produces a magnetic
®eld led AmpeÁre to conclude in 1820 that all magnetism is due to
small electric currents. Just what kind of small electric current is
the cause of magnetism in a permanent magnet, for example?
307
SUPERSTRINGS AND OTHER THINGS
Figure 14.12. (a) A charged particle moving in circles produces an
electric current. This loop of current generates a magnetic ®eld. (b) A
spinning charged sphere also produces a magnetic ®eld.
In the Bohr model of the atom, an electron moves around its
nucleus following a circular orbit. This motion is equivalent to a
very small loop of electric current which, as we know, generates a
small magnetic ®eld (®gure 14.12(a)). In addition to this motion,
each electron also rotates about its own axis and this rotation is
again equivalent to a circular electric current that produces its
own magnetic ®eld (®gure 14.12(b)). The magnetic ®eld of the
electron is that of a dipole; that is, it looks like the magnetic ®eld
of a magnet, with north and south poles, rather than the magnetic
®eld of a monopole. Although, the model of the atom with electrons as small spinning spheres that rotate around the nucleus
is no longer considered adequate, the magnetic ®elds produced
are what we would expect if the electron were spinning on its
axis and revolving around the nucleus.
Although everything contains electrons, not everything is
magnetic because in most substances the magnetic ®elds of the
electrons combine with each other and cancel out. In iron and a
few other substances, these magnetic ®elds do not completely
cancel out, and large collections of atoms align themselves, forming magnetic domains throughout the material. A piece of iron
contains many of these magnetic domains, each formed by the
alignment of perhaps millions of atoms. In a normal piece of
308
Electromagnetism
iron, the domains are randomly oriented; unless many of these
domains are aligned, the material will not be magnetic. However,
the presence of a strong magnetic ®eld in the vicinity may be
enough to align many domains in the direction of this external
®eld. You can magnetize the blade of your knife by rubbing it
several times in the same direction with a large magnet. If the
magnet is very strong, placing the knife on the magnet may be
suf®cient to align enough of its domains for it to remain magnetized for a long time.
Faraday's law of induction
We have seen how an electric current generates a magnetic ®eld.
This suggests that perhaps a magnetic ®eld could generate an
electric current. Is this true?
The ®rst person to answer this question was the English
scientist Michael Faraday. One of ten sons of a blacksmith,
Michael Faraday lacked a formal education. ``My education was
of the most ordinary description,'' he wrote, ``consisting of little
more than the rudiments of reading, writing and arithmetic at a
common-day school.'' He was born in 1791 in Newington,
England, and at the age of 14 became a bookbinder's apprentice
in the shop of Mr G Riebau. This seems to have been a fortunate
turn of events, because that job put him in contact with books,
which the young boy decided to read. ``There was plenty of
books there and I read them,'' he later said. His master not only
allowed him to read the books that were commissioned for
binding, but also encouraged him to read the books in his
personal library and to attend cultural events and lectures. One
day, while binding a volume of the Encyclopaedia Britannica,
Faraday encountered a 127-page article on electricity and
became fascinated by it. The reading of this article stimulated
his interest for science and prompted him to save enough
money to buy the necessary parts with which to construct his
®rst scienti®c apparatus.
In 1810, Faraday was introduced to the City Philosophical
Society which had been formed two years earlier and which
met every Wednesday to discuss a paper on some topic of science
presented by one of the members. At these lectures, Faraday
309
SUPERSTRINGS AND OTHER THINGS
would usually get himself a front seat to take notes which he
would later rewrite and expand at home. Later, when a series
of four lectures was delivered by Sir Humphrey Davy, the English chemist and Superintendent of the Royal Institution, Faraday
was given a ticket to attend them. He took copious notes which he
bound and later sent to Davy along with a petition for a job at the
Institution. Although Davy was ¯attered, he could not offer
Faraday a position at that time, but later, when one of Davy's
employees at the Institution was involved in a ®ght and was
dismissed, Faraday was hired as an assistant in the laboratory.
Faraday was now on the road to become one of the greatest
experimental scientists who ever lived.
In 1825, twelve years after his appointment as assistant, Faraday became director of the laboratory and in 1833, professor of
chemistry at the Royal Institution. It was at the Royal Institution
that Faraday became intrigued by Oersted's experiments showing that an electric current created a magnetic ®eld and wanted
to see it if was possible for a magnetic ®eld to produce an electric
current. With this in mind, he ``had an iron ring made . . . [w]ound
many coils of copper wire round one half, the coils being separated by twine and calico. . . By trial with a trough each was
insulated from the other. Will call this side of the ring [which
was connected to a battery] A.'' On the other side, side B, a
second copper wire was wound (®gure 14.13). Faraday expected
Figure 14.13. (a) Faraday's induction experiment. At the instant the
battery was connected or disconnected so that a current would ¯ow
through coil A, a current was detected for a brief time in coil B. (b)
Faraday's induction ring.
310
Electromagnetism
that the magnetic ®eld produced by coil A, which would magnetize the iron ring, would in turn produce a current in the coil of
copper wire wrapped around the other side of the ring, independent of the other coil. It did not happen that way. Instead, only at
the instant the battery was either connected or disconnected, a
current appeared in coil B.
It was not the presence of the magnetic ®eld that produced a
current in coil B, but the change in this magnetic ®eld that
occurred at the moment the battery was connected or disconnected. As soon as the battery was connected, the current rapidly
increased from zero to a steady value and while that increase in
current took place, the magnetic ®eld produced by this current
also increased. Similarly, when the battery was disconnected,
the current immediately decreased to zero, producing a sudden
decrease in the magnetic ®eld. It was this increase or decrease
in the magnetic ®eld that induced a current in coil B. For this
current to appear in the second coil, a voltage must exist there.
Thus, we can say that a changing magnetic ®eld induces a voltage
in the second coil which in turn produces a current. This phenomenon is called electromagnetic induction.
Faraday performed further experiments and showed that the
iron ring was not needed. Adding a switch to coil A, a current was
induced in coil B whenever the switch was opened or closed.
Since it was the change in the magnetic ®eld generated by the
increase or decrease in the current that induced the current in
the second coil, Faraday experimented with moving magnets
and was able to induce electric currents in coils just by quickly
moving a magnet into or out of the coil (®gure 14.14).
A coil is not necessary for electromagnetic induction to take
place. A single loop of a wire or an electric circuit would work. A
voltage is induced in a single loop of wire by moving a magnet in
the vicinity of the loop. The faster we move the magnet, that is,
the faster the magnetic ®eld in the region of the loop changes,
the greater the induced voltage. We can say that when the rate
at which the magnetic ®eld changes through the circuit is large,
the induced voltage is large. In more general terms, the induced
voltage in the circuit is proportional to the rate of change of the magnetic
®eld. This statement is known as Faraday's law of induction.
In 1822, Faraday wrote in his notebook: ``Convert magnetism
into electricity.'' By 1831, he had accomplished his goal.
311
SUPERSTRINGS AND OTHER THINGS
Figure 14.14. Moving a magnet into or out of a coil induces a current in
the coil.
Motors and generators
Faraday showed that a change in a magnetic ®eld produces a current in a coil. The magnetic ®eld threading through a spinning
coil is variable, even if the ®eld itself is constant (®gure 14.15).
Therefore, a current would be induced in a coil rotating in a
region where there is a magnetic ®eld. If you attach a handle to
this coil and turn it, a current is generated in the coil for as long
as you keep turning the coil. This current can be used to light a
bulb, for example (®gure 14.16). This is a simple one-loop generator, a device to convert mechanical energy into electrical energy.
A paddle wheel turned by a waterfall or by a river can provide
Figure 14.15. The magnetic ®eld through a rotating coil changes from
zero, when the plane of the coil lies parallel to the magnetic ®eld, to a
maximum value, when the plane of the coil is perpendicular to the ®eld.
312
Electromagnetism
Figure 14.16. A loop of conducting wire with a handle generates electricity when the loop is turned in a region where there is a magnetic ®eld.
Metal brushes maintain contact with the circuit as the loop rotates. This is
a simple electric generator which can provide the current for a light bulb.
the mechanical energy for the generator. Steam moving through a
turbine can also supply the mechanical energy to turn the
generator and produce electrical energy.
Suppose now that we remove the handle and place the coil
inside a magnetic ®eld. If instead of a light bulb we connect a battery so that a current ¯ows in the coil, a torque is exerted on the
loop. The reason for this is that since currents are charges in
motion, and a charge in motion in a magnetic ®eld experiences
a magnetic force, an electric current placed in a magnetic ®eld
also experiences a magnetic force. This force is perpendicular to
the direction of the magnetic ®eld and to the direction of the
¯ow of current through the conductor.
Consider the conducting loop of ®gure 14.17, where the magnetic ®eld lines in the region between the two external magnets
¯ow from left to right. A battery is connected to the loop through
a pair of metal brushes that allow two conductors in the arms of
the loop to keep contact while they slide past each other. These
conductors form what is called the commutator because, as we
shall see, they change the ¯ow of current through the loop as
the loop rotates.
We can now see how the loop rotates. When the brushes are
aligned with the commutator segments, current ¯ows into the left
313
SUPERSTRINGS AND OTHER THINGS
Figure 14.17. A simple motor. (a) The wire loop in a magnetic ®eld is
connected at one end to two semicircular conductors called the
commutator. Current from the battery ¯ows into the left (gray) side of
the loop through the metal brush and out the other (black) side. The
torque produced spins the loop counterclockwise. (b) Each brush is in
contact with both commutators. Current bypasses the loop. Inertia
keeps the loop rotating. (c) The black side of the loop is now on the
left. Current ¯ows into this side now and out of the gray side. The current
through the loop changes direction but the loop keeps rotating clockwise.
314
Electromagnetism
side of the loop (shown in gray in ®gure 14.17(a)) and out of the
right side. The force acting on the left side is perpendicular to the
direction of the magnetic ®eld, in this case to the right, and to
the direction of the ¯ow of current. The force on the right side
of the loop is in the opposite direction, because the direction of
the magnetic ®eld is the same as before, that is, to the right, and
the current is moving in the opposite direction through the
loop. These two forces, which depend on the value of the current
and the magnitude of the electric ®eld, have the same magnitude,
since the current ¯owing though the loop has the same magnitude in either direction. The two equal and opposite forces,
acting at the two opposite sides of the loop, exert a torque on it
which results in rotation. As the loop rotates to a position
where each brush is in contact with both brushes, the current
bypasses the loop, which continues to rotate due to its inertia
(®gure 14.17(b)). As the loop rotates beyond this middle position,
current again ¯ows through the loop but in the opposite direction
(it ¯ows into what was the right side, now located on the left,
and out of the other) assuring that the force exerted on whichever
side of the loop happens to be on the left does not change (®gure
14.17(c)). The loop keeps rotating in the same direction. If a shaft
and gears are attached to the loop, its rotational motion can be
transmitted to a wheel which could in turn be attached to a
cart. This device is a simple electric motor, which transforms electrical energy into mechanical energy, the reverse of a generator.
Instead of just one loop, an actual electric motor has many
loops wrapped around an iron core or armature. In addition to
increasing the torque, having many loops improves the operation, since the torque on a single loop changes continuously
from zero to a maximum value, while in many consecutive
loops it would stay about constant.
Maxwell's equations
James Clerk Maxwell was the nineteenth century scientist who
probably made the greatest contribution to modern physics.
Maxwell was born of a wealthy family in Edinburgh on June
13, 1831. Early on he showed great mathematical ability and at
the age of fourteen wrote a paper on geometry, ``The Theory of
315
SUPERSTRINGS AND OTHER THINGS
Rolling Curves,'' which was accepted for reading before the
Royal Society of Edinburgh. Not counting this and other short
papers written while at school, Maxwell wrote over a hundred
scienti®c papers in his 48 short years of life.
Faraday had introduced the concept of lines of force as a
pictorial representation of electrical and magnetic forces. While
still an undergraduate at Cambridge, Maxwell tried to give mathematical form to the physical notions of Faraday. Shortly after
graduating from Cambridge, he wrote a paper entitled ``On Faraday's Lines of Force,'' the ®rst in a series of papers on this subject,
which was read before the Cambridge Philosophical Society.
Seven years later, in a second paper entitled ``Physical Lines of
Force,'' Maxwell devised a model that illustrated Faraday's law
of induction. When Maxwell applied his model, he realized
that, in addition to Faraday's discovery that a changing magnetic
®eld would produce an electric force, the model suggested that a
changing electric ®eld would produce a magnetic force. To take
this into account Maxwell decided to modify AmpeÁre's law.
Maxwell eventually developed a theory of electromagnetism
which embraced everything that was previously known about
electricity and magnetism. His four equations, known today as
Maxwell's Equations, not only summarize all the work of Coulomb,
Oersted, AmpeÁre, Faraday and others but also extended the
relationships and symmetries between electricity and magnetism.
Whereas the ®rst three equations could be considered as restatements of previous work, his fourth equation, the extension of
AmpeÁre's law, was the key to the puzzle, the stroke of genius
which uni®ed electricity and magnetism into one single theory
of electromagnetism.
Maxwell's ®rst equation is a consequence of Coulomb's law
and gives a relationship between an electric charge and the
electric ®eld it produces. The second equation describes how
magnetic ®eld lines always form closed loops; that is, they do
not start or stop anywhere. In this respect, they are essentially
different from electric ®eld lines which start on positive charges
and end on negative charges. For this reason, we do not observe
magnetic monopoles, the magnetic equivalent of electric charges.
The third equation is Faraday's law which says that a changing
magnetic ®eld creates an electric ®eld. Maxwell's fourth equation
is, as we have said, an extension of AmpeÁre's law, and states that
316
Electromagnetism
Physics in our world: Microwave ovens
Microwave ovens are commonplace in our homes today.
However, few of us know how they work. The physics we
have learned so far should allow us to understand the basic
principles of their operation.
A microwave oven is actually a small broadcasting
station. A magnetron tube emits high frequency electromagnetic waves (microwaves). These waves are ``piped''
along a waveguide in the oven. A metal stirrer then directs
the waves throughout all areas of the oven.
Microwave ovens designed for homes have a frequency
of 2450 MHz, which corresponds to a wavelength of 122 mm.
These electromagnetic waves are re¯ected by metals but
transmitted by paper, glass, and some plastics. They are
absorbed by water and sugar. As the microwaves penetrate
the foods, they cause water molecules to vibrate at depths
of about 5 cm, producing heat mainly as a result of the disruption of the intermolecular bonds of the water molecules.
Water molecules consist of an atom of oxygen and
two atoms of hydrogen arranged so that, although the
entire molecule is electrically neutral, the oxygen side of the
molecule is negative, while the side with the hydrogen
atoms is positive. The electric ®eld transmitted by the
microwave produces a torque on the water molecule. Since
the ®eld is oscillating, the applied torque sets the molecule
into vibration. Because a water molecule in a substance
such as food is bound to other molecules, retarding
``frictional'' forces are produced, which appear as heat in
the substance.
Since the walls inside the oven are made of metal, microwaves are re¯ected by them. Standing waves (like the ones
produced on a taut string) may be set up inside the oven.
The places of maximum vibration correspond to hot spots,
while the places where no vibrations of the electromagnetic
®eld take place correspond to cold spots. To avoid the
uneven cooking that these hot and cold spots would produce,
foods are placed on a turntable.
317
SUPERSTRINGS AND OTHER THINGS
Figure 14.18. An electromagnetic wave consists of changing magnetic
and electric ®elds at right angles.
a changing electric ®eld creates a magnetic ®eld. A magnetic ®eld
then can be created by an electric current or by a changing electric
®eld.
According to Maxwell's third equation, a changing magnetic
®eld creates an electric ®eld which is also changing. The fourth
equation tells us that this changing electric ®eld would in turn
create a magnetic ®eld, which changes with time. This changing
magnetic ®eld in turn creates a changing electric ®eld which
creates a changing magnetic ®eld, and so on. By combining his
four equations into a single expression, Maxwell was able to
show that even in regions where there were no electric charges
or magnets, once the process started, the changing magnetic
and electric ®elds would continue to propagate. This propagation
of electric and magnetic ®elds through space is what we call
electromagnetic waves (Figure 14.18).
From his equations, Maxwell was able to determine that the
speed of propagation of the waves was 2:88 108 meters per
second, almost exactly equal to the speed of light, which had
been measured in his time to be 3:11 108 meters per second.
He suggested that electromagnetic waves did exist and were
observed as light. ``The agreement of the results seems to shew
that light . . . is an electromagnetic disturbance,'' he wrote in a
paper published in 1868. Nine years after Maxwell's death,
Heinrich Hertz (1857±1894) generated electromagnetic waves in
his laboratory.
318
15
WAVE MOTION
The nature of waves
Wave motion exists everywhere. When a bird sings in the forest,
sound waves propagate away from the bird's throat in all directions, striking the ear membranes of the other animals nearby. A
leaf falling from a tree on the surface of a pond produces ripples
that spread outward through the surface of the serene water.
Light reaching the surface of the earth from the sun propagates
through empty space as an electromagnetic wave. And as we
learned in chapter 7, even ordinary matter exhibits wave behavior.
In the examples above, sound and water waves require a
medium to propagate. The bird's song could not be heard without the air through which the sound waves could travel, and
the ripples spreading on the still pond are certainly impossible
if the pond does not exist. An astronaut on the moon cannot
attract his companion's attention by clapping his gloved hands
or banging on his metal backpack since there is no atmosphere
that can serve as a medium. The astronaut, however, can speak
into the microphone in his helmet and the astronauts around
him, the astronauts in Moonbase, and Mission Control on earth
can all hear him. The electromagnetic waves generated by his
radio transmitter can propagate through the vacuum of space;
they do not need a medium although they can travel through
certain media. Waves that require a medium to propagate are
called mechanical waves. In this chapter we will study mechanical
waves only, as they can be visualized.
What exactly is a mechanical wave? In chapter 7 we learned
that the elasticity of a medium allows a disturbance created at
some point in space to propagate through the medium without
321
SUPERSTRINGS AND OTHER THINGS
Figure 15.1 (a) Transverse wave. A pulse traveling to the right displaces
the string in a direction at right angles to the direction in which the pulse
travels. (b) A longitudinal wave produced by pushing and pulling on one
end of a Slinky. (c) A torsional wave produced by repeatedly twisting
one end of a coil back and forth.
transmitting matter. Thus, when a taut string is plucked, the
hump or pulse produced travels down the string; but the material
of the string does not travel along with the pulse, it only vibrates
up and down (or sideways or at an angle, depending on the direction in which we pluck it). If matter does not travel, what is it that
gets transmitted with the wave? It is energy and momentum.
When the ripple produced by a pebble dropped in the water
reaches a ¯oating leaf, it lifts it; the energy required to lift the
leaf is transmitted by the wave and comes from the interaction
between the falling pebble and the water.
When a pulse travels along a taut string, the string vibrates in
a direction that is perpendicular to the direction in which the pulse
moves (®gure 15.1(a)). We call this wave a transverse wave. Pushing
the end of a spring coil or ``Slinky'' back and forth produces a
wave that travels through the coil. When the end is pushed, the
®rst turn of the coil gets closer to the second turn, deforming the
322
Wave Motion
coil and forcing the second turn to move away from the approaching ®rst turn. This results in the second turn approaching the third
and the third moving towards the fourth, and so on. If the end is
pulled back after it is pushed, the second turn moves back towards
the ®rst, the third towards the second, the fourth towards the
third, and so on. If the end is pushed and pulled several times,
the overall effect is seen as moving variations of the spacing
between the turns; regions where the turns are closer together
followed by regions where the turns are farther away from each
other (®gure 15.1(b)). This wave is called a longitudinal wave, as
the medium oscillates in the same direction as the wave. If we
now hold the Slinky from one end so that it hangs loose in a
vertical position and twist the end back and forth, a torsional
wave will start to propagate down through the coil (®gure 15.1(c)).
Properties of waves
When the source of a particular wave motion acts continuously
rather than momentarily, a train of waves is formed instead of
a single pulse. If, in addition, the source of waves acts in a
cyclic way, repeating the same stages over and over like a vibration, the wave so produced is called a periodic wave. If we were to
take a snapshot of a periodic wave as it moved through a
medium, it would look like ®gure 15.2. Although the plot looks
like the snapshot of a transverse wave traveling along a string
or cord, it can also represent a longitudinal wave. In the example
with the Slinky, the regions where the turns are closer together
would be represented by the crests and the places where the
turns are separated, by the troughs.
Figure 15.2. Graphical representation of a wave.
323
SUPERSTRINGS AND OTHER THINGS
The distance between two adjacent peaks, between two
adjacent troughs or between any two identical points of a periodic
wave is called the wavelength, (the Greek letter lambda), of the
wave. If we observe one particular point in space and count the
number of wavelengths that pass that point during some time
interval, say one second, we would know how frequently the
wave moves through that point. The number so obtained is
called the frequency f of the wave motion. Frequency, then, is
the number of wavelengths that pass a particular point per
second and is given in cycles per second or hertz (Hz):
1 Hz 1 cycle=s:
The frequency of a light wave determines perceived hue and, as
we shall see in the next chapter, the frequency of a sound wave
determines its perceived pitch.
Clearly, if two waves are traveling at the same speed, the
wave with the shorter wavelength will have the higher frequency
(®gure 15.3). Frequency and wavelength are inversely proportional for waves traveling at the same speed. A wave of twice
Figure 15.3. The observer counts more wavelengths for the lower wave
than for the higher one. The frequency (the number of wavelengths that
pass a point every second) is higher for the lower wave.
324
Wave Motion
the wavelength, for instance, would have half the frequency of
another wave traveling at the same speed.
In chapter 6 we learned that for a rotating object, the time
taken to complete one revolution was called the period, T. In
wave motion, period is the time (in seconds) for one complete
cycle; it is therefore the inverse of the frequency, which is the
number of cycles per second:
1
T :
f
Another important parameter in describing a wave motion is the
amplitude (A) which is the maximum displacement of the medium
from its equilibrium position.
The velocity (v) of a wave moving through a medium is
determined by the medium. Velocity is equal to the displacement
divided by the time interval during which the displacement
occurred. Figure 15.4 shows successive snapshots of a wave traveling to the right. As one complete wavelength passes through
a certain point the wave executes one cycle and this takes a
time exactly equal to one period; thus,
1
v f
since T
:
T
f
Diffraction
One interesting property of waves is that they spread out when
passing through a hole of dimensions similar to those of the
wavelength. Figure 15.5 is a photograph of water waves advancing toward an obstacle with a small opening. The waves pass
through the hole, propagate beyond the obstacle, and spread
out around the edge of the hole. This phenomenon is called
diffraction. A group of swimmers in the sea creates no breaks in
the waves rippling onto the beach, as the waves ¯ow around
small obstacles. This is also a form of diffraction.
The principle of superposition
When two particles encounter each other, they collide. When two
waves encounter each other, however, they pass through each
325
SUPERSTRINGS AND OTHER THINGS
Figure 15.4. As a transverse wave travels along a string, a small section
of the string oscillates up and down, moving in a direction that is perpendicular to the direction in which the wave propagates. It takes a time
equal to one period for one wavelength to move past that section.
Figure 15.5. Plane waves passing through a barrier with a small hole.
The waves spread out beyond the barrier. This is known as diffraction.
(From PSSC Physics Seventh Edition, by Haber-Schaim, Dodge, Gardner,
and Shore. Published by Kendall/Hunt Publishing Company, 1991.)
326
Wave Motion
other unmodi®ed. You can hear the sound of a trumpet even while
a violin is being played in the same room. The two sound waves
reach your ears independently, as if they were the only ones
present in the room. If you are listening to the radio while the telephone rings, you will hear the ring with the same pitch and the
same loudness as if the radio was off. The same is true for water
waves, waves on a string, or any other kind of wave motion.
Where the two waves overlap, their amplitudes add up
algebraically; at some places the waves add up to produce
larger amplitudes and at other places they combine to produce
smaller amplitudes. If, for example, two waves traveling on a
string meet at one point where the amplitude of one wave is
5.0 cm and the amplitude of the other is ÿ3.0 cm or 3.0 cm
below the equilibrium position, the actual displacement of the
string from the equilibrium position at that point would be
2.0 cm (®gure 15.6). Each wave contributes to the rope's displacement regardless of the presence of the other. This property is
known as the principle of superposition or the phenomenon of
Figure 15.6. Superposition of two wave pulses (A and B) at a point. The
waves interact destructively but then pass through each other unchanged.
327
SUPERSTRINGS AND OTHER THINGS
interference. When the displacement is enhanced, the waves are
said to undergo constructive interference, and when the displacement is diminished, the waves undergo destructive interference.
However, this effect is signi®cant only when the waves have
the same frequency.
Constructive and destructive interference
Two waves of equal amplitudes and frequencies moving in the
same direction through the same region of space interfere constructively if the crests of one wave match the crests of the other.
This results in a wave with an amplitude equal to twice the amplitude of either wave alone (®gure 15.7(a)). If the crest of one wave
meets the trough of the other, there is destructive interference
and the waves cancel each other out at that point (®gure 15.7(b)).
Figure 15.8(a) illustrates what happens when ripples from
two nearby sources overlap. If two similar pebbles are dropped
Figure 15.7. (a) When the crests from the two overlapping waves
coincide, there is constructive interference; (b) when the crest and the
trough coincide, there is destructive interference and the waves cancel
each other out.
328
Wave Motion
Figure 15.8. (Left) Interference pattern produced by two sources vibrating simultaneously. (Right) Interference patterns for a different wavelength. (From PSSC Physics Seventh Edition, by Haber-Schaim, Dodge,
Gardner, and Shore. Published by Kendall/Hunt Publishing Company,
1991.)
close to each other on a serene pond, circular ripples start to
spread out from the place where each pebble hit the water and
quickly reach the same area on the surface of the water. There
will be some regions where the ripples will interfere destructively, canceling each other out, and some other regions where
the interference is constructive, and the amplitude is increased.
Figure 15.8(a) is actually a photograph of two trains of waves produced by the vibration of two rods in contact with the water and
connected to the same motor. The rods vibrate synchronously
and produce ripples that have the same amplitude and wavelength and that oscillate in step. They are said to be in phase.
Waves with the same form and wavelength oscillating in phase
are called coherent waves. The regions of constructive interference
form lines that radiate from the middle of the two sources of
waves and which are interspaced by lines formed by the regions
of destructive interference. Here the waves are out of step and are
said to be in antiphase.
This interference pattern of regularly spaced regions of
maxima and minima depends on the separation of the sources
and the wavelength of the two disturbances and is characteristic
of all wave motions. Although it is a simple matter to measure
the wavelength of waves traveling through a rope or string or of
water waves, sound waves and electromagnetic waves are not
visible and the wavelength cannot be measured by observation.
329
SUPERSTRINGS AND OTHER THINGS
Figure 15.9. Two coherent sources of waves interfere constructively at
P and destructively at Q.
However, if the locations of maxima and minima can be
determined, then simple geometry allows us to ®nd the wavelength. In the photograph of ®gure 15.8(b), we see the different
interference patterns produced by ripples of different wavelength
from that of 15.8(a). The separation between the sources and the
positioning of the screen is the same for both cases. We can see
that the spreading is more pronounced for longer wavelengths
(®gure 15.8(a)).
Consider two coherent sources of waves separated by a distance d arranged as in ®gure 15.9. If a maximum is found at some
point P at a distance y above the x axis (assuming that the distance
L is much greater than the separation d between the sources), the
wavelength is given by a simple geometrical expression involving the distances y, L, and d:
yd
:
L
Standing waves
Two identical wave motions of equal amplitude and wavelength
traveling in opposite directions, towards each other, interfere in a
330
Wave Motion
Figure 15.10. Two waves moving in opposite directions, shown at
different times.
way that depends on the location of the crests and troughs of each
wave at different times (®gure 15.10). Regardless of where the
individual waves are, though, there are certain points called
nodes that do not move at all. The points in between the nodes
vibrate but the wave does not move in either direction; the
nodes remain at the same locations and the overall pattern is
stationary. These ®xed waves are called stationary or standing
waves. It can be shown that the spacing between nodes is equal
to half the wavelength of the traveling waves.
Standing waves also occur on a stretched string ®xed at both
ends, such as in a guitar or a piano. Consider the waves generated
when a guitar string is plucked. Many waves are produced which
interfere with each other but only the ones with the end points
®xed are sustained. These waves combine to produce an overall
pattern which does not travel in any direction; the string vibrates
up and down only.
One way to observe stationary waves is to tie one end of a
string to a post, for example, and, holding the other end so as
to keep the string taut, produce a train of waves by repeatedly
shaking the free end of the string with our hand. When the
waves reach the ®xed end, they are re¯ected, and interfere with
the outgoing waves that we are producing from the free end to
form a standing wave pattern. When a taut string is plucked or
struck, standing waves are produced and the string vibrates in
different ways, called modes of vibration. In the fundamental mode
331
SUPERSTRINGS AND OTHER THINGS
Figure 15.11. Standing waves on a string ®xed at both ends. The
fundamental mode of vibration has only two nodes at the ®xed ends;
the second and third harmonics have one and two additional nodes
respectively.
or ®rst harmonic, the center of the string vibrates and the only
nodes are the two ®xed ends. Another mode of vibration has, in
addition to the ®xed ends, a third node in the middle of the
string; this is called the second harmonic or ®rst overtone. Third,
fourth and other harmonics (second, third and higher overtones)
can be obtained with two, three, and more nodes in between the
®xed ends of the string (®gure 15.11).
There are certain wavelengths for which standing waves
appear in a stretched string such as a piano string or the string
of any stringed instrument. Since the string is ®xed at both
ends, these two points, as we have seen, are always nodes. Examining the different modes of oscillation illustrated in ®gure 15.11,
we can see that if the string has a length L, the fundamental mode
of oscillation takes place when one-half of a wavelength forms in
between the two ®xed ends; that is, in the fundamental mode,
one-half of a wavelength equals the length of the string. The
wavelength of the second harmonic is equal to the length of
the string. For the third harmonic, we see that one and a half
wavelengths equal the length of the string, and for the third,
two complete wavelengths are formed in the string. Thus, the
wavelengths of the ®rst four modes of oscillation are:
L
2
fundamental
2
L
2
second harmonic
332
Wave Motion
3
L
2
third harmonic
4
L
:
2
fourth harmonic
Here we have written the wavelength of the second harmonic as
2=2 which is just ; and that of the fourth harmonic as 4=2,
which is equal to 2, so as to express all wavelengths as multiples
of =2. We ®nd other standing wave modes where L is equal to an
integral number of wavelengths or L n=2 where n 1, 2, 3 . . .
etc. The wavelength n of the nth mode of oscillation is then
n
2L
:
n
The frequency of any mode of oscillation can be found from
the formula v f . For the nth mode, the frequency is
nv
:
fn
2L
Thus, when n 1, the fundamental frequency is v=2L; when n 2,
we obtain the frequency for the second harmonic, which is v=L;
and similarly for any other mode of oscillation. The frequencies
obtained from this expression are called the natural frequencies
of the string.
Resonance and chaos
When we ®x one end of a string and, keeping it stretched, repeatedly move the free end up and down slowly with one hand, standing waves are usually not seen right away. However, if we start
increasing the frequency with which we move the free end of
the string, pretty soon we reach the fundamental frequency of
vibration of the string and we observe the fundamental mode.
Further increasing the frequency destroys the standing wave
pattern for a while until the second mode of oscillation of the
string or ®rst overtone is reached.
As we slowly increase the frequency with which we move
the free end of the string, we reach other modes of oscillation.
What we are doing with our rapid up and down motions is to
pump energy to the string; the motion of our hand is the source
of energy. We notice that it is easier to pump energy when the
333
SUPERSTRINGS AND OTHER THINGS
string is oscillating in any one of its natural frequencies. A similar
situation occurs when we push a child on a swing. If the pushes
are in step with the back and forth motion of the swing, it is easier
to maintain the oscillations. But if the pushes are not in step with
the swing, the child's motion becomes disorganized, the oscillations of the swing lose amplitude, and it is more dif®cult to
push it. The string responds better to this in¯ux of energy when
the frequency of the driving force (our hand) matches one of
the natural frequencies of oscillation of the string. This phenomenon is known as resonance.
On November 7, 1940, some four months after it had been
inaugurated, the Tacoma Narrows Bridge collapsed after standing
waves were produced by gusting winds. If the standing waves
had remained stable, the bridge might not have collapsed. However, as scientists have recently discovered, a stable system may
become unstable if a very small variation is introduced into its
initial conditions. Even in the absence of any external random
forcing, some physical systems can show regular periodic motions
or apparently random motion, and the difference between the two
resides in the initial conditions. A long, ¯exible metal beam
clamped to a vibrating support would usually show a standing
wave pattern; however, if started from a slightly different position, the standing wave pattern rapidly develops a chaotic
motion which increases the amplitude of the oscillation (®gure
15.12). Depending on the elasticity of the beam, this chaotic condition may lead to a breaking of the beam. Instead of producing
stable standing waves on the Tacoma Narrows Bridge that day,
the gusting winds could have produced this chaotic motion on
the bridge, causing its collapse.
The chaotic motion we are referring to here is not completely
uncontrolled, helter-skelter, random motion. It is, rather, a somewhat controlled, seemingly random motion that has been discovered in the past few years. The fundamental cause of chaos is
sensitivity to initial conditions. Take, for example, a dripping
faucet. A slow ¯ow rate produces a rhythmic, periodic drip; the
time interval between drops is always the same. A small increase
in the ¯ow rate may still produce a periodic drip. Further increase
of the ¯ow rate eventually produces what appears to be random,
turbulent motion; the time interval between drops seems to vary
randomly, without any periodicity or structure. To investigate
334
Wave Motion
Figure 15.12. (a) Standing waves develop in a long metal beam attached
to a vibrating clamp. (b) Depending on the initial conditions, chaotic
motion can develop which may cause the beam to break.
this phenomenon, the American physicist Robert S Shaw decided
to plot the interval between drops 1 and 2 versus the interval
between drops 2 and 3. When the dripping was periodic, all
points fell in the same place, one on top of the other. When the
¯ow was turbulent, the ®rst points plotted seemed to fall all
over the graph, without any pattern. As enough points accumulated, a pattern emerged; the points formed a constrained
shape. Out of disorganized, turbulent behavior, an eerie order
lurked; an orderly disorder, the order behind chaos (®gure
15.13 (color plate)).
This chaos appears not only in physical systems but also in
living organisms. Chaotic activity is apparently responsible for
certain mechanisms in the brain involved in learning. There is
experimental evidence that chaotic activity switches on and off
in the olfactory system of mammals with different chaotic
responses for familiar and unfamiliar smells. Ventricular ®brillation, an irregular and uncontrolled contraction of the muscle
®bers of the ventricles in the heart's lower chambers, is a condition
that brings death within minutes. The heart is a mechanical
pump and, like the dripping faucet, the condition of ventricular
335
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Chaos in the brain
When we see the face of a friend, hear the voice of a famous
TV personality or recognize the smell of cherry pie, we know
almost instantly that what we see, hear or smell is familiar to
us. How do our brains process the complex and varied information perceived by our senses in such a short time with such
accuracy? The answer seems to be with the existence of chaos
in the brain.
When a person smells an odorant, for example, molecules carrying the odor are picked up by a few neuron cell
receptors in the nasal cavities. The receptor cells that capture
the molecules send out pulses which propagate to the olfactory center in the forebrain where the signals are analyzed
and transmitted to the olfactory cortex. From there the signals
are sent to other parts of the brain where they are combined
with the signals from other senses and a perception is
produced.
How does the brain separate one scent from all the others?
How does it learn to recognize familiar scents? It seems that
chaos is the property that makes perception possible.
Researchers at the University of California at Berkeley
attached an array of 60 electrodes 0.5 mm apart to the olfactory bulbar surface of trained rabbits and recorded electroencephalogram (EEG) tracings as the animals sniffed. At
®rst sight, these EEG tracings look irregular and unpredictable. The EEGs oscillate, rising and falling continuously,
but when a familiar smell is detected all the waves from the
array acquire some regularity, oscillating at about 40 Hz.
When the differences between consecutive waves were
plotted during perception of a familiar scent and during
rest, the underlying order of a chaotic system was observed
in the graphs.
The graphs suggested to the researchers that an act of
perception consists of an explosive jump from one chaotic
set of oscillations to another. They think that the olfactory
bulb and cortex maintain many chaotic sets, one for each
familiar scent. When a new odorant becomes familiar,
another set is added to the collection.
336
Wave Motion
How does the brain shift from the irregular EEGs to the
more regular oscillating one of a familiar scent, sound or
sight? More recent experiments, in which the brain neurons
of the visual system of anesthetized cats were tagged with a
dye that ¯uoresces when the neuron ®res, show that rather
than averaging the input received by all the neurons, as
had been believed, the brain actually processes all the signals.
Moreover, the experiments suggest that neurons respond
simultaneously to the same source rather than to many different ones.
How can the brain sort out what is important amidst all
this noise? Physicist William L Ditto of Georgia Institute of
Technology thinks that this noise actually helps the brain
detect weak signals. This is the idea behind the phenomenon
known as stochastic resonance, in which the addition of noise
enhances the detection of normally undetected signals.
Chaos in the brain, the researchers think, may be the
main property that makes the brain different from a computer, even at the current stages of arti®cial intelligence.
Future programmers will probably have to put chaos into
their programs if they want their machines to achieve even
a minimal level of recognition.
®brillation may have an underlying pattern of regularity. If heart
®brillation is indeed chaotic, it might be possible to predict when it
is going to appear and perhaps prevent it.
Water waves
Water waves are an example of a fairly complicated wave phenomenon that can be understood with what we have learned
about wave motion this far. As we have seen, it is energy that is
transmitted in a wave, not matter. The particles of matter that
make up the medium in which the wave travels vibrate around
®xed positions but do not travel along with the wave. In the
case of water, the particles of water move in circles, upward on
the front edge and downward on the back edge (®gure 15.14).
337
SUPERSTRINGS AND OTHER THINGS
Figure 15.14. The particles of water move in circles when the wave
passes.
After a disturbance is produced in water, several forces begin
to act on the water. Suppose we throw a big rock or brick in the
water of a lake. This makes the water next to the point of
impact rise above the normal level (®gure 15.15). As when an
object is thrown up into the air, the weight of the rising molecules
of water slows them down until they ®nally stop and begin to fall.
Accelerating downwards, they reach the equilibrium level with a
maximum speed and continue moving past this level pushing the
other molecules away and creating a depression in the water. The
pressure of the water itself produces a buoyant force that slows
the falling water and, after stopping it, pushes it back up.
Figure 15.15. Two forces act on the water when a disturbance is produced; the weight of the water acting downward, and the pressure of
the water itself acting upward, which produces a buoyant force FB .
338
Wave Motion
Figure 15.16. Ripples are propagated by forces that arise from surface
tension.
Again the water overshoots and rises above the undisturbed
level, and another cycle of this oscillation starts. Since water is
incompressible, when the ®rst depression is formed, the water
particles next to the depression are pushed up, and a second oscillating pulse is formed. In this way the disturbance propagates
away from the place where the rock was thrown.
In the case of very small waves or ripples, the role of the force
of gravity is not as important as that of surface tension. When the
water rises above the equilibrium level, the curved surface
becomes slightly larger than the ¯at undisturbed surface so that
the average distance between molecules increases. The forces
acting between the water molecules on the surface of the disturbance pull the surface to the ¯at undisturbed level and the
same overshooting effect takes place, producing oscillations
(®gure 15.16).
Seismic waves
Earthquakes are sudden disturbances within the Earth that produce waves (called seismic waves) that travel around the world.
When an earthquake occurs, several kinds of waves propagate
away from the epicenter at different speeds. The fastest waves,
which arrive at a detection site ®rst and consequently are called
P (or primary) waves, are longitudinal and travel through the
Earth. Deep in the earth, the P waves travel at a speed of
about 15 km/s, whereas near the surface their speed is about
5.5 km/s. The slower S (or secondary) waves, which also travel
through the earth, are transverse waves; their speeds are usually
7
12 the speed of the P waves. The slowest waves are the L waves,
which are surface waves with amplitudes that die out fairly
rapidly with depth.
339
SUPERSTRINGS AND OTHER THINGS
Figure 15.17. Waves from earthquakes on the other side of the earth,
which are changed when passing through the core, are different from
the waves from earthquakes that take place nearby.
Solid regions in the earth transmit both P and S waves, while
¯uid regions transmit only P waves. Observation and detection of
these different kinds of wave have been used to obtain information about the structure of the interior of the earth. In 1906,
from analyses of P wave data, the British scientist R D Oldham
demonstrated that the earth has a core. By observing the difference in the waves from earthquakes on the other side of the
earth and those from earthquakes that had taken place at closer
locations, he concluded that the waves from the other side of
the earth had been altered by passing through the core (®gure
15.17). Further studies showed that the interior of the earth has
two other layers besides the core: the mantle and the crust
(®gure 15.18). The core appears to be composed mainly of iron,
with small amounts of sulfur and oxygen; it extends 1218 km
from the center of the earth. The mantle, which has a thickness
of about 2900 km, consists of rocks made of iron and magnesium
combined with a silicate material. It becomes ¯uid under high
340
Wave Motion
Figure 15.18. Structure of the interior of the Earth. Detection of seismic
waves has yielded information about the composition of the Earth.
constant pressure; however, sudden changes in pressure turn it
brittle.
Surrounding the mantle is the crust, a relatively thin layer of
lower density rock; its thickness varies from a few kilometers
under the ocean, to about 50 kilometers in the continents. The
lower part of the crust, together with the upper mantle, constitutes the lithosphere, a rigid layer formed by large thin plates. These
plates, consisting of lower density granitic rocks, ¯oat on the
slightly denser plastic mantle, carrying the continents along
with them. The surface of the earth is continuously changing
and the continental plates move with velocities of a few centimeters per year.
341
16
SOUND
The nature of sound
Sound in¯uences all of our activities. The human ear is capable
of detecting waves carrying energies as low as 10ÿ18 J. For comparison, the energy required to lift a paper clip up to a height
of only one millimeter is one trillion times as great. This remarkably sensitive organ allows us to detect the immense variety of
sounds in which we are constantly immersed; the snarling
and barking of dogs, the mewing of cats, the chirping and
singing of birds, the screaming of sea gulls, the crowing of
roosters, the whispers of trees, the patter of rain, the thunder
of storms, the ticking of clocks, the whir of machines, the crying
of babies, the voices of singers or the melody and harmony of
music.
What is this phenomenon that we call sound? Galileo said
that sound is a wave motion ``produced by the vibration of a
sonorous body.'' Some 2200 years earlier, the Greek philosopher
Pythagoras had recognized that the pitch of a musical sound
depends on the frequency of vibration of the object that produces
the sound. Aristotle, in the fourth century BC, realized that the
transmission of sound through the air is related to the motion
of the air. The Roman architect and engineer Vitruvius, in the
®rst century BC, was probably the ®rst person to understand the
concept of waves. He wrote that the propagation of sound in
air was analogous to the motion of ripples on the surface of the
water. In the eighteenth century, Robert Boyle showed that
sound waves need a medium to travel through by pumping air
out of a jar containing a bell, which could not be heard without
the air.
342
Sound
Figure 16.1. After a drumhead is struck, higher and lower density
regions of air propagate away from the drum.
We know today that sound is a mechanical longitudinal wave
that propagates through a medium with frequencies that range
from a fraction of a hertz to several megahertz. These mechanical
waves are pressure waves produced by the vibrations of the molecules that make up the medium. After we strike a drumhead, it
starts its return toward the position of equilibrium but, like an
oscillating pendulum it overshoots, and continues moving past
this position, pushing the air molecules in its path. This creates
a region of higher density in the air. As the drumhead moves in
the opposite direction, again toward the equilibrium position, a
region of lower density air is produced. With each vibration of
the drumhead, regions of higher density air followed by regions
of lower density air are created. In the regions of higher density,
compressed air pushes on additional air, transferring momentum,
and a wave propagates through space (®gure 16.1).
Since the human ear can detect sounds between 20 and
20,000 Hz, sound of frequencies within this range are called
audio frequencies. Sound waves with frequencies below 20 Hz are
called infrasonic, and those with frequencies above 20,000 Hz are
called ultrasonic.
The speed of sound
The speed of propagation of sound through a medium is a consequence of the properties of that particular medium. In a gas, the
molecules are separated from each other by comparatively large
343
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Telephone tones
When you press a button to dial a number with a touch-tone
telephone, you activate electric circuits that produce tones.
In the USA, pressing the number 1 button, for example,
generates a tone of frequency 697 Hz together with a tone of
frequency 1209 Hz. The number 6 button generates a 770 Hz
tone and a 1477 Hz tone. The sound produced by these
tones is transmitted as an electromagnetic signal to the central
telephone switching of®ce where it is interpreted by electronic
circuits in the switching network.
Each signal arriving at
the central switching of®ce
has been combined or multiplexed together with many
other signals. In the past,
this
multiplexing
was
achieved by assigning each
telephone signal to a different frequency band. Today,
multiplexing is performed
by converting telephone signals to digital form, received
by a large dedicated computer at the switching of®ce.
If the number called is
located at the same central
of®ce (the same exchange,
meaning that the telephone
numbers share the same
initial digits), the connection
is completed there. If the number called has a different
exchange, the signal is sent to a second central of®ce. For
long distance calls, the call is sent to a series of interchange
of®ces until the desired exchange is reached.
Several methods have been developed over the years
to transmit telephone signals. The oldest digital system still
in use today is a system in which multiplexed signals
travel through copper wires that carry 24 telephone signals
344
Sound
simultaneously. The very high frequency radio system,
which carries signals across the United States in the microwave region, bounces signals across microwave towers
located every 26 miles. More recently, with optical ®ber (see
next chapter), the telephone signals are encoded in the
near-visible region of the electromagnetic spectrum. A
single optical ®ber can carry 30 000 simultaneous telephone
signals.
distances, and the time between collisions is longer than in liquids
or solids, where the molecules, being much closer together, interact through intermolecular forces rather than through collisions.
Since a sound wave propagates through molecular interactions,
we would expect sound to travel at a lower speed in gases.
Table 16.1, lists the speed of sound in various substances.
Since molecular motion increases with temperature, we also
expect the speed of sound to be directly proportional to the temperature of the medium through which it propagates. For sound
waves moving in air, if we know the speed v0 at 08C, the speed is
approximately
v v0 0:61 m=s 331 0:61 m=s
where is the temperature in degrees Celsius. This means that the
speed of sound increases about 0.61 m/s for every one-degree rise
in temperature.
Table 16.1. The speed of sound in some gases, liquids and solids.
Substance
Speed in m/s
Gases
Air at 08C
Air at 208C
Air at 1008C
Hydrogen at 08C
Helium at 08C
Oxygen at 08C
Nitrogen at 08C
331
343
366
1286
965
317
334
345
Substance
Speed in m/s
Liquids at 258C
Fresh water
Seawater
1493
1533
Solids
Aluminum
Iron
Lead
Rubber
Granite
5100
5130
2700
1800
6000
SUPERSTRINGS AND OTHER THINGS
Table 16.1 tells us that the speed of sound in a lighter gas, like
hydrogen, is higher than the speed of sound in a heavier gas, like
oxygen. Recall from chapter 10 that the temperature of a substance depends only on the random kinetic energy per molecule,
not on the number of atoms and molecules in the substance nor
on the kinds of atoms that form the substance. At the same temperature, then, molecules of hydrogen and oxygen have the same
average kinetic energy. Since the kinetic energy of a molecule is
2
1
2 mv , at the same temperature, a hydrogen molecule, with a
mass m of 2 amu, has a greater speed v than a heavier oxygen
molecule of mass m equal to 32 amu.
Intensity of sound waves
The sensation of loudness, although subjective, is related to an
objective property, namely the intensity of the sound wave
being perceived. Intensity is the rate at which the wave transports
energy per unit of area; that is, the power delivered by the wave
through a unit of area, or
P
I :
A
Since the units of power are watts, the units of intensity are watts
per meter squared (W/m2 ).
The human ear can detect sounds with intensities as low as
10ÿ12 W/m2 and as high as 1 W/m2 . This is an enormous range
of intensity, a factor of 1012 or one trillion from the lowest to
the highest. Although the sound intensity of normal conversation
is about 10 000 times greater than that of a whisper, we would
perhaps say that normal conversation is only about 3 or 4 times
as loud as a whisper. To better represent the response of the
human ear to changes in sound intensity, we use the term sound
level, expressed in units called bels (in honor of Alexander
Graham Bell) or more commonly decibels, dB (1 dB is 0.1 bel).
On this new scale, 0 dB corresponds to the lowest intensity that
we can detect, and 120 to the maximum we can endure. Thus,
when the sound intensity changes by a factor of 10, we add 10
to the sound level in dB. We can see the relation between
sound intensity and sound level in Table 16.2. Sound levels for
several sounds are listed in Table 16.3.
346
Sound
Table 16.2. Comparison between sound intensity and sound level.
Sound level (dB)
0
10
20
30
40
50
60
70
80
90
100
110
120
Sound intensity (W/m2 )
ÿ12
Relative intensity
100 1
101 10
102 100
103 1 000
104 10 000
105 100 000
106 1 000 000
107 10 000 000
108 100 000 000
109 1 000 000 000
1010 10 000 000 000
1011 100 000 000 000
1012 1 000 000 000 000
10
10ÿ11
10ÿ10
10ÿ9
10ÿ8
10ÿ7
10ÿ6
10ÿ5
10ÿ4
10ÿ3
10ÿ2
10ÿ1
1
The ear
We learned in the previous section that the human ear can detect
sounds with an intensity of 10ÿ12 W/m2 . This corresponds to a
pressure of 3 10ÿ5 Pa above the atmospheric pressure, which
is only 6 times greater than the pressure ¯uctuations produced
by the random motion of the air molecules. If the human ear
were a little more sensitive, we could hear individual molecules
colliding with the eardrum!
The human ear, like the ear of other mammals, is divided into
three regions: the outer ear, the middle ear, and the inner ear
(®gure 16.2) The outer ear consists of the auricle or pinna and the
Table 16.3. Some sound levels.
Sound
Sound level Sound
(dB)
Sound level
(dB)
Barely audible
Rustle of leaves
Whisper at 1 m
Quiet room
Library
Average classroom
Normal conversation (1 m)
Busy street
0
10
20
30
40
50
60
70
80
90
100
110
120
120
140
150
Average factory
Niagara Falls
Power mower
Rock concert (outdoors)
Rock concert (indoors)
Threshold of pain
Jet plane at 30 m
Jet takeoff
347
SUPERSTRINGS AND OTHER THINGS
Figure 16.2. (a) The human ear. (b) Schematic diagram of the ear.
2.5-cm long auditory canal, which connects the auricle with the
eardrum or tympanum. This has the form of a ¯attened cone
some 9 mm in diameter, stretched across the tympanic annulus, a
small bone in the shape of a ring. Sound waves enter the ear
canal through the auricle and set the eardrum into vibration.
Behind the eardrum is the middle ear which consists of a
narrow, air-®lled cavity where three small bones, the hammer,
the anvil, and the stirrup, transmit the vibrations of the eardrum
to the oval window of the inner ear. Leading downward from
this cavity is the Eustachian tube, which connects the ear to the
348
Sound
Figure 16.3. The inner ear.
nasopharynx, the region behind the nasal passages. The purpose
of this tube, about 45 mm in length, is to equalize the pressure on
both sides of the eardrum.
The inner ear is contained in a bony structure known as the
labyrinth with two openings to the middle ear; the oval window,
connected to the stirrup, and the round window, covered by a thin
membrane. This labyrinth contains three semicircular canals and
a spiral tube about 30 mm in length called the cochlea, a word
derived from the Greek world for snail (®gure 16.3). It is in the
cochlea where the vibrations of the sound waves are transformed
into the electrical impulses the nerves transmit to the brain. The
cochlea is divided internally by three longitudinal sections: the
vestibular canal, which ends at the oval window; the tympanic
canal, which ends at the round window; and the cochlear duct,
which lies in the middle of the other two and also ends at the
round window.
A sound wave coming into the inner ear through the oval
window travels down the vestibular canal and back along the
tympanic canal. The membrane that separates these two canals
contains the organ of Corti, a complex arrangement of some
30 000 nerve endings (®gure 16.4). The organ of Corti is covered
by some ®ve thousand sensory cells ± called hair cells due to
their hairlike appearance ± which are stimulated when the membrane is set into vibration by a sound wave. The nerve ®bers
349
SUPERSTRINGS AND OTHER THINGS
Figure 16.4. Cross-section of the cochlea showing the location of the
organ of Corti and the vestibular and tympanic membranes.
connected to these hair cells transmit this information to the brain
in the form of electrical impulses which the brain interprets as
sound.
Hair cells are about ®ve thousandths of a millimeter thick
and as fragile as a cobweb. Sound waves that carry too much
energy can damage them. A few minutes exposed to a sound
level above 110 dB can cause permanent damage; above this
``threshold of pain'', exposure for any length of time is dangerous. Each hair cell can respond to motions as small as 100 picometers (10 billionths of a centimeter), a distance similar to the
diameter of one of the larger atoms. The oscillatory pressure of
the sound wave moves the hair cells and this motion is transformed into an electrical impulse at the cell which changes the
electrical potential that exists between the interior and exterior
of the cell, caused by different concentrations of sodium and
potassium ions inside and outside the cell. When the hair cell is
at rest, there is an exchange of ions through open channels in
the membrane, and an equilibrium situation is reached. When
the cell is moved, the number of open channels changes and
the equilibrium is disturbed; this changes the electric potential
of the cell from about 60 mV to about 40 mV. This change in the
potential difference triggers an electrical signal that travels to
the brain.
350
Sound
Hair cells are also the sensory receptors for the sense of
balance which allows humans to walk upright. In addition to
the cochlea, ®ve other sensory organs contain hair cells; the
three semicircular canals, the utricle and the saccule. The hair
cells of the utricle and the saccule are located on two thin, ¯at
sheets, one of which is positioned vertically and the other horizontally. Close to these organs is the otolithic membrane which
contains hundreds of thousands of small crystals, the otoconia,
which make this membrane denser than the ¯at sheets that contain the utricle and the saccule. When the head is accelerated,
the otocondria lags behind due to its greater inertia, and this
causes the hair cells of the utricle or the saccule to be displaced
in the opposite direction. Displacement of the hair cells produces
potential difference changes, which are communicated to the
brain as electrical signals.
A similar mechanism works in the semicircular canals. In
this case, the hair cells are moved when an angular acceleration
takes place. The canals are ®lled with a ¯uid, the endolymph,
which lags behind the sides of the canal when the head is rotated.
This ¯uid exerts a pressure on the membrane where the hair cells
are located, producing the displacement that causes the potential
changes in the cells. As the three canals, the superior canal, the
horizontal canal, and the inferior canal, lie along perpendicular
directions, they can detect angular acceleration along three
perpendicular axes in space (®gure 16.5).
Figure 16.5. The labyrinth showing the three semicircular canals, the
saccule and the utricle.
351
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Electronic ear implants
People with hearing loss caused by damage to the hair cells
which cover the organ of Corti can now receive cochlear
implants that replace the function of the hair cells with electrodes. These electrodes, in the form of a very thin wire less
than a millimeter in diameter, are inserted through the ear
canal into the cochlea to stimulate the auditory nerves electronically, allowing the deaf person to perceive sound.
Although the ®rst implants contained a single electrode,
more recent devices use multichannel electrodes with several
receptors for different frequencies. A sound processor worn
in a shirt pocket encodes the sound and transmits it as
radio waves to the electrodes.
Researchers are currently attempting to determine the
proper position in the cochlea to place the electrodes and
the range of frequencies to which each electrode should
respond. High frequencies stimulate the outer spiral of the
cochlea whereas low frequencies must penetrate deep inside.
The original single channel electrodes (which were
approved by the Food and Drug Administration but are no
longer available) gave most patients only the awareness of
sound. The multichannel devices based on the selective
frequency bands now in development promise to give
speech recognition to the patient
The sound of music
What is the difference between music and noise? We might say
that music sounds pleasant and noise generally unpleasant. But
what makes a sound pleasant to our ears? Pythagoras, in the
352
Sound
sixth century BC, discovered that when two similar stretched
strings of different lengths are sounded together, the sound
thus produced is pleasant if the lengths of the strings are in the
ratio of two small integers.
Pythagoras was probably born in the year 560 BC in Samos,
one of the Aegean islands. Little is known of his early years in
Samos. In 529 BC, after traveling through Egypt and the East on
advice from his mentor Thales, Pythagoras settled in the Greek
colony of Crotona, in southern Italy, where he began lecturing
in mathematics and philosophy. People from the upper classes
attended his lectures. One of the most attentive was Theano,
the beautiful daughter of his host, whom Pythagoras later
married, and who, in order to attend the lectures, had to defy
an ordinance which forbade women to be present at public
functions. Theano later wrote a biography of her husband, now
unfortunately lost.
Pythagoras discovered that if a stretched string vibrates as a
whole ± we would say that it vibrates in the fundamental mode ±
the sound produced by a second string that has half the length of
the ®rst one would be harmonious with the sound of the ®rst
string (®gure 16.6(a)). Strings with lengths equal to one third,
one fourth, and so on, would also be harmonious with the ®rst
string. Pythagoras also realized that two stretched strings of
Figure 16.6. (a) The sound produced by a string of a certain length
vibrating in the fundamental mode is harmonious with the sound of a
second string that has half the length, also vibrating in the fundamental
mode. (b) Two equal strings also produce harmonious sounds if one
vibrates in the fundamental and the other in two, three or more equal
parts (overtones).
353
SUPERSTRINGS AND OTHER THINGS
equal length would produce pleasant sounds if one is vibrating as
a whole and the other is made to vibrate in two, three, four or
more equal parts (®gure 16.6(b)). These modes of vibration are
what we called in chapter 15 the overtones of the fundamental,
and together with the fundamental form a harmonic series.
Thus, Pythagoras found that the chords that sound pleasant to
the ear correspond to the normal modes of vibration of a string;
that is, those frequencies which form a harmonic series.
Although we have seen that the sounds with frequencies
from a harmonic series are pleasant to the ear, we have not said
why this is so. One characteristic of the sound of music is that
the sound waves show some periodicity; we say that music has
sustained notes. The sound waves of noise, on the contrary, are
disorganized and do not show periodicity. Musicians refer to a
musical tone in terms of loudness, pitch and quality or timbre. We
studied loudness earlier in the chapter; here we will consider
pitch and quality.
The sensation of pitch is related to the frequency of sound
waves; high-pitched sounds have high frequencies and lowpitched sounds have low frequencies. The human ear, as we
know, can detect sounds with frequencies between 20 and
20 000 Hz; thus, sound waves outside this range have no pitch,
since pitch is a subjective sensation.
The different modes of vibration of a stretched string of
length L are related through their frequencies fn by the expression
fn
nv
2L
where, as we saw in chapter 15, n refers to the mode of oscillation
and v is the velocity of propagation of the waves in the string.
The fundamental mode or ®rst harmonic has n 1, the second
harmonic has n 2, and so on. Since the frequency of vibration
of the string is related to its length, we can restate Pythagoras's
discovery by saying that a sound of two strings vibrating simultaneously is pleasant if the ratio of their frequencies is the ratio of
two small integers. If the ratio is 2 to 1, the sounds are one octave
apart. In musical terms, octaves are equal intervals of pitch.
Descriptions of the pitch relationships that exist in music are
called musical scales. Since many relationships can be established,
the number of possible musical scales is very large, and many
354
Sound
Table 16.4. Diatonic C-major scale
Note
Letter
Frequency (Hz)
Frequency ratio
Interval
do
re
mi
fa
sol
la
ti
do
C
D
E
F
G
A
B
C
264
297
330
352
396
440
495
528
9/8
10/9
16/15
9/8
10/9
9/8
16/15
Whole
Whole
Half
Whole
Whole
Whole
Half
have been used throughout history. Generally, the more advanced
civilizations have developed complex scales. The most common
scales are the pentatonic scales, which are based on ®ve notes,
and the heptatonic scales, based on seven notes. Western music is
based on one heptatonic scale, the diatonic scale, with the familiar
notes ``do-re-mi-fa-sol-la-ti-do.'' The notes in this scale are named
by using a LETTER (C, D, E, F, G, A, B), and each has a particular
frequency. The scale itself has ®ve whole steps (W ) and two half
steps (H) with two permutations or modes, major and minor. The
major scale uses the interval sequence W-W-H-W-W-W-H, while
the minor scale has the sequence W-H-W-W-H-W-W. A half step
has a frequency ratio of 16/15 and a whole step has frequency
ratios of 9/8 or 10/9. Table 16.4 shows the diatonic C-major scale.
In Table 16.4, the C with a frequency of 264 Hz is known as
the middle C because in written music this note occurs midway
between the treble and bass clefs. (Clef is the sign placed at the
beginning of the musical staff to determine the position of the
notes. The musical staff is the horizontal set of lines on which
music is written.) The ratio between the next C, the C above
middle C, and middle C is 528 Hz/264 Hz 2/1. These two
notes are therefore an octave apart. We notice that if we label
middle C the ®rst note, the C above middle C would be the
eighth note and octave means ``eight'' in Latin. The ratio between
the frequency of each A and the preceding A or between F and the
preceding F is always 2; and this is valid for any other note. We
can then say that the A above the A with 440 Hz has a frequency
of 880 Hz and the C below middle C has a frequency of 132 Hz.
In addition to loudness and pitch, a musical tone has quality.
The same note when played on a violin sounds different from
355
SUPERSTRINGS AND OTHER THINGS
Figure 16.7. A stretched string vibrating in several overtones. The top
diagram shows the ®rst two harmonics and the bottom diagram the
®rst three harmonics. On the right is the resultant waveform for each
case.
when it is played on a piano or sung by a soprano. The difference
is in what is called the harmonic content of the wave; that is, the
number of harmonics present in the wave. We know from the
superposition principle that standing waves of different frequencies can exist in the same medium at the same time (®gure 16.7).
For example, a guitar string usually oscillates in many overtones
when it is plucked. A piano string when struck also oscillates in
many overtones, but the relative amplitudes of these overtones
are different from the ones in the guitar string. The relative
oscillation energies of the different instruments determine the
quality of sound they produce. Since each instrument has its
own balance of overtones, a clarinet sounds different from a
¯ute or from a soprano voice when sounding the same note
(®gure 16.8). The quality of the sound that we produce with
our voice is also what allows us to discriminate between the
different vowel sounds. When we speak, the vocal chords vibrate
in the fundamental and several overtones and these resonate in
the mouth. These resonant frequencies are called formants.
When we change the shape of the mouth as we pronounce the
different vowels, some harmonics are stressed more than others
and a different set of formants appears, producing the different
sounds.
356
Sound
Figure 16.8. Sound waves for different instruments sounding the same
note.
Musical instruments
Musical instruments consist of a source of sound and a resonator
which ampli®es the sound and enhances certain harmonics.
Guitars and violins use strings to generate sounds and a sounding
box as resonator (®gure 16.9). The vibrations of the strings are
passed to the sounding box by means of the bridge. The sounding
box has its own modes of vibration which resonate with the vibrations of the strings, amplifying only certain frequencies. We saw
in chapter 15 how standing waves develop on a stretched string
which make the string vibrate in several different ways or
modes with frequencies given by
nv
fn
:
2L
The frequency of the fundamental determines the pitch of the
note. Since all the strings in a guitar or a violin have the same
length, the only way to change the fundamental is by changing
the speed of propagation of the waves on the string. How can
this speed be changed? It turns out that the speed of propagation
of waves on a stretched string depends on two factors: the tension
357
SUPERSTRINGS AND OTHER THINGS
Figure 16.9. Guitars and violins have a sounding box or chamber that
resonates with the vibrations of the strings.
in the string and the string's density. If the tensions of the different strings in a guitar are all the same, the equation above tells us
that strings of different densities vibrate with different fundamental frequencies.
Pianos have strings of different lengths and the different
fundamental frequencies are achieved by varying, in addition
to the length, the tension and the density of the strings. To
withstand the tremendous tension of the strings, pianos have a
cast-iron frame. The resonator in a piano is the sounding board, a
¯at piece of wood as big as the piano itself (®gure 16.10). When
a key is depressed, a hammer, which is linked to the far end of
the key, strikes the string and sets it into vibration. The vibrations
of the strings are transmitted to the sounding board via the bridge
over which the strings are stretched. A felt damper, controlled by
358
Sound
Figure 16.10. Schematic diagram of the hammer, damper, string, bridge
and sounding board in a piano.
one of the two or three pedals present on pianos, can be lifted to
allow the strings to vibrate freely or lowered to stop the vibration.
In a wind instrument such as the oboe, the musician blows
into the mouthpiece producing eddies which set the reeds into
vibration. This causes standing waves of the air within the tube.
In other instruments such as brasses, the musician's lip vibrates
as he or she blows air into the mouthpiece; these vibrations also
produce standing waves in the column of air in the instrument.
In each case, the frequency of the note produced depends on
the length of the column of air, whether the tube is open or
closed, and whether it is tapered (conical) or cylindrical.
The standing waves established on the strings of stringed
instruments have nodes at both ends (since they are clamped at
the ends). The fundamental frequency, with nodes only at the
ends, determines the pitch. The quality or characteristic sound
depends on the harmonics present. The situation is similar with
an air column; the closed end of the tube or pipe, such as an
organ pipe, where the air is not free to move, is a displacement
node. However, at the open end of the tube the air is free to
move and a displacement antinode or point of maximum displacement appears there as shown in ®gure 16.11. Since the distance
between a node and the next antinode is one quarter of the wavelength when the air column is vibrating in the fundamental mode,
then the wavelength of the fundamental must be 4 times the length
L of the tube, or 1 4L. We can obtain the frequency of the fundamental by making use of the expression relating frequency, wave
speed and wavelength obtained earlier in the chapter; that is,
v f or f v=. Thus, the frequency of the fundamental is
f1 v=1 v=4L where v is the speed of sound. The next harmonic
359
SUPERSTRINGS AND OTHER THINGS
Figure 16.11. Standing waves in an organ pipe closed at one end. We
use the graph of a wave to indicate variations in the density of the air
in the pipe caused by the vibrations. The nodes are points of higher density and minimum displacement and the antinodes or points of maxima
are regions of lower density and maximum displacement.
contains one additional node between the open and closed ends,
as shown in ®gure 16.11. Since the open end must be an antinode
and the distance to the previous node is one quarter , the length L
of the tube must be equal to three quarters the wavelength of this
harmonic so that the wavelength is
3 43 L;
f3
v
v
3v
:
3 43 L 4L
This frequency is three times the frequency of the fundamental;
thus this harmonic is the third harmonic, which is the reason for
the subindex 3 in the expressions above. Examining ®gure
360
Sound
16.11, we can see that the next harmonic has two nodes in
addition to the node at the end. One and one-quarter wavelengths
equal the length of the tube; thus, the wavelength of this
harmonic is 5 4L=5 and the frequency is f5 5v=4L or ®ve
times the frequency of the fundamental. Hence, this is the ®fth
harmonic. There are no second or fourth harmonics. Actually,
there are no even harmonics in a tube that is closed at one end;
only odd harmonics are present.
In an open organ pipe both ends are antinodes, and the
fundamental occurs when there is a node in the middle of the
pipe (®gure 16.12). In this case, the wavelength is equal to twice
the length of the pipe; that is, 2L. The frequency of the fundamental is then f v=2L. The second harmonic has two nodes
between the ends so that the wavelength is equal to the length
of the pipe; the frequency is then f2 v=L 2f1 . Examining
®gure 16.12, we can see that the third harmonic has a wavelength
Figure 16.12. Fundamental and second and third harmonics in an open
organ pipe.
361
SUPERSTRINGS AND OTHER THINGS
3 2L=3 and a frequency f3 3f1 . As opposed to the closed pipe,
in an open organ pipe all harmonics are present. This also applies
to open wind instruments such as ¯utes, and to reed instruments
with a conical bore, such as saxophones.
When an organ is played, several harmonics are present at
once. The harmonic content of a pipe depends on the ratio of
length to diameter, whether the pipe is closed or open, cylindrical
or conical, and to some extent on the material from which the pipe
is made, as well as the sound-generating edge (®pple or reed).
The harmonic content determines the quality or timbre of the
pipe.
The Doppler effect
You have no doubt noticed the sudden drop in pitch from an
automobile horn as the car speeds past you. We know that the
pitch does not change for the driver and that it is only because
the car is moving that we hear a change in pitch. The same phenomenon can be heard if you drive past a car blowing its horn; the
pitch is higher when you approach the car and lower when you
are leaving it behind. When a source of sound and a listener are
in motion relative to one another the pitch of the sound is not
the same as when the source and listener are stationary with
respect to each other. This phenomenon is known as the Doppler
effect after the Austrian physicist Christian Doppler (1803±1853),
who ®rst explained its properties.
Suppose the horn of an automobile emits a sound of a particular frequency f and wavelength ( v=f ) in all directions
(where v is the speed of sound in air). If this automobile is standing
by the side of the road and we approach it with a speed v0 , the
speed of the sound waves emitted by the car appears to be
increased to v0 v v0 . As we move toward the car emitting
the sound, we encounter more wavelengths per second. In this
case, the frequency that we perceive is
f0
v0 v v0
:
Although we are approaching the car, we still measure the same
distance between two consecutive crests as if we were standing
362
Sound
by the car because the car emitting the sound is at rest with
respect to the air through which the sound propagates and our
motion does not affect it. Since the distance between two consecutive crests is the wavelength, this means that the wavelength
remains the same v=f , and our expression for the frequency
of the sound we hear as we approach the stationary car with a
speed v0 becomes
f0
v v0
v v0
f
:
v=f
v
Let us examine this expression a little more closely. Since v0
is positive, the numerator is greater than the denominator which
makes the frequency that we detect greater than the frequency
perceived by a listener standing by the side of the car. A higher
frequency means a higher pitch. Notice that if we decide to
stop by the car with the stuck horn, then v0 0 and f 0 f ; that
is, the frequency we hear is the frequency heard by a listener at
rest with the source of sound, which is what we have become
by stopping. If we resume our trip and drive away from the
car, then we must consider v0 to be negative. In this case, the
numerator becomes smaller than the denominator and f 0 is
smaller than f , which implies a lower pitch. Thus, the perceived
frequency is higher than the frequency of the source when the listener approaches the source and lower when the listener moves
away from the source. Summarizing, the frequency perceived
by a listener moving with a velocity v0 with respect to a stationary
source of sound is
v v0
Listener approaching the source
f0 f
v
v ÿ v0
Listener moving away from the source.
f0 f
v
A different situation occurs when the source of sound
approaches or moves away from the listener. In these two cases
the wavelength changes because the source of sound is in
motion with respect to the medium through which the sound
propagates. The pitch we hear from the siren of an approaching
®re truck is higher than the pitch heard by the driver of the ®re
truck (®gure 16.13). As the ®re truck moves in our direction, it
gets closer to the waves that were emitted earlier and the sound
363
SUPERSTRINGS AND OTHER THINGS
Figure 16.13. (a) The listeners hear the same frequency when the ®re
truck sounding the siren is at rest. (b) When the ®re truck is moving,
the listener behind it hears a lower pitch than the listener in front of
the ®re truck.
waves that reach us get closer together. The siren oscillates at a
constant frequency even when it is moving. At the beginning of
each oscillation the ®re truck has moved a little closer to us
than where it was when the previous oscillation began. This
makes the distance between oscillations or wavelength decrease.
Since the velocity of sound remains unchanged, then the frequency has to increase. On the other hand, when the ®re truck
is moving away from us, the siren is a little farther away at
each oscillation and the wavelength is stretched out (®gure
16.13(b)).
Suppose the siren emits a pure tone of frequency f and wavelength v=f , where again v is the speed of sound in air. If the
®re truck is moving with a speed vs in the direction of the listener,
the wavelength detected by the listener is shortened by vs =f
during one period of oscillation of the wave. The wavelength
detected by a listener in front of the ®re truck is then
364
Sound
0 v ÿ vs =f and the frequency is
f0
v
v
f
:
0
v ÿ vs
When the ®re truck is moving away from the listener, vs has
opposite sign.
Summarizing, when a source of sound moves with a speed
vs toward or away from a stationary listener, the frequency
perceived is
v
f0 f
Source approaching listener
v ÿ vs
v
f0 f
Source moving away from listener.
v vs
The Doppler effect also applies to other kind of waves. It even
applies to electromagnetic waves, although the mathematical
expressions are somewhat different. Police radar works by
measuring the Doppler shift in frequency of electromagnetic
waves emitted by the radar transmitter, as they are re¯ected
from moving cars. Certain motion-sensitive sound detectors
activate some large department store or supermarket doors.
Astronomers determine the motion and velocity of distant
galaxies by measuring the Doppler shifts of the light they emit.
The discovery of Doppler shifts in the spectra of galaxies in the
1920s enabled Hubble to discover the expansion of the universe.
Shockwaves
When the source of sound moves with a speed greater than the
speed of sound in the medium in which it is moving, a shockwave
is produced. As we have seen, when the source of sound moves at
a speed lower than the speed of sound, the crests of the waves get
closer together in front of the moving source and farther apart
behind it. If the source of sound moves exactly at the speed of
sound, the waves pile up in front. The speed of sound in a particular medium depends on the elasticity of the medium; that is, on
how fast and how much the molecules that make up the medium
vibrate as the wave propagates. When the source of sound moves
through the medium, the elastic properties of the molecules allow
365
SUPERSTRINGS AND OTHER THINGS
Figure 16.14. Shock wave formed by a source of sound moving at
supersonic speeds. (Courtesy R J Gomez, NASA Johnson Space Center.)
them to move away from the source. When an airplane moves at
the speed of sound, the air molecules cannot move away fast
enough and pile up in front of the plane in one large crest. If
the airplane moves faster than the speed of sound, it outruns
the sound waves and these pile up along the sides, producing a
large crest which extends in a cone behind the plane. This large
crest, formed by the constructive interference of a large number
of individual crests, is the shockwave (®gure 16.14).
When an airplane ¯ies overhead at supersonic speeds, the
shockwave is heard as a very loud crack, or sonic boom. This
carries an enormous amount of energy which can shatter windows and produce sound levels that might reach harmful levels.
For this reason, supersonic speeds like that achieved by the
Anglo-French Concorde have been banned over the continental
United States.
An airplane traveling at the speed of sound is said to be
¯ying at Mach 1, after Ernst Mach (1838±1916), an Austrian
physicist and philosopher of science who was the ®rst to investigate the change in air¯ow around an object as it reached the
speed of sound. Mach 2 is used when the airplane is ¯ying at
366
Sound
Figure 16.15. Bow wave formed by a duck swimming at a speed greater
than the speed of propagation of water waves.
twice the speed of sound, and so on. A sonic boom is not
produced only as the airplane crosses the ``sound barrier.'' The
shockwave follows the airplane and the sonic boom is heard
after the plane has passed overhead. A duck moving in the
water at a speed greater than the speed of propagation of surface
waves on the water also produces a kind of shockwave, called a
bow wave (®gure 16.15).
Ultrasound
When sound waves strike the surface of a solid object that is large
compared with the wavelength of the sound waves, they are
re¯ected. If the object is smaller than the wavelength of the
waves impinging on it, the waves simply travel around the
object almost undisturbed. A small piece of wood ¯oating on
the water does not disturb the waves produced by a speedboat;
another boat does. With sound waves, when the size of the
object is at least several times greater than the wavelength of the
waves, the sound waves are re¯ected by the object; this is
the familiar echo. The sound of an automobile horn with a frequency of 300 Hz has a wavelength in air equal to v=f
342 m=s= 300 Hz 1:14 m; thus, objects several meters across
are large enough to re¯ect sound waves of this frequency. Objects
only a few centimeters across re¯ect only sound waves of very
367
SUPERSTRINGS AND OTHER THINGS
high frequencies, beyond the audible range of a human ear. These
sound waves are known as ultrasonic.
Our knowledge of the speed of sound in a particular
medium, say air, allows us to determine the distance to a
source of sound if we know the precise time when the sound
was produced. Because light travels at 3:0 108 m/s, we see the
lightning almost instantaneously but hear the thunder a few
seconds later. It takes light 10 millionths of a second to travel a
distance of 3 km whereas sound waves take 9 seconds to travel
the same distance. If we clap our hands and 1 second later hear
the echo from a cliff in front of us, we know that the cliff is
some 170 m away, since x vt 342 m=s 1 s 342 m and the
sound waves traverse this distance twice. Ships and submarines
use this technique in underwater range®nding or sonar (sound
navigation and ranging). A transmitter sends out a short-wavelength pulse of ultrasonic frequency through the water and a
receiver detects the re¯ected pulse. From knowledge of the time
it takes for the pulse to travel to the obstacle, the distance to the
obstacle can be determined.
Bats also emit ultrasonic pulses of a few milliseconds duration which, after re¯ecting from obstacles and prey up to about
4 m away, are detected by their large ears. Bats, then, can ¯y
and hunt in complete darkness and can apparently locate their
prey by distinguishing the times of arrival of the echo at the
two ears. The repetition rate of the ultrasonic pulses varies from
about 10 pulses per second when the obstacle or prey is farther
away to some 200 pulses per second when the prey is closer.
The range of frequencies of the sound emitted by bats goes
from 30 kHz to nearly 120 kHz. Moths, a favorite prey of bats,
seem to be able to hear those frequencies and drop quickly to
the ground when they detect them. They also have another protection against their predators; their furry bodies are good
sound absorbers and consequently poor sound re¯ectors.
Taking a cue from nature, physicists have developed a
medical diagnostic and treatment tool using ultrasonic waves.
An ultrasonic pulse is produced by applying a high-frequency
alternating voltage to both sides of a quartz crystal which then
vibrates, emitting ultrasonic waves. (Old phonographic needles
use the same principle: the vibrations of the needle as it rests
on the groove are transmitted to a crystal which translates them
368
Sound
into an alternating voltage; this voltage, after it is ampli®ed,
drives the speakers.) These crystals are called piezoelectric and
this technique of transforming electrical energy into mechanical
energy is called the piezoelectric effect. After the ultrasonic pulse
is re¯ected from the boundaries between different organs of the
body, it is detected and analyzed. A two-dimensional image
can be formed when multiple echoes are combined according to
the time delay and intensity of the signal received. The image
obtained, called a B-scan, is displayed on a monitor.
369
17
OPTICS
Waves of light
Light, as we learned in chapter 14, propagates through space as
an electromagnetic wave. Maxwell's third equation tells us that
a changing magnetic ®eld produces a changing electric ®eld
which, according to Maxwell's fourth equation, creates in turn
a changing magnetic ®eld. As Maxwell was able to show, once
the process starts, the changing magnetic and electric ®elds
continue to propagate as a wave. Although we also learned in
chapter 7 that light also behaves as a particle, the wave theory
of light is adequate for many purposes. We shall occupy ourselves with the study of the wave nature of light in the next
chapter. Here, we turn to two phenomena that do not depend
on whether we think of light as particles or as waves and are
the basis for many optical effects; the laws of re¯ection and
refraction.
Re¯ection of light
Since very early, people have known that light travels in a straight
line. Observations of solar and lunar eclipses led the Greek
thinkers to realize that light travels in straight lines at very high
speed. The shadow cast by the moon on the earth during a solar
eclipse or by the earth on the moon during a lunar eclipse
shows that light travels in straight lines over long distances
(®gure 17.1). It is often convenient to use straight lines or rays to
represent pictorially this rectilinear motion of light. This representation will be useful in understanding some of the basic properties
370
Optics
Figure 17.1. (a) The shadow cast by the moon during a solar eclipse or
(b) by the earth on the moon during a lunar eclipse, shows that light
travels in straight lines over long distances.
of light, in particular the laws of re¯ection and refraction and the
operation of many optical devices.
When light falls upon an object, the oscillating electric ®eld of
the incoming light sets the electrons of the atoms in the object into
vibration. According to Maxwell's equations, a vibrating electric
charge sets up an electric ®eld of varying strength which simultaneously creates a time-varying magnetic ®eld. Such ®elds, as we
know, take on a life of their own; they form the electromagnetic
waves that travel to space. Thus, the electrons of the atoms or
molecules of the object, in particular, the outer electrons which
are less tightly bound, absorb the light's energy and start vibrating, becoming emitters of electromagnetic radiation in the process.
This electromagnetic radiation, re-emitted after it is absorbed by
the electrons, is the light we see when we look at the object.
371
SUPERSTRINGS AND OTHER THINGS
The law of re¯ection describes the way in which light is
re¯ected from the surface of an object and says that the angle at
which a light ray returns from a surface, or angle of re¯ection, is
equal to the angle at which the ray strikes the same surface, or
372
Optics
Figure 17.2. The angle that a re¯ected ray makes with the normal to a
surface or angle of re¯ection is equal to the angle that the incident ray
makes with the normal or angle of incidence. The incident and re¯ected
rays lie in the plane that contains the normal to the surface. This is the law
of re¯ection.
angle of incidence. As illustrated in ®gure 17.2, these angles are
measured from the normal or perpendicular to the surface. The
incident ray, the re¯ected ray and the normal are all in the
same plane. When light from a distant source falls upon an
object, the incident rays of light are all nearly parallel. If the surface of the object is smooth, like a mirror or polished metal, the
re¯ected rays are also parallel. This is called specular re¯ection,
from the Latin word for mirror (®gure 17.3(a)). If the surface is
rough, like a wall or the pages of this book, the re¯ected rays
are not parallel to each other; they are re¯ected in all directions
from the surface. This is called diffuse re¯ection. Diffuse re¯ection
is what allows us to see the objects around us; light from the sun or
a lamp strikes the surface of the object re¯ecting in all directions.
The re¯ected rays, being re¯ected from the object in our direction,
strike our eyes and we see the object. When we move to another
location, other re¯ected rays strike our eyes. The same object can
be seen by many people or by ourselves when we shift our position because there are re¯ected rays in all directions coming from
the object's irregular surface (®gure 17.3(b)). When we look at a
mirror, on the other hand, all the incident rays from a distant
373
SUPERSTRINGS AND OTHER THINGS
Figure 17.3. (a) The re¯ection from a smooth surface is called specular.
(b) Re¯ection from rough surfaces is called diffuse.
source are re¯ected in the same direction and we see the source of
light not the mirror. If the rays incident on a mirror are coming
from the diffuse re¯ection off a rough surface, like our face, we
see that surface.
Re¯ection from mirrors
When you look at yourself in a mirror, what you see is the image
or likeness of your face. It looks just like you except that your
right eye is still on the right of the image but the image is looking
back at you and so it is the image's left eye. We are so used to
looking at things in mirrors that we no longer pay any attention
to this apparent right±left reversal. The driver we see through
374
Optics
Figure 17.4. The image you see of your face appears reversed. Studying
the ray diagrams you can see how this reversal takes place. It is in fact not
reversed right to left, but front to rear.
the rear-view mirror of our car looks ®ne apparently driving at
the other side of the car.
Imagine that you are looking at your raised right hand in the
bathroom mirror (®gure 17.4). When you think about it, it looks as
if the image is saluting with the left hand. Consider one of the
many rays of light that are re¯ected by your right thumb, one
that is going to enter your eye after re¯ecting from the mirror.
This ray appears to come from a point behind the mirror at a
distance from the mirror equal to the distance from your thumb
to the mirror. Now, consider another similar ray coming from
your right pinky. Although this ray also enters your eye after
bouncing off the mirror, it strikes the mirror at a point to the
right of the ray coming from the thumb. This ray appears to
come from a point behind the mirror to the right of the thumb
and so it looks like a left hand instead of a right hand.
Suppose now that you are looking at the re¯ection of a liquid
soap dispenser in the mirror (®gure 17.5). Again, the light rays
appear to come from a point far behind the mirror as the dispenser is in front of the mirror. We call the point behind the mirror
from where a particular ray appears to come from, the image, I,
and the point on the soap dispenser where the ray actually originates, the object, O. Labeling the distance from the object to the
mirror o for object distance and the distance from the image to
375
SUPERSTRINGS AND OTHER THINGS
Figure 17.5. The distance from the object to the o mirror is the same as
the distance from the image i to the mirror.
the mirror, i for image distance, we can write that for plane mirrors
(to distinguish them from curved mirrors, to be considered later),
object distance o image distance i:
Curved mirrors
Mirrors, as we know, do not have to be ¯at. Make-up mirrors or
shaving mirrors, which are used to magnify the image of one's
face, are spherical mirrors. Exterior rear-view mirrors used on
the right side of some automobiles, mirrors used in some shops
for security, and corner mirrors used at intersections in some
small towns with narrow streets are also curved mirrors, here
used to increase the viewing angle.
How do these mirrors magnify the image or increase the
viewing angle? Let's look at a few light rays coming toward a
curved mirror from certain speci®c directions. In ®gure 17.6(a)
we show a spherical mirror in which the interior surface is the
re¯ecting surface. This type of curved mirror is called concave.
The line joining the center of curvature C with the center of the
spherical segment is called the optical axis. Consider ®rst a ray
that is parallel to the optical axis. This ray strikes the mirror at
point P1 and is re¯ected back so that the angle of re¯ection
equals the angle of incidence. Our parallel ray is re¯ected back
376
Optics
Figure 17.6. (a) Parallel rays are re¯ected by a concave mirror through
the focal point. (b) A ray that passes through the focal point is re¯ected
parallel to the optical axis.
and crosses the optical axis at point F. This point, which lies
midway between the center of curvature and the mirror, is
called the focal point of the mirror. Any other ray parallel to the
optical axis, like the one striking the mirror at P2 , is re¯ected
back through the focal point.
We can then say that any ray that strikes a concave mirror
parallel to the optical axis and fairly close to it is re¯ected through
the focal point. If we now reverse the direction of the re¯ected
and incident rays in the previous discussion, we can see that an
incident ray that passes through the focal point of a concave
mirror is re¯ected parallel to the optical axis (®gure 17.6(b)).
We can summarize our discussion on concave mirrors in two
simple rules:
. A ray that strikes a concave mirror parallel to the optical axis is
re¯ected through the focal point.
. A ray that passes through the focal point of a concave mirror is
re¯ected parallel to the optical axis.
With these two rules we can see how a curved mirror magni®es or
increases the viewing angle. We can determine the location of the
image of a simple object, such as an arrow, in front of a concave
mirror (®gure 17.7(a)). Since the tail of the arrow coincides with
the optical axis, we only need to ®nd the location of one point
on the object, the tip of the arrow, for instance, as we shall see
shortly. We can draw several rays from this point so that they
re¯ect from the mirror. Our task is simpli®ed a great deal if we
use simple rays that obey our two rules above. Ray 1 is parallel
377
SUPERSTRINGS AND OTHER THINGS
Figure 17.7. (a) Ray diagram to determine the image of an arrow. (b) An
object placed inside the focal point of a concave mirror forms a virtual
image behind the mirror.
to the optical axis. By the ®rst rule, ray 1 is re¯ected through the
focal point F while ray 2 passes through the focal point and is
re¯ected parallel to the optical axis. As we can see in ®gure
17.7(a), these two rays cross at one point. If we were to draw
other rays from the tip of the arrow, they would also cross at
this same point. When we look into the mirror, the rays of light
appear to come from this last point and so this point is the
image of the tip of the arrow. Notice that this point lies below the
optical axis. We could select other points and repeat the procedure
until we reconstruct most of the arrow. If we do this, we would
notice that the images of these points lie above I 0 , closer to the optical axis. Finally, the image of the tail of the arrow would lie right
on the axis. The image of the arrow lies below the optical axis and
is inverted. Notice also that in this case the image is also smaller
than the object. Since our object has a simple symmetry, there is
378
Optics
no need to select other points; knowing the location of the image of
the tip of the arrow allows us to reconstruct the entire image.
We might ask why the image of the arrow in ®gure 17.8(a)
ended up upside down and smaller while the image of our face
in a make-up or shaving mirror is right side up and larger.
Next time you pick up one of these mirrors, place it at arms
Figure 17.8. (a) Convex mirror positioned by a roadside at a blind
corner. (b) Ray diagram for a convex mirror.
379
SUPERSTRINGS AND OTHER THINGS
length rather than up close. Now, your image and the images of
all the other objects in the room and the walls of the room itself
are upside down and in front of the mirror, not behind it. It is
only up close that your face appears larger and right side up.
Why? What is so special about your face? Aside from the fact
that your face is really special, it is the location of the object in
relation to the mirror that determines the size and appearance
of the image. If the object lies between the mirror and the focal
point, the image will be right side up, behind the mirror, and
larger. This is why your face looks larger in a shaving mirror.
They are designed so that the focal point lies slightly beyond
where you would normally place the mirror to look at yourself
in it. The rest of the objects in the room lie beyond the focal
point and thus have smaller, inverted images.
The ray diagram in ®gure 17.7(b) shows the image of an object
placed inside the focal length of a concave mirror. Notice that this
image is not only larger and right side up, as we have just
discussed, but also forms behind the mirror. Images such as
this, in which light rays do not actually pass through the image,
are called virtual images. The name suggests that there might be
other kinds of images, perhaps not as ``ghostly.'' Indeed there
are; the images formed by objects placed beyond the focal point
of a concave mirror, as described above, are real images because
light rays actually pass through them. Real images can be detected
by a photographic ®lm, for example, or formed on a projection
screen placed at the location of the image. Virtual images
cannot. Images formed by plane (¯at) mirrors are also virtual.
A curved mirror with the re¯ecting surface on the outside is
called a convex mirror (®gure 17.8). The two rules outlined above
for concave mirrors also work for convex mirrors. Notice that the
focal point is behind the mirror. Convex mirrors are common in
tight, narrow intersections because they produce a larger ®eld
of view. For this reason, they are use widely as automobile
rear-view mirrors, particularly on the near side of the vehicle.
Refraction of light
As we all have noticed at one time or another, a pool of water
appears to be shallower than it really is and a spoon seems to
380
Optics
Figure 17.9. (a) A spoon in a glass of water seems to bend when viewed
from above. When viewed from the side it appears broken. (b) The rays of
light bend as light crosses the boundary between two transparent media.
bend or even break when we use it to stir a glass of lemonade
(®gure 17.9(a)). The reason for these seemingly strange phenomena is the fact that light travels at different speeds through different transparent media because the electromagnetic waves interact
with the matter of the medium in which they travel. The speed of
light in vacuum is about 300 000 km/s but decreases to about
225 000 km/s in water and 200 000 km/s in glass. The speed of
light in air is about 299 900 km/s, nearly the same as in
vacuum. The oscillating electric ®eld of the electromagnetic
wave sets the electrons of the atoms and molecules that make
up the medium through which light is propagating into vibration.
The vibrating electrons emit electromagnetic radiation of the
same frequency. It is this constant absorption and re-emission
of electromagnetic energy that gives the appearance that light
has slowed down, although it is still traveling between interactions with the electrons at 300 000 km/s as it does in vacuum,
where there are no electrons to interact with.
A ray of light from a point on a spoon under the water travels
along the water in a straight line at 225 000 km/s until it encounters the boundary between water and air (®gure 17.9(b)). In air,
it also travels in a straight line but at a greater speed, and this
381
SUPERSTRINGS AND OTHER THINGS
results in the bending of the light ray at the boundary which our
brains interpret as a bent spoon rather than a bent ray of light.
We can analyze the phenomenon, as we are doing here, and
understand it, but our eyes still see a bent spoon. This bending
of light rays due to the different speeds of light in different
media is called refraction. As is the case with re¯ection, the ray incident upon the boundary between the two media is called incident
ray and the ray entering the new medium is called the refracted ray.
The angle made by the incident ray and the normal to the boundary is called the angle of incidence and the angle formed by the
refracted ray and the normal, the angle of refraction.
How do these changes in the speed of light result in the
bending of light rays? An analogy could help us understand
this phenomenon. Imagine soldiers marching in formation on a
grassy ®eld. When the soldiers come to an adjacent muddy
®eld, they are slowed down by the mud even if they keep in
step, as they have to take shorter steps. If they enter the muddy
®eld at an angle, as shown in ®gure 17.10(a), the columns closer
to the edge of the grassy ®eld enter the mud ®rst so that as
each row comes to the boundary some soldiers in that row will
be in the mud and the rest in the grass. The soldiers marching
in the mud get slightly behind so that the next row, still on the
grass, gets closer to them and the formation swivels. When the
soldiers are all in the muddy ®eld, they will ®nd themselves
marching in a new direction, one closer to the normal to the
boundary and the rows of soldiers will be closer to each other.
For light passing from air into water, we can replace the columns
of soldiers by lines drawn one wavelength apart, called wavefronts
(®gure 17.10(b)). As with the soldiers, the part of each wavefront
that has entered the water is slowed down in relation to the part
that is still in the air so that, as light enters the water, the wavefronts are closer together and move in a direction closer to the
normal. Since the distance between two consecutive wavefronts
is one wavelength, we can see that the wavelength in water is
shorter. The frequency, however, remains the same (in our
analogy, the soldiers keep the same step). Remembering that
the speed of a wave is related to its wavelength and its frequency
through the expression
v f
382
Optics
Figure 17.10. (a) Soldiers marching in formation slow down and change
direction when they enter a muddy ®eld. (b) Light rays slow down as
they pass from air to water and this causes the rays to bend or refract.
we can see that if the speed decreases, the wavelength also
decreases in the same proportion, since the frequency remains
constant. (The atoms absorb and re-emit the light at the same
frequency.)
The path of a light ray through a refracting surface is reversible; that is, if the direction of a light ray is reversed, it will
retrace the original path. If we reverse the direction of the rays
in ®gure 17.10(b), the incident ray becomes the ray in the water
383
SUPERSTRINGS AND OTHER THINGS
Table 17.1. Indices of Refraction
Substance
n
Substance
n
Vacuum
Air
Ice
Water
Acrylic plastic
1.000 00
1.000 29
1.31
1.333
1.51
Quartz
Crown glass
Flint glass
Zirconium
Diamond
1.54
1.59
1.75
1.96
2.42
and the refracted ray is the ray in air. In this case, the angle of
refraction is greater than the angle of incidence. This is the case
of the spoon in the glass of water (®gure 17.9).
The refraction of light was known to Ptolemy, who wrote a
book on optics in which he discussed the relationship between
the incident and refracted rays. It was, however, the Dutch scientist Willebrord van Roijen Snell (1580±1626) who ®rst discovered
the correct relationship between the angle of incidence and the
angle of refraction and for this reason this relationship is
known as Snell's law. Snell's law says that the angle of refraction is
in a constant relationship to the angle of incidence. The constant in
Snell's law depends on the nature of the two media. If the incident
ray is in vacuum or in air (where the speed of light is almost the
same as in vacuum), the constant gives the index of refraction, n, of
the substance: that is, the ratio of the speed of light in vacuum to
that in the substance, or
c
n
v
where c is the speed of light in vacuum, equal to 300 000 km/s,
and v is the speed of light in the particular medium. The speed
of light in water, for example, is 225 000 km/s. The index of
refraction of water is then n 300 000 km=s= 225 000 km=s
1.333. Table 17.1 gives some values of indexes of refraction for
different substances.
Lenses
We are all familiar with the use of lenses as magni®ers in telescopes, microscopes, binoculars and cameras. In a lens, light is
focused by refraction. A lens is a piece of transparent material
in which the two refracting surfaces are curved. In most cases,
384
Optics
Figure 17.11. (a) Parallel rays incident on a convex glass surface converge at one point. (b) Parallel rays incident on a concave glass surface
diverge as if they had originated at a single point.
the surfaces are segments of spheres. When a light ray enters a
transparent medium through a curved surface, the direction of
the refracted ray depends on the orientation of the surface.
Parallel rays incident on a spherical surface are refracted so that
they converge to one point, as seen in ®gure 17.11(a). This type
of surface is called convex. Parallel rays refracted by a concave
surface are refracted so that they diverge and appear as if they
had originated from a single point (®gure 17.11(b)).
An incident ray falling on a lens with two convex surfaces so
that it crosses the ®rst surface at some point other than the center
or the edge of the lens, is refracted toward the normal to the surface at that point. The ray falls on the second spherical surface,
passing from glass to air. Since glass has an index of refraction
greater than that of air, the ray is refracted at this second surface
away from the normal (®gure 17.12). Two rays emanating from
the same point on an object placed near a lens fall on the lens at
different points and are refracted at slightly different angles.
These rays will converge to one point on the other side of the
lens. Lenses with spherical surfaces will not, in general, bring
other rays emanating from the same point on the object to exactly
the same point and the images formed by these lenses are not
sharp, a phenomenon known as spherical aberration that designers
of optical instruments must avoid. However, if the rays passing
through the lens do not make angles greater than 108 with the
lens axis, which is the central line perpendicular to the lens,
385
SUPERSTRINGS AND OTHER THINGS
Figure 17.12. A ray falling on a convex surface of a lens is refracted
toward the normal. At the second surface, the ray is refracted away
from the normal.
sharp images can be obtained with spherical surfaces. Our discussion will be limited to these kinds of rays. Our discussion
will also be limited to thin lenses, in which the thickness of the
lens is very small compared with the distances to the objects
and images from the lens.
All rays parallel to the lens axis are refracted so that they converge to one point on the axis (®gure 17.13). This point is called
the focal point of the lens and the distance from the center of the
lens to the focal point is known as the focal length. Camera
lenses are identi®ed by their focal lengths. A 50 mm lens is one
with a focal length of 50 mm and is usually the normal lens for a
35 mm camera (one that uses ®lm 35 mm wide). A 28 mm lens
or a 35 mm lens is considered a ``wide angle'' lens, and a
135 mm a medium long focus. The focal length of a particular
lens depends on the index of refraction of the lens material and
Figure 17.13.
of the lens.
All rays parallel to the lens axis converge to the focal point
386
Optics
on the curvatures of the two surfaces. As most of us have realized
at one time or another, it does not matter which side of a simple
magni®er we use to look at an object; the image we see is magni®ed the same. We conclude that there must be a focal point on
both sides of a lens and, since the image formed is the same
regardless of which side faces the object, the two focal points
must be equidistant from the lens, even if the curvatures of the
two sides are different.
From our previous discussion, we can deduce simple rules
that will aid us in determining the location of the image formed
by a lens.
. A ray that strikes a lens parallel to the lens axis is refracted to pass
through the focal point.
. A ray that passes through the focal point is refracted to emerge parallel
to the lens axis.
. A ray that strikes the lens at the center is undeviated.
A lens that converges rays to a single point is called a converging lens. A lens that diverges rays so that they appear to come
from a single point on the side of the incident rays is called a diverging lens (®gure 17.14). Our three rules apply to both types of
Figure 17.14. (a) A converging lens bends rays to a single point, while a
diverging lens (b) bends rays such that they appear to diverge from a
single point on the side of the incident rays.
387
SUPERSTRINGS AND OTHER THINGS
Figure 17.15. Ray diagram used in the location of the image formed by
a converging lens.
lenses. Optometrists refer to converging lenses as positive lenses
and to diverging lenses as negative lenses.
Figure 17.15 shows the formation of the image of an arrow
placed on the axis of a converging lens. Again, as with the
mirror, we select several of the rays coming from the tip of the
arrow. We are interested in particular in the three rays outlined
above: ray 1, parallel to the axis; ray 2, through the front focus;
and ray 3, passing through the lens at the center (called the principal ray). These three rays converge to a point after being
refracted by the lens and then diverge, so that they appear to a
viewer to have originated at that point.
Lenses are used in telescopes, binoculars and other optical
instruments to magnify images. The size of the image formed by
a particular lens may be larger or smaller than and even equal to
the size of the object, depending on the location of the object in relation to the lens. The magni®cation, M, is the ratio of the size of the image
to the size of the object. The magni®cation can be found by comparing
the triangle formed by the object, the lens axis, and the unde¯ected
ray passing through the center of the lens, with the triangle formed
by this ray, the image, and the lens axis (®gure 17.16). These two
Figure 17.16. The triangles O0 OV and I0 IV are similar triangles. The
ratio I 0 I=O0 O must equal the ratio OV=IV. Thus, image size/object
size image distance/object distance.
388
Optics
The frontiers of physics: Gradient-index lenses
A lens with a spherical surface does not bring other rays from
the same point on the object into exactly the same point, as we
saw earlier in the chapter. This ``spherical aberration'' produces fuzzy images. Lens designers usually correct for this
and other optical aberrations by grinding the lens into an
aspherical shape or by adding more lenses to their design.
Both solutions have their own problems. Adding more
lenses increases the cost, weight, and size of the design. It
also makes glare more dif®cult to control. Aspherical lenses
are small and light but they are much more expensive to
produce.
The light rays passing through a spherical lens can be
brought into focus at exactly the same point regardless of
where they cross the lens if the index of refraction is variable.
This design would more closely resemble the way light
crosses the lens in human and insect eyes. Although lens
designers have attempted for many years to produce such a
lens, they found themselves faced with calculations that
required sophisticated computer programs.
Today's fast supercomputers have made the task possible. Several companies are now producing gradient-index
lenses (GRINs) in which the index of refraction changes
radially, from center to edge, or axially, from front to back.
A GRIN can focus light with an optically ¯at surface or a
simple spherical shape.
Computers can make the calculations feasible, but the
process to produce the actual lenses is still complicated. A
common technique is the ion-exchange process in which an
ion of a heavy element in the glass is replaced by an ion of a
lighter element. The areas where the lighter element is located
have a lower index of refraction. With this process, it can take
more than a month to have enough ions replaced in the glass.
Other companies are working on similar but faster processes.
GRIN lenses with a diameter of one millimeter are
currently being used in photocopiers, facsimiles, and laser
printers. These lenses, shaped like tiny rods and with focal
lengths of 15 to 30 millimeters, are arrayed in bundles of
389
SUPERSTRINGS AND OTHER THINGS
several hundred and are used as part of the scanning element
in these devices.
Scientists in France are developing GRIN lenses by creating links between the components of hydrophilic polymers.
These links are produced by shining light on the components.
The longer the light shines, the greater the number of links
and the higher the density of the polymer. Since the index
of refraction of the polymer is determined by its density,
the researchers can create a gradient index lens made of
this polymer, which is used to make contact lenses. Although
the technique is not available commercially yet, the scientists
hope that this will be feasible soon.
triangles are similar triangles and we can write a relationship
between the sides forming the right angles that gives us the ratio
of image size to object size:
M
image size image distance
:
object size object distance
An image that is ®ve times the distance from the lens as the object
will be ®ve times as large as the object.
Total internal re¯ection
When light passes from a medium with a high index of refraction,
such as water, to one with a low index of refraction, like air, the
angle of refraction is larger than the angle of incidence. Consider
the rays coming from a point on a submerged object (®gure
17.17). A person outside the water would be able to see the rays
from the four ®rst locations shown. However, as the angle of incidence is increased, the rays are refracted more and more until, at
a certain critical angle, the refracted ray merely skims along the
surface. For larger angles of incidence, the ray cannot escape
and the light is completely re¯ected, remaining in the water.
This phenomenon is called total internal re¯ection.
A prism can re¯ect light through a right angle or even back
parallel to its initial direction (®gure 17.18). An ordinary mirror
390
Optics
Figure 17.17. As the angle of incidence increases, the angle of refraction
also increases until, at the critical angle, the refracted ray just skims the
surface. At greater angles, the light remains inside the water. This is
called total internal re¯ection.
can also re¯ect light back to its original direction or at right angles
to the initial ray. A mirror, however, even a highly polished one,
does not re¯ect all the light incident upon it. A newly-silvered
mirror might re¯ect up to 99% of incident light but, unless it is
Figure 17.18. A prism re¯ects light at right angles to the original ray
and can even return it parallel to the original direction.
391
SUPERSTRINGS AND OTHER THINGS
given a special protective coating, after a few days its re¯ectivity
drops to about 93% due to oxidation. Total internal re¯ection
such as with a prism is, as it says, total; no loss of light occurs.
For this reason, designers of optical equipment such as binoculars
use prisms where possible, rather than mirrors.
Fiber optics
The phenomenon of total internal re¯ection, with no loss of light,
is the basis for the important area called ®ber optics. In a thin transparent ®ber, light is ``piped'' through a series of total internal
re¯ections. If the ®ber is thin enough, the angle of incidence is
always greater than the critical angle and the light is transmitted
even if the ®ber is bent in any direction as long as sharp corners or
kinks are avoided (®gure 17.19). To prevent damage and light
losses from abrasion, the ®ber is clad with transparent material
having a low refractive index.
Optical ®bers can be used to transmit images of objects that
are otherwise inaccessible, like the interior of the human body.
However, the light rays from different parts of an object would
get scrambled by the multiple re¯ections taking place in a
large-diameter ®ber. For this reason, individual ®ne ®bers, each
transmitting information from a very small part of an object,
are collected into a bundle with a lens at the end to form an
image on the ®ber bundle. The ®ber bundle is surrounded by a
jacket of ®bers that transmit light from a light source to provide
Figure 17.19. A light beam can be transmitted through a transparent
®ber by total internal re¯ection, even if the ®ber is bent.
392
Optics
illumination of the object. At the receiving end, the image can be
viewed directly by means of a lens system or displayed on a television monitor. A colon ®berscope is an example of an instrument
that uses ®ber optics to examine the colon without the need for
surgery. Medical instruments that use ®ber optics allow
physicians to examine knee joints, the bladder, the vocal folds,
the stomach, a fetus in the uterus, and even heart valves and
major blood vessels.
Fibers that work by internal re¯ection are called multimode
®bers. If the diameter of the ®ber is reduced to a few micrometers
the mode of propagation changes. The light is no longer re¯ected
internally, but travels down the ®ber in what is in effect a single
ray. This type of ®ber is called a single-mode ®ber. By using infrared
laser light (which has a single frequency) and special glass, the
light can travel 100 km or more before a repeater ampli®er is
needed, and because of its very high frequency, a single ®ber
can carry thousands of telephone conversations, several television
signals, and computer data, simultaneously.
Optical instruments
Optical instruments, from a simple magni®er to the space telescope, enhance our sense of vision and allow us to study nature
from a different perspective. Since the sixteenth century when
the telescope and the microscope were invented (reportedly in
the Netherlands), we have been able to look farther into the universe in both directions: into the very small world of the particles
that are the stuff of matter, and out toward the far reaches of the
universe, in the realm of the galaxies. In this last section of this
chapter we will look at some examples of optical instruments.
The camera
Although modern cameras can be fairly complicated instruments,
the basic optical principles behind their construction are fairly
simple.
A pinhole camera consists of a closed box with a single pinhole
through which light can enter, and uses no lenses or mirrors.
Light rays from an object in front of the pinhole enter the box,
393
SUPERSTRINGS AND OTHER THINGS
Figure 17.20. A pinhole camera produces an image on the wall opposite
the hole by restricting the rays from each point on the object into the
light-tight box.
producing an inverted image on the wall opposite the hole (®gure
17.20). Although it is possible to produce good photographs with
this simple device, there is no provision to control the amount of
light that strikes the ®lm other than by enlarging the hole, and this
produces a blurred image because every point on the object forms
an image that is a disk the size of the pinhole. To let enough light
through the hole so that the ®lm is fully exposed, long exposure
times are required. A pinhole camera is unsuitable for moving
objects.
The simplest modern camera uses a single converging lens in
front of a larger hole; this allows a great deal more light into the
box (®gure 17.21). Light rays from a distant object enter the
camera parallel to the lens axis and are focused at the focal
point of the lens. Rays from a nearby object are focused farther
back. Moving the lens toward or away from the ®lm plane
allows for focusing of objects at different distances from the
camera. The amount of light energy that the ®lm receives can be
controlled by opening or closing the aperture of the hole with an
Figure 17.21. A simple camera consists of a converging lens that can be
moved closer to or away from the ®lm plane, a variable aperture and a
shutter.
394
Optics
iris diaphragm, a ring of metal leaves, and by changing the length of
time light is allowed to enter the camera with a shutter, a mechanical device which may be a curtain in front of the ®lm (a focalplane shutter) or a set of hinged leaves in between the lens element
(an intra-lens shutter). Exposure durations range from around
1=8000 s to several minutes.
Camera manufacturers calibrate the diaphragm in units called
f-stops, de®ned as the ratio of the focal length of the lens f , to the
diameter of the aperture (D) allowed by the diaphragm, or
f -stop
f
:
D
Since the amount of light reaching the lens is proportional to the
area of the aperture and thus to the square of the diameter D, changing the exposure
by a factor of 2 corresponds to an increase by a
p
factor of 2 in the diameter. Camera diaphragms
have scales
p
marked with f-stops related by a factor of 2, i.e.
f =1:4; f =2; f =2:8; f =4; f =5:6; f =8; f =11; f =16; f =22:
The diameter of an f/1.4 aperture, for example, is twice the diameter of an f/2.8 aperture and therefore lets in four times as
much light. If a given lens has a maximum aperture of f/2.8,
you could achieve the same results as with an f/1.4 aperture by
increasing the exposure time by a factor of 4, so that if the
camera meter calls for 1/500 second at f/1.4 the exposure time
for f/2.8 should be four times 1/500 s or 1/125 s.This relationship
is called the law of reciprocity. Stated formally, it says that
exposure intensity duration:
The telescope
The telescope was invented at the turn of the seventeenth century,
probably by a lens maker in Holland. In 1609, the Dutch patent
of®ce turned down a patent application for a telescope from
Hans Lipperschey, a Dutch lens maker, because such an instrument was already common knowledge. Word of that invention
had reached Galileo that same year. Within six months, Galileo
had constructed a telescope of his own with a magnifying
power of 32, which he immediately turned to the heavens
395
SUPERSTRINGS AND OTHER THINGS
Figure 17.22. Galileo demonstrates his telescope before the Venetian
Senate in 1609. (Alinari/Art Resource, NY.)
(®gure 17.22). He discovered that the moon had mountains, and
that the sun had spots. Studying the evolution of the sunspots,
Galileo was able to show that the sun rotated on its axis, completing one rotation in twenty-seven days. Galileo also discovered
that the planet Jupiter had four large moons and worked out
their periods of revolution around the planet. These moons, Io,
Europa, Ganymede and Callisto, are known today as the Galilean
satellites of Jupiter. With his telescopes, Galileo also discovered
that Venus had phases like the moon, evidence that Venus
revolved around the sun, shining by re¯ected sunlight. Galileo
published his discoveries in Sidereus Nuncius (``The Starry
Messenger'').
A simple telescope such as Galileo's original telescope has
two lenses, the eyepiece and the objective. In this type of telescope,
known as a Galilean telescope, the eyepiece is a diverging lens
and the objective is a converging lens (®gure 17.23(a)). A modi®cation of this design, introduced by Johannes Kepler shortly after
he obtained one of Galileo's own designs, uses a converging lens
as eyepiece (®gure 17.23(b)). These telescopes, in which lenses are
used to form an image, are known as refractors. Light rays from a
distant object reach the objective essentially parallel to each other.
396
Optics
Figure 17.23. (a) Galilean telescope. Parallel rays from the object are
focused at the focal point of the objective. These rays are intercepted
by the eyepiece where they are refracted parallel. (b) In a Keplerian telescope, the eyepiece is a converging lens and the image is inverted.
These rays are focused at the focal point of the objective and
then diverge, appearing to a viewer to originate there. These
diverging rays are intercepted by the eyepiece. The eyepiece is
placed so that its focal point coincides with the focal point of
the objective; the rays diverging from this point are refracted
by the eyepiece parallel to each other. Since eyepieces are small
lenses with diameters of about 1 cm or less, the viewer looking
into the eyepiece sees parallel rays coming from the image. In
the Galilean telescope, the diverging lens used as eyepiece is
placed between the objective and its focal point (®gure
17.23(a)), whereas in Kepler's design, the converging lens is
placed at the other side (®gure 17.23(b)). Kepler's design produces an inverted image.
In 1663, James Gregory, a Scottish mathematician and astronomer, published the design of a telescope with a concave mirror
as objective. His attempt to have it built ended in failure mainly
because of the dif®culty in making the mirror. In 1668, Isaac
Newton designed and built a similar telescope with a concave
mirror 2.5-cm in diameter as objective (®gure 17.24). Although
the refracting telescope is what most people have in mind when
they hear the word, all of the large research telescopes in the
world are re¯ectors, as these telescopes are called. Light rays
from the distant object arrive at the mirror parallel to each
other, where they are re¯ected back and focused at the focal
point of the mirror. As with refractors, the eyepiece is placed so
397
SUPERSTRINGS AND OTHER THINGS
Figure 17.24. Newton's ®rst telescope. (By permission of the President
and Council of the Royal Society.)
Figure 17.25. Re¯ector telescope arrangements: (a) prime focus, (b)
Newtonian focus, (c) Cassegrain focus.
398
Optics
that its focal point coincides with the focal point of the objective
(®gure 17.25(a)). Notice that the eyepiece blocks some of the
incoming light. For very large telescopes, like the 200-inch Hale
telescope at Mount Palomar (®gure 17.26), this blocking does
not present much of a problem. This arrangement of eyepiece
and objective is called the prime focus. For smaller re¯ectors,
several other arrangements have been introduced since its invention. Newton himself used the arrangement shown in ®gure
17.25(b), which is properly called Newtonian. Light rays re¯ected
from the mirror are intercepted by a small plane mirror placed
diagonally in front of the objective. The light rays are diverted
to the eyepiece on the side. This arrangement is called the Newtonian focus. A popular design both for research telescopes and
amateur use is the Cassegrain system, where the light rays are
intercepted by a small convex mirror that sends the rays back
to the eyepiece through a small circular hole made in the objective
(®gure 17.25(c)). This increases the effective focal length of the
system.
The human eye
The human eye is a marvelous optical instrument, the result of a
remarkable evolutionary process. The eye is sensitive to wavelengths from 350 nm to 750 nm, which correspond to the most
intense wavelengths in sunlight. This is clearly not a coincidence
but the response of evolution to the existing conditions. Had the
range of intensities of sunlight been different, the eye would no
doubt have evolved to be sensitive to a different range of wavelengths.
The eye is a nearly spherical gelatinous mass about 3 centimeters in diameter, slightly ¯attened from front to back, and
surrounded by a tough membrane, the sclera, where the muscles
that control the movements of the eye are attached. Figure 17.27
is a simpli®ed diagram of the human eye. Light enters the eye
through the cornea, a transparent membrane that acts as a converging lens with a focal length of about 2.4 cm. Light rays pass
through the aqueous humor, a transparent liquid that ®lls the
space between the cornea and the lens, another converging lens
with a variable focal length that changes from about 4 cm to
399
SUPERSTRINGS AND OTHER THINGS
Figure 17.26. (a) Detail of the 200-inch Hale telescope at Mount
Palomar. (b) Prime focus of the Hale. (Courtesy Palomar Observatory.)
400
Optics
Figure 17.27. The human eye.
about 5 cm. The lens, a jellylike substance hard at the center and
softer toward the edges, contracts and bulges under the action
of the ciliary muscles that surround it and keep it under tension.
When the ciliary muscles relax, the lens assumes a thicker
shape, decreasing its radius of curvature which in turn decreases
the focal length. Nearby objects are then brought into focus on the
retina. When the eye looks at a distant object, the ciliary muscles
tension and the lens takes on a ¯atter shape, increasing the
radius of curvature and consequently the focal length, and bringing the distant object to a focus on the retina. This process is called
accommodation.
On their way to the lens, light rays pass through the pupil.
The size of the pupil is controlled by the iris, a muscular ring
behind the cornea that acts like the iris diaphragm in a camera
lens, adjusting the amount of light that enters the eye. The iris
is the colored part of the eye.
The inner chamber behind the lens is ®lled with a thin
gelatinous substance called the vitreous body. This inner surface
of the eyeball is covered with the millions of photoreceptor or
light-sensitive cells that form the retina. There are two kinds of
photoreceptor cells, rods and cones. The cones are the colorsensing cells of the retina and the rods, although not sensitive
to color, are about 1000 times more sensitive to light and
are therefore important for low-light vision. There are three
kinds of cone cells, each responding differently to light from a
colored object due to the presence of light-absorbing proteins
401
SUPERSTRINGS AND OTHER THINGS
sensitive to wavelengths in the red, green or blue part of the
spectrum.
Although the eye is capable of detecting light rays with incident angles up to 2108, vision is sharpest in a small central region
about 0.25 mm in diameter called the fovea, which contains only
cones, with an average separation of about 1 mm. As we read a
book or follow a moving object, the eye muscles move the eyeball
so that light rays from the object fall on the fovea. We can see the
rest of the object and our surroundings in front of us, but only
with somewhat reduced resolution.
A normal eye can bring into focus objects at distances
ranging from a few centimeters to in®nity. The closest the eye
can focus is called the near point, and this distance depends on
the elasticity of the lens. This elasticity diminishes with age.
Normally, at age 10 the near point is about 7 cm and this increases
to about 10 cm at age 20, 15 cm at age 30, 25 cm at age 40, 50 cm at
age 50 and over 2 m at age 60. By the late forties or early ®fties, a
person's arms are usually shorter than his near point and a visit to
the optician is recommended.
People who are nearsighted are unable to see distant objects
clearly. This condition, called myopia, is due either to an eye
that is longer than normal or to a cornea that is curved too
much. Rays from a distant object are focused in front of the
retina. Nearby objects are focused on the retina. Nearsightedness
is corrected by a diverging lens (®gure 17.28(a)).
Figure 17.28. (a) Nearsightedness can be corrected with a diverging
lens. (b) Farsightedness is corrected with a converging lens.
402
Optics
The frontiers of physics: Arti®cial vision
Researchers from North Carolina State University, the University of North Carolina at Chapel Hill, and Johns Hopkins
University have been working on the development of an
implantable arti®cial retina component chip that might
restore sight to people suffering from retinal disorders. The
chip, which is only 2 millimeters square and less than 0.02
millimeter thick, is to be implanted on the retinal surface
inside the eye cavity.
The rods and cones in the eyes of people suffering from
retinosis pigmentosa or macular degeneration are defective,
but the ganglion cells that line the retina are relatively
intact. The new retinal chip generates electric currents that
stimulate these cells which then transmit the signal to the
brain, in theory enabling the patient to see. The chip is
powered by an external laser, which is in turn powered by
a tiny battery pack. The laser transmits the images captured
by a miniature video camera to the photosensors in the
chip. The laser-battery pack-video camera assembly can be
mounted on regular eyeglasses.
403
SUPERSTRINGS AND OTHER THINGS
In the farsighted person, the eye is too short or the cornea
insuf®ciently curved. People with this condition, called hyperopia,
can see distant objects clearly but nearby objects appear blurred
because the rays are focused behind the retina. This defect can
be corrected with a converging lens (®gure 17.28(b)).
404
18
THE NATURE OF
LIGHT
The wave nature of light
As we learned in chapter 7, light shows both particle and wave
behavior. We have arrived at this conclusion through the
attempts of twentieth century physicists to understand the
nature of matter. Some of the Greek philosophers believed that
light consisted of particles that traveled in straight lines at high
speeds and stimulated the sense of vision as they entered the
eye. At the end of the ®fteenth century, Leonardo da Vinci
speculated that light was a wave because of the similarity
between the re¯ection of light and the echo of a sound. In the
seventeenth century, the Dutch physicist Christiaan Huyghens
also felt that light was a wave. However, his contemporary
Isaac Newton thought that light was composed of particles.
Newton's ``corpuscular theory'' prevailed for about two hundred
years, although by the eighteenth century many optical effects
had been explained in terms of the properties of waves. In the
early nineteenth century, Thomas Young in England and
Augustin Jean Fresnel in France demonstrated in a series of
landmark experiments that light showed interference and
diffraction, phenomena characteristic of waves. In Scotland,
Maxwell theorized that light was the propagation of oscillating
electric and magnetic ®elds through space: an electromagnetic
wave.
Twentieth century physicists have shown that both views are
valid; that light has characteristics of particles and of waves. In
this chapter, we will study the fundamentals of the wave
theory of light.
405
SUPERSTRINGS AND OTHER THINGS
The speed of light
An electric charge that is changing gives rise to a changing magnetic ®eld, as we learned in chapter 14. This is Maxwell's fourth
equation. Faraday's Law, which became Maxwell's third equation, tells us that this changing magnetic ®eld in turn creates a
changing electric ®eld. These changing magnetic and electric
®elds are inseparable: the changing magnetic ®eld sustains the
electric ®eld, and the changing electric ®eld sustains the magnetic
®eld. Maxwell realized that these changing ®elds would sustain
one another even in regions where there are no electric charges
to accelerate, as in free space. These mutually sustaining ®elds,
he predicted, propagate through space as an electromagnetic
wave.
Maxwell proceeded to calculate the speed at which these
electromagnetic waves travel through space. He showed that
this speed was equal to the ratio of the electric and magnetic
®eld strengths at any point in space:
c
E
:
B
From AmpeÁre's law, he was able to show that this speed is equal
to the square root of the ratio of the electric and magnetic force
constants:
s
kE
c
:
kM
When Maxwell substituted the then known values of these two
constants he found that the speed of propagation of electromagnetic waves was the same as the experimentally determined
values of the speed of light. He wrote,
The velocity of transverse undulations in our hypothetical medium,
calculated from the electromagnetic experiments of M M Kohlrauch
and Weber, agrees so exactly with the velocity of light calculated
from the optical experiments of M Fizeau, that we can scarcely
avoid the inference that light consists in the transverse undulations
of the same medium . . .
406
The Nature of Light
Inserting up-to-date values of kE and kM into Maxwell's
expression for the speed of electromagnetic waves, we ®nd
s
9 109 Nm2 =C2
3 108 m=s:
c
1 10ÿ7 Tm=A
This is the speed of light in empty space, one of the fundamental
constants in nature. With the values known at the time for the two
constants, Maxwell obtained 2:88 108 m/s.
In his Two New Sciences, Galileo describes an experiment by
which two persons on distant hills ¯ashing lanterns can measure
the speed of light. Although he concluded that the speed of light,
contrary to what we might think from everyday experiences, is not
in®nite, he was unable to obtain a value for it. His contemporary
Descartes asserted that the speed of light must be in®nite. The
Danish astronomer Olaus RoÈmer experimentally showed that
the speed of light is ®nite in 1676 by carefully measuring the
times at which the Galilean satellites of Jupiter emerged from
the shadow of the planet. The French astronomer Cassini had
accurately measured the times of revolution of Jupiter's four
known satellites so that the precise moment when a satellite was
to be eclipsed by Jupiter could be calculated. Before the Academy
of Sciences in Paris, RoÈmer announced that one of Jupiter's satellites, which he had calculated should be eclipsed by Jupiter on
November 9, 1676 at 5:25:45, was going to be exactly ten minutes
late. Careful measurements by the skeptical astronomers of the
Royal Observatory con®rmed that the eclipse of the satellite had
occurred at 5:35:45, in accordance with RoÈmer's prediction. A
few weeks later RoÈmer explained to the Academy that when the
earth was closer to Jupiter in its orbit, the satellites went behind
Jupiter earlier than when the earth was farther away from Jupiter.
He correctly deduced that the reason for this discrepancy was
that the speed of light was ®nite. When the earth was farthest
away from Jupiter, the eclipses were delayed because it took
light 22 minutes to cross the earth's orbit (®gure 18.1). RoÈmer's
friend, Christiaan Huyghens, used his data and his estimate of
the diameter of the earth's orbit to obtain the ®rst calculation of
the speed of light. He obtained a value (in modern units) of
227,000 km/s, about 24% lower than modern values. The main
reason for the error was that RoÈmer's measurement of 22 minutes
407
SUPERSTRINGS AND OTHER THINGS
Figure 18.1. RoÈmer's method to determine the speed of light. The
Galilean satellites of Jupiter take longer to move behind the planet
when the earth is moving away from Jupiter because the light re¯ected
by the satellite takes longer to reach the earth.
for light to cross the earth's orbit is too large; it is actually 16
minutes approximately.
In 1729, the English astronomer James Bradley, during an
attempt to detect stellar parallax, discovered the phenomenon
of stellar aberration, which allowed him to obtain a second
method of estimating the speed of light. At the time of publication
of Newton's Principia, the motion of the Earth was generally
accepted but had not been experimentally measured. For about
150 years after the Principia, a number of scientists tried to
detect stellar parallaxes with the aid of telescopes. Stellar parallax, as we might recall from chapter 6, is the apparent shift in
the position of a star due to the revolution of the Earth around
the Sun. With this purpose in mind, Bradley mounted a vertical
telescope in his chimney and measured the positions of the
stars at different times of the year. Bradley did detect a tiny displacement through the year of 20.5 seconds of arc in either direction for every star. Since stars are at different distances from the
Earth, their parallaxes should all be different. Therefore, this displacement could not be due to stellar parallax. What was it, then?
For about a year Bradley struggled with the problem. One day in
1728, while riding a boat on the Thames River, he noticed that the
wind vane on the mast shifted direction whenever the boat put
408
The Nature of Light
about. He realized immediately that the apparent shift in the
position of every star by 20.5 seconds of arc was due to the velocity of the Earth in its orbital motion about the Sun. This is the
same phenomenon that makes us tilt our umbrellas at an angle
when we walk in the rain, even if the rain falls vertically. To
observe starlight from the moving Earth, we must angle the telescope very slightly in the direction in which the Earth is moving
which makes the star appear in a slightly different position at
different times of the year (®gure 18.2(a)). This phenomenon is
known as stellar aberration. Bradley realized that he could determine the speed of light from the angle of de¯ection and the
velocity of the Earth in its orbit (®gure 18.2(b)). He obtained a
value of 304 000 km/s.
Figure 18.2. (a) The apparent shift in the position of a star as a result of
the earth's orbital motion is known a stellar aberration. (b) By measuring
the angle at which a telescope must be tilted the speed of light can be
computed.
409
SUPERSTRINGS AND OTHER THINGS
Figure 18.3.
Fizeau's experiment to measure the speed of light.
One hundred and twenty years later, in 1849, the French
physicist Armand Hippolyte Louis Fizeau re®ned Galileo's
method of ¯ashing lights to measure the speed of light. Galileo's
method of ¯ashing lights from two adjacent hills was unsuccessful because, as we know today, the time it takes light to travel
between the hills is very much shorter than the human re¯ex
time to open and close the shutters on the lanterns. Fizeau
decided to use a rapidly turning toothed wheel on one hilltop
in Paris and a mirror on a hill 5.2 miles away. A source of light
behind the wheel sent a beam of light between two adjacent
teeth of the wheel that was re¯ected back along the same path
by the mirror (®gure 18.3). If the wheel spun fast enough, the
re¯ected light would pass through the next gap between the
teeth. Knowing the angular frequency of rotation of the wheel
in this case provided the time required for the light beam to
travel the distance of 10.4 miles. Fizeau obtained a value for the
speed of light of 313 300 km/s.
Shortly after this, another French scientist, Jean Bernard LeÂon
Foucault, improved upon his friend Fizeau's method by replacing
the toothed wheel by a rotating mirror. He used his new method
to measure the speed of light in water and showed that it was
less than in air. Foucault presented these results as his doctoral
thesis. The American physicist A A Michelson, between 1878
and 1930, used a similar technique to measure the speed of
light with great accuracy. In 1882, he reported a value of
299 853 km/s which remained the best available until his 1926
value of 299 796 4 km/s, which in turn remained unchallenged
until 1950.
410
The Nature of Light
The electromagnetic spectrum
Maxwell made two predictions from his theory. The ®rst prediction was that there should be electromagnetic waves of many
different frequencies all propagating through space at the speed
of light. The second prediction was that electromagnetic waves
exert a pressure on any surface that re¯ects or absorbs them.
Unfortunately, Maxwell did not live to see his predictions
veri®ed, for he died of cancer at the early age of 48.
Eight years after Maxwell's death, the German physicist
Heinrich Hertz used a spark-gap apparatus in which electric
charge was made to oscillate back and forth, generating electromagnetic waves for the ®rst time, thus con®rming the ®rst of
Maxwell's predictions. Hertz studied physics at the University
of Berlin, where he obtained his PhD in 1880 at the age of 23. In
1883 Hertz devoted himself to the study of Maxwell's theory of
electromagnetism and two years later started his landmark
experiments. He ®rst made an induction coil that terminated at
both ends in small metal balls separated by a small gap (®gure
18.4). The current in the primary coil A, which was connected
to a battery, could be started and stopped by means of a switch.
Opening and closing the switch produced a rapidly changing
magnetic ®eld in the iron core which, according to Faraday's
law, induces a changing current in coil B. If the voltage was
high enough, a spark would jump between the two balls. When
an adjacent wire was bent so that its ends were also separated
by a small gap, a spark would also jump between its two ends.
Figure 18.4. Hertz's experiment to produce and detect electromagnetic
waves.
411
SUPERSTRINGS AND OTHER THINGS
Hertz realized that each spark was actually a series of many
sparks jumping back and forth between the two ends of the wire,
setting changing electric and magnetic ®elds in the gap which,
according to Maxwell's theory, propagated through space as an
electromagnetic wave. When this wave reached the gap in the
other wire, the electric ®eld oscillating there produced sparks
jumping back and forth as in the induction coil and this was
seen as a single large spark. This second gap was a detector of
electromagnetic waves.
By 1888 Hertz had measured the speed of the electromagnetic waves that he had generated with his spark-gap
apparatus, obtaining a value equal to the speed of light, just as
Maxwell had predicted.
Maxwell's second prediction, that electromagnetic waves
exert pressure on any surface, was con®rmed in 1899 by the Russian physicist P N Lebedev using very light mirrors in a vacuum.
The electromagnetic waves generated by Hertz had a wavelength of about 1 m and were what we now call radio waves. The
wavelengths of visible light had been measured during the ®rst
years of the nineteenth century, before the development of
Maxwell's theory. Electromagnetic waves cover a wide range of
wavelengths. The full range of the electromagnetic spectrum is
shown in ®gure 18.5.
Figure 18.5.
The electromagnetic spectrum.
412
The Nature of Light
Color
Electromagnetic waves with wavelengths from about 4 10ÿ7 m
to about 7 10ÿ7 m constitute the visible part of the electromagnetic spectrum. We classify the different regions of the visible
part of the spectrum by the names of colors. Our perception of
color, however, is different from physical color; that is, from the
in®nite possible number of different electromagnetic waves
with frequencies between 400 nm and 700 nm. These limits, by
the way, are somewhat vague. The limits of the visible part of
the spectrum are not well-de®ned, since the sensitivity of the
eye at the two ends does not stop abruptly. At 430 nm and at
690 nm, for example, the eye sensitivity has fallen to about 1%
of its maximum value. If the light intensity is high enough we
can see at wavelengths beyond these limits.
The present understanding of physical color stems from
the beautiful experiments of Isaac Newton. It was already
known in Newton's time that a beam of light passed through
a prism produced a splash of the colors of the rainbow; violet,
blue, green, yellow, orange and red. This phenomenon was
commonplace even at the time of Aristotle. It was Newton,
however, who provided the correct explanation for the phenomenon. In his now famous letter to Henry Oldenburg, the ®rst
secretary of the Royal Society, of February 6, 1672, Newton
wrote:
. . . in the beginning of the Year 1666 (at which time I applied myself
to the grinding of optic glasses of other ®gures than spherical) I procured me a Triangular glass-Prisme, to try therewith the celebrated
Phaenomena of Colours. And in order theroto having darkened
my chamber, and made a small hole in my window-shuts, to let in
a convenient quantity of the Suns light, I placed my Prisme at its
entrance, that it might be thereby refracted to the opposite wall. It
was at ®rst very pleasing divertisement, to view the vivid and intense
colours produced thereby; but after a while applying my self to consider them more circumspectly, I became surprised to see them in
an oblong form; which according to the received laws of Refraction,
I expected should have been circular.
And I saw . . . that the light, tending to [one] end of the Image,
did suffer a Refraction considerably greater than the light tending to
the other. And so the true cause of the length of that Image was
detected to be no other, then that Light consists of Rays differently
413
SUPERSTRINGS AND OTHER THINGS
refrangible, which, according to their degrees of refrangibility,
transmitted towards divers[e] parts of the wall.
This letter was printed in the Philosophical Transactions of the
Royal Society for February 19, 1671±72, and became Newton's
®rst published scienti®c paper. Before Newton provided the
``true cause of the length of the image'' and of the splash of
colors produced by the prism, the accepted idea was that white
light is darkened more at the thick end of the prism, so it becomes
blue; is darkened less where the glass is not as thick, so it becomes
green; and ®nally is darkened the least where it is closest to the
thin end of the prism, so it becomes red. But this ``explanation''
did not explain what Newton immediately noticed, that the
beam coming from a circular hole and thus having a circular
cross section, would produce an oblong beam. Although this
phenomenon had most certainly been noticed before Newton, it
took the mind of the genius to stop and ask why this was so;
the mind of a young man of twenty-three. Seeking the answer,
Newton changed the size of the hole, the location of the prism,
and the place where the beam hit the prism. The spectrum ± a
name he introduced for such a pattern of colors ± did not change.
To test his idea that light was not modi®ed by passing it
through a prism, that it is physically separated into the different
colors, Newton placed a prism near the hole in a shutter of his
darkened room. Placing a second prism a few yards from the
®rst, he noticed that blue light, on passing through the second
prism, was refracted more than red light, as was the case in the
®rst prism. But, more important, these colors were not affected
by the second prism: ``the purely Red rays refracted by the
second Prisme made no other colours but Red & the purely
blew ones no other colours but blew ones.''
The crucial experiment that would leave no doubt in Newton's mind that his ideas were correct, came a few years later.
In his letter to Oldenburg he wrote:
The gradual removal of these suspicions at length led me to the
Experimentum Crucis, which was this: I took two boards [BC, DE
in ®gure 18.6], and placed one of them close behind the Prism at the
window, so that the light might pass through a small hole, made in it
for that purpose, and fall on the other board, which I placed at about
12 foot distance, having ®rst made a small hole in it also, for some of
414
The Nature of Light
Figure 18.6. The experimentum crucis. From Newton's diagram in the
Lectiones opticae.
that Incident light to pass through. Then I placed another Prisme
behind this second board, so that the light, trajected through both
the boards, might pass through that also, and be again refracted
before it arrived at the wall. This done, I took much as to make the
several parts or the Image, cast on the second board, successively
pass through the hole in it, that I might observe to what places on
the wall the second Prisme would refract them. And I saw by the variation of those places, that the light, tending to that end of the Image,
towards which the refraction of the ®rst Prisme was made, did in the
second Prisme suffer a Refraction considerably greater than the light
tending to the other end. And so the true cause of the length of that
Image was detected to be no other, than that Light consists of Rays
differently refrangible, which, without any respect to a difference
in their incidence, were, according to their degrees of refrangibility,
transmitted toward divers[e] parts of the wall.
Newton demonstrated with his experimentum crucis that light consists of rays differently refracted which are transmitted to different
parts of the wall. ``When any one sort of Rays hath been well
parted from those of other kinds,'' he further wrote, ``it hath afterwards obstinately retained its colour, notwithstanding my utmost
endeavours to change it.'' Thus, if light were modi®ed by the
prism, the second prism would also modify light, producing
additional colors. Newton proved that once the colors were separated in the ®rst prism by refraction, no further separation was
possible. This was the crucial experiment.
Newton did not stop there, however. He reversed the process
to prove that the colors of the spectrum could be recombined into
white light. This he did ®rst by adding a third prism in such a way
415
SUPERSTRINGS AND OTHER THINGS
as to have their spectra overlapping. They combined to form
white. In 1669, the year Newton assumed the Lucasian Professorship, he repeated the experiment with a single prism and a converging lens. The spreading beam of colors struck the lens
which he had placed a meter and a half from the prism, converging into a small patch of white light on the other side of the
lens. The spectrum reappeared beyond this point, as the light
rays diverged. In a letter to the Royal Society, shortly after he
was elected a Fellow in 1672, Newton wrote
I have refracted it with Prismes, and re¯ected with it Bodies which in
Day-light were of other colours; I have intercepted it with the coloured
®lm of Air interceding two compressed plates of glass; transmitted it
through coloured Mediums, and through Mediums irradiated with
other sorts of Rays, and diversly terminated it; and yet could never
produce any new colour out of it.
But the most surprising, and wonderful composition was that of
Whiteness. There is no one sort of Rays which alone can exhibit this.
Tis ever compounded, and to its composition are requisite all the aforesaid primary Colours, mixed in due proportion. I have often with
Admiration beheld, that all the Colours of the Prisme being made to
converge, and thereby to be again mixed, reproduced light, intirily
and perfectly white.
Hence therefore it comes to pass, that Whiteness is the usual
colour of Light; for, Light is a confused aggregate of Rays indued
with all sorts of Colours, as they are promiscuously darted from the
various parts of luminous bodies.
We have learned with Newton that a beam of natural white
light is composed of pure colors, which cannot be broken down
any further. The color of these pure beams is a fundamental property. Pure colors are then the simple components from which
light is made. How many pure colors are there? Since a beam of
white natural light can be refracted through an in®nite number
of angles, and each one of these refractions corresponds to a
pure color, the number of possible pure colors is in®nite.
As we saw in chapter 17, the human eye contains three kinds
of cone cells which are sensitive to colored light due to the presence of three types of light-absorbing molecules that change
shape when they interact with light of wavelengths in the red,
green or blue region of the spectrum. Thus, all the sensory
colors can be reproduced by the appropriate combination of
416
The Nature of Light
these three pure colors. Of the in®nite possible physical colors
that may enter the eye, we sense only three different things.
Many different combinations of physical colors may give rise to
the same sensation, and we interpret this to be the same color.
We can begin to see the limitations of the human eye. If the eye
had more types of light-absorbing molecules, we would be able
to extract more visual information from nature. We have overcome this limitation, in part, by designing and building instruments that enhance our vision.
Spectra: The signature of atoms
Light passed through a prism, Newton taught us, is separated
into its component colors. In analogy with the seven notes in
music, Newton claimed the spectrum of white natural light contained seven colors ± red, orange, yellow, green, blue, indigo,
violet. He saw the spectrum as a continuous band of colors. In
1752, the Scottish physicist Thomas Melvill observed that the
spectrum of a colorless alcohol ¯ame to which a volatile substance was added was not a continuous band of colors but
rather a discrete set of colored lines. In 1802, the English scientist
William Wollaston observed the solar spectrum with the aid of a
small telescope and discovered that dark lines crossed the otherwise continuous band of colors. He thought that these lines were
the boundaries between the different colors and did not pursue
the matter.
In 1814, the German physicist and optician Joseph von
FraunhoÈfer, while testing prisms made with a special glass he
was studying, observed Wollaston's dark lines. He found 576
dark lines in the solar spectrum and measured the positions of
the 324 most prominent ones, assigning letters of the alphabet
from A to K to the more conspicuous lines. We still refer to
some of the lines of the solar spectrum by the letters used by
FraunhoÈfer.
Later, FraunhoÈfer passed the light from a star through a prism
by placing it at the focal point of a telescope. He observed that the
star's spectrum also showed dark lines, but these were at different
locations from those in the solar spectrum. He noticed that the
position of many of these lines corresponded to the colored lines
417
SUPERSTRINGS AND OTHER THINGS
Figure 18.7. Kirchhoff's experiment. (a) A dark-line spectrum is
observed by analyzing a beam of white light after it has passed through
a gas. (b) A bright-line spectrum is seen when the prism is placed to the
side of the container with the glowing gas.
of the discrete ¯ame spectrum discovered by Melvill. He suspected that there might be a connection but his early death at
the age of 39 prevented him from ®nding it. It was left for the
German physicist Gustav Kirchhoff, some 50 years later, to ®nd
the connection. Kirchhoff was able to actually produce dark
lines in his laboratory by passing a beam of white light through
a glass container with gas and analyzing this light with a prism
(®gure 18.7(a)). Kirchhoff then placed the prism to the side of
the glass container to observe the glowing gas from a direction
perpendicular to the beam of incident white light (®gure
18.7(b)). He saw a set of colored lines, like the one seen by Melvill.
More interesting was his discovery that the positions of the dark
lines corresponded to the positions of the bright colored lines in
the spectrum seen on the side. Different gases produced different
sets of dark and bright lines. Kirchhoff correctly interpreted this
phenomenon by saying that the gas absorbed certain wavelengths
418
The Nature of Light
from the incoming white light. These absorbed wavelengths
appeared as dark lines in the front spectrum.
The spectrum crisscrossed with dark lines, like the solar
spectrum, came to be known as a dark-line or absorption spectrum,
whereas the spectrum of bright colored lines is called a bright-line
or emission spectrum. Kirchhoff found that each particular compound when in gaseous form produces its own characteristic
spectrum. The emission spectrum of sodium shows two bright
yellow lines very close to each other and this spectrum is different
from the beautiful set of red, orange and yellow lines of neon or
from the red and blue lines of atomic hydrogen. Each element has
its own unique spectrum, the signature of its atoms. The presence
of a particular set of dark or bright lines characteristic of a certain
element ± its emission or absorption spectrum ± is evidence of the
existence of that element. Thus the spectrum of the Sun tells us of
the existence of certain elements in the atmosphere of the Sun and
the spectra of stars give us information about their composition.
The most profound conclusion that has been drawn from this
study is that the Sun and stars are made of the same stuff as
the Earth.
In 1859 Kirchhoff provided a simple explanation of these
phenomena. This explanation has been formulated as Kirchhoff's
three laws of spectral analysis:
. A heated solid or liquid emits light of all wavelengths producing a continuous spectrum.
. A thin luminous gas emits an emission spectrum.
. White natural light passing through a thin gas produces an
absorption spectrum.
It was, however, Bohr's theory of the atom that provided a
more complete explanation for these phenomena. In 1913 Niels
Bohr explained that the electrons in an atom occupy certain
allowed orbits about the nucleus. As we learned in chapter 7,
Bohr's theory says that when an electron drops from a higher
allowed orbit to a lower one, thereby losing energy, the atom containing the electron emits light which carries off the lost energy.
Since there are only certain allowed orbits, only certain electronic
transitions between those orbits take place. Thus the discrete
spectrum of hydrogen, for example, with its vivid red, bluegreen and blue lines, can be explained. When a beam of white
419
SUPERSTRINGS AND OTHER THINGS
light shines on a container with thin or rare®ed hydrogen gas, a
number of the photons that make up the beam of light will
have energies that exactly match the energy difference between
the lowest orbit or ground state and a particular higher orbit or
excited state of the hydrogen atom. Some of these photons are
absorbed by hydrogen atoms in the gas leaving them in excited
states. These absorbed photons are therefore removed from the
beam of white light and when we analyze this beam with a
prism, the frequencies corresponding to those photons will not
be there; these are the dark lines in the absorption spectrum.
The billions of hydrogen atoms left in excited states are unstable
and eventually their electrons make a transition to a lower energy
state, emitting photons with exactly the same frequencies as the
photons absorbed. These emissions are what we see as one of
the characteristic hydrogen lines. A red line appears when the
electron makes a transition from the third orbit to the second; a
blue-green line when there is a transition from the fourth to the
second orbit.
Notice that we have so far considered a thin or rare®ed gas.
This is because in a thin gas, the atoms making transitions to
lower energy states do not interact with each other. When the
gas is denser, however, some atoms begin to interact with each
other while in the process of emitting light. This interaction
slightly changes the energy levels of the colliding atoms. Whereas
in the thin gas, where the atoms do not interact while making a
transition, the energy levels of all the atoms are exactly the same
and all the transitions between any two particular levels have
the same energy and consequently the same frequency (recall
that E hf ), in the denser gas some of these transitions are slightly
different in energy and therefore in frequency. Because this
difference is small, the lines end up very close to each other and
the overall effect is the thickening of the line. In a denser gas,
the collisions increase and the lines become thicker. If the denser
gas is also at a high temperature, as in the interior of the Sun, its
atoms are moving with higher speeds. In this case, in addition
to the increase in the number of collisions due to the higher kinetic
energies of the atoms, there is a widening of the spectral lines due
to the Doppler effect. The frequency of an emitting atom that
happens to be moving away from us will be decreased slightly,
whereas that of an atom moving in our direction will be increased.
420
The Nature of Light
In the interior of the Sun, both effects widen the lines so much that
they actually run into each other, thus producing a continuous
spectrum. It is only when these photons reach the outer cooler
layers of the sun that they interact with the atoms there producing
the dark lines characteristic of the Sun's spectrum (®gure 18.8
(color plate)).
In a solid, the energy levels of the atoms that make up the solid
are modi®ed into energy bands due to the proximity of billions of
other atoms (chapter 13). For this reason a solid has a continuous
spectrum. In the case of a crystal, where there are some constant
energy levels, the spectrum will have some features.
The Bohr model of the atom works very well for the hydrogen-like atoms; that is, for the hydrogen atom and for other
ionized atoms with one electron in their orbits. It does not work
well for multi-electron atoms. It also raises many new questions
for which it does not provide an answer. What causes an electron
to jump down to a lower energy state, emitting light in the process? Why are there allowed orbits? In what direction is light
emitted when an electron makes a transition to a lower energy
state? These questions worried Einstein and other physicists.
Bohr himself, in an attempt to explain the behavior of the atom
with the classical theory of electromagnetism, wrote a paper in
which he proposed to abandon the conservation of energy and
momentum ± the sacrosanct laws of physics ± for atomic processes. Although at the time of the publication of this paper
(1924) there was no evidence that these laws were valid at the
atomic level, it soon came. In 1925, Arthur H Compton and A W
Simon proved that energy and momentum were conserved at the
level of atoms. In July 1925 Bohr wrote: ``One must be prepared
for the fact that the required generalization of the classical electrodynamical theory demands a profound revolution in the concepts
on which the description of nature has until now been founded.''
As we shall see in Chapter 21, Bohr did not have to wait long for
this ``profound revolution'' to take place.
Young's experiment
The phenomena of absorption and emission of light by the electrons of atoms in a thin gas, which produce the spectra, are an
421
SUPERSTRINGS AND OTHER THINGS
example of the particle behavior of light; the photons being
absorbed and emitted behave as particles of light. As we know,
light also exhibits wave behavior. Although Newton thought
that light was composed of particles, he did not reject the wave
theory of light and even suggested that the different colors had
different wavelengths. What Newton could not accept was the
idea advanced by Huyghens and Hooke that light was a spherical
pressure wave propagating through a medium.
Newton had two objections to this idea. The ®rst was that a
wave did not always travel in straight lines; waves spread out in
all directions and even bend around corners. Light, on the other
hand, travels in straight lines as evidenced by the sharp shadows
cast by a sunlit object. The second objection was related to the
phenomenon of polarization.
The answer to Newton's ®rst objection had to wait for
Thomas Young in the early nineteenth century. The son of a
Quaker banker, Young was a child prodigy who could read
¯uently by age 2 and by age 4 had twice read the Bible. At 14 he
knew eight languages. Young studied medicine, graduating
from the University of GoÈttingen in 1796 at the age of 23. Although
he practiced medicine throughout his life, he was not a very good
physician because of his poor bedside manner and probably
because he was more interested in being a scientist rather than a
physician. Between 1801 and 1803 he was a lecturer on science
at the Royal Institution in London. He made important contributions in mechanics, acoustics and optics. A constant that characterizes the elongation of a solid substance under tension is still
called Young's modulus. He made considerable discoveries regarding surface tension, capillarity and the tides. While still in medical
school, he discovered the accommodation of the lens of the eye.
Soon after obtaining his degree he discovered that ocular astigmatism was due to irregularities in the curvature of the cornea.
Young moved from his research on the eye to the nature of
light. A century and a half before Young, the Italian physicist
Francesco Maria Grimaldi had passed a beam of light through
two small apertures, one behind the other. He found that the
band of light on a surface behind the second aperture was slightly
wider than the width of the ®rst aperture. He realized that the
beam of light had bent slightly and called the phenomenon diffraction. However, he could not explain why the band of light showed
422
The Nature of Light
colored streaks at the edges. His discoveries appeared in a book
published posthumously, which, for the most part, did not
receive much attention. Newton was aware of Grimaldi's experiments and attributed the bending of the beam of light to interactions between the particles of light in the beam and the edge
of the slit. In 1802, Young designed a similar experiment. He
passed a beam of sunlight through a pinhole punched in a
screen. Light spreading out from this hole passed through two
other pinholes punched side by side in a second screen. Light
from these two holes fell on a third screen where an interference
pattern of alternating bright and dark regions or fringes appeared.
This situation is identical to the experiment described in Chapter
15 for two coherent sources of waves. As we might recall, two
waves are coherent if they have the same wavelength and a constant phase relationship. As a rule, light is incoherent because the
crests and troughs of the individual waves are in random relationship to one another. In Young's experiment, coherence is
assured because the light reaching the two pinholes originates
from a single source.
In subsequent experiments, Young replaced the two pinholes
in the second screen by two narrow parallel slits a few millimeters
apart (®gure 18.9). In this case, the interference fringes are alternating dark and bright parallel bands. By measuring the distance
between consecutive bright lines, Young calculated the wavelength of the light. He wrote:
Figure 18.9. Young's double-slit experiment.
423
SUPERSTRINGS AND OTHER THINGS
From a comparison of various experiments, it appears that the breath
of the undulations [or wavelength] constituting the extreme red light
must be supposed to be, in air, about one 36 thousandth of an inch,
and those of the extreme violet about one 60 thousandth; the means
of the whole spectrum, with respect to the intensity of light, being
one 45 thousandth.
Young found that the wavelength of light was much smaller
than Newton had thought. The longest wavelength in the visible
spectrum, that of red light, is less than one-thousandth of a millimeter. When light falls on regular-sized objects, it will appear to
travel in a straight line. Thus, the shadows of regular-sized
objects bathed in sunlight are sharp. The shadows of pinholesize objects, as Grimaldi and Young saw, are not sharp.
Young's work on the wave theory of light was not well
received in England mainly because it went against the particle
model proposed by Newton, even after Young pointed out that
Newton himself had made several statements in support of the
wave theory. Young's theory had to wait until 1818, when two
French scientists, Augustin Fresnel and Dominique Arago, proposed a wave theory of their own with a thorough mathematical
basis. By 1850, the wave theory of light was widely accepted
throughout the scienti®c community.
In Young's experiment, the two slits through which light
from a single source passes are separated by a distance d
(®gure 18.9). The screen where the interference phenomenon is
observed is at a distance L, which, in real situations is much
greater than the distance between the slits. A bright region
appears on the screen at a distance y from the center C. In Chapter
15, we obtained a simple expression to calculate the wavelength
of two coherent sources of waves interfering with each other,
in terms of the distance between the sources (in this case, the
distance between the slits), the distance L to the screen, and the
distance y to a particular bright region. The expression is
yd
nL
where n refers to the order in which the bright regions appear to
the side of the central region; the ®rst maximum at either side of
this central region is the n 1, the next one at either side is the
n 2 maximum, and so on.
424
The Nature of Light
Polarization
The second of Newton's objections to the wave theory of light
concerned the phenomenon of polarization. In 1669, the Danish
scientist Erasmus Bartholin noticed that objects seen through a
special crystal he had recently received from Iceland (now
called Iceland feldspar or calcite) were doubled and that when
the crystal was rotated, one image remained stationary while
the other rotated along with the crystal. Assuming that the light
traveling through the crystal was split into two beams he called
the beam that formed the stationary image the ordinary beam
and that for the moving image, the extraordinary beam. Huyghens
investigated this phenomenon and discovered that the two beams
were further split into four beams by a second crystal unless this
crystal was oriented exactly the same way or at 1808 with the ®rst,
in which case no splitting occurred.
In 1717, Newton also considered this phenomenon and concluded that the beam of light is made up of particles that have two
different ``sides'' and therefore look different when viewed from
different directions. These particles are sorted out in the calcite
according to the orientation they had when they entered the
glass, thus producing the double images. ``By those Experiments
it appears,'' he wrote in his Opticks, ``that the Rays of Light have
different Properties in their different sides.'' To Newton, the
particle theory explained the phenomenon of double refraction
in Iceland spar crystal and that led him to reject the wave
theory. ``To me, at least, this seems inexplicable, if Light be
nothing else than Pression or Motion propagated [as a wave]''
he wrote.
The dif®culty that Newton had with the wave theory of light
failing to explain double refraction arose from his assumption
that a wave of light had to be longitudinal, like a sound wave,
rather than transverse. At the beginning of the 19th century,
Young in England and Fresnel in France showed that light
waves were transverse.
We know today that light is an electromagnetic wave; that is,
oscillations in space and time of electric and magnetic ®elds that
are perpendicular to each other and to the direction of propagation of the wave. It is the transverse nature of electromagnetic
waves what explains polarization. If we shake a taut rope up
425
SUPERSTRINGS AND OTHER THINGS
Figure 18.10. Vertically and horizontally polarized waves on a rope.
and down, producing a train of waves, the rope vibrates in a
vertical plane. In this case, the waves on the rope are polarized
in a vertical plane. Shaking the rope horizontally produces horizontally-polarized waves (®gure 18.10). The convention for direction of polarization of a wave of light is the direction in which the
electric ®eld oscillates. Normally, light is unpolarized because it
is emitted in extremely short, randomly polarized bursts with a
duration of the order of nanoseconds. Since a beam of light consists of an immense number of waves, each randomly polarized,
the net result is no polarization (®gure 18.11).
In 1928 Edwin P Land, while a physics student at Harvard,
invented a transparent plastic material he called Polaroid. In addition to calcite, scientists had produced a synthetic crystalline
material, sulfate of iodoquinine or herapathite, which absorbed
light vibrating in the direction of orientation of the needleshaped crystals. Since the herapathite crystals were so fragile,
Land decided to embed them in a sheet of plastic. When the
plastic was stretched, the crystals lined up like venetian blinds
(®gure 18.12). Later, polymeric molecules composed mainly of
Figure 18.11. Several randomly-polarized waves in an unpolarized
beam.
426
The Nature of Light
Figure 18.12. (a) A sheet of Polaroid in which herapathite crystals are
embedded. Light polarized at right angles to these chains of crystals is
transmitted. (b) Direction in which light is transmitted.
long chains of iodine atoms replaced the herapathite crystals.
Light polarized in a direction parallel to the orientation of the
molecules in the Polaroid is completely absorbed because this
incident light sets up vibrations in the molecules, losing most of
its energy. Light polarized in a direction at right angles to the
Polaroid emerges unchanged. If light is polarized at any other
angle, it is partially absorbed by the molecules and therefore
partially transmitted (®gure 18.13).
In a beam of unpolarized light, the electric ®elds of the many
incoming rays are vibrating in all directions. Each electric ®eld
vector can be decomposed into a component that is parallel to,
say, the orientation of the molecules in a Polaroid placed in
front of the beam, and a component perpendicular to these molecules (®gure 18.14). The parallel components are all absorbed, as
we know, and the perpendicular components all pass through.
Thus, when a beam of unpolarized light passes through a polarizer, half the light is absorbed and the other half is transmitted
with the electric ®elds vibrating in one direction; that is, the
light emerges polarized.
427
SUPERSTRINGS AND OTHER THINGS
Figure 18.13. (a) Light polarized parallel to the orientation of the
Polaroid is completely transmitted. (b) Light polarized in a direction at
right angles to the Polaroid is completely absorbed. (c) Light polarized
at any other angle is partially absorbed.
When a polarized beam of light strikes a Polaroid with its
transmission direction parallel to the polarization of the beam,
the beam, as we have just discussed, will completely pass
through. If the transmission direction of the Polaroid is perpendicular to the polarization of the beam, no light is transmitted. What
Figure 18.14. An unpolarized beam incident upon a Polaroid. Each
electric ®eld vector can be decomposed into a component perpendicular
to the direction of the molecules in the Polaroid and another that is
parallel to this direction. Only the perpendicular components are
transmitted.
428
The Nature of Light
happens when the Polaroid is at an oblique angle to the polarization of the beam? Since each electric ®eld vector, vibrating in the
polarization direction, can be decomposed into a component
parallel to the transmission direction of the Polaroid and a component that is perpendicular, part of the beam is transmitted.
When light falls upon an object, we learned in chapter 17, the
oscillating electric ®eld of the incoming light sets the electrons of
the atoms in the object into vibration. As these electrons absorb
the light's energy, they start vibrating in a direction perpendicular to the incident light, becoming emitters of electromagnetic
radiation in the process. This electromagnetic radiation, reemitted after it is absorbed by the electrons, is the light we see
when we look at the object. The electromagnetic radiation is reemitted most strongly in a direction perpendicular to the direction of vibration of the electrons. Thus, light re¯ected by a
smooth nonmetallic surface is partially polarized, with the direction of polarization parallel to the surface. The light from the
morning sun re¯ected from the surface of a lake is partially polarized horizontally. Polarizing sunglasses, with their polarizing
axis vertical, greatly reduce the glare of this re¯ected light. Photographers sometimes use a polarizing ®lter on the camera lens to
reduce glare.
When light enters a calcite crystal, it becomes polarized in a
very special way. Calcite and other similar crystals like tourmaline (called noncubic because of their particular lattice con®guration), as well as certain stressed plastics such as cellophane, are
birefringent, that is, they split a narrow beam of light into two
beams which are polarized in mutually perpendicular directions.
These two beams are the ordinary and extraordinary beams that
Bartholin discovered. This interesting phenomenon is due to the
particular atomic structure of these crystals. The ordinary and
extraordinary light rays travel at different speeds through the
calcite crystal because of the different ways light interacts with
the atoms of the crystal. There is one particular orientation
along which both rays travel at the same speed and, consequently, there is no separation of the incident ray into two rays.
This direction is known as the optic axis of the crystal. A light
ray that enters the calcite crystal along a direction other than
the optical axis splits into two rays, the ordinary and extraordinary rays. The ordinary ray is polarized perpendicularly to the
429
SUPERSTRINGS AND OTHER THINGS
optic axis and the extraordinary ray is polarized in the same
direction as the optic axis. Rotating the crystals makes the extraordinary ray rotate around the ordinary ray.
Lasers
The light our senses perceive consists of a great number of
photons with many different frequencies, oscillating in random
phases. This re¯ects the chaotic nature of the natural processes
that give rise to them. As we know, light is emitted by atoms
that have been excited to higher energy states by collisions with
other atoms or by absorption of photons. Atoms in these high
energy states emit radiation when they decay to their ground
states. This can occur in two ways: a random, spontaneous emission, or a stimulated emission induced by the radiation emitted
by other atoms of the same kind. In natural processes, spontaneous emission is dominant and the emitted light is incoherent.
Spontaneous emission occurs, for example, in ¯uorescent
lamps. The atoms of the gas in the tube ± mercury vapor mixed
with an inert gas ± are continuously excited to high energy
states through collisions with electrons which are accelerated
back and forth between two electrodes at the two ends of the
tube to which an alternating voltage is applied (®gure 18.15).
These excited atoms decay down to their ground states, emitting
Figure 18.15. Electrons accelerated between two electrodes in a ¯uorescent tube collide with the atoms of the gas in the tube, raising them to
higher energy states. As these atoms return to their ground states, they
emit photons in the UV region which in turn excite atoms and molecules
of the phosphor in the walls of the tube to higher states. When these
atoms return to their ground states, they emit light in the visible part
of the spectrum. This is the light we see.
430
The Nature of Light
light in the ultraviolet. If some of the emitted photons have
energy h equal to the energy difference between the ground
state and an excited state in the atoms and molecules of the phosphor which coats the walls of the tube, then these atoms and
molecules may absorb the photons. This absorption leaves these
atoms in excited states. They decay spontaneously in several
small steps, emitting light in the visible part of the spectrum.
Because the atoms decay at random, the emitted photons are
not in phase with each other; the emitted light is not coherent.
In a laser (light ampli®cation by stimulated emission of
radiation), the individual atoms are induced to radiate in phase.
Albert Einstein predicted that if a photon of energy h
E2 ÿ E1 interacts with an atom which is in the excited state E2 ,
the incident photon may stimulate the atom to emit a second
photon which not only has the same energy but also is in phase
with the incident photon. Thus, stimulated emission produces
light that is coherent.
Einstein also showed that the probability of stimulated emission from state E2 to state E1 is the same as the probability for
absorption of a photon of energy E2 ÿ E1 resulting in a transition
from state E1 to state E2 . In an ordinary gas, most atoms are in the
ground state. It is then much more likely that a photon of the right
energy will be absorbed by an atom in the ground state than it is
to cause stimulated emission in an atom in the excited state.
Although it is impossible to predict when a particular atom in
an excited state will decay to a lower state, the average time of
decay for a group of many atoms is of the order of 10ÿ8 seconds.
There are, however, certain excited states called metastable, in
which atoms on average will stay for 10ÿ3 seconds, 100 000
times longer than in an ordinary state. If there are more atoms
in a metastable state than in the ground state, a situation called
population inversion, stimulated emission will be the dominant
process. A common method of achieving population inversion
is by optical pumping, in which intense light of the right energy
is used to excite many atoms to the higher state. Since the probability of absorption is the same as that for emission, the light
that pumps the atoms to higher states can also pump them
down to the ground state. To sidestep this problem, substances
in which electrons can jump down to a third and even a fourth
level are used.
431
SUPERSTRINGS AND OTHER THINGS
Figure 18.16. In a three-level laser, electrons are pumped to the excited
state of energy Ee from where they spontaneously decay to a metastable
state Em or to the ground state. When enough electrons have accumulated in the metastable state, producing a population inversion, a source
of ordinary light stimulates the electrons to decay to the ground state,
starting the lasing process.
A three-level laser, such as the ruby laser, has three levels
that participate in laser processes; the ground state Eo , an ordinary excited state Ee , and a metastable state Em (®gure 18.16).
When a system with a large number of these atoms in the
ground state is irradiated with photons of energy exactly equal
to the energy difference between the ordinary excited state and
the ground state; that is, Ee ÿ Eo , many atoms will be excited to
state Ee by absorption of one of these photons. Once excited to
this level, the atoms spontaneously decay to the metastable
state Em or to the ground state in an average time of 10ÿ8 s. Electrons stay in the metastable state for a much longer time. When a
suf®ciently large number of electrons populate the metastable
state, producing a population inversion, a second source of ordinary light of energy equal to the energy difference between the
metastable state and the ground state stimulates the decay of
many electrons in the metastable state to the ground state, starting the lasing process.
The fundamental principles of the laser were worked out by
the American physicist Charles H Townes. In 1917, Einstein had
recognized the existence of stimulated emission and, as we have
432
The Nature of Light
seen, showed that the probability of an electron in a state E1 of
absorbing a photon of energy E2 ÿ E1 was equal to the probability
of emission of a photon of the same energy by an electron in state
E2 . In 1951, Townes, then a professor of physics at Columbia University, was undertaking the problem of how to generate electromagnetic waves of great intensity in the microwave region. While
sitting on a park bench one morning waiting for a restaurant to
open, Townes realized that certain molecules had energy states
of the right frequency for microwave emission. The problem
was how to excite enough of them, keep them from decaying
until a suf®ciently large number of them were excited, and ®nally
how to stimulate them to decay at once to produce a powerful
enough beam. That morning, while waiting to have breakfast,
Townes had all these problems solved after doing some calculations on the proverbial back of an envelope. By 1953 he and his
graduate students had constructed a working prototype.
Townes called his device a maser, for microwave ampli®cation
by stimulated emission of radiation. Shortly after, in 1958,
Townes and A L Schawlow showed that it was possible to construct a maser producing coherent radiation in the visible
spectrum. Around the same time the Soviet physicists A M Prokhorov and N G Basov independently worked out the principles.
Figure 18.17.
Close-up of the ®rst laser. (Associated Press.)
433
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Compact disc player
A compact disc player is a digital device; that is, the music
has been stored on the disc as binary codes in the form of a
sequence of pits. When the music is recorded, the changes
in sound intensity are transformed by a microphone into
variations in voltage. This electrical signal is sampled 44 100
times a second from each stereo channel. These values of
the voltage are converted into a binary code of on-off
pulses which are encoded as a sequence of pits on the surface
of a plastic disc 12.5 cm in diameter covered with a re¯ecting
layer of aluminum.
In the CD player, a weak laser beam shines on the minute
pits and ¯at re¯ective spaces in the spiral groove engraved on
434
The Nature of Light
the underside of the spinning disc. When the laser beam
shines on the re¯ective aluminum surface, it is re¯ected
back and detected by a photocell where electrons are
released. When the laser beam enters a pit in the track, the
light is not re¯ected. Thus, the digital code of on-off signals
which was encoded when the disc was produced is decoded
in the CD player. As the number of samples per second is so
large, the music reproduced in the player has extremely high
®delity, free of the hiss of magnetic tape and the surface noise
of the old records.
In 1960, the American physicist T H Maiman at Hughes Research
Laboratories in Miami, Florida, constructed the ®rst laser, using a
synthetic ruby cylinder. Townes, Prokhorov and Basov shared the
Nobel prize in 1964 for the discovery of the maser. Schawlow won
the 1981 Nobel prize for his work on lasers.
Holography
Holography, although associated with lasers, was invented by the
Hungarian engineer and physicist Dennis Gabor in 1947 before
Maiman constructed the ®rst laser in 1960. Gabor said that the
idea for holography came to him as he waited for his turn at a
tennis court in England where he was a research engineer. He
wanted to improve the resolution of the electron microscope;
his ®rst paper on the subject, published in Nature in 1948, was
entitled ``A new microscopic principle.'' He was awarded the
1971 Nobel prize in physics for his discovery.
Gabor coined the word holography, from the Greek word
``holos,'' meaning ``the whole,'' because it records the entire
message of light, not just the intensity, as cameras or even our
eyes do. A hologram is produced by the interference of two
light beams, a reference beam coming directly from the light
source, and the beam re¯ected by the object. The re¯ected beam
varies in phase compared with the reference beam because of
the interaction with the object. A stationary interference pattern
is produced where the two beams overlap. A photographic ®lm
435
SUPERSTRINGS AND OTHER THINGS
Figure 18.18. Looking at a hologram that is being illuminated with the
same light source that produced the hologram produces an image that
appears three-dimensional.
placed in this space can record this pattern: this forms the hologram. When the reference beam is passed through the developed
hologram, the interference pattern diffracts the beam in such a
way that the emerging beam is identical with the original object
beam. If we look at the hologram from the side opposite the
light source (®gure 18.18), we see an image of the object. Because
the wave fronts are identical with those generated by the object,
our eyes cannot distinguish between the perceived image and
the real object and we see what appears to be the object itself,
in three dimensions.
Gabor's original experiments permitted holograms of only
very tiny thin objects (he used transparencies as objects). Anything thicker could not work because the coherence length of
his mercury light source was only a fraction of a millimeter.
The advent of the laser in 1960 provided a beam with a coherence
length of several centimeters, and made possible holograms of
three-dimensional objects. Although these holograms had to be
viewed in laser light, in 1964 the Russian physicist Yu N Denisyuk developed a technique to make holograms that were viewable in white light. Instead of positioning the reference beam and
the object on the same side of the hologram, Denisyuk moved the
reference beam to the opposite side (®gure 18.19). The interference pattern then formed inside the photographic ®lm as sheets
like the pages of a book. When developed, the ®lm acted as an
interference ®lter, which, when illuminated with white light,
436
The Nature of Light
Figure 18.19. Denisyuk's con®guration for a white-light-viewable
hologram.
rejected all inappropriate wavelengths and gave single-color
three-dimensional images.
Denisyuk's work, published originally in the Soviet Union,
was translated into English but was unnoticed by American
scientists. In 1965, Nile Hartman at Batelle Memorial Institute
in Columbus, Ohio, independently discovered a similar technique to make a hologram that could be viewed in white light. In
1969, Stephen Benton of the Polaroid Corporation developed
the rainbow hologram, which produces an image in colors of
the spectrum. These holograms look three-dimensional only
side to side, not up and down. The security holograms now
embossed on some credit cards are of this type. When turned
on its side, this type of hologram loses its three-dimensional
appearance.
437
19
THE SPECIAL
THEORY OF
RELATIVITY
Galilean relativity
The theory of relativity is usually associated with the name
Einstein. It might therefore come as a surprise that the concept
of relativity did not originate with Einstein. The honor belongs
to Galileo. In his Two New Sciences, Galileo discussed the problem
of the behavior of falling bodies on a moving Earth, arriving at
the conclusion that they fall exactly as they would appear to do
if the Earth were not moving. Galileo argued that you cannot
tell whether the Earth is moving or at rest by watching an
object fall.
In the Two New Sciences, Galileo discussed the simpler
problem of uniform linear motion. According to Galileo, if we
are in a ship moving along a straight line with constant speed
and drop a ball from the crow's nest, the ball will fall straight
down, hitting the deck at the foot of the mast, not the water.
But an observer on shore will thus see the ball falling down
following a curved path and not a straight line (®gure 19.1).
If we drop a ball inside a closed room, the ball will fall
straight down whether the room is in our house, a cabin in a
cruise ship or the closed bathroom of an airplane. In fact, no
experiment can be performed inside a closed room that will
reveal to us whether or not the room is at rest or moving along
a straight line at constant speed. If we are in the cabin of a ship
that is moving steadily and drop a ball from the middle of the
ceiling, it will drop on the middle of the ¯oor, just as if we
were doing the experiment in our room at home. Motion in a
straight line at constant speed, according to Galileo, has no
discoverable effects. The only way we can tell whether or not
441
SUPERSTRINGS AND OTHER THINGS
Figure 19.1. (a) An observer in a moving ship will see a ball falling
straight down, as if the ship were not moving. (b) A person on shore
will see the ball move along a curved path.
we are moving is by looking out of the window to determine
whether there is relative motion between us and the Earth. Sitting
in an airplane we have just boarded we are sometimes fooled into
thinking it is moving when the engines are running and another
airplane is taxying nearby. Unless we catch a glimpse of a building or are able to see the ground from where we sit, the vibrations
from the engine and the motion of the other airplane make it
impossible to decide who is moving. A similar experience
occurs when, tired during a long car trip, we may ®nd ourselves
sitting low in the back seat when our driver gets stuck in a traf®c
jam. Seeing only the tops of the other automobiles on the other
lanes, we might think we are ®nally moving, only to discover
that the cars traveling in the opposite direction were the only
ones moving.
In a sense, then, all motion is relative. In everyday situations
we refer motion to the Earth. Although we would say that
when we are sitting in our room reading we are not moving, a
442
The Special Theory of Relativity
Figure 19.2. The plane and all its contents are traveling at 800 km/h
relative to the ground. The book that the passenger is reading and the
passenger herself are at rest relative to the plane.
hypothetical observer traveling through the solar system would
af®rm that our room is actually rotating along with the Earth
and revolving around the Sun. The description of motion
depends on the particular reference frame to which we refer it. If
we are on an airplane that is traveling at a steady 800 km/h, we
would consider the book we are reading to be at rest with respect
to us, the other passengers, and the plane (®gure 19.2). Of course,
the book is moving along with us, the other passengers, and the
plane at 800 km/h relative to the ground. Which view is the
correct one? Both are. The book is at rest in the reference frame
of the plane and moving at 800 km/h in the reference frame of
the ground. While the plane is moving steadily, your coffee will
not spill and your pen will not roll off the fold-out table. The
law of inertia holds and, as we have said, nothing, other than
looking out the window, will tell you that you are moving. A
reference frame in which the law of inertia holds is called an
inertial reference frame.
Suppose now that, while a plane is ¯ying with constant
velocity at 800 km/h, a ¯ight attendant walks from the back of
the plane to the front at a steady pace. Assume that she walks at
a speed of 2 km/h (®gure 19.3). This, of course, is her speed in
the reference frame of the plane. If the velocity of the ¯ight attendant were to be measured from the ground, we would ®nd it to be
443
SUPERSTRINGS AND OTHER THINGS
Figure 19.3. While the plane ¯ies at a constant speed of 800 km/h
relative to the ground, a ¯ight attendant walks down the aisle at a
speed of 2 km/h relative to the plane.
802 km/h. This is hardly surprising to us. It is not uncommon to
see people in a hurry walking on escalators. Their velocities
with respect to the building where the escalator is located are
greater than the velocity of someone who merely rides the escalator. We all have seen children walking down an ``up'' escalator so
that they remain stationary in relation to the building. In this case,
their velocities are the negative of the escalator velocity.
If a reference frame S0 is moving with a velocity vF relative to
a second reference frame S, then the velocity v of an object relative
to S is equal to its velocity v0 in frame S0 plus vF ; that is,
v v0 vF :
Returning to our example of the ¯ight attendant, the velocity of
the reference frame S0 of the airplane relative to the frame of
the ground is vF 800 km/h, the ¯ight attendant's velocity relative to the plane is v0 2 km/h, and her velocity relative to the
reference frame S of the ground is v 2 km=h 800 km=h
802 km/h.
444
The Special Theory of Relativity
Since inertial frames of reference move at constant velocities,
the acceleration of an inertial frame is zero. Therefore, the acceleration of an object in one reference frame is the same as in any
other inertial frame. The object we drop from the ceiling of a
cabin in a steadily moving ship not only falls straight down to
the ground relative to the people on the ship, it also accelerates
at 9.8 m/s2 . This is the same acceleration that an observer on
shore would measure for the falling object if this observer could
see it. Thus, not only the law of inertia ± Newton's ®rst law ±
holds for inertial frames of reference but the second law and
the universal law of gravitation as well. In fact all the laws of
mechanics are the same for all observers moving at a constant
velocity relative to each other, as Newton himself recognized.
This statement is implicit in Galileo's own statement that uniform
motion has no discoverable effects and is known today as the
Galilean principle of relativity. We can state this as follows:
The laws of mechanics are the same in all inertial frames of reference.
This principle means that there is no special or absolute reference
frame; all reference frames are equivalent. Thus, there is no
absolute standard of rest; uniform motion has to be referred to
an inertial frame.
A word of caution before we leave this section is in order. We
have implicitly said that the Earth is an inertial reference frame.
This is not exactly true, because the Earth is rotating, so any
point on its surface is always accelerating. However, this acceleration is very small, and the rotational effects of the Earth can be
neglected. The Earth can thus be considered an inertial frame of
reference for our purposes.
The Michelson±Morley experiment
The Galilean principle of relativity was ®ne for mechanics. However, when Maxwell deduced the existence of electromagnetic
waves that travel at the speed of light, scientists began thinking
about the medium through which these waves propagate.
Although Maxwell concluded that electromagnetic waves
would propagate in empty space, the physicists of this period,
being familiar with mechanical waves which require a medium
445
SUPERSTRINGS AND OTHER THINGS
to propagate, naturally assumed that electromagnetic waves also
required a medium, and could not accept the idea of a wave propagating though empty space. On the Earth, light propagated
through air, water, and other transparent media. But light also
came from the Sun and the stars, and the space between the
stars did not appear to be ®lled with any known substance.
Clearly, they reasoned, a transmitting substance existed that
®lled all space; the luminiferous ñther ± or simply the ``ether.''
The ether was a transparent medium thin enough to allow the
motion of the planets but rigid enough to allow for the propagation of light with tremendous speed. Even Maxwell was convinced: ``We have, therefore, some reason to believe,'' he wrote
in 1865, ``from the phenomena of light and heat, that there is an
ñthereal medium ®lling space and permeating bodies, capable
of being set in motion from one part to another, and of communicating that motion to gross matter so as to heat it and affect it in
various ways.''
Albert A Michelson was one of several physicists who tried
to detect the motion of the Earth through the ether. Like all the
others, he failed. Michelson was born in Strelno, Prussia (now
Strzelno, Poland), on December 19, 1852. When he was four, his
parents emigrated to America and settled in San Francisco,
where they went into business. There, the young boy grew up.
As a teenager, Michelson entered the United States Naval Academy at Annapolis where he excelled in science, becoming a physics and chemistry instructor after graduation. In 1878, he began
thinking about better ways to measure the speed of light. Realizing that he required more formal studies in optics, he traveled to
France and Germany for advanced study, as was the custom in
those days. Upon his return to the United States, Michelson
became a professor of physics at the Case School of Applied
Sciences (now known as Case Western Reserve University), and
later, the ®rst head of the physics department at the University
of Chicago. In 1907, Michelson became the ®rst American to
win the Nobel prize in the sciences.
While in Germany working in the laboratory of Hermann
von Helmholtz, Michelson invented an ingenious instrument
with the idea of measuring the Earth's velocity with respect to
the ether. Michelson's interferometer, as the instrument came to be
known, was based on an idea ®rst proposed by Maxwell in
446
The Special Theory of Relativity
1875. As the Earth moves through space, Maxwell reasoned, the
ether that permeates space would create a wind. If we measure
the velocity of light in the direction of motion of the Earth
around the Sun (that is, in the opposite direction to that of the
ether wind) we would obtain a value equal to the speed of light
with respect to the ether minus the speed of the ether. If we measure the speed of light in the opposite direction we would obtain a
value equal to the speed of light with respect to the ether plus the
speed of the ether. If we measure the speed of light in a direction
at right angles to the ether wind we would obtain the actual velocity of light relative to the ether. This situation is similar to the
situation of a swimmer swimming 50 meters, as measured on
shore, ®rst upstream and then the same 50 meters downstream
and later swimming 50 meters back and forth at right angles to
the current. The times for each round trip are going to be different. If we know at what speed the swimmer swims in still waters,
we can deduce the velocity of the current.
Figure 19.4 shows a schematic representation of Michelson's
interferometer. A half-silvered mirror splits a beam of light into
two beams ± one re¯ected and one transmitted ± that travel in
perpendicular directions. The re¯ected beam strikes mirror M2
and is re¯ected back to the half-silvered mirror. The transmitted
beam strikes mirror M1 and is also re¯ected back to the halfsilvered mirror. There the two beams are again partly re¯ected
and partly transmitted and interfere. This interference pattern
can be observed with a telescope.
Michelson's ®rst attempt to detect a difference in the speed of
light in any direction was ``unsuccessful;'' no difference was
detected. He decided that he needed more sensitive equipment.
In 1887, Michelson, now at Case Western Reserve, and his friend
and collaborator, Edward W Morley, a professor of chemistry at
the University, decided to try again. They placed the interferometer on a square slab of stone one and a half meters wide, ¯oating
on mercury to minimize vibrations as the apparatus was rotated to
measure the light from the stars in different directions. When the
apparatus was oriented in the direction of the Earth's motion, a
certain interference pattern was observed due to the difference
in the optical paths of the two perpendicular beams; one parallel
to the Earth's velocity and the other perpendicular to it. As the
interferometer was rotated 908, the beam that was parallel to the
447
SUPERSTRINGS AND OTHER THINGS
Figure 19.4. Schematic diagram of Michelson's interferometer. A narrow
beam of light is split into two beams by a half-silvered mirror. These two
beams travel in perpendicular directions to two mirrors which re¯ect the
beams back to the partially-re¯ected mirror where they are again partly
transmitted and partly re¯ected. The two beams interfere with each
other. This interference pattern can be observed with a telescope.
Earth's velocity was now perpendicular and the one that was
perpendicular became parallel. Thus, a shift in the interference
pattern was expected. Michelson and Morley calculated that this
shift should have been 0.4 fringe; that is, slightly less than half
the width of a fringe. The results were again negative. ``The
actual displacement,'' they reported, ``was certainly less than a
twentieth part of this, and probably less than a fortieth part.''
Michelson was known as a meticulous experimenter and the
negative results of his experiment puzzled other physicists. The
Irish physicist George FitzGerald advanced an explanation in
1882. He proposed that the linear dimensions of all objects are
shortened along the direction of motion of the Earth. Thus, the
arm of the interferometer holding the mirrors placed in the direction of motion of the Earth is shortened. This contraction is exactly
what is needed to compensate for the difference in the velocities of
light in the direction of the ether and perpendicular to it. We do
measure a smaller velocity of light in the direction opposite the
ether wind as compared to the direction perpendicular to it, he
proposed, but the times are the same because the length along
448
The Special Theory of Relativity
the former direction is shorter by the right amount. Because
everything on the Earth, including meter sticks, are contracted
in the direction of the ether wind, we cannot directly measure
this contraction. In 1895, the Dutch physicist Hendrik A Lorentz
independently proposed this contraction and provided an explanation for it in terms of changes in the electromagnetic forces
between the atoms. This contraction, proposed without other
empirical support and only to provide an explanation for the
null results of the Michelson±Morley experiment, is known
today as the Lorentz±FitzGerald contraction.
Einstein's postulates
Albert Einstein was eight years old when Michelson and Morley
were attempting to measure the velocity of the ether wind. He
had been late to begin to talk but was not a poor student in his
early youth, as popular mythology has it. A year before this
famous experiment took place, in 1886, when Einstein was
seven, his mother Pauline wrote to her mother, ``Yesterday
Albert got his marks. Again he is at the top of his class and got
a brilliant record.'' A year later his grandfather wrote, ``Dear
Albert has been back in school a week. I just love that boy,
because you cannot imagine how good and intelligent he has
become.'' At ten, Einstein entered the Luitpold Gymnasium or
secondary school in Munich, a city not far from his native Ulm
in Germany, where he excelled in mathematics and physics, subjects in which he was, ``through self study, far beyond the school
curriculum,'' as he wrote years later. At the age of twelve, he was
given a book on Euclidian geometry which he studied with
enthusiasm. ``The clarity and certainty of its contents made an
indescribable impression on me,'' he later wrote in a small autobiographical essay. Einstein, however, disliked the Gymnasium
and its astringent learning atmosphere, and once remarked that
its teachers were like lieutenants.
When he was ®fteen, his father's business failed and the
family moved to Pavia, Italy. Einstein was left behind to ®nish
his secondary education at the Gymnasium. After six months,
depressed and nervous, he persuaded his family physician to
provide him with a certi®cate stating that owing to nervous
449
SUPERSTRINGS AND OTHER THINGS
disorders he needed the company of his family. Einstein left the
Gymnasium without informing his parents and joined them in
Pavia. He promised his disappointed parents that he would
study on his own to prepare for the entrance examination at the
prestigious Swiss Federal Polytechnic Institute, the ETH, in
Zurich, where his father wanted him to study electrical engineering. Einstein was examined in political and literary history,
German and French, drawing, mathematics, descriptive geometry, biology, chemistry and physics, and was required to write
an essay. He failed. However, because he had done well in mathematics and the sciences, the director of the Polytechnic suggested
that he obtain a diploma at a Swiss secondary school and reapply.
A year later Einstein graduated with near-perfect grades from the
cantonal school in Aarau, in the German-speaking region of
Switzerland. Now armed with his high school diploma, Einstein
was admitted at the ETH without further examination.
Einstein entered ETH on October 26, 1896, at the age of 17, to
study physics. Under six professors ± one of them Hermann Minkowski, who was to participate in giving the theory of relativity
its mathematical formalism ± Einstein studied mathematics.
Three professors taught physics and astronomy. He also added
electives such as gnomic projection, anthropology, geology, banking and stock exchange, politics, and Goethe's philosophy. He
did not attend lectures regularly, however, preferring to spend
his time in the physics laboratory and in the library reading the
original works of Maxwell, Kirchhoff, and Hertz. He relied on
good class notes taken by his friend Marcel Grossmann to cram
for examinations. Einstein particularly disliked the physics
courses taught by Heinrich Weber because he did not present
anything about Maxwell's theory, and Maxwell's theory was
``the most fascinating subject at the time that I was a student,''
as Einstein wrote later. The dislike was mutual, as Weber did
not like Einstein's forthrightness and distrust for authority. In
Europe at the time, a professor was an exalted person, respected
and revered by lesser people.
Einstein graduated from the ETH in August 1900. Three
other students graduated with him and the three immediately
obtained assistantships at the ETH. Einstein was not offered a
position. He then looked for other university positions and was
rejected. In 1901, he wrote, ``From what people tell me, I am not
450
The Special Theory of Relativity
in the good graces of any of my former teachers . . . I would long
ago have found a [position] had not Weber intrigued against me.''
In 1901, still without a job, Einstein returned to physics and
wrote his ®rst scienti®c paper, on intermolecular forces, which
appeared in volume 4 of the prestigious scienti®c journal Annalen
der Physik. He then wrote a research paper on thermodynamics
which he submitted to the University of Zurich to obtain his
doctoral degree. The paper was rejected by Professor Kleiner as
a PhD thesis. In June 1902, with the help of a recommendation
from Marcel Grossmann's father, Einstein ®nally obtained a job
as a clerk in the Swiss Patent Of®ce in Bern. This was not an
academic job but it might have been the best job that Einstein
could have had at that time, because it was undemanding and
left him enough time to think about physics. It was there, at the
patent of®ce, that Einstein developed and published his Special
Theory of Relativity.
1905 was Einstein's annus mirabilis, ranking in the annals of
physics with the other annus mirabilis of 1666 when Newton
went home in Woolsthorpe and explained how the universe
works. That year of 1905 Einstein published a paper which was
``very revolutionary,'' as he wrote to a friend. This paper was
indeed revolutionary; it was the paper that laid the foundation
for quantum theory, with the introduction of the concept of
quanta of energy or photons, and that was eventually to earn
him a Nobel Prize in physics. Less than a month after submitting
this paper for publication, he sent off a second paper which was
to gain him the PhD degree from the University of Zurich,
accepted by the same Professor Kleiner who had rejected his
®rst submission; it was the sugar paper mentioned in chapter 1.
``A New Determination of the Sizes of Molecules,'' he titled it.
Within a month he submitted a third paper explaining the erratic,
zigzag motion of a speck and helped to establish the existence of
atoms. This was called Brownian motion, although Einstein did
not know this. On June 30, 1905, Annalen der Physik, the journal
where Einstein had published all his papers, received the
fourth manuscript from Einstein during that year, titled ``On
the Electrodynamics of Moving Bodies.'' This was the special
relativity paper submitted only ®fteen weeks after his ®rst
paper of that incredible year. But Einstein was not ®nished; he
still had time that year for a ®fth paper which he titled ``Does
451
SUPERSTRINGS AND OTHER THINGS
the Inertia of a Body Depend upon Its Energy Content?'' This was
the paper containing the famous formula E mc2 , which was to
become synonymous with his name. Einstein was twenty-six
that year.
Einstein's 1905 paper on special relativity starts by recognizing that Maxwell's theory of electromagnetism makes a distinction between rest and motion. He gave the example of a
magnet and a wire moving relative to each other. If the magnet
moves and the wire is at rest, the moving magnet generates an
electric ®eld that produces a current in the wire. However, if
the wire is moving and the magnet is at rest, there is no electric
®eld, but the moving conductor experiences a force which sets
up an electric current equal to the current in the former case.
This means that the states of rest and motion can be identi®ed.
Absolute motion does not exist; only relative motion matters.
Galileo had discovered that. Either Galileo was wrong or
Maxwell was wrong. Einstein decided that electromagnetism
had to be reformulated so that the description depends only on
the relative motion. He proceeded to do just that in the second
part of his paper.
Einstein chose not merely to believe that only relative motion
matters; he elevated this idea to the status of a postulate and
called it the Principle of Relativity. Einstein introduced a second
postulate ``. . . that light is always propagated in empty space
with a de®nite velocity c which is independent of the state of
motion of the emitting body.'' The special theory of relativity is
based on these two fundamental postulates. Let us state them
here in Einstein's own words:
Postulate 1 ± The Principle of Relativity: The same laws of
electrodynamics and optics will be valid for all frames of
reference for which the equations of mechanics hold good.
Postulate 2: Light is always propagated in empty space with a
de®nite velocity c, which is independent of the state of motion
of the emitting body.
The ®rst postulate is an extension of the Galilean Principle of
Relativity to cover not only mechanics but also electromagnetism
and optics. Actually, in extending the principle of relativity to
include optics and electromagnetism, Einstein meant to include
all of physics, since mechanics, optics and electromagnetism
452
The Special Theory of Relativity
were all of physics at that time. The Principle of Relativity can
then be stated as follows:
Postulate 1 [alternate version]: The laws of physics are the
same for all observers moving in inertial (non-accelerated)
frames of reference.
The second postulate seems to contradict common sense. It
was Einstein's solution to a puzzle he had struggled with since
he was a young boy of sixteen. What would happen if we could
travel at the speed of light? Newtonian physics does not forbid
us from doing so. Accelerating a spaceship continuously, for
example, would eventually allow it to reach the speed of light
and even any speed beyond it. Traveling at the speed of light,
in the same direction as that of a light beam, we would notice
that the wave pattern of the light beam disappears. Einstein
realized that Maxwell's equations do not allow such a possibility.
Either Maxwell's equations are wrong or traveling at the speed of
light is an impossibility. Einstein decided that Maxwell equations
were correct. The speed of light must be a limiting velocity; no
material body can ever reach it. As such, it must be the same
for all observers in inertial frames of reference.
Time dilation
The constancy of the speed of light for all inertial observers has
important consequences for the way we measure time. Imagine
two identical trains, one moving at a constant velocity v and
the other standing by the railroad station. These two trains are
in inertial frames of reference. Suppose further that the velocity
of the moving train is extremely large, close to the speed of
light. Of course, no train is capable of reaching these velocities
(the spacecraft Galileo achieves velocities of about 0.0001c), but
this is a thought experiment. In one of the cars in the moving
train, an observer has placed a mirror on the ceiling of the car
and a light source on the ¯oor of the car. A second observer in
one of the cars of the stationary train has an identical set up in
her car. Imagine further that the two cars where this equipment
has been installed have large windows that allow each observer
to completely view the other set up but they are unable to see
453
SUPERSTRINGS AND OTHER THINGS
Figure 19.5. (a) Two identical trains with light sources and mirrors in
motion relative to each other. (b) Path seen by observer on the ground.
the ground or any other structure around the trains. For clarity,
we are assuming that we are in the same reference frame as the
parked train; that is, the stationary train is at rest relative to the
ground, where we also are. Both observers agree that the distance
from the mirror to the light source is the same and they label it d
(®gure 19.5(a)).
When the two cars are facing each other, the observers ®re
the light sources. The observer in the moving train measures
the time t0 it takes for the light in his set up to make the round
trip to the mirror and back. Since the distance for the round
trip is 2d, the time is
t0
2d
:
c
The observer in the stationary train, however, measures a different
time t for the light beam in the experiment in the moving train.
Because the train is moving with a velocity v, by the time the
light reaches the mirror, the mirror would have moved a certain
distance (®gure 19.5(b)). According to the stationary observer,
454
The Special Theory of Relativity
if the beam is to hit the mirror, it would have to move at an angle
and therefore travel a longer distance. Since light travels at the
same speed in both reference frames, the beam of light must travel
a longer time in the reference frame of the stationary observer.
The expression for the time measured by the stationary
observer for the round trip of the light beam in the moving
train is
t0
t p t0
1 ÿ v2 =c2
p
where the factor 1= 1 ÿ v2 =c2 ( is the Greek letter gamma).
This expression is not dif®cult to derive using simple algebra (see
Box). Notice that v < c, otherwise the quantity under the square
root sign will be negative, giving us an imaginary number. This
means that the factor is always greater than 1 (it is equal to 1
when v 0), making t > t0 ; that is, the time measured by the
stationary observer is always greater than the time measured
by the moving observer by the factor .
Because we can use the ¯ash-mirror device as a clock by
keeping track of the number of return trips, we can see that time
¯ows more slowly in the moving frame of reference. ``Not so!,''
the observer in the moving train would say. ``I have been watching
the experiment in her train,'' the moving observer says, ``and she
is the one moving. I measure a time that is longer than what she
measures for her own experiment.'' Who is right? Both are right.
That is the whole point of special relativity. The choice of reference
frame does not matter, since there is no preferred frame. Our own
reference frame is at rest for us but, as we have seen before, might
not be for an observer able to see us from space. In fact the socalled moving train might be traveling west at the same speed
as the linear speed due to the Earth's rotation, and for an observer
in space that train might be the stationary one. The only thing we
can af®rm is that the time interval between two events (the
emission of the ¯ash of light and its arrival at the mirror in our
example) as measured by an observer who sees the events occur
in his own reference frame is always smaller than the time interval
between the same events as measured by another inertial
observer. The time interval measured by an observer in his own
reference frame is called the proper time. We can summarize our
455
SUPERSTRINGS AND OTHER THINGS
The stationary observer sees the light beam move along the
sides of an isosceles triangle of base vt, since the train is
moving with speed v. If we divide this triangle into two
right triangles, as shown in the ®gure below, we can apply
the Pythagorean theorem to one of the two right triangles.
According to the second postulate, the speed of the light
beam traveling in the moving train as measured by the
stationary observer is still c. Therefore, if the time measured
by the stationary observer is t, the total distance traveled by
the light beam as it moves up to the mirror and returns is
ct, and the distance for the one-way trip to the mirror is
then ct=2. This is the hypotenuse of our right triangle. Thus,
Right triangles used in the calculation of t.
ct
2
2
vt
2
2
d2
which becomes after multiplying through by 4,
c2 t2 ÿ v2 t2 4d2 :
Solving for t, we obtain
2d
2d
2d=c
t p p p :
2
2
2
2
1 ÿ v2 =c2
c 1 ÿ v =c
c ÿv
But 2d=c t0 . Thus, the time measured by the stationary
observer for the round trip of the light beam is
t0
t p t0 :
1 ÿ v2 =c2
456
The Special Theory of Relativity
discussion by saying that
time in the moving reference frame always ¯ows more slowly.
This phenomenon is called time dilation. It is not an illusion but a
real phenomenon that has been experimentally observed. The
®rst observation involved the muon, an elementary particle
with a mass 207 times that of the electron, which is found in
cosmic rays, a natural source of high-energy particles. The halflife of the muon is 1:52 10ÿ6 s when measured in the laboratory.
Moving at nearly the speed of light, about half of the muons that
reach the upper atmosphere would travel a distance of some
450 m. Yet observations have indicated that half of these muons
actually travel distances of about 3200 m. This phenomenon is
due to time dilation. In the rest frame of the muon, the life-time
is 1:52 10ÿ6 s, which is what we measure for the muons in the
laboratory, when we are also in the same frame of reference.
However, for an observer on the Earth, the muon is traveling at
nearly the speed of light, and the half-life as measured by this
observer is increased by a factor of to 10:8 10ÿ6 s. As seen
by the observer on Earth, the muons travel a greater distance
due to the increased half-life (®gure 19.6).
Figure 19.6. In the muon's reference frame, the half-life is 1.6 ms. At
nearly the speed of light, a muon would travel a distance of about
450 m. For an observer on the ground, this half-life is increased by a
factor of to 10.8 ms, increasing the distance traveled to about 3200 m.
Half of the muons detected at the top of a 3200-m mountain are detected
at the foot of the mountain.
457
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Intergalactic travel
Relativistic time dilation might make human intergalactic
travel possible, although not with our present technology.
The distance to the nearest spiral galaxy, M31, the great
galaxy in Andromeda, is about one million light-years. A
spaceship capable of maintaining a constant acceleration of
1 g (9.8 m/s2 ) during the entire trip, will reach M31 in only
28 years, ship time. However, the crew's families and friends
back on Earth ± and perhaps even Earth's civilization as they
knew it before leaving ± will be long gone way before they
reach their destination, since Earth time for their trip will
be several million years.
M31, the great galaxy in Andromeda, at a distance of one million
light years from us. (Courtesy NASA.)
A constant acceleration of 1 g produces an incredible
increase in speed in a relatively short time. The ®gure below
illustrates this rapid increase in speed. We see that it takes
over three years ship time to reach the nearest star, located
at a distance of nearly four light-years, and some 5 years to
reach Epsilon Eridani, a star 10 light-years away from us. It
only takes 21 years, however, to reach the center of our
458
The Special Theory of Relativity
galaxy which is 30 000 light-years away. Seven additional
years, ship time, will bring this accelerating spacecraft to the
vicinity of M31.
The ships and propulsion systems that we have developed for our nascent space exploration are but a shadow of
the mighty ships needed to even consider interstellar travel.
An acceleration of 1 g is achievable now but cannot be maintained over a long period of time. Maintaining it for an entire
trip to another star requires the engineering of a much more
advanced technology.
It is not only muons that experience time dilation, but everything. Biological processes, such as the rate at which cells divide
or the heart beats, are also slowed down. Time dilation implies
the possibility of interstellar travel and even intergalactic travel,
when the technologies of the future are able to provide us with
the vehicles and engines required for acceleration to speeds
close to the speed of light. Our galaxy, the Milky Way, is about
one hundred thousand light-years in diameter. At close to the
459
SUPERSTRINGS AND OTHER THINGS
speed of light, it would take slightly over one hundred thousand
years to travel from one end of the Galaxy to the other. This time,
however, is measured by a stationary observer back on Earth. A
spacecraft of the future would be able to circumnavigate the
entire Galaxy in less than a human lifetime, as measured on the
ship.
Simultaneity
In his 1905 special relativity paper, Einstein writes in one of clearest pieces of scienti®c prose ever written:
If we wish to describe the motion of a material point, we give the
values of its coordinates as functions of time. Now we must bear carefully in mind that a mathematical description of this kind has no physical meaning unless we are quite clear as to what we understand by
``time.'' We have to take into account that all our judgements in which
time plays a part are always judgements of simultaneous events.
But what are simultaneous events? Einstein gives an example
which Leopold Infeld, one of Einstein's collaborators, called
``the simplest sentence I have ever encountered in a scienti®c
paper'':
If I say, for example, ``the train arrives here at 7,'' this means: the
coincidence of the small hand of my watch with the number 7 and
the arrival of the train are simultaneous events.
Einstein, however, explained that although it could be possible to de®ne ``time'' by substituting ``the position of the small
hand of my watch'' for ``time,'' such a de®nition is no longer
satisfactory ``when we have to connect in time series of events
occurring at different places . . .'' In other words, how can we
say that two events, one in Paris and the other one in Los Angeles,
are simultaneous? We could have a friend in Paris and another in
Los Angeles call us over the telephone, we could ``synchronize''
our watches and be able to determine if the two events are
indeed simultaneous. The problem with this approach is that
the telephone conversations are transmitted using electromagnetic waves which travel at the speed of light and although,
according to the second postulate, this speed is the fastest
speed achievable, it is still ®nite. The signals then are going to
460
The Special Theory of Relativity
Figure 19.7. For the passenger in the moving train (left), the two beams
that originate at the same time in the middle of the train reach the two
walls simultaneously. A person standing outside the train will detect
the beam striking the back wall ®rst.
take slightly different times to arrive at our location but since we
know the speed of light, we can compensate for the delay in the
arrival of the signal from the more distant location. Of course, if
we are located midway between New York and Paris, somewhere
in the Atlantic Ocean, the signals take the same time to arrive and
there is no need for compensation.
The problem arises when we consider events in two different
inertial systems. Consider again a train traveling with a constant
velocity (®gure 19.7). On the train, a passenger is standing in the
middle of a car holding two ¯ashlights pointed toward the front
and back walls which he turns on at the same time. In the inertial
reference frame of the train, the two beams move at the speed of
light c and reach the two walls simultaneously. For an observer
standing on the station platform the two beams also move at
the speed c. However, she sees the back wall of the train
moving to meet the traveling light beam and the front wall
moving away from the beam that is heading in the direction in
which the train is moving. According to the observer standing
on the platform, the beam heading toward the back wall strikes
®rst, and the two events are not simultaneous for her. Therefore,
we must conclude that events that are simultaneous in one inertial
frame are not simultaneous for observers in another inertial frame.
This is a consequence of Einstein's second postulate, the constancy of the speed of light in any inertial reference frame.
461
SUPERSTRINGS AND OTHER THINGS
Length contraction
The Lorentz±FitzGerald contraction, introduced in 1882 to
explain the null results of the Michelson±Morley experiment,
turned out to be a consequence of the special theory of relativity.
To see how this length contraction appears, let us return to our
example of the two trains (Figure 19.8). One train is moving
with a constant velocity v and the other is standing by the station.
In the stationary train, a passenger measures the length of her car
to be L0 . This length, measured by the observer in her own reference frame, is called proper length. The proper length is what we
call the characteristic length of the object. A passenger in the
moving train also measures the length of the car in the stationary
train. This observer uses his precise watch to measure the time t0 it
takes the stationary car to pass through a reference mark he has
made on the window of his own car. He calculates the length of
the stationary car to be L vt0 . The passenger in the stationary
Figure 19.8. The passenger in the stationary train measures the length
of her car to be L0 . The passenger in the moving train measures the
length of the stationary train by timing its motion across a mark on his
window.
462
The Special Theory of Relativity
train, which is also observing the experiment, measures the time t
it takes the mark on the moving train to move at a speed v from
the back to the front of her stationary car, and she is able to
corroborate that the length of her car is vt L0 . She knows that,
because moving clocks run slow, the time t0 measured by the
moving observer for the passage of her car past the mark on
the moving train is less that the time t she measures for that
mark to move across her stationary car. The time t she measures,
we already know, is equal to times the time t0 measured by the
moving observer:
t0
t t0 p
1 ÿ v2 =c2
p
so that t0 t 1 ÿ v2 =c2 . The length L of the stationary car measured by the observer in the moving train is
q
L vt0 vt 1 ÿ v2 =c2
and since vt L0 , the proper length measured by the observer in
the stationary train for her own car, the length measured by the
observer in the moving car is
q
L L0 1 ÿ v2 =c2 :
p
Because the factor 1 ÿ v2 =c2 is always less than 1, L is always
smaller than L0 ; that is, the length of any object in motion relative
to an observer is always less than the length of the object measured by an observer at rest relative to the object.
Length contraction is a real effect, not an optical illusion. It
is a direct consequence of the constancy of the speed of light for
all inertial frames of reference and has been experimentally
observed. As with time dilation, reversing the situation produces
exactly the same effect. That is, if the passenger in the moving
train claims that he is not moving and that the stationary train,
together with the station are moving with a speed v, he would
measure a proper length L0 for his own car and, for him, the
length measured by the passenger in the stationary
train,
p
moving relative to him, is shorter by a factor
1 ÿ v2 =c2 .
Again, both points of view are correct since there is no preferred
reference frame.
463
SUPERSTRINGS AND OTHER THINGS
Addition of velocities
In his 1905 paper on special relativity Einstein concluded that the
speed of light must be a limiting velocity for any material body. If
we imagine a spaceship moving at 299 000 km/s relative to us on
the Earth, could it not be possible for the crew on the ship to
launch a reconnaissance ship at a speed of 10 000 km/s in the
same direction? According to Galilean relativity, we should
measure a speed of 299 000 km=s 10 000 km=h 309 000 km=s,
greater than the speed of light of 299 792 km/s. According to
special relativity, it is not possible for the reconnaissance ship
to surpass the speed of light. If this is the case, how do we
calculate the ship's velocity?
In his relativity paper Einstein shows that if an object moves
with a velocity u0 relative to an inertial frame which in turn is
moving with a velocity v with respect to a second inertial frame
(®gure 19.9), the velocity of the object in the second frame is
u
v u0
:
1 vu0 =c2
For our particular example, the ship's velocity is v 299 000 km/s
relative to the Earth and the reconnaissance ship travels at
Figure 19.9. A reconnaissance ship traveling at a speed u0 relative to a
mother ship, which moves at a speed v relative to the inertial frame xy
of the Earth.
464
The Special Theory of Relativity
u0 10 000 km/s relative to the ship. We would measure a velocity
u for the reconnaissance ship of
u
299 000 km=s 10 000 km=s
1 299 000 km=s 10 000 km=s= 299 792 km=s2
299 051 km=s:
Notice that the numerator in Einstein's expression for the
transformation of velocities is the Galilean transformation of
velocities. In special relativity, this simple addition must be
divided by the quantity 1 vu0 =c2 . If the object is moving in the
same direction as the moving reference frame, the quantity in
the denominator is greater than 1 and the velocity u in the stationary frame is smaller than what it would be in Galilean relativity.
We should also notice that when the velocities are small compared with the velocity of light, the denominator becomes
almost 1, and Einstein's transformation of velocities reduces to
the transformation of Galilean relativity.
E mc2
Einstein's ®fth and last paper of the year 1905, ``Does the Inertia
of a Body Depend upon Its Energy Content?,'' is only three pages
long. ``The results of the previous investigation,'' writes Einstein
in this short and beautiful paper, referring to his special theory,
``lead to a very interesting conclusion, which is here to be
deduced.'' What Einstein deduced was that inertial mass and
energy are equivalent.
In his paper Einstein considers an atom undergoing radioactive decay and emitting light. Applying the principles of
conservation of energy and momentum to the decay he showed
that the atom resulting from the decay had to be less massive
than the original atom. Remember that in 1905, Rutherford had
not yet introduced his nuclear model of the atom and radioactivity
had been discovered only some ®ve years before. As we saw in
chapter 8, this difference in mass accounts for the binding
energy of nuclei, but this problem did not exist until 1911 when
Rutherford published his nuclear model. In his paper, Einstein
concluded:
465
SUPERSTRINGS AND OTHER THINGS
Figure 19.10. Bubble chamber photograph of a collision between two
protons in a high-energy particle accelerator. (Courtesy Brookhaven
National Laboratory.)
If a body gives off the energy E in the form of radiation, its
mass diminishes by E=c2 . The fact that the energy withdrawn
from the body becomes energy of radiation evidently makes no difference, so that we are led to the more general conclusion that
The mass of a body is a measure of its energy-content . . .
''If a body gives off the energy E in the form of radiation, its
mass diminishes by E=c2 .'' As Einstein himself later pointed out,
this conclusion has general validity. An object's mass is a form of
energy. Two protons colliding in a high-energy particle accelerator create a spray of new particles, as can be seen in a bubble
chamber photograph (®gure 19.10), as kinetic energy is converted
to mass energy. If a particle of rest mass m0 disintegrates spontaneously, all its inertial mass is converted into energy. Since
466
The Special Theory of Relativity
Figure 19.11. Pair production. A gamma ray photon of suf®cient energy
passes near a nucleus and disappears creating an electron and a positron
whose tracks can be seen in a cloud chamber curving away from each
other due to an external magnetic ®eld. The gamma ray leaves no
track. (Courtesy Brookhaven National Laboratory.)
this inertial mass is m0 E=c2 , the energy released is
E m0 c2 :
This is Einstein's most famous equation.
This mass±energy equivalence also means that an object of
mass m0 can be created out of energy. The phenomenon of pair
production is an example of this process. When a gamma ray
photon passes near an atom, it sometimes disappears, creating
an electron and its anti-particle, the positron, leaving the nearby
atom unchanged (®gure 19.11). The photon's energy has been
converted into mass.
The total energy of a particle of mass m0 moving with a speed
v and having a kinetic energy KE is
E m0 c2 KE:
This total energy is equal to mc2 ; thus,
mc2 m0 c2 KE:
The relativistic mass, m, of an object equals its rest mass m0 when
the object is at rest with respect to an observer; that is, when
the kinetic energy is zero. When the object is in motion relative
467
SUPERSTRINGS AND OTHER THINGS
to the observer, the relativistic mass increases. Einstein expressed
this relativistic mass as
m0
m p
:
1 ÿ v2 =c2
This expression tells us that when the speed of the object
approaches that of light its mass becomes in®nite. This result is,
of course, a consequence of the postulates of special relativity; it
places a limit on the speed of any object. Regardless of the force
applied on the object or the time this force is applied to the
object, it can never reach the speed of light. Mass, as we learned
early on in the book, is a measure of the inertia of a body; that is, a
measure of the resistance that the body presents to a change in its
state of motion by the action of an applied force. As the speed of
the object increases, its resistance to the applied force or inertia
increases, approaching in®nity as the object's speed approaches
that of light.
Mass is not the same as ``quantity of matter.'' The relativistic
increase in mass does not imply an increase in the object's size or
quantity of matter. As the object increases its speed, it does not
become larger or denser; it merely becomes more dif®cult to
move. This increase in the object's inertia is what we understand
by an increase in mass.
468
20
THE GENERAL
THEORY OF
RELATIVITY
The principle of equivalence
Postulate 1 of the special theory, the Principle of Relativity,
states that the laws of physics are the same for all inertial (nonaccelerated) observers. This postulate, as we saw in chapter 19,
is an extension of the Galilean Principle of Relativity that includes
not only mechanics but also electromagnetism and optics; that is,
all of physics at the time. This postulate means that there is no
absolute inertial reference frame and uniform motion has to be
referred to an inertial frame. No experiment performed inside a
closed chamber in an inertial reference frame can reveal to us
whether the chamber is at rest or in motion.
In 1907, while preparing a comprehensive paper on the
special theory of relativity, Einstein asked himself whether the
postulate of relativity could be extended to non-inertial reference
frames. This would mean that accelerated motion is also relative
and therefore impossible to detect from inside. But we all know
that accelerated motion can be detected from inside. If we are in
a smoothly-moving vehicle, we feel no motion; but if the vehicle
lunges, we feel it immediately. We feel ourselves being pressed
against the back of our seat when the vehicle accelerates and
feel the pressure on our shoulders from the sides of the vehicle
when it turns suddenly. Acceleration, Newton taught us, can be
detected and therefore cannot be relative. Only uniform motion
is relative. Einstein would not accept a partial relativity and
kept looking for the way out. ``I was sitting in a chair in the
patent of®ce in Berne, . . . [t]hen there occurred to me . . . the
happiest thought of my life, in the following form,'' he wrote
later in a paper intended for the journal Nature, which owing to
469
SUPERSTRINGS AND OTHER THINGS
its length was never published. He continued:
The gravitational ®eld has only a relative existence in a way similar to
the electric ®eld generated by a magnetoelectric induction. Because
for an observer falling freely from the roof of a house there
exists ± at least in his immediate surroundings ± no gravitational
®eld. Indeed, if the observer drops some bodies then these remain relative to him in a state of rest or of uniform motion, independent of their
peculiar chemical or physical nature (in this consideration the air
resistance is, of course, ignored). The observer therefore has the
right to interpret his state as 'at rest.'
[. . .]
The experimentally known matter independence of the acceleration of
fall is therefore a powerful argument for the fact that the relativity
postulate has to be extended to coordinate systems which, relative
to each other, are in non-uniform motion.
Accelerated motion cannot be distinguished from inside
from the effects of gravity. In another paper published that
same year in which he completed the formulation of the equivalence between mass and energy, Einstein also presented a Gedanken or thought experiment to illustrate the relativity of accelerated
frames of reference. Imagine, he proposed, a laboratory in space.
We could imagine that this laboratory is inside a spaceship, far
from any gravitational body (®gure 20.1(a)). The spaceship
begins to accelerate and the scientists inside the laboratory
obviously feel this acceleration. The scientists perform simple
experiments inside their laboratory to determine the value of
this acceleration. Let's suppose that the scientists ®nd that the
spaceship is accelerating at a rate of 9.8 m/s2 . The scientists in
this laboratory feel the sensation of weight. One of the scientists
holding a ball in her hands decides to release it. The ball is now
a free body, not in contact with the accelerating spaceship and
we may think it is the ¯oor of the spaceship that rushes up to
meet the ball. With respect to the spaceship the ball moves in
the direction the scientists would call ``down,'' accelerating at
9.8 m/s2 . The scientists perform other simple experiments, dropping objects of different masses which fall to the ¯oor at the same
rate and conclude that the spaceship continues accelerating at the
constant rate of 9.8 m/s2 so that everything behaves as if the
laboratory were back on the ground. The following day, the scientists resume their experiments and conclude that the spaceship
470
The General Theory of Relativity
Figure 20.1. (a) No experiment performed inside a laboratory in space
accelerating at 9.8 m/s2 can allow us to distinguish it from an experiment
carried out at a laboratory on the ground on earth, as shown in (b).
continues accelerating at a rate of 9.8 m/s2 . However, when they
look outside, they are surprised to ®nd that, while they slept, the
spaceship had landed on Earth and that they are actually on the
ground (®gure 20.1(b)).
This thought experiment illustrates what Einstein called the
Principle of Equivalence: that it is impossible to distinguish accelerated
motion from the effects of gravity. In this sense, the effects of a uniform constant acceleration are equivalent to the effects of gravity.
Inside the accelerating spaceship the scientists feel real gravity; it
is not merely a simulation of gravity. As Einstein said, the ``gravitational ®eld has only a relative existence;'' it exists while the
acceleration exists. When the ball is dropped inside the laboratory
471
SUPERSTRINGS AND OTHER THINGS
in space, it ceases to be in contact with the accelerating spaceship
and stops accelerating. When we drop a ball while standing on
the ground on Earth, it falls. But while the ball is falling, there
is no gravity; the ball merely ¯oats in space. It is not dif®cult to
imagine the spaceship accelerating up to meet the ¯oating ball,
but to imagine the Earth moving up to make contact with the
ball seems ludicrous. After all, the Earth does not move relative
to the solar system every time an object falls to the ground.
However, relative to the ball, the Earth, the solar system and
the entire universe are accelerating toward it. Gravity is precisely
equivalent to accelerated motion.
Einstein's principle of equivalence extends the relativity
principle to accelerated frames of reference. Because it generalizes the relativity of inertial frames of reference to include
non-inertial frames, Einstein called the new theory the General
Theory of Relativity.
Implicit in the principle of equivalence is the assumption that
all objects fall to the ground with the same acceleration, regardless of their masses. If this were not the case, the effects of gravity
would be distinguishable from those of acceleration. The ¯oor of
the accelerating spaceship in our thought experiment would
make contact simultaneously with two objects of different
masses released at the same distance from the ¯oor. These two
objects would not fall to the ¯oor simultaneously when the spaceship is back on the ground if the acceleration due to gravity were
not the same for all objects. As we saw in chapter 3, Galileo
showed experimentally that all objects fall to the ground with
the same acceleration. Newton used the concept of mass in two
different contexts: as the measure of inertia, the response of an
object to a given force according to Newton's second law,
F ma, and as a measure of the gravitational effect on the
object. One is the inertial mass and the other, the gravitational
mass. The gravitational force acts in exactly the right proportions
on objects of different masses. If the mass is doubled, the gravitational force doubles. Because the inertial mass doubles, the resistance to this double gravitational force also doubles, producing
the same acceleration as before. Newton tacitly made inertial
mass equal to gravitational mass by assuming Galileo's law that
all objects fall with the same acceleration regardless of their
composition or structure.
472
The General Theory of Relativity
Einstein, with his principle of equivalence, made Galileo's
law the foundation of his general theory of relativity. The principle of equivalence can be then stated in the alternative form:
gravitational mass and inertial mass are equivalent and no experiment
can ever distinguish one from the other. Between 1889 and 1922, the
Hungarian physicist Roland von EoÈtvoÈs performed the series of
experiments that we mentioned at the end of chapter 6 which
showed that inertial mass and gravitational mass were the
same to a few parts in a billion. In the 1960s and 1970s R H
Dicke at Princeton University and Vladimir Bragisnky at
Moscow State University performed experiments that showed
that inertial and gravitational masses were equal to 1 part in
100 billion and 1 part in a trillion, respectively. However, in
1988, a reanalysis of EoÈtvoÈs's experiments seemed to indicate a
slight discrepancy between inertial and gravitational mass and
the existence of a ®fth force, with a strength of about one-®ftieth
that of the gravitational force, was postulated. More recent
experiments performed by scientists from the Joint Institute for
Laboratory Astrophysics in Colorado and from the Los Alamos
National Laboratory in New Mexico, cast some doubts about
the existence of a new force or about any discrepancy in the
inertial and gravitational mass. Sophisticated experiments are
currently underway that will someday clear the situation.
Warped spacetime continuum
Sometime after introducing the principle of equivalence, Einstein
realized that the usual understanding of the geometry of space
should be re-examined. We may illustrate this with an example
used by Einstein. Take a disc or wheel spinning around its
center. From elementary geometry, we know that if we measure
the circumference of the disk and divide it by its diameter, we
obtain the value . However, according to special relativity, if
we measure the circumference of the disk while it is rotating,
we obtain a smaller value than if we measure it when the disc is
at rest relative to us. The reason for this discrepancy is that a
line segment along the circumference of the rotating disc, in
motion relative to us, will undergo a length contraction. The
diameter is not changed because it is at right angles to the velocity
473
SUPERSTRINGS AND OTHER THINGS
of any point on the perimeter of the circle. Therefore, the ratio of
the circumference to the diameter of a rotating circle will be less
than . This means that the geometry of space for accelerating
reference frames is no longer the Euclidean geometry we learned
at school, where Newton's laws hold, but a different, curved,
geometry; Riemannian geometry.
In the formulation of the general theory of relativity, Einstein
found it necessary to consider a curved four-dimensional spacetime to describe the gravitational ®eld. The curvature of a threedimensional space is hard for us to imagine. Much harder is, of
course, to visualize a curved four-dimensional spacetime. However, we can start slowly and build up a good understanding of
it. The idea of a four dimensional spacetime in itself is not hard
to understand. In fact, we already use this idea in our daily
lives. Suppose that on 12th October we agree to meet a friend
for lunch at a restaurant downtown in exactly one week. We
®rst need to specify three numbers for the location in space of
this restaurant. We would need to transport ourselves to 5th
Street and 3rd Avenue and take the elevator to the 7th ¯oor
where the restaurant is located. This knowledge, however, is not
enough for us to meet our friend. We need a fourth number. It
will not do us any good to appear at the restaurant on 15th October
at lunch time because our friend will not be there on that day. In
addition to specifying the location in space, the where, we need
to specify the time, the when. We need to appear at the restaurant
at the corner of 5th Street and 3rd Avenue on the 7th ¯oor on 19th
October at noon (®gure 20.2). These four numbers constitute a
four-dimensional description of the event in spacetime.
The mechanics of Newton can very well be expressed using a
four-dimensional spacetime formalism, but we would not gain
much from doing so. The reason is that in Newtonian mechanics,
time is absolute and all we would gain from this description is
that the time measured by one observer is the same as the time
measured by any other observer. In relativity, the time measured
by two observers in motion relative to one another is different, so
that treating time on a more equal footing with the space coordinates is advantageous. If our friend meeting us for lunch is actually traveling on a spacecraft at a great speed, the time interval of
one week between our agreement to have lunch and the lunch
date will be different for our friend. It was Hermann Minkowski,
474
The General Theory of Relativity
Figure 20.2. The description in spacetime of our meeting with a friend
for lunch requires four numbers; three for the spatial location and one for
the time of the event.
one of Einstein's professors at the ETH, who in 1907 developed
the mathematical formalism of a spacetime continuum as the
underlying geometry behind the space and time relationships
in the theory of relativity.
Einstein proposed that the spacetime continuum is curved.
To understand this curvature of spacetime, let's consider ®rst
the curvature of two-dimensional space. In 1880, a noted Shakespearean scholar, Edwin A Abbott, wrote what later became a
classic science-®ction story, Flatland, a fantasy of strange places
inhabited by two-dimensional geometrical ®gures. Flatland tells
the dif®culties of A Square, an inhabitant of this two-dimensional
world, in imagining a third dimension; a direction perpendicular
to the other two that are familiar to him (®gure 20.3). A Dutch
mathematician, Dionys Bruger, wrote in 1960 a sequel to Flatland,
Sphereland, a story of the dif®culties of Hexagon, grandson of
A Square, who also lives in a two-dimensional world. This
time, however, the world is not ¯at but curved. Like Hexagon,
475
SUPERSTRINGS AND OTHER THINGS
Figure 20.3. ``Look yonder, and behold your own Pentagonal house,
and all its inmates,'' Sphere, a visitor from the three-dimensional
world tells A Square on his ®rst trip outside Flatland.
we also ®nd it very dif®cult to imagine a curved four-dimensional
spacetime continuum.
For our analogy, the two-dimensional space shall be curved,
like Sphereland where Hexagon lived. The Spherelanders originally learned Euclidean geometry, which applied very well to their
immediate surroundings. They never noticed that their world
was not ¯at because Sphereland was very large and the curvature
was not noticeable at small scales. Therefore the interior angles of
a triangular parcel of land added to 1808, and squares had their
opposite sides parallel to each other. One day, however, a
group of their scientists decided to line up a large set of very
precise, straight rulers to form a very large triangle. After the
scientists measured the interior angles, they were surprised to
®nd that their sum added to 1908, not 1808 as they expected.
Adding more rulers to form a larger triangle yields still a greater
476
The General Theory of Relativity
Figure 20.4. The people of Sphereland cannot understand why the
interior angles of small triangles always add up to 1808 whereas for
very big triangles, the sum of the angles is larger than 1808. Eventually
they invent Riemannian geometry to describe their own curved world.
value for the sum of the interior angles; they obtain 1978 this time.
A third triangle, smaller than the ®rst gives the value of 1858.
``What is going on?'' they asked themselves. After much thought
and discussion it is discovered that they live in a world that is
curved. This curvature, however, is very dif®cult for them to
comprehend, because it does not take place in any one of the
two dimensions known to them and therefore it is completely
invisible to them. It occurs in a direction perpendicular to their
world, a third spatial dimension (®gure 20.4).
Our own story is very much like that of the Spherelanders,
except that it occurs in three rather that in two dimensions. On
a small scale, we cannot detect any deviations in our universe
from Euclidean geometry. Einstein, however, reasoned that our
universe is curved and that we should detect the effects of its
curvature on a large scale. From his principle of equivalence, he
knew that bodies of different masses and compositions must
fall with the same acceleration in a gravitational ®eld. This phenomenon has nothing to do with the falling bodies themselves
and should be an intrinsic property of space, in the same way
477
SUPERSTRINGS AND OTHER THINGS
that the curved triangles observed by the Spherelanders have
nothing to do with the rulers they use and are a property of
their space.
In general relativity, space is distorted or curved by the presence of a gravitational ®eld; that is, by the presence of a body.
A second body that enters the space in the vicinity of the ®rst
body experiences the distortions in spacetime produced by the
®rst body. Although Einstein arrived at this conclusion in 1907
by an analysis of his principle of equivalence, the full development of a gravitational theory required complicated mathematics
and several more years. However, a simple analogy will help us
visualize this. Imagine a rubber sheet stretched out ¯at; this is
the analog of our three-dimensional space without any masses
in its vicinity (®gure 20.5(a)). A billiard ball placed on this sheet
will make a dip so that a marble placed on the sheet near the
billiard ball will roll toward it (®gure 20.5(b)). If you roll the
marble on the sheet, it will follow a curved path around the dip.
The billiard ball is not pulling the marble; rather it has distorted
the rubber sheet in such a way as to de¯ect the marble. Similarly,
Figure 20.5. (a) A stretched rubber sheet is the analog of the threedimensional space far away from any mass. (b) A billiard ball placed
on it creates a dip that distorts or warps the sheet. A marble placed on
this warped sheet rolls toward the billiard ball.
478
The General Theory of Relativity
the Sun distorts or warps the space around it. In such a warped
space an object merely travels along a geodesic, which is the path
of minimum distance between two points. This path, the
``straightest'' possible path for a planet happens to be an ellipse.
The planet orbits the Sun in an elliptical orbit not because of a
gravitational force of attraction exerted on the planet by the Sun,
as Newton af®rmed, but because the mass of the Sun warps the
space around it, altering its geometry. For Einstein, there is no
``gravitational force;'' gravity is geometry.
The bending of light
Light always moves in straight lines. Since space is curved, a
straight line is the shortest path between two points, or a geodesic.
Thus, in our curved space a straight line is curved. Since the mass
of an object like the Sun curves the space around it, the curvature
of a light ray increases as it passes near the Sun (®gure 20.6).
Let's now return to our example of the accelerating spaceship.
Imagine that in an optics laboratory inside the spaceship a very
small round window has been installed in one of the walls.
When a beam of light from a star enters the small window, it
Figure 20.6. The path of the light from a star follows a geodesic. As it
passes near the sun where the space is distorted, its curvature increases
and the position of the star appears shifted when compared with its
position when the light passes away from the sun.
479
SUPERSTRINGS AND OTHER THINGS
Figure 20.7. A beam of light follows a straight line far away from the
presence of a large mass. When the beam enters the accelerating spaceship, it hits the opposite wall at a point slightly below.
crosses the laboratory and hits the opposite wall at a particular
location. The scientists inside the laboratory reasoned that since
they are far away from any large mass, the space should be
nearly Euclidean and the path of the light beam should be
nearly straight. However, because the spaceship is accelerating,
the beam does not hit the opposite wall at a point in line with
the small window and the star but slightly below (®gure 20.7).
The scientists undertake very precise measurements of this
slight de¯ection of the light beam and conclude that their spaceship is accelerating at 9.8 m/s2 .
Since in the general theory of relativity, gravitational ®elds
are locally equivalent to accelerations, the scientists expect the
light beam to be bent by the same amount in the presence of
the gravitational ®eld of the Earth, which produces the same
acceleration. When the scientists perform the same experiment
480
The General Theory of Relativity
Figure 20.8. The gravitational ®eld of the earth de¯ects the beam of
light by the same amount as the spaceship accelerating at 9.8 m/s2 .
after the spaceship has landed on Earth, they con®rm that the
de¯ection of the beam is exactly the same as when the spaceship
was accelerating in space (®gure 20.8). Einstein pointed out that
this is to be expected from the mass-energy equivalence given
by his E mc2 of the special theory.
Einstein predicted this effect in 1907 but saw no way of
designing an experiment to test it. In 1911, Einstein calculated
that a light beam from a star grazing the Sun would be de¯ected
a very small angle. ``A ray of light going past the Sun would
accordingly undergo de¯ection to the amount of . . . 0.83 seconds
of arc,'' he wrote towards the end of his 1911 paper ``On the In¯uence of Gravitation on the Propagation of Light,'' an intermediate
paper on his general theory. The dif®culty was that it is not
normally possible to see starlight and Sunlight at the same
time. But Einstein found a way. ``As the ®xed stars in the parts
of the sky near the Sun are visible during total eclipses of the
481
SUPERSTRINGS AND OTHER THINGS
Sun,'' he wrote at the end of his paper, ``this consequence of the
theory may be compared with experience. With the planet Jupiter
the displacement to be expected reaches to about 1/100 of the
amount given. It would be a most desirable thing if astronomers
would take up the question here raised. For apart from any theory
there is the question whether it is possible with the equipment at
present available to detect an in¯uence of gravitational ®elds on
the propagation of light.''
In fact, it was possible with the equipment of the day. In 1914
an expedition led by the German astronomer Erwin FinlayFreundlich went to Russia to observe a total eclipse of the Sun
and test Einstein's prediction. The outbreak of war found the
expedition already in Russia; the astronomers became prisoners
of war before they could make any measurements. They were
released after a few weeks, but without their equipment.
Although this outcome may seem unfortunate (it certainly was
unfortunate for the German astronomers), it was probably better
not to have been carried through. Had they been able to make
their measurements, they would probably have found a value
twice as large as what Einstein had predicted in his paper. The
reason is that Einstein was partially wrong in 1911.
It took Einstein four more years to develop the correct
general theory of relativity. Along the way, he had to master complicated mathematics and struggle with wrong leads and dead
ends. At the end, however, he obtained a physical theory of
incredible power and ``majestic simplicity,'' as Banesh Hoffmann,
one of his collaborators, described it. The new theory predicts a
de¯ection in the position of the stars whose light grazed the
Sun by 1.74 seconds of arc, twice his previous prediction. In
1919, Arthur Eddington, a British astronomer and member of
the Royal Society, organized an expedition to the Isle of Principe
in the Gulf of Guinea, off West Africa. In his Space, Time, and
Gravitation, Eddington wrote:
On the day of the eclipse the weather was unfavourable. When totality
began the dark disc of the Moon surrounded by the corona was visible
through cloud, much as the Moon often appears through cloud on a
night when no stars can be seen. . . .
There is a marvellous spectacle above, and, as the photographs
afterwards revealed, a wonderful prominence-¯ame is posed a hundred thousand miles above the surface of the Sun. We have no time
482
The General Theory of Relativity
to snatch a glance at it. We are conscious only of the weird half-light of
the landscape and hush of nature, broken by the calls of the observers,
and beat of the metronome ticking out the 302 seconds of totality. . . .
[Of the sixteen photographs obtained, only] one was found
showing fairly good images of ®ve stars, which were suitable for determination. . . . The results from this plate gave a de®nite displacement,
in good accordance with Einstein's theory and disagreeing with the
Newtonian prediction.
This con®rmation electri®ed the world and made Einstein into a
celebrity. The telegram with the news of the con®rmation was
sent to Einstein by way of Lorentz in Holland who sent it to
Einstein in Berlin. When the telegram arrived, Einstein read it
and passed it on to a student who was in his of®ce at the time.
The student was visibly excited, but Einstein exclaimed, ``I
knew that the theory was correct. Did you doubt it?'' The student
protested and said that she thought this was a very important
con®rmation of his theory. What would Einstein have said, she
asked him, if the results had been negative? ``Then I would
have been sorry for dear God. The theory is correct.''
The perihelion of Mercury
Newtonian mechanics tells us that the planets move in elliptical
orbits around the Sun. This is Kepler's ®rst law. In Newtonian
mechanics, space is ¯at. In general relativity, space is curved
and when an elliptical orbit is made to ®t into this curved
space, it becomes deformed in such a way that the planet does
not move along exactly the same path every time. Figure 20.9
illustrates what happens. Imagine that Newtonian space is represented by a ¯at sheet where the elliptical orbit has been drawn. To
represent the warped space of general relativity, we fold or cut a
wedge out of the paper and rejoin the edges. The elliptical orbit is
now deformed and the planet moves along a slightly shifted
ellipse. One way that astronomers use to express this is to state
that the perihelion ± the point in the orbit closest to the Sun ± precesses; that is, shifts with every revolution around the Sun. For
this reason, this phenomenon is called perihelion precession.
In 1845, the French astronomer Joseph Le Verrier analyzed
the orbit of Mercury and found that it did not close. Its perihelion
483
SUPERSTRINGS AND OTHER THINGS
Figure 20.9. (a) Elliptical orbit in ¯at Newtonian space. (b) Cutting a
wedge out of the paper removes part of the ellipse. (c) Rejoining the
edges to make a curved sheet introduces a deformation in the orbit
which results in precession.
precessed by 574 seconds of arc every century. Although Le
Verrier had taken into account the perturbations to the orbit of
Mercury due to the other planets, especially the closest ones,
Venus, Earth and Mars, and also that of Jupiter, the most massive
of the planets, the total contribution to the precession amounted
to 531 seconds of arc. There were 43 seconds of arc that could
not be explained with Newtonian physics. In November 1915,
while ®nishing his masterpiece, Einstein decided that a good
test for his new theory would be to calculate the orbit of Mercury.
From his computation, Einstein obtained a value for the perihelion precession of 573.98 seconds of arc, almost exactly what
observations had yielded. General relativity accounted for the
anomalous 43 seconds of arc that Newtonian mechanics could
not explain. ``For a few days, I was beside myself with joyous
excitement,'' he would write later.
More accurate experiments performed during the 1970s gave
a value for the anomalous part of Mercury's perihelion precession
of 43:12 0:21 seconds of arc per century, in complete agreement
484
The General Theory of Relativity
with the prediction of general relativity of 42.98 seconds of arc.
Because physics is never taken as gospel and even the masterpiece of such giants as Einstein is questioned, in recent years a
controversy has appeared regarding the in¯uence on Mercury
due to the slightly oblate shape of the Sun. Although the deviation from the spherical shape is only of 12 parts in a million, it
may be enough to change the anomalous precession to 40 seconds
of arc per century. The controversy has not yet been resolved.
The gravitational time dilation
According to special relativity, time in the moving reference
frame always ¯ows more slowly. The moving reference frame,
however, depends on the observer. Time runs more slowly ±
however small the change may be ± for the airplane pilot
according to us on the ground. According to the pilot, it is
we and the ground who are moving and it is our clocks that
run slow.
Einstein showed that, according to general relativity, time
runs more slowly in a gravitational ®eld. Consider again the
laboratory inside the accelerating spaceship. Imagine that there
are two very precise clocks inside the laboratory, one near the
¯oor and the other near the ceiling (®gure 20.10). These clocks
are interfaced to precision instruments that control the rate at
which the clocks oscillate. To compare the clocks, an electromagnetic signal oscillating in step with the clock near the ¯oor
is sent toward the clock near the ceiling. This signal, we know
from special relativity, travels at the speed of light c. Since the
spaceship is accelerating, the clock at the ceiling will move
away from the incoming signal, which is moving at a constant
speed c and the receiving equipment connected to this clock
will detect a lower frequency because of the Doppler effect. The
scientists in charge of this equipment conclude that the clock on
the ¯oor runs more slowly than theirs. To corroborate, they
send a signal in step with their clock in the direction of the
clock at the bottom. Again, due to the ship's acceleration, the
receiving equipment interfaced to the clock on the ¯oor moves
towards the incoming signal, which is then Doppler-shifted
toward a higher frequency. The scientists on the ¯oor agree
485
SUPERSTRINGS AND OTHER THINGS
Figure 20.10. Comparing clocks in an accelerating laboratory in space
and in a laboratory on the ground on Earth.
with the scientists running the experiment near the top; the clock
at the top runs faster.
By the principle of equivalence, when the spaceship is back
on the ground on Earth, the clocks must behave in the same
way; the clock at the top runs faster than the clock at the
bottom of the laboratory. In a gravitational ®eld, then, clocks
run more slowly and, since Einstein taught us that time is what
clocks measure, time runs more slowly. This was the ®rst of
Einstein's great predictions. Einstein also showed us how this
prediction could be tested. Instead of comparing the rates of
clocks in different positions in a gravitational ®eld, we could compare the rates or frequencies of oscillation of the light emitted by
atoms also in different locations. There is no distinction between a
clock and the oscillations of an electromagnetic signal; modern
atomic clocks are based on the emission of light with a constant
frequency. In 1907, Einstein proposed to test his prediction by
comparing the frequencies of the light from atoms on the Sun
with the frequencies of light from the same atoms on the Earth.
Because the gravitational ®eld of the Sun is stronger, a clock
486
The General Theory of Relativity
near the Sun will run more slowly than on Earth and the
frequency of light from the Sun will be lower. Since this effect
is observed as a shift of the spectral lines of the Sun's light
toward lower frequencies ± the red end of the visible spectrum
± it is called the gravitational red shift.
The test of the gravitational red shift using Einstein's method
had to wait until the 1960s and 1970s, when better instruments
allowed scientists to separate this effect from other complicated
effects such as convection currents of gas in the Sun and the
effects of pressure, all of which cause the spectral lines to shift
in different ways. Modern results agreed with Einstein's prediction of about 5 percent shift of the spectral lines. An accurate
test of the gravitational red shift had been done at Harvard in
1960 in an experiment similar to our thought experiment when
the spaceship is back on the ground. The laboratory was the 24meter tall Jefferson Tower of the physics building with a detector
placed at the top of the tower and a source of gamma rays, radioactive cobalt, at the bottom of the building. When the detector
was placed near the source, the gamma rays were absorbed.
With the detector at the top of the building, the gamma rays
were Doppler shifted toward the red due to the small change in
Earth's gravity and the detector did not absorb them. When the
detector was moved downward to produce a blue shift that compensated for the Earth's gravitational red shift, the gamma rays
were absorbed. The Doppler shift measured in this experiment
was extremely small; only three parts in a quadrillion (1015 ).
The gravitational red shift, although very dif®cult to measure, is fairly easy to calculate. If the laboratory in the spaceship
of our thought experiment accelerating at g has a height h then
the electromagnetic signal takes a time t h=c to travel from
the laboratory's ¯oor to the ceiling. During this time, the elevator
has increased its speed by
v gt
gh
:
c
Although we will not do it here, the algebraic steps necessary to
extend the expression for the Doppler effect obtained in chapter
16 to include light are not very dif®cult. In Doppler observations
of light, it is the wavelength rather than the frequency that is
more often measured and thus we replace f by c=. If v < c, the
487
SUPERSTRINGS AND OTHER THINGS
magnitude of the Doppler wavelength shift is
v gh
2:
c c
More direct experiments to detect the gravitational time delay
have also been done using two cesium-beam atomic clocks, one
on the ground and the second one on a jet plane ¯ying overhead.
The frontiers of physics: Orbiting clocks
According to special relativity, the time interval between two
events is different for different inertial observers. Time in the
moving reference frame always ¯ows more slowly. This
phenomenon is known as time dilation. General relativity
predicts that time runs more slowly in a gravitational ®eld.
Special relativity predicts that a clock on an orbiting
satellite would run more slowly than one in its ground tracking station due to the signi®cant velocity of the satellite relative to the ground. However, because the satellite is farther
away from the center of the Earth, the gravitational ®eld is
weaker there and general relativity predicts that the satellite-based clock would run faster. For a satellite with an
orbital radius of at about 1.5 times the Earth radius, the two
effects cancel.
The satellite NAVSTAR 2, a ¯eet of military satellites
which are part of the US Global Positioning System, have
an atomic clock on board and orbit the Earth at an altitude
of about 20 000 km. This orbit is about 4.2 times the Earth's
radius, and the time delays due to its speed relative to the
ground and its position in the weaker gravitational ®eld of
the Earth do not cancel out. The clocks on the NAVSTAR 2
satellites run faster than ground clocks. In a single day, the
NAVSTAR 2 clocks will be 38.5 microseconds faster and the
satellites clocks must be adjusted accordingly. If they were
left alone, navigational errors would average about 11 km
per day. The correction, calculated with both the special
and general relativity predictions, allow the satellites to
provide navigational positioning with an accuracy of a few
meters.
488
The General Theory of Relativity
In an experiment conducted in 1971 by scientists of the US Naval
Observatory and Washington University in St. Louis, the results
agreed with the predictions of general relativity. In 1976, a Scout
D rocket carrying a hydrogen maser clock was launched by
NASA for a two-hour suborbital ¯ight to an altitude of
10 000 km. This clock is based on the transitions between two
energy levels in hydrogen that emit light with frequency of
1420 MHz. After two years of analysis of the data gathered
during this two-hour experiment, Robert Vessot and Martin
Levine of the Smithsonian Astrophysical Observatory at Harvard
and their collaborators at NASA, who had designed the experiment, reported that the frequency shifts observed agreed with
the theoretical predictions of general relativity to a precision of
70 parts per million.
Black holes
In the constellation of Orion there is a large cloud of gas, the
Orion Nebula, located in the ``sword'' of Orion (®gure 20.11).
The Orion Nebula is located within our galaxy, the Milky Way,
at a distance of about 1500 light years. In this large cloud, about
15 light years across, stars are forming right now. The total
mass of the Orion Nebula is suf®cient to form as many as
100 000 stars. The density of this nebula is about 1000 atoms per
cubic centimeter. Within this cloud, regions of particularly high
density fragment themselves into large balls of gas called protostars. Gradually, the protostar shrinks, due to gravitational
forces, becoming denser. As we know, whenever a gas becomes
more dense, its temperature rises. Eventually, the temperature
in the core of the protostar is high enough for nuclear fusion to
begin; the star is born.
A star the size of the Sun will shine for about 10 billion years
before exhausting its fuel. A larger star, one with a mass ®fty
times the mass of the Sun, will spend its fuel much more rapidly;
it will live for only a few million years. After about 10 billion
years, a star like the Sun, or one with a mass smaller than 4
solar masses, lacking hydrogen to fuse into helium and produce
heat in its core, begins to contract because the core no longer
has enough pressure to hold the overlying layers against the
489
SUPERSTRINGS AND OTHER THINGS
Figure 20.11. The Orion Nebula, a birthplace of stars. (Courtesy C R
O'Dell and S K Wong, NASA, Space Telescope Science Institute.)
force of gravity. This contraction heats up the core again which
starts to radiate this energy out toward the outer layers of the
star. This increase in energy goes into expanding these outer
layers and the star swells up. The total energy of the star is
now distributed over a much greater surface area with the
result that the surface temperature decreases. The star becomes
a red giant.
When the temperature of the core is hot enough, nuclear
fusion of helium nuclei into carbon begins. Eventually, all the
helium is used up and the core cools down once more. The red
giant collapses into a white dwarf which will radiate energy for a
490
The General Theory of Relativity
few billion years. Finally, when all its energy is exhausted, the
star ends up its life as a burned-out mass called a black dwarf. It
is believed that the universe is not yet old enough for the ®rst
black dwarfs to have formed.
More massive stars have a more dramatic end. The outer
layers of a star with about 8 solar masses will explode into a supernova. In 1987, the University of Toronto astronomer Ian Shelton,
working at Las Campanas Observatory in Chile, discovered a
supernova in one of the satellite galaxies of the Milky Way; the
Large Magellanic Cloud (®gure 11.6). Although the outer layers
of the star ¯y apart at great speeds, the core of the star, however,
collapses. Due to the enormous pressures, the electrons are
pushed into the nucleus where they combine with protons to produce neutrons and neutrinos. The neutrinos escape and the core
is now an incredibly dense sphere of neutrons. This is called a
neutron star. The density of this star is so great that a neutron
star with the mass of the Sun will be only about 20 km across.
If a neutron star has a mass of about three solar masses, there
is no force that can stop its collapse and the material of the star is
compressed to a state of in®nite density. At this stage, nothing is
left of the star except an intense gravitational ®eld; the star has
become a black hole. As the star collapses, the curvature of
spacetime becomes increasingly pronounced so that the light
rays from the star are de¯ected more and more. At some point
during the collapse, the curvature of spacetime is so severe that
no light can escape and the body appears black (®gure 20.12).
Since nothing can travel faster than light, nothing else can
escape from the collapsed star.
Figure 20.12. Graphic representation of the curvature of spacetime
around a black hole.
491
SUPERSTRINGS AND OTHER THINGS
At what point in the collapse of a star is the curvature of
spacetime severe enough to impede light from escaping? A few
years after Einstein published his general theory of relativity
the German astrophysicist Karl Schwarzschild calculated this
critical size, now called the Schwarzschild radius. We can see
how this radius can be calculated in terms of the concept of
escape velocity. Recall from chapter 4 that if an object of mass m
is raised through a height h, the gravitational potential energy
is increased by PE mgh. This expression is only valid if the
acceleration due to gravity g is assumed constant. This, in fact,
is not so, as we can see from Newton's law of universal gravitation; g diminishes as 1=r2 , where r is the distance from the
center of the Earth (see chapter 6). When h is small compared to
r, the expression mgh for the change in potential energy is a
good approximation. If a force F is used to slowly displace an
object of mass m upwards a distance h without changing its
kinetic energy, this force must be equal and opposite to the
weight Fgrav of the object. Then the increase in potential energy
is equal to the work done on the object, Fh; that is,
PE Fh ÿFgrav h
where Fgrav , the weight of the object, is the gravitational force that
the Earth, mass M, exerts on the object of mass m, or
Mm
Fgrav G 2 :
r
Using the methods of calculus, it can be shown that the gravitational potential energy of an object of mass m at a distance r
from a mass M is
PE ÿG
Mm
:
r
The escape velocity ve of an object of mass m on the surface of the
Earth (at a distance R from the center of the Earth) is the minimum velocity that we must impart to the object so that it escapes
the gravitational grasp of the Earth. It the object barely moves
after it leaves the gravitational ®eld of the Earth, then both its
kinetic energy and potential energy must be zero. Then
Mm
2
1
ÿG
mv
0:
e
2
R
492
The General Theory of Relativity
The escape velocity is then
r
2GM
:
ve
R
Substituting the mass and radius of the Earth, one ®nds that the
escape velocity is 11.2 km/s. For the Moon, the escape velocity
is 2.3 km/s and for Jupiter, the largest planet in the solar
system, 60 km/s. That is, if we throw a ball upwards with an
initial velocity of 11.2 km/s (or about 40 000 km/h), the ball will
leave Earth and coast out toward in®nity (neglecting air resistance). On the Moon, we would need to impart an initial velocity
of only 2.3 km/s to accomplish the same feat.
Since light always travels at speed c, we could turn our
expression for the escape velocity around to ®nd the critical
radius within which the mass M of the star must be contained
so that light does not escape from it; that is, the Schwarzschild
radius. If we take v c in the escape velocity expression,
Schwarzschild radius RS is given by
RS
2GM
:
c2
We can see that the size of the Schwarzschild radius is proportional to the mass of the star. For a star with a mass similar to
that of the Sun, the Schwarzschild radius is about 3 km.
Although we derived this expression from Newton's
mechanics rather than from the general theory of relativity, as
Schwarzschild did, the expression is still correct. The behavior
of objects inside a black hole, however, cannot be explained in
terms of Newton's equations. As illustrated in ®gure 20.12, the
spacetime near a black hole becomes severely distorted. In 1965
the British physicists Stephen Hawking and Roger Penrose
showed that, according to general relativity, within a black hole
the curvature of spacetime becomes in®nite and this means that
the gravitational force becomes in®nite. This in®nite distortion
in spacetime is called a singularity.
The distance from the point of no return or event horizon ± at
the Schwarzschild radius ± to the singularity is not a distance in
space but rather an interval of time. In a sense, then, space and
time are interchanged inside a black hole. The gravitational
®eld inside a black hole changes with time, as detected by a
493
SUPERSTRINGS AND OTHER THINGS
hypothetical observer inside the black hole. From the outside,
however, it takes an in®nite time for the ®eld to change, which
is another way of saying that it does not change at all; the black
hole remains static for an outside observer. This strange phenomenon is due to the gravitational time delay. At the singularity,
time stops. If an astronaut were to travel into a black hole and
report to the space station in orbit around the black hole his location at ten-second intervals, the messages would at ®rst arrive at
The frontiers of physics: Spacetime drag
According to Einstein's general theory of relativity, a spinning object drags spacetime along with it. Until recently,
this prediction of general relativity had not been observed
since this effect is incredibly small near the Earth. Near a
very massive object, like a neutron star or a black hole, the
effect should be detectable. Since we can't travel to the
stars, the problem in this case is how to detect the spacetime
drag from afar. In November of 1997, two teams of astronomers measured the periodic vibrations of the X-rays emitted
by the gas falling into several rapidly spinning neutron
stars and black holes and found evidence of spacetime
dragging.
General relativity predicts that if the gas surrounding a
spinning star forms a disk that lies at an angle to the plane
of rotation, the disk will wobble like a spinning top. This
wobble is the cause of the oscillations observed by the astronomers. Wei Cui, of the Massachusetts Institute of Technology,
studied the X-rays from several candidate black holes using
NASA's Rossi X-ray Timing Explorer satellite and found
greater dragging around the black holes spinning more
rapidly (see color plate).
Luigi Stella of the Astronomical Observatory of Rome
and his colleague, Mario Vietri of the University of Rome,
studying data from the same NASA satellite, found similar
oscillations in the X-rays from several rotating neutron
stars. Both results were reported at the same meeting of the
American Astronomical Society in Colorado.
494
The General Theory of Relativity
ten-second intervals. Just before the astronaut crosses the event
horizon, the astronauts in the space station would notice the
intervals getting slightly longer. Once the astronaut has crossed
the event horizon, the messages take an in®nite time to arrive at
the space station; that is, they never arrive. The astronaut dutifully keeps sending his messages according to his watch, but
the signals cannot cross the event horizon and never reach the
space station. According to the people in the space station, the
astronaut's time has been gravitationally delayed. The signals
have been progressively red-shifted until they appear to have
an in®nite wavelength; the astronaut seems to have been frozen
at the event horizon, taking an in®nite time to fall through. To
the astronaut, time continues at its normal rate and nothing
changes, except the enormous tidal forces that tear his body
apart: this thought experiment would not be possible to carry
through with a live astronaut.
495
21
THE EARLY ATOMIC
THEORY
The physics of the atom
Physics underwent two great revolutions at the turn of the twentieth century, and Albert Einstein was in the middle of both. One,
the theory of relativity, developed almost single-handedly by
Einstein, changed our conception of space and time, as we saw
in the previous two chapters. The other, quantum physics, the
physics of the atom, which started with the work of Max
Planck and Albert Einstein, changed our understanding of the
nature of matter.
We studied the development of the early models of the atom
in chapter 7. In this and the remaining chapters, we see how this
development led to our current understanding of the nature of
matter, starting with the work of the two pioneers, Planck and
Einstein.
Black body radiation
We know from experience that a hot object radiates energy. An
electric stove has an orange glow when it is hot. But even cold
objects radiate energy. In fact, all objects glow, but unless the temperatures are high enough, the glow is invisible. Throughout the
nineteenth century, physicists knew that this type of radiation
was a wave phenomenon, more properly called electromagnetic
radiation. They also knew that the reason why we sometimes
see the glow is because some of the frequencies of this electromagnetic radiation happen to be the same as the frequencies of
light to which our eyes respond.
496
The Early Atomic Theory
Just before the beginning of the twentieth century, physicists
considered the problem of the radiation emitted and absorbed by
objects at different temperatures. When radiation falls on an
object, part is absorbed and the rest is re¯ected. If we place a
brick on a table in the middle of a room and wait a few minutes,
the brick will absorb energy from the surroundings at the same
rate that it re-emits it back to the surroundings. The brick is
in thermal equilibrium with the surroundings. If the brick, in
thermal equilibrium with the surroundings, were to absorb
more energy than it emits, it would become spontaneously
warmer than its surroundings.
A light-colored object re¯ects more of the radiation falling
upon it than a dark-colored object. Since an object in thermal
equilibrium with its surroundings absorbs and emits energy at
the same rate, a dark-colored object, being a good absorber of
energy, is also a good emitter. An imaginary body that absorbs
all radiation incident upon it is called an ideal black body. Ideal
black bodies, then, are perfect absorbers and perfect emitters.
A good approximation to a black body is a hollow block with a
small hole, as shown in ®gure 21.1.
When scientists measured the radiation emitted by this black
body at different wavelengths, they found that the experimental
results could not be explained with what was known at the
time about the nature of radiation. Scientists had proposed that
the light emitted by an object was formed by the continuous
changes in energy of charged particles oscillating within the
matter. When they calculated the energy emitted in a continuous
Figure 21.1. A cavity as an approximation to a black body. Radiation
entering through the hole has a very small chance of leaving the cavity
before it is completely absorbed by the walls of the cavity.
497
SUPERSTRINGS AND OTHER THINGS
Figure 21.2. Black body radiation curve representing the measured
values of the energy radiated at different wavelengths. The broken line
shows the calculations based on what was known at the time. The calculations do not agree with the measurements at ultraviolet wavelengths.
way by these oscillating particles, their results agreed with the
experimental measurements only for very long wavelengths
(®gure 21.2). At short wavelengths their theoretical calculations
predicted that the energy radiated should become larger and
larger (broken line in ®gure 21.2), whereas the experiments
showed that at those wavelengths the energy should approach
a value of zero, an anomaly that became known as the ``ultraviolet catastrophe''.
In 1900, the German physicist Max Planck was able to solve
the problem. He did so by making an assumption which seemed
very strange, and which Planck himself could not understand
initially. He assumed that the energy emitted by the charged
particles in this black body could only have certain discrete
values. The oscillating charged particles lose or gain energy, not
in a continuous way, but in discrete jumps. In other words, the
energy was radiated in little bundles or packets. Furthermore,
the energy of radiation was directly proportional to the frequency
of radiation. Planck called these bundles of energy quanta. His
formula is
E hf
498
The Early Atomic Theory
where h is a constant, called Planck's constant, and f is the frequency. The value of Planck's constant is
h 6:626 10ÿ34 Js 4:136 10ÿ15 eVs:
According to Planck's equation, the energy of each quantum
depends on its frequency of oscillation. Since the value of
Planck's constant h is very small, the quantum idea becomes
important only in systems where the energies are very small, as
is the case with atoms. Planck's equation not only ®ts the
observed black body radiation curves but also settles the ``ultraviolet catastrophe'' argument. At high frequencies, corresponding
to short wavelengths, radiation consists of high-energy quanta,
and only a few oscillators will have that much energy, so only a
few high-energy quanta are emitted. This is the left part of the
radiation curve (®gure 21.2), where the values of the energy
radiated are very small, approaching zero. At low frequencies,
corresponding to long wavelengths, the quanta have low energies
and many oscillators will have these energies, resulting in the
emission of many low-energy quanta, but they each have so
little energy that even added together, the emitted radiation
does not amount to much. This corresponds to the right side of
the radiation curve in ®gure 21.2, where the energies have
small values. It is only in the middle frequencies where there
are plenty of oscillators with middle wavelengths that have
enough energy to emit quanta of moderate size which, when
added together, produce the peak in the radiation curve. Planck's
insight was to realize that different energies were associated with
different wavelengths, instead of assuming that the energy was
equally distributed between different wavelengths, as the previous radiation theories had proposed.
The photoelectric effect
About the same time that Planck was struggling with the problem
of black body radiation, other physicists were trying to understand a seemingly unrelated problem, known as the photoelectric
effect. It had been observed by Heinrich Hertz in 1887 that when
ultraviolet light shone on certain metallic surfaces, electrons were
ejected from those surfaces (®gure 21.3), provided that the frequency
499
SUPERSTRINGS AND OTHER THINGS
Figure 21.3. The photoelectric effect: (a) Dim UV light of high frequency
produces few fast electrons. (b) Intense high frequency UV light produces
many fast electrons. (c) Dim UV light of lower frequency produces few
slow electrons. (d) Intense UV light of lower frequency produces many
slow electrons. (e) Red light (much lower frequency) produces no
electrons.
of the radiation exceeded a critical threshold. The number of electrons
emitted was proportional to the intensity of the light, as was
expected. However, the kinetic energies of the emitted electrons
were not in any way related to the incident light intensity. This
was quite puzzling.
In 1905, Albert Einstein, considered the problem of the
photoelectric effect. Einstein assumed that the quanta of energy
that Planck had introduced to explain the black body radiation
were also characteristic of light rather than a special property
500
The Early Atomic Theory
related only to a single phenomenon. Thus, he assumed that
light also consisted of discrete quanta of energy. When one of
these quanta, or photons as he called them, struck the surface of
the metal, all its energy was transmitted to an electron in the
metal. Thus, an electron could only absorb energy in packets or
bundles of energy, so that a light of greater intensity did not
provide more energy for the electron to absorb, just more
photons per second, and the chances of a single electron getting
hit twice was extremely unlikely. However, a more intense
beam, having more photons, would strike more electrons in the
metal.
An electron would require a minimum amount of energy to
break away from the surface of the metal. This minimal energy
was called the work function (the Greek letter phi). Because
Einstein assumed that Planck's radiation formula was universal,
the energy of the photons was also given by E hf , so that a
minimum value of the energy implied a minimum value of the
frequency. This explained the threshold frequency below which
no electrons could be emitted.
Einstein's theory of the photoelectric effect accounted for all
the puzzling experimental observations. According to Einstein,
when a photon of energy E hf strikes an electron in a metal,
the electron absorbs all the energy of the photon. The energetic
electron then starts to make its way toward the surface of the
metal. From the principle of conservation of energy, we know
that if the energy E gained by the electron is larger than the
work function of the metal, the electron reaches the surface
and escapes. The energy that remains is the kinetic energy of
the ejected electron. That is,
energy gained from the photon energy to leave the metal
kinetic energy,
or
hf energy to free the electron KE:
Since the work function is the minimal energy required to free
the electron, some of the electrons leave the metal with kinetic
energy as great as
KEmax hf ÿ :
501
SUPERSTRINGS AND OTHER THINGS
Table 21.1. Work functions of some elements.
Element
(eV)
Aluminum
Calcium
Cesium
Germanium
Iron
Platinum
Potassium
Sodium
Tungsten
4.20
2.71
1.96
4.61
3.91
5.22
2.24
2.28
4.54
This expression is Einstein's equation for the photoelectric effect.
In Table 21.1 we list the work functions of some elements.
The photoelectric effect has practical applications in the
detection of very weak light. Some sensitive television cameras
make use of photomultiplier tubes in which an incident photon
ejects an electron from a photosensitive area at the front of the
tube. The ejected electron is accelerated by an electric ®eld, then
strikes a metal surface, producing the emission of several more
electrons, which in turn strike additional electrons from further
metal surfaces. The result is a cascade of electrons down the
tube (®gure 21.4). A high-gain photomultiplier tube can transform the arrival of a single photon at its face into the emission
of a billion electrons, which can easily be detected.
If light consists of discrete quanta of energy or photons, then
light has particle properties. But light also exhibits wave properties, as we have seen. Can light be both a particle and a wave?
These two concepts are mutually contradictory. Either light is a
particle or it is a wave. It cannot be both, in much the same
way that a person cannot be big and small at the same time.
Einstein's theory introduced a con¯ict in physics. It took about
twenty years for this con¯ict to be resolved.
In this crucial year of 1905, Einstein published in three major
papers his Special Theory of Relativity, his explanation of a
phenomenon known as Brownian motion, and his explanation
of the photoelectric effect. In light of these discoveries, the year
1905 is known in the scienti®c world as Einstein's annus mirabilis.
Although by the time Einstein received the Nobel Prize he had
502
The Early Atomic Theory
Figure 21.4. In a photomultiplier tube, an incident photon strikes an
electron from a photosensitive area near its face. This electron in turn
strikes other electrons from subsequent positively-charged metal
surfaces, ejecting them. A cascade of electrons is produced.
also published his General Theory of Relativity, perhaps the
greatest intellectual achievement of all time, he received the
Prize for his explanation of the photoelectric effect. It is believed
that the Nobel committee thought at the time that the theory of
relativity was too controversial.
The Bohr model of the atom revisited
As we learned back in chapter 7, the Danish physicist Niels Bohr
used Einstein's idea of the quantum of energy to construct a
model of the atom which explained the observed spectrum of
the hydrogen atom very well. We saw in chapter 18 that the emission and absorption lines of the hydrogen spectrum are produced
by photons emitted or absorbed in the electronic transitions
between Bohr's allowed orbits. The energy of these photons is
given precisely by Planck's formula E hf .
If we call Ei the energy of the electron in the initial state, and
Ef its energy in the ®nal state, then Ei ÿ Ef hf is the change in
503
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Using photons to detect
tumors
Shining a strong light on an object produces a sharp shadow.
This simple fact is the basis for transillumination, a technique
used in breast examinations. A tumor inside a woman's
breast casts a shadow when a strong beam of light is sent
through the soft tissue of the breast. The tissue itself, however, scatters some of the light, making it dif®cult to detect
tumors smaller than about a centimeter across.
Scientists have recently succeeded in developing a technique that might make it possible to detect very small tumors.
When light enters the soft tissue of the breast, some of the
photons travel in a nearly straight line, while the rest
bounce off the molecules of the tissue and follow a more
tortuous path. Since only the straight-line photons produce
a sharp image of the tumor, scientists need to separate the
straight-line photons from the rest.
(Courtesy Image Diagnostic Systems, Inc.)
The straight-line photons reach the other side earlier
than the rest. In other words, the direct photons will arrive
at the detector slightly ahead of the other photons. The time
difference between the two sets of photons is incredibly
small, of the order of a few picoseconds (trillionths of a
504
The Early Atomic Theory
second). Physicists at the University of Michigan have developed a fast gate that opens and closes at the rate required to
separate the two sets of photons when placed in front of the
detector.
Using this new technique, the scientists are able to detect
spots 200 micrometers (thousandths of a millimeter) across
inside samples of human breast tissue 3.5 millimeters thick.
The scientists' goal now is to increase the depth of tissue
and to reduce the size of the spots that they are able to detect.
energy of an electron jumping from level i to level f. We can illustrate the absorption and emission processes with the aid of an
energy-level diagram. Figure 21.5 shows the energy-level diagram for the hydrogen atom where various series of transitions
are indicated by vertical arrows between the levels, named for
the scientists who discovered them.
As we saw in chapter 18, in an energy-level diagram, the
letter n designates the energy level, beginning with n 1, the
ground state, the lowest stationary state for the electron. Since
only changes in potential energy have any meaning, any convenient zero-level for the energy can be chosen. When n 1
Figure 21.5. Energy level diagram for the hydrogen atom. Only the
Balmer series is in the visible range of wavelengths.
505
SUPERSTRINGS AND OTHER THINGS
the electron is so far from the nucleus that it does not feel its
attraction, and we take its interaction energy to be zero. The
energy of any other state, En , is the difference from this n 1
state. As the electrons jump to energy states that are closer to
the nucleus, their energies decrease and, as we have just seen,
they radiate this energy. Thus, the energies of these states are
less than the energy of the n 1 state, which has been assigned
the value of zero. To be less than zero, the energies of these states
must be negative. Bohr showed that the energy, En , of a particular
level n is
En ÿ
EGS
n2
where EGS 2:2 10ÿ18 J 13:6 eV is the lowest energy possible
for an electron in a hydrogen atom, its ground state energy. This
expression allows us to calculate the energy absorbed or emitted
when an electron jumps from one energy level to another, in
terms of the ground state energy.
Bohr's theory agrees with experiment only for the hydrogen
atom, and it does not work well for more complicated atoms.
Nevertheless, it was the ®rst theory that attempted to explain
the structure of the atom. It should be noted, however, that
Bohr constructed his theory to agree with experiment and could
not explain why the electrons did not radiate when they were
in the stationary orbits allowed by his theory, nor how the electrons absorbed or emitted energy when they underwent a transition between orbits. These were ad hoc assumptions. Ten years
later a bright graduate student in Paris was to introduce a new
idea that would become the basis for a completely different
kind of physics that would explain these assumptions.
De Broglie's waves
Louis Victor de Broglie was a graduate student at the University of
Paris looking for a dissertation topic. He was fascinated with the
wave-particle duality that Einstein had introduced and reasoned
that perhaps electrons could also exhibit particle and wave
characteristics. This problem became his dissertation research.
By 1924 de Broglie's dissertation was complete, but his advisor
506
The Early Atomic Theory
Figure 21.6.
De Broglie's standing waves for the allowed orbits.
was apprehensive about granting him his doctorate based on a
highly speculative idea. He decided to show the work to Einstein.
It was only when Einstein accepted the idea as ``sound'' that de
Broglie's degree was granted. In 1929 de Broglie received the
Nobel Prize for his theory.
De Broglie's theory explained the stationary orbits in Bohr's
model of the hydrogen atom by assuming that each electron orbit
was a standing wave. Since the orbits in Bohr's model are circles,
the electron wave must be a circular standing wave that closes
in on itself, as shown in ®gure 21.6.
The frequency and wavelength of electron waves in de
Broglie's theory are
f
E
h
and
h
mv
where m is the mass of the electron and v its velocity. In 1927, the
American physicists C J Davisson and L H Germer observed
interference patterns with electron beams, as if the electron
beam had wave properties (see ®gure 21.7). De Broglie's prediction was thus con®rmed.
The wave nature of electrons is the basis for the electron
microscope, invented in 1932 by Ernst Ruska in Germany. The
resolution of a microscope (that is, its ability to reproduce details
in a magni®ed image) is inversely proportional to the wavelength
of the light used to illuminate the object. Electron wavelengths
are much shorter than those of visible light. Since the energy of
the electrons is proportional to their frequency and inversely
proportional to their wavelength, fast electrons have shorter
wavelengths. ``Illuminating'' an object with electron waves in
an electron microscope, therefore, produces higher magni®cations than ordinary light. Some of the most powerful electron
507
SUPERSTRINGS AND OTHER THINGS
Figure 21.7. Diffraction pattern obtained by passing an electron beam
through a crystal. (From PSSC Physics Seventh Edition, by HaberSchaim, Dodge, Gardner, and Shore. Published by Kendall/Hunt
Publishing Company, 1991.)
Figure 21.8. Scorpion at 180 magni®cation. (Courtesy S McDanels,
NASA Kennedy Space Center.)
508
The Early Atomic Theory
microscopes achieve magni®cations of over 1 000 000 allowing
the observation of objects as small as 0.05 nm. Figure 21.8
shows a scorpion at 180 magni®cation.
Quantum mechanics
After de Broglie introduced his matter-wave hypothesis, things
began to move fast. The major problem that physicists were
trying to solve was that of the wave nature of the electron. In
1926 the Austrian physicist Erwin SchroÈdinger presented an
equation that described the wave behavior of the electron in the
hydrogen atom. Independently of SchroÈdinger, a twenty-fouryear old professor of physics at GoÈttingen by the name of
Werner Heisenberg undertook a purely mathematical treatment
of atomic particles. Heisenberg's approach was later proven
equivalent to SchroÈdinger's wave equation theory.
This new theory became known as quantum mechanics. In the
years that followed, many discussions took place between
Einstein, Bohr, Heisenberg, SchroÈdinger, and most of the great
physicists of the time, about the interpretation of quantum
mechanics, and especially about the meaning of the wave
equation. Soon they agreed that the waves associated with the
atomic particles like the electron did not represent the actual
motion of the electron. They quickly realized that the electron
neither acts like a wave nor was smeared out throughout the
wave. The physicist Max Born suggested instead that these
waves were waves of probability; that is, they represented the
probability of ®nding the electron in some region of space.
What is then the actual motion of the electrons around the
nucleus? Quantum mechanics tells us that this question cannot
be answered because, as Bohr pointed out, the electron is not
simply a particle, in the macroscopic sense. Thus, it is meaningless to ask how it moves from one point to another. The only
thing we can ask now is what is the probability of ®nding the electron at a certain location around the nucleus. After an observation, it is possible to determine that the electron occupied some
position in space. Another observation may locate the electron
at some other point. The path that the electron takes in moving
between those two points cannot be known. In fact, an electron
509
SUPERSTRINGS AND OTHER THINGS
has a well de®ned position only when it is being observed.
Position is not an intrinsic property of an electron.
As we can see, quantum mechanics has weird consequences.
In fact, weird is an adjective that is very often found associated
with quantum mechanics. That doesn't make it wrong, however.
Many real devices, like computer chips for example, are based on
our knowledge of quantum mechanics. Quantum mechanics
gives us a very accurate picture of the atomic world. The only
problem is that this picture is highly mathematical and impossible to translate into concrete images that we can visualize.
Furthermore, although most physicists agree that quantum
mechanics is correct (in the sense that the scienti®c method
allows a theory to be correct), its interpretation is still subject to
animated discussions not only among physicists and other members of the scienti®c community, but also among philosophers.
The best we can say is that quantum mechanics works, but
nobody really understands it.
510
22
QUANTUM
MECHANICS
The beginnings of quantum mechanics
Einstein's 1905 paper on the photoelectric effect, in which he
considered Planck's idea of quanta of energy to be a universal
characteristic of light, marks the beginning of quantum theory.
For over ten years Einstein stood alone on the issue of energy
quantization of light; and although in 1915 experimental con®rmation of his photoelectric effect came from painstaking experiments performed by Millikan, acceptance of the idea that light
was composed of particles was far from universal. Millikan himself wrote in 1915: ``Despite . . . the apparent complete success of
the Einstein equation, the physical theory of which it was
designed to be the symbolic expression is found so untenable
that Einstein himself, I believe, no longer holds to it.'' Einstein
did hold to it. It was only in the mid 1920s when Heisenberg,
SchroÈdinger, Dirac and others developed the ®rst forms of the
quantum theory, that serious discussions concerning the particle
nature of light and the meaning of the new theory began. Discussions about the meaning of quantum theory continue today.
The new mechanics of the atom
In the summer of 1922, Niels Bohr delivered a series of invited
lectures at the University of GoÈttingen in Germany. Attending
these lectures was a 22 year-old graduate student from the University of Munich, Werner Heisenberg. Bohr's lecture left a
profound impression upon the young student. ``I shall never
forget the ®rst lecture,'' Heisenberg wrote later. ``The hall was
511
SUPERSTRINGS AND OTHER THINGS
®lled to capacity. The great Danish physicist . . . stood on the platform, his head slightly inclined, and a friendly but somewhat
embarrassed smile on his lips. Summer light ¯ooded in through
the wide-open windows. Bohr spoke fairly softly, with a slight
Danish accent . . . each of his carefully formulated sentences
revealed a long chain of underlying thoughts, of philosophical
re¯ections, hinted at but never fully expressed. I found this
approach highly exciting; what he said seemed both new and
not quite new at the same time.''
The young student ``dared to make a critical remark'' during
one of the lectures. Bohr immediately recognized the brilliant
mind of the student and invited him after the lecture for a walk
over the Hain Mountain to further discuss his objections. At the
end of their long walk that afternoon Bohr suggested that Heisenberg join his Institute in Copenhagen after graduation. ``You must
pay us a visit in Copenhagen,'' Bohr told him, ``perhaps you
could stay with us for a term, and we might do some physics
together.'' ``My real scienti®c career only began that afternoon,''
wrote Heisenberg later, ``suddenly the future looked full of
hope and new possibilities, which, after seeing Bohr home, I
painted to myself in the most glorious colors all the way back
to my lodgings.''
The Niels Bohr Institute in Copenhagen was a place where
the young brilliant physicists from Europe, America, and the
Soviet Union would gather to study the problems of the atom.
There Heisenberg found the atmosphere he needed to unleash
his profound intellect. He had obtained his PhD in 1924, at the
age of 23, and had immediately joined Bohr at his Institute.
After almost a year with Bohr, Heisenberg returned to GoÈttingen
as an assistant to Max Born, then director of the Physics Institute.
At GoÈttingen, Heisenberg worked on the problem of atomic
spectral lines, trying to ®nd a mathematical expression for the
line intensities of the hydrogen spectrum. This attempt proved
unsuccessful, leading to ``an impenetrable morass of complicated
mathematical equations, with no way out.'' Nevertheless, something useful came out of this. Heisenberg decided that the
problem of the actual orbits of the electrons inside the atom
should be ignored; they have no meaning. Only the frequencies
and amplitudes associated with the spectral lines have any meaning, since they can be observed directly. Rather than the ``morass
512
Quantum Mechanics
of complicated mathematical equations,'' Heisenberg turned to a
much simpler system: a simple pendulum. The oscillations of a
pendulum could be used as a model for the atomic vibrations.
Toward the end of May 1925 a bout of spring hay fever made
Heisenberg so ill that he asked Born for a two weeks' leave of
absence. He went straight for Heligoland, a small rocky island
in the North Sea, near Hamburg, with fresh sea breezes, far
from the pollen of the meadows of the mainland. There Heisenberg had enough time to re¯ect upon his problem. ``A few days
were enough to jettison all the mathematical ballast that invariably encumbers the beginning of such attempts, and to arrive at
a simpler formulation of my problem.'' Lightning had struck. In
a few more days, he had some preliminary calculations which
looked quite promising. What he needed was the proof that his
new formalism presented no contradictions and, in particular,
that it obeyed the principle of conservation of energy. One
evening, he reached a point where the proof could be obtained.
Heisenberg, too excited to stop, worked well into the night. By
three o'clock in the morning, he had invented a new mechanics.
Returning to GoÈttingen, Heisenberg wrote up his paper,
®nishing it in July 1925. In his paper, he described the energy
transitions of the atom as an array of numbers and found rules
that these arrays must obey in order to calculate atomic processes.
Unsure that his paper was ready for publication, he showed it to
Born, who immediately recognized these arrays as matrices, and
the rules that they obeyed as the rules for matrix algebra. Born
sent the paper to Zeitschrift fuÈr Physik. Max Born enlisted the
assistance of his student Pascual Jordan and together they further
developed Heisenberg's theory. Matrices, it turned out, were the
correct tools to describe atomic processes.
The fundamental property of Heisenberg's new mechanics is
that measurable quantities like position and momentum are no
longer represented by numbers but by matrices. And matrices
do not obey the commutative law of multiplication. That is, if
we multiply matrix p by matrix q, the answer we get is different
from what we get when we multiply matrix q by matrix p. In
Newtonian mechanics, if q 3 is the position of a particle and
p 2 its momentum, the product of these two numbers does not
depend on the order in which we multiply them; that is,
p q 2 3 6 and q p 3 2 6. In a paper Born and
513
SUPERSTRINGS AND OTHER THINGS
Jordan published together, they show that the difference between
the two matrix products pq and qp is proportional to Planck's
constant h. In Newtonian physics, this difference is equal to
zero since p q ÿ q p 6 ÿ 6 0. Because our universe is
discrete, not continuous, Planck's constant is not zero and the difference between the two matrix products is not zero. Because h is
very small, we do not notice this discreteness in the macroscopic
world. The discrete nature of our world forces us to forgo the use
of simple numbers to represent quantities like position and
momentum; they must be represented by matrices. That Heisenberg was able to realize this in the few days at Heligoland can
only be explained as the insight of a genius.
In July 1925, Heisenberg gave a lecture at the Cavendish
Laboratory in Cambridge. In the audience was another brilliant
physicist, Paul Dirac, eight months younger than Heisenberg.
Although Heisenberg did not mention his new theory during
the lecture, he did discuss it with a few scientists afterwards
and even left a prepublication copy of his paper with Dirac's
supervisor, who passed it on to the young scientist. Dirac immediately understood the importance of Heisenberg's work and
began working on it himself. A few weeks later, Dirac wrote a
lucid paper where he showed that Heisenberg's new mechanics
was a complete theory that replaced Newtonian mechanics.
Meanwhile, Heisenberg, Born, and Jordan arrived at the same
conclusion using a different path. After only a few months of concentrated work by these four scientists, the ®rst coherent theory of
the atom emerged. It was called matrix mechanics and is one of the
several forms of quantum mechanics, the mechanics of the atom.
Wave mechanics
Einstein in his 1905 paper on the photoelectric effect had theorized
that light was a particle ± the photon ± and in 1909 suggested that a
theory of light should incorporate both the particle and wave
theories of light. In 1923, as we saw in chapter 21, Louis de Broglie
suggested that all matter ± an electron, for example ± should also
display wavelike behavior. De Broglie proceeded to calculate the
wavelength of the electron and to suggest that his prediction could
be con®rmed experimentally if a diffraction pattern of electrons
514
Quantum Mechanics
Figure 22.1. de Broglie's standing waves for circular stationary orbits.
were observed. This work was de Broglie's doctoral thesis. In
1927, Davisson and Germer observed interference patterns in
electron beams.
Erwin SchroÈdinger, a professor of physics at the University of
Zurich, read in one of Einstein's papers of 1925 a positive reference
to de Broglie's work. Einstein, we recall, had commented favorably on de Broglie's ideas when Paul Langevin, de Broglie's
thesis director, sent Einstein a copy of the thesis. According to
de Broglie's ideas, a wave of some sort must be associated with
an electron. When the electron orbits the nucleus in Bohr's
model of the atom, a resonance condition that gives rise to Bohr's stationary states must exist, in a similar fashion to the appearance of
stationary con®gurations produced by standing waves on a string.
For a circular orbit, de Broglie reasoned, an integral number of
wavelengths must coincide exactly with the circumference of a
stationary-state orbit (®gure 22.1). After studying de Broglie's
papers, SchroÈdinger used the mathematics of waves to calculate
the allowed energy levels, obtaining values that did not agree
with the observed patterns of atomic spectra. Not being successful
with his initial calculation, SchroÈdinger put the work aside for
several months. Peter Debye, head of the research group at
Zurich, having heard of de Broglie's work, asked SchroÈdinger to
give a seminar on the subject. The discussion at the seminar stimulated SchroÈdinger to return to his work. In a few weeks during the
Christmas holidays, SchroÈdinger developed a wave equation
which governed the propagation of electron waves.
With his wave equation, SchroÈdinger correctly obtained the
light spectrum of hydrogen. Before direct experimental veri®cation of electron interference, de Broglie's strange idea that the
electron was a wave was quantitatively vindicated by SchroÈdinger. His paper was published in January 1926. Wave mechanics,
515
SUPERSTRINGS AND OTHER THINGS
Pioneers of physics: SchroÈdinger's inspired guess
SchroÈdinger reasoned that if an electron has a wavelike
behavior, then it should be possible to describe this behavior
in space and time by a wave equation. He devised an equation ± a wave equation ± that controls the behavior of the
wave and that speci®es the connection between this behavior
and the behavior of the particle. His starting point was the
principle of conservation of energy. Applying de Broglie's
relation between momentum and wavelength and Planck's
E hf relation, SchroÈdinger was able to guess the mathematical form of the wave equation. We must stress that SchroÈdinger's equation cannot be derived; it was obtained by a
postulate. Its justi®cation is not that it has been deduced
from previous known facts, but that its predictions can be
experimentally veri®ed. ``Where did we get that [equation]
from?'' asked Richard Feynman, ``Nowhere. It is not possible
to derive it from anything you know. It came from the mind
of SchroÈdinger.''
as SchroÈdinger's theory was initially called, was a second
completely general way of formulating the mechanics of the
atom. As Dirac was later on to prove, the two approaches,
matrix mechanics and wave mechanics, are equivalent and both
are generalizations that include Newton's theory as a special case.
What are these waves in SchroÈdinger's theory? The ®rst
answer came from SchroÈdinger himself. Perhaps electrons and
all other subatomic particles are not really particles but waves.
The entire universe is nothing but a great wave phenomenon.
This interpretation was quickly rejected by Max Born because it
would not explain the fact that you could count individual particles with a Geiger counter and see their tracks in a cloud
chamber. Max Born offered a better solution to this puzzle. In
1924, before SchroÈdinger developed his theory, Bohr had written
a paper in which he had attempted to solve the contradiction
between the particle picture and the wave picture introduced
by de Broglie. In that paper Bohr had considered the concept of
probability waves. He interpreted the electromagnetic waves not
as real waves but as waves of probability the intensity of which
516
Quantum Mechanics
determines the probability that the atom is going to undergo a
transition from one energy level to another. Born extended
Bohr's concept and developed a clear mathematical de®nition of
the wave functions in SchroÈdinger's theory which he interpreted
as probability waves. These waves were not three-dimensional
waves like electromagnetic waves, but waves of many dimensions; they were, then, abstract mathematical entities. According
to Bohr's interpretation, the SchroÈdinger wave function of an electron gives the probability of locating an electron in a particular
region of space. More precisely, the square of the wave amplitude
at some point in space is the probability of ®nding the electron
there.
Heisenberg's uncertainty principle
In Heligoland, we recall, Heisenberg had discovered that observable or measurable quantities were to be represented by matrices.
In 1927, Heisenberg showed that if two matrices q and p represented two physical properties of a particle, like position and
momentum or time and energy, which obeyed the noncommutative rule, that is, that the difference between the product pq and
qp was proportional to Planck's constant, then one could not
measure both properties simultaneously with in®nite precision.
This statement is known as Heisenberg's Uncertainty Principle.
To illustrate this principle, let's consider a possible experiment
with electrons. In chapter 18, we discussed the diffraction of light
in Young's double-slit experiment in which a beam of light passed
initially through a single narrow slit and then through a pair of slits
that formed two coherent beams. These beams interfered with each
other, producing an interference pattern on a screen. We can
understand this interference pattern if we think of light as a
wave. In fact, this was the experimental proof given by Young
that light was a wave phenomenon. Light, Einstein told us, also
behaves as a particle. Can we think of light as made up of particles
and still understand Young's double-slit experiment? What if we
use electrons instead of photons?
Consider a source of electrons, like the electron gun in a
regular television tube, a thin metal plate with two very narrow
slits in it, and a phosphor-coated screen that produces a ¯ash
517
SUPERSTRINGS AND OTHER THINGS
Figure 22.2. (a) Double-slit experiment with electrons. (b) Probability
that an electron hits the screen at a particular point when only one slit
is open. (c) Probability when both slits are open.
when an electron collides with it, like a television screen (®gure
22.2(a)). What we observe is that the ¯ashes on the screen indicate
that each electron hits the screen in just one point. If we cover
either one of the slits and let the experiment run for some time,
counting the number of ¯ashes at any given position on the
screen, what we obtain for the probability of arrival of the electron at various locations on the screen is the pattern shown in
®gure 22.2(b). Uncovering both slits gives us the pattern shown
in ®gure 22.2(c) for the probability of arrival. This pattern looks
like the interference fringes obtained in the original Young's
double-slit experiment with waves shown in ®gure 18.9! Figure
22.3 is a photograph of the interference pattern for electrons in
a double-slit experiment. Observing the regions where the probability is minimum with both slits open we notice that the number
of counts with both slits open is smaller than the counts with only
one slit open. This is due to destructive interference in those
regions, a phenomenon characteristic of waves. The electrons
are behaving like waves even though they arrive at the screen
and make a ¯ash, like particles. A wave does not hit the screen
at a single point.
Could it be that the many electrons in the beam are somehow
interacting with each other and ``cooperating'' so that when they
518
Quantum Mechanics
Figure 22.3. Photograph of a double-slit interference experiment with
electrons. (Courtesy Claus Jonsson.)
arrive at the screen, an interference pattern is formed? To investigate this, we could reduce the intensity of the beam until we
have just a single electron traveling at a time. We begin to
notice the ¯ashes on the screen and, as before, we record their
position. At ®rst, no pattern is noticeable. When enough ¯ashes
accumulate, we begin to see the interference pattern again, as if
each electron interferes with itself !
How can an electron interfere with itself? Are the electrons,
somehow, passing through both slits at once and then recombining to form the interference pattern? We could install small detectors near the slits to determine which slit the electron passes
through. The detectors might shine some light on it so that we
can observe the scattered light. The ®rst thing we notice is that
the electrons go through either one of the slits; no funny business
of one electron going through both slits at once is observed.
Everything seems normal, as we would expect particles to
behave. As before, we record the ¯ashes and when enough of
them accumulate, we see that the interference pattern has disappeared. The probability that an electron hits the screen at a
given location is what we would obtain for regular particles.
Observing the electrons destroys the interference pattern. Determining
the position of the electron destroys their ability to interfere.
It might be that the light we are using to observe the electrons
disturbs them and changes their behavior. To get around this
dif®culty, we reduce the intensity of the light shining on the electrons and repeat the experiment. Reducing the intensity of the
light means that we reduce the energy of the photons. As we
519
SUPERSTRINGS AND OTHER THINGS
might recall, the energy of each photon, from Planck's formula, is
given by hf, where f is the frequency of the light. If we have n
photons in each ¯ash, the total energy is E nhf . We can
reduce the intensity of the light shining on the electrons by sending very few photons so that n is small. When we use one photon
for every electron that passes, no interference pattern is observed
and we account for the position of each electron. Reducing the
number of photons even more results in some electrons not
being seen. Not only do we lose information about the location
of each electron, but we begin to observe a faint interference pattern on the screen, produced by the electrons that we failed to see.
We need at least one photon for every electron if we want to know
through which slit they pass.
Reducing the intensity of the light shining on the electrons by
reducing the frequency increases the wavelength of the light
which reduces the resolution of our detector; that is, its ability
to determine the location of the electron. When the frequency is
not too small and the wavelength small enough to let us see
each electron, no interference pattern is produced. It might be
that we are still disturbing the electrons. If we reduce the intensity
of the light so as not to disturb the passing electrons, the wavelength becomes larger than the distance between the slits and
we cannot tell which one the electron went through. And on
top of that, the interference pattern returns!
Our attempt to observe the electrons without disturbing
them has not worked. What Heisenberg's uncertainty principle
tells us is that it cannot work. Our failure is not due to our lack
of imagination, or to lack of better instrumentation; it will never
work. It is an intrinsic limitation of nature. This is the way the universe works. It is not possible for us to determine through which
slit the electron passes on its way to the screen. The only solution
is that the electron goes through both slits at once and carries their
imprint when it arrives at the screen. This concept is impossible
for us to imagine and it is only through mathematics that physicists have been able to encapsulate this phenomenon.
Heisenberg derived a mathematical expression to explain
this intrinsic limitation of nature. If the uncertainty in the measurement of the position of the electron along the vertical direction is
q and the uncertainty in this component of the momentum is
p, Heisenberg's uncertainty principle says that the product of
520
Quantum Mechanics
these two uncertainties must never be smaller than a certain constant, which is equal to Planck's constant h divided by 2; that is,
q p
h
:
2
This constant h=2 is equal to 1:055 10ÿ34 J s and places a lower
limit to the value of the product of the uncertainties. If our instruments allow us to measure the electron's position with an uncertainty of only one-millionth of a meter (10ÿ6 m), for example, the
uncertainty in the momentum must be no smaller than
1:055 10ÿ28 kg m/s so that the product is at least h=2. This
makes the uncertainty in the electron's velocity equal to about
100 m/s. If with better instruments we reduce the uncertainty
of the position to one nanometer, 10ÿ9 m, the uncertainty in the
electron's velocity becomes one thousand times larger. The
momentum has become so uncertain that we will not know
whether, one second later, the electron will still be near the instruments or 100 km away!
We must emphasize here that the concept of uncertainty
applies not to a single measurement but to the measurement of
many electrons. Heisenberg's uncertainty principle is a statement
about a statistical average over many measurements.
As can be seen from the above discussion, the name particle
for electrons is an unfortunate one because it gives the idea that
electrons are like dust particles or any other macroscopic particle.
Electrons and other quantum ``particles'' are really quantum
entities with properties given by the laws of quantum mechanics.
The new physics
What are we to make of all this? As the double-slit experiment
with electrons showed, the act of observation changes the
system being observed. Electrons and other quantum particles
behave like particles and like waves, depending on the particular
observation. If we set up an experiment to detect the wave nature
Since momentum is the product of mass times velocity, and the mass of the
electron is 9:11 10ÿ31 kg. The uncertainty in the velocity is then the uncertainty
in the momentum, 1:055 10ÿ28 kg m/s, divided by is 9:11 10ÿ31 kg, or 116 m/s.
521
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Knowledge and
certainty
For all its power and its implications for our knowledge
of nature, the mathematical expression that Heisenberg
developed for his uncertainty principle looks surprisingly
simple. We can illustrate the method that Heisenberg used
to obtain it as follows:
We can call the uncertainty in the position of the electron
along the vertical direction q. The position of the electron
when it hits the screen depends on the component of the
electron's momentum along the vertical axis (q-axis); the
uncertainty in this component of the momentum is p. If
we use light of wavelength to observe each electron, then
the uncertainty in the position is equal to , or q . The
momentum imparted to the particle is at the most equal the
momentum of the photon, which is given by de Broglie's
relation. De Broglie's relation, from chapter 21, is
h=mv h=p, so that the uncertainty in the momentum is
approximately p h=. The product of the two uncertainties q and p is
q p h:
What Heisenberg's uncertainty principle says is that the product of these two uncertainties must always be greater than
Planck's constant h. Heisenberg's more detailed derivation
gives the form of the uncertainty principle as
q p
h
:
2
of the electron, we ®nd wave phenomena such as interference and
diffraction. If we set up an experiment to detect the electron's
particle nature, we detect particles. Which one is the correct interpretation? What is the true nature of the electron? According to
Bohr, it is meaningless to ask what an electron really is. Physics
does not tell us about what is but about what we can describe
regarding the world. Both theoretical pictures, the wave picture
and the particle picture, are equally valid in describing the
electron and provide complementary descriptions of reality.
522
Quantum Mechanics
Physics in our world: Electron microscopes
Soon after it was discovered that high speed electrons had
wavelengths that were much shorter than the wavelengths
of light, physicists showed that magnetic ®elds could act as
lenses by causing electron waves to converge to a focus. In
1931, Max Knoll and Ernst Ruska built the ®rst microscope
that focused electron waves instead of light waves. The electrons were focused with electric or magnetic ®elds.
Diagram of a transmission electron microscope, TEM.
523
SUPERSTRINGS AND OTHER THINGS
In the design of Knoll and Ruska which is based on an
optical microscope and known as the transmission electron
microscope (TEM), a hot tungsten wire emits electrons which
are focused on the specimen by a magnet acting as a condenser ``electron lens.'' An objective lens produces a magni®ed
image which becomes the object for the projector lens. The
image is further magni®ed and projected onto a ¯uorescent
screen or a photographic plate.
A few years after the invention of the TEM, Knoll modi®ed their earlier design so that the focused electron beam
scanned the surface of the specimen. In the scanning electron
microscope (SEM), as his invention is known, electrons from
the surface of the specimen are knocked out by the incident
electrons. Some of these electrons can be detected by an
electrode. As the beam scans the surface, more electrons are
knocked out from sharp edges than from a ¯at surface and
a map of the object can be generated on a CRT.
SEM image of a brass ®tting from a liquid oxygen tanker at launch
pad 39A at the Kennedy Space Center showing dendritic structure.
(Courtesy V Cummings, NASA Kennedy Space Center.)
524
Quantum Mechanics
A modern TEM can produce magni®cations up to
1 000 000 and resolve objects as small as 0.2 nm. A modern
SEM, on the other hand, can produce magni®cations that
range from 15 to about 100 000. The SEM, however,
produces an image with a large depth of ®eld, giving the
impression of a three-dimensional surface relief. Both instruments are based on the extremely short effective de Broglie
wavelengths of high-speed electrons.
The scanning tunneling microscope
The scanning tunneling microscope (STM), an instrument
capable of producing images with atomic resolution, was
invented in 1981 by Gerd Binnig, Heinrich Rohrer, and
collaborators at the IBM Research Laboratory in Zurich,
Switzerland. Their discovery won Binning and Rohrer the
1986 Nobel Prize in Physics.
The operation of the STM is based on the phenomenon of
tunneling. Quantum mechanics tells us that a particle such as
an electron has a ®nite probability of ``tunneling'' through a
potential barrier that classically separates two regions. Electromagnetic theory tells us that when a voltage is applied
between the tip of the instrument and the surface to be
examined, there is a potential barrier between the tip and
the surface. An electron with insuf®cient energy to surmount
the barrier would not be able to move from the tip to the
surface. However, quantum mechanics tells us that since
the electron can be described by a wave function, it has a
®nite probability of crossing or tunneling through the
barrier. This tunneling probability decreases very rapidly
(exponentially) with the width of the potential barrier. As a
consequence, observation of tunneling events is feasible
only for relatively small potential barriers.
In the STM, a sharp metal tip is brought to within a nanometer of the surface, producing an overlap of the electronic
wave functions. When a small voltage is applied, a tunneling
current in the nanoampere range ¯ows between the tip
and the surface. The tip is scanned across the surface using
525
SUPERSTRINGS AND OTHER THINGS
piezoelectric crystals, which expand or contract in response
to an applied voltage. A third crystal maintains the tip at a
constant distance of a few tens of nanometers above the
surface. The potential applied to this crystal determines the
vertical position of the tip and thus gives the topography of
the surface.
An image of graphite with the atoms arranged in an ordered
Ê ) is
array, taken with the Sweet Briar College STM. An aÊngstroÈm (A
0.1 nm.
Heisenberg's uncertainty principle tells us that we cannot
know both the position and the momentum of a particle simultaneously with in®nite precision. The more precision we demand of
our knowledge of position, for example, the less precision we are
allowed in our knowledge of momentum. If we make q very
small, p becomes large in such a way as to keep the product
of the two uncertainties proportional to Planck's constant. We
can choose to measure the position with in®nite precision, but
this means that we completely give up knowing anything about
the particle's momentum; the uncertainty becomes in®nite. Or
we can determine precisely the particle's momentum but in
this case, our knowledge of its position is undetermined; the
particle can be anywhere. Both situations are complementary.
526
Quantum Mechanics
Each quality ± position and momentum ± are complementary
properties of the quantum particle.
An electron occupies a position in space only when an observation is made. Therefore, if, through observation, we determine
that an electron is at a given location at one particular instant, and
at another location at another particular instant, those two locations are the only ones that we can attribute to the electron. We
cannot say what the electron did to move from the ®rst location
to the second, and we cannot speak of the path that the electron
followed between the two locations, because the path does not
exist.
Is this the way the world really is? Is the world this crazy
place where the observer changes reality by the mere act of
observation? Bohr was convinced that this was the case. His
Copenhagen interpretation of quantum mechanics, as the ideas of
Bohr, Heisenberg and Born ± uncertainty, complementarity and
the disturbance of the system by the observer ± have been collectively called, is the interpretation accepted by the great majority
of physicists today. The major dissenter was Albert Einstein. He
never accepted that there could be chance and limitations
imposed on the measurement of observed phenomena in
nature. The uncertainty in the quantum world is not an intrinsic
property of the universe but the result of our incomplete knowledge of it, he reasoned. The weather, for example, is unpredictable not because of any inherent uncertainty but because
climate conditions involve too many parameters and too many
particles which make it impossible with the tools available to us
at the present to compute their behavior with the necessary
detail for accurate prediction. If a large enough computer were
available to take into consideration all these variables, accurate
weather prediction would be possible. The unpredictability and
uncertainty, then, are a result of these hidden variables that we
have not included. In the case of quantum mechanics, the
hidden variables have not been included because we do not
know them yet. According to Einstein, quantum mechanics is
not incorrect; it is simply incomplete. When this deeper level of
hidden variables is discovered, a new correct and deterministic
theory will be developed.
In 1935 Einstein wrote a paper with his colleagues Boris
Podolsky and Nathan Rosen (``EPR'') in which he proposed a
527
SUPERSTRINGS AND OTHER THINGS
Figure 22.4. In the EPR experiment a particle at rest explodes into two
fragments that move away from each other. We can measure the momentum of particle 1 and the position of particle 2. Because of conservation of
momentum, it is possible to deduce the momentum of particle 2.
thought experiment to show that it was possible for a particle to
have both position and momentum at the same time and consequently, to show that quantum mechanics is incomplete. In the
EPR experiment, a particle at rest splits into two fragments, 1
and 2 (®gure 22.4). The momentum of the original particle
before splitting is zero since the particle is at rest. Because
momentum is conserved, the total momentum of particles 1
plus 2 has to be zero also. When the particles are at a great distance from each other, the momentum of particle 1 is measured.
This should not affect particle 2 in any way, since it is very far
away. Heisenberg's uncertainty principle prohibits us from accurately measuring the position of particle 1 but, since the momentum of particle 2 has not been measured, we are free to determine
its position with perfect accuracy. We now know particle 1's
momentum, particle 2's position and the total momentum of
both particles. The momentum of particle 2 is equal to the total
momentum of both particles minus the momentum of particle
1, which we know. Although we did not measure particle 2's
momentum, we are able to compute it with accuracy. Contrary
to what quantum mechanics says, we have accurately determined
the position and the momentum of particle 2. ``I am therefore
inclined to believe,'' Einstein wrote later, ``that the description
of [the] quantum mechanism . . . has to be regarded as an incomplete and indirect description of reality, to be replaced at some
later date by a more complete and direct one.''
Bohr's response to the EPR argument was that the two
particles, once in contact with each other, continue to be part of
the system upon which one makes measurements. Therefore,
Heisenberg's uncertainty principle prohibits us from knowing
both the momentum and the position of particle 2 regardless of
528
Quantum Mechanics
the technique used to measure these quantities. Einstein could
not accept the idea that a measurement made at one location
could affect a second particle at a distant location. There is no limitation as to how far apart the two particles have to be in this
thought experiment. The measurement of the momentum of particle 1 could be made in a laboratory on Earth and the measurement of particle 2's position could be done by a laboratory
on the Moon, when the particle gets there.
Why not resolve the controversy by direct experiment? It
turns out that this kind of experiment is extremely dif®cult to
perform. In 1965, the Irish physicist John Stewart Bell, working
at CERN, the European accelerator center in Geneva, decided to
study the EPR experiment and was able to state and prove a
powerful theorem, known today as Bell's theorem, which deals
with the correlations that could exist between the results of simultaneous measurements done on two separated particles. Bell's
theorem gives the theoretical limits on the correlations between
these simultaneous measurements. The limits on these correlations were given by Bell in the form of an inequality, known
appropriately enough as Bell's inequality. To give a simple illustration of the reasoning behind Bell's inequality, the amount of
money in change that you have in your pocket at a particular
moment cannot be greater than the amount in change plus the
amount that you have in bills of different denominations at that
same moment.
Bell's theorem opened the path for an experiment that would
decide whether or not there are hidden variables in quantum
mechanics. A real EPR experiment. In 1982 Alain Aspect in
Paris performed such an experiment. The results were unequivocal. There are no hidden variables in quantum mechanics. Bohr
was right and Einstein was wrong. The world is as strange as
quantum mechanics has shown us it is. An electron is neither a
particle nor a wave. It is not an entity out there, separated from
the rest of the universe. An electron, a proton, a quantum particle,
an atom, all of these ``particles'' are simply convenient ways that
we have of thinking about what is only a set of mathematical
relations that connect our observations. The universe is not a
collection of objects; it is rather an inseparable net of relations in
which no component, including the observer, has an independent
reality.
529
SUPERSTRINGS AND OTHER THINGS
The frontiers of physics: Quantum teleportation
In Star Trek, when the members of the away team down on
the surface of an alien world need to get out of a sticky situation in a hurry, they contact their ship and ask to be ``beamed
up.'' Until recently, teleportation, the idea of transporting
matter instantaneously from one place to another, remained
in the realm of science ®ction.
Physicists had not paid much attention to teleportation
because it seemed to violate the uncertainty principle of
quantum mechanics, which forbids the simultaneous knowledge of all the information in an object. However, in 1993 a
research team led by Charles H Bennett of the IBM Thomas
J Watson Research Center in Yorktown Heights, NY, proposed a way to use the EPR experiment to transmit a quantum state instantaneously. It has been known for several
decades that there are situations in which an atom emits
two simultaneous photons which travel in opposite directions and remain forever correlated or ``entangled.'' Quantum mechanics tells us that neither photon has a particular
polarization until a measurement is performed on them.
When such a measurement is done on one of the two photons
and its polarization is determined, the second ``entangled''
photon immediately acquires the opposite polarization,
regardless of how far away it is (at the other side of the
laboratory or at the opposite end of the galaxy).
The EPR experiment does not allow the instantaneous
transmission of information, however. The IBM team proposed a theoretical method to record or scan and ``transmit''
part of the information from a particle's quantum state to its
entangled partner. Later, the second particle's state is modi®ed using the scanned information to bring it to the original
state of the ®rst particle. Having been disturbed by the
scanning process, the ®rst particle is no longer in its original
quantum state. The method provided a clever way to instantaneously teleport the quantum state of a particle to a second
particle.
The work of the IBM team was theoretical. In December
1997, researchers at the University of Innsbruck in Austria
530
Quantum Mechanics
reported the results of the ®rst experiment in which a photon
was teleported. A second group led by Francesco De Martini
at the University of Rome ``La Sapienza'' also reported
having teleported photon characteristics.
(Reprinted by permission from International Business Machines
Corporation, copyright 2000.)
Scientists are considering transporting electrons and
even whole atoms and ions. One idea is to transport a fragile
quantum state from a particle with a short life to a stable one.
In this way, information could be stored on trapped ions,
shielded from the environment, a quantum memory for computers. Other ideas involve the use of quantum teleportation
for transmission of information under noisy conditions,
where messages would be degraded or for providing links
between quantum computers. Physicists see that it would
be dif®cult to go beyond those applications to the point
where macroscopic objects could be instantly teleported.
The problem is that once the transfer is made, the receiver's
observation changes the transferred object's state. To interpret the result correctly, information about the process still
needs to be sent to the receiver by conventional means (at
speeds lower than the speed of light).
531
23
NUCLEAR PHYSICS
Beyond the atom
Quantum mechanics provided the mathematical foundation for
our understanding of the properties of atoms. With this foundation in place, progress came quickly. Some physicists used the
new theory to develop a more complete picture of the electronic
properties of atoms, while others went on to apply the theory
to the study of the atomic nucleus.
We studied some of the basic properties of the nucleus in
chapter 8. Here we will examine nuclear transformations, nuclear
energy, and the applications of nuclear physics to modern
technology.
Radioactivity
As we saw in chapter 8, atomic nuclei are composed of protons
and neutrons bound by the strong force. Because the electric
force has in®nite range, diminishing gradually with distance,
the positively charged protons feel the electrical repulsion of
each and every one of the remaining protons in the nucleus.
Since the nuclear force is a short range force, its in¯uence dies
out to almost nothing when the protons are separated by more
than 10ÿ14 m.
If a nucleus contains too many protons, the total electric
repulsion can overwhelm the nuclear attraction and a fragment
of the nucleus (an alpha particle) will ¯y out. This is known as
alpha decay. (Alpha is the Greek letter .) However, if the nucleus
contains a large number of neutrons, they contribute to the
532
Nuclear Physics
nuclear force without being affected by the electric force, as
they are neutral. This is why light nuclei usually have a similar
number of protons and neutrons, like 42 He, 73 Li, 126 C, and 168 O
whereas heavy nuclei require a great deal more neutrons to
counterbalance the effect of the electrical repulsion. For example
197
208
82 Pb contains 126 neutrons and 82 protons, and 79 Au has 118
neutrons and only 79 protons.
There is a limit to the number of neutrons that a nucleus can
have and still be stable. The reason for this has to do with another
type of force, the weak nuclear force, which is also a short-range
force and is responsible for the instability of the neutron. On
the average, a free neutron, outside the nucleus, can only last
for about ®fteen minutes before it disintegrates into a proton,
an electron, and another particle called a neutrino. Inside the
nucleus, however, this disintegration usually does not take
place because of a principle of quantum mechanics, known as
the Pauli exclusion principle, which states that no two protons
can occupy the same quantum state. These quantum states are
similar to the quantum states occupied by the electrons in the
atom. Pauli's principle prevents a nuclear neutron from decaying
into a proton because this new proton would have nowhere to go,
all the available states being already occupied by other protons.
Neutrons also obey the Pauli exclusion principle. Thus, a
nucleus with too many neutrons has to place those neutrons in
higher energy states. In these states the neutrons have enough
energy to place the protons into which they decay in unoccupied
high energy levels. But where this happens the nucleus loses its
identity. In other words, when a neutron inside a nucleus is
able to decay through the weak nuclear force into a proton, an
electron and a neutrino, the nucleus is transmuted into a different
nucleus. Because the nucleus cannot contain free electrons, these
are ejected. (According to Heisenberg's Uncertainty Principle, a
particle with a mass as small as that of the electron and having
the energies observed in beta decay cannot be con®ned to a
volume as small as that occupied by the nucleus.) This phenomenon is called beta decay. (Beta is the Greek letter .) This is the
same phenomenon as that identi®ed by Rutherford at the beginning of this century.
We now discuss these two types of radioactive decay,
together with a third, known as gamma decay, in some detail.
533
SUPERSTRINGS AND OTHER THINGS
(Gamma is the Greek letter . Alpha, beta and gamma are the ®rst
three letters of the Greek alphabet.)
Alpha decay
As we said earlier, if a nucleus contains too many protons it is
electrically unstable and undergoes alpha decay. An example
of a nucleus that alpha-decays is 238
92 U. This decay is illustrated
in ®gure 23.1.
We can write this decay as follows:
238
92 U
4
! 234
90 Th 2 He:
We call the nucleus before the decay the parent nucleus (238
92 U in our
example) and the nucleus produced after the decay, the daughter
nucleus (234
90 Th). Notice that the mass number of the parent nucleus
is 238 and this is equal to the sum of the mass numbers of the
daughter nucleus and the alpha particle (42 He), namely 234 4.
Since the mass number represents the number of nucleons in
the nucleus, this is the same as saying that the number of
nucleons in the parent nucleus must be equal to the total
number of nucleons in the daughter plus those in the alpha
particle. The atomic numbers obey a similar rule. Because the
atomic number represents the number of charges in the nucleus
(number of protons), this second rule says that the number of
charges in the parent nucleus must equal the total number of
Figure 23.1. Alpha decay of
534
238
92 U.
Nuclear Physics
charges in the daughter and the alpha particle. These two rules
hold for every radioactive decay and are based on conservation
laws. These conservation laws are called conservation of nucleon
number and conservation of electric charge.
Beta decay
A neutron-rich nucleus is also unstable and will convert spontaneously into another nucleus with an extra proton through the
weak nuclear force. As we mentioned earlier, this ``beta decay''
happens because of the transformation of the neutron into a
proton, an electron and a neutrino particle. The neutrino was proposed by the physicist Wolfgang Pauli as a solution to a problem
that was observed when the energies of the nuclei involved in
beta decay were measured. It seemed that in beta decay the principle of conservation of energy no longer held. Because physicists
have always been reluctant to accept anything that violates this
well-established principle, Pauli proposed in 1930 the existence
of a small neutral particle that would take care of the observed
discrepancy. The Italian physicist Enrico Fermi named it the
neutrino, Italian for ``little neutral one.'' Twenty years later
Frederick Reines and Clyde L Cowan detected the neutrino at
the US Atomic Energy Commission Savanna River Laboratory
in Georgia.
A typical beta decay event is illustrated in ®gure 23.2. In
symbols, we write this decay as
14
6C
! 147 N ÿ10 e :
Here the symbol represents the neutrino particle ( is the Greek
letter nu). In this case it is an ``antineutrino,'' which is the antiparticle of the neutrino, identi®ed by a bar over the symbol .
An antiparticle is a particle of antimatter: antimatter consists of
atoms with negatively charged nuclei and positive electrons,
called positrons. (Antimatter will be studied in more detail in
chapter 22). Because the electron is not a nucleon, its mass
number is indicated as zero, and as it has the same electric
charge as the proton but opposite sign, its atomic number appears
as ÿ1.
Emission of a positron (a positive electron or antielectron)
together with a neutrino also takes place. An example of this
535
SUPERSTRINGS AND OTHER THINGS
Figure 23.2. Beta decay of 146 C. One of the 8 neutrons in 146 C is transformed into a proton, an electron, and an antineutrino. The electron
and the antineutrino ¯y away. The daughter nucleus, 147 N, has one
more neutron and one fewer proton than the parent nucleus.
type of beta decay is
12
7N
! 126 C 01 e :
Note the 1 charge on this antielectron.
Gamma decay
In many cases the product of an alpha or beta decay process is a
nucleus that is left in an excited state. When a nucleus is in an
excited state, the nucleons can jump to a lower energy state
and, in a process similar to the way atoms emit electromagnetic
radiation, one or several photons are emitted from the nucleus.
The energy of these photons is much higher than that of visible
light. Photons so produced are called gamma rays.
In gamma decay there is no emission or absorption of
nucleons. The only change is in the energy of the nucleus. Therefore, neither Z nor A change and the nucleus remains the same.
We indicate a nucleus in an excited state by placing an asterisk
next to its chemical symbol. One example of gamma decay is
12
6C
! 126 C :
The products of radioactive decays (the alpha and beta
particles and the gamma ray photons) can interact with the
atoms and molecules in living cells or in electronic circuits.
These interactions can lead to certain changes. A few of the
536
Nuclear Physics
changes that may occur in living cells may be bene®cial and, if
they are passed to offspring, these changes help to produce the
process that we call evolution. Most of the changes in the cells
of biological systems, however, are detrimental, causing
damage to the organism. Again, just about every change in an
electronic circuit produces some damage and can lead to serious
disruption of the electronic systems in which they are installed.
Half-life
In a radioactive sample there are vast numbers of nuclei. Every
time a nucleus decays by alpha or beta emission, the number of
radioactive nuclei in the sample decreases by one. The time
required for half the nuclei in a given radioactive sample to
decay is called the half-life, T, of the particular nuclide. In each
subsequent half life, half the remaining nuclei decay; thus after
two half-lives, the remaining radioactive nuclei is one-fourth
the original number. After three half-lives, the original number
is reduced to one-eighth. For a given radioactive nuclear species,
the half-life is the same regardless of the number of nuclei in the
sample. The half-lives of several nuclei are listed in Table 23.1.
Although the half-life of 146 C, say, is known to be 5730 years,
we cannot predict exactly when a particular 146 C nucleus will
decay. The decay of a given nucleus is a random event. Because
a macroscopic sample contains an enormous number of nuclei,
we can use the laws of probability to determine how many
nuclei (without specifying which ones) will decay in a given
length of time. The probability that an unstable nucleus will
decay spontaneously is the same at any instant, regardless of its
past history, and it is the same for all nuclei of the same type.
Table 23.1 Half-lives of selected nuclei
Element
Uranium-238
Uranium-234
Carbon-14
Radium-226
Strontium-90
Half-life
9
4:55 10 years
2:48 105 years
5730 years
1620 years
28.9 years
537
Element
Half-life
Cobalt-60
Thorium-234
Iodine-131
Radon-222
Lead-214
5.3 years
24.1 days
8.1 days
3.8 days
26.8 minutes
SUPERSTRINGS AND OTHER THINGS
Nuclear reactions
All three radioactive decay processes are examples of nuclear
transformations. These are not the only way a nucleus can be
changed into one or more nuclei, however. If a fast moving
nucleon collides with a nucleus, a nuclear transformation can
occur. These nuclear reactions can take place in nature or they
can be arti®cially produced.
The ®rst nuclear reaction produced arti®cially was observed
by Rutherford in 1919. He discovered that when an alpha particle
passed through nitrogen gas, protons were emitted. He concluded correctly that nitrogen nuclei had been transformed into
oxygen nuclei (®gure 23.3).
We can write this nuclear reaction in symbols, as
4
2 He
147 N ! 178 O 11 H;
which is commonly written in the abbreviated form, with p representing the proton, as
14
7N
; p178 O:
Thousands of nuclear reactions have been studied since Rutherford's time, most of them arti®cially produced with the aid of
the particle accelerators invented during the 1930s. The products
of these reactions are in many cases radioactive isotopes that have
value in medicine and related ®elds where they are used as radioactive ``labels'' that show the path taken by a substance through
the body (®gure 23.4).
Nuclear reactions also obey the two conservation rules
introduced in our study of radioactive decay. Applying these
conservation rules, that is, conservation of nucleon number and
conservation of electric charge, to Rutherford's reaction, we can
Figure 23.3. An alpha particle strikes a nitrogen nucleus and is
absorbed. The new nucleus ejects a proton, becoming an oxygen nucleus.
This was the ®rst nuclear reaction to be arti®cially produced.
538
Nuclear Physics
Figure 23.4. Image of the brain of a patient with a migraine is formed by
detecting gamma rays from radioactive ¯ourine-18 that has been injected
into the patient. (Courtesy Department of Nuclear Medicine, Charing
Cross Hospital/Science Photo Library.)
see that the number of nucleons is the same before and after the
reaction takes place. To check this, notice that the sum of the
mass numbers of the nuclei before the collision takes place, He
and N, is 4 14 18, which is equal to the sum of the mass numbers of the ®nal nuclei, 17 for oxygen plus 1 for the ejected proton.
Similarly, the conservation of charge gives 2 for helium plus 7 for
nitrogen which equals the 8 for oxygen plus 1 for the proton.
Using these rules we could, for example, determine the product of the collision of a neutron (10 n) with 27
13 Al, which ejects an
alpha particle. Writing X for the unknown nucleus we have
1
0n
4
27
13 Al ! 2 He X:
539
SUPERSTRINGS AND OTHER THINGS
Conservation of nucleon number gives
1 27 4 Ax
and conservation of charge,
0 13 2 Zx
where Ax and Zx are the mass number and atomic number of our
unknown nucleus, respectively. This tells us that the atomic
number of X is 11, identifying the nucleus as sodium, Na, with
a mass number equal to 24. This particular isotope of sodium,
24
11 Na, is radioactive, and decays with a half-life of 15 hours
through beta emission into 24
12 Mg. This process is used in medicine
to study the way sodium is transported across membranes.
Nuclear energy: Fission and fusion
Nuclear ®ssion
During the summer of 1939, just before the start of World War II,
the German scientists Otto Hahn and Fritz Strassmann published
a paper where they stated that when uranium was bombarded
with neutrons, it produced smaller nuclei which were about
half the size of the original uranium nucleus. One of Hahn's previous collaborators, the physicist Lise Meitner and her nephew
Otto Frisch, both of whom had ¯ed Nazi Germany and were
working at the Nobel Institute in Sweden, immediately realized
that the uranium nucleus actually had split into two fragments.
This new phenomenon was called nuclear ®ssion.
In ®gure 23.5 we have illustrated how nuclear ®ssion works.
An incoming neutron strikes a uranium-235 nucleus and is
Figure 23.5. The steps in the nuclear ®ssion of a uranium nucleus, as
explained in the liquid drop model.
540
Nuclear Physics
momentarily absorbed, turning the target nucleus into uranium236. The absorbed neutron gives the uranium nucleus extra
internal energy which causes it to split. Once the fragments are
separated, the electrical repulsion between the ®ssion fragments
is stronger than the nuclear attraction and the two nuclei ¯y
apart. Since a liquid drop splits into two parts in a similar
fashion, physicists constructed a model based on this phenomenon, the liquid-drop model of ®ssion, to explain how it works. If
enough energy is provided to the drop, it will begin to take on
increasingly larger elongated shapes, vibrating back and forth
until it ®nally splits. In the uranium-235 nucleus, the neutron
provides the energy needed to start the vibrations until the
newly formed uranium-236 splits into ®ssion fragments.
A typical ®ssion reaction is
1
0n
1
94
139
236
235
92 U ! 92 U ! 54 Xe 38 Sr 30 n:
If the neutrons released in this reaction could be used to initiate
further reactions, a self-sustaining chain reaction could take place
(®gure 23.6). Enrico Fermi and other physicists at the University
of Chicago showed that this chain reaction was possible by constructing the ®rst nuclear reactor in 1942.
Figure 23.6. Chain reaction. After the ®rst reaction is initiated by one
neutron, the neutrons released by this reaction can strike uranium
nuclei causing additional ®ssions which in turn produce more neutrons.
The process multiplies very rapidly.
541
SUPERSTRINGS AND OTHER THINGS
Figure 23.7. Binding energy per nucleon plotted versus atomic mass
number. This curve shows that medium nuclei are more tightly bound
than light or heavy nuclei.
In a uranium ®ssion reaction like the one above, the mass of
U-235 plus a neutron is much greater than that of the ®ssion
fragments. As we saw in chapter 8, the curve of binding energy
per nucleon versus mass number (®gure 23.7) shows that the
values of the binding energy for light and for heavy nuclei are
smaller that those for medium nuclei. Therefore, these nuclei are
not as tightly bound as medium-weight nuclei (iron is the most
stable nucleus). Splitting a heavy nucleus, like U-235, to produce
medium-weight nuclei, which are more stable, releases energy.
We can calculate this energy by determining the mass de®ciency.
Before the reaction takes place, the total mass is the mass of the
approaching neutron plus the mass of the U-235 nucleus, or
m 10 n m
235
92 U
1:008 665 amu 235:043 925 amu
236:052 6 amu
and the mass of the ®ssion fragments is
m
139
54 Xe
m
94
38 Sr
3m 10 n 138:918 43 93:915 47
3 1:008 665 amu
235:859 9 amu:
The mass de®ciency is thus 236:052 6 ÿ 235:859 9 amu
0.192 7 amu. That is, the ®ssion fragments have less mass than
542
Nuclear Physics
the U-235 nucleus and the incoming neutron. The energy equivalent of this mass difference is
E 0:192 7 amu
931 MeV
1 amu
179 MeV:
This is the energy released in each ®ssion event. At the nuclear
level, this is an enormous amount of energy. At the macroscopic
level, 179 MeV correspond to about 3 10ÿ11 J, which is a very
tiny amount of energy. In a macroscopic sample of uranium,
however, there are billions of nuclei, so that if a chain reaction
is triggered, a huge amount of energy can be released. This is
the principle behind the nuclear bomb, a nuclear chain reaction
without control.
For a nuclear explosion to occur, there must be enough
®ssionable uranium to sustain the chain reaction. Natural
uranium consists of several isotopes of uranium, the most abundant of which is U-238 (99.3%). Unlike uranium-235, U-238 is not
a good prospect for a chain reaction because when neutrons
collide with it they are either scattered or captured. Since the
natural concentration of U-235 is only 0.7%, it is necessary to
increase it to over 90% in order to obtain a chain reaction. The
minimum mass required to produce a chain reaction is called
the critical mass and it depends not only on the amount of
uranium-235 but on the design of the bomb. An amount of
U-235 as small as 1 kilogram is a signi®cant quantity for a good
design.
A critical mass of U-235 will explode on its own, as any stray
neutron can initiate a chain reaction. For this reason nuclear
bombs are designed so that the uranium cannot form a critical
mass until the precise moment when the bomb is to explode.
Little Boy, the bomb that exploded on Hiroshima on August 6,
1945, was designed so that the uranium was kept in two subcritical
masses at the two ends of a cigar-shaped container (®gure 23.8).
An explosive propelled the two pieces together at a speed of
several kilometers per second to form a critical mass. After the
critical mass is formed the entire explosion takes place in about
one microsecond. 99.9% of the energy is released during the last
tenth of a microsecond of the explosion.
543
SUPERSTRINGS AND OTHER THINGS
Figure 23.8. One design for a nuclear bomb. An explosive propels the
two pieces together to form the critical mass. The bomb explodes in
about one microsecond. This design was used in Little Boy, the bomb
dropped on Hiroshima in 1945.
Core of a nuclear reactor. (Courtesy NASA.)
In a nuclear reactor, a low concentration of U-235, usually
about 3% ensures that most of the neutrons released by one
®ssion reaction are not absorbed by other U-235 nuclei. But
even at this low concentration, the ®ssion rate would eventually
increase. To prevent this increase and to control the rate of reactions, cadmium rods that absorb neutrons are inserted inside the
uranium core (®gure 23.9). In the reactor, the splitting of U-235
544
Nuclear Physics
Figure 23.9.
Schematic diagram of a nuclear reactor.
nuclei in a controlled environment produces energy that is used
to heat water and obtain steam that powers a turbine to produce
electricity.
In the 1970s scientists found evidence of a natural nuclear
reactor that existed almost two billion years ago during the
Precambrian era in Gabon, West Africa. Examining samples of
uranium from a mine, they discovered that a chain reaction had
taken place there for almost a million years with water trapped
in the sandstone absorbing some of the neutrons, like the
cadmium rods in modern reactors.
Nuclear fusion
Another source of nuclear energy comes from nuclear fusion, the
process that fuels the sun and the stars (®gure 23.10 (color
plate)). Nuclear fusion occurs when two light nuclei are fused
together to form a heavier nucleus (®gure 23.11). As we know,
the mass of every stable nucleus is less than the mass of its
protons and neutrons taken together. This is seen better by reexamining the binding energy per nucleon curve of ®gure 23.7.
Since light nuclei are less tightly bound than medium-weight
nuclei, joining light nuclei produces a more massive nucleus
which is more stable. Thus if two light nuclei were to come
together to form a new nucleus, energy would be released.
545
SUPERSTRINGS AND OTHER THINGS
Figure 23.11. Nuclear fusion: a nucleus of deuterium and a tritium
nucleus come together and form He-4 with the release of one neutron
and energy.
Uncontrolled fusion reactions are responsible for the fury of
hydrogen bombs. Physicists, however, have been attempting to
achieve controlled fusion reactions for many years and, although
great advances have been achieved, complete success is not yet
at hand. The fusion of hydrogen could be an almost unlimited
source of energy as hydrogen is so plentiful. The main dif®culty
lies in bringing the hydrogen nuclei close enough for the
attractive nuclear force to overcome the electrical repulsion.
Two techniques have been proposed for a practical fusion reactor:
magnetic con®nement and laser fusion. The magnetic con®nement scheme uses magnets to keep the protons con®ned to a
small doughnut-shaped region. The tokamak reactor is one
example of this technique. The Princeton Tokamak Fusion Test
Reactor has been able to con®ne protons for 16 of a second. The
second technique, the inertial con®nement method, uses lasers to
bring together protons so that fusion can take place. At the
Lawrence Livermore laboratory in California, lasers bombard a
deuterium-tritium pellet with enormous amounts of energy so
that the outer layers of the pellet burn away, causing the rest of
the pellet to implode and compress the fuel to densities high
enough to trigger nuclear fusion.
Applications of nuclear physics
Nuclear reactions have practical applications in archeology,
medicine, and industry. We shall review here some of the most
important applications.
546
Nuclear Physics
Pioneers of physics: Enrico Fermi (1901±1954)
After Lise Meitner and Otto Frisch proved experimentally
that the uranium nucleus did split into two fragments, Niels
Bohr and the American physicist John Wheeler developed
the liquid drop model to explain the newly discovered
phenomenon. One important feature of this model was the
release of neutrons as the uranium nucleus split, which
made a chain reaction possible.
After War World II started in 1939, the possibility that the
Nazis could use this discovery to build a powerful bomb
prompted the United States to start an urgent research program
in nuclear physics. In December 1941, Enrico Fermi was called
to Chicago to direct the ``uranium project.'' On December 2,
1942, Fermi's team achieved the ®rst controlled chain reaction
on an ``atomic pile,'' the ®rst nuclear ®ssion reactor.
Fermi had just been awarded the Nobel Prize in physics
for his work with neutrons as projectiles to produce nuclear
reactions. It had occurred to Fermi to bombard uranium
with neutrons to produce a new arti®cial element of higher
mass number. Although these transuranium elements were
later discovered in nature, they were not known at the time.
To Fermi's consternation, his discoveries were publicized by
the Italian press as a Fascist triumph.
Fermi had received his doctoral degree magna cum laude
from the University of Pisa in 1922, a few months before Mussolini seized power. He did postdoctoral work in Germany
and returned to Italy a few years later to become professor
of physics at the University of Rome. By the time of his trip
to Stockholm to accept the Nobel prize, life had become
very dif®cult for the anti-Fascist Fermi. His wife was Jewish
and Italy had just passed several anti-Jewish laws. The controlled Italian press was also increasingly critical of Fermi's
refusal to wear the Fascist uniform at the Nobel ceremonies
or to give the Fascist salute. The Fermis decided to sail for
the United States from Stockholm.
After the war, Fermi became professor of physics at the
University of Chicago where he taught until his early death at
the age 53 of stomach cancer. The element with atomic mass
100, discovered a year later, was named fermium in his honor.
547
SUPERSTRINGS AND OTHER THINGS
Radioactive dating
The known half-lives of certain radioactive elements can be used
as a clock for determining the age of some objects. This process is
known as radioactive dating. Carbon-14, a radioactive isotope of
carbon with a half-life of 5730 years, is used to date archeological
®ndings.
The upper atmosphere of the earth is continuously being
bombarded by cosmic rays. These are charged particles, electrons
and nuclei of hydrogen and helium together with some heavier
nuclei, moving at speeds close to the speed of light in the interstellar medium. When cosmic rays collide with the atoms in the
upper atmosphere, neutrons are liberated; these in turn collide
with nitrogen atoms to produce carbon-14. The reaction is
1
0n
147 N ! 146 C 11 H:
Even though carbon-14 is constantly being produced in the
earth's atmosphere, only about 1 in 109 carbon atoms are of this
radioactive isotope. All living plants absorb carbon dioxide
from the air, and chemically do not distinguish between radioactive and nonradioactive carbon. Since animals and humans
eat plants, all living things take in carbon-14 in the same proportion. When a plant or animal dies, no new C-14 is ingested, and
the carbon-14 present at death decays, with a half-life of 5730
years.
Carbon-14 decays through beta emission according to the
reaction
14
6C
! 147 N ÿ10 e :
Therefore, when the plant or animal dies, the ratio of C-14 to
ordinary C-12 begins to decrease as the C-14 decays, and from
this decrease the elapsed time since the death took place can be
determined. If, for example, the ratio of C-14 to C-12 in a
wooden artifact unearthed in an archeological site is one-fourth
of the ratio for living trees, we can infer that the tree from
which the artifact was made died two half-lives (11 460 years)
ago.
Carbon-14 dating is used to determine the age of fossils and
archeological artifacts made from organic materials up to 40 000
years (about seven half-lives). Beyond that there is not enough
548
Nuclear Physics
C-14 remaining in the objects for an accurate age determination.
Other isotopes with longer half-lives, such as U-238, with a
half-life of 4:5 109 years, can be used for the determination of
the age of rocks, for example.
Using these radioactive clocks, scientists have been able to
determine that hominids have been around for 3.5 million
years, that life on earth appeared some 3 to 4 billion years ago,
and that the earth itself was formed some four and a half billion
years ago.
Biological effects of radioactivity
Radioactive isotopes are used as ``labels'' that reveal the passage
of a substance through the body (®gure 23.12 (color plate)). They
are also used in radiation treatment, to destroy cancerous
growths from the body where surgery is either inadvisable or
impossible. However, radiation not only destroys malignant
tissue; it also destroys normal living cells by altering or damaging
the structure of key molecules.
The radioactive decay of nuclei produces different kinds of
ionizing radiation with different energies, reaching up to several
million electron-volts per photon. The term ``ionizing radiation''
means that this radiation interacts with matter, stripping electrons
off of atoms, thus forming ions. In addition to the alpha and beta
particles and the gamma rays already discussed (all produced in
nuclear processes), ionizing radiation also includes photons of
lower energies which are produced in atomic processes.
In recent years, radioactive isotopes have been used to trace
chemicals in several reactions as they move through an organism.
If an atom is replaced in a chemical reaction by its radioactive
counterpart, the compound is ``tagged'' and can be traced with
the aid of a detector. Radioactive isotopes can be effectively
detected by means of a scanning device. One such device is the
Positron Emission Tomography (PET) scanner, a machine that
produces cross-sectional views of the body, and requires only
low levels of radiation. Radioactive nuclides that decay through
emission of positrons (antielectrons) are introduced into the
body. One such decay is
12
7N
! 126 C 01 e :
549
SUPERSTRINGS AND OTHER THINGS
As we shall study in more detail in chapter 24, when these positrons encounter regular electrons, they annihilate each other,
emitting gamma rays. These gamma rays are emitted in pairs,
emerging in opposite directions. Gamma ray detectors that recognize photons emitted simultaneously but in opposite directions
are used to triangulate several such events so that the exact position where they were produced can be determined.
As we have said, ionizing radiation, which can be used to
diagnose, treat, and sometimes cure malignant growths in
humans, can also be harmful. Radiation damage in organisms is
due to ionization effects in cells that result in structural changes.
When such damage affects the DNA molecules of the reproductive organs, it is called genetic damage. This may lead to genetic
mutations which, in their great majority, are harmful. Radiation
damage which affects other cells in the body is called somatic
damage.
It is, of course, desirable to protect ourselves from ionizing
radiation. However, it is sometimes necessary to risk some radiation exposure when bene®ts can be derived from it, as in certain
medical applications when other methods are not feasible. It is,
nevertheless, impossible to avoid all radiation exposure because
cosmic rays and radioactive rocks are natural sources of radiation
that deliver a dose every year to each person equivalent to the
dose received from medical X-rays by the average person in
one year.
Uranium is ubiquitous in the earth's crust. The radioactive
isotope uranium-238 alpha-decays into thorium-234, itself a
radioactive isotope which beta-decays into another radioactive
isotope. After several decays, the stable isotope lead-206 is produced and the sequence of nuclides that results is called a decay
series. One of the products of this decay series is radium-226
which alpha decays into radon-222. The radioactive isotope
radon-222 is an inert gas, invisible and odorless, which may
seep into a house to build up a hazardous concentration. When
radon decays, with a half-life of 3.82 days, it emits an alpha
particle of 5.5 MeV energy in the reaction
222
Rn ! 218 Po 4 He
and this is followed by a series of other alpha and beta particles
produced in the subsequent decays of the remaining nuclides
550
Nuclear Physics
Figure 23.13. (a) Commercially-available alpha-track detector to measure radon concentration in homes. (b) Activated-charcoal detector.
in the decay series. The total energy released in these processes is
21.5 MeV. When a person breathes in radon, and other nuclides in
the decay series ± some of which are solid and might lodge in the
lung ± these emit radioactive alpha and beta particles that can
cause biological damage.
Although the average house contains radon at a concentration of about one-fourth what the EPA considers as the threshold
of concern, recent tests have determined that nearly one third of
the residences tested by the EPA exceeded that threshold. Fairly
simple devices can be used to record the concentration of radon
in homes. They fall into two categories: alpha-track devices and
activated-charcoal detectors. Alpha-track devices, developed by
General Electric's Research and Development Center as part of
the lunar exploration program during the 1970s, consist of a
small sheet of polycarbonate plastic kept in a plastic bottle
(®gure 23.13(a)). When an alpha particle strikes the plastic
sheet, it causes a microscopic pockmark which can be identi®ed
by a laboratory with the appropriate equipment. In a typical
test, fewer than 100 pocks are produced in a span of three
months; for this reason, the alpha-track detectors are best for
long-term measurements. For short-term measurements, the
activated-charcoal detectors are better. These are ¯at canisters
of activated-charcoal granules which trap radon gas (®gure
23.13(b)). After a short period of exposure to the air in the
house, they are sent back to a laboratory for analysis.
551
SUPERSTRINGS AND OTHER THINGS
Physics in our world: Proton beams for cancer
therapy
A sophisticated machine built by the Fermi National Accelerator Laboratory to accelerate protons has been installed at
the Loma Linda University Medical Center where it is
being used to treat rare, super®cial skin cancers. The 50-ton
machine strips electrons from hydrogen atoms and accelerates the remaining protons to great speeds.
The patient is partly enclosed in a tube at the center of a
three-story-high, 90-ton rotating gantry, which directs the
proton beam to its target for about a minute. Like the electron
beam in a television set or computer monitor, the proton
beam at Loma Linda scans a thin slice of tumor line by line.
By varying the number of protons in the beam and their
energy, the beam can be distributed directly in successive
layers inside the tumor.
Unlike X-rays, which transmit some of their ionizing
radiation over a few centimeters and are not completely
stopped by the tissue, proton beams are stopped completely
by the tumor, where they deposit most of their energy, leaving
surrounding tissue untouched. The malignant tissue is
destroyed as the beam splits apart DNA molecules. Scientists
at Loma Linda University hope to be able to extend the use
of the machine to treat localized tumors in the lung and
pancreas.
552
24
ELEMENTARY
PARTICLES
Antimatter
Paul Adrien Maurice Dirac was born in Bristol, England, on
August 8, 1902, one of the three children of Charles Dirac, a
Swiss eÂmigreÂ, and Florence Holten. Dirac was brought up to be
bilingual in French and English. ``My father made the rule that I
should only talk to him in French. He thought it would be good
for me to learn French in that way. Since I found that I couldn't
express myself in French, it was better for me to stay silent than
to talk in English. So I became very silent at that time ± that started
very early.'' Dirac remained very reserved throughout his life.
``My father always encouraged me towards mathematics . . . He
did not appreciate the need for social contacts. The result was
that I didn't speak to anybody unless spoken to,'' he was to confess
later. ``I was very much an introvert, and I spent my time thinking
about problems in nature.''
At the age of 16, Dirac entered the University of Bristol
where, afraid that there would be no jobs in mathematics, he
studied engineering, graduating in 1921. Ironically, he could
not ®nd a job with his engineering degree and decided to stay
on to study pure and applied mathematics. In 1923, he enrolled
at Cambridge where he received his PhD in May of 1926, with
a thesis entitled Quantum Mechanics. In 1932, at the age of 30, he
became Lucasian Professor of Mathematics at Cambridge, the
chair once held by Newton.
In 1927 Dirac attended the Solvay Conference, a periodic
meeting of prominent physicists paid for by the wealthy Belgian
industrial chemist Ernest Solvay, where he met Einstein for the
®rst time. During a conversation with Bohr at the conference,
553
SUPERSTRINGS AND OTHER THINGS
Bohr asked Dirac what he was working on. ``I am trying to get a
relativistic theory of the electron,'' answered Dirac. Bohr claimed
that the problem had already been solved. Dirac did not agree.
Dirac knew that quantum mechanics and relativity theory
were both correct. Quantum mechanics, however, could not be
applied to quantum particles moving at relativistic speeds. Since
the wave properties of the electron had already been con®rmed,
Dirac decided to develop a relativistic quantum theory of the electron using the concept of quantum ®eld. According to this approach,
every particle type has its own ®eld. The ®eld of the photon is the
electromagnetic ®eld. The electron also belongs to a ®eld, now
called the Dirac ®eld. These ®elds interact with each other according
to laws expressed in certain quantum ®eld equations.
Dirac's theory ± the ®rst quantum ®eld theory ± is known today
as relativistic quantum ®eld theory. It is Maxwell's electromagnetism
applied to relativistic quantum particles. In his theory, Dirac
generalized the SchroÈdinger equation to include electrons
moving at relativistic speeds with a new equation now called
the Dirac equation. This equation had remarkable and profound
consequences. ``The relativistic wave equation of the electron
[the Dirac equation] ranks among the highest achievements of
twentieth-century science,'' says Abraham Pais of Rockefeller
University.
The Dirac equation predicted the observed properties of the
electron. For example, it predicted that an electron had an intrinsic spin angular momentum. The effects of this spin of the electron had been observed experimentally, although SchroÈdinger's
equation did not account for it.
The most remarkable prediction of the Dirac equation had to
do with its two solutions for the energy of an electron, one positive
and the other negative. What to do with the negative energy solutions? Nobody knew what electrons with negative energy could
be. ``I was reconciled to the fact that the negative energy states
could not be excluded from the mathematical theory,'' wrote
Dirac, ``and so I thought, let us try and ®nd a physical explanation
for them.'' The physical explanation that Dirac eventually gave
was that the negative energy solution described a new kind of
particle with the same mass as the electron but with positive electric
charge ± opposite to that of the electron. He initially called it the
antielectron. That was a bold prediction. ``I didn't see any chance
554
Elementary Particles
of making further progress,'' he wrote later. ``I thought it was
rather sick.'' To his great triumph, antielectrons or positrons, as
they were later called, were discovered four years later in cosmic
rays by Carl D Anderson of the California Institute of Technology.
Physicists soon realized that Dirac's theory predicted the
existence of other antiparticles, in addition to the positron. For
every quantum particle there was an antiparticle. These antiparticles constitute a new kind of matter, antimatter, identical to
ordinary matter except that if the particle has electric charge, its
antiparticle has the opposite charge. When a particle encounters
its antiparticle, they annihilate each other, disappearing in a
burst of photons in a process called pair annihilation (®gure
24.1(a)). The inverse process, in which high energy photons
create a positron-electron pair is also possible (®gure 24.1(b)).
This process is called pair production. Recall that, according to
the special theory of relativity, an electron has a rest energy
m0 c2 . Thus, photons of energy hf equal to at least 2m0 c2 can
produce an electron and a positron.
The fundamental forces
Before we continue delving more deeply in to the structure of
matter, let us summarize what we have learned so far about the
known interactions in nature. All the variety of phenomena in
the universe, from the explosion of a supernova in another
galaxy to the falling of a leaf during an autumn afternoon on
Earth, from the appearance of a dark spot on the atmosphere of
a distant star, the explosion of a volcano on Io or the collapse of
a distant red giant to the whirr of the wings of a hummingbird,
all of these events can be ultimately explained by only four fundamental forces. Everything that happens anywhere in the universe
is ultimately controlled by the operation of these four forces.
Understanding the properties of these forces ± gravity, electromagnetism, and the strong and weak nuclear forces ± is, perhaps,
the most important task in physics today.
. Gravity, the ®rst one of the four forces to be discovered, was
formulated by Newton in his universal law of gravitation.
Gravity controls a wide range of phenomena, from the falling
of an apple or the motions of ocean tides to the expansion of
555
SUPERSTRINGS AND OTHER THINGS
Figure 24.1. (Top) Pair annihilation and (Bottom) pair production.
(Courtesy Brookhaven National Laboratory and Lawrence Berkeley
Laboratory.)
the universe. Newton's universal law of gravitation is a simple
inverse-square law; that is, the gravitational force is a longrange force that decreases in strength in proportion to the
square of the distance between the two interacting bodies.
Newton's theory, however, fails to explain why gravity exists.
556
Elementary Particles
Table 24.1. Properties of the fundamental forces
Force
Strong
Electromagnetic
Weak
Gravitational
Range
ÿ15
10 m
in®nite
10ÿ17 m
in®nite
Relative strength
1
7:3 10ÿ3
10ÿ5
6 10ÿ39
Einstein's general theory of relativity presented gravity not as
a force but as a product of geometry, the geometry of spacetime. In doing so, Einstein placed gravity farther away from
the rest of physics.
. The electromagnetic interaction is described by Maxwell's equations. These equations concisely summarize all known electric
and magnetic phenomena. The electromagnetic force is
responsible for the binding of atoms and molecules; it binds
electrons to nuclei and atoms and binds atoms together into
molecules, holding them together in solids. Like Newton's
universal law of gravity, the electromagnetic force is an
inverse-square force.
. The strong force, one of the two nuclear forces, holds the
nucleons together in a nucleus. It is a short-range force, becoming negligible at distances greater that 10ÿ15 m. The strong
force acts on protons and neutrons but not on electrons, neutrinos or photons. The strong force is 137 times stronger than the
electromagnetic force.
. The weak force is also a short-range nuclear force. It is responsible for radioactive beta-decay processes, such as the transformation within the nucleus of a neutron into a proton or a
proton into a neutron. The weak force controls many of the
reactions that produce energy in the Sun and the stars. The
weak force is of the order of a hundred thousand times
weaker than the strong force.
Table 24.1 summarizes the properties of the fundamental
forces. As we can see, gravity is the weakest of the forces. The
gravitational force of attraction between two electrons, for example, is 1036 times weaker than their electrostatic repulsion.
The gravitational force between large bodies becomes important
because it involves a tremendous number of atoms and because
these atoms are neutral, so that their electrostatic interaction
557
SUPERSTRINGS AND OTHER THINGS
cancels out. The two nuclear forces are short-range and play no
role in the interaction between large bodies at normal distances.
Exchange forces
What is a force? How does a force work? How does an electron
know that there is another electron nearby or the Earth that the
Sun is 150 million km away? How is a force transmitted?
Newton never attempted to explain how his universal law of
gravitation worked. Gravity was accepted as an action at a distance
phenomenon. Faraday introduced the concept of ®eld to explain
how electric and magnetic forces were transmitted. Einstein, in
his general theory of relativity, said that gravity was geometry.
According to Einstein, the presence of an object disturbs the
space-time around it, altering its geometry, so that when a
second object enters this space, it experiences the distortions.
In 1928 Dirac developed his relativistic quantum theory of
the electron by combining Maxwell's electrodynamics, relativity
and quantum mechanics. Dirac's theory, a quantum ®eld
theory, was further developed by Richard Feynman, Sin-Itiro
Tomonaga and Julian Schwinger into a complete theory, known
today as quantum electrodynamics or QED. According to electrodynamics, when an electron accelerates, it radiates energy in
the form of an electromagnetic wave. From quantum mechanics,
we know that this electromagnetic wave consists of photons.
Thus the acceleration of an electron results in the emission of
one or more photons. When an electron approaches another
electron, the mutual electrostatic repulsion that results is interpreted in QED as due to an interaction with the photons emitted
by the accelerating electrons. According to QED, the approaching
electron emits a photon which is then absorbed by the second
electron. We can illustrate this process with the diagram shown
in ®gure 24.2. An electron is represented by a straight line and
the photon they exchange by a wiggly line. These diagrams
were ®rst used by Richard Feynman as symbolic representations
of equations and are known today as Feynman diagrams.
Where do the photons exchanged between two interacting
electrons come from? From nowhere. They are allowed by the
uncertainty principle. The electron creates a photon which is
558
Elementary Particles
Figure 24.2. Feynman diagram of the interaction between two electrons.
The two electrons exchange virtual photons which carry energy and
momentum. This exchange causes the electromagnetic repulsion between
the two electrons.
absorbed by the interacting electron before the change in energy
associated with its creation can be detected. If the uncertainty in
our knowledge of the energy of the electron is E and t the
uncertainty in our knowledge of the time during which the
electron has that energy, Heisenberg's uncertainty principle
states that
h
:
E t
2
This expression means that during the interval t a photon of
energy as large as E is created. If the second electron absorbs
this photon before the time t is up, no experiment can detect
any missing energy. Particles that exist for only the time allowed
by the uncertainty principle are called virtual particles. A virtual
photon exists only for the brief, ¯eeting moment permitted by the
uncertainty principle. Real photons exist forever, provided they
do not interact with other particles.
Quantum ®eld theory provides an answer to the questions
that opened this section. All the forces of nature can be explained
as being the result of the exchange of some virtual particle.
Pions
Is it possible to understand the strong interaction between
nucleons in terms of the exchange of virtual particles? The Japanese physicist Hideki Yukawa made such a suggestion in 1933.
559
SUPERSTRINGS AND OTHER THINGS
``By confronting this dif®cult problem I committed myself to long
days of suffering,'' writes Yukawa in his autobiography. ``Is the
new nuclear force a primary one? . . . It seemed likely that [it]
was a third fundamental force unrelated to gravitation and electromagnetism . . . Perhaps the nuclear force could ®nd expression
as a ®eld . . . If one visualizes the force ®eld as a game of `catch'
between protons and neutrons, the crux of the problem would
be the nature of the `ball' or particle.''
Heisenberg had had a similar idea before and had proposed
that the ``ball'' was the electron. At ®rst, Yukawa thought also that
the electron was the intermediary particle. Soon, he realized that
this was not possible. ``Let me not look for the particle that belongs
to the nuclear force among the known particles . . . If I focus on the
characteristics of the nuclear force ®eld, then the characteristics of
the particle I seek will become apparent,'' he wrote. Then came the
stroke of genius. ``The crucial point came to me in October [1934].
The nuclear force is effective at extremely small distances. My new
insight was that this distance and the mass of the new particle are
inversely related to each other.''
If a virtual particle of mass m can only exist for a time t
allowed by the uncertainty principle and we assume that this
particle can travel at the speed close to c, the maximum range R
of the particle (that is, the maximum distance that the particle
can travel during its existence) is
R ct:
The uncertainty in energy equals the mass energy of the virtual
particle
E mc2 :
The uncertainty principle E t h=2 gives us
h
h
t
:
2 E 2mc2
The maximum range of the virtual particle is
h
R
:
2mc
The maximum range is then inversely proportional to the mass. If
the mass is zero, the range becomes in®nite. A massive virtual
particle has a ®nite range; the more massive the particle is, the
shorter the range. The electromagnetic interaction had an in®nite
560
Elementary Particles
Figure 24.3. Feynman diagram for the strong interaction between two
protons via the exchange of a virtual pion.
range and was mediated by a massless particle, the photon. The
nuclear force, a very short-range force, had to be mediated by a
massive particle. Knowing that nuclear forces occur over less
than 10ÿ14 m Yukawa calculated the mass of the new particle to
be 140 MeV/c2 . Because its mass fell between the mass of the
electron and the mass of the proton, the new particle was called
meson (from the Greek meso, ``middle'').
Three years later, in 1937, a team of American scientists found
a particle with a mass of about 100 MeV/c2 and it was immediately assumed that it was the meson predicted by Yukawa. Experiments carried out in 1945, however, showed that this new particle
interacted weakly with protons and neutrons and consequently
could not be the mediator of the nuclear force. The following
year, the real Yukawa particle, the particle exchanged in the
nuclear interaction, was found in cosmic rays and was named
the meson or pion. (The earlier particle, later called the muon,
turned out to be a completely different beast that participates
only in weak and electromagnetic interactions and has nothing
to do with Yukawa's particle.) There are three kinds of pions:
; ÿ , which is the antiparticle of the ; and 0 , which is its
own antiparticle. Figure 24.3 illustrates the strong interaction
between two protons via the exchange of a virtual pion.
Particle classi®cation: Hadrons and leptons
In the subsequent years, experiments with cosmic rays and with
particle accelerators yielded a plethora of new particles. In
561
SUPERSTRINGS AND OTHER THINGS
addition to the protons, neutrons, pions, muons, neutrinos and
photons, particles with names like sigma, delta, lambda, tau
meson, and kaon, were added to the arsenal of high-energy
physicists. What to make of all these particles?
To make sense of the hundreds of particles that have been
discovered, a classi®cation scheme based on the interaction in
which they participate has been devised. Particles that participate
in the strong interaction, the interaction that holds the nucleus
together, are called hadrons (from the Greek hadros, meaning
strong). Hadrons with half-integer spin in units of h=2 (such as
the proton or the neutron) are called baryons, whereas hadrons
with integer spin (such as the pion) are called mesons. Since all
baryons, except the proton itself, ultimately decay into protons
and other particles, they are all more massive than the proton
(baryon means heavy in Greek).
Particles that do not interact via the strong force are called
leptons, from the Greek leptos, meaning light. The electron, the
neutrino, the muon and the tau particle are all leptons. To the
best of anyone's knowledge, the muon is exactly the same as
the electron except that it is 200 times more massive. The same
can be said for the tau; in this case, it is 3500 times heavier
than the electron. All leptons have spin 12 and all are truly elementary particles, without any internal structure. As far as we
know, leptons are point particles with no dimensions. The
most precise experiments have determined that the electron is
no larger than 10ÿ17 cm in diameter. This is one thousandmillionth the size of an atom and one ten-thousandth the size
of a proton.
In a weak interaction involving the electron, like the decay
of the neutron, one kind of neutrino is involved, the electron
neutrino. In the decay of the pion, however, another kind of
neutrino, the muon neutrino, is involved. When this neutrino
strikes a target, it produces muons and not electrons. There is
a third type of neutrino emitted in conjunction with the tau
particle, the tau neutrino. There is then a grouping of leptons
into pairs or generations, the electron with the electron
neutrino, the muon with the muon neutrino, and the tau particle
with the tau neutrino. Each one of these leptons has its
corresponding antiparticle. This classi®cation is summarized in
Table 24.2.
562
Elementary Particles
Table 24.2. Lepton classi®cation
Generation
First
Second
Third
Charged lepton
ÿ
Electron, e
Muon, ÿ
Tau, ÿ
Neutrino
Charged antilepton
e
Positron, e
Antimuon,
Antitau,
Antineutrino
e
Conservation laws
Interactions between elementary particles take place according to
a set of conservation laws. One such law is the principle of conservation of energy. In any process between particles, the mass energy
of the decay products must equal the mass energy of the original
particles. It was the principle of conservation of energy that led
Wolfgang Pauli in 1932 to predict the existence of the neutrino
from the observation of an apparent violation of this law in
beta-decay processes. The neutrino was found in 1950.
Conservation of electric charge is another very important law. It
says that the sum of all the charges of the original particles must
equal the sum of the charges of the particles produced in the
interaction. For example, in the reaction between two nucleons
that produces pions:
p n ! p p ÿ
electric charge:
10!11ÿ1
the sum of the charges of the original nucleons is 1 and the sum of
the charges of the products is 1 1 ÿ 1 1:
Conservation of energy and conservation of charge ± two
laws we had encountered before ± are, as far as we know, universal laws. We mentioned before that all baryons ®nally decay into
protons. This is a consequence of another important conservation
law, the law of conservation of baryons. In any interaction, the creation of a baryon must be accompanied by the creation of an antibaryon.
If we assign a baryon number B 1 to all baryons, B ÿ1 to all
antibaryons, and B 0 to all other particles, this law can also be
stated as follows:
Law of Conservation of Baryons: In any interaction, the total baryon
number must remain constant.
563
SUPERSTRINGS AND OTHER THINGS
Since the proton is the lightest baryon, all other baryons must
ultimately decay to a proton, which can decay no further without
changing baryon number.
Leptons also obey similar conservation laws. Leptons, as we
recall, participate in weak interactions. A weak decay process is
the decay of the neutron into a proton, an electron and an electron-neutrino. Another weak decay is that of the pion, which
decays into a muon and a muon-neutrino. In all of these processes,
the creation of a lepton always takes place with the creation of an
antilepton. If the leptons (electron, muon and tau) are assigned a
lepton number 1 and the antileptons a lepton number ÿ1, this
principle can be stated as follows:
Principle of Conservation of Leptons: In all interactions between
particles, the total lepton number for each variety (electron, muon,
and tau) must remain constant.
As we indicated before, each charged lepton has its own associated neutrino, so that a pion cannot decay into a muon and an
electron-neutrino. The electron and the electron-neutrino are
then assigned an electronic lepton number Le 1. The positron
and the positron-neutrino are assigned an electronic lepton
number Le ÿ1. All other leptons and in fact all other particles
are assigned an electronic lepton number Le 0. The muon and
the muon-neutrino, on the other hand, are assigned a muonic
lepton number L 1; the antimuon and its associated neutrino
have L ÿ1. Similarly for the tau and the tau-neutrino. Table 24.3
Table 24.3. Lepton quantum numbers
Lepton
Le
L
L
eÿ
e
ÿ
ÿ
e
e
1
1
0
0
0
0
ÿ1
ÿ1
0
0
0
0
0
0
1
1
0
0
0
0
ÿ1
ÿ1
0
0
0
0
0
0
1
1
0
0
0
0
ÿ1
ÿ1
564
Elementary Particles
summarizes the lepton quantum numbers of the six leptons and
their antiparticles.
Strange particles
There are some important differences between hadrons and leptons. Hadrons have a de®nite extension in space whereas leptons
behave like point particles. In addition, there are only six leptons
(and their corresponding six antiparticles), while the hadrons, it is
believed today, are in®nite in number. Moreover, all leptons obey
the same law of conservation of lepton number. For hadrons, only
the baryons have a conservation law; there is no meson number
conservation law.
To complicate matters, new particles were discovered in the
early 1950s. These were produced via the strong interaction,
making them hadrons; but they decayed in strange ways, as if
they were leptons decaying through the weak interaction. These
particles, the K mesons or kaons, and the (Greek capital
lambda) and (Greek capital xi) baryons, were called strange
particles owing to their strange behavior. In 1952, Abraham Pais
of the Institute for Advanced Studies in Princeton introduced
the idea of associated production, according to which, the strange
particles were always produced in pairs.
In 1956 Murray Gell-Mann in the United States and Kazuhiko Nishijima in Japan independently suggested that Pais's associated production phenomenon was the result of the conservation
of a new property, which they called strangeness. It turned out that
strangeness, with a new quantum number S, must be conserved
in strong interactions. The strange particles can only be produced
in pairs of overall zero strangeness.
Quarks
In 1961, Gell-Mann, and independently Yuval Neeman in Israel,
noticed a pattern in the hadrons which was based on a mathematical symmetry which they called SU(3). The mathematical tool
that Gell-Mann and Neeman used to discover this symmetry is
called group theory, a technique that had been formulated by a
twenty-year-old French mathematician, Evariste Galois, the
565
SUPERSTRINGS AND OTHER THINGS
night before his death in 1832. Galois had become involved in a
duel challenge over a woman and, anticipating his death, spent
his last night writing out his ideas on group theory. Galois was
killed but his ideas survived, and Gell-Mann and Neeman used
them in their theory. According to the symmetric group SU(3),
each hadron is a member of a speci®c family of hadrons with 1,
8, 10, and 27 members each. Gell-Mann called this scheme the
eightfold way. The proton and the neutron belong to the same
family of eight hadrons, called the baryon octet. The pion also
belongs to a family with eight members called the meson octet.
The eightfold way was very successful in the prediction of
unknown particles. The most spectacular prediction was that of
the ÿ (Greek capital omega minus) particle, the tenth member
of a family of hadrons called a decuplet. The mass and the quantum properties of the ÿ had been predicted by Gell-Mann in
1962. In December 1963 the ÿ was found by a team of physicists
at Brookhaven National Laboratory with exactly the properties
predicted by the eightfold way.
The questions that scientists were asking then were: Why did
the eightfold way work? Why were the hadrons grouped into
families of 1, 8, 10, and 27 members each? It was again GellMann with yet another physicist working independently,
George Zweig, who provided the answers. Zweig who, like
Gell-Mann, was a professor of physics at Caltech, developed his
ideas while on sabbatical leave at CERN, the European research
center in Geneva. They suggested that the families in the eightfold
way could all be generated from a basic triplet which corresponded to three new fundamental particles that made up all
the hadrons. Hadrons, according to the new theory, were not
elementary particles but were composed of members of a fundamental triplet that Gell-Mann called quarks. A proton, for example,
was supposed to be composed of three quarks. Gell-Mann
borrowed the name quark from a passage in James Joyce's novel
Finnegans Wake that reads:
Three quarks for Muster Mark!
Sure he hasn't got much bark
And sure any he has it's all beside the mark.
The ``three quarks'' may refer to the three children of Mr Mark (or
Mr Finn) who occasionally represent him.
566
Elementary Particles
Table 24.4. Quantum numbers of the original three quarks and their antiparticles
Quark
Charge (e)
Baryon number
Strangeness
Spin
u
d
s
u
d
s
13
ÿ 13
ÿ 13
ÿ 23
13
13
1
3
1
3
1
3
ÿ 13
ÿ 13
ÿ 13
0
0
ÿ1
0
0
1
1
2
1
2
1
2
1
2
1
2
1
2
The three quarks in the original theory are point particles like
the electron and with the same spin 12. According to this theory, all
hadrons are made of combinations of two or three quarks, called
the up or u quark, down or d quark and strange or s quark. The
three varieties of quark are known as ¯avors. The term comes
from the initial whimsical use of the names ``chocolate,''
``vanilla'' and ``strawberry'' for the three quarks. Although
these names did not stick, the term ``¯avor'' for the different
types of quark did.
Quarks have unusual quantum numbers (table 24.4). The
most unusual is the fractional electric charge. The baryon
number is also fractional. All baryons, like the proton and
neutron, are combinations of three quarks, and their antiparticles
are combinations of three antiquarks. With three quarks to a
baryon, the baryon number of a quark has to be 13. Mesons, on
the other hand, are combinations of a quark and an antiquark,
giving a baryon number of zero, as it should be, since they are
not baryons. The rules of SU(3) govern the combination of
quarks into baryons and mesons to produce the correct baryon,
charge and other quantum numbers. According to these rules,
the proton is composed of two u quarks and one d quark, as
shown in ®gure 24.4(a). As shown in table 24.4, the u quarks
Figure 24.4.
(a) Quark structure of the proton and (b) the pion.
567
SUPERSTRINGS AND OTHER THINGS
Pioneers of physics: Gell-Mann's quark
James Joyce's last novel, Finnegans Wake, is a massive, dauntingly obscure work in which technique is more important
than content. The story centers on a Dublin pub owner
named H C Earwicker who dreams throughout the entire
novel. Through his dreams Earwicker reenacts myths and
great historical events in a haphazard progression of re¯ection, complete with paradoxes, repetitions and sudden
changes of focus. The novel is also full of puns, deliberate
misspellings and curious linguistic turns, beginning with
the missing apostrophe in the book's title.
When Gell-Mann was struggling with his eightfold way,
he was trying to explain to a colleague why some of the
conclusions of his calculations seemed crazy because they
implied the existence of fractional electric charge. Thinking
more about it the following day, Gell-Mann began to accept
the strange conclusions. Because these fractional charges
were so peculiar, he used an odd term for them: quork.
Several months later, when he was ready to publish his
idea, he found himself reading some passages in Finnegans
Wake (``you know how you read Finnegans Wake,'' he said in
an interview), when he came across ``Three quarks for
Muster Mark.'' That was it! Three quarks make a proton or a
neutron! Although Joyce's word rhymes with bark, it was
close enough to his quork. Quark it was from then on.
have a charge of 23 and the d quark has a charge of ÿ 13; the total
charge of the proton is then 23 23 ÿ 13 1, in units of
the fundamental charge e, which is the charge of the proton.
The pion is composed of a u quark and an anti-d quark. Its total
charge is 23 13 1.
Particles with charm
In 1973, the theoretical physicist Sheldon Glashow of Harvard
University and his collaborators developed a more complete
theory of quarks based on mathematical symmetries which led
568
Elementary Particles
them to propose the existence of a fourth quark ¯avor which
Glashow named the charm or c quark. The introduction of this
fourth quark was done on a theoretical basis alone, for no hadrons
had been found that required its existence. In November of 1974
two experimental teams, one led by Samuel Ting at Brookhaven
National Laboratory and the other led by Burton Richter at
Stanford Linear Accelerator simultaneously announced their
independent discovery of a new particle that behaved like a
meson but was heavier than the proton. Ting, who is of Chinese
descent, named the particle the J because his name in Chinese is
written with a character similar to this letter. The Stanford group,
however, had named the particle the (Greek capital psi). Since
the discovery was made simultaneously by the two groups, the
particle is known today as the J= . The behavior of this particle
could only be explained in terms of the c quark predicted by Glashow. Some time later, other charged mesons were discovered
that con®rmed the existence of the c quark. Ting and Richter
won the Nobel prize in physics in 1976 for their work.
In the early 1970s, there were four quark ¯avors and four
leptons. In 1975, experiments conducted at Stanford University
led to the discovery of the ®fth lepton, the tau, and this discovery
persuaded physicists to consider the possibility of a ®fth and even
sixth quark. These quarks were named bottom or b quark and top
or t quark. In 1977, Leon Lederman and his group at the Fermi
National Laboratory reported the discovery of a very massive
meson, the , which consists of a b±b pair, thus con®rming the
existence of the b quark. Evidence for the t quark was found at
Fermilab in 1995. Table 24.5 lists the quantum numbers of the
six quarks.
Table 24.5. Quark quantum numbers
Quark
Charge (e)
Baryon number
Strangeness
Charm
Bottom
Top
u
d
s
c
b
t
23
ÿ 13
ÿ 13
23
ÿ 13
23
1
3
1
3
1
3
1
3
1
3
1
3
0
0
ÿ1
0
0
0
0
0
0
1
0
0
0
0
0
0
ÿ1
0
0
0
0
0
0
1
569
SUPERSTRINGS AND OTHER THINGS
Table 24.6. Quark classi®cation
Generation
Quark
First
Second
Third
u
s
b
d
c
t
Just as the six leptons were classi®ed into three generations,
the six quarks are also classi®ed in three generations each
containing two associated quarks or ¯avor doublets (Table 24.6).
The three generations of quarks and leptons are shown in table
24.7.
Table 24.7. Quarks and leptons
Charge
Symbol
Name
23
ÿ 13
0
ÿ1
u
d
e
e
Up
Down
Electron
Electron
neutrino
23
ÿ 13
0
ÿ1
c
s
Charm
Strange
Muon
Muon
neutrino
23
ÿ 13
0
ÿ1
t
b
Top
Bottom
Tau
Tau
neutrino
570
25
SUPERFORCE:
EINSTEIN'S DREAM
Symmetry
Gottfried Wilhelm Leibniz, the seventeenth century German
philosopher and mathematician, formulated the principle of the
identity of the indiscernibles, which states that if it is not possible
to establish a difference between two objects, they are identical.
Since it is not possible to distinguish between two identical
objects, an interchange in their positions has no effect in the
physical state of the two objects or of the system they belong to.
This interchange is an example of what a mathematician calls
symmetry.
When a system remains unchanged after some operation is
performed on it (such as the exchange of two identical particles)
we say that the system is invariant under that particular operation.
Another example of a symmetric operation is the rotation of
crystals and other objects that have symmetry. If we rotate a
snow¯ake through an angle of 608, the new orientation is indistinguishable from the original orientation. Rotating a cube through
an angle of 908 around any one of its three axes gives us back
the original orientation. A sphere can be rotated through any
angle around any axis and the new position is indistinguishable
from the original. We say that the sphere is invariant under any
rotation. Figure 25.1 illustrates these symmetries.
After a symmetric operation, such as the exchange of two
identical electrons in an atom, the mathematical equation that
describes the system must remain invariant. To deal with these
symmetries, physicists use a mathematical technique called
group theory, which we mentioned in connection with GellMann's Eightfold Way. Group theory was developed by Galois
571
SUPERSTRINGS AND OTHER THINGS
Figure 25.1. A snow¯ake is unchanged when rotated 608. A rotation of
908 around an axis keeps the cube unchanged. A sphere can be rotated
through any angle and remain unchanged.
in France in 1831. Towards the end of the nineteenth century, the
Norwegian mathematician Sophus Lie classi®ed all possible
groups of a particular type into seven classes. The Lie group
O(3), for example, describes the symmetry of a billiard ball; the
ball looks exactly the same after it is rotated through any angle.
The connection between symmetry and the laws of physics
stems from the work of the brilliant German mathematician
Amalie Emmy NoÈther. Born in the university town of Erlangen
in 1882, NoÈther attended the all-male University of Erlangen
where her father was a professor of mathematics, after obtaining
special permission to enroll. In 1907 she received her PhD degree
summa cum laude with a dissertation on invariants. After graduation, she taught without salary at a mathematical school and
occasionally substituted for her father at the university. In 1915,
she joined a research team working on Einstein's general relativity at GoÈttingen but was not accepted at the university even with
an unpaid contract. The eminent mathematician David Hilbert,
who supported her appointment and even had her lecture in
his classes, angrily said, ``I do not see that the sex of the candidate
is an argument against her admission as [an instructor]. After all,
we are a university and not a bathing establishment.'' She ®nally
won admission as a university instructor in 1919. By 1930 NoÈther
had become the center of mathematical research at GoÈttingen. In
1933, she ¯ed Nazi Germany for the United States and became a
visiting professor of mathematics at Bryn Mawr College and a
member of the research faculty at the Institute for Advanced
Study in Princeton. After her death in 1935, Einstein wrote of
her work in The New York Times:
572
Superforce: Einstein's Dream
Pure mathematics is, in its way, the poetry of logical ideas. One seeks
the most general ideas of operation which will bring together in simple,
logical and uni®ed form the largest possible circle of formal relationships. In this effort toward logical beauty spiritual formulas are discovered necessary for the deeper penetration into the laws of nature.
The theorem that made NoÈther famous among physicists
was developed for the paper that she presented before the faculty
at GoÈttingen as part of her application for the position, as was
customary in German universities. NoÈther's theorem can be
stated as follows:
For every continuous symmetry in the laws of physics there exists a
corresponding conservation law.
A continuous symmetry is one in which the corresponding transformation can be varied continuously, as is the case with rotation.
The angle of rotation can be continuously changed. This symmetry leads to the conservation of angular momentum.
Global and local symmetries
NoÈther's elegant theorem prompted physicists to re-examine the
conservation laws from the perspective of symmetry. In 1918, the
year that NoÈther introduced her theorem, the physicist Hermann
Weyl was attempting to formulate a theory that would combine
electromagnetism and the general theory of relativity by showing
that both theories were linked to a symmetry of space. He
proposed a theory that remained invariant under arbitrary
space dilations or contractions in which the standards of length
and time changed at every point in space-time. This reminded
him of the gauge steel blocks used by railroad engineers as
standards of length in their determination of the distance
between tracks and so he referred to the invariance in his
theory as gauge invariance.
The gauge invariance in Weyl's theory was a local one because
the standards of length and time changed at every point. In a global
gauge invariance, on the other hand, the symmetric transformation must not change at every point but must be the same everywhere at once. An example of global symmetry is the charge
symmetry of electromagnetism. Suppose that after we carefully
573
SUPERSTRINGS AND OTHER THINGS
Figure 25.2. Changing the sign of all the individual charges produces
no effect on the forces between them.
measure the interactions between all pairs of charges in some
region of space the sign of each individual charge is changed.
Measuring the forces between all pairs again produces exactly
the same result. The forces are unchanged by a change in the
sign of all particles in the region of interest (®gure 25.2).
As another example of global symmetry, suppose that one
evening some phenomenon causes the dimensions of everything
on the earth to be reduced by one half, including the earth itself.
When you wake up the next morning, everything will appear to
be normal. There is no way to determine that a change took place
because everything changed in the same proportion; all the
measuring devices were reduced in the same proportion and
the phenomenon is undetected. Precision measurements of the
diameter of the moon, the orbits of the planets and the frequency
of the light reaching us from the sun and other stars would of
course make possible the detection of this hypothetical phenomenon. Therefore, to be precise, we would have to say that the
change has to take place in the entire universe at once.
Clearly, if only some objects change size, the phenomenon is
easy to detect. How can we then have local symmetries? That is,
how can we produce different changes at different points or to
different objects and still keep the system unchanged? By ®nding
another part of the system that produces a compensating change. If
you are photographing an approaching water skier and want the
size of the image to remain constant in your view®nder, you can
zoom out to compensate. If you are skillful enough, you may
succeed in maintaining the size unchanged. The zooming effect
of your lens compensates for the apparent change in size of the
approaching skier.
Suppose that we measure the electric ®eld at a particular
point due to a stationary electric charge located a certain distance
away. If the entire laboratory is then raised to a potential of, say,
2000 volts, a determination of the electric ®eld should yield the
same value. The reason is that the electric ®eld is determined
only by differences in the electric potential, not by the absolute
574
Superforce: Einstein's Dream
value of the potential. An analogous situation occurs when we
measure the energy required to lift a 10-kg box from the ¯oor
up to a height of one meter. If we ®rst perform this experiment
in a laboratory at sea level and later in a laboratory in the mountains, at an altitude of 2000 meters, the amount of energy will be
the same. The energy required to lift the box depends only on the
difference in height. The symmetries involved here are global symmetries. In the case of the electric ®eld experiment, the entire
laboratory was raised to the same potential. Neither electricity
nor Newtonian gravitation contains local symmetries.
What happens if the electric charge in our experiment is not
static? Maxwell's theory of electromagnetism, we recall, also
deals with moving charges. A charge in motion relative to the
laboratory generates a magnetic ®eld. Just as there is an electric
potential associated with the electric ®eld, a magnetic potential,
associated with the magnetic ®eld, can be de®ned. The interplay
between the two potentials allows us to establish the local symmetry. Any arbitrary local change in the electric potential can be
compensated by a local change in the magnetic potential. This
means that we are free to set our own reference potential at
every point in space in any way we want. Regardless of how
we vary the one potential throughout space, the other potential
can be adjusted to cancel out the differences. A charged particle
moving through this region of space will experience no change
in the electromagnetic force acting on it as a consequence of the
changes in each individual potential.
Seen this way, the electromagnetic ®eld is not merely a force
®eld which happens to exist in nature; it is actually a manifestation of a simple local gauge symmetry. Although Newtonian
gravity exhibits only global symmetry, Einstein's general relativity is a theory based on a local gauge symmetry, as well. We can
say that the force of gravity is also a manifestation of a simple
local gauge symmetry. As we shall see soon, all four forces of
nature can be generated from local gauge symmetries.
The electroweak uni®cation
Maxwell uni®ed electricity and magnetism without any knowledge of group theory, local gauge symmetries or NoÈther's
575
SUPERSTRINGS AND OTHER THINGS
theorem. His only guide was the symmetry observed in electric
and magnetic phenomena and in the equations that described
them. From a modern perspective, we can see that although
neither electricity nor magnetism alone exhibits local symmetry,
the uni®ed theory of electromagnetism does exhibit this more
powerful local gauge symmetry.
Nearly a hundred years later, two physicists at Brookhaven
National Laboratory, C N Yang and Robert L Mills, took the
®rst steps toward the next uni®cation. Yang, the son of a mathematics professor, emigrated to the United States from his native
China in 1945 at the age of 23 and entered the University of
Chicago as a graduate student in physics. At Chicago, Yang
began thinking about a way to generalize Maxwell's theory of
electromagnetism to see if there was some sort of connection
between it and the theory of the weak interaction. Electromagnetism, Yang knew, possessed a kind of symmetry known as U(1).
That is, electromagnetism was invariant under U(1) gauge
transformations.
Werner Heisenberg had shown in 1932 that if a proton is
changed into a neutron everywhere in the universe, the strong
force between nucleons remains unchanged. The reason for this
invariance is that, as we saw in the previous chapter, the strong
force can be thought of as due to the exchange of a meson, and
this exchange takes place regardless of whether the particles are
protons or neutrons. The symmetry in Heisenberg's theory is
known as global SU(2) gauge symmetry. Could it be possible to
construct a theory with local SU(2) symmetry? In 1954, Yang
and Mills succeeded in constructing such a theory. To convert
the global symmetry into a local one, they introduced a set of
new ®elds that described a family of spin-1 particles. Two of
these ®elds were identi®ed with the electric and magnetic
®elds, and they described the photon. The remaining ®elds
described charged photons, one positive and the other negative.
Charged photons had never been observed in nature. However, physicists knew that the weak interaction, like the electromagnetic interaction, was mediated by spin-1 particles. Could
the new charged photons be the mediators of the weak interaction? The fact that they were charged made the particles possible candidates for the positions of mediators. The only problem
was that the weak interaction was known to be short-range,
576
Superforce: Einstein's Dream
requiring a particle with mass, and the new charged photons of
Yang and Mills were massless. The way out of this dif®culty
came from three physicists who were to succeed where others
had failed; in the uni®cation of electromagnetism with one of
the nuclear forces.
Abdus Salam from Imperial College, London, and Sheldon
Glashow from Harvard University were working independently
on gauge theories and became aware of each other's work when
Glashow, just after completing his PhD, gave a lecture on his
work in London. Although the two were later to share the
Nobel prize for their work in the uni®cation of the weak and electromagnetic interactions, the encounter in London did not result
in a collaboration. Salam, who was in the audience when Glashow
presented some of his results in his early attempt at a gauge theory
linking the weak and electromagnetic forces, did not believe that
``this slip of a boy'' was correct. ``Naturally,'' said Salam later, ``I
wanted to show he was wrong, which he was. As a consequence,
I never read anything else by Glashow, which of course turned out
to be a mistake.''
Salam was born and raised in what is now Pakistan and
entered Punjab University with the highest marks ever recorded
in the entrance examination. In 1946 he traveled to England
where he obtained two separate undergraduate degrees in
mathematics and in physics at Cambridge University. After
obtaining his PhD in theoretical physics, Salam returned to
Pakistan to teach at the university. The work that he had done
for his doctoral thesis, however, had made him well known
among physicists and when a position opened at the University
of Edinburgh in 1954, it was immediately offered to him.
In a paper on gauge theories, Salam advanced the idea that
systems that are not symmetrical can be described by symmetrical equations. This idea, now called spontaneous symmetry breaking,
had actually been proposed in another context by Heisenberg in
1928. Heisenberg used the temperature dependence of ferromagnetic materials to illustrate symmetry breaking. A magnet
has a north and a south pole formed by the alignment of millions
of atoms, as we saw in chapter 14. A magnet, then, has a very
speci®c orientation in space; the force that it exerts on an object
made of iron located near its north pole is different from the
force on the same object when the magnet is rotated 908, for
577
SUPERSTRINGS AND OTHER THINGS
Figure 25.3. (a) Iron atoms in a magnet at normal temperatures. Rotating this magnet produces a different magnetic ®eld con®guration. (b)
Heating up the magnet above a certain critical temperature destroys
the alignment of the iron atoms. Rotation of the magnet now produces
no change in the space around it.
example (®gure 25.3(a)). The interaction with the magnet is not
invariant under rotations in three dimensions. If, however, we
heat up the magnet above a certain temperature, the alignment
of the atoms is lost due to the increased random motion caused
by the added energy. Now the magnet can be rotated in any direction without any change in the interaction with the object (®gure
25.3(b)). In the language of groups, we say that the interaction is
invariant under the symmetry group O(3); that is, under rotations
in three dimensions. When the temperature falls below the critical
temperature, this symmetry is broken. Below this temperature,
578
Superforce: Einstein's Dream
the symmetry is hidden. Adding energy to the magnet allows us to
observe this symmetry.
In 1967, Steven Weinberg from Harvard University published
a paper in which he showed that the massless charged photons
predicted by Yang and Mills could be considered to be the
mediators of the weak interaction when spontaneous symmetry
breaking was taken into account. Abdus Salam in London was
to make the same discovery a few months later. To achieve the
spontaneous symmetry breaking, Weinberg and Salam made
use of an idea developed by F Englert and Robert H Brout of the
University of Brussels and by Peter Higgs of the University of
Edinburgh. According to this mechanism, known today as the
Higgs mechanism, a new ®eld ± the Higgs ®eld ± is introduced to
the theory which has its lowest energy state with broken symmetry. We can illustrate the Higgs ®eld with the situation presented
in ®gure 25.4. A top balanced on a smooth surface with its axis
oriented in the vertical direction has an obvious symmetry; that
is, the top remains unchanged when it is rotated around its vertical axis. However, this position is unstable; any small perturbation
will cause the top to topple, losing energy. A rotation around the
vertical axis now does not keep the top unchanged; the symmetry
is broken. The position with broken symmetry has lower energy
than the unstable equilibrium with rotational symmetry. Likewise, the behavior of the Higgs mechanism is such that the state
in which the Higgs ®eld has its lowest energy is one of broken
symmetry.
According to this idea, the Higgs ®eld couples with the Yang±
Mills ®elds and in the process it gives mass to the Yang±Mills
Figure 25.4. (a) Balancing a top on its tip so that its long axis is vertical
produces an unstable equilibrium; the top easily falls. This position has
rotational symmetry. (b) When the top falls, losing energy, the symmetry
is broken.
579
SUPERSTRINGS AND OTHER THINGS
photons. Salam refers to this mechanism in a picturesque way: the
massless Yang±Mills particles ``eat'' the Higgs particles in order to
gain weight, and the eaten particles become ``ghosts.'' Weinberg
and Salam were able to construct a theory of the weak and electromagnetic interactions based on the Yang±Mills theory and the
Higgs mechanism. According to their model, at very high energies
the mediators of the interaction are massless. At low energies, due
to the spontaneous symmetry-breaking Higgs mechanism, these
particles acquire masses. At the outset, the four ®elds in the
Weinberg±Salam model have in®nite range and are therefore
mediated by massless particles. The Higgs mechanism that
causes the system to fall to the lower, stable energy state selects
the broken symmetry solution and gives masses to three of the
four particles. These particles are known today as the W , the
W ÿ and the Z0 . The fourth particle remains massless; this particle
is the photon, mediator of the electromagnetic interaction.
The photon is the mediator of the electric and magnetic
forces. We can thus consider electricity and magnetism to be
two different manifestations of the same force because the mediator of both interactions is the photon. The electromagnetic interaction and the weak interaction are both gauge theories whose
forces are mediated by the exchange of the same family of particles. These interactions appear very different to us because
our observations are done at low energies, and at these energies,
due to the spontaneous symmetry breaking mechanism, the
mediators of the interactions are different. The mediator of the
electromagnetic interaction, the photon, is massless and the interaction has in®nite range. The mediators of the weak interaction,
the W and W ÿ and the Z particles, are very massive and the
interaction is short-range. At extremely high energies, the
theory predicted, the symmetry is restored, all the mediators
become massless photons, and the two forces become one: the
electroweak force.
The electroweak theory of Weinberg and Salam was very
beautiful. Originally formulated in the late 1960s, it was cleaned
up of certain mathematical dif®culties by a very bright graduate
student at the University of Utrecht, Gerald 'T Hooft, in 1971. The
only dif®culty remaining with the theory was that no W or Z
particles had ever been observed. In 1983, an extremely elaborate
experimental set-up conducted by two teams of several hundred
580
Superforce: Einstein's Dream
physicists led by Carlo Rubbia at CERN, the European center for
nuclear research, discovered the W and Z particles and triumphantly con®rmed the predictions of the electroweak theory. In
1979, Weinberg, Salam and Glashow shared the Nobel prize in
physics for the formulation of their theory. In 1984, Carlo
Rubbia and Simon van der Meer, the Dutch physicist who discovered the brilliant experimental technique, shared the Nobel prize
in physics for the detection of the W and Z particles. 'T Hooft won
the prize in 1999.
The color force
The enormous success of the electroweak theory encouraged
physicists to seek further uni®cation of the fundamental forces
of nature. The next obvious step was to try to integrate the
theory of the strong interaction with the electroweak theory. The
strong interaction arises from the interactions between quarks.
Quarks, as we have seen, have fractional electric charge, and the
different combinations of quarks to form hadrons obey the law
of conservation of charge so that, for example, the two u quarks
with charge 23 and the d quark with charge ÿ 13 that form a
proton add up to a total charge of 1, the charge of the proton
in units of the fundamental charge e (the charge of the electron).
The way that these charges add up, however, does not explain
why quarks combine to form the different hadrons observed.
One way to explain this interaction is to imagine that quarks
possess a new kind of charge, in addition to their electric charge.
For mesons, which consist of a quark and an antiquark, this
charge acts in a fashion analogous to the electric charge that
binds the electrons to the nucleus to form an atom; that is, opposite charges attract. For protons or neutrons which consist of
combinations of three quarks, the situation is more complicated
and we need to consider the attraction of three charges. This
new kind of charge is called color charge, although it has nothing
whatsoever to do with the colors of the visible spectrum. The
three color charges are called red, green, and blue, like the primary
colors, because the mathematical rules obeyed by these three
charges are similar to the way the three primary colors combine
to form white. Each quark (u, d, s, c, b, and t) has the three possible
581
SUPERSTRINGS AND OTHER THINGS
Figure 25.5. The combination of a red, a green and a blue quark to form
a proton gives a color-neutral or white particle.
color charges red, green and blue. Quarks can combine in groups
of two or three to form a particle only when the resulting color is
white, in a way similar to the formation of neutral atoms by an
equal number of positively charged protons in the nucleus and
negatively charged orbiting electrons. Thus, a proton, which we
know consists of two u quarks and one d quark has a red u
quark, a green d quark and a blue u quark. Like the primary
colors, the red, green and blue quarks combine to produce a
white or color-neutral proton (®gure 25.5).
What about mesons, which are composed of a quark and an
antiquark? It turns out that antiquarks have the anticolors: cyan,
the anticolor of red; magenta, the anticolor of green; and yellow,
the anticolor of blue. As pigments, these colors absorb the primary
colors and are said to be subtractive colors or complementaries. Cyan
is a greenish-blue color that absorbs red; magenta absorbs green;
and yellow absorbs blue. When we add a subtractive color to its
corresponding primary color we get white. Likewise, when a
quark with red color charge is combined with an antiquark with
cyan color charge, the result is white, as is the case with the pion
(®gure 25.6).
The theory that explains the color interactions between
quarks is called quantum chromodynamics or QCD, and is modeled
after quantum electrodynamics or QED, the most successful
theory ever devised. Although from our discussion above it
might seem that QCD is a fairly straightforward and simple
theory, the equations are much more complicated than in any
other theory and the calculations extremely complicated and
tedious. QCD is a gauge theory and it is this fact what allows
582
Superforce: Einstein's Dream
Figure 25.6. A pion is formed with a red u quark and a cyan d antiquark.
The particle has no color charge, since red and cyan combine to produce
white.
its uni®cation with the electroweak theory. The gauge symmetry
associated with QCD is the invariance in the local transformations of color. Since the invariance is local, the color of one
quark can change while the other quarks remain unmodi®ed.
This change would give color to the particle of which the quark
is a constituent. As with the electroweak theory, the way to
Figure 25.7. (Cartoon by Leo Cullum.)
583
SUPERSTRINGS AND OTHER THINGS
balance this change is with the introduction of new ®elds. The
messenger particles of these ®elds are called gluons, of which
there are eight, one for each compensating ®eld. The gluons
play the same role in quantum chromodynamics as the photon
in electrodynamics and the W , W ÿ and Z0 particles in the
weak interaction; they are the messenger particles that convey
the force.
Gluons carry one color and one anticolor, although not
necessarily of a corresponding pair. A gluon can carry, for example, blue-antigreen or red-antiblue. When a quark changes
color, a colored gluon is emitted which is then absorbed by
another quark. Since the absorbed gluon carries color, the
second quark shifts its color in exactly the right way to compensate for the color change that took place in the ®rst quark, keeping
the hadron formed by these quarks white. This is the gauge
symmetry of QCD. Although the quark colors vary from point
to point, all hadrons remain white due to the continual compensation carried by the gluon ®elds.
GUTs, the third uni®cation
With quantum chromodynamics and the electroweak theory
based on the gauge symmetry principle, the next problem was
whether these two theories could be uni®ed into a single one.
Would it be possible to build a theory treating quarks and leptons
on an equal footing? In 1973 Sheldon Glashow and Howard
Georgi of Harvard University proposed such a theory. In their
Grand Uni®ed Theory or GUT, the electroweak and strong forces
are merely two aspects of the same grand force. Symmetry breaking differentiates the two forces into separate ®elds.
As before, the local gauge symmetry required in the grand
uni®ed theory is compensated by the introduction of new ®elds.
In the GUT of Glashow and Georgi, the ®rst and simplest of the
theories, twenty-four compensating force ®elds are required.
Twelve of the messenger particles for these ®elds are to be identi®ed with the known quanta of the electroweak and strong interactions: the photon, the two Ws and the Z, and the eight gluons
of the strong force. The remaining twelve particles are new and
are collectively called the X particles. These particles, which can
584
Superforce: Einstein's Dream
change leptons into quarks, carry color charge and electric charge
of 13 or 43. The spontaneous symmetry-breaking process converts some of the energy of the twelve remaining X particles
into mass. This mass, of the order of 1015 GeV/c2 , is roughly a
quadrillion times the mass of a proton (about a nanogram), an
enormous mass for an elementary particle.
The conversion of quarks into leptons and viceversa cannot
be seen even at the high energies of the current or planned
particle accelerators. It is only at the tremendous energies of
1015 GeV that this connection could be observed. There is, however, no prospect of achieving these energies in an accelerator.
Heisenberg's uncertainty principle tells us that the kinetic
energy (or the velocity) of a particle is intertwined with the
appropriate distance in space. For energies of the order of
1015 GeV, this distance is about 10ÿ29 cm. This is the distance
that the X particle traverses as a messenger. At those distances,
quarks and leptons become indistinguishable. An electron seen
at that distance is no longer an electron but a fundamental
quark-lepton entity.
Supersymmetry and superstrings
Grand uni®ed theories, as we have just seen, unify the electromagnetic, weak, and strong interactions into one single formalism.
These theories, of which there are several versions, have not yet
passed major experimental tests. However, most physicists agree
that the general ideas and techniques common to all these theories
are probably correct and that only the implementation of the
symmetries to achieve uni®cation is still subject to debate.
What about gravity? Before the two nuclear forces were
known, Einstein started a lifelong and fruitless effort to unify
electromagnetism with gravity. As we have seen, the path to
uni®cation was different and gravity has been the last one to be
considered. Einstein's general theory of relativity, a theory of
gravity, is a gauge theory. It is not, however, a quantum ®eld
theory; and therein lies the main obstacle towards uni®cation
with the other forces.
Physicists have not stood still, however, and they have made
some progress in the last ®fteen years towards the inclusion of
585
SUPERSTRINGS AND OTHER THINGS
gravity in the uni®cation of all forces. One approach has been to
®nd a powerful new gauge symmetry that unites quarks and
leptons with messenger particles. This supersymmetry brings
together all quantum particles, including the messenger particle
of gravity or graviton, as components of a single master super®eld. The gauge particle responsible for supersymmetry is
called the gravitino, with spin 32. The resulting theory became
known as supergravity, a theory that incorporates and extends
Einstein's theory of gravity.
One particular important development physicists discovered
was that the mathematical structure of supergravity theories
could be greatly simpli®ed if more than the four space-time
dimensions are considered. The most popular supergravity
theory, for example, was formulated in eleven space-time dimensions: ten space dimensions and one time dimension. Only four of
the eleven space-time dimensions are observable; the remaining
seven are rolled up to a very small size.
The idea of considering higher dimensions in an attempt at
uni®cation is not new. In 1919, the German mathematician Theodor Kaluza generalized Einstein's gravitational ®eld equations to
a ®ve dimensional space-time. In this representation, the extra
®fth dimension produced a set of equations that turned out to
be Maxwell's equations for the electromagnetic ®eld. Electromagnetism, in the context of Kaluza's theory, is nothing more
than a form of gravity, the gravity of the unseen additional
dimension. In 1926, the Swedish physicist Oscar Klein extended
and cleaned up Kaluza's theory and calculated the radius of the
extra ®fth dimension to be about 10ÿ30 cm. We can visualize the
existence of an extra ®fth dimension that cannot be observed by
considering, for example, a pipeline, like the Alaska pipeline
that brings crude petroleum to the lower forty-eight states.
When seen from a large distance, the pipeline appears to be a
wiggly line, with only one dimension. Of course when we look
at it from a closer distance, the other two dimensions become
apparent and what looked like a point of the line is actually a
circle (®gure 25.8). Although, a nice idea at the time, the
Kaluza±Klein theory turned out to be extremely restrictive and
no one has been able to apply it to the other forces.
The Kaluza±Klein theory was almost forgotten until the
early 1960s when Gabriele Veneziano of CERN proposed a
586
Superforce: Einstein's Dream
Figure 25.8. (a) A pipe line seen from afar looks like a broken line. (b)
When seen from a closer distance, its real dimensions become apparent.
mathematical model to explain the existence of many very shortlived hadrons that had been observed in experiments at CERN
and at other high-energy physics laboratories throughout the
world. Closer investigation of this new mathematical model by
Veneziano and other physicists revealed that this new theory
described the quantized motion of subatomic strings. During the
late 1970s and early 80s, Michael Green of Queen Mary College
in London and John Schwarz of the California Institute of
Technology advanced Veneziano's idea and solved some mathematical inconsistencies. The current version of this theory stems
from the work of Green and Schwarz who showed in 1984 that
superstring theory, as it is called today, had enough symmetry to
ban all mathematical inconsistencies.
According to superstring theory, all elementary particles are
represented by strings no longer than 10ÿ35 m, which make them
some 1020 times smaller than a proton. These strings can vibrate
like a guitar string and each vibrational mode corresponds to a
particle. The frequency of vibration of a string determines the
energy and therefore the mass of the particle. There are open
strings and closed strings. Open strings have endpoints. Quantities
such as electric or color charge are attached to the endpoints of
open strings. These open strings can interact with other open
strings by joining their endpoints to form another string (®gure
25.9). Open strings can also split into two strings. An open
string can become a closed string by joining its endpoints.
Superstring theory, constructed in a ten-dimensional space,
automatically includes gravity, in addition to the other three
forces. According to this theory, the laws of physics are
587
SUPERSTRINGS AND OTHER THINGS
Figure 25.9. (a) Two open strings can combine to form a third string. (b)
An open string can split into two new strings. (c) Closed strings can also
combine and split.
approximations to a much richer theory that operates on an
unimaginably small scale. In recent years, a new theory, called
M-theory, proposes that superstrings are actually the projection
in ten-dimensional space of eleven-dimensional membranes or
p-branes, which are the fundamental constituents of matter.
Although there is a great deal of activity in this area and the
theory promises to unify all of physics without any of the mathematical problems encountered by all the other approaches, there
are still very few experimental predictions to test the validity of
the theory. However, it points to one direction that physics
might take.
The creation of the universe
The recent developments in our understanding of matter have
allowed us to pierce a few of the veils that cloud the age-old question of the origin of the universe. As we said in chapter 11, matter
and energy during the ®rst few moments after the universe began
were under conditions of very high pressure and very high
energy. In our cold universe today, we recognize four different
interactions: the electromagnetic force, the weak nuclear force,
the strong nuclear force, and the gravitational force.
The strength of the electromagnetic force is determined by
the electron. The force between two electrons, however, depends
on how fast they are moving. Each electron is surrounded by a sea
588
Superforce: Einstein's Dream
Figure 25.10. Cloud of virtual electron±positron pairs surrounding an
electron.
of virtual electron±positron pairs that owe their existence to
Heisenberg's uncertainty principle (®gure 25.10). These particles
appear out of nothing and disappear almost immediately after
being created. The only requirement is that they recombine and
annihilate in time to satisfy the uncertainty principle. As we
saw in chapter 22, Heisenberg's uncertainty principle states that
h
:
2
This uncertainty relation means that during the interval t an
electron±positron pair of total energy E can be created out of
nothing; if the pair recombines and annihilates itself before the
time t is up, no experiment can detect any missing energy.
The electron±positron pairs that surround an electron at any
given time are oriented so that the virtual positrons are attracted
toward the electron and the negatively charged virtual electrons
are repelled by it. These particles are continuously appearing
and disappearing, creating a cloud of virtual pairs around the
electron. This cloud shields the electron from other incoming electrons, reducing the strength of their interaction. For an electron
that stays far away, the positive and negative virtual charges
surrounding the ®rst electron will cancel out and the interaction
is the regular Coulomb interaction that we studied in chapter
12. However, if this electron approaches the ®rst electron with
high enough energy, it can penetrate this cloud, leaving some
of the virtual electrons behind. What lies ahead is an electron
Et
589
SUPERSTRINGS AND OTHER THINGS
surrounded by a shield of virtual positrons which reduce the
effect of the repulsion. If the energy of the approaching electron
is high enough, it can penetrate far into the cloud, leaving these
positrons behind and seeing the bare electron, thus feeling the
full strength of the electrical repulsion. The electric force, then,
seems stronger to electrons with high energy or equivalently to
those at high temperatures.
Quarks are also surrounded by a cloud of virtual quark±
antiquark pairs with the color charge polarized in a similar
fashion as with the electrons. There is a screening of the color
charge due to this shielding effect. The situation here is more
complicated, however, because the gluons, carriers of the color
charge, also carry color. There is a second cloud of colored
gluons surrounding the quark which carry color charge of the
same type (®gure 25.11). If, for example, the original quark carries
red color charge, the cloud of virtual quark±antiquark pairs will
carry red and antired color charges arranged so that the antired
is closer to the red quark. The second cloud of virtual gluons
carries red color charge and this produces an antiscreening effect,
opposite to the screening effect of the quark±antiquark pair,
Figure 25.11. A quark is surrounded by a cloud of quark±antiquark
pairs and by an overlapping cloud of virtual gluons that carry the same
type of color charge as the original quark. These two clouds produce
competing effects and the overall result is a cloud with a net color
charge of the same type as that of the quark.
590
Superforce: Einstein's Dream
which weakens the force between interacting quarks. There are two
competing phenomena taking place here. One is the screening of
the virtual gluons which has the same effect as that between interacting electrons. The other is the antiscreening of the gluons just
mentioned. When a high-energy quark approaches another
quark and gets past the screening, the antiscreening of the
gluons makes the interaction weaker, exactly the opposite of
what takes place between interacting electrons. This property of
quarks is called asymptotic freedom, conveying the idea that if the
energy were in®nite, the interaction would become weaker and
weaker until it would ®nally vanish; the quarks would feel no
forces and behave as free particles.
The weak interaction is also mediated by massive particles,
the W , W ÿ and Z particles. A similar effect takes place here.
The weak interaction becomes weaker as the energy of the interaction increases.
In our cold universe, due to symmetry breaking, the four
interactions have different strengths. The strong and weak interactions are stronger than the electromagnetic interaction, and
gravity is the weakest. However, as we have just seen, at very
high energies the electromagnetic interaction becomes stronger
and the strong and weak interactions weaker. At some point
very high in energy these three interactions have equal strength
(®gure 25.12). Theoretical calculations place this energy at about
1015 GeV. These enormous energies only existed for a brief
moment after the creation of the universe. In our laboratories,
we have achieved energies of about 1000 GeV.
At those very high energies, then, all the forces of nature
except gravity were uni®ed. For a few very tiny fractions of a
second only gravity and this uni®ed force existed. At energies
even higher than that, when the universe was less than 10ÿ43
second old, gravity was uni®ed with the other forces and only
one superforce existed.
The ®rst moments of the universe
The picture that modern physics paints of the evolution of the universe is that of complete simplicity from the very moment of
creation up to 10ÿ43 second. This is an incredibly small interval
591
SUPERSTRINGS AND OTHER THINGS
Figure 25.12. Evolution of the forces of nature. Due to symmetry breaking, the forces have very different strengths. In the past, when the
universe was very hot and dense, the electromagnetic, weak, and strong
forces were one and the same.
of time; one that we are incapable of comprehending. Physicists
are making great progress toward the understanding of his
small period in the life of the universe. The main dif®culty lies
with the fact that, during that extremely brief time, the universe
was a microscopic object, subject to the laws of quantum
mechanics, which means that the theory of gravity must be combined with quantum mechanics before we can achieve a complete
understanding. At the present, we lack a theory of quantum
gravity.
At the 10ÿ43 second mark the complete symmetry was broken
and the superforce was split into gravity and the GUT force. This
period in the life of the universe is known as the GUT era. The next
important landmark took place when the universe was 10ÿ35
second old, as the strong color force becomes distinct, and the
grand uni®cation symmetry was broken at an energy of 1015 GeV.
During the next period in the life of the universe only three
forces existed: gravity, the strong color force, and the electroweak
force. At 10ÿ12 second the universe had cooled down and the
592
Superforce: Einstein's Dream
Figure 25.13. The universe has evolved from total symmetry through a
process of symmetry-breaking where the four known forces of nature are
separated.
energies were of the order of 1000 GeV. These conditions are now
possible to reproduce in the laboratory. From now on, the four
forces familiar to us today control everything in the universe.
This symmetry-breaking process is illustrated in ®gure 25.13.
When the universe was about 10ÿ5 second old (one hundredth of a millisecond), the quarks became con®ned into protons
and neutrons. One second after the Big Bang, there were eight
protons for every neutron, the present abundance ratio. One
minute after the instant of creation, the universe had cooled
down due to its expansion, so that nuclei could be formed without immediately being shredded into pieces again. This process
lasted for about seven additional minutes and the universe
behaved like a giant fusion reactor producing helium from the
fusion of hydrogen nuclei. Calculations show that at the end of
that era the ratio of hydrogen to helium was three to one, exactly
the proportion observed today.
593
SUPERSTRINGS AND OTHER THINGS
Nothing much happened for the next 100 000 years. Then the
universe entered a new phase: the age of the atom. Since atoms
are electrically neutral, photons, which interact only with charged
matter, begin to uncouple. Three hundred thousand years after
The frontiers of physics: The cosmic background
explorer
In April 1992 scientists presented in Washington, DC the
results of what the renowned physicist Stephen Hawking
called ``the discovery of the century, if not of all time.'' The
physicists showed the extremely precise measurements
taken with NASA's Cosmic Background Explorer (COBE)
satellite which allowed them to detect the very remnants of
creation.
Instruments aboard the COBE satellite measured small
¯uctuations in the cosmic microwave background radiation.
These ¯uctuations are believed to represent gravitational
ripples that could have seeded galaxy formation. After its
launch in late 1989, COBE's instruments had measured the
spectrum of the microwave background radiation. COBE's
microwave spectrum reproduced perfectly a black body
radiation curve of 2.73 K, in accordance with the basic Big
Bang theory. However, a perfectly smooth radiation curve
would not have allowed the existence of tiny concentrations
of matter that must have been formed in the early universe.
These unevennesses in the early universe would eventually
evolve into the galaxies and clusters of galaxies observed
today.
The COBE more recent measurements showed minuscule variations of 30 millionths of a kelvin from the 2.73 K
background temperature. ``It's the missing link,'' said Berkeley cosmologist Joseph Silk. The very small ¯uctuations
detected, however, do not seem to be enough to account for
the rapid (on a cosmic scale) formation of galaxies observed.
Although scientists are very happy to have ®nally detected
the predicted ¯uctuations, they are hard at work attempting
to devise other mechanisms that could help explain the
formation of the cosmos.
594
Superforce: Einstein's Dream
its birth, the universe became transparent to light. Photons
embarked now on an independent existence and continued to
cool off as the universe expanded. We see them today as the
2.7 K background radiation that permeates the universe.
After the uncoupling of the photons from matter the universe
entered the present phase: the era of structure. After one billion
years, matter began to coalesce into galaxies and gas began to
ascrete into stars. Most of those ®rst-generation stars did not
live for very long and ended their existences in powerful explosions called supernovas (®gure 11.6). These spewed out carbon,
oxygen, silicon and iron. New stars were born out of the matter
of the explosion mixed with the surrounding gas in the galaxies.
Millions of years later, the protostar nebula from which our Sun
and planet Earth were formed contained the ashes of the early
supernova explosions.
595
APPENDIX A
POWERS OF TEN
In physics we deal with quantities that range from the very small
to the immensely large. The distance to the nearest galaxy in kilometers, for example, would require many zeros if we were to
write it in the conventional way. The mass of an electron in kilograms, a very small number, would require writing 26 zeros after
the decimal point. We can avoid these dif®culties if we use
powers of ten to write these very large and very small numbers.
The product
10 10 10 10 10 10
where the factor 10 occurs 6 times, can be written as
106 :
Suppose that we now have the product
10 10 10 10 10 10 10 10 10 109
where the factor 10 occurs 9 times. We can write this last product
as
or
10 10 10 10 10 10 10 10 10
106 103 :
We can see from the above example that
and, in general, that
106 103 109
10m 10n 10 m n
596
Appendix A
To multiply powers of ten, then, we add the exponents.
Suppose that now we want to obtain the result of
106
103
which can be written as
10 10 10 10 10 10
10 10 10 103 :
10 10 10
We can then see that
106
103
103
and, in general,
10m
10 m ÿ n
10n
To divide powers of ten, we subtract the exponents of the numerator and denominator.
If the exponents m and n are the same (equal to 4, for
example), the expression
104
10 4 ÿ 4 100 :
104
Since
104
is equal to 1, then
104
100 1
Another important case occurs when the exponent of the numerator is equal to 0:
100
10 0 ÿ 4 10ÿ4
104
since 100 equals 1, then
1
10ÿ4
104
597
SUPERSTRINGS AND OTHER THINGS
or, in general,
1
10ÿn
10n
For example,
1
10ÿ7
107
1
1012 :
10ÿ12
and
What happens
when the exponent is a fraction? Consider for
1
example 102 (or 100:5 ). If we multiply 10 raised to the one half by
itself we get 1, since we know already that to multiply powers of
ten we add the exponents. That is,
1
1
102 102 1:
1
1
Since 102 multiplied by itself gives 1, 102 must be the square root
of 10, or
p
1
102 10 3:16:
1
If we now multiply 103 by itself three times, we also get 1, since
1
1
1
3 3 3 1:
1
1
1
1
103 103 103 1:
We can see that 103 is the cube root of 10, or
p
1
3
103 10 2:15:
598
APPENDIX B
THE ELEMENTS
Element
Symbol
Actinium
Aluminum
Americium
Antimony
Argon
Arsenic
Astatine
Barium
Berkelium
Beryllium
Bismuth
Boron
Bromine
Cadmium
Calcium
Californium
Carbon
Cerium
Cesium
Chlorine
Chromium
Cobalt
Copper
Curium
Dysprosium
Einsteinium
Erbium
Ac
Al
Am
Sb
Ar
As
At
Ba
Bk
Be
Bi
B
Br
Cd
Ca
Cf
C
Ce
Cs
Cl
Cr
Co
Cu
Cm
Dy
Es
Er
Atomic number (Z)
89
13
95
51
18
33
85
56
97
4
83
5
35
48
20
98
6
58
55
17
24
27
29
96
66
99
68
599
SUPERSTRINGS AND OTHER THINGS
Element
Symbol
Atomic number (Z)
Europium
Fermium
Fluorine
Francium
Gadolinium
Gallium
Germanium
Gold
Hafnium
Helium
Holmium
Hydrogen
Indium
Iodine
Iridium
Iron
Krypton
Lanthanum
Lawrencium
Lead
Lithium
Lutetium
Magnesium
Manganese
Mendelevium
Mercury
Molybdenum
Neodymium
Neon
Neptunium
Nickel
Niobium
Nitrogen
Nobelium
Osmium
Oxygen
Palladium
Phosphorus
Platinum
Eu
Fm
F
Fr
Gd
Ga
Ge
Au
Hf
He
Ho
H
In
I
Ir
Fe
Kr
La
Lw
Pb
Li
Lu
Mg
Mn
Md
Hg
Mo
Nd
Ne
Np
Ni
Nb
N
No
Os
O
Pd
P
Pt
63
100
9
87
64
31
32
79
72
2
67
1
49
53
77
26
36
57
103
82
3
71
12
25
101
80
42
60
10
93
28
41
7
102
76
8
46
15
78
600
Appendix B
Element
Symbol
Plutonium
Polonium
Potassium
Praseodymium
Promethium
Protactinium
Radium
Radon
Rhenium
Rhodium
Rubidium
Ruthenium
Samarium
Scandium
Selenium
Silicon
Silver
Sodium
Strontium
Sulfur
Tantalum
Technetium
Tellurium
Terbium
Thallium
Thorium
Thulium
Tin
Titanium
Tungsten
Uranium
Vanadium
Xenon
Ytterbium
Yttrium
Zinc
Zirconium
Pu
Po
K
Pr
Pm
Pa
Ra
Rn
Re
Rh
Rb
Ru
Sm
Sc
Se
Si
Ag
Na
Sr
S
Ta
Tc
Te
Tb
Tl
Th
Tm
Sn
Ti
W
U
V
Xe
Yb
Y
Zn
Zr
Atomic number (Z)
94
84
19
59
61
91
88
86
75
45
37
44
62
21
34
14
47
11
38
16
73
43
52
65
81
90
69
50
22
74
92
23
54
70
39
30
40
601
APPENDIX C
NOBEL PRIZE
WINNERS IN PHYSICS
1901 Wilhelm Konrad RoÈntgen
1902 Hendrik Antoon Lorentz
Pieter Zeeman
1903 Antoine Henri Becquerel
Pierre Curie
Marie Sklowdowska-Curie
1904 Lord Rayleigh
(John William Strutt)
1905 Philipp Eduard Anton
von Lenard
1906 Joseph John Thomson
1845±1923
1853±1928
1865±1943
1852±1908
1859±1906
1867±1934
1842±1919
For the discovery of x-rays.
For their work on the in¯uence of
magnetism on radiation.
For his discovery of radioactivity.
For their joint research on nuclear
radiation phenomena.
For his research on the densities of
the gases and for his discovery of
argon.
1862±1947 For his work on cathode rays.
1856±1940 For his research on the conduction
of electricity by gases.
1907 Albert Abraham Michelson 1852±1931 For his optical instruments and for
measuring the speed of light.
1908 Gabriel Lippmann
1845±1921 For his method of reproducing
colors photographically based on
interference techniques.
1909 Guglielmo Marconi
1874±1937 For their development of wireless
1850±1918 telegraphy.
Carl Ferdinand Braun
1910 Johannes Diderik van der 1837±1932 For his research on the equation of
Waals
state for gases and liquids.
1911 Wilhelm Wien
1864±1928 For his work on heat radiation.
1912 Nils Gustaf Dalen
1869±1937 For his invention of automatic
regulators for use in lighthouses.
1913 Heike Kamerlingh Onnes 1853±1926 For his work on the properties of
matter at low temperatures and
for liquefying helium.
1914 Max von Laue
1879±1960 For his discovery of the diffraction
of x-rays in crystals.
1915 William Henry Bragg
1862±1942 For their analysis of crystal
William Lawrence Bragg
1890±1971 structure using x-rays.
1917 Charles Glover Barkla
1877±1944 For his study of atoms by x-rays
scattering.
602
Appendix C
1918 Max Planck
1919 Johannes Stark
1920 Charles-Edouard
Guillaume
1921 Albert Einstein
1922 Niels Bohr
1923 Robert Andrews Millikan
1924 Kark Manne Georg
Siegbahn
1925 James Franck
Gustav Hertz
1926 Jean Baptiste Perrin
1927 Arthur Holly Compton
Charles Thomson Rees
Wilson
1928 Owen Willans Richardson
1929 Prince Louis-Victor de
Broglie
1930 Sir Chandrasekhara
Venkata Raman
1932 Werner Heisenberg
1933
1935
1936
1937
1938
1858±1947 For his discovery of quanta of
energy.
1874±1957 For his discovery of the splitting of
spectral lines in electric ®elds
1861±1938 For his discovery of invar, a
nickel-steel alloy.
1879±1955 For his explanation of the
photoelectric effect.
1885±1962 For his model of the atom.
1868±1953 For his measurement of the charge
of the electron and for his
experimental work on the
photoelectric effect.
1888±1979 For his research in x-ray
spectroscopy.
1882±1964 For their research on electron±
1887±1975 atom collisions.
1870±1942 For his work on the discontinuous
structure of matter and for
measuring the size of atoms.
1892±1962 For his discovery of the Compton
effect
1869±1959 For inventing the cloud chamber
which makes visible the paths of
charged particles.
1879±1959 For his discovery of the
thermionic effect.
1892±1987 For his discovery of the wave
nature of electrons.
1888±1970 For his work on light scattering.
1901±1976 For the development of quantum
mechanics.
Erwin SchroÈdinger
1887±1962 For the development of wave
mechanics.
Paul Adrien Maurice Dirac 1902±1984 For the development of relativistic
quantum mechanics.
James Chadwick
1891±1974 For the discovery of the neutron.
Victor Franz Hess
1883±1964 For the discovery of cosmic
radiation.
Carl David Anderson
1904±1984 For the discovery of the positron.
Clinton Joseph Davisson
1881±1958 For their experimental discovery
1892±1975 of the diffraction of electrons by
George Paget Thomson
crystals, con®rming de Broglie's
hypothesis.
Enrico Fermi
1901±1954 For producing new radioactive
elements by means of neutron
irradiation.
603
SUPERSTRINGS AND OTHER THINGS
1939 Ernest Orlando Lawrence
1943 Otto Stern
1901±1958 For the invention of the cyclotron.
1888±1969 For his discovery of the magnetic
moment of the proton.
1944 Isidor Isaac Rabi
1898±1988 For his discovery of the nuclear
magnetic resonance method that
records the magnetic properties of
nuclei.
1945 Wolfgang Pauli
1900±1958 For the discovery of the exclusion
principle.
1946 Percy Williams Bridgman 1882±1961 For his work in the ®eld of highpressure physics.
1947 Sir Edward Victor
1892±1965 For his study of the physics of the
Appleton
upper atmosphere.
1948 Patrick Maynard Stuart
1897±1974 For his discoveries in nuclear
Blackett
physics with cloud-chamber
photographs of cosmic rays.
1949 Hideki Yukawa
1907±1981 For his prediction of the existence
of mesons.
1950 Cecil Frank Powell
1903±1969 For his photographic method of
studying nuclear processes and
his discoveries of new mesons.
1951 Sir John Douglas Cockcroft 1897±1967 For their work on the
1903±1995 transmutation of atomic nuclei
Ernest Thomas Sinton
using a particle accelerator.
Walton
1952 Felix Bloch
1905±1983 For their discovery of nuclear
Edward Mills Purcell
b. 1912
magnetic resonance in liquids are
gases.
1953 Frits Zernike
1888±1966 For his invention of the phasecontrast microscope.
1954 Max Born
1882±1970 For his interpretation of the wave
function as probability.
Walther Bothe
1891±1957 For his coincidence method for
studying subatomic particles.
1955 Willis Eugene Lamb
b. 1913
For his discoveries concerning the
®ne structure of the hydrogen
spectrum.
Polykarp Kusch
1911±1993 For his precision determination of
the magnetic moment of the
electron.
1910±1989 For their development of the
1956 William Shockley
1908±1991 transistor.
John Bardeen
1902±1987
Walter Houser Brattain
1957 Chen Ning Yang
b. 1922
For their prediction that parity is
Tsung Dao Lee
b. 1926
not conserved in beta decay.
1958 Pavel Aleksejevic Cerenkov 1904±1990 For the discovery of Cerenkov
1908±1990 radiation.
Il' ja Michajlovic Frank
Igor' Evgen' evic Tamm
1895±1971 For his interpretation of Cerenkov
radiation.
604
Appendix C
1959 Emilio Gino Segre
Owen Chamberlain
1960 Donald Arthur Glaser
1905±1989 For their discovery of the
b. 1920
antiproton.
b. 1926
For the development of the bubble
chamber.
1961 Robert Hofstadter
1915±1990 For his discovery of the internal
structure of the nucleons.
Rudolf Ludwig MoÈssbauer b. 1929
For his discovery of the
MoÈssbauer effect regarding
recoilless emission of -rays.
1962 Lev Davidovic Landau
1908±1968 For his theoretical work on
the super¯uidity of liquid
helium.
1963 Eugene P Wigner
1902±1995 For his discovery and
application of symmetry
principles to elementary particle
theory.
Maria Goeppert Mayer
1906±1972 For their work concerning the
J Hans D Jensen
1907±1973 shell structure of the nucleus.
b. 1915
1964 Charles H Townes
For the development of masers
Nikolai G Basov
b. 1922
and lasers.
Alexander M Prochorov
b. 1916
1965 Sin-itiro Tomonaga
1906±1979 For their development of quantum
Julian Schwinger
1918±1994 electrodynamics.
Richard P Feynman
1918±1988
1966 Alfred Kastler
1902±1984 For the development of optical
methods for studying energy
levels in atoms.
1967 Hans Albrecht Bethe
b. 1906
For discoveries concerning the
energy production in stars.
1968 Luis W Alvarez
1911±1988 For the discovery of resonance
states of elementary particles.
1969 Murray Gell-Mann
b. 1929
For his theoretical work regarding
the classi®cation of elementary
particles.
1970 Hannes AlveÂn
b. 1908
For discoveries in magnetohydrodynamics.
Louis Neel
b. 1904
For his discoveries concerning
antiferromagnetism and
ferrimagnetism.
1971 Dennis Gabor
1900±1979 For his development of the
principles of holography.
1908±1991 For their development of a theory
1972 John Bardeen
Leon N Cooper
b. 1931
of superconductivity.
J Robert Schrieffer
b. 1931
1973 Leo Esaki
b. 1925
For the discovery of tunneling in
semiconductors.
Ivar Giaever
b. 1929
For the discovery of tunneling in
superconductors
605
SUPERSTRINGS AND OTHER THINGS
1973 Brian D Josephson
b. 1940
For his theoretical work on the
properties of currents through a
tunnel barrier.
1974 Antony Hewish
b. 1924
For the discovery of pulsars.
Sir Martin Ryle
1918±1984 For his work in
radiointerferometry.
For their work on the structure of
b. 1922
1975 Aage Bohr
the atomic nucleus.
b. 1926
Ben Mottelson
1917±1986
James Rainwater
1976 Burton Richter
b. 1931
For their independent discovery
Samuel Chao Chung Ting b. 1936
of J and psi particles.
For their quantum-mechanical
b. 1923
1977 Philip Warren Anderson
study of solids.
b. 1905
Nevill Francis Mott
John Hasbrouck Van Vleck 1899±1980
1978 Peter L Kapitza
1894±1984 For his fundamental work in
low-temperature physics.
Arno A Penzias
b. 1926
For the discovery of cosmic
Robert Woodrow Wilson
b. 1936
microwave background radiation.
For their uni®ed theory of the
b. 1932
1979 Sheldon Lee Glashow
weak and electromagnetic forces.
b. 1926
Abdus Salam
b. 1933
Steven Weinberg
For the discovery of parity
1980 James W Cronin
b. 1931
violations in the decay of neutral
b. 1923
Val L Fitch
K mesons.
1981 Nicolaas Bloemergen
b. 1920
For their development of laser
Arthur Leonard Schawlow 1921±1999 spectroscopy.
Kai M Siegbahn
b. 1918
For the development of highresolution electron spectroscopy.
1982 Kenneth Geddes Wilson
b. 1936
For his work regarding phase
transitions.
1983 Subrehmanyan
1910±1995 For his work on the structure and
Chandrasekhar
evolution of stars.
William A Fowler
1911±1995 For his work on the formation of
the chemical elements.
1984 Carlo Rubbia
b. 1934
For their discovery of the W and Z
Simon van der Meer
b. 1925
particles, the carriers of the weak
interaction.
1985 Klaus von Klitzing
b. 1943
For his discovery of the quantized
Hall effect.
1986 Ernst Ruska
1906±1988 For the invention of the electron
microscope.
Gerd Binnig
b. 1947
For the invention of the scanningHeinrich Rohrer
b. 1933
tunneling electron microscope.
1987 Karl Alex Muller
b. 1927
For their discovery of high
J George Bednorz
b. 1950
temperature superconductors.
For their production of neutrino
b. 1922
1988 Leon Lederman
beams and their discovery of the
b. 1932
Melvin Schwartz
mu neutrino.
b. 1921
Jack Steinberger
606
Appendix C
1989 Norman Ramsay
Hans Dehmelt
Wolfgang Paul
1990 Jerome Friedman
Henry Kendall
Richard Taylor
1991 Pierre-Gilles de Gennes
1992 Georges Charpak
1993 Russel Hulse
Joseph Taylor
1994 Bertram N Brockhouse
Clifford G Shull
1995 Martin L Perl
Frederick Reines
1996 David M Lee
Douglas D Osheroff
Robert C Richardson
1997 Steven Chu
Claude Cohen-Tannoudji
William D Phillips
1998 Robert B Laughlin
Horst L Stomer
Daniel C Tsui
1999 Gerardus 'T Hooft
Martinus J G Veltman
2000 Zhores I Alferov
Herbert Kroemer
Jack S Kilby
b. 1915
For the development of atomic
resonance spectroscopy.
b. 1922
For their development of
1913±1993 techniques for trapping single
atoms.
b. 1930
For experiments that revealed the
1926±1999 existence of quarks.
b. 1929
b. 1932
For his research on
superconductors, polymers and
liquid crystals.
b. 1924
For his development of
elementary particle detectors.
b. 1950
For discovering evidence for
b. 1941
gravitational waves.
b. 1918
For contributions to the
b. 1915
development of neutron scattering
techniques in condensed matter
studies.
b. 1927
For experimental contributions to
b. 1918
lepton physics
b. 1931
For their discovery of
b. 1945
super¯uidity in helium-3.
b. 1937
For the development of methods
b. 1948
to cool and trap atoms with lasers.
b. 1938
b. 1948
b. 1950
For their discovery of a new form
b. 1949
of quantum ®eld with fractionally
b. 1939
charged excitations.
b. 1946
For elucidating the quantum
b. 1931
structure of electroweak
interactions in physics.
b. 1930
For work (which) has laid the
b. 1928
foundations of modern
b. 1923
information technology (IT),
particularly through their
invention of rapid transistors,
laser diodes, and integrated
circuits (chips).
607
APPENDIX D
PHYSICS TIME-LINE
Ancient Greeks
425
BC
280
BC
240
BC
235
BC
130
BC
130
BC
Democritus proposes that all matter is made of
small indivisible particles, which he calls ``atoms.''
Aristarchus of Samos determines the relative
distances of the Sun and the Moon from the Earth.
He also determines the relative sizes of the Sun,
the Moon and the Earth. These considerations lead
him to propose that the Earth revolves around the
Sun.
Archimedes discovers his principle of buoyancy
(Archimedes' Principle).
Eratosthenes develops a method to measure the
circumference of the Earth.
Hipparchus estimates the size of Moon from the
parallax of an eclipse.
Ptolemy develops his theory of the motion of the
heavenly bodies. According to his theory, the Earth
is at the center of the universe and the Sun and
known planets revolve around it.
Pre-Galilean Physics
1269
1514
AD
Petrus de Maricourt conducts experiments with
magnets and magnetic compasses.
Nicolaus Copernicus develops his heliocentric
theory. (He publishes it in 1543, a few days before
his death.)
608
Appendix D
Classical Physics
1592
1600
1604
1609
1609
1613
1619
1621
1638
1651
1662
1665±1666
1668
1672
Galileo Galilei invents the thermometer.
William Gilbert publishes De Magnete which starts
the modern treatment of magnetism. He also shows
that the Earth is a magnet.
Galileo Galilei proves that falling bodies are
accelerated towards the ground at a constant rate.
He also shows that the distance for a falling object
increases as the square of the time.
Johannes Kepler publishes his ®rst and second laws
of planetary motion in a book entitled Astronomia
Nova.
Galileo Galilei builds a telescope after hearing of its
invention.
Galileo Galilei introduces his principle of inertia.
Johannes Kepler publishes his third law of planetary
motion.
Willebrord van Roijen Snell introduces the law of
refraction.
Galileo introduces the concept of the relativity of
motion in his Two New Sciences.
Blaise Pascal shows that pressure applied at one
point in a liquid is transmitted unchanged to all
points in the liquid (Pascal's Principle).
Robert Boyle, while experimenting with gases,
shows that if a ®xed amount of a gas is kept at a
constant temperature, the pressure and the volume
of the gas follow a simple mathematical
relationship.
Isaac Newton begins his work on the motion of
bodies. He also completes his theory of colors,
develops the main ideas of the calculus, and his law
of gravitation.
Isaac Newton designs and builds a re¯ecting
telescope.
Isaac Newton, in a letter to the Royal Society,
describes his experiments explaining the nature of
color. This letter became Newton's ®rst published
scienti®c paper.
609
SUPERSTRINGS AND OTHER THINGS
1676
1714
1738
1742
1747
1766
1777
1787
1798
1798
1800
1802
1802
1804
Robert Hooke proposes his law relating the
elongation of a spring to the force applied to
produce that elongation.
Gabriel Fahrenheit introduces the mercury
thermometer and his new scale of temperature.
Daniel Bernoulli develops the foundations of
hydrodynamics.
Anders Celsius proposes a new temperature scale.
Benjamin Franklin conducts experiments that show
that one type of electri®cation could be neutralized
by the other type. This indicated to him that the two
types of electricity were not just different; they were
opposites and calls one type positive and the other
negative.
Joseph Priestley proposes that the force between
electric charges follows an inverse square law.
Charles de Coulomb invents a torsion balance to
measure the force between electrically charged
objects (Coulomb's law).
Jacques-Alexander Charles discovers the
relationship between the change in volume of a gas
with temperature. He fails to publish his discovery.
Henry Cavendish adapts the torsion balance
invented by Coulomb to measure the gravitational
constant.
Benjamin Thompson, Count Rumford, introduces
the idea that heat is a form of motion.
Alessandro Volta invents the electric battery.
Thomas Young, in a landmark experiment,
demonstrates that light is a wave phenomenon.
Gian Domenico Romagnosi proposes in a
newspaper article that an electric current affects a
magnetic current. His discovery is largely ignored.
Oersted, a better known scientist, was to discover
the same phenomenon in 1819.
Joseph Louis Gay-Lussac, without knowledge of
Charles' work of 1787, discovers the relationship
between the expansion of a gas at constant pressure
and the temperature. This discovery is now known
as Gay-Lussac's law.
610
Appendix D
1808
1814
1819
1820
1820
1822
1827
1831
1838
1842
1843
1846
1848
1850
1851
1868
John Dalton develops his atomic theory.
Joseph von FraunhoÈfer invents the spectroscope and
observes the absorption lines in the Sun's spectrum
two years later.
Hans Christian Oersted discovers that an
electric current de¯ects a magnetic compass.
His discovery, published in a scienti®c journal,
gets noticed.
Andre AmpeÁre gives mathematical form to
Oersted's discovery. In modern language, AmpeÁre's
law states that: an electric current creates a magnetic
®eld.
Biot and Savart propose a force law between an
electric current and a magnetic ®eld.
Andre AmpeÁre shows that two wires carrying
electric currents attract each other.
Georg Ohm shows that current and voltage are
related by a simple relationship, known today as
Ohm's law.
Michael Faraday showed experimentally that a
changing magnetic ®eld produces an electric current
(Faraday's law).
Friedrich Bessel ®rst observes the parallax of a star
with the aid of a telescope.
Christian Doppler proposes his Doppler Effect for
sound and light waves.
James Joule measures the electrical equivalent of
heat.
Gustav Kirchhoff proposes his rules of electrical
circuits (Kirchhoff's laws).
William Thomson, Lord Kelvin, devises what is now
known as the absolute temperature scale or Kelvin
scale.
Rudolf Gottlieb, known as Clausius, states the
second law of thermodynamics.
Armand Fizeau measures the velocity of light in a
moving medium.
James Clerk Maxwell proposes the electromagnetic
nature of light and suggests that electromagnetic
waves exist and are observed as light.
611
SUPERSTRINGS AND OTHER THINGS
1869
1873
1887
Dmitri Mendeleyev proposes his periodic table of
the chemical elements.
Johannes van der Waals develops his theory of
intermolecular forces in ¯uids.
Heinrich Hertz generates electromagnetic waves in
his laboratory.
Modern Physics
1887
1895
1890
1897
1898
1898
1900
1905
1905
1905
1905
1906
1909
1911
1911
1913
1915
Albert Michelson and E W Morley, in a landmark
experiment, demonstrate the absence of the ether,
a substance previously postulated to ®ll all space.
Wilhelm RoÈntgen discovers X-rays.
James Prescott Joule measures the mechanical
equivalent of heat.
J J Thomson determines the charge to mass ratio of
the electron.
Pierre and Marie Curie discover the radioactive
elements radium and polonium.
Ernest Rutherford discovers alpha and beta
radiation.
Max Planck introduces the concept of quanta in
black body radiation and Planck's constant.
Albert Einstein explains Brownian motion.
Albert Einstein explains the photoelectric effect.
Albert Einstein publishes his special theory of
relativity.
Albert Einstein postulates the equivalence of mass
and energy.
Albert Einstein proposes quantum explanation of
the speci®c heat laws for solids.
Robert Millikan measures the charge on the electron.
Heike Kamerlingh Onnes introduces his theory of
superconductivity.
Ernest Rutherford discovers the nucleus of the atom.
Niels Bohr proposes his quantum theory of atomic
orbits.
Albert Einstein publishes his general theory of
relativity.
612
Appendix D
1916
1917
1918
1923
1925
1926
1926
1927
1927
1930
1932
1932
1942
1948
1948
1953
1954
1956
1957
1961
Karl Schwarzschild calculates the critical radius of
curvature of space±time around a collapsing star at
which light cannot escape.
Albert Einstein presents his theory of stimulated
emission, the foundation for the laser.
Emmy NoÈther proposes the mathematical
relationships between symmetry and conservation
laws of physics.
Louis de Broglie predicts the wave nature of particles.
Werner Heisenberg develops matrix mechanics,
the ®rst quantum mechanical theory.
Erwin SchroÈdinger develops wave mechanics,
an alternate quantum mechanical theory.
Werner Heisenberg proposes the uncertainty
principle.
Niels Bohr proposes the principle of
complementarity.
Niels Bohr develops the Copenhagen interpretation
of quantum mechanics.
Ernest Orlando Lawrence and M Stanley Livingston
invent the cyclotron.
James Chadwick identi®es the neutron.
Werner Heisenberg proposes that the nucleus of an
atom is composed of protons and neutrons.
Enrico Fermi obtains the ®rst self sustaining ®ssion
reaction.
Sin-itiro Tomonaga, Julian Schwinger, and
Richard Feynman develop quantum
electrodynamics (QED).
John Bardeen, Walter Brattain, and William
Shockley invent the transistor.
Charles Townes invents the maser.
C N Yang and Robert L Mills propose a non-abelian
gauge theory.
Murray Gell-Mann and Kazuhiko Nishijima
introduce the strangeness quantum number.
John Bardeen, Leon Cooper, and John R Schrieffer
propose their BCS theory of superconductivity.
Sheldon Glashow introduces the neutral intermediate
vector boson of electroweak interactions.
613
SUPERSTRINGS AND OTHER THINGS
1961
1964
1964
1965
1965
1967
1974
1977
1981
1981
1982
1983
1994
2000
Murray Gell-Mann and Yuval Ne'eman
independendently discover the SU(3) octet
symmetry of hadrons.
Peter Higgs, Robert Brout, and F Englert introduce
the Higgs mechanism of symmetry breaking.
Murray Gell-Mann and George Zweig
independently propose the quark theory of hadrons.
John Stewart Bell states and proves a powerful
theorem (Bell's theorem), which gives the
theoretical limits on the correlations between the
results of simultaneous measurements done on two
separated particles. The limits on these correlations
are given by Bell in the form of an inequality.
Arno Penzias and Robert Wilson measure the
cosmic background radiation.
Steven Weinberg and Abdus Salam independently
propose the electroweak uni®cation which is based
on signi®cant contributions by Sheldon Glashow.
The three would later share the Nobel Prize in
physics for their theory.
Howard Georgi and Sheldon Glashow propose the
SU(5) as a Grand Uni®ed Theory and predict decay
of the proton.
A Fermilab team detects the bottom quark.
Michael Green and John Schwarz propose what
becomes known as Type I superstring theory.
Gerd Binnig and Heinrich Rohrer invent the
scanning tunneling microscope.
Alain Aspect performs an experiment that is
considered to con®rm the non-local aspects of
quantum mechanics.
Carlo Rubbia leads a team that detects the W and Z
bosons at CERN.
A Fermilab team detects the top quark.
Tantalizing hints of the existence of the Higgs boson
are seen in experiments with the Large Electron
Positron collider at CERN, the European Laboratory
for Particle Physics.
614
GLOSSARY
Absolute Zero: Temperature at which no thermal energy can
be extracted from an object. It is the minimum temperature
attainable. It is equal to ÿ273.158C.
Acceleration: The rate at which velocity changes. The SI units
of acceleration are m/s2 .
Alpha decay: If a nucleus contains too many protons, it is
unstable and emits an alpha particle, which is a nucleus of
helium-4.
Alpha particle: See alpha decay. A stable nuclear particle that
consists of two protons and two neutrons. An alpha particle
is the nucleus of helium.
Amorphous solid: See Crystal.
AmpeÁre's law: An electric current produces a magnetic ®eld.
Amplitude: The amplitude of an oscillation is the maximum
displacement of the medium from its equilibrium position.
Angular momentum: A measure of the rotation of an object.
It is the tendency of a rotating object to keep rotating because
of its inertia. Angular momentum can be expressed as L I!.
The angular momentum of a body is conserved if the net
external torque acting on the object is zero. This is the law of
conservation of angular momentum.
Angular velocity: The rate of change of angular displacement
with time.
Antimatter: Antimatter particles, called antiparticles, are
identical to ordinary matter except that if the particle has
electric charge, its antiparticle would have the opposite
charge. If a particle has no charge, like the photon, it is its
own antiparticle.
Archimedes' principle: An object partially or completely
submerged in a ¯uid is buoyed up by a force equal to the
weight of the ¯uid displaced by the object.
615
SUPERSTRINGS AND OTHER THINGS
Atomic mass unit, amu: A unit of mass used in the atomic
realm. 1 amu 1:660 540 2 10ÿ27 kg.
Atomic number: The total number of protons in a nucleus.
Average speed: De®ned as the total distance traveled divided
by the time taken to travel this distance. The units of speed
are units of distance divided by units of time. The SI unit is
the meter per second (m/s).
Baryon: A subatomic particle composed of three quarks held
together by the color force.
Beta decay: If a nucleus contains too many neutrons it is
unstable and decays by emitting a beta particle, which is an
electron.
Beta particle: See beta decay. An electron that is emitted from
the nucleus of an atom undergoing beta decay.
Binding energy: The total energy of the nucleus is less than the
total energy of its separated nucleons. This energy difference
is called binding energy.
Black body: An object that absorbs all radiation incident upon
it. It is also a perfect emitter of radiation.
Black hole: If a star has a mass of more than 3 solar masses,
gravitational compression will make the star so dense that
the escape velocity from it becomes greater than the speed of
light. The star contracts to a single point, called a singularity
or ``black hole.''
Boyle's law: See Ideal gas law.
Buoyant force: Upward force exerted by a ¯uid on a ¯oating or
immersed object as a reaction to the force exerted by the
object to displace the ¯uid.
Calorie: The amount of heat required to raise the temperature
of 1 gram of water by 18C. It is equal to 4.186 J.
Capacitor: A device for the storage of electrical energy. It
consists of two oppositely charged metal plates separated by
an insulator.
Centripetal acceleration: An object moving with a constant
speed v in a circular path of radius r has an acceleration
directed toward the center of the circle called centripetal
acceleration. It has a magnitude v2 =r.
Chain reaction: A reaction in which some of the products
initiate further reactions of the same kind allowing the
reaction to become self-sustaining.
616
Glossary
Charles's law: Another name for Gay-Lussac's law (q.v.).
Coherent radiation: Electromagnetic radiation such as is seen
in radio waves and laser beams, where all the radiation is of
a single frequency and all the photons are in phase (in step).
The coherence length is a measure of the quality of the
coherence; the bandwidth is another.
Color charge: A measure of the strength of the strong
interaction.
Compound: If the atoms retain their identities while they attract
each other owing to the mutual attraction of their respective
ions (ionic bond), the atoms are said to form a compound.
Concave mirror: A curved mirror in which the interior surface
is the re¯ecting surface.
Conservative and nonconservative forces: When the work
done by an unbalanced force acting on a body depends only
on the initial and ®nal positions of the body, the force is said
to be a conservative. If the work done depends on the path
taken by the body, the force producing this motion is said to
be nonconservative.
Constructive interference: See interference.
Convex mirror: A curved mirror which has the exterior surface
as the re¯ecting one.
coulomb: SI unit of electric charge. One coulomb is the charge
of 6:25 1018 electrons or an equal number of protons.
Coulomb's law: The force exerted by one charged object on
another varies inversely as the square of the distance
separating the objects and is proportional to the product of
the magnitude of the charges. The force is along the line
joining the charges and is attractive if the charges have
opposite signs and repulsive if they have the same sign. If we
call q1 and q2 the magnitudes of the two charges and r, the
distance between their centers, we can state Coulomb's law
as an equation: F k q1 q2 =r2 . The value of Coulomb's
constant k is 9 109 N m2 =C2 .
Covalent bond: A type of chemical attraction that depends on
the fact that the presence of two electrons in a certain region
of space is energetically advantageous. In a covalent bond,
atoms are bound together by sharing electrons.
Critical angle: When light is passing from a medium of higher
index of refraction to one of low index of refraction, the
617
SUPERSTRINGS AND OTHER THINGS
critical angle is the angle of incidence for which the angle of
refraction is 908 (i.e., the emergent beam travels along the
interface).
Crystal: The forces that bind the atoms together in a solid are
strong enough for the solid to maintain its shape. If the atoms
arrange themselves in a pattern that is repeated through the
substance, the solid is called a crystal. Solids that do not form
these patterns are said to be amorphous.
de Broglie wavelength: The wavelength associated with a
particle, equal to the ratio of Planck's constant to the
momentum of the particle.
decibel: The unit of sound level, a measure of relative sound
levels.
Density: The mass per unit volume of a substance. The SI unit
is the kilogram per cubic meter (kg/m3 ).
Destructive interference: See interference.
Dewpoint: See Humidity.
Diffraction: The spreading out of waves on passing through a
narrow aperture.
Diode: A device that acts like a switch in an electric circuit,
permitting the ¯ow of current in only one direction.
Doppler effect: The change in frequency perceived by a listener
who is in motion relative to a source of sound.
Ef®ciency: The ratio of the useful work performed by a
machine to the total amount of energy required to operate it.
Elastic collision: A collision in which the kinetic energy is
conserved.
Elastic potential energy: In a spring of force constant k
stretched a distance x, the elastic potential energy is 12 kx2 .
Electric charge: A property of subatomic particles that is
responsible for electric and magnetic phenomena. The
fundamental charge is the charge of one electron or one
proton and has a magnitude of 1:602 10ÿ19 C.
Electric current: The rate at which charge ¯ows in a conductor.
If during a time t an amount of charge q ¯ows past a
particular point in a conductor, the electric current is i q=t.
The unit of current is the ampere (A), which is a fundamental
SI unit.
Electric ®eld: Property of space around an electric charge. The
electrostatic force per unit charge.
618
Glossary
Electric potential difference: See Voltage.
Electromagnetic wave: Propagation of oscillating electric and
magnetic ®elds through space.
Electron: A fundamental particle, one of the main constituents
of matter. The electron has an electric charge of
ÿ1:602 10ÿ19 C and a mass of 9:1094 10ÿ31 kg or
5:486 10ÿ4 amu.
Electron-volt (eV): Unit of energy used when dealing with
atoms or electrons. 1 eV 1:602 10ÿ19 J.
Electrostatic force: See Coulomb's law.
Electroweak force: Uni®cation of electromagnetism and the
weak nuclear force. The triplet of massive bosons W , W ÿ ,
and Z, along with the massless photons, are the mediators of
this force.
Energy: The capacity to do work or the result of doing work.
Entropy: From a Greek word that means transformation, entropy
is a measure of the disorder of a system.
Escape velocity: The escape velocity of an object on the surface
of the earth is the minimum velocity that we must impart to
the object so that it escapes the gravitational grasp of the
earth.
Event horizon: See Schwartzchild radius. A sphere around a
black hole with a radius equal to the Schwartzchild radius.
No particle inside this sphere can escape the gravitational
attraction of the black hole.
Faraday's law of induction: The induced voltage in a circuit is
proportional to the rate of change of the magnetic ®eld.
Field: The concept of ®eld is used to specify a quantity for all
points in a particular region of space. The electric ®eld
describes the property of the space around an electrically
charged object. A charged body distorts the space around it in
such a way that any other charged body placed in this space
feels a force that is given by Coulomb's law. The electric ®eld
strength E is the Coulomb force felt by a test charge q0 that is
placed in the ®eld divided by the magnitude of this test
charge q0 , E F=q0 . The direction of the electric ®eld vector is
the direction of the force on a positive test charge.
Fission: See Nuclear ®ssion.
Fluids: Substances in which the binding forces are weaker
than in solids, so that the atoms or molecules do not occupy
619
SUPERSTRINGS AND OTHER THINGS
®xed positions, and move at random. Liquids and gases are
¯uids.
Focal point: The point at which all of the rays gathered by a
lens, curved mirror, or optical instrument pass.
Frame of reference: See reference frame.
Frequency: The number of wave crests that pass a given point
per second. The unit of frequency is the hertz, Hz.
f-stop: The ratio of the focal length of a lens to the diameter of
its aperture.
Fundamental charge: The charge on one electron or one proton,
e 1:6 10ÿ19 C. The electric charge on an charged object
always occurs in integral multiples of the fundamental charge.
Fundamental forces: There are four fundamental forces in
nature: gravitational, electromagnetic, strong, and weak. The
strong force, a short-range force, holds the nucleus together.
The weak force, also a short-range force, is responsible for
radioactive beta-decay processes.
Fundamental units: The fundamental quantities in mechanics
are length, mass, and time. The corresponding fundamental SI
units are the meter, the kilogram and the second. The other
fundamental SI units are the ampere, lumen, kelvin and mole.
Fusion: See Nuclear fusion.
Galilean Principle of Relativity: The laws of mechanics are the
same in all inertial frames of reference. This means that there
is no special or absolute reference frame. Thus, there is no
absolute standard of rest; uniform motion has to be referred
to an inertial frame.
Gamma decay: If a nucleus is left in an excited state after an
alpha or beta process, it will decay to the ground state by
emitting one or more photons, called gamma rays.
Gamma rays: Electromagnetic radiation with frequencies
greater than about 3 1019 Hz and wavelengths smaller than
about 10ÿ11 m.
Gauge symmetry: Physical theories that remain invariant under
changes taking place everywhere in the universe are said to
obey a global gauge symmetry. When the changes are
different at every point in space, the theory is said to obey a
local gauge symmetry. To maintain a local symmetry, in
which different changes take place at different points or to
different objects, a compensating change must take place.
620
Glossary
Gay-Lussac's law: See Ideal gas law.
Gravitational force: See Law of Universal Gravitation.
Gravitational potential energy: The energy that a body
possesses by virtue of its separation from the Earth's surface.
For an object of mass m situated at a height h above the
ground, the gravitational potential energy is PEgrav mgh.
Gravitational red shift: Einstein showed that, according to
general relativity, time runs more slowly in a gravitational
®eld and that celestial objects in a strong gravitational ®eld
would show a spectral shift toward longer wavelengths.
Gravity: See Law of Universal Gravitation.
Hadrons: Particles that participate in the strong interaction.
Hadrons are not fundamental particles; they have a de®nite
extension. Hadrons that decay into a proton and another stable
particle are baryons. The remaining hadrons are the mesons.
Half-life: The time required for half the nuclei in a given
radioactive sample to decay.
Heat: The thermal energy transferred from a warmer object to a
cooler object. The SI unit of heat is the joule; another unit of
heat, de®ned during the times of the caloric theory, is the
calorie, cal, which is the amount of heat required to raise the
temperature of 1 gram of water by 18C. The relation between
calories and joules is 1 cal 4:186 J.
Heat capacity: The heat required to increase the temperature of
a mass m of the substance an amount T. The speci®c heat
capacity, or heat capacity per unit mass, is the heat required to
raise the temperature of a unit mass of a substance by one kelvin.
If Q is the amount of heat required; m, the mass of the
substance, and T the change in temperature, the speci®c
heat capacity c Q=m T. The SI unit of speci®c heat
capacity is the joule per kilogram kelvin ( J/kg K).
Heat of fusion: See latent heat.
Heat of vaporization: See latent heat.
Heisenberg's Uncertainty Principle: It is not possible to
measure the exact position and the exact momentum of a
particle simultaneously.
hertz: The SI unit of frequency. One cycle per second.
Higgs ®eld: See Higgs mechanism. A quantum ®eld with very
special properties that allows it to give mass to the
elementary particles when they interact with this ®eld.
621
SUPERSTRINGS AND OTHER THINGS
Higgs mechanism: See Higgs ®eld. The Higgs mechanism is a
special set of circumstances such that the state in which the
Higgs ®eld has its lowest energy is one of broken symmetry.
Hologram, holography: The recording of an image by
recording the standing-wave pattern caused by the
interaction of two coherent beams of radiation, one having
been modi®ed by interaction with the object.
Humidity: Measure of the amount of water present in the air at
any given time. Absolute humidity (AH) is the total mass of
water vapor present in the air per unit volume, generally
given in g=m3 . Humidity at saturation, HS, is the mass per unit
volume of water vapor required to saturate the air. Relative
humidity, RH, is the ratio of the absolute humidity to the
humidity at saturation: RH AH=HS. The temperature at
which the air saturates is called the dew point.
Ideal gas: Any gas in which the cohesive forces between
molecules are negligible and the collisions between molecules
are perfectly elastic. Collisions between the molecules of an
ideal gas conserve both momentum and kinetic energy. If an
ideal gas is kept at constant temperature, the pressure is
inversely proportional to its volume; that is, the product of
the pressure and the volume is a constant. This is Boyle's
law. The constant is the same for all gases. If the pressure of
an ideal gas is kept constant, a change in volume is
proportional to the change in absolute temperature. This is
Gay-Lussac's law, also known as Charles' law.
Ideal gas law: By combining Boyle's law and Gay-Lussac's law
into one single expression, we obtain the ideal gas law,
PV constant T.
Image: Strictly, optical image. Formed where light rays
intersect or where they appear to have originated from. A real
image is one formed by light rays actually intersecting
whereas a virtual image is an image formed by light rays
which appear to come from a point in space.
Index of refraction: For a transparent substance, the ratio of the
speed of light in vacuum to that in the substance.
Inelastic collision: A collision in which kinetic energy is not
conserved. Momentum is conserved.
Inertia: The tendency of an object to resist any change in its
state of motion.
622
Glossary
Inertial reference frame: A reference frame in which the law of
inertia holds. Inertial reference frames move at constant
velocities.
Infrared radiation: Electromagnetic radiation in the region of
the spectrum between about 4 1014 and 1011 Hz and
wavelengths between about 7:5 10ÿ7 m and 3 10ÿ3 m.
Instantaneous speed: The speed that an object has at any given
instant.
Intensity: The rate at which a wave transports energy.
Interference: Two or more trains of the same frequency sharing
the same space will interfere constructively or destructively,
depending on whether the combined amplitude is greater or
less than the component wave amplitudes.
Internal energy: The internal energy, U, of a system is the sum
of all forms of energy, thermal energy and potential energy.
Thermal energy is the sum of all the random kinetic energies
of the atoms and molecules in a substance and potential energy
is the energy stored in the molecules, atoms, and nuclei of a
substance.
Invariance: If a system remains unchanged after some
operation is performed on it, we say that the system is
invariant under that operation.
Inverse square law: A mathematical expression in which a value
for a quantity varies inversely with the square of the distance.
Ion: An atom or molecule with a net electric charge.
Ionic bond: Bonding due to the electrical attraction between
oppositely charged ions.
Ionizing radiation: This is produced by a particle or a photon
with enough energy to remove an electron from an atom.
Isobaric process: A process that occurs at constant pressure.
In this case, the work done is W P V.
Isotope: One of several forms of an element having the same
number of protons but a different number of neutrons.
Joule ( J): The SI unit of energy or work, equal to a newton
meter (Nm).
Kepler's Laws of Planetary Motion: In the early 1600s, Kepler
discovered the three laws of planetary motion that bear his
name:
Law of orbits: Each planet moves around the sun in an elliptical
orbit, with the sun at one focus.
623
SUPERSTRINGS AND OTHER THINGS
Law of areas: A planet moves around the sun at a rate such that
the line from the sun to the planet sweeps out equal areas in
equal intervals of time.
Harmonic law or law of periods: The squares of the periods of any
two planets are proportional to the cubes of their average
distances from the sun.
Kinetic energy: The energy that an object has by virtue of its
motion. It is equal to one-half the product of the mass m
and the square of the speed v: KE 12 mv2 .
Laser radiation: Laser is an acronym for light ampli®cation by
stimulated emission of radiation. A laser is a device which
produces a narrow beam of single-wavelength, coherent
radiation (q.v.) by stimulated emission of photons.
Latent heat: The heat absorbed or released by one kilogram of a
substance during a phase transition. If the transition is from
solid to liquid or vice versa, it is called latent heat of fusion, Lf .
If the transition involves the liquid and gas phases of a
substance, it is called the latent heat of vaporization, Lv . The
heat required to melt a solid of mass m is given by Q mLf ;
the heat required to vaporize a liquid of mass m is Q mLv .
Laws of re¯ection: The angle of re¯ection is equal to the angle
of incidence. The incident ray, the re¯ected ray and the
normal are all in the same plane.
Law of refraction (Snell's law): The angle of refraction is in a
constant relationship to the angle of incidence, and the
incident and refracted rays are in the same plane as the
normal.
Law of Universal Gravitation: Any two objects of mass M and
m, separated by a distance r, will attract each other with a
force proportional to the product of their masses and
inversely proportional to the square of their distance apart.
The constant of proportionality is the universal constant G,
with a value of 6:67 10ÿ11 N m2 =kg2 .
Laws of thermodynamics: These are numbered zeroth, ®rst,
second, and third:
0 If two objects are each in thermal equilibrium with a third
object, they are in thermal equilibrium with each other.
1 In an isolated system, the total internal energy of a system
remains constant, although it can change from one kind to
another.
624
Glossary
2 Heat does not pass spontaneously from a cold to a hot object.
Another way of stating this is: All natural changes take place
in the direction of increasing entropy.
3 It is impossible to reach absolute zero temperature in a
process with a ®nite number of steps.
Length contraction (relativistic): An observer in motion relative
to an object measures the length of that object along the
direction of motion to be contracted when compared with the
length measured by an observer at rest relative to the object.
Leptons: Particles that interact via the weak force. All leptons
are truly elementary particles, without internal structure.
Leptons are classi®ed in three generations, each containing a
charged lepton and a neutrino.
Lever arm: The perpendicular distance from the center of
rotation to the point of application of a force.
light year: The distance traveled by light in one year. It is equal
to about 9:5 1012 km.
Magnetic ®eld: The property of space around a magnet.
Because magnetic poles exist only in pairs, magnetic ®eld
lines do not start or end anywhere, an essential difference
from electric ®eld lines, which start on positive charges and
end on negative charges.
Magnetosphere: The volume around the Earth that is
in¯uenced by the Earth's magnetic ®eld. It is believed to be
caused by motions of the metallic core which produce electric
currents in the hot, electrically conductive material; these
currents ¯ow upward and are in turn carried around by the
Earth's fast rotation. The magnetosphere traps some matter
from the solar wind (q.v.).
Magni®cation: In a lens or mirror system, the ratio of the size
of the optical image to the size of the object.
Mass number: The total number of protons and neutrons in a
nucleus.
Maxwell's equations: The four equations by which James Clerk
Maxwell described the relationship between electricity and
magnetism and provided a model for the propagation of
electromagnetic radiation. To state them precisely requires a
use of vector calculus.
Meniscus: The curved surface of a liquid in a container
produced by the cohesive forces between the liquid
625
SUPERSTRINGS AND OTHER THINGS
molecules and the adhesive forces between the liquid and the
container.
Meson: Particle with a mass that falls between the mass of the
electron and the mass of the proton. Mesons are
combinations of a quark and an antiquark.
Metastable state: An excited energy state of an atom with a
longer life time than that of regular excited states.
meter: SI unit of length. The meter is de®ned as the distance
traveled by light in 1/299 792 458 of a second.
Microwaves: Electromagnetic radiation in the region of the
spectrum between about 3 108 and 3 1011 Hz and
wavelengths between about 10ÿ4 m and 1 m.
Molecule: A structure formed when atoms combine in such a
way as to share some of their electrons.
Moment of inertia: The moment of inertia of a body measures
its resistance to change in its state of rotation about a given
axis.
Muon: A fundamental particle with a mass about 200 times
that of the electron. A muon is a lepton that decays into an
electron and neutrinos.
Newton: The SI unit of force. One newton is one kilogram
metre per second squared (kgm/s2 ).
Newton's laws of motion:
Newton's First Law: Every body continues in its state of rest or of
uniform motion in a straight line unless it is compelled to
change that state by forces impressed upon it.
Newton's Second Law: The acceleration of an object is directly
proportional to the net force acting on it and inversely
proportional to its mass.
Newton's Third Law: To every action there is always an equal
reaction. The mutual actions of two bodies upon each other are
always equal and directed to contrary parts.
NoÈther's theorem: The theorem that establishes the connection
between symmetry and the laws of physics. It can be stated
as follows: For every continuous symmetry in the laws of
physics there exists a corresponding conservation law.
A continuous symmetry is one in which the corresponding
transformation can be varied continuously, as in a rotation.
Nuclear ®ssion: The event which occurs when a heavy nucleus
splits into two smaller nuclei (called ®ssion fragments).
626
Glossary
Nuclear fusion: This takes place when two light nuclei are
fused together to form a heavier nucleus.
Object and image distances (conjugate foci): The distance
from the object and image respectively from an optical mirror
or lens. For a plane mirror, the object distance is always
equal to the image distance.
Ohm's law: The current ¯owing through a conductor is directly
proportional to the voltage V that exists between the two
ends of the conductor, or i V=R, where R is the resistance
of the conductor.
Pair annihilation: See pair production. When a particle
encounters its antiparticle, they annihilate each other,
disappearing in a burst of photons.
Pair production: See pair annihilation. The inverse process, in
which high energy photons create a positron-electron pair.
Pascal's principle: This states that the pressure applied to a
liquid is transmitted undiminished to all points of the liquid
and to the walls of the container.
Period: The time required to complete one cycle of a periodic
motion. Period is the inverse of frequency (q.v.).
Photoelectric effect: Light of a certain frequency incident on a
substance causes electrons to be emitted from the substance.
Photon: Quantum of light. The photon is the mediator of the
electric and magnetic forces. The character of the force
depends on the polarization of the exchanged photon.
Longitudinal and time-like polarization mediate the
electrostatic force while the magnetic force is mediated by the
other two polarizations. The photon is a boson with spin 1
and zero mass.
Pion: The particle that mediates the strong force. Also called
the -meson.
Polarization: Orientation of the oscillation vector of a wave or
of the rotation axis of a spinning object.
Positron: An antielectron. A fundamental particle with the
same mass as the electron and a positive electric charge of
the same magnitude as that of the electron. A constituent of
antimatter.
Potential energy: The energy that an object has by virtue of its
position in a ®eld.
Power: The rate at which work is done or energy is released.
627
SUPERSTRINGS AND OTHER THINGS
Pressure: Force per unit area. The SI unit of pressure is the pascal
(Pa). One pascal is one newton per square meter (N/m2 ).
Normal atmospheric pressure at sea level is 101.3 kPa or 1 atm.
Principle of equivalence: This states that it is impossible to
distinguish the effects of accelerated motion from the effects
of gravity. It extends the relativity principle to accelerated
frames of reference. The principle of equivalence can be
stated in the alternative form: Gravitational mass and inertial
mass are equivalent and no experiment can distinguish one
from the other.
Proper time: The time interval measured by an observer in his
own reference frame.
Proton: One of the main constituents of matter. The proton has
an electric charge of 1:602 10ÿ19 C and a mass of
1:6726 10ÿ27 kg or 1.007 276 amu.
Quanta: Packets of energy, introduced in 1900 by Max Planck
to explain the behavior of the radiation emitted by a hot
body. Albert Einstein generalized this revolutionary concept
and stated that light behaves both as a wave and as quanta of
energy or photons (q.v.).
Quantum chromodynamics (QCD): The theory that explains
the color interactions between quarks. This theory is modeled
after quantum electrodynamics or QED.
Quarks: Quarks are believed to be truly fundamental particles.
Hadrons are thought to be composed of quarks. There are six
¯avors or varieties of quarks: Up and down, strange and charm,
and bottom and top. Like leptons, quarks are also grouped in
generations. The strong force arises from the interaction
between quarks. Quarks possess a kind of charge called color
charge: red, green and blue. All hadrons are color-neutral or
``white.''
radian: The central angle in a circle subtented by an arc length
equal to the radius of the circle. It is equal to 57.38.
Radio waves: Electromagnetic radiation in the region of the
spectrum smaller than about 3 108 Hz and wavelengths
greater than about 1 m.
Radioactive decay: See radioactivity.
Radioactivity: Emission of several particles such as electrons,
photons, neutrons, neutrinos or positrons, due to the decay of
unstable nuclei.
628
Glossary
Resistance: In a conducting medium, the ratio of the voltage to
the current is a constant, the resistance of the conductor. The
units of resistance are ohms, .
Schwartzchild radius: See event horizon. The radius to which a
given object must be reduced so that its escape velocity
equals the speed of light. It is the event horizon of a black
hole.
Semiconductor: Materials with electrical conductivities that are
intermediate between those of conductors and insulators. The
conductivity of a semiconductor is changed by the addition
of small amounts of impurities to its crystal structure.
Solar wind: The stream of energetic particles emitted by the
Sun.
Sound: A mechanical longitudinal wave that propagates
through a medium with frequencies that range from a
fraction of a hertz to several megahertz. Audible waves are
sound waves with frequencies between 20 and 20 000 Hz.
Sound waves with frequencies below 20 Hz are called
infrasonic waves and those with frequencies above 20 000 Hz
are called ultrasonic waves.
Space-time: A four-dimensional geometry consisting of the
three coordinates of space and one of time.
Speci®c heat: The heat required to increase the temperature of
1 gram of a substance by one degree Celsius.
Speed: See Velocity. The magnitude of the velocity.
Speed of sound: This depends on the elastic and inertial
properties, and temperature, of the transmitting medium.
In air, the speed of sound at 208C is 343 m/s.
Standing waves: In general, the resultant of two identical wave
motions of equal amplitude and wavelength traveling in
opposite directions. The points that do not move are called
nodes. Standing waves can be seen on a stretched string ®xed
at both ends. The natural frequencies of vibration form a
harmonic series. Any two coherent beams of radiation whose
paths cross will generate a standing waveform, and this is
the principle of holography (q.v.).
Strong force: See nuclear force. Holds the nucleons together in a
nucleus. It is a short-range force, becoming negligible at
distances greater that 10ÿ15 m. The strong force acts on
protons and neutrons but not on electrons, neutrinos or
629
SUPERSTRINGS AND OTHER THINGS
photons. The strong force is 137 times stronger than the
electromagnetic force.
Supergravity: Theory that attempts to unify gravity with the
other three forces with the use of a powerful new gauge
symmetry that unites quarks and leptons with messenger
particles called supersymmetry. Supergravity theories are
formulated in more than four space-time dimensions.
Superstring theory: A theory that promises to provide a
uni®ed description of all the forces of nature. According
to this theory, all elementary particles are represented by
strings, open or closed, no more than 10ÿ35 m in dimensions.
Surface tension: The intermolecular forces that act on the
molecules on the surface of a liquid to make the surface of
the liquid as small as possible. These forces are also
responsible for the rising of the liquid in very thin tubes, a
phenomenon known as capillarity.
Symmetry: If something remains unchanged after some
operation is performed on it, it is said to be symmetric under
that operation.
Temperature: A measure of the average random kinetic energy
per molecule of a substance.
Thermal expansion: The proportional change in length or area
or volume when a change in temperature has occurred.
Time dilation: Time in the moving reference frame always
¯ows more slowly than in the stationary reference frame.
Torque: The product of the applied force and the lever arm
length. The ability to rotate an object depends on the applied
torque.
Total internal re¯ection: The re¯ection that occurs when light
is incident from a medium with a high index of refraction to
one with a low index of refraction at angles of incidence
greater than the critical angle (q.v.). The light beam then
obeys the laws of re¯ection (q.v.).
Transistor: A semiconductor device that can act as a current
switch and as an electronic ampli®er in a circuit.
Ultraviolet radiation: Electromagnetic radiation in the region of
the spectrum between about 7:5 1014 and 3 1018 Hz and
wavelengths between about 4 10ÿ7 m and 10ÿ10 m.
Uncertainty principle: See Heisenberg's Uncertainty Principle.
Uniform motion: Motion at a constant velocity.
630
Glossary
Uniformly accelerated motion: Motion with constant
acceleration. One important example of uniformly accelerated
motion is the vertical motion of an object falling towards the
ground due to the gravitational attraction of the Earth.
Another is circular motion at constant speed.
Universal law of gravitation: See Law of universal gravitation.
Vector quantities: Quantities that require both magnitude and
direction for their complete speci®cation, e.g. velocity.
Velocity: The speed and the direction of motion of an object.
Virtual particles: Particles that exist only for the brief moment
allowed by Heisenberg's uncertainty principle.
Visible light: Electromagnetic radiation in the region of the
spectrum between about 4 1014 and 7:5 1014 Hz and
wavelengths between about 7:5 10ÿ7 m and 4 10ÿ7 m.
Voltage: The change in electric potential energy of a charge
divided by the magnitude of that charge.
Wave: A mechanism for the transmission of energy in which the
medium itself does not travel. In particular, electromagnetic
radiation does not need a medium to propagate.
Wavelength: The distance between two identical points of a
periodic wave.
Weak force: See strong force. Responsible for radioactive
-decay processes, such as the transformation within the
nucleus of a neutron into a proton or a proton into a neutron.
The weak force controls many of the reactions that produce
energy in the sun and the stars. The weak force is some
hundred thousand times weaker than the strong force.
Weight: The gravitational force with which the Earth attracts an
object towards its center when the object is on or near the
Earth's surface.
Work function: The minimum amount of energy required to
release electrons from a particular metal.
Work: When a constant force acts on an object along the
direction of motion of the object, the work done on the object
is equal to product of the force and the distance that the
object moves. The SI unit of work is the joule ( J), equal to
1 newton meter (Nm).
X-rays: Electromagnetic radiation in the region of the spectrum
between about 3 1016 and 1021 Hz and wavelengths
between about 10ÿ8 m and 3 10ÿ13 m.
631
INDEX
Abott, Edwin A 475
absolute humidity 214
absolute zero 224, 233±4
absorption spectrum 419
acceleration 34±5, 58, 469, 471, 472
due to gravity 37, 40
accommodation 401
action and reaction, law of 59±61
activated-charcoal detectors 551
addition of velocities 464±8
adhesive forces 182
ñther 45, 446
air 45, 167, 168
air bag systems 90
airplane wing 185
Almagest 114
alpha decay 532, 534±5
alpha particles 137, 138, 160
alpha rays 137
alpha-track devices 551
Alvarez, Luis 10±13
Alvarez, Walter 10±13
amber 247, 268
amorphous materials 164
AmpeÁre, Andre Marie 296±7, 307
AmpeÁre's law 297, 406
amplitude 325, 328
Anderson, Carl D 555
Andromeda 458
angle of incidence 373, 384, 391
angle of re¯ection 372
angle of refraction 384, 391
angular momentum 102±7
angular velocity 100±1
antielectrons 555
antimatter 553±5
antineutrino 535
antinode 359
antiparticles 555
antiphase 329
anvil 348
aperture 395
Apollo missions 125
applied electricity 267±90
aqueous humor 399
Arago, Dominique 424
Archimedes 26, 176
Archimedes' principle 177±8
Aristarchus 111, 112, 124
Aristotle 44±7, 133
arti®cial vision 403
Asaro, Frank 11
associated production 565
astronauts 321
Astronomia Nova 117
astronomy 111, 114, 117
atmosphere (unit) 169
atmospheric pressure 168±72, 211,
212
atomic clock 17
atomic hypothesis 133
atomic mass number 154, 160
atomic mass unit 18, 155
atomic number 153±4, 599±601
atomic theory 496±510
atoms 6, 7, 8, 133±50
Bohr model 145±7, 308, 503±6
early concept 134
632
Index
atoms (continued )
®rst models 136±40
new mechanics 511±14
physics 496
Rutherford's nuclear model 140
Thompson's model 137
atria 190
attractive nuclear force 156
audio frequencies 343
auditory canal 348
aurora borealis 306
automobile ef®ciency 86±7
automobile emissions 86
automobile engines 224
average speed 27±30
avian magnetic navigation 307
Avogadro's number 5
Babylonians 111
Bardeen, John 286
Barham, Peter 213
barometer 170, 171
Bartholin, Erasmus 425
baryon number 563±4
baryon octet 566
baryons 563±4
base 286
Basov, N G 433
bastard wing 188±9
battery 135, 268±71
BCS theory 289, 290
Becquerel, Antione Henri 136
Bednorz, J Georg 289±90
bel (unit) 346
Bell, Alexander Graham 346
Bell, John Stewart 529
Bell's inequality 529
Bell's theorem 529
Benton, Stephen 437
Berliner Tageblatt 151
Bernoulli, Daniel 184
Bernoulli's principle 184±6
Bessel, Friedrich 112
beta decay 533, 535±6
beta rays 137
Big Bang 237, 238, 593
bimetallic strip 217±18
binding energy 157, 542
binding energy per nucleon 159, 160,
542
binding forces 165
bird migration 307
black body radiation 496±9
black dwarf 491
black holes 230, 489±95
blood pressure 190±2
Bohm, David 288
Bohr model 145±7, 308, 503±6
Bohr, Niels 83, 140, 145±7, 419, 421,
503, 511, 527
boiling 210±13
boiling point 211
Boltzmann, Ludwig 240
Born, Max 512, 516
bow wave 367
Boyle, Robert 93, 221±3, 342
Boyle's law 223, 225
Bradley, James 408±9
brain, chaos 336±7
brake system 175
Brattain, Walter 286
bright-line 419
Brout, Robert H 579
Bruger, Dionys 475
B-scan 369
bubble chamber 300±1
Buehler, Martin 258
buoyancy 176±9
Cabrera, Blas 293±4
calcite crystals 429
calculus 31, 52, 122
caloric ¯uid 195±6
calorie (unit) 205
camera 393±4
cancer therapy 552
cannon balls 95±7
capacitance 263±4
capacitor 263±4
capillarity 179±84
car seat belt 54
carbon-14 548±9
carbon monoxide 86
cardiovascular system 186±92
Cassegrain focus 399
Cassini 407
633
SUPERSTRINGS AND OTHER THINGS
cathode 135
cathode ray tube (CRT) 135
cathode rays 135
cats, twisting 106±7
Cavendish, Henry 121, 251
CD-ROM drive 101
Celsius, Anders 200
Celsius scale 200±1
centigrade scale 200±1
centripedal acceleration 108±9
centripedal force 108±9, 121
cesium atom 17
cesium clock 17
Chadwick, James 153
chain reaction 541, 543
chaos 333±7, 430
brain 336±7
chaotic motion 6, 334
charge independence 156, 157
charm 568±9
chemical energy 62, 66, 78
chloride ion 148, 150
Chu, Paul C W 289
ciliary muscles 401
Clausius, Rudolf 230, 232
Clausius statement 229
cochlea 349, 350
cochlear duct 349
cochlear implant 352
Cockcroft, J D 302
coef®cient of linear expansion 216±17
Cohen, I Bernard 122
coherent light 431
coherent waves 329, 330
cohesive forces 182
collector 286
collisions 94±5
color 413±17
color charge 581, 590
color force 581±4
commutator 313, 314
compact disc player 434±5
compound 148
Compton, Arthur H 421
concave mirror 376, 397
condensation 209
conduction band 282, 283
conduction electrons 272, 274, 288, 289
conductors 267, 281
conservation laws 563±5
conservation of electric charge 535,
540, 563
conservation of energy 78±97, 563
principle of 80±2
conservation of mechanical energy
80±2
conservation of momentum 91±3
conservation of nucleon number 535,
540
conservative force 75±7, 261
constant angular velocity (CAV) 101
constant linear velocity (CLV) 101
constructive interference 328±30
converging lens 387±8, 394, 399, 402,
404
convex mirror 379, 380
Cooper, Leon N 288
Cooper pairs 288±9
Copenhagen interpretation of
quantum mechanics 527
Copernicus, Nicolaus 114±18
cornea 399, 404
coronary sinus 189
Cosmic Background Explorer (COBE)
satellite 594
Coulomb, Charles 251, 294
Coulomb force 254
Coulomb interactions 589
coulomb (unit) 253
Coulomb's constant 253
Coulomb's law 250±4, 298, 316
covalent bond 148
Cowan, Clyde L 535
Cretaceous 10, 12
critical angle 390, 391
critical mass 543
crystal structure 208
crystals 164
cubit 13
Cui, Wei 494
Curie, Pierre and Marie 136
curve balls 187±9
curved mirrors 376±80
cyclotron 302±4
Dalton, John 134
634
Index
dark-line 6, 419, 421
Da Vinci, Leonardo 25, 405
Davisson, C J 507
Davy, Sir Humphrey 310
de Broglie, Louis 506±9, 514±15
De Magnete 247±8, 292
De Natura Rerum 291
De Revolutionibus 114
Debye, Peter 515
decad 13
decibel (unit) 346
deferents 112
Democritus 134
DeMoivre, Abraham 119
Denisyuk, Yu N 436
density 165±6, 220
common substances 167
dental cavities 275
Descartes, Rene 407
destructive interference 328±30
deuterium 154
deuterium nucleus 155
dew point 215
diaphragm 395
diastole 190
diastolic pressure 192
diatomic molecules 202
diatonic scale 355
diffraction 325, 422
diffuse re¯ection 373
diffusion 232, 240
diodes 284, 286
dipole 308
Dirac equation 554
Dirac ®eld 554
Dirac, Paul Adrien Maurice 293, 514,
553±5
direction 31
direction of time 229
Discourses and Mathematical
Demonstrations Concerning Two
New Sciences Pertaining to
Mechanics and Local Motion see Two
New Sciences
displacement node 359
Ditto, William D 337
diverging lens 387, 396, 402
donor 284
doping 283
Doppler, Christian 362
Doppler effect 362±5, 420, 487
Doppler shift 485, 487±8
double-slit experiment 518±19, 521
drift velocity 272±4
du Fay, Charles FrancËois de Cisterney
248
ear 346±51
eardrum 348
Earth 45, 59, 60, 98±100, 111±13, 121±7
earthquakes 339
Earth's composition 341
Earth's dimensions 20
Earth's magnetism 304±6
Earth's motion 409, 442, 447
Earth's velocity 446, 447
Earwicker, H C 568
Eddington, Arthur 482
ef®ciency 82±7
eightfold way 566
Einstein, Albert 4±6, 8, 81±2, 140, 144,
145, 421, 431, 432±3, 449±53, 486,
500, 511, 527, 585
general theory of relativity 469±95,
503
mass±energy equivalence 465±8
postulates 452±3, 469
simulataneous events 460
spacetime continuum 473±9
special theory of relativity 157,
441±68, 496, 502
Einstein's formula 158
elastic collisions 94±5
elastic potential energy 72±4
electric car 271±2
electric charge 134, 248±50, 261, 406,
563, 575
electric circuit 275±8
electric current 268±70, 280±1, 295±8
electric ®eld 253±6, 272, 284, 371, 427
electric potential 260±2, 277
electric potential difference 262
electric potential energy 261±2
electric power 280±1
electrical energy 88, 280±1, 369
storage 262±6
635
SUPERSTRINGS AND OTHER THINGS
electrical repulsion 156
electrical symbols 277
electricity 134, 247±66
see also applied electricity
electri®cation 248
electrodes 135
electroencephalogram (EEG) 336±7
electromagnetic energy 381
electromagnetic ®eld 575
electromagnetic force 588
electromagnetic induction 311
electromagnetic interaction 557
electromagnetic radiation 371, 496
electromagnetic spectrum 411±12
electromagnetic waves 370, 406, 407,
411, 413, 445
electromagnetism 247±8, 291±318
electron collisions 430
electron microscopes 523±6
electron migration 284±5
electron neutrino 562
electron pump 264
electron volt 66, 262
electronic ear implants 352
electrons 8, 135, 136, 148, 155, 250,
253, 260, 264, 308, 491
electrostatic forces, Mars 258
electrostatic repulsion 155
electroweak force 580
electroweak uni®cation 575±81
elektron 247
elementary particles 553±70
classi®cation 561±2
elements 599±601
four 45
ellipse 116
emission spectrum 419
emitter 286
empirical method 8
endolymph 351
energy 62±77
concept 62, 66
heat as 195±8
transformation 78±9
energy band structures 282
energy conservation 86±7
energy conversion 84±5
energy gap 282
energy level 145±7
energy level diagram 146, 505
energy losses 82±4, 87
energy of mass 81, 82
energy of motion 68
energy of position 69±73
Englert, F 579
entropy 230, 233±9
and time 239±43
EoÈtvoÈs, Roland von 127, 473
epicycles 112, 113
EPR experiment 527±8
equant 113
equilibrium 55
Eratosthenes 20±1
escape velocity 492±3
European Space Agency 258
eustachian tube 348
evaporation 210±13
exchange forces 558±9
expansion 232
of water 218±20
experimentum crucis 415
exposure 395
extraordinary beam 425
eye 399±404, 422
eyepiece 396
f stops 395
Fahrenheit, Gabriel 200
Fahrenheit scale 200±1
falling bodies 37±40
Faraday, Michael 134±5, 248, 254, 309,
316
Faraday's law of induction 309±11,
316, 406
farsightedness 402, 404
Fermi, Enrico 84, 535, 547
Feynman diagram 558, 559, 561
Feynman, Richard 133, 516, 558
®ber optics 392±3
®fth force 129
Finlay-Freundlich, Erwin 482
Finnegans Wake 566, 568
®re 45
®rst law of thermodynamics 226±9
FitzGerald, George 448
Fizeau, Armand Hippolyte Louis 410
636
Index
¯avor doublets 570
¯uids 164±92
de®nition 165
in motion 184±6
¯uorescence 306
focal length 386, 395
focal point 377
foot (unit) 13
forbidden energy gap 282
force, concept of 43±4
force constant 73
force per unit area 167
forward bias 285
Foucault, Jean Bernard LeÂon 410
fovea 402
Franklin, Benjamin 134, 197, 248±50
FraunhoÈfer, Joseph von 6, 417
frequency 324, 328, 343
Fresnel, Augustin 405, 424
friction 29, 76±7, 82, 196, 198
Frisch, Otto 540
FroÈhlich, Herbert 288
fundamental charge 253, 256±60
fundamental forces 555±8
fundamental frequency 333, 359
fundamental mode 331±2, 354
fundamental units 15±17
furlongs 13
fusion 207±9
Gabor, Dennis 435
Galaxy 458±60
Galen 199
Galilean principle of relativity 441±5,
464, 469
Galilean satellites of Jupiter 396, 407
Galilean telescope 396±7
Galileo 26±7, 34±8, 40±2, 45±6, 118,
123, 168, 199±200, 342, 395±7, 441
dialog with Aristotle 47
Law of Inertia 48±52
Two New Sciences 26, 34±6, 168, 407,
441
Galois, Evariste 565±6
gamma decay 533, 536±7
gamma ray detector 550
gamma rays 137, 539, 550
Gamow, George 238
ganglion cells 403
gas, de®nition 165
gas-recombinant lead±acid batteries
271
gauge invariance 573
gauss 298
Gay-Lussac, Joseph Louis 223
Gay-Lussac's law 225
Geiger counter 516
Geiger, Hans 137, 138
Gell-Mann, Murray 8, 565±8
General Theory of Relativity 469±95,
503
generator 312±15
geocentric model 112
geodesic 479
Georgi, Howard 584
germanium 283
Germer, L H 507
Gilbert, William 247, 292
Glashow, Sheldon 21, 568, 577, 584
global gauge invariance 573
Global Positioning Satellite (GPS)
System 128±9
global SU(2) gauge symmetry 576
gluon 584
Goldstein, Eugen 151
Gottlieb, Rudolf 229
gradient-index lenses (GRINs) 389±90
grand uni®ed theory (GUT) 584±8,
592
gravitational ®eld 254, 470, 481, 485
gravitational force 44, 229, 251, 492,
556
gravitational mass 127, 472
gravitational potential difference 269
gravitational potential energy 69±73
gravitational red shift 487
gravitational time dilation 485±9
gravitino 586
graviton 586
gravity 52, 110, 472, 555±6, 585±6, 592
acceleration due to 40, 41
Green, Michael 7, 587
Gregory, James 397
Grimaldi, Francesco Maria 422
ground state 146, 430
group theory 565, 571
637
SUPERSTRINGS AND OTHER THINGS
guitar 357
hadron 561±2
Hahn, Otto 540
hair cells 349±51, 352
Hale telescope 400
half-life 537, 550
Halley, Edmond 53, 119
hammer 348
Harkings, William D 153
harmonic law 117, 118, 121
Harmonici Mundi 117
harmonics 332±3, 354, 356, 360±2
Hawking, Stephen 493
hearing loss 352
heart 189±92
heat 195±220
as form of energy 195±8
mechanical equivalent of 199
heat capacity 205±6
heat dissipated by electric current 281
heat energy 203
heat of fusion 207±9
heat of vaporization 207±9
heavy hydrogen 154
heavy water 154
Heisenberg, Werner 6±7, 151±3, 509,
511±14, 560, 576
Heisenberg's uncertainty principle
517±22, 528, 533, 559, 585, 589
heliocentric model 112, 114
helium atom 159
Helmholtz, Hermann von 446
heptatonic scale 355
herapathite crystals 426
Hertz, Heinrich 318, 411±12, 499
hidden symmetry 579
hidden variables 527
Higgs ®eld 579
Higgs mechanism 579, 580
Higgs, Peter 579
Hilbert, David 572
hole 283, 284
hologram 435±7
holography 435±7
Hooke, Robert 119, 442
Hooke's Law 73
horizontal canal 351
horizontal velocity 41±2
horsepower 85±7
Hubble, Edwin 237, 365
Hubble Space Telescope 237
Hubble's constant 237
Hubble's law 237
humidity 213±15
at saturation 215
Huyghens, Christiaan 93, 94, 95, 405,
407, 422
hydraulic lift 174±5
hydrogen, atomic properties 147
hydrogen atom 218±19, 420, 505, 506
hydrogen bomb 546
hydrogen nucleus 155
hyperopia 404
ice, open structure 219
ice cream 213
ice crystals 219
ideal black body 497
ideal gas 224
ideal gas law 221±5
image 374, 375
image distance 376
image reversal 375
impact theory of mass extinctions 9
impulse 88±91
impurities 267
incident ray 385
induction, Faraday's law of 309±11,
316, 406
inelastic collisions 94±5
inertial con®nement 546
inertial frames of reference 56, 463
inertial mass 127, 472
inertial reference frame 56, 443±5, 463
Infeld, Leopold 460
inferior canal 351
inferior vena cava 189
infrasonic wave 343
inkjet printer 265
inner ear 349
instantaneous speed 30±1
insulators 267
interaction energy 506
interference 328
interference pattern 329, 518±19
638
Index
Knoll, Max 523
Kopernigk, Mikolaj (Nicolaus
Copernicus) 114±18
K±T boundary layer 11, 12
intergalactic travel 458±60
internal energy 226, 227, 229
inverse-square law 294, 556
inverted image 397
ionic bond 148
ionizing radiation 549
ions 148
iridium 11±13
iris diaphragm 395
iron atoms 578
isochoric process 228
isothermal process 228
isotopes 154
Jefferson, Thomas 13
Joint Institute for Laboratory
Astrophysics 129
Joule, James 66, 67, 198, 199, 281
joule per kilogram per Kelvin 206
joule (unit) 66, 158, 205
Joule's law 281
Joyce, James 566, 568
Jupiter 407
Galilean satellites 396, 407
magnetosphere 305
Kaluza±Klein theory 586
Kaluza's theory 586
Keller, Mark W 264
Kelvin, Lord (William Thompson) 68,
201, 240
Kelvin temperature scale 201, 223
kelvin (unit) 201
Kepler, Johannes 396
Kepler's First Law 117, 483
Kepler's laws of planetary motion
114±18
Kepler's Second Law 116, 117
Kepler's Third Law (harmonic law)
117, 118, 121
kilogram 14, 17
kinetic energy 68±9, 74, 78±81, 95,
196, 201±2, 211, 216, 300, 467, 501
rotational 202, 205
transitional 202, 205
vibrational 205
Kirchhoff, Gustav 418±19
Klein, Oscar 83
labyrinth 349, 351
Land, Edwin P 426
Langevin, Paul 515
Large Magellanic Cloud 491
lasers 430±5
latent heat 209
latent heat of fusion 209
latent heat of vaporization 209
Lavoisier, Antoine 195
law of action and reaction 59±61
law of areas 116, 118
law of conservation of angular
momentum 107
law of conservation of baryons 563±4
law of conservation of energy 278
law of force 56±8
law of inertia 48±52
law of orbits 117
law of universal gravitation 98,
118±25, 556
historical perspective 110±14
Lawrence, Ernest 302
Lawrence, Wendy 127
laws of mechanics 43±61
laws of spectral analysis 419
lead±acid battery 271
leaning Tower of Pisa 50±1
Lebedev, P N 412
Lederman, Leon 569
Leibniz, Gottfried Wilhelm 66, 571
length 15, 16
length contraction 462±3
lens axis 385
lenses 384±90, 396, 399
see also speci®c types of lens
Leonardo da Vinci see Da Vinci,
Leonardo
lepton 561±2, 570, 585
lepton quantum numbers 564±5
Leucippus 134
lever arm 103
Le Verrier, Joseph 483
Leyden jar 263, 268
639
SUPERSTRINGS AND OTHER THINGS
Lie, Sophus 572
light 145, 318, 321, 370, 417
bending 479±83
nature of 405±37
speed of 382, 406±10
wave nature of 405
wave theory of 370, 424
light quanta 144
light rays 389
light waves 142, 370, 405
Lipperschey, Hans 395
liquid, de®nition 165
liquid-drop model of ®ssion 541
liquid helium 289
liquid nitrogen 290
liter 14
local SU(2) symmetry 576
lodestone 247, 292
London, Fritz 287
London, Heinz 287
longitudinal wave 322, 323, 343
Lorentz, Hendrik A 449
Lorentz±FitzGerald contraction 449,
462
Los Alamos National Laboratory 129
loudness 346, 354
Lucretius 247, 291
luminiferous ñther 446
lunar eclipse 371
M-theory 588
Mach 1 366
Mach 2 366
Mach, Ernst 366
macular degeneration 403
Magellanic Cloud 231
magnet
discovery 291±2
rotation 578
magnetic bottle 306
magnetic domains 308
magnetic ®eld 292±4, 297, 302, 303,
311, 312, 406, 578
moving charge in 298±301
magnetic force 44, 294
magnetic monopoles 293
magnetic navigation, avian 307
magnetic poles 294
magnetic potential 575
magnetic potential energy 72
magnetic storms 306
magnetism 247±8, 295±8
earth's 304±6
source of 307±9
magnetite 247
magneto-optical drives 295
magnetosphere 305, 306
magnetron 317
magni®cation 388
Maiman, T H 435
Mars 115
electrostatic forces 258
Mars Environmental Compatibility
Assessment (MECA) Electrometer
258±9
Marsden, Ernest 139
maser 433
mass 15±17, 19, 82, 104, 123, 127, 468,
579
energy of 81
mass defect 158
mass±energy equivalence 465±8
mass extinctions, impact theory 9
Mathematical Collection, The 114
mathematics and physics 18±21
Maximilian I, elector of Bavaria 6
Maxwell, James Clerk 248, 315±18,
405, 407, 411, 412, 445
Maxwell's equations 315±18, 371, 586
Maxwell's fourth equation 370, 406
Maxwell's third equation 370, 406
measurement 13±15
mechanical energy 68, 369
mechanical equivalent of heat 199
mechanical wave 142, 321
mechanics, laws of 23, 43±61
Meitner, Lise 540
Melvill, Thomas 417
meniscus 182
Mercury
magnetic ®eld 304
perihelion 483±5
merry-go-round 98±100
meson 561, 581, 582
Mesopotamis 111
metastable state 431
640
Index
meteorite 10±12
meter 14, 15, 17, 20
Michel, Helen 11
Michell, John 294
Michelson, Albert A 410, 446±9
Michelson interferometer 446±8
Michelson±Morley experiment 445±9,
462
microwave oven 317
middle ear 348
migratory birds 307
Milky Way Galaxy 98
mille passus 13
Millikan, Robert A 257, 260
Mills, Robert L 576, 580
mirrors, re¯ection 374±5
modes of vibration 331±2
molecular motion 6
molecules 5, 6, 147±50
collision 240
moment of inertia 104
momentum 88±91
conservation of 91±3
Moon 122±3, 371, 493
measuring the distance to 124±5
moons 396
motion 25±42
ancient ideas 44±5
of projectiles 40±2
projectiles 40±2
understanding 25
uniform 26±7
uniform linear 441
uniformly accelerated 35±7
motors 312±15
Mount Palomar 399, 400
MuÈller, K Alex 289±90
multimode ®bers 393
muon 457
muon neutrino 562
music 352±6
musical instruments 357±62
musical scales 354±5
Musschenbroeck, Pieter van 262±33
myopia 402
Mysterium Cosmographicum 115
NaCl molecule 150
Nagaoka, Hantaro 140
NASA 258, 594
nasopharynx 349
natural frequency 333
natural motion 45
NAVSTAR 2 488
near point 402
nearsightedness 402
Neeman, Yuval 565
negative electric charge 134, 249±50
negative ion 148
Nernst, Harmann Walther 233, 234
neutrino 84, 535, 562
neutron 8, 9, 83±4, 151±3, 155, 160,
533
neutron star 491
Newton, Isaac 31, 52, 108, 142, 397,
399, 413±17, 422, 425, 555±6
First Law 52, 122, 445
law of universal gravitation 98,
118±25, 556
historical perspective 110±14
laws of motion 43±61
Principia 53, 56, 125
Second Law 56±8, 121, 127
Third Law 59±61
newton second (unit) 91
Newtonian focus 399
Newtonian mechanics 483
Newton's cradle 94
newtons per square meter 168
Nishijima, Kazuhiko 565
Nobel Prize winners in physics
602±7
node 331, 359
noise 352
nonconservative force 75±7, 81
Northern Lights 306
NoÈther, Amalie Emmy 572
NoÈther's theorem 573
n-type semiconductor 283, 284
nuclear bomb 544
nuclear energy 540±6
nuclear energy level 161±3
nuclear energy level diagram 162
nuclear explosion 543
nuclear ®ssion 540
nuclear force 156, 157, 532, 533
641
SUPERSTRINGS AND OTHER THINGS
nuclear fusion 545±6
nuclear physics 532±52
applications 546±51
nuclear reactions 82, 538±40
nuclear reactor 544, 545
nucleon 153, 157, 161, 162
nucleon orbits 163
nucleus 8, 139, 151±63, 532
composition 153±5, 532
size and shape 159±61
nuclide 154
object 375
object distance 375
objective 396, 397
octave 354, 355
Oersted, Hans Christian 248, 295±7
Ohm, Georg Simon 274
ohm (unit) 275
Ohm's law 270±5, 281
oil drop experiment 260
Oldenburg, Henry 413, 414
Oldham, R D 340
Onnes, Kamerlingh 287
open structure 219
optical axis 376
optical ®ber 392±3
optical instruments 393±9
optical pumping 431
optics 370±404
orbital motion 125±9
orbiting clock 488
ordinary beam 425
organ of Corti 349, 350, 352
organ pipe 359±62
Orion Nebula 489
oscillation 332±3, 339, 364
otocondria 351
otoconia 351
otolithic membrane 351
oval window 348
overtone 356
oxygen atom 218±19
oxygen molecule 203
p-branes 588
pair annihilation 555, 556
pair production 467, 555, 556
Pais, Abraham 565
parallax 112
parallel connection 279±80
parallel plate capacitor 263
parallel rays 385
parallelogram method 33
partial constant angular velocity
(PCAV) 101
particle accelerator 9, 301±4, 585
particles 8, 553±70
particles of light 422, 502, 511
Pascal, Blaise 168, 173
Pascal's principle 173±4
Path®nder mission 258
Pauli exclusion principle 533
Pauli, Wolfgang 83±4, 535
Pemberton, Henry 119
Penrose, Roger 493
pentatonic scale 355
Peregrinus de Maricourt, Petrus 247,
291±2, 295
perihelion precession 483
period 325
permanent magnet 307
Philosophiae Naturalis Principia
Mathematica see Principia
photoelectric effect 499±503
photomultiplier tube 502, 503
photon 144, 146, 239, 420, 422, 431,
501, 503±5, 550, 558, 576, 577, 580
physical color 413
physics
and mathematics 18±21
and other sciences 9±13
concepts 21
overview 3±21
time-line 608±14
piano 358±9
piezoelectric crystals 369
piezoelectric effect 369
pi meson see pion
pinhole camera 393±4
pion 559±61, 583
pitch 354, 357, 363, 364
Planck, Max 140, 143±5, 257, 496, 498,
503
Planck's constant 499, 514, 521
Planck's formula 520
642
Index
plane mirror 376
plane wave 326
planetary motion, Kepler's laws of
114±18
p±n junction 285
p±n junction diode 284
p±n±p transistor 286
Podolsky, Boris 527
Poincare cycle time 241
PoincareÂ, Henri 241
Poincare recurrence 241
point-contact design 286
polarization 422, 425±30
Polaroid 427±9
polonium 136
polyvinyl chloride (PVC) 148
Popper, Karl R 8, 9
population inversion 431
positive electric charge 134, 155,
249±50
positive ion 148
positron 535, 555
positron-electron pair 555
Positron Emission Tomography (PET)
549
potential difference 262, 266, 269, 272
potential drop 278±9
potential energy 69±73, 78±80, 146,
208, 209, 226, 279
power 85±8
powers of ten 596±8
pressure 166±76, 185, 223
liquid 172±6
pressure cooker 212, 213
priciple of superposition 325±8
Priestley, Joseph 250±1
Prigogine, Ilya 241
prime focus 399, 400
Princeton Tokamak Fusion Test
Reactor 546
principal quantum number 146
principle of conservation of
mechanical energy 80±2
principle of conservation of
momentum 93
principle of equivalence 469±73
principle of relativity 452±3, 469
prism 391±2, 417
projectiles, motion of 40±2
Prokhorov, A M 433
proper length 462±3
proton 8, 9, 151±3, 155, 300, 466, 491,
533, 582
proton beams 552
Ptolemy, Claudius 112±13
p-type semiconductor 283, 284
Pythagoras 111, 342, 352±4
quality 354±6
quanta 141±5, 144, 498
quantity of motion 92
quantum chromodynamics (QCD)
582, 584
quantum electrodynamics (QED) 558,
582
quantum ®eld 554
theory 554, 559
quantum mechanics 147, 152, 161,
163, 509±31, 527
beginnings 511
Quantum Mechanics 553
quantum numbers 163
quantum physics 496
quantum teleportation 530±1
quark 9, 565±8, 581, 582, 585, 590, 591
quark quantum numbers 569
radiation 143
radiation damage 550
radiation exposure 550
radio waves 412
radioactive dating 548±9
radioactive decay 533, 549
radioactive isotopes 549±1
radioactive nucleus 156±7
radioactivity 136, 137, 532±7
biological effects 549±1
radium 136
radon 551
radon-222 550
rare earth elements 289
recombination 284±5
red giant 490
reference frames 56, 443±5, 457, 463,
485
reference level 69
643
SUPERSTRINGS AND OTHER THINGS
re¯ection 370±80
from mirrors 374±5
re¯ector 397
re¯ector telescope 398
refraction 370, 380±4
indices of 384
see also lenses
Reines, Frederick 535
relative humidity 215
relativistic mass 467±8
relativistic quantum ®eld theory 554
relativity see Galilean principle of
relativity; General Theory of
Relativity; Special Theory of
Relativity
resistance 273, 275, 281
resistor 277
combinations 278±80
resonance 333±7, 515
retina 401, 403
retinosis pigmentosa 403
reverse bias 285
reversibility paradox 240
revolution 98
Richter, Burton 569
right-hand rule 297, 298
rock 341
rockets 95±7
rods and cones 401, 403
Romagnosi, Gian Domenico 296
RoÈmer, Olaus 407
RoÈntgen, Wilhelm Conrad von 136
Rosen, Nathan 527
rotation 98
rotational inertia 104
rotational motion 98±102
rotational symmetry 579
round window 349
Rubbia, Carlo 581
Rumford, Count, (Benjamin
Thompson) 197±8
Ruska, Ernst 507, 523
Rutherford, Ernest 83, 137±40, 151,
153, 157, 538
saccule 351
Salam, Abdus 577, 580
Sandage, Alan 237
satellites 110, 126, 305, 488, 594
saturation 156
scalar quantities 31
scanning electron microscope (SEM)
524±5
scanning tunneling microscope (STM)
525±6
Schawlow, A L 433
Schrieffer, John R 288
SchroÈdinger, Erwin 3, 509, 515, 516
Schwarz, John 7, 587
Schwarzschild, Karl 492
Schwarzschild radius 492, 493
Schwinger, Julian 558
science
distinguishing feature 8
modern 45±8
scienti®c knowledge 9
scienti®c method 7±9
scienti®c theory 8
second law of thermodynamics
229±33
seismic waves 339±41
semicircular canals 351
semiconductors 267, 281±7
n-type 283, 284
p-type 283, 284
series connection 278
Shaw, Robert S 335
shells 163
Shelton, Ian 491
Shockley, William 286
shockwaves 365±7
short range force 156
shutter 395
SI units 14±17, 28, 66, 87, 121, 158,
168, 205, 206, 262, 281, 298
see also speci®c units
Sidereus Nuncius 396
silicon 283, 284
Simon, A W 421
simultaneity 460±1
single-mode ®ber 393
singularity 493
sino-auricular node 190
Snell's law 384
Snow C P 229
sodium chloride 164
644
Index
sodium ion 148
solar eclipse 371
solar ¯ares 306
solar spectrum 6
Solar System 98, 110
solids 164
sound 342±69
nature of 342±3
speed of 343±6, 366, 368
sound intensity 347
sound level 346, 347
sound waves 142, 321, 327, 354
intensity 346
sounding box 357
Southern Lights 306
Space Shuttle Discovery 127
Space, Time, and Gravitation 482
spacecraft 125±9
spacetime continuum 473±9
spacetime drag 494
Special Theory of Relativity 157,
441±68, 496, 502
speci®c heat capacity 206±7
spectra 417±21
spectral lines 7
specular re¯ection 373
speed
and direction 31
average 27±30
instantaneous 30±1
units of 28
spherical aberration 385, 389
spherical lens 389
spherical mirror 376
sphygmomanometer 190±2
spontaneous emission 430
spontaneous symmetry breaking 577,
580
spring coil (or ``Slinky'') 322±3
standard atmosphere 170
standing (or stationary) wave 330±3,
335, 506±9
Stanford Linear Accelerator
Laboratory (SLAC) 9
states of matter 164±5
stationary (or standing) wave 330±3,
335, 506±9
stationary orbits 145
Stella, Luigi 494
stellar aberration 408±9
stellar parallax 408
stimulated emission 432±3
stirrup 348
strange particles 565
Strassmann, Fritz 540
streamlines 184, 187
strong nuclear force 156, 557
structure of matter 133
Stuckey, William 118
subatomic particles 8, 83
sul®tes 149
sulfur dioxide 149
Sun 111±12, 115±18, 121, 409, 479, 487
superconductor 287±90
superforce 571±95
supergravity 586
superior canal 351
superior vena cava 189
supernova 12, 491
explosions 230, 231
superstrings 585±8
theory 7, 587
supersymmetry 585±8
surface tension 179±84, 339
symbols of elements 599±601
symmetry 571±3
global 573±5
local 573±5
symmetry-breaking 593
symmetry group O(3) 578
Syncom 2 110
systole 190
systolic pressure 191±2
Tacoma Narrows Bridge 334
tau neutrino 562
telephone tones 344±5
telescope 395±9
temperature 195±220, 223, 224, 228
temperature change 216±17
temperature measurement 199±201
terrela 292
Tertiary 10, 12
tesla 298
Thales of Miletus 111, 133
``The Two Cultures'' 229
645
SUPERSTRINGS AND OTHER THINGS
thermal energy 81, 203, 205±6, 226
thermal expansion 215±18
thermodynamics, laws of 221±43
thermography 204
thermometer 199
thermostat 217
thin lenses 386
third law of thermodynamics 233±5
Thompson, Benjamin, (Count
Rumford) 197±8
Thompson, William see Kelvin, Lord
Thomson, Sir Joseph John 135±6, 257
'T Hooft, Gerard 580, 581
thorium-234 550
thought experiment 453, 470
timbre 354
time 15, 16
and entropy 239±43
direction of 239±40
time dilation 453±60
gravitational 485±9
time-line, physics 608±14
Tomonaga, Sin-Itiro 558
tooth decay 275
torque 102±7
torr (unit) 170
Torricelli, Evangelista 168, 170
torsion balance 251±2
torsional wave 322
total energy 226
total internal energy 226
total internal re¯ection 390±2
total mechanical energy 80
tourmaline 429
Townes, Charles H 432±3
Townsend J S 257
transformation of energy 78±9
transistor 284, 286
translational motion 98
transmission electron microscope
(TEM) 523
transverse wave 322, 326, 425
tumor detection 504±5
twisting cats 106±7
Two New Sciences 26, 34±6, 168, 407,
441
tympanic annulus 348
tympanic canal 349
tympanic membrane 350
tympanum 348
ultrasonic waves 343
ultrasound 367±9
ultraviolet catastrophe 498
uniform linear motion 53, 441
uniformly accelerated motion 35±7
units 13
see also SI units and speci®c units
universal gravitation 52
Newton's law of 98, 118±25, 556
universe 110±14
creation 588±92
expansion 238
®rst moments 591±5
origin 235±9
uranium ®ssion reaction 542
uranium-235 541±4
uranium-238 550
Ursa Major 237
utricle 351
valence band 282, 284
valence electrons 272
Van Allen belt 306
Van Allen, James 305
van der Meer, Simon 581
vaporization 207±9
vector quantity 31, 89
vectors 31±4
addition 33
components 34
resultant 33
velocity 31, 185, 443
addition 464±8
Veneziano model 587
ventricle 189, 190
ventricular ®brillation 335
vertical motion 42
vestibular canal 349
vestibular membrane 350
vibration 331±2
View of Sir Isaac Newton's Philosophy, A
119
Viking Lander missions 258
violin 357
virtual electron±positron pairs 589
646
Index
virtual image 378
viscosity 5
vitreous body 401
Vitruvius 342
volcanic eruption 12
volt (unit) 262
Volta, Count Alessandro 262±3, 268
voltage 262
volume 223, 224
Wallis, John 93
Walton, E T S 302
water 45, 148, 211
contraction 218
expansion 218±20
water molecules 218±19
water vapor 213±15
water waves 230, 321, 337±9
Watt, James 85, 87
watt (unit) 87±8, 281
wave equation 516
wave mechanics 514±17
wave motion 141, 143, 321±41
wave nature of light 405
wave theory of light 370, 424
waveforms 356
wavelength 324±5, 329, 332±3, 364,
382, 413, 424
waves 141±5
in phase 329
nature of 321±3
overlapping 328
properties of 323±5
weak nuclear force 533, 557
weight 60, 123
Weinberg, Steven 579
Weindler, Peter 307
Weyl, Hermann 573
white dwarf 490
Wien, Wilhelm 152
Wilson H A 257
wind instrument 359
WinderoÈe, Rolf 302
winemaking 149
Wollaston, William 417
work 66, 74, 76, 88
concept 62±5
work-energy theorem 74, 185
work function 501, 502
Wu, Mau-Kuen 289
X particles 584±5
X-rays 550
Yang, C N 576, 580
Young, Thomas 66, 142, 405, 421±4
Young's modulus 422
Yukawa, Hideki 559±60
zeroth law of thermodynamics
225±6
Zweig, George 8, 566
647