Electromagnetism Geometry
Electromagnetism Geometry
电磁与几何
李思 (Si Li)
Tsinghua University
Thanks very much for your support of this book! It is greatly appreciated if you are willing
to help improve it by sending your comments such as typo, mistake or suggestion at your tea
time. You are welcome to submit your comment via either the website
https://www.wjx.cn/vm/eITIr9H.aspx
1
Contents
Preface 5
2
2.4.3 Dielectric Matter . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
2.5 Magnetic Field and Vector Potential . . . . . . . . . . . . . . . . . . . . . . . . . 50
2.5.1 Magnetic Field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
2.5.2 Interface Condition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
2.6 Magnetic Moment and Magnetic Dipole . . . . . . . . . . . . . . . . . . . . . . . 56
2.6.1 Magnetic Moment . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
2.6.2 Magnetic Dipole Layers . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
2.7 Linking and Magnetic Helicity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
2.8 Dirac Monopole . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Chapter 3 Electrodynamics 66
3.1 Force and Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
3.1.1 Lorentz Force . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
3.1.2 Electromagnetic Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
3.2 Electromagnetic Induction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
3.2.1 Electromotive Force and Flux Rule . . . . . . . . . . . . . . . . . . . . . . 71
3.2.2 Mutual Inductance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
3.2.3 Self Inductance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
3.2.4 Magnetostatic Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 76
3.3 Electromagnetic Wave . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
3.3.1 Wave Equation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
3.3.2 Plane Waves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
3.3.3 Polarization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79
3.3.4 Wave Packets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.4 Green’s Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
3.4.1 Wave Equation with Source . . . . . . . . . . . . . . . . . . . . . . . . . . 82
3.4.2 Green’s Function for the Wave Equation . . . . . . . . . . . . . . . . . . . 84
3.4.3 Retarded and Advanced Solutions . . . . . . . . . . . . . . . . . . . . . . 85
3.5 Dipole Radiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
3.5.1 Spherical Wave . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
3.5.2 Electric Dipole Radiation . . . . . . . . . . . . . . . . . . . . . . . . . . . 88
3.6 Moving Point Charge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
3.7 Scattering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
3
4.2.1 Vertical Vector Field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
4.2.2 Connection 1-form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
4.2.3 Horizontal Vector . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
4.2.4 Parallel Transport . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
4.3 Curvature and Chern Class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
4.3.1 Curvature 2-form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118
4.3.2 Holonomy and Curvature . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
4.3.3 Chern Class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121
4.4 Local Gauge and Transition . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
4.4.1 Local Gauge 1-form via Trivialization . . . . . . . . . . . . . . . . . . . . 123
4.4.2 Connection via Transition Functions . . . . . . . . . . . . . . . . . . . . . 126
4.5 Gauge Transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
4.5.1 Gauge Transformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
4.5.2 Local v.s. Global Gauge Transformations . . . . . . . . . . . . . . . . . . 132
4.6 Maxwell Theory as U (1)-gauge Theory . . . . . . . . . . . . . . . . . . . . . . . . 134
4.6.1 Potential as Gauge 1-form . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
4.6.2 Electromagnetic Field as Curvature . . . . . . . . . . . . . . . . . . . . . 134
4.6.3 Maxwell Action . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
4.6.4 Gauge Principle and Charge Conservation . . . . . . . . . . . . . . . . . . 136
4.6.5 Magnetic Monopole . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
4.7 Associated Bundle and Matter Field . . . . . . . . . . . . . . . . . . . . . . . . . 138
4.7.1 Associated Vector Bundle . . . . . . . . . . . . . . . . . . . . . . . . . . . 138
4.7.2 Hermitian Inner Product . . . . . . . . . . . . . . . . . . . . . . . . . . . 139
4.7.3 Covariant Derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
4.7.4 Matter Wave Function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
Bibliography 157
4
Preface
In April 2021, Qiuzhen College (求真书院) was newly established at Tsinghua University
under the leadership of Professor Shing-Tung Yau. It homes the distinguished elite mathematics
program in China starting in 2021: the “Yau Mathematical Sciences Leaders Program” (丘成桐
数学科学领军人才培养计划). This program puts strong emphasis on basic sciences related to
mathematics in a broad sense. Though majored in mathematics, students in this program are re-
quired to study fundamental theoretical physics such as classical mechanics, electromagnetism,
quantum mechanics, and statistical mechanics, in order to understand global perspectives of
theoretical sciences. It is an exciting challenge both for students and for instructors.
This preliminary note is written for the course “Electrodynamics” that I lectured at Qi-
uzhen College in the spring semester of 2023. The lecture note consists of two parts. The first
part is to explain key physics ingredients of electromagnetism, such as Maxwell’s equations,
electrostatics, magnetostatics, electromagnetic waves, radiation, scattering, etc. The second
part is of geometric nature, which explains Maxwell theory as U(1)-gauge theory in terms of
fiber bundle theory, as well as its consistency with special relativity. We emphasize on different
faces of concrete examples in order to understand the bridge between physics and mathematics.
This note is in succession to [10] in this series, and assumes basic knowledge on differential
forms and vector fields. Readers can consult [10] for preliminary geometric backgrounds.
I greatly appreciate the “Notes Taker Program” at Qiuzhen College, which has triggered
and supported the production of this note. A preliminary version of this note was carefully
typed by Yang Peng (杨鹏) all the way along the course, and I am extremely grateful to his
great job for Notes Taker. I would also like to thank Zhou Jiawei (周嘉伟)、Dingxu Zhihan (丁
徐祉晗)、Liu Jiuhe (刘九和) and Quan Hanwen (权瀚文) for their help on proofreading of the
first version of this note.
@ Jingzhai (静斋)
June, 2023
5
This is a drawing by my daughter expressing herself with her brother. I found it interest-
ing as it illustrates precisely the essence of electromagnetism on the coupling of electric
and magnetic fields as well as the topological nature of Maxwell’s equations.
6
Chapter 1 Introduction: Maxwell’s
Equations
In the early 1860’s, James Clerk Maxwell took the work of Faraday and many others, and
summarized into four equations that linked the electric field with the magnetic field. Maxwell’s
equations are nowadays accepted as the basis of all modern theories of electromagnetism.
∇·E ~ = ρ/ε0
∇ · B
~ =0
∇×E
⃗
~ = − ∂B
∂t
∇ × B ⃗
~ = µ0 ~j + ε0 ∂ E
∂t
These equations predicted electromagnetic waves travelling at the observed speed of light. This
leads to Maxwell’s speculation that lights are electromagnetic waves, and suddenly brings light,
electricity and magnetism into the same fundamental phenomenon.
Maxwell’s equations in modern geometry take the concise form
dF = 0
d∗ F = J
Here F is a 2-form on spacetime that collects both electric and magnetic fields. J is a 1-form
that represents the electric charge-current. d is the de Rham differential, and d∗ is its adjoint.
This immediately reveals many deep geometric and topological natures of Maxwell theory.
In this chapter, we will review and explain the precise meaning of these equations, preparing
for the journey toward the study of physics and geometry of electromagnetism. The beauty of
this set of equations lies in its the simplicity and nontriviality. For example, it encodes all
together the following fundamental electromagnetic phenomenons
These will be discussed and explored in detail in Chapter 2 and Chapter 3. Geometric and
topological aspects of Maxwell’s equations will be explained in Chapter 4 and Chapter 5.
Our flavor will be geometric, and assume basic knowledge on differential forms. Readers
can consult [10, Chapter 3] in this series for preliminary geometric backgrounds.
7
1.1 Hodge Star
Hodge Star on R3
Let us consider the geometry of R3 with space coordinate (x, y, z). Denote by
∂ ∂ ∂
d3 = dx + dy + dz
∂x ∂y ∂z
for the de Rham differential on R3 for space variables. The Euclidean metric ds2 = dx2 + dy 2 +
dz 2 defines a Hodge star operator on differential forms, which we denote by
∗
1 7−→
3
dx ∧ dy ∧ dz
∗
dx 7−→
3
dy ∧ dz
∗
dy 7−→
3
dz ∧ dx
∗3
dz 7−→ dx ∧ dy
∗3
dx ∧ dy 7−→ dz
∗3
dy ∧ dz 7−→ dx
∗
dz ∧ dx 7−→
3
dy
∗
dx ∧ dy ∧ dz 7−→
3
1
Then ∗3 is defined on all Ω• (R3 ) by C ∞ (R3 )-linear extension over the above basis. For example,
where f, g ∈ C ∞ (R3 ) are functions on R3 . The above ∗3 has the property that
∇2 can be extended to Ωp (R3 ) component-wise with respect to the basis as above. For example,
8
Proposition 1.1.2. d3 d∗3 + d∗3 d3 is related to the Laplacian operator ∇2 by
Integral in R3
Differential p-forms can be integrated on p-dim oriented spaces. Besides, we will also use
the following notations for various integrals in R3 in this note:
~ on a 2-dim surface S ⊂ R3 :
• Surface integral of a vector A
ˆ
d~σ · A
~
S
d~σ · A
~ = dσx Ax + dσy Ay + dσz Az = dydzAx + dzdxAy + dxdyAz .
~ on a 1-dim curve C ⊂ R3 :
• Line integral of a vector A
ˆ
d~r · A
~
C
d~r · A
~ = dxAx + dyAy + dzAz .
´
• We will sometimes write Y f for the integral of a function f over some space Y ⊂ R3 ,
without explicitly expressing the measure. The measure is understood as the canonically
induced one from R3 . When Y is 3-dim, this is the volume integral; when Y is 2-dim, this
is the area integral; when Y is 1-dim, this is the curve integral.
9
Hodge Star on R3,1
Consider now the Minkowski space R3,1 with space-time coordinate (x, y, z, t). We will
simply denote by
∂ ∂ ∂ ∂
d = dx + dy + dz + dt
∂x ∂y ∂z ∂t
for the de Rham differential on R3,1 . The Minkowski metric ds2 = c2 dt2 − dx2 − dy 2 − dz 2
induces a Hodge star operator
Remark 1.1.5. The sign in defining d∗ via ∗ on R3,1 is different from that on R3 , due to dimension
and signature reason.
dd∗ + d∗ d = −□,
10
where □ is the d’Alembert operator
1 ∂2 ∂2 ∂2 ∂2 1 ∂2
□= − − − = − ∇2 .
c2 ∂t2 ∂x2 ∂y 2 ∂z 2 c2 ∂t2
Here □ is defined on Ωp (R3 ) component-wise with respect to the basis as above, similar to ∇2 .
Proof: Exercise.
Remark 1.1.7. Here is a useful formula relating geometric operators in R3 and in R3,1 . Let α
be a p-form in R3,1 containing only form indices dx, dy, dz, and |α| = p is the form degree of α.
Then
∗α = cdt ∧ ∗3 α
∗(α ∧ cdt) = ∗3 α
d∗ α = −d∗3 α
d∗ (α ∧ cdt) = −d∗ α ∧ cdt − (−1)|α| 1 ∂t α
3 c
This set of equations completely describes the dynamics and interactions of electromagnetic
fields. Here
These quantities in general depend on the space (x, y, z) and time t. The other constants in
Maxwell’s equations are: ε0 the permittivity of free space and µ0 the permeability of free space.
They are related to the speed of light c by the following equation
1
c= √ .
ε0 µ 0
~ and
A particle of charge q moving with velocity ~v in the background of electric field E
~ experiences force
magnetic field B
~ + ~v × B).
F~ = q(E ~
11
This is the Lorentz force law. We can talk about force density f~ per volume. Then the Lorentz
force law becomes
~ + ~j × B.
f = ρE ~
Maxwell’s equations, together with the Lorentz force law, form the foundation of classical elec-
tromagnetism.
Maxwell’s equations have a compact form in geometric terms. We collect electric and
magnetic field into a 2-form F on the space-time R3,1 as
∗:E
~ 7−→ −cB ~
~ 7−→ 1 E
∗:B ~
c
It is computed
1
∗d ∗ F = (∂t Ex dx + ∂t Ey dy + ∂t Ez dz) + (∂x Ex + ∂y Ey + ∂z Ez )dt
c2
− (∂x By − ∂y Bx )dz − (∂y Bz − ∂z By )dx − (∂z Bx − ∂x Bz )dy.
Here d∗ = ∗d∗ is the adjoint of d. We have arrived at the geometric form of Maxwell’s equations
dF = 0
d∗ F = J
12
1.3 Conservation Law of Charge
Electric Charge
The words “electric” and “electricity” come from the Greek word for “amber”. Electric
charge is an intrinsic property of matter, and all take value in an integral1 multiple of the
elementary charge
e = 1.602176634 × 10−19 C.
A single proton carries electric charge e and a single electron carries electric charge −e.
Since e is practically small, it is natural to consider continuous objects and define “charge
density” ρ(~r, t) per unit volume. Here
~r = (x, y, z)
qα
There is a similar treatment for line charges (using δ-function supported on the line)
1
Here we do not consider quarks or quasi-particles.
13
Electric Current
The movement of electric charges constitutes the “electric current”. Such quantity is cap-
tured by a space-vector ~j called current density. In general ~j = ~j(~r, t) depends on the position
and may change with time. For any surface S, the surface integral
ˆ
I= d~σ · ~j
S
counts the charge per unit time passing through S.
~j
For example, the current density of a point particle of charge q moving at position ~rα (t) is
Electric charge is conserved in physical processes. Consider the total charge Q contained
in some fixed region V ˆ
Q= d3 r ρ.
V
Its change with time is given by ˆ
dQ ∂ρ
= d3 r .
dt V ∂t
On the other hand, we can compute its change by counting the flow out through its boundary
S = ∂V ˆ ˆ
dQ Gauss s′
=− d~σ · ~j ======= − d3 r ∇ · ~j.
dt S Theorem V
~j
V
14
By comparing the two expressions, we find
ˆ
∂ρ
3
d r + ∇ · j = 0.
~
V ∂t
This holds for any region V , leading to the following local form of conservation law of charge
∂ρ
+ ∇ · ~j = 0
∂t
This equation is also a consistency equation for Maxwell’s equations. To see this, recall
d∗ F = J
where
1 1
J= ρdt − (jx dx + jy dy + jz dz) .
ε0 c2
The equation d∗ F = J is equivalent to
d(∗F ) = ∗J.
d(∗J) = 0.
Explicitly, we have
1
∗J = (ρdx ∧ dy ∧ dz − jx dt ∧ dy ∧ dz − jy dt ∧ dz ∧ dx − jz dt ∧ dx ∧ dy),
cε0
1 ∂ρ
d(∗J) = + ∇ · j dt ∧ dx ∧ dy ∧ dz.
~
cε0 ∂t
Therefore
∂ρ
d(∗J) = 0 ⇐⇒ + ∇ · ~j = 0.
∂t
The conservation law can be also understood via Stokes’ Theorem as follows. Consider the
following region
M = V × [t1 , t2 ] in the spacetime R3,1 .
t t2
V t1
15
The boundary ∂M of M consists of three pieces
Taking the infinitesimal form t2 → t1 (or simply taking the derivative of t2 ), we find
ˆ ˆ
∂ρ
d3 r + d~σ · ~j = 0.
V ∂t S
In Section 4.6.4, we will discuss another interpretation of charge conservation from the
point of view of gauge principle.
~r − ~r ′
q′ q
1 X ′ ~r − ~rα′
F~ = qqα .
4πε0 α |~r − ~rα′ |3
This formula naturally generalizes to the case with electric charge distribution ρ
ˆ
1 ~r − ~r ′
F~ = q d3 r′ ρ(~r ′ ) .
4πε0 |~r − ~r ′ |3
We can recast this formula into the form
F~ = q E
~
16
Here now comes the important idea and concept of field. The Coulomb’s law in the form
F~ = q E
~ is a conceptual shift of picture on the nature of force. A particle experiences a force
determined by the local value of the field at the position of the particle. This is different
from the non-local force over large distance. In particular, fields are physical, and can exist
independently of the presence of charged particles.
Now we look into the expression
ˆ ˆ
1 ′ ~ r − ~r ′ 1 1
~
E(~r ) = 3 ′
d r ρ(~r ) ′ 3 =− d3 r′ ρ(~r ′ )∇ .
4πε0 |~r − ~r | 4πε0 |~r − ~r ′ |
Here ∇ = ∂x∂ ∂
, ∂y ∂
, ∂z is the gradient operator with respect to the position ~r. Observe that
|⃗
1
r−⃗r ′|
is the Green’s function on R3 , or precisely
1
∇2 = −4πδ(~r − ~r ′ ).
|~r − ~r ′ |
Here ∇2 = ∂x2 + ∂y2 + ∂z2 is the Laplacian operator on R3 . It follows that
ˆ ˆ
~ r) = − 1
∇ · E(~ d3 r′ ρ(~r ′ )∇2
1
=
1
d3 r′ ρ(~r ′ )δ(~r − ~r ′ ) = ρ(~r )/ε0 ,
4πε0 |~r − ~r ′ | ε0
i.e.
∇·E
~ = ρ/ε0 .
We have found that Coulomb’s law gives one of the four Maxwell’s equations.
Gauss’s Law
The equation ∇ · E
~ = ρ/ε0 has the interpretation that the electron field is sourced by the
electric charge. To illustrate this, consider some region V with boundary ∂V = S. We consider
the electric flux through S defined by the surface integral
ˆ
d~σ · E.
~
S
~
E
V S
17
V
N S
N S S′
Geometrically, we have
d(∗F ) = ∗J.
d(∗F ) = 0.
18
In 1800, Alessandro Volta invented the Voltaic pile which produces a steady electric cur-
rent. This can be viewed as an early electric battery, and enabled a rapid many discoveries
in chemistry and electromagnetism. The question of a possible interaction between electricity
and magnetism had arisen soon after the invention of Volta’s pile. In 1820, Hans Christian
Ørsted discovered that a compass needle was deflated from magnetic north by a nearby electric
current, confirming the first connection between electricity and magnetism. This phenomenon
can be summarized as
current j =⇒ ~
magnetic field B
Soon after Ørsted’s discovery, André-Marie Ampère found that two parallel wires carrying
electric currents attract or repel each other. Such mutual action was formulated in mathematics
and led to the important principal: Ampère’s law. In 1827, Ampère published his book Memoir
on the Mathematical Theory of Electrodynamic Phenomena, Uniquely Deduced from Experience,
which coined the name “electrodynamics”.
Let us first consider steady solution of Maxwell’s equation, i.e. all things are independent
of time t. The equations governing the magnetic fields in the steady case are
∇ × B~ = µ0 ~j Ampère’s law
∇ · B
~ =0 Gauss’s law
∂ρ
In the steady situation, ∂t = 0 and hence the charge conservation law becomes
∇ · ~j = 0.
We will next give a geometric description of the above two equations and their solutions.
19
~ = (Bx , By , Bz ) and ~j = (jx , jy , jz ) as 1-forms
~ = (Ex , Ey , Ez ), B
Let us recollect the vectors E
on R3
E = Ex dx + Ey dy + Ez dz
B = Bx dx + By dy + Bz dz
j = jx dx + jy dy + jz dz
~ = −d∗ B.
∇·B 3
Here recall d3 deontes the de Rham differential on R3 . The consistency condition d23 B = 0 asks
for d3 ∗3 j = 0, or equivalently ∇ · ~j = 0. This is the steady charge conservation.
From d∗3 B = 0 or equivalently d3 ∗3 B = 0, we can write ∗3 B = d3 A for a 1-form
A = Ax dx + Ay dy + Az dz.
~
~ = ∇ × A.
B
The vector potential is not uniquely specified. Indeed, for any function χ, the shift
A 7−→ A + d3 χ = A′
also satisfies
d3 A′ = d3 (A + d3 χ) = d3 A = ∗3 B
since d23 = 0. Such a change of A is called a gauge transformation, which we will explain in
detail in later part of the note.
In turns out that there is a condition to fix this freedom of choice, by asking A to satisfy
the “Coulomb gauge” condition
Let us now assume A satisfies the Coulomb gauge condition. Let us substitute A into
d3 (∗3 d3 A) = µ0 ∗3 j.
20
Coulomb gauge condition implies
d3 d∗3 A = 0.
Combining the above two and using d3 d∗3 + d∗3 d3 = −∇2 , we find
∇2 A = −µ0 j.
i.e. ˆ
~ r ) = µ0 ~r − ~r ′
B(~ d3 r′ ~j(~r ′ ) × .
4π |~r − ~r ′ |3
This formula is known as Biot-Savart Law.
C1 C2 C1 C2
n(C1 , C2 ) = 0 n(C1 , C2 ) = ±1
One way to define the linking number in the current situation is as follows. We first fill C1
by a disk D1 whose boundary is precisely ∂D1 = C1 . Then the linking number
n(C1 , C2 ) = #(D1 ∩ C2 )
21
C2 n(C1 , C2 ) = ±1
C1
You can similarly fill C2 first by a disk and intersect with C1 : the answer is the same.
There is an integral expression for the linking number due to Gauss, which is closely related
to ingredients of electromagnetism. Suppose C2 carries a steady current I. It creates a magnetic
~ in space, which can be computed via Biot-Savart formula
field B
˛
~ r1 ) = µ0 I ~r1 − ~r2
B(~ d~r2 × .
4π C2 |~r1 − ~r2 |3
D1 I
C1 C2
22
Precisely, this is described by the third Maxwell’s equation
~
∂B
∇×E
~ =− .
∂t
In terms of differential forms as described in Section 1.6, this is written as
∂B
∗3 d3 E = −
∂t
or equivalently
∂
d3 E = − (∗3 B).
∂t
Let us consider a surface S which is bounded by a closed curve C.
~
B
S C
The integral of the magnetic field over the surface S is called the “magnetic flux” through S
ˆ ˆ
Φ= d~σ · B =
~ ∗3 B.
S S
i.e. ˆ ˆ
d
− d~σ · B =
~ d~r · E.
~
dt S C
This is the form when Faraday discovered the law of induction: changing the magnetic flux
through S will produce a current to flow along C.
However, in the dynamical case when all things depend on time, this is not enough. Faraday’s
law says that the change of magnetic field will induce electric field. Parallelly, it is natural to
ask whether the change of electric field will induce magnetic field. This is Maxwell’s addition
to Ampère’s Law, usually called Ampère-Maxwell Law
!
~
~ = µ0 ~j + ε0 ∂ E
∇×B ,
∂t
23
1
or using the relation µ0 ε0 = c2
,
~
1 ∂E
∇×B
~ = µ0~j +
2
.
c ∂t
The extra term is called “displacement current”, though it is not really a current but
an addition to the current. In terms of differential forms, Ampère-Maxwell Law is
1 ∂
d3 B = µ0 ∗3 j + ∗3 E.
c2 ∂t
The displacement current is in fact necessary for the consistency with charge conservation.
In fact, using (d3 )2 = 0
1 ∂ 1 ∂
0 = d3 d3 B = µ0 d3 ∗3 j + 2 d3 ∗3 E = µ0 ∇ · j + 2 ∇ · E dx ∧ dy ∧ dz,
~ ~
c ∂t c ∂t
24
In general when spacetime has nontrivial topology, such A can only be obtained locally and
glued via transformation law. We will discuss this situation in more detail in Section 4.6.
Let us write the 1-form A as
A = −φdt + A
E ∧ dt + ∗3 B = −d3 φ ∧ dt + dt ∧ ∂t A + d3 A.
~ = (Ax , Ay , Az ), this is
In vector notation, let A
E~ = −∇φ − ∂t A~
B~ =∇×A ~
A 7−→ A + dχ
for a function χ = χ(~r, t) on the spacetime. The electric-magnetic field remains the same
These are gauge transformations, and A is called the gauge field. We will study gauge
theory systematically in Chapter 4.
In components, the gauge transformation is
∂χ
φ 7−→ φ −
∂t
A~ 7−→ A
~ + ∇χ
∗d ∗ dA = J.
To solve this equation, we can choose a gauge condition to fix the gauge degree of freedom.
There are usually two commonly used gauge fixing condition:
~ =0
Coulomb gauge : ∇ · A
~ + 1 ∂φ
Lorenz gauge : ∇·A =0
c2 ∂t
We have seen Coulomb gauge in Section 1.6 on the Biot-Savart Law in magnetostatics. Here
we briefly comment on Lorenz gauge which we will discuss later in details.
25
The Lorenz gauge can be written as
Lorenz gauge : d ∗ A = 0.
So
□A = −(dd∗ + d∗ d)A = −J,
or in components,
1 ∂2
∇2 φ − 2 2 φ = −ρ/ε0
c ∂t
2
∇2 A~ − 1 ∂ A ~ = −µ0~j
c2 ∂t2
These are inhomogeneous wave equations that we will study in detail in Section 3.4.
In a region without charge and current, such as vacuum, Maxwell’s equations reduce to
dF = 0
d∗ F = 0
□F = −(dd∗ + d∗ d)F = 0.
In components, we have
~
1 ∂2E
2 2 − ∇2 E ~ =0
c ∂t
2~
1 ∂ B − ∇2 B~ =0
c2 ∂t2
which are the standard form of wave equations traveling at the speed of light c. This led
Maxwell to propose that light and radio waves were propagating electro-magnetic waves. In
1887, Heinrich Hertz demonstrated Maxwell’s electromagnetic waves propagating at the same
speed as light. This placed Maxwell’s theory on a firm foundation.
All electromagnetic waves travel at a fixed speed c in the vaccum, independent of any frame
of reference. This looks controversial comparing to classical Newton mechanics. It was studied
by Hendrik Lorentz who was able to derive the Lorentz transformations that preserve the form
of Maxwell’s equations. This subsequently laid the foundation for Einstein’s special relativity.
We will study this in Chapter 5.
26
Chapter 2 Static Electromagnetism
~ = ~j = 0, ∂ρ
B = 0.
∂t
The only relevant Maxwell’s equations are
∇ · E
~ = ρ/ε0
∇ × E~ =0
~ is stationary: ⃗
∂E
where the electric field E ∂t = 0.
∇2 φ = −ρ/ε0 .
∇2 φ = 0.
27
When we consider electrostatics problem in the space without boundary, say R3 , we can
solve φ (with appropriate decay condition at infinity) in terms of the Coulomb integral
ˆ
1 ρ(~r ′ )
φ(~r ) = d3 r′ .
4πε0 |~r − ~r ′ |
S
V
The study of this problem for Poisson’s and Laplace’s equation is the subject of “Potential
theory”. Let us first look at some examples.
28
2.1.3 Uniform Ball
Consider a ball BR of radius R centered at origin, with uniform charge ρ0 per unit volume.
The charge density function in R3 is
ρ = ρ0 χBR .
ρ0 R
~ = −∇φ = −∂r φ ~r .
E
r
~ without explicitly evaluating the above integral. In fact, consider
This is enough to solve φ, E
the following surface integral over the sphere Sr of radius r centered at the origin
ˆ ˆ ˆ
4πR3
d~σ · ∇φ = d r∇ φ = −
3 2
d3 r ρ0 /ε0 = −ρ0 /ε0 Vol(BR ) = − ρ0 /ε0 = −Q/ε0 ,
Sr BR BR 3
where Q = ρ0 4πR3 /3 is the total charge of the ball. On the other hand, using φ = φ(r),
ˆ
d~σ · ∇φ = ∂r φ Area(Sr ) = 4πr2 ∂r φ.
Sr
~ = −∇φ = Q ~r
E (r > R)
4πε0 r3
which takes the same form as that of a point charge Q.
Inside BR : |~r | < R.
29
We follow the same strategy via spherical symmetry: φ = φ(r). Then
ˆ ˆ
r3
d~σ · ∇φ = d3 r ∇2 φ = −ρ0 /ε0 Vol(Br ) = − 3 Q/ε0 ,
Sr Br R
and ˆ
~ · ∇φ = 4πr2 ∂r φ.
dS
Sr
~ = −∇φ = Q ~r
E (r < R)
4πε0 R3
which grows linearly inside the ball. We can also describe a continuous solution of φ by
Q 3 r2
− r<R
4πε0 2R 2R3
φ(r) =
Q 1 r>R
4πε0 r
φ E
R r R r
We consider a line segment of length 2L with uniform charge λ per unit length, placed
along the z-axis and centered at the origin.
z
y
L
−L
30
It would be convenient to work with cylindrical coordinates
x = ρ cos θ
y = ρ sin θ
z=z
Due to cylindrical symmetry, the scalar potential is a function of (ρ, z) only and given by
ˆ L "p #
λ 1 λ (z − L)2 + ρ2 − (z − L)
φ(ρ, z) = dz ′ p = ln p .
4πε0 −L ρ2 + (z − z ′ )2 4πε0 (z + L)2 + ρ2 − (z + L)
which are the distances from the two segment endpoints to the observation point. They are
related by
2
r+ − r−
2
= 4zL.
z=L r−
0
r+
z = −L
Then
λ r− − (z − L)
φ= ln .
4πε0 r+ − (z + L)
Observe that
r2 − r−
2
(r− + r+ )(r− − r+ ) 1 1
r− −(z−L) = r− +L− + = +r− +L = (r− + r+ ) + L (r− − r+ ) + 1 .
4L 4L 2 2L
Similarly,
1 1
r+ − (z + L) = (r− + r+ ) − L (r− − r+ ) + 1 .
2 2L
It follows that
λ r− + r+ + 2L
φ= ln
4πε0 r− + r+ − 2L
is a function of r− + r+ only. This allows us to draw the surface of equipotential, or equivalently
r− + r+ = constant
31
~
E
φ = const
Here Q = 2λL is the total charge. So when we observe very far away the segment, it looks like
a point charge as expected.
⃝
2 |z| << L, ρ << L.
r+ + r− 1 p p
= (L − z)2 + ρ2 + (L + z)2 + ρ2
2L 2L
1 p p
= (1 − z/L)2 + (ρ/L)2 + (1 + z/L)2 + (ρ/L)2
2
√
Using 1 + x = 1 + 12 x − 18 x2 + O(x3 ), the above is equal to
1 1 1
= 1 + (−2z/L + z 2 /L2 + ρ2 /L2 ) − (−2z/L)2
2 2 8
!
1 1 −3
+ 1 + (2z/L + z /L + ρ /L ) − (−2z/L) + O(L )
2 2 2 2 2
2 8
1
=1 + (ρ/L)2 + O(L−3 ).
2
Therefore
! !
λ
r+ +r−
2L +1 λ 2 + 21 (ρ/L)2 + O(L−3 )
φ= ln r+ +r− = ln 1 −3
4πε0 2L −1 4πε0 2
2 (ρ/L) + O(L )
λ λ
' ln 4 − ln(ρ/L)2
4πε0 4πε0
λ λ
=− ln ρ + ln(2L).
2πε0 2πε0
The second term is a constant, which does not contribute to the electric field. We find
~ = −∇φ ' λ ρ ~
E .
2πε0 ρ2
32
Here the vector ρ
~ has length ρ and points to the radial direction in the xy-plane. Equivalently,
we can view E~ = λ ρ⃗2 as the electric field produced by the infinite line (L → +∞) with
2πε0 ρ
uniform charge per unit length λ.
z-axis
λ
~ = λ ρ ⃗
E 2πε0 ρ2
We consider the electric field produced by surface charge. We first consider the case of
an infinite plane located at z = 0, carrying uniform charge ξ per unit area. By translation
symmetry, we know that the electric field points along the direction of z-axis, only depend on
z, and with opposite directions above and below the plane:
~ = (0, 0, E = E(z)),
E E(−z) = −E(z).
~
E
z=0
33
~
E
S+ z
Y
S0
S− −z
This leads to ˆ ˆ ˆ
d~σ · E
~ − ~ = 1
d~σ · E ξ.
S+ S− ε0 S0
So
ξ ξ
E(z)A − E(−z)A = A =⇒ E(z) = for z > 0.
ε0 2ε0
Equivalently, we can write
ξ z
E(z) = .
2ε0 |z|
Up to a constant, the scalar potential is
ξ
φ = φ(z) = − |z|.
2ε0
~ is NOT continuous across the
There is an important observation here: the electric field E
charge plane
ξ
E(z → 0+ ) − E(z → 0− ) = .
ε0
On the other hand, the scalar potential φ is continuous, though its derivative is not. (φ can
also jump in general, such as in Dipole Layer.)
We discuss some generalization of infinite plane charge. Consider a pair of infinite planes
P+ , P− placed at
P± : z = ±l.
34
~ =0
E
−ξ
z=l P+
~
E
+ξ
z = −l P−
~ =0
E
By the result from the infinite plane and the superposition law, we find
0 |z| > l
~ =
E
(0, 0, ξ/ε ) |z| < l
0
As another example, we can consider a bit more realistic model of the infinite plane by an
infinite slab of thickness 2l with charge density per unit volume ρ, placed at
|z| ≤ l.
z=l
z = −l
We can think about the effect of each slice of infinite plane and use superposition to sum/inte-
~ = (0, 0, E(z)). When |z| > L, the total effect is the same as an infinite
grate them up. Let E
plane charge with ξ = 2lρ. Therefore
ρl
z>l
ε0
E(z) = .
ρl
− z < −l
ε0
Now we consider a point inside the slab, say at z = a where
0 < a < l.
The effect of the slab between a ≤ z ≤ l will cancel that between −l ≤ z ≤ −a.
35
z=l
z=a
z = −a
z = −l
Consider a spherical shell of radius R with uniform surface charge ξ per area, centered at
the origin.
~
E
By spherical symmetry,
~ r ) = A(r) ~r
E(~
36
where Q = 4πR2 ξ is the total charge on the shell. Since
ˆ
Q
d~σ · E
~ = A(r)r Area(Sr ) = A(r) 4πr3 =⇒ A(r) = ,
Sr 4πr3 ε0
i.e.
~ = Q ~r .
E
4πε0 r3
So effectively it feels the same as a point charge Q.
⃝
2 r < R. Again by Gauss’s law and spherical symmetry,
ˆ
d~σ · E
~ = 0 =⇒ A(r) = 0.
Sr
Now we discuss the relation between the surface charge distribution and the discontinuity
of electric field. Consider a surface S with charge density ξ per area. Let n̂ denote the unit
normal vector on the surface. Let E ~ + denote the limit of the electric field along the n̂ side of
the surface, and E ~ − denote the limit on the other side.
~+
E
n̂
~−
E
Then the interface conditions for electric field along the surface charge distribution is
n̂ · (E
~+ −E
~ − ) = ξ/ε0 jump along normal direction
n̂ × (E~ −E~ )=0 continuous along tangent direction
+ −
We have seen several examples of such phenomenon above. To see how such interface conditions
arise, let us first consider a thin cylinder Y across the surface as shown.
~
E
S+
S0
S−
~
E
37
Apply Gauss’s law ˆ ˆ
~ = 1
d~σ · E ξ.
∂Y ε0 S0
When the cylinder is infinite thin, this leads to
ˆ ˆ
1
(n̂ · E
~ + − n̂ · E
~ −) = ξ.
S0 ε0 S0
L
δ
C
R
S
Since ∇ × E
~ = 0, the loop integral
˛ ˆ
d~r · E =
~ d~σ · (∇ × E)
~ = 0.
C R
n̂ × (E
~+ −E
~ − ) = 0.
There are similar interface conditions for magnetic fields across a surface with current. See
Section 2.5.2.
A conductor is a material that contains many free electrons that can move in the mate-
rial but not leave its surface. In “electrostatic” situation, the electrons will move to arrange
themselves to produce zero electric field inside the conductor.
Conductor
ξ
~ =0
E
~ = 0 inside.
E
38
On the other hand, its surface S could have a nontrivial charge density ξ per area due to outside
situation. For example, we can put a conductor inside two parallel plane charge as shown.
+ −
+ − + −
− +
+ − + −
+ −
n̂
Conductor
subject to certain boundary condition in a region V under consideration. There are usually two
types of boundary conditions on S = ∂V .
39
⃝
1 Dirichlet boundary condition: specify the value of φ on the boundary
φ|S = f.
f
S
⃝
2 Neumann boundary condition: specify the value of the normal derivative on the boundary
∂n̂ φ|S = g.
n̂
2.3.1 Uniqueness
∇2 φ = −ρ/ε0
and either γ|S = 0 (Dirichlet case) or ∂n̂ γ|S = 0 (Neumann case). Consider the volume integral
(we omit the volume measure d3 r on V and also the surface measure on S for simplicity)
ˆ ˆ
|∇γ| =
2
∇ · (γ∇γ) − γ∇2 γ
V ˆV ˆ
= d~σ · (γ∇γ) − γ∇2 γ
ˆS ˆ V
= γ∂n̂ γ − γ∇2 γ.
S V
Either Dirichlet or Neumann condition implies γ∂n̂ γ = 0. Together with the Laplace
equation ∇2 γ = 0, it follows that ˆ
|∇γ|2 = 0.
V
This implies ∇γ = 0, hence γ is a constant. For Dirichlet boundary condition, γ = 0 and the
solution is unique. For Neumann boundary condition, the solution is unique up to an additive
constant.
40
2.3.2 Green’s Function
The solution to the Poisson’s equation in a finite region V with either Dirichlet or Neumann
boundary condition can be obtained by means of Green’s function G(~r, ~r ′ ), which satisfies
Here ∇2′ is the Laplacian with respect to ~r ′ . Such a Green’s function can be written as
1
G(~r, ~r ′ ) = + F (~r, ~r ′ )
|~r − ~r ′ |
where F satisfies the Laplace’s equation
∇2 φ = −ρ/ε0 inside V.
which gives an explicit formula of φ in terms of its boundary value φ|S . Thus this formula is
suitable for Dirichlet boundary value problem.
Remark 2.3.1. GD (~r, ~r ′ ) has the symmetry property in general
GD (~r, ~r ′ ) = GD (~r ′ , ~r ).
41
Here A = Area(S) is the total area of the boundary surface S. Such choice of constant is to be
consistent with the equation ∇2′ GN (~r, ~r ′ ) = −4πδ(~r − ~r ′ ). In fact,
ˆ ˆ ˆ ´
3 ′ 2′ ′ ′ ′ 1
−4π = d r ∇ GN (~r, ~r ) = ∂n̂ GN (~r, ~r ) = (−4π)/A = −4π S
V S S A
ˆ
=⇒ A= 1 = Area(S).
S
Using GN (~r, ~r ′ ),
we find
ˆ ˆ ˆ
1 ′ 1 ′ 1
φ(~r ) = 3 ′ ′
d r ρ(~r )GN (~r, ~r ) + ′ ′
∂ φ(~r )GN (~r, ~r ) − φ(~r ′ )∂n̂ ′ GN (~r, ~r ′ )
4πε0 V 4π S n̂ 4π S
ˆ ˆ ˆ
1 1 1
= φ(~r ′ ) + d3 r′ ρ(~r ′ )GN (~r, ~r ′ ) + ∂ ′ φ(~r ′ )GN (~r, ~r ′ ).
A S 4πε0 V 4π S n̂
which gives an explicit formula of φ in terms of its boundary normal derivative ∂n̂ φ|S and the
´
boundary integral S φ(~r ′ ). Thus this formula is suitable for Neumann boundary value problem.
We can also interpret Green’s function G(~r, ~r ′ ) as an electrostatic problem as follows. Let
us consider the following potential function φ⃗r
1
φ⃗r (~r ′ ) := G(~r, ~r ′ ).
4πε0
Then it satisfies the equation
1
∇2′ φ⃗r (~r ′ ) = −
δ(~r − ~r ′ )
ε0
which represents a potential for a unit point charge located at ~r inside V . Let us write
1 1
φ⃗r (~r ′ ) = + ξ⃗r (~r ′ )
4πε0 |~r − ~r ′ |
for some ξ⃗r (~r ′ ). Then ξ⃗r (~r ′ ) satisfies the equation
It represents the potential due to certain external charge distribution outside V chosen in such
a way that the combined potential with the internal point charge at ~r has the specific boundary
condition on ∂V .
Remark 2.3.2. Green’s function on Rn . A strategy to solve the Poisson’s equation on Rn is
to find a fundamental solution of
∇2 Φ(~r ) = −δ(~r ).
42
where ci ’s are constants. We choose appropriate constants such that
1
− ln r n=2
2π
Φ(r) =
1
n≥3
n(n − 2)α(n)rn−2
π n/2
Here α(n) = is the volume of the n-dimensional unit ball, and
Γ(1 + n/2)
(n/2)! n even
Γ(1 + n/2) = √
n!!2−(n+1)/2 π n odd
Then we show ∇2 Φ = −δ(~r ) in the sense of distribution. This means for each smooth
function g : Rn → R with compact support, we have
ˆ
Φ · ∇2 g = −g(0).
Rn
Indeed,
ˆ ˆ ˆ
Φ·∇ g =
2
Φ·∇ g+2
Φ · ∇2 g
Rn B(0,ε) B(0,ε)c
ˆ ˆ ˆ ˆ
Integration
======== Φ · ∇2 g + ∇2 Φ · g − ∂⃗n Φ · g + Φ · ∂⃗n g
by part B(0,ε) B(0,ε)c ∂(B(0,ε)c ) ∂(B(0,ε)c )
| {z´ } | {z } | ´
{z } | {z
´
}
≤c||∇g||L∞ B(0,ε) rΦ =0 ∂(B(0,ε)c ) g ≤||∂⃗n g||L∞ ∂(B(0,ε)c ) Φ
= Vol(∂(B(0,ε)c ))
ε→0
==== 0 + 0 − g(0) + 0 = −g(0).
Here B(0, ε) is the ball of radius ε centered at 0, and B(0, ε)c = Rn − B(0, ε). It follows that
ˆ
1
φ(~r ) = d~r ′ Φ(~r ′ )ρ(~r − ~r ′ )
ε0 Rn
is a solution to the Poisson’s equation
∇2 φ = −ρ/ε0 .
V = {z ≥ 0}.
and assume that S = {z = 0} is a conducting plane (so the scalar potential is constant on S).
43
V
q z0
z=0
−q
−z0
GD (~r, ~r ′ ) = GD (~r ′ , ~r ).
Here we can think about the mirror charge σ(~r ′ ) as an external charge outside V (the F (~r, ~r ′ )
term as discussed in Section 2.3.2).
As another example, we consider a point charge q outside a conducting sphere of radius R.
We set the potential on the conducting sphere to be zero. Finding the potential of this problem
is the same as describing the Dirichlet Green’s function on V = {|~r | ≥ R}, S = {|~r | = R}.
q ~r+
44
~r
R
σR (~r )
It is clear that
q
∇2 φ⃗r+ (~r ) = − δ(~r − ~r+ ) inside V.
4πε0
On the other hand, let n̂ = ~r/|~r | and n̂+ = ~r+ /|~r+ | denote the unit direction of ~r , ~r+ and let
r = |~r |, r+ = |~r+ |. Then
q 1 R
φ⃗r+ (~r ) = − .
4πε0 |~r − ~r+ | |rr+ n̂ − R2 n̂+ |
It is clear from this expression that
It follows that
φ⃗r+ (~r )|⃗r ∈S = φ⃗r (~r+ )|⃗r ∈S = 0.
~r
V
45
In the region r′ << r, we have valid Taylor series expansion
1 1 ′ 1
′ = − ~r · ∇ + ··· .
|~r − ~r | r r
⃗
r
where r̂ = r is the unit vector in the direction of ~r.
Recall the series expansion
X ∞
1
√ = tl Pl (x) for |x| ≤ 1, 0 < t < 1.
1 − 2xt + t2 l=0
Here Pl (x)’s are the so-called Legendre polynomials. Explicitly, they are given by
1 dl 2
Pl (x) = (x − 1)l .
2l l! dxl
The first a few terms are
P0 (x) = 1, P1 (x) = x,
1 1
P2 (x) = (3x2 − 1), P3 (x) = (5x3 − 3x),
2 2
···
46
where ˆ
Q= d3 r′ ρ(~r ′ )
V
is the total electric charge, and ˆ
P~ = d3 r′ ρ(~r ′ )~r ′
V
is the electric dipole moment, and
ˆ
1
Qij = d3 r′ ρ(~r ′ )ri′ rj′
2 V
is the electric quadrupole where we write the vector ~r ′ as ~r ′ = (r1′ , r2′ , r3′ ).
For electrically neutral object (Q = 0), the second term describes the leading electrostatic
potential at far distance
1 P~ · ~r
φ(~r ) ' , r >> 0.
4πε0 r3
Then the leading approximation of the electric field is
Consider two point charges +q and −q at a distance d apart. The total charge is zero. The
dipole moment is
~
P~ = q d.
+q
d~
−q
The “electric dipole” is the limit d → 0 and q → +∞ such that P~ is fixed. In this limit,
the potential is
1 P~ · ~r
φ(~r ) =
4πε0 r3
where higher terms in the multipole expansion vanish in the limit. The electric field is
~ r) = 1 3(r̂ · P~ )r̂ − P~
E(~ .
4πε0 r3
This allows us to draw the electric field of the dipole as
+ P~
d→0
−
dipole electric field
47
2.4.3 Dielectric Matter
~ tot = E
E ~ int 6= 0
~ ext + E inside dielectric.
Each nucleus has charge +q at the center surrounded by a spherical cloud of electrons of charge
−q such that the total effect is neutral. Let us now apply an external electric field, and the
atoms get polarized
−+
~
E
with a dipole moment. This can be approximated by electric dipoles. The polarisation P~ (~r )
is the dipole moment density per unit volume. The potential arising from the polarisation is
given by integrating the dipole potential
ˆ ~ r ′ ) · (~r − ~r ′ ) ˆ
1 3 ′ P (~ 1 3 ′ ~ ′ ′ 1
φ(~r ) = d r = d r P (~r ) · ∇
4πε0 V |~r − ~r ′ |3 4πε0 V |~r − ~r ′ |
ˆ ˆ
1 P~ (~r ′ ) 1 ′ ~ r ′)
3 ′ ∇ · P (~
= d~σ · − d r .
4πε0 ∂V |~r − ~r ′ | 4πε0 V |~r − ~r ′ |
The first surface integral is the potential from the surface charge density
P~ · n̂
This is called the bound charge due to polarisation. Assume there are also some charges
inside the matter that are free to move, which do not arise form polarisation. Let us call this
48
extra charge ρfree . Then total potential consists of that arising from the polarisation and that
produced from the free charges. Thus the total electric field is
Then it obeys
∇·D
~ = ρfree .
~ = ε0 (1 + χe )E
D ~ = εE
~
where ε > ε0 is called the permittivity of the material. εr = ε/ε0 = 1+χe is called the relative
permittivity. Then
∇·E~ = ρfree
ε
takes the same form as that in the vacuum. The polarisation has simply the effect of replacing
ε0 by ε.
Example 2.4.1. Consider a dielectric sphere of radius R, with a point charge Q at the center.
49
implies (as we have seen before)
~ = Q
D ~r (r < R)
4πr3
The electric field is then
E ~ = 1 Q ~r
~ = D/ε (r < R)
εr 4πε0 r3
This behaves like a charge Q/εr placed at the origin. This says that the bound charge gathers
at the origin, screening the original charge.
Outside the matter, we have
~ = Q
E ~r (r > R)
4πε0 r3
which does not depend on the polarization. On the surface, we have
~+ = Q ~− = Q
E n̂, E n̂.
4πε0 R2 4πεR2
n̂
~+
E
~−
E
~+ −E
E ~ − = ξ n̂,
ε0
Q 1
ξ= 1− .
4πR2 εr
The total surface charge is therefore Q − Q/εr . This is precisely the opposite of the bound
charge at the origin, as expected.
50
In this steady situation when things do not depend on time, we have the steady-current condition
∇ · ~j = 0.
∇×B
~ = µ0~j.
~
~ =∇×A
B
A ~ ′=A
~ 7−→ A ~ + ∇χ
for a function χ = χ(~r ). In terms of the vector potential, the Ampère’s law becomes
~ ) = µ0~j.
∇ × (∇ × A
~ + ∇(∇ · A
−∇2 A ~ ) = µ0~j.
Remark 2.5.1. One quick way to see this is to use differential forms. In fact, let us identify the
~ with a 1-form
vector A
A = Ax dx + Ay dy + Az dz.
~ corresponds to the 1-form ∗3 d3 A. Thus the vector ∇ × (∇ × A
The vector ∇ × A ~ ) corresponds
to
∗3 d3 ∗3 dA = d∗3 d3 A.
~ + ∇(∇ · A
which corresponds to the vector −∇2 A ~ ).
51
Due to the gauge degrees of freedom, we can choose some gauge fixing condition to simplify
the problem. A natural choice is the
Coulomb gauge : ~ = 0.
∇·A
~ = f 6= 0. Consider
This can also be achieved for domains in question. In fact, assume ∇ · A
the gauge transformation
~ ′=A
A ~ + ∇χ.
~ ′ = 0 is to solve
Solving the Coulomb gauge condition ∇ · A
∇2 χ = −f
which is the Poisson’s equation. With appropriate boundary condition, the solution exists and
is unique.
~ = 0. Then the
Now let us assume the vector potential satisfies the Coulomb gauge: ∇ · A
Ampère’s law in such gauge becomes
~ = −µ0~j
∇2 A
which becomes the vector version of the Poisson’s equation. Our experience in electrostatics
allows us to write down the solution in the space
ˆ ~j(~r ′ )
~ r ) = µ0
A(~ d3 r′ .
4π |~r − ~r ′ |
We can check that the Coulomb gauge is satisfied when the current ~j is suitably located in some
region V . Then
ˆ ˆ
µ0 1 µ0 1
~ r) =
∇ · A(~ d3 r′ ~j(~r ′ ) · ∇ = − d3 ′~ ′
r j(~
r ) · ∇ ′
4π |~r − ~r ′ | 4π |~r − ~r ′ |
ˆ
µ0 1
= d3 r′ ∇′ · ~j(~r ′ ) = 0.
4π |~r − ~r ′ |
Here we have used integration by part and the steady current condition ∇ · ~j = 0.
As a result, the magnetic field can be written as
ˆ
µ0 ~r − ~r ′
~
B(~r ) = ∇ × A(~r ) =
~ d3 r′ ~j(~r ′ ) × .
4π |~r − ~r ′ |3
52
~j
Therefore we find ˛
d~r · B
~ = µ0 I.
C
Example 2.5.2 (Infinite Straight Wire). Consider an infinite straight wire carrying a steady
current I. Assume the wire is placed along the z-axis. Symmetry leads us to consider cylindrical
polar coordinate
(ρ, θ, z)
p
where ρ = x2 + y 2 . The current is
~j = I δ(x)δ(y) ẑ.
~
B
θ̂
~r − ~r ′
53
We can check this result by considering a loop integral along the circle Cρ of radius ρ in
the xy-plane centered at the origin.
Cρ
˛ ˆ 2π
µ0 I
d~r · B(ρ,
~ θ) = ρdθ = µ0 I
Cρ 0 2πρ
as expected.
Example 2.5.3 (Infinite Plane Current). Consider the surface of xy-plane carrying a constant
~ per unit length. Assume K
surface current density K ~ = kx̂ lies in the x-direction.
y
~
K
Based on our experience from the infinite line wire, and the translation symmetry, we see
~ is oriented along the direction of −ŷ when z > 0, along the direction of ŷ when z < 0,
that B
and its magnitude only depends on z. Let us write
~ = −B(z)ŷ,
B
where B(−z) = −B(z). Now we consider Ampère’s law in the rectangle as illustrated.
C
L
z
y
z
x
−z
˛
d~r · B
~ = B(z)L − B(−z)L = 2B(z)L.
C
The total current through the surface is KL. It follows that 2B(z)L = µ0 KL, i.e.
µ0 K
− ŷ if z > 0
~ =
B 2
µ0 K ŷ if z < 0
2
54
Note that the magnitude of the magnetic field is constant, which is the same situation as in
electrostatics. The magnetic field is not continuous and exhibits a jump across a surface current.
Let
~ ± = limit of the magnetic field along the ±n̂ side of the surface.
B
n̂ × (B
~+−B
~ − ) = µ0 K.
~
A similar argument as in our discussion of electric field interface condition in Section 2.2.3
leads to the following interface condition for magnetic field across a surface with current density
K~ and normal vector n̂.
n̂ · (B
~+−B~ −) = 0 continuous along normal direction
n̂ × (B~+−B~ − ) = µ0 K
~ jump along tangent direction
We leave the details to the reader.
Example. Solenoid
A solenoid consists of a surface current travelling around the cylinder. We consider an
infinite cylinder with surface density in the rotating direction as in the picture.
~
K
Let
(ρ, θ, z)
~ = K θ̂
K
~ = B(ρ)ẑ.
B
55
~ = 0 outside the cylinder.
Outside the cylinder: B(+∞) = 0 =⇒ B
Inside the cylinder: Consider the surface
~
B
˛
d~r · B
~ = B(ρ)L, I = KL =⇒ B(ρ) = µ0 K, ~ = µ0 K ẑ.
B
C
Here r̂ = ⃗rr , and Pl ’s are Legendre polynomials. Plug this into the vector potential, we find
ˆ X∞ ′ l
~ r ) = µ0
A(~ d3 r′
r
Pl (r̂ · r̂ ′ )~j(~r ′ )
4πr V r
l=0
ˆ
µ0 ~j(~r ′ ) 1 ~r · ~r ′
= d3 r′ + ~j(~r ′ ) 3 + ···
|{z} .
4π V | {z r} | {z r } multipole
monopole term dipole term
The monopole term vanishes. For example, let us consider the x-component. Denote
′
~r = (x′ , y ′ , z ′ ), ~j = (jx , jy , jz ) in components. We have
ˆ ˆ
3 ′ ′
d r jx (~r ) = d3 r′ ∇′ · (x′~j ) − x′ ∇′ · ~j = 0
V V
56
Using jx = ∇′ · (x′~j(~r ′ )) as above, we have
ˆ ˆ ˆ ˆ
3 ′ ′ ′ ′ ′
d r (~r · ~r )jx (~r ) = d r (~r · ~r )∇ · (x j ) = − d r ∇ (~r · ~r ) · x j = − d3 r′ (~r · ~j )x′ .
3 ′ ′ ′~ 3 ′ ′ ′~
V V V V
Now using the vector identity ~a × (~b × ~c ) = (~a · ~c )~b − (~a · ~b )~c,
we find
ˆ ˆ h i
1
d3 r′ (~r · ~r ′ )~j(~r ′ ) = d3 r′ (~r · ~r ′ )~j(~r ′ ) − (~r · ~j )~r ′
2 V
V
ˆ
1 3 ′ ′ ′
= d r ~r × j(~r ) × ~r.
~
2 V
~ = µ0 3(r̂ · m)r̂
B
~ −m ~
, r >> 0.
4π r 3
Example 2.6.1 (Magnetic Moment and Angular Momentum). Consider the current generated
by the motion of a number of charged particles with charges qi , masses Mi , positions ~ri and
velocities ~vi = ~r˙i . The current density is
X
~j(~r ) = qi~vi δ(~r − ~ri ).
i
57
The magnetic (dipole) moment is
ˆ
1 1X 1 X qi L
~i
m
~ = d3 r′ ~r ′ × ~j(~r ′ ) = qi~ri × ~vi =
2 2 2 Mi
i i
~ i = Mi~ri × ~vi is the angular momentum of the i-th particle. Assume all the particles in
where L
motion have the same charge/mass radio qi /Mi = q/M . Then
q ~
L m
~ =
2M
~ =P L
is proportional to the total angular momentum L ~
i i.
~i
L
This classical result is very close to the quantum case describing magnet. The quantum
~ leading to an intrinsic
particles (like electron) carry the so-called “spin” angular momentum S,
spin magnetic moment
q ~
m S.
~g =g
2M
The number g is called the g-factor, and contains important information about the quantum
physics.
~n
I
S
Assume the circuit C bounds a surface S. Let ξ~ be an arbitrary constant vector. Then
˛ ˛ ˆ
~ I ~ I ~ I
ξ·m ~C = ξ · (~r × d~r ) = d~r · (ξ × ~r ) = d~σ · ∇ × (ξ~ × ~r )
2 C 2 C 2 S
ˆ h i ˆ
I ~ ~ ~
= d~σ · ξ ∇ · ~r − (ξ · ∇)~r = I d~σ · ξ.
2 S S
58
Since ξ~ is arbitrary, we find ˆ
m
~C =I d~σ
S
which holds for any surface S with boundary C.
Now assume C is planer, say lies on a plane with normal vector ~n oriented as indicated.
Then ˆ
d~σ = A~n
S
where A is the area circumscribed by C. In this case
m
~ C = IA~n.
C I
~ C = IAn̂ = πIR2 n̂. Let us now consider the limit R → 0, I → ∞, while leaving
Here m
a = IA = πIR2 fixed. In the limit,
˛
d~r ′ (~r ′ ) ∼ Rl−1
l
I
C
hence all higher multipoles go to zero. In this way, we obtain a magnetic dipole with magnetic
moment
m
~ = an̂.
n̂
59
2.6.2 Magnetic Dipole Layers
dm
~ = Id~σ .
Question: What is the magnetic field produced by such a magnetic dipole layer?
Let us first consider an intuitive approach. We can think of a magnetic dipole as a small
current loop
The currents inside S will cancel each other, leading to the effect of a total current I circling
along C. As a result, we should expect that the magnetic field should be the same as that
produced by current I circling along the loop C.
Let us now confirm this intuitive result. By the magnetic dipole formula, we have
ˆ ˆ
~ µ0 ~r − ~r ′ µ0 ′ 1
A(~r ) = ~ ×
dm = Id~σ × ∇ .
4π S |~r − ~r ′ |3 4π S |~r − ~r ′ |
Here the position on the surface S is parametrized by ~r ′ . Let us choose again an arbitrary
~ then
constant vector ξ,
ˆ ˆ
~ ~ µ0 I ~ ′ 1 µ0 I ′ ξ~
ξ · A(~r ) = ξ · d~σ × ∇ = d~
σ · ∇ ×
4π S |~r − ~r ′ | 4π S |~r − ~r ′ |
˛ ˛
µ0 I ξ~ ~ · µ0 I d~r ′
= d~r ′ · = ξ .
4π C |~r − ~r ′ | 4π C |~r − ~r ′ |
60
~ hence
This holds for any vector ξ,
˛
~ r ) = µ0 I d~r ′
A(~
4π C |~r − ~r ′ |
~
B ~j
∂V V
∂S S
~
B
C2
I
Consider another circle C1 that links with C2 as before. The total current through S is
C1
C2
S
I
61
n(C1 , C2 )I where n(C1 , C2 ) is the linking number of C1 and C2 . Then
˛
d~r1 · B(~
~ r1 ) = µ0 n(C1 , C2 )I
C1
where A = Ax dx + Ay dy + Az dz. For those who are familiar with geometry, you realize that
this is the same form of abelian Chern-Simons action.
Let us first consider the gauge invariance of the magnetic helicity. Under the gauge trans-
formation
A 7−→ A + dχ,
where n̂ is the normal vector on the surface ∂V . We see that in the case when ∂V is a magnetic
surface (n̂ · B
~ = 0), the magnetic helicity will be invariant under gauge transformations.
Let us use Biot-Savart law to analyze the magnetic helicity. We assume the current ~j lies
inside V and ∂V is a magnetic surface. Then the vector potential is given by
ˆ ~j(~r ′ )
µ0
~
A(~r ) = d3 r′ .
4π V |~r − ~r ′ |
62
2.8 Dirac Monopole
The Gauss’s law of magnetic field can be written as
d3 ∗3 B = 0
∗3 B = d3 A
where A = Ax dx + Ay dy + Az dz. However, when we work with a region V that has nontrivial
topology, we may not be able to find such an A globally, although locally it always exists
(Poincaré’s Lemma). We will systematically study this later in Chapter 4 in terms of fiber
bundle. Here we illustrate some basic feature via the example of Dirac monopole.
The Dirac monopole has magnetic field
~ = g ~r
B
4π r3
where g is the magnetic charge. It satisfies
∇·B
~ = gδ(r)
hence describing a magnetic monopole of charge g at the origin. Although we have not observed
magnetic monopole in the lab yet, this is still an interesting theoretical model. We can compare
the above formula with the point electric charge
~ = q ~r
E .
4πε0 r3
V = R3 − {0}.
We have
g
∗3 B = (xdy ∧ dz + ydz ∧ dx + zdx ∧ dy)
4πr3
which defines a closed 2-form on V . However, there does not exist globally a 1-form A on V
such that
∗3 B = d3 A.
Indeed, assume this equation holds. Consider the surface S of the unit sphere. On one hand,
we have ˆ ˆ ˆ
∗3 B = d~σ · B
~ = d3 r∇ · B
~ = g.
S S
On the other hand, ˆ ˆ
dA = A = 0.
S ∂S
63
This is a contradiction.
In fact, V has the homotopy type of S 2 , and
2
HdR (V ) ' HdR
2
(S 2 ) ' R.
U+ = R3 \ {(0, 0, z) | z ≤ 0}
U− = R3 \ {(0, 0, z) | z ≥ 0}
z≥0
z≤0
U+ U−
It is clear that
U+ ∪ U− = V = R3 − {0}
U ∩ U = R3 − {z-axis}
+ −
(r, θ, ϕ)
r
θ
y
ϕ
We can write
g
A± = ± (1 ∓ cos θ)dϕ.
4π
64
Note that ∗3 B in spherical coordinate is
g
∗3 B = sin θdθdϕ
4π
where sin θdθdϕ is the standard area form on S 2 . It can now be easily checked that
∗3 B = dA± valid on U± .
d(A+ − A− ) = 0
1
HdR (U+ ∩ U− ) = HdR
1
(R3 − {z-axis}) ' HdR
1
(S 1 ) = R.
S1
must be an integer (Theorem 4.3.6). This gives the Dirac quantization condition
ge ∈ 2πZ.
65
Chapter 3 Electrodynamics
In this chapter, we study dynamical aspects of electromagnetism where the electric and
magnetic fields will evolve with time in general. We will see how electromagnetic waves arise
from solving Maxwell’s equations, and how they are produced by accelerating charges.
~ + ~v × B).
F~ = q(E ~
Example 3.1.1 (Force between steady charges). Consider two charge distributions ρ1 and ρ2
in the space.
ρ1 ρ2
F~2e = −F~1e .
In particular, F~1e = F~2e = 0 when ρ1 = ρ2 . Thus the net force of a charge distribution on itself
is zero.
Example 3.1.2 (Force between steady currents). Consider two steady current ~j1 and ~j2 in the
space.
66
~j1 ~j2
F~2m = −F~1m .
Again, F~1m = F~2m = 0 when ~j1 = ~j2 . Thus the net force of a current distribution on itself is
zero.
Comparing with the electric force case, we find that F~1m takes the same form as that of
F~2m , except with an extra minus sign. This sign difference says that parallel currents attract
and anti-parallel currents repel.
attract repel
67
Example 3.1.3 (Helical motion in magnetic field). Consider a uniform constant magnetic field
~ = (0, 0, B)
B
pointing in the z-direction with magnitude B. We consider the motion of a particle of charge
q, mass m. The equation of motion is
m~r¨ = q~r˙ × B.
~
vz = 0
We explore the energy stored in electromagnetic fields. Suppose we have some charge and
~ Here we consider
~ and magnetic field B.
current configuration which produces electric field E
the dynamical case and things will vary with time t. Then we need the full Maxwell’s equations.
Consider the infinitesimal work dW done by the electromagnetic forces on these charges in
the infitinesimal time dt. According to Lorentz force law, the work on a charge q is
F~ · d~r = q E~ + ~r˙ × B
~ · ~r˙ dt = q E~ · ~v dt.
This can be generalized to the case of charge and current densities, by replacing
68
Therefore the total work done in the region V is
ˆ
dW ~ · ~j.
= d3 r E
dt V
Poynting vector : ~= 1E
S ~ ×B
~
µ0
which describes the energy flow. Then
ˆ ˆ
1
d~σ · (E
~ × B) =
~ d~σ · S
~
µ0 ∂V ∂V
~
S
69
gives the expected interpretation: the work done by the electromagnetic force is equal to the
decrease in the total energy in the region less the energy that flowed out of the region. Let
1 ~2 + 1 B~2
u= ε0 E
2 µ0
~ · ~j = − ∂u − ∇ · S.
E ~
∂t
In the region of empty space where no work is done, the above equation becomes
∂u
+∇·S
~ = 0.
∂t
∂ρ
Comparing with the charge conservation ∂t + ∇ · ~j = 0, the above equation can be viewed as
the local form of the conservation of electromagnetic energy.
Example 3.1.4. Consider a uniformly charged spherical shell of total charge Q and radius R.
R
Q
Example 3.1.5 (Electromagnetic wave). Consider the following solution of Maxwell’s equations
in the vacuum:
E~ = cos(~k · ~r − ωt)E
~0 ~k
where k̂ = .
B~ = 1 cos(~k · ~r − ωt)k̂ × E
~0 |~k|
c
~ 0 , ~k, ω are constants and satisfy
Here E
c2~k · ~k = ω 2 , ~k · E
~ 0 = 0.
This describes an electromagnetic wave that we will discuss in detail in Section 3.3. This
wave travels in the direction of ~k, while the electric field and magnetic field are oscillating in
orthogonal directions.
70
~k
k̂
~
B
~
E
~
B
~
E
which simply says that the wave is transporting energy in the speed of light.
Let C be a conducting circuit. The electromotive force (“emf” for short) is the accu-
mulated tangential force per unit charge throughout the circuit. Equivalently, emf is the work
done on a unit charge moving around the whole circuit.
In a circuit context, the emf will drive the charges in the wire and form a current. In many
circumstances, the resulting current I is proportional to the applied emf E, which is called
Ohm’s Law:
E = IR.
71
Note that if the circuit C is static in the space, then ~v is always pointed along the direction of
the circuit, hence d~l · (~v × B
~ ) = 0. In this case, we have
˛
E= d~l · E
~ if C is at rest.
C
In other words, the second term in E comes from the effect of moving circuit. The precise
relationship is described by the “Flux Rule” that we now derive.
Assume C bounds a surface S. The magnetic flux through S is defined to be
ˆ
Φ= d~σ · B.
~
S
~
B
Let us now consider the change of Φ with respect to time. Assume the circuit C is moving with
velocity ~vC (~r ) at each point ~r of C. Let us write
ˆ
Φ= ∗3 B
S
Here ι⃗vC is the contraction with respect to the vector field ~vC . Expanding the notation, we find
(Exercise) ˆ ˆ
ι⃗vC (∗3 B) = − d~l · (~vC × B
~ ).
C C
This can be also understood from the picture: the total charge of the flux is computed by the
“volume” formed by d~l, ~vC , B.
~ We know that the “volume” is computed by the determinant,
hence the expression d~l · (~vC × B
~ ).
~vC ~
B
d~l
72
It follows that ˆ ˆ
dΦ ∂B~
= d~σ · − d~l · (~vC × B) ~
dt Sˆ ∂t C ˆ
Faraday’s
======= − d~σ · (∇ × E)
~ − d~l · (~vC × B)
~
law
ˆS ˆ C
=− ~
dl · E −
~ d~l · (~vC × B).~
C C
Now the velocity ~v differs from ~vC by a tangential vector along C, hence
ˆ ˆ
~
dl · (~vC × B) =
~ d~l · (~v × B).
~
C C
~
B
~ increases
assume B emf induced current and magnetic field
Example 3.2.1 (Jumping Ring). Place a metal ring on top of a solenoidal coil around an iron
core. Let us now switch on the current, which will produce a magnetic field. This will induce
a current in the ring in the opposite direction. Since opposite current repel, the ring will jump
out.
73
ring
solenoid
Consider two loops C1 , C2 of wire at rest. If we run a steady current I1 around C1 , it will
~ 1 and hence a magnetic flux Φ2 through the loop C2 . The flux Φ2 is
produce a magnetic field B
proportional to the current I1
Φ2 = M21 I1
S2 C2
~1
B
C1
I1
Therefore
ˆ ˆ ˛ ˛ ˛
~ 1) = ~ 1 (~r ) = µ0 I1 d~r · d~r ′
Φ2 = d~σ · B
~1 = d~σ · (∇ × A d~r · A .
S2 S2 C2 4π C2 C1 |~r − ~r ′ |
It follows that ˛ ˛
µ0 d~r · d~r ′
M21 =
4π C2 C1 |~r − ~r ′ |
74
which is a purely geometric quantity. This expression is invariant under the switch of C1 and
C2 , hence
call
M21 = M12 ==== M.
As a consequence, if we vary the current in C1 , Faraday’s law of induction says this will
vary the flow through C2 and produce a current in C2 . Isn’t this amazing!
A changing current also induces an emf in the source loop itself. The flux is proportional
to the current
Φ = LI
where L is called the self induction of the loop. The induced emf is then given by
dI
E = −L .
dt
Example 3.2.2 (Inductance of the Solenoid). Consider a solenoid wrapped by a wire which
carries current I and winds n times per unit length. Assume the solenoid has length l and
√
cross-sectional area A, and l >> A. This is approximately the infinite cylinder example with
horizontal surface current density N I as we discussed before. The magnetic field inside the
solenoid is
~ = µ0 N I ẑ.
B
~
B
Φ = (µ0 N IA)N l = µ0 IN 2 Al = µ0 IN 2 V
75
3.2.4 Magnetostatic Energy
Let us try to increase the current of a circuit C to achieve the current I. The work done
in this process can be viewed as the energy stored in the circuit. At time t, the induced emf is
dI
E = −L .
dt
Therefore the rate of electrical work by induced forces is
dW dI
= EI = −LI ,
dt dt
so
1
W = − LI 2 .
2
The total energy stored is U = −W = 12 LI 2 . It is illustrating to compare this with the
mechanical case
Particle Circuit
mv (momentum) LI
1 2 1 2
2 mv (kinetic energy) 2 LI
Let us rewrite this into the form of current density by replacing Id~l → ~j d3 r
ˆ
1 ~
U= d3 r ~j · A
2
In the magnetostatic case, this is
ˆ ˆ ˆ
1 ~ 1 ~ 1
U= d r (∇ × B) · A =
3 ~ d r (∇ × A) · B =
3 ~ ~ ·B
d3 r B ~
2µ0 2µ0 2µ0
~ = 0) that we find previously in Section 3.1.2.
which is the same electromagnetic energy (when E
Example 3.2.3 (Solenoid). We consider the solenoid as described in the previous example.
The energy density of the solenoid is
1 ~ ~ µ0 N 2 I 2
B·B= .
2µ0 2
The total stored energy is
µ0 N 2 I 2
U= V.
2
Comparing with the formula U = 12 LI 2 , we find
L = µ0 N 2 V
76
3.3 Electromagnetic Wave
We study Maxwell’s equations in the absence of sources, which gives rise to solutions
by electromagnetic waves propagating in spacetime. In particular, this leads to Maxwell’s
speculation of light as electromagnetic wave, and brings light, electricity and magnetism into
the same fundamental phenomenon.
F = E ∧ dt + ∗3 B = E ∧ dt + ∗(B ∧ cdt)
= (Ex dx + Ey dy + Ez dz) ∧ dt + (Bx dy ∧ dz + By dz ∧ dx + Bz dx ∧ dy),
dd∗ + d∗ d = −□,
In components, this is
~
1 ∂2E
2 2 − ∇2 E ~ =0
c ∂t
2~
1 ∂ B − ∇2 B ~ =0
c2 ∂t2
which are the vector valued wave equations travelling at the speed of light c.
Since Maxwell’s Equations in the vacuum are linear, it turns out to be convenient to express
solutions via complex valued functions while electromagnetic fields are obtained by taking their
real part. We will assume this implicitly and will not distinguish real and complex valued
solutions when it is clear from the context.
77
3.3.2 Plane Waves
□ϕ = 0 ⇐⇒ ω 2 = c2 k 2 where k = |~k|.
The two cases ω = ±ck corresponding to two waves propagating in opposite directions along ~k.
We will take
ω = ck
in the following discussion. The other case can be equivalently described by the propagating
vector −~k.
To see the precise meaning of ~k and ω, let us first consider standing at a fixed point ~r in the
⃗
space. Then the wave ei(k·⃗r−ωt) at the point ~r will oscillate ω
2π times per unit time. Similarly, if
k
we look at the wave at an instant time t, we find that the wave will oscillate 2π times per unit
length along the direction ~k.
ω k
2π times per unit time 2π times per unit length
ω k
So 2π / 2π = ω/k is the velocity of the wave.
Remark 3.3.1. The ratio ω/k is called the “phase velocity” of the wave. For electromagnetic
waves in the vacuum, the phase velocity is always the speed of light c for waves of any frequency:
ω/k = c. This is called “dispersionless” waves. If we consider waves in dispersive matter, we
will find the phase velocity depending on frequency
ω(k) = v(k) · k.
78
~ 0 are complex valued constant vectors. Physical solutions are their real parts as
~ 0, B
Here E
remarked above. Since E ~ 0 are constants, they solve the wave equation
~ 0, B
□E
~ = □B
~ = 0.
On the other hand, Maxwell’s equations say more than wave equations and will put further
constraints. The divergence free relations
∇·E
~ = 0, ∇·B
~ =0
is equivalent to
~ 0 · ~k = 0,
E ~ 0 · ~k = 0.
B
~ are both perpendicular to the propagating direction ~k. The curl
~ and B
This means that E
relations
~
∂B 1 ∂E~
∇×E
~ =− , ∇×B
~ =
∂t c2 ∂t
is equivalent to
~k × E ~ 0,
~ 0 = ωB ~0 = −ωE
~k × B ~ 0.
c2
~ 0 and
~ 0 is also perpendicular to B
This means that E
~k
~ 0 = k̂ × E
cB ~0 where k̂ = .
k
The above relations can be summarized by the geometric figure
~k
|E
~ 0 | = c|B
~ 0|
~0
cB
~0
E
3.3.3 Polarization
Let us consider electromagnetic waves in the propagating direction ~k. Let us fix two unit
vectors ê1 , ê2 ∈ R3 such that
k̂
ê2
ê1
79
Let us express
~ 0 = E1 ê1 + E2 ê2
E
where we emphasise that E1 , E2 are complex numbers. Then
~ = (E1 ê1 + E2 ê2 ) ei(⃗k·⃗r−ωt) .
E
~ can be obtained from E
The magnetic field B ~ by
Let us write
E1 = A1 eiθ1 , E2 = A2 eiθ2 , Ai ≥ 0.
The behavior of the electric/magnetic field can be described via the following cases:
⃝
1 Linear Polarization: eiθ2 = ±eiθ1 or θ2 − θ1 ∈ Zπ. In this case we have
h i
~ = Re ~uei(⃗k·⃗r−ωt+θ1 ) = ~u cos(~k · ~r − ωt + θ1 )
E
where ~u is the fixed vector ~u = A1 ê1 ± A2 ê2 . The electromagnetic fields are oscillating along
fixed directions. The wave at an instant time looks like
~k
k̂
~
B
~
E
k̂ × ~u
~u
⃝
2 Circular Polarization: E2 = ±iE1 . In this case we have
h i
~ ± = Re (ê1 ± iê2 )A1 ei(⃗k·⃗r−ωt+θ1 ) = A1 cos(~k · ~r − ωt + θ1 )ê1 ∓ sin(~k · ~r − ωt + θ1 )ê2 .
E
~ ± has a constant magnitude, but rotates in a
At a fixed position ~r in the space, the vector E
circle at a frequency ω. E ~ + is rotating counterclockwise, while E
~ − is rotating clockwise, as
shown below
~+
E ~−
E
ê2 ê2
ê1 ê1
80
The wave at an instant time looks like (for the right circular polarization case)
~k
⃝
3 Elliptical Polarization: general E1 and E2 .
h i
⃗
Re (E1 ê1 + E2 ê2 )ei(k·⃗r−ωt) = A1 cos(~k · ~r − ωt + θ1 )ê1 + A2 cos(~k · ~r − ωt + θ2 )ê2 .
ê2 ~ ê2
cB
~
E
ê1 ê1
In the above, we have focused on the case of a plane wave with fixed ~k. Such a wave
⃗
ei(k·⃗r−ωt) spreads out the full space and can not exist in nature. In general, since the wave
equation is linear, a localized wave can be obtained by superposing plane waves (like Fourier
modes) with different wave vectors. A general electromagnetic wave packet is given by
ˆ
1 ⃗
~
E= 3
d3 k ~E(~k )ei(k·⃗r−ωt)
(2π)
ˆ
B~ = 1 d3 1
k k̂ × ~E(~k ) ei(⃗k·⃗r−ωt)
(2π)3 c
or more precisely their real part. The vector function ~E(~k ) plays the role of Fourier coefficients.
In the vacuum which is dispersionless, we always have ω = ck. In dispersive matters, we have
a more complicated dispersion relation between ω and k.
Recall that the energy density of an electromagnetic field is
1 ~ 2 + 1 |Re B|
~ 2 = ε0 |Re E|
u= ε0 |Re E| ~ 2 + |Re cB|
~ 2
2 µ0 2
and the total energy U is
ˆ
ε0
U= d3 r |Re E|
~ 2 + |Re cB|
~ 2 .
2
81
In the vacuum case (ω = ck), we calculate
~ 2 = 1 (E
|Re E| ~ ·E
~ +E~∗ ·E ~ ∗ + 2E ~ ∗ ) = 1 (E
~ ·E ~ ·E ~ +E ~ ·E ~ ∗ ) + c.c.
4 ˆ ˆ 4
1 1 1
= d 3
k d3~ ′ ~ ~
k E(k ) · ~E(~k ′ )ei(⃗k+⃗k′ )·⃗r−ic(k+k′ )t + ~E(~k ) · ~E∗ (~k ′ )ei(⃗k−⃗k′ )·⃗r−ic(k−k′ )t + c.c.
4 (2π)3 (2π)3
Here c.c. means the complex conjugate of the expression before it. Using the identity
ˆ
1
d3 r ei⃗p·⃗r = δ(~
p ),
(2π)3
we find
ˆ ˆ
~ 2=1 1
d3 r |Re E| d3 k ~E(~k ) · ~E∗ (~k ) + ~E(~k ) · ~E(−~k )e−2ickt + c.c.
4 (2π)3
Similarly, the contribution from the magnetic field is
ˆ ˆ h i
~ 2=1 1
d3 r |Re cB| d3
k k̂ × ~E(~k ) · k̂ × ~E∗ (~k ) + k̂ × ~E(~k ) · −k̂ × ~E(−~k ) e−2ickt + c.c.
4 (2π)3
ˆ
1 1 3 ~E(~k ) · ~E∗ (~k ) − ~E(~k ) · ~E(−~k )e−2ickt + c.c.
= d k
4 (2π)3
Adding the above two contributions, we find the following total energy formula for electromag-
netic wave packet in a region of vacuum
ˆ
ε0 1
U= d3 k |~E(~k )|2 .
2 (2π)3
We now consider solving the full Maxwell’s equations with specified distributions of charge
and current:
∇·E ~ = ρ/ε0
∇ · B
~ =0
∇×E
⃗
~ = − ∂B
∂t
∇ × B ⃗
~ = µ0 ~j + ε0 ∂ E
∂t
F = E ∧ dt + ∗3 B, J = ρ/ε0 dt − µ0 j,
82
The consistency condition for d∗ F = J requires
d∗ J = 0.
∂t ρ + ∇ · ~j = 0.
The equation dF = 0 can be solved by introducing the potential 1-form A on R3,1 such
that
F = dA.
Expressing in components,
A = −φdt + A
The choice of A is not unique, and we have the following gauge transformation leaving F
invariant
A 7−→ A + dχ
for any function χ on R3,1 . We need a gauge fixing condition to specify the solution of A as
discussed in Section 1.10. We will focus on the Lorenz gauge here:
1 ∂φ
Lorenz gauge: d∗ A = 0 ⇐⇒ ~ = 0.
+∇·A
c2 ∂t
In terms of the potential 1-form A, the other half of Maxwell’s Equations d∗ F = J becomes
d∗ dA = J.
In Lorenz gauge,
d∗ dA = J Maxwell’s Equations
d∗ A = 0 Lorenz gauge
from which we find
□A = −J.
83
3.4.2 Green’s Function for the Wave Equation
To solve the inhomogeneous wave equation, we follow the same strategy as before by con-
structing the inverse “ □1 ”, which is called the Green’s function for the wave equation.
By definition, the Green’s function G(~r, t; ~r ′ , t′ ) for the wave equation is the wave produced
at (~r ′ , t′ ) by a unit point source at (~r, t). It satisfies
Here □′ = 1 ∂2
c2 ∂t′2
− ∇2′ acts on the variable (~r ′ , t′ ).
Let us first explain how to solve inhomogeneous wave equation using the Green’s function,
and come back to the construction of the Green’s function later. Consider a function ϕ(~r, t)
satisfying the inhomogeneous wave equation
□ϕ = f.
To find the expression of ϕ in terms of f , we apply the Green’s function in the region V ∈ R3
and time interval [t1 , t2 ]:
ˆ t2 ˆ
ϕ(~r, t) = d3 r′ ϕ(~r ′ , t′ )δ(~r − ~r ′ )δ(t − t′ )
dt ′
t1 V
ˆ t2 ˆ
′ ′ 1 ∂2
= dt′ 3 ′
d r ϕ(~r , t ) 2 ′2 − ∇ 2′
G(~r, t; ~r ′ , t′ ).
t1 V c ∂t
For the first term, we use integration by part and find
ˆ t2 ˆ t2 2
′ ∂2 ′ ∂ ∂G ∂ϕ t2
dt ϕ ′2 G = dt ϕ G+ ϕ ′ −G ′ .
t1 ∂t t1 ∂t′2 ∂t ∂t t1
The first term is the superposition of unit point source by the source function f . The
second term is the boundary term in the spatial variables. The third term is the boundary term
in the time variable.
Let us now construct the Green’s function in R3,1 . By symmetry considerations, we seek a
Green’s function of the form
G(~r, t; ~r ′ , t′ ) = G(~r − ~r ′ , t − t′ )
84
and the function G(~r, t) satisfying
1 ∂2
− ∇ G(~r, t) = δ(~r )δ(t).
2
c2 ∂t2
Furthermore, it is natural to assume G(~r, t) is spherically symmetric, and hence G(~r, t) = G(r, t)
only depends on the radius r = |~r | and time t. Then
1 ∂ 2 ∂G ∂ 2 G 2 ∂G 1 ∂2
∇ G= 2
2
r = + = (rG).
r ∂r ∂r ∂r2 r ∂r r ∂r2
We are reduced to solve
1 1 ∂2 ∂2
− (rG) = δ(~r )δ(t).
r c2 ∂t2 ∂r2
At r > 0, we have the one dimensional wave equation
1 ∂2 ∂2 1∂ ∂ 1∂ ∂
− (rG) = − + (rG) = 0
c2 ∂t2 ∂r2 c ∂t ∂r c ∂t ∂r
which has two independent solutions of the form
1
G(∓) (r, t) = g± (t ± r/c)
r
for functions g± to be determined from the behavior at r = 0. Using
2 1
∇ = −4πδ(~r ),
r
we have the following singular behavior at r = 0
1 ∂2
− ∇ G(∓) (r, t) = 4πδ(~r )g± (t ± r/c).
2
c2 ∂t2
Comparing with the equation for Green’s function, we find
1
g± (u) = δ(u).
4π
We conclude that
1
δ(t ± r/c).
G(∓) (~r, t) =
4πr
Therefore we find two solutions of Green’s function
1
G(∓) (~r, t; ~r ′ , t′ ) = δ(t − t′ ± |~r − ~r ′ |/c).
4π|~r − ~r ′ |
Let us plug these Green’s functions into the above expression of ϕ satisfying the inhomo-
geneous wave equation
□ϕ = f.
We assume a spatially localized source and such that the spatial boundary term is absent. Thus
ˆ t2 ˆ
ϕ(~r, t) = dt ′
d3 r′ G(~r, t; ~r ′ , t′ )f (~r ′ , t′ )
t1 V
ˆ ′
1 3 ′
′ ′ ′ ′ ′ ′ ′ ′
t =t2
+ 2 d r ϕ(~r , t )∂t G(~r, t; ~r , t ) − G(~r, t; ~r , t )∂t ϕ(~r , t )
′ ′ .
c V t′ =t1
85
There are two cases:
⃝
1 Retarded Solution: This is to use
1
G(+) (~r, t; ~r ′ , t′ ) = δ(t − t′ − |~r − ~r ′ |/c).
4π|~r − ~r ′ |
Note that in the expression for ϕ(~r, t), we have
t1 ≤ t ≤ t2
so G(+) is only nonzero at the t′ = t1 boundary. The retarded solution is therefore given by
ˆ
1 f (~r ′ , t − |~r − ~r ′ |/c)
ϕret (~r, t) = d3 r′ + ϕin (~r, t)
4π |~r − ~r ′ |
where ϕin (~r, t) collects the boundary term at t′ = t1 .
The term involving the source f has the following interpretation: ϕret (~r, t) is contributed
from source located at ~r ′ at time t − |~r − ~r ′ |/c. Such time delay reflects the propagating wave
signaled at the speed of light c.
Assume t1 is at the time before the source appears. Then at t ≤ t1 , the source integral
does not contribute and we have
So physically, ϕin (~r, t) can be viewed as an “incoming wave” solution of the homogeneous wave
equation at the initial time t1 before the effect of the source. This is also consistent with the
fact that a general solution of the inhomogeneous wave equation is given by the sum of a special
solution (here is the source term) and a solution of the homogeneous wave equation (here is the
incoming wave).
⃝
2 Advanced Solution: This is to use
1
G(−) (~r, t; ~r ′ , t′ ) = δ(t − t′ + |~r − ~r ′ |/c).
4π|~r − ~r ′ |
The discussion is similar and we find the advanced solution
ˆ
1 f (~r ′ , t + |~r − ~r ′ |/c)
ϕadv (~r, t) = d3 r ′ + ϕout (~r, t).
4π |~r − ~r ′ |
The advanced solution at (~r, t) takes the effect of the source located at ~r at a future time
t + |~r − ~r ′ |/c. And ϕout is the “outgoing wave” describing the situation at time t2 after the
effect of the source.
Mathematically, both the retarded and the advanced waves are solutions. Physically, we
will pick the retarded one which reflects the causal structure.
We now use the retarded Green’s function to write down a solution of the inhomogeneous
wave equations
□φ = ρ/ε0
□A
~ = µ ~j 0
86
It reads ˆ
1 ρ(~r ′ , t − |~r − ~r ′ |/c)
φ(~r, t) = d3 r′
4πε0 |~r − ~r ′ |
ˆ ~ ′ ′
~ r, t) = µ0 d3 r′ j (~r , t − |~r − ~r |/c)
A(~
4π |~r − ~r ′ |
Let us check the Lorenz gauge condition
1 ∂ ~ = 0.
φ+∇·A
c2 ∂t
Let us write the above formula via Green’s function
ˆ
1
φ(~r, t) = d3 r′ dt′ G(+) (~r, t; ~r ′ , t′ )ρ(~r ′ , t′ )
ε0 ˆ
~ r, t) = µ0 d3 r′ dt′ G(+) (~r, t; ~r ′ , t′ )~j (~r ′ , t′ )
A(~
We have also learned that Maxwell’s equations admit solutions by electromagnetic waves. We
now explain
accelerating charges =⇒ radiation
Let us first describe waves which are spreading out from some center (say the origin) and
are spherically symmetric. In other words, we consider a solution ϕ of
□ϕ = 0
87
and such that ϕ = ϕ(r, t) where r = |~r |. We have seen that for spherically symmetric functions,
the wave equation becomes
1 1 ∂2 ∂2
− (rϕ) = 0.
r c2 ∂t2 ∂r2
A general retarded solution is given by
f (t − r/c)
ϕ(r, t) = .
4πr
It represents a spherical wave travelling outward from the origin. Strictly speaking, ϕ solves the
wave equation outside the origin but would have singularity at the origin due to the presence
of source. In fact, from the general solution of inhomogeneous wave equation with source S
ˆ
1 S(~r ′ , t − |~r − ~r ′ |/c)
ϕ(r, t) = d3 r′ ,
4π |~r − ~r ′ |
we see that the source is located at the origin given by
so in fact ϕ solves
□ϕ = δ(~r )f (t).
Another thing worth mentioning is that the amplitude of ϕ decays in proportion to 1/r as
the wave propagates. This is consistent with energy conservation. As the wave propagates, the
total energy flux over the sphere of radius r must be the same. The area of the sphere is 4πr2 ,
and the energy density depends on the square of the wave amplitude, so the amplitude of the
wave must decrease as 1/r.
1 1 ~r · ~r ′
= + 3 + ··· for r >> 0.
|~r − ~r ′ | r r
We also assume that the source does not vary too fast: the motion of charges and currents are
non-relativistic, and they do not change very much over the time that it takes light to cross the
r′
region V . This in particular says that the operation c ∂t would produce something very small.
′ r′
r·⃗
Then we can use |~r − ~r | = r − ⃗
r + · · · to expand
′ ′ ′ ~r · ~r ′
f (~r , t − |~r − ~r |/c) = f ~r , t − r/c + + ···
rc
~r · ~r ′
= f (~r ′ , t − r/c) + ∂t f (~r ′ , t − r/c) + · · · .
rc
88
Therefore ϕ is approximated by
ˆ
1 3 ′ ~r · ~r ′ ′ ~r · ~r ′ ′
ϕ(~r, t) ' d r 1 + 2 + ··· f (~r , t − r/c) + ∂t f (~r , t − r/c) + · · ·
4πr r rc
ˆ
1 3 ′ ′ ~r · ~r ′ ′ ~r · ~r ′ ′
= d r f (~r , t − r/c) + ∂t f (~r , t − r/c) + 2 f (~r , t − r/c) + · · · .
4πr rc r
Let us apply this to Maxwell’s equations with charge distribution ρ and current distribution ~j.
The vector potential
ˆ ~j (~r ′ , t − |~r − ~r ′ |/c)
~ r, t) = µ0
A(~ d3 r′
4π |~r − ~r ′ |
has the leading approximation (under the same assumption as above)
ˆ
~ r, t) ' µ0
A(~ d3 r′ ~j (~r ′ , t − r/c).
4πr
Consider the x-component for example, we have
jx (~r ′ , t′ ) = ∇′ · x′ ~j (~r ′ , t′ ) − x′ ∇′ · ~j (~r ′ , t′ ) = ∇′ · x′ ~j (~r ′ , t′ ) + x′ ∂t′ ρ(~r ′ , t′ ).
Therefore ˆ ˆ
d3 r′ jx (~r ′ , t − r/c) = d3 r′ x′ ∂t ρ(~r ′ , t − r/c).
89
Here Q is the total charge. The leading approximation of the electric field is therefore (keeping
only the 1/r-order term)
~
~ = −∇φ − ∂t A
E
r̂ r µ0 ¨
' ¨
· p~ (t − r/c)∇ − p~ (t − r/c)
4πε0 rc c 4πr
µ0 h i
= r̂ · p~¨ r̂ − p~¨ (r̂ · r̂)
4πr
µ0
= r̂ × r̂ × p~¨ (t − r/c) .
4πr
As a consistency check, we can also use Maxwell’s equations. The approximated electric field
~ can be computed via that of B
E ~ by
∂~ ...
~ ' µ0 r̂ × r̂ × p~ (t − r/c) ,
E = c2 ∇ × B
∂t 4πr
so
~ ' µ0 r̂ × r̂ × p~¨ (t − r/c) .
E
4πr
Let us denote this leading electric dipole approximation by
B~ ED = − µ0 r̂ × p~¨ (t − r/c)
4πrc
µ
E~ ED = 0 r̂ × r̂ × p~¨ (t − r/c)
4πr
They are both spherical waves and related by
~ ED = −cr̂ × B
E ~ ED .
r̂
~ ED
E
~ ED
cB
The power radiated from the source can be computed by the Poynting vector
~= 1E
S ~ ' 1E
~ ×B ~ ED × B
~ ED = µ0
|r̂ × p~¨ |2 r̂
µ0 µ0 16π 2 r2 c
which points in the same direction as ~r. We conclude that oscillating dipole is emitting spherical
electromagnetic waves that transporting power radially.
We can also study higher order approximations. For example. the next order are magnetic
dipole and electric quadrupole radiations. We will not discuss them in this note.
90
Let us assume the dipole is oscillating along the z-direction:
p~ = p(t)ẑ.
dipole
θ ~r
Then
µ0
~=
S |p̈|2 sin2 θr̂
16π 2 r2 c
which is largest in the direction perpendicular to the dipole (when θ = π/2) and smallest in the
direction parallel to the dipole (when θ = 0, π).
The total radiated power is computed by
ˆ ˆ 2π ˆ π
µ0 µ0 2
P = d~σ · S =
~
2
|p̈| 2
dφ dθ sin3 θ = |p̈| .
S2 16π c 0 0 6πc
Example 3.5.1. Consider a particle of charge Q oscillating in the z-direction with frequency
ω and amplitude d. The electric dipole moment is
−d
Then
p~¨ = −ω 2 p cos(ωt)ẑ
µ0 p2 ω 4
P (t) = cos2 (ωt).
6πc
The time-averaged power is
ˆ 2π/ω
1 µ0 p2 ω 4
hP i = P (t)dt = .
2π/ω 0 12πc
91
3.6 Moving Point Charge
Let us consider a point particle with charge q moving in the trajectory
~ = position of q at time t.
ξ(t)
~ is general, the point charge will accelerate along the way and radiate electromagnetic
Since ξ(t)
waves. We would like to calculate the electromagnetic fields produced by such a moving point
charge.
The charge and current densities are
ρ(~r, t) = q δ 3 ~r − ξ(t)~
~j (~r, t) = q~v (t) δ 3 ~r − ξ(t)
~
where the sum is over all roots f (xi ) = 0. In fact, this is about change of variable formula:
ˆ X
g(x)δ(f (x))df = g(xi )
xi
and ˆ ˆ
g(x)δ(f (x))df = g(x)|f ′ (x)|δ(f (x))dx.
Comparing the above two expressions, we get the above δ-function relation.
Let us now apply this to δ(f (t′ )) where
~ ′ )| = c(t − t′ ).
|~r − ξ(t
92
~r
c(t − t′ )
t′
ξ~
t
~r
c(t − t′1 )
c(t − t′2 )
t′1
ξ~
t′2 t
We conclude that the speed of the particle ≥ c. Since no charged particle can travel at the speed
of light, this is a contradiction. The existence of retarded time can be achieved by chasing back
along the trajectory for long enough until |~r − ξ(t~ ′ )| = c(t − t′ ) is satisfied since the charged
particle travels relatively smaller than the speed of light.
Let us denote tret for the the retarded time solving the equation
~ ret )|/c = 0
tret − t + |~r − ξ(t
with the understanding that the dependence of tret on (~r, t) is implicit when it is clear from the
context. To simplify notation, let us also write
~
R(t) ~
= ~r − ξ(t), R(t) = |R(t)|,
~ ~
n̂R = R/R, ~ = ~v (t)/c,
β(t) ~
β(t) = |β(t)|.
~
R(t) ~ of the particle to the point ~r of the field.
represents the relative vector from the point ξ(t)
Then
1
δ(f (t′ )) = δ(t′ − tret ).
~
1 − R(tret ) · β(tret )/R(tret )
~
Plugging this into the potential, we find
1 q
φ(~r, t) = .
~ · β~ ret
4πε0 R − R
93
Here [−]ret means the time variable is evaluated at tret . By a similar argument, we find the
vector potential
~ µ0 q~v
A(~r, t) = .
4π R − R~ · β~ ret
The above two expressions are called the “Liénard-Wiechert potentials” for a moving point
charge.
~ = ~v t
ξ(t) with ~v constant.
Geometrically, let C represent the point at time t, and B represent the point at time tret .
D h
θ
B C ~v
tret t
Then
|AB| = |~r − ~v tret |, |BC| = v(t − tret ), where v = |~v |.
Therefore
1
p
q
φ(~r, t) = 4πε0 R 1 − β 2 sin2 θ
~ r, t) = µ0 p ~v
A(~ 4π R 1 − β 2 sin2 θ
~ = ~r −~v t is the relative vector from the present position of the particle to the field point,
Here R
and θ is the angle between R ~ and ~v . Note that for nonrelativistic velocities (v << c),
1 q
φ(~r, t) ' .
4πε0 R
94
We now move on to compute the electromagnetic fields
E ~ = −∇φ − ∂t A~
B ~
~ =∇×A
Since the retarded time tret depends on ~r and t implicitly, we work directly with the expression
ˆ
q δ (t′ − t + R(t′ )/c)
φ(~r, t) = 4πε dt′
0 R(t′ )
ˆ ′ ′ ′
~ r, t) = qµ0 dt′ ~v (t )δ (t − t + R(t )/c)
A(~
4π R(t′ )
Using
′
′ ∂ ′ ′
∇R = R/R
~ = n̂R , ∇δ t − t + R(t )/c = ∇(R/c) − δ t − t + R(t )/c ,
∂t
we find
~ = −∇φ − ∂t A
E ~
ˆ
q ′ ∇R(t )
′
′ ′
∇R(t′ ) ∂ ′ ′
= dt δ t − t + R(t )/c + δ t − t + R(t )/c
4πε0 R(t′ )2 cR(t′ ) ∂t
ˆ
q ∂ ~v (t′ )δ (t′ − t + R(t′ )/c)
− dt′
4πε0 ∂t c2 R(t′ )
" # " #
q ~
R q ∂ R/R ~ − ~v /c
= +
4πε0 R3 g 4πε0 c ∂t Rg
ret ret
" #
q n̂R q ∂ n̂R − β~
= + .
4πε0 R2 g ret 4πε0 ∂t Rgc
ret
⃗ v
where g = 1 − n̂R · β~ = 1 − R·⃗
Rc . To compute t-derivative term, we use
to find
dt 1 ∂R
=1+ (tret ) = g(tret ).
dtret c ∂t
From R ~
~ = ~r − ξ(t), we have
∂R~ ∂R
= −~v = −cβ,~ ~
= −cn̂R · β,
∂t ∂t
! ! !
∂ n̂R 1 ∂R~ ~
R ∂ ~
R 1 ∂ ~
R ∂ ~
R c
= − 3 R ~· = − n̂R · n̂R = n̂R · β~ n̂R − β~
∂t R ∂t R ∂t R ∂t ∂t R
c c
= n̂R × n̂R × β~ = ~ − c gn̂R ,
n̂R − β
R R R
∂Rg ∂ ~ ~
= R−R ~ · β~ = −cn̂R · β~ + cβ 2 − R ~ · dβ = −cn̂R · β~ + cβ 2 − R n̂R · dβ .
∂t ∂t dt dt
This enables us to compute
" # " # " !# " !#
∂ n̂R − β~ dtret ∂ n̂R − β~ 1 ∂ n̂R − β~ 1 ∂ n̂R − β~
= = =
∂t Rgc dt ∂tret Rgc g (tret ) ∂t Rgc g ∂t Rgc
ret ret ret ret
95
and
q n̂R
∂
∂t n̂R − β~ ~
n̂R − β ∂
~ =
E + − 2 3 (Rg)
4πε0 R2 g Rg 2 c R g c ∂t
ret
" !#
q n̂R n̂R − β ~ n̂R dβ/dt n̂R − β~
~ dβ~
= + − 2 − − 2 3 −cn̂R · β~ + cβ − R
2 ~·
4πε0 R2 g R2 g 2 R g Rg 2 c R g c dt
ret
" ! #
n̂R − β~ ~ ~ ~
=
q
g + n̂R · β~ − β 2 + n̂R − β n̂R · dβ − dβ/dt
4πε0 R g 2 3 Rg 3 c dt Rg 2 c
ret
⃗
~ ~ dβ
q n̂R − β (1 − β ) n̂R × n̂R − β × dt
2
= + .
4πε0 R2 g 3 Rg 3 c
ret
That is,
⃗
dβ
~ (1 − β 2 )
n̂R − β n̂R × n̂R − β~ ×
~ = q dt
E + .
4πε0 R2 g 3 Rg 3 c
ret
A similar computation leads to
~ = 1 [n̂R ] × E.
B ~
ret
c
These are called Liénard-Wiechert electric and magnetic fields.
The above expression decomposes the electric and magnetic field into a velocity field and
~ =E
an acceleration field: E ~v +E~ a where
n̂ − ~ (1 − β 2 )
β
E~ q R
v = 4πε
0 R2 g 3
ret
⃗
× − ~ × dβ
~ q R n̂ n̂ R β
Ea =
dt
4πε0 Rg c3
ret
~ v only depends on the velocity β~ and decays as 1/R2 in space. This field and its energy is
E
~ a contains β~˙ which is about the acceleration. It decays as 1/R in
attached to the particle. E
space, and therefore contains surface energy that propagates to infinity. This is the radiation
field which contain energy that radiates from the particle to the space faraway.
~ = ~v t.
Example 3.6.2. We again look at a point charge of constant velocity with ξ(t)
D h
θ
B C ~v
tret t
96
The particle is not accelerating and
n̂ R − ~ (1 − β 2 )
β ~ − Rβ~ (1 − β 2 )
R
~ =E
E ~v = q =
q .
4πε0 R2 g 3 4πε0 R3 g 3
ret ret
~ = (vt, 0, 0).
ξ(t)
~ = (Ex , Ey , Ez )
Then the above formula leads to the explicit expression in components E
q γ(x − vt)
Ex =
4πε0 (γ 2 (x − vt)2 + y 2 + z 2 )3/2
q γy
Ey =
4πε0 (γ 2 (x − vt)2 + y 2 + z 2 )3/2
q γz
Ez =
4πε0 (γ (x − vt) + y 2 + z 2 )3/2
2 2
where γ = √ 1
. This expression has a very suggestive relativistic meaning. Indeed, we
1−v 2 /c2
will show in Example 5.2.5 that this formula can be simply obtained from the Lorentz boost
along x-direction of the fields of a stationary point charge.
3.7 Scattering
We briefly discuss the basic idea behind the scattering of electromagnetic waves. Its physics
contains several steps:
97
Thomson Scattering
~ 0 cos(~k · ~r − ωt) interacting with a free particle of
~ =E
Consider an incoming plane wave E
mass m and charge q. The equation of motion is
m~r¨ = q E~ + ~r˙ × B
~ .
so we can neglect the magnetic Lorentz force. In the non-relativistic limit, we can also assume the
oscillation of the particle is small comparing with the wave length. Under these simplification,
we have
m~r¨ = q E
~ 0 cos ωt
hPrad i µ2 q 4
σ=D E = 0 2.
Sbinc 6πm
Note that the cross-section of Thomson scattering does not depend on the frequency ω. All
wave lengths of light are scattered equally.
Rayleigh Scattering
98
~ 0 cos(~k · ~r − ωt) as
~ =E
We assume linear polarization so α is a constant. For the plane wave E
above, Larmor’s formula gives the time-averaged radiation power
µ0 α2 E20 ω 4
hPrad i = .
12πc
The cross-section is given by
hPrad i µ2 α 2 ω 4
σ=D E= 0
Sbinc 6π
which is stronger for high frequencies or short wave lengths. This explains blue sky in the day
time and red sky at sunset since
λblue < λred .
During the day, we look away from the sun and see light that has scattered by the atmosphere.
At sunset, we look directly at the sun and see light that remains from the scattering.
99
Chapter 4 U (1) Gauge Theory
There are four fundamental interactions in nature: the weak nuclear force, the strong
nuclear force, the electromagnetic force, and the gravity. One landmark of modern physics is
to realise that all these interactions have a common feature in terms of gauge principle.
The notion of gauge transformation and gauge invariance was introduced by Hermann
Weyl in his attempt to unify gravitation and electromagnetism via a geometric framework.
Weyl emphasized the role of gauge invariance as a symmetric principle, and his proposal forms
the foundation of what is now known as gauge theory.
The modern aspect of gauge theory has rich content both in mathematics and physics. We
are not intended to present a full overview of this beautiful theory. Instead, we aim to put
hands on one basic example, the electromagnetism, as a U (1)-gauge theory from the modern
geometric perspective. We assume basic knowledge on the notion of manifolds, and readers
can consult [10, Chapter 3] in this series for preliminary geometric backgrounds. We will give
a self-contained discussion on the geometry of fiber bundles in order to understand the bridge
between the physical content in the first half of this note and the mathematical content that
constitutes the modern framework. We will briefly mention and comment on generalizations,
such as non-abelian gauge theory or Yang-Mills theory, at certain steps along the way.
The notion of fiber bundle describes a family of geometric object (called the fiber) varying
with respect to a parameter space (called the base). Precisely, it consists of the following
geometric data:
100
φα
π −1 (Uα ) Uα × F
π πα
Uα
and this diagram is commutative, i.e. πα ◦ ϕα = π. Here πα : Uα × F → Uα is the projection to
the Uα -factor. This property says that for each x ∈ Uα , ϕα defines a diffeomorphism
∼
ϕα : π −1 (x) → F
so every preimage π −1 (x) of a point x ∈ B is diffeomorphic to F . The total space looks like
π −1 (x) ' F
F F
π π
B x
so π : E → B can be viewed as a family of the fiber manifold F varying over the base B. The
locally trivial condition says that locally it can be parameterized as a trivial family. However,
globally E may not be the same as B × F : nontrivial topological phenomenon can arise.
⃝
2 G is equipped with an action on F
ρ : G × F −→ F
ρ : G −→ Diff(F )
Uαβ
101
By definition, the composition
is a fiberwise diffeomorphism, i.e. for each x ∈ Uαβ , ϕβα maps {x} × F to {x} × F and defines
a diffeomorphism of F under the canonical identification {x} × F ' F . Equivalently, we can
write
ϕβα : Uα ∩ Uβ −→ Diff(F ).
Such ϕβα is called the translation function, which characterizes the difference of identifying
the fiber π −1 (x) ' F under two trivializations. In fact, for x ∈ Uαβ , we have
ϕβα
ϕα ϕβ
ϕα ϕβ F
F
F π −1 (x) F
ϕβα
for some g(x) ∈ G. We say the transformation has the structure captured by the Lie group G,
and this is why G is called the structure group. We will simply write ϕβα (x) ∈ G. Then the
transformation function is required to be expressed as
ϕβα : Uα ∩ Uβ −→ G
with the understanding that its actual transformation on the fiber F is realized via the G-action
ρ
Uα ∩ Uβ −→ G −→ Diff(F ).
This requirement clearly puts further constraint on the trivializations {Uα , ϕα }, and in this case
we say the fiber bundle has structure group G.
We usually denote a fiber bundle by
F E
π
B
and specify the structure group G in the context when we need to.
102
Definition 4.1.1. π : E → B is called a trivial bundle if there is a global trivialization
≃
E B×F
B
so we can identify E as the product B × F .
Theorem 4.1.2. If B is contractible, then any fiber bundle E over B is a trivial bundle.
E
s π i.e. j = π ◦ s.
j
U B
In other words, a section s over U assigns every point x ∈ U an element s(x) of the fiber
π −1 (x). We will mainly consider smooth sections without further specification in this note.
We denote
Γ(U, E) = {sections over U}.
Γ(B, E) = Map(B, F )
s : B −→ E = B × F
b 7−→ (b, f (b))
for f : B → F .
Example 4.1.5 (Möbius strip). Möbius strip is obtained by gluing the two ends of a strip with
a half-twist, as shown below
103
M
S1
S1
is a fiber bundle with fiber I = [0, 1].
By construction, it is clear that global sections of this fiber bundle can be identified with
The Möbius strip is a nontrivial fiber bundle. To see this, assume M ' S 1 × I is trivial,
then we can find two global sections f1 , f2 such that their images in M do not intersect. On
the other hand, as described above, f1 and f2 can be identified with two functions
fi : [0, 1] −→ [0, 1]
104
such that fi (1) = 1 − fi (0). Assume f2 (0) > f1 (0). Then f2 (1) < f1 (1). By Intermediate Value
Theorem, there must be some point t such that
f2 (t) = f1 (t).
π : S 3 −→ S 2
π : S 3 −→ S 2
(z1 , z2 ) 7−→ [z1 , z2 ]
For each point p ∈ S 2 , we have π −1 (p) ' S 1 . For example, consider the north pole [1, 0], then
S1 S3
π
S2
is a fiber bundle with fiber S 1 . This is called the “Hopf fibration”. This is a nontrivial fibration
since S 3 and S 2 × S 1 are topologically different. For example,
π1 (S 3 ) = 1
π (S 2 × S 1 ) = Z
1
i.e. any loop in S 3 can be shrinked continuously to a point, but a loop wrapping along the S 1
factor of S 2 × S 1 can not.
105
4.1.2 Vector Bundle
Definition 4.1.7. A real vector bundle, or simply vector bundle, is a fiber bundle E with
fiber F = Rm and structure group G ⊂ GLm (R). The integer m is called the rank of the vector
bundle, and we denote
rank(E) = m.
Similarly, a complex vector bundle of rank m is a fiber bundle with fiber Cm and structure
group G ⊂ GLm (C).
One important fact about vector bundle is that each fiber π −1 (x) for x ∈ B is a linear
vector space, so we can add elements of π −1 (x) and multiply by a scalar. In fact, let {Uα , ϕα }
be a trivialization of the vector bundle E. Assume x ∈ Uα . Then we can use ϕα to identify
ϕα : π −1 (x) −→ Rm .
λ1 s1 + λ2 s2 := ϕ−1
α (λ1 ϕα (s1 ) + λ2 ϕα (s2 ))
where λi ∈ R, s1 ∈ π −1 (x). It does not depend on the choice of the local trivialization. Let
x ∈ Uβ and
ϕβ : π −1 (x) −→ Rm
be another identification of the fiber π −1 (x) via a different trivialization ϕβ . Then the transition
φ−1
Rm π −1 (x)
α
φβ
g
Rm
i.e.
−1
ϕ−1
α (λ1 ϕα (s1 ) + λ2 ϕα (s2 )) = ϕβ (λ1 ϕβ (s1 ) + λ2 ϕβ (s2 )) .
This implies the linear operation λ1 ϕα (s1 ) + λ2 ϕα (s2 ) is intrinsically defined on the fiber of a
vector bundle, and it does not depend on the choice of the local trivialization.
Similarly, let U ⊂ B be an open subset. Then the space of sections of the vector bundle on
U has a structure of C ∞ (U )-module:
Γ(U, E) is a C ∞ (U ) − module.
For any two sections s1 , s2 ∈ Γ(U, E) and any two functions f1 , f2 ∈ C ∞ (U ), we have
s = f1 s1 + f2 s2 ∈ Γ(U, E)
106
by defining the value of s(x) at each x ∈ U via
s multiply fs
by a function f
π
Definition 4.1.8. Let E → B be a vector bundle of rank m, and U ⊂ B be an open subset. A
set of m sections {s1 , · · · , sm } of E over U is said to be a frame of E over U if
for each x ∈ U.
Example 4.1.9 (Tangent bundle). Let X be a smooth manifold of dim = n. We can define
the tangent bundle which has rank n as follows. Set-theoretically, T X is the union
[
TX = Tp X
p∈X
107
Tp X
Local trivializations of T X can be constructed with the help of local coordinates. Let
x1 , · · ·
, xn be a local coordinate system on an open subset U ⊂ X. Then we have a frame of
T X over U by the local vector fields
∂ ∂
1
,··· , n on U.
∂x ∂x
A section V of T X over U is the same as a vector field on U, which can be expanded via the
frame as
X
n
∂
V = V i (x) , x ∈ U.
∂xi
i=1
V (x)
X
If y 1 , · · · , y n is another choice of local coordinates on U′ , then we have the coordinate
transformation
(x1 , · · · , xn ) 7−→ (y 1 , · · · , y n ) on U ∩ U′ .
It gives rise to a linear transformation of the frame via the chain rule
∂ X ∂y j ∂
n
= .
∂xi ∂xi ∂y j
j=1
∂y j
In particular, the matrix ∂xi
is precisely the transition map between two local trivializations
of T X. We can expand the same vector field V in both coordinates on the intersection U ∩ U′
X
n
∂ i
V = V i (x) in x coordinates
∂xi
i=1
X
n
∂ j
V = Ve j (y) j in y coordinates
∂y
j=1
108
Then their coefficients are related via the transformation rule of the frame by
X
n
∂y j
Ve j (y(x)) = V i (x) .
∂xi
i=1
p Tp S 2
~r ∈ S2
T S 2 is a nontrivial vector bundle: there can not exist a global frame on S 2 . In fact, Hairy Ball
Theorem says that any global vector field on S 2 must vanish at some point, thus can not be
part of a frame everywhere on S 2 .
Definition 4.1.11. A principal G-bundle is a fiber bundle with fiber F = G being a Lie
group G. The structure group is a subgroup of G and its action on F is given by the left
multiplication
G × F (= G) −→ F (= G)
(g, h) 7−→ g · h
G P
π
The fact that G admits both left and right G-action allows us to define a fiberwise right G-action
on P
P × G −→ P
(p, g) 7−→ p · g
109
as follows. Let b ∈ B and ϕα be a local trivialization. Then ϕα gives a diffeomorphism
∼
ϕα : π −1 (b) −
→ G.
p · g := ϕ−1
α (ϕα (p) · g)
under the above identification. This does not depend on the choice of the local trivialization.
Assume we have another trivialization ϕβ which gives a different diffeomorphism
∼
ϕβ : π −1 (b) −
→ G.
ϕαβ (g) = t · g.
t·(−)
G G
φβ φα
π −1 (b)
ϕαβ (ϕβ (p) · g) = t · (ϕβ (p) · g) = (t · ϕβ (p)) · g = ϕαβ (ϕβ (p)) · g = ϕα (p) · g,
which implies
−1
ϕ−1
α (ϕα (p) · g) = ϕβ (ϕβ (p) · g).
This says that the right action p · g is well-defined and does not depend on the choice of local
trivializations.
Note that the right G-action on itself is transitive and free. Therefore each fiber π −1 (b) is
a single G-orbit. The fibration
π : P −→ B
B = P /G
S1 S3
S2
110
can be realized as a principal S 1 -bundle. Let us identify
n o
S 1 = eiθ = {λ ∈ C| | λ| = 1} ,
S 3 = (z1 , z2 ) ∈ C2 | |z1 |2 + |z2 |2 = 1 .
We define a right S 1 -action (Since S 1 is an abelian group, left and right actions are essentially
the same) on S 3
S 3 × S 1 −→ S 3
(z1 , z2 ), e iθ
7−→ (z1 eiθ , z2 eiθ )
S 3 /S 1 ' CP 1 = S 2
To emphasize the role of a group, we will use U (1) for the group of unitary complex numbers
U (1) = {z ∈ C | |z| = 1} .
This is an abelian Lie group. As we will see, electromagnetism is about the geometry of U (1)-
principal bundles, hence usually called abelian gauge theory. In general, Yang-Mills theory
generalizes Maxwell theory to principal G-bundles, hence is about non-abelian gauge theory.
U (1) P
B
Geometrically, this is a family of circles
111
We already know that points of a fiber of P are related to each other via the (right)
U (1)-action. The essential geometric idea underlying the notion of connection is to be able
to transport points of a fiber of P to another fiber along a path in B, so different fibers can
communicate and compare with each other.
B
x0 γ x1
In other words, we need a notion to help lifting a path from the base B to P horizontally.
The U (1)-action on P defines a vector field on P along the fiber direction via its infinitesimal
transformation. Let us parametrize U (1) by
n o
U (1) = eiθ .
Given a point x ∈ P and eiθ ∈ U (1), the transformed point will be simply denoted by
x · eiθ ∈ P . This defines a curve γx
γx : (−ε, ε) −→ P
θ 7−→ x · eiθ
This construction applies to all points of P , and gives rise to a vector field on P , denoted by
Vθ . Explicitly,
d
Vθ (x) := (x · eiθ ) for x ∈ P.
dθ θ=0
The vector field Vθ points along the fiber direction of P , or in formula this means
π∗ (Vθ ) = 0.
112
Vθ
Moreover, if you follow the flow of the vector field Vθ , then it will circle around the fiber and
come back to the start point after time 2π.
Remark 4.2.1. In general for a principal G-bundle P , every element of the Lie algebra g of G
leads to a vector field on P via the infinitesimal right G-action. In this case we have a map
g −→ Vect(P ).
Condition ⃝
2 says that for any eiθ ∈ U (1), let us define a diffeomorphism
f : P −→ P
x 7−→ x · eiθ
where LVθ is the Lie derivative with respect to the vector field Vθ .
Example 4.2.3. Assume P = B × U (1) is trivial. Let xi denote local coordinates on B and
θ denote the angle coordinate on U (1) as above. Then
∂
Vθ = .
∂θ
An arbitrary 1-form A can be expressed in coordinates by
X
A = Aθ (x, θ)dθ + Ai (x, θ)dxi .
i
113
We have
ιV A = Aθ
θ
L A = (∂ A )dθ + P (∂ A )dxi
Vθ θ θ i θ i
For a general U (1)-bundle, we can describe the 1-form A in terms of a local trivialization
φ
π −1 (Uα ) Uα × U (1)
Uα
Then a local coordinate system xi on Uα and the angle coordinate θ on U (1) define a local
coordinate system on π −1 (Uα ) via the map ϕ. In such coordinates, the connection A will again
take the form locally
X
(ϕ−1 )∗ A = dθ + Ai (x)dxi
i
A connection 1-form A allows us to lift a tangent vector from the base manifold to the
total space “Horizontally”.
ιv A = 0.
Hq = {v ∈ Tq P | ιv A = 0} .
Tq P = Hq ⊕ RVθ .
114
Vθ
q Hq
x Tx B
In local coordinates xi , θ from a local trivialization as above, with A expressed as
X
dθ + Ai (x)dxi ,
i
π∗ : Hq −→ Tx B, where x = π(q)
Definition 4.2.5. Let q ∈ P , x = π(q) ∈ B. We define the horizontal lift of a tangent vector
v ∈ Tx B at q to be the tangent vector ve ∈ Hq such that
π∗ (e
v ) = v.
ve
q
x v
115
4.2.4 Parallel Transport
e : [0, 1] −→ E
γ
q0 e
γ
q1
x0 γ x1 B
xi (t) = γ i (t),
e(t) is given by
then γ
e(t) = xi (t) = γ i (t), θ(t)
γ
dθ X dγ i
=− Ai (γ(t)).
dt dt
i
The existence and uniqueness follow from the standard theory in ordinary differential equations.
Definition 4.2.7. Given a curve γ : [0, 1] → B, with x0 = γ(0) and x1 = γ(1), we define the
parallel transport
Tγ : π −1 (x0 ) −→ π −1 (x1 )
by
e(1)
Tγ (q0 ) = γ
116
Proof: The fact that A is U (1)-invariant implies that horizontal vectors are preserved under the
e is a horizontal lift of γ. Then for any g ∈ U (1), the new curve
U (1)-action. Assume γ
eg (t) = γ
γ e(t) · g
eg (0) = γ
γ e(0) · g = q0 · g.
It follows that
Tγ (q0 · g) = γ
eg (1) = γ
e(1) · g = Tγ (q0 ) · g.
eg
γ
q0 e
γ
q1
x0 x1 B
γ
Now we consider the case when γ is a loop, i.e. γ(0) = γ(1) = x0 . Then for any q0 ∈
π −1 (x0 ), we have Tγ (q0 ) ∈ π −1 (x0 ). Therefore
Tγ (q0 ) = q0 · g
Holγ (q0 ).
q0 · g
q0
γ
x0
117
In the U (1)-case which is an abelian group, Holγ (q0 ) does not depend on the choice of q0
(Exercise: show this) and we will simply write Holγ ∈ U (1).
Holγ ∈ U (1)
describing the parallel transport action from the fiber π −1 (x0 ) to itself.
γ
x0 Holγ ∈ U (1)
The nontriviality of such holonomies indicate nontriviality of the U (1)-bundle via certain
twist or curving. We will make this precise in this section. The relevant geometric notion is
called the curvature.
FA = dA.
Since A is a 1-form on the total space P , the curvature FA is firstly defined as a 2-form on
P . However, FA is can be viewed in fact as a 2-form on B, which does not depend on the fiber
direction. To see this, let us choose a local trivialization over an open U ⊂ B
π −1 (U) U × U (1)
U
with local coordinates {xi } on the base U and angle coordinate θ on the fiber U (1). The
connection A will take the form
X
A = dθ + Ai (x)dxi .
i
118
which is clearly a 2-form on the base.
Note that this expression is independent of the choice of local trivialization. In fact, suppose
we have a different local trivialization
π −1 (U)
φ
U × U (1) U × U (1)
∈
∈
e
(x, eiθ ) (x, eiθ )
e
eiθ = eiϕ(x) eiθ
e
for some function φ(x). Then in (x̃, θ)-coordinate,
X X
A = dθ + Ai (x)dxi = dθe + dφ(x) + Ai (x)dxi .
i i
The curvature
!
X X
F = dA = d dθe + dφ(x) + Ai (x)dx i
= (∂i Aj ) dxi ∧ dxj
i i,j
π ∗ : Ω• (B) → Ω• (P )
for a fiber bundle is an injective map, and identifies forms on B as a subspace of forms on
P (such forms are called basic forms in fiber bundle geometry). Then the claim is that the
curvature F lies in the image of this map. This explains the above local calculation.
119
Remark 4.3.2. The curvature takes the simple form F = dA since U (1) is an abelian Lie group.
In general, a connection on a principal G-bundle is a g-valued 1-form A ∈ Ω1 (P, g). The
curvature will take the form
1
F = dA + [A, A]
2
where [−, −] denotes the Lie bracket on the Lie algebra g of G. In the abelian U (1)-case,
[−, −] = 0. In the G-bundle case, such defined curvature form will again descend to the base B.
Now we explain the relation between the holonomy and curvature. Let γ be a loop on B
which bounds a surface Σ. Here γ could be piece-wise smooth, and Σ may not be a disk but
could have non-trivial topology.
γ
Σ
γ = ∂Σ
x0
Σα
For each small Σα , let γα = ∂Σα be its boundary loop. For any path in the interior, it is part
of two different small loops with opposite direction. The corresponding parallel transport will
cancel each other. It follows that
Y
Holγ = Holγα .
α
120
On the other hand, ˆ Xˆ
F = F.
Σ α Σα
for each small Σα . Thus we assume Σ is small enough and lies inside an open U ⊂ B with a
local trivialization
π −1 (U) U × U (1)
U
i
Let x denote local coordinates on U, and θ denote the angle coordinate on the fiber. The
connection 1-form is of the form
X
dθ + α where α= Ai (x)dxi .
i
Let e is parametrized by xi (t), θ(t)
xi (t) parametrize the curve γ in B. Its lifting γ
where θ(t) satisfies the flow equation (Horizontal condition)
dθ X dxi (t)
+ Ai (x(t)) =0
dt dt
i
dF = 0.
It may not be an exact form on B (although it is written as F = dA, but A is not a 1-form
on B but a 1-form on P ). Thus F defines a de Rham cohomology class on B which captures
topological information of the bundle.
Let us first understand how the curvature depends on the choice of the connection. Assume
e
A, A are two connection 1-forms. Let
e−A
α=A
121
which is a 1-form on P . We claim that α is a pull-back of a 1-form form B. In fact, in local
coordinates of a local trivialization, we can write
X
A = dθ + Ai (x)dxi
i
X
e = dθ +
A ei (x)dxi
A
i
e−A = P A
Then α = A ei (x) − Ai (x) dxi which clearly depends only on the base. So we will
i
write
α ∈ Ω1 (B).
Remark 4.3.4. The above computation says that the space of connections on P is an affine space
{connections on P} = A0 + Ω1 (B)
FAe = FA + dα,
[F ] ∈ H 2 (B)
depends only on the bundle P , but not on the choice of the connection.
Definition 4.3.5. The (first) Chern class of the U (1)-principal bundle π : P → B is the de
Rham cohomology class
1
c1 (P ) := F ∈ H 2 (B).
2π
One important property of c1 (P ) is that it is an integral class. More precisely, we have
Proof: We can treat Σ as having the boundary with a trivial constant loop γ. Clearly we have
Holγ = 1. By Theorem 4.3.3
´ ˆ
−i
e Σ F
= Holγ = 1 =⇒ F ∈ 2πZ.
Σ
122
Σ
x0
γ: constant loop mapped to x0
c1 (P ) = 0 ∈ H 2 (B).
A = dθ.
Then F = dA = 0.
then such bundle can not be trivial. Dirac monopole is such an example (see Example 4.4.5).
Let
U (1) P
π
B
be a principal U (1)-bundle. We first give an equivalent description of local trivializations in
terms of local sections.
ϕ : π −1 (U ) → U × U (1) σ ∈ Γ(U, P )
123
Proof: Let σ ∈ Γ(U, P ). Then it defines a diffeomorphism
σ
ϕ S1
1
U
π
ϕ : π −1 (U) −→ U × U (1)
for some Ai (x). To describe the meaning of Ai (x), consider the section σ : U → P . Let
Aσ := σ ∗ A
124
We have
! !
X ∗
X
∗ ∗ ∗
Aσ = σ A = σ ϕ dθ + Ai (x)dx i
= (ϕ ◦ σ) dθ + Ai (x)dx i
.
i i
Since
ϕ ◦ σ : U −→ U × U (1)
x 7−→ (x, 1)
It follows that
X
Aσ = Ai (x)dxi .
i
In other words, the information of Ai (x) in the chosen local trivialization is precisely the pull-
back of A to the base via the corresponding local section.
We next describe how Aσ depends on the choice of local trivializations. Let
e : π −1 (U) → U × U (1)
ϕ
such that
e(x) = σ(x) · eiα(x) ,
σ ∀x ∈ U.
e(x) · e−iα(x) g.
σ(x) · g = σ
π −1 (U)
φ e
φ
e −1
φ◦φ
U × U (1) U × U (1)
∈
∈
Let
Aσ = σ ∗ A = P Ai (x)dxi
i
Aeσ = σ P ei (x)dxi
e∗ A = i A
be local descriptions of A with respect to the corresponding trivialization. By construction,
under the diffeomorphism
e −1
φ◦φ
U × U (1) −−−−→ U × U (1),
we have !
X X
−1 ∗ ei (x)dxi
e◦ϕ
ϕ dθ + A = dθ + Ai (x)dxi .
i i
125
On the other hand, since
e ◦ ϕ−1 : (x, eiθ ) 7−→ x, e−iα(x) eiθ ,
ϕ
we have explicitly
!
∗ X X X
e ◦ ϕ−1
ϕ dθ + ei (x)dxi
A = d (θ − α(x)) + ei (x)dxi = dθ − dα(x) +
A ei (x)dxi .
A
i i i
Thus we find
X X
ei (x)dxi =
A Ai (x)dxi + dα(x).
i i
To summarize, we have proved the following
e = σ · eiα
σ
for some eiα : U → U (1). Then the two local descriptions Aσ = σ ∗ A and Aσe = σ
e∗ A are related
by
Aσe = Aσ + dα.
Definition 4.4.3. Aσ is called the local gauge 1-form of A with respect to the trivialization
σ. The above change of Aσ via different local trivializations is called “local gauge transfor-
mations”.
ϕα : π −1 (Uα ) −→ Uα × U (1)
such that
π −1 (Uαβ )
φα φβ
126
(x, g) = (x, ϕαγ (x)ϕγβ (x)ϕβα (x)g)
∈
Uαβγ × U (1)
φα
π −1 (Uαβγ )
φβ φγ
Here Uαβγ = Uα ∩ Uβ ∩ Uγ .
In fact, any collection of maps
ϕβα : Uα ∩ Uβ −→ U (1)
satisfying condition ⃝
1 and ⃝
2 defines a principal U (1)-bundle P by the quotient
!,
a
P = Uα × U (1) ∼
α
Uα × U (1) Uβ × U (1)
for x ∈ Uα ∩ Uβ . We leave the details to the reader to check that this is indeed a principal
U (1)-bundle.
Now let us reformulate the data of local trivializations {ϕα } and transition functions {ϕαβ }.
By the proposition above, each ϕα corresponds to a local section σα : U → π −1 (U) by
σα (x) = ϕ−1
α (x, 1).
−1
σβ (x) = ϕ−1 −1 −1
α ◦ ϕα ◦ ϕβ (x, 1) = ϕα (x, ϕαβ (x)) = ϕα (x, 1)ϕαβ (x) = σα (x)ϕαβ (x),
i.e. we have
σβ = σα ϕαβ on Uα ∩ Uβ .
127
On the intersection Uα ∩ Uβ , we have
σβ = σα ϕαβ on Uα ∩ Uβ .
Aβ = Aα + dχαβ .
A {Aα = σα∗ A}
Example 4.4.5 (Dirac Monopole). Consider the unit sphere S 2 ⊂ R3 with spherical coordinates
(θ, ϕ).
z
(θ, ϕ)
y
ϕ
We consider a cover of S 2 by
U+ = S 2 − {(0, 0, −1)}
U− = S 2 − {(0, 0, 1)}
U+ U−
128
Each U± is contractible, hence the fiber bundle can be trivialized on U± .
Consider the following 1-form
n
A± = ± (1 ∓ cos θ) dϕ on U± .
2
Here n is a constant to be determined. On U+ ∩ U− ,
A+ − A− = ndϕ.
If we ask {A+ , A− } to define a connection 1-form of some principal U (1)-bundle, then the
transition function on U+ ∩ U− is
U+ ∩ U− −→ U (1)
(θ, ϕ) 7−→ einφ
Since ϕ is defined modulo 2π, this map is well-defined if and only if n ∈ Z is an integer.
In other words, for each integer n ∈ Z, the transition function
eiχ : U+ ∩ U− −→ U (1)
(θ, ϕ) 7−→ einφ
defines a principal U (1)-bundle Pn on S 2 , and the collection {A± = ± n2 (1 ∓ cos θ) dϕ} defines
a connection on Pn . Then the curvature form is
n
F = dA+ (= dA− ) = sin θ dθ ∧ dϕ.
2
It is direct to compute ˆ
1
F = n.
2π S2
Hence the U (1)-bundle Pn is nontrivial for n 6= 0. Moreover, these U (1)-bundle {Pn }’s are all
topologically different since they have different Chern classes (Theorem 4.3.6).
such that
⃝
1 f is fiberwise, i.e. π ◦ f = π
129
f
P P
π π
B
⃝
2 f is U (1)-equivariant, i.e.
f (q · g) = f (q) · g, ∀q ∈ P, g ∈ U (1).
f : π −1 (x) −→ π −1 (x).
Since U (1) is abelian, this is simply given by a right action by an element of U (1). It follows
that such f can be equivalently described by
fe : B −→ U (1)
such that
f (q) = q · fe(x), ∀q ∈ π −1 (x).
f ∗ A ∈ Ω1 (P ).
which says
f∗ Vθ = Vθ .
It follows that
1 ιVθ f ∗ A = f ∗ (ιVθ A) = f ∗ (1) = 1
⃝
2 LVθ f ∗ A = f ∗ (LVθ A) = f ∗ (0) = 0.
⃝
130
We can also describe this explicitly in local coordinates via a local trivializations, with base
coordinates xi and U (1) angle coordinate θ. Then A can be written as
X
A = dθ + Ai (x)dxi .
i
fe : B −→ U (1)
Then
f (x, eiθ ) = (x, eiθ eiϕ(x) ),
so
X X
f ∗ A = d(θ + φ(x)) + Ai (x)dxi = dθ + Ai (x)dxi + dφ(x)
i i
which still takes the form of a connection. This proves the claim that f ∗ A is also a connection.
f ∗ A = A + π ∗ dφ.
Definition 4.5.4. Let Conn(P ) denote the space of connections on P . The natural action
Aut(P ) ↷ Conn(P )
Proposition 4.5.5. The curvature 2-form of the principal U (1)-bundle P is invariant under
gauge transformations.
Ff ∗ A = d(f ∗ A) = dA + d(dφ) = dA = FA .
Remark 4.5.6. In general for principal G-bundle, the curvature 2-form will transform via group
conjugation under gauge transformations. When G = U (1) is abelian, group conjugation is
trivial.
131
Therefore the curvature of U (1)-bundle can be viewed as a map
e = A + π ∗ α. The
In summary, any two different choices of connection differ by a 1-form on B: A
curvatures differ by the exact form dα
FAe = FA + dα.
Gauge transformations lead to change of the connection by α = dφ, hence leaving F invariant.
The formula of local and global gauge transformations look very similar. We clarify these
two notions to avoid possible confusions. In short,
• Local gauge transformation is about the different expression of the same connection via
different choices of local trivializations.
• Global gauge transformation is about the change from one connection to another, so
linking different connections.
However, these two notions are closely related when the bundle P is trivial, where we
have a trivialization over the full base B. Indeed, assume now P is trivial, then we have an
identification (Proposition 4.4.1)
fe : B −→ U (1)
such that
σ2 (x) = σ1 (x)fe(x), ∀x ∈ B.
σ2 = f ◦ σ1 .
f
P P
σ1 σ2
B
In particular, Aut(P ) acts on Γ(B, P ) freely and transitively in the case when P is trivial.
Let A be a connection on a trivial U (1)-bundle P over B. Let σ ∈ Γ(B, P ) be a global
section which gives a trivialization of P . Let f ∈ Aut(P ) be an automorphism of P which
corresponds to a map
fe : B −→ U (1)
x 7−→ eiϕ(x)
132
⃝
1 f gives a (global) gauge transformation
e = f ∗ A = A + π ∗ dφ.
A 7−→ A
⃝
2 f gives a different trivialization by the section
e =f ◦σ
σ ∈ Γ(B, P ).
1 and ⃝
Although the transformation formula in ⃝ 2 look similarly, they are different in nature.
⃝
1 is about gauge transformations, which changes one connection to another connection. ⃝
2 is
about local gauge transformation, where the connection is fixed but expressed on the base with
different trivializations. In the case when P is trivial, we have a (noncanonical) identification
In general, they are different and we hope it will not cause further confusion.
Now we discuss the case when P is nontrivial in general. Let us fix an open cover {Uα } of
B and local trivialization on each Uα by σα ∈ Γ(Uα , P ). Let A be a connection 1-form on P .
Then we can express A locally via the collection {σα } by
{Aα = σα∗ A} .
fe = eiϕ : B −→ U (1).
A 7−→ f ∗ A.
We can express the gauge transformation locally via the fixed trivialization {σα }
(f ∗ A)α = (f ◦ σα )∗ A = Aα + dφ.
Thus a gauge transformation is equivalently described with respect to the fixed trivialization
{σα } by a transformation of collections
f
{Aα } 7−→ {Aα + dφ} , where dφ is the same expression in all α
for fe = eiϕ as above. Again, the gauge transformation preserves the curvature form
f
{Fα = dAα } 7−→ {(f ∗ F )α = dAα + d(dφ) = dAα } .
133
4.6 Maxwell Theory as U (1)-gauge Theory
We are now ready to explain electromagnetism in terms of U (1)-gauge theory.
B = R3,1 .
∼
P −→ R3,1 × U (1).
Since σ will be fixed, we will simply write the gauge 1-form as A = Aσ . In spacetime coordinates
{x, y, z, t} on R3,1 ,
A = At dt + Ax dx + Ay dy + Az dz.
Let eiχ : R3,1 → U (1). It gives a gauge transformation of the connection 1-form, hence the
(local) gauge 1-form, via
A 7−→ A + dχ.
F = dA
collects the components of electric and magnetic fields. It takes the form that we have seen
134
4.6.3 Maxwell Action
Here d∗ = ∗d∗ is the adjoint of d. These equations can be derived from the action principle as
follows.
If we write it as ˆ
SM [A] = L (A) dx ∧ dy ∧ dz ∧ cdt,
R3,1
where L (A) is the Lagrangian density, then a direct computation gives
ε0 ~ ~ 1 ~ ~
~ · ~j .
L = E ·E− B · B + −ρφ + A
2 2µ0
Now we consider the variation of SM [A] under an arbitrary variation of the connection
A −→ A + δA
135
4.6.4 Gauge Principle and Charge Conservation
Gauge principle asks for invariance of physical quantities under gauge transformations. Let
us analyze the Maxwell action under gauge transformations.
Consider a gauge transformation described by
χ can be viewed as a function on R3,1 defined modulo 2π. Since the topology of R3,1 is trivial,
we can actually lift χ to be a single-valued function on R3,1 , and we assume this.
Under the gauge transformation generated by eiχ , the gauge 1-form transforms as
A 7−→ A + dχ
and the curvature F is invariant. It follows that the Maxwell action transforms as
ˆ ˆ
1 1
SM [A + dχ] = SM [A] + ∗J ∧ dχ = SM [A] + d (∗J) ∧ χ.
µ0 R3,1 µ0 R3,1
If we require that SM should be invariant under arbitrary gauge transformations, then we need
d(∗J) = 0.
This is precisely the charge conservation. In other words, charge conservation can be viewed as
a direct consequence of gauge principle. Assume charge conservation, then the Maxwell action
defines a functional
SM : Conn(P )/ Aut(P ) −→ R.
Maxwell’s equations
dF = 0
d(∗F ) = ∗J
is based on the assumption (in known experiments) that magnetic monopole does not exist. In
theory, the full Maxwell’s equations are
dF = ∗Jm
d(∗F ) = ∗J
e
where Je is the electric charge-current 1-form, and Jm is the magnetic charge-current 1-form.
It exhibits the full electromagnetic duality
F ←→ ∗F
Jm ←→ Je
Jm = 0 on R3,1 − M
136
and on the region R3,1 − M we have
dF = 0
on R3,1 − M.
d(∗F ) = ∗J
e
where Rt denotes the real line parameterized by time t. In this case we are led to consider
U (1)-gauge theory on
R3,1 − M = R3 − {0} × Rt .
(r, θ, ϕ)
r
θ
y
ϕ
Then R3,1 is parametrized by (r, θ, ϕ, t) and R3,1 − M corresponds to the locus where r > 0.
Now we choose a covering of R3,1 − M by
U+ = R3 \ {(0, 0, z) | z ≤ 0} × Rt
U− = R3 \ {(0, 0, z) | z ≥ 0} × Rt
z≥0
z≤0
U+ U−
137
Each U± is contractible and so a principal U (1)-bundle can be trivialized on the cover {U+ , U− }.
The local gauge 1-form of the Dirac magnetic monopole is described in the local trivialization
by
n n
A± = (−ydx + xdy) = ± (1 ∓ cos θ)dϕ on U± .
2r(z ± r) 2
On the intersection U+ ∩ U− , we have
A+ − A− = ndϕ on U+ ∩ U−
einφ : U+ ∩ U− −→ U (1).
This transition function defines a principal U (1)-bundle Pn on R3,1 − M (see Example 4.4.5).
The curvature form is
n
F = dA+ (= dA− ) = sin θdθ ∧ dϕ.
2
Let Sr2 denote a sphere of radius r in R3 . Then
ˆ ˆ
1
c1 (Pn ) = F = n ∈ Z.
Sr2 2π Sr2
which describes the magnetic flux over S 2 . It computes the total magnetic charge surrounded
by the sphere Sr2 .
Let us start with a general discussion of the construction of vector bundles associated to a
principal G-bundle. Let
G P
π
B
be a principal G-bundle. Let
ρ : G −→ End(V )
P ×ρ V := P × V / ∼
138
where the equivalence relation ∼ is
(p · g, v) ∼ (p, g · v), ∀p ∈ P, v ∈ V, g ∈ G.
Here p · g is the right g-action on P , and g · v is the left g-action on V with respect to the
representation ρ
g · v := ρ(g)(v).
P ×ρ V 3 (p, v)
πρ
B 3 π(p)
Proof: Let
φ
π −1 (U) U×G
U
be a local trivialization of P over an open U ⊂ B. Then it induces
π −1 (U) ×G V (U × G) ×G V U×V
U
which defines a local trivialization of P ×G V over U with fiber = V . It can be checked that the
transition functions are linear transformations on V , with the help of the representation map
ρ : G → End(V ).
Definition 4.7.2. P ×ρ V is called the associated vector bundle of the principal G-bundle
P with respect to the G-representation ρ on V .
The associated vector bundle is a complex vector bundle of rank = 1, i.e. a complex line bundle.
139
For any point x ∈ B, the fiber πρ−1
n
(x) is a one dimensional complex vector space. We can
define a Hermitian inner product on each finer πρ−1
n
(x) as follows. Let v ∈ πρ−1
n
(x), which is
represented by a pair (p, z) for p ∈ π −1 (x), z ∈ C. Then we define its norm by
|v|2 := |z|2 .
|ψ|2 ∈ C∞ (B).
As we will discuss below, the wave function of particle in quantum mechanics will be a section
of a complex line bundle L as above, and |ψ|2 plays the role of probability distribution of the
quantum particle.
We know that we can take derivatives on functions. We would like to extend such notion
of derivative on sections of vector bundles. We will focus on the case of the complex line bundle
Ln arising from the principal U (1)-bundle P and the U (1)-representation ρn on C as above.
We explain that a connection 1-form A on P will allow us to define derivatives of sections of all
the associated bundles.
Precisely, let s ∈ Γ(B, Ln ) denote a section of Ln on B, and V ∈ Vect(B) be a vector field
on B. We will define a notion of covariant derivative of s with respect to the vector field V ,
denoted by ∇V s, which is again a section ∇V s ∈ Γ(B, Ln ) as follows. Let U ⊂ B be an open
subset, with a local trivialization of P defined by a local section
σ ∈ Γ(U, P ).
Here f (x) is a complex valued function on U. This representation clearly depends on the choice
of σ. Now we define the section ∇V s whose value on U is represented by
∇V s = (σ, ∂V f + in ιV (Aσ )f ) .
Here Aσ = σ ∗ A is the local gauge 1-form with respect to the choice σ. Note that since f is a
function, ∂V f is the usual derivative with respect to V . We can write it in a compact form
140
where df + inAσ f is a 1-form on U.
We check that such ∇V s is well-defined, i.e. does not depend on the choice of σ. Assume
e ∈ Γ(U, P ). Then
we have another section σ
e(x) = σ(x)eiϕ(x) ,
σ x ∈ U,
where
eiϕ(x) : U −→ U (1).
σ (x), fe(x)), which should represent the same element s(x), we find
Now since (σ(x), f (x)) ∼ (e
On the other hand, the local gauge 1-form Aσe is related to Aσ by a local transformation
Aσe = Aσ + dφ.
Indeed, from
dfe + inAσe fe = d e−inϕ f + in (Aσ + dφ) e−inϕ f = e−inϕ (df + inAσ f ) .
We see that
e, ιV dfe + inAσe fe = σ
σ e, e−inϕ ιV (df + inAσ f ) ∼ σee−iϕ , ιV (df + inAσ f )
This shows that ∇V s is well-defined and does not depend on the choice of local trivialization.
In our previous study of electromagnetism, the electric and magnetic fields are of central
role, while the scalar and vector potentials are introduced as an auxiliary object to help solve
Maxwell’s equations. However in quantum mechanics, the situation is completely changed and
the use of gauge potential is essential. We briefly mention some key constructions.
141
As we have discussed before, the Maxwell theory of electromagnetism can be viewed as a
U (1)-gauge theory
connection ←→ potentials
curvature ←→ electromagnetic fields
In quantum mechanics, particles are described by wave functions ψ, where |ψ|2 describes the
probability distribution of the quantum particle. In an electromagnetic background, a particle
of electric charge n will be described by a section
s ∈ Γ(R3,1 , Ln )
of the complex line bundle Ln associated to the principal U (1)-bundle P of Maxwell theory. It
seems unnecessary to introduce U (1)-bundle here since the topology of R3,1 is trivial. However,
when the space-time encodes nontrivial topology, this description will be essential. As we
have seen before, the norm |s|2 is still well-defined, with the physical meaning of probability
amplitude. With respect to a choice of section σ on R3,1 , we can represent the section as
s = (σ, ψ)
ψ −→ einψ
A = σ ∗ A = −φdt + Ax dx + Ay dy + Az dz
∇x ψ = ∂x ψ + inAx ψ.
For the Schrödinger equation governing the wave function ψ, we simply need to replace every
derivative such as ∂x , ∂y , ∂z by ∇x = ∂x + inAx , ∇y = ∂y + inAy , ∇z = ∂z + inAz . For example,
the Hamiltonian H is 2
1 X ∂
H =− + inAi + V (r).
2m ∂xi
i
This explains the basic principle on how gauge field is coupled with matter field.
142
Chapter 5 Electromagnetism and Special
Relativity
The theory of special relativity originates from electromagnetism. The work of Hendrik
Lorentz through his study of electromagnetism plays a fundamental role. In this chapter, we
will study basic aspects of the coherence of electromagnetism with respect to special relativity.
We will see in particular how Lorentz transformations preserve the form of Maxwell’s equations.
xµ = (ct, x, y, z) µ = 0, 1, 2, 3.
where
1 0 0 0
0 −1 0
0
η=
0 0 −1 0
0 0 0 −1
is called the Minkowski metric. The space R3,1 equipped with the Minkowski metric is called
Minkowski spacetime.
In coordinates, if
X
Λ : xµ 7−→ x
eµ = Λ µ ν xν ,
ν
143
then the 4 × 4 matrix Λµ ν gives a Lorentz transformation if it obeys
X
ηµν Λµ ρ Λν σ = ηρσ , ∀ρ, σ = 0, 1, 2, 3
µ,ν
ΛT ηΛ = η, ΛT = transpose of Λ.
Example 5.1.3 (Lorentz Boost). A Lorentz boost along x-direction with velocity v is the linear
transformation
ct − xv/c
cet= p
1 − v 2 /c2
x − vt
xe= p
1 − v 2 /c2
ye = y
ze = z
(ce
t )2 − x
e2 − ye2 − ze2 = (ct)2 − x2 − y 2 − z 2 .
where γ = √ 1
. We can view this as a hyperbolic rotation in the (x0 , x1 )-plane
1−v 2 /c2
cosh α − sinh α 0 0
− sinh α cosh α 0 0
Λ= .
0 0 1 0
0 0 0 1
144
Observe that the relation
ΛT ηΛ = η
implies
det(Λ)2 = 1 =⇒ det(Λ) = ±1.
Any tensor fields transform naturally under Lorentz transformations. For example, let
Λ : R3,1 −→ R3,1 .
P
Let α = µ αµ (x)dx
µ be a 1-form on R3,1 . Then α is transformed via pull-back by Λ as
α 7−→ Λ∗ (α).
Remark 5.1.5. Strictly speaking, our convention of the transformation α → Λ∗ (α) defines a
right action of Lorentz transformation on differential forms. Indeed, we have
We could as well define α → (Λ−1 )∗ (α) for a left action. However, since we will not discuss
composition of Lorentz transformations, it does not really matter. We will keep the above
convention, which is convenient for differential forms, to simplify the presentation. It should be
straight-forward to relate different conventions.
In coordinates, if
Λ : R3,1 −→ R3,1
X
xµ 7−→ x
eµ = Λ µ ν xν
ν
145
Expanding the relation α̃ = Λ∗ (α) which says
X X
x)Λµ ν dxν =
αµ (e eµ (x)dxµ ,
α
µ,ν µ
For simplicity, we will use Einstein summation convention, where repeated upper and lower
indices refer to summation. Then under the Lorentz transform,
xµ 7−→ x
eµ = Λµ ν xν .
α 7−→ α
e=α
eµ (x)dxµ ,
where
eµ (x) = αν (e
α x)Λν µ .
Similarly, for a vector field V = V µ (x) ∂x∂ µ , we define its transformation under Λ by
∂ ∂xν ∂
−1 ν ∂
V 7−→ V µ (e
x) = V µ
(e
x ) = V µ
(e
x) Λ µ ∂xν
.
xµ
∂e xµ ∂xν
∂e
Let us denote transformed vector field of V by Ṽ
∂ ∂
V = V µ (x) 7−→ Ve = Ve µ (x) µ .
∂xµ ∂x
Then comparing the above two expressions, we find the transformation rule as
µ µ
Ve µ (x) = V ν (e
x) Λ−1 ν
= V ν (Λx) Λ−1 ν
.
Remark 5.1.6. In geometric term, this transformation of a vector field is the push-forward
V 7−→ Ve = (Λ−1 )∗ V.
This can be further simplified using the Minkowski metric ηµν . We can use η to turn a
form into a vector field, and vice versa. This is the standard rule to raise or lower tensor indices
via η. For example, we can construct
Vµ = ηµν V ν
αµ = η µν αν
146
is the inverse matrix of η.
1 0 0 0 1 0 0 0
0 −1 0 0 0 −1 0 0
η= , η −1 = .
0 0 −1 0 0 0 −1 0
0 0 0 −1 0 0 0 −1
In general, for an arbitrary tensor field T with components T µ1 ···µk ν1 ···νm , we can raise and
lower their indices by
′ ′ ′ ′
Tµ1 ···µk ν1 ···νm = ηµ1 µ′1 · · · ηµk µ′k η ν1 ν1 · · · η νm νm T µ1 ···µk ν1′ ···νm
′ .
ΛT ηΛ = η, or ηµν Λµ ρ Λν σ = ηρσ .
Warning: Be careful about the position of indices. (Λν µ ) is the inverse matrix of (Λµ ν ).
As a result, we can write the Lorentz transformation of a vector field by
T 7−→ Te
This is the transformation rule for general tensor fields. Note that this is consistent if we raise
or lower some indices of Te (Exercise: Check this).
147
5.1.3 Invariance of Inner Contraction
We can change the type of a tensor field by contracting their indices. For example, consider
a tensor field
∂
T = T µν ⊗ dxν .
∂xµ
Then we can get a scalar (function) by
f (x) = T µ µ (x).
fe(x) = f (e
x).
Since
Teµ µ (x) = T µ ν ′ (e
′ ′ ′ ′
x)Λµ′ µ Λν µ = T µ ν ′ (e
x)δµ′ ν = T µ µ (e
x),
This consistency follows essentially from the invariance of Minkowski metric under Lorentz
transformations.
We first describe how electromagnetic fields change under Lorentz transformations. Let
Λ : R3,1 −→ R3,1
Λ : xµ 7−→ x
eµ = Λµ ν xν .
148
The tensor components Fµν are expressed in matrix form
0 −Ex /c −Ey /c −Ez /c ← µ = 0
E /c B −B
x 0 z y 1
Fµν =
Ey /c −Bz 0 Bx 2
Ez /c By −Bx 0 3
↑
ν=0 1 2 3
This allows us to read off the transformation of electric and magnetic fields via components.
Example 5.2.1 (Lorentz boost). Consider the Lorentz boost Λ in the x-direction
γ −γv/c 0 0
−γv/c 0 0
γ
Λ=
0 0 1 0
0 0 0 1
where γ = √ 1
. Then
1−v 2 /c2
γ −γv/c 0 0 −Ex /c −Ey /c −Ez /c
0 γ −γv/c 0 0
−γv/c 0 0 −By 0 0
γ Ex /c 0 Bz −γv/c γ
Feµν =
0 0
1 0 Ey /c −Bz 0
Bx 0 0 1 0
0 0 0 1 Ez /c By −Bx 0 0 0 0 1
0 −Ex /c −γEy /c − γBz v/c −γEz /c + γBy v/c
Ex /c γEy v/c2 + γBz γEz v/c2 − γBy
0
= .
γEy /c + γBz v/c −γEy v/c2 − γBz 0 Bx
γEz /c − γBy v/c −γEz v/c2 + γBy −Bx 0
It reads in components as
Ee x = Ex
e y = γ (Ey + vBz )
E
Ee z = γ (Ez − vBy )
e x = Bx
B
e y = γ By − vEz /c2
B
B
e z = γ Bz + vEy /c2
where fields on the left hand side are evaluated at point x and fields on the right hand side are
e = Λx.
evaluated at the point x
One important observation is that Lorentz transformation will mix electric and magnetic
fields in general.
149
5.2.2 Transformation of Charge-Current Density
The charge and current densities are organized into a 1-form on R3,1
Under the Lorentz transformation Λ : R3,1 → R3,1 , it transforms J = Jµ dxµ to Je = Jeµ dxµ by
Je = Λ∗ (J)
or in components,
Jeµ (x) = Jρ (e
x)Λρ µ = Jρ (Λx)Λρ µ .
we find
ρe = γ ρ + jx v/c2
ejx = γ (jx + ρv)
ejy = jy
ejz = jz
Again, fields on the left hand side are evaluated at point x and fields on the right hand side are
e = Λx.
evaluated at the point x
To understand the Lorentz transformation of Maxwell’s equations, we need to know how the
Hodge star ? intertwines with the Lorentz transformation. Note that we use the notation ? for
Hodge star in this subsection, in order to distinguish with the ∗-symbol in pull-backs.
Proposition 5.2.3. Let Λ : R3,1 → R3,1 be a proper Lorentz transformation. Then for any
differential form α ∈ Ω• (R3,1 ), we have
In other words, the Hodge star ? commutes with the pull-back via Λ.
150
Proof: It is enough to check on the basis
1, dxµ , dxµ ∧ dxν , dxµ ∧ dxν ∧ dxρ , dx0 ∧ dx1 ∧ dx2 ∧ dx3 .
and
?Λ∗ (1) = ?1 = dx0 ∧ dx1 ∧ dx2 ∧ dx3 .
For a proper Lorentz transformation, det(Λ) = 1. Thus the above two terms are equal.
Since the pull-back Λ∗ commutes with d, and also commutes with the Hodge star ? as in
Proposition 5.2.3, we find
dF = 0 dFe = 0
pull-back
=====⇒
d(?F ) = ?J via Λ d(?Fe) = ?Je
Thus the transformed electromagnetic field and the transformed charge-current density again
satisfy the Maxwell’s equations! In other words, Lorentz transformations will transform solu-
tions of Maxwell’s equations to solutions of Maxwell’s equations.
Another illustrating way to see this is to consider the Maxwell action
ˆ
1 1
SM [A, J] = F ∧ ?F + ?J ∧ A .
µ0 R3,1 2
Under the Lorentz transformation Λ, the gauge 1-form A = Aµ dxµ transforms as
eµ (x)dxµ = Λ∗ (A)
A 7−→ A
eµ (x) = Aρ (e
or in components A x)Λρ µ . Then
ˆ
e e 1 1e e e e
SM [A, J] = F ∧ ?F + ?J ∧ A
µ0 R3,1 2
ˆ
1 1
= Λ∗ F ∧ ?F + ?J ∧ A
µ0 R3,1 2
ˆ
1 1
= F ∧ ?F + ?J ∧ A = SM [A, J].
µ0 R3,1 2
151
Here in the second line, we have used the invariance of integral under orientation-preserving
e is a critical point
diffeomorphisms. It follows that A is a critical point of SM [A, J] if and only if A
e J],
of SM [A, e i.e. Lorentz transformations send solutions of Maxwell’s equations to solutions.
Example 5.2.5. Consider the electromagnetic field of a static point particle of charge q at the
origin
~ = q ~r ~ = 0.
E , B
4πε0 r3
Under the Lorentz boost Λ : R3,1 → R3,1
γ −γv/c 0 0
−γv/c 0 0
γ
Λ=
0 0 1 0
0 0 0 1
with
ct − xv/c
cet= p
1 − v 2 /c2
x − vt
xe= p
1 − v 2 /c2
ye = y
ze = z
is transformed to
ρe = γqδ ~re = γqδ(γ(x − vt))δ(y)δ(z) = qδ(x − vt)δ(y)δ(z)
ejx = qvδ(x − vt)δ(y)δ(z)
ejy = ejz = 0
152
This describes a point charge moving in the x-direction with constant velocity v. We have re-
covered the Liénard-Wiechert electric and magnetic fields for a point charge of constant velocity
described in Example 5.2.5.
Assume the particle has mass m, and the trajectory is described by ~r (t) = (x(t), y(t), z(t)).
Then the equation of motion is described via Newton’s law by
m~r¨ = q E~ + ~r˙ × B
~ .
Let us now work with a relativistic particle of mass m and charge q, whose trajectory in
the spacetime is described by
153
⃝
1 In terms of a parameter σ on γ, the first term is expressed explicitly as
ˆ r
dxµ dxν
−mc dσ ηµν .
dσ dσ
It is clear from the expression that this term does not depend on the choice of parameter σ.
e of the curve γ,
For another parameter σ
r s 2 ˆ r r
ˆ ˆ ˆ
µ
dx dx ν dxµ dxν de
σ de
σ dx µ dxν dxµ dxν
dσ ηµν = dσ ηµν = dσ ηµν = de
σ ηµν .
dσ dσ deσ deσ dσ dσ de
σ de σ de
σ deσ
This also explains why we write the action as
ˆ
p
−mc ηµν dxµ dxν
γ
Here terms in · · · involve higher orders in ~r˙ /c, which become zero in the non-relativistic limit.
It follows that the term ˆ
p
−mc ηµν dxµ dxν
γ
´
becomes the standard kinetic term 1
2m dt ~γ˙ 2 in the non-relativistic limit, up to a constant
which is irrelevant for equation of motion. Note that ηµν dxµ dxν is always nonnegative since
particles can not travel faster than the speed of light.
⃝
2 In the second term,
A = −φdt + Ax dx + Ay dy + Az dz
´
is the gauge 1-form, which can be integrated along a curve γ in R3,1 . This is q γ A.
If we parametrize the curve γ via time t along the curve
then ˆ ˆ ˆ
dx dy dz ~ · ~r˙ .
q A=q dt −φ + Ax + Ay + Az = dt −qφ + q A
γ dt dt dt
154
This coincides with the potential term in the case of non-relativistic charged particle.
The above shows that
ˆ ˆ
p
S[γ] = −mc ηµν dxµ dxν + q A
γ γ
The equation of motion is obtained by asking δS = 0 for an arbitrary variation δxµ . This leads
to
d dxν
Pµ = −qFµν
dσ dσ
where Pµ = ηµν P ν and
dxν
P = mc q
ν
. dσ
ρ dxν
ηρν dx
dσ dσ
Note that P µ is invariant under the change of the parametrization σ, by the same reason as
above. It is precisely the relativistic momentum. In fact, let us choose the time t as the
parameter, then
ẋµ mc m~r˙
P µ = mc p = q ,q
˙
c − ~r
2 2 ˙ ˙
1 − ~r /c
2 2 1 − ~r /c
2 2
P µ Pµ = (P 0 )2 − (P 1 )2 − (P 2 )2 − (P 3 )2 = m2 c2
155
which is the familiar result on relativistic 4-momentum. cP 0 gives the relativistic energy
mc2 1
q = mc2 + m~r˙ 2 + · · ·
2
1 − ~r˙ 2 /c2
r˙
and 12 m~r˙ 2 is the non-relativistic kinetic energy. The non-relativistic limit of √ m⃗
˙
gives m~r˙ ,
r 2 /c2
1−⃗
which is the standard momentum vector.
It is illustrating to unpack the extra equation coming from µ = 0:
dP0 dxν ~ · ~r˙ /c,
= −qF0ν = qE
dt dt
i.e.
d(cP0 ) ~ · ~r˙.
= qE
dt
Since cP0 is the energy, this equation simply tells the change of kinetic energy when work is
done by an electric field.
156
Bibliography
[1] Berger, Mitchell A., and George B. Field. The topological properties of magnetic helicity.
Journal of Fluid Mechanics 147 (1984): 133-148.
[2] Buchwald, Jed Z. Electrodynamics from Thomson and Maxwell to Hertz. The Oxford Hand-
book of the History of Physics. Oxford University Press; 2013.
[3] Butcher, Ginger. Tour of the electromagnetic spectrum. Government Printing Office, 2016.
[4] Eguchi, Tohru, Peter B. Gilkey, and Andrew J. Hanson. Gravitation, gauge theories and
differential geometry. Physics reports 66.6 (1980): 213-393.
[5] Garrity, Thomas A. Electricity and Magnetism for Mathematicians: A Guided Path from
Maxwell’s Equations to Yang-Mills. Cambridge University Press, 2015.
[6] Göckeler, Meinulf, and Thomas Schücker. Differential geometry, gauge theories, and gravity.
Cambridge University Press, 1989.
[7] Griffiths, David J., and Reed Colleger. Introduction to Electrodynamics. Prentice Hall Up-
per Saddle River. New Jersey 7458 (1999).
[8] Jackson, John David. Classical electrodynamics. John Wiley & Sons, 1999.
[9] Jackson, John David, and Lev Borisovich Okun. Historical roots of gauge invariance. Re-
views of Modern Physics 73.3 (2001): 663.
[10] Li, Si. Classical Mechanics and Geometry. International Press of Boston, 2023.
[11] Maxwell, James Clerk. A treatise on electricity and magnetism. Oxford: Clarendon Press,
1873.
[12] Nakahara, Mikio. Geometry, Topology and Physics. CRC press, 2003.
[13] Nayfeh, Munir H., and Morton K. Brussel. Electricity and magnetism. Courier Dover Pub-
lications, 2015.
[14] O’Raifeartaigh, Lochlainn. The dawning of gauge theory. Vol. 106. Princeton University
Press, 1997.
157
[15] Schwinger, Julian, Lester L. Deraad Jr., Kimball A. Milton, Wu-yang Tsai. Classical elec-
trodynamics. Westview Press, 1998.
158