Vector Calculus
Vector Calculus
Lent 2015
These notes are not endorsed by the lecturers, and I have modified them (often
significantly) after lectures. They are nowhere near accurate representations of what
was actually lectured, and in particular, all errors are almost surely mine.
Curves in R3
Parameterised curves and arc length, tangents and normals to curves in R3 , the radius
of curvature. [1]
Integration in R2 and R3
Line integrals. Surface and volume integrals: definitions, examples using Cartesian,
cylindrical and spherical coordinates; change of variables. [4]
Vector operators
Directional derivatives. The gradient of a real-valued function: definition; interpretation
as normal to level surfaces; examples including the use of cylindrical, spherical *and
general orthogonal curvilinear* coordinates.
Divergence, curl and ∇2 in Cartesian coordinates, examples; formulae for these oper-
ators (statement only) in cylindrical, spherical *and general orthogonal curvilinear*
coordinates. Solenoidal fields, irrotational fields and conservative fields; scalar potentials.
Vector derivative identities. [5]
Integration theorems
Divergence theorem, Green’s theorem, Stokes’s theorem, Green’s second theorem:
statements; informal proofs; examples; application to fluid dynamics, and to electro-
magnetism including statement of Maxwell’s equations. [5]
Laplace’s equation
Laplace’s equation in R2 and R3 : uniqueness theorem and maximum principle. Solution
of Poisson’s equation by Gauss’s method (for spherical and cylindrical symmetry) and
as an integral. [4]
Cartesian tensors in R3
Tensor transformation laws, addition, multiplication, contraction, with emphasis on
tensors of second rank. Isotropic second and third rank tensors. Symmetric and
antisymmetric tensors. Revision of principal axes and diagonalization. Quotient
theorem. Examples including inertia and conductivity. [5]
1
Contents IA Vector Calculus
Contents
0 Introduction 4
3 Integration in R2 and R3 17
3.1 Integrals over subsets of R2 . . . . . . . . . . . . . . . . . . . . . 17
3.2 Change of variables for an integral in R2 . . . . . . . . . . . . . . 19
3.3 Generalization to R3 . . . . . . . . . . . . . . . . . . . . . . . . . 21
3.4 Further generalizations . . . . . . . . . . . . . . . . . . . . . . . . 24
7 Integral theorems 38
7.1 Statement and examples . . . . . . . . . . . . . . . . . . . . . . . 38
7.1.1 Green’s theorem (in the plane) . . . . . . . . . . . . . . . 38
7.1.2 Stokes’ theorem . . . . . . . . . . . . . . . . . . . . . . . . 39
7.1.3 Divergence/Gauss theorem . . . . . . . . . . . . . . . . . 40
7.2 Relating and proving integral theorems . . . . . . . . . . . . . . . 41
2
Contents IA Vector Calculus
12 Maxwell’s equations 67
12.1 Laws of electromagnetism . . . . . . . . . . . . . . . . . . . . . . 67
12.2 Static charges and steady currents . . . . . . . . . . . . . . . . . 68
12.3 Electromagnetic waves . . . . . . . . . . . . . . . . . . . . . . . . 69
14 Tensors of rank 2 77
14.1 Decomposition of a second-rank tensor . . . . . . . . . . . . . . . 77
14.2 The inertia tensor . . . . . . . . . . . . . . . . . . . . . . . . . . 78
14.3 Diagonalization of a symmetric second rank tensor . . . . . . . . 80
3
0 Introduction IA Vector Calculus
0 Introduction
In the differential equations class, we learnt how to do calculus in one dimension.
However, (apparently) the world has more than one dimension. We live in a
3 (or 4) dimensional world, and string theorists think that the world has more
than 10 dimensions. It is thus important to know how to do calculus in many
dimensions.
For example, the position of a particle in a three dimensional world can be
d
given by a position vector x. Then by definition, the velocity is given by dt x = ẋ.
This would require us to take the derivative of a vector.
This is not too difficult. We can just differentiate the vector componentwise.
However, we can reverse the problem and get a more complicated one. We can
assign a number to each point in (3D) space, and ask how this number changes
as we move in space. For example, the function might tell us the temperature at
each point in space, and we want to know how the temperature changes with
position.
In the most general case, we will assign a vector to each point in space. For
example, the electric field vector E(x) tells us the direction of the electric field
at each point in space.
On the other side of the story, we also want to do integration in multiple
dimensions. Apart from the obvious “integrating a vector”, we might want to
integrate over surfaces. For example, we can let v(x) be the velocity of some
fluid at each point in space. Then to find the total fluid flow through a surface,
we integrate v over the surface.
In this course, we are mostly going to learn about doing calculus in many
dimensions. In the last few lectures, we are going to learn about Cartesian
tensors, which is a generalization of vectors.
Note that throughout the course (and lecture notes), summation convention
is implied unless otherwise stated.
4
1 Derivatives and coordinates IA Vector Calculus
Vector functions R → Rn
We start with the simple case of vector functions.
Definition (Vector function). A vector function is a function F : R → Rn .
This takes in a number and returns a vector. For example, it can map a time
to the velocity of a particle at that time.
Definition (Derivative of vector function). A vector function F(x) is differen-
tiable if
def
δF = F(x + δx) − F(x) = F0 (x)δx + o(δx)
for some F0 (x). F0 (x) is called the derivative of F(x).
We don’t have anything new and special here, since we might as well have
defined F0 (x) as
dF 1
F0 = = lim [F(x + δx) − F(x)],
dx δx→0 δx
dF = F0 (x) dx.
5
1 Derivatives and coordinates IA Vector Calculus
Proposition.
d df dg
(f g) = g+f
dt dt dt
d dg dh
(g · h) = ·h+g·
dt dt dt
d dg dh
(g × h) = ×h+g×
dt dt dt
Note that the order of multiplication must be retained in the case of the cross
product.
Example. Consider a particle with mass m. It has position r(t), velocity ṙ(t)
and acceleration r̈. Its momentum is p = mṙ(t).
Note that derivatives with respect to t are usually denoted by dots instead
of dashes.
If F(r) is the force on a particle, then Newton’s second law states that
ṗ = mr̈ = F.
L = r × p = mr × ṙ.
If we want to know how the angular momentum changes over time, then
L̇ = mṙ × ṙ + mr × r̈ = mr × r̈ = r × F.
Scalar functions Rn → R
We can also define derivatives for a different kind of function:
Here we have a fancy name “gradient” for the derivative. But we will soon
give up on finding fancy names and just call everything the “derivative”!
Note also that here we genuinely need the new notion of derivative, since
“dividing by δr” makes no sense at all!
6
1 Derivatives and coordinates IA Vector Calculus
The above definition considers the case where δr comes in all directions.
What if we only care about the case where δr is in some particular direction n?
For example, maybe f is the potential of a particle that is confined to move in
one straight line only.
Then taking δr = hn, with n a unit vector,
which gives
7
1 Derivatives and coordinates IA Vector Calculus
Now suppose we have a scalar function f (r) and we want to consider the rate
of change along a path r(u). A change δu produces a change δr = r0 δu + o(δu),
and
δf = ∇f · δr + o(|δr|) = ∇f · r0 (u)δu + o(δu).
This shows that f is differentiable as a function of u and
Theorem (Chain rule). Given a function f (r(u)),
df dr ∂f dxi
= ∇f · = .
du du ∂xi du
Note that if we drop the du, we simply get
∂f
df = ∇f · dr = dxi ,
∂xi
which is what we’ve previously had.
Vector fields Rn → Rm
We are now ready to tackle the general case, which are given the fancy name of
vector fields.
Definition (Vector field). A vector field is a function F : Rn → Rm .
Definition (Derivative of vector field). A vector field F : Rn → Rm is differen-
tiable if
def
δF = F(x + δx) − F(x) = M δx + o(δx)
for some m × n matrix M . M is the derivative of F.
As promised, M does not have a fancy name.
Given an arbitrary function F : Rn → Rm that maps x 7→ y and a choice
of basis, we can write F as a set of m functions yj = Fj (x) such that y =
(y1 , y2 , · · · , ym ). Then
∂Fj
dyj = dxi .
∂xi
and we can write the derivative as
Theorem. The derivative of F is given by
∂yj
Mji = .
∂xi
Note that we could have used this as the definition of the derivative. However,
the original definition is superior because it does not require a selection of
coordinate system.
Definition. A function is smooth if it can be differentiated any number of times.
This requires that all partial derivatives exist and are totally symmetric in i, j
and k (i.e. the differential operator is commutative).
The functions we will consider will be smooth except where things obviously
go wrong (e.g. f (x) = 1/x at x = 0).
8
1 Derivatives and coordinates IA Vector Calculus
We can calculate
∂ρ/∂x1 ∂ρ/∂x2
M (f ) = = M (g)−1 .
∂ϕ/∂x1 ∂ϕ/∂x2
These matrices are known as Jacobian matrices, and their determinants are
known as the Jacobians.
9
1 Derivatives and coordinates IA Vector Calculus
Note that
det M (f ) det M (g) = 1.
x1 = ρ cos ϕ, x2 = ρ sin ϕ.
However, note that ρ and ϕ are not components of a position vector, i.e. they
are not the “coefficients” of basis vectors like r = x1 e1 + x2 e2 are. But we can
associate related basis vectors that point to directions of increasing ρ and ϕ,
obtained by differentiating r with respect to the variables and then normalizing:
e2
eϕ eρ
ρ
ϕ
e1
These are not “usual” basis vectors in the sense that these basis vectors vary
with position and are undefined at the origin. However, they are still very useful
when dealing with systems with rotational symmetry.
In three dimensions, we have cylindrical polars and spherical polars.
10
2 Curves and Line IA Vector Calculus
11
2 Curves and Line IA Vector Calculus
It is often convenient to use the arclength s as the parameter. Then the tangent
vector will always have unit length since the proposition above yields
ds
|r0 (s)| = = 1.
ds
We call ds the scalar line element, which will be used when we consider integrals.
Definition (Scalar line element). The scalar line element of C is ds.
Proposition. ds = ±|r0 (u)|du
We say dr = r0 (u)du is the line element on C. Note that the upper and lower
limits of the integral are the end point and start point respectively, and β is not
necessarily larger than α.
For example, we may be moving a particle from a to b along a curve C
under a force field F. Then we may divide the curve into many small segments
δr. Then for each segment, the force experienced is F(r) and the work done is
F(r) · δr. Then the total work done across the curve is
Z
W = F(r) · dr.
C
Example. Take F(r) = (xey , z 2 , xy) and we want to find the line integral from
a = (0, 0, 0) to b = (1, 1, 1).
b
C2
C1
12
2 Curves and Line IA Vector Calculus
Now we try to integrate along another curve C2 : r(t) = (t, t, t). So r0 (t) =
(1, 1, 1).
Z Z
F · dr = F · r0 (t)dt
C2
Z 1
= tet + 2t2 dt
0
5
= .
3
We see that the line integral depends on the curve C in general, not just a, b.
We can also use the arclength s as the parameter. Since dr = t ds, with t
being the unit tangent vector, we have
Z Z
F · dr = F · t ds.
C C
Definition (Closed curve). A closed curve is a curve with the same start andH
end point. The line integral along a closed curve is (sometimes) written as
and is (sometimes) called the circulation of F around C.
Sometimes we are not that lucky and our curve is not smooth. For example,
the graph of an absolute value function is not smooth. However, often we can
break it apart into many smaller segments, each of which is smooth. Alternatively,
we can write the curve as a sum of smooth curves. We call these piecewise smooth
curves.
Definition (Piecewise smooth curve). A piecewise smooth curve is a curve
C = C1 + C2 + · · · + Cn with all Ci smooth with regular parametrisations. The
line integral over a piecewise smooth C is
Z Z Z Z
F · dr = F · dr + F · dr + · · · + F · dr.
C C1 C2 Cn
13
2 Curves and Line IA Vector Calculus
b
C3
C1
14
2 Curves and Line IA Vector Calculus
We see that if we integrate the first term with respect to x, we obtain x3 y sin z.
We obtain the same thing if we integrate the second and third term. So this is
equal to Z
d(x3 y sin z) = [x3 y sin z]b
a.
C
15
2 Curves and Line IA Vector Calculus
16
3 Integration in R2 and R3 IA Vector Calculus
3 Integration in R2 and R3
3.1 Integrals over subsets of R2
Definition (Surface integral). Let D ⊆ R2 . Let r = (x, y) be in Cartesian
coordinates. We can approximate D by N disjoint subsets of simple shapes, e.g.
triangles, parallelograms. These shapes are labelled by I and have areas δAi .
y
x
P
To integrate a function f over D, we would like to take the sum f (ri )δAi ,
and take the limit as δAi → 0. But we need a condition stronger than simply
δAi → 0. We won’t want the areas to grow into arbitrarily long yet thin strips
whose area decreases to 0. So we say that we find an ` such that each area can
be contained in a disc of diameter `.
Then we take the limit as ` → 0, N → ∞, and the union of the pieces tends
to D. For a function f (r), we define the surface integral as
Z X
f (r) dA = lim f (ri )δAi .
D `→0
I
where ri is some point within each subset Ai . The integral exists if the limit
is well-defined (i.e. the same regardless of what Ai and ri we choose before we
take the limit) and exists.
17
3 Integration in R2 and R3 IA Vector Calculus
δy
y
Y
x
xy
This is useful if we want to integrate over a concave area and we have disconnected
vertical strips.
y
We could also do it the other way round, integrating over y first, and come up
with the result
Z Z Z
f (x, y) dA = f (x, y) dy dx.
D X yx
18
3 Integration in R2 and R3 IA Vector Calculus
While we have rather strict conditions for this theorem, it actually holds in many
more cases, but those situations have to be checked manually.
Definition (Area element). The area element is dA.
Proposition. dA = dx dy in Cartesian coordinates.
Example. We integrate over the triangle bounded by (0, 0), (2, 0) and (0, 1).
We want to integrate the function f (x, y) = x2 y over the area. So
Z Z 1 Z 2−2y
f (xy) dA = x2 y dx dy
D 0 0
1 2−2y
x3
Z
= y dy
0 3 0
8 1
Z
= y(1 − y)3 dy
3 0
2
=
15
We can integrate it the other way round:
Z Z 2 Z 1−x/2
x2 y dA = x2 y dy dx
D 0 0
Z 1−x/2
2
1 2 2
= x y dx
0 2 0
Z 2 2
x x 2
= 1− dx
0 2 2
2
=
15
Since it doesn’t matter whether we integrate x first or y first, if we find it
difficult to integrate one way, we can try doing it the other way and see if it is
easier.
While this integral is tedious in general, there is a special case where it is
substantially easier.
Definition (Separable function). A function f (x, y) is separable if it can be
written as f (x, y) = h(y)g(x).
Proposition. Take separable f (x, y) = h(y)g(x) and D be a rectangle {(x, y) :
a ≤ x ≤ b, c ≤ y ≤ d}. Then
Z b ! Z !
Z d
f (x, y) dx dy = g(x) dx h(y) dy
D a c
19
3 Integration in R2 and R3 IA Vector Calculus
where
∂x ∂x
∂(x, y)
J= = ∂u ∂v
∂(u, v) ∂y ∂y
∂u ∂v
is the Jacobian. In other words,
dx dy = |J| du dv.
Proof. Since we are writing (x(u, v), y(u, v)), we are actually transforming from
(u, v) to (x, y) and not the other way round.
Suppose we start with an area δA0 = δuδv in the (u, v) plane. Then by
Taylors’ theorem, we have
∂x ∂x
δx = x(u + δu, v + δv) − x(u, v) ≈ δu + δv.
∂u ∂v
We have a similar expression for δy and we obtain
∂x ∂x
δx δu
≈ ∂u∂y
∂v
∂y
δy ∂u ∂v
δv
Recall from Vectors and Matrices that the determinant of the matrix is how
much it scales up an area. So the area formed by δx and δy is |J| times the area
formed by δu and δv. Hence
dx dy = |J| du dv.
x = ρ cos ϕ
y = ρ sin ϕ
dA = ρ dρ dϕ.
20
3 Integration in R2 and R3 IA Vector Calculus
Note that in polar coordinates, we are integrating over a rectangle and the
function is separable. So this is equal to
h 2
iR
π/2
= −e−ρ /2 [ϕ]0
0
π 2
= 1 − e−R /2 . (∗)
2
Note that the integral exists as R → ∞.
Now we take the case of x, y → ∞ and consider the original integral.
Z Z ∞ Z ∞
2 2
f dA = e−(x +y )/2 dx dy
D x=0 y=0
Z ∞ Z ∞
−x2 /2 −y 2 /2
= e dx e dy
0 0
π
=
2
p
where the last line is from (*). So each of the two integrals must be π/2, i.e.
Z ∞ r
−x2 /2 π
e dx = .
0 2
3.3 Generalization to R3
We will do exactly the same thing as we just did, but with one more dimension:
Definition (Volume integral). Consider a volume V ⊆ R3 with position vector
r = (x, y, z). We approximate V by N small disjoint subsets of some simple
shape (e.g. cuboids) labelled by I, volume δVI , contained within a solid sphere
of diameter `.
Assume that as ` → 0 and N → ∞, the union of the small subsets tend to
V . Then Z X
f (r) dV = lim f (r∗I )δVI ,
V `→0
I
where r∗I is any chosen point in each small subset.
To evaluate this, we can take δVI = δxδyδz, and take δx → 0, δy → 0 and
δz in some order. For example,
Z Z Z !
f (r) dv = f (x, y, z) dz dx dy.
V D Zxy
So we integrate f (x, y, z) over z at each point (x, y), then take the integral of
that over the area containing all required (x, y).
Alternatively, we can take the area integral first, and have
Z Z Z
f (r) dV = f (x, y, z) dx dy dz.
V z DZ
21
3 Integration in R2 and R3 IA Vector Calculus
where we separated the integral into three parts as in the area integrals.
Note that in the second line, we rewrote the integrals to write the differentials
next to the integral sign. This is simply a different notation that saves us from
writing r = 0 etc. in the limits of the integrals.
This is a useful general result. We understand it as the sum of spherical
shells of thickness δr and volume 4πr2 δr.
If we take f = 1, then we have the familiar result that the volume of a sphere
is 43 πa3 .
Example. Consider a volume within a sphere of radius a with a cylinder of
radius b (b < a) removed. The region is defined as
x2 + y 2 + z 2 ≤ a2
x 2 + y 2 ≥ b2 .
22
3 Integration in R2 and R3 IA Vector Calculus
a
b
b ≤ ρ ≤ a.
So the volume is
Z Z a Z 2π Z √a2 −ρ2
dV = dρ dϕ √ dz ρ
V b 0 − a2 −ρ2
Z a p
= 2π a2 − ρ2 dρ 2ρ
b
a
2
= 2π (a2 − ρ2 )3/2
3 b
4 2 2 3/2
= π(a − b ) .
3
Example. Suppose the density of electric charge is ρ(r) = ρ0 az in a hemisphere
H of radius a, with z ≥ 0. What is the total charge of H?
We use spherical polars. So
π
r ≤ a, 0 ≤ ϕ ≤ 2π, 0≤θ≤ .
2
We have
ρ0
ρ(r) = r cos θ.
a
The total charge Q in H is
Z Z a Z π/2 Z 2π
ρ0
ρ dV = dr r cos θr2 sin θ
dθ dϕ
H 0 0 0 a
ρ0 a 3
Z Z π/2 Z 2π
= r dr sin θ cos θ dθ dϕ
a 0 0 0
a π/2
ρ0 r 4 1 2
= sin θ [ϕ]2π
0
a 4 0 2 0
ρ0 πa3
= .
4
23
3 Integration in R2 and R3 IA Vector Calculus
Vector-valued integrals
R R
We can define V F(r) dV in a similar way to V f (r) dV as the limit of a sum over
small contributions of volume. In practice, we integrate them componentwise. If
F(r) = Fi (r)ei ,
then Z Z
F(r) dV = (Fi (r) dV )ei .
V V
For example, if a mass has density ρ(r), then its mass is
Z
M= ρ(r) dV
V
24
3 Integration in R2 and R3 IA Vector Calculus
25
4 Surfaces and surface integrals IA Vector Calculus
d dr
[f (r(u))] = ∇f · = 0.
du du
dr dr
This means that ∇f is always perpendicular to du . Since du is the tangent to
the curve, ∇f is perpendicular to the tangent. Since this is true for any curve
r(u), ∇f is perpendicular to any tangent of the surface. Therefore
Example.
26
4 Surfaces and surface integrals IA Vector Calculus
Most surfaces we encounter are orientable. For example, for a sphere, we can
declare that the normal should always point outwards. A notable example of a
non-orientable surface is the Möbius strip (or Klein bottle).
For simple cases, we can describe the orientation as “inward” and “outward”.
We can then label the points on the spheres by the angles θ, ϕ, with
r(u, v) = u, r(u, v) = u + v.
The idea is that r has to depend on both u and v, and in “different ways”.
More precisely, when we vary the coordinates (u, v), the point r will change
accordingly. By the chain rule, this is given by
∂r ∂r
δr = δu + δv + o(δu, δv).
∂u ∂v
∂r ∂r
Then δu and ∂v are tangent vectors to curves on S with v and u constant
respectively. What we want is for them to point in different directions.
27
4 Surfaces and surface integrals IA Vector Calculus
However, we are just using u and v as arbitrary labels for points in the surface,
and one unit of area in D does not correspond to one unit of area in S. Instead,
suppose we produce a small rectangle in D by changing u and v by small δu, δv.
In D, this corresponds to a rectangle with vertices (u, v), (u + δu, v), (u, v +
δv), (u + δu, v + δv), and spans an area δuδv. In the surface S, these small
∂r ∂r
changes δu, δv correspond to changes ∂u δu and ∂v δv, and these span a vector
area of
∂r ∂r
δS = × δuδv = n δS.
∂u ∂v
Note that the order of u, v gives the choice of the sign of the unit normal.
The actual area is then given by
∂r ∂r
δS = × δu δv.
∂u ∂v
Then we have
So we find
∂r
= aeθ .
∂θ
28
4 Surfaces and surface integrals IA Vector Calculus
Similarly, we have
∂r
= a sin θeϕ .
∂ϕ
Then
∂r ∂r
× = a2 sin θ er .
∂θ ∂ϕ
So
dS = a2 sin θ dθ dϕ.
Our bounds are 0 ≤ θ ≤ α, 0 ≤ ϕ ≤ 2π.
Then the area is
Z 2π Z α
a2 sin θ dθ dϕ = 2πa2 (1 − cos α).
0 0
However, this doesn’t work. For example, if all the flow is tangential to the
surface, then nothing is really passing through the surface, but |F| is non-zero,
so we get a non-zero integral. Instead, what we should do is to consider the
component of F that is normal to the surface S, i.e. parallel to its normal.
Definition (Surface integral). The surface integral or flux of a vector field F(r)
over S is defined by
Z Z Z
∂r ∂r
F(r) · dS = F(r) · n dS = F(r(u, v)) · × du dv.
S S D ∂u ∂v
Intuitively, this is the total amount of F passing through S. For example, if
F is the electric field, the flux is the amount of electric field passing through a
surface.
R
For a given orientation, the integral F·dS is independent of the parametriza-
tion. Changing orientation is equivalent to changing the sign of n, which is in
turn equivalent to changing the order of u and v in the definition of S, which is
also equivalent to changing the sign of the flux integral.
Example. Consider a sphere of radius a, r(θ, ϕ). Then
∂r ∂r
= aeθ , = a sin θeϕ .
∂θ ∂ϕ
The vector area element is
29
4 Surfaces and surface integrals IA Vector Calculus
u δt
δS
dS = a2 sin θn dϕ dθ,
with
r 1
n= = (x, y, z).
a a
So
1
n·u= (−x2 + z 2 ) = a(− sin2 θ cos2 ϕ + cos2 θ).
a
Therefore
Z Z α Z 2π
u · dS = a3 sin θ[(cos2 θ − 1) cos2 ϕ + cos2 θ] dϕ dθ
S 0 0
Z α
= a3 sin θ[π(cos2 θ − 1) + 2π cos2 θ] dθ
0
Z α
= a3 π(3 cos3 θ − 1) sin θ dθ
0
= πa3 [cosθ − cos3 θ]α
0
= πa3 cos α sin2 α.
What happens when we change parametrization? Let r(u, v) and r(ũ, ṽ) be
two regular parametrizations for the surface. By the chain rule,
∂r ∂r ∂ ũ ∂r ∂ṽ
= +
∂u ∂ ũ ∂u ∂ṽ ∂u
∂r ∂r ∂ ũ ∂r ∂ṽ
= +
∂v ∂ ũ ∂v ∂ṽ ∂v
So
∂r ∂r ∂(ũ, ṽ) ∂r ∂r
× = ×
∂u ∂v ∂(u, v) ∂ ũ ∂ṽ
30
4 Surfaces and surface integrals IA Vector Calculus
where ∂(ũ,ṽ)
∂(u,v) is the Jacobian.
Since
∂(ũ, ṽ)
dũ dṽ = du dv,
∂(u, v)
We recover the formula
∂r ∂r ∂r ∂r
dS = × du dv = × dũ dṽ.
∂u ∂v ∂ ũ ∂ṽ
Similarly, we have
∂r ∂r ∂r ∂r
dS = × du dv = × dũ dṽ.
∂u ∂v ∂ ũ ∂ṽ
provided (u, v) and (ũ, ṽ) have the same orientation.
∂r ∂r
× = (0, 0, J),
∂u ∂v
with J being the Jacobian. Therefore
Z Z Z
∂r ∂r
f (r) dS = f (r(u, v)) × du dv = f (r(u, v))|J| du dv,
S D ∂u ∂v D
∂r ∂r ∂r
δr = δu + δv + δw + o(δu, δv, δw).
∂u ∂v ∂w
Then the cuboid δu, δv, δw in u, v, w space is mapped to a parallelopiped of
volume
∂r ∂r ∂r
δV = δu · δv × δw = |J| δu δv δw.
∂u ∂v ∂w
So dV = |J| du dv dw.
31
5 Geometry of curves and surfaces IA Vector Calculus
n θ
t
r(0)
32
5 Geometry of curves and surfaces IA Vector Calculus
Since we are in 3D, given t(s) and n(s), there is another normal to the curve.
We can add a third normal to generate an orthonormal basis.
Definition (Binormal). The binormal of a curve is b = t × n.
We can define the torsion similar to the curvature, but with the binormal
instead of the tangent.1
33
5 Geometry of curves and surfaces IA Vector Calculus
34
6 Div, Grad, Curl and ∇ IA Vector Calculus
e1 e2 e3
∂Fk ∂ ∂ ∂
∇ × F = εijk ei = ∂x ∂y ∂z
∂xj
Fx Fy Fz
35
6 Div, Grad, Curl and ∇ IA Vector Calculus
Note that Grad and Div can be analogously defined in any dimension n, but
curl is specific to n = 3 because it uses the vector product.
Example. Consider rα with r = |r|. We know that r = xi ei . So r2 = xi xi .
Therefore
∂r
2r = 2xj ,
∂xj
or
∂r xi
= .
∂xi r
So
∂ α ∂r
∇rα = ei (r ) = ei αrα−1 = αrα−2 r.
∂xi ∂xi
Also,
∂xi
∇·r= = 3.
∂xi
and
∂xj
∇ × r = ek εijk = 0.
∂xi
Proposition. We have the following Leibnitz properties:
∇ · (rα r) = (∇rα )r + rα ∇ · r
= (αrα−2 r) · r + rα (3)
= (α + 3)rα
∇ × (rα r) = (∇(rα )) × r + rα (∇ × r)
= αrα−2 r × r
=0
36
6 Div, Grad, Curl and ∇ IA Vector Calculus
Proposition.
∇ × (∇f ) = 0
∇ · (∇ × F) = 0
∂2f
εijk = 0.
∂xi ∂xj
∇ × F = 0 ⇒ F = ∇f
for some f .
Definition (Conservative/irrotational field and scalar potential). If F = ∇f ,
then f is the scalar potential. We say F is conservative or irrotational.
Similarly,
Proposition. If H is defined over all of R3 and ∇ · H = 0, then H = ∇ × A
for some A.
∂2
2
∂2 ∂2
∂
∇2 = ∇ · ∇ = = + + .
∂xi ∂xi ∂x21 ∂x22 ∂x33
This operation is defined on both scalar and vector fields — on a scalar field,
∇2 f = ∇ · (∇f ),
∇2 A = ∇(∇ · A) − ∇ × (∇ × A).
37
7 Integral theorems IA Vector Calculus
7 Integral theorems
7.1 Statement and examples
There are three big integral theorems, known as Green’s theorem, Stoke’s theorem
and Gauss’ theorem. There are all generalizations of the fundamental theorem of
calculus in some sense. In particular, they all say that an n dimensional integral
of a derivative is equivalent to an n − 1 dimensional integral of the original
function.
We will first state all three theorems with some simple applications. In the
next section, we will see that the three integral theorems are so closely related
that it’s easiest to show their equivalence first, and then prove just one of them.
104 4
From example sheet 1, each side gives 105 a .
x
a
Then Green’s theorem follows directly from the fundamental theorem of calculus
in 1D. We first consider the first term of Green’s theorem:
Z Z aZ b
∂P ∂P
− dA = − dy dx
∂y 0 0 ∂y
Z a
= [−P (x, b) + P (x, 0)] dx
Z0
= P dx
C
38
7 Integral theorems IA Vector Calculus
Note that we can convert the 1D integral in the second-to-last line to a line integral
around the curve C, since the P (x, 0) and P (x, b) terms give the horizontal part
of C, and the lack of dy term means that the integral is nil when integrating the
vertical parts.
Similarly, Z Z
∂Q
dA = Q dy.
A ∂x C
Combining them gives Green’s theorem.
Green’s theorem also holds for a bounded region A, where the boundary ∂A
consists of disconnected components (each piecewise smooth, non-intersecting
and closed) with anti-clockwise orientation on the exterior, and clockwise on the
interior boundary, e.g.
The orientation of the curve comes from imagining the surface as:
39
7 Integral theorems IA Vector Calculus
Our boundary ∂C is
So they agree.
S1
S2
V is a solid hemisphere
x2 + y 2 + z 2 ≤ a2 , z ≥ 0,
40
7 Integral theorems IA Vector Calculus
On S1 ,
1
dS = n dS = (x, y, z) dS.
a
Then
1
F · dS = z(z + a) dS = cos θa(cos θ + 1) a2 sin θ dθ dϕ .
a | {z }
dS
Then
Z Z 2π Z π/2
F · dS = a3 dϕ sin θ(cos2 θ + cos θ) dθ
S1 0 0
π/2
3 −1 1
= 2πa cos3 θ − cos2 θ
3 2 0
5
= πa3 .
3
On S2 , dS = n dS = −(0, 0, 1) dS. Then F · dS = −a dS. So
Z
F · dS = −πa3 .
S2
So Z Z
5 2
F · dS + F · dS = − 1 πa3 = πa3 ,
S1 S2 3 3
in accordance with Gauss’ theorem.
Given any P (x, y) and Q(x, y), we can consider the vector field
F = (P, Q, 0),
41
7 Integral theorems IA Vector Calculus
So
∂Q ∂P
∇×F= 0, 0, − .
∂x ∂y
Then the left hand side of Stokes is
Z Z Z
F · dr = F · t ds = P dx + Q dy,
C C C
given Z Z
∂Fv ∂Fu
Fu du + Fv dv = − dA.
∂A A ∂u ∂v
Doing some pattern-matching, we want
F · dr = Fu du + Fv dv
for some Fu and Fv .
By the chain rule, we know that
∂r ∂r
dr = du + dv.
∂u ∂v
So we choose
∂r ∂r
Fu = F · , Fv = F · .
∂u ∂v
This choice matches the left hand sides of the two equations.
To match the right, recall that
∂r ∂r
(∇ × F) · dS = (∇ × F) · × du dv.
∂u ∂v
Therefore, for the right hand sides to match, we want
∂Fv ∂Fu ∂r ∂r
− = (∇ × F) · × . (∗)
∂u ∂v ∂u ∂v
Fortunately, this is true. Unfortunately, the proof involves complicated suffix
notation and summation convention:
∂Fv ∂ ∂r ∂ ∂xi ∂Fi ∂xj ∂xi ∂xi
= F· = Fi = + Fi .
∂u ∂u ∂v ∂u ∂v ∂xj ∂u ∂v ∂u∂v
42
7 Integral theorems IA Vector Calculus
Similarly,
∂Fu ∂ ∂r ∂ ∂xj ∂Fj ∂xi ∂xj ∂xi
= F· = Fj = + Fi .
∂u ∂u ∂u ∂u ∂u ∂xi ∂v ∂u ∂u∂v
So
∂Fv ∂Fu ∂xj ∂xi ∂Fi ∂Fj
− = − .
∂u ∂v ∂u ∂v ∂xj ∂xi
This is the left hand side of (∗).
The right hand side of (∗) is
∂r ∂r ∂Fj ∂xp ∂xq
(∇ × F) · × = εijk εkpq
∂u ∂v ∂xi ∂u ∂v
∂Fj ∂xp ∂xq
= (δip δjq − δiq δjp )
∂xi ∂u ∂v
∂Fj ∂Fi ∂xi ∂xj
= − .
∂xi ∂xj ∂u ∂v
So they match. Therefore, given our choice of Fu and Fv , Green’s theorem
translates to Stokes’ theorem.
Proposition. Greens theorem ⇔ 2D divergence theorem.
Proof. The 2D divergence theorem states that
Z Z
(∇ · G) dA = G · n ds.
A ∂A
and Z Z
∂Q ∂P
(∇ · G) dA = − dA.
A A ∂x ∂y
Now 2D version of Gauss’ theorem says the two LHS are the equal, and Green’s
theorem says the two RHS are equal. So the result follows.
Proposition. 2D divergence theorem.
Z Z
(∇ · G) dA = G · n ds.
A C=∂A
43
7 Integral theorems IA Vector Calculus
Proof. For the sake of simplicity, we assume that G only has a vertical component,
noting that the same proof works for purely horizontal G, and an arbitrary G is
just a linear combination of the two.
Furthermore, we assume that A is a simple, convex shape. A more complicated
shape can be cut into smaller simple regions, and we can apply the simple case
to each of the small regions.
Suppose G = G(x, y)ĵ. Then
∂G
∇·G= .
∂y
Then Z Z Z
∂G
∇ · G dA = dy dx.
A X Yx ∂y
Now we divide A into an upper and lower part, with boundaries C+ = y+ (x)
and C− = y− (x) respectively. Since I cannot draw, A will be pictured as a circle,
but the proof is valid for any simple convex shape.
y
C+
Yx
C−
x
dy
We see that the boundary of Yx at any specific x is given by y− (x) and y+ (x).
Hence by the Fundamental theorem of Calculus,
Z Z y+ (x)
∂G ∂G
dy = dy = G(x, y+ (x)) − G(x, y− (x)).
Yx ∂y y− (x) ∂y
To compute the full area integral, we want to integrate over all x. However, the
divergence theorem talks in terms of ds, not dx. So we need to find some way
to relate ds and dx. If we move a distance δs, the change in x is δs cos θ, where
θ is the angle between the tangent and the horizontal. But θ is also the angle
between the normal and the vertical. So cos θ = n · ĵ. Therefore dx = ĵ · n ds.
In particular, G dx = G ĵ · n ds = G · n ds, since G = G ĵ.
However, at C− , n points downwards, so n · ĵ happens to be negative. So,
actually, at C− , dx = −G · n ds.
44
7 Integral theorems IA Vector Calculus
Again, split S = ∂V into the top and bottom parts S+ and S− (ie the parts
with k̂ · n ≥ 0 and k̂ · n < 0), and parametrize by z+ (x, y) and z− (x, y). Then
the integral becomes
Z Z Z
∇ · F dV = (F (x, y, z+ ) − F (x, y, z− )) dA = F · n dS.
V D S
45
8 Some applications of integral theorems IA Vector Calculus
So
Proposition.
Z
1
n · (∇ × F)(r0 ) = lim F · dr,
diam(A)→0 area(A) ∂A
where the limit is taken over all surfaces A containing r0 with normal n.
These are coordinate-independent definitions of div and curl.
Example. Suppose u is a velocity field of fluid flow. Then
Z
u · dS
S
Then, at r0 ,
V̇
∇ · u = lim ,
V →0 V
the relative rate of change of volume. For example, if u(r) = αr (ie fluid flowing
out of origin), then ∇ · u = 3α, which increases at a constant rate everywhere.
46
8 Some applications of integral theorems IA Vector Calculus
Recall that Z
1
n · ∇ × u = lim u · dr = 2ω,
A→0 πa2 ∂A
ie twice the local angular velocity. For example, if you have a washing machine
rotating at a rate of ω, Then the velocity u = ω × r. Then the curl is
∇ × (ω × r) = 2ω,
∇ × (∇f ) = 0.
Proof. Given F(r) satisfying ∇ × F = 0, let C and C̃ be any two curves from a
to b.
47
8 Some applications of integral theorems IA Vector Calculus
C b
C̃
But ∇ × F = 0. So Z Z
F · dr − F · dr = 0,
C C̃
or Z Z
F · dr = F · dr.
C C̃
R
Proposition. If (ii) C F · dr is independent of C for fixed end points and
orientation, then (i) F = ∇f for some scalar field f .
Proof. We fix a and define f (r) = C F(r0 ) · dr0 for any curve from a to r.
R
So
δf = f (r + δr) − f (r) = F(r) · δr + o(δr).
But the definition of grad is exactly
δf = ∇f · δr + o(δr).
So we have F = ∇f .
Note that these results assume F is defined on the whole of R3 . It also
works of F is defined on a simply connected domain D, ie a subspace of R3
without holes. By definition, this means that any two curves C, C̃ with fixed
end points can be smoothly deformed into one another (alternatively, any loop
can be shrunk into a point).
If we have a smooth transformation from C to C̃, the process sweeps out a
surface bounded by C and C̃. This is required by the proof that (iii) ⇒ (ii).
If D is not simply connected, then we obtain a multi-valued f (r) on D in
general (for the proof (ii) ⇒ (i)). However, we can choose to restrict to a subset
D0 ⊆ D such that f (r) is single-valued on D0 .
48
8 Some applications of integral theorems IA Vector Calculus
Example. Take
−y x
F= , ,0 .
x2 + y 2 x2 + y 2
This obeys ∇ × F = 0, and is defined on D = R3 \ {z-axis}, which is not
simply-connected. We can also write
F = ∇f,
where
y
f = tan−1 .
x
which is multi-valued. If we integrate it about the closed loop x2 + y 2 = 1, z = 0,
i.e. a circle about the z axis, the integral gives 2π, as opposed to the expected 0
for a conservative force. This shows that the simply-connected-domain criterion
is important!
However f can be single-valued if we restrict it to
D0 = R3 − {half-plane x ≥ 0, y = 0},
which is simply-connected. (Draw and check!) Any closed curve we can draw in
this area will have an integral of 0 (the circle mentioned above will no longer be
closed!).
by divergence theorem. So this states that the rate of change of the quantity Q
in V is the flux of the stuff flowing out of the surface. ie Q cannot just disappear
but must smoothly flow out.
In particular, if V is the whole universe (ie R3 ), and j → 0 sufficiently rapidly
as |r| → ∞, then we calculate the total amount of Q in the universe by taking V
to be a solid sphere of radius R, and take the limit as R → ∞. Then the surface
integral → 0, and the equation states that
dQ
= 0,
dt
49
8 Some applications of integral theorems IA Vector Calculus
Example. If ρ(r, t) is the charge density (i.e. ρδV is the amount of charge in
a small volume δV ), then Q(t) is the total charge in V . j(r, t) is the electric
current density. So j · dS is the charge flowing through δS per unit time.
Example. Let j = ρu with u being the velocity field. Then (ρu δt) · δS is equal
to the mass of fluid crossing δS in time δt. So
Z
dQ
= − j · dS
dt S
does indeed imply the conservation of mass. The conservation equation in this
case is
∂ρ
+ ∇ · (ρu) = 0
∂t
For the case where ρ is constant and uniform (i.e. independent of r and t), we
get that ∇ · u = 0. We say that the fluid is incompressible.
50
9 Orthogonal curvilinear coordinates IA Vector Calculus
dr = hu eu du + hv ev dv + hw ew dw,
(ii) In cylindrical polars, r(ρ, ϕ, z) = ρ[cos ϕî + sin ϕĵ] + z k̂. Then hρ = hz = 1,
and
∂r
hϕ = = |(−ρ sin ϕ, ρ sin ϕ, 0)| = ρ.
∂ϕ
The basis vectors eρ , eϕ , ez are as in section 1.
(iii) In spherical polars,
51
9 Orthogonal curvilinear coordinates IA Vector Calculus
52
9 Orthogonal curvilinear coordinates IA Vector Calculus
δv C
u
δu
We then integrate around the curve C. We split the curve C up into 4 parts
(corresponding to the four sides), and take linear approximations by assum-
ing F and h are constant when moving through each horizontal/vertical
segment.
Z
F · dr ≈ Fu (u, v)hu (u, v) δu + Fv (u + δu, v)hv (u + δu, v) δu
C
− Fu (u, v + δv)hu (u, v + δv) δu − Fv (u, v)hv (u, v) δv
∂ ∂
≈ hv Fv − (hu Fu ) δuδv.
∂u ∂v
Divide by the area and take the limit as area → 0, we obtain
Z
1 1 ∂ ∂
lim F · dr = hv Fv − (hu Fu ) .
A→0 A C hu hv ∂u ∂v
So, by the integral definition of divergence,
1 ∂ ∂
ew · ∇ × F = (hv Fv ) − (hu Fu ) ,
hu hv ∂u ∂v
and similarly for other components.
We can find the divergence similarly.
1
Example. Let A = r tan θ2 eϕ in spherical polars. Then
er reθ r sin θeϕ
1 ∂ ∂ ∂ er ∂ θ 1
∇×A= ∂r ∂θ ∂ϕ = sin θ tan = 2 er .
r2 sin θ r2 sin θ ∂θ 2 r
0 0 r sin θ · 1r tan θ2
53
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
This means that if you walk around the place and return to the same position,
the total work done is 0 and you did not gain energy, i.e. gravitational potential
energy is conserved.
Gauss’ law tells us what this gravitational field looks like:
Law (Gauss’ law for gravitation). Given any volume V bounded by closed
surface S, Z
g · dS = −4πGM,
S
where G is Newton’s gravitational constant, and M is the total mass contained
in V .
These equations determine g(r) from a mass distribution.
Example. We can obtain Newton’s law of gravitation from Gauss’ law together
with an assumption about symmetry.
Consider a total mass M distributed with a spherical symmetry about the
origin O, with all the mass contained within some radius r = a. By spherical
symmetry, we have g(r) = g(r)r̂.
Consider Gauss’ law with S being a sphere of radius r = R > a. Then n̂ = r̂.
So Z Z Z
g · dS = g(R)r̂ · r̂ dS = g(R)dS = 4πR2 g(R).
S S
By Gauss’ law, we obtain
4πR2 g(R) = −4πGM.
So
GM
g(R) = −
R2
for R > a.
Therefore the gravitational force on a mass m at r is
GM m
F(r) = − r̂.
r2
If we take the limit as a → 0, we get a point mass M at the origin. Then we
recover Newton’s law of gravitation for point masses.
54
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
R
The condition C
g · dr = 0 for any closed C can be re-written by Stoke’s
theorem as Z
∇ × g · dS = 0,
S
where S is bounded by the closed curve C. This is true for arbitrary S. So
∇ × g = 0.
∇ · g = −4πGρ.
∇2 ϕ = 4πGρ.
55
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
∇ × E = 0.
E
p
r= x2 + y 2
56
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
E(r) = E(r)r̂.
Pick S to be a cylinder of length L and radius r. We know that the end caps do
not contribute to the flux since the field lines are perpendicular to the normal.
Also, the curved surface has area 2πrL. Then by Gauss’ law in integral form,
Z
σL
E · dS = E(r)2πrL = .
S ε0
So
σ
E(r) = r̂.
2πε0 r
Note that the field varies as 1/r, not 1/r2 . Intuitively, this is because we have
one more dimension of “stuff” compared to the point charge, so the field does
not drop as fast.
∇2 ϕ = −ρ,
∇2 ϕ = 0.
∇ϕ = ϕ0 (r)r̂.
57
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
1 1 1 b3
A = − ρ 0 b3 , B= ρ0 a2 + ρ0 .
3 5 3 a
58
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
r
r=a
59
10 Gauss’ Law and Poisson’s equation IA Vector Calculus
So
GM R
g(R) = − .
a3
For R ≥ a, we can simply apply Newton’s law of gravitation.
In general, even if the problem has nothing to do with gravitation or electro-
statics, if we want to solve ∇2 ϕ = −ρ with ρ and ϕ sufficiently symmetric, we
can consider the flux of ∇ϕ out of a surface S = ∂V :
Z Z
∇ϕ · dS = − ρ dV,
S V
60
11 Laplace’s and Poisson’s equations IA Vector Calculus
(i) Ψ = 0 on S; or
∂Ψ
(ii) ∂n = 0 on S.
But
2 2
∇ · (Ψ∇Ψ) = (∇Ψ) · (∇Ψ) + Ψ ∇
| {zΨ} = |(∇Ψ)| .
=0
So Z
|∇Ψ|2 dV = 0.
V
61
11 Laplace’s and Poisson’s equations IA Vector Calculus
We’ve proven uniqueness. How about existence? It turns out it isn’t difficult
to craft a boundary condition in which there are no solutions.
For example, if we have ∇2 ϕ = −ρ on V with the condition ∂ϕ
∂n = g, then by
the divergence theorem,
Z Z
2 ∂ϕ
∇ ϕ dV = dS.
V ∂S ∂n
So if ρ and g don’t satisfy this equation, then we can’t have any solutions.
The theorem can be similarly proved and stated for regions in R2 , R3 , · · · , by
using the definitions of grad, div and the divergence theorem. The result also
extends to unbounded domains. To prove it, we can take a sphere of radius R
and impose the boundary conditions |Ψ(r)| = O(1/R) or | ∂Ψ 2
∂n (r)| = O(1/R ) as
R → ∞. Then we just take the relevant limits to complete the proof.
Similar results also apply to related equations and different kinds of boundary
conditions, eg D or N on different parts of the boundary. But we have to analyse
these case by case and see if the proof still applies.
The proof uses a special case of the result
Proposition (Green’s first identity).
Z Z Z
(u∇v) · dS = (∇u) · (∇v) dV + u∇2 v dV,
S V V
These are sometimes useful, but can be easily deduced from the divergence
theorem when needed.
62
11 Laplace’s and Poisson’s equations IA Vector Calculus
In words, this says that the value at the center of a sphere is the average of
the values on the surface on the sphere.
dS = R2 sin θ dθ dχ.
dS
So R2 is independent of R. Write
Z
1 dS
ϕ̄(R) = ϕ .
4π R2
by divergence theorem. So ϕ̄(R) does not depend on R, and the result follows.
Proof. Suppose that ϕ had a local maximum at a in the interior. Then there is
an ε such that for any r such that 0 < |r − a| < ε, we have ϕ(r) < ϕ(a).
Note that if there is an ε that works, then any smaller ε will work. Pick an ε
sufficiently small such that the region |r − a| < ε lies within V (possible since a
lies in the interior of V ).
Then for any r such that |r − a| = ε, we have ϕ(r) < ϕ(a).
Z
1
ϕ̄(ε) = ϕ(r) dS < ϕ(a),
4πR2 SR
63
11 Laplace’s and Poisson’s equations IA Vector Calculus
2
i.e. ∂x∂i ∂x
ϕ
i
= Hii = 0. But
P Hii is the trace of the Hessian matrix, which is the
sum of eigenvalues. So λi = 0.
Recall that a maximum or minimum occurs when all eigenvalues have the
same sign. This clearly cannot happen if the sum is 0. Therefore we can only
have saddle points.
(note we ignored the case where all λi = 0, where this analysis is inconclusive)
ρ(r0 )
Z
1
ϕ(r) = dV 0
4π V 0 |r − r0 |
64
11 Laplace’s and Poisson’s equations IA Vector Calculus
65
11 Laplace’s and Poisson’s equations IA Vector Calculus
In short, we have
1
∇2 = −4πδ(r − r0 ).
|r − r0 |
Using these, we can verify that the integral solution of Poisson’s equation we
obtained previously is correct:
ρ(r0 )
Z
2 2 1 0
∇ Ψ(r) = ∇ dV
4π V 0 |r − r0 |
Z
1 0 1
= ρ(r )∇ 2
dV 0
4π V 0 |r − r0 |
Z
=− ρ(r0 )δ(r − r0 ) dV 0
V0
= −ρ(r),
as required.
66
12 Maxwell’s equations IA Vector Calculus
12 Maxwell’s equations
12.1 Laws of electromagnetism
Maxwell’s equations are a set of four equations that describe the behaviours
of electromagnetism. Together with the Lorentz force law, these describe all
we know about (classical) electromagnetism. All other results we know are
simply mathematical consequences of these equations. It is thus important to
understand the mathematical properties of these equations.
To begin with, there are two fields that govern electromagnetism, known
as the electric and magnetic field. These are denoted by E(r, t) and B(r, t)
respectively.
To understand electromagnetism, we need to understand how these fields
are formed, and how these fields affect charged particles. The second is rather
straightforward, and is given by the Lorentz force law.
Law (Lorentz force law). A point charge q experiences a force of
F = q(E + ṙ × B).
So
∂ρ
+ ∇ · j = 0.
∂t
67
12 Maxwell’s equations IA Vector Calculus
We can also take the volume integral of the first equation to obtain
Z Z
1 Q
∇ · E dV = ρ dV = .
V ε 0 V ε 0
Electrostatics Magnetostatics
∇ · E = ρ/ε0 ∇·B=0
∇×E=0 ∇ × B = µ0 j
∇2 ϕ = −ρ/ε0 ∇2 A = −µ0 j.
ε0 sets the scale of electrostatic effects, µ0 sets the scale of magnetic effects,
e.g. the Coulomb force e.g. force between two wires with cur-
rents.
68
12 Maxwell’s equations IA Vector Calculus
∂B ∂ ∂2E
∇2 E = ∇(∇ · E) − ∇ × (∇ × E) = ∇ × = (∇ × B) = µ0 ε0 2 .
∂t ∂t ∂ t
Define c = √ 1
µ0 ε0 . Then the equation gives
1 ∂2
∇2 − 2 2 E = 0.
c ∂t
69
13 Tensors and tensor fields IA Vector Calculus
vi0 = Rip vp .
We can think about this from another angle. To define an arbitrary quantity
Aij , we can always just write down 9 numbers and be done with it. Moreover,
we can write down a different set of numbers in a different basis. For example,
we can define Aij = δij in our favorite basis, but Aij = 0 in all other bases. We
can do so because we have the power of the pen.
However, for this Aij to represent something physically meaningful, i.e. an
actual linear transformation, we have to make sure that the components of Aij
transform sensibly under a basis transformation. By “sensibly”, we mean that it
has to follow the transformation rule A0ij = Rip Rjq Apq . For example, the Aij
we defined in the previous paragraph does not transform sensibly. While it is
something we can define and write down, it does not correspond to anything
meaningful.
The things that transform sensibly are known as tensors. For example,
vectors and matrices (that transform according to the usual change-of-basis
rules) are tensors, but that Aij is not.
In general, tensors are allowed to have an arbitrary number of indices. In
order for a quantity Tij···k to be a tensor, we require it to transform according to
0
Tij···k = Rip Rjq · · · Rkr Tpq···r ,
Example.
– A tensor T of rank 0 doesn’t transform under change of basis, and is a
scalar.
70
13 Tensors and tensor fields IA Vector Calculus
Tij···k = ui vj · · · wk
Then linear combinations of such expressions are also tensors, e.g. Tij =
ui vj + ai bj for any u, v, a, b.
(ii) δij and εijk are tensors of rank 2 and 3 respectively — with the special
property that their components are unchanged with respect to the basis
coordinate:
0
δij = Rip Rjq δpq = Rip Rjp = δij ,
since Rip Rjp = (RRT )ij = Iij . Also
71
13 Tensors and tensor fields IA Vector Calculus
T = u ⊗ v ⊗ ··· ⊗ w
by
Tij···k = ui vj · · · wk ,
as defined in the example in the beginning of the chapter.
Definition (Tensor contraction). For a tensor T of rank n with components
Tijp···q , we can contract on the indices i, j to obtain a new tensor of rank n − 2:
Note that we don’t have to always contract on the first two indices. We can
contract any pair we like.
To check that contraction produces a tensor, we take the ranks 2 Tij example.
Contracting, we get Tii ,a rank-0 scalar. We have Tii0 = Rip Riq Tpq = δpq Tpq =
Tpp = Tii , since R is an orthogonal matrix.
If we view Tij as a matrix, then the contraction is simply the trace of
the matrix. So our result above says that the trace is invariant under basis
transformations — as we already know in IA Vectors and Matrices.
Note that our usual matrix product can be formed by first applying a tensor
product to obtain Mij Npq , then contract with δjp to obtain Mij Njq .
Tijp···q = Tjip···q .
It is anti-symmetric if
Tijp···q = −Tjip···q .
Again, a tensor can be symmetric or anti-symmetric in any pair of indices, not
just the first two.
72
13 Tensors and tensor fields IA Vector Calculus
as required.
Definition (Totally symmetric and anti-symmetric tensors). A tensor is totally
(anti-)symmetric if it is (anti-)symmetric in every pair of indices.
Example. δij = δji is totally symmetric, while εijk = −εjik is totally antisym-
metric.
There are totally symmetric tensors of arbitrary rank n. But in R3 ,
– Any totally antisymmetric tensor of rank 3 is λεijk for some scalar λ.
T (a, b, · · · , c) = Tij···k ai bj · · · ck .
To show that tensors are equivalent to multi-linear maps, we have to show the
following:
(i) Defining a map with a tensor makes sense, i.e. the expression Tij···k ai bj · · · ck
is the same regardless of the basis chosen;
73
13 Tensors and tensor fields IA Vector Calculus
Since vp0 = Rpi vi by tensor transformation rules, multiplying both sides by Rpi
gives vi = Rpi vp0 . Substituting in gives
vi···j ai · · · bj = Ti···jp···q ai · · · bj cp · · · dq
74
13 Tensors and tensor fields IA Vector Calculus
We assume that the fields are smooth so they can be differentiated any
number of times
∂ ∂
··· Tij···k ,
∂xp ∂xq
except for where things obviously fail, e.g. for where T is not defined. We now
claim:
Proposition.
∂ ∂
··· T · · · k, (∗)
∂xp ∂xq ij
| {z } | {z
n
}
m
is a tensor of rank n + m.
Proof. To show this, it suffices to show that ∂x∂ p satisfies the tensor transfor-
mation rules for rank 1 tensors (i.e. it is something like a rank 1 tensor). Then
by the exact same argument we used to show that tensor products preserve
tensorness, we can show that the (∗) is a tensor. (we cannot use the result of
tensor products directly, since this is not exactly a product. But the exact same
proof works!)
Since x0i = Riq xq , we have
∂x0i
= Rip .
∂xp
∂xp
(noting that ∂xq = δpq ). Similarly,
∂xq
= Riq .
∂x0i
75
13 Tensors and tensor fields IA Vector Calculus
(mass
R times velocity), where Tij = ρui uj . Then the flux through the surface S
is S Tij nj dS.
It is easy to generalize the divergence theorem from vectors to tensors. We
can then use it to discuss conservation laws for tensor quantities.
Let V be a volume bounded by a surface S = ∂V and Tij···k` be a smooth
tensor field. Then
Theorem (Divergence theorem for tensors).
Z Z
∂
Tij···k` n` dS = (Tij···k` ) dV,
S V ∂x`
76
14 Tensors of rank 2 IA Vector Calculus
14 Tensors of rank 2
14.1 Decomposition of a second-rank tensor
This decomposition might look arbitrary at first sight, but as time goes on, you
will find that it is actually very useful in your future career (at least, the lecturer
claims so).
Any second rank tensor can be written as a sum of its symmetric and
anti-symmetric parts
Tij = Sij + Aij ,
where
1 1
Sij = (Tij + Tji ), Aij = (Tij − Tji ).
2 2
Here Tij has 9 independent components, whereas Sij and Aij have 6 and 3
independent components, since they must be of the form
a d e 0 a b
(Sij ) = d b f , (Aij ) = −a 0 c .
e f c −b −c 0
The symmetric part can be be further reduced to a traceless part plus an isotropic
(i.e. multiple of δij ) part:
1
Sij = Pij + δij Q,
3
where Q = Sii is the trace of Sij and Pij = Pji = Sij − 31 δij Q is traceless. Then
Pij has 5 independent components while Q has 1.
Since the antisymmetric part has 3 independent components, just like a usual
vector, we should be able to write Ai in terms of a single vector. In fact, we can
write the antisymmetric part as
Aij = εijk Bk
for some vector B. To figure out what this B is, we multiply by εij` on both
sides and use some magic algebra to obtain
1 1
Bk = εijk Aij = εijk Tij ,
2 2
where the last equality is from the fact that only antisymmetric parts contribute
to the sum.
Then
0 B3 −B2
(Aij ) = −B3 0 B1
B2 −B1 0
To summarize,
1
Tij = Pij + εijk Bk + δij Q,
3
Tij +Tji
where Bk = 21 εpqj Tpq , Q = Tkk and Pij = Pji = 2 − 13 δij Q.
77
14 Tensors of rank 2 IA Vector Calculus
∂Fi
Example. The derivative of a vector field Fi (r) is a tensor Tij = ∂x j
, a tensor
field. Our decomposition given above has the symmetric traceless piece
1 ∂Fi ∂Fj 1 ∂Fk 1 ∂Fi ∂Fj 1
Pij = + − δij = + − δij ∇ · F,
2 ∂xj ∂xi 3 ∂xk 2 ∂xj ∂xi 3
1 ∂Fi 1
Bk = εijk = − (∇ × F)k .
2 ∂xj 2
and trace
∂Fk
Q= = ∇ · F.
∂xk
Hence a complete description involves a scalar ∇ · F, a vector ∇ × F, and a
symmetric traceless tensor Pij .
Li = Iij ωj ,
where
For a rigid body occupying volume V with mass density ρ(r), we replace the
sum with an integral to obtain
Z
Iij = ρ(r)(xk xk δij − xi xj ) dV.
V
78
14 Tensors of rank 2 IA Vector Calculus
x3
a
2`
x1
x2
x1 = r cos θ
x2 = r sin θ
x3 = x3
dV = r dr dθ dx3
We have
Z
I33 = ρ0 (x21 + x22 ) dV
V
Z a Z 2π Z `
= ρ0 r2 (r dr dθ dx2 )
0 0 −`
4 a
r
= ρ0 · 2π · 2`
4 0
= ε0 π`a4 .
Similarly, we have
Z
I11 = ρ0 (x22 + x23 ) dV
V
Z a Z 2π Z `
= ρ0 (r2 sin2 θ + x23 )r dr dθ dx3
0 0 −`
Z a Z 2π 3 ` !
` x
= ρ0 r r sin θ [x3 ]−` + 3
2 2
dθ dr
0 0 3 −`
Z a Z 2π
2
= ρ0 r r2 sin2 θ2` + `3 dθ dr
0 0 3
Z a Z 2π
2 3 2 2
= ρ0 2πa · ` + 2` r dr sin θ
3 0 0
2
a 2
= ρ0 πa2 ` + `2
2 3
79
14 Tensors of rank 2 IA Vector Calculus
1
L= M a2 ω
2
for rotation about any axis.
and the tensor transformation rule Tij0 = Rip Rjq Tpq becomes
T 0 = RT RT = RT R−1 .
80
15 Invariant and isotropic tensors IA Vector Calculus
(i) There are no isotropic tensors of rank 1, except the zero tensor.
(ii) The most general rank 2 isotropic tensor is Tij = αδij for some scalar α.
(iii) The most general rank 3 isotropic tensor is Tijk = βεijk for some scalar β.
(iv) All isotropic tensors of higher rank are obtained by combining δij and εijk
using tensor products, contractions, and linear combinations.
We will provide a sketch of the proof:
Proof. We analyze conditions for invariance under specific rotations through π
or π/2 about coordinate axes.
81
15 Invariant and isotropic tensors IA Vector Calculus
and
T23 = R2p R3q Tpq = R21 R33 T13 = −T13
So T13 = T23 = 0. Similarly, we have T31 = T32 = 0.
We also have
T11 = R1p R1q Tpq = R12 R12 T22 = T22 .
So T11 = T22 .
By picking a rotation about a different axis, we have T21 = T12 and
T22 = T33 .
Hence Tij = αδij .
(iii) Suppose that Tijk is rank-3 isotropic. Using the rotation by π about the
x3 axis, we have
So T111 = 0. We have similar results for π rotations about other axes and
other choices of indices.
Then we can show that Tijk = 0 unless all i, j, k are distinct.
Now consider
0 1 0
(Rij ) = −1 0 0 ,
0 0 1
a rotation about x3 through π/2. Then
T123 = R1p R2q R3r Tpqr = R12 R21 R33 T213 = −T213 .
So T123 = −T213 . Along with similar results for other indices and axes of
rotation, we find that Tijk is totally antisymmetric, and Tijk = βεijk for
some β.
82
15 Invariant and isotropic tensors IA Vector Calculus
Theorem. Let Z
Tij···k = f (x)xi xj · · · xk dV.
V
The result is particularly useful if (i) and (ii) hold for any rotation R, in
which case Tij···k is isotropic.
Example. Let Z
Tij = xi xj dV,
V
with V being a solid sphere of |r| < a. Our result applies with f = 1, which,
being a constant, is clearly invariant under rotations. Also the solid sphere is
invariant under any rotation. So T must be isotropic. But the only rank 2
isotropic tensor is αδij . Hence we must have
Tij = αδij ,
So
4
Tij = πa5 δij .
15
83
15 Invariant and isotropic tensors IA Vector Calculus
Normally if we are only interested in the i 6= j case, we just claim that Tij = 0
by saying “by symmetry, it is 0”. But now we can do it (more) rigorously!
There is a closely related result for the inertia tensor of a solid sphere of
constant density ρ0 , or of mass M = 43 πa3 ρ0 .
Recall that Z
Iij = ρ0 (xk xk δij − xi xj ) dV.
V
R
We see that Iij is isotropic (since we have just shown that xi xj dV is isotropic,
and xk xk δij is also isotropic). Let Iij = βδij . Then
Z
Iij = ρ0 (xk xk δij − xi xj ) dV
V
Z Z
= ρ0 δij xk xk dV − xi xj dV
V V
= ρ0 (δij Tkk − Tij )
4 5 4 5
= ρ0 πa δij − πa δij
5 15
8
= ρ0 πa5 δij
15
2
= M a2 δij .
5
84