Vector Calculus Allanach Evans
Vector Calculus Allanach Evans
Vector Calculus Allanach Evans
Vector Calculus IA
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
Abstract: These are lecture notes for the Cambridge mathematics tripos Part IA
Vector Calculus course. There are four examples sheets for this course. These notes
are pretty much complete.
Vector Analysis and Cartesian Tensors, Bourne and Kendall 1999 by Nelson
Thomas 30.
Future lecturers: please feel free to use/modify this resource. If you do though, please
append your name to the current author list.
Contents
1 Derivatives and Coordinates 1
1.1 Differentiation Using Vector Notation 1
1.1.1 Vector function of a scalar 1
1.1.2 Scalar function of position; gradient and directional derivatives 2
1.1.3 The chain rule: a particular case 3
1.2 Differentiation Using Coordinate Notation 3
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
3 Integration in 2 and 3 14
3.1 Integrals over subsets of 2 14
3.1.1 Definition as the limit of a sum 14
3.1.2 Evaluation as multiple integrals 15
3.1.3 Comments 16
3.2 Change of Variables for an Integral in 2 17
3.3 Generalisation to 3 19
3.3.1 Definitions 19
3.3.2 Change of Variables in 3 20
3.3.3 Examples 21
3.4 Further Generalisations and Comments 22
3.4.1 Integration in n 22
3.4.2 Change of variables for the case n = 1 23
3.4.3 Vector valued integrals 23
i
4 Surfaces and Surface Integrals 24
4.1 Surfaces and Normals 24
4.2 Parameterised Surfaces and Area 26
4.3 Surface Integrals of Vector Fields 27
4.4 Comparing Line, Surface and Volume Integrals 30
4.4.1 Line and surface integrals and orientations 30
4.4.2 Change of variables in 2 and 3 revisited 30
7 Integral Theorems 36
7.1 Statements and Examples 36
7.1.1 Greens theorem (in the plane) 36
7.1.2 Stokes theorem 38
7.1.3 Divergence, or Gauss theorem 40
7.2 Relating and Proving the Integral Theorems 41
7.2.1 Proving Greens theorem from Stokes theorem or the 2d di-
vergence theorem 41
7.2.2 Proving Greens theorem by Proving the 2d Divergence Theo-
rem 42
7.2.3 Greens theorem Stokes theorem 45
7.2.4 Proving the divergence (or Gauss) theorem in 3d: outline 46
ii
10.3 Poissons Equation and Laplaces Equation 58
12 Maxwells Equations 66
12.1 Laws of Electromagnetism 66
12.2 Static Charges and Steady Currents 67
12.3 Electromagnetic Waves 67
14 Tensors of Rank 2 73
14.1 Decomposition of a Second Rank Tensor 73
14.2 The Inertia Tensor 73
14.3 Diagonalisation of a Symmetric Second Rank Tensor 74
iii
15 Invariant and Isotropic Tensors 75
15.1 Definitions and Classification Results 75
15.2 Application to Invariant Integrals 75
15.3 A Sketch of a Proof of Classification Results for Rank n 3 76
(f F ) = f F + f F , (F G) = F G + F G ,
(F G) = F G + F G .
provided the basis {ei } is independent of u (note the implicit summation convention
over repeated indices).
The definition of the derivative can be abbreviated using differential notation
dF = F (u) du in which oterms are suppressed, compared to the relation above
with small but finite changes.
Example: A point particle of mass m has position r(t) (a function of time t), velocitya
r(t),
acceleration r(t) and momentum p = mr(t).
If F (r) is the force on a particle,
then p = m r = F (r) is Newtons Second Law. We define the angular momentum
about the origin L = r p = mr r L = m
r r + mr r, using Leibniz. Hence
L = r F , or the torque of F about the origin.
a
Note that derivatives with respect to t are conventionally denoted by dots instead of dashes.
1
1.1.2 Scalar function of position; gradient and directional derivatives
A scalar function f (r) is differentiable at r if
1
n f = limh0 [f (r + hn) f (r)] ,
h
the directional derivative of f along n. Thus, the gradient contains information about
how f changes as we move away from r to first order in the displacement.
Let r = xi ei with {ei } orthonormal. Setting n = ei for fixed i gives
1 f
ei f = limh0 [f (r + hei ) f (r)] = .
h xi
Hence the gradient in Cartesian coordinates is
f
f = e. (1.1)
xi i
With this choice of basis and coordinates, the definition of differentiability becomes
f
f = xi + o(x)
xi
as x = xi xi 0. In differential notation, we suppress oterms:
f
df = f dr = dxi .
xi
thus f = (1, 0, 1) at (x, y, z) = (0, 1, 0). The rate of change of f along direction
n is given by the directional derivative n f . So f increases most rapidly for n =
+ 12 (1, 0, 1) and decreases most rapidly for n = 12 (1, 0, 1) at a rate n f = 2.
To first order, there is no change in f if n (1, 0, 1).
2
1.1.3 The chain rule: a particular case
Consider a composition of differentiable functions f (r(u)). A change u produces a
change r = r u + o(u) and
df f dxi
a chain rule. In Cartesian coordinates (setting r = xi ei ), du
= xi du
.
with the ranges of the labels a, i, r understood. If f and g are smooth, then so is
f g, with derivative
M (f g)ra = M (f )ri M (g)ia ,
3
or matrix multiplication
yr yr xi
ua = xi ua
mp m n n p matrix type
Thus, if we compare the functions, we also compose the derivatives (matrices) as
linear maps.
dyr = M (f )ri dxi
combine and compare with dyr = M (f g)ra dua .
dxi = M (g)ia dua
xi
In operator form, u a = u , which holds when acting on any function which
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
a xi
depends on ua through xi .
For example, y = f (xi (u)) 3
dy y dxi
du
= xi du .
11 13 31
Compare with section 1.1.3.
4
x2
e
11
00
1
0
00
11
00
11
0011
11 00 e
00
11 00
11
00
11
00
11
00
11 00
11
00
11
00
11
1
0
00
11
0
e2 1
0
1
0
1
0
1
0
1
0
1
0
1
0
1
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
0
1
11111
00000
1111
0000
0
1
e1
1
0 x1
xi
In general, for invertible smooth maps, as above, detM (g) = det ua
, detM (f ) =
det u
xi
a
are called the Jacobians. They are non-zero, with
xi ua
det det = 1, (no summation convention).
ua xi
x1 = cos , x2 = sin ,
as in Section 1.2.3, see Fig. 1. and are coordinates (or parameters) labelling
points in the plane, but they are not components of a position vector (whereas r =
x1 e1 + x2 e2 is). Nevertheless, we can associate basis vectors with these coordinates:
are unit vectors in the directions of increasing and , respectively, with the other
held fixed. These vectors vary with position and are ill-defined at the origin.
In 3 dimensions, we define the coordinates as in Table 1.
5
cylindrical polars spherical polars
, , z r, ,
x1 = cos x1 = r sin cos
x2 = sin x2 = r sin sin , r = |r|
x3 = z x3 = r cos
Basis vectors
e = cos e1 + sin e2 er = cos ez + sin e
e = sin e1 + cos e2 e = sin e1 + cos e2
ez = e 3 e = sin ez + cos e .
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
1
0
0
1
0 ez
1
0
1
0
1
e
11
00
0
1 z 11
00
0
1
0
er
1
0
1
0
1 r
0
1
0
1
1
0
0
1
0
1 x2
0
1
0
1 e
0
1
0
1
00000000
11111111
00000000
11111111
e x1 e
1
0
Figure 2. Cylindrical polar coordinates and spherical polar coordinates. r = |r|, 0 <
2 and 0 .
6
Examples:
1. r(u) = ui + u2j + u3 k for < u < , a curve without end-points.
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
This is a parameterisation of x2 /4 + y 2 = 1, y 0, z = 3.
r(u) C
r(u) r(u+u)
The derivative r (u) is a vector tangent to the curve at each point, provided it is
not zero. The parameterisation is called regular if r (u) 6= 0. We will assume this
holds except perhaps at isolated points, so that the curve can always be divided into
segments on which the parameterisation is regular.
7
The arc length s measured along C, from some fixed point satisfies
s = |r| + o(|r|) = |r (u)u| + o(u) (2.1)
ds dr
= = |r (u)|. (2.2)
du du
The sign fixes the direction of measuring s, for increasing or decreasing u.
Example: A helix r(u) = (3 cos u, 3 sin u, 4u) r (u) = (3 sin u, 3 cos u, 4).
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
ds
du
= |r (u)| = 32 + 42 = 5. Thus the arc length, measured from r(0) = 3i to
r(u) (u > 0) is s(u) = 5u. Hence, the distance from r(0) = 3i to r(2) = 3i + 8 k is
s(2) = 10.
The parameterisation of a curve can be changed: take u u invertible and
smooth (each is a smooth function of the other). The chain rule implies that
dr dr d u
= /
d
u du du
so the tangent vector changes size, but not direction. Choosing u = s, the arc length,
gives a tangent vector t = dr
ds
of unit length along the direction of increasing s.
ds = |r (u)|du is a scalar line element on C.
8
This can be regarded as the limit of a sum over small displacements corresponding
to dividing the interval for u into small segments.
b b
F F
C C
a a
Each term is
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
dr
F (r) r = F (r(u)) + o(u).
du
The left hand side does not depend upon the parameterisation, and so neither does
Eq. 2.3. This can also be checked by the chain rule:
dr dr
F (r) du = F (r) du.
du d
u
R
Using components, with r = xi ei and F = Fi ei , the line integral is C Fi dxi =
R dxi
Fi du du.
dr = r (u)du is the line element on C.
R R
Examples: F (r) = (xey , z 2 , xy), so C F dr = C xey dx + z 2 dy + xy dz.
b=(1,1,1) u=1
C2
C1
a=(0,0,0) u=0
We define C1 : r(u) = (u, u2 , u3 ) r (u) = (1, 2u, 3u2 ). On C1 , F (r) =
2
(ueu , u6 , u3 ). Hence
Z Z 1 Z 1
2
F dr = F r (u) du = ueu + 2u7 + 3u5 du = e/2 + 1/4.
C1 0 0
For C2 : r(t) = (t, t, t) r (t) = (1, 1, 1) (we are now using t as the independent
variable). On C2 , F (r(t)) = (tet , t2 , t2 ). Hence
Z Z 1 Z 1
t 2 t 1
F dr = (te + 2t ) dt = [te ]0 et dt + 2/3 = 5/3.
C2 0 0
9
H
A closed curve has b = a. The line integral is sometimes then written C
F dr
and is called the circulation of F around C.
2.2.2 Comments
R
1. F dr depends on C in general and not just upon the end points a and b.
C
R R
2. C F dr = F r (u) du for orientation a b. This holds whether < or
> .
b=r()
F
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
a=r()
10
The parameterisations on each Ci can be chosen independently.
Example: (see Section 2.2.1)
b=(1,1,1)
C3
C1
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
a=(0,0,0)
R R
C
F dr = C
xey dx + z 2 dy + xy dz, where C = C1 + C3 is a piecewise smooth
closed curve as shown with C3 = C2 . From our previous results,
I Z Z Z Z
F dr = F dr + F dr = F dr F dr
C C1 C3 C1 C2
= (e/2 + 1/4) 5/3 = e/2 17/12.
It is convenient to extend sums to any sets of piecewise smooth curves, even if
they are not connected. This allows for some useful constructions, for example (these
rely on segments with the opposite orientations cancelling):
C1
C2
I I I
F dr = F dr + F dr.
C C1 C2
or
11
C1 C
C2
Proof: Z Z Z
dr
F dr = f dr = du f
C C du
for any parameterisation of C with a = r() and b = r().
Z
d
= (f (r(u))) du
du
as required.
Note that
When F is a gradient, the line integral depends only on the end points, not on
the curve joining them.
H
When C is a closed curve, C F dr = 0 if F is a gradient.
12
2.4.2 Differentials
It is often convenient to work with differentials F dr = Fi dxi as objects that can
be integrated along curves. Such a differential is called exact if it has the form
f f
df = f dr = x i
dxi . So F = f Fi = x i
Fi dxi = df is exact. To test
Fi Fj
if this holds, we can use the necessary condition xj
= xi
(since both are equal to
2f
for nice functions). We will see later that, locally, this is also sufficient. For
xi xj
an exact differential, Eq. 2.4 is
Z Z
F dr = df = f (b) f (a).
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
C C
d(f + g) = df + dg
r
Consider the position of a point particle moving under F (r) according to Newtons
second law: mr = F (r). The kinetic energy of the particle is
1 d
K(t) = mr 2 K(t) = mr r = F r
2 dt
If the path of a particle is a curve C from a = r() to b = r(), then
Z Z Z
d
K() K() = K(t) dt = F r dt = F dr
dt C
13
i.e. the change in the kinetic energy is equal to the work done by the force.
For a conservative force, F = V , where V (r) is the potential energy (setting
R
f = V in Eq. 2.4), C F dr = V (a) V (b), or the work done is equal to the loss
in potential energy. So, in this case,
i.e. the total energy K + V is conserved (in other words, it is constant during the
motion).
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
3 Integration in 2 and 3
3.1 Integrals over subsets of 2
3.1.1 Definition as the limit of a sum
Let D be a subset of 2 . We use the position vector r = (x, y) with Cartesian
coordinates. Consider approximating D by N small, disjoint subsets of simple shape,
e.g. triangles or parallelograms, labelled by I with area AI : each small enough to
be contained within a disc of diameter l.
x
AI
We assume that as l 0 and N the union of all of the small sets D. For a
R P
function or scalar field f (r), we define D f (r)dA = liml0 I f (rI )AI where rI is
some point within each small set. We assume that the limit exists for well-behaved
functions f (i.e. smooth) and D (for example the interior of a non-intersecting closed
curve) independent of all of the choices we have made before taking the limit l 0
and N .
R
Taking f = 1 yields D dA =area of D. Integrals with f 6= 1 are also known as
area integrals, referring to the region, or domain of integration, D.
14
3.1.2 Evaluation as multiple integrals
Area integrals can be expressed as successive integrals over the coordinates x and
y as follows: choose the small sets in the definition to be rectangles, each of size
AI = xy. Summing over subsets in a narrow horizontal strip with y and y
R
fixed and taking x 0 gives a contribution y xy f (x, y) dx with range xy =
{x : (x, y) D}. Then, summing over all such strips and taking y 0 gives
R R R
D
f (x, y) dA = Y xy f (x, y) dx dy where Y is the range of all y in D:
y
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
y+ y
Y y
x
xy
Alternatively, we can first sum over subsets in a narrow vertical strip with x and
R
x fixed and take y 0 to get x Yx f (x, y) dy with range Yx = {y : (x, y)
R
D}. Then, we sum over all such strips and take x 0 to get D
f (x, y) dA =
R R
X Yx
f (x, y)dy dx, where X is the range of all x in D:
Yx
x x+ x
x
X
We can summarise all of this by the statement that the area element is dA = dx dy
in Cartesian coordinates.
15
Example: f (x, y) = x2 y.
y
1
D
0 2 x
Z Z 1 Z 22y Z 1 22y Z 1
2 x3 8
f (x, y) dA = x y dx dy = y dy = y(1 y)3 dy
D 0 0 0 3 0 3 0
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
2
= .
15
Alternatively,
Z Z Z ! Z 1x/2
2 1x/2 2
2 2 y2
f (x, y) dA = x y dy dx = x dx
D 0 0 0 2 0
Z 2
1 x 2
= x2 (1 )2 dx = .
2 0 2 15
3.1.3 Comments
1. We have adopted a notation with the range of a one dimensional integral given
by a set, which may consist of disconnected intervals. For example, X =
R Rb Rb
[a1 , b1 ] [a2 , b2 ] X f (x)dx = a11 f (x)dx + a22 f (x)dx, and similarly for any
number of intervals. The simple pictures we drew to illustrate the arguments
in Section 3.1.2 involve ranges which are single intervals, but the arguments
hold more generally, for example:
Yx , the union of
D these intervals
0 x
2. The result that we can integrate over x and y in either order is called Fubinis theorem.
It holds if f is a continuous function and D is a compact subset of 2 , and
under some more general conditions too. In some cases, f can diverge, or D
R
can become infinite, and D f dA still exists, but these must be handled on a
case-by-case basis.
Less well behaved examples do exist where a different answer is obtained on
integrating in one order or the other, just like in conditionally convergent series.
16
3. In the special case f (x, y) = g(x)h(y) and D = {(x, y) : a x b, c y d},
R Rb Rd
D
f (x, y) dx dy = a
g(x)dx c
h(y)dy.
z
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
D
x
y v
D D
x u
R R
Then for a function (or scalar field) f , D f (x, y) dx dy = D f (x(u, v), y(u, v))|J| du dv,
where x x
(x, y) u
v
J= = y y
(u, v) u v
is the Jacobian. In the formula, |J| is the modulus of the Jacobian.
R
Proof in outline: write D as the limit of the sum of rectangles of area A = uv
in the (u, v) plane. Each rectangle is mapped approximately to a parallelogram of
area A = |J|uv in the (x, y) plane, since up to oterms,
x x
x u
= u v
y y ,
y u v
v
17
and |J| is the factor by which the areas are related. Summing over parallelograms in
the (x, y) plane gives the integral over D as claimed, with the Jacobian factor giving
the correct area for each parallelogram. The result can be summarised as
(x, y)
dA = du dv.
(u, v)
y
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
x
x = cos (x, y) cos sin
and J = = = dA = d d.
y = sin (, ) sin cos
In this simple case, we see that small changes and produce a small area in the
xy plane of approximate size A = , confirming the Jacobian factor. Now we
2 2 2
apply this to regions D and D for f = e(x +y )/2 = e /2 :
D : x2 + y 2 R2 ; x, y 0 D : 0 R; 0 /2.
y
/2
D D
x
R
Z Z Z R Z /2 h iR
2 /2 2 2
f dA = f d d = e d d = e /2 []02 = (1eR /2 ).
D =0 =0 0 2
For this function (a Gaussian), we can take R and the integral still exists. In
this limit, 0 x, y < so
Z Z Z Z Z
(x2 +y 2 )/2 x2 /2 y 2 /2
f dA = e dx dy = e dx e dy = .
D x=0 y=0 0 0 2
R 2 /2
Hence,
p we derive the well known result for the Gaussian integral 0
ex dx =
/2.
18
3.3 Generalisation to 3
3.3.1 Definitions
Let a volume V 3 and a position vector r = (x, y, z). We approximate V by
N small disjoint subsets of simple shape (for example, cuboids) labelled by I with
volume VI , each contained within a solid sphere of diameter l. We assume that as
l 0 and N , the union of the small subsets tends to V . Then, the integral of
a function (or scalar field) f (r) over V is defined
Z X
f (r) dV = liml0 f (rI )VI ,
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
V I
x A= x y
19
R R R
Alternatively, V f (r) dV = z ( Dz f (x, y, z) dx dy)dz, i.e. take x and y to zero
first to get an area integral over slice Dz , then sum over slices and take z 0:
z
range of all z in V
V
y
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
is the Jacobian.
The justification (an informal proof) is similar to the 2-dimensional case: given
a small cuboid with sides u, v, w respectively, we get an approximate small paral-
lelapiped of volume |J|uvw in (x, y, z) space, and we can use these as small sets
in the definition of the integral, hence the result.
Important special cases are cylindrical polar or spherical polar coordinates, as
in Section 1.3. For cylindrical polar coordinates
which means
(x, y, z)
= dV = d d dz.
(, , z)
20
On the other hand, in spherical polar coordinates,
i.e.
(x, y, z)
= r2 sin dV = r2 sin dr d d.
(r, , )
In summary, for a general coordinate system, the volume element is:
(x, y, z)
dV = du dv dw.
(u, v, w)
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
3.3.3 Examples
Example: Consider the volume within a sphere of radius a with a cylinder of radius
b removed (where b < a). Thus V : x2 + y 2 + z 2 a2 and x2 + y 2 b2 .
y
a 1/2
z=(a 22)
x
In cylindrical polar
p coordinates ,p
, z, dV = d d dz and V : b a,
0 2 and a z a2 2 .
2 2
Z Z b Z 2 Z z=a2 2 Z a p 4 2
dV = d d dz = 2 d 2 a2 2 = (a b2 )3/2
V =a =0 z= a2 2 b 3
21
Example: Consider the density of electric charge (r) = 0 z/a in a hemisphere H
of radius a, with z 0 and 0 a constant. What is the total charge of H?
H
y
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
x
In spherical polar coordinates, r a, 0 2 and 0 /2. Also,
dV = r2 sin dr d d.
The total charge in H is
Z Z a Z /2 Z 2
0
Q= dV = r cos r2 sin dr d d
H r=0 =0 =0 a
Z a Z /2 Z 2 a /2
0 0 r 4 1 2 1 3
= 3
r dr sin cos d d = sin []2
0 = a 0 .
a 0 0 0 a 4 0 2 0 4
Checking the dimensions of our answer, we have a volume times a charge density
(which is correct).
22
3.4.2 Change of variables for the case n = 1
For a single variable n = 1, the Jacobian is just J = dx/du and the general formula
reads Z Z
dx
f (x)dx = f (x(u)) du.
D D du
The modulus of dx/du is correct because of our natural convention about integrating
over subsets D and D .
R Rb
For example, suppose D is the interval a x b so D f (x) dx = a f (x) dx.
Let u = , correspond to x = a, b, respectively. Since x u is invertible and
dx dx
smooth in both directions, du 6= 0. Either < and du > 0, in which case
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
R dx
R dx dx
R dx
R dx
D f | du
| du =
f du
du or > and du
< 0, so D f | du
| du =
f ( du ) du.
Rb R dx
Either way, the formula yields a f (x) dx = f (x(u)) du du, a familiar result.
23
Example: Consider a solid hemisphere H with r a and z 0, with uniform density
R
. The total mass M = H dV = 2a3 /3. If we put R = (X, Y, Z):
Z Z a Z /2 Z 2
1
X = x dV = xr2 sin dr d d
M H M r=0 =0 =0
Z a Z /2 Z 2
3 2
= r dr sin d cos d = 0.
M 0
0 0
Y = 0 in a similar way.
Z a Z /2 Z 2
a4 1
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
3 3a
Z = r dr sin cos d d = 2 = ,
M 0 0 0 M 4 2 8
dr
du
d dr
By the chain rule, du f (r(u)) = f du . This is zero, since the curve is in S and
f does not change along it. At any r on S with f 6= 0, there exist two linearly
independent tangent directions to f . Hence
1. In the neighbourhood of this point, the equation indeed describes a surface (it
has two independent tangent directions).
24
Examples: f (r) = x2 + y 2 + z 2 = c. For c > 0, this describes a sphere of radius c.
f = 2(x, y, z) = 2r.
z
f
y
x
f
y
x
For c = 0, the surface degenerates to a single point 0 in the first case, and to a double
cone in the second (it is singular at the apex 0, where f = 0.
z
c=0
y
x
A surface S can be defined to have a boundary S consisting of a piecewise
smooth closed curve. Defining S as above but with z 0, S is the circle x2 + y 2 = c
and z = 0 in the first two cases. A surface is bounded if it can be contained within
some solid sphere, and unbounded otherwise. A bounded surface with no boundary
is called closed.
At each point on a surface, there is a unit normal n (a normal of unit length) that
is unique up to a sign. The surface is called orientable if there is a consistent choice
of unit normal n which varies smoothly. We shall deal exclusively with orientable
surfaces, which rules out examples like the Mobius band:
25
For an orientable surface, the unit normal is then determined everywhere by
a choice of sign. The choce of normal is called the orientation of the surface. For
simple examples, we can specify the orientation by referring to the outward or inward
normal:
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
r
u
r r
u
and v are tangent vectors to curves on S with v and u constant, respectively.
To define a surface, we need two linearly independent directions at each point, so we
require a regular parameterisation:
r r
6= 0
u v
and this defines a unit normal n on S. The small changes u, v produce a small
parallelogram on S, up to oterms with vector area
r r
S = = S n,
u v
r r
where the S = u v
u v is the scalar area.
n
r
v v
S
r
u u
26
Note that with these conventions, the choice of the unit normal is determined
by the order of the parameters u and v. By summing and taking limits, the area of
S is Z Z
r r
dS = u v du dv,
S D
r r
where dS = u v du dv is the scalar area element.
r(, ) = a cos sin i + sin sin j + cos k = aer .
r
= ae , e = cos (cos i+sin j)sin k and
r
= a sin e , where e = sin i+
cos j. Thus
r r
= a2 sin er , where er = n, the outward normal. The scalar area
element is dS = a2 sin d d. Suppose that S is the region 0 , 0 2.
27
n
F
S
D
u v
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
By writing the integral over D as the limit of a sum over small rectangles uv, the
integral over S becomes the limit of a sum of contributions
n F
F (r) S = F (r) n S
r r
= F (r(u, v)) u v + o-terms.
u v
R
From the geometrical nature of the left hand side, the integral S F dS is independent
of the parameterisation, for a given orientation. It is called the flux of F through S.
r r
dS = ndS = du dv
u v
is the vector area element.
Note that changing the orientation of S is equivalent to changing the sign of the
unit normal n, which is equivalent to changing the order of u and v in the definition
of S, which is also equivalent to changing the sign of any flux integral.
r
Example: For a sphere of radius a we found from r(, ) in section 4.2 that = ae ,
r r r
= a sin e . The vector area element is then dS = ndS = d d =
2
a sin er d d, where n = er = r/a, the outward normal.
The fluid flux through a surface provides a physical example. The velocity field
u(r) of a fluid gives the motion of any small volume of fluid at r.
u(r)
28
We assume that it depends smoothly upon r (note that it will also depend upon t in
general). For any small area S on the surface S, the volume of fluid crossing it in
time t, in the sense given by normal n, is the volume of the cylinder shown:
n u t
S
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
29
The scalar area element is therefore
r r r r
dS = du dv =
d
u d
v
u v u
v
(u,v)
since (u,v) du dv = d
u d
v.
The vector area element is
r r r r
dS = du dv = d
u d
v
u v u
v
provided that (u, v) and (
u, v) correspond to the same orientation on S, since then
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
(
u,
v)
(u,v)
> 0.
30
r
w w r
v v
r
u u
r r r
of volume V = u u v
v w
w = |J|u v w. We deduce that the volume
r r r (x,y,z)
element is dV = |J|du dv dw where J = u
v w = (u,v,w)
. This is an alternative
derivation of the result in section 3.3.
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
Let t = n where the unit vector n(s) is called the principal normal and (s) is
called the curvature (we can make positive by choosing an appropriate direction
for n). These quantities can vary from point to point on the curve.
Suppose that the curve can be Taylor expanded around s = 0:
1
r(s) = r(0) + sr (0) + s2 r (0) + O(s3 )
2
1 2
= r(0) + st|s=0 + s |s=0 n + O(s3 ). (5.1)
2
Compare this with the vector equation for a circle passing through r(0) with radius
a, in the plane defined by t and n, as shown:
a
n
t
r (0)
By comparison with Eq. 5.1, we see that we can match a general curve to a
circle, to second order in arc length, by taking a = 1/, the radius of curvature at
the point s = 0.
31
In practice, for a curve r(u) given in terms of some parameter u, we can calculate
t, s, n, by taking derivatives d/du and then using the Chain Rule to convert the
expression to the derivatives d/ds afterwards.
Given t(s) and n(s) we can extend to an orthonormal basis by defining b(s), the
binormal to be b = t n
n
b
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
The geometry of the curve is encoded in how this basis changes along it. This can
be specified by two scalar functions of arc length: the curvature (s) and the torsion
(s) = b n.
[Non-examinable and not lectured:] The torsion determines what the curve looks
like to third order in its Taylor expansion, and how the curve lifts out of the t, n
plane. Analogously to curvature, one defines a radius of torsion = 1/ . If a curve
is in a plane (and the curvature is non-zero), the torsion is zero. A helix has constant
torsion (and it is positive for a right-handed helix, and negative for a left-handed
one).
n
P
If we move the plane, we also move the curve C produced by the intersection. Let
be the curvature of C at P , defined above. As the plane is rotated about n, we find
a range min max where min and max are the principal curvatures. Then
K = min max is called the Gaussian curvature.
The Theorema Egregium is that K is intrinsic to the surface S, i.e. it can be
expressed in terms of lengths, angles etc. which are measured entirely on the surface.
32
This is the start of intrinsic geometry - embedding a surface in Euclidean space
determines length and angles on it, but then we study only geometric aspects of
this intrinsic structure. Eventually, we do away with this embedding altogether, in
Riemannian geometry and general relativity.
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
We can check this theorem when S is a sphere of radius a, for which geodesics are
great circles. Its easy to see that min = max = 1/a so K = 1/a2 , a constant.
We have a family of geodesic triangles D, as shown, with 1 = , 2 = 3 = /2.
R
Since K is constant over S, D K dA = K area of D = a12 (a2 ) = . Then
1 + 2 + 3 = + , agreeing with the prediction of the theorem. By contrast, the
extrinsic geometry is to do with how surfaces are embedded in the space.
=i +j +k = ( , , ).
x y z x y z
33
Let us use basis vectors that are orthonormal and right-handed: ei ej = ij and
ei ej = ijk ek .
(nabla, or del) is both an operator and a vector. We can therefore apply it to a
vector field F (r) = Fi (r)ei using either the scalar or vector product. The divergence,
or div of F is:
Fi
F = (ei ) (Fj ej ) =
xi xi
and is a scalar field.
The curl of F is defined to be
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
Fj
F = (ei ) (Fj ej ) = ijk e
xi xi k
and is a vector field. It can also be written as
e e e
1 2 3
x1 x2 x3 .
F1 F2 F3
Example: F = xezi + y 2 sin xj + xyz k.
2
F = (xez ) + (y sin x) + (xyz) = ez + 2y sin x + xy.
x y z
2
F = (xyz) (y sin x) i + z
(xe ) (xyz) j +
y z z x
2
(y sin x) (xe ) k
z
x y
= xzi + (xez yz)j + y 2 cos xk.
(f + g) = f + g
(F + G) = F + G
(F + G) = F + G
for any constants and . Grad and div can be analogously defined in any dimension
n, but curl is specific to n = 3 because it uses the vector product.
34
r r
Examples: Consider r = xi ei , r2 = xi xi 2r x i
= 2xi , or xi
= xi /r.
1. r = ei x i (r ) = ei r1 x
r
i
= ei r1 xi /r = r2 r = r1 r
xi
2. r = xi
= ii = 3.
(f g) = (f )g + f (g),
(f F ) = (f ) F + f ( F ) and
(f F ) = (f ) F + f ( F )
are frequently useful. There are also some less commonly used properties too:
(F G) = F ( G) + G ( F ) + (F )G + (G )F (6.1)
(F G) = F ( G) G( F ) + (G )F (F )G
(F G) = ( F ) G F ( G).
Examples:
F
1. (F G) =
xi
(F G)i =
( F G )
xi ijk j k
= kij xji Gk Fj jik G
xi
k
.
Gj
[F ( G)]i = Fj (F )Gi (6.2)
xi
Taking Eq. 6.2 (along with the F G counterpart and substituting into the
right hand side of Eq. 6.1 yields its left hand side, i.e. the formula for
Fj Gj
[(F G)]i = Gj + F j .
xi xi
Using the Leibniz properties, we can extend our results to some previous examples:
Examples:
2. (r r) = (r ) r + r r = r2
r
r = 0.
35
6.3 Second Order Derivatives
If F = , the field is called conservative. If F = 0, a field is called or irrotational.
Two different combinations of grad, div and curl vanish identically: for any scalar
field f , (f ) = 0 and for any vector field A, ( A) = 0. Both of these
results follow from the symmetry of mixed partial derivatives:
2f 2 Ak
kij = 0, kij = 0.
xi xj xi xj
The converse of each result also holds for fields defined on all of 3 :
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
F = 0 F = f
H = 0 H = A,
2 A = ( A) ( A).
7 Integral Theorems
The following closely-related results generalise the Fundamental Theorem of calculus:
that integration is the inverse of differentiation. To generalise to higher dimensions,
we need the right kind of derivative to match the right kind of integral.
36
where A is a bounded region in the (x, y) plane with boundary A = C, a piecewise
smooth, non-intersecting closed curve, traversed anti-clockwise.
Examples:
R R
1. P = x2 y and Q = xy 2 A (y 2 x2 ) dA = C x2 y dx + xy 2 dy. If C is the
parabola y 2 = 4ax and the line x = a, both with 2a y 2a, then each
104 4
integral above gives 105 a (see Examples Sheet I).
y C
A
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
a x
y2=4ax
y
b
C
A
a x
R Rb Ra Rb R
Also, A Qx
dA = 0
dy 0
dx( Q
x
) = 0
[Q(a, x) Q(0, y)] = C
Qdy, since only
the vertical segments contribute. Adding these two integrals gives the result in
the theorem.
Greens theorem also holds for a bounded region A with the boundary A con-
sisting of a number of disconnected components (each being a piecewise smooth, non-
intersecting closed curve) with anti-clockwise orientations on the exterior boundary
and clockwise orientations on the interior boundary. For example,
C1
C2 C3
37
These can be related to the case of a single boundary component using the construc-
tion from section 2.3:
since the horizontal parts cancel in the opposite directions (in the limit that they are
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
S C
n
t
t n
38
Example: Take S to be the section of a sphere of radius a: 0 :
n=e r
S
C
a
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
39
Example: Take S to be the section of the cone z 2 = x2 + y 2 , a z b (where
b > a > 0). In cylindrical polar coordinates r(, ) = ( cos , sin , ), a
b, 0 2.
z
Cb
n
S
Ca
y
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
r r
= (cos , sin , 1) ( sin , cos , 0) = ( cos , sin , )
dS = ( cos , sin , ) d d,
which gives the normal n as shown on the diagram. Taking F = (0, xz, 0) F =
(x, 0, z).
F dS = (2 cos2 + 2 ) d d
Z Z b Z 2
F dS = d d2 (3/2 +
cos
2/2) = (b3 a3 ).
S a 0
Cb Ca
40
where V is a bounded volume with boundary V = S, a piecewise smooth closed
surface with normal n pointing outwards.
Example: Take V to be the solid hemisphere x2 + y 2 + z 2 a2 , z 0. V = S =
S1 (hemisphere) + S2 (disc).
n
S1
S2
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
n
R
F = (0, 0, z + a) F = 1. Then, V
F dV = 2/3a3 , the volume of the
hemisphere.
On S1 : dS = n dS = (x, y, z)/a dS.
C
k
t = (dx/ds, dy/ds, 0) is the unit tangent to C and n = (dy/ds, dx/ds, 0) is the unit
normal to C out of A. n = t k, where k = (0, 0, 1).
41
Given any P (x, y) and Q(x, y), consider the vector field F = (P, Q, 0) F =
(0, 0, Q
x
P
y
). Then,
Z Z
F t ds = P dx + Q dy (7.1)
C C
and Z Z
Q P
( G dA = ( ) dA. (7.2)
A A x y
Thus an application of Stokes theorem for F equates the two left-hand sides of
Eqs. 7.1, 7.2 whereas Greens theorem equates the two right-hand sides. Therefore,
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
and Z Z
Q P
( G) k dA = ( ) dA. (7.4)
A A x y
Again, Greens theorem equates the right hand sides of Eqs. 7.3,7.4 whereas the
two-dimensional version of the divergence theorem for a vector field G in the (x, y)
plane Z Z
G dA = G nds (7.5)
A C=A
relates the left-hand sides:
G
We have chosen this order of integration because G = y
. Depending on x, the
range Yx may be a single interval or a union of intervals:
42
y
C j
n n
C+
Yx
j
A
C
n
x x
I
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
for any x in I.
n j
C+
s
On C+ : n j 0 and dx = n jds
x
C
On C : n j 0 and dx = n jds
x
Hence
Z Z
G
dy dx = (G(x, y+ (x)) G(x, y (x))) dx
I y
ZI Z
= G n ds + G n ds.
C+ C
43
y
C+
A
C
x
X
as shown, we take I = X and A = C+ + C and the integrals over C combine to
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
R
give C G n ds, proving the result.
For regions A of more general shape, either:
1. Divide A into simpler subsets which do have the property above. The result
holds on each subset Ai it holds on A
A4
A1
A5
A2
A3
2. Proceed directly, even if Yx is a sum of intervals and get a sum over all boundary
segments corresponding to x in the range I
C1
C2
C3
I
Z Z XZ
G
dy dx = G n ds.
I Yx y i Ci
44
7.2.3 Greens theorem Stokes theorem
Consider a parameterised surface given by r(u, v), corresponding to a region A in the
u, v plane. Suppose the boundaries are given by A : u(t), v(t) and S : r(u(t), v(t))
for parameter t.
v
n
A
S
S A
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
45
Greens theorem equates the left-hand sides of Eqs. 7.9,7.10 whereas Stokes equates
their right-hand sides.
The conclusion is that the left hand sides being equal (i.e. Greens theorem for
Fi , Fv on A Stokes theorem for F on S).
z
Z Z Z !
F
F dV = dA,
V D Zxy z
where Zxy is the range of the z integration for fixed x, y (in general this will be the
union of disjoint intervals).
z
V
zxy
y
D
(x,y)
x
The boundary surface S = V can be divided into surfaces S on which k n 0.
S is given by z = z (x, y) at the ends of some interval in Zxy .
k n
S+
S From the diagram, A = k n S on S (the signs are given
by the fact that n is always the outward normal). The integral
of F
z
then gives
k
F (x, y, z+ (x, y)) F (x, y, z (x, y))
S R
and the integral of this over D can be written S+ F k n dS +
R R
S as required.
F k n dS, or in total V F dS,
n S
46
8 Some Applications of Integral Theorems
8.1 Integral Expressions of Div and Curl
The divergence theorem for a vector field F applied to a small volume V containing
a point r0 gives Z Z
F dS = F dV ( F )(r0 )V.
V V
r0
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
Similarly, Stokes theorem applied to a small part of a plane with area A and normal
n gives Z Z
F dr = ( F ) n dA (n F )(r0 )A.
A A
n
F
A ro
At r0 ,
R
n F = limA0 A1 A
F dr.
47
Examples: The expressions above allow interpretations of u and u when u is
a velocity field, describing fluid flow.
R
1. Divergence: we know from section 4.3 that S u dS is the rate of fluid crossing
S (i.e. per unit time). Taking V to be the volume occupied by a fixed material
R
quantity of fluid, we therefore have V = S u dS (for a boundary V at some
fixed time interval, tV is the approximate volume that crosses in time t,
as the volume occupied by the fixed material quantity changes). Then, from
above, at r0 , u = limV 0 V /V , the local (i.e. at r0 ) relative rate of change
of volume occupied by a fixed material quantity of fluid.
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
For example, u(r) = r, fluid flows out of the origin. u = 3, i.e. the
volume increases at the same constant relative rate everywhere.
u
t
R R
u dr = A u t ds = (2a) the average of u t, the component tangential
A
to the boundary = 2a2 , where we define = 1/a (the average of u t), as
the local angular velocity around n. Then
1
n u = lima0 (2a2 ) = 2
a2
at r0 , i.e. twice the local rate of rotation. For example, u = r, rigid motion
with constant angular velocity about 0, and ( r) = 2, twice the rate
of rotation.
8.2 Conservative and Irrotational Fields, and Scalar Potentials
Consider three ways to define a vector field F , (A) and (B) are equivalent definitions
of a conservative vector field, and (C) defines an irrotational field:
(A) F = f for some scalar field f (r).
R
(B) C F dr is independent of C, for some given end points and orienta-
tion.
(C) F = 0.
R
(A), (B) and (C) are all equivalent for F (r) on 3 . (A)(B) with C F dr = f (b)
f (a) was shown in section 2.4. (A)(C) was shown in section 6.3 (via f = 0.)
We shall now show that (C)(B) and (B)(A) in 3 . Given F (r) obeying (C)
on 3 , let C and C be any two curves from a b:
48
~ b b
C
S
a C a S
R R R
If S is any surface with boundary S = C C, F dr C F dr = S F dr =
R C
S
F dS by Stokes theorem. Hence, (C)(B).
R
To show (B)(A), we must define f (r). Let us fix a and define f (r) = C F (r )
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
C r+ r
C r
But we have f = f r + o(|r|) from the definition of grad. Since this holds for
any r, we have that F = f and hence (B)(A), as claimed.
If F (r) is defined ans satisfies (C) only on a subset D of 3 , then (B) and (A) will
still hold on D if it is simply connected. By definition, this means that any curves C
and C with fixed end-points can be smoothly deformed into one another in D. Such
a deformation generates a smooth surface S with S = C C, so the argument for
(C)(B) holds as above and (B)(A) still follows.
If D is not simply connected, then the construction above produces a multi-
valued scalar field f (r) on D in general. However, we can always choose to restrict
to a simply connected subset D0 D such that f (r) will be single-valued on D0 .
49
Example: F = (y/(x2 + y 2 ), x/(x2 + y 2 ), 0) is well-defined and obeys F = 0 on
D = 3 {z-axis}, which is not simply connected.
~
C
C
, usual polar
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
angle
j
Q(t)
Q(t) is the total amount of some quantity in V ; (r, t) is the density for this quantity,
with (r, t)V the amount in a small volume V ; the flux of j out of S gives the rate
of quantity leaving V , according to the equation above.
50
Examples:
1. Conservation of electric charge. (r, t) is the charge density, (r, t)V is the
charge in a small volume V . Q(t) is the total charge in V . j(r, t) is the electric
current density, and j S is the charge flowing through S per unit time.
S
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
2. Conservation of mass for a fluid. (r, t) is the mass density and j = u with
u(r, t) being a fluid velocity field. By extension of the discussion of fluid volume
in section 4.3, ut nS is the mass of fluid that crosses S in time t, so
dQ
R
dt
= S
j dS does indeed imply conservation of mass.
n u t
51
(see the Jacobian condition in section 4.4.2). These vectors are tangent to the curves
parameterised by u, v and w when the other two are held fixed.
We say that u, v, w are orthogonal curvilinear coodinates if the tangent vectors
r r r
above are orthogonal. Then, we can set u = hu eu , v = hv ev , w = hw ew , with
hu > 0, hv > 0, hw > 0 and eu , ev , ew forming an orthonormal and right-handed
basis (right-handed meaning eu ev = ew , which may be achieved by ordering the
coordinates appropriately). In these coordinates, the line element is
dr = hu eu du + hv ev dv + hw ew dw
and the positive scale factors hu , hv , hw determine the changes in length along each
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
Examples:
hx = hy = hz = 1, e = i,
1. Cartesian coordinates r(x, y, z) = xi + yj + z k. x
ey = j, ez = k.
h = hz =
2. Cylindrical polar coordinates r(, , z) = (cos i + sin j) + z k,
1, h = . e , e and ez are as in section 1.3.
3. Spherical polar coordinates r(r, , ) = r(cos sin i + sin sin j + cos k),
hr = 1, h = r, h = r sin . er , e , e are as in section 1.3.
Consider a surface with w constant (say), parameterised by u and v. The vector
r r
area element is dS = u v du dv (a general formula) = hu eu hv ev du dv =
hu hv ew du dv, for orthogonal coordinates.
ew ev
eu
52
with
= (f ) dr.
(this is the coordinate independent definition of grad, see section 1.1). From the
expression for dr above, and since the basis vectors are orthonormal,
1 f 1 f 1 f
f = eu + ev + ew
hu u hv v hw w
is the gradient in general orthogonal curvilinear coordinates. To apply this, we need
to know the various h factors.
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
and
1
F = (hv hw Fu ) + (two similar terms) . (9.2)
hu hv hw u
There are a number of ways these can be derived2 :
53
3. Use integral expressions for div and curl as in section 8.1. This approach gives
good geometric insight.
To follow (3) for curl, consider the area defined by changes u and v with w fixed.
C is the small circuit shown,
ew ev u
u+ u
eu v+ v
C
v
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
ev
eu
Z
F dS [hu hv Fw (u, v, w + w) hu hv Fw (u, v, w)] uv + two similar terms
S
(hu hv Fw )uvw + two similar terms.
w
54
Then, dividing by V = hu hv hw uvw and taking the limit gives the formula for
F above.
In this course, we need to know how to apply these formulae in particular coor-
dinate systems.
11111
00000
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
00000
11111
r=a
S:r=R
55
Consider Gauss Law with S being a sphere of radius r = R > a. The outward
normal is r.
Z Z Z
g dS = g(R)
r rdS = g(R)dS = 4R2 g(R)
S S S
GM
g(r) = r (r > a).
r2
The gravitational force on a mass m at r is therefore F (r) = GM m r/r2 . In
the limit a 0 we get a point mass M at the origin 0 and we recover Newtons Law
of Gravitation for point masses.
For any a, the field and the force for r > a depends only on the total mass M ,
assuming spherical symmetry (and no mass in r > a). This simple behaviour means
that we can identify the gravitational field that keeps the moon in orbit (where the
earth is approximately point-like) with the one that pulls an apple from a tree (where
the earth is certainly not point-like).
R R
The condition C g dr = 0 for any closed C can be re-written S gdS = 0
for any S with S = C by Stokes theorem. Since S is arbitrary, this g = 0.
In our above example with spherical symmetry, g = 0 automatically.
If the mass distribution is not sufficiently symmetrical, it is difficult to solve for
g(r) from Gauss Law in integral form, but it can be re-cast in differential form.
Suppose that the mass M arises from a mass density (r). Then
Z Z Z
g dS = 4GM gdV = 4G (r) dV
S V V
R
with S = V by Gauss theorem V ( g + 4G) dV = 0 for any volume V ,
hence
g = 4G,
Gauss law in differential form. Furthermore, since g = 0, we can introduce the
gravitational potential (r) with g = and then Gauss law becomes
2 = 4G,
Poissons equation. In the previous case with spherical symmetry, we can choose
(r) = GM/r for r > a. It obeys the boundary condition 0 as r .
56
10.2 Laws of Electrostatics
Consider a distribution of electric charges at rest. They produce a force on a charge
a at rest at r which is proportional to q: F = qE(r), where E(r) is the electric field
(a vector field), or force per unit charge.
R
The electric field obeys C E dr = 0 for any closed curve C, i.e. E(r) is conser-
vative. Also, it obeys Gauss Law:
Z
Q
E dS =
S 0
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
for any closed surface S which is the boundary of a volume V , with Q being the total
charge contained within V . 0 is the permittivity of free space, a constant of nature.
Mathematically speaking, this is identical to the gravitational case in section 10.1,
and the laws above can be re-expressed in differential form in the same way: E = 0
and
E =
0
(Gauss Law in differential form), where (r) is the electric charge density. these
can be re-expressed as E = , with (r) being the electrostatic potential and
2 = /0 , Poissons equation.
For example: take a total charge Q distributed with spherical symmetry about
0 and all of the charge being contained within a radius r = a. We solve this using
Gauss Law in integral form, just as in the gravitational case. Thus,
Q r
E(r) =
40 r2
for r > a and
Q 1
(r) = .
40 r
we obtain a point charge at 0 in the limit a 0. from this result for E, we recover
Coulombs Law for the force on another point charge q at r:
qQ r
F = qE = .
40 r2
57
Example:
The electric field due to a line of charge. is the charge per unit
length along the zaxis. We have cylindrical symmetry, but here
we shall use r for the radial coordinate in cylindrical polar coor- n z
58
is the general solution.
A 1 2
(r) = +B 0 r .
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
|r {z } |6 {z }
general solution of Laplaces equation particular integral
For a unique solution to Poissons (or Laplaces) equation, we must specify the
boundary conditions. If is defined in all of 3 , we often require 0 as |r| .
If is a solution on a bounded volume V , then there are two common kinds of
boundary condition on V :
The choice of boundary condition depends on the physical content, for example
specifying /n corresponds to specifying the normal component of g or E. We
can also specify different boundary conditions on different boundary components.
4
Notice the new notation of a derivative with respect to a vector!
59
Example: We seek a spherical symmetric (r) with
2 0 ra
= ,
0 r>a
The physical interpretation is that is the gravitational potential inside and outside
a planet of constant density 0 , radius a and total mass M . We restore gravitational
conventions by replacing 0 4G0 and write the results above in terms of
M = 4/3a3 0 . A and D are fixed to be zero by the boundary conditions at r = 0, .
The matching at r = a implies
1
: B + 4G0 a2 = C/a
6
1
: + 4G0 a = C/a2
3
GM/(2a) ((r/a)2 3) ra
(r) =
GM/r r>a
The gravitational field g = = g(r)er with
GM r/a3 ra
g(r) = (r) = .
GM/r2 r>a
60
Example: An alternative solution is to apply Gauss Law for a flux of g = g(r)er out
of S, a sphere of radius R. R a has already been covered in section 10.1 (the result
is independent of how the mass is distributed, provided there is spherical symmetry).
S
R
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
For R a, we get
Z
g dS = 4R2 g(R) = 4G M (R/a)3 g(R) = GM R/a3 ,
S | {z }
mass in Ra
1. (r) is unique, or
61
Proof: Let 1 (r) and 2 (r) satisfy Poissons equation, each obeying the boundary
conditions (1) or (2). Then (r) = 2 (r) 1 (r) satisfies 2 = 0 on V by linearity
and (D) = 0 on S or (N) /n = n = 0 on S. By the divergence theorem,
R R
V
() dV = S
dS. But
() = + 2 = ||2 .
| {z }
0 on V
So Z Z
2
|| dV = dS = 0
V S n
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
by either (N) or (D). Since ||2 0, the integral can only vanish if || = 0
= 0 = c, a constant, on V . So:
(D) = 0 on S c = 0 and 1 = 2 on V , or
(N) 2 (r) = 1 (r) + c as claimed.
11.1.2 Comments
The result says nothing about existence. Indeed, for 2 = on V with
n
= g on V (Neumann) there can only be a solution if
Z Z Z Z
2
dV = dS dV + g dS = 0, (11.1)
V V n V V
but and g are fixed to begin with, so Eq. 11.1 may not necessarily be true.
The uniqueness result has practical importance: if we can find a solution by any
means, then we know it is the only solution. For example: find a solution with
symmetry (r), or more generally, linear combinations of separable solutions5
= X(x)Y (y) or R(r)Y ().
The theorem can be stated and proved similarly for regions in 2,3,... : it just
uses definitions of grad and div, and the divergence theorem.
62
Similar results apply to related equations and various kinds of boundary con-
ditions, perhaps with D or N being different parts of the boundary:
or
Z Z Z
(uv) dS = u v dV + u2 v dV,
S V V
63
11.2.2 The Maximum (or Minimum) Principle
We say that (r) has a local maximum at a if, for some > 0, (r) < (a) when
0 < |r a| < . The maximum principle states that if a function is harmonic on a
region V , then cannot have a local maximum at an interior point a of V .
Proof: If has a local maximum at a in the interior, we choose so that |ra| <
lies within V (this is possible, since a is an interior point) and then, for any R with
0 < R < , (r) < (a) for |r a| = R (R) < (a). But this contradicts the
mean value property for a harmonic function.
The result is consistent with the usual analysis of stationary points by differenti-
2
ation. Suppose at r = a, = 0 and Hij = xi x
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
j
has eigenvalues i . is harmonic
2 P
2 = xi x
i
= Hii = i i = 0. (Note that the sum of eigenvalues is equal
to the trace of the Hessian matrix H). This implies that there exists at least one
eigenvalue of each sign, and hence the stationary point can only be a saddle point
(i.e. there is no local maximum or local minimum) unless all i = 0, in which case
this is inconclusive. This standard analysis is consistent with, but not as powerful
as, the maximum principle. Everything works analogously for minima instead of
maxima.
Now consider a general distribution of sources (r) with (r )dV being the contri-
bution from any small volume at position r .
(r)dV r
r 0
P R
To sum over all such contributions, we replace by V dV in the formula above,
which suggests that Z
1 (r )
(r) = dV (11.2)
4 V |r r |
64
is the solution to Poissons equation 2 = and is the integral with respect to r
taken over the region V on which (r ) is non-zero ( = 0 outside V ). This is indeed
the solution with boundary conditions |(r)| = O(1/|r|) and |(r)| = O(1/|r|2 ).
For (r ) non-zero but suitably well-behaved as |r | we can also take V = 3 ,
the integral over all space.
Example: Using Eq. 11.2, we can calculate (r), the solution to:
2 0 |r| a
= .
0 |r| > a
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
We fix r and introduce polar coordinates r , , for r with respect to this fixed
R
direction (r). Then, (r) = 4 1 0
V |rr |
dV , where dV = r 2 sin dr d d, V :
r = |r | a and |r r | = (r2 + r 2 2rr cos )1/2 so the integrand is independent
of .
r
Therefore,
Z Z Z
1 a
2
0 r 2 sin
(r) = dr d d
4 0 r 2 + r2 2rr cos
0 0
aZ hp i
2 1
= 0 /2 dr r r 2 + r2 2rr cos .
0 rr =0
hp i
2r for r > r
r 2 + r2 2rr cos = |r + r | |r r | = ,
=0 2r for r < r
so ( Ra 2
00
dr rr = 0 31 a3 1r (r > a)
(r) = R
r r
2 Ra ,
0 0 dr r + r dr r = 0 16 r2 + 21 a2 (r a)
the solution we obtained previously in section 10.3.
65
theorem holds if we take 2 = (ra) where6 the three-dimension delta function
R
is a generalised function (or distribution) satisfying V f (r)(ra) dV = f (a) for any
volume V containing a. Using this, we can verify the integral solutions of Poissons
equations immediately:
Z Z
2 1 2 1
(r) = (r ) dV = (r )(r r )dV = (r),
4 V |r r | V
as required.
12 Maxwells Equations
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
B
E = 0
E+ =0
t
E
B =0 B 0 0 = 0 j,
t
where 0 is the permittivity of free space and 0 is the permeability of free space
(they are both physical constants). Then, the Lorentz Force on a point charge q due
to these fields is
F = q(E + r B).
In principle, this gives a complete description of all classical electromagnetic phe-
nomena.
Conservation of electric charge comes from Maxwells equations:
+ j = 0.
t
Gauss Law for E is unmodified, and now we have a similar law for B too (in
electrostatics), but with zero magnetic charge. In integral form:
Z Z
E dS = Q/0 , B dS = 0.
S S
R
The remaining Maxwells equations also have integral forms, for example C E dr =
R
dtd S B dS for any surface S with a boundary of a closed curve C, from Stokes
theorem.
6
The expression for 2 above makes sense intuitively for a point source concentrated at a.
66
12.2 Static Charges and Steady Currents
If , j, E and B are all independent of time, then E and B are no longer linked, since
Maxwells equations then imply:
Electrostatics Magnetostatics
E = /0 B =0
E =0 B = 0 j
E = and 2 = /0 , the scalar B = A, but the vector potential is
Poisson equation described in section 11.3. ambiguous: A A + produces the same
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
B E 2E
2 E = ( E) ( E) = = ( B) = (0 0 ) = 0 0 2 ,
t t t t t
or
2 1 2 2 1 2
2 2 E = 0, and similarly we can obtain 2 2 B = 0,
c t c t
where c2 = 1/(0 0 ). These are wave equations in three dimensions describing prop-
agation with speed c. We compare with the one dimensional wave equation for a
scalar 2
1 2
f = 0, f = f (x ct)
x2 c2 t2
where f+ describes a right moving wave and f a left moving wave.
So, Maxwells equations predict electromagnetic waves which move with speed
c = 1/ 0 0 2.998 108 ms1 , i.e. the experimentally measured values of 0 and
0 combine to give the experimental value of the speed of light.
Maxwell famously said We can scarcely avoid the inference that light consists of
the transverse modulation of the same medium which is the cause of electric and mag-
netic phenomena. This was the unification of the theories of electricity, magnetism
and light.
67
13 Tensors and Tensor Fields
13.1 Definitions and Examples
13.1.1 Tensor transformation rule
Consider orthonormal right-handed bases {ei } and {ei } in 3 with corresponding
Cartesian coordinates {xi } and {xi }: x = xi ei = xi ei . Any such bases are related
by a rotation: ei = Rip ep , xi = Rip xp where
The components of a vector v transform in a similar way under this change of basis
and coordinates: v = vi ei = vi ei vi = Rip vp .
Tensors are geometric objects which obey a generalised form of this transfor-
mation rule. By definition, a tensor T of rank n has components Tij...k (n indices)
with respect to each basis {ei } or coordinate system {xi } and under a change of
basis/coordinates
Tij...k = Rip Rjq . . . Rkr Tpq...r .
This is the tensor transformation rule.
T of rank 0, i.e. no indices T = T , a scalar
Examples: T of rank 1, Ti = Rip Tp , a vector
T of rank 2, Tij = Rip Rjq Tpq form a matrix.
i.e the correct rule. Any linear combination of such expressions are also tensors,
for example Tij = ui vj + ai bj for any u, v, a, b.
2. ij and ijk are tensors of rank 2 and 3 respectively, with the special property
that their components are unchanged with respect to any basis/coordinate
system: ij = Rip Rjq pq = Rip Rjp = ij which is the tensor transformation
rule with ij = ij and Rip Rjq Rkr pqr = (detR)ijk = ijk which7 is the tensor
transformation rule with ijk = ijk .
7
Note that here, we used a general result from Vectors and Matrices IA, namely
Mip Mjq Mkr pqr = (detM )ijk .
68
13.1.3 Rank 2 tensors and matrices
A linear map between vectors u and v has the general form vi = Mij uj or vi = Mij uj
with respect to different choices of basis/coordinate system. Using the transformation
rule for vectors: vi = Rip vp = Rip (Mpq uq ) = Mij uj = Mij (Rjq uq ). These expressions
are compatible for any u iff Rip Mpq = Mij Rjq or Mik
= Rip Rkq Mpq ; this is the
tensor transformation rule for rank 2. In matrix notation M = RM RT , which is the
standard transformation for the matrix of a linear map under an orthogonal change
of basis (RT = R1 ).
Example: In many substances, an applied electric field E induces an electric current
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
(T + S)ij = Tij + Sij = Rip Rjq Tpq + Rip Rjq Spq = Rip Rjq (Tpq + Spq ) = Rip Rjq (T + S)pq .
This is a tensor (check the transformation rule yourself). The tensor product can be
defined similarly for any number of tensors, for example for n vectors u, v, . . . w we
define T = u v w by Tij...k = ui vj wk as in section 13.1.
13.2.3 Contractions
Given a rank n tensor T with components Tijp...q , we define a new tensor S of rank
n 2 by Sp...q = ij Tijp...q = Tiip...q . This is called contracting on the indices i and
69
j. Contracting on a different pair of indices results in a different tensor, in general.
To check that contraction produces a tensor, we take the example of Tij of rank 2.
Contracting Tii of rank 0 (i.e. a scalar), Tii = Rip Riq Tpq = pq Tpq = Tpp . Higher rank
tensors involve additional indices that are unaffected by the contraction. Note that
with Tij regarded as a 3 by 3 matrix, Tii is the trace. Tii = Tr(T ) = Tr(RT RT ) =
Tr(T ) = Tpp , in matrix language.
index pair). This holds in all coordinate systems if it holds in any one, since
Tklr...s = Rki Rlj Rrp . . . Rsq Tijp...q = Rki Rlj Rrp . . . Rsq Tjip...q = Tlkr...s .
A tensor is called totally symmetric/anti-symmetric if it is symmetric/anti-symmetric
in every pair of indices. For example ij = ji is totally symmetric and ijk is totally
anti-symmetric. There are totally symmetric tensors of arbitrary rank n, but in 3 :
1. Any totally antisymmetric tensor of rank 3 has the form Tijk = ijk for some
scalar .
2. There are no totally antisymmetric tensors of rank n > 3 except (trivially) the
tensors with all components zero.
70
13.3.2 The quotient rule
If Ti . . . j p . . . q is a tensor of rank n + m and up...q is any tensor of rank m, then
| {z } | {z }
n m
This is the quotient rule. We have seen and proved the special case n = m = 1
in section 13.1.3. The general case can be proved in a similar way, checking the
transformation rule directly, but the proliferation of indices makes the argument
look more complicated than it really is.
To streamline things, we shall use the ideas in section 13.3.1. It is sufficient to
take the special form up...q = cp . . . dq for any vectors c, . . . , d; by assumption vi...j =
Ti...jp...q cp . . . dq is a tensor, but then vi...j ai . . . bj = Ti...jp...q ai . . . bj cp . . . dq is a scalar
for any vectors a, . . . , b, c, . . . d. This is enough to ensure the tensor transformation
rule for the components Ti...jp...q , as observed in section 13.3.1 above.
except for where things obviously fail, for example where T is not defined. This is a
new tensor of rank n + m.
x
To verify the transformation rule, consider the derivatives: xi = Rip xp xpi =
xq
Rip , and xq = (R1 )qi xi = Riq xi xi
= Riq . Note that Rip and Riq are constant
matrices. Now, by the chain rule
xq
=
= Riq .
xi xi xq xq
71
transformation rule. The definition of tensor derivatives involves comparing tensors
at different points, but otherwise one just uses linear combinations and then limits.
For example: derivatives of a scalar field
= Rip Rjq Rkr = Rip Rjq Rkr
xi xj xk xp xq xr xp xq xr
which is the required transformation rule (all for Cartesian coordinates related by a
constant R). For just one derivative, we have ()i = x i
, a vector.
Because we can add tensors and take limits (by assumption) the definition of a tensor
R
valued integral is straight forward. For example, V Tij...k (x) dV and the result is a
tensor (it is easiest to think of the integral as the limit of a sum).
For a physical example, recall our discussion of the flux of quantities for a fluid
with velocity field u(x) through a small surface element (assuming a uniform density
)
n
u
vl
v = = ai b j c k Tij...kl ,
xl xl
n v = nl vl = ai bj ck Tij...kl nl , (13.1)
and we obtain the tensor divergence theorem, but contracted with ai bj cl on each
side. Since a, b, , c are arbitrary, they can be cancelled off and the result follows.
72
14 Tensors of Rank 2
14.1 Decomposition of a Second Rank Tensor
Any second rank tensor can be written as a sum of its symmetric and anti-symmetric
parts: Tij = Sij + Aij has 9 independent components (IC), where Sij = 12 (Tij + Tji )
has 6 IC and Aij = 21 (Tij Tji ) has 3 IC. In fact, the symmetric part can be reduced
further: Sij = Pij + 13 ij Q where Pii = 0, or Pij is traceless, and Q = Sii is the trace
of Sij . Pij has 5 IC whereas Q has 1. The antisymmetric part can be re-expressed:
1
Aij = ijk Bk Bk = ijk Aij
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
2
from the double- identities, or
0 B3 B2
(Aij ) = B3 0 B1 .
B2 B1 0
To summarise,
1
Tij = Pij +ijk Bk + ij Q
|{z} |{z} 3 |{z}
symmetric traceless vector scalar
where Bk = 12 ijk Tij , and only the antisymmetric parts of T contribute. Qk = Tkk
where only symmetric parts of T contribute.
Fi
Example: A vector field Fi (r) derivative Tij = xj
, a tensor field. The decomposition
Fj
Fi
given above has a symmetric traceless piece Pij = 21 ( x j
+ xi
) 13 ij Fk
xk
, an anti-
Fi
symmetric piece Aij = ijk Bk with Bk = 21 ijk x j
= 12 ( F )k and a trace Q =
Fk
xk
= F . Hence, a complete description of the derivative of a vector field F
involves a scalar F , a vector F and a symmetric traceless tensor Pij .
73
P
For a rigid body occupying volume V with mass density (r), we replace m
R
V
(r) dV above to obtain
Z
Iij = (r) (xk xk ij xi xj ) dV.
V
R R
For example, I33 = V (r)(x21 + x22 ) dV and I12 = V
(r)x1 x2 dV . Iij is a
symmetric tensor (by inspection).
Example:
x3
Copyright 2016 University of Cambridge. Not to be quoted or reproduced without permission.
The inertia tensor for a cylinder of radius a and height 2l, uni-
form density 0 and mass M = 2a2 l0 . We choose coordinates
x2 x1 = r cos , x2 = r sin , dV = r dr d dx3 , with x3 being the
x1 axis of symmetry.
Then
Z Z a Z 2 Z l
I33 = 0 (x21 + x22 ) dV = 0 dr d dx3 rr2 = 0 la4 ,
V 0 0 l
Z Z a Z 2 Z l
I11 = 0 (x22 + x23 ) dV = 0 d dx3 (r2 sin2 + x23 )r
dr
V 0 0
2 l
1 1 2 a 2
= 0 ( a4 2l + a2 2 l3 ) = 0 a2 l + l2 .
4 2 3 2 3
We get the same result for I22 (which has sin2 cos2 ). Also,
Z Z a Z 2 Z l
I13 = 0 x1 x3 dV = 0 dr d dx3 r2 cos x3 = 0
V 0 0 l
74
This applies to any symmetric tensor. For the special case of the inertia ten-
sor, the eigenvalues are called the principal moments of inertia. As exemplified in
section 14.2, we can often guess the correct principal axes for Iij based on the sym-
metries of the body: with the axes we chose, Iij was found to be diagonal, by direct
calculation.
i.e. each component is unchanged. A tensor T which is invariant under every rotation
is called isotropic (i.e. the same in all directions). As noted in section 13.1.2, ij and
ijk are isotropic tensors. This ensures that the component definitions of the scalar
and vector products
a b = ij ai bj , (a b)i = ijk aj bk
1. There are no isotropic tensors of rank 1 (vector) except the zero vector.
2. The most general rank 2 isotropic tensor is Tij = ij for some scalar .
3. The most general rank 3 isotropic tensor is Tijk = ijk for some scalar .
4. All isotropic tensors of higher rank are obtained by combining ij and ijk using
tensor products, contractions and linear combinations.
Tijkl = ij kl + ik jl + il jk
for some scalar constants , , . There are no other independent combinations (for
example, you might think of ijp klp , but this is equal to ik jl il jk ).
75
with f (x) being a scalar function and V is some volume. Given a rotation Rij ,
consider an active transformation x = xi ei mapped to x = xi ei with xi = Rij xj (NB
there is no change of basis vectors here) and V is mapped to V . Suppose that under
this active transformation
1. f (x) = f (x ) and
2. V = V .
Z
Rip Rjq Rkr Tpq...r = f (x)xi xj xk dV
V
from the definition of T Z
= f (x )xi xj xk dV
V
from (1) Z
= f (x )xi xj xk dV
V
from (2)
= Tij...k
(xi or xi are dummy variables in the integral).
xi
NB with the change of variables xi = Rip xp , xp
= Rip (a constant) and
xi
det( ) = det(Rip ) = 1, i.e. the Jacobian=1 or dV = dV . The result is partic-
xp
ularly useful if (1) and (2) hold for any rotation R, in which case Tij...k is isotropic.
R
Example: Tij = V xi xj dV with V being a solid sphere with |r| a. The result
above applies with f = 1 for any rotation R, so Tij is isotropic Tij = ij , and
R
we need only determine the scalar . But taking the trace, 3 = Tii = V xi xi dV =
Ra 2
0
r 4r2 dr = 54 a5 . Hence Tij = 15 4
a5 ij .
There is a closely related result for the inertia tensor of a sphere of constant density
0 or mass M :
Z
4 2
Iij = 0 (xk xk ij xi xj ) dV = 0 (3 1) a5 ij = M a2 ij ,
V 15 5
which is isotropic.
76
1. Suppose that Ti is rank 1 isotropic. Consider
1 0 0
(Rij ) = 0 1 0 ,
0 0 1
0 10
(Rij ) = 1 0 0 ,
0 01
a rotation through /2 clockwise about the x3 axis: T13 = R1p R3q Tpq =
R12 R33 T23 = (+1)(+1)T23 . T23 = R2p R3q Tpq = R21 R33 T13 = (1)(+1)T13 .
Hence T13 = T23 = 0 and similarly T31 = T32 = 0. T11 = R1p R1q Tpq =
R12 R12 T22 = (+1)(+1)T22 . By applying /2 rotations about the x1 and x2
axes in a similar way, we deduce T11 = T22 = T33 = , say and Tij i 6= j.
Hence Tij = ij .
Acknowledgements
We thank Prof. R. Josza for proof reading.
77