MM 1
MM 1
MM 1
Peter S. Riseborough
June 18, 2018
Contents
1 Mathematics and Physics 5
2 Vector Analysis 6
2.1 Vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.2 Scalar Products . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.3 The Gradient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
2.4 The Divergence . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.5 The Curl . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.6 Successive Applications of ∇ . . . . . . . . . . . . . . . . . . . . 12
2.7 Gauss’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
2.8 Stokes’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.9 Non-Orthogonal Coordinate Systems . . . . . . . . . . . . . . . . 16
2.9.1 Curvilinear Coordinate Systems . . . . . . . . . . . . . . . 18
2.9.2 Spherical Polar Coordinates . . . . . . . . . . . . . . . . . 19
2.9.3 The Gradient . . . . . . . . . . . . . . . . . . . . . . . . . 21
2.9.4 The Divergence . . . . . . . . . . . . . . . . . . . . . . . . 21
2.9.5 The Curl . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
2.9.6 Compounding Vector Differential Operators in Curvilin-
ear Coordinates . . . . . . . . . . . . . . . . . . . . . . . . 23
1
4.2.2 Regular Singularities . . . . . . . . . . . . . . . . . . . . . 74
4.3 Linear Dependence . . . . . . . . . . . . . . . . . . . . . . . . . . 86
4.3.1 Linearly Independent Solutions . . . . . . . . . . . . . . . 90
4.3.2 Abel’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . 90
4.3.3 Other Solutions . . . . . . . . . . . . . . . . . . . . . . . . 91
2
9.1 Associated Legendre Functions . . . . . . . . . . . . . . . . . . . 181
9.1.1 The Associated Legendre Equation . . . . . . . . . . . . . 181
9.1.2 Generating Function Expansion . . . . . . . . . . . . . . . 184
9.1.3 Recursion Relations . . . . . . . . . . . . . . . . . . . . . 185
9.1.4 Orthogonality . . . . . . . . . . . . . . . . . . . . . . . . . 188
9.2 Spherical Harmonics . . . . . . . . . . . . . . . . . . . . . . . . . 192
9.2.1 Expansion in Spherical Harmonics . . . . . . . . . . . . . 192
9.2.2 Addition Theorem . . . . . . . . . . . . . . . . . . . . . . 193
3
15 Calculus of Residues 275
15.1 Residue Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . 275
15.2 Jordan’s Lemma . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
15.3 Cauchy’s Principal Value . . . . . . . . . . . . . . . . . . . . . . 278
15.4 Contour Integration . . . . . . . . . . . . . . . . . . . . . . . . . 282
15.5 The Poisson Summation Formula . . . . . . . . . . . . . . . . . . 294
15.6 Kramers-Kronig Relations . . . . . . . . . . . . . . . . . . . . . . 298
15.7 Integral Representations . . . . . . . . . . . . . . . . . . . . . . . 300
4
1 Mathematics and Physics
Physics is a science which relates measurements and measurable quantities to a
few fundamental laws or principles.
If physical laws are to be fundamental, they must be few in number and must
be able to be stated in ways which are independent of any arbitrary choices.
In particular, a physical law must be able to be stated in a way which is in-
dependent of the choice of reference frame in which the measurements are made.
5
2 Vector Analysis
2.1 Vectors
Consider the displacement vector, in a Cartesian coordinate system it can be
expressed as
→
−
r = êx x + êy y + êz z (1)
where êx , êy and êz , are three orthogonal unit vectors, with fixed directions.
The components of the displacement are (x, y, z).
Hence,
→
−
r = ê0x ( x cos θ + y sin θ ) + ê0y ( y cos θ − x sin θ ) + ê0z z 0 (4)
→
−
Any arbitrary vector A can be expressed as
→
−
A = êx Ax + êy Ay + êz Az (6)
where êx , êy and êz , are three orthogonal unit vectors, with fixed directions.
The components of the displacement are (Ax , Ay , Az ). The arbitrary vector
transforms under rotations exactly the same way as the displacement
→
−
A = ê0x ( Ax cos θ + Ay sin θ ) + ê0y ( Ay cos θ − Ax sin θ ) + ê0z A0z (7)
6
2.2 Scalar Products
Although vectors are transformed under rotations, there are quantities asso-
ciated with the vectors that are invariant under rotations. These invariant
quantities include:-
(i) Lengths of vectors.
(ii) Angles between vectors.
These invariant properties can be formulated in terms of the invariance of a
scalar product.
The scalar product transforms exactly the same way as a scalar under rotations,
and is thus a scalar or invariant quantity.
→
− → −
A . B = Ax B x + Ay B y + Az B z
= A0x Bx0 + A0y By0 + A0z Bz0 (9)
The change in the scalar qunatity is written in the form of a scalar product of
the vector displacement →
−
a given by
→
−
a = êx ax + êy ay + êz az (11)
→
− ∂φ ∂φ ∂φ
∇φ = êx + êy + êz (12)
∂x ∂y ∂z
The latter quantity is a vector quantity, as follows from the scalar quantities
φ(→
−r ) and φ(→
−
r +→ −a ) being invariant. Thus, the dot product in the Taylor ex-
→
−
pansion must behave like a scalar. This is the case if ∇φ is a vector, since the
scalar product of the two vectors is a scalar.
7
The gradient operator is defined as
→
− ∂ ∂ ∂
∇ = êx + êy + êz (13)
∂x ∂y ∂z
The gradient operator is an abstraction, and only makes good sense when the
operator acts on a differentiable function.
The gradient specifies the rate of change of a scalar field, and the direction
of the gradient is in the direction of largest change.
in electro-statics. This has the physical meaning that a particle will move (ac-
celerate) from regions of high potential to low potential,. The particle always
accelerates in the direction of the maximum decrease in potential.
→
− q→−r
E = + 3 (16)
r
8
→
−
Consider a vector quantity A of the form
→
−
A = →
−
r f (r) (19)
which is spherically symmetric and directed radially from the origin. The di-
→
−
vergence of A is given by
since it satisfies
→
− →
t .−
ρ = 0 (23)
Therefore, the divergence of the tangential vector field is zero
→
− → −
∇ . t = 0 (24)
→
−
In this example, the vector field t is flowing in closed circles, and the diver-
gence is zero.
→
−
Given a differentiable vector field A , which represents the flow of a quantity,
then the divergence represents the net inflow of the quantity to a volume and,
as such, is a scalar.
9
is equal to the matter that flows into the volume.
There is no source for magnetic induction field, and this shows up in the
Maxwell equation
→
− → −
∇ . B = 0 (29)
The finite magnetic induction field is purely a relativistic effect in that it rep-
resents the electric field produced by compensating charge densities which are
in relative motion.
10
→
−
The curl of a radial vector A = → −
r f (r) is evaluated as
êx êy êz
→
− →
− ∂ ∂ ∂
∇ ∧ A =
∂x ∂y ∂z
= 0
(33)
x f (r) y f (r) z f (r)
→
−
The curl of a tangential vector t given by
→
−
t = ( êx y − êy x ) (34)
is evaluated as
→
− →
−
∇ ∧ t = − 2 êz (35)
The tangential vector represents a rotation about the z-axis in a clockwise (neg-
ative) direction.
11
2.6 Successive Applications of ∇
The gradient operator can be used successively to create higher-order differen-
tials. Frequently found higher-derivatives include the divergence of a gradient
of a scalar φ
→
− → −
∇ . ∇ φ = ∇2 φ (43)
which defines the Laplacian of φ. In Cartesian coordinates one has
2
∂2 ∂2
→
− → − ∂
∇ . ∇ φ = + + φ (44)
∂x2 ∂y 2 ∂z 2
∇2 φ = − 4 π ρ (47)
12
Other useful identities include
→
− →
−
∇ ∧ ( ∇ φ) = 0 (52)
and
→
− →
− →
−
∇ .( ∇ ∧ A ) = 0 (53)
For simplicity, consider the integration volume as a cube with faces oriented
parallel to the x, y and z axes. In this special case, Gauss’s theorem can be
easily proved by expressing the divergence in terms of the Cartesian components,
and integrating the three separate terms. Since each term is of the form of a
derivative with respect to a Cartesian coordinate, the corresponding integral
can be evaluated in terms of the boundary term
Z x+ Z y+ Z z+
∂Ax ∂Ay ∂Az
dx dy dz + +
x− y− z− ∂x ∂y ∂z
Z y+ Z z+ x+ Z x+ Z z+ y+ Z x+ Z y+ z+
= dy dz Ax
+ dx dz Ay
+ dx dy Az (55)
y− z− x− x− z− y− x− y− z−
The last six terms can be identified with the integrations over the six surfaces of
→
−
the cube. It should be noted that the for a fixed direction of A the integration
over the upper and lower surfaces have different signs. If the normal to the
surface is always chosen to be directed outwards, this expression can be written
as an integral over the surface of the cube
Z
→
− → −
d2 S . A (56)
13
→
−
where the surface S bounds the volume V .
————————————————————————————————–
Example:
Using Gauss’s theorem, show that the divergence of the electric field caused
by a point charge q is proportional to a Dirac delta function.
Solution:
→
− q→−
r
E = (58)
r3
and one finds that
→
− → −
∇ . E = 0 (59)
for r 6= 0. Then, since Gauss’s theorem applied to a volume containing the
point charge becomes
Z Z
3→
− →
− → − →
− → −
d r ∇ . E = d2 S . E
− q→ −
Z
→ r
= d2 S . 3
r
= 4πq (60)
————————————————————————————————–
14
2.8 Stokes’s Theorem
Stoke’s theorem relates the surface integral of the curl of a vector to an integral
of the vector around the perimeter of the surface.
Stokes’s theorem can easily be proved by integration of the curl of the vector
over a square, with normal along the z-axis and sides parallel to the Cartesian
axes
Z
2→
− →
− →
−
d S . ∇ ∧ A
Z x+ Z y+
∂Ay ∂Ax
= dx dy êz . êz − (62)
x− y− ∂x ∂y
The scalar product with the directed surface selects out the z-component of the
curl. One integral in each term can be evaluated, yielding
Z y+ x+ Z x+ y+ I
→
−
d→−
= dy Ay − dx Ax = r . A (63)
y− x− x− y−
which is of the form of an integration over the four sides of the square, in which
the loop is traversed in a counterclockwise direction.
and Z
→
−
d→
−
r . A (65)
path2
If these two paths are traversed consecutively, but the second path is traced in
the reverse direction, then one has a loop. If the phase of the wave function at
the origin is unique, up to multiples of 2 π, then the loop integral
I
→
−
d→−r . A (66)
15
must take on multiples of a fundamental value φ0 . Stokes’s theorem leads to
the discovery that magnetic flux must be quantized as
I
→
−
d→
−r . A = n φ0
Z
→
− → −
d2 S . B = n φ 0 (67)
In this prescription, the components have been given as the successive displace-
ments needed to be traversed parallel to the unit vectors to arrive at the point.
This is one way of specifying a vector. The components x1 and x2 are known
as the co-variant components.
Another way of specifying the same vector is given by specifying the com-
ponents x1 , x2 as the displacements along the unit vectors, if the point is given
by the intersection of the perpendiculars subtended from the axes. The compo-
nents x1 and x2 are the contra-variant components.
————————————————————————————————–
16
Example:
Solution:
The relationship between the Cartesian components of the vector and the
covariant components is given by
l cos φ = x1 + x2 cos θ
l sin φ = x2 sin θ (73)
l cos φ = x1
l cos(θ − φ) = x2 (74)
x1 = x1 + x2 cos θ
2
x = x2 + x1 cos θ (75)
17
————————————————————————————————–
Example:
How does the length get expressed in terms of the contra-variant compo-
nents?
Solution:
————————————————————————————————–
which becomes
X X ∂xi ∂xi
2
dl = dqj dqj 0 (83)
0 i
∂qj ∂qj 0
j,j
18
Thus, the metric is found to be
X
j,j 0 ∂xi ∂xi
g = (84)
i
∂qj ∂qj 0
The three unit vectors of the generalized coordinate system are proportional
to
X ∂xi
êqj ∝ êi (85)
i
∂qj
In general, the direction of the unit vectors depends on the values of the set of
three generalized coordinates qj ’s.
The inverse of the metric is also diagonal and has non-zero elements g1i,i . Thus,
in this case, the co-variant and contra-variant components of a vector are simply
related by
1
xi = i,i xi (89)
g
An example of orthogonal curvilinear coordinates is given by the spherical polar
coordinate representation.
19
The unit vectors are denoted by (êr , êθ , êϕ ) and are in the direction of increasing
coordinate. Thus,
∂→−
r
êr =
∂r
= êx sin θ cos ϕ + êy sin θ sin ϕ + êz cos θ (91)
and
∂→−
r
êθ ∝
∂θ
= êx r cos θ cos ϕ + êy r cos θ sin ϕ − êz r sin θ (92)
êθ = êx cos θ cos ϕ + êy cos θ sin ϕ − êz sin θ (93)
which is in the x − y plane. The unit vector êϕ is given by normalizing the
above vector, and is
As can be seen by evaluation of the scalar product, these three unit vectors are
mutually perpendicular. Furthermore, they form a coordinate systems in which
Due to the orthogonality of the unit vectors, the metric is a diagonal matrix
and has the non-zero matrix elements
g r,r = 1
θ,θ
g = r2
g ϕ,ϕ = r2 sin2 θ (97)
20
2.9.3 The Gradient
In curvilinear coordinates, the gradient of a scalar function φ is given by consid-
eration of the infinitesimal increment caused by the change in the independent
variables qj
X ∂φ
dφ = dqj (99)
j
∂qj
d3 →
−
p
r = Πj dqj g j,j
p
= Det g j,j Πj dqj (104)
21
Hence, from Gauss’s theorem, the divergence is given by the the sums of the
scalar product of the vector with the directed surface areas divided by the
volume element. Since the surfaces with normals in the direction of êqi occur in
pairs and are oppositely directed, one finds the divergence as the derivative
→
− → − 1 X ∂ Ai p
∇ . A = p p Det g j,j (106)
Detg j,j i ∂qi g i,i
The components of the curl along the unit directions êqj in the direction of
→
−
increasing qi can be evaluated over the surface areas d2 S with normals êqj .
Then, we have
p
Detg i,i
Z
2→
− →
− →
− →
− →
−
d Sj . ∇ ∧ A = ∇ ∧ A p Πi6=j dqi (109)
j g j,j
22
where we have Taylor expanded the vector field about the center of the in-
finitesimal surface. The lowest-order terms in the expansion stemming from the
opposite sides of the perimeter cancel. Hence, the component of the curl along
the normal to the infinitesimal surface is given by the expression
→
− →
− 1 ∂ p 3,3 ∂ p 2,2
∇ ∧ A = p g A3 − g A2 (112)
1 g 2,2 g 3,3 ∂q2 ∂q3
The expression for the entire curl vector can be expressed as a determinant
p p p
ê1 g 1,1 ê2 g 2,2 ê3 g 3,3
→
− →
− 1 ∂ ∂ ∂
∇ ∧ A = p
∂q1 ∂q2 ∂q3
(113)
Detg j,j
p p p
g 1,1 A1 g 2,2 A2 g 3,3 A3
∂ ∂
x = x + 1 (116)
∂x ∂x
The order of the operators is important. The differential operator acts on every-
thing in front of it, which includes the unit vectors. In Cartesian coordinates,
the directions of the unit vectors are fixed thus,
∂ ∂ ∂
êx = êx = êx = 0 (117)
∂x ∂y ∂z
23
etc. For curvilinear coordinates this is no longer true. For example, in spherical
polar coordinates although the directions of the unit vectors are not determined
by the radial distance
∂ ∂ ∂
êr = êθ = êϕ = 0 (118)
∂r ∂r ∂r
the other derivatives of the unit vectors are not zero, as
∂
êr = êθ
∂θ
∂
êθ = − êr
∂θ
∂
êϕ = 0 (119)
∂θ
and
∂
êr = sin θ êϕ
∂ϕ
∂
êθ = cos θ êϕ
∂ϕ
∂
êϕ = − sin θ êr + cos θ êθ (120)
∂ϕ
Here it is important to note that the differential operator acts on the unit vec-
tors, before the scalar product is evaluated.
Problem:2.1
Problem:2.2
How are the Laplacian of a scalar ∇2 ψ and the square of the angular mo-
−̂ 2
→
mentum L ψ related in spherical polar coordinates?
24
Problem:2.3
− J cos(θ−→ − θ− → → )
R R +−a
→
−
where θ−→ is the angle that the spin at site R subtends to a fixed axis. For
R
positive J, the interaction is minimized if the neighboring spins are all parallel.
In the continuum limit, the interactions can be Taylor expanded in the lattice
constant a. For small a, angle θ can be considered as a continuous field θ(→ −r ),
defined at every point in the two-dimensional space. The energy of the field is
approximately given by
Z
→
− → −
E= − J a 2
d2 →
−
r ∇θ . ∇θ
∇2 θ = 0
(i) Find an expresssion for the Laplacian and the gradient in circular polar
coordinates (r, ϕ).
(ii) Show that
θ(→−
X
r) = rm ( Am cos mϕ + Bm sin mϕ )
m
θ = ln r + C
θ = ϕ + C
are solutions of Laplace’s equation, except at the point r = 0. Since the contin-
uum appozimation is valid for distances greater than a, these may be considered
good solutions, if a cut-off is introduced for distances smaller than a.
(vi) Show that
y − yi
θ(→
−
X
r) = ni tan−1 + θ0
i
x − xi
25
can also be considered at a good solution, except in the vicinity of a finite num-
ber of points (xi , yi ) where the solution is singular.
(vii) Why should the ni be restricted to positive and negative integer values?
26
3 Partial Differential Equations
The dynamics of systems are usually described by one or more partial differential
equations. A partial differential equation is characterized as being an equation
for an unknown function of more than one independent variable, which expresses
a relationship between the partial derivatives of the function with respect to the
various independent variables. Conceptually, a solution may be envisaged as
being obtained by direct integration. Since integration occurs between two lim-
its, the solution of a partial differential equation is not unique unless its value
is given at one of the limits. That is, the solution is not unique unless the
constants of integration are specified. These are usually specified as boundary
conditions or initial conditions.
1 ∂2φ
∇2 φ − = f (→
−
r , t) (123)
c2 ∂t2
where φ(→ −
r , t) describes the wave motion, c is the phase velocity of the wave and
the force density f (→ −r , t) acts as a source for the waves, inside the region of in-
terest. Again appropriate boundary conditions for four-dimensional space-time
(→
−r , t) need to be specified, for the solution to be unique. Since this equation
is a second-order equation with respect to time, it is necessary to specify φ at
two times. Alternatively, one may specify φ(→ −
r , t0 ) at the initial time and its
∂φ(→−r ,t)
derivative ∂t t0 | at the initial time.
∇2 φ = − 4 π ρ (124)
27
which specifies the scalar or electrostatic potential φ(→
−
r ) produced by a charge
→
−
density ρ( r ). The boundaries of the spatial region in which φ is to be deter-
mined may also involve charge densities on the boundary surfaces or they may
be surfaces over which φ is specified. The charge density is to be regarded as a
source for the electric potential φ.
All of the above equations posses the special property that they are linear
partial differential equations. Furthermore, they are all second-order linear par-
tial differential equations since, the highest order derivative that enters is the
second-order derivative.
28
only involve the first power of the unknown function or the first power of a (first
or higher-order) partial derivative of the function. The solution of the partial
differential equation is not unique, unless the boundary conditions are specified.
That is, one may find more than one (linearly independent) solution for the
unknown function, such as φi for i = 1 , 2 , . . . , N . Due to the linearity,
the general solution of the homogeneous equation φ can be expressed as a linear
combination
XN
φ = C i φi (127)
i=1
where the Ci are arbitrary (complex) numbers. The constant Ci may be deter-
mined if appropriate boundary conditions and initial conditions are specified.
This is often referred to as the principle of linear superposition.
Now consider the inhomogeneous equation, that is, the equation in which the
source terms are present. If a particular solution of the inhomogeneous equation
is found as φp , then it can be seen that due to the linearity it is possible to find
a general solution as the sum of the particular solution and the solutions of the
homogeneous equation
XN
φ = φp + Ci φi (128)
i=1
29
as soliton solutions.
For the Korteweg-de Vries equation one can look for soliton solutions which
propagate with velocity c. These are of the form
Substituting this form of the solution into the partial differential equation leads
to
∂φ ∂3φ
(φ − c) + = 0 (132)
∂x ∂x3
which can be integrated to yield
φ2 ∂2φ
− cφ + = κ (133)
2 ∂x2
The constant of integration κ is chosen to be zero, by specifying that φ → 0
when | x − c t | → ∞. On identifying an integrating factor of
∂φ
(134)
∂x
and multiplying the differential equation by the integrating factor, one obtains
∂φ φ2 ∂φ ∂φ ∂ 2 φ
− c φ + = 0 (135)
∂x 2 ∂x ∂x ∂x2
This can be integrated again to yield
2
φ3 φ2
1 ∂φ
− c + = γ (136)
6 2 2 ∂x
The boundary conditions can be used again to find γ = 0. The square root of
the equation can be taken, giving the solution as an integral with z = 3φc
φ(x,t)
Z
√
Z x−ct
3 c dz
√ = c dx0 (137)
z 1 − z
The integral can be evaluated, using the substitution
z = sech2 x (138)
and
dz = − 2 sech2 x tanh x dx (139)
giving
3c
φ(x, t) = (140)
√ x
cosh2 c − c t
2
30
This non-linear solution has a finite spatial extent and propagates with velocity
c, and does not disperse or spread out.
We shall solve the Cauchy problem with the method of characteristics. The
method of characteristics is a powerful method that allows one to reduce any
linear first-order partial differential equation into an ordinary differential equa-
tion. The characteristics are defined to be curves in the (x, t) plane, x(t), which
satisfy the equation
dx(t)
= a(x(t), t) (144)
dt
The solution of this equation defines a family or a set of curves x(t). The
different curves correspond to the different constants of integration, or initial
conditions x(0) = x0 .
The solution φ(x, t), when evaluated on the characteristic x(t), yields φ(x(t), t).
This has the special property that φ has a constant value along the curve x(t).
This can be shown by taking the derivative along the characteristic curve
31
and since the characteristic satisfies
dx(t)
= a(x(t), t) (146)
dt
one has
d ∂φ(x, t) ∂φ(x, t)
φ(x(t), t) = a(x, t) + = 0 (147)
dt ∂x ∂t
as φ(x, t) satisfies the homogeneous linear partial differential equation. Thus,
φ(x, t) is constant along a characteristic. Hence,
φ(x(t), t) = φ(x0 , 0)
= f (x0 ) (148)
This means that if we can determine the characteristic, one can compute the
solution of the Cauchy problem.
————————————————————————————————–
Example: 3.1.1
Solution:
32
The solution φ(x, t) is constant on the curve passing through (x0 , 0), and is
determined from
φ(x, t) = φ(x(t), t) = f (x0 ) (153)
However, on inverting the equation for the characteristic one finds
x0 = x − c t (154)
so one has
φ(x, t) = f ( x − c t ) (155)
which clearly satisfies both the initial condition and the differential equation.
The above solution corresponds to a wave travelling in the positive direction
with speed c.
————————————————————————————————–
Example: 3.1.2
Solution:
one has
φ(x, t) = f ( x exp[ − t ] ) (162)
33
This is the solution that satisfies both the partial differential equation and the
initial condition.
————————————————————————————————–
dφ(x(t), t) ∂φ dx(t) ∂φ
= +
dt ∂t dt ∂x
∂φ ∂φ
= + a(x, t)
∂t ∂x
= b(x, t) (166)
However, the solution can be found by integration along the characteristic curve.
This yields, Z t
φ(x(t), t) = f (x0 ) + dt0 b(x(t0 ), t0 ) (167)
0
On inverting the relation between x(t) and x0 , and substituting the resulting
relation for f (x0 ) in the above equation, one finds the solution.
————————————————————————————————–
Example: 3.1.3
34
where the solution has to satisfy the initial condition φ(x, 0) = f (x).
Solution:
x(t) = x0 + c t (169)
and the solution of the partial differential equation along the characteristic is
given by
Z t
φ(x(t), t) = f (x0 ) + λ dt0 x(t0 )
0
c 2
= f (x0 ) + λ ( x0 t + t ) (170)
2
Since the characteristic can be inverted to yield the initial condition as x0 =
x − c t, one has the solution
ct
φ(x, t) = f ( x − c t ) + λ t x − (171)
2
which completely solves the problem.
————————————————————————————————–
Example: 3.1.4
Solution:
x(t) = x0 + c t (174)
35
which has the solution
t
φ(x(t), t) = φ(x(0), 0) exp[ − ]
τ
t
= f ( x − c t ) exp[ − ] (176)
τ
which is a damped forward travelling wave.
————————————————————————————————–
Example:3.1.5
Solution:
dφ η2 η2
x + 2 φ = 2 (180)
dx x x
for constant η. On multiplying by the integrating factor
η2
exp − (181)
2 x2
and integrating, one finds that the general solution is given by
2
η2
η
φ(x, t(x)) = exp f (η) + exp − (182)
2 x2 2 x2
Hence, on substituting for η, one obtains the solution
2
t
φ(x, t) = exp f (xt) + 1 (183)
2
36
Alternatively, the characteristic curve x(t) could have been specified para-
metrically in terms of the variable ξ
dx
= x (184)
dξ
dt
= −t (185)
dξ
which can be solved independently to yield
x = exp[ξ]
t = η exp[−ξ] (186)
η2
exp − exp[−2ξ] (188)
2
The resulting equation can be solved by integration and substitution of the t = 0
boundary condition.
37
∂ n φ(x,0)
x and t. If φ(x, 0) is given, then all the derivatives ∂xn can be obtained by
∂φ(x,t)
direct differentiation. If the first-order derivative ∂t is given, then the
t=0
n+1
derivatives ∂ ∂t ∂x
φ(x,t)
n can also be obtained by direct differentiation.
t=0
∂ 2 φ(x,t)
These two pieces of initial data allow the second derivative ∂t2 to
t=0
be evaluated, by using the differential equation,
Characteristics.
The differential equation can be solved in the (ξ, η) local coordinate system.
We shall define the solution in the new coordinates to be
Let us assume that the differential equation has boundary conditions in which
38
˜
∂ φ(ξ,η)
the function φ̃(ξ = 0, η) and its derivative ∂ξ
are given. By differentia-
ξ=0
tion with respect to η, one can determine
∂ φ̃(0, η)
∂η
∂ 2 φ̃(0, η)
(194)
∂η 2
and
∂ φ̃(ξ, η)
∂ξ ξ=0
∂ 2 φ̃(ξ, η)
(195)
∂ξ ∂η ξ=0
To obtain all the higher-order derivatives needed for a Taylor series expansion
of the solution, one must express the partial differential equation in terms of
the new variables. Thus, one has
∂φ ∂ φ̃ ∂ξ ∂ φ̃ ∂η
= + (196)
∂t x ∂ξ η ∂t x ∂η ξ ∂t x
and
2 2 2 2
∂ 2 φ̃
2
∂ φ ∂ξ ∂ φ̃ ∂ξ ∂η ∂ φ̃ ∂η
= + 2 +
∂t2 x ∂ξ 2 ∂t x ∂ξ ∂η ∂t x ∂t x ∂η 2 ∂t x
2 2
∂ φ̃ ∂ ξ ∂ φ̃ ∂ η
+ + (197)
∂ξ η ∂t2 x ∂η ξ ∂t2 x
etc. Therefore, the differential equation can be written as
2 2 2
∂ φ̃ ∂ξ ∂ξ ∂ξ ∂ξ
2
A + 2 B + C
∂ξ ∂x ∂x ∂t ∂t
2
∂ φ̃ ∂ξ ∂η ∂ξ ∂η ∂η ∂ξ ∂ξ ∂η
+ 2 A + B + + C
∂ξ ∂η ∂x ∂x ∂x ∂t ∂x ∂t ∂t ∂t
2 2 2
∂ φ̃ ∂η ∂η ∂η ∂η
+ 2
A + 2B + C
∂η ∂x ∂x ∂t ∂t
2 2 2
∂ φ̃ ∂ ξ ∂ ξ ∂ ξ ∂ξ ∂ξ
+ A + 2 B + C + D + E
∂ξ ∂x2 ∂x ∂t ∂t2 ∂x ∂t
2 2 2
∂ φ̃ ∂ η ∂ η ∂ η ∂η ∂η
+ A + 2B + C + D + E
∂η ∂x2 ∂x ∂t ∂t2 ∂x ∂t
+ F φ̃ + G = 0 (198)
39
The ability to solve this equation with the initial data given on ξ = 0, rests
on whether or not the second derivative
2
∂ φ̃
(199)
∂ξ 2
can be determined from the differential equation, since all other quantities are
assumed to be known. The second derivative w.r.t. ξ can be found if its coeffi-
cient is non-zero
2 2
∂ξ ∂ξ ∂ξ ∂ξ
A + 2B + C 6= 0 (200)
∂x ∂x ∂t ∂t
If the above expression never vanishes for any real function ξ(x, t), then the so-
lution can be found. All higher-order derivatives of φ̃ can be found by repeated
differentiation.
The above analysis can be extended to the case in which the coefficients
A, B and C are well-behaved functions of (x, t), since only the second-order
derivatives w.r.t. to (ξ, η) play a role in the analysis.
Hyperbolic Equations.
40
However, the slope of the curve ξ(x, t) = const. is given
∂ξ ∂ξ
dx + dt = 0 (205)
∂x ∂t
or
√
B ∓ B2 − A C
dt
= (207)
dx A
The solution can not be determined from Cauchy initial boundary conditions
specified on these curves. The set of curves are the characteristics of the equa-
tion. In the local coordinate system corresponding to the positive sign, we see
that 2 2
∂ξ ∂ξ ∂ξ ∂ξ
A + 2B + C = 0 (208)
∂x ∂x ∂t ∂t
The other coordinate η(x, t) can be found by choosing the negative sign. On
this family of curves, one also finds
2 2
∂η ∂η ∂η ∂η
A + 2B + C = 0 (209)
∂x ∂x ∂t ∂t
In this special coordinate system, one finds the partial differential equation
simplifies and has the canonical form
∂ 2 φ̃ ∂ φ̃ ∂ φ̃
+ α + β + γ φ̃ + δ = 0 (210)
∂ξ ∂η ∂ξ ∂η
where α, β, γ and δ are functions of ξ and η.
41
which factorizes as
∂ξ 1 ∂ξ
+ = 0 (213)
∂x c ∂t
and
∂η 1 ∂η
− = 0 (214)
∂x c ∂t
which has solutions
ξ = x − ct (215)
and
η = x + ct (216)
In this new coordinate system, the wave equation reduces to
2
∂ φ̃
= 0 (217)
∂ξ ∂η
∂ φ̃
A solution can be found by integrating the equation w.r.t. ξ, yielding ∂η in
∂g(η)
terms of an arbitrary function of integration η, which we will denote by ∂η .
If this is followed by integration of the differential equation w.r.t η, one finds
∂ φ̃
∂ξ in terms of an arbitrary function of integration ξ, which we will denote by
∂f (ξ)
∂ξ . By integrating the resulting two equations again to obtain expressions
for φ̃(ξ, η) and equating the forms of the two solutions, one finds that
————————————————————————————————–
Example:3.2.1
1 ∂2φ ∂2φ
− = 0 (219)
c2 ∂t2 ∂x2
subject to the general boundary conditions
φ(x, 0) = f (x)
∂φ
= g(x) (220)
∂t t=0
42
Solution:
Substituting equations (223) and (224) for ψF (x + ct) and ψB (x − ct) in the
above expression, results in the equation
Z x+ct
1
2 ψF (x−ct) + 2ψB (x+ct) − f (x+ct) − f (x−ct) = dx0 g(x0 ) (228)
c x−ct
43
————————————————————————————————–
Example:3.2.2
1 ∂2φ ∂2φ
− = 0 (230)
c2 ∂t2 ∂x2
subject to the boundary conditions
φ(x, 0) = exp[x/ξ]
∂φ
= c/ξ sin x/ξ (231)
∂t t=0
Solution:
Following from the analysis of the previous example, one finds that the
solution is
————————————————————————————————–
Example:3.2.3
φ(x, 0) = x2 ξ −2
∂φ
= c exp[x/ξ] (234)
∂t t=0
————————————————————————————————–
Parabolic Equations.
44
If the discriminant vanishes in a finite region, B 2 − A C = 0, then it
is clear that neither A nor C can vanish as in this case B must also vanish,
and one would only have an ordinary differential equation. If A 6= 0, and the
coefficient of the second derivative of φ̃ with respect to ξ is factorized as
2
1 ∂ξ ∂ξ
A + B (235)
A ∂x ∂t
corresponding to the double root. Hence, there is only a single family of char-
acteristics ξ defined by
dx A
= (237)
dt B
also vanishes, since the discriminant vanishes. This can be seen by examining
2
the coefficient of ∂ξ∂ ∂η
φ̃
in the differential equation and noticing that
∂ξ ∂ξ
0 = C (A + B )
∂x ∂t
∂ξ ∂ξ
= B(B + C ) (238)
∂x ∂t
∂ 2 φ̃
Thus, all the factors in the coefficient of ∂ξ ∂η vanish. Hence, parabolic equa-
tions can be put in the canonical form
∂ 2 φ̃ ∂ φ̃ ∂ φ̃
+ α + β + γ φ̃ + δ = 0 (239)
∂η 2 ∂ξ ∂η
Elliptic Equations.
45
In the case when B 2 − A C < 0, the partial differential equation is elliptic.
In the case, when the discriminant is negative, there are no real roots and
thus there are no real solutions of the equation. Therefore, there is always a
solution of the partial differential equation, if the initial conditions are given.
The elliptic equation can be put in canonical form. First, the coefficient of
∂ 2 φ̃
(241)
∂ξ ∂η
is chosen to be put equal to zero. This requires that
" # " #
∂ξ ∂η ∂η ∂ξ ∂η ∂η
A +B + B +C = 0 (242)
∂x ∂x ∂t ∂t ∂x ∂t
∂ 2 φ̃
Secondly, the equation is transformed such that the coefficients of ∂ξ 2 and
∂ 2 φ̃ ∂ 2 φ̃
∂η 2 are made equal, while the coefficient of is maintained to be zero. For
∂ξ∂η
convenience, we shall relabel the coordinates found from the first transformation
as (x, t), and the coefficients as A, B = 0 and C. The required transformation is
determined by the condition that the coefficients of the second-order derivatives
are equal
2 2 2 2
∂ξ ∂ξ ∂η ∂η
A + C = A + C (244)
∂x ∂t ∂x ∂t
46
This condition is equivalent to
√ √
∂ξ ∂η
A = C
∂x ∂t
√ √
∂ξ ∂η
C = − A (247)
∂t ∂x
The solution exists if the integral is independent of the path of integration. This
requires "r # "r #
∂ C ∂η ∂ A ∂η
= − (249)
∂t A ∂t ∂x C ∂x
which is just the condition that ξ is an analytic function, so a solution can al-
ways be found.
47
and thus the solution is given by
These are harmonic functions. The real and imaginary part of these solutions
separately satisfy Laplace’s equation.
Cauchy Conditions.
Dirichlet Conditions.
Neumann Conditions.
48
variable.
For example, the wave equation in Cartesian coordinates takes the form
2 2 2 2
∂ φ ∂ φ ∂ φ 1 ∂ φ
+ + − = 0 (255)
∂x2 ∂y 2 ∂z 2 c2 ∂t2
This can be reduced to four one-dimensional differential equations by assuming
that
φ(x, y, z, t) = X(x) Y (y) Z(z) T (t) (256)
which, on substituting into the differential equation leads to
" 2 2 2 #
1 1 ∂2T
1 ∂ X 1 ∂ Y 1 ∂ Z
X(x) Y (y) Z(z) T (t) + + − 2 = 0
X(x) ∂x2 Y (y) ∂y 2 Z(z) ∂z 2 c T ∂t2
(257)
or on diving by φ one has
" 2 2 2 #
1 1 ∂2T
1 ∂ X 1 ∂ Y 1 ∂ Z
+ + = 2 (258)
X(x) ∂x2 Y (y) ∂y 2 Z(z) ∂z 2 c T ∂t2
49
which is the one-dimensional problem
2
∂ X
= L X(x) (264)
∂x2
which involves the two constants of separation K and L. This equation can be
solved by noting that
2 2
1 ∂ Z 1 ∂ Y
− K + L = − (266)
Z(z) ∂z 2 Y (y) ∂y 2
Thus, after solving the one-dimensional problems, we will have found a spe-
cific solution of the form
Since we have a linear partial differential equation, the general solution is of the
form of a linear superposition
X
φ(x, y, z, t) = CK,L,M φK,L,M (x, y, z, t)
K,L,M
X
= CK,L,M TK (t) XL (x) YM (y) ZK−L−M (z) (270)
K,L,M
The constant coefficients CK,L,M are chosen such as to satisfy the boundary
conditions.
50
————————————————————————————————–
Example:3.4.1
Solution:
————————————————————————————————–
Example:3.4.2
51
Solve this equation.
Solution:
∂2Φ
= − m2 Φ(ϕ) (278)
∂ϕ2
m2
1 ∂ ∂Θ
sin θ − Θ(θ) = − l ( l + 1 ) Θ(θ) (279)
sin θ ∂θ ∂θ sin2 θ
h̄2 1 ∂ h̄2 l ( l + 1 )
2 ∂R
− r + R(r) + V (r) R(r) = E R(r) (280)
2 m r2 ∂r ∂r 2 m r2
It is seen that R(r) depends on l but not on m, and is denoted by Rl (r). Thus, in
this case, the solution of the partial differential equation can be reduced to the
solution of three one-dimensional problems described by ordinary differential
equations, which involves the two constants of separation (l, m). The general
solution can be written as the linear superposition
X
Ψ(r, θ, ϕ) = Cl,m Rl (r) Θl,m (θ) Φm (ϕ) (281)
l,m
of solutions with the different possible (l, m) values, and with expansion coeffi-
cients Cl,m .
————————————————————————————————–
Example:3.4.3
52
The solutions of the Dirac equation are also solutions of the Klein-Gordon
equation, but not vice versa. Consider the mass m(z) in the Dirac equation
to be a function of the z-spatial coordinates, where m(z) change sign at z = 0
since it iis given by
n h̄ z
m(z) = tanh (282)
ξ ξ
where n is an integer. With an appropriate boundary condition, one can derive
a Klein-Gordon equation of the form
2
E 2 2 2 2 ∂m
2
+ h̄ ∇ − m(z) c Φ = − h̄ c Φ (283)
c ∂z
Find the solutions of this equation localized near the plane z = 0.
Solution
53
which is exponentially localized around z = 0. The decay length is given by
m0 c
h̄ . Since m = n, the dispersion relation describes massless excitations
————————————————————————————————–
Problem:
The equation indicates that the characteristics curves have tangents given
by →
−a . Hence, since the initial conditions are specified on curves perpendicular
to ξ, one has
ξ = →
−a .→
−
r
η = ( a ∧ êz ) . →
→
− −
r
The characteristics are found as
ξ = ax x + ay y
η = ay x − ax y
As long as
a ay
J = x
ay −ax
6= 0
54
the equation reduces to
∂φ →
−
|→
−
a |2 = →
−
a . ∇φ
∂ξ
= 0
φ(ξ, η) = f (η)
Problem:
55
which is solved as
ξ
g(ξ 0 , η) c ξ0
Z
cξ
φp = exp − → dξ 0 exp
|−
a |2 |→
−
a |2 |→
−a |2
The general solution is given by φh + φp , thus
Z ξ 0
c ξ0
cξ 0 g(ξ , η)
φ(ξ, η) = exp − → f (η) + dξ exp →
|−
a |2 |→
−
a |2 |−a |2
Problem:
56
when y(x) satisfies
dy ay (x, y)
=
dx ax (x, y)
Then, ξ can be chosen in any way such that the Jacobian is non-zero
∂(ξ, η)
J =
∂(x, y)
∂ξ ∂ξ
∂x ∂y
=
∂η ∂η
∂x ∂y
6= 0
φ(0, η) = f (η)
Problem:
φ(0, y) = y
57
Problem:
φ(0, y) = exp[−y 2 ]
Problem:
φ(x, 0) = x
∂φ
The characteristics can be found by requiring that the coefficient of ∂η van-
ishes
∂η ∂η
2xt + = 0
∂x ∂t
with
dx
= 2xt
dt
The lines of constant η satisfy
dx
= 2 t dt
x
and has the solution
x = η exp[t2 ]
η = x exp[−t2 ]
58
which satisfies our requirement that the coefficient vanishes. On choosing ξ = t,
the partial differential equation reduces to the ordinary differential equation
∂φ
= φ
∂ξ
Therefore, one obtains the form of the solution as
Problem:
∂φ
The characteristics can be found by setting the coefficient of ∂η to zero.
This yields
∂η ∂η
φ2 + − 0
∂x ∂t
This slope of the lines with constant η are given by
dx
= φ2
dt
The characteristic η is found as
x = φ2 t + η
or
η = x − φ2 t
59
On setting ξ = t, the partial differential equation reduces to
∂φ
= 0
∂ξ
which has the solution
φ(ξ, η) = f (η)
On imposing the boundary conditions on this form, one obtains an implicit
equation for φ
p
φ(x, t) = x − φ2 t
which can be solved to yield
r
x
φ(x, t) = ±
1 + t
This solution is valid for x > 0 and t > 0.
Problem:
60
then the equation is identically satisfied since
∂ψ ∂ψ
=
∂ξ ∂ξ
The three Lagrange-Charpit equations can separately be integrated w.r.t. ξ.
The first two lead to
t(ξ, η) = ξ + c1 (η)
x(ξ, η) = c ξ + c2 (η)
where c1 (η) and c2 (η) are arbitrary functions of integration. The third equation
becomes
dψ
= dξ
a(ξ, η)
t = ξ
x = ct + η
Problem:
φ(x, x) = x3
61
Problem:
φ(0, y) = y2 − 1
Problem:
∂2φ ∂φ
+ = 0
∂x∂t ∂x
and find a general solution.
∂φx
= − φx
∂t
which has the solution for φx
Problem:
∂2φ ∂φ ∂φ
φ=
∂x∂y ∂x ∂y
62
by using the method of separation of variables. That is, assume the solution is
of the form
Problem: 1
h̄2 m ω2 2
− ∇2 Ψ + r Ψ = EΨ
2m 2
63
4 Ordinary Differential Equations
An ordinary differential equation (ODE) is an equation involving one unknown
function φ of a single variable and it’s derivatives. In physical applications, the
variable is either a position coordinate x or time t. Since the equation involves
a derivative of first or higher order, finding solution requires the equivalent of
at least one integration of the unknown function φ(x). Hence, in order that
the constants of integration be uniquely specified the equation must be sup-
plemented by at least one boundary condition. Generally, as many derivatives
occur in the ODE, many different boundary conditions are also needed. The
number of boundary conditions needed usually correspond to the degree of the
highest-order derivative. The boundary conditions usually consist of specifying
a combination of the unknown function and its derivatives at a particular value
of the variable, x0 .
64
In this case one may identify C = m2 . Hence, the non-linear differential
equation of second-order in the derivative has been reduced to a non-linear dif-
ferential equation only involving the first-order derivative of φ.
This solution represents a static soliton excitation. Solitons of this kind exist in
a system of infinitely many gravitational pendula. In this case, the pendula are
joined rigidly to one wire. The rotation of a pendulum provides a twist in the
supporting wire which transmits a torsional force on the neighboring wire. The
angle that the pendula make with the direction of the gravitation is denoted by
φ. The gravitational force is periodic in the angle φ and is responsible for the
non-linear term. The above solution represents a rotation of successive pendula
over the top. In addition to this, one expects small amplitude oscillations of the
weights around the bottom. Since φ is expected to be small one can linearize
the equation, that is we may be able to write b
sin φ ≈ φ (303)
to obtain
∂2φ
− m2 φ(x) = 0 (304)
∂x2
65
This is a second-order linear differential equation. It can be solved to yield the
solution
φ(x) = A sin m ( x − x0 ) (305)
involving two constants of integration which are provided by two boundary
conditions. This solution represents a snapshot of the ripples of the coupled
pendula. We have assumed that the amplitude of the ripples are such that
A 1.
where the set of functions an (x) and f (x) are known. The largest value of n for
which the function an (x) is non-zero is denoted by N , then the linear differential
equation is of N -th order. If the term f (x) is zero, the equation is homogeneous
Whereas if the term f x) is non-zero, the equation is inhomogeneous.
also satisfies the same linear differential equation. In general, the non-uniqueness
of solutions to an equation is due to the failure to specify appropriate boundary
conditions on φ(x). This non-uniqueness can be utilized to construct solutions
that satisfy the appropriate boundary conditions.
then this φ also satisfies the same inhomogeneous equation. The non-uniqueness
does not hold when appropriate boundary conditions are specified.
66
boundary conditions eliminate the N constants of integration that are found
when integrating the equation N times. Exceptions to this rule may occur if
the coefficient of the highest differential vanish at a point. This, essentially,
reduces the order of the differential equation locally. If the boundary conditions
are not used, one can develop N independent solutions of the N -th order linear
differential equation.
d2 φ dφ
+ P (x) + Q(x) φ = 0 (310)
dx2 dx
If both P (x) and Q(x) are finite at the point x = x0 , the point x0 is an
ordinary point. If either P (x), Q(x) or both diverge at the point x = x0 ,the
point x0 is a singular point. Example of a singular point occurs in quantum
mechanics, for example, the behavior of the radial part of the Hydrogen atom
wave function is governed by the ordinary differential equation
h̄2 1 h̄2 l ( l + 1 )
∂ 2 ∂
− r R(r) + V (r) + R(r) = E R(r)
2 m r2 ∂r ∂r 2 m r2
(311)
where the centrifugal potential diverges as r → 0. This is an example of a
singular point. Physically, what is expected to occur is that the terms V (r) and
E are negligible, near this singular point, and that the radial kinetic energy will
balance the centrifugal potential.
h̄2 1 h̄2 l ( l + 1 )
∂ 2 ∂
− r R(r) + R(r) ≈ 0 (312)
2 m r2 ∂r ∂r 2 m r2
Then, one expects that for r ∼ 0 the radial wave function will be such that
h̄2 ∂ 2 ψ m ω 2 x2
− + ψ = Eψ (314)
2 ∂x2 2
67
when x → ±∞. In this case, one expects that the kinetic energy term should
balance the potential energy term. This is usually handled by introducing the
variable z = x1 .
We shall consider the series expansion of the solution φ about the point x0 ,
in the form X
φ(x) = ( x − x0 )α Cn ( x − x0 )n (315)
n=0
and
∂ 2 φ(x) X
= α ( α − 1 )( x − x0 )α−2 C n ( x − x0 ) n
∂x2 n=0
68
X
+ 2 α ( x − x0 )α−1 n Cn ( x − x0 )n−1
n=1
X
α
+ ( x − x0 ) n ( n − 1 ) Cn ( x − x0 )n−2
n=2
X
α+n−2
= Cn ( x − x0 ) n(n − 1) + 2nα + α(α − 1)
n=0
X
= Cn ( x − x0 )α+n−2 ( n + α ) ( n + α − 1 ) (317)
n=0
etc.
On substituting the series expansions for P (x), Q(x) and φ(x) into the differ-
ential equation, one obtains the equation
X
α+n−2
0 = ( x − x0 ) Cn n + α n + α − 1
n=0
X m=n
X
+ ( x − x0 )α+n−1 Pm Cn−m n − m + α
n=0 m=0
X m=n
X
+ ( x − x0 )α+n Qm Cn−m (319)
n=0 m=0
69
+ ( x − x0 )α−2 C0 α ( α − 1 )
+ ( x − x0 )α−1 α C1 ( α + 1) + P0 C0 (320)
The first equation is known as the indicial equation, and has solutions α = 0 or
α = 1. The indicial equation can be obtained directly be examining only the
lowest-order mono-nomial. Due to the principle of superposition, the coefficient
C0 can be chosen arbitrarily.
For each positive index n, the set of equations determines Cn+2 in terms of
all the lower order expansion coefficients. For example, for n = 0 one has
1
C2 = − (α + 1) P0 C1 + (P1 α + Q0 ) C0 (324)
(α + 2)(α + 1)
For α = 0, the first few terms of the solution can be written as
Q0
φa (x) = C0 ( 1 − ( x − x0 )2 + . . . )
2!
P0
+ C 1 ( x − x0 ) ( 1 − ( x − x0 ) + . . . ) (325)
2!
70
while for α = 1, one has
P0
φb (x) = C 0 ( x − x0 ) ( 1 − ( x − x0 ) + . . . ) (326)
2!
Thus, we have found only two independent solutions of the second-order differ-
ential equation. A general solution can be expressed as a linear combination of
φa (x) with C1 = 0 and φb (x).
————————————————————————————————–
Example:
Solution:
∂2
φ(x) − x φ(x) = 0 (332)
∂x2
71
where φ(x) = φ( z−z
ξ ) = Ψ(z).
0
α(α − 1) = 0 (335)
Or since the first two terms on the left-hand-side are zero, one has
∞
X ∞
X
n ( n − 1 ) Cn xn−2 − Cn xn+1 = 0 (337)
n=2 n=0
or
∞
X
2 C−1 + ( m + 3 ) ( m + 2 ) Cm+3 − Cm xm+1 = 0 (339)
m=0
72
On equating the coefficients of the various powers to zero, one obtains
C−1 = 0 (340)
which is consistent with our assumption that the series expansion starts with
C0 . The higher-order coefficients are related via
Cm
Cm+3 = (341)
( m + 3 ) ( m +2 )
Hence, we have a relation between the terms in which the powers of x are of the
form x3r . Iterating this relation yields
C3r−3
C3r =
(3r)(3r − 1)
C3r−6
= (342)
(3r)(3r − 1)(3r − 3)(3r − 4)
This leads to the unique determination of all the coefficients whose orders are
divisible by three.
73
Because of the denominator of the general term in the series, the series converges
for all values of x. The solutions show oscillations with increasing frequency and
decreasing amplitude for increasingly negative values of x. The solutions are
monotonic for positive values of x.
————————————————————————————————–
and
q−2 q−1
Q(x) = + + q0 + . . . (347)
( x − x0 ) 2 ( x − x0 )
the indicial equation can be found by examining the lowest term in the expansion
C0 ( x − x0 )α . The indicial equation is found as
Example:
d2 R 2 dR l(l + 1)
− − + R = 0 (350)
dr2 r dr r2
which has a regular singular point at r = 0.
Solution:
74
The equation can be solved by applying the Frobenius method by assuming
the solution is of the form
X
R(r) = rα Cn r n (351)
n=0
The indicial equation can be found directly by examining the coefficient of the
lowest-order mononomial C0 rα . Thus, we have
α l(l + 1)
− α ( α − 1 ) C0 rα−2 − 2 C0 rα−1 + C0 rα = 0 (352)
r r2
Therefore, the indicial equation leads to
————————————————————————————————–
Example:
d2 R 2 dR l(l + 1)
− − + R = k2 R (354)
dr2 r dr r2
Solution:
75
This ansatz is substituted into the differential equation, leading to the polyno-
mial equation
∞ ∞
X 2 X
− ( n + α ) ( n + α − 1 ) Cn x(α+n−2) − ( n + α ) Cn xn+α−1)
n=0
x n=0
X ∞
l(l + 1)
+ − 1 Cn x(n+α) = 0
x2 n=0
(357)
Combining the terms of the same power, we have
∞
X ∞
X
( n + α ) ( n + α + 1 ) − l ( l + 1 ) Cn x(α+n−2) = Cn xn
n=0 n=0
∞
X X∞
( n + α − l ) ( n + α + l + 1 ) Cn x(α+n−2) = Cn−2 xn−2
n=0 n=2
(358)
If one were to move all the non-zero terms to one side of the equation, it is
seen that this equation is equivalent to equating a polynomial with zero. A
polynomial is only equal to zero, for all values of a continuous variable x, if the
coefficients are of the polynomial are all identical to zero. That is, the coefficients
of the terms with like powers must be identical. On equating coefficients of the
same power, one finds
( n + α − l ) ( n + α + l + 1 ) Cn = − Cn−2 (359)
However, the general set of linear equations reduces to the recursion relation
( n + α − l ) ( n + α + l + 1 ) Cn = ( − 1 ) Cn−2 (362)
76
This series does not terminate, and so one has
1
Cn = ( − 1 ) Cn−2 (363)
( n + α − l ) ( n +α + l + 1 )
n ( α − l )!! ( α + l + 1 )!!
Cn = ( − 1 ) 2 C0 (364)
( n + α − l )!! ( n + α + l + 1 )!!
( 2 m )!! = 2m m! (366)
( 2 m + 1 )!
( 2 m + 1 )!! = (367)
2m m!
Instead of using the explicit expression for odd n, we shall use the relation
for even n to define the factorial expression for half integer n. That is, we shall
define the half-integer factorial via
m+ 21 1
2 m + 1 !! = 2 m+ ! (368)
2
77
where we have defined the factorial for half-integer n. It can be seen, from the
ratio test, that this series converges as the ratio of successive terms is − m12 , for
large m. The first few terms of the solution are given by
( k r )2
l
Rl (r) = C0 ( k r ) 1 − + ... (370)
2 (2 l + 3 )
This series solution is related to the Bessel function of half-integer order, also
known as the spherical Bessel function.
————————————————————————————————–
Problem:
Show that the series solution for Rl (x) satisfies the recursion relations
2l + 1
Rl+1 (x) + Rl−1 (x) = Rl (x) (371)
x
and
∂Rl (x)
l Rl−1 (x) − ( l + 1 ) Rl+1 (x) = ( 2 l + 1 ) (372)
∂x
Problem:
Verify that
sin x
j0 (x) = (373)
x
and
sin x − x cos x
j1 (x) = (374)
x2
are solutions of the differential equation for l = 0 and l = 1. Show that
on expanding these solutions in powers of x, one recovers (up to an arbitrary
multiplicative constant) the Frobenius series solutions which are non-singular
at x = 0.
Problem:
Using the above recursion relations, find explicit expressions for the solutions
for l = 2 and l = 3.
Homework: 8.5.5
Homework: 8.5.6
78
Homework: 8.5.12
————————————————————————————————–
Example:
∂2φ ∂φ
( 1 − x2 ) − 2x + l(l + 1)φ = 0 (375)
∂x2 ∂x
Solution:
or
∞
X
Cn ( n + α ) ( n + α − 1 ) x(n+α−2) =
n=0
∞
X
= Cn−2 (n + α − 2)(n + α − 1) − l(l + 1) x(n+α−2)
n=2
(378)
Cn ( n + α ) ( n + α − 1 ) = Cn−2 ( n + α − l − 2 ) ( n + α + l − 1 )
(379)
79
The indicial equation is found by setting n = 0, and yields
α(α − 1) = 0 (380)
For α = 1 the solution is odd in x and we find that the coefficients are
related via
(n − l + 1)(n + l)
Cn = Cn−2 (385)
n(n + 1)
For large n, the coefficients are of the same order of magnitude. So by the
ratio test, one expects that the series may diverge when x = ± 1, unless the
coefficients are zero for large n. In fact, this is the case for arbitrary l. However,
when l is an integer, say N , the series truncates,
(N − l + 1)(N + l)
0 = CN +2 = CN (386)
(N + 2)(N + 1)
and the solution is x multiplied by an even polynomial in which the highest
term is of power N = l − 1.
C2 2 C4 4
φl (x) = x C0 1 + x + x + ... (387)
C0 C0
80
Hence, in the case α = 1, the solution is an odd polynomial of order l.
————————————————————————————————–
Problem:
where
l
2 l even
smax = l −2 (389)
2 l odd
and the coefficients are given by
(l − 2s + 1)(l − 2s + 2)
As = − As−1
2s(2l − 2s + 1)
l!
As = ( − 1 )s A0
( l − 2 s )! 2s s! ( 2 l − 2 s + 1 ) . . . ( 2 l − 1 )
l! ( 2 l − 2 s )! 2l l!
As = ( − 1 )s A0 (390)
( l − 2 s )! 2s s! ( 2 l )! 2(l−s) ( l − s )!
( − 1 )s ( l! )2 ( 2 l − 2 s )!
As = A0 (391)
s! ( 2 l )! ( l − 2 s )! ( l − s )!
Problem:
Show that the series solution, for integer l, satisfies the recursion relation
————————————————————————————————–
81
Example:
82
if one defines C−1 = 0. This polynomial is only zero if the coefficients of the
various powers are all equal to zero.
Examining the coefficient of the lowest power, xα−2 , and equating the coef-
ficient of the mononomial to zero, one finds the indicial equation
α 2 C0 = 0 (400)
α2 = 0 (401)
0 = n2 Cn + ( m − n + 1 ) Cn−1 (402)
83
If the series truncates at the N -th order term, one must have CM = 0 for
all M > N . This can only be the case if CN +1 = 0 and CN 6= 0. In this
case, the recursion relation becomes
(N − m)
0 = CN +1 = CN (408)
( N + 1 )2
————————————————————————————————–
Example:
Solution:
The equation can be solved using the Frobenius method with the ansatz
∞
X
φm (x) = xα C n xn (410)
n=0
84
The indicial equation is found from examining the coefficient of lowest non-
zero mononomial xα−2 , where n = 0,
α ( α − 1 ) C0 = 0 (414)
hence, either α = 0 or α = 1.
that is, the series terminates if m = N in which case the series is a polynomial
of order N .
We note that if the series does not truncate the large n behavior of the
coefficients is governed by
2
Cn ∼ Cn−2 (418)
n
so one has
2s
C2s ∼ C0 (419)
s!
and the series would exponentiate at large x.
85
4.3 Linear Dependence
In the Frobenius method we look for a power series solution, by substituting
the power series into the equation. Let us examine only the first N terms in
the series solution. After some re-organization, one finds a polynomial which
is equated to zero. This polynomial equation is solved by insisting that if the
polynomial in x is equal to zero, then all the coefficients of the polynomial are
zero. This condition then leads to the recursion relation etc. If the series con-
verges in the limit N → ∞, one has a solution.
X−1
n=N
kN xN = kn xn (422)
n=0
for any set of kn which contains non-zero values. This is formalized by the
statement that a set of functions φn = xn are linearly independent if the only
solution of X
k n φn = 0 (423)
n
is that kn = 0 for all n. On the other hand, if one has a set of functions φα
such as the set containing all the N-th lowest-order mononomials, xn , and the
86
function φN +1 is expressible as φN +1 = a x2 + b x + c then there are
non-zero or non-trivial solutions of the equation
N
X +1
k α φα = 0 (424)
α=0
k0 = c
k1 = b
k2 = a
kN +1 = 1 (425)
In this case, the set of functions φα are linearly dependent at least one of them
can be expressed as a linear combination of the others.
is that
kα = 0 ∀α (427)
Otherwise, if a non-trivial solution exists the set of functions are linearly de-
pendent.
and
N
X ∂ 2 φα
kα = 0 (430)
α=1
∂x2
87
etc. The set of all these equations must be satisfied if the original equation
is satisfied for a continuous range of x, since the derivatives of the equation
can be found by subtracting the equation at infinitesimally different values of
x ie, x + dx and x, and then dividing by dx. The set of N coefficients kα are
completely determined from N independent equations, so we shall truncate the
set of equations with
N
X ∂ N −1 φα
kα = 0 (431)
α=1
∂xN −1
Since the functions φα and the derivatives are known, this is a set of N linear
equations for N unknowns and can be written as a matrix equation
φ1 φ2 ... ... φN
k1 0
∂φ1 ∂φ2
. . . . . . ∂φN k2 0
∂x ∂x ∂x
... ... ... ... ... . = . (432)
... ... ... ... ... . .
∂ N −1 φ1 ∂ N −1 φ2 N −1
∂x N −1 ∂x N −1 . . . . . . ∂ N −1
∂x
φN kN 0
The matrix has an inverse if it’s determinant is non-zero. In this case, the only
solution consists of the trivial solution kn = 0 for all n. If the determinant is
non-zero, the set of functions φn are linearly independent.
If the determinant is zero, then non-zero solutions exist for kn for some n
and the set of functions φn are linearly dependent.
As examples one can find that the functions sin x and cos x are linearly
independent as
sin x cos x
cos x − sin x = − 1 (435)
88
which is non zero. Although, sin x is related to cos x via
p
sin x = ± ( 1 − cos2 x ) (436)
this relation is non-linear.
It is easy to show that the functions exp x, exp −x and cosh x are linearly
dependent as
exp x
exp −x cosh x
exp x
− exp −x sinh x = 0 (437)
exp x exp −x cosh x
which shows that it is possible to find a linear relationship between the functions
such as
exp x + exp −x = 2 cosh x (438)
The concept of linear dependence and linear independence is common to
the theory of vectors in d-dimensional spaces. A set of vectors φ̂n are linearly
independent if non-zero constants kn can not be found such that
n=d
X
kn φ̂n = 0 (439)
n=1
In such cases, the set of φ̂n can be used as a basis for the d-dimensional space,
→
−
in that an arbitrary vector Φ can be expanded as
n=d
→
− X
Φ = Cn φ̂n (440)
n=1
→
−
That is, the vector Φ and the set of basis vectors are form a linearly depen-
dent set. To see the relation we should identify kd+1 = 1 as the multiplier
→
−
of φ̂d+1 = Φ and kn = Cn as the multipliers for the rest of the φ̂n . Thus,
the kn or Cn can be considered as the components of the vector and the set of
linearly independent φ̂n as providing the basis vectors.
89
4.3.1 Linearly Independent Solutions
For an N -th order linear differential equation one expects that there exist N
linearly independent solutions φn . The general solution is expected to be able
to be written as a linear superposition of the N linearly independent solutions
as
n=N
X
φ = C n φn (442)
n=1
φ1 φ2 . . . . . . φN +1
. . . . . . ∂φ∂x
∂φ1 ∂φ2
N +1
∂x ∂x
W = . . . ... ... ... . . . (444)
... . . . . . . . . . . . .
N
∂ φ1 ∂ N φ2 . . . . . . ∂ φN +1
∂xN ∂xN ∂x
On substituting from the differential equation for the N -th order derivative into
the determinant, one finds the Wronskian is zero as the last row can be ex-
pressed as a linear combination of the (N − 1) higher rows. Hence, the set of
N + 1 solutions are linearly dependent contrary to our initial assumption. Thus,
at most, there are only N linearly independent solutions of the general N -th
order differential equation.
∂2φ ∂φ
+ p(x) + q(x) φ = 0 (445)
∂x2 ∂x
90
then their Wronskian W is defined as a function of x
φ
1 φ2 ∂φ2 ∂φ1
W = ∂φ1 ∂φ2 = φ1 − φ2
∂x ∂x ∂x ∂x
Abel’s theorem states that the x-dependence of the Wronskian W (x) is deter-
mined from the differential equation through p(x). On taking the derivative of
the Wronskian, one finds
∂W ∂ ∂φ2 ∂ ∂φ1
= φ1 − φ2
∂x ∂x ∂x ∂x ∂x
2 2
∂ φ2 ∂ φ1
= φ1 − φ2 (446)
∂x2 ∂x2
On substituting the second-order derivatives found from the differential equation
∂2φ ∂φ
= − p(x) − q(x) φ (447)
∂x2 ∂x
into the expression for the derivative of the Wronskian, one obtains
∂W ∂φ2 ∂φ1
= − φ1 p(x) + φ2 p(x)
∂x ∂x ∂x
= − p(x) W (x) (448)
Thus, if W (a) is non-zero because the solutions are linearly independent, then
as long as p(t) is not complex one finds that W (x) is always non-zero and the
solutions are always linearly independent. For linearly independent solutions,
the Wronskian is determined up to an arbitrary multiplicative constant since the
solutions φ are only determined up to a multiplicative constant. Furthermore,
if p(x) = 0 then the Wronskian is simply a constant.
91
∂φ2
2 φ2 ∂φ1
∂x
= ( φ1 (x) ) − 2
φ1 φ ∂x
1
∂ φ2
= ( φ1 (x) )2 (450)
∂x φ1
Thus,
∂ φ2 W (x)
= (451)
∂x φ1 φ1 (x)2
and so one obtains the second solution of the second-order differential equation
through one integration
Z x
W (t)
φ2 (x) = φ1 (x) dt (452)
φ1 (t)2
∂2φ
+ φ = 0 (453)
∂x2
which if knows one solution φ1 = sin x, then one can find a second solution.
Since p(x) = 0 the Wronskian is just a constant W , hence
Z x
dt W
φ2 (x) = sin x
sin2 t
= W sin x − cot x
= − W cos x (454)
92
5 Stürm Liouville Theory
Consider the Stürm-Liouville eigenvalue equation for the unknown function φ
in the form
∂2φ ∂φ
a2 (x) 2
+ a1 (x) + a0 (x) φ = λ u(x) φ (455)
∂x ∂x
where λ is an unknown number, and all the functions are defined on an interval
between x = a and x = b. The solutions are expected to satisfy boundary
conditions. The functions a0 (x) and u(x) are non zero in the interval (a, b). For
convenience we shall consider the case where u(x) > 0. The case of negative
u(x) can be treated by changing the definition of λ to accommodate the sign
change. The number λ is regarded as unknown and is called the eigenvalue. The
possible values of the eigenvalue are found from the condition that a solution
φ(x) exists which satisfies the boundary conditions. When the solutions φλ (x)
are found to exist the eigenvalues have particular values. The set of eigenvalues
may take on either discrete or continuous values. The function φλ (x) is called
an eigenfunction, and each eigenfunction corresponds to a particular eigenvalue
λ. Sometimes a particular eigenvalue λ may correspond to two or more different
eigenfunctions. In this case, the eigenvalue is said to be degenerate. Any lin-
ear combination of the degenerate eigenfunctions is also an eigenfunction with
the same eigenvalue. The number of linearly independent eigenfunctions corre-
sponding to this eigenvalue is the degeneracy of the eigenvalue.
93
or Z x
a1 (t)
p(x) = p(a) exp dt (460)
a a2 (t)
It is because of this that we required that a2 (t) be non-vanishing in (a, b).
In this case, we see that p(x) = w(x) = 1 and q(x) = 0. For each arbitrary
real value of λ one can find two solutions of the differential equation which do
not satisfy the two boundary conditions
√
f±λ (x) = exp ± λ x (463)
The functions f±λ (x) can be combined to yield functions φλ (x) that satisfies
the boundary condition at x = 0 as
√ √
φλ (x) = exp + λ x − exp − λ x (464)
94
Thus, the values of the eigenvalues λn are determined by the condition that
φn (x) exists and satisfy the boundary conditions.
Problem:
Find the general solutions of the above equation, using the Frobenius method,
without imposing boundary conditions.
If boundary conditions are applied, can the series be used directly to find
the possible set of eigenvalues?
————————————————————————————————–
Example:
∂2φ
− x2 φ = λ φ (470)
∂x2
subject to the boundary conditions that φ(x) vanishes in the limits x → ± ∞.
Solution:
Here p(x) = w(x) = 1 and q(x) = − x2 . The eigenvalue equation can be
solved near the origin by the Frobenius method. The indicial equation is simply
α(α − 1) = 0 (471)
or
X
φ1 = x C n xn (473)
n=0
Furthermore, from the recursion relation one finds that the odd coefficients
vanish and that the even coefficients must satisfy
Thus, the solutions are either even φ0 (−x) = φ0 (x) or odd φ1 (−x) = − φ1 (x)
in x. The Frobenius method can not be expected to converge in the limits
x → ± ∞. Therefore, we need to examine the asymptotic large x behavior.
95
In this case, one can neglect the eigenvalue compared to the x2 term. The
approximate equation is given by
∂2φ
− x2 φ ∼ 0 (475)
∂x2
This equation can be approximately factorized as
∂ ∂
− x + x φ ∼ 0 (476)
∂x ∂x
x2
φ(x) ∼ exp − (478)
2
Thus, we may look for solutions of the forms of power series in increasing
powers of x2 times a decreasing exponential function. That is we either have
the even function
x2
X
φ0 (x) = P2n x2n exp − (479)
n=0
2
Using this form we can determine the power series with coefficients Pn by using
the Frobenius method.
x2
To simplify the solution we shall substitute φ(x) = p(x) exp − 2 into
the equation and after cancelling out the common exponential factor one finds
∂ 2 p(x) ∂p(x)
2
− 2x − p(x) = λ p(x) (481)
∂x ∂x
Then the indicial equation for p(x) is
α(α − 1) = 0 (482)
96
The recursion relation becomes
————————————————————————————————–
In the above two examples, the set of eigenfunctions share the common
property that
Z b
dx φn (x) w(x) φm (x) = 0 (488)
a
where λn 6= λm . This is a general property of a set of eigenfunctions of the
Stürm-Liouville equation. This theorem implies that the successive eigenfunc-
tions change sign an increasing number of times. That is the eigenfunctions can
be classified by their number of nodes.
Problem:
97
obey the equation
Z L
dx φm (x) φn (x) = 0 (490)
0
if λm 6= λn .
————————————————————————————————–
Example:
Solution:
φ1 (x) = sin k x
φ2 (x) = cos k x (492)
Since any linear combinations of these are also eigenfunctions, one finds that
the complex functions
φ01 (x) = exp + i k x
φ02 (x) = exp − i k x (493)
————————————————————————————————–
98
5.2 The Inner Product
The inner product of two functions Ψ(x) and Φ(x) is defined as the weighted
integral
Z b
dx Ψ∗ (x) w(x) Φ(x) (494)
a
Two functions are said to be orthogonal if their inner product is zero. The inner
product of a function with itself is the normalization of the function. If w(x) is
a real, non-zero, function the normalization is a real number. It is customary to
demand that all functions Ψ(x) are normalized to unity. Thus, we insist that
Z
dx Ψ∗ (x) w(x) Ψ(x) = 1 (495)
and noting that the basis vectors form an orthogonal set so that
one has
d
→
− → − X
A . B = An B n (499)
n=1
The inner product of two functions can be evaluated in the same way. First
the functions are expanded in terms of the basis functions φn (x) with compo-
nents An and Bn
Xd
Φ(x) = An φn (x) (500)
n=1
and
d
X
Ψ(x) = Bn φn (x) (501)
n=1
99
Then, if the basis functions form an orthonormal set so that
Z b
dx φ∗n (x) w(x) φm (x) = δn,m (502)
a
100
where the term on the right-hand-side is recognized as the inner product. The
term on the left-hand-side is evaluated by splitting into two parts and integrating
each term by parts. The terms of the form
∂φ∗2 ∂φ1
p(x) (509)
∂x ∂x
cancel identically, leaving only the boundary terms
b b
∂φ∗2 (x) ∗ ∂φ1 (x)
φ1 (x) p(x) − φ2 (x) p(x)
∂x a ∂x a
Z b
= ( λ∗2 − λ1 ) dx φ∗2 (x) w(x) φ1 (x) (510)
a
However, on either using boundary conditions such that the functions vanish on
the boundaries
φ(a) = φ(b) = 0 (511)
or boundary conditions where the derivative vanishes at the boundaries
∂φ ∂φ
= = 0 (512)
∂x a ∂x b
then one finds that
Z b
( λ∗2 − λ1 ) dx φ∗2 (x) w(x) φ1 (x) = 0 (513)
a
This is the central result. It proves the theorem. First if φ2 = φ1 then the
normalization integral is finite and non-zero, so λ∗1 = λ1 . Thus, the eigenvalues
of the Stürm-Liouville equation are real. Using this we have
Z b
( λ2 − λ1 ) dx φ∗2 (x) w(x) φ1 (x) = 0 (514)
a
101
5.4 Orthogonality and Linear Independence
Given a set of mutually orthogonal functions φn (x), one can easily show that
they are linearly independent. For if one has
X
kn φn (x) = 0 (516)
n
then one can take the inner product with any one of them, say φm (x), to find
X Z b
kn dx φ∗m (x) w(x) φn (x) = 0
n a
X
kn δn,m = 0
n
km = 0 (517)
Thus, the only solution of the equation is that all the km are identically zero.
Hence, any set of mutually orthogonal functions are linearly independent.
102
It can be seen that ψ2 (x) is orthogonal to ψ1 (x) by direct substitution in
Z b
dx ψ1∗ (x) w(x) ψ2 (x) = 0 (521)
a
103
Therefore, it is always possible to construct a set of orthonormal basis func-
tions from the set of Sturm-Liouville eigenfunctions. We shall always assume
that the set of eigenfunctions of a Stürm-Liouville equation have been chosen
as an orthonormal set.
104
q
3
Hence, the orthogonal polynomial ψ1 (x) = 2 x and is an odd function of x.
and C2 is found as r
45
C2 = (533)
8
The orthogonalization of the next polynomial is non-trivial. The process
starts with φ3 (x) = x3 , so one finds
Z 1
3 3 4
ψ3 (x) = C3 x − x dt t
2 −1
3
= C3 x3 − x (534)
5
etc.
This set of polynomials, apart from multiplicative constants, are the same
as the set of Legendre polynomials Pn (x)
P0 (x) = 1
P1 (x)= x
3 x2 − 1
P2 (x) =
2
5 x3 − 3 x
P3 (x) =
2
(2n)! xn + . . .
Pn (x) = (535)
2n (n!)2
Problem:
Construct the first four orthogonal polynomials on the interval (−∞, +∞)
with weight factor w(x) = exp[ − x2 ]. Show, by direct substitution in the
105
equation, that these are the same polynomials p(x) that occur in the eigenfunc-
2
tions φ(x) = p(x) exp[ − x2 ] of
∂2φ
− x2 φ = λ φ (536)
∂x2
subject to the boundary conditions that the functions φ(x) vanishes in the limits
x → ± ∞.
Problem:
In general the number of linearly independent basis vectors that form a com-
plete set is equal to the dimension of the vector space. Also, if the basis is not
complete, then the set of vectors that can be expressed as a linear combination
of the incomplete basis set forms a vector space which has dimensions equal to
the number of linearly independent basis vectors in the incomplete set.
106
For the two dimensional space, the set of the two basis vectors span the en-
tire vector space and the basis is said to be complete. However, in a three-
dimensional vector space, the above combination only describes a two dimen-
sional plane in the three-dimensional volume. The two-dimensional basis vectors
do not span the three-dimensional vector space and the set of two basis vectors
is therefore said to be incomplete.
In this case of a three dimensional vector space, the set of the three basis vectors
is complete.
p(x) = p0 + p1 x + p2 x2 (540)
By a general well behaved function, we require that our function satisfy the
same boundary conditions as the basis functions. Also, the inner product of
Ψ(x) with itself must exist. These functions lie in the space C2 , i.e., the space
of square integrable complex functions.
It is easy to show that a function Ψ(x) in this space has a unique expansion
in terms of the basis. That is the expansion coefficients are uniquely determined
107
since by use of the inner product of Ψ(x) with any one of the basis functions,
say φm (x), one can uniquely determine the expansion coefficient Cm .
Z b X Z b
∗
dx φm (x) w(x) Ψ(x) = Cn dx φ∗m (x) w(x) φn (x)
a n a
X
= Cn δn,m
n
= Cm (542)
That is, the expansion coefficient Cm is uniquely determined by the inner prod-
uct of Ψ(x) and φm (x).
then we have Z b
dx φ∗m (x) w(x) δ(x − x0 ) = Cm (546)
a
which is simply evaluated as
Cm = φ∗m (x0 ) w(x0 ) (547)
Therefore, the Dirac delta function has an expansion of the form
X
δ(x − x0 ) = φ∗n (x0 ) w(x0 ) φn (x) (548)
n
Since the delta function is symmetrical in x and x0 one can also write
X
δ(x − x0 ) = φ∗n (x0 ) w(x) φn (x) (549)
n
108
because the left-hand-side is zero unless x = x0 .
109
and the inverse transform
Z +∞
˜ 1
f (k) = dx √ exp − i k x f (x) (557)
−∞ 2π
Completeness of the eigenfunctions of a Stürm-Liouville equation has to be
proved on a case by case basis. The proof usually consists of showing that the
expansion in a series of eigenfunctions
N
X
Cn φn (x) (558)
n=0
one finds that convergence requires that the equality sign in Bessel’s inequality
holds as N → ∞.
110
which is a statement that the squared length of a vector must be greater than
the square of any one component.
111
6 Fourier Transforms
The plane waves are eigenfunctions of the Stürm-Lioville equation
∂ 2 φk
= − k2 φ (563)
∂x2
on the interval (−∞, ∞) has solutions
1
φk (x) = √ exp ikx (564)
2π
The eigenfunctions satisfy the orthogonality relations
Z ∞ Z ∞
dx
dx φ∗k0 (x) φk (x) = exp i ( k − k 0 ) x
−∞ −∞ 2 π
= δ( k − k 0 ) (565)
————————————————————————————————–
Example:
112
Find the frequency-dependent response function given by the Fourier Transform
χ̃(ω). The divergences only occur in the lower-half complex frequency plane.
————————————————————————————————–
113
The Fourier transform of the n-th order derivative can be obtained by re-
peated differentiation by parts and is given by
Z ∞ n
∂ f (x) 1
dx √ exp − i k x
−∞ ∂xn 2π
n ˜
= ( i k ) f (k) (575)
————————————————————————————————–
Example:
and
∂φ(x, t)
= g(x) (578)
∂t
t=0
Fourier transforming the equation with respect to x, one has
Z +∞ 2 Z +∞ 2
1 ∂ φ 1 1 ∂ φ
dx √ exp − i k x 2
= 2
dx √ exp − i k x
−∞ 2π ∂x c −∞ 2π ∂t2
1 ∂ 2 φ̃(k, t)
− k 2 φ̃(k, t) = (579)
c2 ∂t2
and
Z +∞
∂ φ̃(k, t) 1 ∂φ(x, t)
= dx √ exp − ikx
∂t
t=0 −∞ 2π ∂t
t=0
114
Z +∞
1
= dx √ exp − ikx g(x)
−∞ 2π
= g̃(k) (581)
1 ∂ 2 φ̃(k, t)
− k 2 φ̃(k, t) = (582)
c2 ∂t2
subject to the two initial conditions given by
and
∂ φ̃(k, t)
= g̃(k) (584)
∂t
t=0
The second-order ordinary differential equation has the general solution
φ̃(k, t) = A exp i c k t + B exp − i c k t (585)
where A and B are arbitrary constants. The initial conditions determine A and
B through
f˜(k) = A + B (586)
and
g̃(k) = i c k ( A − B ) (587)
Hence, we have determined the constants as
1 ˜ g̃(k)
A = f (k) − i (588)
2 ck
and
1 g̃(k)
B = f˜(k) + i (589)
2 ck
Thus, the Fourier Transform of the solution is given by
" #
1 ˜ g̃(k) ˜ g̃(k)
φ̃(k, t) = f (k) − i exp + i c k t + B f (k) + i exp − i c k t
2 ck ck
(590)
and then the solution is given by the inverse Fourier Transform
Z +∞
1
φ(x, t) = dk φ̃(k, t) √ exp + i k x (591)
−∞ 2π
115
or
Z +∞
1 g̃(k)
1
φ(x, t) = dk f˜(k) − i √
exp i k ( x + c t )
2 −∞ ck 2π
Z+∞
1 g̃(k) 1
+ dk f˜(k) + i √ exp i k ( x − c t )
2 −∞ ck 2π
(592)
The integrals can be evaluated from the definition of the inverse Fourier Trans-
form yielding
Z x+ct
1 1
φ(x, t) = f( x + c t ) + dz g(z)
2 c a
Z x−ct
1 1
+ f( x − c t ) − dz g(z) (593)
2 c a
where the arbitrary constant of integration cancels. In fact, using this cancella-
tion, one can write the solution as
1
φ(x, t) = f( x + c t ) + f( x − c t )
2
Z x+ct
1
+ dz g(z) (594)
2 c x−ct
This is D’Alembert’s solution of the wave equation and corresponds to a super-
position of a backward and forward travelling wave.
————————————————————————————————–
116
Z ∞ Z ∞
1 1
= dk dt g(t) √ exp − i k t f˜(k) √ exp + i k x
−∞ −∞ 2π 2π
Z ∞
1
= dk g̃(k) f˜(k) √ exp + i k x (596)
−∞ 2π
which is the Fourier Transform of the product of Fourier Transforms. Inversely,
the inverse Fourier transform of a convolution is merely the product of Fourier
transforms.
————————————————————————————————–
Example:
where the stimulus occurs at a time t0 that is earlier than the response time.
This expresses causality. The integral over t0 in this relation can be extended
to ∞ as Z ∞
A(t) = dt0 χ(t − t0 ) B(t0 ) (598)
−∞
if we define
χ(t − t0 ) = 0 for t0 > t (599)
Thus, the response has the form of the convolution.
The applied field can be Fourier Transformed into its frequency components,
and also the response can be frequency resolved into the components Ã(ω). The
relation between the frequency components of the response and the applied can
be obtained by Fourier Transforming the linear relation, which yields
Ã(ω) = χ̃(ω) B̃(ω) (600)
Therefore, the response relation simplifies in the frequency domain.
————————————————————————————————–
117
and can be derived using the completeness relation
Z +∞ Z +∞ Z +∞
∗ ˜ 1
dx f (x) g (x) = dx dk f (k) √ exp + i k x g ∗ (x)
−∞ −∞ −∞ 2π
Z +∞ Z +∞ Z +∞
1 1
= dx dk f˜(k) √ exp + i k x dk 0 g̃ ∗ (k 0 ) √ exp − i k 0 x
−∞ −∞ 2π −∞ 2π
Z +∞ Z +∞ Z +∞
dx
= dk f˜(k) dk 0 g̃ ∗ (k 0 ) exp + i ( k − k 0 ) x
−∞ −∞ −∞ 2 π
Z +∞ Z +∞
= dk f˜(k) dk 0 g̃ ∗ (k 0 ) δ( k − k 0 )
−∞ −∞
Z +∞
= dk f˜(k) g̃ ∗ (k) (602)
−∞
118
7 Fourier Series
The Stürm-Liouville equation
∂2φ
+ n2 φ = 0 (603)
∂x2
with eigenvalue − n2 has solutions cos nx and sin nx that can be used to form
an orthonormal set, as
Z 2π
dx sin mx sin nx = π δn,m m = 6 0
0
Z 2π
dx cos mx cos nx = π δn,m m 6= 0
0
Z 2π
dx sin mx sin nx = 0 (604)
0
This set of eigenfunctions generates the finite Fourier series expansion, whereby
any well behaved function on the interval (0, 2π) can be expanded as
∞
1 X 1 1
f (x) = a0 √ + an √ cos nx + bn √ sin nx (607)
2π n=1
π π
119
This leads to an explicit form of the completeness condition, in which we define
the dirac delta function restricted to the interval (0, 2π) to be ∆(x − t) so
1 1 X
∆(x − t) = + ( cos nx cos nt + sin nx sin nt )
2π π n=1
1 1 X
= + cos n ( x − t ) (609)
2π π n=1
It can be shown that the set of Fourier expansion coefficients (an , bn ) in the
expansion of f (x) are the coefficients that minimize the difference
N
a0 X 1
χ(x) = f (x) − √ − √ an cos nx + bn sin nx (610)
2π n=1
π
and
∂I ∂I
= = 0 ∀n ≤ N (612)
∂an ∂bn
This leads to the equations
Z 2π
1
a0 = √ dt f (t)
2π 0
Z 2π
1
an = √ dt f (t) cos nt
π 0
Z 2π
1
bn = √ dt f (t) sin nt (613)
π 0
For these values of the expansion coefficients, the difference χ(x) only vanishes
for almost all values of x if
I = 0 (614)
This condition allows the function to deviate from the Fourier series only at a
set of isolated points that contribute zero to the integral. The condition I = 0
is equivalent to Bessel’s inequality being satisfied, since
Z 2π N
X
2
I = dt f (t) − a20 − a2n + b2n (615)
0 n=1
The quantity a2n + b2n (which is a function of n) is known as the power spectrum
of f (x).
120
The basis functions φn (x) are continuous at every point in the interval (0, 2π)
yet, it is possible to expand a square integrable f (x) with a finite number of
discontinuities as a Fourier series. If f (x) has a discontinuity at x = x0 it can
be proved that the series converges to the value of
1
lim f (x0 + ) + f (x0 − ) (616)
→ 0 2
f (x) = x 0 ≤ x < π
f (x) = x − 2π π < x ≤ 2π (617)
Using integration by parts one finds that the coefficients of the cosine terms are
also zero
Z 2π
1
an = √ dt f (t) cos nt
π 0
Z π Z 2π
1
= √ dt f (t) cos nt + dt f (t) cos nt
π 0 π
Z π Z 2π
1
= √ dt t cos nt + dt ( t − 2 π ) cos nt
π
0 π
1 cos nπ − 1 cos nπ − 1
= √ −
π n2 n2
an = 0 (619)
121
Z π Z 2π
1
= √ dt f (t) sin nt + dt f (t) sin nt
π 0 π
Z π Z 2π
1
= √ dt t sin nt + dt ( t − 2 π ) sin nt
π 0 π
1 1 1 − cos nπ
= √ 2π − +
π n n
√ cos nπ
= −2 π
n
√ 2
bn = π ( − 1 )n (620)
n
and are non-zero. Hence, we have the Fourier series expansion
∞
X ( − 1 )n
f (x) = 2 sin nx (621)
n=1
n
Homework: 14.1.4
Problem:
Expand x2 and x4 in a Fourier series and then evaluate the series at the
point x = π.
1 2 X sin(2n + 1)x
f (x) = + (624)
2 π n=0 2n + 1
122
The Fourier series assigns a value to the function at the discontinuity, which is
the mean from just above and below. Furthermore, the Fourier series truncated
after N terms, also over-estimates the function or overshoots it just before the
discontinuity. This is the Gibbs phenomenon.
or, equivalently
∞
1 1 X
∆(x − t) = + Real exp i n ( x − t ) (626)
2π π n=1
Hence, on defining the sum of the first N terms of the Fourier series for f (x) as
fN (x) one has
Z 2 π
fN (x) = dt f (t) ∆N (x − t)
0
" #
Z 2π 1
1 sin ( N + 2 ) ( x − t)
= dt f (t) x − t )
2π 0 sin ( 2
(628)
123
The Gibbs phenomenon can be demonstrated by the square wave in which case
Z π
fN (x) = dt 1 ∆N (x − t)
0
" #
sin ( N + 12 ) ( x − t )
Z π
1
= dt (629)
2π 0 sin ( x − t ) 2
1
The integral can be evaluated in the limit N → ∞ by writing z = ( N + 2 ) y,
so that
Z ∞ " #
1 sin z
lim fN (π) = dz
N →∞ π 0 z
1
= (632)
2
The over shoot at x = 0 can be estimated by rewriting the partial sum, by
shifting the variable of integration to s = x − t
!
sin ( N + 12 ) s
Z x
1
fN (x) = ds (633)
2 π x−π sin 2s
N+ 1
The integrand is symmetrical in s and has a maximum value of π 2 at s = 0
and first falls to zero at s = ± N π+ 1 . We shall examine the behavior of the
2
series at the discontinuity at x = 0. At x = 0 the integral over s starts with a
small value of the integrand which oscillates about zero and the integrand then
attains its maximum value at the upper limit of integration. However, it is clear
to see that the integral will be greater if the range of integration of s covers both
positive regions around the central maximum. This occurs when x = N π+ 1 .
2
In this case, one estimates the maximum value of the partial sum is given by
Z π1 " #
1
1 N+
2
sin ( N + 2 ) s
fN (x) ∼ ds
max 2 π −π sin 2s
124
Z π1 " #
1
1 1 N+
2
sin ( N + 2 )s
= + ds
2 2π 0 sin 2s
Z π " #
1 1 sin z
∼ + dz (634)
2 π 0 z
where
1
z = (N + )s (635)
2
The second integral is greater than 0.5, its’ value is 0.588. The truth of this in-
equality can be seen by plotting the integrand. The integrand when considered
as function of s is oscillatory, with a constant period ∼ 2π N . As x is increased
in steps of 2π
N , the upper limit of s includes one more cycle of the integrand,
and the upper limit of the z integration increases by a multiple of 2π. Each
successive half cycle yields a contribution of opposite sign and smaller magni-
tude than the previous half cycle. Thus, when taken in pairs, the contribution
from the entire z interval of (0, ∞) [which contributes 0.5] is smaller than the
contribution from the first half cycle. Therefore, the series over shoots at the
discontinuity by 8%.
and any complex function Φ(ϕ) on the interval 2 π > ϕ > 0 can be expanded
as X
Φ(ϕ) = Cm φm (ϕ) (639)
m
125
The complex coefficients Cm are given by
Z 2π
1
Cm = √ dϕ exp − i m ϕ Φ(ϕ) (640)
2π 0
If the function Φ is real, then one must have
∗
C−m = Cm (641)
————————————————————————————————–
Example:
Consider a cylindrical metal sheet of radius a and infinite length that has
been cut across a diameter. The two sheets are almost touching. One half of
the cylinder is kept at a potential φ0 and the other half is kept at a potential
−φ0 . Determine the potential at an arbitrary point (z, r, ϕ) inside the cylinder.
Assume, that the potential φ(z, r, ϕ) inside the cylinder is governed by the
equation
∇2 φ = 0 (642)
or in cylindrical coordinates
∂2φ 1 ∂2φ
1 ∂ ∂φ
2
+ r + = 0 (643)
∂z r ∂r ∂r r2 ∂ϕ2
Solution:
This can be solved by noting that the problem is invariant under transla-
tions along the cylinder’s axis. Thus, the potential is only a function of (r, ϕ)
alone. Furthermore, one can assume that the potential can be expanded in a
discrete Fourier series in ϕ with arbitrary coefficients that depend on r, since
the potential is periodic in ϕ.
X Cm (r)
φ(r, ϕ) = √ exp i m ϕ (644)
m
2π
126
and integrating over ϕ between 0 and 2 π, one finds that the expansion coefficient
Cn (r) satisfies the equation
n2
1 ∂ ∂Cn (r)
r − 2 Cn (r) = 0 (646)
r ∂r ∂r r
Since, the potential must be finite at the center of the cylinder (r = 0) one
can set Bn = 0. Furthermore, from the boundary condition at r = a one has
Then, using the Fourier series expansion of φ(a, ϕ) and the orthogonality con-
dition one has
Z 2π Z π
φ0
√ dϕ exp − i n ϕ − dϕ exp − i n ϕ = An an (650)
2π π 0
4i φ0
A2n+1 = √ a−(2n+1)
2 π 2n + 1
A2n = 0 (652)
which are only non-zero for odd n. Hence, the expansion only contains terms
that have odd n. The potential is given by
∞ 2n+1
φ0 X 8 r
φ(r, ϕ) = − sin n ϕ (653)
2 π n=0 2 n + 1 a
127
————————————————————————————————–
Example:
Solution:
1 ∂2φ
1 ∂ ∂φ
r + 2 = 0 (655)
r ∂r ∂r r ∂θ2
and this can be solved by separation of variables, and then series expansion.
The solutions are sought in the form
On substitution of this ansatz into Laplace’s equation and writing the separation
constant as − µ2 , one obtains the two ordinary differential equations. These
consist of the differential equation for the angular part
∂2Θ
= − µ2 Θ (657)
∂θ2
and the radial equation
µ2
1 ∂ ∂R
r − R = 0 (658)
r ∂r ∂r r2
The angular equation has solutions
Θ(θ) = A sin µ θ
Θ(θ) = B cos µ θ (659)
However, due to the boundary conditions, the only allowable solutions are
where
π
µ = mπ (661)
b
128
and m is any positive integer. That is, the allowable values of µ are given by
µ = mb (662)
R(r) = C rµ + D r−µ
= C rmb + D r− mb
(665)
The above solution has the interesting property that when b > 1, which
means that the angle between the conducting planes is obtuse, then the radial
component of the electric field is given
∞ r
X
mb−1 2b
Er = − m b Cm r sin m b θ (668)
m=1
π
129
which is unbounded for b < 1 as r → 0. Thus, electric fields can be excep-
tionally large and have corner singularities close to the edge on a metal object
with an acute angle, such as lightning rods.
————————————————————————————————–
∇2 Φ(x, y, z) = 0 (670)
φ0 (x, y, z) = 1 (671)
φ1,c (x, y, z) = x
φ1,s (x, y, z) = y
φ1,0 (x, y, z) = z (672)
Although there are six quadratic form there are only five linearly independent
quadratic forms which satisfy Laplace’s equation. These are
φ2,s (x, y, z) = yz
φ2,c (x, y, z) = xz
φ2,2s (x, y, z) = xy
φ2,2c (x, y, z) = x2 − y 2
φ2,0 (x, y, z) = 2 z 2 − x2 − y 2 (673)
φ0 (r, θ, ϕ) = 1 (674)
and
130
The linear functions, for fixed (r, θ), form a set of orthogonal functions of ϕ.
Likewise, for the set of quadratic functions
which again form a set orthogonal functions of ϕ, because the integral over ϕ
vanishes Z 2π
dϕ φ∗2,m (θ, ϕ) φ2,n (θ, ϕ) = 0 (677)
0
if n 6= m. Furthermore, even if the eigenfunctions have the same ϕ dependence,
they are orthogonal to the linear functions with weight factor sin θ as the integral
Z π
dθ φ∗2,m (θ, ϕ) sin θ φ1,m (θ, ϕ) = 0 (678)
0
x2 + y 2 + z 2 = r 2 (679)
————————————————————————————————–
Example:
Find all (7) real linearly cubic expressions that are solutions of Laplace’s
equation, and then express them in terms of spherical polar coordinates.
Solution:
131
By substitution of the form
φ = γ x y z + α1 x3 + α2 y 3 + α3 z 3
+ β1 x y 2 + β 2 x z 2 + β3 y x 2 β4 y z 2 + β5 z x 2 + β6 z y 2
(680)
φ3,2s = xyz
φ3,2c = z ( x2 − y 2 )
φ3,0 = 2 z 3 − 3 z ( x2 + y 2 )
φ3,3s = x3 − 3 x y 2
φ3,3c = y 3 − 3 y x2
φ3,1s = y 3 + y x2 − 4 y z 2
φ3,1c = x3 + x y 2 − 4 x z 2 (681)
————————————————————————————————–
132
8 Bessel Functions
Bessel’s Differential Equation is
t → − t−1 (686)
which implies that yn (x) and y−n (x) are related via
133
8.0.2 Series Expansion
A series expansion ( in x ) for the functions yn (x) can be found by writing
xt x
g(x, t) = exp exp −
2 2t
∞
X (xt) X r ∞ s −s
s x t
= ( − 1 )
r=0
2r r! s=0 2s s!
∞
X xr+s tr−s
= ( − 1 )s
r,s=0
2r+s r! s!
(690)
This series expansion can be compared with the generating function expansion
∞
X
g(x, t) = yn (x) tn (691)
n=−∞
r = n + s (692)
The function yn (x), for positive n, is given by the coefficient of tn and is iden-
tified as a polynomial in x where the various powers correspond to the terms in
the sum over all values of s. The function is found as
∞
X xn+2s
yn (x) = ( −1 )s
s=0
2n+2s s! (n + s)!
∞
xn X x2s
= n
( − 1 )s 2s (693)
2 s=0 2 s! (s + n)!
which is a series expansion in x which vanishes at the origin for n > 0. For neg-
ative n, the properties of the generating function yield yn (x) = ( − 1 )n y−n (x)
which also vanishes at the origin.
134
On using the generating function expansion in the left-hand-side
n=∞ n=∞
x X X
( 1 + t−2 ) yn (x) tn = yn (x) n tn−1 (695)
2 n=−∞ n=−∞
and identifying the powers of tn−1 one finds the recursion relation
2n
yn−1 (x) + yn+1 (x) = yn (x) (696)
x
This recursion relation can be used to determine yn+1 (x) in terms of yn (x) and
yn−1 (x). However, since the errors increase with increasing n it is more efficient
to use the equation for yn (x) and yn+1 (x) to determine yn−1 (x) with lower val-
ues of n.
which leads to
∂yn (x) 1
= yn−1 (x) − yn+1 (x) (698)
∂x 2
n ∂yn (x)
yn (x) + = yn−1 (x) (699)
x ∂x
or
n ∂yn (x)
yn (x) − = yn+1 (x) (700)
x ∂x
respectively, if one eliminates either yn+1 (x) or yn−1 (x).
Compact alternate forms of the above two recursion relations can be found.
For example, starting with
n ∂yn (x)
yn (x) + = yn−1 (x) (701)
x ∂x
one can multiply by xn and find that
∂
x yn (x) = xn yn−1 (x)
n
(702)
∂x
135
and likewise, starting from the other one of the above pair of relations, one can
show that
∂ −n
x yn (x) = − x−n yn+1 (x) (703)
∂x
136
8.0.5 Integral Representation
The series for the Bessel functions can be expressed in terms of an integral. This
can be shown by starting with the generating function expansion
∞
X
g(x, t) = J0 (x) + ( Jn (x) tn + J−n (x) t−n )
n=1
X∞
= J0 (x) + Jn (x) ( tn + ( − 1 )n t−n )
n=1
(710)
where we have separated out the even and odd terms of the series. When
expressed in terms of θ, the generating function reduces to
g(x, exp[ i θ ]) = exp i x sin θ (712)
On splitting the complex form of the generating function expansion into equa-
tions for the real and imaginary parts one finds the two equations
∞
X
cos ( x sin θ ) = J0 (x) + 2 J2n (x) cos 2 n θ
n=1
∞
X
sin ( x sin θ ) = 2 J2n+1 (x) sin ( 2 n + 1 ) θ (713)
n=0
137
Thus, on multiplying the real and imaginary part of the generating function by
either sin nθ or cos nθ and integrating, we have the four equations
Z 2π
1
J2n (x) = dθ cos( x sin θ ) cos 2nθ
π 0
Z 2π
1
J2n+1 (x) = dθ sin( x sin θ ) sin (2n + 1) θ
π 0
Z 2π
1
0 = dθ sin( x sin θ ) cos 2nθ
π 0
Z 2π
1
0 = dθ cos( x sin θ ) sin (2n + 1) θ (715)
π 0
depending on whether n is even or odd. Combining these equations in pairs, for
odd n and even n, one has
Z 2π
1
Jn (x) = dθ cos( x sin θ ) cos nθ + sin( x sin θ ) sin nθ
π 0
Z 2π
1
Jn (x) = dθ cos ( x sin θ − n θ ) (716)
π 0
for non zero n. The above equations are integral representations of the Bessel
functions.
so
∞
X ∞
X ∞
X
Jn (x + y) tn = Jm (x) tm Jl (y) tl (718)
n=−∞ m=−∞ l=−∞
or
∞
X ∞
X ∞
X
n
Jn (x + y) t = Jm (x) Jn−m (y) tn (719)
n=−∞ n=−∞ m=−∞
Hence, on equating the coefficients of tn , one has the Bessel function addition
theorem
∞
X
Jn (x + y) = Jm (x) Jn−m (y) (720)
m=−∞
138
The Bessel functions often occur in problems that involve circular planar
symmetry, like a vibrating drum head. In this case, one describes the location
of a point on the circular surface of the drum by planar polar coordinates (r, ϕ).
Another application occurs in the theory of diffraction.
Example:
This electric field is the sum of the amplitudes of the light originating from
the points (r, ϕ) inside the circular aperture, weighted by the phase from their
differences in optical path length r cos ϕ sin α. The integration over ϕ can be
performed using the integral representation of the Bessel function
Z a
2π
E ∼ dr r 2 π J0 r sin α (722)
0 λ
The integration over r can be performed by using the recursion relations, leading
to
λa 2πa
E ∼ J1 sin α (723)
sin α λ
The intensity of the light, I, arriving at the distant screen is proportional to
| E |2 , so
2
λa 2πa
I ∼ J12 sin α (724)
sin α λ
The angle α lies between 0 and π2 so the factor sin α vanishes at 0. Thus,
the denominator vanishes for light transmitted in the normal direction, but
fortunately so does the Bessel function J1 , as can be seen from the Frobenius
series. Therefore, on using l’Hopital’s rule, the intensity is non-vanishing for
a point directly in front of the aperture, as expected. However, the intensity
falls to zero in specific directions α determined by the vanishing of the Bessel
function J1
2πa
J1 sin α = 0 (725)
λ
139
For a sufficiently large aperture, a λ, the intensity falls to zero at the angles.
The first zero is found at the angle α determined by
2π a
sin α = 3.8137 (726)
λ
For a macroscopic opening a = 0.5 × 10−2 m and λ ∼ 10−7 m one has
Thus, the first circle of zero intensity makes has an extremely small angular
spread. The outer circles are determined by the higher order zeroes of the
Bessel function, and for large n are approximated by
2πa
sin α ∼ n π (728)
λ
which is similar to the Bragg condition. The outer dark rings are hard to ob-
serve as the intensity between the rings is very low.
Example:
h̄2
− ∇2 Ψ = E Ψ (729)
2m
where the allowed value of the energy is denoted by E. The wave function
Ψ(r, θ, z) expressed in terms of cylindrical coordinates satisfies the boundary
conditions
Ψ(a, θ, z) = 0
Ψ(r, θ, 0) = Ψ(r, θ, l) = 0
(730)
Find an expression for the wave functions and the allowed energies.
1 ∂2Ψ ∂2Ψ
1 ∂ ∂Ψ 2mE
r + 2 + = − Ψ (731)
r ∂r ∂r r ∂θ 2 ∂z 2
h̄2
On substituting the ansatz for the eigenfunction
140
into the equation and diving by Ψ one finds
1 ∂2Θ 1 ∂2Z
1 ∂ ∂R 2mE
r + 2 2
= − 2 − (733)
r R ∂r ∂r r Θ ∂θ h̄ Z ∂z 2
Since the z dependence is entirely contained in the right-hand-side, and the
left-hand-side is constant as the independent variable z is changed, we must
have
∂2Z
= − κ2 Z (734)
∂z 2
where − κ2 is an arbitrary constant. This equation has a general solution
Z(z) = A cos κz + B sin κz (735)
Since Ψ must vanish on the two surfaces z = 0 and z = l one determines the
constants as
A = 0
B sin κl = 0 (736)
Hence, we have the allowed solutions
nz π z
Z(z) = B sin (737)
l
where nz is an arbitrary positive integer, and κ = nzl π . On substituting Z(z)
back into the differential equation, and multiplying by r2 one has
1 ∂2Θ n2z π 2
r ∂ ∂R 2 2mE
r + = −r − (738)
R ∂r ∂r Θ ∂θ2 h̄2 l2
From the above equation one can recognize that the θ dependent term must
also be a constant, say − m2 , and therefore satisfies the differential equation
∂ 2 Θ(θ)
= − m2 Θ (739)
∂θ2
which has solutions of the form
Θ(θ) = exp imθ (740)
Since the wave function has a unique value at any point, one must have
Θ(θ + 2 π) = Θ(θ) (741)
which implies that m must be a positive or negative integer.
Finally, we find that the radial function satisfies the equation
n2z π 2
∂ ∂R 2mE
r r − m2 R = − r 2 − R (742)
∂r ∂r h̄2 l2
141
We shall put this equation into a dimensionless form by introducing the variable
x = k r, where k is to be determined. The differential equation of R(x/k)
becomes
x2 n2z π 2
∂ ∂R 2mE
x x − m2 R = − 2 − R (743)
∂x ∂x k h̄2 l2
n2z π 2
2mE
k2 = − (744)
h̄2 l2
Jm (ka) = 0 (747)
Jm (zm,n ) = 0 (749)
Example:
142
Consider the electromagnetic field inside a cylindrical metal cavity of radius
a and length l. Maxwell’s equations are given by
→
−
→
− →
− 1 ∂B
∇ ∧ E = − (752)
c ∂t
and →
−
→
− →
− 1 ∂E
∇ ∧ B = + (753)
c ∂t
Maxwell’s equations can be combined to yield the wave equation governing the
electric field →
−
1 ∂2 E
→
− →
− →
−
∇ ∧ ∇ ∧ E = − 2 (754)
c ∂t2
and since Gauss’s law holds
→
− → −
∇ . E = 0 (755)
one has →
−
→
− 1 ∂2 E
− ∇2 E = − 2 (756)
c ∂t2
On representing the time dependence of the electric field as a wave of frequency
ω via
→
− →− →
− →−
E ( r , t) = E ( r ) Real exp[ i ω t ] (757)
then one finds that the spatial dependence of the field satisfies
→
− − ω2 →
− −
− ∇2 E (→
r ) = 2 E (→
r) (758)
c
In cylindrical coordinates the z component of the electric field satisfies the Lapla-
cian eigenvalue equation
1 ∂ 2 Ez ∂ 2 Ez ω2
1 ∂ ∂Ez
r + 2 2
+ 2
= − 2 Ez (759)
r ∂r ∂r r ∂θ ∂z c
On using the method of separation of variables we assume that the z-component
of the electric field can be written as the product
On substituting this ansatz into the partial differential equation one finds that it
separates into three ordinary differential equations. The z-dependent function
Z(z) satisfies
∂ 2 Z(z)
= − kz2 Z(z) (761)
∂z 2
and Θ(θ) satisfies
∂ 2 Θ(θ)
= − m2 Θ(θ) (762)
∂θ2
143
Using the values of the separation constants, one finds that the Radial function
R(r) has to satisfy
m2
2
1 ∂ ∂R ω 2
r − 2 R = − − kz R (763)
r ∂r ∂r r c2
which is Bessel’s equation. Thus, the general solution of the partial differential
equation has been found to be of the form
→
−
X
Ez ( r ) = Jm (kr r) exp i m θ am,kr sin kz z + bm,kr cos kz z (764)
m,kr
Jm (kr a) = 0 (766)
Hence, the allowed values of kr are given in terms of the zeros of the m-th Bessel
function. The allowed k values are given by
zm,n
kr = (767)
a
where zmn stands for the n-th zero of the m-th Bessel function, i.e. Jm (zm,n ) =
0. The electromagnetic field must also satisfy boundary conditions at the two
ends of the cylinder. The boundary conditions at the ends of the cylinder z = 0
and z = l are satisfied by setting am,kr = 0 and kz = nzl π for integer and
zero values of nz . In this case, the tangential components of the field Er and
Eθ vanish at z = 0 and z = l. This leads to the magnetic induction field
being purely transverse. The allowed values of the frequencies are given by
r
2
zm,n n2 π 2
ω = c 2
+ z2 (768)
a l
8.0.7 Orthonormality
The orthonormality relations for the Bessel functions can be proved by starting
from Bessel’s equation
∂ 2 φν (x) ∂φν (x)
x2 + x + ( x2 − ν 2 ) φν (x) = 0 (769)
∂x2 ∂x
144
Due to physical reasons, we shall write x = k r and use zero boundary condi-
tions at the center and edge of the circular area r = 0 and at r = a. Thus,
we demand that our solution must satisfy
∂2 ν2
1 ∂ 2
φ ν (kr) + φ ν (kr) + k − φν (kr) = 0
∂r2 r ∂r r2
ν2
1 ∂ ∂
r φν (kr) + k2 − 2 φν (kr) = 0 (771)
r ∂r ∂r r
where k 2 can be regarded as the eigenvalue. The eigenvalue, or rather the value
km , is determined by demanding that the solution of Bessel’s equation φν (km r)
satisfies the boundary condition
φν (km a) = 0 (772)
Using the boundary conditions at r = a and noting that the solutions are
finite at r = 0, one finds that the eigenfunctions corresponding to different
eigenvalues are orthogonal, as
Z a
2
( km − kn2 ) dr r φν (kn r) φν (km r) = 0 (775)
0
145
The normalization can be found by setting km = kn + , hence, on using
the boundary condition at r = 0 one has
a
0 0
φν (kn a + a) a φν (kn a) − φν (kn a) a φν (kn a + a)
0
Z a
= 2 dr r φν (kn r) φν (kn r)
0
(776)
Furthermore, since
ν
φν+1 (kn a) = φν (kn a) − φ0ν (kn a) (778)
kn a
one has a
a2
Z
dr r φ2ν (kn r) = φ2ν+1 (kn a) (779)
0 2
146
Example:
1 ∂2 u
∇2 u − = 0 (783)
c2 ∂t2
where c is the velocity of the sound waves on the drumhead. Since the drumhead
is fixed tightly to the circular rim, the boundary conditions is that there is no
vertical displacement so
u(a, θ, t) = 0 (784)
In circular coordinates one has
1 ∂2u 1 ∂2u
1 ∂ ∂u
r + 2 = (785)
r ∂r ∂r r ∂θ2 c2 ∂t2
This partial differential equation can be solved by assuming that the solution
can be expressed as a Fourier series in the time variable. For any fixed time
t, the solution can be considered to be a function of (r, θ). The θ dependence
can be expressed as a series in Legendre polynomials, where the coefficients are
functions of r. The undetermined coefficients which are functions of r can then
be expanded as a Bessel series. The general term in this multiple expansion is
given by the ansatz
u(r, θ, t) = R(r) Θ(θ) exp i ω t (786)
On substituting this ansatz into the equation, cancelling the common time-
dependent terms, the equation reduces to
R(r) ∂ 2 Θ(θ) ω2
1 ∂ ∂R(r)
Θ(θ) r + 2 2
= − 2 R(r) Θ(θ) (787)
r ∂r ∂r r ∂θ c
1 ∂ 2 Θ(θ) ω2
r ∂ ∂R(r)
r + 2
= − 2 r2 (788)
R(r) ∂r ∂r Θ(θ) ∂θ c
This equation can be written such that one side depends on r alone and another
side which depends on θ alone
ω2 1 ∂ 2 Θ(θ)
r ∂ ∂R(r)
r + 2 r2 = − (789)
R(r) ∂r ∂r c Θ(θ) ∂θ2
147
Therefore, the solution can be obtained by the method of separation of variables.
It can be seen that, since the left-hand-side is independent of θ and the right-
hand-side is independent of r, they must be equal to a constant, ( say − m2 ).
Thus, we have
∂ 2 Θ(θ)
= − m2 Θ(θ) (790)
∂θ2
which has solutions of the form
Since, one can identify the point (r, θ) as (r, θ + 2π), then one requires that
which requires that m is an integer. Furthermore, the other side of the equation
is equal to the same constant
ω2
∂ ∂R(r)
r r + 2 r2 R(r) = m2 R(r) (793)
∂r ∂r c
148
terms of a Bessel series.
Example:
∇2 φ = 0 (798)
∂2φ
1 ∂ ∂φ
r + = 0 (799)
r ∂r ∂r ∂z 2
Using the method of separation of variables one seeks solutions of the homoge-
neous equation in the form
On substitution of the ansatz for the solution, one finds that R and Z must
satisfy
1 ∂2Z
1 ∂ ∂R
r = − (801)
r R ∂r ∂r Z ∂z 2
Since the left and right-hand-side are functions of independent variables r and
z, they must be equal to a constant ( say − k 2 ). Then the differential equation
is equivalent to the pair of ordinary differential equations
∂2Z
= k2 Z (802)
∂z 2
1 ∂ ∂R
r + k2 R = 0 (803)
r ∂r ∂r
The equation for Z(z) has the solution
Z(z) = A exp − k z + B exp + k z (804)
but since the potential must vanish as z → ∞ one has B = 0. Likewise, the
radial part of the potential is given by
149
as the potential remains finite at the center of the disk, r = 0. Thus, we
find the general solution of the partial differential equation is of the form of the
linear superposition
Z ∞
φ(r, z) = dk A(k) J0 (kr) exp − k z (806)
0
On interchanging the order of integration and using the continuum form of the
orthonormality relation
Z ∞
1
dr r J0 (kr) J0 (k 0 r) = δ( k − k 0 ) (809)
0 k
one determines the coefficients A(k 0 ), as the integral
Z R
1 0
A(k ) = φ 0 dr r J0 (k 0 r) (810)
k0 0
A(k 0 ) = φ0 R J1 (k 0 R) (812)
150
The potential above the center of the disk ( r = 0 ) is given by the integral
Z ∞
φ(0, z) = φ0 R dk J1 (kR) exp − k z
0
z
= φ0 1 − √ (814)
z 2 + R2
This can be seen by noting the special case of the Bessel function recurrence
relations yields
∂J0 (x)
J1 (x) = − (815)
∂x
This relation can be used in the integration of the expression for the potential.
On integrating by parts with respect to k, one finds the equation
k=∞ Z ∞
φ(0, z) = φ0 − J0 (kR) exp − k z − z dk J0 (kR) exp − k z
k=0 0
Z ∞
φ(0, z) = φ0 1 − z dk J0 (kR) exp − k z
0
(816)
The last integral can be evaluated by using the integral representation of the
Bessel function.
Z ∞ Z ∞ Z π
2 2
dk J0 (kR) exp − k z = dk dϕ cos( kR sin ϕ ) exp − k z
0 0 π 0
Z π Z ∞
2 2
= dϕ dk cos( kR sin ϕ ) exp − k z
π 0 0
Z π
2 2 z
= dϕ 2
π 0 z + R2 sin2 ϕ
1
= √ (817)
z + R2
2
Example:
A cylinder, of length l and radius a, has the top plate held at a potential
φ(l, r) and the bottom plate is held at a potential φ(0, r). The potential between
the plates is given by
∇2 φ = 0 (819)
151
which reduces to
∂2φ
1 ∂ ∂φ
r + = 0 (820)
r ∂r ∂r ∂z 2
From separation of variables one finds the solutions of the homogeneous equation
in the form
φ(z, r) = R(r) Z(z) (821)
On substituting this form into the partial differential equation, one finds that
1 ∂2Z
1 ∂ ∂R
r = − (822)
r R ∂r ∂r Z ∂z 2
∂2Z
= k2 Z (823)
∂z 2
and
1 ∂ ∂R
r + k2 R = 0 (824)
r ∂r ∂r
The equation for Z(z) has a general solution of the form
while the solution of the radial equation which is regular at the center of the
cylinder, r = 0, is given by
The Bessel function of zero-th order occurs since the potential is invariant under
arbitrary rotations about the cylinder’s axis. The boundary condition at the
surface of the cylinder is equivalent to requiring
J0 (ka) = 0 (827)
which yields the allowed values of k, in terms of the zeroes of the zeroth order
Bessel function, kn = zan . The solution can be expressed as
X
φ(z, r) = Ak cosh[ k z ] + Bk sinh[ k z ] J0 (kr) (828)
k
152
The coefficients Ak and Bk can be obtained, by first finding the coefficients
in the Bessel expansion of φ(0, r) and φ(l, r). At the top plate, ( z = l ), we
expand the boundary value as a Bessel series
X
φ(l, r) = φ̃k (l) J0 (kr) (829)
k
where the allowed values of k are given by the zeroes of the Bessel function.
Homework: 11.2.3
Homework: 11.2.9
153
for ν > 0.
Example:
ω2 →
→
− →
− →
− −
∇ ∧ ∇ ∧ E = 2 E (837)
c
→
− → −
and Gauss’s law reduces to ∇ . E = 0 in vacuum with no charges present.
Thus, the z-component of the electric field is governed by
2
ω
∇2 Ez + Ez = 0 (839)
c
On using separation of variables one assumes that the E.M. waves have the form
X
Ez (r, θ, z, t) = Am (k) Rm (r) Θ(θ) Z(z) exp − i ω t (841)
m,k
On solving for the (θ, z) dependence, and using the condition that Ez is 2 π
periodic in θ one has travelling waves of the form
X
Ez (r, θ, z, t) = Cm (k) Rm (r) exp − i m θ exp i ( k z − ω t ) (842)
m,k
154
where the radial function satisfies
m2
2
1 ∂ ∂Rm ω 2
r − 2 Rm + − k Rm = 0 (843)
r ∂r ∂r r c2
The radial equation has a solution for b > r > a given by
and
Rm (b) = Am Jm (αb) + Bm Nm (αb) = 0 (846)
The allowed values of α are determined from the transcendental equation
ω2
= α2 + k 2 (848)
c2
Since k 2 must be positive if the wave is to be transmitted, the minimum fre-
quency is given by ωmin = c α.
The Hankel functions are defined as solutions of Bessel’s equation which are
the superpositions
Hν+ (x) = Jν (x) + i Nν (x) (849)
and
Hν− (x) = Jν (x) − i Nν (x) (850)
The Hankel functions have the asymptotic forms of
±
lim Hν (x) → exp ± i x (851)
x→∞
and since they diverge at the origin has a natural interpretation in terms of
outgoing cylindrical waves or incoming cylindrical waves.
Example:
155
Consider the diffraction of an electromagnetic plane wave E0 with frequency
ω, by a metal wire of radius a. The electric vector is parallel to the cylindrical
axis, and the direction of propagation is perpendicular to the cylindrical axis.
The incident field is
êz E0 exp i ( k r cos θ − ω t ) (852)
Furthermore, as
∞
X
exp i k a cos θ = J0 (ka) + 2 ( i )m Jm (ka) cos m θ (859)
m=1
156
one can determine the coefficients Cm uniquely. In particular
C0 H0+ (ka) = − E0 J0 (ka) (860)
and
Cm H0+ (ka) = − 2 ( i )m E0 Jm (ka) (861)
Thus, the scattered wave is given by
∞
J0 (ka) +
X
m Jm (ka) +
Esc = − E0 H (kr) + 2 (i) Hm (kr) cos m θ
H0+ (ka) 0 m=1
+
Hm (ka)
(862)
157
8.2.1 Recursion Relations
It can be seen that the zeroth order spherical Bessel function j0 (x) is given by
sin x
j0 (x) = (869)
x
The recurrence relations for the spherical Bessel functions are given by
2n + 1
jn−1 (x) + jn+1 (x) = jn (x)
x
n jn−1 (x) − ( n + 1 ) jn+1 (x) = ( 2 n + 1 ) jn0 (x)
(870)
The recurrence relations can be used to evaluate the low order spherical Bessel
functions
sin x cos x
j1 (x) = 2
−
x x
3 − x2 3
j2 (x) = sin x − 2 cos x (871)
x3 x
The general form of the spherical Bessel functions can be obtained from the
recursion relations by combining them into the form
∂ n+1
x jn (x) = xn+1 jn−1 (x)
∂x
∂ −n
x jn (x) = − x−n jn+1 (x) (872)
∂x
for the spherical Bessel function of order n. The asymptotic, large x, behavior
is given by
1 π
jn (x) = sin x − n (874)
x 2
This follows as the asymptotic large x behavior is governed by the terms of
lowest power in x1 . The derivatives present in the Rayleigh formula produce a
faster decay and hence negligible contributions when they act on the powers of
x. Thus the leading term of the large x behavior is determined by the term
where the n derivatives in the Rayleigh formula for j0 (x) all act on the factor
sin x.
158
8.2.2 Orthogonality Relations
The orthogonality relations for the spherical Bessel functions can be expressed
as Z a 2
a3
2 0
dr jn (kr) r jn (k r) = jn+1 (ka) δk,k0 (875)
0 3
where k and k 0 are solutions of the equation
jn (ka) = 0 (876)
Example:
h̄2
− ∇2 Ψ = E Ψ (878)
2m
has an energy eigenfunction Ψ that can be expressed as
The wave function Ψ(r, θ, ϕ) with angular momentum l has a radial wave func-
tion Rl (kr) which satisfies the differential equation
∂R(r) 2 ∂R(r) 2 l(l + 1)
+ = k − R(r) = 0 (880)
∂r2 r ∂r r2
h̄2 k2
where the energy eigenvalue is given by E = 2 m . The radial function has
the general solution
159
The solution which is regular at r = 0 corresponds to the case where Bl = 0.
Example:
Example:
The wave function Ψ, which expresses the state of the particle, satisfies the
energy eigenvalue equation
h̄2
− ∇2 Ψ + V (r) Ψ = E Ψ (886)
2m
The effect of the potential is to demand that the wave function Ψ(r, θ, ϕ) satisfies
the boundary condition
Ψ(a, θ, ϕ) = 0 (887)
160
at r = a. This corresponds to the condition that the particle does not enter
the nucleus. Due to the spherical symmetry of the potential, we have a solution
of the form
Ψl,m (r, θ, ϕ) = Rl (r) Ylm (θ, ϕ) (888)
The radial function Rl (r) is given by
where Al and Bl are arbitrary constants, and the energy eigenvalue is given by
2
k2
E = h̄2 m . The boundary condition at the surface of the spherical nucleus
determines the ratio of the constants
Homework: 11.7.22
Cn = in ( 2 n + 1 ) Pn (cos θ) (894)
161
9 Legendre Polynomials
9.0.4 Generating Function Expansion
The generating function g(x, t) yields an expansion for the Legendre polynomials
Pn (x). The generating function expansion is simply
− 12
2
g(x, t) = 1 − 2xt + t
∞
X
= Pn (x) tn (895)
n=0
This has direct application in electrostatics, where the potential φ(→−r ) due to
→
−
a charge distribution ρ( r ) is given by the solution of Poisson’s equation. The
solution can be expressed in terms of the Green’s function and the charge den-
sity, and the Green’s function can be expanded using the generating function
expansion
→
−
3→ρ( r0 )
−0
Z
φ(→
−
r) = d r →
−
|→
−r − r0 |
Z ∞ →
−
ρ( r0 )
Z π Z 2π
= dr0 r02 dθ sin θ dϕ √
0 0 0 r2 − 2 r r0 cos θ + r02
∞ π Z 2π ∞ n
→
− X 1
Z Z
r<
= dr0 r02 dθ sin θ dϕ ρ( r0 ) Pn (cos θ)
0 0 0 n=0
r> r>
(896)
→
−
where θ is the angle between → −r and r0 , and r< and r> are, respectively, the
0
smaller and large values of (r, r ).
Problem:
Assume that the Legendre polynomials Pl (cos θ) are orthogonal, with weight
factor sin θ.
162
9.0.5 Series Expansion
The generating function can be used to find the series expansion of the Legendre
polynomials. First we expand in powers of 2 x t − t2 , via
1
g(x, t) = √
1 − 2 x t + t2
∞
X (2s)!
= 2s (s!)2
( 2 x t − t2 )s (898)
s=0
2
Hence, the Legendre polynomials have the highest power of xn , and terms which
decrease in powers of x2 . Therefore, for odd n the series only contains odd terms
in x, whereas for even n the series is even in x. The series in decreasing pow-
ers of x−2 terminates when n = 2 r for even n and when n = 2 r + 1 for odd n.
163
∞
X
= Pn (x) tn
n=0
(902)
one obtains
∂ x − t
g(x, t) = 3
∂t ( 1 − 2 x t + t2 ) 2
∞
X
= Pn (x) n tn−1 (903)
n=0
164
This is not very useful.
to give
∂ ∂ ∂
(2n+1) x Pn (x) + Pn (x) = (n+1)Pn+1 (x) + n Pn−1 (x)
∂x ∂x ∂x
(910)
This relation can be used to eliminate the derivative of Pn (x) in the previous
equation, by multiplying by 2 and subtracting ( 2 n + 1 ) times the previous
equation, leading to
∂ ∂
( 2 n + 1 ) Pn (x) = Pn+1 (x) − Pn−1 (x) (911)
∂x ∂x
The above two equations are to be combined. One is first put in the form
n → n − 1 so it becomes
∂ ∂
x Pn−1 (x) + n Pn−1 (x) = Pn (x) (914)
∂x ∂x
and this is combined with x times the relation
∂ ∂
x Pn (x) − n Pn (x) = Pn−1 (x) (915)
∂x ∂x
∂
so as to eliminate ∂x Pn−1 (x). On performing these manipulations one obtains
the recursion relation
∂
( 1 − x2 ) Pn (x) = n Pn−1 (x) − n x Pn (x) (916)
∂x
165
Finally, if one uses the above recursion relation together with
————————————————————————————————–
Example:
Hence,
∂ Pn (cos θ) Pn (cos θ) 2 ∂ Pn (cos θ)
= − ( n + 1 ) cos θ + sin θ
∂z r(n+1) r(n+2) ∂ cos θ r(n+2)
(922)
but one has the recursion relation
∂Pn (cos θ)
sin2 θ = ( n + 1 ) cos θ Pn (cos θ) − ( n + 1 ) P(n+1) (cos θ) (923)
∂ cos θ
Thus, one has proved the identity
∂ Pn (cos θ) Pn+1 (cos θ)
(n+1)
= −(n + 1) (924)
∂z r r(n+2)
166
————————————————————————————————–
On taking the derivative with respect to z, keeping (x, y) fixed, one has
∂ 1 1 − z
p = 3
∂z 1 + x + y2 + z2 − 2 z
2 (1 + x2+ y2 + z2 − 2 z ) 2
( 1 − r cos θ )
= 3 (926)
( 1 + r2 − 2 r cos θ ) 2
Hence, we have
X∞
∂ Pn (cos θ) ( 1 − r cos θ )
(n+1)
= 3 (927)
∂z n=0
r ( 1 + r2 − 2 r cos θ ) 2
which can be written as
X∞
∂ Pn (cos θ) ( 1 − r cos θ )
( 1 + r2 − 2 r cos θ ) =
r(n+1)
p
∂z n=0 ( 1 + r2 − 2 r cos θ )
∞
X Pn (cos θ)
= ( 1 − r cos θ )
n=0
r(n+1)
(928)
On substituting the identity that is being verified, we find
∞ ∞ ∞
X P(n+1) (cos θ) X cos θ Pn (cos θ) X Pn (cos θ)
( 1 + r2 − 2 r cos θ ) ( n + 1 ) = −
n=0
r(n+2) n=0
rn n=0
r(n+1)
(929)
or on changing the index of summation
∞ ∞ ∞
X P(n−1) (cos θ) X P(n+1) (cos θ) X cos θ Pn (cos θ)
n n
+ ( n + 1 ) n
= (2n + 1)
n=0
r n=0
r n=0
rn
(930)
which is satisfied identically, because of the recursion relation
( 2 n + 1 ) cos θ Pn (cos θ) = n P(n−1) (cos θ) + ( n + 1 ) P(n+1) (cos θ)
(931)
167
Thus, we have again verified the identity in question.
∂2 ∂ ∂ ∂
( 1 − x2 ) Pn (x) − 2 x Pn (x) = n Pn−1 (x) − n x Pn (x) − n Pn (x)
∂x2 ∂x ∂x ∂x
(933)
and then using
∂ ∂
x Pn (x) − n Pn (x) = Pn−1 (x) (934)
∂x ∂x
to eliminate the derivative of Pn−1 (x) one obtains Legendre’s equation
∂2 ∂
( 1 − x2 ) Pn (x) − 2 x Pn (x) + n ( n + 1 ) Pn (x) = 0 (935)
∂x2 ∂x
Legendre’s differential equation has singular points at x = ± 1, where the
solution Pn (x) may become infinite. The value of n must be an integer if the
solution is to remain finite. If n is integer the Frobenius series expansion termi-
nates and the solution becomes a polynomial.
α = 0 (937)
(938)
168
which terminates after n terms when n is an integer.
9.0.8 Orthogonality
Legendre’s differential equation in the variable x can be written as
∂ ∂
( 1 − x2 ) Pn (x) + n ( n + 1 ) Pn (x) = 0 (942)
∂x ∂x
which is a Stürm-Liouville equation, with eigenvalue n ( n + 1 ) and weighting
factor unity. The boundary conditions are imposed at x = ± 1, which are
regular singular points as ( 1 − x2 ) = 0 at the ends of the interval.
169
On integrating by parts, and noting that the factor ( 1 − x2 ) vanishes at both
the boundaries, then as long as Pn (±1) is not infinite one has
Z +1
n(n + 1) − m(m + 1) dx Pm (x) Pn (x) = 0 (944)
−1
Thus, the Legendre functions are orthogonal as long as the eigenvalues are not
equal n ( n + 1 ) 6= m ( m + 1 ).
The Legendre functions are defined such that Pn (1) = 1. The normalization
integral can be evaluated from the square of the generating function expansion
n=∞
X 2
( 1 − 2 x t + t2 )−1 = Pn (x) tn (945)
n=0
where we have used the orthogonality of the Legendre functions. The integral
can be evaluated as
n=∞ Z +1
1 1 − 2 t + t2 X
2n
− ln = t dx Pn2 (x) (947)
2t 1 + 2 t + t2 n=0 −1
170
9.0.9 Legendre Expansions
Due to the orthogonality conditions and the completeness of the eigenfunctions
of a Stürm-Liouville equation, any function can be expanded on the interval
(−1, +1) as a Legendre series
n=∞
X
f (x) = Cn Pn (x) (951)
n=0
Hence, we may expand the delta function, on the interval (−1, +1) as
n=∞
X 2n + 1
δ( x − t ) = Pn (x) Pn (t) (954)
n=0
2
————————————————————————————————–
Example:
Find the electrostatic potential for a point charge with charge q, located at a
distance a from the center, inside a uniform conducting spherical shell or radius
R.
− ∇2 φ = 4 π ρ (955)
where ρ is only non-zero at the position of the point charge. Elsewhere, Poisson’s
equation simplifies to
∇2 φ = 0 (956)
171
We shall assume that the polar axis, ( θ = 0 ) , runs through the center and
the charge q. Then the potential is invariant under changes of the azimuthal
angle ϕ, and so φ is independent of ϕ. Laplace’s equation takes the form
1 ∂ 2 ∂φ 1 ∂ ∂φ
r + 2 sin θ = 0 (957)
r2 ∂r ∂r r sin θ ∂θ ∂θ
where Al and Bl are arbitrary constants that are to be determined by the bound-
ary conditions.
Hence, for l 6= 0 one finds the relation between the expansion coefficients
Al = − Bl R−(2l+1) (960)
for r > a. The coefficients Bl can be found from the boundary condition at
the point (r = a, θ = 0). Near this point, the potential should be dominated by
the singular behavior of the point charge
∞
q q X al
lim φ(r, θ) ∼ = (963)
r →a (r − a) r rl
l=0
B0 = q
Bl = q al (964)
172
Thus, we have we have identified the contribution from the point charge q as
∞
q q X al
φq (r, θ) = √ = Pl (cos θ) (965)
r2 + a2 − 2 r a cos θ r rl
l=0
The induced contribution to the potential from the charge on the conducting
surface is found, from the principle of linear superposition, as
∞ l
q X ar
φind (r, θ) = − Pl (cos θ)
R R2
l=0
q 1
= − s
R 2
r a r a
1 − 2 R2 cos θ + R2
qR 1
= − s (966)
a 2
R2 R2
r2 + a − 2r a cos θ
————————————————————————————————–
Example:
∇2 φ = 0 (967)
The electric field is chosen to be oriented along the polar axis, so the problem
has azimuthal symmetry and the potential is independent of ϕ. We shall use the
method of separation of variables and assume that the potential can be found
in the form of a series expansion
X
φ(r, θ) = Cl Rl (r) Θl (θ) (968)
l
173
On writing the separation constant as l ( l + 1 ) one has the radial function
Rl (r) satisfying the equation
∂ ∂R
r2 = l(l + 1)R (970)
∂r ∂r
which has the solution
Rl (r) = Al rl + Bl r−(l+1) (971)
The angular dependence is given by
Θl (θ) = Pl (cos θ) (972)
where l is an integer, for Θl (θ) to be non-singular along the poles.
174
which leads to the potential having the form
a3
B0
φ(r, θ) = A0 + − E0 r P1 (cos θ) 1 − 3 (978)
r r
————————————————————————————————–
Example:
∇2 φ = − 4 π ρ(→
−
r) (979)
For points not in the plane z = 0, the charge density is zero, and so one
has Laplace’s equation
∇2 φ = 0 (980)
Since, the geometry has an axial symmetry, the potential φ is only a function
of (r, θ). In spherical polar coordinates, one has
∂2φ
1 ∂ 2 ∂φ 1 ∂ ∂φ 1
r + sin θ + = 0 (981)
r2 ∂r ∂r r2 sin θ ∂θ ∂θ r2 sin2 θ ∂ϕ2
π
if θ = 2. Due to the axial symmetry, this reduces to
1 ∂ 2 ∂φ 1 ∂ ∂φ
r + 2 sin θ = 0 (982)
r2 ∂r ∂r r sin θ ∂θ ∂θ
Then since one side of the equation is a function of r alone, and the other side
is a function of θ alone, both sides of the equation must be equal to a constant
175
C
∂ 2 ∂R
r = C R(r)
∂r ∂r
∂ ∂Θ
sin θ = − C sin θ Θ(θ)
∂θ ∂θ
(985)
This series is unique and the coefficients An can be determined from the appro-
priate boundary condition.
176
Hence, we obtain the potential φ(r, θ) as an expansion
∞ 2n
q X (2n)! a
φ(r, θ) = ( − 1 )n 2n Pn (cos θ) (994)
r n=0 2 (n!)2 r
————————————————————————————————–
Example:
Since the problem is symmetric under rotations around the z-axis, the po-
tential φ(r, θ) is independent of ϕ. The potential inside the sphere satisfies
Laplace’s equation
∇2 φ = 0 (995)
which in spherical polar coordinates, and using the azimuthal symmetry, be-
comes
1 ∂ 2 ∂φ 1 ∂ ∂φ
r + sin θ = 0 (996)
r2 ∂r ∂r r2 sin θ ∂θ ∂θ
The solution is found by the method of separation of variables
after which it is found that the radial function R(r) must satisfy an eigenvalue
equation
∂ 2 ∂R
r = n(n + 1)R (998)
∂r ∂r
Then, for fixed n we have the solution
Rn (r) = An rn + Bn r−(n+1) (999)
177
Thus, we can form a general solution as a series expansion
∞
X
φ(r, θ) = An rn + Bn r−(n+1) Pn (cos θ) (1002)
n=0
When this integral is evaluated, since the function φ(a, θ) is odd only the Legen-
dre polynomials with odd values of n survive, as P2n (cos θ) is an even function
of cos θ. Then
∞
X (4n + 3) (2n − 1)!!
A2n+1 a2n+1 = φ0 ( − 1 )n+1 (1005)
n=0
(2n + 2)!!
————————————————————————————————–
Example:
178
where θ is the angle between the position and the direction of the momentum.
This form of the expansion is to be expected as both the Bessel function expan-
sion and the plane wave are eigenstates of the Laplacian, with eigenvalue − k 2 .
It can be shown that Cl = il ( 2 l + 1 ).
The recurrence relation can be used to express cos θ Pn (cos θ) entirely in terms
of the Legendre polynomials via
0 m (m + 1)
jm (kr) = jm−1 (kr) − jm+1 (kr) (1012)
2m + 1 2m + 1
one finds
m (m + 1)
i Cm−1 jm−1 (kr) + Cm+1 jm+1 (kr)
2m − 1 2m + 3
m (m + 1)
= Cm jm−1 (kr) − jm+1 (kr)
2m + 1 2m + 1
(1013)
179
The first coefficient can be obtained by examining the limit k → 0 since in
this case only j0 (0) survives, and has a magnitude of unity. Hence,
C0 = 1 (1015)
Cl = il ( 2 l + 1 ) (1016)
Thus, we have found the expansion of the plane wave in terms of the spherical
Bessel functions.
————————————————————————————————–
Example:
180
Z 2π Z π
= dϕ dθ sin θ | f (k, θ) |2
0 0
(1020)
m2
1 ∂ ∂ m
sin θ P (cos θ) + n(n + 1) − Pnm (cos θ) = 0
sin θ ∂θ ∂θ n sin2 θ
(1022)
or with the substitution x = cos θ one has
∂2 m2
2 m m
(1 − x ) P (x) − 2 x Pn (x) + n ( n + 1 ) − Pnm (x) = 0
∂x2 n 1 − x2
(1023)
This also has regular singular points at x ± 1. When m2 = 0, the associated
Legendre differential equation reduces to Legendre’s differential equation.
∂2 ∂
( 1 − x2 ) 2
Pn (x) − 2 x Pn (x) + n ( n + 1 ) Pn (x) = 0 (1025)
∂x ∂x
181
and then differentiating m times. On using the formula
n=m
∂m ∂ m−n ∂n
X
A(x) B(x) = C(m, n) A(x) B(x) (1026)
∂xm n=0
∂xm−n ∂xn
one obtains
∂ 2+m ∂ 1+m
( 1 − x2 ) 2+m
Pn (x) − 2 x ( m + 1 ) Pn (x)
∂x ∂x1+m
∂m
+(n − m)(n + m + 1) Pn (x) = 0
∂xm
(1027)
since ( 1 − x2 ) only has two non-vanishing orders of derivatives.
On setting
∂m m
m
Pn (x) = ( 1 − x2 )− 2 Pnm (x) (1028)
∂x
and then differentiating, one obtains
∂ m+1 m ∂ mx m
Pn (x) = ( 1 − x2 )− 2 Pnm (x) + ( 1 − x2 )− 2 Pnm (x)
∂xm+1 ∂x 1 − x 2
(1029)
On differentiating a second time, one obtains
∂ m+2 m ∂2
Pn (x) = ( 1 − x2 ) − 2 P m (x)
∂xm+2 ∂x2 n
mx m ∂
+ 2 ( 1 − x2 ) − 2 P m (x)
1 − x 2 ∂x n
m ( m + 2 ) x2 m
+ 2 2
( 1 − x2 )− 2 Pnm (x)
(1 − x )
m m
+ 2
( 1 − x2 )− 2 Pnm (x)
(1 − x )
(1030)
These expressions are substituted into the m-th order differential relation, and
m
after cancelling the common factor of ( 1 − x2 )− 2 one has
∂2 m m2
∂ m
( 1 − x2 ) P (x) − 2 x P (x) + n ( n + 1 ) − Pnm (x) = 0
∂x2 n ∂x n 1 − x2
(1031)
Expressing the variable x in terms of the variable appropriate to the polar angle
x = cos θ, the Associated Legendre equation becomes
m2
1 ∂ ∂ m
sin θ Pn (cos θ) + n ( n + 1 ) − Pnm (x) = 0 (1032)
sin θ ∂θ ∂θ sin2 θ
182
This equation reduces to Legendre’s differential equation when m = 0 and the
associated Legendre function of order m equal to zero is the Legendre polyno-
mial.
The Legendre polynomial are non zero for the range of n values such that
n ≥ m ≥ 0, but can also be defined for negative m values in which case
n ≥ m ≥ −n (1033)
and
(n − m)! m
Pn−m (x) = ( − 1 )m P (x) (1034)
(n + m)! n
since the differential equation only involves m2 and does not depend on the sign
of m.
∂2ψ
1 ∂ 2 ∂ψ 1 ∂ ∂ψ 1
r + sin θ + 2 = − k2 ψ
r2 ∂r ∂r r2 sin θ ∂θ ∂θ r2 sin θ ∂ϕ2
(1035)
On using the ansatz for separation of variables
∂2Φ
1 ∂ 2 ∂R 1 ∂ ∂Θ 1
r + sin θ + = − k2
r2 R ∂r ∂r r2 sin θ Θ ∂θ ∂θ r2 sin2 θ Φ ∂ϕ2
(1037)
On multiplying by r2 sin2 θ, one recognizes that since all the ϕ dependence is
contained in the term
1 ∂2Φ
(1038)
Φ ∂ϕ2
this must be a constant. That is, the azimuthal dependence must satisfy
∂2Φ
= − m2 Φ (1039)
∂ϕ2
183
Since, with fixed (r, θ), the values of ϕ and ϕ + 2 π represent the same physical
point, one must have
Φm (ϕ) = Φm (ϕ + 2π) (1041)
which implies that m satisfies the condition
exp i m 2 π = 1 (1042)
and therefore m must be an integer. On substituting the constant back into the
original equation one obtains
m2
1 ∂ 2 ∂R 1 ∂ ∂Θ
r + sin θ − = − k 2 (1043)
r2 R ∂r ∂r r2 sin θ Θ ∂θ ∂θ r2 sin2 θ
This equation can be rewritten as
m2
1 ∂ 2 ∂R 2 2 1 ∂ ∂Θ
r + k r = − sin θ + (1044)
R ∂r ∂r sin θ Θ ∂θ ∂θ sin2 θ
The two sides of the equation depend on the different independent variables
(r, θ), and therefore must be constants, say, l ( l + 1 ). Then it can be seen
that the θ dependence is given by the Θm l (θ) which satisfy the equation
∂Θm m2
1 ∂
sin θ l
+ l(l + 1) − Θm
l (θ) = 0 (1045)
sin θ ∂θ ∂θ sin2 θ
The above equation is related to the equation for the associated Legendre func-
tions. That is, the solution is given by
Θm m
l (θ) = Pl (cos θ) (1046)
184
m
By differentiating the recursion relation m times, and multiplying by ( 1 − x2 ) 2
one obtains
m ∞
(2m)! ( 1 − x2 ) 2 t m X
= Pnm (x) tn (1049)
2m m! ( 1 − 2 x t + t2 )m+ 21 n=0
However, the first m − 1 associated Legendre functions in the sum are zero.
Hence we have
m ∞
(2m)! ( 1 − x2 ) 2 X
= Pnm (x) tn−m
2 m! ( 1 − 2 x t + t2 )m+ 12
m
n=m
2 m ∞
(2m)! (1 − x ) 2 X
m
= Pm+s (x) ts (1050)
2m m! ( 1 − 2 x t + t2 )m+ 12 s=0
∂m ∂ m−1
(2n + 1)x Pn (x) + m ( 2 n + 1 ) Pn (x)
∂xm ∂xm−1
∂m ∂ m
= (n + 1) Pn+1 (x) + n Pn−1 (x) (1052)
∂xm ∂xm
On identifying the associated Legendre polynomial as
∂m m
Pn (x) = ( 1 − x2 )− 2 Pnm (x) (1053)
∂xm
the above relation reduces to
1
( 2 n + 1 ) x Pnm (x) + m ( 2 n + 1 ) ( 1 − x2 ) 2 Pnm−1 (x) = ( n + 1 ) Pn+1
m m
(x) + n Pn−1 (x)
(1054)
185
and differentiating m times, one obtains
∂m ∂ m+1 ∂ m+1
(2n + 1) P n (x) = P n+1 (x) − Pn−1 (x)
∂xm ∂xm+1 ∂xm+1
(1056)
or
1
m+1 m+1
( 2 n + 1 ) ( 1 − x2 ) 2 Pnm (x) = Pn+1 (x) − Pn−1 (x) (1057)
The above recurrence relation for the Associated Legendre functions reduces to
the corresponding recursion relation for the Legendre polynomials when m = 0.
————————————————————————————————–
Example:
Z π0
dθ sin θ Plm+1
0 (cos θ) ( sin θ ) Plm (cos θ)
0
Z π
dθ sin θ Plm−1
0 (cos θ) ( sin θ ) Plm (cos θ) (1059)
0
186
vectors, or three polarizations. The three components of this complex vector
are
r sin θ cos ϕ i ω t
r sin θ sin ϕ i ω t
r cos θ i ω t (1061)
by using the above two recursion relations. As the associated Legendre functions
are expected to be orthogonal, since they are eigenvalues of a Stürm-Liouville
equation, these integrals are only non-zero if l0 ± 1 = l. This condition is called
a selection rule. The vanishing of the integral means that the only transitions
which are allowed are those in which the magnitude of the electrons angular mo-
mentum is not conserved! The resolution of this apparent paradox is that the
electromagnetic field carries a unit of angular momentum. The photon, which
is the quantum particle representing an electromagnetic field has a quantized
angular momentum, or spin of magnitude 1 h̄. Likewise, a similar consideration
of the superscript, m, also yields a selection rule of m0 − m = ± 1 , or 0. The
index m corresponds to the z component of the angular momentum. The m
selection rule corresponds to the statement that the angular momentum of the
photon may be oriented in three directions, or have three polarizations. In the
non-relativistic limit, the three polarizations correspond to the three possible
values of m for which P1m (cos θ) is non-vanishing.
Problem:
Derive a recursion relation which will reduce the last expression to the sums
of integrals which only consist of the product of associated Legendre functions
with index m − 1, and the weighting factor sin θ.
187
9.1.4 Orthogonality
Associated Legendre functions with the same fixed m value are orthogonal, for
different values of n. This can easily be proved from the differential equation
m2
1 ∂ ∂
sin θ Pnm (cos θ) + n ( n + 1 ) − Pnm (cos θ) = 0
sin θ ∂θ ∂θ sin2 θ
(1063)
by multiplying by Pnm0 (cos θ) and subtracting the analogous equation with n
and n0 interchanged. Since m is fixed, the terms involving m2 sin−2 θ identically
cancel, and on multiplying by the weight factor, one is left with
m ∂ ∂ m m ∂ ∂ m
Pn0 (cos θ) sin θ P (cos θ) − Pn (cos θ) sin θ P 0 (cos θ)
∂θ ∂θ n ∂θ ∂θ n
= n0 ( n0 + 1 ) − n ( n + 1 ) Pnm0 (cos θ) sin θ Pnm (cos θ)
(1064)
since the boundary terms vanish identically, if the associated Legendre functions
and their derivatives are finite at the boundaries.
One finds that the associated Legendre functions have the normalization
integrals given by the value
Z π
2 (n + m)!
dθ Pnm0 (cos θ) sin θ Pnm (cos θ) = δn,n0 (1066)
0 2 n + 1 (n − m)!
Problem:
12.5.16
Example:
188
Consider a circular current carrying loop of radius a in the equatorial plane
(θ = π2 ), in which the current is constant and has a value I. The magnetic
→
−
induction field, B is given by the solution of
→
− →
− 4π →
−
∇ ∧ B = j (1067)
c
→
−
where j is the current density. Together with the definition of the vector
potential
→
− →
− →
−
B = ∇ ∧ A (1068)
one has
→
− →
− →
− 4π → −
∇ ∧ ∇ ∧ A = j (1069)
c
→
−
Since, the problem has azimuthal symmetry B should be independent of ϕ
and have no component in the direction êϕ . The vector potential, therefore, is
entirely in the direction êϕ
→
−
A = Aϕ êϕ (1070)
Hence, one finds that the magnetic induction is given by
êr r êθ r sin θ êϕ
→
− 1 ∂ ∂ ∂
B = 2 (1071)
r sin θ ∂r ∂θ ∂ϕ
0 0 r sin θ Aϕ
which is independent of ϕ.
êr r êθ r sin θ êϕ
∂ ∂ ∂
→
− →
− 1
∇ ∧ B = ∂r ∂θ ∂ϕ
2
r sin θ
1 ∂ ∂
r sin θ ∂θ sin θ Aϕ − ∂r r Aϕ 0
" #
1 ∂2
1 ∂ 1 ∂
= − êϕ r Aϕ + 2 sin θ Aϕ
r ∂r2 r ∂θ sin θ ∂θ
(1073)
189
which off the θ = π2 plane becomes
" #
1 ∂2
1 ∂ 1 ∂
r Aϕ + 2 sin θ Aϕ = 0 (1075)
r ∂r2 r ∂θ sin θ ∂θ
The expansion coefficient An = 0 for r > a as the vector potential must fall
to zero as r → ∞.
190
one finds the general expression for the induction field. Since the derivative of
the associated Legendre function can be expressed as
∂ ∂
P 1 (cos θ) = − sin θ P 1 (cos θ) (1085)
∂θ n ∂ cos θ n
and on using the recursion relation
1 ∂ m 1 m+1 1
( 1 − x2 ) 2 P (x) = P (x) − (n+m) (n−m+1) Pnm−1 (x) (1086)
∂x n 2 n 2
and also
2mx
Pnm+1 (x) = √ Pnm (x) + m(m − 1) − n(n + 1) Pnm−1 (x)
1 − x2
(1087)
one has
mx 1 ∂ m
√ Pnm (x) − ( 1 − x2 ) 2 Pn (x) = ( n + m ) ( n − m + 1 ) Pnm−1 (x)
1 − x 2 ∂x
(1088)
On substituting m = 1 one finds that
∞
1 X
Br (r, θ) = An n ( n + 1 ) r−n Pn (cos θ) (1089)
r2 n=0
and
∞
1 X
Bθ (r, θ) = n An r−n Pn1 (cos θ) (1090)
r2 n=0
On substituting θ = 0 one has Pm (1) = 1 and Pn1 (1) = 0, thus the field on
the z-axis takes both the forms
∞
1 X
Br (r, 0) = An n ( n + 1 ) r−n
r2 n=0
2π a2
= I 3 (1091)
c ( a2 + r2 ) 2
From this one finds that, on expanding in powers of r−1 only the terms odd
in r−n and thus only the odd n coefficients (A2n+1 ) are finite and are uniquely
given by
2πI (2n)!
A2n+1 = ( − 1 )n (1092)
c 2(2n+1) n! (n + 1)!
This completes the solution for the vector potential.
191
9.2 Spherical Harmonics
The spherical harmonics are solutions for the angular part of Laplace’s equation.
They are composed of a product of an azimuthal function Φ(φ) and a polar part
Θ(θ) and are normalized to unity.
∂2
Φ(ϕ) = − m2 Φ(ϕ) (1093)
∂ϕ2
which has solutions
1
Φm (ϕ) = √ exp imϕ (1094)
2π
The value of m is integer since the function Φm (ϕ) is periodic in ϕ with period
2 π.
192
as the spherical harmonics form a complete set. Furthermore, the expansion
coefficients can be obtained from
Z π Z 2π
m
Cl = dθ dϕ sin θ Ylm (θ, ϕ)∗ f (θ, ϕ) (1098)
0 0
where
δ( Ω − Ω0 ) = δ( ϕ − ϕ0 ) δ( cos θ − cos θ0 ) (1100)
since this is equal to unity on integrating over all solid angles dΩ = dϕ dθ sin θ
and θ is uniquely specified by the value of cos θ for π > θ > 0.
where γ is the angle between (θ1 , ϕ1 ) and (θ2 , ϕ2 ). This can be proved by
choosing the spherical polar coordinate system such that (θ2 , ϕ2 ) is directed
along the polar axis. In this case θ2 = 0, and so sin θ2 = 0, therefore,
the only non-zero associated Legendre functions and hence non-zero spherical
harmonics, Ylm (θ2 , ϕ2 ), are those corresponding to m = 0. Then as we have
Pl0 (1) = 1, the completeness relation simply takes the form
l=∞
X 1 (2 l + 1 ) 0
δ( Ω1 ) = Pl (cos θ1 ) (1102)
2π 2
l=0
193
The addition theorem states
m=l
4π X
Pl (cos γ) = Ylm (θ1 , ϕ1 )∗ Ylm (θ2 , ϕ2 ) (1104)
2l + 1
m=−l
Example:
where Z
qlm = d3 →
−
r 0 Ylm ∗ (θ0 , ϕ0 ) r0l ρ(→
−
r 0) (1108)
194
and
r Z
3
q10 = − d3 →
−
r 0 r0 cos θ0 ρ(→
−
r 0)
4π
r
3
= − pz (1112)
4π
where the dipole moment is given by
Z
→
−
p = d3 →
−
r0 →
−
r 0 ρ(→
−
r 0) (1113)
where we are introducing the notation which labels the (x, y, z) components
by (x1 , x2 , x3 ), and have introduced a tensor Qi,j win which the components
are labeled by the subscripts corresponding to (xi , xj ). The multi-pole moment
corresponding to l = 2 and m = 1 is given by
r Z
15
d3 →
−
r 0 r0 sin θ0 cos θ0 exp − i ϕ0 ρ(→ −
2
1
q2 = − r 0)
8π
r Z
15 3→−
= − d r 0 0
x − iy 0
z 0 ρ(→
−
r 0)
8π
r
1 15
= − Q1,3 − i Q2,3
3 8π
(1115)
195
in which the quadrupole moment is represented by the traceless tensor
Z
3→
−
Qi,j = d r 0 0 0
3 xi xj − r 02
ρ(→
−
r 0) (1117)
where →
−
r 0 is a position variable associated with the charge distribution.
196
10 Hermite Polynomials
The generating function expansion for the Hermite Polynomials is given by
∞
tn
X
g(x, t) = exp − t2 + 2 x t = Hn (x) (1119)
n=0
n!
and
∂
Hn (x) = 2 n Hn−1 (x) (1121)
∂x
197
This is not of the form of a Stürm-Liouville equation, unless one introduces a
weighting function
w(x) = exp − x2 (1128)
This is found by examining the ratio’s of the first two coefficients in the Stürm-
Liouville form
∂2φ ∂p(x) ∂φ
p(x) + + q(x) φ(x) = λ w(x) φ(x) (1129)
∂x2 ∂x ∂x
in which case one identifies the ratio
∂p(x)
∂x
= −2x (1130)
p(x)
or on integrating
ln p(x) = − x2 (1131)
Since the integrating factor is
2
p(x) = exp − x (1132)
one finds the above weighting factor and the differential equation has the form
∂ ∂
exp − x2 Hn (x) + 2 n exp − x2 Hn (x) = 0 (1133)
∂x ∂x
10.0.5 Orthogonality
The Hermite polynomials form an orthogonal set
Z ∞
2
dx Hn (x) exp − x Hm (x) = 0 (1134)
−∞
198
where we have used the orthogonality relation. However, one also has
Z ∞
dx g(x, s) exp − x2 g(x, t)
−∞
Z ∞
= dx exp − x2 exp − t2 + 2 x t exp − s2 + 2 x s
−∞
Z ∞
= dx exp − x2 + 2 x ( s + t ) exp − t2 − s2
−∞
√
= π exp ( s + t )2 exp − s2 − t2
√
= π exp 2 s t (1136)
Example:
199
and the second-order derivative is
2
∂2ψ ∂2
mω 2 mω 2 mω ∂
= exp − x x − (1 + 2 x )+ φ(x)
∂x2 2 h̄ h̄ h̄ ∂x ∂x2
(1143)
Inserting this last expression into the eigenvalue equation, we find that φ satisfies
the differential equation
∂2φ mω ∂φ m ( 2 E − h̄ ω )
− 2 x = − φ (1144)
∂x2 h̄ ∂x h̄2
This differential equation can be put into Hermite’s form by introducing a di-
mensionless variable r
h̄
z = x (1145)
mω
Thus, we have
∂2φ
∂φ 2E
− 2z + − 1 φ (1146)
∂z 2 ∂z h̄ ω
which has the Hermite polynomial Hn (z) as a solution when
2E
2n = − 1 (1147)
h̄ ω
Therefore, the allowed values of the energy are given by
1
E = h̄ ω ( n + ) (1148)
2
Problem:
and Z ∞
2
dx exp − x x2 Hn (x) Hm (x) (1151)
−∞
200
11 Laguerre Polynomials
The generating function expansion for the Laguerre polynomials is given by
exp − 1 x−t t X∞
g(x, t) = = Ln (x) tn (1152)
1 − t n=0
201
11.0.7 Laguerre’s Differential Equation
The Laguerre polynomials satisfy Laguerres differential equation. The differ-
ential equation for the Laguerre polynomials can be derived by combining the
recursion relations. This is done by differentiating the prior relation
∂2 ∂ ∂ ∂
x 2
Ln (x) + Ln (x) = n Ln (x) − n Ln−1 (x) (1160)
∂x ∂x ∂x ∂x
and subtracting the previous equation one has
∂2 ∂ ∂ ∂
x Ln (x) + ( 1 − x ) Ln (x) = n Ln (x) − n Ln−1 (x) − n Ln (x) + n Ln−1 (x)
∂x2 ∂x ∂x ∂x
(1161)
Finally, the right-hand-side can be identified as − n Ln (x). Thus, we have
Laguerre’s differential equation
∂2 ∂
x 2
Ln (x) + ( 1 − x ) Ln (x) + n Ln (x) = 0 (1162)
∂x ∂x
This equation is not in Stürm-Liouville form, but can be put in the form by
multiplying by exp[ − x ]. Hence, we obtain
∂ ∂
x exp[ − x ] Ln (x) + n exp[ − x ] Ln (x) = 0 (1163)
∂x ∂x
The solution is defined on the interval (0, ∞). Therefore, the Laguerre functions
are orthogonal with weighting factor exp[ − x ].
202
and
∂ p
x L (x) = n Lpn (x) − ( n + p ) Lpn−1 (x) (1167)
∂x n
From these, or by differentiating Laguerre’s differential equation p times, one
finds
∂2 p ∂ p
x Ln (x) + ( p + 1 − x ) L (x) + n Lpn (x) = 0 (1168)
∂x2 ∂x n
The weighting function is found to be
xp exp[ − x ] (1169)
Example:
h̄2 1 ∂ h̄2 l ( l + 1 ) Z e2
2 ∂R
− 2
r + 2
R − R = E R (1172)
2 m r ∂r ∂r 2m r r
where the second term is the centrifugal potential. This equation can be put
in a dimensionless form by defining a characteristic length scale r0 by equating
the centrifugal potential with the Coulomb attraction. (This just corresponds
to the radius of the circular orbit in Classical Mechanics.) The radius is found
as
h̄2
r0 = (1173)
2 m Z e2
Then, introducing a dimensionless radius ρ as the ratio of the radius to that of
the circular orbit
r
ρ = (1174)
r0
203
one sees that the on expressing the Laplacian in dimensionless form the energy
term is given by the following constant
2 m E r02 E h̄2 1
2 = 2 4
= − (1175)
h̄ 2mZ e 4 α2
which must also be dimensionless. The second equality was found by substitut-
ing the expression for r0 . The dimensionless constant must be negative, since
an electron that is bound to the hydrogen atom does not have enough energy
to escape to infinity. The minimum energy required to escape to infinity is zero
energy, since the potential falls to zero at infinity and the minimum of the ki-
netic energy is also zero. That is the minimum energy E is that in which the
electron comes to rest at infinity. Thus, since E is less than this minimum it is
negative. In terms of the variable, ρ and the constant α one has the differential
equation
1 ∂ 2 ∂R(ρ r0 ) 1 l(l + 1) 1
ρ + − − R(ρ r0 ) = 0 (1176)
ρ2 ∂ρ ∂ρ ρ ρ2 4 α2
The form of the solution can be found by examining the equation near ρ →
0. In this case, the centrifugal potential is much larger than the Coulomb
potential so one can neglect the Coulomb potential. Also, for the same reason,
the energy constant 4 1α2 is also negligible compared with the centrifugal barrier.
The equation reduces to
1 ∂ 2 ∂R(ρ r0 ) l(l + 1)
ρ − R(ρ r0 ) = 0 (1177)
ρ2 ∂ρ ∂ρ ρ2
R(r) ∝ ρl (1178)
since the other solution is proportional to ρ−(l+1) which diverges at the origin.
It is not acceptable to have a solution that diverges too badly at the origin.
204
which for large ρ simply becomes
∂ 2 R(ρ r0 ) 1
− R(ρ r0 ) = 0 (1181)
∂ρ2 4 α2
which has the solution
ρ
R(ρ r0 ) ∝ exp − (1182)
2α
as the other solution which diverges exponentially as
ρ
R(ρ r0 ) ∝ exp + (1183)
2α
is physically unacceptable for an electron that is bound to the nucleus at a dis-
tance r0 . Thus, we have found that electrons are bound at a distance of 2 α r0
from the origin.
This motivates one looking for a solution with the right forms at the origin
and at infinity. This form can be expressed as
ρ ρ
R(ρ r0 ) = exp − ρl L( ) (1184)
2α α
To substitute this form into the differential equation, one needs to evaluate the
first and second order derivatives of the form. First note that
∂R(ρ r0 ) ρ l 1 l ρ 1 0 ρ
= exp − ρ − + L( ) + L ( ) (1185)
∂ρ 2α 2α ρ α α α
and then note that
∂ ∂R(ρ r0 ) ρ
ρ2 = exp − ρl+2 ×
∂ρ ∂ρ 2α
1 (l + 1) l(l + 1) ρ
× 2
− + 2
L( )
4α αρ ρ α
1 2l + 2 ρ 1 ρ
+ − 2 + L0 ( ) + 2 L00 ( ) (1186)
α ρα α α α
Thus, one finds that L satisfies the equation
ρ 00 ρ 1 0 (l + 1)
L + ( 2 l + 2 − ) L + 1 − L = 0 (1187)
α2 α α α
or
ρ 00 ρ
L + (2l + 2 − ) L0 + α − (l + 1) L = 0 (1188)
α α
205
(2l+1)
Hence L(x) is the associated Laguerre polynomial Lα − l − 1 (x), when α is an
integer. If the function is to be normalizable, it is necessary for the series to
terminate, and hence α − l − 1 must be an integer. It is usual to set n = α,
and have the condition n ≥ l + 1. The radial wave function is given by
l r (2l+1) r
R(r) ∝ r exp − L(n−l−1) (1189)
2 n r0 n r0
Example:
one has
h̄2 1 ∂ h̄2 l ( l + 1 ) m ω 2 r2
2 ∂R
− r R + R + R = ER
2 m r2 ∂r ∂r 2 m r2 2
(1192)
On writing
l ρ
R(r) = ρ 2 exp − L(ρ) (1193)
2
with a dimensionless variable
mω 2
ρ = r (1194)
h̄
one has
r
∂R(r) m ω l+1 ρ l 1 ∂
= 2 ρ 2 exp − − + L (1195)
∂r h̄ 2 2ρ 2 ∂ρ
∂ 2 R(r)
m ω l+2 ρ l(l − 1) 2l + 1 1
= 4 ρ 2 exp − − +
∂r2 h̄ 2 4 ρ2 4ρ 4
2
2l + 1 ∂ ∂
+ − 1 + L (1196)
2ρ ∂ρ ∂ρ2
206
This leads to the equation
∂2L
2l + 3 ∂L E 2l + 3
−ρ − − ρ = − L (1197)
∂ρ2 2 ∂ρ 2 h̄ ω 4
Problem:
m2
∂ ∂u 1 1
x + Ex + l − − F x2 u = 0 (1199)
∂x ∂x 2 4x 4
where F is a measure of the strength of the electric field. Find the unperturbed
wave function in terms of the Associated Laguerre polynomials.
207
12 Inhomogeneous Equations
12.1 Inhomogeneous Differential Equations
Inhomogeneous linear differential equations can be solved by Green’s function
methods. For example, if one has an equation of the type
∂ ∂φ
p(x) + ( q(x) − λ w(x) ) φ(x) = f (x) (1200)
∂x ∂x
∂G(x, x0 )
∂
p(x) + ( q(x) − λ w(x) ) G(x, x0 ) = δ(x − x0 ) (1202)
∂x ∂x
Since we can write the inhomogeneous term of our original equation in the form
of an integral
Z b
f (x) = dx0 δ(x − x0 ) f (x0 ) (1204)
a
then we can view the original inhomogeneous equation as a linear superposition
of equations with δ function source terms, but in which the various terms are
weighted with a factor of f (x0 ). That is, the solution can be expressed as
Z b
φ(x) = dx0 G(x, x0 ) f (x0 ) (1205)
a
This expression satisfies the equation for φ(x), as can be seen by direct substi-
tution. This means that once the Green’s function has been found, the solution
of the inhomogeneous equation, for any reasonable f (x), can be found by inte-
gration.
208
12.1.1 Eigenfunction Expansion
One method of finding the Green’s function is based on the completeness re-
lation for the eigenvalues of the Stürm-Liouville equation in which the control
parameter λ has been set to zero
X
δ(x − x0 ) = φ∗n (x0 ) w(x) φn (x) (1206)
n
Due to the completeness relation one can find an expansion of the Green’s
function in the form
X
G(x, x0 ) = Gn (x0 ) φn (x) (1207)
n
0
where Gn (x ) is an unknown coefficient. As φn (x) satisfies the same boundary
conditions as G(x, x0 ), this expansion is satisfies the boundary conditions. On
substituting these two expansions in the equation one obtains
X X
( λn − λ ) Gn (x0 ) w(x) φn (x) = φ∗n (x0 ) w(x) φn (x) (1208)
n n
Therefore, if the control parameter is chosen such that λn 6= λ for all n, we have
found that the Green’s function is given in terms of a sum of the eigenfunctions
X φ∗ (x0 ) φn (x)
G(x, x0 ) = n
(1210)
n
λn − λ
209
utilizing the arbitrary constants of integration. In one-dimension, the point
x0 separates the interval into two disjoint regions. The solution of the Green’s
function in the two regions b x > x0 and x0 > x > a coincide with the solution
of the homogeneous equation with the appropriate boundary conditions. Then
in the first region, b > x > x0 , one can have
which satisfies the boundary condition at x = b. In the second region one can
have x0 > x > a
G(x, x0 ) = C2 φ2 (x) (1214)
where φ2 (x) satisfies the boundary condition at x = a. The arbitrary constants
C1 and C2 can be obtained from consideration of the other boundaries of the two
intervals, that is the point x = x0 . The Green’s function must be continuous
at x = x0 , which requires that
which leads to
∂φ1 (x) ∂φ2 (x) 1
C1 − C 2 = (1217)
∂x x=x0 ∂x x=x0 p(x0 )
The above pair of equations for the constants C1 and C2 has a solution if the
Wronskian determinant is non-zero
φ1 (x0 ) φ2 (x0 )
W (φ1 , φ2 ) = ∂φ1 (x0 ) ∂φ2 (x0 )
(1218)
∂x ∂x
210
So the constants C1 and C2 are found as
φ2 (x0 )
C1 = −
C
φ1 (x0 )
C2 = −
C
(1221)
and so the Green’s function is given by
1
G(x, x0 ) = − φ1 (x) φ2 (x0 ) (1222)
C
for b > x > x0 and
1
G(x, x0 ) = − φ1 (x0 ) φ2 (x) (1223)
C
for x0 > x > a . Therefore, the Green’s function is symmetric under the
interchange of x and x0 .
is given by
G(x, t) = −x for 0 ≤ x < t
G(x, t) = −t for t < x ≤ 1 (1227)
211
This is solved by noting that for x 6= x0 the equation for the Green’s
function is given by the solution of
∂ 2 G(x, x0 )
= 0 (1228)
∂x2
This has a general solution
G(x, x0 ) = a x + b (1229)
The Green’s function in the region x0 > x ≥ 0 must satisfy the boundary
condition at x = 0. Thus, the Green’s function has to satisfy the boundary
condition
G< (0, x0 ) = a< 0 + b< = 0 (1230)
Hence, we have b< = 0 so the Green’s function in this region is given by
∂G> (x, x0 )
∂x = 0 (1232)
1
which leads to
∂G> (x, x0 )
∂x = a> = 0 (1233)
1
0 − a< = 1 (1237)
212
Hence, we have
G< (x, x0 ) = −x
0
G> (x, x ) = − x0 (1238)
Example:
The Green’s function can be obtained from knowledge of the solutions of the
homogeneous equation. The general solution of the homogeneous equation is
given by
G(x, x0 ) = a sin x + b cos x (1241)
In the first region ( < ) where x0 > x > 0 the Green’s function has to satisfy
the boundary condition
G< (0, x0 ) = a< sin 0 + b< cos 0 = 0 (1242)
Hence,
G< (x, x0 ) = a< sin x (1243)
0
In the second region ( > ) where 1 > x > x we have
∂G> (x, x0 )
= a> cos 1 − b> sin 1 = 0 (1244)
∂x
x=1
∂x 0 − ∂x 0 = 1
x x
a< cos x0 − a> cos x0 + b> sin x0 = 1 (1246)
213
Eliminating a> , using the relation a> = b> tan 1, one obtains the pair of
equations
Example:
∂G(x, x0 )
∂
x = δ( x − x0 ) (1251)
∂x ∂x
G(x, x0 ) = a ln x + b (1252)
and
G> (1, x0 ) = a> ln 1 + b> = 0 (1254)
214
The continuity condition leads to
G> (x0 , x0 ) = G< (x0 , x0 ) (1255)
or
a> ln x0 = b< (1256)
0
On integrating over a region of infinitesimal width around x , yields
Z x0 + Z x0 +
∂G(x, x0 )
∂
dx x = dx δ( x − x0 )
x0 − ∂x ∂x x0 −
∂G> (x, x0 ) ∂G< (x, x0 )
0
x
∂x 0 − ∂x 0 = 1 (1257)
x x
Thus, a> = 1 and from the continuity condition one has b< = ln x0 . Hence,
we find
G< (x, x0 ) = ln x0
G> (x, x0 ) = ln x (1258)
Example:
Find the series solution for the Green’s function which satisfies the equation
∂ 2 G(x, x0 )
= δ( x − x0 ) (1259)
∂x2
with boundary conditions
G(0, x0 ) = G(1, x0 ) = 0 (1260)
The eigenvalue equation
∂ 2 φ(x)
= λ φ(x) (1261)
∂x2
satisfying the same boundary conditions has eigenfunctions φn (x) given by
√
φn (x) = 2 sin nπx (1262)
and eigenvalue λ = − π 2 n2 . On writing the Green’s function as a series
expansion in terms of a complete set of functions
∞
X
G(x, x0 ) = Gn (x0 ) φn (x) (1263)
n=1
then the expansion coefficients can be found by substitution into the differential
equation
X∞
− π 2 n2 Gn (x0 ) φn (x) = δ( x − x0 ) (1264)
n=1
215
On multiplying by φm (x) and integrating over x we obtain
φm (x0 )
Gm (x0 ) = − (1265)
π 2 m2
Thus, we have
∞
X sin nπx sin nπx0
G(x, x0 ) = − 2 (1266)
n=1
π 2 n2
This can be compared with the expression
G< (x, x0 ) = x ( x0 − 1 )
G> (x, x0 ) = ( x − 1 ) x0 (1267)
————————————————————————————————–
Example:
216
along with the boundary conditions
Thus,
k B cos k ( x0 − l ) − k A cos k x0 = 0 (1277)
0
Continuity of the Green’s function at x = x yields
or
B sin k ( x0 − l ) = A sin k x0 (1279)
This, leads to the determination of A and B as
sin k ( x0 − l )
A =
k sin k l
sin k x0
B = (1280)
k sin k l
Hence, the Green’s function is given by
sin k x< sin k ( x> − l )
G(x, x0 ) = (1281)
k sin k l
which is symmetric in x and x0 . The solution of the forced equation is given by
Z l
u(x) = dx0 G(x, x0 ) f (x0 )
0
Z x
sin k ( x − l )
= dx0 sin k x0 f (x0 )
k sin k l 0
Z l
sin k x
+ dx0 sin k ( x0 − l ) f (x0 ) (1282)
k sin k l x
————————————————————————————————–
217
12.2 Inhomogeneous Partial Differential Equations
The Greens’ function method can also be used for inhomogeneous partial dif-
ferential equations. Examples of inhomogeneous partial differential equations
which often occur in physics are given by
∇2 φ(→−r) = − 4 π ρ(→
−
r)
2 →−
1 ∂ φ( r , t)
∇2 φ(→
−
r , t) − 2 = − 4 π ρ(→
−
r , t)
c ∂t2
→
−
1 ∂φ( r , t)
∇2 φ(→
−r , t) − = − 4 π ρ(→
−
r , t) (1283)
κ ∂t
subject to appropriate boundary conditions. These three equations can be
solved, for arbitrary source terms ρ, from knowledge of the respective Green’s
functions which satisfy the differential equations
∇2 G(→ −r ,→− r 0 ) = δ 3 (→
−
r −→
−
r 0)
1 ∂ 2 G(→
−r , t; →
−r 0 , t0 )
∇2 G(→
−
r , t; →
−
r 0 , t0 ) − 2 = δ 3 (→
−
r −→
−
r 0 ) δ(t − t0 )
c ∂t2
1 ∂G(→ −r , t; →
−r 0 , t0 )
∇2 G(→
−r , t; →
−
r 0 , t0 ) − = δ 3 (→
−
r −→
−
r 0 ) δ(t − t0 )
κ ∂t
(1284)
with the same boundary conditions. In these equations we have introduced a
three-dimensional delta function. These are defined via
Z
d3 →
−
r δ 3 (→
−
r −→
−
r 0 ) f (→
−
r ) = f (→
−
r 0) (1285)
218
12.2.1 The Symmetry of the Green’s Function.
The Green’s function is symmetric in its arguments (→
−
r ,→
−
r 0 ), such that
G(→
−
r , t; →
−
r 0 , t0 ) = G(→
−
r 0 , t0 ; →
−
r , t) (1289)
1 ∂ 2 G(→ −
r , t; →
−
r 0 , t0 )
− = δ 3 (→−
r −→−r 0 ) δ(t − t0 ) (1290)
c2 ∂t2
and the similar equation with a source at (→ − r ”, t”). Multiplying the equation for
G( r , t; r , t ) by G( r , t; r ”, t”) and subtracting it from G(→
→
− →
− 0 0 →
− →
− −r , t; →
−
r 0 , t0 ) times
→
− →
−
the equation for G( r , t; r ”, t”), one obtains
→
− →
− →
− →
−
G(→
−
r , t; →
−
r ”, t”) ∇ . p(→
−
r , t) ∇ G(→
−
r , t; →
−
r 0 , t0 ) − G(→
−
r , t; →
−
r 0 , t0 ) ∇ . p(→
−
r , t) ∇ G(→
−
r , t; →
−
r ”, t”)
1 →
− →
− ∂ 2 G(→
−
r , t; →
−
r 0 , t0 ) 1 →− →
−
2 →
− → −
0 0 ∂ G( r , t; r ”, t”)
− G( r , t; r ”, t”) + G( r , t; r , t )
c2 ∂t2 c2 ∂t2
→
− →
− 3 →
− →
− 0 0 →
− →
− 0 0 3 →− →
−
= G( r , t; r ”, t”) δ ( r − r ) δ(t − t ) − G( r , t; r , t ) δ ( r − r ”) δ(t − t”)
(1291)
On integrating over →
−r and t, one obtains
Z ∞ Z
→
− →
−
dt d2 S G(→−
r , t; →
−
r ”, t”) . p(→−
r , t) ∇ G(→
−
r , t; →
−
r 0 , t0 )
−∞
Z ∞ Z
2→
− →
− →
− 0 0 →
− →
− →
− →
−
− dt d S G( r , t; r , t ) . p( r , t) ∇ G( r , t; r ”, t”)
−∞
∂G(→
−r , t; →
− t=∞
r 0 , t0 )
Z
1 3→
− →
− →
−
− d r G( r , t; r ”, t”)
c2 ∂t
t=−∞
1
Z
∂G(→
−r , t; →
−
r ”, t”)
t=∞
d3 →
−
r G(→ −
r , t; →
−
r 0 , t0 )
+
c2 ∂t
t=−∞
→
− 0 0 →− →
− →
−
= G( r , t ; r ”, t”) − G( r ”, t”; r , t )0 0
(1292)
G(→
−
r 0 , t0 ; →
−
r ”, t”) = G(→
−
r ”, t”; →
−
r 0 , t0 ) (1293)
219
We shall now illustrate the solution of the equation for the Green’s function
of Poisson’s equation and the Wave Equation, in several different geometries.
Poisson’s Equation
————————————————————————————————–
Example:
The Green’s function for Poisson’s equation inside a sphere is given by the
solution of
∇2 G(→−r ,→
−
r 0 ) = δ 3 (→
−
r −→
−r 0) (1294)
In spherical coordinates the delta function can be written as
δ(r − r0 ) δ(θ − θ0 )
δ 3 (→
−
r −→
−
r 0) = δ(ϕ − ϕ0 )
r2 sin θ
∞ m=l
δ(r − r0 ) X X
= Y ∗m 0 0 m
l (θ , ϕ ) Yl (θ, ϕ)
r2
l=0 m=−l
(1295)
where we have used the completeness condition for the spherical harmonics. The
Green’s function can also be expanded in powers of the spherical harmonics
∞ m=l
G(→
−
r ,→
− Gl,m (r, →
−
X X
r 0) = r 0) Y ∗m
l (θ, ϕ) (1296)
l=0 m=−l
On substituting the expansion for the Green’s function into the differential equa-
tion one obtains
X 1 ∂
2 ∂ l(l + 1)
r Gl,m − Gl,m Ylm (θ, ϕ)
r2 ∂r ∂r r2
l,m
δ(r − r0 ) X ∗ m
= Y l (θ, ϕ) Ylm (θ, ϕ) (1297)
r2
l,m
On multiplying by the spherical harmonics, and integrating over the solid angle,
the orthogonality of the spherical harmonics yields
δ(r − r0 ) ∗ m 0 0
1 ∂ 2 ∂ l(l + 1)
r G l,m − G l.m = Y l (θ , ϕ ) (1298)
r2 ∂r ∂r r2 r2
Since the left-hand-side has no explicit dependence on the angle (θ0 , ϕ0 ), one can
separate out the angular dependence
Gl,m (r, →
−
r 0 ) = gl (r, r0 ) Y ∗ m 0 0
l (θ , ϕ ) (1299)
220
and find
δ(r − r0 )
1 ∂ 2∂ l(l + 1)
r gl,m − gl.m = (1300)
r2 ∂r ∂r r2 r2
This is the equation for a one-dimensional Green’s function. For r 6= r0 , the
inhomogeneous term vanishes so one has solutions of the form
(
0
1
A rl + B r(l+1) r < r0
gl (r, r ) = (1301)
l 1
C r + D r(l+1) r > r0
221
Hence, the radial part of the Green’s function is
2l+1
0
1 − ra0 r < r
l 0l
r r
gl (r, r0 ) = (1308)
(2l + 1) a2l+1
2l+1
1 − ar r > r0
where we have used the spherical harmonic addition theorem and γ is the angle
between →
−
r and →−r 0.
In the limit that the boundaries are removed to infinity, a → ∞, the Green’s
function simplifies to
2l+1
1 0
r < r
r0
l 0l
0 r r
gl (r, r ) = −
(2l + 1) 2l+1
1 0
r r > r
l
1 r<
= − (1310)
(2l + 1) r> r<
Thus, the Green’s function in infinite three-dimensional space is given by the
familiar result
∞ l
1 r<
G(→
−
r ,→
−
X
r 0) = − Pl (cos γ)
4 π r> r<
l=0
1
= − p
4π r2 − 2 r r0 cos γ + r02
1
= − (1311)
4π| r − →
→
− −
r0 |
which is the Green’s function for Laplace’s equation.
————————————————————————————————–
222
Example:
The solution of Poisson’s equation in two dimensions, for the potential φ(r)
confined by a conducting ring of radius a can be obtained from the Green’s
function. The Green’s function satisfies the partial differential equation
∂G(→−r ,→
−r 0) 1 ∂ 2 G(→
−
r ,→
−r 0) δ( r − r0 )
1 ∂
r + 2 = δ( θ − θ0 ) (1312)
r ∂r ∂r r ∂θ2 r
On using the completeness relation
∞
1 X
δ( θ − θ0 ) = exp i m ( θ − θ0 ) (1313)
2π m=−∞
one finds that the coefficient Gm must satisfy the ordinary differential equation
∂Gm (r, →
−r 0) m2 δ( r − r0 )
1 ∂ →
− 0 1 0
r − 2 Gm (r, r ) = √ exp − i m θ
r ∂r ∂r r r 2π
(1315)
Introducing the definition
→
− 0 0 1 0
Gm (r, r ) = gm (r, r ) √ exp − i m θ (1316)
2π
∂gm (r, r0 ) m2 δ( r − r0 )
1 ∂
r − 2 gm (r, r0 ) = (1317)
r ∂r ∂r r r
g0 (r, r0 ) = A0 + B0 ln r (1318)
gm (r < r0 ) = Am rm (1320)
223
The boundary condition at r = a leads to a relation between the coefficients
of rm and r−m . The solution close to the edge of the ring has the form
" m #
m
0 r a
gm (r > r ) = Cm − (1321)
a r
g0 (r < r0 ) = A0 (1322)
gm (r0 + , r0 ) = gm (r0 − , r0 )
r0 +
∂gm (r, r0 ) 1
= (1324)
∂r
r 0 − r0
The boundary conditions at the intersection of the two intervals determine the
remaining two coefficients via
" m m #
r0
0m a
Am r = Cm − (1325)
a r0
and " m m #
r0
a
m Cm + − m Am r0m = 1 (1326)
a r0
These two equations can be solved resulting in
0 m
1 r
Cm = (1327)
2m a
and " m m #
a−m r0
a
Am = − (1328)
2m a r0
Hence, the solution is given by
m " m m #
r0
0 1 r a
gm (r < r ) = −
2m a a r0
m " m m #
r0
0 1 r a
gm (r > r ) = − (1329)
2m a a r
224
and
r0
g0 (r < r0 ) = ln
a
r
g0 (r > r0 ) = ln (1330)
a
The Green’s function is given by
∞
1
G(→
−
r ,→
−
X
r 0) = 0
gm (r, r ) exp 0
im(θ − θ )
2π m=−∞
∞
1 0
X
0 0
= g0 (r, r ) + 2 gm (r, r ) cos m ( θ − θ )
2π m=1
(1331)
In the limit that the boundaries are removed to infinity a → ∞ one has
m
0 1 r<
gm (r, r ) = − (1332)
2m r>
so
∞ m
1 r> 1 r<
G(→
−
r ,→
−
X
r 0) = ln − 0
cos m ( θ − θ )
2π a m=1
m r>
r + r02 − 2 r r0 cos ( θ − θ0 )
2
1
= ln
4π a2
1 |→−
r − →−
r0 |
= ln (1333)
2π a
which is the Green’s function for the Laplacian operator in an infinite two-
dimensional space.
————————————————————————————————–
Example:
1 ∂2φ
1
∇2 φ − 2 2
= − 2 f (→ −r , t) (1334)
c ∂t c σ
225
where f is the force per unit area normal to the drumhead and σ is the mass
density. The boundary condition on the displacements is φ(→ −r , t) = 0. We
shall assume that the driving force has a temporal Fourier decomposition
Z ∞
→
− dω →
−
f ( r , t) = √ f ( r , ω) exp − i ω t (1335)
−∞ 2π
Due to the linear nature of the equation one can find the solution φ(→−
r , t) from
→
−
the temporal Fourier transform φ( r , ω) defined via
Z ∞
→
− dω →
−
φ( r , t) = √ φ( r , ω) exp − i ω t (1336)
−∞ 2π
On Fourier transforming the differential equation with respect to t, one has
Z ∞
1 ∂2φ
dt 2
√ ∇ φ − 2 exp + i ω t
−∞ 2π c ∂t2
Z ∞
1 dt →
−
= − 2 √ f ( r , t) exp + i ω t (1337)
c σ −∞ 2π
On integrating the second derivative with respect to time by parts, twice, and
assuming that limt→±∞ φ(→ −r , t) = 0, one obtains
ω2 →
→
− − 1
2
∇ φ( r , ω) + 2 φ( r , ω) = − 2 f (→ −r , ω) (1338)
c c σ
φ(→
−
r 0 , ω)
Z
1
= √ + 2 d2 →
−
r G(→
−
r ,→
−
r 0 ; ω) f (→
−
r , ω)
2π c σ
(1340)
226
Also, using the two dimensional version of Green’s theorem, this is equal to
∂G(→−
r ,→
−r 0 ; ω) ∂φ(→
−
Z
→
− →
− →
− 0 r ; ω)
= dl φ( r , ω) − G( r , r ; ω) (1341)
∂n ∂n
where the integral is over the perimeter of the two dimensional area and the
derivative with respect to n ia taken along the normal to the perimeter. For a
∂ ∂
circular area of radius a, one has dl = a dθ and ∂n = ∂r . Since both terms
vanish on the perimeter of the drumhead, φ( r , ω) = 0 and G(→
→
− −
r ,→
−
r 0 ; ω) = 0,
one has
φ(→−r 0 , ω)
Z
1
√ = − 2 d2 →
−
r G(→
−r ,→
−
r 0 ; ω) f (→
−
r , ω) (1342)
2π c σ
or using the symmetry of the Green’s function
φ(→
−r 0 , ω)
Z
1
√ = − 2 d2 →
−
r G(→−
r 0, →
−
r ; ω) f (→
−
r , ω) (1343)
2π c σ
Thus, the solution of the forced equation can be obtained from the Green’s
function and the forcing function. In particular one has
Z ∞
dω
φ(→
−r , t) = √ φ(→−
r , ω) exp − i ω t
−∞ 2π
Z ∞ Z
1
= − 2 dω d2 →
−
r 0 G(→
−
r ,→
−
r 0 ; ω) f (→
−
r 0 , ω) exp − i ω t
c σ −∞
(1344)
where we have used the completeness relation for the Fourier series to expand
the delta function of the angle.
227
On substituting these into the equation for the Green’s function, multiplying
by the complex conjugate basis function
1
√ exp − i m θ (1347)
2π
and integrating over θ, one obtains
∂Gm (r, →
−r 0) δ(r − r0 )
2
m2
1 ∂ ω 1
r + − 2 Gm (r, →
−
r 0) = exp − imθ 0
r ∂r ∂r c2 r 2π r
(1348)
The inhomogeneous term is zero for both regions r > r0 and r0 > r, so that
the solution is
ω
gm (r, r0 ) = A Jm ( r) r < r0
c
ω ω
gm (r, r0 ) = F Jm ( r) + G Nm ( r) r > r0
c c
(1351)
228
The matching conditions at r = r0 are given by
gm (r0 + , r0 ) = gm (r0 − , r0 )
r0 +
∂gm (r, r0 ) 1 1
= √ (1354)
∂r
r 0 − 2 π r0
Hence, the coefficients A and B satisfy the linear equations
ω ω ω ω ω
A Jm ( r0 ) = B Nm ( a) Jm ( r0 ) − Jm ( a) Nm ( r0 )
c c c c c
1 1 ω ω 0 ω 0 ω 0 ω 0 0 ω 0
√ = B N m ( a) J m ( r ) − J m ( a) N m ( r ) − A J m ( r )
2 π r0 c c c c c c
(1355)
The solution of these equations can be simplified by noting that the Wronskian
of the solutions of the Bessel functions is given by
0 0 2
Jm (z) Nm (z) − Jm (z) Nm (z) = (1356)
πz
The solution can be written as
ω ω 0 ω ω 0
Nm ( c a) Jm ( c r ) − Jm ( c a) Nm ( c r )
1 π
A = − √
2π 2 Jm ( ωc a)
1 π Jm ( ωc r0 )
B = − √ ω (1357)
2 π 2 Jm ( c a)
The radial Green’s function can be written as
π Jm ( ωc r)
1 ω ω 0 ω ω 0
gm (r, r0 ) = − √ ω N m ( a) J m ( r ) − J m ( a) N m ( r ) r0 > r
2 π 2 Jm ( c a) c c c c
ω 0
1 π Jm ( c r ) ω ω ω ω
gm (r, r0 ) = − √ ω N m ( a) J m ( r) − Jm ( a) N m ( r) r0 < r
2 π 2 Jm ( c a) c c c c
(1358)
229
The time-dependent Green’s function is given by
Z ∞
→
− →
− 0 0 dω →
− →
− 0 0
G( r , r ; t − t ) = √ G( r , r ; ω) exp − i ω ( t − t ) (1360)
−∞ 2π
Poisson’s Equation.
————————————————————————————————–
Example:
∇2 G(→
−
r ,→
−
r 0 ) = δ 3 (→
−
r −→
−
r 0) (1361)
Solution:
→ (→
∇2 φ− − → (→
→ φ−
r ) = λ− −
r) (1362)
k k k
and has eigenfunctions
1 →
− −
→ (→
φ− −
r) = 3 exp i k .→
r (1363)
k ( 2 π )2
230
The Green’s function can be expanded as
Z
→
−
G(→−
r ,→
−
r 0) = → (→
d3 k G− − → (→
r 0 ) φ− −
r) (1365)
k k
and the delta function can be expanded using the completeness relation for the
eigenfunctions
Z
→
− ∗ →
δ 3 (→
−
r −→
−r 0) = − (−
d3 k φ→ → (→
r 0 ) φ− −
r) (1366)
k k
Inserting both of these expansions in the equation for the Green’s function we
find the expansion coefficients of the Green’s function satisfy the equation
Z Z
→
− →
− ∗ →
− → (→
d3 k k 2 G− − → (→
r 0 ) φ− −
r) = d3 k φ→− (− → (→
r 0 ) φ− −
r) (1367)
k k k k
Multiplying by φ→∗ →
− →
−
− 0 ( r ) and integrating over r one has the orthogonality re-
k
lation Z
→
− → −0
d3 →
−
r φ→− 0 (→
∗ −
r ) φ−→ (→−
r ) = δ3 ( k − k ) (1368)
k k
one projects out the coefficient G→ − 0 (→
−
r ) so
k
− 0 (→
− k 02 G→ − − 0 (→
∗
r ) = φ→ −
r 0) (1369)
k k
Hence, the Green’s function is given by
∗ →
Z − (−
φ→ r 0 ) φ−→ (→
−r)
3→
− k k
G(→
−
r ,→
−
r 0) = − d k
k2
→
− →
− →
− 0
→
− exp i k . ( r − r )
d3 k
Z
= −
( 2 π )3 k2
(1370)
It is seen that the Green’s function only depends on the difference of the po-
sitions, →
−
r −→ −r 0 , and not on →
−
r and →
−r 0 separately. This is because we have
assumed that space is homogeneous. To simplify further calculations we shall
re-center our coordinates on the source point at →−r 0 . In this case, the Green’s
function only depends on r .→
−
231
2π π ∞ exp i k r cos θ
dk k 2
Z Z Z
= − dϕ d θ sin θ
0 0 0 ( 2 π )3 k2
(1371)
where the polar axis has been chosen along the direction of → −
r . The integral
over ϕ and cos θ can be performed as
Z 2π Z 1 Z ∞
→
− dk
G( r ) = − dϕ d cos θ exp i k r cos θ
0 −1 0 ( 2 π )3
Z ∞ exp i k r − exp − i k r
dk
= −
0 ( 2 π )2 ikr
(1372)
where the integral over the singularity at k = 0 is interpreted as being just the
principal part. That is, the integral is evaluated over two regions (−∞, −ε) and
(ε, ∞), which end symmetrically around the singularity at k = 0. The integral
is evaluated by taking the limit ε → 0. The integration can be evaluated by
Cauchy’s theorem to yield π i times the residue. Hence, we obtain
1
G(→
−
r) = − (1374)
(4πr)
————————————————————————————————–
Example:
232
where in ultra-spherical polar coordinates one has the delta function
0 0
δ(r − r0 ) δ(θd−1 − θd−1 ) δ(θd−2 − θd−2 ) δ(θ2 − θ20 )
δ d (→
−
r −→
−
r 0) = . . . δ(ϕ−ϕ0 )
rd−1 sind−2 θd−1 sind−3 θd−2 sin θ2
(1377)
One can solve for the Green’s function using the eigenfunction expansion method.
The Green’s function is expanded as
→
−
dd k
Z
→
− →
− 0 →
− 0 →
− → −
G( r , r ) = d G−→ ( r ) exp i k . r (1378)
( 2 π )2 k
The delta function can also be expanded using the completeness relation
→
−
dd k
Z
d →
− →
− 0 →
− →
− →
− 0
δ (r − r ) = exp i k . ( r − r ) (1379)
( 2 π )d
These are substituted into the equations of motion, which then gives an equation
involving all the Green’s function expansion coefficients. The orthogonality
relation of the eigenfunctions
dd →
−
Z
r →
− →
− →
−0 →
− → −0
exp i r . ( k − k ) = δd ( k − k ) (1380)
( 2 π )d
233
Thus, we have
∞ exp i k r cos θd−1
dk k d−1 π
Z Z
G(→
−
r) = − dθ d−1 sind−2
θ d−1
0 ( 2 π )d 0 k2
Z π Z π Z 2π
× dθd−2 sind−3 θd−2 . . . dθ2 sin θ2 dϕ
0 0 0
(1385)
The angular integrations which only involve the weight factors can be performed
using the formula
√ Γ( m+1
Z π
2 )
dθ sinm θ = π m+2 (1386)
0 Γ( 2 )
where the Γ function is the generalized factorial function defined by
Z ∞
Γ(z) = dx exp − x xz−1 (1387)
0
One can show that, by integrating by parts with respect to x, the Γ function
satisfies the same recursion relation as the factorial function
and as Z ∞
Γ(1) = dx exp − x = 1 (1389)
0
Γ(n + 1) = n! (1390)
π d−2
√
d−1
Z
d−2 2 2
dθd−1 sin θd−1 exp i k r cos θd−1 = π Γ( ) J d−2 (kr)
0 2 kr 2
(1392)
234
Thus, we find
∞ d−2
dk k d−3
Z
1 2
G(→
−
r) = − d J d−2 (kr)
0 ( 2 π )2 kr 2
d−2 Z ∞
1 2
dk d−4
= − d k 2 J d−2 (kr) (1393)
r 0 (2π) 2 2
Finally, the integral over k can be evaluated with the aid of the formula
∞
2µ Γ( µ+ν+1 )
Z
dk k µ Jν (kr) = 2
(1394)
0 rµ+1 Γ( µ−ν+1
2 )
Γ( d−2
→
− 2 ) 1
G( r ) = − d (1395)
4 π2 rd−2
The d-dimensional Green’s function for the Laplacian operator reduces to the
three-dimensional case previously
√ considered, as can be seen by putting d = 3
and noting that Γ( 12 ) = π.
The Green’s function for the wave equation can be obtained by expansion
in terms of eigenfunctions.
————————————————————————————————–
Example:
The Green’s function for the wave equation, inside a two dimensional area,
satisfies the inhomogeneous equation
1 ∂2
2
∇ − 2 G(→−r ,→
−
r 0 ; t − t0 ) = δ 2 (→
−
r −→
−
r 0 ) δ(t − t0 ) (1396)
c ∂t2
Solution:
On Fourier transforming the equation for the Green’s function with respect
to t according to
Z ∞
→
− →
− 0 dt →
− →
− 0 0 0
G( r , r ; ω) = √ G( r , r ; t − t ) exp + i ω ( t − t ) (1397)
−∞ 2π
235
yields
ω2
1
∇2 + G(→
−
r ,→
−
r 0 ; ω) = √ δ 2 (→
−
r −→
−
r 0) (1398)
c2 2π
The eigenfunctions of the operator satisfy the eigenvalue equation
ω2
2
∇ + 2 φk,m (→
−r ) = λk,m φk,m (→ −
r) (1399)
c
where k, m label the eigenfunctions. The explicit form of the eigenvalue equation
is given by
1 ∂ 2 φk,m ω2
1 ∂ ∂φk,m
r + 2 + φ k,m = λk,m φk,m (1400)
r ∂r ∂r r ∂θ2 c2
The eigenfunctions that satisfy the boundary conditions at the origin and vanish
on a circle of radius a are given by
1
φk,m (r, θ) = Nm Jm (kr) √ exp i m θ (1401)
2π
where the normalization Nm is given by
−1
a 0
Nm = √ Jm (ka) (1402)
2
and the number k is given by the zeroes of the Bessel function
Jm (ka) = 0 (1403)
The eigenvalues are given by
ω2
λk,m = − k2 (1404)
c2
∞ X
1 X Nm2
Jm (kr) Jm (kr0 )
+ √ ω2
cos m ( θ − θ0 )
2π m=1
π c2 − k 2
k
(1405)
236
where we have used the symmetry of the product Jm (x) Jm (y) under the inter-
change of m and − m.
————————————————————————————————–
Example:
237
Z ∞ exp + ik|→
−
r − →
−
r0 |
dk k
=
−∞ ( 2 π )2 |→
−
r − →
−
r0 |
Z ∞ exp − i ω ( t − t0 )
dω
× ω2
(1411)
−∞ 2πi c2 − k2
The integration over ω can be performed using Cauchy’s theorem of contour
integration. The contour has to be closed in the lower-half complex ω plane for
positive t, and in the upper-half complex plane for negative t. There are poles
at ω = ± c k in the lower-half plane. This leads to
Z ∞
c Θ(t − t0 )
dk
G(→−r ,t : →
−
r 0 , t0 ) = − i → exp i k | →
−r − →
−
r 0
| sin c k ( t − t0 )
|−r − → −
r 0 | −∞ ( 2 π )2
c →
− →
− 0 0
= − δ | r − r | − c ( t − t ) (1412)
4π|→ −
r − → −r0 |
————————————————————————————————–
Example:
The Green’s function for the wave equation in an infinite d-dimensional space
can also be obtained using this method. First, the Green’s function equation
is Fourier transformed with respect to time. The frequency dependent Green’s
function is then defined as the solution of the inhomogeneous partial differential
equation
ω2
1
∇2 + 2 G(→−r ,→
−r 0 ; ω) = √ δd ( →
−
r − →−r0 ) (1413)
c 2π
Solution:
The Green’s function can be found by using the completeness relation of the
plane wave eigenfunctions of the Laplace operator. On expanding the Green’s
function in terms of plane waves and also using the representation of the d-
dimensional delta function
→
−
dd k
Z
→
−
δd ( →
−
r − →−
r0 ) = exp i k . ( →
−
r − →
−r 0
) (1414)
( 2 π )d
one finds that the frequency dependent Green’s function can be written as
→
−
Z exp i k . ( →
−
r − → −
r0 )
1 →
−
G(→−r ,→
−
r 0 , ω) = (d+ 1
)
dd k ω2
(1415)
2
(2π) c2 − k
2
238
The time-dependent Green’s function is given by the inverse Fourier Transform
of the frequency dependent Green’s function
→
− →
− →
− 0 0
Z ∞ Z exp i k . ( r − r ) − i ω ( t − t )
1 d→
−
G(→−r ,→
−
r 0 , t−t0 ) = dω d k ω2
( 2 π )(d+1) −∞ c2 − k
2
(1416)
Note that because space time is homogeneous ( we have no boundaries ) the
Green’s function is invariant under translations through time and space.
→
−
The integral over the d-dimensional volume dd k in ultra-spherical polar
coordinates is given by
Z ∞ Z π Z π Z π Z 2π
dk k d−1 dθd−1 sind−2 θd−1 dθd−2 sind−3 θd−2 . . . dθ2 sin θ2 dϕ
0 0 0 0 0
(1417)
The integral over the solid angle involves an integral over the principal polar
angle of the form
d−2
√
d−1
Z
(d−2) 2 2
dθ sin θ exp i x cos θ = π Γ( ) J d−2 (x) (1418)
2 x 2
Γ(n + 1) = n! (1420)
d d−2
= ( 2 π ) 2 x− 2 J d−2 (x) (1422)
2
239
as the Γ function is defined such that Γ(1) = 1.
Also the integral over ω can be performed, using contour integration. Since
we only want the causal part of the Green’s function for physical reasons, the
poles on the real ω-axis at ω = ± c k are displaced into the lower-half complex
plane, giving
Z ∞ exp − i ω t
2πc
dω ω2
= Θ(t) sin c k t (1423)
−∞ c2 − k
2 k
t > t0
1
Θ(t) = (1424)
0 t < t0
Then on defining
s = r2 + ( − i c t + ε )2 (1429)
240
and using the identity
1 ∂ n−1 −1
s−n = ( − 1 )n s (1430)
(n − 1)! ∂sn−1
one has
−n n−1 −1
2 2 1 1 ∂ 2 2
r + (−ict + ε) = − r + (−ict + ε)
(n − 1)! 2 r ∂r
(1431)
The term involving a second delta function has been dropped as it does not
contribute, due to the presence of the Heaviside function. The Green’s function
is retarded, in the sense that the solution at point (→ −r , t) only experiences the
effect of the source at the point (→ −r 0 , t0 ) only if t is later than t0 . Thus, the re-
tarded Green’s function expresses causality. However, it also shows that the an
effect at point (→
−
r , t) only occurs if the signal from the point source at (→ −
r 0 , t0 )
exactly reaches the point (→ −
r , t), and that in between the signal travels with
speed c.
241
13 Complex Analysis
Consider a function of a complex variable z = x + i y. Then the function
f (z) of the complex variable also has a real and imaginary part, so
where u and v are real functions of x and y. The derivative of a complex function
is defined by the limit
df (z) f (z + ∆z) − f (z)
= lim (1434)
dz ∆z→0 ∆z
provided that the limit exists and is independent of the path in which the limit
is taken. For example, if one considers
∆z = ∆x + i ∆y (1435)
Conversely, if the Cauchy Riemann conditions are satisfied then f (z) is con-
tinuous, and the derivative exists. This can be formulated as a theorem.
242
Theorem.
Let u(x, y) and v(x, y) be the real and imaginary parts of a function of a
complex variable f (z), which obey the Cauchy Riemann conditions and also
posses continuous partial derivatives ( with respect to x and y ) at all points in
some region of the complex plane, then f (z) is differentiable in this region.
Proof.
Since u and v have continuous first partial derivatives there exist four num-
bers, 1 , 2 , δ1 , δ2 that can be made arbitrarily small as ∆x and ∆y tend to zero,
such that
∂u ∂u
u(x+∆x, y+∆y) − u(x, y) = ∆x + ∆y + 1 ∆x + δ1 ∆y (1441)
∂x ∂y
and
∂v ∂v
v(x+∆x, y+∆y) − v(x, y) = ∆x + ∆y + 2 ∆x + δ2 ∆y (1442)
∂x ∂y
Multiplying the second of these equations by i and adding them one has
∂u ∂u ∂v ∂v
f (z + ∆z) − f (z) = ∆x + ∆y + i ∆x + i ∆y
∂x ∂y ∂x ∂y
+ 1 ∆x + δ1 ∆y + i 2 ∆x + i δ2 ∆y (1443)
————————————————————————————————–
243
Example:
u(x, y) = x2 − y 2 (1446)
and
v(x, y) = 2 i x y (1447)
Hence,
∂u ∂v
= 2x = (1448)
∂x ∂y
and
∂u ∂v
= −2y = − (1449)
∂y ∂x
However, the function f (z) = (z ∗ )2 does not satisfy the Cauchy Riemann
conditions, but is still continuous.
That is, the function only depends on z = x + i y and not on the complex
conjugate z ∗ = x − i y. It should be noted that not all functions can be Taylor
expanded about an arbitrary point, for example the functions
1
f (z) = (1451)
z
and
1
f (z) = exp − (1452)
z
can not be Taylor expanded around the point z = 0. The partial derivatives
do not exist, and the Cauchy-Riemann conditions do not hold at z = 0. The
radius of convergence of the Taylor expansion about the origin is zero.
Analytic Functions
244
A function f (z) is analytic at the point z0 if it satisfies the Cauchy Riemann
conditions at z0 . If f (z) is analytic at all points in the entire complex z plane
it is defined to be an entire function.
x = x(s)
y = y(s) (1453)
which reduces the contour integration to the sum of two Riemann integrations.
————————————————————————————————–
Example:
x(s) = s
y(s) = s2 (1455)
x(s) = s
y(s) = s (1456)
245
The integral over contour C1 is evaluated as
Z Z 1
dz(s) 2 2
dz f (z) = ds (x − y ) + i2xy
C1 0 ds
Z 1
dx(s) dy(s)
= ds + i ( x2 − y 2 ) + i 2 x y
0 ds ds
Z 1
= ds 1 + i 2 s ( s2 − s4 ) + i 2 s3
0
Z 1
2 4 3 5
= ds (s − 5s ) + i(4s − 2s )
0
s=1
1 3 2 6
s − s5 ) + i ( s4 −
= ( s )
3 6
s=0
2 2
= − + i (1457)
3 3
————————————————————————————————–
Example:
246
complex z plane. The integral can be evaluated as
Z 2π
rn
Z
1
dz z n = dθ exp i ( n + 1 ) θ (1459)
2πi C 2π 0
where we have used the polar form for the complex number
z = r exp i θ (1460)
————————————————————————————————–
Both of these examples yield results which are independent of r, the radius
of the closed circular contour.
247
A contour integral of function, that satisfies the Cauchy Riemann conditions
at every point of a region of the complex plane which contains the function, is
independent of the path of integration. There is an analogy between integration
of analytic functions and conservative forces in Mechanics; the analytic func-
tion plays the role of a conservative force and the integral plays the role of the
potential. Alternatively, there is an analogy between integration of an analytic
function and functions of state in thermodynamics.
dF (z)
= f (z) (1465)
dz
The function F (z) is the primitive of the function f (z). This is the content of
Cauchy’s Integral Theorem.
Stokes’s Proof.
248
The integration consists of two line integrals. Each line integral can be thought
of as a integration of a scalar product of a vector with the vector displacement
around the loop in the (x, y) plane. That is, the integration can be considered
as an integration representing the work performed by a force.
in terms of an integral inside the area of the (x, y) enclosed by the loop or
I Z Z
∂v ∂u
u dx − v dy = − dx dy + (1472)
C ∂x ∂y
The imaginary part can be evaluated in the same way, but this time the
→
−
vector B in the loop integral
I
→
−
d→
−r . B (1473)
C
is identified as
→
−
B = ê1 v + ê2 u (1474)
leading to
I Z Z
∂u ∂v
v dx + u dy = dx dy − (1475)
C ∂x ∂y
where the last integral is evaluated over the area enclosed by the loop.
Since the functions u and v satisfy the Cauchy conditions inside the region
enclosed by the loop, both integrands vanish. Hence, we have Cauchy’s integral
theorem I
dz f (z) = 0 (1476)
C
249
if f (z) is analytic at all points enclosed by the loop.
Then, on reversing the direction of the integration on one segment, the integrals
in the same direction are equal
Z B Z B
dz f (z) = dz f (z) (1478)
A C1 A C2
————————————————————————————————–
Example:
250
around a contour C circling the origin counterclockwise with | z | > 1.
Solution:
The Cauchy Riemann conditions are not satisfied at two point z = and
z = − 1 where the derivative is not defined. The function is analytic at
all points in the surrounding area. The region is multiply connected. Two
cut lines can be chosen linking the singularities to infinity, making the region
simply connected. We shall choose the line running along the positive imagi-
nary axis from the origin and the parallel line running from z = − 1 to infinity.
Cauchy’s theorem can be applied to the contour C and its completion run-
ning along oppositely directed segments on each side of the line cuts extending
to the singularities, and two small circles C1 and C2 of radius r circling the
singularities in a clockwise direction.
since the contour is in a singly connected region in which the function is ana-
lytic. The contributions from the anti-parallel line segments cancel in pairs, as
the function is single-valued on these lines.
z = − 1 + r exp[ i θ ] (1482)
The integral around the contour C2 running clockwise around the origin is
evaluated along the curve
z = r exp[ i θ ] (1484)
251
and θ runs from 0 to − 2 π. The integral is evaluated as
I Z −2π
1 1
dz = lim i dθ r exp[ i θ ]
C2 z ( z + 1 ) r→0 0 r exp[ i θ ] ( 1 + r exp[ i θ ] )
Z −2π
1
= lim i dθ
r→0 0 ( 1 + r exp[ i θ ] )
= −i2π (1485)
————————————————————————————————–
Example:
Solution:
The Cauchy Riemann conditions are not satisfied at two point z = and
z = − 1 where the derivative is not defined. The function is analytic at all
points in the surrounding area. The region is multiply connected. Two cuts
can be chosen linking the singularities to infinity, cutting the region so that it
is simply connected. We shall choose the cut running along the positive imagi-
nary axis from the origin to i ∞ and a parallel cut running from z = − 1 to i ∞.
252
Cauchy’s theorem can be applied to the contour C and its completion run-
ning along oppositely directed segments on each side of the cuts, and two small
circles C1 and C2 of radius r circling the singularities in a clockwise direction.
z = − 1 + r exp[ i θ ] (1491)
The integral around the contour C2 running clockwise around the origin is
evaluated along the curve
z = r exp[ i θ ] (1493)
and θ runs from 0 to − 2 π. The integral is evaluated as
I Z −2π
2z + 1 1 + 2 r exp[ i θ ]
dz = lim i dθ r exp[ θ ]
C2 z(z + 1) r→0 0 r exp[ i θ ] ( 1 + r exp[ i θ ] )
Z −2π
1 + 2 r exp[ i θ ]
= lim i dθ
r→0 0 ( 1 + r exp[ i θ ] )
= −i2π (1494)
253
or I
2z + 1
dz = 4πi (1496)
C z(z + 1)
————————————————————————————————–
The contour integration lies within the simply connected region of analyticity
and, thus, Cauchy’s theorem can be applied. The integral is evaluated as four
segments
I Z z+
f (z) f (z)
0 = dz + dz
C z − z0 z z − z0
I Z0 z0
f (z) f (z)
+ dz + dz (1499)
C” z − z0 z− z − z0
Since f (z) is single-valued, the integral along both sides of the line cancel. Hence
we have
I I
f (z) f (z)
0 = dz + dz (1500)
C z − z0 C” z − z0
254
On reversing the direction of the contour C” around z0 such that C” = − C 0 ,
then I I
f (z) f (z)
dz = dz (1501)
C z − z 0 C 0 z − z0
where both contours are traversed in the same sense of rotation. The contour
C 0 is evaluated over a path of radius around z0 , i.e.
z = z0 + r exp[ i θ ] (1502)
We shall assume that the contour C and hence C 0 both run in a counter clockwise
direction, so that θ runs from 0 to 2 π. Hence, in the limit r → 0 one has
I Z 2π
f (z) f ( z0 + r exp[ i θ ] )
dz = i dθ r exp[ i θ ]
C z − z 0 0 r exp[ iθ ]
Z 2π
= i dθ f (z0 )
0
= 2 π i f (z0 ) (1503)
————————————————————————————————–
Example:
Solution:
255
which can be verified by explicit integration.
Thus, Cauchy’s integral formula avoids any need to explicitly construct and
evaluate the integral.
————————————————————————————————–
13.4 Derivatives
Cauchy’s integral formula may be used to express the derivative of a function
f (z) at a point z0 in a region which is analytic. The point z0 is assumed to be
enclosed by a loop C contained inside the region of analyticity.
and z0 I
1 f (z)
dz = f (z0 ) (1509)
2πi C z − z0
So one has
" #
f (z0 + ∆z) − f (z0 )
I
1 f (z) f (z)
lim = lim dz −
∆z →0 ∆z ∆z → 0 2 π i ∆z C z − z0 − ∆z z − z0
I " #
1 f (z)
= lim dz
∆z → 0 2 π i C ( z − z0 − ∆z) ( z − z0 )
I " #
1 f (z)
= dz (1510)
2πi C ( z − z0 )2
where C encloses z0 , and f (z) is analytic at all points of the region enclosed by C.
256
The higher order derivatives can also be evaluated in the same way. This
procedure leads to
" #
dn f (z0 )
I
n! f (z)
= dz (1512)
dz0n 2πi C ( z − z0 )n+1
which is basically the n-th order derivative of the Cauchy integral formula with
respect to z0 .
Thus, Cauchy’s integral formula proves that if f (z) is analytic then all the
derivatives are also analytic.
Example:
z = z0 + r exp[ i θ ] (1514)
257
Example:
for every closed contour C in the region, then f (z) is analytic in the region.
This is proved by noting that since the integral on every closed contour is
zero, an integral on an open contour (in the region) can only depend upon the
end points Z zf
F (zf ) − F (zi ) = dz f (z) (1521)
zi
258
Furthermore, as Z zf
f (zi )
f (zi ) = dz (1522)
zf − zi zi
one has the identity
zf
F (zf ) − F (zi ) f (z) − f (zi )
Z
− f (zi ) = dz (1523)
zf − zi zi zf − zi
In the limit zf → zi the right-hand-side goes to zero as f (z) is continuous.
Hence, the left-hand-side is equal to zero. Thus,
F (zf ) − F (zi )
lim − f (zi ) = 0 (1524)
zf → zi zf − zi
and
F (zf ) − F (zi ) dF (z)
lim = = f (zi ) (1525)
zf → zi zf − zi dz zi
Since F (z) is analytic, then Cauchy’s integral formula for the derivatives shows
that all the derivatives of F (z) are analytic, in particular the first derivative
f (z) is also analytic.
Cauchy’s Inequality
since only the term proportional to z −1 gives a finite contribution to the integral.
Then I
1 f (z) 2πr
| Cn | = dz n+1 < M (1530)
2 π |z|=r z 2 π rn+1
259
which leads to the inequality
| Cn | r n < M (1531)
Liouville’s Theorem
Liouville’s theorem states that if f (z) is analytic and bounded in the com-
plex plane then f (z) is a constant.
Conversely, if the function deviates from a constant the function must have
a singularity somewhere in the complex plane. In the next section we shall con-
sider properties of more general functions of a complex variable z
260
14 Complex Functions
We shall first examine the properties of a function that is analytic throughout
a simply connected region.
f (z 0 )
I
1
f (z) = dz 0 0 (1534)
2πi C z − z
The contour C has to enclose z and we choose it such that it also encloses z0 .
Then on writing
f (z 0 )
I
1
f (z) = dz 0 0
(1535)
2πi C ( z − z0 ) − ( z − z0 )
where Cauchy’s integral formula for the n-th order derivative has been used. If
f (z) is analytic at z0 the Taylor expansion exists and is convergent for z suffi-
ciently close to z0 .
Schwartz Reflection
261
The Schwartz reflection principle can be proved by using the Taylor expan-
sion about some point x0 on the real axis. Then
X 1 dn f (x0 )
f (z) = ( z − x0 )n (1538)
n=0
n! dxn0
Since f (x) is real on the real axis, then so are all the derivatives. Hence,
X 1 ∗ n
∗ n d f (x0 )
f (z) = ( z − x0 )
n=0
n! dxn0
X 1 dn f (x0 )
= ( z ∗ − x0 )n
n=0
n! dxn0
= f (z0∗ ) (1539)
where the last line is obtained by identifying the Taylor expansion as f (z) with
the replacement z = z ∗ .
Thus we have a region in which the both Taylor series converge, and yield the
function. However, the second expansion defines the function at points which
lie outside the original circle of convergence. Thus, the function f (z) has been
262
expressed as an analytic function in a region that contains points not included
in the original circle. Thus, the analytic properties of the function has been
continued into a different region of the complex plane.
It should be noted that analytic continuation can take many different forms
and is not restricted to series expansion.
Example:
and the radius of convergence is determined by the distance to the nearest sin-
gularity, which is at zs = − 1. The radius of convergence of the expansion
about z0 = 0 is unity. The Taylor expansion converges for all the points inside
the unit circle.
Let us first choose to expand about the new point z2 = + 1. The function
can be expanded about z2 = 1 by writing
1
f (z) = (1542)
2 + (z − 1)
The series is found as
∞
X 1
f (z) = ( − 1 )n ( z − 1 )n (1543)
n=0
2n+1
and has a radius of convergence determined by the distance to the closest sin-
gularity which is at zs = −1, so the radius of convergence is 2. Thus, the
analytic properties of the function been extended to a circle of radius 2. The
new circle of convergence includes the point z = 3. We can now iterate the
process, an define the function over the whole positive real axis.
263
Alternatively, one may wish to examine the function on the negative real
axis, or some other region. In this case, one might choose another point inside
the original circle of convergence, say z2 = i. The expansion about this point
is expanded about z2 = 1 via
1
f (z) = (1544)
1 + i + (z − i)
as
∞
X 1
f (z) = ( − 1 )n ( z − i )n (1545)
n=0
( 1 + i )n+1
which has a radius of convergence determined by the distance to the closest
singularity which is at zs = −1. The series converges within the region
√
| z − i | < | 1 + i | = | zs − i | = 2 (1546)
√
so the radius of convergence is 2. By choosing a suitable series of overlapping
circles, one can analytically continue the function to the negative real axis, but
the circles always have to exclude the point z = − 1.
It should be noted that a function f (z) may have more than one singularity,
in which case the circle of convergence for an expansion about some point is
limited by the closest singularity.
Example:
A function that can not be analytic continued is given by the Taylor expan-
sion about z = 0, of the function
∞
X n
f (z) = 1 + z2
n=0
= 1 + z 2 + z 4 + z 8 + z 16 + . . . (1547)
f (z) = z 2 + f (z 2 ) (1548)
f (z) = z 2 + f (z 2 ) (1550)
264
and the term f (z 2 ) on the right-hand-side is singular when z 2 = 1, one finds
that f (z) is singular at all the roots of z 2 = 1. This process can be iterated to
m
show that f (z) is singular at all the roots of z 2 = 1 for each integer value of
m. There are 2m such singularities for each integer value of m and the singulari-
ties are uniformly distributed on the unit circle. The singularities corresponding
to m → ∞ are separated by infinitesimally small distances. Hence, the ra-
dius of convergence for z values on the unit circle is infinitesimally small. Thus,
it is impossible to analytically continue the function f (z) outside the unit circle.
More generally, the function may be analytic inside a ring shaped area of
inner radius r and outer radius R, where R > r. The the area is not simply
connected. It is possible to introduce a line which cuts the plane making it into
a simply connected region. A particular line can be chosen which runs from
a point on the inner perimeter to a point on the outer perimeter. Since the
area is now simply connected one can apply Cauchy’s integral formula to any
contour in the annular region which does not cross the cut we have chosen. In
particular let us choose a contour that consists of two circular segments which
are concentric, the center of the circles is denoted as the point z0 . The radius
of the circles r1 and r2 are such that R > r1 > r2 > r. The contour is
completed by anti-parallel paths running along opposite sides of the cut.
For any point z inside the singly connected region where f (z) is analytic one
has
f (z 0 ) f (z 0 )
I I
1 0 1
f (z) = dz 0 + dz 0 (1552)
2πi C1 z − z 2πi C2 z0 − z
where the first circular contour is traversed in the counter clockwise direction
and the second contour is traversed in the counterclockwise direction. The con-
tribution from the oppositely directed paths along a segment of the cut has
cancelled, since our function is analytic and single-valued in the entire annular
region. That is the function f (z) has the same value at points immediately
265
adjacent to the cut.
| z 0 − z0 | = r1 > | z − z0 | (1554)
| z − z0 | > | z 0 − z0 | = r2 (1555)
Hence, we have
∞ I
1 X ( z − z0 )n
f (z) = dz 0 f (z 0 )
2 π i n=0 C1 ( z 0 − z0 )n+1
∞ I
1 X ( z 0 − z0 )n
− dz 0 f (z 0 )
2 π i n=0 C2 ( z − z0 )n+1
∞ ∞
X X Bn
= An ( z − z0 )n + (1558)
n= n=0
( z − z0 )n+1
where
f (z 0 )
I
1
An = dz 0 (1559)
2πi C1 ( z0 − z0 )n+1
266
and the contour C2 runs clockwise. Thus, we have the Laurent expansion of the
function
X∞
f (z) = An ( z − z0 )n (1562)
n=−∞
Example:
The function
1
f (z) = (1563)
z(z − 1)
has a Laurent series expansion about z = 0. The simplest method of obtaining
the expansion consists of expressing the function in partial fractions and then
expanding about z = 0
1 1
f (z) = −
z − 1 z
∞
1 X
= − − zn
z n=0
(1564)
Hence, the non zero coefficients are A−1 and An for n ≥ 0 and are all unity.
A−1 = A0 = A1 = . . . = 1 (1565)
The Laurent series differs from the Taylor series due to the occurrence of nega-
tive powers of ( z − z0 ), which makes the series diverge at z = z0 .
Example:
267
As a primitive example, consider the function
1
f (z) = z m (1567)
for integer m. What this equation actually means is that f (z) is described by
the solutions of m
f (z) = z (1568)
The solutions are found by expressing the complex number z in polar form
z = rz exp i θz (1569)
rfm = rz (1571)
m θ f = θz + 2 π n (1573)
268
are continuous in z. Each branch can be considered to define a single-valued
function, as long as two branches do not give the same value at one point. How-
ever, we see that if z traverses a contour wrapping around the origin exactly
once, then θz increases by exactly 2 π, and one branch merges with the next as
the phase of f (z) has increased by 2mπ . In this case, the origin is a branch point
of the function.
If the contour where chosen to cross the branch cut, the function would be
identified with the value on the next branch etc. In this case, one could de-
fine the complex plane to consist of several copies, (in which the phase of z is
advanced by 2 π between otherwise identical points in successive copies). The
function in this enlarged complex plane is single-valued. This manner of ex-
tending the complex plane to different phases, is akin to cutting identical sheets
representing the complex plane, and joining the edge of the cut adjoining the
lower-half complex plane of one sheet to the edge of the cut adjoining the upper-
half complex plane of the next sheet. Thus, one has a spiral of continuously
1
connected copies of the complex plane. The function z m is single-valued on this
spiral sheet, and also repeats after m sheets have been crossed. The m-th sheet
is identified with the zeroth sheet, and this construct is the Riemann surface of
1
z m . The Riemann surface consists of m sheets.
269
!
1 m 2π
= z m r exp i − 1
m + 1 m
(1575)
This integral is non zero if m 6= 1 and is zero when m = 1 for which the func-
tion is single-valued. The multi-valued nature of functions can make Cauchy’s
theorem inapplicable.
Example:
f (z) = ln z (1576)
or in polar coordinates
f (z) = ln r + i θ (1577)
Thus, ln z is infinitely multi-valued. Furthermore, it is the multi-valued nature
of ln z that makes the integral
I
dz
= ln z = 2 π i (1578)
z
when integrating around a contour circling about the origin once.
Example:
The function
1
f (z) = ( z 2 − 1 ) 2 (1579)
can be factorized as
1 1
f (z) = ( z − 1 ) 2 ( z + 1 ) 2 (1580)
The first factor has a branch point at z = 1 and the second factor has a branch
point at z = − 1. The branch cut has to connect the two branch points. For
convenience one can represent
z − 1 = r1 exp i ϕ1 (1581)
and
z + 1 = r2 exp i ϕ2 (1582)
270
The introduction of the branch cut makes the function single-valued. Con-
sider a contour starting at + 1 running down to − 1 in the upper-half complex
plane and returning to + 1 below in the lower-half complex plane just below
the branch cut.
As the contour rotates around the branch point + 1 the phase ϕ1 changes
from 0 to π while ϕ2 = 0. As the contour rounds the branch point − 1, the
phase ϕ2 changes by 2 π. Secondly, as the contour returns to the point + 1 and
circles around it back to the start ϕ1 changes from π to 2 π. In this loop the
total phase changes by 2 π, hence, the function is single-valued.
14.5 Singularities
An point z0 where a function is not analytic or single valued, but is analytical
at all neighboring points is known as an isolated singularity. In this case one
can draw a contour around the singularity on which the function only has no
other singularity within it. The Laurent expansion yields a series which exhibits
isolated singularities.
Poles
Let the highest negative order for which the coefficient is non zero be − n. That
is a−n 6= 0, and the higher order terms a−m are zero for − n > − m. In this
271
case one has a pole of order n. In this case, one has
an an−1
f (z) = n
+ + . . . +a0 + a1 ( z − z0 ) + . . . (1587)
( z − z0 ) ( z − z0 )n−1
A pole of order one is called a simple pole. It should be noted that the function
is single valued on a contour going round a pole, as it does return to its original
value after it has completed the contour, and f (z) varies continuously on the
contour.
————————————————————————————————–
Example:
The function
sin z
f (z) = (1588)
z6
has a pole of order 5 as
∞
X z 2n+1
sin z = ( − 1 )n (1589)
n=0
( 2 n + 1 )!
so
∞
sin z X
n z 2n−5
= ( − 1 ) (1590)
z6 n=0
( 2 n + 1 )!
which leads to a pole of order 5.
Essential Singularities
————————————————————————————————–
Example:
The function
1
f (z) = exp (1591)
z
has an essential singularity at z = 0. The function has the expansion
X 1 1 n
f (z) = (1592)
n=0
n! z
272
which contains terms of arbitrarily high negative orders. The pathological be-
havior of this function is seen by examining how the function varies as z tends
to zero. If z approaches zero from the positive axis then f (z) tends to infinity,
on the other hand if z tends to zero along the negative real axis, then f (z) tends
to zero. In fact by choosing an arbitrary path to the point z = 0 one can have
limz→0 f (z) = b for any complex b. For example, on inverting the equation
one can find solutions
1
z = (1593)
2 π i n + ln b
where n are integers, as n approaches to infinity z approaches zero, while the
function remains equal to b.
Branch Points
A point z0 is a branch point of a function if, when z moves around the point
z0 on a contour of small but non-zero radius, the function does not return to
its original value on completing the contour. It is assumed that f (z) varies
continuously on the contour.
————————————————————————————————–
Example:
The function p
f (z) = z 2 − a2 (1594)
has branch points at z = ± a. The function does not return to the same value
if the contour warps around one pole once, say z = a, but does return to the
same value if the contour encloses both branch points.
Singularities at Infinity
————————————————————————————————–
273
Example:
The function
f (z) = sin z (1597)
has an essential singularity at infinity since
1
g(ξ) = sin
ξ
∞
X ( − 1 )2n+1 1
= (2n+1)
(1598)
n=0
( 2 n + 1 )! ξ
274
15 Calculus of Residues
15.1 Residue Theorem
If a function f (z) which has a Laurent expansion
n=∞
X
f (z) = An ( z − z0 )n (1599)
n=−∞
The residue theorem states that the integral of a function around a contour
C that only encloses a set of isolated singularities, then the integral
I
1 X
dz f (z) = R(zn ) (1601)
2πi C n
————————————————————————————————–
Example:
275
Evaluate the integral Z ∞
Γ
dx (1603)
−∞ x2 + Γ2
Solution:
1
which shows the poles at x = ± i Γ with residues ∓ 2 i.
then
Γ
lim | f (z) | ∼ → 0 (1606)
R → ∞ R2
Hence, the contour at infinity is given by
Z π
Γ
R dθ | f (z) | ∼ π (1607)
0 R
One completing the contour in the upper-half complex plane one encloses
the pole at z = i Γ in a counter clockwise sense and obtains the final result
Z ∞
Γ 2πi
dx 2 2
= 1 = π (1608)
−∞ x + Γ 2i
If the contour had been evaluated in the lower-half plane, the contour would
enclose the pole at z = − i Γ in a clockwise sense, since the residue at this
pole is negative, and the clockwise contour produces another negative sign, we
obtain the same result.
276
15.2 Jordan’s Lemma
Consider the integral on an open curve CR consisting of the semi-circle of radius
R in the upper-half complex plane. The semi-circle is centered on the origin.
Let the function f (z) be a function that tends to zero as | M
z | as | z | → ∞,
then I
1
lim dz exp i k z f (z) → 0 (1609)
R → ∞ 2 π i CR
Proof
277
15.3 Cauchy’s Principal Value
The Principal Value of an integral of a function f (x) which has only one simple
pole at x0 on the real axis, is defined as
Z +∞ Z x0 −ε Z ∞
Pr dx f (x) = lim dx f (x) + dx f (x) (1612)
−∞ ε→0 −∞ x0 +ε
For example, if the function only has simple poles in the upper-half complex
plane and can be evaluated on the semi-circle CR at infinity in the upper-half
complex plane. The principal value integral can be evaluated by integrating over
a contour that runs from − ∞ to ( x0 − ε ) then follows a small semi-circle of
radius ε around the pole at x0 , joins the real axis at ( x0 + ε ) then follows
the real axis to + ∞ and then completes the contour by following a semi-circle
of infinite radius (R → ∞) in the upper-half complex plane.
Then the principal value integral and the integral over the small semi-circle
of radius ε around x0 is equal to 2 π i times the sum of the residues enclosed
by the contour.
For example, if f (z) is analytic in the upper-half complex plane on the semi-
circle at infinity in the upper-half complex plane we have
Z +∞ I
f (x) f (z)
Pr dx + dz = i π f (x0 ) (1613)
−∞ x − x 0 CR z − x0
It does not matter if the semi-circle of radius ε is taken to be in the upper-half
complex plane or the lower-half complex plane.
If the small semi-circle is chosen in the lower-half complex plane the inte-
gral runs counter clockwise and runs half way around the pole, and contributes
π i f (x0 ) to the left-hand-side, but the right-hand-side has the contribution of
2 π f (x0 ) as the pole at x0 is enclosed by the contour.
On the other hand if the contour is closed in the upper-half complex plane,
the semi-circle is in the clockwise direction and contributes − π i f (x0 ) to the
left-hand-side and the right-hand-side is zero, as the pole at x0 is not enclosed
by the contour.
278
————————————————————————————————–
Example:
————————————————————————————————–
This is equivalent to the principal value integral and the small semi-circle,
or Z +∞
f (z)
Pr dx − i π f (x0 ) (1620)
−∞ z − x0
279
Hence, we have
Z +∞ Z +∞
f (z) f (z)
dz = Pr dz − i π f (x0 ) (1621)
−∞ z − x0 + i ε −∞ z − x0
which yields the identity
1 1
lim = Pr − i π δ( z − x0 ) (1622)
ε → 0 z − x0 + i ε z − x0
The truth of this result can also be seen by writing
1 z − x0 ε
= − i (1623)
z − x0 + i ε ( z − x0 ) 2 + ε2 ( z − x0 )2 + ε2
in which the first term on the right not only does not diverge at z = x0 but
is zero. This term cuts off at a distance ε from the pole, and thus corresponds
to the Principal value. The second term on the right is a Lorentzian of width ε
centered at x0 . As the integrated weight is π, this represents the delta function
term in the limit ε → 0.
————————————————————————————————–
Example:
Solution:
280
Hence, for k > 0 one has
Θ(k) = 1 (1627)
On the other hand, when k < 0, the integral must be closed in the lower-half
complex plane. The contour does not enclose the pole and Jordan’s Lemma
shows the contour at infinity yields a vanishing contribution. Hence for k < 0
one has
Θ(k) = 0 (1628)
Thus, Θ(k) is the Heaviside step function.
————————————————————————————————–
Example:
Solution:
First we write
Z ∞ exp itx
dx (1630)
−∞ (x − k)(x + k)
The integrand has two poles, at x = ± k. The integral can be evaluated as a
principal value integral and the contribution on two small semicircles of radius
ε about x = ± k.
For t > 0 the integral can be evaluated by completing the contour in the
upper-half complex plane, in which case no poles are enclosed. The contour at
infinity yields zero by Jordan’s Lemma. Hence, one has
Z ∞ exp i t x exp − i t k exp + i t k
Pr dx − π i − π i = 0
−∞ x2 − k 2 −2k 2k
(1631)
Hence, for t > 0, we have
Z ∞ exp itx
π sin t k
Pr dx = − (1632)
−∞ x2 − k2 k
281
For t < 0 the integral can be evaluated by completing the contour in the
lower-half complex plane, in which case no poles are enclosed. The contour at
infinity yields zero by Jordan’s Lemma. Hence, one has
Z ∞ exp i t x exp − i t k exp + i t k
Pr dx + π i + π i = 0
−∞ x2 − k 2 −2k 2k
(1633)
Example:
Solution:
282
one can write the integral as a contour integral around the unit circle in the
complex plane
I
dz 1
I = −i b
C z a + 2 ( z + z −1 )
I
2
= −i dz 2 + 2 a z + b
C b z
I
2i 1
= − dz 2 a (1638)
b C z + 2 b z + 1
lies inside the unit circle. As can be seen directly from the coefficients of the
quadratic equation, the product of the solutions is equal to unity. Thus, the
second solution has to lie outside the unit circle. In the case a > b, one has
I
2i 1
I = − dz a
p a
( b ) − 1 ) ( z + ab −
p a
b C (z + b + 2 ( b )2 − 1 )
2i 2πi
= − p a
b 2 ( b )2 − 1
2π
= √ (1641)
a − b2
2
————————————————————————————————–
Example:
by contour integration.
283
Solution:
one has
2n
z + z −1
I
dz
I = −i
C z 2
C(2n, n)
= 2π
22n
π (2n)!
= (1644)
22n−1 n! n!
where we have used the binomial expansion and noted the integral only picks
up the term which gives rise to the pole at z = 0. The binomial coefficient of
the term z 0 in the expansion is C(2n, n), which leads to the final result.
————————————————————————————————–
Example:
Solution:
so the integral can be written as a contour integral over the unit circle
I
dz 1
I =
i z a + b ( z − z−1 )
2 i
I
2 1
= dz 2 (1647)
b z + 2 i ab z − 1
The denominator has poles at
s
2
a a
z = −i ± i − 1 (1648)
b b
284
The pole at s
2
a a
z = −i + i − 1 (1649)
b b
is inside the unit circle. The integral can be written as
I
2 1
I = dz s s
b 2 2
z + i ab − i a
b − 1 z + i a
b + i a
b − 1
I " #
1 1 1
= √ dz s − s
i a2 − b2 2 2
a a a a
z + i b − i b − 1 z + i b + i b − 1
2π
= √ (1650)
b2 − a2
The last line of the integral is evaluated by noting that only the pole of the first
term is inside the unit circle.
————————————————————————————————–
Example:
Solution:
one can write the integral as a contour integral around the unit circle in the
complex plane
I
dz 1
I = −i 2
z
C
a + 2b ( z + z −1 )
I
4z
= −i dz 2
C
b z2 + 2 a z + b
285
I
4i z
= − 2 dz 2 (1653)
b C a
z2 + 2 b z + 1
lies inside the unit circle. The other pole is located at the inverse of the position
of the above pole, and therefore lies outside the unit circle. In this case, one has
I
4i z
I = − 2 dz a
p a
( b ) − 1 ) ( z + ab −
p a
b C (z + b + 2 2 ( b )2 − 1 )2
!
8π d z
=
b2 dz ( z + ab + ( ab )2 − 1 )2
p
zα
!
a
p a
+ ( ) 2 − 1 − z
8π b b
=
( z + ab +
p a
b2 ( b )2 − 1 )3
zα
2πa
= 3 (1656)
( a2 − b2 ) 2
where Cauchy’s integral formula for derivatives has been used to evaluate the
contribution from the pole of order 2.
————————————————————————————————–
Example:
The function
2 sin2 ω T
(1657)
π ω2 T
occurs in the derivation of the Fermi-Golden transition rate, in second order
time-dependent perturbation theory. In the limit T → ∞ this becomes a
Dirac delta function expressing energy conservation. Evaluate the integral
Z ∞
sin2 ω T
dω (1658)
−∞ ω2 T
286
Solution:
where we have Taylor expanded the function about r = 0, and the contour
does not enclose the pole. Hence, we have
Z ∞
1 − exp[ + i ω T ] π
dω 2 T
= i (1661)
−∞ 4 ω 4
Likewise, the other term can be evaluated by closing the contour in the lower-
half complex plane
Z ∞
1 − exp[ − i ω T ] π
dω 2 T
= i (1662)
−∞ 4 ω 4
On adding these two contributions one has the final result
Z ∞
sin2 ω T π
dω 2
= (1663)
−∞ ω T 2
which shows that the function has weight unity. Furthermore, the weight is
distributed in a frequency interval of width given by T1 , so for T → ∞ one
obtains the energy conserving delta function
2 sin2 ω T
lim = δ( ω ) (1664)
T →∞ π ω2 T
————————————————————————————————–
287
Example:
Solution;
As the function is analytic at x = 0 the integration along the real axis can
be deformed slightly near the origin in a semi-circle of radius ε in the lower-half
complex plane. This contour does not contribute to the integral in the limit
ε → 0. The integrand can be expressed as
exp[ + 3 i x ] − 3 exp[ + i x ] + 3 exp[ − i x ] − exp[ − 3 i x ]
(1666)
8 i 3 x3
and the contribution to the small semi-circle from the terms with the positive
phases cancel to order ε with those with the negative phases.
The contour can be closed in the upper-half complex plane for the terms with
the positive phases. The contour encloses the pole of order three at x = 0,
while the contour for the term with the negative phases must be closed in the
lower-half complex plane. This contour excludes the pole at x = 0. The end
result is that the integration is determined by the residue at x = 0 which is
evaluated from Cauchy’s formulae for derivatives as
6
(1667)
16 i
Hence, the integral is evaluated as
Z ∞
sin3 x 3π
dx 3
= (1668)
−∞ x 4
————————————————————————————————–
Example:
288
Solution:
The integral is convergent as when x → ∞ the fact that a < 1 makes the
integrand tends to zero. Also, when x → − ∞ the integrand also tends to zero
as a > 0.
The integration is evaluated on a contour which runs along the real axis
between (−R, R) then up to R + 2 π i and then runs anti-parallel to the real
axis from R + 2 π i to − R − 2 π i and then back down to − R. This particular
contour is chosen due to the fact that the integration on the upper line has
1 1
= (1670)
exp[ x + i 2 π ] − 1 exp[ x ] − 1
The two vertical segments vanish as R → ∞ as the integrand vanishes as
exp[ − a R ] or exp[ − ( 1 − a ) R ]. Thus, the contour integral is given by
I Z ∞ Z ∞
exp[ a z ] exp[ a x ] exp[ a x + i 2 π a ]
dz = dx − dx
exp[ z ] + 1 −∞ exp[ x ] + 1 −∞ exp[ x ] + 1
Z ∞
exp[ a x ]
= ( 1 − exp[ i 2 π a ] ) dx (1671)
−∞ exp[ x ] + 1
————————————————————————————————–
Example:
289
for x > − 1. Show that
Z ∞
πx
dt tx sin t = Γ(x + 1) cos
0 2
Z ∞
πx
dt tx cos t = − Γ(x + 1) sin (1676)
0 2
Solution:
Consider the contour integral defined by
I
t
dz z exp − z (1677)
C
where C starts at the origin and runs to infinity, then follows a segment of a
circular path until the positive imaginary axis is reached. The contour is closed
by the segment running down the positive imaginary axis back to the origin.
in the integrand, the integral over the quarter circle of radius R → ∞ vanishes.
Thus,
Z ∞ Z ∞
dx xt exp − x = i dy it y t exp − i y (1680)
0 0
On multiplying by a factor of
πt
exp − i (1682)
2
one obtains
Z ∞
πt
exp − i Γ(t + 1) = i dy y t exp − i y (1683)
2 0
290
and, on taking the real and imaginary parts, one obtains the results
Z ∞
πx
dt tx sin t = Γ(x + 1) cos
2
Z 0∞
πx
dt tx cos t = − Γ(x + 1) sin (1684)
0 2
as was to be shown.
————————————————————————————————–
Example:
for α < 1.
Solution:
The function has a branch point at x = 0. A branch cut can be drawn along
the positive real axis. A contour can be drawn which performs a circle of radius
R at infinity and a counter clockwise contour at radius r and two anti-parallel
segments on opposite sides of the real axis.
The contribution from the circular contour at infinity vanishes as the inte-
grand vanishes as Rα−2 . The contribution from the clockwise contour around
the origin vanishes as rα+1 as r → 0. The contribution from the two anti-
parallel segments do not cancel as the function just below the real axis differs
from the function just above the real axis by the phase exp[ i 2 π α ]. Hence,
the integral around the contour reduces to
Z ∞
zα xα
I
dz 2 = 1 − exp i 2 π α dx 2 (1686)
C z + 1 0 x + 1
291
Hence, one has
∞ !
xα
Z
πα3πα
1 − exp i 2 π α dx 2 = π − exp i
exp i
0 x + 1 2 2
(1688)
and on factoring out exp[ i π α ] from both sides, one has the result
Z ∞ !
xα sin π2α
dx 2 = π
0 x + 1 sin π α
!
π
= (1689)
2 cos π2α
————————————————————————————————–
Example:
Prove that ∞
xα π(1 − α)
Z
dx = (1690)
0 ( 1 + x2 ) 2 4 cos π2α
————————————————————————————————–
Example:
Solution:
292
where we have used the polar representation of the complex number z =
ρ exp[ i θ ].
————————————————————————————————–
Example:
Prove that
∞
ln2 x π2
Z
dx = (1696)
0 1 + x2 8
————————————————————————————————–
Example:
Show that ∞
( nπ )
Z
1
dx = (1697)
0 1 + xn sin ( nπ )
by integrating over a contour composed of the real axis an arc at infinity of
length R ( 2nπ ), and a straight line back to the origin.
Solution:
The segment of the contour at infinity vanishes. The two straight line seg-
ments are evaluated as
Z ∞ Z ∞
1 2π 1
dx − exp i ds (1698)
0 1 + xn n 0 1 + sn
293
where
2π
z = s exp i (1699)
n
along the second segment. The contour integral is equal to the residue of the
pole at
π
z = exp i (1700)
n
which is enclosed by the contour. The residue has the value
1 1 π
R = = − exp i (1701)
n z n−1 n n
Thus, we have
Z ∞
2π 1 2πi π
1 − exp i dx = − exp i (1702)
n 0 1 + xn n n
which on dividing by the prefactor can be re-written as
Z ∞
( nπ )
1
dx = (1703)
0 1 + xn sin ( nπ )
as was to be shown.
has simple poles at z = n for positive and negative integer values of n, with
residues
1
Rn = (1706)
πi
The summation can be written as a contour integration, where the contour
encircles the poles and avoids the singularities of f (z). Thus, the summation
can be expressed as
∞ I
X 1 1
f (n) = dz f (z) (1707)
n=0
2 i C tan πz
294
and the contour can be deformed as convenient. If the contour at infinity van-
ishes, the integration can be evaluated along the imaginary axis.
————————————————————————————————–
Example:
Solution:
One can express the sum as a contour integrations around all the positive
poles of cot π z,
∞ I
X 1 1 1 1
m
= dz m
(1709)
n=0
n 2 i C tan π z z
The function f (z) has no poles except the simple pole at z = 0, so the contour
of integration can be deformed. The integrations can be deformed from C to
C 0 which is an integration along both sides of the positive real axis, running
from 1 to ∞. The small anti-parallel segments cancel, so the infinite number of
circles of C have been joined into the closed contour C 0 . The contour can then
be furthered deformed to an integral parallel to the imaginary axis
1
z = + iy (1710)
2
and a semi-circle at infinity. The contour at infinity vanishes as m ≥ 2. Hence
we have
∞ Z ∞
X 1 i 1
m
= dy tanh π y 1
n=0
n 2 −∞ ( 2 + i y )m
Z ∞ " #
i 1 1
= dy tanh π y −
2 0 ( 12 + i y )m ( 21 − i y )m
Z ∞ " #
1
= dy tanh π y Im 1 (1711)
0 ( 2 − i y )m
295
————————————————————————————————–
Example:
Evaluate
∞
X ( − 1 )n
(1713)
n=−∞
( a + n )2
where a is a non-integer number.
Solution:
in which the contour runs on both sides of the real axis, but does not enclose
the double pole at z = − a. The function csc π z has poles at z = π n and
has residues
1
( − 1 )n (1715)
π
The contour can be deformed to infinity, and an excursion that excludes the
double pole. The excursion circles the double pole in a clockwise direction. The
contribution from the contour at infinity vanishes. Thus, we find that the sum
is equal to a contribution from the clockwise contour around the double pole.
The residue at the double pole z = − a is
————————————————————————————————–
Example:
296
Solution:
Let
∞
X 2x
f (z) = (1719)
n=1
x2 + z 2 π 2
then
∞ I
X 2x 1 X
= dz cot π z f (z) − Res π cot π z f (z)
n=−∞
( x2 2 2
+ n π ) 2πi C
poles of f (z)
(1720)
where C is a closed contour enclosing the real axis and the poles of f (z). The
contour integral vanishes as the contour is deformed to infinity since | f (z) | →
0. Hence, as the poles of f (z) are located at
x
z = ±i (1721)
π
and the residues are coth x, one has
∞
X 2x
= 2 coth x (1722)
n=−∞
( x2 + z2 π2 )
or
∞
1 X 2x
coth x − = 2 + n2 π 2
(1723)
x n=1
x
————————————————————————————————–
297
15.6 Kramers-Kronig Relations
The Kramers-Kronig relation expresses causality. The response of a system can
be expressed in terms of a response function. The response of the system A(t)
only occurs after the system has been perturbed by an applied field B(t0 ) at an
earlier time t0 . The general relation between the response and applied field is
given by linear response theory and involves the convolution
Z ∞
A(t) = dt0 χ(t − t0 ) B(t0 ) (1727)
−∞
so that the integral on the semi-circle at infinity in the upper-half complex plane
is zero. The Cauchy integral formula is given by
I
1 χ(z)
χ(z0 ) = dz (1731)
2πi C z − z0
if z0 is in the upper-half complex plane. Since the semi-circle at infinity vanishes
Z ∞
1 χ(z)
χ(z0 ) = dz (1732)
2 π i −∞ z − z0
On varying z0 from the upper-half complex plane to a value ω0 on the real axis,
the integral becomes the sum of the Principal Value and a contour of radius ε
around the pole. Hence,
Z ∞
Pr χ(z) χ(ω0 )
χ(ω0 ) = dz + (1733)
2 π i −∞ z − ω0 2
298
or Z ∞
Pr χ(z)
χ(ω0 ) = dz (1734)
π i −∞ z − ω0
On taking the real and imaginary part of this equation one has the two equations
Z ∞
Pr Im χ(ω)
Real χ(ω0 ) = dω (1735)
π −∞ ω − ω0
and Z ∞
Pr Real χ(ω)
Im χ(ω0 ) = − dω (1736)
π −∞ ω − ω0
which are the Kramers-Kronig relations.
Symmetry Relations
299
and similarly for the imaginary part, one can show that
Z ∞
Pr Real χ(ω)
Im χ(ω0 ) = − dω
π −∞ ω − ω0
Z ∞
2 ω0 Real χ(ω)
= − dω (1744)
π 0 ω 2 − ω02
This representation of the Kramers-Kronig relation is useful as perhaps only ei-
ther the real or imaginary part of the response function may only be measurable
for positive ω.
300