1 Analysis of Complex Functions
1 Analysis of Complex Functions
1 Analysis of Complex Functions
Definition 1.1 The complex plane C is (a representation in R2 of ) the set {z : |z| <
∞}, and the extended complex plane is C ∪{∞}, where {∞} is the single point at infinity
corresponding to the origin under the transformation z → z −1 .
Definition 1.2 An argument of z is any one of the numbers θ + 2πm, where z = reiθ
and m is an integer. The principal value of arg(z) satisfies −π < arg(z) ≤ π
exists and is unique. We then say that a single-valued function f (z) is (complex) analytic,
regular or holomorphic in D if f 0 (z) exists (and is continuous) at each point in D. Finally,
f (z) is entire if it is holomorphic everywhere in C, or meromorphic if its only singularities
in C are poles (see later).
Corollary 1.1 If f is at least twice differentiable in D, then the real and imaginary
parts are both harmonic functions:
∂2u ∂2u
+ = 0
∂x2 ∂y 2
∂2v ∂2v
+ = 0. (3)
∂x2 ∂y 2
Thus, u and v cannot have maxima or minima in D and so any stationary points must
be saddle points. Hence, their biggest and smallest values are attained only on the
boundary ∂D of D.
As with functions of a real variable, we may consider power series expansions of
complex functions. For now we’ll look at holomorphic functions, but later we’ll extend
our discussion to include functions with singularities. As with real functions, if f (z) =
P∞ n P∞
n=0 an z , then the derivative of this function is given by f 0 (z) = n=1 nan z
n−1
. Each
P∞ n
power series has a radius of convergence, R, such that n=0 an z converges for |z| < R
and diverges for |z| > R.
We may use power series expansions to define complex versions of some of our
favourite real functions:
∞
X zn z2
exp(z) = = 1+z+ +···
n=0 n! 2!
z3 z5
sin(z) = z − + −···
3! 5!
z2 z4
cos(z) = 1 − + −··· . (4)
2! 4!
In all these cases the radius of convergence is R = ∞. Note that, with this definition of
exp(z), the usual result; exp(z1 + z2 ) = exp(z1 ) exp(z2 ) holds.
As an introduction to more complicated functions, consider trying to define the
complex logarithm, log(z). We define w to be a logarithm of z if
z = exp(w) (5)
where we have defined −π < arg(z) ≤ π. With arg(z) restricted to this region, each
choice for the integer k specifies a branch of the logarithm function. The principal value
of log(z) is obtained by setting k = 0.
Note that given some domain D (with 0 ∈
/ D), then once we specify the value of
log(z0 ) for some z0 ∈ D, then log(z) is uniquely defined on D. (This is equivalent to
specifying the branch).
However, it is impossible to define log(z) in this way in any domain which contains
a simple closed curve which encircles the origin, since log(0) is undefined.
For any holomorphic function, f (z), which is many valued (i.e. has several branches),
a point z0 which behaves in the same way as the origin for log(z) is referred to as a branch
point. In such a situation f (z0 ) may or may not be defined.
There exists a power series expansion for the logarithm, given by
∞ n
X
n+1 z z2 z3
log(1 + z) = (−1) =z− + −··· , (8)
n=1 n 2 3
with radius of convergence R = 1. In addition, all points on this circle also converge,
except for z = 1.
Another example of a function with branches is provided by the function w = z a ,
with a ∈ C. This can be seen by
w = z a = exp[a log(z)]
= exp[a(log(r) + iθ)] . (9)
Here we have set z = reiθ . It is the choice of range for θ, for example −π < θ ≤ π, that
specifies the branch. Note that the choice becomes unnecessary if a ∈ Z, because in this
case the choice of a different branch merely changes z a by e2πim = 1. This fits with our
elementary definition of z n .
Example 1.1
ii = exp[i log(i)]
π
= exp i i + i2πk
2
1
= exp − 2k + π . (10)
2
This takes infinitely many distinct real values.
PHY581 Methods of Theoretical Physics I, Fall 2005 4
z-plane w-plane
Example 1.2 w = z p/q , where p/q is a rational number in lowest terms. In this case
we obtain a function with q branches, having z = 0 as a branch point.
√
Consider the special case w = z. Write z = reiθ , 0 ≤ θ < 2π (note a different,
√
more convenient choice of range here). Then w = ρeiφ , with 0 ≤ φ < π, where ρ = + r
and φ = θ/2.
Geometrically, the whole complex plane with the positive real line removed is mapped
to the upper half plane (see below)
√
As we have defined f : w 2 = z, it defines a branch of z with z = 0 as a branch
point. We must have a cut along the positive real line because we cannot possibly have
continuity there since f (4) = 2, but f (4eiθ ) → −2 as θ → 2π from below.
Proof 1.3 Z Z Z
f (z) dz = (udx − vdy) + i (vdx + udy) (14)
C C C
Theorem 1.4 (Cauchy’s Integral Formula) Let f 0 (z) exist inside the open disc B(z0 , r).
Then for each point a with |a − z0 | < r, we have
Z
1 f (z)
f (a) = dz . (16)
2πi ∂B z−a
(here ∂B = {z | |z − z0 | = r}).
PHY581 Methods of Theoretical Physics I, Fall 2005 6
Proof 1.5
Z Z
f (z) f (z)
dz = dz
∂B z−a ∂B(a,ε) z − a
Z Z
f (z) − f (a) f (a)
= lim dz + dz . (17)
ε→ 0 ∂B(a,ε) z−a ∂B(a,ε) z − a
The first term on the right hand side can be made arbitrarily small because the integrand
is bounded and the length of the simple closed curve → 0. As for the second term;
substitute
γ(t) = a + εeit , t ∈ [0, 2π] (18)
as required.
This must seem like a lot of heavy-handed mathematical machinery, but I’d like to
present one more important result before we get down to some examples of how these
theorems are applied.
This power series, if it exists, must be unique, since we can use Cauchy’s integral formula
to show that
1 (n)
f (z0 ) .
cn = (22)
n!
Thus, we have formally shown the existence of a power series that we mentioned
earlier.
PHY581 Methods of Theoretical Physics I, Fall 2005 7
Definition 1.8 A function f (z) is meromorphic on the domain D if all its singularities
in D are isolated and non-essential.
Lemma 1.1 If f (z) is meromorphic, then locally f (z) is expressible as a quotient g(z)/h(z),
where g and h are holomorphic. Conversely, if g and h are holomorphic on a domain
D, and h 6≡ 0, then g(z)/h(z) is meromorphic (perhaps with removable singularities).
Note how the following definition parallels that of a pole and its order:
If no such finite value k exists (i.e. f (i) (z0 ) = 0 ∀ 0 ≤ i < ∞), we say that z0 is a
zero of infinite order. In this case, the assumption of holomorphy implies f (z0 ) ∀ z in
some neighborhood of z0 .
We’ll now generalize our power series discussion to functions with singularities. Recall
the expansion we derived earlier for holomorphic functions, and compare to the following.
If f (z) is holomorphic in the annulus R1 < |z − z0 | < R2 (note, f not required to be
holomorphic at z0 ) then we may express
∞
X
f (z) = an (z − z0 )n (25)
n=−∞
and this series is referred to as a Laurent series for f . This can be decomposed as
∞
X −1
X
f (z) = an (z − z0 )n + an (z − z0 )n (26)
n=0 n=−∞
where the first term is referred to as the subsidiary part and the second term as the
principal part. Both parts are unique, and, as earlier,
Z
f (z)
an = dz (27)
C (z − z0 )n+1
with C a simple closed curve containing z0 in the domain of f .
The Laurent series converges absolutely, and uniformly in any closed subset of the
annulus.
PHY581 Methods of Theoretical Physics I, Fall 2005 9
Definition 1.10 (Residue) The coefficient a−1 of (z − z0 )−1 in the Laurent expansion
is called the residue of f (z) at z = z0 .
The problem with this result is actually to calculate the residues Rj . Two hints:
1 dm−1
Rj = lim m−1 [z m f (z)] (30)
(m − 1)! z→0 dz
Now, the main point of many of the recent results is that they allow us to analyti-
cally evaluate many definite integrals which would seem almost impossible without the
techniques we’ll learn here. Although these methods will apply to complex integrals,
we’ll see that they provide an excellent method for the evaluation of real integrals.
In all the examples that follow, the procedure is the same. We have a definite real
integral to evaluate, and we do this by first making the integral complex and including
the range of integration (e.g. [−a, a], where a frequently tends to infinity), inside some
suitable simple closed curve C in C. With minor adaptation, we suppose that the complex
integrand has no singular points on C. We then apply the residue theorem and arrange
that all contributions to the integral, other than that on the real axis, are vanishingly
small.
Warning: Watch out for branching. Recall that this occurs, for example, when f (x)
contains a factor xα , where α is non-integral, or log(x).
We shall consider various different cases:
Method; write
! !
z + z −1 z − z −1
cos(θ) = , sin(θ) = (33)
2 2i
2. f (z) meromorphic with finite number of poles in the region =(z) > 0 and no poles
on the real axis. Assume that |z 2 f (z)| ≤ M whenever =(z) ≥ 0 and |z| > R, say.
for m > 0.
Therefore
Z Z π
imz iθ iθ iθ
f (z)e dz = f (Re ) exp(imRe )Re dθ
Γ
Z0 π
< ε e−mR sin θ R dθ
0
Z π/2
= 2Rε e−mR sin θ dθ
0
Z π/2
≤ 2Rε e−2mRθ/π dθ
0
πε
= (1 − e−mR )
m
πε
< , (38)
m
as required.
Note, that we can distinguish between real and imaginary parts, and thereby
replace eimz by cos(mz) or sin(mz).
4. We can adapt this general technique to allow for a finite number of singular points
on the real axis. Diagramatically, we replace our original contour by
Lemma 1.3 Suppose that f (z) has a simple pole at z = 0, and we integrate round
a circular arc of radius r between the angles θ1 and θ2 . Then
Z
f (z) dz = residue at 0 × i(θ2 − θ1 ) + εr , (39)
reiθ2 →reiθ2
where εr → 0, as r → 0.
Proof 1.12 Expand f (z) as (residue at 0)/z + (regular function), and use Jor-
dan’s lemma.
as follows:
Cut C along the non-negative real axis. Then z = reiθ is uniquely defined with
0 < r and 0 < θ < 2π. Write the logarithm over this domain as z → log(r) + iθ,
and define z α = eα log z .
With these conventions we have fixed the branches of log(z) and z α with which we
work in this problem.
Choose ε > 0 so small and R > 0 so large that the contour encloses all poles except
the one that may exist at z = 0.
β
R
α
δ r ε
γ
PHY581 Methods of Theoretical Physics I, Fall 2005 13
Then
Z Z R
lim z α f (z) dz = xα f (x) dx
ε→0 αε,r r
Z
= lim z α f (z) dz
ε→0 γε,r
Z R
= −e2πiα xα f (x) dx , (41)
r
because on γε,r we are about to cross over to another branch of the integrand.
Therefore,
Z Z ∞
lim lim z α f (z) dz = (1 − e2πiα ) xα f (x) dx
R→∞ ε→0 α∪β∪δ∪γ 0
X
= 2πi Resj (z α f (z)) . (42)
j
6. With log(z) rather than z α in the integrand, it may be more convenient to replace
the given contour by
7. For some integrands it may be useful to use a large rectangle, for example if a
trigonometric or hyperbolic function appears in the denominator.
−π+ in π+ in
−π π
As with all our examples we choose a contour and extend the integral around the
contour. Here consider I
dz
, (43)
Γ (z 2 + 1)2 (z 2 + 4)
where Γ is defined to be the contour consisting of that part of the real axis from −R to
R (R > 0), and the semicircle, radius R, center 0 in the upper half-plane.
iR
C
-R R
Then
I Z Z
dx +R dx dz
2 2 2
= 2 2 2
+
Γ (z + 1) (z + 4) −R (x + 1) (x + 4) C (z + 1)2 (z 2 + 4)
2
Now let R → ∞. Clearly the second term in the above goes to zero in this limit. We
now evaluate the left hand side using the calculus of residues.
The integrand has a pole of order 2 at z = i, and a pole of order 1 at z = 2i, which
lie within the contour Γ. Now;
" #
d 1
Res(z = i) = lim
z→i dz (z + i)(z + i)(z 2 + 4)
" #
−2 2z
= lim −
z→i (z + i)3 (z 2 + 4) (z + i)2 (z 2 + 4)2
−2 2i
= 3
−
3(2i) 9(2i)2
i
= − . (44)
36
Similarly
" #
1
Res(z = 2i) = lim
z→2i (z + 1)2 (z + 2i)
2
i
= − . (45)
36
Therefore
I
dz i i
2 2 2
= 2πi − −
Γ (z + 1) (z + 4) 36 36
π
= . (46)
9
PHY581 Methods of Theoretical Physics I, Fall 2005 15
So we obtain Z ∞ dx π
= . (47)
−∞ (x2 2 2
+ 1) (x + 4) 9
Finally, since the integrand is an even function of x, this implies
Z ∞ dx π
= . (48)
0 (x2 2 2
+ 1) (x + 4) 18
1.4.2 Example 2
Z ∞ cos(x) dx
, a>0
−∞ x2 + a2
This requires a little more cunning. Consider
I
eiz dz
Γ z 2 + a2
with Γ the same contour as in example 1. We then obtain:
I Z Z Z
eiz dz R cos(x) dx R i sin(x) dx eiz dz
= + +
Γ z 2 + a2 −R x2 + a2 −R x2 + a2 C z 2 + a2
Obviously, the final term goes to zero as R → ∞, thus
I Z Z
eiz dz ∞ cos(x) dx ∞ sin(x) dx
= +i
Γ z 2 + a2 −∞ x2 + a2 −∞ x2 + a2
The integrand on the left hand side has simple poles at z = ±ia. However, since
a > 0 we have chosen the above contour, and hence only require the pole at z = +ia
(enclosed by the contour). So;
" #
eiz
Res(z = ia) = lim
z→ia z + ia
e−a
= . (49)
2ia
Therefore Z Z !
∞ cos(x) dx ∞ sin(x) dx e−a π −a
+i = 2πi = e . (50)
−∞ x2 + a2 −∞ x2 + a2 2ia a
Finally, taking the real parts of both sides we obtain
Z ∞ cos(x) dx π
2 2
= e−a . (51)
−∞ x +a a
PHY581 Methods of Theoretical Physics I, Fall 2005 16
iR
C
C’
-R -r r R
1.4.3 Example 3
Z
x − sin(x) ∞
dx . (52)
−∞ x3
We’ll begin by integrating by parts twice, to put the integral into a form that is more
simple to handle by complex methods.
Z " #∞ Z
∞ x − sin(x) (x − sin(x)) 1 ∞ 1 − cos(x)
dx = − + dx
−∞ x3 2x2 −∞
2 −∞ x2
" #∞ Z
(1 − cos(x)) 1 ∞ sin(x)
= − + dx
2x −∞
2 −∞ x
Z
1 ∞ sin(x)
= dx , (53)
2 −∞ x
since both terms in the square brackets vanish.
Now to use our complex variable machinery. Consider
Z
eiz
dz , (54)
Γ z
where Γ is the contour shown
Now, this integral is zero, by Cauchy’s theorem. We may write it as
Z Z Z Z Z
eiz eiz −r eix eiz R eix
dz = dz + dx + dz + dx . (55)
Γ z C z −R x C0 z r x
Now, as R → ∞ the integral around the large semicircle, C becomes zero. Thus,
Z Z Z
−r eix ∞ eix eiz
dx + dx = − dz . (56)
−∞ x r x C0 z
Now, since (eiz − 1)/z has a removable singularity at the origin, we have
Z Z Z
−r eix ∞ eix 1
dx + dx = − dz
−∞ x r x C0 z
Z 0
1
= − iθ
(ireiθ ) dθ
π re
= πi . (57)
PHY581 Methods of Theoretical Physics I, Fall 2005 17
Letting r → 0, we have
Z
eix ∞
dx = πi . (58)
−∞ x
Taking imaginary parts we finally obtain
Z ∞ sin(x)
dx = π , (59)
−∞ x
so that by our initial integration by parts:
Z ∞ x − sin(x) π
3
dx = . (60)
−∞ x 2
Definition 2.2 If {φn (z)} is an asymptotic sequence, then the asymptotic expansion for
a function f (z) is
∞
X
f (z) ∼ ar φr (z) , (62)
r=0
provided that
n−1
X
f (z) − ar φr (z) = O(φn ) or o(φn−1) , (63)
r=0
as z → z0 . (i.e. the remainder after n terms is smaller than the last included term, or
the same order as the first neglected term)
P
Some important properties of asymptotic expansions are (Here consider f ∼ ar z −r
always.):
is valid for all arg(z) (i.e., doesn’t exhibit Stokes’ phenomenon), then the series is
in fact convergent; i.e.
∞
X
f (z) = ar z −r . (67)
r=0
P∞ n
Proof 2.2 f is single-valued, holomorphic for |z| ≥ a, therefore f (z) = −∞ cn z ,
with I
1 f (z)
cn = dz . (68)
2πi C z n+1
Choose C to be a large circle, radius R. Then
Z
1 2π
iθ 1
|cn | ≤ f (Re ) dθ . (69)
2π 0 Rn
P∞
Now, since f (z) ∼ 0 ar z −r , f → a0 as |z| → ∞. Therefore, we can find M such
that |f | < M for large enough |z|. This implies that
M
|cn | < , (n > 0) . (70)
Rn
But R can be as large as we like, so cn = 0 for n > 0. Also, an = c−n , since
asymptotic expansions are unique (see next property).
as z → ∞. Similarly !
n−1
X
−r
f− ar z z n → an , (72)
0
as z → ∞. Thus, the coefficients {an } are uniquely defined. (Note that the
converse does not hold).
Let’s now see how we might calculate asymptotic expansions for several different
classes of functions.
PHY581 Methods of Theoretical Physics I, Fall 2005 19
It is important to note here that the right hand side is merely the left hand side
expanded and integrated term by term. However, it is the fact that the result is an
asymptotic expansion that is nontrivial. This is because the summation need not con-
verge uniformly in t for all t in the range of integration. Thus, it is not clear that we
can interchange the order of integration and summation.
Under more restrictive circumstances we could just integrate by parts to show this.
However, Watson’s lemma works in more general situations, and a more subtle proof is
required. I won’t give the proof here, although if we have time I may come back and
supply it later.
as x → +∞ (x real).
The rough argument is that the largest contribution comes from the biggest value
of h(u), say h(u0 ), which is exponentially larger than any other contribution. We’ll see
how this works in 2 distinct situations. In both these, Watson’s lemma is crucial to
obtaining the final result.
where τ = u −u0 and we can extend the range of integration to (−∞, ∞) since any
extra contributions are negligible (the dominant contribution comes from τ = 0)
2. h0 (u0 ) 6= 0
In this case we have u0 = b (or a). Now a Taylor expansion about u0 yields
Z 0
xh(u0 ) 0
F (x) ∼ e g(u0) exτ h (u0 ) dτ
−∞
1
∼ exh(u0 ) g(u0) + O(x−2 ) . (78)
xh0 (u 0)
Let’s see immediately how this works by applying what we’ve just learned to an
example that is well-known (the result at least) to some of you.
Consider expanding Γ(x + 1) as x → ∞, for x real. If you know what the Γ-function
is, you’ll know that the answer we hope to get is known as Stirling’s formula, and is
very useful in all types of situations in physics. If you haven’t heard of the Γ-function,
then this will still be a good example of how to use Laplace’s method.
The Γ-function has an integral expression given by
Z ∞
Γ(x + 1) = e−t tx dt . (79)
0
Although this appears to already be in the correct form to apply Laplace’s method to,
we must transform it because the largest value of the exponential occurs at t = 0, where
tx vanishes. Therefore, we’ll write
Z ∞
Γ(x + 1) = e−t+x log(t) dt
0
Z ∞
x+1
= x ex(−u+log(u)) du , (80)
0
where we have made the change of variables t = xu not because it is essential, but
because it makes things neater because the position of the maximum stays at a fixed
point and doesn’t go to infinity as we take the asymptotic limit.
PHY581 Methods of Theoretical Physics I, Fall 2005 21
with s = u − 1. The ∼ here comes from Watson’s lemma. We could have expanded
to higher order but have chosen not to. We can extend the limit of integration since
any contribution from the range (−∞, −1) is subdominant. Thus, to leading order we
obtain
2π 1/2 x+1 −x
Γ(x + 1) ∼ x e , (82)
x
which you may recognize as the leading term in Stirling’s formula.
Proof 2.3 Assume w.l.o.g that q(t) is continuous on [a, b] so that for any given > 0,
the interval [a, b] can be divided into n − 1 subintervals in each of which q(t) varies by
less than 2. Then, ∃{tn } such that a = t0 < t1 < . . . < tn = b, with |q(t) − q(tk )| < ,
for t ∈ [tk−1 , tk ]. Also, q(t) is bounded in [a, b], so ∃Q such that |q(t)| < Q ∀ t ∈ [a, b].
Then n Z n Z
X ti X ti
I(x) = q(ti ) eixt dt + [q(t) − q(ti )]eixt dt . (84)
1 ti−1 1 ti−1
Now, Z
ti eixti − eixti−1 2
ixt
e dt = ≤ , (85)
ti−1 ix x
and Z
ti
ixt
ti−1
[q(t) − q(ti )]e dt
≤ (ti − ti−1 ) . (86)
Definition 3.1 Suppose ai (x), 0 ≤ i ≤ n, are defined and bounded on I (an 6= 0). Then
L : C n (I) → C 0 (I), f → Lf
n
X
Lf (x) = ai (x)(D i f )(x) (90)
i=0
Definition 3.2 Let L be a LDO of order n on I and let f (x) be n-times differentiable
on I. An equation
Ly = f (91)
which exists, since we assume that p(x) is bounded. Then, (yeP )0 = (y 0 + py)eP . Thus,
if y(x) satisfies the homogeneous equation, then (yeP )0 = 0, which implies
Now consider the non-homogeneous equation. Clearly (yeP )0 = reP . Therefore the
general solution is Z x
y(x) = Ce−P (x) + e−P (x) r(u)eP (u) du . (97)
has solution
1 1
Cex − 2
x+ 2
e−x
y(x) = . (99)
x2 + 1
PHY581 Methods of Theoretical Physics I, Fall 2005 24
Once again, here are two sub-problems to solving this equation; determining the Kernel
and the particular integrals. In general both are difficult.
Example 3.2
L[y] ≡ y 00 + 3y 0 + 2y = x2 ex . (101)
⇒ (c2 + 3c + 2)ecx = 0
⇒ c = −1 or − 2 . (102)
Therefore, independent solutions are y1 = e−x and y2 = e−2x , and the CF is a linear
combination of these.
The fastest way to find a PI is to guess one! Guess y = (ax2 + bx + d)ex . Then
routine algebra gives a = 1/6, b = −5/18, d = 19/108. The general solution is therefore
1
y(x) = c1 e−x + c2 e−2x + (18x2 − 30x + 19)ex . (103)
108
Example 3.3
L[y] ≡ y 00 + 2y 0 + y = cos(x) . (104)
Consider first the CFs. Try y = ecx . Yields c = −1 (repeated). So there are not
independent roots here. To find the other linearly independent CF, try y = u(x)e−x .
This then gives
L[y] = u00e−x = 0 , (105)
So u = ax+b. Therefore, two linearly independent solutions are y1 = e−x , and y2 = xe−x .
Now, guess a PI: y = c cos(x) + d sin(x). This gives c = 0 and d = 1/2. Therefore,
the general solution is
1
y(x) = (c1 x + c2 )e−x + sin(x) . (106)
2
PHY581 Methods of Theoretical Physics I, Fall 2005 25
Why did this trick work in the latter example? This is a particular case of reduction of
order: If one solution of a nth order LDE is known, the equation can be converted to
an order n − 1 one. Let’s verify this explicitly when n = 2.
for w.
y 00 + Ay 0 + By = 0 , (110)
as we have seen.
Second, consider
A 0 B
y 00 + y + 2y = 0 . (112)
x x
Now n1 and n2 are roots of n(n − 1) + An + B = 0. Then
(
C1 xn1 + C2 xn2 6 n2
n1 =
y(x) = , (113)
(C1 + C2 ln x)xn1 n1 = n2
In this section, wlog, we will use I = [0, ∞), and for appropriateness of notation, will
use t (time) instead of x as our variable.
The general problem is
If this solution exists it can be shown to be unique. By assumption we can solve the
homogeneous problem (with the same initial condition). We therefore consider the
standard problem
M[y(t)] = f(t) , y(0) = 0 , (115)
since a solution of the homogeneous problem, with the given boundary condition, added
to a solution of this equation is the general solution to the equation.
Let us write the standard problem as
Now, what does (117) mean? Clearly, if t 6= s we can assume that G is a smooth
function of t. In addition, assume that G and Gt are bounded as t → s. Now integrate
(117) from t = s − to s + , > 0 :
Z s+ Z s+
(Gtt + pGt + qG) dt = δ(t − s) dt = 1 . (120)
s− s−
Thus Z s+
s+
[Gt ]s− + (pGt + qG) dt = 1 . (121)
s−
Thus, Gt is not continuous, but has a jump of 1 at t = s. Let’s be a little more formal
about all this.
Definition 3.3 The Green’s function for the initial value problem posed earlier is a
function G(t, s) satisfying
Definition 3.4 If y1 (x) and y2 (x) are linearly independent solutions to a second order
LDE, then the wronskian is
Clearly the first two conditions are satisfied. We need to impose continuity at t = s and
a jump of 1 in Gt . These conditions read
c1 y1 (s) + c2 y2 (s) = 0
c1 ẏ1 (s) + c2 ẏ2 (s) = 1 . (125)
Example 3.4
ÿ + ω 2y = e−t , (128)
with t > 0, y(0) = ẏ(0) = 0. The Green’s function is sin[ω(t − s)]/ω (show this).
Therefore, the solution is
Z t
y(t) = G(t, s)e−s ds
0
Z
1 t
= e−s sin(ω(t − s)) ds
ω 0
( )
1 sin(ωt)
= 2
e−t + − cos(ωt) . (129)
1+ω ω
Example 3.5
t2 ÿ − (t2 + 2t)ẏ + (t + 2)y = f (t) , (130)
with y(0) = ẏ(t) = 0. We have
2 1 2 f
L[y] ≡ ÿ − 1 + ẏ + + 2 y = fˆ ≡ 2 . (131)
t t t t
By inspection, one solution of L[y] = 0 is y = t. Using reduction of order, the second
solution is tet . Set
(
0 t<s
G(t, s) = c1 t+c2 tet−s , (132)
s
t>s
Continuity at t = s implies c1 + c2 = 0. A jump of 1 in Gt implies c2 = 1/s = −c1 .
Therefore
Z
1 t−s t
y(t) = [te − t]fˆ(s) ds
0 s
Z t " t−s #
e −1
= t f (s) ds . (133)
0 s3
PHY581 Methods of Theoretical Physics I, Fall 2005 29
Now set I = [a, b]. Consider a n-th order system. The kernel has dimension n. In
a 2-point bvp, we impose m > 0 conditions at x = a, and n − m > 0 conditions at
x = b, to fix a complementary function. Such a problem may have 0, 1 or infinitely
many solutions.
Example 3.6 Consider y 00 + y = 0, for which the candidate functions are sin x and
cos x. Consider the following possibilities for boundary conditions (bcs).
2. y(0) = y(π) = 0. This has an infinite number of solutions; y(x) = λ sin x, for
arbitrary λ.
Example 3.7 Consider y 00 − y = 0 and consider the following possibilities for boundary
conditions (bcs).
Definition 3.5 Suppose the problem M[y] = 0 with boundary values at x = a and x = b
has no non-trivial solutions. Then a, b are conjugate points.
for example.
x ∈ [a, b], with bcs C1 (y, y 0, a), C2 (y, y 0, b). The Green’s function G(x, ξ) satisfies
PHY581 Methods of Theoretical Physics I, Fall 2005 30
I will state, but not prove, that if the bcs are homogeneous, and a, b are conjugate, then
G exists and is unique.
It can then be shown that the solution of the problem (135) is
Z b
y(x) = G(x, ξ)f (ξ) dξ . (136)
a
Example 3.8
y 00(x) + y(x) = f (x) , (137)
on [0, π], with y(0) = 0, y 0 (π) = 0. It is easy to see that the homogeneous equation has
solutions
4 Transform Calculus
4.1 The Fourier Transform
I’ll assume that you know something about Fourier series. Suppose g(x) is continuous
on −π to π, and that g(±π) = 0. Then
∞
X
g(y) = Cn einy , (141)
−∞
where Z
1 π
Cn = g(y)e−iny dy . (142)
2π −π
PHY581 Methods of Theoretical Physics I, Fall 2005 31
Now consider changing the interval to [−L/2, L/2]. Set y = ωx, ω = 2π/L, g(y) = f (x),
cn = LCn . Then we have
∞
1X
f (x) = cn eiωnx , (143)
L −∞
with Z L/2
cn = f (x)e−iωnx dx . (144)
−L/2
We now take the limit as L → ∞. The sum now samples points increasingly close
˜
together and in the limit becomes an integral. Denoting cn by f(k) we obtain the
relationships
Z
1 ∞ ˜
f (x) = f (k)eikx dk
2π
Z ∞
−∞
˜
f(k) = f (x)e−ikx dx . (146)
−∞
R∞
Definition 4.1 Suppose −∞ |f (x)|dx < ∞. Then f˜(k) defined by the above is the
Fourier Transform, and the expression for f (x) is the inversion formula.
Note that I will use these definitions consistently, however, physicists often switch the
√
signs in the exponents, and make this more symmetric by having a 1/ 2π in front of
each integral. These are just issues of convention.
Lemma 4.1 (Shifting Relations) Suppose f˜(k) exists. Let g(x) = f (x − x0 ), and
h(x) = eiλx f (x), with λ and x0 constant. Then
Lemma 4.2 Suppose f˜(k) exists. Let g(x) = f (ax), with a 6= 0 real. Then
!
1 ˜ k
g̃(k) = f . (148)
|a| a
Proof 4.2
Z ∞
g̃(k) = f (ax)e−ikx dx
−∞
Z ∞ dy
= sign(a) f (y)e−iky/a
!
−∞ a
1 ˜ k
= f . (149)
|a| a
Lemma 4.3 Suppose g(x) = f 0 (x), h̃(k) = df˜/dk = f˜0 (k). Then, assuming all the
integrals converge,
g̃(k) = ik f˜(k)
h(x) = −ixf (x) . (150)
Simple extensions of these results show a general trend that the faster f (x) falls off
as x → ±∞, the smoother f˜(k) is, and vice-versa.
An important point is that Fourier transforms can be used to solve differential equa-
tions with constant coefficients:
Because of linearity,
(ik)2 ỹ + (ikp)ỹ + qỹ = f˜ , (152)
and so Z
1 ∞ f˜(k)eikx
y(x) = dk . (153)
2π −∞ q + ikp − k 2
This is clearly related to the Green’s function approach, and we shall return to it later.
PHY581 Methods of Theoretical Physics I, Fall 2005 33
δ̃(k) = 1
1̃ = 2πδ(k) , (154)
are familiar. For these purposes, both these functions must be regarded as distributions.
Definition 4.2 Let F be a class of “good” functions on (−∞, ∞); for example, C ∞
with exponential decay at ±∞. Then, g(x) is a distribution with respect to F if hf, gi,
defined by Z ∞
hf, gi ≡ f (x)g(x) dx (155)
−∞
Note that a different definition of “good” would lead to a different class of distribu-
tions; but C ∞ is usually required because this implies that the derivative of a distribution
(defined below) is also a distribution.
With the definition above, the space of distributions (which is dual to the space of
test functions) has many of the nice properties of a space of functions (e.g. linearity).
Each distribution is defined by its action on test functions. For example, δ(x) is the
distribution defined by
hδ, f i = f (0) ∀f ∈ F . (156)
hg 0 , f i = −hg, f 0i ∀f ∈ F . (157)
˜
hg̃, f i = hg, fi ∀f ∈ F . (158)
It is straightforward to show that most of the properties of Fourier transforms hold also
for the Fourier transforms of distributions:
PHY581 Methods of Theoretical Physics I, Fall 2005 34
1. The Fourier transform of the delta function. By the above definition hδ̃, f i = hδ, f˜i.
The RHS is
Z ∞ Z ∞ Z ∞
˜ dk = f˜(0) =
δ(k)f(k) f (x) dx ≡ f (k) dk , (159)
−∞ −∞ −∞
Comparing these, which must be equal for all test functions f (x), gives the result
δ̃(k) = 1.
˜ The
2. The Fourier transform of a constant. By the above definition h1̃, f i = h1, fi.
RHS is Z Z
∞ ∞
˜ dk =
1f(k) f˜(k) dk = 2πf (0) , (161)
−∞ −∞
These must be equal for all test functions, so 1̃ = 2πδ(k). This result is consistent
with the Fourier inversion theorem, but the conditions of the theorem do not hold
here.
3. The Fourier transform of H(x) (the Heaviside function). A naive approach gives
the wrong answer. One could argue that since H 0 (x) = δ(x), and, for any f
f˜0 (k) = ik f˜(k), then since δ̃(k) = 1, it follows that ik H̃(k) = 1, which is correct.
However, it does not follow that H̃(k) = 1/ik because when distributions are
allowed, the full solution of the equation 1 = ik H̃(k) should be
1
H̃(k) = + Aδ(k) , (163)
ik
where A is a constant which is not determined by this method. Since H(x) +
H(−x) = 1, the real part of the Fourier transform of H must be the Fourier
transform of 1/2. Therefore A = π.
PHY581 Methods of Theoretical Physics I, Fall 2005 35
Unfortunately, there exists no simple formula relating ffg to f˜ and g̃. Instead, ffg need
not exist. There is however another kind of multiplication which is physically very
important and for which Fourier transforms are easy to evaluate.
R∞ R∞
Definition 4.3 Suppose −∞ |f |2 dx < ∞ and −∞ |g|2 dx < ∞. The convolution of
f (x) and g(x) is
Z ∞
(f ∗ g)(x) = f (y)g(x − y) dy
−∞
Z ∞
= g(u)f (x − u) du
−∞
= (g ∗ f )(x) . (164)
Note that (f ∗δ) = f for all f . Thus, δ is the identity for (∗) considered as multiplication.
Theorem 4.4 (The Convolution Theorem) Suppose f˜, g̃, and f ∗ g exist. Then
fg
∗ g exists and
∗ g = f˜g̃ .
fg (165)
Theorem 4.6 (Rayleigh (1899) - Plancherel (1910)) Suppose complex f (t) is such
R
that f˜(k), and ∞ |f |2 dx both exist. Then
−∞
Z ∞ 1 Z∞ ˜ 2
|f (x)|2 dx = |f (k)| dk . (167)
−∞ 2π −∞
Proof 4.7
Z
1 ∞ ˜ ¯˜
RHS = f (k)f(k) dk
2π −∞
Z ∞ Z ∞ Z ∞
1
= (f (x)e−ikx dx)(f¯(y)eiky dy) dk
2π −∞ −∞ −∞
Z Z Z
1 ∞ ∞ ∞ ¯ ik(y−x)
= f (x)f(y)e dkdxdy
2π −∞ −∞ −∞
Z ∞ Z ∞
= f (x)f¯(y)δ(y − x) dxdy
−∞ −∞
Z ∞
= f (x)f¯(x) dx
−∞
Z ∞
= |f (x)|2 dx . (168)
−∞
PHY581 Methods of Theoretical Physics I, Fall 2005 36
R∞
Theorem 4.8 (Parseval’s Theorem) If f , g, are real, and f˜, g̃ and −∞ f g dx all
exist, then Z ∞ 1 Z∞ ˜
f (x)g(x) dx = f (k)g̃(−k) dk . (169)
−∞ 2π −∞
Proof 4.10 Let x be fixed, and let g(t) = f (t + x). Then, taking Fourier transforms
with respect to t we have
Z Z
1 ∞ 1 ∞
f˜(k)eikx dk = g̃(k) dk
2π −∞ 2π −∞
1
= h1, g̃i
2π
1
= h1̃, gi
2π
1
= h2πδ, gi
2π
= g(0)
= f (x) . (171)
Note that if f is discontinuous at, say, x = x0 , then f˜(k) is continuous and the
inversion integral is also continuous. It can be rigorously shown that
Z
1 ∞ 1
f˜(k)eikx0 dk = [f (x0 + 0) + f (x0 − 0)] . (172)
2π −∞ 2
Here are some examples, for which the Laplace integrals are easy to compute.
1
f (t) = 1 , F (p) = (174)
p
1
f (t) = eat , F (p) = (<(p) > <(a)) (175)
p−a
p
f (t) = cos(ωt) , F (p) = (176)
p + ω2
2
ω
f (t) = sin(ωt) , F (p) = (177)
p + ω2
2
ω
f (t) = sinh(ωt) , F (p) = (178)
p − ω2
2
p
f (t) = cosh(ωt) , F (p) = , (179)
p − ω2
2
Proof 4.12
df
pF (p) = L. + f (0)
dt
Z
df ∞
= f (0) + dt . e−pt(190)
0 dt
As p → ∞ the right hand side becomes f (0) as required, provided f (t) is bounded near
t = 0.
Convolutions are also important for Laplace transforms. Recall that for Laplace
transforms we assume that functions vanish for t < 0. Therefore, in a convolution
R∞
integral h(t) = −∞ f (t − u)g(u) du the integrand is nonzero only for t > u > 0. Thus
we have Z t
h(t) = f (t − u)g(u) du , t>0, (192)
0
h(t) = 0 , t<0. (193)
Proof 4.14
Z ∞ Z ∞
F (p)G(p) = e−ps f (s) ds e−pu g(u) du
0 0
Z ∞ Z ∞
−ps
= e f (s)θ(s) ds e−pu g(u)θ(u) du
−∞ −∞
Z ∞ Z ∞
= du g(u)θ(u) dse−p(s+u) f (s)θ(s)
−∞ −∞
Z ∞ Z∞
= du g(u)θ(u) dte−pt f (t − u)θ(t − u)
−∞ −∞
Z ∞ Z ∞
−pt
= dt e dug(u)f (t − u)θ(t − u)θ(u)
−∞ −∞
Z ∞ Z t
= dt e−ptθ(t) duf (t − u)g(u)
−∞ 0
Z ∞ Z t
= dte−pt du f (t − u)g(u)
Z0∞ 0
= dt e−pt h(t)
0
= H(p) . (194)
PHY581 Methods of Theoretical Physics I, Fall 2005 39
with the boundary conditions J0 (0) = 1, J00 (0) = 0. First rewrite the equation as
Now Laplace transform and use the result L.tn f (t) = (−d/dp)n F (p) to get
d d
− (L.J000 ) + L.J00 − (L.J0 ) = 0 . (197)
dp dp
Next I’ll write K(p) ≡ L.J0 and use our results on the Laplace transforms of derivatives
to get
(p2 + 1)K 0 (p) + pK = 0 . (198)
This is now a simple first order equation that we can solve to give
!−1/2
c 1
K(p) = 1+ 2 , (199)
p p
where c is a constant. We can now use the initial value theorem to fix c via 1 = J0 (0) =
limp→∞ pK(p) = c. So finally,
!−1/2
1 1
K(p) = 1+ 2 . (200)
p p
where
(−1)n (2n)!
αn = . (202)
22n (n!)2
Finally, we can use that the Laplace transform of tn is n!/pn+1 , to invert and get
∞
X αn t2n
J0 (t) =
n=0 (2n)!
n
∞
X − 14 t2
= . (203)
n=0 (n!)2
PHY581 Methods of Theoretical Physics I, Fall 2005 40
In this example, we used a Laplace transform that we know to invert another Laplace
transform. However, for general problems we’ll need a general inversion formula, anal-
ogous to the one that we used for Fourier transforms. Remember that for Fourier
transforms the transform variable was real, therefore the inversion formula was a real
integral. However, as I mentioned, with Laplace transforms, the transform variable is
complex in general. Therefore, we’ll end up needing a complex (contour) integral to
invert and recover f (t).
To get to the appropriate inversion formula, we’ll postulate a form for the integral,
and then show how it can be an inversion. Remember that I said we’d need to require
<(p) > γ for some γ in order for the Laplace transform to converge. With this is mind,
let’s examine integrals of the form
Z γ+i∞
dp ept F (p) , (204)
γ−i∞
where F (p) is the Laplace transform of a function f (t). We would like this integral to
yield f (t). Substituting in for the actual Laplace transform we get
Z γ+i∞ Z ∞
pt
dp e du e−pu f (u)θ(u) . (205)
γ−i∞ −∞
So, we can finally rearrange things to get the Bromwich Inversion Formula for the
Laplace transform: Z
1 γ+i∞
f (t)θ(t) = dp eptF (p) . (208)
2πi γ−i∞
Now, notice that, in deriving this, we have not specified what γ is. For a given F (p),
γ is not known a priori. In fact, γ must be chosen so that the right hand side of the
inversion integral is zero for t < 0 (to match the left hand side).
To do this, start with γ > 0 and close the contour using a semicircle in <(p) > γ > 0
to form a closed contour C. Now, for t < 0, the factor of ept ensures that the contribution
from the integral over the semicircle at infinity vanishes. Thus, for the integral around
PHY581 Methods of Theoretical Physics I, Fall 2005 41
C to yield zero , F (p) must have no singularities inside C. Therefore, all singularities of
F (p) must lie to the left of the line <(p) = γ. This fixes γ.
For t > 0, we close the contour in <(p) < γ. This gives
1 I pt
f (t) = e F (p) dp , t>0. (209)
2πi C
If the only singularities of F (p) are isolated poles, the inversion integral can be
performed by the calculus of residues
X
f (t) = Resi [ept F (p)]
poles
X
= epj t F (pj ) , (210)
j
for poles at p = pj . Suppose the pole of F (p) with largest real part is p = pj . Then
f (t) ∼ epj t as t → ∞, and therefore we require γ > <(pj ).
Let me give some examples of how to use Laplace transforms to solve ordinary
differential equations, in particular initial value problems. Consider
with initial values x(0) = 1, ẋ(0) = 0. Write L.x(t) ≡ X(p). Laplace transforming the
equation, and using our results about the Laplace transforms of derivatives, gives
1
[p2 X(p) − px(0) − ẋ(0)] + 2[pX(p) − x(0)] + X(p) = , , (212)
p+1
which, after a little rearranging, implies that
1 1 1
X(p) = + 2
+ . (213)
p + 1 (p + 1) (p + 1)3
But now, using our example from earlier, we can invert each of these term by term to
obtain
t2 −t
x(t) = e−t + te−t + e . (214)
2
Here’s another example
ÿ + ẏ − 2y = 0 , (215)
What’s interesting about this example is that we use the first boundary condition just
after transforming, to dispose of one of the terms generated by Laplace transforming a
derivative, but use the second boundary term only after inverting the transform.
The next example deals with a pair of coupled first order differential equations.
Consider
ẋ + x + 2y = e2t ,
2ẋ + ẏ − x = 0 , (219)
which yields
x(t) = (2t − 1)e2t + et . (223)
PHY581 Methods of Theoretical Physics I, Fall 2005 43
Similarly we obtain
y(t) = (1 − 3t)e2t − et . (224)
ẍ + ω 2 x = f (t) , (225)
F (p)
X(p) = . (226)
p2+ ω2
and use the convolution theorem to tell us that x(t) = (g ∗ f )(t). This reads
Z t sin ω(t − t0 ) 0
x(t) = dt0 f (t ) , (228)
0 ω
for t > 0. Therefore, g(t) is the Green’s function of the problem.
As a final example for Laplace transforms, consider the diffusion equation
∂2u 1 ∂u
2
= , (229)
∂x k ∂t
in x ≥ 0, t ≥ 0, subject to u(0, t) = f (t), given, u(x, 0) = 0, and u(x, t) → 0, as x → ∞.
This problem could describe, for example, a prescribed heating (f (t)) applied to the
x = 0 end of a semi-infinite rod, initially unheated.
Perform the Laplace transform with respect to t:
Z ∞
U(x, p) = dt e−ptu(x, t) . (230)
0
Note that, evaluating this at x = 0 gives U(0, p) = L.f (t) ≡ F (p). Now, the diffusion
equation, using u(x, 0) = 0, gives
∂2U p
= U , (231)
∂x2 k
which is easily solved to give
q q
U(x, p) = A(p) exp(− p/kx) + B(p) exp( p/kx) . (232)
PHY581 Methods of Theoretical Physics I, Fall 2005 44
Now, since the x-dependence of U and u will be the same, we require U(x, p) → 0, as
q
x → ∞, which gives B(p) = 0. So, we have U(x, p) = A(p) exp(− p/kx). At x = 0, we
have U(0, p) = A(p), and so we write
q
U(x, p) = U(0, p) exp(− p/kx) ,
= F (p)G(p) , (233)
q
with G(p) = exp(− p/kx). Therefore, the convolution theorem tells us that u(x, t) =
(g ∗ f )(x, t), and it remains to find g(x, t). But from an earlier result we know this:
s !
x2 −3/2 x2
g(x, t) = t exp − . (234)
4πk 4kt
I’ve given you a lot of examples using the Laplace transform. During this time you’ve
had some time to get used to the Fourier transform. I’d now like to go back to the Fourier
transform for one example that is of particular physical significance.
Consider the problem of finding the Green’s function that satisfies
!
∂2
− 2 − q 2 G(x, x0 ) = δ(x − x0 ) , (236)
∂x
where q is a fixed, real, positive number, and −∞ < x, x0 < ∞. This Green’s function
describes one-dimensional scattering in quantum mechanics. Set x0 = 0 (w.l.o.g.), and
then Fourier transform. We obtain that
(k 2 − q 2 )G̃(k) = 1 , (237)
with the function we’re looking for given by the inversion formula
Z ∞
G(x) = dk e−ikx G̃(k) . (238)
−∞
Now, we would like to solve for G̃(k). However, G̃(k) = 1/(k 2 − q 2 ) will not do, because
it puts poles on the real k-axis, and this gives problems for the inversion integral.
To proceed, we will apply Feynman’s Rule. This technique is extremely important
in quantum mechanics and quantum field theory. Replace q 2 by q 2 ± i. This enables
PHY581 Methods of Theoretical Physics I, Fall 2005 45
one to define two independent Green’s functions G± (x) by taking the limit → 0 at an
appropriate later stage. Consider
1
G̃+ (k) = ,
k2 − (q 2 + i)
1
= ,
[k − (q 2 + i)1/2 ][k + (q 2 + i)1/2 ]
1 1
= · , (239)
k − q − i k + q + i0
0
where 0 ≡ /(2q 2 ), and lim→0 and lim0 →0 are equivalent. So, we have
Z ∞ 1 1
G+ (x) = dx e−ikx · . (240)
−∞ k − q − i k + q + i0
0
Now, for x > 0, we can evaluate this integral by closing the contour in the upper half
k-plane (=(k) > 0), and for x < 0, we can evaluate this integral by closing the contour
in the lower half k-plane (=(k) < 0). We use the residue theorem, and then take the
limit lim→0 at the very end. The result is
eiqx e−iqx
G+ (x) = πiθ(x) + πiθ(−x) . (241)
q q
One can calculate G− (x) similarly. Given our technique, you should check that these
Green’s functions obey the differential equation.
5 Sturm-Liouville Theory
Let’s begin with an example to get the feel of the kind of problems we’ll tackle with
these techniques.
Consider a uniform string with fixed ends. The displacement of this string obeys the
wave equation
1 ∂2y ∂2y
= , (242)
c2 ∂t2 ∂x2
with boundary conditions y = 0 at x = 0 and at x = l for all time. To start, we separate
variables, making the ansatz y(x, t) = X(x)T (t). This yields
1 T̈ X 00
− = − = const = λ , (243)
c2 T X
say. Vibrations correspond to T ∼ eiωt , so λ = ω 2 /c2 . We seek solutions of
X 00 = −λX , (244)
PHY581 Methods of Theoretical Physics I, Fall 2005 46
L = −DpD + q , (246)
to be solved for y(x) for x ∈ I, subject to the boundary conditions (to be specified),
with w(x) > 0 on I, and λ an eigenvalue parameter. The solution has some general
features
1. ∃ nontrivial solutions which obey the boundary conditions in use iff λ ∈ S, the
spectrum of the problem. S is a monotonic set of discretely distributed real eigen-
values λn , so that λ0 < λ1 < λ2 < · · ·, with λn → ∞ like n2 as n → ∞.
and suitable boundary conditions, for distinct values λ1 and λ2 of the eigenvalue param-
eter. Form the object Z b
dx [y2 × (248) − y1 × (249)] . (250)
a
This yields
Z b Z b
(λ1 − λ2 ) dx w(x)y1 (x)y2 (x) = dx [−(py10 )0 y2 + (py20 )0 y1 ] ,
a a
= [−(py10 )y2 + (py20 )y1 ]ba . (251)
Now, appropriate boundary conditions are those that make this vanish. For then, since
λ1 6= λ2 , we have Z b
dx w(x)y1(x)y2 (x) = 0 . (252)
a
This is the sense in which y1 and y2 are orthogonal with respect to the weight function
w(x).
A good example is given by Legendre’s equation and polynomials. This arises from
the Laplace equation in cylindrical coordinates (r, θ, z). Writing x ≡ cos θ, the Legendre
equation is " #
d d
− (1 − x2 ) P (x) = λP (x) , (253)
dx dx
with I = (−1, 1). Note that this is a Sturm-Liouville problem with w(x) = 1, and
p(x) = (1 − x2 ). The suitable boundary conditions are automatically imposed if P (x) is
finite at x = ±1, since p(x) → 0 at the endpoints. The solutions are a set of polynomials,
the first few of which are
P0 = 1 , λ0 = 0 , (254)
P1 = x , λ1 = 2 , (255)
1
P2 = x2 − , λ2 = 6 . (256)
3
More generally, there exists a unique Pn = xn + · · · with λn = n(n + 1). It can also be
checked that the Pn are orthogonal on I.
PHY581 Methods of Theoretical Physics I, Fall 2005 48
This yields
Z b Z b
∗ ∗
(λ − λ ) dx w(x)y(x)y(x) = dx [−(p(y ∗)0 )0 y + (py 0)0 y ∗] ,
a a
= [−(p(y ∗ )0 )y + (py 0)y ∗ ]ba ,
= 0, (260)
for the suitable boundary conditions. Now, since w(x) > 0 on I, this implies that
Rb
a dx w|y|2 is strictly positive on I. Therefore,
λ∗ = λ . (261)
i.e., λ is real.
and note that “suitably well-behaved” requires that ||f || = (f, f )1/2 exists. Then,
(yn , ym ) = 0 for n 6= m. Further, choose the scale of yn to achieve orthonormality:
Also
X
(ym , f ) = cn (yn , ym ) ,
n
X
= cm δnm ,
n
= cm . (265)
This is to be compared with the formulae for Fourier series. Thus, if we assume that
the yn are known, then for a given f we can obtain the cm via
Z b
cn = dξ w(ξ)yn(ξ)f (ξ) . (266)
a
An important and useful result follows if we substitute this into the expansion for
f (x). We obtain
"
∞ Z
#
X b
f (x) = dξ w(ξ)yn(ξ)f (ξ) yn (x) ,
n=0 a
Z " ∞
#
b X
= dξ w(ξ)yn(ξ)yn (x) f (ξ) . (267)
a n=0
This is a formal completeness relation for the Sturm-Liouville problem. Note that we
can check this if we assume that δ(x − ξ) obeys the boundary conditions for a < ξ < b.
We can then expand the delta function as
∞
X
δ(x − ξ) = cn (ξ)yn (x) . (269)
n=0
where the yn obey suitable boundary conditions on I = [a, b], and w(x) > 0 on I. By
solved I mean that the λn and yn are determined, and (yn , ym ) = δnm has been arranged.
We would like to solve the problem
for y(x), for x ∈ I, subject to the same boundary conditions. Here λ is a fixed real
number, and f (x) is a given function; naturally assumed to obey the same boundary
conditions. To proceed, write
f (x) = w(x)h(x) ,
∞
X
h(x) = hn yn (x) , (273)
n=0
where we can calculate the coefficients hn when f (and hence h) is given, and the yn are
known.
We now posit the expansion
∞
X
y(x) = an yn (x) , (274)
n=0
and seek the unknowns an to complete the specification of the solution. Substituting in
to the problem (272), the left hand side becomes
∞
X ∞
X
an (L − λw)yn(x) = an (λn w − λw)yn (x) ,
n=0 n=0
∞
X
= w(x) an (λn − λ)yn (x) . ,
n=0
Finally, equating these, multiplying both sides by ym (x), and integrating over a < x < b,
we obtain
am (λm − λ) = hm . (276)
Now, if λ ∈
/ S, we have
hn
an = . (277)
λn − λ
PHY581 Methods of Theoretical Physics I, Fall 2005 51
We now have the solution in terms of quantities calculated for the homogeneous
equation. It is instructive to write this in another way. The solution is
∞
X hn
y(x) = yn (x) ,
n=0 λn − λ
∞
"Z #
X 1 b
= yn (x) dξ w(ξ)yn(ξ)h(ξ) ,
n=0 λn − λ a
Z " ∞
#
b X
yn (x)yn (ξ)
= dξ f (ξ) , (278)
a n=0 λn − λ
We can check that this Green’s function behaves as it should, by acting on it with
the left hand side of (272):
∞
X yn (ξ)
[L − λw(x)]G(x, ξ) = (L − λw)yn (x) ,
n=0 λn − λ
X∞
yn (ξ)
= (λn − λ)w(x)yn (x) ,
n=0 λn − λ
∞
X
= yn (ξ)w(x)yn (x) ,
n=0
as expected.
5.6 Self-Adjointness
It has probably not escaped your notice that the Sturm-Liouville problem and its solu-
tions are related to things that you’ve seen in quantum mechanics. Here, we shall see
precisely what this relation is. In particular, we shall compare the terms self-adjoint (in
Sturm-Liouville theory) and Hermitian (in quantum mechanics).
I will begin by considering the case of w(x) = 1. Consider a typical Sturm-Liouville
problem
Ly = λy ,
L = −DpD + q , (281)
PHY581 Methods of Theoretical Physics I, Fall 2005 52
with boundary conditions y(a) = y(b) = 0. In the vector space V of (possibly complex)
functions f , g, obeying the boundary conditions, we will use the inner product
Z b
(f, g) = dx f (x)∗ g(x) . (282)
a
With this definition, note that our self-adjoint operator L is hermitian (for the special
case w = 1). You can check this trivially using the definition of the inner product. Do
this as a brief exercise.
Now turn to the case of w 6= 1. In this course, and in Sturm-Liouville theory in
general, we refer to the operator L as self-adjoint, even when w 6= 1. we also use
Rb
Ly = λwy and (f, g) = a dx wf g, from which real λn and orthogonal yn follow. However,
with this definition, (Lf, g) 6= (f, Lg), because of w. To understand the relationship to
hermiticity, we define
M ≡ −w −1/2 DpDw 1/2 + q , (285)
Therefore, care is needed with terminology, even though either definition leads to real
eigenvalues and orthogonal eigenfunctions.