Bh-Notes-2014 - 15 Imperial College
Bh-Notes-2014 - 15 Imperial College
Bh-Notes-2014 - 15 Imperial College
Fay Dowker
Blackett Laboratory,
Imperial College,
London, SW7 2AZ,
U.K.
Contents
5 Hawking Radiation 57
5.1 Quantum Field Theory in Curved Spacetime 59
5.2 QFT in Rindler Space in 1 + 1 dimensions 62
5.3 QFT in the spacetime of a spherical collapsing star: Hawking Radiation 70
Recommended Reading:
Popular/Historical:
Textbooks:
Lecture Notes:
General:
• S. W. Hawking & R. Penrose, The Nature of Space and Time, Princeton Uni-
versity Press, 1996.
–2–
1 The Schwarzschild Black Hole
The Schwarzschild metric (1916) is a solution to the vacuum Einstein equations
Rµν = 0. It is given by
−1
2 µ ν 2M 2 2M
ds = gµν dx dx = − 1 − dt + 1 − dr2 + r2 dΩ22 , (1.1)
r r
where 0 < r < ∞ is a radial coordinate and dΩ22 = dθ2 + sin2 θdφ2 is the round
metric on the two-sphere.
To study the physics of black holes, we will start with a simple model of spheri-
cally symmetric gravitational collapse: a ball of pressure-free dust that collapses un-
der its own gravity. Since the dust ball is spherically symmetric, Birkhoff’s theorem
tells us that outside of it (where the energy-momentum tensor vanishes), the metric
must be Schwarzschild. Since the metric is continuous, it must be Schwarzschild
on the surface, too. Let us follow the path of a massive particle on the surface of
the dust ball. Assume it follows a radial geodesic, dθ = dφ = 0. Since the pull of
gravity in the r-direction is the same for all particles at a given radius, the particle
will remain at the surface throughout its trajectory.
–3–
The factor dt/dτ can be found from the Euler-Lagrange equations of the action:
∂L d ∂L
− = 0. (1.5)
∂xµ dτ ∂(dxµ /dτ )
The action of the dust particle in the Schwarzschild background takes the form
Z 2 −1 2 !
1 2M dt 2M dR
S= − 1− + 1− − m2 dτ. (1.6)
2m R dτ R dτ
Since it has no explicit dependence on the t-coordinate (it only depends on dt/dτ ),
the Euler-Lagrange equation for the t-coordinate
d ∂L d 2M dt
= −2 1 − =0 (1.7)
dτ ∂(dt/dτ ) dτ R dτ
The constant is equal to the energy per unit rest mass of the particle, as mea-
sured by an inertial observer at r = ∞. To see this, recall that the energy of
a relativistic particle is given by the t-component of the momentum (co-)vector
pµ = (−E, p) = mgµν dxµ /dτ . Then
dt 2M dt
E/m = −p0 /m = g00 = 1− = > 0. (1.9)
dτ R dτ
where we have defined Ṙ ≡ dR/dt. This gives the following equation for Ṙ:
2
2 −2 2M 2M 2
Ṙ = 1− −1+ . (1.11)
R R
–4–
Figure 1. Ṙ2 as a function of R for the massive surface particle.
How much coordinate time t2M does it take the particle to reach R = 2M from
R = Rmax ? Integrating (1.11), we find
Z t2M Z 2M −1 − 12
2M 2M
t2M = dt = − dR 1 − − 1 + 2 = ∞! (1.13)
0 Rmax R R
To convince yourself that this integral is infinite, look at the contribution to the
integral from the region near R = 2M . Let R = 2M + ρ with 0 < ρ 2M and
Taylor expand in ρ:
Z 0 −1 − 12
2M 2M 2
t2M = finite number − dρ 1 − −1+
ρ 2M + ρ 2M + ρ
Z 0 (1.14)
2M 0
= finite number − dρ + O(ρ ) .
ρ ρ
This is infinite since the 1/ρ term in the integral diverges logarithmically at 0.
Hence, as far as the coordinate time t is concerned, the particle never reaches the
Schwarzschid radius.
This result seems physically rather strange. The surface of a collapsing ball of
dust slows down as it approaches R = 2M and never actually reaches R = 2M .
–5–
Figure 2. (dR/dτ )2 as a function of R for the massive surface particle.
However, notice that while t is equal to the proper time along the worldliness of in-
ertial observers in the “rest frame of the star” at r = ∞, it has no physical meaning
in the region of spacetime close to r = 2M relevant to the infalling particle itself. t
is just a coordinate, and coordinates in general relativity are not physical.
So let us instead work with the proper time τ along the world line of the infalling
particle. How does R change as a function of τ ? Using (1.8) and (1.4) we find
2
dR 2 Rmax
= (1 − ) −1 . (1.15)
dτ R
We see that nothing strange at all happens as R → 2M + . The particle (and therefore
the surface of the dust ball) passes smoothly through R = 2M . This is confirmed by
the plot of (dR/dτ )2 against R in Figure 2.
Let us calculate the proper time τ2M that elapses for the particle between R = Rmax
and R = 2M . Using (1.15), we find (exercise)
Z τ2M Z 2M
dτ 2M √
τ2M = dτ = dR = 2
1 − + arcsin , (1.16)
3
0 Rmax dR (1 − 2 ) 2
–6–
The proper time taken to reach r = 0 is also finite:
Z 0
dτ πM
τ0 = dτ = 3 . (1.17)
Rmax dR (1 − 2 ) 2
This means that the entire dust star collapses to the single point r = 0 in a finite
proper time, a first sign that there is a true singularity at r = 0 — a curvature singu-
larity — at which physical (coordinate-independent) quantities such as Rµνρσ Rµνρσ
diverge.
While the calculations above have given us some useful insights, we have been
a bit careless. We worked out the geodesics using the Schwarzschild metric, and
traced them through r = 2M , even though the metric (1.1) breaks down at r = 2M .
Furthermore, for r < 2M , the factor 1 − 2M
r
becomes negative, which makes t
appear like a space-coordinate and r like a time-coordinate. Statements such as “the
star collapses to r = 0 in finite time” then become somewhat suspect. To make our
calculations sound, let us replace the coordinate system (t, r, θ, φ) by a more suitable
set of coordinates.
so that null geodesics obey the simple equation dt2 = dr∗2 . Solving (1.19) and requir-
ing r∗ to be real and increasing with r, we obtain
r − 2M
r∗ = r + 2M ln . (1.20)
2M
–7–
The solutions t ± r∗ = const. correspond to ingoing and outgoing null geodesics.
Let us define a new pair of lightcone coordinates:
u = t − r∗
(1.21)
v = t + r∗ ,
such that lines of constant v correspond to ingoing null geodesics and lines of con-
stant u to outgoing null geodesics.
–8–
�* =�-�
�
� �� ��
-�
-�
Figure 3. Ingoing (dashed red) and outgoing (solid blue) null geodesics in IEF coordinates
for M = 1. The horizon at r = 2M is highlighted in black. Both axes labelled in units of
M.
As you can see in Figure 3, this means that the particle must eventually hit r = 0.
Furthermore, no signal from an event inside the event horizon can ever escape the
black hole to reach an observer at r > 2M .
From Figure 3 we can also get an idea how someone hovering at a fixed radius
r > 2M outside the black hole will perceive the infalling matter as it falls across the
horizon. Imagine a signal being transmitted from the surface of the dust ball at a
constant rate. Due to the bending of the lightcones in the vicinity of the horizon,
the signals reaching the observer will become sparser and sparser as the surface ap-
proaches r = 2M . If we think of these signals as electromagnetic radiation, i.e. light,
then the light is shifted toward the lower (i.e. red) end of the spectrum, and the
signal received by the distant observer becomes redder and redder until it eventually
disappears.
–9–
�* =�+�
�
� �� ��
-�
-�
Figure 4. Ingoing (dashed red) and outgoing (solid blue) null geodesics in the OEF metric
for M = 1. The horizon at r = 2M is highlighted in black.
Note the similarity between (1.22) and (1.23): the only difference is the sign of the
cross-term. However, the physical picture of the spacetime described by the line-
element (1.23) turns out to be very different. To see this, let us consider again
“ingoing” (v = const.) and “outgoing” (u = const.) null geodesics. They are de-
picted in Figure 4, where we have defined the OEF time t∗ = u + r∗ . “Ingoing”
photons emitted at r > 2M or r < 2M never cross r = 2M : they approach r = 2M
and hover the horizon forever. Conversely, all outgoing null geodesics escape to in-
finity. Looking at the lightcones, we see that everything inside r = 2M is ejected.
The interior region r < 2M is therefore called a white hole and r = 2M the white
hole horizon. It is the exact time reversal of a black hole (you can see this by turning
Figure 3 upside down and comparing it with Figure 4).
What happens in the spacetime described by OEF coordinates is clearly very dif-
ferent from what happens in the spacetime described by IEF coordinates. This may
seem contradictory, since both are obtained by coordinate transformations from the
original Schwarzschild spacetime. We will learn how to make sense of this apparent
contradiction by introducing yet another coordinate system.
– 10 –
where r = r(u, v) can be given in terms of u and v via (1.20) and r∗ = (v − u)/2.
This metric has a coordinate singularity at r = 2M , so it is only defined for r 6= 2M .
To remove the singularity, we define a new pair of coordinates U and V in the region
r > 2M by
u v
U = − exp − and V = exp . (1.25)
4M 4M
Note that U < 0 and V > 0 for all values of r. The coordinates (U, V, θ, φ) are called
Kruskal-Szekeres (KS) coordinates. The Schwarzschild metric in KS coordinates is
(exercise):
2 32M 3 r
ds = − exp − dU dV + r2 dΩ22 , (1.26)
r 2M
where r = r(U, V ) is defined in terms of U and V by the implicit equation
U V = − r−2M r
2
2M
exp 2M
.
This metric is well-defined for r = 2M and indeed for all r > 0. Notice that for
r < 2M , we have U V > 0, which is incompatible with (1.25). However we consider
the initial definitions of U and V in terms of the Schwarzschild coordinates for now
as only holding for r > 2M and extend the spacetime to coordinate ranges of U and
V to −∞ < U, V < ∞ with metric (1.26).
Figure 5 is a picture of the spacetime described by the KS metric. Since null lines
are conventionally plotted at 45◦ , we define time and space coordinates T = U + V
and X = T − V , which label the vertical and horizontal axes in the figure. The
U and V axes are then at 45◦ to the T and X axes. At the horizon r = 2M , we
have U V = 0, which means either U = 0 or V = 0. This corresponds to the solid
diagonals. The singularity r = 0 corresponds to the (two branches of the) hyper-
bola described by U V = 1, which is represented by a wavy line (singularities will
always be represented by wavy lines). In general, surfaces of r = const. correspond
to hyperbolae U V = const. with U V < 1, as shown in blue on the diagram. Spatial
sections with t = const. have U/V = const. and |U/V | < 1, which corresponds to
straight lines through the origin with gradient between −1 and 1. Finally, ingoing
and outgoing null geodesics are respectively given by U = const. and V = const.
We can now see the relation between the different coordinate systems. The
original Schwarzschild coordinates (t, r, θ, φ) only cover region I. When we changed
to IEF and OEF coordinates, we extended our spacetime to regions II and III,
respectively. There is a new region that we only see in KS coordinates, region IV .
2
We can write r explicitly as r(U, V ) = 2M [1 + W (−U V /e)] where e is Euler’s constant and W
stands for the Lambert W -function.
– 11 –
Figure 5. Kruskal spacetime. Either each point of this diagram can be thought of as
representing a 2−sphere of radius r(T, X) or the diagram is interpreted as the causal
structure of radial motion at one fixed polar angle {θ, φ}. The black lines at 45◦ correspond
to r = 2M . The singularity r = 0 is represented by a wavy line. Solid blue hyperbolae
correspond to r = const. surfaces and straight dashed lines correspond to t = const.
surfaces.
The geometry of spatial sections is also worth a closer look. Consider a slice
with t = const. ⇐⇒ T /X = const. On Figure 5, it corresponds to a straight line
through the point (T, X) = (0, 0) ⇐⇒ r = 2M , where regions I and IV attach to
each other. In both regions the metric on the slice is given by
−1
2 2M
ds = 1 − dr2 + r2 dΩ22 (1.28)
r
and in both regions r increases from 2M to ∞. For large r, we have 1 − 2M
r
→1
and so the geometry becomes flat Euclidean. As we approach r = 2M , i.e. the centre
– 12 –
Figure 6. The Einstein-Rosen bridge with one spatial dimension suppressed. Each circle
represents a two-sphere in the three-dimensional analogue.
in Figure 5, the metric (1.28) starts to deviate from the Euclidean metric. What is
the geometry near r = 2M ? Since we cannot draw a three-dimensional surface, let
us suppress one angular coordinate by setting θ to its equatorial value θ = π2 . Then
−1 2
the metric is ds2 = 1 − 2M r
dr + r2 dφ2 . It is not hard to show (exercise) that
this is just the metric on a so-called quartic surface x2 + y 2 = (z 2 /8M + 2M )2 em-
bedded in three-dimensional Euclidean space E3 with cartesian coordinates x, y, z.3
The geometry is shown in Figure 6, where two identical copies of the surface have
been attached to each other at the circle r = 2M . One side corresponds to region I
and the other side to region IV . This is known as the Einstein-Rosen bridge, which
is one example of a wormhole. Note that no observer can ever cross the wormhole, as
you can see clearly in the Kruskal diagram (and, further along in the lectures, in the
Penrose diagram for Kruskal space): in order to cross the wormhole from region I to
IV or vice-versa, the trajectory of the observer would have to be spacelike somewhere.
We can also get an intuitive sense for the strange sign change that appears in
the original Schwarzschild metric (1.1) between r > 2M and r < 2M , which makes r
appear like a time coordinate when r < 2M . Indeed, if r = 0 were just a “position in
space”, as one might naively think of it, it would seem that one could simply avoid
it by navigating around it. In Figure 5, we see that for anyone who has fallen across
the horizon, the singularity r = 0 is not a position in space — it becomes a moment
of time, as unavoidable as 9am tomorrow morning.
3
Hint: Recall that the metric onpE3 is ds2 = dx2 + dy 2 + dz 2 . Consider the surface parametrised
by x = r cos φ, y = r sin φ and z = 8M (r − 2M ), show that it satisfies the quartic equation above
and find the metric induced on it. For an illustration see http://wolfr.am/WIRjK3.
– 13 –
Finally, we can justify in retrospect some of the coordinate extensions (analytic
continuations) that we performed rather ad-hoc in the last sections. For example,
consider the extension from region I covered by Schwarzschild coordinates to regions
I + II covered by IEF coordinates. The original coordinate system with time co-
ordinate −∞ < t < ∞ covers region I only. If we think of region I as a physical
spacetime in its own right, then a particle will hit a “boundary” (r = 2M ) in fi-
nite proper time τ2M . In light of this, it seems physically quite reasonable to work
instead with the spacetime I + II obtained by extending the original coordinate
system, where this artificial boundary disappears.
– 14 –
g̃ that is regular on the edges. Then (M, g̃) is a good representation of the original
spacetime (M, g) insofar that it has exactly the same causal structure. It is custom-
ary to add the points at infinity to M to form a new manifold M̃ . The resulting
spacetime (M̃ , g̃) is what is called the conformal compactification of (M, g).
Note that the curvature tensors are in general not preserved under conformal
µ µ
transformations, e.g. R̃νρσ 6= Rνρσ , R̃ 6= R etc. In that sense, the conformally com-
pactified spacetime (M̃ , g̃) is unphysical: it provides a good representation of the
causal structure of the physical spacetime (M, g), but it should otherwise not be
viewed as a picture of what is going on (for example, as noted above, the geodesics
of massive test particles in (M̃ , g̃) do not correspond to geodesics of massive test
particles in (M, g)).
Now, to shrink “infinity” down to a finite coordinate distance, we define a new set
of coordinates via
u = tan ũ
(1.32)
v = tan ṽ.
These coordinates indeed have a finite range − π2 < ũ, ṽ < π2 (the range is open since
points with u, v = tan(± π2 ) = ±∞ are not points in spacetime). The line-element in
terms of ũ and ṽ is
1
ds2 = − dũdṽ. (1.33)
(cos ũ cos ṽ)2
It diverges as u, v → ±∞ ⇐⇒ ũ, ṽ → ± π2 . Define a new metric g̃µν through a
conformal transformation on gµν :
This metric is regular at the points at infinity where either ũ or ṽ are equal to ± π2
and we can now add those points to the spacetime. The resulting spacetime (M̃ , g̃)
is the conformal compactification of (M, g). Both spacetimes are shown in (ũ, ṽ)
coordinates in Figure 7.
– 15 –
Figure 7. Left: Minkowski space (M, g) in (ũ, ṽ) coordinates. The boundaries ũ, ṽ = ± π2
are not part of M and g diverges there. Some timelike and spacelike geodesics of g have been
included: lines with r = const. are represented by dashed curves and lines with t = const.
are represented by solid curves. Right: The Penrose diagram of conformally compactified
Minkowski space (M̃ , g̃), with future/past timelike infinity i± , future/past null infinity J ±
and spacelike infinity i0 . The timelike and spacelike geodesics of Minkowski space are
clearly not all geodesics for (M̃ , g̃), since g̃ is flat in (ũ, ṽ) coordinates.
The two points (ũ, ṽ) = ( π2 , π2 ) and (− π2 , − π2 ) are denoted by i± . All future (past)
directed timelike curves end up at i+ (i− ), so we refer to i+ (i− ) as future (past) time-
like infinity. Future directed null geodesics either end up at ṽ = π2 with constant
|ũ| < π2 or at ũ = π2 with constant |ṽ| < π2 . This set of points is denoted by J +
(“scri-plus”) and referred to as future null infinity. An analogous definition holds
for past null infinity J − (“scri-minus”). Finally, spacelike infinity i0 denotes the set
of endpoints of spacelike geodesics, which corresponds here to (ũ, ṽ) = ( π2 , − π2 ) and
(ũ, ṽ) = (− π2 , π2 ).
where dΩ22 is the round metric on the 2-sphere. Define light-cone coordinates u = t−r
and v = t + r and perform the same transformation (1.32) as above to bring infinity
– 16 –
Figure 8. Left: The Penrose diagram of 3 + 1 dimensional Minkowski space with some
lines of constant r and t. Each point represents a 2−sphere. As the null geodesic shown
passes through r = 0, it emerges on another copy of the Penrose diagram whose points
represent the antipodes on the two-spheres. Right: The conformal compactification drawn
as a portion of the Einstein static universe with the same null geodesic.
to finite coordinate distance. The only difference to the previous analysis is that
since r ≥ 0, we have the additional constraint u ≤ v ⇐⇒ ũ ≤ ṽ. The metric in
(ũ, ṽ, φ, θ) coordinates reads
1
ds2 = − 2 2
2 −4dũdṽ + sin (ṽ − ũ) dΩ2 (1.37)
(2 cos ũ cos ṽ)
and its conformal compactification corresponds to the spacetime (M̃ , g̃) with metric
ds̃2 = (cos ũ cos ṽ)2 ds2 = −4dũdṽ + sin2 (ṽ − ũ) dΩ22 . (1.38)
and extended coordinate ranges −π/2 ≤ ũ, ṽ ≤ π/2. On the Penrose diagram of
1 + 1 dimensional Minkowski space in Figure 7, ũ ≤ ṽ implies that we only include
the part that lies to the right of the vertical line x = 0. The resulting diagram for the
3 + 1 space is shown in Figure 8. Every point on the diagram represents a two-sphere
of radius sin(ṽ − ũ).
There exists a somewhat more illustrative way to picture the conformal com-
pactification. Define T̃ = ṽ + ũ and χ = ṽ − ũ. The coordinate ranges are then
(exercise) −π < T̃ < π and 0 < χ < π and the metric reads
– 17 –
Figure 9. Eternally accelerating observers in Minkowski space. Their worldlines are shown
in blue and labelled by ξ. Events in the shaded region such as the black dot are hidden
to them. The Rindler horizon is the boundary between the shaded and unshaded regions.
Rindler space corresponds to the right wedge outlined by the dashed black null lines. The
straight dotted lines are lines of constant Rindler time η.
The spatial part dχ2 + sin2 χdΩ22 of this metric is just the round metric of a three-
sphere S 3 parametrised by polar coordinates (χ, θ, φ). The spacetime (1.39) therefore
represents a static universe with spherical spatial slices, which corresponds to a fi-
nite portion of the Einstein static universe (ESU) whose topology is R × S 3 . This is
shown in Figure 8, and the Penrose diagram corresponds to that part of the region
wrapped around the cylinder facing out of the page.
– 18 –
Figure 10. The Penrose diagram for 1 + 1 dimensional Rindler space, seen as a portion of
Minkowski space. Some accelerated worldlines (curves of constant ξ) have been drawn for
ξ = 29 , 25 , 23 , 1, 23 , 52 , 29 . Note that all of the worldlines represent observers accelerating in the
postitive x−direction, even though they appear to bend toward the right for ξ > 1. This
distortion is a side effect of the coordinate transformation (u, v) → (ũ, ṽ).
– 19 –
Define a new set of null coordinates via U = tan Ũ and V = tan Ṽ , such that
− π2 < Ũ , Ṽ < π2 . Then the line-element becomes
32M 3
r
2 −2 2 2 2 2
ds = (2 cos Ũ cos Ṽ ) −4 exp − dŨ dṼ + r cos Ũ cos Ṽ dΩ2 (1.42)
r 2M
– 20 –
Figure 11. Left: The Penrose diagram for Kruskal space. The possible trajectory of the
surface of a collapsing star is shown — the region to the left of the curve corresponds to
the interior of the star, where spacetime is not described by the Kruskal metric. Right:
The Penrose diagram for a collapsing star. The curved line represents the surface and
the shaded region corresponds to the interior of the star. The horizon corresponds to the
dashed line.
– 21 –
Figure 12. Left: The 1 + 1 dimensional de Sitter hyperboloid embedded in 2 + 1 dimen-
sional Minkowski space. Some curves of constant θ and t are shown. Right: The Penrose
diagram of de Sitter space. Dotted lines are lines of constant η. The diagonal line is the
de Sitter horizon for a comoving observer at the north pole χ = 0.
To construct the Penrose diagram of de Sitter space we pull out the factor of
a cosh2 (t/a) in the metric (1.47):
2
and define the “conformal time coordinate” η accordingly: dη 2 = dt2 /(a2 cosh2 (t/a)).
Integrating, we obtain η = ±2 tan−1 et/a +c. Choose the upper sign and fix c = − π2 so
that η is monotonically increasing with t and has the symmetric range η ∈ − π2 , π2 .
– 22 –
Figure 13. Left: The conformal compactifiction of de Sitter space viewed as a finite slab
of the Einstein static universe. Right: The Penrose diagram for the negative mass black
hole. The singularity at r = 0 can be seen from J + .
|
term 1 − 2M = (1 + 2|M
r r
) is then always positive and there are no singularities in
the metric except the curvature singularity at r = 0.
We can construct the Penrose diagram in the same manner as in the positive
mass case, by writing the metric in terms of u = t − r∗ and v = t + r∗ coordinates
(with the restriction r ≥ 0 ⇐⇒ v ≥ u) and proceeding with the conformal com-
pactification as usual. The Penrose diagram is shown in Figure 13.
Notice that this singularity is different from the black hole singularity: it can
be seen from J + . Conversely, the singularity inside the black hole is hidden behind
the horizon r = 2M . A singularity that can be seen from J + is known as a naked
singularity. The white hole has a naked singularity too, as it can be seen in Figure 11.
While naked singularities occur quite frequently in solutions to Einsteins equations,
their physical status is debated. Roger Penrose has formulated the cosmic censorship
hypothesis: “Nature abhors a naked singularity”, which encapsulates the expectation
that naked singularities (except for the Big Bang) are unphysical and do not occur
in the real world.
– 23 –
we have observational evidence all appear to be rotating, while the Schwarzschild
solution has zero angular momentum. In this section we review two further, more
general, black hole solutions.
∆ = Q2 − 2M r + r2 = (r − r+ )(r − r− ) (2.4)
p
where r± = M ± M 2 − Q2 . The metric then reads
∆ 2 r2 2
ds2 = − 2
dt + dr + r2 dΩ22 . (2.5)
r ∆
There are three separate cases to look at: Q > M , Q < M and Q = M . Let’s
consider them in turn.
– 24 –
2.1.1 Super-Extremal RN: Q > M
If Q > M then ∆ has no real roots and the metric is regular for all r > 0. There is
a curvature singularity at r = 0. This is the same situation as for the negative mass
black hole, and the Penrose diagram looks exactly the same (Figure 13).
Note that an electron has charge e = 1.6×10−19 C and mass me = 9.1×10−31 kg.
√
In geometrised units this means Q = e/ 4π0 G = 1.4 × 10−29 kg, so Q M . Could
it be that an electron is just a charged black hole? No. The electron is a quantum
mechanical object, whose Compton wavelength λ = h/mc = 2.4 × 10−12 m is much
larger than its Schwarzschild radius rs = 2Gme /c2 = 1.4 × 10−57 m.
– 25 –
Figure 14. The Penrose diagram for the sub-extremal Reissner-Nordström solution.
– 26 –
2.2 Rotating Black Holes
So far we have only discussed solutions with spherical symmetry. We now introduce
the Kerr-Newman solution (1965) to the Einstein-Maxwell equations, which describes
a rotating charged black hole of mass M , charge Q and angular momentum J ≡ M a
(so a is the angular momentum per unit mass). In Boyer-Lindquist coordinates
(t, r, θ, φ), in which the black hole rotates about the polar axis, the metric part of
the solution is given by
∆ − a2 sin2 θ a sin2 θ 2
2 2 Σ 2
r + a2 − ∆ dtdφ
ds = − dt + dr − 2
Σ ∆ Σ
2 2 2 2 2
(2.15)
2 (r + a ) − ∆a sin θ 2 2
+ Σdθ + sin θdφ
Σ
We will investigate the structure of the simple but illustrative special case of a
rotating black hole with zero charge Q = 0. The metric (2.15) then reduces to the
Kerr solution (1963):
2
2 dr 2M r 2
ds = Σ + dθ + (r2 + a2 ) sin2 θdφ2 +
2
a sin2 θdφ − dt − dt2 (2.17)
∆ Σ
– 27 –
Below we will show a coordinate transformation that removes them. It also has a
curvature singularity at
The latter condition implies that the curvature singularity is only there when θ = π2 ,
i.e. when r = 0 is approached along the equator. When approached from any other
angle, there is no singularity at r = 0.
There are again three cases to consider: M < a, M = a and M > a. We will
concentrate on the M > a solution, for which there are two coordinate singularities at
r+ (the “outer” horizon) and r− < r+ (the “inner” horizon). To remove them, we do
a coordinate transformation to ingoing Kerr coordinates (v, r, θ, χ), where v = t + r∗
and r∗ and χ are defined by
r 2 + a2 a
dr∗ = dr and dχ = dφ + dr. (2.20)
∆ ∆
The definition of χ implies that φ = const. does not correspond to χ = const. For
example, in order to stay at χ = const. as you fall in (dr < 0), you need to rotate
too: dφ = −a/∆dr. In terms of ingoing Kerr coordinates the metric becomes
∆ − a2 sin2 θ a sin2 θ 2
2
ds = − dv 2 + 2dvdr − 2 (r + a2 − ∆)dvdχ
Σ Σ
"
2 2 2 2 2
# (2.21)
2 (r + a ) − ∆a sin θ 2 2 2
− 2a sin θdχdr + sin θdχ + Σdθ .
Σ
There are no more factors of ∆ in the numerators and the metric is regular at r+
and r− . The only remaining singularity is the curvature singularity at Σ = 0.
To draw the Penrose diagram is more difficult because the metric is not spheri-
cally symmetric. Since the curvature singularity at r = 0 only appears when θ = π2 ,
the Penrose diagram should look very different for θ 6= π2 and θ = π2 . In order to
represent both cases, it is customary to draw a Penrose diagram that is an amalgam
of the Penrose diagram for an observer falling in from the north pole (θ = 0) and of
that for an observer falling in in the equatorial plane (θ = π2 ) at fixed χ. Notice that
χ = const. means that φ is not constant, so the observer falling in at θ = π2 rotates
about the polar axis.
– 28 –
Figure 15. The Penrose diagram for the sub-extremal Kerr black hole.
Penrose diagram. This leads to the infinitely sequence of spacetime regions we saw
in Figure 14. Up to this point, the analysis is identical for θ = 0 and θ = π2 . The
only difference is that the Penrose diagram for θ = 0 has a curvature singularity at
r = 0, whereas the Penrose diagram for θ = π2 has none. In the amalgam Penrose
diagram for the Kerr spacetime, we indicate this by drawing an interrupted wavy
line at r = 0. The result is shown in Figure 15.
– 29 –
the metric of a ring with radius a. If you travel toward r = 0 from any other angle
than θ = π2 , you will not encounter the singularity. Instead, you will fall through the
interior of the “ring” and emerge in a new region of spacetime.
Close enough to the singularity, where r is small and negative, r < 0 and
|r| < 2M a/(r2 + a2 ), (2.24) is negative and the curve is timelike. Since χ is a
periodic coordinate with χ ≡ χ + 2π, the curve is also closed: it is a CT C.
However, it turns out that region X is unphysical. Much like in the case of
the sub-extremal RN solution of section 2.1.2, the inner horizon at r = r− becomes
a curvature singularity in the presence of the smallest perturbations to the Kerr
metric: at the inner horizon, perturbations are infinitely blueshifted, which leads to
divergences in the curvature scalars.
∂
Notation: to denote partial derivatives we use shorthand notations such as ∂t = ∂t
and ∂µ = ∂x∂ µ as well as X,µ = ∂µ X when applied to a tensor X.
– 30 –
p ∈ γ. This can be expressed as the demand that
d
ξp (f ) = (f ◦ γ(λ)) (3.1)
dλ p
where λ parametrises γ.
When ξ is smooth and everywhere non-zero, the set of integral curves form a
congruence: every point p ∈ M lies on a unique integral curve. Given any congruence
there is an associated one-parameter family of diffeomorphisms from M onto itself,
defined as follows: for each s ∈ R, define hs : M → M , where hs (p) is the point
parameter distance s from p along ξ, i.e. if p = γ(λ0 ) then hs (p) = γ(λ0 + s). These
transformations form an abelian group: the composition law is hs ◦ ht = hs+t , the
identity is h0 and the inverse is (hs )−1 = h−s .
This leads to the concept of the Lie derivative Lξ along the vector field ξ. When
applied to a vector V at point p it is defined as
Vp − (hδλ )∗ Vh−δλ (p)
(Lξ V )p = lim . (3.3)
δλ→0 δλ
Here (hs )∗ denotes the push-forward associated with the group element hs , which
maps a vector defined at p to a vector defined at hs (p). It can be shown (exercise)
that the Lie derivative of a vector is equal to the bracket
(Lξ V )p = [ξ, V ]p
The Lie derivative can be applied to any tensor on M , with appropriate defini-
tions analogous to (3.3). In particular, given a metric tensor g on M , we can take
its Lie derivative. Since the components of g transform covariantly (whereas vector
fields transform contravariantly), the Lie derivative of g involves the pull-back h∗s ,
which maps a covector at hs (p) to a covector at p:
gp − (hδλ )∗ ghδλ (p)
(Lξ g)p = lim . (3.4)
δλ→0 δλ
It can then be shown that
– 31 –
If the metric does not change under the transformation hs we say that the transfor-
mation is an isometry and that the metric has a symmetry. In this case Lξ g = 0,
which implies
∇µ ξν + ∇ν ξµ = 0. (3.6)
This is known as Killing’s equation and a vector ξ that satisfies (3.6) is a Killing
vector. Note that this equation implicitly involves the metric, which is hidden in
∇. Finding the symmetries of a spacetime amounts to finding the vectors that sat-
isfy the Killing equation; this can be done either by inspection or by integrating (3.6).
A useful fact to know when looking for Killing vectors is the following. Given
any vector field ξ we can (locally) find a coordinate system (x1 , x2 , x3 , x4 ) in which
ξ takes the form
∂
ξ= , (3.7)
∂x1
i.e. ξ µ = (1, 0, 0, 0). This implies that
∂gµν
(Lξ g)µν = . (3.8)
∂x1
Hence, if we find a coordinate system in which gµν is independent of one of the
∂
coordinates, say y, then we know that ∂y must be a Killing vector since
∂gµν
(L ∂ g)µν = = 0. (3.9)
∂y ∂y
The converse statement is not true: if gµν depends on all the coordinates, that does
not mean that that g has no Killing vectors.
reveals the two Killing vectors k = ∂t and m = ∂φ . This means that Schwarzschild
spacetime is static and axisymmetric:
– 32 –
Schwarzschild spacetime is not just axisymmetric but spherically symmetric: it
has two more spacelike Killing vectors, which together with ∂φ generate the symme-
tries of a sphere — the SO(3) transformations. The other two Killing vectors cannot
be “read off” from the metric but they can be found by solving Killing’s equation.
∆ − a2 sin2 θ a sin2 θ 2
2 Σ
dt2 + dr2 − 2 r + a2 − ∆ dtdφ
ds = −
Σ ∆ Σ
2
2 2 2 2
(r + a ) − ∆a sin θ
+ Σdθ2 + sin2 θdφ2
Σ
A number of uniqueness theorems, proved between 1967 and 1975, have estab-
lished the remarkable fact that the two-parameter Kerr family (parameters M and
J) is the unique stationary asymptotically flat solution of the vacuum Einstein equa-
tions and that the three-parameter Kerr-Newman family (parameters M , J and Q)
is the unique stationary asymptotically flat black hole solution of the electrovacuum
Einstein-Maxwell equations.
If we deform the curve by a small amount δxµ (λ) and require the action to be
stationary with respect to this variation, we obtain the Euler-Lagrange equations:
δI d µ
µ
= 0 =⇒ ∇(λ) ẋµ ≡ ẋ + Γµρσ ẋρ ẋσ ∝ ẋµ . (3.11)
δx dλ
– 33 –
The dot denotes differentiation with respect to λ. The solutions to (3.11) are the
geodesics in (M, g). If the right hand side vanishes, ∇(λ) ẋµ = 0, the geodesic is said
to be affinely parametrised and λ is an affine parameter. If we set λ = τ then (3.11)
reduces to
d2 xµ ρ
µ dx dx
σ
+ Γ ρσ = 0. (3.12)
dτ 2 dτ dτ
For the purpose of finding geodesics we can cast (3.10) into a more useful form by
introducing an independent function e(λ) (called the auxiliary field or einbein):
1 λB
Z
µ
dλ e−1 (λ)gµν ẋµ ẋν − m2 e(λ) .
I (x , e) = (3.13)
2 λA
To see that this is equivalent to (3.10), notice that from δI/δe = 0 we get
1p 1 dτ
− e−2 gµν ẋµ ẋν − m2 = 0 =⇒ e = −gµν ẋµ ẋν = (3.14)
m m dλ
and from δI/δxµ we get
d µ
ẋ + Γµρσ ẋρ ẋσ = (e−1 ė)ẋµ . (3.15)
dλ
This equation is already the geodesic equation ∇(λ) ẋµ = (e−1 ė)ẋµ and (3.14) re-
lates e to the choice of parameter λ. To turn this into the equation for an affinely
parametrised geodesic, we need to set ė = 0. Then dτ /dλ = const., which implies
that λ = aτ + b.
xµ → xµ + αk µ (3.16)
– 34 –
We see that k being Killing leads to a symmetry of the particle action. Associated
with this symmetry is a quantity (charge) that is conserved along geodesics:
– 35 –
3.2.2 Example 2: Kerr
The vectors k and ` are also Killing vectors for the Kerr metric (2.15,2.17):
2
2 dr 2M r 2
ds = Σ + dθ + (r2 + a2 ) sin2 θdφ2 +
2
a sin2 θdφ − dt − dt2 .
∆ Σ
In Schwarzschild spacetime, you (or a test particle) can travel along integral curves of
k = ∂t anywhere outside the horizon — you will then appear stationary5 to observers
at infinity, since your position in space is not changing. This is possible because g00 is
negative everywhere for r > 2M , so that k 2 = g00 is negative and the integral curves
of k are timelike. It turns out that this is not the case in Kerr spacetime: there
is a region around the outer horizon, called the ergosphere or ergoregion, in which
it is impossible for you (and any test particle) to remain stationary with respect to
observers at infinity — everything rotates. This happens because
2M r
g00 = − 1 − (3.25)
Σ
becomes positive in the region
2M r
> 1 =⇒ ξ(r) ≡ Σ − 2M r = r2 + a2 cos2 θ − 2M r < 0, (3.26)
Σ
part of which lies outside the outer horizon r = r+ when a 6= 0. This is easy to
see by noting that the equation for ξ(r) is a parabola with roots at r̃± = M ±
√ √
M 2 − a2 cos2 θ and r̃+ is bigger than r+ = M + M 2 − a2 for θ 6= 0, π. Hence g00
is positive in the ellipsoidal region r+ < r < r̃+ , which has a maximum extent on the
√
equator θ = π2 where r̃+ = M + M 2 + a2 .
In the ergoregion, orbits of ∂t are not timelike, so you cannot travel along them
and remain stationary with respect to observers at infinity. In order for a curve
xµ = (t, r, θ, φ) to be timelike, its tangent vector uµ = dxµ /dτ must satisfy u2 = −1.
But in the ergoregion, every term in u2 = gµν uµ uν is positive except for gtφ ut uφ ,
which means that uφ = dφ/dτ must be non-zero. Moreover, since ut > 0 for a
future-directed worldline and gtφ < 0, uφ must be positive. Any timelike worldline is
therefore dragged around in the direction of rotation of the black hole. This effect is
an example of frame dragging.
– 36 –
that, once in the ergoregion, it decays into two particles, one of which falls into the
black hole and one of which escapes the ergoregion again. Denote the energy of the
initial particle by E = −kµ pµ and that of the final particles by E1 = −kµ pµ1 and
E2 = −kµ pµ2 , where k is the asymptotically timelike Killing vector. Conservation of
four-momentum pµ = pµ1 + pµ2 implies that E = E1 + E2 . The fact that k is spacelike
in the ergoregion allows you to arrange the decay such that the energy of the particle
that falls into the black hole is negative with respect to you: E1 = kµ pµ1 < 0. To see
this, choose a coordinate system in which k µ = (0, x, 0, 0). Prepare your decay such
that pµ1 = (1, y, 0, 0), where y is small enough for p1 to be timelike and adjusted such
that xy > 0. Then E1 is negative and E2 > E: the particle that reemerges from the
ergoregion has more energy than the particle you sent in.
3.3 Hypersurfaces
Let S(x) be a smooth function of the spacetime coordinates. Consider the family of
hypersurfaces S(x) = const. The normal vector to S(x) = const. is given by
nµ = f (x)g µν ∂ν S, (3.27)
On the spatial slice Σ, h is just the positive definite three-dimensional flat Euclidean
metric. Next, consider a timelike hypersurface Σ defined by S(x) = x1 = const. with
– 37 –
normal nµ = (0, 1, 0, 0). The induced metric on it is hµν = diag(−1, 0, 1, 1), which
carries Lorentzian signature on the subspace of the tangent space spanned by vectors
tangent to Σ.
Claim: For a null hypersurface Σ, the integral curves in Σ of the normal ` are null
geodesics.
If the integral curves of `µ are not affinely parametrised, then we can always
find some function h(x) such that `˜µ = h(x)`µ has affinely parametrised integral
˜ `˜µ = 0. These curves are called the null geodesic generators of the
curves, i.e. `.∇
null hypersurface.
– 38 –
some function f , ξ = f ` on Σ. It follows that ξ satisfies
ξ µ ∇µ ξ ν = f `µ ∇µ (f `ν )
= f `µ `ν ∇µ f + f 2 `µ ∇µ `ν
(3.32)
= f `ν `µ ∇µ f
≡ κξ ν
`µ = f (g vr ∂r S, g rr ∂r S, 0, 0) = f 1, 1 − 2M
r
, 0, 0 = f (1, 0, 0, 0) = f ∂v . (3.34)
– 39 –
The surface gravity κ is evaluated most easily in IEF coordinates. The equa-
tion (3.32) defining κ is a vector equation so we only need to evaluate one of its
components. For ξ = ∂v , the µ = v component of ξ σ ∇σ ξ µ is
Now
1 1 M
Γvvv = g vσ (gvσ,v + gσv,v − gvv,σ ) = g vr (−gvv,r ) = 2 , (3.38)
2 2 r
which, on the horizon, reduces to Γvvv = 1/4M . Substitution into (3.37) then gives
κ = 1/4M .
– 40 –
equilibrium black hole in the presence of the electromagnetic field is therefore fully
characterised by the three numbers M , J and Q.
However, the electromagnetic field is only one among many matter fields in
Nature. Through a series of theorems known as “no hair” theorems it has been es-
tablished that, in general, black holes have no other properties besides mass, angular
momentum and charge (of whichever fields are present). To illustrate the general
spirit of these proofs, we look at a simple example here.
Claim: A static black hole cannot be the source of a real (minimally coupled) scalar
field.
Proof: Let φ be a real scalar field that satisfies the Klein-Gordon equation
∇2 φ − m2 φ = 0. (3.41)
By the word “static” in the claim we mean that scalar field and spacetime (i.e. the
metric field) have settled down to equilibrium, which implies that the scalar field
does not vary in time anymore: “φ̇ = 0”. In covariant language the statement is that
there exists a timelike Killing vector k (= ∂t ) of the metric such that k µ ∇µ φ = 0.
Given k we can foliate the spacetime by t = const. hypersurfaces whose unit normals
n are proportional to k, n ∝ k. The black hole horizon is a Killing horizon of k.
where V ⊂ M is the spacetime region outside of the horizon. From the Klein-Gordon
equation (3.41) it follows immediately that I = 0. On the other hand, using Stokes’
theorem to integrate by parts the first term, we get
√
Z Z
µν 2 2 4
I = (−g ∇µ φ∇ν φ − m φ ) −gd x + φ∇µ φdS µ (3.43)
V ∂V
where ∂V is the boundary of the region V and dS µ is the normal surface element on
∂V .
We now show that the surface integral in (3.43) vanishes identically. There
are four contributions to ∂V : the horizon r = 2M , the surfaces t = ±∞ and the
surface r → ∞ (the “sphere at infinity”). On the horizon and on the t = ±∞
surfaces, the normal surface element is proportional to k, so the integrand vanishes:
∇µ φdS µ ∝ k µ ∇µ φ = 0. We are left with the boundary term at r → ∞. To see that
– 41 –
this vanishes too, note that since the spacetime is static and asymptotically flat, the
metric takes the form
which implies that the field must fall off like φ ∼ 1/r or faster. For a sphere of radius
r the normal surface element takes the form dS ∝ rdr∂r , so the leading contribution
in the integrand is φ∇µ φdS µ ∝ rdrφ∂r φ ∼ dr/r2 . The integral of this vanishes in
the limit r → ∞.
Hence, the first integral in (3.43) must vanish identically. In terms of the induced
metric hµν = gµν − nµ nν on hypersurfaces of t = const. the integrand reads
having dropped the second term due to the fact that n ∝ k and k.∇φ = 0. The
last inequality follows from the fact that h is positive definite (see section 3.3.1).
Therefore, the integrand is always negative or zero and the integral is zero only if
the integrand vanishes everywhere. For m > 0 this implies that φ = 0 everywhere.
For m = 0 it implies that ∇µ φ = 0 =⇒ φ = const. everywhere, which is physically
indistinguishable from φ = 0 everywhere.
– 42 –
Given a d-dimensional spacetime (M, g), a differential form of order k (or k-form) on
M is a totally antisymmetric covariant k-index tensor. By definition, zero-forms are
functions and one-forms are covectors. Given a (local) coordinate chart xµ on M , the
coordinate differentials dx1 , dx2 , . . . , dxd are examples of 1-forms and together they
form a complete basis. A general one-form can therefore be written as F = Fµ dxµ ,
where Fµ are the components of F . A k-form F can be written as
1
F = Fµ µ ...µ dxµ1 ∧ dxµ2 . . . ∧ dxµk , (3.48)
k! 1 2 k
where ∧ is the antisymmetric wedge product: dxα ∧ dxβ = −dxβ ∧ dxα . The exterior
derivative of a k-form F is defined as the (k + 1)-form dF whose components are
The electromagnetic tensor defines the two-form F = 12 Fµν dxµ ∧dxν . In terms of
the four-potential A = Aµ dxµ it is given by F = dA, which implies that
dF = d(dA) = 0. Maxwell’s equations are completely encapsulated in the two
equations
d ? F = 4π ? j
(3.53)
d?j =0
where j = jµ dxµ . The second equation is the continuity equation. The Hodge star ?
exchanges magnetic and electric fields, i.e. the magnetic and electric components of
– 43 –
F are interchanged in ?F . Consider the electric charge in some spacelike region B,
which we define as
Z Z Z
1 1
Q(B) = ?j = d?F = ?F. (3.54)
B 4π B 4π ∂B
The last equality follows from Stoke’s theorem. This charge is conserved, which
can be shown as follows. Take any two spacelike surfaces B1 and B2 . Consider the
cylindrical region V , which is bounded by B1 and B2 and chosen to be large enough
that all the sources (i.e. the support of j) lie inside V . In other words j = 0 on
the cylindrical boundaries and outside of V . An illustrative example is shown in
Figure 16, where V corresponds to the solid cylinder-like region between B1 and B2 .
By virtue of the continuity equation we have:
Z
0= d?j
ZV
= ?j
∂V
Z Z
= ?j − ?j (3.55)
B1 B2
Z Z
1 1
= ?F − ?F
4π ∂B1 4π ∂B2
= Q(B1 ) − Q(B2 ),
Let’s turn to the analogous situation for Kµν . The associated two-form is defined
as K = 21 Kµν dxµ ∧ dxν . How does K show up in Einstein’s equations? First note
that (∇µ ∇ν − ∇ν ∇µ )k σ = R σρµν k ρ . For σ = µ we get ∇µ ∇ν k µ = Rρν k ρ or
∇µ K µ ν = Rρν k ρ , (3.56)
which looks just like ∇µ F µ ν = 4πjν . Einstein’s equations Rµν − 21 Rgµν = 8πGTµν
can be written as
1
Rρν = 8πG Tρν − gρν T . (3.57)
2
Let
1
ζν ≡ Rρν k ρ (3.58)
8πG
and ζ = ζν dxν . Then Einstein’s equations take the form
d ? K = 8πG ? ζ
(3.59)
d ? ζ = 0.
– 44 –
Figure 16. Two spacelike slices with subregions B1 and B2 . V is the spacetime volume
traced out between B1 and B2 . The regions are chosen in such a way that there are no
sources outside of V.
– 45 –
Law Thermodynamics Black Holes
The temperature T is constant The surface gravity κ is constant
0th throughout a system in thermal over the event horizon of a station-
equilibrium ary black hole
1
1st
P
dE = T dS + i µi dNi dM = 8π κdA + ΩH dJ + ΨH dQ
2nd dS ≥ 0 dA ≥ 0
T cannot be reduced to zero by a κ cannot be reduced to zero by a
3rd
finite number of operations finite number of operations
A young graduate student named Jacob Bekenstein disagreed. He noted that the
second law of thermodynamics would be violated if black holes had no entropy, since
one could throw arbitrarily entropic objects into the black hole, thereby lowering
the total entropy of the exterior universe. He claimed that black holes must have an
entropy SBH ∝ A to save the second law of thermodynamics. Bekenstein’s generalised
second law states that
dStotal ≥ 0 (4.1)
where Stotal = Sexternal + SBH . In 1974, it was Hawking who announced that black
holes are hot and radiate just like any hot body with a temperature
~κ
TH = , (4.2)
2πkB
from which it follows that a black hole has an entropy given by
A
SBH = (4.3)
4G~
– 46 –
the Bekenstein-Hawking entropy.
√ Since G~ has units of length squared, we may
define the Planck length lP = G~ so that the Bekenstein-Hawking entropy
1A
SBH = (4.4)
4 lP2
can be interpreted as a quarter of the area of the black hole, counted in units of lP2 .
So, after all, the analogy of table 1 between black hole mechancis and thermody-
namics seems to be more than a formal peculiarity: if one identifies the temperature
with ~κ/2πkB it becomes a physical unification. In the next sections we will discover
the origin of the results discussed here.
• The surface area A of the horizon is defined at time t0 as the area of the
intersection of the hypersurface Σ of constant t = t0 with the horizon H defined
by r = r+ . For the Kerr-Newman metric, the induced line-element on the
intersection of the two hypersurfaces t = const. and r = const. is
(r + a2 )2 − ∆a2 sin2 θ
2
2 µ ν 2
ds = hµν dx dx = Σdθ + sin2 θdφ2 . (4.6)
Σ
√ 2
On H ∩ Σ we have ∆ = 0 and therefore h = (r+ + a2 ) sin θ, so that
Z √ Z 2π Z π
2
A= hdθdφ = dφ dθ(r+ + a2 ) sin θ = 4π(r+ 2
+ a2 ). (4.7)
H∩Σ 0 0
– 47 –
2
• The angular velocity ΩH = a/(r+ + a2 ) was derived in (3.39) above: it is the
non-zero constant in χ = ∂t + ΩH ∂φ that makes χ a Killing vector tangent to
the generators of the horizon.
The second term in the first line was dropped because Aµ vanishes as r → ∞.
We shall assume that the curves are geodesics and that λ is an affine parameter
so that t.∇tµ = 0. The congruence is then called an (affinely parametrised) geodesic
– 48 –
congruence. For a null congruence we have t2 = 0 and for a timelike congruences
we can set t2 = −1. The vectors t and ηa commute because they form a coordinate
basis:
[t, ηa ] = 0 =⇒ tµ ∇µ ηaν − ηaµ ∇µ tν = 0 (4.12)
D
for a = 1, 2, 3. Since tµ ∇µ = Dλ is the covariant derivative in the direction of the
congruence, this says that the rate of change of the connecting vectors along a curve
of the congruence is given by
D ν
η = tµ ∇µ ηaν = ηaµ ∇µ tν ≡ Bµν ηaµ , (4.13)
Dλ a
D
where Dλ := tµ ∇µ .
The tensor Bµν measures the geodesic deviation, i.e. the extent to which the
connecting vector fails to be parallelly transported along curves of the congruence.
When ηa is parallelly transported, Bµν is zero.
– 49 –
spacelike to both ` and n: ηi .n = ηi .` = 0 for i = 1, 2. The projector onto the two-
dimensional spacelike subspace of the tangent space spanned by η1 and η2 is given
by
Pνµ = δνµ + nµ `ν + `µ nν . (4.15)
It is a quick exercise to check that P has the right properties: P2 = P,
Pνµ nν = Pνµ `ν = 0 and Pνµ ηiν = ηiν .
where B̂νµ ≡ Pρµ Bσρ Pνσ is the projection of Bνµ into the two-dimensional subspace. In
the third manipulation we used the fact that `.∇Pνµ = 0, which follows from the
definition (4.15) and the fact that n and ` are parallelly transported.
Any linear transformation on a vector space can be decomposed into three com-
ponents representing stretch, rotation and shear. For B̂νµ this can be done by setting
1
B̂νµ = θPνµ + σ̂νµ + ω̂νµ (4.18)
2
where
and TrB̂ ≡ B̂µµ is the trace of B̂νµ . The three components of B̂νµ contain information
about the geometry of the congruence. For example, it can be shown that ω̂ = 0
implies that the tangent vector field ` is normal to a family of null hypersurfaces (the
converse statement holds too). We will prove this below, but in order to do so, we
need two results first.
– 50 –
We omit the proof — for more detail and references containing proofs see section B.3
in Wald.
using the definition of Pνµ in (4.15). The result then follows immediately due to the
total antisymmetrisation on the indices.
Claim: The tangent field ` is normal to a family of null hypersurfaces if and only if
ω̂ = 0.
Proof: First, assume that ω̂ = 0. Then B̂[µν] = 0, which together with the previous
lemma implies
0 = `[µ B̂νρ] = `[µ Bνρ] = `[µ ∇ν `ρ] (4.21)
and by Frobenius’ theorem ` is hypersurface orthogonal. Conversely, assume that ` is
hypersurface orthogonal. Then it follows from Frobenius’ theorem that
`[µ B̂νρ] = `[µ Bνρ] = 0 and therefore
da
= θa (4.24)
dλ
where θ is the “stretch” term in B̂νµ defined in (4.18). From (4.24) we see that θ
indeed measures the expansion of the geodesics in the congruence: when θ > 0 the
– 51 –
geodesics diverge, and when θ < 0 the geodesics converge. Raychoudhuri’s equation
is a statement about the rate of change of θ:
dθ
= `.∇θ = `.∇B̂µµ = `.∇(Pρµ Bσρ Pµσ ) = `.∇(Bσρ Pρµ Pµσ )
dλ
= `.(Bσρ Pρσ ) (using P 2 = P )
= Pρσ `µ ∇µ Bσρ (using `.∇P = 0 from (4.15))
= Pρσ `µ ∇µ (∇σ `ρ )
= Pρσ `µ (∇µ ∇σ `ρ − ∇σ ∇µ `ρ + ∇σ ∇µ `ρ )
= Pρσ [`µ Rρ νµσ `ν + ∇σ (`µ ∇µ `ρ ) − (∇σ `µ )∇µ `ρ ]
= (δρσ + `σ nρ + nσ `ρ )`µ R[ρ νµσ] `ν − Pρσ (∇σ `µ )∇µ `ρ (using `.∇`ρ = 0)
= −Rµν `µ `ν − Pρσ Bσµ Bµρ (4.25)
= −Rµν `µ `ν − Pρσ Bαρ δµα Bσµ
= −Rµν `µ `ν − Pρσ Bαρ (Pµα − `α nµ − nα `µ )Bσµ
= −Rµν `µ `ν − Pρσ Bαρ Pµα Bσµ (using `µ Bσµ = Bαρ `α = 0)
= −Rµν `µ `ν − Pγσ Pργ Bαρ Pβα Pµβ Bσµ (using P = P 2 twice)
= −Rµν `µ `ν − B̂νµ B̂µν
= −Rµν `µ `ν − 1 1
2
θPνµ + σ̂νµ + ω̂νµ 2
θPµν + σ̂µν + ω̂µν
= −Rµν `µ `ν − 1 2
2
θ + ω̂ 2 − σ̂ 2 .
We also made use of the symmetries of the Riemann tensor. To summarise, the
Raychaudhuri equation for null geodesic congruences is:
dθ 1
= −Rµν `µ `ν − θ2 − σ̂ 2 + ω̂ 2 . (4.26)
dλ 2
So far, this is a purely geometrical statement about null geodesic congruences in a
Lorentzian manifold — it contains no physics. The physics comes from Einstein’s
equations
1
Rµν − Rgµν = 8πGTµν =⇒ Rµν `µ `ν = 8πGTµν `µ `ν , (4.27)
2
from which we obtain
dθ 1
= −8πGTµν `µ `ν − θ2 + ω̂ 2 − σ̂ 2 . (4.28)
dλ 2
The first term in this equation is constrained when matter satisfies the Weak Energy
Condition (WEC):
Tµν tµ tν ≥ 0 for all timelike t, (4.29)
which says that the energy density in the frame of any timelike observer is non-
negative. It then follows by continuity that Tµν `µ `ν ≥ 0 for the null vector `. This
– 52 –
has an important consequence for the expansion parameter θ:
Hence, if the geodesics are converging at any point on the congruence, they must
continue to converge and the expansion goes to −∞. This signals a breakdown of the
congruence and neighbouring geodesics then meet at some point q, called a caustic
or focus.
In the special case of a Killing horizon, the null geodesic generators are integral
curves of the Killing vector, which generates a symmetry of the spacetime. In that
case θ = 0 everywhere (this can be proven by studying B̂µν directly when the surface
is a Killing horizon).
– 53 –
4.5 Causal Structure
Given a spacetime (M, g), a timelike curve in M is a curve γ ∈ M with an everywhere
timelike tangent vector. A causal curve is a curve with a nowhere spacelike tangent
vector.
n Clearly,
o a timelike curve is also causal. If there
n exists a o
future directed
causal curve γ from p to q we say that p is to the causal
timeliken chronological past of q and
p≺q
o
we write p q . Given a subset U ⊂ M we define the
Analogous definitions can be given for the chronological future I + (U ) and the causal
future J + (U ). For any set S, we denote the (topological) closure of S (i.e. S together
with its limit points) by S, the interior of S (i.e. the largest open set contained in
S) by S̊ and the boundary of S by ∂S = S \ S̊.
The following relations between the chronological and causal past are always
true:
I − (U ) = J˚− (U )
(4.34)
I − (U ) = J − (U ) .
By consequence of the equivalence principle, every spacetime looks locally like Minkowski
space: for any point p ∈ M , there exists a so-called convex normal neighbourhood in
which the local causal structure is that of Minkowski space.
– 54 –
Some facts about the causal and chronological relations:
– 55 –
4.6 Hawking’s Area Theorem
The area theorem is a fully dynamical result about black hole spacetimes, which
means that it cannot make use of such concepts as Killing horizons, which rely on
particular spacetime symmetries. Hence we need another way to define the event
horizon of a black hole. A natural definition can be made in terms of the spacetime’s
causal structure: for an asymptotically flat spacetime (M, g), the future horizon H +
is defined as the boundary of the causal past of future null infinity, ∂J − (J + ). To
give an example, the horizon in the the Penrose diagram of the collapsing star in
Figure 11 corresponds to the dashed black line.
Note that, strictly speaking, J + is a set of points that is not part of the original
spacetime manifold (M, g). However, J + is contained in the conformal compactifi-
cation (M̃ , g̃), and since the causal structure is invariant under conformal transfor-
mations, the set of points H + obtained by taking the boundary of the causal past of
J + will be contained in the original spacetime manifold (M, g).
Since the horizon H + is the boundary of the causal past of a set, it is a null
hypersurface and its null geodesic generators cannot leave H + by the theorems in
the previous section. We are now in a position to prove Hawking’s area theorem.
Theorem (Hawking): The area of H + cannot decrease if the weak energy condition
holds and cosmic censorship holds (no singularities on or outside the horizon).
Proof: By the area of H + we mean the area of the intersection of H + with a space-
like hypersurface. Now H + is a null geodesic congruence and if we can show that the
expansion θ satisfies θ ≥ 0 everywhere on H + then the result follows immediately.
Suppose that θ = θ0 < 0 at some p ∈ H + . If the WEC holds, there is a conjugate
point q at finite affine parameter-distance along the geodesic γ through p and all the
points r ∈ γ beyond q are timelike related to p by theorem 9.3.10 in Wald. So γ must
have left H + at q. But that contradicts the Penrose theorem that null generators of
∂J − (J + ) can have no future endpoints on ∂J − (J + ). Hence θ ≥ 0 everywhere on
H + and the theorem follows.
What does the area theorem imply for the black hole spacetimes we have come
across already? For a Schwarzschild black hole, the area is proportional to the mass:
A = 4π(2M )2 = 16πM 2 . Hence, if a physical process starts with a Schwarzschild
– 56 –
black hole and ends with another, the mass cannot decrease. That this relies on the
WEC is clear: the situation would be different if you were able to throw negative
2
energy into the black hole. For a Kerr black hole, A = 4π(r+ + a2 ) where r+ = M +
√
M 2 + a2 . Hence, the mass of a Kerr black hole could in principle decrease in the
course of a physical process, but it would have to be compensated by an increase in
the angular momentum.
5 Hawking Radiation
In this section, we will study the physics of black holes taking into account quantum
mechanical effects. In order to do so, we will need to become acquainted with the
methods of quantum field theory (QFT) in curved spacetime. First, a quick review of
(free, scalar) QFT in flat space. Remember that four-vectors are denoted by standard
letters and (spatial) three-vectors are denoted by bold-face letters, e.g. p stands for
the four-momentum and p stands for the three-momentum.
The equation of motion for a real scalar field φ is the Klein-Gordon equation
φ − m2 φ = 0 (5.1)
can be expanded as
Z
d3 p ap ψp (x) + a∗p ψp∗ (x) ,
φ(x) = (5.3)
where ap and a∗p are the expansion coefficents (the coefficients of ψp and ψp∗ must
be complex conjugates of each other in order for φ(x) to be real). Upon second
quantisation, the coefficients in (5.3) become operators ap and a†p (we denote the
Hermitian conjugate by a dagger and we will omit the hats on operators in most cases
for notational convenience, but from now on the field and its coefficients should be
6
Note that due to the signature of the metric p0 > 0 ⇐⇒ p0 < 0.
– 57 –
thought of as operators). In order that the standard canonical commutation relations
for the field [φ(t, x), φ̇(t, y)] = iδ(x − y) and [φ(t, x), φ(t, y)] = 0 translate to the
simple relations [ap , a†q ] = δ(p − q) and [ap , aq ] = 0 on the operator coefficients, we
normalise the mode functions using the Klein-Gordon inner product
←
→
Z
(f, g) = i d3 xf ∗ ∂t g (5.4)
←→
where f ∗ ∂t g = f ∗ (∂t g) − (∂t f ∗ )g. The required normalisation conditions are then
p 3
∗
(ψp , ψq ) = δ(p − q) and (ψp , ψq ) = 0, fixing Np = 1/ 2p0 (2π) 2 up to an arbitrary
complex phase. The vacuum state is defined by ap |0i = 0 ∀ p.
At first sight it seems that this definition of the vacuum state depends on a
particular choice of frame: we chose an inertial frame with coordinates xµ in which
we imposed the positive-frequency condition p0 > 0. This condition will look dif-
ferent in a new coordinate system x̃µ . However, it turns out that for all inertial
reference frames, i.e. those related to xµ by a Lorentz transformation x̃µ = Λµν xν ,
the positive-frequency condition leads to the same vacuum. To see this, consider
positive-frequency mode functions in a new frame x̃µ :
µ
ψ̃p = Np eipµ x̃ . (5.5)
in terms of these modes and the new vacuum state will be defined by ãp |0̃i = 0 ∀ p.
To prove that the two vacua are the same, |0̃i = |0i, we just need to show that
(The converse implication then follows by symmetry.) It follows from p̃µ = pν Λνµ
that ψ̃p ∝ ψp̃ :
0 12 0 12
1 ipµ x̃µ p̃ 1 ip̃µ xµ p̃
ψ̃p = p 3 e = 0 3 e = ψp̃ . (5.8)
p0
p
2p0 (2π) 2 p 2p̃0 (2π) 2
Furthermore, the mode ψp̃ is a positive-frequency mode because p0 > 0 =⇒ p̃0 > 0.7
Hence, the coefficient in front of the positive-frequency mode ψ̃p with momentum p
7
This is true because by Lorentz transformations we really mean (proper) orthochronous Lorentz
transformations — orthochronous meaning that they preserve the direction of time. To see more
explicitly why this implies that the sign of the time-component of the four-momentum p is pre-
0
served, recall that for a (proper orthochronous) Lorentz transformation, Λ0 = γ = 1/ 1 − |v|2 /c2
and Λ0i = −γvp i /c, where v is the relative speed between the two reference frames. Then
p̃0 = Λ0µ pµ = γ( m2 + |p|2 − v.p/c) ≥ γ|p|(1 − |v| cos θ/c) > 0 as required. We used the fact that
γ is positive, |v| < c and cos θ ≤ 1.
– 58 –
in frame x̃µ will be the same as the coefficient in front of the positive-frequency mode
ψp̃ with momentum p̃ in frame xµ : ãp = ap̃ . By consequence, we have
which proves that |0i = |0̃i. Hence, the vacuum state in Minkowski spacetime is
independent of which inertial frame you use to define it. If the vacuum appears
empty of particles to observers in one inertial frame, it will appear empty of particles
to observers in all inertial frames. This is a direct consequence of the fact that
positive-frequency modes in one inertial frame are linear combinations (in this case
particularly simple ones) of positive-frequency modes in another inertial frame.
• FLRW spacetimes.
• Minkowski space with a point removed. For any infinite spacelike hypersurface,
the removed point will either be to the future or past of it. But then there
will exist past/future inextendible causal curves that end at the removed point
and therefore fail to intersect the surface. This reflects the fact that specifying
initial conditions on the surface is not enough to fix a consistent solution to
the equation of motion on the whole spacetime: anything can happen at the
removed point (it could be a source or a sink to the field) and so additional
boundary conditions need to be specified there.
– 59 –
• Spacetimes with a naked singularity (such as the M < 0 Schwarzschild black
hole). In that case, there will be timelike curves that start and/or end on the
singularity and avoid crossing any candidate Cauchy surface we might think of.
Again this reflects the fact that we don’t know what happens at the singularity
— we would have to specify additional boundary conditions on it in order to
have a well-posed problem.
Assuming that (M, g) admits a Cauchy surface we study a free, real scalar field
on it which satisfies the Klein-Gordon equation
φ − m2 φ = 0 (5.10)
Deep and difficult theorems in the theory of differential equations say that, when
there is a Cauchy surface, we can always find some (highly non-unique) orthonormal
basis of solutions {ψi , ψi∗ } such that
(ψi , ψj ) = +δij
(ψi∗ , ψj∗ ) = −δij (5.12)
(ψi , ψj∗ ) = 0.
Since there exists a plethora of such orthonormal bases, the “vacuum” state obtained
by doing standard second quantisation with one such basis in general has no physical
meaning.
However, if there exists a timelike Killing vector k = ∂t , i.e. we are dealing with
a stationary spacetime, then k commutes with the Klein-Gordon operator − m2
and one can find simultaneous eigenmodes of the two operators. The second quan-
tisation procedure can then be used to define a vacuum with a meaningful physical
– 60 –
interpretation, by designating a special subset of these modes as “positive-frequency
modes”. This special set is found as follows. First, note that k is antihermi-
tian: (f, kg) = −(kf, g). Hence it has imaginary eigenvalues kui = −iωi ui with
ωi ∈ R, and eigenfunctions with distinct values are orthogonal.8 We can therefore
find a basis {ψi } satisfying (5.12) such that kψi = −iωi ψi with ωi > 0 and use
this condition as the generalised “positive-frequency” condition with respect to k
(this of course reduces to the original condition in Minkowski spacetime, since there
kψp = ∂t ψp = ip0 ψp = −ip0 ψp with p0 > 0). If we second quantise with these modes,
then the state defined by ai |0i = 0 ∀ i, where ai are the operator coefficients of the
positive-frequency modes in the expansion of the field operator φ(x), is the ground
state of a physically meaningful Hamiltonian
Z
Ĥ = dS µ T̂µν k ν , (5.13)
t=const.
which can be interpreted as the operator corresponding to the total energy on the
surface t = const. So the state |0i is the minimum-energy state on the surface
t = const. in the sense that it is the minimum-eigenvalue eigenstate of Ĥ. In flat
spacetime this just reduces to Z
Ĥ = d3 xT̂00 . (5.14)
The Hamiltonian Ĥ dictates the time-evolution of all operators through the Heisen-
berg equation of motion:
dÔ h i
= i Ĥ, Ô , (5.15)
dt
i.e. it is the operator that generates time-translations.
– 61 –
cases, the respective expansions of the field are then
X † ∗
X
† ∗
φ(x) = ai f i + ai f i = bi gi + bi gi (5.16)
i i
where the modes have been normalised with respect to the Klein-Gordon inner prod-
uct in each case such that [ai , a†j ] = δij and [bi , b†j ] = δij . Now, since {fi } is a basis
we can also expand any function gi in terms of it:
X
gi = Aij fj + Bij fj∗ . (5.17)
i
The coefficients Aij and Bij are called Bogoliubov coefficients and the transformation
between different bases is called a Bogoliubov transformation. Using the normalisa-
tion conditions (5.12) it can then be shown that the Bogoliubov coefficients satisfy
the following relations: X
Aik A∗jk − Bik Bjk
∗
= δij
k
X (5.18)
Aik Bjk − Bik Ajk = 0.
k
In matrix notation this can be written as AA† − BB† = 1 and ABT − BAT = 0.
We can also relate the different operator coefficients to each other, e.g.
bi = (gi , φ) = j A∗ij aj − Bij∗ a†j .
P
The procedure above defines a vacuum associated with the modes {fi , fi∗ } —
call it the “in-vacuum” — as the state that satisfies ai |0iin = 0 ∀ i. In a stationary
reference frame in region B, i.e. a frame that follows an integral curve of k T , this
vacuum will appear empty. What about region T ? More precisely, what is the
expected number hNi i of particles of species (or momentum) i in the state |0iin , if
evaluated in a stationary reference frame in region T ? It is given by the expectation
value of the number operator Ni = b†i bi (no summation over i), which is (exercise) :
X
hNi i = in h0|b†i bi |0iin = Bij Bij∗ = (BB† )ii (no summation over i). (5.19)
j
If this is non-zero, there is particle production. Another way to say this is that the
in-vacuum is different from the “out-vacuum” defined by bi |0iout = 0 ∀ i. Hence, a
changing spacetime geometry generically causes particle production.
– 62 –
Figure 17. Minkowski space in 1 + 1 dimensions and the left and right Rindler wedges.
Some accelerated worlines (lines of constant ξL and ξR ) have been drawn.
eternally accelerated observers in Minkowski space. We shall see that such observers,
when travelling through the Minkowski vacuum, will feel themselves immersed in a
thermal bath of particles.
– 63 –
Left Rindler Wedge L Right Rindler Wedge R
uL = ηL − ξL uR = ηR − ξR
v L = η L + ξL vR = ηR + ξR
Table 2. Coordinates in the left and right Rindler wedges and their relations to Minkowski
coordinates.
Analogous definitions can be made for coordinates that cover the left Rindler
wedge. The coordinates and line elements for the left and right Rindler wedge are
shown in table 2, where we’ve used subscripts to distinguish between the two wedges.
In this section, Rindler coordinates without subscript always refer to the right wedge.
In fact, the time coordinates ηL = ηR = tanh−1 (t/x) are exactly the same func-
tion of Minkowski coordinates — they just take different ranges in L and R. We can
therefore drop the subscripts on η completely, as long as we remember that η < 0
corresponds to L and η > 0 to R. Lines of η = const. then correspond to lines
through the origin that extend over both L and R. These lines are Cauchy surfaces
for Minkowski spacetime. The vector ∂η is a timelike Killing vector (exercise) in both
L and R, but it is future-pointing in R and past-pointing in L. We now have three
globally hyperbolic manifolds with global future-pointing timelike Killing vectors:
Minkowski space with ∂t , the left Rindler wedge with −∂η , and the right Rindler
– 64 –
wedge with ∂η . In each of these the canonical quantisation procedure outlined in the
preceding section can be applied.
– 65 –
proper time of a Rindler observer is proportional to η. Hence, the vector ∂η is the
natural Killing vector with respect to which a Rindler would measure frequencies
and define a notion of positive frequency.
We need to extend our definition of the Rindler modes because as defined in (5.28)
they are not specified on a Cauchy surface for M . For example, they are only defined
on half of the Cauchy surface η = 0 (for x > 0). This is easily remedied by defining
the global modes gkR as follows:
These modes are defined on an entire Cauchy surface (e.g. η = 0). The set gkR , gkR∗
does not form complete basis for the functions on M yet. For this reason, we also need
to consider the positive-frequency modes in the left wedge L, which are associated
with left-accelerating Rindler observers. These modes are given by
and are positive frequency with respect to −∂η in L (remembering that the future-
pointing timelike Killing vector in L is −∂η ). Taken together, the positive-frequency
modes in L and R form a complete set, and so a field configuration φ(x) in Minkowski
space may be expanded as
X
φ(x) = bk gkR (x) + ck gkL (x) + h.c. (5.32)
k
where h.c. denotes Hermitian conjugate. We can now ask: How many R-particles
are expected to be seen in the Minkowski vaccum — how many particles does the
externally accelerated observer in the right wedge see? In other words, what is
h0M |R Nk |0M i, where R Nk = b†k bk ? To simplify things, we specialise our question to
right-movers, for which k > 0 =⇒ ω = k. We need the Bogoliubov coefficients
Z ∞
R
gω = dω 0 (Aωω0 fω0 + Bωω0 fω∗0 ) (5.33)
0
1
where fω0 = (4πω 0 )− 2 exp (−iω 0 ū). Notice the resemblance of this integral with the
Fourier transform of gωR :
Z ∞
1 0
R
gω (ū) = dω 0 e−iω ū g̃ω (ω 0 ) (5.34)
2π −∞
– 66 –
where Z ∞
0 0
g̃ω (ω ) = dū eiω ū gωR (ū) (5.35)
−∞
(for notational convenience we omit the superscript R on the Fourier transform). Our
strategy is now to massage (5.34) into a form that allows us to write the Bogoliubov
coefficients in (5.33) in terms of the Fourier transform g̃ω . To that end, we split up
the integral in (5.34) as follows:
Z ∞ Z ∞
1 0 −iūω 0 0 1 0
R
gω (ū) = dω e g̃ω (ω ) + dω 0 eiūω g̃ω (−ω 0 ). (5.36)
2π 0 2π 0
We flipped the sign of the integration variable ω 0 → −ω 0 in the second term. Com-
paring with (5.33), we find
r r
ω0 ω0
Aωω0 = g̃ω (ω 0 ) and Bωω0 = g̃ω (−ω 0 ). (5.37)
π π
In order to find the Bogoliubov coefficients (5.37), we don’t actually need to evaluate
the Fourier transforms. It is enough if we can relate g̃ω (ω 0 ) to g̃ω (−ω 0 ), since the
additional condition AA† − BB† = 1 then fixes Aωω0 and Bωω0 completely. To do so
we prove the following claim:
Proof: The integral defining g̃ω uses gωR in terms of ū. Since u = −a−1 ln(−aū), the
right-moving R-modes take the form
The function ln(−z) is multi-valued on the complex plane so we will give it a branch
cut on the positive real axis. Then
Z ∞
0 0
g̃ω (−ω ) = dū e−iω ū gω (ū)
−∞
Z 0 (5.39)
1 −iω 0 ū −i ω ln(−au)
=√ dū e e a .
4πω −∞
Now we deform the path of integration, using the contour shown in Figure 18. The
contour is closed in the upper complex half-plane because the integral over the arc
segment C then vanishes for ω 0 > 0 in the limit as the radius of the arc goes to infinity.
In order to avoid the branch cut we choose γ2 to lie just above the real axis by an
infinitesimal amount i (which we send to zero at the end). Since the integrand has
– 67 –
Figure 18. The contour integral used in the evaluation of gωR (−ω 0 ) with a branch cut of
ln(−z) on the positive real axis.
no poles inside the closed contour (i.e. it is holomorphic there), Cauchy’s integral
theorem implies
I Z Z Z
−iω 0 ū −i ω 0 ω
0 = dū e e a
ln(−au)
= + + dū e−iω ū e−i a ln(−au) . (5.40)
γ1 γ2 C
Strictly speaking, there is another contribution on the right hand side for the integral
from i to 0 (see Figure 18), but this will vanish when we send to zero at the end
so we can forget about it here. Since the integral over the arc C vanishes, it follows
R R
from (5.40) that γ1 = − γ2 , which implies
Z ∞+i
0 1 0 ω
g̃ω (−ω ) = − √ dū e−iω ū e−i a ln(−au)
4πω i
Z −i
1 0 ω
= −√ dū eiω ū e−i a ln(au) (ū ↔ −ū)
4πω −∞−i (5.41)
Z 0
1 0 ω
= −√ dū eiω ū e−i a [ln(−au)−iπ] ( → 0)
4πω −∞
= −e−πω/a g̃ω (ω 0 )
as claimed.
Hence we have Aωω0 = −e−πω/a Bωω0 , which together with |Aωω |2 − |Bωω |2 = 1
implies
1
h0M |R Nω |0M i = (BB† )ωω = |Bωω |2 = 2πω/a . (5.42)
e −1
~a
This corresponds to a Planck spectrum with temperature T = 2πkB
.
– 68 –
In fact, we can show more: the state |0M i is the thermal state with respect to
the Rindler Hamiltonian ĤR , i.e. the Hamiltonian associated with a hypersurface
η = const. in the right Rindler wedge (in other words, ĤR generates time-translations
along ∂η ). Mathematically this statement is
e−β ĤR
TrL (|0M ih0M |) = (5.43)
he−β ĤR i
where β = (kb T )−1 = 2πkb /~a. To show this, it is convenient to introduce so-called
Unruh modes, which are positive frequency with respect to ∂t , i.e. they are linear
combinations of fk -modes. The Unruh mode of frequency k is defined as
1
eπω/2a gkR + e−πω/2a g−k
L
zk = p . (5.44)
2 sinh ωπ/a
In order to form a complete set, they must be complemented by the modes
1
zk0 = p e−πω/2a g−k
R
+ eπω/2a gkL .
(5.45)
2 sinh ωπ/a
The modes zk and zk0 together with their complex conjugates form a complete basis
for functions on Minkowski space. Hence, the field can be equally well expanded in
terms of them: X
φ(x) = dk zk (x) + dk0 zk0 (x) + h.c. (5.46)
k
– 69 –
state has n left-wedge particles. To find the state ρR in R, we need to trace over the
states in L via the “partial trace” TrL . Defining En = nω, we obtain
∞ Y
X e−βEn e−β ĤR
P∞ −βE R n, k R n, k =
as claimed above. This reveals something perhaps surprising about the Minkowski
vacuum: it is not a local quantity at all, i.e. it is not a state of the form
The actual physical meaning of the results above may perhaps seem a bit ob-
scure. We considered the quantisation of a quantum field “appropriate to a Rindler
observer” in Minkowski space, which led us to conclude that a Rindler observer is
immersed in a thermal bath of radiation. How convincing were our arguments for
choosing some appropriate quantisation scheme? How would an eternally accelerated
observer actually measure the radiation? To put the results derived above on more
solid grounds, Bill Unruh constructed a simple model of a detector (nowadays known
as an Unruh-deWitt detector) and showed that an accelerated detector in Minkowski
space behaves as if it were in a thermal bath of particles. This effect is known as the
Unruh effect. Hawking radiation, which we discuss next, is a phenomenon closely
related to this effect.
– 70 –
Figure 19. Penrose diagram of the spherically symmetric collapsing star.
As before, to find out what observers at late times on J + see in the state |0in i
defined by ai |0in i = 0 ∀ i, we will evaluate hNi i = (BB† )ii , for which we need the
Bogoliubov coefficients in the expansion
X
gi = Aij fj + Bij fj∗ . (5.53)
j
– 71 –
If we could find the exact solutions to the Klein-Gordon equation, this would
be a fairly straight-forward task: once the positive-frequency solutions gi on J + are
specified, we can do the analog of a Fourier transform to find their expansion in terms
of fi and fi∗ on J − (such an expansion always exists since {fi , fi∗ } is a complete set
of functions) and read off the Bogoliubov coefficients. However, we will see in a
moment that the Klein-Gordon equation is complicated in Schwarzschild spacetime,
which prevents us from finding analytic solutions. Instead, we will ask: if a given
solution to the Klein-Gordon equation is asymptotically positive-frequency at J + ,
then what is its form at J − ? In other words, we will impose a boundary condition
to the solution at J + and investigate what its corresponding form must be on J − .
This amounts to “tracing back in time” the solution from J + to J − .
The metric of the Schwarschild black hole spacetime with coordinates (t, r∗ , θ, φ)
reads
2 2M
ds = 1 − (−dt2 + dr∗2 ) + r2 dΩ22 . (5.54)
r
We will also use the light-cone coordinates u = t − r∗ and v = t + r∗ below. Using
the handy formula
1 √
= ∇µ ∇µ = √ ∂µ −gg µν ∂ν
(5.55)
−g
we can find the Klein-Gordon equation for the field φ(t, r∗ , θ, φ). Expanding the
solution in spherical harmonics φ(t, r∗ , θ, φ) = χl (r∗ , t)Ylm (θ, φ) we find
where
2M l(l + 1) 2M
Vl (r∗ ) = 1 − + 3 . (5.57)
r r2 r
Set χl (r∗ , t) = e−iωt Rlω (r∗ ) so that
This equation can be cast into a form known as the confluent Heun equation, whose
solutions have been studied to some extent but are complicated. We can gain some
insight into the solutions by looking more closely at the potential Vl (r∗ ). Both near
the horizon H + (r → 2M ⇐⇒ r∗ → −∞) and near J ± (r → ∞ ⇐⇒ r∗ → ∞),
the potential tends to zero — it takes the form of a potential barrier, as shown in
Figure 20. Hence, if we consider how any particular solution to (5.58) evolves in
time, it will be partly transmitted and partly reflected as it comes in from r∗ = ∞.
– 72 –
Figure 20. The potential Vl as a function of r∗ (shown here for l = 1).
Near J ± , the solutions to (5.56) are just plane waves. The equivalent of right-
moving and left-moving for a radial coordinate is outgoing and ingoing (correspond-
ing, respectively, to r∗ increasing or decreasing with time) so we define “early modes”
1 Ylm
flmω+ = √ e−iωu (outgoing)
2πω r
(5.59)
1 Ylm
flmω− = √ e−iωv (ingoing)
2πω r
at J + . We will be interested mainly in ingoing early modes and outgoing late modes,
so we will use the shorthand notation
fω ∼ flmω−
(5.61)
gω ∼ glmω+ .
We need to express gω in terms of fω0 and fω∗0 on J − . First, note that plane waves
such as gω are in fact completely delocalised, i.e. they have support everywhere on
J + . However, using a superposition of such positive frequency plane waves, we can
always construct a localised “wave packet” on J + (for example a Gaussian packet)
and we can choose it to be peaked around some momentum ω0 and around some
coordinate value u0 . Hence, when we speak of the “outgoing mode at late times” we
really mean a linear combination of outgoing modes on J + . Keeping this in mind,
– 73 –
Figure 21. The BLA.
In order to find the expansion of the late mode in terms of early modes, we can
trace the solution back in time. As the wave travels inwards from J + (toward de-
creasing values of r∗ ), it will encounter the potential barrier. One part of the wave,
(r)
call it gω , will be reflected and end up on J − with the same frequency ω. This
will correspond to a term of the form Aωω0 ∝ δ(ω − ω 0 ) in (5.53). The remaining
(t)
part gω of gω will be transmitted through the barrier and will enter the collapsing
matter. In that region, the precise geometry of spacetime is unknown. However,
since we are interested in a packet peaked at late times (i.e. large u0 ) and at some
finite frequency ω0 , we know that the packet will be peaked at a very high frequency
as it enters the collapsing matter due to the gravitational blueshift. This leads to an
important simplification: the packet will obey the geometric optics approximation,
which means that gω takes the form A(x)eiS(x) where A(x) is slowly varying com-
pared to S. Substituting this into the Klein-Gordon equation gives ∇µ S∇µ S = 0,
which means that surfaces of constant phase are null. Given a wave, we can therefore
trace its surfaces of constant phase back in time by following null geodesics.
Consider tracing back the wave along a particular null geodesic γ which starts
off at some u = u0 at J + and hits J − at v = v0 . as in Figure 21. We denote
by γH a null generator of the horizon H + , which has been extended into the past
– 74 –
until it hits J − at some value of v. We may set this value to v = 0 without loss
of generality, since the spacetime is invariant under shifts v → v + c. In that case
we have v0 < 0 for the geodesic γ. Let n be a connecting vector between the two
curves as shown in Figure 21. Fix its normalisation by requiring n.` = −1, where `
is a generator of the Killing horizon H + . Near the horizon, the Kruskal coordinate
U = −e−κu is an affine distance along n and we can use it to measure the distance
between γ and γH (1.25). In order to find the form of the wave at J − , we need to
understand how the affine distance along the connecting vector n will change by the
time γ reaches J − . At J − , the coordinate v is an affine parameter along the null
geodesic integral curves of n. If U0 = 0 (corresponding to u0 = ∞) then the affine
distance is zero at J − . Hence we can expand the affine distance between γ and γH
at J − in powers of U0 : v = cU0 + O(U02 ) for some constant c > 0 (since U, v < 0).
Using u = −κ−1 ln(−U ) = −κ−1 ln(−cv), we can conclude that if a mode takes the
(t)
form gω ∼ e−iωu on J + , the transmitted part gω on J − will take the form
( ω
(t) ei κ ln(−v) for v < 0
gω ∼ (5.62)
0 for v > 0
– 75 –