Mcowling Fa Notes
Mcowling Fa Notes
Mcowling Fa Notes
Throughout, vector spaces are over the field F, which is R or C, unless otherwise stated.
First, we consider bases in a space of continuous functions. This will motivate using
(countably) infinite linear combinations. To interpret these, we need some kind of conver-
gence; a norm on a vector space allows us to do this. We ask when linear maps of normed
vector spaces are continuous, and when two normed vector spaces are “the same”.
Exercise 1.5. Show that every vector space V over an arbitrary field F has a Hamel
basis. What relations, if any, connect the cardinality of the basis and that cardinality of
the vector space. Hint: first consider the collection of linearly independent subsets of V ,
ordered by inclusion.
A Hamel basis of C(I) has large cardinality; and finding one uses the Axiom of Choice.
However, in C(I) we can use Fourier methods to write an arbitrary function f in the form
X
f (t) = at + an e2πint ∀t ∈ I.
n∈Z
The infinite sum may be interpreted using Abel or Cesáro means. So to calculate effectively,
we must consider infinite linear combinations. Infinite sums are interpreted using a limit
process, so we need to know when a sequence converges. This requires a topology. Normed
vector spaces provide this.
N
X
yN = xn ,
n=1
converges;
(d) if xn ∈ V and ∞
P PN
n=1 kxn k < ∞, then the sequence (yN ), given by yN = n=1 xn ,
converges; further, if y denotes its limit, then
∞
X
ky − yN k ≤ kxn k .
n=N +1
Question 1.11. When are two normed vector spaces (V, k·kV ) and (W, k·kW ) “the
same”?
We would like V and W to be the same as vector spaces, and to have to same topology
(i.e., the same convergent sequences) or the same metric (i.e., the same norm). So we
would like to have an isomorphism T : V → W such that T is either a homeomorphism
or an isometry. When we say that T is an isomorphism, we mean a linear (vector space)
isomorphism; when we call T a homeomorphism we mean that T and T −1 are continuous;
and when we say T is an isometry, we mean that
dW (T u, T v) := kT u − T vkW = kT (u − v)kW = ku − vkV =: dV (u, v) ∀u, v ∈ V,
or equivalently, that kT vkW = kvkV for all v ∈ V .
Question 1.12. When is a map T : V → W between normed vector spaces continuous?
Lemma 1.13. Suppose that T : V → W is a linear map between normed vector spaces.
Then the following are equivalent:
(a) T is continuous at one point in V ;
(b) T is continuous at all points in V ;
(c) there is a positive constant C such that
kT vkW ≤ C kvkV ∀v ∈ V.
so T vn → 0. Suppose that (c) does not hold, then for all n ∈ Z+ there exists vn ∈ V such
that kT vn kW > n kvn kV . Clearly vn 6= 0. Now let
1
vn0 = vn .
n kvn kV
Then kvn0 kV = 1/n and vn → 0, but kT vn0 kW > 1 and T vn0 6→ 0.
Corollary 1.14. Suppose that T : V → W is an isomorphism of normed vector
spaces. Then T is a homeomorphism if and only if there exist positive constants C1 and
C2 such that
C1 kvkV ≤ kT vkW ≤ C2 kvkV ∀v ∈ V.
Exercise 1.15. Let (V, k·k) be a normed vector space. A completion of V is a complete
normed vector space W and an isometric isomorphism T from V to a dense subspace T V
of W . Prove that:
(a) at least one completion of V exists;
(b) if W1 and W2 are two completions of V , then W1 and W2 are isometrically isomorphic.
Exercise 1.16. Suppose that V and W are normed linear spaces, and that T : V → W
is a linear isomorphism such that
C1 kvkV ≤ kT vkW ≤ C2 kvkV ∀v ∈ V,
for some positive constants C1 and C2 . Show that W is complete if and only if V is
complete.
Exercise 1.17. Suppose that V and W are normed linear spaces, that V0 and W0 are
dense subspaces of V and W , and that W is complete. Suppose that T0 : V0 → W0 is linear
and that there is a constant C such that kT vkW ≤ C kvkV for all v ∈ V0 .
Show that there is a linear map T : V → W such that T |V0 = T0 and kT vkW ≤ C kvkV
for all v ∈ V . Show also that if T 0 has the same properties as T , then T = T 0 .
Exercise 1.18. Suppose that V and W are complete normed linear spaces, and that
V0 and W0 are dense subspaces of V and W respectively. Suppose that T0 is an isometric
isomorphism from V0 onto W0 .
Using the result of the previous exercise, or otherwise, show that there is an isomorphism
T from V onto W such that T |V0 = T0 and kT vkW = kvkV for all v ∈ V . Show also that
if T 0 has the same properties as T , then T = T 0 .
TOPIC 2
`p spaces
Let S be a set. We will define spaces `p (S), where 1 ≤ p ≤ ∞, and show that they
are complete normed vector spaces. We will determine when they are separable; this is a
necessary condition to have a countable basis.
1. Preliminaries
In the vector space Fn , we define
1/p
Pn p
j=1 |u j | if 1 ≤ p < ∞;
kukp :=
max{|u | : j = 1, . . . , n} if p = ∞.
j
and equality holds if and only if uj |uj |p−1 = vj |vj |q−1 for all j ∈ {1, . . . , n}.
Deduce that, if u, v ∈ Fn , then
X n
uj v̄j ≤ kukp kvkq ,
j=1
(this is known as Hölder’s inequality) and for all u ∈ Fn , there exists v ∈ Fn such that
kvkq = 1 and
Xn
uj v̄j = kukp
j=1
(this is known as the converse of Hölder’s inequality).
5
6 2. `p SPACES
Exercise 2.4. Using the result of the previous exercise, or otherwise, show that
ku + vkp ≤ kukp + kukp ∀u, v ∈ Fn .
Deduce that k·kp is a norm on Fn .
2. Infinite sums
Definition 2.5. Let S be a set, and let F(S) denote the collection of all finite subsets
of S.
(a) Given f : S → [0, ∞), we define
X X X
f= f (s) := sup f (s).
S s∈S S0 ∈F (S) s∈S
0
P
(b) Given f : S → F such that S |f | < ∞, we define
X X X X X X
f= f (s) := a(s) − b(s) + i c(s) − i d(s),
S s∈S s∈S s∈S s∈S s∈S
where
a(s) := max{Re f (s), 0}, b(s) := max{− Re f (s), 0},
c(s) := max{Im f (s), 0}, d(s) := max{− Im f (s), 0},
so that f = a − b + ic − id. Note that 0 ≤ a ≤ |f |, and similarly for b, c and d, so
P
S f is well-defined.
Proof. It is easy to see that `1 (S) is a normed vector space. We show only that it is
closed under addition and that kf + gk1 ≤ kf k1 + kgk1 .
Given f, g ∈ `1 (S), we see that
X X X
|(f + g)(s)| ≤ |f (s)| + |g(s)| ≤ kf k1 + kgk1
s∈S0 s∈S0 s∈S0
Hence
N
X XN
X
fn (s) − f (s)
= sup fn (s) − f (s)
n=1 1 S0 ∈F (S) s∈S
0 n=1
∞
X ∞
X
≤ sup kfn k1 = kfn k1 → 0
S0 ∈F (S) n=N +1 n=N +1
as N → ∞, as required.
Exercise 2.9. Show that `p (S) is a complete normed vector space.
8 2. `p SPACES
Exercise 2.10. State and prove versions of Hölder’s inequality and its converse for
spaces `p (S) and `q (S) when S is an infinite set.
Definition 2.11. Let S be a set. We define cc (S), sometimes written c00 (S), and c0 (S)
as follows:
cc (S) := {f : S → F : supp(f ) is finite},
c0 (S) := {f : S → F : f → 0 at infinity}.
Here supp(f ) denotes the support of f , that is, the set {s ∈ S : f (s) 6= 0}, and f → 0 at
infinity in S means that {s ∈ S : |f (s)| > 1/n} is finite for each n ∈ Z+ . We endow c0 (S)
with the norm k·k∞ .
Exercise 2.12. Show that cc (S) is dense in `p (S) if 1 ≤ p < ∞ or if S is finite and
1 ≤ p ≤ ∞. Show also that cc (S) is dense in c0 (S). Under what conditions on p and S is
`p (S) separable? Under what conditions on S c0 (S) separable?
Exercise 2.13. Show that, if 1 ≤ p ≤ q ≤ ∞, then `1 (S) ⊆ `p (S) ⊆ `q (S) ⊆ `∞ (S),
and kf kq ≤ kf kp whenever f ∈ `p (S). When are the set inclusions strict? When is the
norm inequality an equality or a norm equivalence?
Exercise 2.14. Suppose that 1 ≤ p < r < q ≤ ∞, and that 1/r = (1 − θ)/p + θ/q,
where 0 < θ < 1. If f ∈ `p (S), then f ∈ `r (S) and f ∈ `q (S) by the preceding exercise.
Show that
kf kr ≤ kf k1−θ
p kf kθq .
Exercise 2.15. Find the isometric isomorphisms of `p (S).
Exercise 2.16. Suppose that T is an isometry of R2 with the `p norm, that is,
kT u − T vkp = ku − vkp ∀u, v ∈ R2 .
Is T necessarily linear? If not, is T “nearly linear” in some sense?
TOPIC 3
Hilbert spaces
We define Hilbert spaces, which are the “nicest” infinite-dimensional vector spaces, and
look at some of their properties. We also consider some examples.
Deduce that hu + v, u + vi1/2 ≤ hu, ui1/2 + hv, vi1/2 , and hence show that the associated
norm is a norm.
Lemma 3.3. Let (V, h·, ·i) be an inner product space, and fix v, w ∈ V . If
hu, vi = hu, wi ∀u ∈ V,
then v = w.
Proof. We have to show that if un → u and vn → v, then hun , vn i → hu, vi. This is a
consequence of the following inequalities and limits:
|hun , vn i − hu, vi| = |hun − u, vn − vi + hun − u, vi + hu, vn − vi|
≤ |hun − u, vn − vi| + |hun − u, vi| + |hu, vn − vi|
≤ kun − uk kvn − vk + kun − uk kvk + kuk kvn − vk
→0
as n → ∞.
Examples 3.5. The following are inner product spaces, when equipped with their usual
inner products: Fn ; `2 (S), where S is a set; and L2 (M ), where M is a measure space. The
spaces L2 ([0, 1]) and L2 (R) and `2 (N) and `2 (Z) are particularly useful examples.
We know how the inner products on Rn and Cn behave, but let us check that the inner
product is well-defined on `2 (S). The standard inner product is
X
hf, gi = f (s) ḡ(s);
S
The required properties of h·, ·i may be deduced easily from the corresponding properties
of summation.
The case of L2 (M ) is similar, but requires integration rather than summation.
Question 3.6. Suppose that (V, k·k) is a normed vector space. If we know that V is
an inner product space, can we find the inner product? And if we don’t know whether V
is an inner product space or not, can we decide whether it is?
Lemma 3.7 (Polarisation). If (V, h·, ·i) is an inner product space over R, then
1
hu, vi = ku + vk2 − ku − vk2 ∀u, v ∈ V ;
4
if (V, h·, ·i) is an inner product space over C, then
4
1 X k
2
hu, vi = i
u + ik v
∀u, v ∈ V.
4 k=1
Notice that if V is a vector space over C, then it is also a vector space over R, by
“restriction of scalars”. The lemma above shows that the real part of a complex inner
product on a complex vector space is a real inner product on the corresponding real vector
space. What can we say about the imaginary part of the inner product? And given a real
vector space, when does it arise as a complex vector space with restricted scalars?
Lemma 3.8. Suppose that (V, k·k) is a real normed vector space and that Appolonius’
theorem holds, that is,
ku + vk2 + ku − vk2 = 2 kuk2 + kvk2 ∀u, v ∈ V.
Examples 3.11. The following are Hilbert spaces, when equipped with their usual
inner products: Fn ; `2 (S), where S is a set; and L2 (M ), where M is a measure space.
We showed that `p (S) is complete in Topic 2; in particular, `2 (S) is complete. The case
of L2 (M ) is similar, but requires integration rather than summation; the key fact is that
pointwise limits of measurable functions are measurable (this is not true for continuous or
Riemann integrable functions, which is why we need to use Lebesgue integration).
and so
(wm − wn )
2 = 1 ku − wm k2 + ku − wn k2 −
u − 1 (xm + xn )
2
1
2 2
2
1 1 2 1 1 2
≤ d+ + d+ − d2
2 m 2 n
1 1 1 1 1
=d + + + .
m n 2 m2 n2
Thus the sequence (wn ) is Cauchy, and so it converges, to v say; since S is closed, v ∈ S.
14 3. HILBERT SPACES
Now
1
ku − vk ≤ ku − wn k + kwn − vk ≤ d + + kwn − vk ;
n
by letting n → ∞, we see that ku − vk ≤ d. But ku − vk ≥ d because v ∈ S, and hence
equality holds.
Uniqueness is easy: if v, v 0 ∈ S and ku − vk = ku − v 0 k = d, then, again by Apollonius’
theorem,
u − 1 (v + v 0 )
2 +
1 (v − v 0 )
2 = 1 (ku − vk2 + ku − v 0 k2 ) = d2 ;
2 2 2
2
now
u − 21 (v + v 0 )
≥ d since 12 (v + v 0 ) ∈ S, whence
12 (v − v 0 )
≤ 0 so v = v 0 .
We sometimes call u the best approximant to v in S.
Since subspaces of vector spaces are convex, the last lemma enables us to define the
orthogonal projection onto a not necessarily finite-dimensional closed subspace of a Hilbert
space.
Corollary 3.21. Suppose that S is a closed subspace of a Hilbert space (V, h·, ·i). For
each u ∈ V , write PS (u) for the unique vector in S such that
ku − PS (u)k = min{ku − wk : w ∈ S},
as in Lemma 3.20. Then PS (u) is the unique vector in S such that hu − PS (u), wi = 0 for
all w ∈ S. Further, PS : V → S is an orthogonal projection, that is, PS is linear, PS2 = PS ,
and hPS (u), vi = hu, PS (v)i for all u, v ∈ V .
Proof. Take w ∈ S. By Lemma 3.20,
ku − (PS (u) + λw)k ≥ ku − PS (u)k
for all λ ∈ F, and equality holds if and only if λ = 0. Hence
hu − (PS (u) + λw), u − (PS (u) + λw)i ≥ hu − PS (u), u − PS (u)i ,
that is,
|λ|2 hw, wi − 2 Re(λ hw, u − PS (u)i) ≥ 0
for all λ ∈ F, and equality holds if and only if λ = 0. This implies that hw, u − PS (u)i = 0.
Further, if v1 , v2 ∈ S and hw, u − v1 i = 0 = hw, u − v2 i for all w ∈ S, then it follows
that hw, v1 − v2 i = 0 for all w ∈ S, and hence v1 = v2 . Thus PS (u) is the unique vector in
S such that hu − PS (u), wi = 0 for all w ∈ S.
We have just shown that each u ∈ V may be written uniquely as uS + u⊥ , where uS ∈ S
and u⊥ ⊥ S. Given two vectors u, u0 ∈ S and λ ∈ F, we see that
λu + u0 = λuS + u0S + λu⊥ + u0⊥ ,
and λuS + u0S ∈ S while λu⊥ + u0⊥ ⊥ S. It follows that PS (λu + u0 ) = λPS (u) + PS (u0 ),
that is, PS is linear. The other properties of PS follow similarly.
We observe that if S is finite-dimensional, then the projection defined here coincides
with the projection defined earlier; the key is that v − PS (v) ∈ S ⊥ (see Exercise 3.15).
3. BESSEL’S INEQUALITY AND ORTHONORMAL BASES 15
Exercise 3.22. Suppose that S1 and S2 are two closed convex subsets of a Hilbert
space (V, h·, ·i). What additional hypotheses, if any, are needed to ensure that there exist
points u1 ∈ S1 and u2 ∈ S2 such that
ku1 − u2 k = inf{kv1 − v2 k : v1 ∈ S1 , v2 ∈ S2 }?
Proof. Suppose first that A is finite, and write S for span A. Write v = vS + v⊥ ,
where vS = PS (v) and v⊥ ⊥ S; then
kvk2 = kvS k2 + kv⊥ k2 ≥ kvS k2 .
Further, vS = e∈A hv, ei e and so kvS k2 = e∈A |hv, ei|2 . The result follows in the case
P P
when A is finite.
If A is infinite, then X
|hv, ei|2 ≤ kvk2 ∀v ∈ V
e∈A0
for each finite subset A0 of A. Taking the supremum over finite subsets of A gives the
result in the general case.
Finally we come to our main theorem. Recall that span A denotes the vector space of
all finite linear combinations of elements of a subset A of a vector space V .
Theorem 3.24. Let A be an orthonormal subset of a Hilbert space (V, h·, ·i). The
following conditions are equivalent:
(a) span A is dense in V ;
(b) A⊥ = {0};
(c) A is a maximal orthonormal subset of V ;
(d) Bessel’s inequality (3.1) is always an equality;
(e) the mapping T : V → `2 (A), given by
T (v) = (e 7→ hv, ei),
is an isometric isomorphism.
Proof. We show that (a) and (b) are equivalent. Write S for span A; then Exercise
3.13 shows that
A⊥ = S ⊥ = (S̄)⊥ ,
so if S is dense, then A⊥ = {0}. And if S is not dense, then S̄ is a proper closed subspace
of V ; take v ∈ V \ S̄, and then v − PS̄ v ∈ A⊥ \ {0}.
16 3. HILBERT SPACES
P 3.27. If A is an orthonormal
Exercise subset of a Hilbert space (V, h·, ·i), how can we
2
interpret e∈A λ(e)e, when λ ∈ ` (A)?
3. BESSEL’S INEQUALITY AND ORTHONORMAL BASES 17
2πint
Examples 3.28. The set of functions {t √ 7→ e : n√∈ Z} is orthonormal in L2 (I);
alternatively, we may consider the set {1, 2 cos(2πnt), 2 sin(2πnt) : n ∈ Z+ }. Both
these sets are bases. This may be proved in various ways. Essentially all the proofs use the
fact that C(I) is a dense subspace of L2 (I), but there are several ways to show that the set
of trigonometric polynomials (the span of the complex exponentials, or of the trigonometric
functions) is dense in C(I). These include using Sturm–Liouville theory, which is about
differential equations; using the Stone–Weierstrass theorem, which is about algebras of
functions; and using convolution, which is a technique of harmonic analysis.
Fourier series is a prototype of Hilbert space theory.
The theory of Fourier integrals does not give us a basis for L2 (R), since periodic func-
tions are not square integrable. Various bases for L2 (R) were found in the 1800s, with
names such as Hermite functions or Laguerre functions; these were first constructed using
differential operators. We will consider the Hermite functions later.
Exercise 3.29. Define functions hj : I → R, where j ∈ N, by h0 = 1, and, for
n = 0, 1, 2, . . . and k = 0, 1, . . . , 2n − 1,
(
2n/2 sgn sin(2n πt) if k ≤ 2n t ≤ k + 1
h2 +k (t) =
n
0 otherwise,
where sgn is the signum function: sgn t = 1 if t > 0, sgn t = −1 if t < 0, and sgn 0 = 0.
(a) Sketch the graphs of the functions h1 and h6 .
(b) Show that {hj : j ∈ N} is an orthogonal set in L2 (I) (where I = [0, 1]).
(c) Show that every function f ∈ C(I) may be approximated uniformly by a finite sum of
the form Jj=0 cj hj .
P
(d) Deduce that {hj : j ∈ N} is an orthonormal basis in L2 (I). You may assume that C(I)
is dense in L2 (I).
(This basis is known as the Haar basis, and is used in signal processing.)
Remark 3.30. Spaces L2 (M1 ) and L2 (M2 ) can be “the same” in different ways. We say
that measure spaces M1 and M2 are isomorphic if there is a measure-preserving bijection
ϕ : M1 → M2 . Such a map ϕ induces an isometric isomorphism T : L2 (M2 ) → L2 (M1 ):
T f = f ◦ ϕ.
Measure space isomorphism is a “coarse” equivalence. For example, Rm and Rn are
measure-isomorphic for all m, n ∈ Z+ , and hence the corresponding L2 spaces are iso-
morphic as Hilbert spaces. However, no two of Z, R and I are measure-isomorphic, but
the corresponding Hilbert spaces are isometrically isomorphic. Indeed, there is a unique
isometric isomorphism equivalence class of infinite-dimensional separable Hilbert spaces.
Hence Hilbert space equivalence is an even “coarser” equivalence.
An important research area in modern functional analysis involves studying Hilbert
spaces with additional structure so that, for instance, L2 (Rm ) and L2 (Rn ) with the addi-
tional structure are not isomorphic.
TOPIC 4
Dual spaces
We discuss linear functionals on normed vector spaces, and consider the examples of
the `p spaces. We then prove the Hahn–Banach theorem, one of the pillars of functional
analysis, and look at some applications.
is well-defined and linear, from Topic 2 (in particular, from Hölder’s inequality); further
|T (g)(f )| ≤ kgkq kf kp ∀f ∈ `p (S).
Thus the mapping T is a linear mapping from `q (S) into `p (S)∗ . This mapping is an
isometry (by Hölder’s inequality and its converse); so in particular it is injective.
Conversely, given a continuous linear functional L on `p (S), we may define g : S → F
by the formula
g(s) = L(δs ),
where δs ∈ `p (S) is defined by requiring that δs (t) is equal to 1 if s = t and 0 otherwise. It
is evident that |g(s)| ≤ kLkV ∗ . By considering linear combinations of the δs where s ∈ S0 ,
a finite subset of S, and using the converse of Hölder’s inequality, we see that
X 1/q
|g(s)|q ≤ kLkV ∗
s∈S0
if 1 ≤ q < ∞ or
max |g(s)| ≤ kLkV ∗
s∈S0
if q = ∞, and then taking suprema over such subsets S0 , it follows that
kgkq ≤ kLkV ∗ .
Further, by construction, L(f ) = T (g)(f ) for all f ∈ cc (S). If cc (S) is dense in `p (S), then
it follows that L = T (g); but if cc (S) is not dense in `p (S) (this only happens if p = ∞),
then it is possible that L 6= T (g).
We will see soon that T (`1 (S)) is a proper subset of `∞ (S)∗ when S is infinite. Thus
p
` (S) is reflexive if 1 < p < ∞ or if S is finite, and is not reflexive otherwise.
2. THE HAHN–BANACH THEOREM 21
by taking v equal to u, we see that in fact equality holds, and in particular Te is injective.
But it is not a priori obvious that Te is surjective. However, by identifying H with `2 (A) for
some set A, we can see that the mapping Teu corresponds to T (ū), where T is the mapping
considered in Example 4.7, and hence Te is also surjective.
Exercise 4.9. Suppose that (V, k·kV ) is a Banach space. If V is reflexive, must V ∗ be
reflexive? And if V ∗ is reflexive, must V be reflexive?
Proof. There are several steps in the proof. First we show that, when the scalars are
real, it is always possible to extend a linear functional defined on a subspace to a larger
subspace without increasing the norm. Then we consider the collection of all possible
extensions, and use Zorn’s lemma to deduce the result. An extra trick is needed to deal
with complex scalars.
Step 1: Enlarging the domain. Suppose that V is a normed vector space over R. If W
is a subspace of V and v ∈ V \ W , then W 0 = W + Rv is a larger subspace than W . We
show how to extend a linear functional L on W to a linear functional L0 on W 0 without
increasing the norm. Suppose that
|L(w)| ≤ C kwk ∀w ∈ W ;
we will construct L0 such that |L0 (w0 )| ≤ C kw0 k for all w0 ∈ W 0 .
Every element of W 0 has a unique representation in the form w + λv, where w ∈ W
and λ ∈ R. We may therefore take µ ∈ R, and define
L0 (w + λv) = L(w) + λµ,
22 4. DUAL SPACES
|L0 (w + v)| ≤ C kw + vk ∀w ∈ W.
−C kw + vk ≤ L0 (w + v) ≤ C kw + vk
−C kw + vk ≤ L(w) + µ ≤ C kw + vk
−C kw + vk − L(w) ≤ µ ≤ C kw + vk − L(w)
L(w2 − w1 ) ≤ |L(w2 − w1 )|
≤ C kw2 − w1 k
≤ C k(w2 + v) − (w1 + v)k
≤ C kw2 + vk + C kw1 + vk
for all w1 , w2 ∈ W , which proves the inequality (4.2) that enables us to choose µ.
2. THE HAHN–BANACH THEOREM 23
Step 2: Induction. Now we present the Zorn’s lemma induction argument, still sup-
posing that the scalars are real. Recall that we start with a linear functional L : U → R
satisfying |L(u)| ≤ C kuk for all u ∈ U . We consider the collection of linear mappings
MW : W → R, where W is a subspace of V and U ⊆ W , with the properties that
MW |U = L and
|MW (w)| ≤ C kwk ∀w ∈ W.
0 0
We partially order the set of these mappings by decreeing that MW MW 0 if W ⊆ W
0
and MW 0 |W = MW . Given a chain of such mappings
0 00
MW MW 0 MW 00 . . . ,
Theorem 4.16. Suppose that U is a closed subspace of a Banach space V . The quotient
vector space V /U , endowed with the quotient norm k·kq , defined by
kv + U kq = inf{kv + uk : u ∈ U }
for all v ∈ V , is a Banach space.
Proof. Omitted.
Corollary 4.19. Suppose that U is a closed subspace of a Banach space V . If T ∈ U ⊥ ,
then T determines a continuous linear functional Ṫ on V /U by
Ṫ (v + U ) = T (v) ∀v ∈ V.
The mapping T 7→ Ṫ is an isometric isomorphism from U ⊥ onto (V /U )∗ .
Proof. Omitted.
26 4. DUAL SPACES
(a) Show that (C (I), k·kC k ) is a Banach space, and that the map f 7→ f (j) (t0 ) is a con-
k
We show that in infinite-dimensional vector spaces, closed bounded sets need not be
compact, and we vary the topology to make it easier for a set to be compact.
1. Weak topologies
Suppose that V is a vector space. One way to define a topology on V is to require that
certain functions f on V are continuous. If these functions are F-valued, then we require
that the sets f −1 (J) are open whenever J is an open subset of F. ButSevery open
S subset of
F is a union of open balls (these are intervals when F = R), and f −1 ( α Jα ) = α f −1 (Jα ),
so if we require that f −1 (J) is open whenever J is an open ball, then it will follow that
f −1 (J) is open whenever J is an open subset of F. We use this fact when the functions f
are linear.
Definition 5.1. Suppose that (V, k·k) is a normed vector space, and that S is a subset
of V ∗ . The σ(V, S) topology on V is the topology for which an open base consists of all
the sets
{v ∈ V : L1 (v) ∈ J1 , L2 (v) ∈ J2 , . . . , Ln (v) ∈ Jn },
where n ∈ N , while Lj ∈ S and Jj is an open ball in V for all j ∈ {1, 2, . . . , n}. Equivalently,
a subbase consists of the sets {v ∈ V : L(v) ∈ J, where L ∈ S and J is an open ball in F.
Observe that this definition only requires that V be a vector space; the norm plays no
role, except that we ask that the linear functionals Lj be continuous in the usual (norm)
topology. We could also relax this requirement and consider arbitrary linear functionals
on an arbitrary vector space V ; indeed, the Zariski topology in algebraic geometry is
defined in this way (except that the functions used are polynomials and the open sets J
are complements of finite sets).
Usually, we consider two particular instances of these topologies.
Definition 5.2. Suppose that (V, k·k) is a normed vector space. The weak topology
on V is the σ(V, V ∗ ) topology; if V is a dual space, that is, V = U ∗ for some Banach space
U , then we may view the predual space U as a subspace of the dual space V ∗ using the
injection of U into U ∗∗ considered in Corollary 4.11), and the weak-star topology is the
σ(V, U ) topology on V .
Lemma 5.3. The weak and weak-star topologies are coarser than the norm topology, that
is, every set that is open in the weak or weak-star topology is open in the norm topology.
27
28 5. WEAK TOPOLOGIES AND COMPACTNESS
Proof. It suffices to show that a subbasic set {v ∈ V : L(v) ∈ J}, where L ∈ V ∗ and
J is an open ball in F, is open in the norm topology.
If L(v0 ) ∈ J and J is open in F, then there exists ε ∈ R+ such that BF (L(v0 ), ε) ⊆ J;
now if kv − v0 k < ε/ kLkV ∗ , then
|L(v) − L(v0 )| = |L(v − v0 )| ≤ kLkV ∗ kv − v0 k < ε,
and so
BV (v0 , ε/ kLkV ∗ ) ⊆ {v ∈ V : L(v) ∈ J},
as required.
Lemma 5.4. Suppose that S is a convex subset of V . Then S is weakly closed (closed
in the weak topology) if and only if it is closed in the norm topology.
Sketch of the proof. If S is closed in the weak topology, then it is automatically
closed in the norm topology. So we may assume that S is closed in the norm topology and
show that it is closed in the weak topology.
Given any point v in V \ S, we can use the Hahn–Banach theorem to construct a
continuous linear functional M on V such that M (v) ∈ / L(S) . It then follows that
\
S= {v ∈ V : L(v) ∈ L(S) }.
L∈V ∗
For each L ∈ V ∗ , the set L(S) is closed in F, and the set {v ∈ V : L(v) ∈ L(S) } is closed
in the weak topology; it follows that S is closed.
Exercise 5.5. Suppose that (V, k·k) is a normed vector space and that W is a subspace
of V ∗ . Show that the weak topology σ(V, W ) is the coarsest topology for which all the
linear functionals in W are continuous, in the sense that L−1 (S) is open in V for all open
subsets S of F and all L ∈ W .
Exercise 5.6. Suppose that (V, k·k) is a normed vector space and that E is a subset
of V ∗ . Show that the weak topology σ(V, E) is the same as σ(V, span E). Is this the same
as σ(V, (span E) )?
3. Exercises
Exercise 5.8. Suppose that (V, k·k) is a Banach space. Show that the unit ball of V
is dense in the unit ball of V ∗∗ in the weak-star topology on V ∗∗ (that is, the σ(V ∗∗ , V ∗ )
topology).
Exercise 5.9. Suppose that (V, k·k) is a Banach space. If V is reflexive, is V ∗ reflexive?
If V ∗ is reflexive, is V reflexive?
TOPIC 6
Linear operators
1. Linear operators
Definition 6.1. Suppose that U and V are normed vector spaces. A bounded or
continuous linear operator T : U → V is a linear mapping T from U to V for which there
exists a constant C such that
kT ukV ≤ C kukU ∀u ∈ V. (6.1)
Lemma 6.2. Suppose that T : U → V is a continuous linear operator. Then
sup{kT ukV : u ∈ U, kukU ≤ 1} = min{C : (6.1) holds}. (6.2)
Proof. Clearly, if (6.1) holds, then C ≥ 0, and if C 0 ≥ C, then
kT ukV ≤ C 0 kukU ∀u ∈ V.
Hence the set of C for which (6.1) holds is an interval, of the form [C0 , +∞) or (C0 , +∞),
where C0 ≥ 0. In either case,
kT ukV ≤ (C0 + ε) kukU ∀u ∈ V
+
for all ε ∈ R . Exchanging the order of the quantifiers, and letting ε → 0, we see that
kT ukV ≤ C0 kukU ∀u ∈ V.
This shows that the infimum is attained, and justifies the use of minimum.
Since
kT ukV ≤ C0 kukU ∀u ∈ V,
by specialising to u of norm at most 1 we obtain kT ukV ≤ C0 , and
sup{kT ukV : u ∈ U, kukU ≤ 1} ≤ C0 .
Conversely, for all u ∈ U \ {0},
kT ukV =
T (kuk−1
kuk ≤ sup{kT u0 k : u0 ∈ U, ku0 k ≤ 1} kuk ,
U u) V U V U U
Lemma 6.4. Suppose that (U, k·kU ) and (V, k·kV ) are normed vector spaces. Then the
set B(U, V ) of all continuous linear operators T : U → V , with the norm defined above, is
a normed vector space. If V is a Banach space, so is B(U, V ).
Proof. We omit the proof. It is standard that linear maps between vector spaces form
another vector space, so the crucial issue is whether the norm as defined is a norm, and
when B(U, V ) is complete.
Proof. We omit much of this proof, as it is standard that the adjoint of a linear map
between vector spaces is another linear map; the issue is showing that T ∗ is bounded and
finding its norm. The key is to show that
kT k = sup{|hv, T ui| : u ∈ U, kukU ≤ 1, v ∈ V, kvkV ≤ 1}, (6.3)
which follows from the Cauchy–Schwarz inequality.
Definition 6.12. A linear operator T on a Hilbert space V is self-adjoint if T = T ∗ .
Example 6.13. For a function m ∈ L∞ (M ), define Tm : L2 (M ) → L2 (M ) by
Tm f (t) = m(t) f (t) ∀t ∈ M.
The adjoint of Tm is Tm̄ , and so Tm is self-adjoint if m is real-valued.
If M = I, the unit interval, and m(t) = t for all t ∈ I, then the operator Tm is self-
adjoint. However this operator does not have any eigenvectors or eigenvalues. Indeed, if
Tm f = λf for some function f ∈ L2 (I) and λ ∈ C, then
(m(t) − λ)f (t) = 0 ∀t ∈ I
and so f = 0 almost everywhere. Thus, in contrast to the finite-dimensional case, in
infinite-dimensional spaces, being self-adjoint does not ensure that an operator has eigen-
vectors and eigenvalues. However, Tm is a limit of operators with eigenvalues and eigen-
vectors. Indeed, define the step functions mk : I → R by mk (t) = bktc/k for all t ∈ I,
where k ∈ Z+ . Then Tmk is self-adjoint, and has eigenvalues j/k, where 0 ≤ j < k, and
each eigenspace is infinite-dimensional; further, Tm = limk→∞ Tmk in the operator norm.
It is also worth pointing out that the supremum sup{kTm ukV : u ∈ L2 (I), kuk2 ≤ 1} is
not attained for this example.
Proposition 6.14. Suppose that (V, h·, ·i) is a Hilbert space and that T ∈ B(V, V ) is
self-adjoint. Then
kT k = sup{|hT v, vi| : v ∈ V, kvk ≤ 1}.
= 4λ kT vk2 .
Hence
4λ kT vk2 = hT (T v + λv), (T v + λv)i − hT (T v − λv), (T v − λv)i
≤ MT kT v + λvk2 + MT kT v − λvk2
= MT (hT v + λv, T v + λvi + hT v − λv, T v − λvi)
= MT (2 hT v, T vi + 2λ2 hv, vi)
= 2MT (kT vk2 + λ2 kvk2 ),
and so
2λ kT vk2 ≤ MT (kT vk2 + λ2 kvk2 ).
Take λ = kT vk / kvk; then
2 kT vk3 / kvk ≤ 2MT kT vk2 ,
and so
kT vk ≤ MT kvk ,
as required.
Theorem 6.16. Suppose that (V, h·, ·i) is a Hilbert space and that T ∈ B(V, V ) is
compact and self-adjoint. Then there exist a set N , equal to Z+ or {1, . . . , n} for some
positive integer n or ∅, and an orthonormal set {vn : n ∈ N } of eigenvectors vn with
corresponding nonzero eigenvalues λn , such that |λ1 | ≥ |λ2 | ≥ . . . and λn → 0 as n → ∞
if N = Z+ . Further T = 0 on {vn : n ∈ N }⊥ .
hT v, v1 i = hv, T v1 i = λ1 hv, v1 i = 0,
and so T (V1 ) ⊆ V1 . We apply the proposition to T |V1 , the restriction of T to V1 , and find
v2 ∈ V1 such that kv2 k = 1 and T v2 = λ2 v2 , where |λ2 | = kT |V1 k. Clearly |λ1 | ≥ |λ2 |.
Now we take V2 = {v1 , v2 }⊥ , and observe that if v ∈ V2 , then
hT v, vj i = hv, T vj i = λj hv, vj i = 0
when j ∈ {1, 2}, and so T (V2 ) ⊆ V2 . We apply the proposition to T |V2 , the restriction of T
to V2 , and find v3 ∈ V2 such that kv3 k = 1 and T v3 = λ3 v3 , where |λ3 | = kT |V2 k. Clearly
|λ1 | ≥ |λ2 | ≥ |λ3 |.
Either this process continues inductively then stops after n steps because T |Vn = 0, or
it continues infinitely. In the latter case, the set {λ1 v1 , λ2 v2 , . . . } is contained in a compact
set, and this is only possible if for all ε ∈ R+ , there are only finitely many λn such that
|λn | ≥ ε, that is, λn → 0.
Corollary 6.17. Suppose that (U, h·, ·iU ) and (V, h·, ·iV ) are Hilbert spaces and that
T ∈ B(U, V ) is compact. Then there exist a set N , equal to Z+ or {1, . . . , n} for some
positive integer n or ∅, and orthonormal sets {un : n ∈ N } in U and {vn : n ∈ N } in V ,
such that T un = µn vn for all n ∈ N , where µ1 ≥ µ2 ≥ · · · > 0 and µn → 0 as n → ∞ if
N = Z+ . Further T = 0 on {un : n ∈ N }⊥ .
hvm , vn i = µ−1 −1
m µn hT um , T un i
= µ−1 −1 ∗
m µn hT T um , un i
= µ−1 −1 2
m µn µm hum , un i
(
1 if m = n
=
0 otherwise.
and so the vn also form an orthonormal set.
and a fortiori
[
V = (nT BU (0, 1)) .
n∈N
By the Baire category theorem, there exists n ∈ N such that (nT BU (0, 1)) has nonempty
interior. Since multiplication by nonzero scalars is a homeomorphism, it follows that
(T BU (0, 1)) has nonempty interior, that is,
(T BU (0, 1)) ⊇ BV (v0 , r0 )
for some v0 ∈ V and r0 ∈ R+ . In particular, T BU (0, 1) ∩ BV (v0 , r0 ) is dense in BV (v0 , r0 ).
If v ∈ BV (0, r0 ), then v + v0 ∈ BV (v0 , r0 ), and so there exists a sequence (un )n∈N in
BU (0, 1) such that T un → v + v0 as n → ∞; in particular, there is a sequence (u0n )n∈N such
that T un → v0 . Now un − u0n ∈ BU (0, 2) and T (un − u0n ) → v + v0 − v0 = v as n → ∞,
and thus
(T B(0, 2)) ⊇ BV (0, r0 ).
4. THE OPEN MAPPING THEOREM 37
I claim that, given v ∈ BV (0, r0 ), there exists a sequence (un )n∈N in BU (0, 2) such that
X X X
v= 2−n T un = T (2−n un ) = T 2−n un .
n∈N n∈N n∈N
Since
X
X
X
−n
2−n un
≤
2 un
≤ 21−n = 4,
U
U
n∈N n∈N n∈N
this claim implies that v ∈ T BU (0, 4), and so T BU (0, 4) ⊇ BV (0, r0 ). By linearity, it
follows that
T BU (u0 , r) = T u0 + T BU (0, r) ⊇ T u0 + BV (0, rr0 /4),
and hence that T sends open sets to open sets. Indeed, if u0 is a point in an open set E
in U , then BU (u0 , r) ⊇ E for a suitably small r0 , and so the typical point T u0 in T E lies
inside a ball BV (T u0 , rr0 /4) in V that is a subset of T E.
To prove the claim, we choose the points un ∈ BU (0, 2) recursively. First, take u0 so
that kv − T u0 kV ≤ r0 /2. Once we have chosen u0 , u1 , . . . , uk in BU (0, 2) such that
k
X
v − 2 T un
< 2−k−1 r0 ,
−n
V
n=0
Pk
2k+1 (v − n=0 2−n T un ) ∈ BV (0, r0 ), so we may choose uk+1 ∈ BU (0, 2) such that
k
X
r0
2−n T un ) − T uk+1
< ,
k+1
2 (v −
n=0
V 2
and then
k+1
X
k
X
−n −n −k−1
v − 2 T un
=
v − 2 T un − 2 T uk+1
< 2−k−2 r0 .
n=0 n=0
Once the un have been chosen, passing to the limit as k → ∞ proves the claim, and hence
the theorem.
Corollary 6.20. Suppose that (U, k·kU ) and (V, k·kV ) are Banach spaces, and that
T : U → V is a continuous linear operator. If T is bijective, then T −1 is also continuous.
In the next exercises and examples, given an (integrable) functions g on the unit interval
I, we write ĝ for its Fourier transform, which is a function on Z:
Z
ĝ(n) = g(t)e−2πint dt ∀n ∈ Z.
I
Example 6.23. The continuous linear mapping g 7→ ĝ from L1 (I) to c0 (Z) is injective
(from Fourier series), but not surjective. Indeed, if it were surjective, then by the corollary
to the open mapping theorem, the inverse mapping would be continuous, that is, there
would be a constant C such that
kf k ≤ C
fˆ
1 ∞
∀f ∈ L1 (I).
However, the functions DN studied above show that no such constant can exist, and so the
Fourier transformation g 7→ ĝ cannot be surjective.
Exercise 6.24. By using the open Pmapping theorem, show that there exists a function
ˆ
f ∈ C(I) such that f (0) = f (1) and n∈Z f (n) = ∞.
birjective, since P1 (u, T u) = u for all u ∈ U . Then P1 |G(T ) −1 : U → G(T ) is also continuous,
by the corollary to the open mapping theorem. Since T = P2 P1 |G(T ) −1 , it follows that T is
continuous.
Exercise 6.26. Suppose that m ∈ `∞ (Z), and suppose that for all f ∈ Lp (I), there
exists Tm f ∈ Lp (I) such that (Tm f )b(n) = m(n)fˆ(n) for all n ∈ Z. Show that the mapping
Tm is continuous and linear.
Proof. For any positive real number r, define Er = {v ∈ V : supn∈N kTn vkW ≤ r}.
Then \
Er = {v ∈ V : kTn vkW ≤ r},
n∈N
and so Er is closed, as each set {v ∈ V : kTn vkW ≤ r} is closed. Indeed, if vj ∈ V and
vj → v as j → ∞, then
kTn vkW = lim kTn vj kW ≤ r.
j
S
By definition and hypothesis, V = r∈Z+ Er . By the Baire category theorem, at least
one of the sets Er must not be nowhere dense, that is, at least one Er must contain an
open set, and hence contain an open ball B(v0 , s), where v0 ∈ V and s ∈ R+ .
Exercise 6.28. Suppose that (V, k·kV ) is a Banach space and (W, k·kW ) is a normed
vector space. Suppose also that Tn : V → W is a continuous linear operator for each
n ∈ N, and that lim Tn v exists for all v ∈ V . Prove that
sup kTn kB(V,W ) < ∞.
n∈N
Exercise 6.29.
7. Further problems
Exercise 6.30. Suppose that (U, k·kU ) and (V, k·kV ) are normed vector spaces, and
that T : U → V is a linear map. Show that T T is compact if T is compact. Is the converse
true?
TOPIC 7
In this chapter, we give some conditions for a subspace A of the space C(X) of contin-
uous functions on a topological space X to be dense in C(X). This type of result is used in
approximation theory, as well as to show that the span of an orthogonal set of continuous
functions A is dense in C(X) and so also in L2 (X), and hence to establish that A is a
basis for L2 (X).
We will argue that wδ ∗ f approximates f (as δ → 0) and then, by taking finitely many
terms of the power series expansion of wδ (t − s) in powers of t − s, show that wδ ∗ f (t) may
be approximated by a polynomial in t.
Since f is continuous and compactly supported, it is uniformly continuous, which means
that there exists η ∈ R+ such that |f (t − s) − f (t)| < 14 ε for all t ∈ R provided that |s| < η.
Further, it is not hard to see that
Z
wδ (s) ds = 1
R
Z
wδ (s) ds → 0 + as δ → 0+,
R\[−η,η]
41
42 7. APPROXIMATION IN SPACES OF CONTINUOUS FUNCTIONS
irrespective of the value of η, so in particular for η chosen above, we may take δ ∈ R+ such
that
Z
ε
wδ (s) ds < .
R\[−η,η] 4 kf k∞
for all t ∈ R.
Now we show that wδ ∗ f may be approximated by a polynomial to within ε/2 on I.
This is not possible on all R, since f is zero outside [−1, 2] and any polynomial that is
bounded outside [−1, 2] must be constant.
Observe that
Z
wδ ∗ f (t) = wδ (t − s) f (s) ds
R
Z
1 2
=√ e−(t−s) /2δ f (s) ds
2πδ R
∞
[−(t − s)2 /2δ]n
Z X
1
=√ f (s) ds.
2πδ R n=0 n!
and then
Z X
∞ ∞
1 Z X [−(t − s)2 /2δ]n 1 [−(t − s)2 /2δ]n
√ f (s) ds ≤ √ |f (s)| ds
n! n!
2πδ R 2πδ R n=N +1
n=N +1
∞
|[−(t − s)2 /2δ]n |
Z X
1
≤√ |f (s)| ds
2πδ R n=N +1 n!
∞
[2/δ]n
Z X
1
≤√ |f (s)| ds
2πδ R n=N +1 n!
Z √
1 ε 2πδ
≤√ |f (s)| ds
2πδ R 2 kf k1
= 21 ε
for all t ∈ I, as required.
From complex analysis, much is known about the space H ∞ (C+ ) of bounded holo-
morphic functions on C+ . In particular, suppose that {0} ⊆ Λ ⊆ N. Then there exists
h ∈ H ∞ (C+ ) such that h(n) = 0 for all n ∈ Λ ⊆ N if and only if condition (b) above holds.
This result is key to the proof of the theorem.
On the one hand, if A is not dense in C(I), then by the Hahn–Banach theorem, there
exists a linear functional L on C(I) such that L|A = 0 but L 6= 0. By Weierstrass’ theorem,
there exists n ∈ N such that L(t 7→ tn ) 6= 0, and n ∈
/ Λ by our earlier observation. Hence
hL 6= 0, so condition (b) above does not hold.
On the other hand, if condition (b) above does not hold, then the function that is
constructed using complex analysis turns out to be of the form hL for some linear functional
L. We can certainly set L(t 7→ tn ) = h(n), and so define L(p) for all polynomials p in
C(I). What is needed is to show that there exists a constant C such that |L(p)| ≤ C kpk∞
for all polynomials p; this ensures that we may define L(f ) unambiguously as lim L(pn ) for
any sequence of polynomials pn that converge to f . To do so requires some more complex
analysis and we omit the details.
Proof. If A satisfies the conditions above, so does Ā. Hence we may and shall suppose
that A is also closed, and show that A = C(X).
If F = C, then the real part and the imaginary part of every function in A lie in A,
and if A satisfies the conditions above, the so does AR , the (real) subspace of A of real-
valued functions, provided that we restrict to real scalars. Further, if AR = C(X, R), then
A = C(X, C). Hence we may and shall suppose that F = R.
Step 1: some special polynomials on I. By the argument given just before the statement
of this theorem, it is possible to approximate t 7→ t uniformly on I by (real) polynomials
in even powers of t, and so it is possible to approximate t 7→ |t| uniformly on [−1, 1] by
even polynomials. In other words, for each ε ∈ R+ , there exists an even polynomial pε
3. THE STONE–WEIERSTRASS THEOREM 45
such that
|pε (t) − |t|| < ε ∀t ∈ I.
Step 2: A is closed under taking absolute values. Suppose that f ∈ A and kf k∞ ≤ 1.
Since A is an algebra, pε ◦ f ∈ A. Further,
max{|pε ◦ f (x) − |f (x)|| : x ∈ X} = max{|pε (f (x)) − |f (x)|| : x ∈ X}
≤ max{|pε (t) − |t|| : t ∈ [−1, 1]} < ε.
Thus we may approximate |f | by polynomials in f . Since A is closed, we deduce that
|f | ∈ A. For an arbitrary f ∈ A \ {0}, let c = kf k−1
∞ . We see that cf ∈ A and kcf k∞ = 1,
whence |cf | ∈ A, and so |f | ∈ A.
Step 3: A is closed under taking maxima and minima of two functions. Suppose that
f, g ∈ A. Observe that
1
max{f (x), g(x)} = f (x) + g(x) + |f (x) − g(x)| ∀x ∈ X,
2
and so max{f, g} ∈ A. Similarly, min{f, g} ∈ A.
Step 4: A contains “bump functions” at every point. Suppose that K is a compact
subset of X and p ∈ X \ K. Because A is a vector space containing constants, and A
separates points, for each q ∈ K, there exists a function fq ∈ A such that fq (p) = 1 and
fq (q) = −1; then the set {x ∈ X : fq (x) < 0} is open and contains q. The various sets
{x ∈ X : fq (x) < 0}, where the index q ranges over K, form an open cover of K, and since
K is compact, there exists a finite subcover, indexed by q1 , q2 , . . . , qn , say. Set
bp = max{min{fq1 , fq2 , , fqn , 1}, 0}.
Then bp ∈ A, and bp (p) = 1 while bp (x) = 0 for all x ∈ K. Further, 0 ≤ bp ≤ 1.
Step 5: A = C(X). Take f ∈ C(X) and ε ∈ R+ . We shall approximate f by a function
in A to within ε. Since A is closed, this implies that A = C(X), as required.
Because A contains constants, it suffices to approximate f + λ1, for some real λ. Thus
we may assume that f ≥ 0. We will construct g ∈ A such that f − ε < g < f + ε.
For each p in X, define K = {x ∈ X : f (x) ≤ f (p) − ε}, and consider the bump
function bp constructed in Step 4. If x ∈ K, then f (p)bp (x) = 0 < f (x) + ε, while if x ∈
/ K,
then
f (p)bp (x) ≤ f (p) < f (x) + ε,
by definition of K, and so
f (p)bp (x) < f (x) + ε ∀x ∈ X.
Finally, the function x 7→ f (p)bp (x) − f (x) is continuous and takes the value 0 when x = p,
and so the set {x ∈ X : f (p)bp (x) − f (x) > −ε} is open and contains p. Thus the various
sets {x ∈ X : f (p)bp (x) − f (x) > −ε}, as the index p ranges over X, form an open cover
of X, and since K is compact, there exists a finite subcover, indexed by p1 , p2 , . . . , pm ,
say. Set
g = max{f (p1 )bp1 , f (p2 )bp2 , . . . , f (pm )bpm }.
46 7. APPROXIMATION IN SPACES OF CONTINUOUS FUNCTIONS
Then g ∈ A, and on the one hand, g < f + ε since each f (p)bp < f + ε, while on the other
hand, each x ∈ X belongs to at least one of the sets {x ∈ X : f (pj )bpj (x) − f (x) > −ε},
and hence g(x) ≥ f (pj )bpj (x) > f (x) − ε.
This theorem has many important consequences, many of which were already known
before it was proved, but with a variety of different proofs, so it allows us to present a
unified approach.
Corollary 7.4. The set of trigonometric polynomials t 7→ n∈Z an e2πint (where only
P
finitely many an are nonzero) is dense in the set of continuous 1-periodic functions on R.
Proof. Let T denote the unit circle in C. By the Stone–Weierstrass theorem, the
algebra of all polynomials in the usual coordinates x and y on C is dense in C(T ). However,
every polynomial in x and y may also be written as a polynomial in z and z̄, and z z̄ = 1
on T so all monomials of the form z j z̄ k may be simplified to z j−k or z̄ k−j , depending on
whether j ≥ k or k ≥ j. We conclude that the set of all polynomials of the form n∈Z an z n
P
(where only finitely many an are nonzero) is dense in C(T ).
Now E : t 7→ e2πit is a continuous surjection from R to T , so f 7→ f ◦E maps continuous
functions on T to continuous 1-periodic functions on R. P It is easy to see that this map is
and that the trigonometric polynomial t 7→ n∈Z an e2πint corresponds to the
a bijection, P
polynomial n∈Z an z n on T .
for all trigonometric polynomials p. Since the set of trigonometric polynomials is dense in
C(T ), it follows that
Z
f (t) g(t) dt = 0
I
for all g ∈ C(T ).
From the theory of measure and integration, we know that for each f ∈ L1 (I) and
ε ∈ R+ , there exists f1 ∈ C(T ) such that kf − f1 k1 < ε. It follows that
Z Z Z
f1 (t) g(t) dt = f (t) g(t) dt − [f − f1 ](t) g(t) dt
I
ZI I
It follows that kf1 k1 < ε, and then that kf k1 < 2ε; since ε is arbitrary, we deduce that
f = 0 almost everywhere.
APPENDIX A
You should know the following definitions and facts from Analysis. Suppose that (M, d)
is a metric space. First, for p in M and r in R+ , the open ball B(p, r) is the subset of M
given by
B(p, r) = {q ∈ M : d(q, p) < r}.
We say that a subset S of M is open if S is empty, or if S may be written as a union of
open balls; equivalently, given any point p ∈ S, there exists r ∈ R+ such that B(p, r) ⊆ S.
A subset S of M is closed if its complement M \ S is open.
Any union of open sets is open and any intersection of closed sets is closed. In any
topological space M , the closure of a subset S is the smallest closed subset of M that
contains S; it is the intersection of all closed subsets of M that contain S. A subset S of
M is dense if its closure is M itself.
In metric spaces, topological concepts such as continuity may be expressed using open
sets or sequences. For instance, a function f is continuous provided that f −1 (U ) is open
whenever U is open, or equivalently, provided that f (xn ) → f (x) whenever xn → x.
Similarly, K is a compact set if every cover of K by open sets has a finite subcover, or
equivalently if every sequence in K has a convergent subsequence. To solve problems about
metric spaces, sometimes one point of view is more convenient, and sometimes the other
is better; we can choose which one to use.
Unfortunately, sequences are not always effective in general topological spaces. We give
some examples of sequences “behaving badly”, and explain how to replace “bad” sequences
by “good” nets.
but we need to clarify what we mean by this limit. We could number the points in N × N
using one copy of N, but this would obscure the natural role of the indices M and N .
Example A.3. Let I be the standard interval [0, 1]. Consider the space S = {0, 1}I of
all functions from I to the discrete two-point set {0, 1}. We give S the topology in which
the open sets are the empty set, together with the arbitrary unions of the sets
{f ∈ X : f (t1 ) = v1 , f (t2 ) = v2 , . . . , f (tn ) = vn },
where {t1 , t2 , . . . , tn } is a finite subset of I and each of the values v1 , v2 , . . . , vn is either 0
or 1.
Suppose that g, gn ∈ S and gn → g in S. If t0 ∈ I, then U = {f ∈ X : f (t0 ) = g(t0 )}
is an open set containing g, and since gn → g, then gn ∈ U for sufficiently large n, that is,
gn (t0 ) = g(t0 ). Since t0 is arbitrary, it follows that if gn → g, then gn → g pointwise.
Tychonov’s theorem shows that S is compact.
4. DEFINITION OF A NET 103
However, consider the function hn in S given by taking hn (t) to be the nth entry bn in
the binary expansion
b1 b2 b 3
t= + + + ...
2 4 8
of t; if t admits two different binary expansions, then we take the expansion in which all
bn are 0 when n is sufficiently large. Suppose that the sequence (hn ) has a convergent
subsequence (hnk ). Then hnk (t) converges for all t ∈ I. But there is a real number t whose
nth binary coefficient bn is given by
(
1 if n ∈ {n2 , n4 , n6 , n8 , . . . }
bn =
0 otherwise.
and the sequence (hnk (t)) does not converge.
Thus compactness cannot be described in terms of convergent subsequences.
One way to get around these problems is to use nets.
4. Definition of a net
Definition A.4. A directed set is a set A with a reflexive and transitive order relation
with the additional property that for all finite subsets F of A, there exists µ ∈ A such
that α µ for all α ∈ F . The element µ is called an upper bound for F .
We write either α β or β α.
Example A.5. The set N, with the usual order, is a directed set. The upper bound of
a finite set may be taken to be the maximum element.
Example A.6. The set Z, with the usual order, is a directed set. The upper bound of
a finite set may be taken to be the maximum element.
Example A.7. The set Z, with the opposite order, that is, m n if and only if m ≥ n,
is a directed set. The upper bound of a finite set may be taken to be the minimum element.
Example A.8. The set N2 , with the product ordering, namely, (M, N ) (M 0 , N 0 )
provided that M ≤ M 0 and N ≤ N 0 , is a directed set. The upper bound of a set
{(M1 , N1 ), (M2 , N2 ), . . . , (Mn , Nn )} may be taken to be (max{Mj }, max{Nj }).
Example A.9. Let S be any set. Then the set F of all finite subsets of S ordered by
set inclusion is a directedSset. The upper bound of the finite subsets F1 , F2 , . . . , Fn may be
taken to be their union nj=1 Fj .
We can omit the condition that the sets are finite in the previous example; the power
set of S, ordered by set inclusion, is also a directed set.
Example A.10. Let p be a point in a topological space M . Then the set N of all
open subsets of S that contain p is a directed set, with the ordering given by reverse set
inclusion, that is, U V when U ⊇ T V . The upper bound of the subsets U1 , U2 , . . . , Un
may be taken to be their intersection nj=1 Uj .
104 A. NOTES ON TOPOLOGY AND NETS
Exercise A.11. Is the power set of M , ordered by reverse set inclusion, a directed set?
Definition A.12. A net in a set S is a function from a directed set A into S; we write
xα rather than x(α) and (xα )α∈A rather than x : A → S. We say that a net (xα )α∈A in a
topological space M converges to x in M , and we write xα → x or limα xα = x, if for each
open set U containing x, there exists γ ∈ A (possibly depending on U ) such that xα ∈ U
whenever α γ. We often write (xα ) rather than (xα )α∈A .
Lemma A.13. A subset S of a topological space M is closed if and only if the limit of
every net of elements of S that converges in M is an element of S.
Proof. Suppose that S is closed, and y ∈ / S. Then M \ S is an open set U containing
y. If (xα ) is a convergent net in S, then no xα is in M \ S, so xα 6→ y.
Conversely suppose that S is not closed. If, for each y in M \ S, we could find an open
set Uy such that y ∈ Uy ⊆ M \ S, then we could write
[
M \S = Uy ,
y∈M \S
and M \ S would be open; this is not possible. Consequently, there exists y0 ∈ M \ S such
that every open set U containing y0 meets S. Write N for the collection of open sets U
containing y0 , ordered by reverse set inclusion. For each U in N , choose xU ∈ U ∩ S. Then
(xU )U ∈N is a net in S, and further xU → y0 ∈ / S.
Example A.14. A net indexed by N, with the usual order, is just a sequence. The net
converges precisely when the sequence converges.
Example A.15. A net indexed by N2 , with the product ordering, is a “doubly in-
dexed sequence”, and, for instance, lim(M,N ) f(M,N ) = f provided that we may make f(M,N )
close to f by taking both M and N large. If lim(M,N ) f(M,N ) = f , then it follows that
limk→∞ f(m(k),n(k)) = f for any increasing functions m and n from N to N.
Example A.16. Let F be the collection of all finite subsetsPof N, ordered by inclusion.
For a real or complex sequence
P (a n ) and F ∈ F, define s F = n∈F an . Then lim sF = s
precisely when the series an converges unconditionally to s.
The topology on a space M determines the convergent nets in M , by Definition A.12.
Conversely, if we know the convergent nets in M , then we can determine the topology, for
instance, by using Lemma A.13 to determine the closed sets.
5. Subnets
Definition A.17. Suppose that A and B are directed sets and that (αβ )β∈B is a net
in A such that αβ 0 αβ whenever β 0 β and, for all γ ∈ A, there exists β ∈ B such that
αβ γ. Then (xαβ )β∈B is a subnet of the net (xα )α∈A .
A subsequence is a subnet, but the converse is not true: (1, 2, 2, 3, 3, 3, . . . ) is a
subnet of (1, 2, 3, . . . ) but is not a subsequence.
5. SUBNETS 105
Definition A.18. We say that x is a limit point of the set {xα : α ∈ A} if for all open
sets U containing x, there is γ ∈ A, which may depend on U , such that xγ ∈ U .
We say that x is a limit point of the net (xα )α∈A if for all open sets U containing x, and
all γ ∈ A, there is α ∈ A, which may depend on γ and U , such that α γ and xα ∈ U .
Example A.19. Suppose that an = tan−1 (n) for all n ∈ Z. Then both ±1 are limit
points of the set {aN : n ∈ Z}, but only 1 is a limit point of the net (an )N ∈Z when Z is
given its standard order.
We prove one sample result on subnets.
Lemma A.20. The point x is a limit point of the net (xα )α∈A if and only if there is a
subnet of (xα )α∈A that converges to x.
Proof. Suppose that x is a limit point of a net (xα )α∈A . Given γ ∈ A and an open set
U containing x, take α ∈ A such that α γ and xα ∈ U , and write α as α(γ,U ) . Write N
for the collection of open sets containing x ordered by reverse set inclusion; then A × N ,
with the product ordering, is a also a directed set, and (xα(γ,U ) )(γ,U )∈A×N is a subnet of the
net (xα )α∈A that converges to x.
Conversely, suppose that there is a subnet (xαβ )β∈B of (xα )α∈A that converges to x.
Then for each γ ∈ A and open set U containing x, there exists β1 ∈ B such that xαβ ∈ U
whenever β β1 , and there exists β2 ∈ A such that αβ γ whenever β β2 . Take an
upper bound β for {β1 , β2 }. Then αβ γ and xαβ ∈ U , and so x is a limit point of the
net (xα )α∈A .
Theorem A.21. A subset S of a topological space M is compact if and only if every
net in S has a subnet that converges to a point of S.
Proof. We omit this proof. It uses the Axiom of Choice.
Example A.22. Consider again the space {0, 1}I of {0, 1}-valued functions on the
interval I, and the sequence hn defined in Example A.3. There there is a subnet of this
sequence that converges pointwise on I. It does not seem to be possible to describe this
subnet simply and explicitly.
APPENDIX B
We use without proof some ideas from the theory of measure and integration.
1. Measure theory
A σ-algebra M of subsets of a set S is a collection of subsets of S that contains ∅ and
S, and is closed under taking complements and countable unions and intersections. The
elements of M are called measurable sets.
A (positive) measure µ is a mapping from M to [0, +∞] that satisfies the conditions
that µ(∅) = 0, and
[ X
µ En = µ(En )
n∈N n∈N
whenever En ∈ M for all n ∈ N and the sets En are pairwise disjoint. The total mass of a
positive measure on S is µ(S).
If S is a topological space, then the Borel σ-algebra is the smallest σ-algebra that
contains all open sets. It is a theorem that “the smallest σ-algebra makes sense.
In many examples, such as Rn , there are simple objects, such as rectangles with sides
parallel to the axes, on which we have a natural notion of measure, and then we can extend
this measure to the Borel σ-algebra.
2. Measurable functions
A function f : S → F is measurable if f −1 (B) ∈ M for all balls B in F. If f is complex-
valued, then it is measurable if and only if its real and imaginary parts are measurable,
and the theory is usually developed with a focus on real-valued functions.
There are various theorems about measurable functions. For example, these form an
algebra under pointwise operations, the supremum and infimum of a sequence of measurable
functions is measurable, and the pointwise limit of a pointwise convergent sequence of
measurable functions is measurable. These properties make measurable functions much
more versatile that Riemann integrable functions.
107
108 B. NOTES ON MEASURE AND INTEGRATION
3. Integrals
Given a measure space, that is, a triple (S, M, µ) as above, we may define the integral
of a nonnegative measurable function f : S → [0, +∞] by the formula
Z X X
f dµ = sup{ an µ(En ) : an 1En ≤ f },
S n∈N n∈N
subject to the condition that the four summands on the right hand side are finite. If two
measurable functions differ on a set of measure 0, then they have the same integral.
We define the Lebesgue spaces Lp (S, M, µ), usually abbreviated to Lp (S, µ) or just
Lp (S), to be the spaces of measurable spaces for which
Z 1/p
p
|f | dµ < ∞,
S
and we identify functions that are equal except on a set of measure 0. The expression on
the left hand side of the inequality above is then the Lp -norm of f (or more precisely, of
its equivalence class). The definition of L∞ (S) is similar.
When the measure space arises from a topological space, then it can be shown that
continuous functions with compact support are dense in the spaces Lp (S), except perhaps
when p = ∞.
4. Complex measures
In general, there are many measures defined on the same σ-algebra of subsets of a space
S. A complex measure is a linear combination (with coefficients ±1 and ±i of four positive
measures on S, each of which has finite total mass. These assign a measure to a general
set that is a complex number.
When the measure space arises from a topological space, then it can be shown that
Z
dµ
S
may be fined for all complex measures and all bounded continuous functions f . The Riesz
representation theorem states that the linear functionals on C0 (S) (the space of continuous
4. COMPLEX MEASURES 109
R
functions on S that vanish at infinity) coincide with the maps f 7→ S
dµ, where µ is an
arbitrary complex measure.