Notes On Functional Analysis PDF
Notes On Functional Analysis PDF
IN MATHEMATICS
L =00
n
UIM OOKAGENCY
Notes on
Functional Analysis
Rajendra Bhatia
Indian Statistical Institute
Delhi
HINDUSTAN
BOOK AGENCY
Published in India by
email: info@hindbook.com
http://www.hindbook.com
All export rights for this edition vest exclusively with Hindustan Book
Agency (India). Unauthorized export is a violation of Copyright Law and
is subject to legal action.
ISBN 978-81-85931-89.0
Preface
These notes are a record of a one semester course on Functional Analysis that
I have given a few times to the second year students in the Master of Statistics
program at the Indian Statistical Institute, Delhi.
I first taught this course in 1987 to an exceptionally well prepared batch of five
students, three of whom have gone on to become very successful mathematicians.
Ten years after the course one of them suggested that my lecture notes could be
useful for others. I had just finished writing a book in 1996 and was loathe to begin
another soon afterwards. I decided instead to prepare an almost verbatim record of
what I said in the class the next time I taught the course. This was easier thought
than done. The notes written in parts over three different years of teaching were
finally ready in 2004.
This background should explain the somewhat unusual format of the book. Un-
like the typical text it is not divided into chapters and sections, and it is neither
self-contained nor comprehensive. The division is into lectures each corresponding
to a 90 minutes class room session. Each is broken into small units that are num-
bered.
Prerequisites for this course are a good knowledge of Linear Algebra, Real Anal-
ysis, Lebesgue Integrals, Metric Spaces, and the rudiments of Set Topology. Tradi-
tionally, all these topics are taught before Functional Analysis, and they are used
here without much ado. While all major ideas are explained in full, several smaller
details are left as exercises. In addition there are other exercises of varying difficulty,
and all students are encouraged to do as many of them as they can.
The book can be used by hard working students to learn the basics of Functional
Analysis, and by teachers who may find the division into lectures helpful in planning
vi
their courses. It could also be used for training and refresher courses for Ph.D.
students and college teachers.
The contents of the course are fairly standard; the novelties, if any, lurk in the
details. The course begins with the definition and examples of a Banach space and
ends with the spectral theorem for bounded self-adjoint operators in a Hilbert space.
Concrete examples and connections with classical analysis are emphasized where
possible. Of necessity many interesting topics are left out.
There are two persons to whom I owe special thanks. The course follows, in spirit
but not in detail, the one I took as a student from K. R. Parthasarathy. In addition
I have tried to follow his injunction that each lecture should contain (at least) one
major idea. Ajit Iqbal Singh read the notes with her usual diligence and pointed out
many errors, inconsistencies, gaps and loose statements in the draft version. I am
much obliged for her help. Takashi Sano read parts of the notes and made useful
suggestions. I will be most obliged to alert readers for bringing the remaining errors
to my notice so that a revised edition could be better.
The notes have been set into type by Anil Shukla with competence and care and
I thank him for the effort.
A word about notation
To begin with I talk of real or complex vector spaces. Very soon, no mention is
made of the field. When this happens, assume that the space is complex. Likewise
I start with normed linear spaces and then come to Banach spaces. If no mention is
made of this, assume that X stands for a complete normed linear space.
The symbol E is used for two different purposes. It could mean the closure of
the subset E of a topological space, or the complex conjugate of a subset E of the
complex plane. This is always clear from the context, and there does not seem any
need to discard either of the two common usages.
There are twenty six Lectures in this book. Each of these has small parts with
numbers. These are called Sections. A reference such as "Section m" means the
section numbered m in the same Lecture. Sections in other lectures are referred
to as "Section m in Lecture n". An equation number (m.n) means the equation
numbered n in Lecture m.
Do I contradict myself?
Lecture 2 Dimensionality 11
Banach Spaces
The subject Functional Analysis was created at the beginning of the twentieth
century to provide a unified framework for the study of problems that involve con-
tinuity and linearity. The basic objects of study in this subject are Banach spaces
and linear operators on these spaces.
1. Let X be a vector space over the field F, where F is either the field I of real
numbers or the field C of complex numbers. A norm ff on X is a function
that assigns to each element of X a nonnegative real value, and has the following
properties:
Examples Aplenty
2. The absolute value is a norm on the space IF, and with this IF is a Banach
space.
3. The Euclidean space IFn is the space of n-vectors x = (xi,... , xi,) with the norm
1x112 := (I xI2)h/2.
j=1
4. For each real number p,1 < p < Oc the space is the space IFn with the p- norm
of a vector x = (x1,... , xn) defined as
j=1
IIxIIoo =
ab<
a +-. bq
p q
for all 1 < p < oo. When p = 2, this is the more familiar Cauchy-Schwarz inequality.
(iii) Use (1.2) to prove the Minkowski inequality
p IIxIIy = II=III
6. Why did we restrict ourselves to p > 1? Let 0 < p < 1 and take the same
definition of II ' III as above. Find two vectors x and y in 1F2 for which the triangle
inequality is violated.
lxii '- (
All the spaces in the examples above are finite-dimensional and are Banach spaces
when equipped with the norms we have defined.
8. Let C[0,1] be the space of (real or complex valued) continuous functions on the
interval [O, 1]. Let
9. More generally, let X be any compact metric space, and let C(X) be the space
of (real or complex valued) continuous functions on X. Let
I=suPlf(x)I
sex
4 Notes on Functional Analysis
It is clear that this defines a norm. The completeness of C(X) is proved by a typical
use of epsilonics. This argument is called the e/3 argument.
Let fr-, be a Cauchy sequence in C(X). Then for every e > 0 there exists an
integer N such that for in, n > N and for all x
f(x) - fm(X)I 6.
So, for every x, the sequence f(x) converges to a limit (in 1F) which we may call
f(x). In the inequality above let m -> oo. This gives
i- f(x)J <
for n > N and for all x. In other words, the sequence fn converges uniformly to
f. We now show that f is continuous. Let x be any point in X and let e be any
positive number. Choose N such that fN(z) - f(z) < s/3 for all z E X. Since IN is
continuous at x, there exists 8 such that (- fN(Y)I <6/3 whenever d(x, y) <8.
Hence, if d(x, y) < S, then
10. For each natural number r, let cr [0, 1] be the space of all functions that have
continuous derivatives f (1), f (2),... , I(r) of order upto r. (As usual, the derivatives
are one-sided limits at the endpoints 0 and 1.) Let
r
f:_ sup fj_0
0<t<1
The space CT [0, 1] is a Banach space with this norm. (Recall that if the sequences
fn and fn converge uniformly on [O, 1] to f,g respectively, then f is differentiable
and f' = g.)
11. Now let X be any metric space, not necessarily compact, and let C(X) be the
1. Banach Spaces 5
Sequence Spaces
13. Let c be the subspace of that consists of all convergent sequences. Use an
e/3 argument to show that it is a closed subspace of 4,.
Let c0 be the collection of all sequences converging to 0. This is also a closed
linear subspace of .
We use the symbol c00 to denote the collection of all sequences whose terms are
zero after some stage. This is a linear subspace of but is not closed. The space
co is the completion of coo (the smallest closed space in . that contains coo).
14. For each real number 1 < P < oo, let 4 be the collection of all sequences
x = (xl,x2,...) such that >=i 1<00.
(i) Use the convexity of the function f(t) = t on [0, oo) to show that 4 is a vector
space.
(ii) Note that 4 C co C F.,.
(iii) The inclusions in (ii) are proper. (Consider the sequence with terms xn =
(iv) The space 4 for any 1 < p < oo is not closed in Pte.
(v) For x e ep, define
00
:= (
j=1
IxI)'.
6 Notes on Functional Analysis
Show that this is a norm. Imitate the steps in Example 4. Some modifications are
necessary. The Holder inequality (1.2) is now the statement: if x E 4 and y E t9,
then their termwise product xy is in Ll and the inequality (1.2) holds. With this norm
Qp is a Banach space.
(vi) Let 1 <p <p' <00. If the series E x P converges, then so does E Ix '. Thus
the vector space PP is contained in 4'. Further, for every x E 4 we have
This inequality can be proved as follows. Assume first that lix lip = 1. Then ix < 1
for all j, and hence, 1x3 I" < x3 P This shows that
EIxi' lxii'= 1,
Lebesgue Spaces
15. Let I be the interval [O, 1] with the Lebesgue measure . Let X be the collection
of all bounded measurable functions on I, and for f e X let
Then X is a Banac;h space. (To prove completeness, recall that uniform convergence
of a sequence fis enough to ensure that the limit f is measurable.)
16. Since sets of measure zero are of no consequence, it is more natural to consider
essentially bounded functions rather than bounded ones. Let f be a measurable
function on I. If there exists an M > 0 such that
The collection of all (equivalence classes of) such functions is the space It
is a Banach space with this norm.
17. For 1 <p < oo, let L[0, 1] be the collection of all measurable functions on [0, 1]
for which fo lis finite. Then L[0,1] is a vector space and
IIflIv :- (fe' l
is.a norm on it. To prove this, one uses versions of Holder and Minkowski inequalities
(1.2) and (1.3) in which sums are replaced by integrals.
The completeness of L[0, 1] is standard measure theory. The assertion that
L[0, 1] is complete is called the Riesz-Fischer Theorem. (Warning: There are other
theorems going by the same name.)
18. The interval I can be replaced by a general measure space (X, S, 1u) in which X
is a set, S a a-algebra of subsets of X, and p any measure. The spaces Lp(X, S, p),
1 < p < oc, can then be defined in the same way as above. (It is often necessary to
put some restrictions like a-finiteness to prevent unruly behaviour of different sorts.)
When X = N, and p is the counting measure, we get sequence spaces.
If p(X) is finite, and 1 < p <p' < oc, then the space is a linear subspace of
L. In this case we have
Ill lIP p(X)1/p-I1P If i (1.5)
for all f E Lp'. (This can be seen using the Holder inequality, choosing one of the
functions to be identically l.) This is just the opposite of the behaviour of sequence
spaces in Example 14.
If (X) = oo, no inclusion relations of this kind can be asserted in general.
8 Notes on Functional Analysis
Separable Spaces
A metric space is called separable if it has a subset that is countable and dense
Separable Banach spaces are easier to handle than nonseparable ones. So, it is o
interest to know which spaces are separable.
19. The space C[0, 1] is separable. Polynomials with rational coefficients are dens<
in this space.
20. For 1 < p < oc, the space Cpl is dense in p. Within this space those that havE
rational entries are dense. So the spaces .gyp, 1 < p < oc are separable.
21. The space . is not separable. Consider the set S of sequences whose term;
are 0 or 1. Then S is an uncountable subset of . (It is uncountable because ever
point in the unit interval has a binary decimal expansion and thus corresponds to
unique element of S.) If x, y are any two distinct elements of S, then Ix - y f =1
So the open balls B(x,1/2), with radii 1/2 and centred at points x e S, form ar
uncountable disjoint collection. Any dense set in must have at least one point it
each of these balls, and hence can not be countable.
The subspace c0 of is separable (coo is dense in it) as is the subspace c (consider
sequences whose terms are constant after some stage).
22. For 1 <p < oo, the spaces L[0, 1] are separable. Continuous functions are dens
in each of them. The space L [0, 1] is not. (Consider the characteristic functions o
the intervals [0, t], 0 < t < 1).
23. What about the spaces S, C)? These can not be "smaller" than the space:
(X, S, ). If we put d(E, F) = (EzF), where EOF is the symmetric difference o
the sets E and F, then d(E, F) is a metric on S. It can be proved (with standard bu
elaborate measure theory) that for 1 <p < oc, the space Lp(X, S, c) is separable i
and only if the metric space (S, d) is separable. Further, this condition is satisfied i
and only if the a-algebra S is countably generated. (The statements about am
1. Banach Spaces 9
L[0,1], 1 < p < oo are included in this more general set up.)
More examples
24. A function f on [0, 1] is said to be absolutely continuous if, given > 0, there
exists (5> 0 such that
hf(tf(t)t <
i=1
for every finite disjoint collection of intervals { (ti, ti) } in [0, 1] with >'_ 1 Iti - ti I < b
The Fundamental Theorem of Calculus says that if f is absolutely continuous, then
it is differentiable almost everywhere, its derivative f' is in L1 [0, 11, and f(t) _
fo f/(s)ds + f(0) for all 0 < t < 1. Conversely, if y is any element of L1[0,1], then
the function Gdefined as G(t) = fo g(s)ds is absolutely continuous, and then G' is
equal to g almost everywhere.
For each natural number r, let LPT [0, 1] be the collection of all (r - 1) times
continuously differentiable functions f on [0, 1] with the properties that fir-1> is
absolutely continuous and f(r) belongs to L[0, 1]. For f in this space define
If +IU(') IIP'
Then LT [0, 1], 1 < p < oe is a Banach space. (The proof is standard measure
theory.) These are called Sobolev spaces and are used often in the study of differential
equations.
25. Let D be the unit disk in the complex plane and let X be the collection of all
functions analytic on D and continuous on its closure D. For f in X, let
Then X is a Banach space with this norm. (The uniform limit of analytic functions
is analytic. Use the theorems of Cauchy and Morera. )
10 Notes on Functional Analysis
Caveat
We have now many examples of Banach spaces. We will see some more in the
course. Two remarks must be made here.
There are important and useful spaces in analysis that are vector spaces and have
a natural topology on them that does not arise from any norm. These are topological
vector spaces that are not normed spaces. The spaces of distributions used in the
study of differential equations are examples of such spaces.
All the examples that we gave are not hard to describe and come from familiar
contexts. There are Banach spaces with norms that are defined inductively and are
not easy to describe. These Banach spaces are sources of counterexamples to many
assertions that seem plausible and reasonable. There has been a lot of research on
these exotic Banach spaces in recent decades.
Lecture 2
Dimensionality
Algebraic (Hamel)Basis
holds if and only if a1 = a2 = ... = a,z = o. A (finite) sum like the one in (2.1) is
called a lzTiear combination of x1, ,
Zorn's Lemma
4. Exercises. (i) Use Zorn's Lemma to show that every vector space X has an
algebraic basis. (This is a maximal linearly independent subset of X.)
2. Dimensionality 13
(ii) Show that any two algebraic bases of X have the same cardinality. This is called
the dimension of X, written as dim X.
(iii) If B is an algebraic basis for X then every element of X can be written uniquely
as a linear combination of elements of B.
(iv) Two vector spaces X and Y are isomorphic if and only if dim X = dim Y.
5. The notion of an algebraic basis is not of much use in studying Banach spaces
since it is not related to any topological property. We will see if X is a Banach space,
then either dim X <00 or dim X > c, the cardinality of the continuum. Thus there
is no Banach space whose algebraic dimension is countably infinite.
6. Let {x} be a sequence of elements of a Banach space X. We say that the series
0o N
converges if the sequence sN = xn of its partial sums has a limit in X.
n=1 n=1
8. If {x} is a Schauder basis for a Banach space X, then the collection of all finite
N
sums anxn, in which a,t are scalars with rational real and imaginary parts, is
ri=1
dense in X. So, X is separable. Thus a nonseparable Banach space can not have a
Schauder basis.
For n = 1, 2, ... , let e72 be the vector with all entries zero except an entry 1 in
the nth place. Then {e} is a Schauder basis for each of the spaces p < 00,
and for the space co.
14 Notes on Functional Analysis
9. Is there any obvious Schauder basis for the space C[O,1] of real functions? The
one constructed by Schauder is described below.
as follows. Let f(r) = 0 if j <n, f(r) = 1, and let f n be linear between any two
neighbours among the first n dyadic rationals. Draw the graphs of f, f 4 and 15.
Show that every element g of C[0, 1] has a unique representation g = I ai fi :
n
(iv) draw the graph of >ajfj;
i=1
(v) since the sequence ri is dense in [0, 1], these sums converge uniformly to g, as
n -* oo.
Note that = 1 for all n. Thus we have a normalised basis for C[0, 1].
Equivalence of Norms
11. Let ( (I and II II' be two norms on a vector space X. We say these norms are
equivalent if there exist positive real numbers C and C' such that
12. Let x1,... , x7 be orthonormal vectors in the Euclidean space CT. Then for all
scalars al,... , an ,
The next lemma provides a good working substitute for this. It says that if x1,... , xn
are linearly independent vectors in any Banach space, then the norm of any linear
combination a1 x1 an xn can not be too small.
Proof. Divide both sides of the inequality (2.3) by al I + + Ian I. The problem
reduces to showing that there exists C, such that if lajI = 1, then
anm)
If this were not the case, for each positive integer m there would exist a(im),... ,
with > I = 1 such that
1
liaim) xi+.. .+.a (rn) xnll (2.4)
C
m
This contradicts (2.4) which says that aim) xi + + anm) xn converges to zero as
13. Theorem. Any two norms on a finite dimensional vector space are equivalent.
This is a norm on X. Let II ' II be any other norm. By the Lemma in 12, there exists
a constant C such that
lixil >_ cllxlil.
14. Exercises. (i) Consider the space Cn with the p-norms 1 < p < oc. Given two
indices p and p', find the smallest numbers Cp,p such that
Local Compactness
15. By the Heine-Borel Theorem, the closed unit ball {x : IxIf2 < 1} is a compact
subset of fin. It follows that any closed ball (with any centre and radius) is compact.
A topological space in which every point has a neighbourhood with compact
closure is called locally compact. This property is the next best thing to compactness.
We have seen that all norms on afinite-dimensional space are equivalent. So,
the Heine-Borel Theorem says that all finite-dimensional normed spaces are locally
compact. We will see that no infinite-dimensional space has this property.
dist(x,M) t.
(The distance of a vector x from a subspace M is the number dist (x, M) = inf {IIx -
mfl: m E NI}.)
Proof. Choose any vector u not in M, and let d = dist (u, M). Since M is closed,
d> 0. For each 0 < t < 1, d/t > d. Hence, by the definition of d, there exists
Xo e M such that
d< Mu-xoI < d/t.
''o
If Xt :_ IIu-x II
, then for all x E M
1
Mx-xtII = -
Mu-xoM
II(IuxoM)xu+xoM
1 IL.
III-Tollu-jjl0
18 Notes on Functional Analysis
where xl = lu - xo lix + xo. Note that for each x in M the vector xl is also in M.
So, by the definition of d
IlluxoM > t. d
X = {feC[O,1}:f(O)=O}
i
M = {feX:ff=O}
0
18. Theorem. In any infinite-dimensional normed linear space the closed unit ball
cannot be compact.
Proof. Choose any unit vector x1 in X and let M1 be its linear span. By Riesz's
Lemma, there exists a unit vector x2 such that dist(x2, M1) > 1/2, and hence,
I Ix2 - x 1 i i > 1/2. Let M2 be the linear span of x l and x2. Repeat the argument.
This leads to a sequence xof unit vectors at distance greater than 1/2 from each
other. So, the unit ball is not compact.
Quotient Spaces
1. Let X be a vector space and Al a subspace of it. Say that two elements x and y
of X are equivalent, x y, if x -- y E Al. This is an equivalence relation on X. The
cosec of x under this relation is the set
=x+M:={x+m:mEM}.
Let X be the collection of all these cosecs. If we set
x+y,
written as X/M.
2. Let X be a nonmed linear space and let Al be a closed subspace. Let X = X,M
and define
=dist (x,M) = inf IIx-mI!.
iiiEM
20 Notes on Functional Analysis
IIx+mII.
ti
We will show that if X is complete, then so is X.
4. Theorem. Let X be a Banach space and M a closed subspace of it. Then the
quotient X/M is also a Banach space.
The coset y is a natural candidate for being the limit of the series > xn. Indeed,
N N
n=1 n=1
N
inf
mEMll n=1
x-y-mII
N N
c Ixn-y-rnM
a=t n=t
N
II
(x - m) - yIt.
n=1
3. New Banach Spaces from Old 21
The right hand side goes to zero as N -* oo. This shows xn is summable.
5. Exercises. (i) Let X be the Euclidean space Cn and let M = 1 < k <n.
Show that X/M is isomorphic to the orthogonal complement of Ck.
(ii) Let X = C[0,1] and let M = {f : f(O) = 0}. Show that X/M is isomorphic to the
scalars C. (Identify the scalar z with the constant function with value z everywhere.)
and hence, every bounded linear operator is a continuous map from X into Y. (If
xn -p x in X, then Axn -p Ax in Y)
9. Each m x n matrix gives rise to a bounded linear operator from (C into Cm.
Each element u of Cn gives rise to a linear functional via the map x H u.x, where
u.x is the dot product.
Let X = L[0, 1]. Then the map f --* fo f(t)dt is a bounded linear functional.
10. Let X = L2[0,1]. Let K(x, y) be a measurable function on [0, 1] x [0,1] such that
p1 pi
Jo Jo I
By
y < 00 a.e.x,
f 1 K(x, y).f
0
f 1 i1 I
0 0
<00 a.e.x.
Thus Kf is a well defined bounded measurable function and hence is in L2[0,1]. So,
the map f --> Kf is a linear operator on L2[0,1]. It is easy to see that it is a bounded
3. New Banach Spaces from Old 23
linear operator.
Indeed
IIKfII2 IIll M.
There is nothing special about [0,1] here. It could be replaced by any bounded
or unbounded interval of the real line. The square integrability condition (3.3) is
sufficient, but not necessary, for the operator in (3.4) to be bounded.
11. Let X = C[0,1] and let K(x, y) be a continuous function on [0,1] x [0, 1]. For
f e X, let K f be a new function defined as
f1
(Kf)(x) = K(x, y) f (y)dy. (3.5)
J
Show that f --> K f is a bounded linear operator on X.
The condition that K(x, y) is continuous in (x, y) is sufficient, but not necessary,
to ensure that the operator K is bounded. For example the operator K in (3.5) is
bounded if lim fo K(x, y) 0.
The operators K defined in (3.4) and (3.5) are said to be integral kernel operators
induced by the kernel K(x, y). They are obvious generalisations of operators on finites
dimensional spaces induced by matrices. Many problems in mathematical physics
are solved by formulating them as integral equations. Integral kernel operators are
of great interest in this context.
12. Let X, Y, Z be normed linear spaces, if A E 13(Y, Z) and B E B(X, Y), then
AB E 13(X, Z) and
IIABII MAIl IIBII. (3.6)
The space ,Ci(X, X) is written as 8(X) to save space (and breath). It is a vector
space, and two of its elements can be multiplied. The multiplication behaves nicely
with respect to addition: A(B + C) = AB + AC, and (A + B)C = AC + BC.
13. If X is any normed linear space and Y a Banach space, then 13(X, Y) is a Banach
24 Notes on Functional Analysis
14. Let A be a vector space. We say A is an algebra if there is a rule for multiplication
on A that satisfies four conditions
If A has a norm that is submulticative; i.e., llabll c Ilall libli, then we say that A
is a nonmed algebra. If A is complete in the metric induced by this norm, then A is
called a Banach algebra.
Our discussion above shows that if X is a Banach space, then 13(X) is a Ba-
nach algebra. The study of Banach algebras has been a major theme in Functional
Analysis.
15. Exercise. Let A be an n x n matrix. Let X be the space Cn with the p norm,
where p = 1, or oc. Regard A as a linear operator on X and find an expression for
its norm in each of the two cases.
3. New Banach Spaces from Old 25
16. The space 13(X, 1F) is called the dual space of X. The symbol X* is conventionally
used for it and its elements are called (bounded) linear functionals on X.
17. We should emphasize that we will use the word functional to mean a bounded
linear functional.
If X is any vector space, then the algebraic dual of X is the vector space X'
consisting of all linear functional on X. If X is finite-dimensional and e1,... , en is a
basis for it, then the linear functionals fi,... , fn defined by f1(e3) = 5j constitute a
basis for the dual space X'. Thus X and X' are isomorphic vector spaces.
Let X be an infinite-dimensional vector space with a Hamel basis {e1, e2, ... } .
Now the linear functionals {fi, 12,. . . } defined as above no longer constitute a Hamel
basis for X'. (Consider the linear functional f defined as f(e2fl) _ li f(e2+1) _ 0,
for all n.) In fact the dimension of X' is uncountable. Thus for infinite-dimensional
spaces the concept of algebraic dual is not as useful as in finite dimensions. For
Banach spaces, the dual X" (sometimes called the topological dual) turns out to be
a very useful object.
Two examples
Then Ff is called the Fourier transform of f. F is a bounded linear map from Li (][8)
into L00(II8). Its norm is It is a basic fact in Fourier analysis that J' induces
a bijective map of L2(II8) onto itself and in this case its norm is 1. Note that the
Fourier kernel is not square integrable on 11 8 x R.
K(x,y) e_
on Il8+ x ][8+. The associated integral operator on L2(1[8+) is called the Laplace trans-
form G; i.e.,
(Cf)(x) = e-may f (y)dy. (3.10)
J
Like the Fourier kernel, the kernel (3.9) too is not square integrable. Like the Fourier
transform, the Laplace transform is a bounded operator on L2(II8+) and is bijective.
Let us calculate the norm of this operator on L2(R+).
Let g = Cf. Then
i f 00
Ig(x)12 =
(e_XY/2y_h/4)(e-2y1/4f (y)) dyI2.
Change the order of integration. The integral with respect to x can be evaluated
once again by a change of variables.
00
Jo
hlLf II - (3.14)
lira
The calculation above suggests a candidate. If f(y) = y-1/2, then ,Cf = f. That
seems more than what we need for (3.14). However, the argument is flawed since this
function is not in LZ(][8+). What saves it is the observation that to prove
we do not need (3.14). It is adequate to produce a sequence of functions fn for which
Cfhh
II In II
Exercise. Let 0 < a < b < oo and let 1(y) = y-1/2 when y e [a, 6], and f(y) = 0
outside this interval. Show that for each E > 0 we can choose a and b such that
h112 ? (1 - E)lr 111112. This shows that IIGII =
Lecture 4
The Hahn Banach Theorem (H.B.T.) is called one of the three basic principles of
linear analysis the two others are the Uniform Boundedness Principle and the Open
Mapping Theorem. We will study them in the next three lectures. The H.B.T. has
several versions and several corollaries.
In essence the theorem says that we can extend a linear functional from a sub-
space to all of a vector space without increasing its size.
1. Theorem. Let X be a real vector space and p a sublinear functional on it. Let
Xo be a subspace of X and let fo be a linear functional on Xo such that fo(x) < p(x)
for all x E Xo. Then there exists a linear functional f on X such that f(x) =fo(g)
whenever x, E Xo, and f(x) < p(x) for all x E X.
Proof. The idea is simple. Let x1 be a vector outside X0. We will first extend fo to
4. The Hahn-Banach Theorem 29
the space spanned by Xo and xl, and then use Zorn's Lemma. Let
The representation of each element of X1 in the form x + ax1 is unique. For every
pair x, y in Xo
So,
Let
Then a < b. Choose any number c such that a < c < b. Then for all x E Xo
Let a be any nonnegative real number, and multiply both sides of these inequalities
by a. Then replace ax by x. This gives
then we get a linear functional f' on Xl and f'(Y) < p(y) for all y E X1.
30 Notes on Functional Analysis
Thus we have obtained an extension of fo to Xl. Note this extension is not unique
since it is defined in terms of c, an arbitrary number between a and b. If Xl = X,
we are done. If not, we can repeat the argument above extending Ii to a bigger
subspace of X. Does this process of extending by one dimension at a time eventually
exhaust all of X? We do not know this, and to overcome the difficulty we employ
Zorn's Lemma.
Let F be the collection of all ordered pairs (Y, f) where Y is a subspace of X
that contains Xo, and f is a linear functional on Y that reduces to fo on Xo and is
dominated by p on Y. Define a partial order < on .F by saying that (Y1, fl) < (Y2, f2)
if Y2 is a linear space that contains Yl and 12 = fi on Vi. Let g = {(Ya, fck)}aEA
be a totally ordered subset of 1. Then the pair (Y, g), where Y = UQEA Y and
g(am) =fi(x) for x E Ya, is an element of J' and is an upper bound for G. Therefore,
by Zorn's Lemma, 1' has a maximal element. Let (Y, be this maximal element.
If Y X, then we could extend (Y, by adding one dimension as before. But
then (Y, would not have been maximal. Thus Y = X and if we put f =
f is a linear functional on X with the required properties.
Proof. Regard X as a vector space over Il8 by restricting the scalars to real numbers.
Let go(o) = Re fo(x) for all x e Xo. Then go is a real linear functional on Xo
dominated by the sublinear functional p. So, go can be extended to a real linear
functional g on X dominated by p. Note that
So,
So far we can say only that f is real linear: i.e. f(x + y) = f(x) -}- f(y) and
f(cEx) = a f (x) for a e R. Let a + i/3 be any complex number. Then using (4.1) we
see that
Proof. We will use the versions of H.B.T. proved in 1 and 2. We give the proof for
real spaces and leave the complex case as an exercise.
32 Notes on Functional Analysis
Let p(x) = Clixii This is a sublinear functional. Since fo(x) < p(x) for all
x e Xo, we can find a linear functional f on X that reduces to fo on Xo and such
that 1(x) < p(x) for all x E X.
Since p(-x) = p(x), it follows that f(-x) < p(x); i.e., -f(x) < p(x). So l<
p(x) = CIIxII for all x e X. So the theorem is proved for real spaces.
Proof. Let Xl be the linear span of Xo and xl. Every vector in Xl can be written
uniquely as y = x + axl with x E Xo, a E C. Let fi() = ab. Then fi is a linear
functional on Xl, fl(x1) =band fl(x) = 0 for all x e Xo. If we show h
= 1, the
proposition would follow from the H.B.T.
Let x be any element of Xo and let a 0. Then
5. Exercise. For each nonzero vector x0 in a normed linear space X there exists a
4. The Hahn-Banach Theorem 33
Proof. Choose a countable dense set {f} in X*. For each n, choose xn E X such
that If 1 and f (x1) I > fff. Let D be the collection of all rational linear
combinations of elements of the set {x}. Then D is countable. Its closure D is a
subspace of X. If D X, we can choose a linear functional f on X such that I= 1
and f (x) =0 for all x E D.
Since {f} is dense in X*, there exists a subsequence f.,,, converging to f. Note
that
lItm(sm)I = (fm - f)(Xm)f I-fll
Thus f if -p 0. Since f if - 111ff and f- 1, this is a contradiction.
We will see that i = . So, the converse of the Theorem is not true.
Banach Limits
8. Let be the space of real bounded sequences. A linear functional on this space
is called a Banach, limit if
(i) f(x1x2,...) > 0 if all xn > 0.
(ii) f(x2,x3,...) = f(xi,x2,x3,.
(iii) 1(1,1,1,...) = 1.
We will show that such a linear functional exists.
Consider the subspace c in consisting of all convergent sequences. For an
element x = (Xi, x2,...) of c let fo(s) = lim xn.
This is a linear functional on c. For any x = (xi, x2,...) in define
where the inf is over all choices of positive integers r; k1, ... , kr.
_ (x2,x3,. ..}.
Show that
p(x - Sx) < 0 for all x.
4. The Hahn-Banach Theorem 35
Exercises. (i) A sequence in Q is called almost convergent if all its Banach limits
are equal.
Show that x is almost convergent if
p(x) = -p(-x),
Exercise. Find the Banach limit of the sequence x = (1, 0,1,0,. . .).
The Baire Category Theorem says that a complete metric space cannot be the
union of a countable number of nowhere dense sets. This has several very useful
consequences. One of them is the Uniform Boundedness Principle (U.B.P.) also
called the Banaeh--Steinhaus Theorem.
The U.B.P.
Then
sup sup PA(X) <00.
A
(The hypothesis is that the family {PA} is bounded at each point x; the conclusion
is that it is uniformly bounded over the unit ball of X.)
Since p), are continuous, Cis closed. By the hypothesis X = UnCn. So, by the
Baire Category Theorem, there exists an no such that the set C 0 contains a closed
ball B (xo,r).
Let x be any element of X such that 2 lxii < r. Then the vectors x0 x/2 are
in the ball B (xo, r). Since
x x
x=xo-i-2-(xo-Z)
we have
x x
pa (x) PA (X0 + 2) + pa (xo - 2) < 2no.
If 1 <2r, the proof is over. If this is not the case, choose a positive integer m > 1/2r.
Now if lxii < 1, then ilx/mii <2r, and
2. Corollary. Let X be a Banach space and let {Aa} be a family of bounded linear
operators from X into a normed linear space Y. Suppose for each x E X
sup
n
I<00
(because the terms of the sequence x are zero after some stage). However, 1f1 = n,
and hence
IIfnII = 00.
convergent, hence bounded; i.e., there exists a number K(x) such that
sup
n
I= K(x) <00.
Hence, by the U.B.P., there exists a number K such that
Proof. For each x, the sequence IIAxI is convergent, hence bounded. So, by the
U.B.P. the sequence IIAM is bounded. This is true for IIBII also. Note that
I
- a)x + (a- a)axII
IBM
6. Let X be the Banach space of continuous functions on the interval [-it, it]. The
Fourier coefficients of a function f in X are the numbers
an =
1 f f(t)edt.
27r f
The Fourier series of f is the series
00
a 71eiuzt .
n=-00
One of the basic questions in the study of such series is whether this series converges
at each point t in [-it, it], and if so, is its sum equal to f(t)?
An example to show that this is not always the case was constructed by Du
Bois-Raymond in 1876. The idea was to construct successively worse functions and
take their limit. This is called condensation of singularities and eventually it led to
the discovery of the U.B.P.
Using the U.B.P. it is possible to give a soft proof of the existence of a continuous
function whose Fourier series diverges at some point. A soft proof means that the
messy construction of an explicit example is avoided. Such a proof is given below.
be the partial sum of the series (5.2) at t = 0. For each N, this is a linear functional
on X. We have
ANT - f f(t)DN(t)dt,
where
eint
n=-N
is called the Dirichlet kernel. One can see that
1 sin(N + 1\4
DN (t) _ -
27r sink
and using this
N
, I= oo.
-.r
(5.3)
Note that IIANII f IDN(t)ldt. For a fixed N, let gN(t) = sgn DN(t). This is a
step function and can be approximated by continuous functions of norm 1; i.e., there
exist ,,,,, in X such that IkT= 1 and gN(t) for every t. Hence, by the
Dominated Convergence Theorem
lim AN(gm) _
m- +oo
f 9N(t)Drv(t)dt =f
Thus, IIANII =fI and by (5.3) IAN II is unbounded. Hence by the U.B.P,
there exists an f in X for which Iis unbounded; i.e., the Fourier series of f
diverges at 0.
Exercises.
8. A subset of a metric space is said to be meagre (of first category) if it is the union
of a countable family of nowhere dense sets.
Let X, Y be Banach spaces and let $ be a subset of X3(X, Y). Suppose there
exists a point xo E X such that the set {Axo : A E S} is unbounded. Show that the
5. The Uniform Boundedness Principle 41
set
{xeX:supAx<}
AES
is meagre in X. (Examine the proof of the U.B.P.).
9. For each t in [-7r, 7rJ consider the set of all f in C[-7r, ir] for which the partial
N
sums of its Fourier series aeint
are bounded. Show that this set is meagre in
n=-N
C[-7r, 7r].
10. Show that there exists a continuous function on [-ir, it] whose Fourier series
diverges at each point of a dense set in [-it, it].
Lecture 6
Theorems that tell us that a continuous map is also open under some simple condi-
tions play a very important role in analysis. The open mapping theorem is one such
result.
Proof of the theorem. Let E = A (B(O, 1)), and let F be its closure. The first
step of the proof consists of showing that F contains an open ball By (0, 2E), and the
second step of showing that this implies that E contains the ball By (0, s). We have
observed that this would suffice for proving the theorem.
Since A is a surjective linear map, we have Y = U1 A (B(0, n)) = U 1 n E.
Since the space Y is complete, the Baire category theorem tells us that for some m
the set 7nE = mF has a nonempty interior. Hence F contains some open ball, say
By (yo, 4E). The point yo, being in F, can be expressed as yo = lim Axn , where x-,Z
is a sequence in B(0, 1). The points -xn are also in B(0, 1), and hence -yo is in
F. Thus By (0, 4`) = By (yo, 4E) - yo C F -+ F = 2F, and hence By (0.2E) C F. The
first step is over.
Let y be any point of By (0, Since y e F, there exists a point y, in E such
that IIy - yi. II < s In other words, y - yi is a point of By (0, E) which iii turn is a
subset of 2 F. Repeating the argument, we can find a. point y2 in the set 2 E such that
y - (/11 + y2) is in By (0, E / 2) , a subset of 4 F. Thus we have a sequence y= Ax.
x
where II xrt II < l/211, and II y - (Yl + ' ' + y,l) II < Let x = xn. Then
oc .x x
n=1
lxii <
IL= 1
1/2' -= 2, and y =
n=1 .
y= n=1 Ax n = Ax. We have shown that every
point of By (0, 2E) is in the set 2E. Hence every point of By (0, E) is in E, and this
completes the proof.
3. Remark. The crucial part of the hypothesis is that A is surjective. If the range
ran(A) were always closed, the theorem would be trivial: we would just say if A is
injective then its inverse A-1 from the Banach space ran(A) to X is a bounded linear
44 Notes on Functional Analysis
operator. However, ran(A) is not always closed. For example, let A be the map on
2 that sends the sequence {x} to {x/n}. Then ran(A) contains all sequences {yn}
for which
This example shows that the inverse of a bounded linear operator from a Banach
space onto an incomplete normed linear space need not be bounded. Rephrasing
this example in terms of infinite matrices makes the picture clearer. The operator
A acts on the standard basis {e} as Ae= en/n, and hence it is represented by
the infinite diagonal matrix diag (1,1/2,1/3, ...). Clearly A is injective and IIAII = 1
However, the linear operator A-1 from ran (A) into Q2 corresponds to the diagonal
matrix diag (1,2,3, ...), and is not bounded.
4. If X, Y are vector spaces, then their direct sum X Y is the collection of ordered
pairs (x, y) with x e X, y E Y, and with vector space operations defined as usual. If
X, Y are normed linear spaces, we define
Proof. Let G(A) be the graph of A. It is easy to see that if A is continuous, then
G(A) is closed.
If G(A) is closed, then it is a Banach space (in the space X Y). For an element
(x, Ax,) of G(A), let Pl (x, Ax) _ x, P2 (x, Ax) = Ax. Then Pl, P2 are continuous
linear maps from G(A) into X, Y, respectively. The map Pl is a bijection. So, its
inverse P' is a continuous map from X onto G(A), by the Open Mapping Theorem.
Since A = PZPi 1, A is also continuous.
6. What does this theorem say? Let f be any map from X to Y. To say that f is
continuous means that if xn -* .x in X, then the sequence f(x) converges to a limit
y in Y and y = f(x). The Closed Graph Theorem says that if f is a linear map
between Banach spaces, then to prove its continuity we have to show that if x- x
in X and f(x) -f y in Y, then y = f(x). This makes it easier to check whether a
linear map is continuous.
The assertion of the theorem is not always true if X or Y is not complete. For
example, let Y = C[0,1] and let X be the linear subspace of Y consisting of functions
that have continous derivatives. The derivative map Af = f' is a linear operator
from X into Y. It is not continuous but its graph is closed.
7. Exercise. Let X be a vector space with two norms 11. 1)1 and 11.112, both of which
make it a Banach space. Suppose there exists a constant C such that lix iii < Clix112
for all x. Then there exists a constant D such that f< DlIxiil for all x.
8. Exercise. Let X be a Banach space with a Schauder basis {xn}. Let {an(x)}
be the coefficients of x in this basis; i.e., let x = > an(x)xn. Show that each an is a
bounded linear functional on X.
46 Notes on Functional Analysis
u
[Hint: Consider the space V consisting of all sequences a = (a1 , a2,...) for which
the series a7 x7 converges in X. Define the norm of such a sequence as
71
tl ... to
= fl(t, - t)
z>;
to j .
n
is non zero if t tj. Thus dim c. Since the cardinality of as a set is also c
(why?) if follows that dim = c.
II
iiaxii ianiiixnii iiaii iiXnii = 2iiaii,
the series anxn is convergent and T is a bounded linear map from into X. It
is easy to see that T (a) = 0 if and only if a = 0. So, T is injective. Thus T is an
algebraic isomorphism of onto its range.
13. An isometric isomorphism is a map of one normed linear space onto another
that preserves norms and is a linear isomorphism.
Since i< lxii, Tx E Thus T is a linear map from X into and iiTxii <
I lxii. It remains to show that. iiTxii _ lix ii for all x. Given any x choose a sequence
X,,, in D such that x,n -* x. Then ilxmii -> lix ii and iiTxmii -; iiTxii. But for each
m, IITxmii = Supn i- iixmll. SO iiTxii - iixii.
48 Notes on Functional Analysis
14. The sequence spaces p,1 < p < oo, and co seem more familiar than abstract
Banach spaces since we can "see" sequences. Proposition 13 says every separable
Banach space is (upto an isometric isomorphism) a subspace of 4,. For long func-
tional analysts sought to know whether every infinite dimensional separable Banach
space contains a subspace that is isometrically isomorphic to either co or to some
1 <p < oc. In 1974, B. Tsirelson showed that this is not always so.
Lecture 7
Dual Spaces
The idea of duality, and the associated notion of adjointness, are important in func-
tional analysis. We will identify the spaces X* for some of the standard Banach
spaces.
1. Let f be a linear functional on C. If e1,... , e is the standard basis for Cn, then
the numbers m = f(e3) completely characterise f. The action of I on any element
x = (Xi, x2,... , x7 ) of C' is given by the formula
n
f(x) _
Any vector i = (r1,.. . , ) gives rise to a linear functional f on Cn via this formula.
Thus the vector space dual to Ctm is C itself.
Every linear functional on C is continuous (no matter what norm we choose on
Ca). However, its norm will, of course, depend on the norm we choose for C.
2. Consider the space P,1 < p < oo. We will calculate the norm of a linear
functional f on this space in terms of the vector rj with which f can be identified as
in (7.1).
So,
If II < II11IIq Show that this is an equality by considering the special vector x defined
50 Notes on Functional Analysis
by
0 if 1li = 0
xi. _
I1iN/1i if ri 0.
U(r)I 5 Z, I II IvII
G max UXHi.
So, MI U < r Show that this is an equality by considering the vector :x defined
by
xi =
Ii /rii if IiII
0 otherwise.
(iii) Let p = oo. Once again note that f(x) < ii IxIL. So f U < fShow
that Mill _ llilli.
The conclusion is
(7f) * _ Q for 1 <v < 00.
The dual of
3. The arguments we have given can be pushed to study sequence spaces. A little
extra care is required to handle infinite sums.
Let 1 <p < oc. The standard basis e3 is a Schauder basis for e. For f E , let
= f(e). For each natural number n,
( n
J=1
i
sup lf(x)l C 11f11
Exercise. Show that L1 = L. (The argument is similar to the one used above.)
Thus
L* _ Lq for 1 < p < oc .
4. How about the remaining case p = oo? Since L does not have a Schauder basis
we can not imitate the earlier reasoning. In any case L could not be L1: we have
seen that a nonseparable space can not have a separable dual.
What then is the dual of Lam? We will not calculate it here. There is a very
general theorem called the Riesz Representation Theorem that describes the dual of
the space of bounded continuous functions on a locally compact Hausdorff space X.
(See W. Rudin, Real and Complex Analysis.) When X = N, this space is Lx.
5. Well, if Li is not the dual of Lam, is it the dual of some other space? (Incidentally,
not every Banach space is the dual of another. The space C [O, 1] is not the dual of
any Banach space. We will not prove this fact here.)
Proposition. Ll = co.
Proof. Once again, the standard basis e3 is a Schauder basis for c0. Let i E co
52 Notes on Functional Analysis
Hence r E L1 and lIilI 1 11111. If x is a vector in the space coo and lix Il
then 1< (Since coo is dense in cij, this shows that 11f11 < ft . Hence
11111 _ Ililii. Thus we have an isometric isomorphism between the spaces co and Pl.
6. (i) Show that L1 is the dual of the space c as well. [Hint: Let j = 1, 2, ... ,
be the standard unit vectors and e = (1, 1,1, ...). Then {e, el, e2, ...} is a Schauder
basis for the space c. ]
(ii) Show that the spaces CO and c are not isometrically isomorphic. [Hint: the unit
ball in c has two extreme points (xn - 1 and xn - -1), the unit ball in co has none.]
7. The dual of the space L[0,1] for each 1 < p < oo is L9[0,1]. The proof uses
Holder's inequality for integrals and very standard measure theory arguments.
This statement is true for S, ) with some restrictions on the measure
space (like v-finiteness).
8. Let g be a function on [0, 1]. Let P be a partition of [0, 1] as 0 = to <t1 < <
Let
n
v(g;P) =ig(t) -g(t_i)l,
j=1
and
V (g) = sup v(g; P), (7.3)
where the supremum is taken over all possible partitions P. If V (g) is finite we say
7. Dual Spaces 53
that g is of bounded variation, and then V (g) is called the total variation of g. The
space BV [O, 1] consisting of all such functions is a vector space. Every absolutely
continuous function is in this space. There exists a continuous function that is not of
bounded variation (consider t sin(/t) near zero). A function of bounded variation
need not be continuous (consider characteristic functions of intervals).
If we put II9II = V (g) we get a pseudonorm: all properties of a norm are satisfied
except one; II could be zero without g being zero. Every constant function has
zero total variation; the converse is also true. To get over this we could consider
two functions of bounded variation to be equivalent if their difference is a constant.
The space BV [0, 1] then consists of equivalence classes of functions with respect to
this relation. Alternately, we could modify the definition of llM by putting llM =
g(0) + V(g). In either case we get a normed linear space which is in fact a Banach
space. (Try to prove this.)
g*(f) = ffdg,
where the integral is the Riemann-Stieltjes integral. We have
10. We will show that conversely, every bounded linear functional on C[0,1] arises
in this way.
Let cp E (C[0,1])*. The space C[0,1] is contained in the space B[0,1] consisting
of all bounded functions with the ' norm. By the Hahn-Banach Theorem cp can
be extended to a linear functional on B[0,1] and the extension has the same norm
as p.
Let Xo be the zero function, and for 0 <t < 1 let Xt be the characteristic function
54 Notes on Functional Analysis
of the interval [0, t). Let g(t) = cp(Xt). We will show that g is of bounded variation.
For any partition 0 = to < ti < < tom, = 1,
n n
i=1
(9(t,.) - 9(ti-i)I =
i=1
{g(t) - 9(ti-i)} Sgn [g(t) -
n
= - (xt1)] sgn [g(t) -
2-1
TE
We have here the linear functional cp acting on an element of B[0,1]. The ' norm
of this element is 1 (at any point t only one of the Xti(t) - 1(t) is nonvanishing;
sgnx is a number of modulus 1). Thus v(g;P) < for every partition P, and
hence
V(g) IlIL
Note that g(0) = cp(Xo) = 0 and for each t
Now follow the usual path in constructing integrals. Extend this relation to all step
functions by linearity, and then to all continuous functions by taking limits. We thus
have
(f)=ffdg for all feC[0,1],
and further,
kPM = IIM = V(g).
11. The space BV [0, 1] contains some ill-behaved functions. For example, consider
for 0 < c < 1 the delta function
ap(t) _
1 if t=c
o if tic.
We have V (S) = 2, but f fdS = 0 for all continuous f. In other words
IkII = 0, but 5C = 2.
7. Dual Spaces 55
How does one get over this? One needs to do a little more work than that for
the pseudonorm problem in Section 8.
12. Let g E BV [0, 1] and suppose g(0) = 0. Then there exists a unique function g
in BV [0, 1] satisfying the conditions g(0) = 0, g is right continuous on (0, 1) and
14. The Riesz Representation Theorem. The dual of the space C{0, 1] is
the space BVN[0,1]. Each element g of BVN[0,1] can be identified with a linear
functional g* on C[0,1] by the relation
15. Every real function of bounded variation is the difference of two monotonically
increasing functions.
Let us consider the space CR [0,1] consisting of real continuous functions. We
want to know for what bounded linear functionals cp on this space the function g
associated to it by the Riesz Representation Theorem is monotonically increasing.
(The measure corresponding to a monotonically increasing function is positive; that
corresponding to the difference of two such functions is a signed measure.)
56 Notes on Functional Analysis
16. A function (on any domain) is called positive if it takes nonnegative values; i.e.,
f(x) > 0 for all x. We write this briefly as f > 0.
A linear functional cp on C[0,1] is said to be positive if (f) > 0 whenever f > 0.
(Note cp(f) is a number.)
The study of maps that preserve positivity (in different senses) is an important
topic in analysis.
So,
Thus f'pfI < 'p(1). Since cp(1) < IIlII = this means
A corollary of this is that any extension cp of cp to BR[0,1] obtained via the Hahn-
Banach Theorem is also positive. If not, there would exist an f with 0 < f < 1 such
that (f) <0. Then
'p(f) = ffdg
7. Dual Spaces 57
w(I) = f Id
Exercises
19. For 0 < t < 1 let 't be the linear functional on C[0,1] defined as
Pt(f) = f(t).
Find the function g in BVN[0,1] that corresponds to cot according to the Riesz
Representation Theorem.
20. Show that the space BVN[0,1] is not separable. This is another example of a
situation where a separable Banach space has a nonseparable dual.
Lecture S
Some Applications
fdp11 - fdp as m OG
(In a terminology that we will learn later, this says that the set of probability
measures on [o, l] is weak* compact)
Proof, Let {fj} be a dense set in C [o, l] Since J'fldp.l?,I C IfiII. for all n, the
.
procedure we get a subsequence Paz of psuch that for each j, the sequence f fdp
converges as rn ---pc.
Using an E/3-argument, one can see that for every f in C[0,1] the sequence
fdp, is Cauchv and hence convergent. Let A(f) = lim f fdp7,. Then A is a
linear functional on C [0,1] , A( f ) f < If Ik , and A(1) = 1. By the Riesz Representa-
tion Theorem, there exists a probability measure p such that A(f) = f fdp.
8. Some Applications 59
rnxd,j(i (8.1)
2
The sequence {o1} E is called the Fourier-Stieltjes sequence corresponding to p.
Let N be any natural number, and pick up any complex numbers zp i.
Then
0Cr,s<N---1
` 0 C r, s C N-
2
1T--1
1 - Zrx y
wT d(:r).
G rr
Hence.
ar_s Zr zs ? o.
0Cr,s<N----1
4. The condition (8.2) can be expressed by saying that for all N, the matrices
R
ap a1 a ti
a1 ap a1
a2 a1 a
02 (11
L 011E-1 aN-2 as
tions
(i) a_n =ate, for all n,
(ii) ao > 0,
(iii) Ian < ao for all n.
5. We have seen that the Fourier-Stieltjes sequence associated with a positive finite
measure on [-7r, ii] is a positive definite sequence. One of the basic theorems of
harmonic analysis says that every positive definite sequence arises in this way.
ao=1.
Proof. The inequalities (8.2) are valid for all N, and for all complex numbers
zo,... , zN _ 1. Make a special choice zr = 0 < r < N - 1, where x is any real
number. We have
ar_se2(7'_s)x
0.
0<r,s<N-1
This inequality can be stated in an equivalent form
N-1
(N - I 0.
k=-N+1
For each natural number N, let
N-1
fN(X) (1_)akeikx
k=-N+1
8. Some Applications 61
In particular
ir
1 e-inxdp 1 e-inxd/iN
(x) hm (x)
2ir N-oo 27r -,r
lim
27r I ,r
e-2nx fN (x)dx
lim (1_lnt an
N
an.
Since lani < ao for all n, this series converges in the unit disk D = {z : zI < 1}. For
every z in D we have
2 Re f (z) f(z) + I (z
1_1z12 1 - zz
00 00 00
zmzm
akzk + a-kzk
m=0 k=0 k=1
00 00 00 00
a-kzirtzm+k
akz'n+kzm +
m=0 k=0 m=0 k=1
62 Notes on Functional Analysis
o'o oc 00 00
ar-Szrzs + ar_SzTz-S
s=0 r=s r=0 s=r+ 1
00
-s .
ar_Szrz
r,s=0
This last sum is positive because the sequence {ate,} is positive definite. Thus the
function f defined by (8.4) is a holomorphic map of D into the right half plane
(RHP).
It is a remarkable fact of complex analysis that conversely, if the function f maps
D into the RHP then the coefficients of its power series lead to a positive definite
sequence.
9. What does this theorem say? Let C be the collection of all holomorphic functions
mapping D into the RHP. Constant functions (with values in the RHP) are in C. It
is easy to check that for each t in [-ir, ir] the function
eZt + z
Ht(z) ezt -z
is in C. Positive linear combinations of functions in C are again in C. So are limits
of functions in C. The formula (8.5) says that all functions in C can be obtained
from the family (8.6) by performing these operations. (An integral is a limit of finite
sums.
10. The theorem can be proved using standard complex analysis techniques like
contour integration. See D. Sarason, Notes on Complex Function Theory, TRIM,
8. Some Applications 63
Hindustan Book Agency, p.161-162. The proof we give uses the Hahn-Banach and
the Riesz Representation Theorems.
A trigonometric polynomial is a function
N
g(O) = a -}- an cos n8 + bn sin n8), an, bn E R.
2
n=1
The numbers an, b11 are called the Fourier coefficients of g, and are uniquely deter-
mined by g. The collection of all such functions is a vector space, and is dense in
CR [-ir, 71]. For brevity, we will write
Re f(z) = ao +
n=1
aoao
A(g) = ----- + -
n=i
Note that
Since o Ic7drn is convergent, this shows that the series in (8.8) is uniformly
convergent on [-?r, it]. So, from (8.7) and (8.8), integrating term by term and using
orthogonality of the trigonometric functions, we obtain
It
o
l
[g(9) Re f(re)dO = a2 + 2
_it n=1
64 Notes on Functional Analysis
Hence ir
We then have
A(g) = J g(t)da(t) for all g E
eit-z = 1+
zne_mnt
Hz(t) :_ 1-ze_it
=1+2
n=1
= 1+2z{u(t) -iv(t)}.
n=1
Use (8.9) to get
A(HZ) -CYO + (Ctin -I- ZQn)zn = (lxn -I- 2Qn)z - ZQO = f(z) - i Im f(o).
zi=1 =0
So,
n ezt
Expanding the integrand as the (first) series in (8.10), this gives
da(t)1 +2 e-ineda(t)J zn
f(=) _ [2(c0 - o) + J-
11-n
n=1
ao = 2 J da(t)
n
an = 2 e-Zntda(t).
13. The Riesz-Herglotz Integral Representation plays a central role in the theory of
matrix monotone functions. See R. Bhatia, Matrix Analysis, Chapter V.
Lecture 9
when we say that a sequence fin the space x`[0,1] converges to f, we mean that
fn - f --- o as ii. --- oo; and this is the same as saying fconverges to f uniformly.
There are other notions of convergence that are weaker, and still very useful in anal-
ysis. This is the motivation for studying different topologies on spaces of functions,
and on general Banach spaces.
1. Let S be any set and let (T, U ) he a topological space. Let F be a family of maps
from S into T. The weak topology on S generated by F (or the F-weak topology) is
the weakest. (i.e., the smallest) topology on S for which all f E F are continuous.
2. Examples. 1, Let C[a, b] be the space of all continuous functions on [a, b].
For each x E [a,b] the map E(f) -= f(x) is a map from C [a, b] to C, called the
evaluation map. The weak topology generated by {E . x e [a, h]} is called the
topology of pointwise convergence on C [a, b] .
2. The product topology on 11E or C' is the weak topology generated by the
projection maps i defined as it ( x i , ... , xn ) = x3, 1 j C ii.
9. The Weak Topology 67
3. Now let X be any Banach space and let X* be its dual space. The weak topology
on X generated by X * is called the weak topology on X. For this topology, the sets
where E > 0, k = 1, 2, ... , and fi, 12,. . , fk are in X * , form a neighbourhood base
.
at the point 0. A base at any other point can be obtained from this by a translation.
4. For brevity, members of the weak topology on X are called weakly open sets.
Phrases such as weak neighbourhood, weak closure etc. are used to indicate neigh-
bourhoods and closures in the weak topology.
The topology on X given by its norm is called the norm topology or the strong
topology or the usual topology on X ; the adjective chosen depends on the point of
view to be emphasized at a particular moment.
A sequence xn in X converges to x in the norm /strong/usual topology if
lxn - x l i --* 0. We write this as xn -* x. The sequence xn. converges to x in the
weak topology if and only if f(x) converges to 1(x) for all f e X*. We write this
as xx, and say xconverges weakly to x.
6. Exercise. Show that the norm topology on X is stronger than the weak topology
68 Notes on Functional Analysis
suPif(x)i
suPIF'(.f)i Cf.
Hence by the Uniform Boundedness Principle, there exists a positive number C such
that
suP c,
9. The Weak Topology 69
9. We will use this to show that the weak topology on, 1 <p < oe, can not be
obtained from any metric.
Let en be the standard basis for .gyp, and let
S = {n1/e n = 1, 2. . . }.
This is the collection of all vectors of the form (0, 0,... , nl/(1, 0, ... ), n = 1, 2, ... .
We will show that the set S intersects every weak neighbourhood of 0 in p. If V
is such a neighbourhood, then it contains a basic open set
So, if the set S does not intersect V, then for some j we have e for all n.
This implies that
k
> , for all n.
n1/q
j=1
If y = (Yi, 112,...) is any vector, let us use the notation II for the vector (yi I, Ii I, )
Clearly, if y is in q, then so is y l . For 1 < j < k, each f(j) is in q, and hence so is
their sum f = k 1
f(i) But if the last inequality were true we would have
.
n=1 n=1
n
and that implies f cannot be in Pq. This contradiction shows that S intersects V.
This is true for every weak neighbourhood V of 0. Hence 0 is a weak accumulation
point of the set S.
Now if the weak topology of 4 arose from a metric there should be a sequence
70 Notes on Functional Analysis
Nets
11. We have seen that in a topological space that is not metrisable, sequences
might not be adequate to detect accumulation points. The remedy lies in the intro-
duction of nets. Reasoning with nets is particularly useful in problems of functional
analysis.
A partially ordered set I, with partial order -<, is called a directed set if for all
a, /3 E I, there exists 'y E I such that a - y and ,3 - 'y.
The sets N and III with their usual orders are directed sets. The collection of all
subsets of a given set with set inclusion as the partial order is a directed set. Let
I be the collection of all neighbourhoods of a point x in a topological space X. Say
Nl - N2 if N2 C N1. Then I is a directed set.
12. We say that a net {x}j eventually satisfies a property P, if there exists 'y E I
9. The Weak Topology 71
such that the property P is satisfied by all xa with -y - a. We say that {Xc}EJ
frequently satisfies P, if for each E I, there exists an a such that -y - cr and xa
satisfies the property P.
15. Let {xa}aEI and {y}i3eJ be two nets. We say {x} is a subnet of {y}, if there
exists a function F : I - J such that
(The second condition says that F(a) is eventually larger than each /3 in J.)
18. The Tychonoff Theorem. If {Xa } is any family of compact topological spaces,
then the product topological space rj Xa is compact.
a
19. Warning. All this might suggest that everything is simple. We have to merely
replace the subscript n in xn by cx and pretend nothing else has changed. This is
not so. Here are two of the pitfalls.
(i) A net in a normed space may be convergent without being bounded. (Have
we seen an example already?)
(ii) A sequence may have a convergent subnet without having any convergent
subsequence. (We will soon see an example.)
1. The dual of X* is another Banach space X. This is called the second dual or
the bidual of X. Let J be the map from X into X** that associates with x E X the
element F E X** defined as
4. Let X * be the dual of a Banach space X. The usual topology on X * is the one
generated by its norm. Its weak topology is the weak topology generated by its dual
X**. There is one more topology on X * that is useful. This is the weak topology
on X * generated by the subspace X of X**; i.e., the weakest (smallest) topology on
X * for which every element of X, acting as a linear functional on X *, is continuous.
This is called the weak* topology on X * .
6. The Banach-Alaoglu Theorem. Let X be any Banach space. Then the unit
ball {f E X* : HIM < 1} in the space X* is compact in the weak* topology.
Proof. For each x E X consider the set B_ {z E C: z < xII}. This is a compact
subset of the complex plane. Consider the space
B:= flB
xEX
Thus f is linear. Since f E B, we already know I< HxM. Thus Hill < 1. So
fEB.
7. If X is reflexive, the unit ball of X* is weakly compact. (The weak topology and
the \veak* topology are the same in this case.)
If X = X * (as is the case when X is 2 or L2) then the unit ball of X is weakly
compact.
Recall that the unit ball of any infinite-dimensional space can not be compact in
the strong (usual) topology. This weaker compactness can still be very useful.
It can be proved that a Banach space is reflexive if and only if the weak and the
weak* topologies coincide.
Proof. Let X be the closed unit ball of the dual space X * with the weak* topology.
We have seen that X is compact. Every element x of X can be thought of as a
continuous function on X.
76 Notes on Functional Analysis
Exercises
10. Show that the only linear functionals on X* that are weak* continuous are the
elements of X.
The only linear functionals on X that are weakly continuous are the elements of
X*. (Thus a linear functional on X is weakly continuous if and only if it is strongly
continuous.)
Show that xn w x if and only if {IIx } is bounded and f(x) -> 1(x) for every
element f of a fundamental set in X.
12. Let 1 < p < oo. Show that a sequence {x.} in ,, converges weakly to x
if and only if {IIx I} is bounded and xn converges coordinatewise to x; i.e., if
_ n and x = then for each j the sequence converges
to l as n -j oo.
13. A sequence {fTh} in X * is weak* convergent if and only if {IIfII} is bounded and
{ f(x)} is a Cauchy sequence for each x in some fundamental set in X.
Annihilators
Then S1 is a (closed linear) subspace of X*. This is called the annihilator of S (the
collection of all linear functionals that kill every element of S). If [S] denotes the
closed linear space spanned by the set S, then Sl = [S]l. The notation 51 suggests
orthogonality, and indeed there are several similarities with that notion.
It is easy to see that S1 = {0} if and only if S is a fundamental set in X.
15. Let X/M be the quotient of X by M. The dimension of this space is called the
codimension of M in X. In symbols
Proof. Let X/M be the quotient space, and Q : X -f X/M the quotient map.
The image N = Q(N) is finite-dimensional, and hence closed in X/M. Since Q is
continuous, Q' (N) is closed in X. But Q-1(N) = M + N.
in the sense that either both sides are infinite, or they are finite and equal.
for some v E 11I. Thus the space 111, together with the vectors xl spans X.
Let M be the subspace spanned by M and the vectors xi xj+1,... ,
Then M1, 1 < j < in, are closed subspaces of X containing M. By the Halin-Banach
<
Theorem we can find f in X * such that f(M1) = 0 and f(x) = 1. We thus have
,
This last condition shows that, f are linearly independent. We will see that they
form a basis for M'.
Let f be any element of Al'. For k = 1, 2,... , in, we can write
in
f(XA.) = f(x)f(xk)
j= 1
I= f(x)f.
Thus f, 1 <j < in form a basis for Al'. So dim 1L11 = in.
,
Now suppose codini Al = oo. Choose a vector xl in X not in Al. Let Mi be the
space spanned by Al and x1. Choose a vector x2 in X not in A11. Let. 1112 be the
space spanned by All and x2. Since codini Al = oo, this process can be continued
indefinitely leading to a strictly increasing sequence of closed subspaces
MCM1
The inclusions
X*DM1DMi J...J{0}
10. The Second Dual and the Weak* Topology 79
19. Exercise. If fr,.. , fn are linearly independent elements of X *, then there exist
.
Hubert Spaces
To each vector in the familiar Euclidean space we assign a length, and to each pair
of vectors an angle between them. The first notion has been made abstract in the
definition of a norm, what is missing in the theory so far is an appropriate concept
of angle and the associated notion of orthoyonality. These ideas depend on the in-
troduction of an inner product. Hilbert spaces are special kinds of Banach spaces
whose norms arise from inner products.
They are important for several reasons. Since they have a special structure, more
can be said about them; so we have a richer theory. They are important for applica-
tions: quantum mechanics, signal processing, wavelets. Our intuition works better
with them. Functional analysis leads us to use our geometric intuition to understand
the behaviour of spaces of functions. Typically, to understand convergence of a se-
quence of functions an analyst might draw a picture on paper where the functions
are represented by points; then argue by analogy to extend her reasoning from R2
to a Hilbert space and finally see whether the special nature of Hilbert space can be
dispensed with.
Hilbert space is named after David Hilbert, one of the great mathematicians of
the twentieth century. He used the space 2 in his study of problems concerned with
integral equations. The general theory of Hilbert spaces was developed later by John
von Neumann in 1928 to provide a mathematical framework for quantum mechanics.
Basic notions
elements x, y of X is associated a complex number (x, y), called the inner product of
x and y, that satisfies the following four conditions
(i) (x+y,z) = (x,z) + (y,z) for all x,, y, z E X.
(ii) (ax, y) = a(x, y) for all cx E C.
(iii) (x, y) = (y, x) (the bar denotes complex conjugation).
(iv) (xix) > 0, and (x,x) = 0 if and only if x = 0.
Note that conditions (i), (ii) and (iii) imply that
(v) (x,y+z)=(x,y)+(x,z).
(vi) (x, ay) = a (x,y).
Thus the function (.,.) is linear in the first variable and conjugate linear in the
second.
This convention is followed by almost all functional analysis books, and violated
by almost all physics and matrix theory books. The latter take the inner product to
be conjugate linear in the first variable and linear in the second.
2. A real vector space X is called an inner product space if there is defined a real
function (.,.) on X x X satisfying the properties (i)-(iv). The complex conjugation in
(iii) and (vi) is redundant in this case. We will talk of complex inner product spaces
most of the time. What we say is often true for real inner product spaces. (The
latter parts, involving spectral theory, require the underlying field to be complex.)
3. The space CT' is an inner product space with the usual definition
R
(x,y) =>Xjj.
j=1
(f,9 = f (11.2)
a
11. Hilbert Spaces 83
:_ (x,x)112. (11.3)
5. If the inner product space X with the norm (11.3) (induced by the inner product)
is a complete metric space, we say it is a Hilbert space. An inner product space is
some times called a pre Hilbert space. If it is not complete, then its completion is a
Hilbert space.
We will use the symbol N for a Hilbert space .
(x,y)
j=1
The finiteness of the sum in (11.5) and the integral in (11.6) are consequences of the
Schwarz inequality. The completeness of L2 [a, b] is the Riesz-Fischer theorem.
The space C[a, b] is a dense subspace of L2 [a, b]. It is not a closed subspace.
Exercises
7. The norm in any inner product space satisfies the parallelogram law
9. The norm and the inner product are related by the polarisation identity
4(x, y) = lix + y112 - lix -X112 + ilix + zy112 - 2lix - iyll2. (11.7)
The polarisation identity allows us to recover the inner product from the norm.
10. Suppose X is any normed linear space. From the norm on X we can define a
function (.,.) on X x X by (11.7). Then (x, x) _ lix 112. However, defines an
inner product on X if and only if the norm satisfies the parallelogram law. (A
bit of work is required for the proof of this statement.)
11. Hilbert Spaces 85
11. Let w be a primitive nth root of unity, where n> 2. Show that
n-1
1
(x,y) = - Yw'lx+w'yil2.
Ti'P=o
This is a generalisation of the polarisation identity.
(i) SnS'c{O}.
(ii) S1 is a closed linear subspace of N.
(iii) {O}' = N,N' = {O}.
14. Theorem. Let S be any closed convex subset of N. Then for each x in N there
exists a unique point xo in S such that
Proof. Let d = dist(x, S). Then there exists a sequence yn in S such that Ix-y7111 --
d. By the Appolonius Theorem
Ix - ym
_. 2 (x - (yn yrra)112)
(We have used the convexity of S to conclude 2 (yii + e S.) As 7i, rn - oe, the
left hand side goes to 2d2. This shows {y} is a Cauchy sequence. Since S is closed
xo := hmn yn is in S and
x-xoII =1im:C-y=d.
If there is another point xl in S for which Ix - xl H = d, the sine argument with
the Appolonius Theorem shows that xi = xo.
The theorem says that each point of 'N has a unique best approximant from any
given closed convex set S. This is not true in all Banach spaces. Approximation
problems in Hilbert spaces are generally easier because of this theorem.
15. Especially interesting is the case when S is a closed linear subspace. For each x
in ?N let
where xo is the unique point in S closest to x. Then PS is a well defined map with
range S. If x E S, then PS(T) = x. Thus P,g is idempotent; i.e.,
i.e.,
Re (x-xo,y) =0.
Hence
(x-xo,y) =0.
Thus x - xo is in the subspace S1. Since S fl 51 = {0}, we have a direct sum
decomposition
'/-(= S SI . (11.10)
X = V W,
if V, W are subspaces of X that have only the zero vector in common, and whose
linear span is X. Then every vector x has a unique decomposition x = v + w with
vEV, wEW.
16. Show that the map PS defined by (11.8) is linear, ran PS = S, and ker PS = S'.
(The symbols ran and ker stand for the range and the kernel of a linear operator.)
By the Pythagorean Theorem
IPsII = 1. (11.12)
(The obvious trivial exception is the case S = {0}. We do not explicitly mention
such trivialities.)
88 Notes on Functional Analysis
The map PS is called the orthogonal projection or the orthoprojector onto S. The
space S1 is called the orthogonal complement of the (closed linear) space S. In this
case S11 = S.
17. The notion of direct sum in (11.11) is purely algebraic. If V is a linear subspace
of a vector space X, then we can always find a subspace W such that X is the direct
sum of V and W. (Hint: use a Hamel basis.)
When X is a Banach space it is natural to ask for a decomposition like (11.11)
with the added requirement that both V and W be closed linear spaces.
Let us say that a closed linear subspace V of a Banach space X is a direct
summand if there exists another closed linear subspace W of X such that we have
the decomposition (11.11).
In a Hilbert space every closed linear subspace is a direct summand; we just
choose W = V L . In a general Banach space no obvious choice suggests itself. Indeed,
there may not be any. There is a theorem of Lindenstrauss and Tzafriri that says that
a Banach space in which every closed subspace is a direct summand is isomorphic
to a Hilbert space.
The subspace co in the Banach space is not a direct summand. This was
proved by R.S. Phillips in 1940. A simple proof (that you can read) is given in R.J.
Whitley, Projecting m onto c0, American Mathematical Monthly, 73 (1966) 285-286.
18. Let X be any vector space with a decomposition as in (11.11). We define a linear
map called the projection on V along W by the relation Pv,w (x) = v, where
x=v+w,vEV, wEW. Showthat
Conversely supose we are given an idempotent linear map P of X into itself. Let
ran P = V, ker P = W. Show that we have X = V W, and P = PV,w .
19. Now assume that the space X in Section 18 is a Banach space. If the operator
Pv,w is bounded then V, W must be closed. (The kernel of a continuous map is
closed.)
Show that if V is a direct summand in X, then the projection is a bounded
operator. (Use the Closed Graph Theorem.) Show that Pv,wII > 1.
20. If V is a direct summand in a Banach space X, then there exist infinitely many
subspaces W such that X = V W. (You can see this in R2.) In a Hilbert space,
there is a very special choice W = V 1.
In a Hilbert space by a direct sum decomposition we always mean a decomposition
into a subspace and its orthogonal complement.
We will see later that among projections, orthogonal projections are characterised
by one more condition: selfadjointness.
Self-duality
This can be turned around. Let f be any (nonzero bounded) linear functional on 7-l.
Let S = ker f and let z be any unit vector in S. Note that x - (f(x)/f(z))z is in
90 Notes on Functional Analysis
S. So
2 - f(x) Z. Z) - U,
(x,z) =f(x)
fez)
The fact that I-I and 7(* can be identified via the correspondence y - f',, is
22. The Hahn -Banach Theorem for Hilbert spaces is a simple consequence of the
above representation theorem.
B(x,y) = (x,Ay).
We have lB II _ llAll.
11. Hilbert Spaces 91
24. Earlier on, we had defined the annihilator of any subset S of a Banach space X.
This was a subset S' of X*. When X is a Hilbert space, this set is the same as S-
defined in Section 13.
25. Note that xa converges to x in the weak topology of N if and only if (xa, y) --
(x,y) for ally EN.
Supplementary Exercises
26. Let f be a nonzero bounded linear functional on a Banach space X and let
S = {x e X : f(x) = 1}. Show that S is a closed convex subset of X. Show that.
i
=ES II=II =
Ilf 11
So, if there is no vector x in X for which If II = I /IIxII, then the point 0 has no
best approxirnant from S.
27. Let X = C[0,1] and let Y be its subspace consisting of all functions that vanish
at 0. Let (f) = fo t f(t) dt. Then cp is a bounded linear functional. Find its norm
on X. and on Y. What are the points f in X and in Y for which (f)I/IIf II.
28. Combine Exercises 26 and 27 to show that (the existence part of) Theorem 14
is not always true in all Banach spaces.
29. Let S = {x E X 1,X2 > 0, x1 + x2 = 1}. This is the line segment joining
the points (1,0) and (0.1). Each point of S is at i distance 1 from the point (0,0).
Thus the uniqueness part of Theorem 14 is violated in this Banach space.
02 Notes on Functional Analysis
30. Let V, W be any two subspaces of & not orthogonal to each other. Show that
IPvwII > 1.
31. A function f on 7-1 is called a quadratic form if there exists a sesquilinear form B
on 7-C x 7-1 such that 1(x) = B (x, x). Show that a pointwise limit of quadratic forms
is a quadratic form.
(If B is definite, then it is an inner product and we have proved the inequality in
that case,) Hint : Consider B(x, y) + E%(x, y).
Lecture 1 2
Orthonormal Bases
2. It follows from the Pythagorean Theorem that every orthonormal set is linearly
independent.
3. Let {e, : 1 C i n} be any finite orthonormal set. For each x in 1-i, {x, e } e is the
component of x in the direction of e. One can see that x - (x, e) e is orthogonal
j=1
to each e, and hence to the sum (x, e) e. e. The Pythagorean Tlieoreni then shows
that
fl
(12.1)
3=
This is called Bessel's inequality.
h<E aEJ
94 Notes on Functional Analysis
for every finite subset J of I that contains J0. In this case we write
x= X.
aEI
5. Bessel's Inequality. Let {e}j be any orthonormal set in 7-(. Then for all x
txii. (12.2)
aEI
is countable.
Proof. Let
E = {e: (x,e2> Ixft2/n}.
Then E = U 1 En . By Bessel's inequality the set E7 can have no more than n - 1
elements.
6. Parseval's Equality. Let {ea}Ej be an orthonormal basis in 7-if. Then for each
xElI
x= (x,e)e. (12.4)
aEI
= (x,e2. (12.5)
aEJ
Proof. Given an x, let E be the set given by (12.3). Enumerate its elements as
{el, e2, ...}. For each n, let
n
yn = (x,e)e.
i=1
12. Orthonormal Bases 95
Ifn>m, we have
n
i=m+1
By Bessel's inequality this sum goes to zero as n, m -f oo. So yn is a Cauchy
sequence. Let y be its limit. Note that for all j
n
(x - y,ej) (x, e3) n-
- lim
o0
((x,e)e,e3)
2=1
(x,e) - (x,e) = 0.
If eis any element of the given set {ea}ej outside E, then (x, e8) = 0, and once
again (x - y, e) = 0. Thus x - y is orthogonal to the maximal orthonormal family
{ea}aEl. Hence x = y. Thus
Only countably many terms in this sum are nonzero. (However, this countable set
depends on x.) Further note that
- aElI
n
lim
n-; x
i=1
n
llm Ilx -
n-+oo
(x, e2)ei 2
i=1
0.
7. Let {u, u2, ... } be a finite or countable linearly independent set in 7-(. Then
there exists an orthonormal set {ei, e2,. .} having the same cardinality and the
.
same linear span as the set {u}. This is constructed by the familiar Gram-Schmidt
Process.
96 Notes on Functional Analysis
Proof. A countable orthonormal basis for 7-1 is also a Schauder basis for it. So, if
such a basis exists, 7-1 must be separable.
Conversely, let 7-1 be separable and choose a countable dense set {x} in 7-1. We
can obtain from this a set {un} that is linearly independent and has the same (closed)
linear spare. From this set {u} we get an orthonormal basis by the Gram-Schmidt
process.
9. A linear bijection U between two Hilbert spaces 7-1 and AC is called an isomorphism
if it preserves inner products; i.e.,
Proof. If 7-1 is separable, it has a countable orthonormal basis {en}. Let U(x) _
{ (x, e)}. }. Show that for each x in 7-1 the sequence {(x, is in 2i and U is an
isomorphism.
We will assume from now on that all our Hilbert spaces are separable.
11. Let 7-1 = L2 [-ir, it]. The functions en (t) = 1 eunt, n E Z, form an orthonormal
basis in ?-f. It is easy to see that the family {en } is orthonormal. Its completeness
follows from standard results in Fourier series.
There are other orthonormal bases for ?-C that have been of interest in classical
analysis. In recent years there has been renewed interest in them because of the
recent theory of wavelets.
12. Orthonormal Bases 97
12. Exercises. (i) Let {e} be an orthonormal basis in 7-1. Any orthonormal set
{f} that satisfies
00
Ilen-Snllz<1
is an orthonormal basis. (Hint: If x is orthogonal to {f,} show f< lix 112,
violating Parseval's equality.
(ii) More generally, show that if
00
Iien_fnIi2<oo
7i=1
00
then {f} is an orthonormal basis. (Hints: Choose N such that Iie _ f iI2 <1
7E=N+ 1
Let S be the closed linear span of {fN+1, fN+2,.. }. For 1 < n < N, the vectors
00
9n = en- (en,fm)fm
m=N+1
are in Sl. Show that dim S1 = N. The space S1 is spanned by {gi,... , gnr}
and by {f,... fN}. So, if a vector x is orthogonal to the family {f}, then it is
,
orthogonal to {ei,... , eN }. Use this and Part (i) to show that Parseval's equality
forbids such behaviour.)
13. Metrisability of the unit ball with the weak topology. We have seen that
the weak topology of 2 is not metrisable. However, its restriction to the unit ball is
metrisable.
(i) Let 7-1 be any separable Hilbert space and let {e} be an orthonormal basis
for 7-1. Let B = {x E 'H: IixiI <1}. For x,y E B, let
d(x,y) := i(x-y,en)i.
14. Let 7 = L2 [-1, 1]. Apply the Gram-Schmidt process to the sequence of func-
tions { 1, t, t2,. . . }. The resulting orthogonal functions are
pn(x) _ (t2 _ i)n
dtn
These are called the Legendre polynomials. Show that the family { n + 1/2 Pn}
is an orthonormal basis for 7-1. (For proving the completeness of this system, the
Weierstrass approximation theorem may be useful.)
15. Let 7-(= L2(II8). Apply the Gram-Schmidt process to the family
fn(t) = (_1)ne_t2/2et2 =:
H,i(t)e_t2/2,
n = 0, 1, 2, ... .
dtn
The functions Hn (t) are called Hermite polynomials. Show that the members of
{f(t)} are pairwise orthogonal, and normalise them. Show that the resulting family
is an orthonormal basis for 7-1.
(Hint: To show completeness, we need to show that if
G(z) _ 9(t)e-t2/2eitzdt.
_x
This is an entire function. Use (12.6) to see that G and its derivatives of all orders
vanish at 0. Hence G is zero everywhere. In particular
00
Multiply this equality by e+ZXY, where y is a real number, then integrate with respect
to x from -a to a. This gives
00
g(t)e-t2/2 sin a(t - y)
dt = 0, for all a, y E R.
J
-00
t-y
Conclude that g = 0.)
.fn(t) _ e-t/2Ln(t)
where the value of rk(t) at a discontinuity is taken as the right hand limit. Equiva-
lently, on the dyadic intervals [j/2k+1, (j + 1)/2k+1), 0 < j < 2k+1, rk(t) takes the
value 1 if j is even and -1 if j is odd. The constant function 1 and the functions
rk together are called Rademacher functions. They form an orthonormal family but
not a complete family. (The function cos2irt is orthogonal to all of them.)
This system is included in another family called Walsh functions defined as fol-
lows. Let wo(t) = 1. For n > 1, let
n= nk 2k where nk = 0 or 1
k=0
The functions wn together with the constant function 1 are called the Walsh func-
tions. They are step functions that take the values 1 only. Note that if n = 2k, then
wn = r/. So this family includes the Rademacher functions. In fact it consists of
all finite products of distinct Rademacher functions. Show that the Walsh functions
form an orthonormal basis for 7-(. (Hint: To check orthogonality, observe that if at
least two of the integers k1, k2,... ,kn are distinct, then
1
frki(t)rk2(t) r(t)dt = 0.
0
x
To prove completeness, let f E -l and define F(x) = f f(t)dt. Then F'(x) = f(x)
0
1
18. Gram matrices. Let x1, ... , xn be any vectors in a Hilbert space 71. The
n x n matrix G(x 1, ... , x71) whose i, j entry is (xi, x) is called the Gram matrix of
the given set of vectors. Its determinant is called the Gram determinant.
(i) Every Gram matrix is positive semidefinite; it is positive definite if and only if
the vectors x3 are linearly independent. [Calculate (Gu, u) .]
(ii) Every positive semidefinite matrix is a Gram matrix. [Hint: write aij = (Ae, ej) =
(A1/2e2, A1/2ej) ]
(iii) Let j ,1 <j <n be any positive numbers. Then the matrix whose i, j entry is
00
1 is positive semidefinite. [Hint: A
= Te_(Aii)tdt.]
(iv) Calculate, by induction on n or by some other argument, the determinant of the
matrix in (iii); it has the value
J-
(,\.).)2
(12.7)
1-I (A2+))
1 <i, j <n
19. Let x1, ... , x7z be linearly independent vectors in ?-t, and let M be their linear
12. orthonormal Bases 101
20. We know that the family f(t) = t, n = 0, 1, 2,... is fundamental in L2 [0, 1}.
The following remarkable theorem tells us there is a lot of room here; much smaller
subfamilies of this family are also fundamental.
Mi ntz's Theorem. Let 1 <1 <2 < be any sequence of integers. Then the
family {tnk } is fundamental in L2 [0, 1] if and only if
O 1
n = oo. (12.8)
j=1
Proof. Let Mk be the linear span of the functions tTh1, ... , tnk . The set {tni } would
be fundamental if and only if disc (f, Mk) goes to zero as k -- oc. Since the family
{tm} is fundamental, this is so if and only if for each m dist(tr'2, Mk) goes to zero as
k - oc. By Exercise 19, this is so if and only if
det G(t'n, t'1 , ... , tnk )
lim = 0 for all m. (12.9)
k--goo det G(ti,...
n , tnk)
Note that
i
(ti t3) = [tdt = i + j + 1 1
0
Hence the ratio of the two Gram determinants occurring above can be evaluated
using (12.7). The answer is
k
1 (n3 - m)2 1 k7 (1 - m/n3)2
2m+1 (n+7n+i)2 2m+1 11
j=1
So, the condition (12.9) becomes
k
lim V [log(1 - m) - log(1 + m + 1)J = -oo. (12.10)
n3 n3
102 Notes on Functional Analysis
Since
log(1 + x)
lim =1
x-40 x
the series log(1 + x,t) and x are convergent or divergent simultaneously. Use
this to show that (12.10) is true if and only if (12.8) is.
Linear Operators
Let X, Y be Banach spaces. For a while we will study bounded linear operators from
to Y. These will just be called operators.
Topologies on operators
1. The norm topology. We denote the space of operators from X to Y by 13(X, Y).
This is a Banach space with the norm IIAI sup IAxII. The topology given by
Ilxll=1
this norm is called the usual topology, the norm topology or the uniform operator
topology on ,Ci(X, Y).
F : B (X, Y ) -* Y,
A -- Ax,
13(X, Y) -* C,
A H f(Ax),
where x varies over X and f over Y. If X, Y are Hilbert spaces, then Aa w A if
and only if (Ax,y) - (Ax, y) for all x e X, y e Y.
4. Caution. In Lecture 9, we defined the strong and the weak topologies for any
Banach space. The adjectives strong and weak are now used in a different sense. (The
"strong" topology of the Banach spaces 13(X, Y) is its "usual" topology). For spaces
of operators the words strong and weak will be used in the new sense introduced
here; unless it is stated otherwise.
(ii) Let e1, e2,... be the standard orthonormal basis for 2. Let Pn be the orthog-
onal projection onto the linear span of {ei,... ,e}. Then I - Pn is the orthogonal
projection onto the orthogonal complement of this space. Here P,z - I in the strong
operator topology. But Ill - (1 = 1 for all n. So Pn does not converge to I in the
norm topology.
(iii) The right shift operator S on QZ is defined as follows. Let x = (xi, X2,...) be
any element of 2. Then
i('Snx,y)l C
i=1 i=1
As n -p oo, the last sum goes to zero. So the sequence {S7} converges to zero in the
weak operator topology. However, llSxll _ lix ii for all x and n. So {S} does not
converge to zero in the strong operator topology. Hence it does not converge to any
limit in the strong operator topology, because if it did, then the strong limit would
also be a weak limit, and that can only be zero.
6. The strong operator topology and the weak operator topology are not metrisable.
While convergence of sequences does not reveal all the features of these topologies,
we may still be interested in sequences and their convergence. The Uniform Bound-
edness Principle is the useful tool in these situations.
Exercise. Let {A} be a sequence of operators. Suppose {Anx} converges for each
x. Then there exists an operator A such that An s A.
Proof. The sequence {A} is weakly Cauchy if for each x, y in 7-l the sequence
{ (Ax, y) } is a Cauchy sequence (of complex numbers). Let
B (x, y) = 1
moo
(Ax,y).
A such that
B(x,y) = (Ax,y).
(Ax)(y) - (Ax,y).
By the U.B.P.,
sup l!Anxli < oo for all x.
n
Operator Multiplication
8. Consider the space 8(X). Let A7 ---} A and Bn --} B in the norm topology. Then
show that lIAB7 --- AB II ---+ o. This shows that multiplication of operators is jointly
continuous in the norm topology of 8(X).
9. Let An and B77 be sequences in 8(X) converging in the strong operator topology to
A and B, respectively. Use the U.B.P. to show the sequence {iiAIf} is bounded; and
then show that the product A B71 converges to AB in the strong operator topology.
This argument fails for nets. Hence, it does not follow that multiplication of
operators is jointly continuous in the strong operator topology. In fact, it is not.
13. Linear Operators 107
{A : I
- Ao)xi M < a, 1 < i < n},
(ii) Let {x1,... , x,z, yb... , yn} be a linearly independent set in 7,1 such that
IIy2 - Aoxi M <E for all i. Define an operator A by putting Axe = y2, Ayi = 0 for all
i, and Au = 0 for all u orthogonal to {x1......x, yl , ... , y}. Then A2 = 0. Show
.
(iii) This shows that the set N is dense in 13(7-1) in the strong operator topology.
So, if squaring of operators were a continuous operation, then N would equal 8(7-1).
That, can't be.
Exercise. Here is one more proof of the same fact. Consider the set of all ordered
pairs (M, u) where Al is a finite-dimensional subspace of 7-1 and u a unit vector
orthogonal to M. Define a partial order on this set by saying (M, u) - (N, v) if N
contains M and u. Now define two nets of operators as follows
where xo is a fixed unit vector. Show that both these nets converge to 0 in the strong
operator topology; but their product does not.
10. Let X = 2. We defined the right shift operator S in Section 5(iii). The left shift
is the operator T defined as
Note that for each x, IITxII --> 0. Thus {T} converges to 0 in the strong, and
therefore also in the weak, operator topology. We have seen earlier that {Sn} also
converges to 0 in the weak operator topology. Note that TnS= I for all n. This
example shows that operator multiplication is not continuous (even on sequences) in
the weak operator topology.
11. However operator multiplication is separately continuous in both the strong and
the weak topology; i.e., if a net Aa converges, strongly or weakly to A, then for
each B, Aa B converges to AB in the same sense; and if Ba converges, strongly or
weakly to B, then ABa converges to .AB in the same sense. It is easy to prove these
statements.
2 I- B)enll,
n=1
_ O 1
dw(A, B) 2+I((AB)em,en)L
mn
m,n=1
Show that these are metrics on Xi(N). On each bounded set of 8(N) the topology
given by them is the strong (weak) operator topology.
Inverses
13. Let A E 8(X). If A is bijective, then by the inverse mapping theorem, A-1 is
also in Ci(X). Let G be the collection of all invertible elements of Ci(X). This set is
a multiplicative group. We have (AB)-1 = B-lA-1.
(13.1)
13. Linear Operators 109
This goes to zero as n, m --> oo. So {Sn} is a Cauchy sequence. Hence the series in
(13.1) is convergent. Let T denote its sum. Note that
ASn=Sn-(I-A)Sn=I-(I-A)1
So, by continuity of operator multiplication AT = I. A similar argument shows
TA = I. Hence T = A-1.
(I-A)-1=I+A+A2+ (13.2)
Note that
lI(I-AY'lI
The series (13.1) or (13.2) is called the Neumann Series.
16. The theorem just proved shows that G contains an open neighbourhood of I;
hence it contains an open neighbourhood of each of its points. Thus Q is an open
subset of 8(X).
More precisely, show that if A E g and h- Bhl < 1/h1A-1 , then B E and
I<_ I1-IIA-'IIIIA-BII
This shows that operator inversion is continuous in the norm topology. Thus is a
topological group.
19. This is not true in infinite-dimensional spaces. Let X = Q2 and let S be the right
shift operator. Then S is left-invertible (because TS = I) but not right-invertible
(if it were it would be invertible). We will show that no operator in a ball of radius
one around S is invertible. If MS - All < 1 then
Exercise. The set of right invertible operators (a set that includes G) is not dense
in ,Ci(X). Nor is the set of left invertible operators.
Adjoint Operators
3. Exercise.
(BA)* = A*B*.
(iii) The adjoint of the identity operator on X is the identity operator on X*; i.e.,
I*=I.
4. The conclusion of (i) above is that the map A --> A* from 13(X, Y) to 13(Y*, X*)
is linear; that of (ii) is some times expressed by saying this map is contravariant.
The equation (14.3) says this map is an isometry. It is, in general, not surjective.
5. Example. Let X = Y = where 1 < p < oo. Let S be the right shift operator;
i.e., if x = (x1, X2,. . .), then Sx = (0, xl, x2,. . .). Let T = S*. This is an operator on
14. Adjoint operators 113
4. What is it? Let f E Qq and let g = S* f. The definition (14.1) says g(x) = f(Sx)
for all x in gyp, i.e.,
Thus T is the left shift operator on Qq. It maps (fl, f2...) to (f2, f,. . .).
6. Let 7-1 be a Hilbert space. Recall that 7-1 is isomorphic to If via a conjugate linear
map R that associates to y e 7-1 the linear functional f, defined as f(x) = (x, y) for
all x E 7-i. (See Section 21, Lecture 11.) So, for every A E 8(7-1) its adjoint A* can
be identified with an operator on 7-1. Call this operator At for the time being. We
have At = R_ 1 A* R (as shown in the diagram).
A*
?- * f*
At
7. If 7-1, K are Hilbert spaces and A is a linear operator from 7-1 to K, then A* is a
linear operator from )C to 7-1 defined by (14.4) with x E 7-1, y e 1'C.
(iii) it is surjective.
(vii) I* = I.
Thus the map A H A* has properties very similar to the complex conjugation
z -- z on C. A new feature is the relation (v) arising out of non-commutativity of
operator multiplication.
14. Adjoint Operators 115
Proof. The submultiplicativity of the norm, and the property (14.3) show
IIA*AII IlAII I= I
s I
as well.
10. The property (14.5) is very important. A Banach algebra (see Lecture 3) with
an involution (a star operation A H A*) whose norm satisfies (14.5) is called a
C*-algebra. Study of such algebras is an important area in functional analysis.
Continuity Properties
11. Since IIA* II _ hthe map A --> A* from 13(X) to 13(X*) is continuous in the
usual (norm) topology.
Let T be the left shift operator on 2. Then for every vector x, 1moo I= 0.
So the sequence {T} converges strongly to the zero operator. On the other hand
(Tfl)* = Si', where S is the right shift. We know that {S'2} does not converge
116 Notes on Functional Analysis
strongly. (Section 5, Lecture 13). So, the map A A* is not strongly continuous
on 2.
From the equation (14.4) it is clear that the map A * A* is continuous in the
weak operator topology of B(1-.. This is true, more generally, when 1-1 is replaced
by a reflexive Banach space.
Examples
12. Matrices. Let 1-1 be an n-dimensional Hilbert space and choose an orthonormal
basis for 7-1. Every operator A on 11 has a matrix representation A = [ajj] with
respect to this basis. Show that A* is the operator corresponding to the matrix [a2]
in this basis. This is the usual conjugate transpose of A.
13. Integral Operators. Let K be a square integrable kernel on [0,11 x [0,11 and
let AK be the integral operator induced by it on L2[0,1], i.e.
Let K*(x, y) = K(y, x). Show that the adjoint operator (AK)* is the integral operator
induced by the kernel K*. (Use Fubini's Theorem.)
(Af)(x) =
(A*f)(x) = f f(t)dt.
14. Composition Operators. Let p be a continuous map of [o, l] into itself. This
14. Adjoint Operators 117
Exercises
(i) Let A* be invertible. Then A* is an open map. So the image of the unit ball
{g: II9II < 1} in Y* under this map contains some ball {f: Ill II c} in X.
- suP{I9(Ax)l }T',
lAx II 9E = 1}
= sup{(A*g)(x)f : g E llll = 1}
> sup{If(x)I f E X",IIfII <_ c}
118 Notes on Functional Analysis
= dlxii.
This says that A is bounded below and implies that A is one-to-one and its
range ran A is closed.
(iii) It is easy to see that for any A E 13(X, Y) we have (ran A)1 = ker A*. So if
ker A* _ {0}, then ran A is dense.
The additional structure in a Hilbert space and its self-duality male the adjoint
operation especially interesting. All Hilbert spaces that we consider are over complex
scalars except when we say otherwise.
1. Let 7-1 be a Hilbert space. If (x, y) = o for all y 7-1, then x o. Thus an operator
A on N is the zero operator if and only if {Ax, y} = o for all x, y E 7-1.
Exercise. Let 7-( be a complex Hilbert space and let A E 13(N). Show that A = 0
if (Ax, x) = 0 for all x. (Use polarization.) Find an operator A on II82 for which
(Ax, x) = 0 for all x and If All = 1.
Self-adjoint operators
So, (Ax, x) is real. Conversely if N is a complex Hilbert space and (Ax, x) is real for
all x, then A is self adjoint.
120 Notes on Functional Analysis
sup I
=I
iiyu=1
and hence,
sup I(Ax,y)f = sup IlAxil = I(15.1)
I= sup I(15.2)
1111=1
There are two equations here. Subtract the second of them from the first to get
x by eiex does not change the right hand side. Choose 8 such that
eie (Ax, y) > 0. The inequality above then becomes
Now take suprema over fix ii = hiM = 1 and use (15.1) to get from this f< M,
and hence h= 111
6. Exercise. Find an operator on the space C2 for which the equality (15.2) is not
true.
15. Some Special Operators in Hilbert Space 121
7. If Al and A2 are self-adjoint, then so is aAl + QA2 for any real numbers a, /3.
Thus the collection of all self-adjoint operators on ?-l is a real vector space.
8. If A1, A2 are self-adjoint, then their product A1A2 is self-adjoint if and only if
A1A2 = A2A1.
Positive Operators
9. Let A be aself-adjoint operator. If for all x, (Ax, x) > 0, we say that A is positive
semidefinite. If (Ax, x) > 0 for all nonzero vectors x we say A is positive definite.
For brevity we will call positive semidefinite operators just positive operators; if we
need to emphasize that A is positive definite we will say A is strictly positive.
If A is any operator on a complex Hilbert space, then the condition (Ax, x) > 0
for all x implies that A is self-adjoint. The operator A on R2 defined by the matrix
10. We write A > 0 to mean A is positive. If A > 0 then aA > 0 for all positive
real numbers a. If A, B are self-adjoint, we say A > B if A - B > 0. This defines
a partial order on the collection of self-adjoint operators. If Al > B1 and A2 > B2,
then Al + A2 > B1 + B2.
11. Let A be any operator. Then A*A and AA* are positive.
21
12. Let A, B be operators on represented by matrices A =
1 1
1 1
B= . Then A > B. Is it true that A2 > BZ?
11
122 Notes on Functional Analysis
Normal Operators
13. Ari operator A is said to be normal if A*A = AA*. Self-adjoint operators are a
very special class of normal operators.
The last statement is true for all x if and only if A*A = AA*.
100
The operator A on C3 defined by the matrix A = 001 is not normal but
000
the equality (15.4) is still true for this A.
(15.5)
2 2i
A = B + iC. (15.6)
This is some times called the Cartesian decomposition of A. in analogy with the
decomposition z = x + iy of a complex number. B and C are called the real and
imaginary parts of A.
Unitary operators
U'`U=UU*=I. (15.7)
(i) U is unitary.
(iv) If {e} is an orthonormal basis for 7-1, then {Uen} is also an orthonormal basis.
18. Exercise. Show that the condition (15.8) is equivalent to the condition
19. The properties listed in (iii) in Exercises 17, say that U preserves all the struc-
tures that go into defining a Hilbert space : U is linear, bijective, and preserves
inner products. Thus we can say U is an automorphism of x. If x, 1C are two
Hilbert spaces and if there exists a bijective linear map U from 7-1 to IC that satisfies
(15.8) we say 7-1 and 1C are isomorphic Hilbert spaces.
20. An isometry (on any metric space) is always one-to-one. A linear operator on a
finite-dimensional vector space is one-to-one if and only if it is onto. This is not the
case if the vector space is infinite-dimensional. For example, the right shift operator
S on e2 is one-to-one but not onto while the left shift T is onto but not one-to-one.
A*A = I. (15.10)
If
AA* = I (15.11)
22. Recall our discussion of projections in Lecture 11, Sections 18, 19. A linear
map P on 71 is called a projection if it is idempotent (P2 = P). If S = ran P and
S' = ker P, then 7-1 = S + S', and P is the projection on S along s'. The operator
I - P is also a projection, its range is S' and kernel S. For example, the operator
2
P on Ccorresponding to the matrix P =
11 is idempotent. Its range is the
00
space S = {(x, 0) : x E C}, and its kernel S' _ {(x, -x) : x e C}. A special property
characterises orthogonal projections: those for which S' = S.
(zi, Pz2)
This shows P* = P.
23. When we talk of Hilbert spaces we usually mean an orthogonal projection when
we say a projection. To each closed linear subspace S in 7-1 there corresponds a unique
(orthogonal) projection P and vice versa. There is an intimate connection between
(geometric) properties of subspaces and the (algebraic) properties of projections
corresponding to them.
basis vector e1 in 2. Then .M is invariant under the right shift operator S but not
under its adjoint S*. So .M does not reduce A.
27. Theorem. Let P be the orthogonal projection onto the subspace M of 7.1. Then
.M is invariant under an operator A, if and only if AP = PAP; and .M reduces A if
and only if AP = PA.
We have used the property P* = P at the second step here, and P2 = P at the
third.
Exercises
30. Let Pi P2 he projections. Show that the following conditions are equivalent
a
31. If Pl and P2, are projections, then Pl - P2 is a projection if and only if Pl < Pi.
In this case ran (P1 - PZ) = ran Pi fl (ran P2)1.
32. Show that the Laplace transform operator defined in Section 19 of Lecture 3
is a self-adjoint operator on L2( +
33. The Hilbert-Hankel operator H is the integral kernel operator on L2 (0, oo) defined
as
Hf(x)_ JO f y) dy
o x+y
Show that H = 2, where G is the Laplace transform operator. This shows that
Lecture 16
A large, and the most important, part of operator theory is the study of the spectrum
of an operator. In finite dimensions, this is the set of eigenvalues of A. In infinite
dimensions there are complications that arise from the fact that an operator could
fail to be invertible in different ways. Finding the spectrum is not an easy problem
even in the finite-dimensional case; it is much more difficult in infinite dimensions.
1. Let x(t) be a map from an interval [a, b] of the real line into a Banach space X.
It is obvious how to define continuity of this map. If IIx(t) - x(to)I1 -3 0 as t -j to,
we say x(t) is contiguous at to.
If x(t) is continuous at to, then clearly for each f E X*, the (complex-valued)
function f(x(t)) is continuous at tp. We say that x(t) is weakly continuous at tp if
f(x(t)) is continuous at to for all f E X. (If emphasis is needed we call a continuous
map strongly continuous.)
Strong and weak differentiability can be defined in the same way. If to is a point
in (a, b) we consider the limits
o
lim
x(t0 + h) -x(to)
h
130 Notes on Functional Analysis
and
hl o f(x(to + h) - f(x(to)) fEX.
If the first limit exists, we say x(t) is (strongly) differentiable at to. If the second
limit exists for every f E X*, we say x(t) is weakly differentiable at to. Clearly strong
differentiability implies weak differentiability. The converse is not always true when
X is infinite-dimensional.
two (1/t) = 0.
So from (16.1) we see that .1(t) is weakly differentiable at t = 0, and the weak
derivative is the zero function. If the map 1(t) had a strong derivative at 0, it would
have to be equal to the weak derivative. But for all t 0,
3. Let G be any open connected set of the complex plane and let x(z) be a map
from G into X. If for every point z in G the limit
x(z + h) - x(z)
lim
h-.o h
1 6. The Resolvent and The Spectrum 131
exists we say x(z) is strongly analytic on C. If for every z E C and f E X*, the limit
4. Theorem. Let x(z) be a weakly analytic map from a complex region G into a
Banach space X. Then x(z) is strongly analytic.
Proof. Let f be any element of X*. Then (f ox)(z) = f(x(z)) is an analytic function
on G. Let (f o x)'(z) be its derivative. Let z be any point in G and I' a closed curve
in G with winding number 1 around zo and winding number 0 around any point
outside G. By Cauchy's integral formula
f(x(zo)) =
f(x()
2iri J
h 1f f(x))
(16.2)
2xi Jr ((- zo - h)((- zo)z
Since I' is a compact set and f(x(.)) a continuous functions, the supremum
Sup f(x(())I = cr
SEI'
132 Notes on Functional Analysis
sup
Ilfll<i SEr
sup I= C
is finite. (Think of x(() as linear functionals on X*.) Hence the quantity in (16.2) is
bounded by
Clhi f d(I
27r Jfor
all f with IIfI 1. As h -> 0 this goes to 0, and the convergence is uniform for
f< 1. Hence the limit
x(zo + h) - x(zo)
nl,' o h
exists in X (see (4.2)). Thus x(z) is strongly analytic at zo.
Exercise. The space ,Ci(X) has three topologies that are of interest: norm topology,
strong operator topology, and weak operator topology. Define analyticity of a map
z --> A(z) from a complex G into ,Ci(X) with respect to these topologies. Show that
the three notions of analyticity are equivalent.
Resolvents
6. Let A E 13(X) and let A be any complex number. It is customary to write the
operator A - Al as A - A.
The resolvent set of A is the collection of all complex numbers A for which A - A
is invertible. Note that if (A - A)-1 exists, it is a bounded operator. (The Inverse
Mapping Theorem, Lecture 6.) We write p(A) for the resolvent set of A. The operator
A)-i,
RA(A) _ (A - A E p(A)
If Al > hthen IA/All < 1. Hence I - A/A is invertible. (See Chapter 13,
Theorem 14.) Hence the operator A - a = A(A/A - 1) is also invertible. We have
1 An
(A - A)-1 = ( 1
for Al I> I (16.3)
l
Thus p(A) is a nonempty set.
Proof. A simple algebraic manipulation using the definition of the resolvent shows
that
8. Corollary. The family {RA(A) : A E p(A)} is a commuting family; i.e., any two
elements of this family commute with each other.
9. Theorem. For each A E 13(X) the set p(A) is an open subset of C, and the map
A --> RB(A) is an analytic map from p(A) into ,Ci(X).
Proof. The argument that was used to show that the set of invertible operators is
open in 13(X) can be modified to show p(A) is an open set. Let Ao E p(A). We want
to show that A E p(A) if A is close to Ao. We have the identity
A-A = (A_Ao)[I_(A_Ao)(A_Ao)_1]
134 Notes on Functional Analysis
_ (A-A0) [I-(A-AO)RA0(A)].
i.e., A - aoI < (A)II Thus if A satisfies this inequality, then it belongs to
p(A). Hence p(A) is open. Further, this shows
lim IIRA(A)II = 0.
laHoe
So, by Liouville's Theorem p(A) can not be the entire complex plane. (A bounded
entire function is a constant.)
The Spectrum
11. The complement of the resolvent set in the complex plane is called the spectrum
of A, and is denoted by v(A).
12. If X is afinite-dimensional space, then Q(A) is a finite set. Its elements are the
eigenvalues of A. Every operator on an n-dimensional space has at least one and at
most n eigenvalues.
16. The Resolvent and The Spectrum 135
13. Let S be the right shift operator on gyp, 1 < p < oo. For any complex number A
the equation Sx = Ax, i.e.,
(O,Xl,X2) =
can never be satisfied by any nonzero vector x. So, S does not have any eigenvalue.
At the same time we do know v(S) is not an empty set. So, a point can be in the
spectrum of an operator A without being an eigenvalue. This is because A - A can
be injective without being invertible.
Spectral Radius
This is the radius of the smallest disk centered at the origin that contains the spec-
trum of A. We know that
spr (A) < 11AM. (16.5)
The spectral radius of a nilpotent matrix is 0; so the two sides of (16.5) need not be
equal.
16. Consider the series (16.3) - a power series in 1/A. This series converges when
Al >
and then defines (A - A)-1. It does not converge for at least one point A with
IAI = limllAnlll/n. Hence
Much more interesting is the fact that lim here is actually the limit of the (convergent)
sequence II A Ill/n.
17. The Spectral Radius Formula. For every A E 13(X), the sequence l n
converges, and
lim spr(A). (16.8)
So, if A'2 - An were invertible, then A - A would have a left inverse and a right inverse,
and would therefore be invertible. By contraposition if A - A is not invertible, then
nor is An - A. In other words, if A E Q(A), then A'2 E Q(An). Hence lA'2l < llA7 II;
i.e., Al < Ifor all n. This shows that spr (A) < limIlA'2IV/2 But we have
already obtained the equality (16.7).
This may lead one to believe that the sequence IIA'2ll l/n is monotonically decreasing.
This is, however, not always true. Consider the operator A on the Hilbert space C2
19. Exercise. If A is a normal operator on a Hilbert space. Then spr (A) = lAM.
(Use Lemma 15 of Lecture 15. In afinite-dimensional space prove this using the
spectral theorem for normal operators.) Find an operator A that is not normal but
has spr (A) = I
Since the operator p(A)-A is not invertible, one of the factors A-A3 is not invertible.
Thus A3 E Q(A) and also p(A3) - A = 0. This shows A = p(A3) for some A3 E Q(A).
Hence Q(p(A)) C p(a'(A)).
a(A) =
Ra(A*) for all a E p(A).
o(A*) = a(A).
Rx(A*) for all a E p(A).
Let S be the right shift operator on the space el. Since 1SIJ = 1 the spectrum Q(S)
is contained in the closed unit disk D. We have seen that S has no eigenvalue. The
adjoint of S is the left shift operator T on the space L. If A is any complex number
with A < 1, then the vector xa = (l, A, A2,...) is in L and 7'xa = Ax,. Thus every
point A in the disk D is an eigenvalue of T. This shows also that u(S) = Q(T) = D.
We have seen an example where QP(A) = and another where up(A) = a(A).
Proof. If A is invertible, then it is bounded below, and its range is all of X, not
just dense in X.
5. This simple theorem leads to a useful division of the spectrum into two parts (not
always- disjoint).
is called the approximate point spectrum of A. Its members are called approximate
eigenvalues of A.
The set
acomp(A) :_ {A :ran (A - A) is not dense in X}
ores(A) := vcomp(A)\op(A),
called the residual spectrum of A, is the set of those points in the compression
spectrum that are not eigenvalues. The set
7. We have observed that for every operator A on a Banach space Q(A) = Q(A*).
This equality does not persist for parts of the spectrum.
Proof. Let M be the closure of the space ran (A - A). If A E Qcomp(A), then M is
a proper subspace of X. Hence there exists a nonzero linear functional f on X that
vanishes on M. Write this in the notation (14.2) as
This says that g(x) = 0 for all g e ran (A* - a). If the closure of ran (A* - a) were
the entire space X*, this would mean g(x) = 0 for all g E X*. But the Hahn-Banach
Theorem guarantees the existence of at least one linear functional y that does not
vanish at x. So ran (A* - A) can not be dense. This proves (ii).
o.p(A*) = crcomp(A)
- oaPP(A*) U oaPP(A).
Here the bar denotes complex conjugation operation, (Recall that we identified ?-(
with 7(* and A** with A; in this process linearity was replaced by conjugate linearity.)
The set a(A) consists of eigenvalues-------objects familiar to us; the set Qapp (A) is
a little more complicated but still simpler than the remaining part of the spectrum.
The relations given in Theorem 7 and Exercise 8 are often helpful in studying the
more complicated parts of the spectrum of A in terms of the simpler parts of the
spectrum of A*.
9. Exercise. Let A be any operator on a Banach space. Then rrapp (A) is a closed
set.
Hence under the given conditions Ran (A) is a Cauchy sequence. Let R be the limit
of this sequence. Then
Proof. If A is on the boundary of Q(A), then there exists a sequence {fin} in p(A)
converging to A. So, by Proposition 10, {(A - I I} is an unbounded sequence.
So, it contains a subsequence, again denoted by {A }, such that for every n, there
exists a unit vector xfor which II(A -fin)-1x> n. Let
(A-A)'xn
""
I
Let T be the left shift on el. Then T* = S the right shift on Pte. Since II TII = 1>
we know that Q(T) is contained in the closed unit disk D. From Exercise 16.22 we
know that Q(S) = v(T). Fill in the details in the statements that follow.
(i) If IAI < 1, then XA :_ (1, A,A2, ... ,) is in el and is an eigenvector of T for
eigenvalue A. Thus the interior D is contained in v(T).
(iii) If aJ = 1, then there does not exist any vector x in L1 for which Tx, = ax.
Thus no point on the boundary of D is in ap(T).
(iv) The point spectrum QP(S) is empty. Hence the compression spectrum ocomp(7')
is empty. (Theorem 7.)
(vii) Let jAj = 1. Then u = (A, A2,...) is in L. Let y be any element of P and let
x = (S - A)y. From the relation
j=1
1/2, then
Hence Imn > n/2. But that cannot be true if y E L. So we must have jx -
1/2 for every x e ran (S - a). Hence A E ucomp(S)
13, Exercise. Find the various parts of the spectra of the right and left shift
operators on 1Cp ao.
17. Subdivision of the Spectrum 145
14. Exercise. Let P be a projection operator in any Banach space. What is the
spectrum of P, and what are the various parts of Q(P)?
Hence I - BA is invertible.
(ii) Show that the sets a(AB) and Q(BA) have the same elements with one possible
exception: the point zero.
In Lecture 15 we studied normal operators in Hilbert spaces. For this class the
spectrum is somewhat simpler.
Proof. Let A be any complex number and write A = + iv, where .t and v are real,
If A is self-adjoint, then for every vector x
11(A-A)x112 = ((A-A)x,(A-A)x)
= ((A-A)(A-A)x,x)
18. Spectra of Normal Operators 147
_ II(A - v211x112
>
Exercise. Find a simpler proof for the more special statement that every eigenvalue
of a self-adjoint operator is real.
Diagonal Operators
3. Let f be a separable Hilbert space and let {e} be an orthonormal basis. Let
a = (a1, a2,...) be a bounded sequence of complex numbers. Let Aaen = anen
This gives a linear operator on 7( if we do the obvious: let Aa ( nen) _ > nanen
It is easy to see that Aa is bounded and
a1
Aa = a2
Aa + A,3 = Aa+/
Aa A,Q = Aa ,
Aa = Aa .
148 Notes on Functional Analysis
The sequence 1 = (1, 1,...) is the identity element for the algebra 4,. An element
a is invertible in Q if there exists /3 in P, such that aQ = 1. This happens if and
only if {ate,} is bounded away from zero; i.e., inf IaI > 0. The diagonal operator Aa
is invertible (with inverse Ap) if and only if a is invertible (with inverse ,3).
Proof. It is obvious that each an is an eigenvalue of Aa, and easy to see that there
are no other eigenvalues. Let A be any complex number different from all a. The
operator Aa - A is not invertible if and only if the sequence {an - A} is not bounded
away from zero. This is equivalent to saying that a subsequence an converges to A;
i.e., A is a limit point of the set {a}.
Multiplication operators
6. Let (X, S, ) be a Q-finite measure space. For each cp e L() let M(,c, be the
linear operator on the Hilbert space N =L2() defined as M f = cp f for all f E N.
We have then
II(2It = IIPIIoc,
M + M1, =
M M .
18. Spectra of Normal Operators 149
7. The function 1 that is equal to 1 almost everywhere is an identity for the algebra
L. An element cp of L is invertible if and only if there exists E L such that
cps = 1 a.e. This happens if and only if cp is bounded away from zero; i.e., there
exists b > 0 such that ko(x)I > b a.e. The multiplication operator Mw is invertible
(with inverse Mv,) if and only if cp is invertible (with inverse b).
8. Let cp be a complex measurable function on (X, S, ,.t) The thick range of Sp,
.
for every b > 0. This is the same as saying A E ess ran cp. This proves the first
assertion. Let A E Then there exists a nonzero function f such that
(p(x) - A) f(x) = 0. So (x) _ A for all x where 1(x) 0. Such x constitute a
set of positive measure. So A E trap cp. Conversely, if A E tran cp, then the set
E _ {x: cp(x) _ A} has norizero (possibly infinite) measure. Choose a subset F of
E that has a finite positive measure. Then the characteristic function XF is in L2()
and is an eigenvector of M, for the eigenvalue A. Thus a E Q(M).
10. One of the highlights of Functional Analysis is the Spectral Theorem. This
says that every normal operator A on any Hilbert space is unitarily equivalent
to a multiplication operator; i.e., there exists a measure space (X, S, ,a) a unitary
operator U : ?-iC - L2 (u) and a function cp E L such that A = U* If A is
Hermitian the function cP is real, and if A is positive P is positive.
11. Let Q2(7G) be the space of all doubly infinite sequences {x}_ such that
II oo. The standard basis for this space is the collection {e}_ of
n=-oo
vectors that have all entries zero except an entry 1 in the nth place. The right shift
or the forward shift on this space is the operator Sdefined as Sen = en+l for all n.
Its inverse is the left shift or the backward shift Tdefined as Ten = for all Ti.
The operators S and T are unitary. To distinguish them from the shift operators on
l2 = 12(N) these are called two sided shifts.
If a is bounded away from zero, then T is invertible, and its inverse is the operator
S acting as
Sen = 1 e+1
n for all n.
an+1
This is a weighted forward shift.
13. Exercise. Let T be a weighted backward shift with weight sequence a. Show
that
14. Let T be the weighted backward shift on 12(Z) with weight sequence a in which
a_1 = 0 and an = 1 for all n -1. By Exercise 13 (ii) the spectral radius of T
is 1. For each A with IA) < 1, the vector xa = is an eigenvector of T with
n=o
eigenvalue A. So Q(T) = D, the closed unit disk.
152 Notes on Functional Analysis
Consider another weighted backward shift T' with weights a' in which a 1 = 1
and an = 0 for all n -1. For every real number s, let TE = T + eT'. This
is a weighted backward shift with weight sequence a(e) in which a(s) = e and
an(e) = 1 for all n -1. Thus spr(TE) = 1. If s 0, then Te is invertible, and
by Exercise 13(ii) the spectral radius of TE 1 also is 1. This means that Q(TE) is
contained in the boundary of the disk D.
This example shows something striking. The spectrum of T = To is the unit disk
D. Adding a small operator eT' to T makes the spectrum shrink to the boundary of
D. (The operator eT' has rank 1 and norm . )
15. Thus the map A cr(A) that associates to an operator A its spectrum is a
discontinuous map. Let us make this statement more precise.
Exercise Let (X, d) be any metric space and let E, F be any compact subsets of X.
Let
s(E, F) sup dist (x, F) = sup inf d(x, y),
xEE xEE yEF
and
Show that h(E, F) is a metric on the collection of all compact subsets of X. This
is called the Hausdor, f, distance between E and F. It is the smallest number b such
that any point of E is within distance b of some point of F, and vice versa.
The space 13(7-() is a metric space with its usual norm and the collection of
compact subsets of C is a metric space with the Hausdorff distance. The example in
Section 14 shows that the map A H v(A) between these two spaces is discontinuous
(when 1-( is infinite-dimensional).
16. If a map is not continuous, one looks for some weaker regular behaviour it may
18. Spectra of Normal Operators 153
display. It turns out that the spectrum can shrink drastically with a small change
in the operator (as our example above shows) but it can not expand in this wild a
manner. The appropriate way to describe this is to say that the map A H Q(A)
is upper semicontinuous. By definition, this means that for every open set G that
contains o(A) there exists an e > 0 such that
Proof. Let e = If A - Bff. It suffices to show s(v(A), Q(B)) < e and then invoke
symmetry. For this we have to show that for each A E Q(A) there is a E v(B) such
154 Notes on Functional Analysis
18. When the space 7-( is finite-dimensional the spectrum is continuous on all of
13(7-1). More is true in this case. Let A --> Eig A be the map that assigns to A the
(unordered) n-tuple {A1,... , a,t} whose elements are eigenvalues of A counted with
multiplicities. Then this map is continuous. (The set v(A) gives no information
about multiplicities of the eigenvalues.)
one of the most important and useful theorems of linear algebra is the spectral
theorem. This says that every normal operator on an n-dimensional Hilbert space
7-1 can be diagonalised by a unitary conjugation: there exists a unitary operator U
such that U*AU = A, where A is the diagonal matrix with the eigenvalues of A
on its diagonal. Among other things, this allows us to define functions of a normal
matrix A in a natural way. Let f be any functions on C. If A = diag ( \ 1 , ... ,n }
is a diagonal matrix with A as its diagonal entries, define 1(A) to be the diagonal
matrix diag (f(A,),... , f()), and if A .= UAU*, put f(A) -= U f (11)U*.
If A is a positive operator, then all its eigenvalues are positive. Each of them has
a unique positive square root. Thus A has a unique positive square root, written as
A"2.
The spectral theorem for infinite-dimensional Hilbert spaces will be proved later
irz this course. It says that a normal operator A is unitarily equivalent to a multipli-
cation operator M. If A is positive, then we define A"2 as the operator equivalent
to Z-
156 Notes on Functional Analysis
However, the existence of the square root A1/2 can be proved by more elementary
arguments. Though less transparent, they are useful in other contexts.
Al
Proof. We prove first that An is weakly convergent. For each vector x, the sequence
(Ax, x) is an increasing sequence of real numbers bounded from above by a (x, x, ) .
So the limit f(x) = lim (Anx, x) exists. Being a limit of quadratic forms, this
n- o0
is again a quadratic form; i.e., there exists a sesquilinear form B(x, y) on 7-l such
that 1(x) = B(x, x). (See Exercise 31, Lecture 11). Clearly B is bounded. So, by
the result proved in Section 23 of Lecture 11, there exists aself-adjoint operator
A such that 1(x) _ (Ax, x). This operator A is the weak limit of A. We will
show that, in fact,. An converges strongly to A. There is some simplification, and
no loss of generality, if we assume Al > 0. (Add (to all the A,t.) Then for
n > m we have 0 < Ate, - A,,,, < al. This shows that h
- A,,,,II <a. (Recall that
I
n - Am hi = sup ((An - A,,,)x, x).) Using the Schwarz inequality (19.1) we get for
1111=1
every x
((AnAm)x,(AnAm)x)2
((An - A,n)x, x) ((An - ATre)2x, (An - Am)x
C ( ('4n - Am)x,x )a3hhxhh2.
19. Square Roots and the Polar Decomposition 157
Since An is weakly convergent, the inner product in the last line goes to zero as
n, m - oc. So, the left hand side of this inequality goes to zero. This shows that
for every vector x, fn - Am) x I I goes to zero as n, rn -f oo. Hence An is strongly
convergent; and its strong limit is A.
We remark here that the proof above can be simplified considerably if we assume
that every positive operator has a positive square root: The weak limit A is bigger
than all A, so A - An is positive and hence equal to Pn for some positive P,. For
every x
IIPnxII2 = (Px, x) _ ((A - x)
Proof. We may assume that A < I. (Divide A by IAII.) Consider the sequence X,
defined inductively as
Xn+1 =
I - A+X2
Xp = 0,
2
=s-limI_A+Xn I-A+X2
X =
2 2
where s-lim stands for strong limit. The last equality shows that
A=I-2X+X2=(I-X)2.
158 Notes on Functional Analysis
4. Exercise. For any linear operator A on 7-t let ran A and ker A stand for the range
and the kernel of A. Show that
written as A = P' U' where P' = IA*I, and U' is unitary. Note that IA*l = IAI if and
only if A is normal.
8. Exercise. Use the polar decomposition to prove the singular value decomposition:
every linear operator A on an n-dimensional space can be written as A = USV,
where U and V are unitary and S is diagonal with nonnegative diagonal entries
si ...>sn.
9. Let S be the right shift on the space l2. Then S*S = I, and hence IS, = I. Since
S is not unitary we can not have S = for any unitary operator U. Thus the
polar decomposition theorem for infinite-dimensional spaces has to be different from
Theorem 5. The difference is small.
Partial isometries
10. An operator W on 7-1 is called partial isometry if llWxll _ lix II for every x E
(kerW)'.
The space (ker W)' is called the initial space of W, and ran W is called its final
space. Both these spaces are closed. The map W : (ker W)' - ran W is an isometry
of one Hilbert space onto another.
ran W and its final space is (ker W)'. The operators PZ = W *W and P1 = WW
are the projection operators on the initial and the final spaces of W, respectively.
11. Exercise. Let W be any linear operator on 7-1. Then the following conditions
are equivalent
(iii) WW is a projection.
(v) WW*W=W
(vi) W *W W * = W *.
12. Theorem. Let A be any operator on 7-1. Then there exists a partial isometry W
such that A = W fAf. The initial space of W is (ker A)1 and its final space is ran A.
This decomposition is unique in the following sense: if A = UP, where P is positive
and U is a partial isometry with ker U = ker P, then P = A and U = W.
Proof. Define W ran IAI -> ran A by putting W IAIx = Ax for all x E ?-l. It is
easy to see that W is an isometry. The space ran A is dense in (ker A)1 (Exercise !)
and hence, W extends to an isometry W (ker A)1 -f ran A. Put Wx = 0 for all
x E ker A. This gives a partial isometry on 'H, and A = W AI. To prove uniqueness
note that A*A = PU*UP =PEP, where E is the projection onto the initial space
of E. This space is (ker U)1 = (ker P)1 =ran A. So A*A = P2, and hence P = JAJ,
162 Notes on Functional Analysis
(i) W*A=IAI.
Compact Operators
This is a special class of operators and for several reasons it is good to study them in
some detail at this stage. Their spectral theory is much simpler than that of general
bounded operators, and it is just a little bit more complicated than that of finite-
dimensional operators. Many problems in mathematical physics lead to integral
equations, and the associated integral operators are compact. For this reason these
operators were among the first to be studied, and in fact, this was the forerunner to
the general theory.
A set E is precompact if and oniy if for every > 0, E can be covered by a finite
number of balls of radius E.
The sequence criterion for compactness of metric spaces tells us that A is compact
if and only if for each bounded sequence {x} the sequence {Axe,} has a convergent
subsequence.
164 Notes on Functional Analysis
sequence in X with llflI G 1 for all n, the sequence {Af} has a convergent subse-
quence. For this we use Ascoli's Theorem. Since llAf II
IlAll, the family {Af}
is bounded. We will show that it is equicontinuous. Since K is uniformly contin-
uous, for each e > 0 there exists S > 0 such that whenever xl - I < S we have
I-
Iy) - K(x2i y)l <e for all y. This shows that whenever lxi -x21 <6 we have
f 0
i
lim
xn-*X
K
Corollary. If A E ,Ci(X, Y) and there exists a sequence Ate, E 13oo (X, Y) such that
llA - All -+0, then A E 130 (X,Y) .
6. Exercise. Show that a strong limit of finite-rank operators is not always compact.
We have seen that the space 130(X) is a vector space of 13(X). Theorem 8 says
166 Notes on Functional Analysis
that ,i3o(X) is a two-sided ideal in the algebra 13(X). By Theorem 5 this ideal is
closed.
xn-ix = Axn-+Ax.
S S
(20.1)
x- wx x = Axn -Ax
w
. (20.2)
Proof. Let x- x. x. Then the sequence {jlxnII} is bounded. (Lecture 9, Section 8.) If
Axdoes not converge strongly to Ax, then there exists an e > 0 and a subsequence
{xm} such that IIAxm - AxII > e for all m. Since {xm} is bounded and A compact,
{Ax,,,,} has a convergent subsequence. Suppose y is the limit of this sequences.
Then y is also its weak limit. But by (20.2) we must have y = Ax. This leads to a
contradiction.
20. Compact Operators 167
Exercise. Let A be a compact operator on 7-1 and let {en} be an orthonormal basis.
Then the sequence {Aen} converges to 0.
Proof. Let {xn} be any sequence in 7-1 with IIxdI < 1. If we show {xn} has a weakly
convergent subsequence {Xm},the complete continuity of A would imply that Axm
is (strongly) convergent and hence A is compact. In a compact metric space every
sequence has a convergent subsequence. So, if the unit ball {x: lxii < 1} in 7-1 with
the weak topology were a compact metric space, then {x} would surely have a
convergent subsequence. In Section 13 of Lecture 12 we constructed exactly such a
metric.
12. It can be shown, more generally, that if X is a reflexive Banach space then every
completely continuous operator on it is compact. In some books the terms "compact
operator" and "completely continuous operator" are used interchangeably.
Warning. The condition (20.3) is phrased in terms of sequences. These are enough
to capture everything about the strong topology but not about the weak topology.
If X is given its weak topology and Y its strong topology, then a map A : X - Y is
continuous if for every net Xa converging weakly to x, the net Axa converges strongly
to Ax. It can be shown that the only such linear operators are finite-rank operators.
Proof. Let A E Lao (X, Y). Let {g} be a sequence in Y* with 1. We have to
show that the sequence {A*gn} in X* has a convergent subsequence. Let S be the
unit ball in X. Then A(S) the closure of A(S) is a compact metric space. Regard
168 Notes on Functional Analysis
sup I= sup
yEA(S)
f
C sup
yEA(S)
IgfI I2Jll C
yEA(S)
Thus the family {g} is uniformly bounded in C(A(s)). Next note that for all
Y1, Y2 E Y
14. For Hilbert space operators Theorem 13 can be proved easily using the polar
decomposition.
When 7-1 is a Hilbert space ,Cio (7-l) is a closed, two-sided, *-closed ideal in 8(7-1).
It can be proved (using the spectral theorem) that this is the only ideal in 13 (7-f)
with this property.
15. Theorem. Let 7-1 be a separable Hilbert space. Then 1300 (7-l) is dense in 130 (N).
In other words, every compact operator on ?-( is a norm limit of finite rank operators.
Proof. Let {e} be an orthonormal basis for ?-C. Let Nn be the subspace spanned
by the vectors e1,... , e,. Let P, be the orthogonal projection onto N. Then APn
is a finite-rank operator and
I IAx II > an/2. Since the sequence hR increases to 7-C, the sequence xR converges
weakly to 0. So, if A is compact AxR - 0. Hence aR -* 0. Thus A is the norm limit
of the sequence APR.
16. Is the assertion of Theorem 15 valid for all separable Banach spaces? This
question turns out to be difficult. In 1973, P. Enflo answered it in the negative.
There exists a separable Banach space on which some compact operator is not a
norm limit of finite rank operators. Our proof of Theorem 15 suggests that if X has
a Schauder basis, then t3oo (X) is dense in B0 (X). This is indeed the case. So the
space X in Enflo's example does not have a Schauder basis.
Lecture 2 1
Most of the spectral properties of a compact operator in a Banach space were dis-
covered by F. Riesz, and appeared in a paper in 1918 (several years before Banach's
book). These results were augmented and simplified by the work of Schauder. What
follows is an exposition of these ideas.
Proof. For each n., let S,L = {x E X : <n}. Then A(SS) is precompact. Every
compact metric space is separable. So A (Sn) is separable. Hence so is the countable
union
U A (S) =ran A.
n=1
The Open Mapping Theorem tells us that if ran A is closed, then A is an open map.
So A (Sn) is an open precompact set in ran A. Every point in ran A belongs to some
21. The Spectrum of a Compact Operator 171
4. Corollary. Let A E 13(X) and let A be a nonzero complex number. Then the
space ker (A - A) is finite-dimensional.
Proof. For each linear operator A and complex number A, the space N = ker(A - A)
is closed. It is easy to see that if A 0, then A maps N onto itself. So if A is compact,
then by Theorem 2, N is finite-dimensional.
6. Proposition. Let A E ,Cio(X). Then the point spectrum a(A) is countable and
has only one possible limit point 0.
is finite. If this is not the case, then there exists an E, an infinite set {A} with
An l > and vectors xn such that If x,1 f = 1 and Axn = anxn . The vectors x,2, being
eigenvectors corresponding to distinct eigenvalues of A, are linearly independent.
So for each n, the space Mn spanned by {x1,... , x} is an n-dimensional space.
By Riesz's Lemma, for each n > 1, there exists yn e Mn such that f = 1 and
dist (y, M_') = 1. Since y,t E Mn we can write
This shows that Ayn - An yn is in M_1. 1. For n > m the vector Ayn - Aym has the
form '\nyn - z where z e M_1. Since dist (yn, M_1) = 1, this shows that
(iii) Each nonzero point of a(A) is an eigenvalue of A and has finite multiplicity.
21. The Spectrum of a Compact Operator 173
(ii) Let A be the diagonal operator with diagonal entries 1,0, 1/2,0,1/3, 0, ... .
Then 0 is an eigenvalue of A with infinite multiplicity. Each point 1/n is an
eigenvalue of A with multiplicity one.
(iii) Let A = D the diagonal operator with diagonal entries 1,1/2,1/3, ....Then 0
is not an eigenvalue. The points 1/n are eigenvalues of A and 0 is their limit
point.
If Ax = Ax, then
xn = (n - 1)! An-Ixl for all n.
It is easy to see that A has no eigenvalue. So in this case 0 is the only point
in Q(A), and is not an eigenvalue. Note that the operators in (iii) and (iv) are
174 Notes on Functional Analysis
0 0 1/2 0
0 0 0 1/3
DT=
and SD is the transpose of this matrix. The first matrix has entries (1, 1/2, 1/3,... )
on its first superdiagonal, and the second on its first subdiagonal. If we take
the top left rt n block of either of these matrices, it has zero as an eigenw
value of multiplicity n. One may naively expect that DT and SD have o as an
eigenvalue with infinite multiplicity. This fails, in different ways, in both the
cases.
X = ker(A -
12. We know that A is compact if and only if A* is compact. We know also that
a(A) = Q(A*). In Section 10 we have seen an example where 0 is an eigenvalue of A
but not of A*. The nonzero points in the set Q(A) = Q(A*) can only be eigenvalues
of finite multiplicity for either operator. More is true: each point A 0 has the same
multiplicity as an eigenvalue for A as it has for A*.
Proof. Let m* and m be the numbers on the left and the right hand sides of (21.1).
We show first that m* < n. Let x i , ... , Xm be a basis for the space ker (A - A).
Choose linear functionals fi,... , fm on X such that f2(x3) = bid . (Use the H.B.T. )
If m* > m, there exist m + 1 linearly independent elements gi,... , gm+i in the space
ker (A* - A) C X *. Choose yi,... , ym+1 in X such that gi (yj) _ 6i3. (See Exercise
m
19 in Lecture 10.) For each x e X let Bx = f(x)y. This is a linear operator of
i=1
finite rank, and hence is compact. Note that
if 1 < j < m
(Bx, gj) _ J f fix)
0 if j =m+1.
Since gj E ker (A* - A),
((A+B-A)x,g)=
f(x) if 1-j-m
<< (21.2)
0 if j=m+1.
Thus g,,,+l annihilates ran (A + B - A) .Since g.,,,,+1 (y.,,,+1) = 1, this shows
ran (A + B - A) X.
Hence A E Q(A+B) and since A + B is compact A has to be an eigenvalue. This
is possible only if there exists a nonzero vector x such that (A + B - A) x = 0. If x
is such a vector, then from (21.2) f3(x) = 0 for all 1 < j < m, and hence by the
definition of B we have Bx = 0. So x E ker (A - A) .The vectors x3 are a basis for
this space, and hence
x = a1x1 + ... + amxrn
Using the relations f(x) _ we get from this cad = f(x) = 0 for all 1 < j < m.
But then x = 0. This is a contradiction. Hence we must have m* < m. Applying the
same argument to A* in place of A, we see that m**, the dimension of ker (A** - A)
is bounded as rn** < m* < m. On the other hand, if J is the cannonical embedding
of X in X*, then JA = A** J. Hence ker (A - Al) C ker (A** - Al) and m < m**.
Thus rn = m* .
we see that ker (A* - A) _ [ran (A - A)]'. Since ran (A - A) is closed (Theorem
11), dim [ran (A - A)]1 = codim ran (A - A) by Theorem 16 of Lecture 10.
14. Fredholm Operators. Let A E 13 (X, Y). The quotient space Y/ran A is
called the cokernel of A, and written as coker A. If either ker A or coker A has finite
21. The Spectrum of a Compact operator 177
If ker A and coker A both are finite-dimensional, we say that A is a Fredholm operator.
The index of such an operator is a finite integer.
15. The Fredholm Alternative. From Theorems 9 and 12 we can extract the fol-
lowing statement, a special case of which for certain integral equations was obtained
by Fredholm.
If the alternative (ii) is true, then the homogeneous equation Ax - x = 0 has only a
finite number of linearly independent solutions.
Continuing the analysis of the previous lecture we obtain more information about
compact operators.
1. Let A E ,CC3p(X) and let A ; 0. For brevity let us write N3 for the closed linear
space ker (A - a) , j = 0,1, 2, .... We have a nested chain of subspaces
(22.1)
If this is not the case, then there exists a sequence yj of unit vectors such that
y3 E N3 and dist (y3, 1/2. For n > m
2. Exercise. Let A and A be as above. Let R3 be the closed linear space ran (A-a)i.
We have a decreasing chain of subspaces
(22.3)
Proof. Choose indices p and q, not both zero, satisfying (22.2) and (22.4). Let
n = max(p, q). Let y E ker (A - A)n fl ran (A - A)72. Then there exists x such that
y = (A - a)72x, and (A - A)ny = 0. But then (A - A)2nx = 0; i.e., x e ker (A - A)2n.
Since ker (A - A)2" = ker (A - A)" this means y = 0. Thus the two subspaces on the
right hand side of (22.5) have zero intersection. Let x be any element of X. Then
(A - A)nx is in ran (A - A)n =ran (A - a)2". So there exists a vector y such that
(A - A)nx = (A - A)2y. We have
x = (x - (A - A)72y) + (A - A)72y.
The first summand in this sum is in ker (A - A)n and the second is in ran (A - A)72.
This proves (22.5). It is clear that each of the spaces is invariant under A.
NA = ker (A - A)n,
RA = ran (A - A)n.
Then the restriction of A to NA has a single point spectrum {A} and the restriction
of A to RA has spectrum a (A) \ {A}.
Note that the space NA is the linear span of the spaces ker (A - A)3, j = 1, 2, ... .
Likewise RA is the intersection of the spaces ran (A - A)i, j = 1, 2..... So, the
integer n plays no essential role in the statement of this corollary.
X = NA EBRA
obtained above, let Pa be the projection on Na along RA. This is called the Riesz
Projection of A corresponding to the eigenvalue A. Since A is an isolated point of
a(A) we can find a closed curve I' in the plane with winding number 1 around A and
0 around any other point of a(A). It turns out that PA has a representation
1
PA 2iri f (A-1d(.
Invariant subspaces
the case. A may have no nonzero eigenvalue and then the Riesz theory does not
even tell us whether A has any nontrivial invariant subspaces. Our next theorem
says such a space does exist.
A(D)C U Ti 1(D).
i=1
182 Notes on Functional Analysis
In particular, Ax0 E Tip 1(D) for some 1 < i1 < n. This means that Tit Ax0 E D
and ATi1 Ax0 E A(D). So ATi1 Ax0 E Ti21(D) for some 1 <i2 <n. This means that
Ax0 E D. Continuing this process m times we see that
Tim x0 E D. (22.6)
All the Tip here are from the finite set {T1,... , T7}. Let c = max { IIT2 II: 1 < i < n}.
Then
lITjm...T,A"`II c"`IIA"`II
=I
The operator cA has spectral radius 0. So, by the spectral radius formula I111/rn
converges to 0, and hence (cA)m II converges to 0. Thus
as m -> oo. So from (22.6) the point 0 is in the closure of the set D. This is a
contradiction.
The case of Hilbert space, as in most problems, is simpler. The case of normal
operators is especially simple and interesting. Before Riesz did it for Banach space
operators, Hilbert had made an analysis of the spectrum of compact self-adjoint
integral operators in the space L2. These ideas were extended by E. Schmidt to
general Hilbert spaces -a term that came into existence later.
Let us recall that all our Hilbert spaces are assumed to be separable.
Proof. Most of the work for the proof has already been done. We know that Q(A)
is real, and each nonzero point in Q(A) is an eigenvalue of finite multiplicity. It
is easy to see that eigenvectors corresponding to distinct eigenvalues are mutually
orthogonal. (If Ax =fix, and Ay = y, then (A - )(x,y) _ (fix, y) - (x,,ay) _
(Ax, y) - (x, Ay) = 0.) For each eigenvalue of A choose an orthonormal basis for
the corresponding eigenspace. Let {en} be the collection of all these eigenvectors
for all the eigenvalues. This is an orthonormal set whose closed linear span .M is
invariant under A. Suppose the space ,M1 is nonzero. Since A is self-adjoint ,M1 is
also invariant under A. Let Ao be the restriction of A to .M1. Then Ao is self-adjoint
and compact. If v(Ao) contains a nonzero point A, then A is an eigenvalue of Ao
and hence of A. (Because Ax = Aox =Ate.) Since all eigenvectors of A are in M,
this is not possible. Hence Q(Ao) _ {0}, which means spr (A0) = 0. Since Ao is
self-adjoint, this means lAo II = 0, and hence Ao = 0. Thus for every x e .Ml we
have Ax = Aox = 0, which implies x e M. Hence .M1 = {0} and ,M = 7-1.
184 Notes on Functional Analysis
We have shown that {e} is an orthonormal basis for 7-1 and there exist real
numbers An such that Aen = anen. We have seen earlier (see Sections 7 and 10 of
Lecture 20) that under these circumstances )n converges to 0.
9. With just one change-the ) are complex numbers all assertions of the Hilbert-
Schmidt theorem are valid for compact normal operators. The proof is essentially
the same.
A = 1n en en (22.7)
n
in which en is an orthonormal basis and {A} is a sequence of complex numbers
converging to zero. This is also written as
A anenen. (22.8)
n
Here enen is the orthogonal projection onto the one-dimensional space spanned by
the vector e. The expression 22.8 is called the spectral decomposition of A.
f(A) _ f(An)ene.
This is a bounded operator.
In particular, if A is compact and positive, we can define its positive square root
A1!2 using the spectral decomposition.
10. The spectral theorem shows that every compact normal operator A has a reduc-
ing subspace-a closed subspace M such that M and M L both are invariant under
A.
A = >sn(.,en)fn. (22.9)
n
Proof. The operator A*A is compact and positive. So there exists an orthonormal
set {en} and positive numbers sn such that A*Ae,z = seen. The sn are all the nonzero
eigenvalues of A*A; the operator A*A vanishes on the orthogonal complement of the
space spanned by the {e}. Let In = n (Aen). Then
,fn .fm = 1
5nsm
(Aen Aem = 1e
5n5m
(A*Ae n m = nm,
i.e., the set {fn} consists of orthonormal vectors. Every vector x in 7-( can be
expanded as
x= (x, en)en + y,
n
where y e ker A*A = ker IAI. Using the polar decomposition A = UIAI we see that
Ay = 0. Thus Ax = (x, en)Aen = I Sn(x, en)fn
We may expand the sequence {sn} to include the zero eigenvalues of A*A and
the sets {en} and {f,} to orthonormal bases. The numbers sn are called the singular
values of A. They are the eigenvalues of the operator IAI. It is customary to arrange
sn in decreasing order. We have then an enumeration
Let X be any Banach space and let A be any (bounded linear) operator on it.
Does there exist a (proper closed) subspace Y in X that is invariant under A? This
question is called the Invariant subspace problem and has been of much interest in
functional analysis.
In 1949 von Neumann proved that every compact operator on a Hilbert space
has an invariant subspace. In 1954 Aronszajn and Smith extended this result to all
Banach spaces. For many years after that there was small progress on this problem.
(Sample result: if there exists a polynomial p such that p(A) is compact, then A has
an invariant subspace.) Lomonosov's Theorem announced in 1973 subsumed most
of the results then known, had a simple proof, and seemed to be valid for almost
all operators. (One needs to ensure that A commutes with some nonzero compact
operator.) The proof of Theorem 6 given here is due to H. M. Hilden.
Trace Ideals
Let A be a compact operator on (an infinite-dimensional) Hilbert space '1-1 and let
Proof. For brevity we write sfor sn (A). Let A have the singular value decompo-
sition
A= (23.3)
Let F be any operator with rank F C ri --- 1. Then we may choose a unit vector x in
188 Notes on Functional Analysis
IA-FM ? I? IlAxil I
j=1
s(AB) s(A)IIBM,
s(BA) <
Proof. Let A and F be as in (23.3) and (23.4). Since rank FB <n - 1, we have
from Lemma 1
This proves the first assertion. The second has a similar proof.
The right hand side of this inequality is symmetric in A and B. Hence we have also
s(B)-s(A)<1IA-B.
This proves the assertion.
s(A) <oc.
Then we say that A belongs to Cl, or that A is a trace class operator. In this case
we define HAIIi as
oa
4. Lemma. Let A be a trace class operator. Then for any two orthonormal sets
{x,,,,} and {y} we have
00
(AXrn,y hAlt'. (23.6)
m_1
n=1 n=1
190 Notes on Functional Analysis
Since all the summands are positive, the two sums may be interchanged, and this
double sum is equal to
sn I(xrn,en)H(fn,yrn)I.
n=1 m=1
n=1 m=1
Since en and f n are unit vectors, by Bessel's inequality this expression is bounded
by
sn=llAll,
n-
5. The trace. Let A E Cl and let {Xm} be any orthonormal basis for N. Let
00
tr A = >(AXm,Xm). (23.7)
m=1
Lemma 4 implies that this series converges absolutely and its terms may, therefore,
be rearranged. We show that the sum in (23.7) does not depend on the orthonormal
basis {xm,}.
Theorem. Let A be a trace class operator with singular value decomposition (23.3).
Then for every orthonormal basis {Xm} we have
(23.8)
m=1 n=1
The order of summation can be changed by the argument in the proof of Lemma 4
and we have
00 ao 00
(AXm,Xm) sn (xm,en)(fn,xm)
m.=1 n=1 m=1
00 00
m=1
(<00. (23.10)
for some orthonormal basis, then it is not necessary that A is trace class. (The right
shift S on P2 is an example.) For A to be trace class the series (23.10) must converge
for every orthonormal basis.
A+B = sn(A+B)(.,en)fn
n=1
be the singular value decomposition. Then
Hence
I + l(Ben,.fn)
n=1 n=1 n=1
IIAMi + llBIli.
n1,Sj(An) = Sj(A)
Let e be any positive number. By the diagonal procedure we can obtain a subse-
quence {Ak} such that
and hence A E Cl . Now choose any n and let (A- A) _ > s( A - A)(.,
i=1
f
be the singular value decomposition of the operator An - A. Then for each positive
integer N
N N
>sj(A-A) _
j=1 j=1
N
kIn - Ak)ei , 13)1 (since Ak -> A)
(The last limit exists because { n - Ak IIi} is a Cauchy sequence.) It follows that
- -ql < y IIAn - 1kIIi Taking the limit as n - o0 one sees that IIAn - All1
converges to 0.
8. Theorem. Let A be a trace class operator and B any bounded operator. Then
AB and BA are trace class and
Theorem. Let A be a trace class operator and B any bounded operator. Then
tr AB = tr BA. (23.13)
Proof. We prove this for a special case first. Let U be any unitary operator, and
let {x} be any orthonormal basis. Then the vectors yn = U*xn form another
orthonormal basis. We have
So, the equality (23.13) is true when B is a unitary operator. The general case
follows from this because of the following lemma and the obvious fact that the trace
is a linear functional on Cl.
194 Notes on Functional Analysis
If B is any self-adjoint operator, we may divide it by (TWIT and reduce it to the special
case above. Thus every self-adjoint operator is a linear combination of two unitary
operators. Since every bounded operator is a linear combination of two self-acijoint
operators the lemma is proved.
If b is a real number with Ibi < 1, then b = cos 8 for some B in [0, it]. In this case
the equation (23.14) defines two numbers exp(fiB) whose sum is 2 cos B.
(i) The collection Cl consisting of trace class operators is a 2-sided, *-closed ideal
in 13(7-l).
(v) The formula (23.7) defines a linear functional called trace on Cl. This has the
property tr AB = tr BA.
IlI..XIIIIAM1IIYII (23.15)
23. Trace Ideals
IIUAVII1 = I (23.16)
11. Exercise. Let A be a self-adjoint, trace class operator. Show that tr A is the
sum of the eigenvalues of A each counted as often as its multiplicity.
(This statement is true for all trace class operators, not necessarily self-adjoint.
This is called Lidskii's Theorem and its proof is somewhat intricate.)
Hilbert-Schmidt Operators
is finite for every orthonormal basis {xn} and is independent of the choice of
the orthonormal basis. This gives an inner product on C2.
This is called the Hilbert-Schmidt norm. With this norm C2 is complete. Thus
C2 is a Hilbert space.
196 Notes on Functional Analysis
'bmn(X,y) = com(x)pn(y)
K(x,y)cpm(y)cp dx dy .
J001 1
It follows that
r1 f1
lIKhl2 = trK*K = K(x,y)I2dxdy.
f Jo
Schatten Classes
Let A be a compact operator and let 1 <p < oc. We say A is in the class Cp if
00
s(A) < oo.
n=1
For such an operator we define
00 i/P
IlAII=(s(A))
n=1
23. Trace Ideals 197
I = sl(A) = MAIl.
There is an interesting analogy between the sequence spaces PP and the Schatten
spaces C. This goes as follows. The class of all compact operators is thought of
as being analogous to the space co consisting of sequences that converge to 0. (The
singular values of a compact operator are in the space co.) The space CP, 1 < p < oo,
is thought of being analogous to the space 4. We state without proof some facts.
(i) For 1 <p < oo the space C is the Banach space dual of C4, where P + 9 = 1.
(ii) The space Cl is the Banach space dual of the space ,t3o(X) consisting of all
compact operators.
Exercises. Here are two exercises arising from our discussion of singular values.
14. Let .Fn be the collection of all operators whose rank is at most n. Show that this
is a norm closed subset of 13(7-1).
where Ae3 = ) j e3 . We can express this in other ways. Let > ) > ... > ) j be the
distinct eigenvalues of A and let ml, m2,.. - , mbe their multiplicities. Then there
exists a unitary operator U such that
k
U*AU = (24.2)
3=1
viewpoint and each of which is useful in different ways. We will study some of these
versions.
7(=117(2... =l(n
x=(xi,x2,...)
where x3 E 7-1 and 1 11x3 IIZ <00. The inner product on 7-1 is defined as
(x,y) _ (Xn,yn),
n=1
and this makes 7-iC into a Hilbert space.
(pf1,pf2,. . .).
A very special and simple situation is the case when X is an interval [a, b] and
(t) = t. The induced multiplication operator M, is then called a canonical multipli-
cation operator. For brevity we write this operator as M. One version of the spectral
theorem says that every self-adjoint operator on a Hilbert space is equivalent to a
canonical multiplication operator.
The theorem is proved in two steps. First we consider a special case when A has
a cyclic vector. The proof in this case is an application of the Riesz Representation
Theorem or Helly's Theorem proved in Lectures 7 and 8. We follow arguments
that lead from the finite-dimensional case to the infinite-dimensional one, thereby
reducing the mystery of the proof to some extent.
2. Cyclic spaces and vectors. Let A be any operator on 7-1. Given a vector x let
S be the closure of the linear span of the family {x, Ax, A2x,... }. We say that S is
a cyclic subspace of 7-1 with x as a cyclic vector. If there exists a vector xo such that
the cyclic subspace corresponding to it is the entire space 7.1 we say that A has a
cyclic vector xo in 7-1.
Proof. Let xo be a cyclic vector for A. We may assume IxoM = 1. Using the Gram-
Schmidt procedure obtain from the set {xo, Axo, A2x0, ... } an orthonormal basis
{yo, yl, y2, ...} for 7-1. Let Sn be the subspace apanned by the first n vectors in this
basis, and Pn the orthogonal projection onto Sn . The sequence {Pn} is an increasing
sequence that converges strongly to I. Let An = Pn APn . Then ((I < h and An
converges strongly to A. The operator An annihilates Sn and maps Sn into itself.
Let An be the restriction of An to S.
We apply the known finite-dimensional spectral theorem to the Hermitian oper-
24. The Spectral Theorem -I 201
ator An on the n-dimensional space S. Let Anl > An2 > > 'nk,be the distinct
eigenvalues of A. Then Anj I < IIAnII < hand there exist mutually orthogonal
projections with ranges contained in Sn such that
An = An j P3. (24.4)
j=1
(Pnjx0,x0) (24.5)
7:AnjEE
lim ff dn = ffd.
Since the measure ,un is concentrated at the finite set {A1,... , nkn } we have
kn
f j=1
From the representation (24.4) we see that the right hand side of (24.6) is equal to
(Axo, x0). Since A71-.A,
S
we have
for r = 0,1,2.....
202 Notes on Functional Analysis
For r = 0, 1, 2, ... , let cpr(t) = tr. The collection {r(t)} is a fundamental set
in LZ(X, ) while the collection {ATx0} is a fundamental set in x. Define a map U
between these two sets as follows
(r,s) = ft8d(t).
From (24.7) we have, therefore
In other words
(U(Arxo),U(Asxo)) _ (Kxo,A8x0).
Thus the map U preserves inner products. Since {is a fundamental set in 7-1
we can extend U uniquely to a linear isometry from ?-l into L2(X, ). The range of an
isometry is always closed. In this case the range contains all polynomial functions,
and hence is equal to L2(X,). Thus U is a unitary operator. From the equation
(24.8) defining U we have
4. Proof of Theorem 1. Let x1 be any unit vector in x and let Si be the closed
linear span of the set {Xi, Ax 1, A2 x 1i ... }. If S1 = 7-1, the theorem reduces to the
24. The Spectral Theorem -I 203
{x2, Ax2, A2x2,... }. An application of Zorn's Lemma shows that 1-1 can be written
as a countable direct sum
in which each Sn is a cyclic subspace and the S72 are mutually orthogonal.
Proposition 3 can be applied to each cyclic space Sn to get a measure fin. The
theorem follows from this.
Examples
5. Let 7-l = cn and let A be a Hermitian operator on ?-l with distinct eigenvalues
al, ... , An. Define a probability measure on [-hAll, Iby the rule ({A3}) _
and (E) = 0 for every set that contains none of the points A3. Then the space L2()
is fin. A typical element of this space may be written as 1(A) _ (f(A1),... , f(An)).
The spectral theorem tells us that A is equivalent to the operator that sends 1(A)
to Af(A) = (A1f(A1),... ,Anf(An)).
Note that in this example we could have chosen any probability measure such
n
that
j=1
for which mp > 1, then the eigenvalues .Ae+l, , Ak no longer occur in this listing.
Pick up vectors yl, ... , ye in 7-li such that yj is an eigenvector of A corresponding
to Let N2 be the P-dimensional space spanned by these vectors. Let 2 be
the probability measure on [-11AM, lAM] such that 1/Q for 1 < j < Q.
The space N2 is isomorphic to L2(2), and A restricted to N2 is equivalent to the
canonical multiplication operator on L2(2). This process can be repeated, and we
get measures i, 2i ... , r such that A is equivalent to the canonical multiplication
operator on L2(1) L2(2) L2(T)
(i) The operator A has no cyclic vector. This can be proved as follows. Let sgn(x)
be the function that takes the value 1 when x > 0 and -1 when x < 0. For
any function f in 7-l let g(x) = sgn(x) f (-x). Then
fl g(x)x2 f(x)dx
1
Cl
(-x)x2"f
J =J 1 sgn(x)f (x)dx.
The integrand is an odd function and so the integral is zero. This shows that
g is orthogonal to {f, Af, AZ f, ...}.
(ii) Let Neven and Nodd be subspaces of N consisting of even and odd functions,
respectively. These two spaces are mutually orthogonal and each of them is a
cyclic subspace for A.
24. The Spectral Theorem -I 205
(iii) Define a map U from ?-leven onto L2[0,1] as follows. For cp E 'Heven
(t'/2)
(U)(t) = 1/4 t [0,1). (24.9)
(iv) Use the formula (24.9) to define a map U from -fodd to L2[0,1]. In this case
the inverse of this map is
9. The spectral theorem, another form. One can replace the family {} oc-
curing in Theorem 1 by a single measure. The price to be paid is that the underlying
space [-hAll, fis replaced by a more complicated space. One way of doing this is
as follows. Let Xn = [-IIAII, Ifor all n = 1, 2..... Let X = U1 X, where this
union is understood to mean a union of different copies of the same space. Let be
the probability measure on X defined by the requirement that its restriction to the
nth copy in the union above is the measure Then is a probability measure
on X and the space L2() is isomorphic to the space ElL2(n). The operator A is
206 Notes on Functional Analysis
The measures {jcn} associated with A by the spectral theorem are called spectral
measures. They are measures on the interval X = [-hAil, 1 All]. In the familiar
situation of Examples 5,6 and 7 we saw that these measures are concentrated on
a(A) and vanish on the rest of X. This is, in fact, true always.
10. Exercise. (i) Let Al be the canonical multiplication operator in L2(X, ). Show
that cr(M) = supp. [Hint: If cp is any bounded measurable function, then a(M)
is the essential range of cp. See Lecture 18.]
(ii) Let A be aself-adjoint operator with a cyclic vector and let be a spectral
measure associated with it (as in Proposition 3). Then
supp = v(A).
11. Theorem. Let A be a self-adjoint operator and let {p} be its spectral measures.
Then
a(A) =
{The set on the right hand side is called the support of the family {n} and is written
as supp {n}.)
24. The Spectral Theorem -I 207
We saw that spectral measures associated with A are not unique. This is less
serious than it seems at first.
Two measures and v on X are said to be equivalent if they have the same null
sets; i.e., (E) = 0 a v(E) = 0. If (E) = 0 whenever v(E) = 0 we say v. (
is absolutely continuous with respect to v.)
12. Exercise. (i) If and v are equivalent measures, then the Hilbert spaces L2()
and L2(v) are isomorphic. The operator U defined by
(Uf)(x) =
is a unitary operator from L2() onto L2(v). (dp/dv is the Radon-Nikodym deriva-
tive; it exists whenever << v.
(ii) Let M and M be the canonical multiplication operators on L2() and L2(v).
Then
UMU* = M.
(iii) Conversely, suppose and v are two measures on X, and there exists a unitary
operator U from L2() onto L2(v) such that UMLU* = M. Then and v are
equivalent. [To prove this proceed as follows. Show that M for all k =
0, 1, 2, ....Let cpk(x) = x and let go = U(cpo). Show that U(k) = Since U is
unitary this shows that
It follows that
f(x)dp(x) = ff(x)g(x)dv(x),
J
for all f E L2(). This shows that v.J
This leaves the question of "multiplicities" of the spectral measures: how often
208 Notes on Functional Analysis
does a measure n occur in the sequence of spectral measures. This too has a
natural and pleasing answer, and we leave it at that for now.
Lecture 25
Projection-valued measures
Let (X, S) be a measurable space (any set Xwith a Q-algebra of subsets S). Let
P(N) be the collection of all orthogonal projection operators in a Hilbert space N.
A projection-valued measure on X is a map E --+ P(E) from S into 2(N) with the
following properties:
P(E),
where the series on the right converges in the strong operator topology.
(i) P(q) = 0.
The multiplicative property (iv) is important. Let us indicate its proof. Write
E U F as a disjoint union
EuF= (EnF')u(FnE')u(EnF).
This shows that
The first two terms on the right hand side vanish (as P(F) 1 P(E fl F') and P(E) 1
P(F f1 E').) Since E is a subset of E U F we have P(E)P(E U F) = P(E). So, the
left hand side is equal to P(E)P(F). Similarly the fourth term on the right hand
side is equal to P(E fl F).
Then P2 is a pvm on X in N2. We say that P1 and P2 are unitarily equivalent pvm's.
Then P is a pvm on X in N. We say that P is the direct sum of Pi, P2,... and write
this as P = EBnPn.
Let be a measure on (X, S) and let ?-l be the space L2(). For every f E L2()
and EESlet
(P'2(E)) .f = XE.f,
Then P is a pvm if and only if for every v v is a probability measure on (X, S).
212 Notes on Functional Analysis
Hence
P(EuF)=P(E)+P(F).
Since P(.) is a projection operator, this means that P(E) 1 P(F) whenever E and
F are disjoint.
Now let {En} be a family of mutually disjoint sets. Then {P(E)} is a family of
mutually orthogonal projections. Hence the series > P(En) converges strongly to
a projection. (The sequence of partial sums of this series is an increasing sequence
of projections.) Hence we have
00 00 00
n=1
v =
n=1
(P(En)v,v) - n=1 = pv (U lEn)
= KP(UlEm)v,v).
Integration
s= Z
i=1
in which {E2} are disjoint measurable sets that cover X and ai are complex numbers.
It is natural to define
fs dP = aZP(EZ).
(i) For all simple functions sl and s2, and complex numbers al and a2 we have
(iv) If v is a unit vector in 7-( and the measure associated with P according to
the equation (25.1), then
8. Now let f be any bounded measurable function on X. Then there exists a sequence
of simple functions {sn} such that sue, converges uniformly to f. From the Property
(v) in Exercise 7, we see that for any m and n
where {sn} is any sequence of simple functions converging uniformly to f. The limit
here is the limit in the norm topology of 13(7-1).
9. Exercise. Verify that f fdP defined above satisfies the five properties analogous
to those proved in Exercise 7. (The supremum in Property (v) is now replaced by
the essential supremum of f.)
10. Note that f fdP so defined is a normal operator on x. If f is real, this operator
is self-adjoint and if If (x)) = 1, then it is unitary.
What we have called f fdP is the integral over the entire space X. If E is any
measurable subset of X we put
L fdP = f(xEf)dP.
Other conventions of ordinary integration are adopted in an obvious way.
Suppose fn -> f almost everywhere (with respect to the pvm P). Then the sequence
of operators {f f,dP} converges strongly to the operator f fdP.
Proof. We use the measures v defined in (25.1) to reduce the problem to one about
ordinary measures.
The assumption that fn -> f except possibly on a set E with P(E) = 0 implies
that for every unit vector v, f- f almost everywhere with respect to v. Hence by
the (ordinary) Lebesgue dominated convergence theorem the integral f If- f I2dv
I- ffdP)v112 = fIIn_II2dv.
To say that this goes to 0 for all v is to say that f fdP converges strongly to f f dP.
12. Exercise. Under the hypotheses of the theorem above it is not necessary that
f f7dP converges in norm to f fdP. To see this let X = [0, 1], 7-1 = L2[0,1], and
let P be the canonical pvm. For each n let fn be the characteristic function of the
interval [0,1 - 1/n] .Observe that f fdP is not a Cauchy sequence in 8(7-1).
CfdP I f = (25.2)
J cp dP'` = P(E)
We now have all the machinery to prove another form of the spectral theorem.
Proof. Recall the multiplication operator form of the spectral theorem. This says
that there exist a sequence of probability measures {n} on X and a unitary operator
U from 7-1 onto the space 7-lo _ EEL2(n) such that UAU* = M, the canonical
multiplication operator on 7-[o. By the Corollary in Section 14 M = ft dPp(t), where
Po is the canonical pvm on X in In other words,
UAU* = ft dPo(t).
Let P be the pvm on X in 7-1 defined by
P(.) = U*Po(.)U.
Then, by Exercise 13, we see that the representation (25.3) is valid. (We have used
the variable ) here to show the theorem as a generalisation of the finite-dimensional
expression (24.2).)
It remains to be shown that the pvm P occuring in (25.3) is unique. Suppose Q
is another pvm on X such that
A = f a dQ(a).
x
By the Property (ii) of Exercise 7 and Exercise 9 we have then
)J1dP(\) n=0,1,2,...
= f xAdQ(\),
Hence for all unit vectors v,
Hence
16. Exercise. Let be the pvm associated with A via (25.3). Then the family
commutes with A. [Let f be a characteristic function xF. Then f f7)dP(A) _
P(F) and this commutes with all P(E). Extend this to all f by the familiar routine.]
218 Notes on Functional Analysis
Let P be a pvm on a Hausdorff topological space with its Borel a-algebra. Let
E be the union of all open sets G in X for which P(G) = 0. The set X\E is called
the support of P and is written as supp P.
17. Proposition. Let P be the pvm associated with a self-adjoint operator A via
the spectral theorem. Then
supp P = Q(A). (25.5)
Proof. Suppose ;\ supp P. Then there exists e > 0 such that PA-E, A+E) = 0. Let
v be any unit vector and v the measure defined by (25.1). Then v is concentrated
on the complement of the interval (A - E, A -}- E). Hence It - A > s almost everwhere
with respect to . Since
II(A - A)vIl2 = X It -
this shows that I I(A -A)vII2 > e2. This shows that A - A is bounded below by E. So
A cannot be an approximate eigenvalue of A, and hence cannot be in v(A).
Now suppose A E supp P. Then for every positive integer n, the projection P(A -
n, A 0. Let v be any unit vector in the range of this projection. Then for any
set E contained in the complement of the interval (A - n, A + n) we have vrz (E) = 0.
Hence
1
I(A - 1)vII2 = It - AIZd(t) _ It - 1l2dvn(t) < n2.
I
Li/n
Thus {vn} is a sequence of approximate eigenvectors of A, and hence A E v(A).
This lecture is a quick review of some matters related to the spectral theorem.
Then the spectrum of A is contained in the interval [a, b] and contains the points a
and b.
Proof. It is enough to prove the proposition in the special case when a = 0; i.e.
when the operator A is positive. (Consider the operator A -- a instead of A.) In this
case for every real number ) we have
((A-)x,x) -Ajfxff.
So if A C 0, then A -- ,k is bounded below and hence invertible. Thus cr(A) does not
contain any negative number. Since a - = o, the operator A is not invertible. Hence
220 Notes on Functional Analysis
So a(A) is contained in [a, b]. Since Q(A) is a closed set it contains the point b.
Functions of A
The spectral theorem makes it easy to define a function f(A) of the operator A
corresponding to every bounded measurable function f defined on Q(A).
A= a dP(A) (26.1)
v(A)
We could also have used the first form of the spectral theorem. If A is equivalent
to the multiplication operator Mv,, then f(A) is equivalent to the multiplication
operator Mfo.
If A is a positive operator, a(A) is contained in [0, oo). Every point of this set
has a unique positive square root. So, we get from the prescription (26.2) a unique
positive operator A"2, the square root of A. In the other picture, the function
cp representing A takes only nonnegative values. The operator A"2 is then the
multiplication operator corresponding to the function cp1/2
Let A be aself-adjoint operator and let he the pvm associated with it.
Suppose B is any operator that commutes with P(E) for all measurable sets E. Then
26. The Spectral Theorem -III 221
B commutes with f fdP for all bounded measurable functions f. (Prove this first for
characteristic functions, then for simple functions, and then for all f.) Conversely,
suppose B commutes with A. Then B commutes with all powers An. Let x and y be
any two vectors. Since An = f A"dP(A), we have
An"d(P()x,B*y) (Anx,B*y) (AnBx,y) = fd(P()Bx,y).
_ _ (BA'2x,y) _
Since this is true for all n, we must have
(P(E)x,B*y) _ (P(E)Bx,y),
The essential spectrum Qess(A) consists of those points A for which the range of
the projection P(A -e, A +e) is infinite-dimensional for every e > 0. If for some e > 0,
this range is finite-dimensional we say that a is in Qdisc(A), the discrete spectrum of
A. Thus the spectrum v(A) is decomposed into two disjoint parts, the essential and
the discrete spectrum.
(ii) Qdisc(A) is not always a closed set. (e.g. in the case of a compact operator for
which 0 is not in the spectrum but is a. limit point of the spectrum.)
(iii) A point A is in the set ads(A) if and only if A is an isolated point of Q(A)
and is an eigenvalue of finite multiplicity. Thus A is in Qess(A) if it is either
an eigenvalue of infinite multiplicity or is a limit point of v(A).
Proof. If A e cress (A), then for every n the space ran P A - n , A + n is infinite-
dimensional. Choose an orthonormal sequence {xk k = 1, 2, ... } in this space.
Then
ll(A - A)xk 112 < for all k.
n
(See the proof of Proposition 17 in Lecture 25.) By the diagonal procedure we may
pick up a sequence {xn} such that II(A - A)xnII2 < 1/n2 for n = 1, 2, ... .
If a E Qdisc(A), then for some s > 0 the space ran P(A - e, A + e) is finite-
dimensional. So, if {xn} is any orthonormal sequence, then this space can contain
only finitely many terms of this sequence, say xl, X2,... , xN. For n > N we have,
therefore, II(A - a)x lI2 > e2. Thus (A - A)xn cannot converge to 0.
The following theorem says that adding a compact operator to a bounded self-
adjoint operator does not change its essential spectrum.
I- a)xn II I
- a)xII + I- A)x II,
this shows that (B - -f 0, and hence '\ E Qess(B). Thus cress(A) C oess(B)
By symmetry the reverse inclusion is also true.
One may note here that the spectral theorem for a compact self-adjoint operator
follows from this. (Choose B = 0.)
A= fz dP(z). (26.3)
The multiplication operator form of this theorem says that A is unitarily equiv-
alent to an operator of the form M(,, in some space L2().
Let U be a unitary operator. Then a(U) is contained in the unit circle. We may
identify the unit circle with the interval [-7r, nj as usual.
Let x be any vector in 1-1 and for n E 7L, let
an = (Ux,x).
Then for any sequence of complex numbers z1, z2, ... , we have
L
j,k j,k
x)zjzk
_ .(Uix, UIcx)z7zk
j,k
J
Thus the sequence {an} is a positive-definite sequence. By the Herglotz Theorem
(Lecture 8) there exists a positive measure on [-ir, n] such that
Using the polarisation identity we can express (Unx, y) for any pair of vectors x, y
as a sum of four such terms. This leads to the relation
4
(ILx+y - /2x-y + 2/Lx+iy - ii- x_iy) .
(ii) =
(P(E)x,y) =
From the properties in Exercise 7 it follows that P(E) is self-adjoint and countably
additive. To prove that it is a pvm we need to show that P(E)2 = P(E) for all E.
We prove a stronger statement.
(If f ei"`td(t) = f eintdv(t) for all n, then the measures and v on [-ir, it] are equal.)
Integrate the two sides of (26.8) over the set E. This gives
f XE y) _ (P(E)Ukx, y)
_ (Ukx, P(E)y) (since P(E) is self-adjoint)
_ J eiktd(P(t)x, P(E)y) (from (26.5) and (26.6)).
This is true for all x and y. Hence we have the assertion (26.7).
Thus is a pvm on the unit circle (identified with [-it, it]). The relations
(26.5) and (26.6) show that
U= f eitdP(t), (26.9)
where P is a pvm on the unit circle. The integral exists in the norm topology; the
proof given for self-adjoint operators in Lecture 25 works here too.
Let (X, be a measure space. A bijection T of X such that T and T-' are
measurable is called an automorphism of (X, S). If T-1(E) _ u(E) for all E E S,
then T is called ameasure-preserving map.
10. Exercise. The aim of this exercise is to show that the set of compact operators
Cio(7-l) is the only closed 2-sided (proper) ideal in ,Ci(?-l). Fill in the details.
(1) Let Z be any 2-sided ideal in 8(7-1). Let T E Z and let u, v be any two vectors
such that Tn = v. Let A be any rank-one opearator. Then there exist vectors
x and y such that A = (.,x)y. Let B = (.,x)u and let C be any operator such
that Cv = y. Show that A = CTB. Thus Z contains all rank-one operators,
and hence it contains all operators of finite rank.
(ii) Suppose Z contains a positive operator A that is not compact. Then there exists
an E > 0 such that the range of the projection P(e, oo) is infinite-dimensional.
26. The Spectral Theorem -III 229
(Here P is the pvm associated with A.) Let .M be this range and let V be a
unitary operator from 1-[ onto M. Since A(M) _ we have
IV*AVxII ? IIxII.
(iii) Thus if Z is any proper 2-sided ideal in 13(7-1) then every element of T is a
compact operator and every finite-rank operator is in Z. Since Xio(fl) is the
norm closure of finite-rank operators, if Z is closed, then it is equal to 130(?-l).
Index
A1!2, 155 c, 5
At, 113 4, 5
A*, 111 ffdP, 214
A a9 A, 103 oc-norm, 2
A- A, 194 (x,y),, 82
BV[4,1], 53 codim, 77
C(X), 3 css ran cp, 149
C[0,1], 3 lnd A, 177
c7 [o, 1], 4 kcr, 87
kcr A, 158
L 7 ran, 87
7 ran A, 158
.RA(A), 132 spr (A), 135
S1, 76 supp P, 218
S-1--, 85 supp,a, 206
W(A), 219 Iran cp, 149
X/, 19 tr A, 190
X**, 73 (E), 211
X*, 25 p(E), 212
[8], 77 P(A), 132
13(X, Y), 21 a(A), 134
t3(X), 23 139
adisc A } 222
,
Index 231
(U.B.P.), 36
Walsh functions, 99
weak* compact, 58
weak* continuous, 76
weak* topology, 74
weakly analytic, 131
weakly differentiable, 130
weak operator topology, 103
weak topology, 66, 74, 79
metrisability of unit ball, 97
not metrisable, 69
weighted shift, 150
weight sequence, 151
Weyl's Perturbation Theorem, 223
Young's inequality, 2