0% found this document useful (0 votes)

47 views

An Introduction To Functional Analysis For Science and Engineering

1. This article aims to introduce functional analysis, a branch of mathematics widely used in science and engineering that extends concepts like convergence from real numbers to infinite sets of functions. 2. Functional analysis allows problems involving continuous functions, like waves, to be meaningfully reduced to finite-dimensional approximations. It develops concepts like norms, metrics, Hilbert spaces, and operators between vector spaces that enable examining convergence of sequences of vectors and functions. 3. The article will cover key ideas like vectors representing functions, linear operators generalizing matrices, and properties like compactness and Hermiticity of operators. It focuses on topics necessary to understand important results while keeping the introduction accessible through a narrative style.

Uploaded by

evelyn

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views

An Introduction To Functional Analysis For Science and Engineering

Uploaded by

evelyn

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 60

An introduction to functional analysis

for science and engineering

David A. B. Miller
Stanford University
This article is a tutorial introduction to the functional analysis mathematics needed in many
physical problems, such as in waves in continuous media. This mathematics takes us beyond that
of finite matrices, allowing us to work meaningfully with infinite sets of continuous functions. It
resolves important issues, such as whether, why and how we can practically reduce such problems
to finite matrix approximations. This branch of mathematics is well developed and its results are
widely used in many areas of science and engineering. It is, however, difficult to find a readable
introduction that both is deep enough to give a solid and reliable grounding but yet is efficient
and comprehensible. To keep this introduction accessible and compact, I have selected only the
topics necessary for the most important results, but the argument is mathematically complete and
self-contained. Specifically, the article starts from elementary ideas of sets and sequences of real
numbers. It then develops spaces of vectors or functions, introducing the concepts of norms and
metrics that allow us to consider the idea of convergence of vectors and of functions. Adding the
inner product, it introduces Hilbert spaces, and proceeds to the key forms of operators that map
vectors or functions to other vectors or functions in the same or a different Hilbert space. This
leads to the central concept of compact operators, which allows us to resolve many difficulties of
working with infinite sets of vectors or functions. We then introduce Hilbert-Schmidt operators,
which are compact operators encountered extensively in physical problems, such as those
involving waves. Finally, it introduces the eigenvectors or eigenfunctions for major classes of
operators, and their powerful properties, and ends with the important topic of singular-value
decomposition of operators. This article is written in a style that is complementary to that of
standard mathematical treatments; by relegating longer proofs to a separate section, I have
attempted to retain a clear narrative flow and motivation in developing the mathematical
structure. Hopefully, the result is useful to a broader readership who need to understand this
mathematics, especially in physical science and engineering.

1 Introduction
Physical scientists and engineers are typically well educated in many branches of mathematics. Sets, the
various kinds of numbers, calculus, differential equations, and linear algebra (especially with finite
matrices) form a typical grounding. It is not uncommon in these disciplines to encounter results from
another field of mathematics when we have to work with sets of functions; this is routine in quantum
mechanics, for example, which is mathematically built around the general linear algebra of operators and
sets of eigenfunctions. But that field of mathematics is not itself part of the typical course sequence for such
scientists and engineers. When we need to understand those results more deeply, we therefore have a
problem. Recently, in understanding problems with waves [1], for example, such as meaningfully counting
the number of usable communications channels between sources and receivers, that lack of understanding
has led to substantial confusion and even error1. This “missing” field of mathematics is functional analysis.
Functional analysis is a highly developed field that is well-known to mathematicians. But, possibly because
it is not generally taught to others, its literature is resolutely mathematical, erecting a higher barrier of

1
Indeed, the impetus for writing this review was precisely to give the necessary background for such a deeper analysis
of such wave problems [1]
arXiv:1904.02539 [math.FA]
2

incomprehensibility for other mortals. Its texts, like many in mathematics, tend to be dry to the point of
total dessication. That may suit those who like mathematics for its own sake, and may even be regarded as
a virtue in that discipline. But, for others whose motivation is to understand the mathematics they need for
specific actual problems in the physical world, the lack of any narrative structure (so – where are we going
with this?) and of any sense of purpose in this next definition, lemma or proof (so – do I need this, and if
so why?) can make the field practically inaccessible to many others who could understand it well and use
it productively.
This article is a modest attempt to introduce at least an important and useful part of this field of functional
analysis. By a careful selection of topics, by avoiding the temptation of every incidental lemma, and by
relegating major proofs to the end, I hope to construct a narrative that leads us willingly through to
understanding. A few of those proofs do indeed require some deep intellectual effort, especially if one is
not used to certain kinds of mathematical arguments. I have, however, relegated all of these proofs to the
end, so the reader can more easily follow the sequence of useful ideas. With that, this field, or at least the
part I will cover, is, I believe, relatively comprehensible, and even straightforward.
Broadly speaking, functional analysis takes the kinds of results that are simple and even obvious for
concepts such as the convergence of sequences of real numbers, and extends them to other situations; we
can then examine especially the convergence of sequences of different vectors (possibly with large or even
infinite numbers of elements) or continuous functions, and even sequences of matrices or linear operators.
This extension is possible because we only need to generalize a few of the properties we find with real
numbers; then we can establish the necessary techniques for convergence of these more complex vectors,
functions, matrices, and operators. We can then build on those techniques to generate powerful results.

2 Ideas and approach

Before starting on the mathematics proper, we can summarize where we are going, why we are going there,
and how we going to get there. We can do this first by looking at the key ideas, and then give a “roadmap”
for this article.

2.1 Key ideas

We give a brief and informal introduction here the kinds of ideas and mathematical concepts we will use.
We will develop all the formal definitions and resulting mathematics later.

2.1.1 Vectors and functions

In this mathematics, instead of just working with individual numbers, we are going to have to work at least
with “vectors”. Here these vectors are almost always not the geometrical vectors with x, y, and z
components; instead, we could regard them as representing the values of a function at every point of interest,
and we can think of them as columns of those numbers – the values of the function at each such point –
which are vectors in the sense of matrix-vector algebra. Of course, immediately we see that, to represent
“smooth” functions, these vectors are likely to be infinitely long. Since these vectors are representing
functions, we can (and do) refer to them as functions, and we will use the terms “vectors” and “functions”
interchangeably in what follows here.

2.1.2 Operators
In normal matrix algebra with finite matrices, we are used to the idea that a matrix can “operate on” (that
is, multiply) a vector (usually considered to be on the right of the matrix) to generate a new vector. A matrix
operating in this way is technically a “linear operator”. In the mathematics we develop here, we generalize
this idea of a linear operator; the reader can, however, be confident that this operator can always in the end
be thought of as a matrix, possibly of infinite dimensions. Importantly, with the same functional analysis
mathematics, this operator could be a linear integral operator (as in a Fourier transform operator, or a
Green’s function, for example), kinds of operators that are common in working with continuous functions.
3

For this mathematics, we have to define some additional concepts for such operators, especially whether
they are what is known as “compact”. This “compactness” property is essentially what allows us to
approximate infinite systems with appropriate finite ones. Such compactness is trivial for finite matrices
(all finite matrices have this property), but other operators may not be compact.
A particularly important class of operators – “Hilbert-Schmidt” operators – comes up frequently in physical
problems, and we will see these are all compact. They have another important property that allows us to
define “sum rules” for various physical problems. A further important characteristic of operators is whether
they are “Hermitian”. For a matrix, this simply means that, if we transpose the matrix and then take the
complex conjugate of all the elements, we recover the original matrix. For more general operators, we can
give a correspondingly more general definition. This “Hermiticity” appears often in our applications, and
yields further very useful mathematical results.
Operators can lead to very specific functions – the “eigenfunctions” – which are those functions that, when
we operate on them with the operator, lead to the same function at the output, just multiplied by some
“eigenvalue”. This is analogous to the similar behavior for eigenvectors and eigenvalues for matrices, but
these functions can have many further useful properties in our case.

2.1.3 Norms, metrics, and inner products

With ordinary numbers, the idea of the “size” of a number is obvious – it is just the magnitude of the
number. For functions, vectors, and matrices, we need a generalization of this idea, which we call the
“norm”. With ordinary numbers, the “distance” between two numbers is also obvious – we subtract one
from the other, and take the magnitude of that. For functions, vectors, and matrices, we need an analogous
concept, which we call the “metric”. We we can visualize this loosely as the length of the “vector” that
joins the tips of the two “vectors” we are comparing. With that metric, we can talk about the convergence
of sequences of “vectors” or sequences of functions. Both the norm and the metric can be simply defined
based on another idea – the “inner product”. The inner product between two vectors or functions can be
viewed as a generalization of the geometrical vector idea of a vector “dot” product. The inner product is
mathematically of central importance here, and often has substantial physical meaning.

2.1.4 Hilbert spaces

Just as we can think of ordinary geometric vectors as existing in ordinary three-dimensional “geometrical”
space, we can consider our more general vectors as existing in a space (which generally may have an infinite
number of dimensions). If we give that space the property of having an inner product, then (with some
minor additional properties) we can call this space a “Hilbert space”, which then becomes our analog of
geometrical space.

2.1.5 Basis sets and eigenfunctions

We will see that, just as we can define unit vectors in the x, y, and z directions in ordinary geometrical space
and use them to describe any vector, we can similarly define “basis” vectors in our Hilbert spaces.
Importantly, the eigenfunctions of many of the operators we will work with will be “complete” basis sets,
which have many useful mathematical properties. A final step in our mathematical process is to establish
the operation of “singular value decomposition” of an operator, which can define pairs of eigenfunctions in
different Hilbert spaces; this is particularly useful, for example, for looking at problems with waves, where
we may be generating waves in one “volume” from sources in another.

2.2 Structure of this article

With these various ideas, we can set up powerful mathematics that retains much of our understanding for
finite matrices and vectors and extends it for continuous functions. Below, then, we develop this
mathematics, including all the definitions of terms and the necessary theorems, and including proofs. Note
that when we are defining some term below, we will write the term in italic font. I have including an index
4

of all of these definitions at the end (section 13) so the reader can refer back to those as the reader progresses
further into the development here.
This article is customized in its order and in the specific material included, and the overall argument
presented here is original in that sense. The underlying mathematical material is, however, standard; nearly
all the mathematical definitions, theorems and proofs below are found in texts in functional analysis. In
particular, I have made extensive use of discussions in Kreyszig [2], Hunter and Nachtergaele [3], and
Hanson and Yakovlev [4], as well as general mathematical reference material2. If some proof closely
follows another treatment, I give that specific source. I have tried to use a consistent notation within this
article, though that notation is necessarily not quite the same as any of these sources.
The construction of the argument here is also original in that it avoids introducing much mathematics that
we do not need; by contrast, the standard treatments in functional analysis texts typically insist on building
up through a progressive and very general set of concepts, many of which are irrelevant here and that make
it more difficult to get to the ideas we do need3. The only actual mathematical innovations in this article are
in some minor but novel notations we introduce that make the mathematics easier to use in some situations4.
The way that I present the material here is also different in structure to the approach of typical mathematics
texts. I emphasize the key points and structure of the overall argument rather than the detailed proofs. I
include all the necessary proofs, but the more involved proofs are included at the end so that those details
do not interrupt the overall logical and narrative flow. If this mathematics is new to you, I recommend
reading through the argument here up to section 11, and returning to that section later for deeper study of
the various proofs. Overall this article is by far the shortest path I know of to understanding this material.
Hopefully, too, this article may help the reader follow these more comprehensive texts [2][3][4] if the reader
needs yet more depth or an alternate treatment.
In this article, in section 3, I will introduce the necessary concepts from the analysis of convergence with
ordinary numbers. Section 4 extends these convergence ideas to functions and vectors. Section 5 introduces
Hilbert spaces, and section 6 continues by introducing key ideas for operators. Section 7 treats the
eigenfunctions and eigenvalues of the most important class of operators for our purposes (compact
Hermitian operators). Section 8 expands the concept of inner products, allowing ones with additional
specific physical meanings, and section 9 gives the extension of this algebra to singular-value
decomposition. After concluding remarks in section 10, various proofs are given in detail in section 11.
After references in section 12, an index of definitions is given in section 13.

3 Convergence of sequences of real numbers

To think about how one vector, function, matrix or operator matrix converges on another such entity, we
need to start by thinking about how sequences of numbers converge, and we generally do this by thinking

2
Kreyszig [2] is a classic introductory text that is more comprehensive and readable than many texts in the field of
functional analysis, but it omits explicitly discussion of Hilbert-Schmidt operators (though it does cover much of the
associated mathematics). Hunter and Nachtergaele [3] is an example of a more modern and relatively complete text.
Hanson and Yakovlev [4] is not so complete on the mathematics of functional analysis in itself (though it refers to
additional proofs in other sources), but includes substantial discussion of applications in electromagnetism.
3
As a result, my approach here takes up only about 10% of the corresponding length of such mathematics texts in
getting to the same point. Of course, there is much other good mathematics in those texts, but that other 90% makes
it much harder to understand just the mathematics we do need.
4
In part because we may be working with operators that map from one Hilbert space to another, and those may have
different inner products, we introduce the explicit notation of an “underlying inner product”. We also expand the use
of the Dirac notation. This is a common linear algebra notation in some fields (especially quantum mechanics), but is
not common in mathematics texts. We are also able to make full use of it, through what we call an “algebraic shift”,
which is essentially a shift from algebra using general inner products, whose algebra is more restricted, to one that is
a simple analog of matrix-algebra. Dirac notation can be regarded as just a convenient general notation for the algebra
of complex matrices.
5

about convergence of sequences of real numbers5. To do so, we should be clear about what a sequence of
real numbers is. We should also formally introduce the ideas of norms and metrics, which are very
straightforward for real numbers. (These ideas can be applied later to other entities, such as vectors,
functions, matrices and operators.) Then we can formally define convergence of a sequence of real numbers
and give some other important definitions. Following that, we can introduce the Bolzano-Weierstrass
theorem (proved in section 11), which is a useful mathematical tool for later proofs, and is an essential
concept for understanding all the ideas of convergence.

3.1 Sets and sequences of real numbers

Before defining a sequence, we can define a set of real numbers. A set can be defined just by writing out
all its members. A set itself need have no properties other than having members, though we can give it
additional properties. Conventionally, we write a set out by enclosing a list of its elements within curly
brackets. So {1.7, 3.6, 2} is a set that contains the three real numbers 1.7, 3.6, and 2. The order of the
elements of the set (here the 3 real numbers) does not matter if we are just listing the elements, so
{1.7, 2, 3.6} means the same as {1.7, 3.6, 2} .
We often do care about the order of numbers, however. 1.7, 2, and 3.6 might be the values of some function
at successive points, or they might be the x, y, and z coordinates of some point in ordinary geometric space.
If we care about the order, then we use a sequence of elements, which conventionally we write by enclosing
the elements in ordinary braces, as in (1.7, 3.6, 2) ; this is a sequence of the three real numbers 1.7, 3.6, and
2 in this specific order. So, the sequence (1.7, 3.6, 2) is different from the sequence (1.7, 2, 3.6) . We could
also write the sequence (1.7, 3.6, 2) using a notation (1.7, 3.6, 2) ≡ ( x j ) where x1 = 1.7 , x2 = 3.6 and
x3 = 2 , and we presume j successively takes on the values 1, 2, and 3 – i.e., j = 1, 2,3 .
Sequences of elements can obviously be finite in length, as in the above simple examples. However, it is
common in mathematics texts on functional analysis to presume implicitly that sequences written in the
form ( x j ) are infinite in length unless otherwise stated6, with the variable j typically then ranging over all
the positive integers, or, equivalently, the “natural numbers”7, which explicitly start at 1, excluding zero.
Below, we will also need the idea of a “subsequence”. A subsequence is formed from a sequence by
choosing some of the elements of the sequence while keeping the order of those elements the same. So
(1.7, 2) is a subsequence of the sequence (1.7, 3.6, 2) , whereas (2, 1.7) is not. Note that a subsequence is
also a sequence in its own right, being a set of elements in a specific order.

3.2 Sets and spaces

The set of (all) real numbers8 is a set that also possesses various axiomatic properties. These are, essentially,
the ordinary arithmetic properties of addition, subtraction, multiplication and division. With these additional
properties, the set of real numbers is also a (mathematical) field, as is the set of complex numbers.
When we are operating with vectors, functions, operators or matrices, the corresponding sets may also have
some axiomatic properties, especially the ideas of norms and metrics below; these sets might not have all
the same algebraic properties9 as fields, so we need to take some care in defining the properties they do

5
The arguments here also work without change for complex numbers.
6
This default assumption of infinite length is not always clear or explicit in texts on functional analysis, which can
cause significant confusion in reading them.
7
The set of all natural numbers is usually denoted by the symbol  .
8
The set of all real numbers is usually denoted by the symbol  .
9
For example, multiplication of operators or matrices is not generally commutative, and division of one vector,
function or matrix by another may not have any meaning.
6

have. An (abstract) space is a set with some additional axiomatic properties, and this is what we mean
below by the term “space” in a mathematical context. A (mathematical) field is also a specific example of
a space, so we can use the term “space” to cover sets with many different kinds of added attributes.

3.3 Norms and metrics

To talk about the “size” or “magnitude” of elements, such as real numbers (or, later, functions, vectors,
operators and matrices), we can introduce the idea of a norm. To be useful in the mathematics that follows,
a norm of an element x in a set or space, written as x , is a real number, and for arbitrary elements x and y
in the set or space, it must possess the following four additional properties10:

(N1) x ≥0
(N2) x = 0 if and only if x = 0
(N3) ax = a x where a is any scalar (i.e., real or complex number)
(N4) x + y ≤ x + y (the triangle inequality for norms)
(1)
For the set or space of real numbers, we choose the norm to be simply the modulus – i.e., the norm of the
real number x is x (and we can make the same choice for the norm of the set or space of complex numbers
if needed). A set or space on which we have defined a norm is called a normed space.
A metric d in a set or space expresses a notion of “distance” between two elements x and y of the set or
space, and has to have four properties for arbitrary elements x and y:

(M1) d is real-valued, finite and nonnegative (i.e., d ≥ 0 )

(M2) d ( x, y ) = 0 if and only if x = y (i.e., if and only if x and y are the same
element)
(M3) d ( x, y ) = d ( y, x) (symmetry)
(M4) d ( x, y ) ≤ d ( x, z ) + d ( z , y ) (the triangle inequality)
(2)
For real numbers, we can choose a metric
d  ( x, y ) ≡ x − y (3)

i.e., the modulus of the difference between the two numbers. A set or space on which we have defined a
metric is called a metric space. A metric like this one d  ( x, y ) , which obviously follows directly from the
definition of the norm, is sometimes called the metric induced by the norm.

3.4 Convergence of a sequence

With the ideas of metrics (and norms), the notion of convergence of a sequence of real numbers is
straightforward. Generally, an (infinitely long) sequence ( xn ) in a metric space with metric d is said to
converge if there is an element x in the space such that
lim d ( xn , x ) = 0 (4)
n →∞

10
Note that in the expression x = 0 here, the “0” is the number zero, but in the general case where x may be a vector
rather than a number, the expression x = 0 has to be taken to mean that the “0” refers to the zero vector, a vector of
zero “length” or norm. This ambiguity of notation is unfortunately common in this mathematics.
7

where “ lim ” is a short-hand for “the limit as n tends to infinity”.

n→∞

Defined in this way, this idea of convergence can later be applied to other kinds of spaces in which the
elements may not be real numbers but for which we have defined some metric – specifically, the elements
might later be functions, vectors, operators, or matrices. We then call x the limit of ( xn ) and we can write
either lim xn = x or the notation xn → x ; both notations are equivalent to saying that ( xn ) converges to
n →∞
x or has limit x. (If ( xn ) is not convergent, then by definition it is divergent.)
Note that x must be in the set or space if it is to be a limit in this strict sense. It is quite possible for a
sequence to converge to a limit that lies outside a set or space, though it would have to be “just” outside.
For example, if we considered the set or space of real numbers starting from (and including) 0 and
continuing up to, but not including, 1 (a set that could be written as [0,1) in one common mathematical
notation), the sequence ( xn ) with elements xn = 1 − 1 / n , i.e., with elements 0, ½, 2/3, ¾, 4/5, … and so on,
is converging towards 1 as its limit, but 1 is not itself in the sequence. In this case, the sequence, though
convergent, is not converging to a limit in the set or space (even though it is converging to a value that lies
just outside the set or space). Of course, we could easily “close” this space by adding in the element 1; the
resulting set of all these elements and the element 1 could then be called the closure of the original set.
(Note that the closure is not just the additional elements required to close the set; it is all the elements of
the original set plus those required to close it.)
One particularly important property of a set of elements is whether it is bounded. Formally, a set is bounded
if, for any choices of elements x and y in the set, the supremum of the metric d ( x, y ) is finite11. The
supremum of a set of real numbers is the smallest number that is greater than or equal to all the elements of
the set, if such an element exists. The supremum is also referred to as the least upper bound.
Often, the supremum will be the maximum element in the set, but the supremum and the maximum are not
necessarily the same; a set may have a supremum that is not in the set, whereas a maximum, if it exists,
would have to be in the set. For example, the infinite set of elements ½,3/4, 7/8, 15/16, 31/32, and so on,
has a supremum of 1, but the element 1 is not in this set, and it is not clear that there is a maximum element
in this set; for any specific element we choose that is arbitrarily close to 1, there is another element that is
even closer. We can call a sequence ( xn ) a bounded sequence if the corresponding set of points in the
sequence is a bounded set.
In discussing boundedness, we sometimes also need the complementary concept of an infimum, especially
if the numbers in question could be both positive and negative. The infimum of a set of real numbers is the
largest number that is less than or equal to all the elements of the set, if such an element exists. Similarly,
it is not necessarily the same as the minimum element in the set; for example, a minimum may not exist in
a set, as in the set with elements ½, ¼, 1/8, 1/16, and so on, which has an infimum of 0, but may have no
minimum element. (Metrics are always positive or zero, so they naturally have a lower bound of zero, so
with metrics we may not need to deal with the infimum explicitly.)

3.5 Cauchy sequences and complete spaces

For subsequent definitions and proofs, in addition to the definition in Eq. (4), it is useful also to have a
slightly different kind of notion of convergence of sequences. To set this up, we first define the idea of a
Cauchy sequence. Specifically, in a metric space with a metric d ( x, y ) (and the real numbers are a good
example, with the metric d  as defined in Eq. (3)), a sequence ( xn ) is said to be Cauchy (or to be a Cauchy

11
Note, incidentally, that this notion of boundedness only uses the “distance” between any two elements, not the
“value” of the elements themselves. This choice is slightly less restrictive formally because it means we do not need
to know the absolute size of the elements, and it is a sufficient definition for the proofs we need to construct. For the
set of real numbers between 100 and 102, the supremum of this metric would be 2, not 102.
8

sequence)12 if for every real number ε > 0 (no matter how small) there is a number N (a positive integer or
natural number) such that,
for every m, n > N , d ( xm , xn ) < ε (5)
So, once we get past some point in the sequence (specifically after the Nth element), the elements are all
closer to one another than some chosen (positive) “distance” or separation ε, and no matter how small a
distance (i.e., ε) we choose, there is always some N (possibly different for each ε) for which this holds.
The distinction between a convergent sequence as in Eq. (4) and a Cauchy sequence (which by definition
converges as in Eq. (5)) essentially makes no difference for us, because we can prove that
every convergent sequence in a metric space is a Cauchy sequence (6)
See 11.1 “Proof (1) that every convergent sequence in a metric space is a Cauchy sequence” below.
A (metric) space is said to be complete if every Cauchy sequence in the space converges to a limit that is
also an element of the space13. Of course, this is not saying that in a complete metric space every sequence
is a Cauchy sequence, or even that every sequence converges, but it is saying that if every Cauchy sequence
converges in the space, then the space is “complete” (by definition). A complete space therefore has to have
all the limit points of Cauchy sequences as elements of the space, and in that sense it has to be a “closed”
space.

3.6 Bolzano-Weierstrass theorem

The Bolzano-Weierstrass theorem is a mathematical statement that, on first glance, may seem strange (and
even appear wrong), and whose significance and importance might not be obvious. It is a core result in
mathematical analysis; it is widely used in proofs in functional analysis and elsewhere, and we will require
it below. In its simplest form, it can be stated as follows:
Each bounded sequence of real numbers has a convergent subsequence (7)
Note, first, that the “sequence” here is, again, presumed to be infinitely long, even though this is typically
not explicitly stated (and without this presumption the statement would indeed be wrong). Second, the
bounded set of real numbers considered here has to include the supremum and infimum of the sequence –
that is, it has to be a “closed” set of real numbers; otherwise, the subsequence could converge to a number
that lies (“just”) outside the set, which would make it technically not a convergent sequence. Third, the
subsequence in the statement of the theorem is itself a sequence, and is also presumed to be an infinitely
long sequence (which again is not typically explicitly stated).
We can consider a subsequence to be constructed from the original sequence by selecting some (or possibly
all) of the elements while keeping their relative order. So, with a sequence (1.1,3.2, 2.5,0.7,) , (3.2,0.7)
is a possible subsequence, but (0.7,3.2) is not a possible subsequence because the relative order of the
elements has been changed14.

12
Cauchy sequences are also sometimes called fundamental sequences.
13
Unfortunately, the term “complete” is used for more than one different purpose in this field. Here we are explicitly
discussing a “complete space”. Elsewhere, we will discuss a “complete set”, which refers to a set of basis functions
or vectors that can be used in linear combinations to make up any function or vector in a given space.
14
For the notation “  ”, we may use this especially when specifying elements of a set or a sequence. When we write
a set in the form { x1 , x2 , , xn } or a sequence in the form ( x1 , x2 , , xn ) , we mean that there is some finite number,
n, of elements in the set or sequence, and the “  ” indicates we should include all the elements, continuing in the
pattern given by the first few (here two) stated. So here we should be including the elements x3 , x4 , x5 and so on
(presuming here the n > 5 ), which, in the case of the sequence, should be in this order. When we write { x1 , x2 ,} or
( x1 , x2 ,) , we similarly mean that the set or sequence should continue with the next elements in the obvious pattern,
but either we are presuming the set or sequence is infinite (which will be more common) or that it might be either
finite (with a number of elements we are not specifying) or infinite.
9

It may help in understanding this theorem to look at some extreme cases. First, it is, of course, possible to
construct an infinitely long sequence that does not itself converge. A simple example would be an
“oscillating” sequence, such as (1,0,1,0,) . But this sequence does have two obvious convergent
subsequences – explicitly (1,1,1,) and (0,0,0,) . Each of these trivially converges, the first to 1, and the
second to 0.
A point or value at which an infinite subsequence converges is called an accumulation point. An
accumulation point is not necessarily a limit of the original sequence, which, after all, may not even have a
limit to which it converges, but it is the limit of at least one subsequence. Obviously, in our oscillating
sequence, there are two accumulation points, one being 1 (the limit of the subsequence (1,1,1,) ) and the
other being 0 (the limit of the subsequence (0,0,0,) Our original oscillating sequence has many other
convergent subsequences (in fact, in infinite number) because it is only necessary that the subsequence
eventually converges; the “non-converging” part of it can go on as long as we like provide the subsequence
eventually does converge. So, a subsequence (1,0,1,0,0,0,) , where all the remaining elements are 0, is
also a convergent subsequence.
One other key point to note is that, in constructing a sequence in the first place, we can repeat the same
element of the set as many times as we like (as we have done in constructing the oscillating sequences
above). So, trivially, we can always construct a convergent sequence from any (non-empty) set of real
numbers by just repeating the same number infinitely. This rather trivial kind of sequence is sometimes
used in proofs.
We give one of the standard proofs of this theorem (7) below 11.2 “Proof (2) of the Bolzano-Weierstrass
theorem”.

3.7 Compact spaces

The formal definition of a compact space is as follows:
A metric space is said to be compact if every sequence in the space has a convergent subsequence. (8)
Note that the convergence here will be in terms of the metric, as in Eq. (4) or Eq. (5), which is always
defined in terms of the distance between two elements. Convergence here is not just convergence of the
norm, which would be the “length” of a given element. For real numbers we might not notice the difference
between a convergence in the metric and a convergence in the norm, but for other entities such as functions
or vectors, we need to be considering convergence in the metric, and unless otherwise stated, that is what
we mean. (Different vectors could all have the same norm, but be pointing in different “directions”, in
which case a sequence of them could converge with respect to the norm but not with respect to the metric.)
For a space to be compact, it is important that it is bounded; if the space is not bounded then we do not
know if sequences in general have convergent subsequences; the Bolzano-Weierstrass theorem is not then
applicable (it is easy to construct a sequence of numbers that just keeps on growing and hence has no
convergent subsequence). But, if we have a complete space, so every convergent sequence converges within
the space (including Cauchy convergence), then we can use the Bolzano-Weierstrass theorem to conclude
that such a bounded space is compact. So, we expect bounded complete spaces to be compact.
A space is called precompact (or, in some texts “relatively compact”) if its closure is compact. So a
precompact space would be one in which some sequences might converge “just” outside the space, and the
closure of that space, adding in the limits of such sequences to the space, would be compact.

4 Convergence in vector and function spaces

So far, we have explicitly discussed mostly just real numbers as the elements in a space, with suggestions
that we are going to generalize beyond this. Now we are going to explicitly introduce the kinds of elements
that are of most interest to us, which are vectors and functions. Generally, for us, these two concepts of
vectors and functions are interchangeable. All of our vectors can be considered to be functions and all of
10

our functions can be considered to be vectors. The general mathematics below will be the same for vectors
and functions. (The only substantial difference will come in the precise way we choose to define what we
will call the “inner product” below.) Because of this similarity, we can use the same notation for both.

4.1 Notation for scalars, vectors and functions

In mathematics texts, it is common to make no substantial distinction between the notation for a scalar or
for a vector or function; we might use x or y for any of these, for example. From now on, we will try to be
more specific, and, as much as possible, we use
specific italic Roman letters, such as a, b, c, d, f, g, r, s, t, x, y, and z, for real or complex scalars,
other italic Roman letters, such as j, k, m, n, p, and q, for positive integers, and
italic Greek letters, such as α, β, γ, µ, η, θ, φ, and ψ, to indicate functions or vectors (though we may
have to make occasional exceptions to this notation).
Later, we will be able to use Dirac’s bra-ket notation more extensively for functions and vectors, such as
the “ket” φ ; for the moment, we can introduce this as an alternative notation for vectors, postponing for
the moment the broader transition to using it also for functions.
By “vector” here, we usually mean something that we could also write out as a mathematical column vector
of scalars, as in
 f1 
f 
=φ  2
≡φ (9)
 f3 
 

Note that we do not in general mean a geometrical vector – i.e., something with “components” along
geometrical axes (though the mathematics can handle that as a specific case). If necessary to make the
distinction, we will use the words “mathematical” on the one hand and “geometrical” or “physical” on the
other hand, and by default we mean the “mathematical” version here, as in Eq. (9).
This mathematical vector could have a finite number of elements (and hence be of a finite “dimension”),
though generally we want to set up the mathematics to handle vectors with possibly infinite numbers of
elements (and hence of possibly infinite dimension); vectors with finite numbers of elements are then just
special cases of vectors with generally infinite dimension.
In a sense, such a vector is already a function; it is a mapping from the natural numbers 1, 2,3, (the
subscripts in the individual elements in the vector in Eq. (9)) to the corresponding values f1 , f 2 , f 3 , . Once
we define the inner product below, we will have a particularly definite way to clarify the relation between
vectors and functions.
By a function here, we could mean a simple scalar function of one continuous real variable x, such as
φ ≡ φ ( x) (10)
We might also mean a function of multiple variables, as in a scalar function of the ordinary Cartesian
( x, y, z ) coordinates, such as some function γ ( x, y, z ) , and we could even mean a physical vector function
such as some electric field E( x, y, z ) ≡ E(r ) (the use of the bold E for the function is an obvious exception
to our lower case Greek letters for functions).

4.2 Vector (or function) space

In what follows, we will use the term “vector space” to cover spaces with either vectors or functions as their
elements. We need to give such spaces some properties so we can work towards ideas of convergence. We
11

can formally define a vector (or function) space (also known as a linear space) as a (non-empty) set contain
vector (or function) elements such as α, β, and γ, and having two algebraic operations: vector addition and
multiplication of vectors by scalars. By scalars here we will mean complex numbers15. For the vectors and
functions we are considering, such additions of vectors and of functions (which are just point-by-point or
element-by-element additions) and multiplications of vectors or functions by scalars are both relatively
obvious, so we defer these formal points to a footnote16.

4.3 Matrices, vectors, Hermitian adjoints and Dirac notation

Though we will not need matrices until we have introduced linear operators, we do need to introduce some
more notation for vectors, and it will save some space if we formally introduce all the basic notation and
operations of matrices and vectors here, including more aspects of the Dirac notation.
First, we should explicitly clarify what we mean by a matrix (even though this is very standard). A matrix
is a 2-dimensional array of numbers (which we will allow to be complex). When using a letter to refer to a
matrix, we will use an upper-case sans-serif font, as in A. (We will also below use this same convention
when referring generally to linear operators, which we will define later.) By convention, the elements of a
matrix are written with two subscripts, in the order of the “row number” first and the “column” number
second, where the numbering starts in the top left corner, and uses the natural numbers (so, starting at 1,
not 0). We will typically use the lower case italic Roman letter corresponding to the letter name of the
matrix, so a jk for the element in the jth row and kth column of the matrix A. Hence the matrix A can be
written as
 a11 a12 
A =  a21 a22  (11)
   
If the matrix has finite dimensions, then we can characterize it by the number of rows, N, and the number
of columns, M, saying such a matrix is an N × M matrix.
With two vectors written as

15
Technically, a vector space can be defined with respect to any specific mathematical “field”, but we will exclusively
be considering the field of complex numbers, of which real numbers are also just a special case.
16
Vector addition is an operation that is both commutative, i.e., α + β = β + α , and associative, i.e.,
α + ( β + γ ) = (α + β ) + γ . To deal fully with convergence, we require that the space needs to include a “zero” vector.
We could write such a vector as, for example, zero to make a distinction in our notation; however, generally both
mathematicians and physical scientists are very loose here, and general just write “0” instead for this vector, on the
presumption that no-one will be confused into thinking this is a number rather than a vector. With that dubious
convention, we formally have α + zero ≡ α + 0 = α for all vectors α . Also, we require that for every vector α in the
space, there is a vector −α such that α + ( −α ) =0 (where the “0” on the right is the zero vector, not the number
zero). For all vectors α and β and all scalars a and b, multiplication by scalars, usually written in the form aα , is such
that we have a ( bα ) = ( ab ) α and 1α = α (where the “1” here is the real number 1 – the multiplicative identity element
in the field of complex numbers). The usefulness of this is in complicated multiplicative vector expressions (and we
will define what we mean by vector multiplications later), where we note that we can move scalars essentially at will
through any such products. Note too that the multiplication by scalars is effectively commutative – we can write
aα = α a (even though we typically use the first of these notations). We also have the distributive laws
a (α + β ) = aα + a β , α ( x + y ) = α x + α y and (α + β ) x =α x + β x . For the case when we are dealing with
functions such as an electric field E ( r ) that is itself a (geometrical) vector-valued function, the addition of such
functions should, of course, be (geometrical) vector addition, but such (geometrical) vector arithmetic operations obey
the same formal rules as scalar functions with regard to associativity in addition and distributivity when multiplying
by scalars.
12

 b1   g1 
β b2  ≡ β
= =
and γ  g 2  ≡ γ (12)
     
we could have that the vector γ is the result of the multiplication of β by the matrix A, which we could
write in any of the four equivalent ways:
 g1   a11 a12   b1 
 g  = a a22  b2  (13)
 2   21
         
g j = ∑ a jk bk (14)
k

γ = Aβ (15)

γ =A β (16)

where the summation form (14) is the most explicit about the actual details of the process of matrix-vector
multiplication. For matrix-vector situations, we will typically prefer the bra-ket notation (16) over the more
general notation (15).
We could, of course, have written a vector just as a special case of a matrix – a matrix with just one column
– but in our use we have quite different physical meanings for vectors and matrices. Typically, a
mathematical vector (not a geometric vector) will refer to some physical field, such as an acoustic or
electromagnetic field, and a matrix will refer to a mathematical operation we perform on such a physical
field or to some physical process such as one that generates waves from sources. As a result, it is useful for
us to make an explicit distinction between vectors and matrices in our notation.
Because we want to work with complex vectors and functions (i.e., ones with complex-numbered values),
we need a version of complex conjugation that is particularly convenient for the algebra of working with
entire vectors and matrices, and this concept is the Hermitian adjoint17. For a vector (and also for a matrix),
the Hermitian adjoint is formed by “reflecting” the vector or matrix about a “-45°” line (as in the “leading”
diagonal of the matrix, with elements a11 , a22 , ) – the operation known as “taking the transpose” – and
then taking the complex conjugate of all the elements. This operation is usually notated with a superscript
“dagger”, “ † ”. So, for a matrix we have
†
 a11 a12   a11∗ ∗
a21 
A† ≡  a21  
a22  ≡  a12
∗ ∗
a22  (17)
       
where the superscript “ ∗ ” denotes the complex conjugate of the number.
The Hermitian adjoint of a (column) vector is a row vector with complex conjugated elements. Because we
use this operation often with vectors in the algebra that follows, it also has its own notation, which is the
“bra” part, β , of Dirac’s bra-ket notation. Explicitly,
†
 b1 
β ≡ b2  ≡ [b1∗ b2∗ ] ≡ β
†
(18)
  

Note that in general the Hermitian adjoint operation performed twice gets us back to where we started, i.e.,
( A† )† ≡ A , and ( β ) ≡( β )
† † †
≡ β .

17
The Hermitian adjoint is also known as the Hermitian conjugate, conjugate transpose or sometimes just the adjoint.
13

4.4 Inner products

For the vectors or functions of interest to us for our physical problems, we will be able to define what we
can call an inner product. An inner product has a specific set of properties we will define below, but one of
its most important characteristics is that it is a kind of product between two vectors or functions β and γ
that results in a scalar number c as the result. One mathematical notation for this is to write
c = ( β ,γ ) (19)

Before giving the formal properties of an inner product, we can give some simple examples.

4.4.1 Cartesian inner product

For (mathematical) vectors, a particularly simple inner product (but not the only one we could define) is
what can be called the Cartesian inner product, which is just the vector product of two vectors constructed
by multiplying one vector by the Hermitian adjoint of the other so as to give a scalar (number) result, as in
 g1 
( β , γ )Cartesian ≡ [ b1∗ b2∗ ]  g 2  ≡ ∑ b∗j g j ≡ β γ ≡ β γ (20)
j
  
On the right hand side, we show another standard part of the Dirac notation, which is to condense this into
the compact notation β γ that is a shorthand for β γ . In our discussion here, we are going to use the
Dirac notation for all such Cartesian inner products, though we will keep the general notation ( β , γ ) also
for other inner products or for the general case.
Another simple example of an inner product would be an integral form for functions of one variable, for
example,
α β ≡ ∫ α ∗ ( x ) β ( x ) dx (21)

This particular integral inner product is essentially also a Cartesian inner product in the limit of a sum
tending to an integral, and so we have used the Dirac notation.
An integral like Eq. (21) is also a good example of what can be called a “functional”. A mathematical
definition of a functional is a mapping from a vector or function space to a space of scalars. In other words,
a functional turns a vector or function into a number (just as an operator turns a vector or function into
(usually another) vector or function). We could view the operation in Eq. (21) as a functional acting on the
function β ( x ) to generate the number α β on the left. In this article, the only functional we need is the
inner product, so from this point on, we will not discuss functionals18 in general any further19.
An even simpler example of a Cartesian inner product is the usual (geometrical) vector dot product for
geometrical vectors a and b, so we could write

18
That we are avoiding any general treatment of functionals might well be considered by some to be almost an
indictable offense in an introduction to functional analysis. Functionals generally have other important uses and were
very important in the development of this field, which largely grew out of the need to solve integral equations.
However, the most important result from the theory of functionals for our purposes is the inner product, and it is
indeed very powerful. Our omission of any more general discussion of functionals is also one of the ways we can keep
this introduction short and to the point.
19
One other important example is a Green’s function equation of the form g ( x2 ) = ∫ G ( x2 ; x1 ) f ( x1 ) dx1 ; for example,
we might have a Green’s function G ( x2 ; x1 ) ∝ exp ( ik x2 − x1 ) / x2 − x1 for a scalar wave equation, with g ( x2 ) being
the wave generated at x2 from the source function f ( x1 ) . With x2 viewed as a parameter, then this integral would
be a functional, with g ( x2 ) just being a number. However, we prefer to think of this as an integral operator
∫ G ( x2 ; x1 ) dx1 acting on the function f ( x1 ) to generate the new function g ( x2 ) , so we do not use the “functional”
way of looking at such an equation.
14

a b = a⋅b (22)
One main difference in inner products compared to the geometrical vector dot product is that the general
inner product is designed for complex numbered elements, and that means that the order of the inner product
generally matters (see (IP3) below).

4.4.2 Formal properties of inner products

For subsequent uses of inner products in the mathematics that follows, we only need the inner product to
have the following properties, and this gives the most general definition20.

For all vectors α , β and γ in a vector space, and all (complex) scalars a, we define an
inner product (α , β ) , which is a (complex) scalar, through the following properties:
(IP1) (γ ,α + β ) ≡ (γ ,α ) + (γ , β )
(IP2) (γ , aα ) = a (γ ,α ) (where aα is the vector or function in which all the values
in the vector or function α are multiplied by the (complex) scalar a)
(IP3) ( β ,α ) = (α , β )∗
(IP4) (α ,α ) ≥=
0 , with (α ,α ) 0=
if and only if α 0 (the zero vector)
(23)
For the specific case of the Cartesian inner product, these criteria can be rewritten as

(IP1) (Cartesian) γ α +β ≡ γ (α )
+ β = γ α + γ β
(IP2) (Cartesian) γ aα = a γ a (where aα is the vector or function in
which all the values in the vector or function α are
multiplied by the (complex) scalar a)
∗
(IP3) (Cartesian) βα = α β
(IP4) (Cartesian) αα= ≥ 0 , with α α 0=
if and only if α 0 (the zero
vector)
(24)
We can easily check that all of our examples above of inner products satisfy all these four criteria, either in
the form (23) or the Cartesian form (24).
Note that, in addition to (IP2) above, we can write as a consequence of these properties that

20
Note, incidentally, that the order in which we are writing the inner product here is the opposite order from that used
in most mathematics texts. This difference shows up specifically in (IP2); most mathematics texts would write
( aα , γ ) = a (α , γ ) instead. The convention in these mathematics texts is unfortunate because it is the opposite way
round from the order we find in matrix-vector multiplication (and in Dirac notation). The matrix-vector notation allows
a simple associative law without having to change the written order of the elements, whereas this conventional
mathematics notation does not. The Dirac notation follows the matrix-vector ordering (and indeed, Dirac notation is
generally a good notation for complex matrix-vector algebra). At least one modern text [3] recognizes the problems
of this historical choice in mathematics texts, and uses the notation (α , γ ) the other way round, as we do here.
15

( aγ ,α ) = (α , aγ )∗ by (IP3)
∗
=  a (α , γ )  by (IP2)
(25)
(α , γ )
= a∗
∗

= a∗ ( γ ,α ) by (IP3)
The combination of properties ( γ , aα ) = a ( γ ,α ) and ( aγ ,α ) = a∗ ( γ ,α ) means the inner product is what
is sometimes called sesquilinear. “Sesqui” is the Latin word for “one and a half”, loosely indicating that
the inner product is only “half” linear when the factor is in front of the left vector because we then require
the complex conjugate of the multiplying factor.

4.4.3 Weighted inner products

We might ask what additional possibilities are created by the more general definition of the inner product
in (23) rather than just the Cartesian one in (24) above. There is one particularly important extension that
this general definition allows, which is to what we could call a weighted inner product. An example of a
weighted inner product might be
(α , β ) = ∫ γ ( x )α ∗ ( x ) β ( x ) dx (26)

where γ ( x ) is a real, positive, and non-zero function of x. This is an inner product in the sense of satisfying
all of the general criteria in (24). Such weighted inner products can occur in physical problems.
An example would be in an inner product that can give the electrostatic energy corresponding to a field
E ( r ) in a dielectric with a scalar, positive, non-zero dielectric constant ε ( r ) . Then the dielectric constant
ε ( r ) (or ε ( r ) / 2 ) would be the weighting function, and we could define the inner product (here for the
specific case of an inner product of a field with itself) as
1
=
W (=
E, E ) ∫∫∫ ε ( r ) E∗ ( r ) ⋅ E ( r ) d 3r (27)
2
Even with the presence of the weighting function and with the (physical) vector dot product as the
multiplication, this satisfies all the criteria in (23) above, and so is a valid inner product. It is also an example
of an “energy” inner product, where the inner product of a vector or function with itself gives the energy
W in the system.
Below, in section 8, we show a further broad category of entities that also are inner products, though we
defer this discussion for the moment until we have introduced Hermitian operators.

4.5 Norms and metrics from inner products

We note that the inner product of a vector or function with itself is guaranteed to be zero or a positive real
number21; this is the criterion (IP4) above in (23). As a result, the inner product can be used to define a
norm for a vector or function α , which we can choose as
α = (α , α ) (28)

which we can think of as being the “length” or the magnitude of the “amplitude” of the vector or function.
(This norm satisfies all the criteria for a norm as in (1).)
As is generally true for norms, they can be used to define a metric. So, for some vector space P in which
these vectors are elements, we can therefore define the metric

21
Since an inner product must satisfy ( β , α ) = (α , β )∗ , which is criterion (IP3) in (23), (α , α ) is guaranteed to be a
real number since it must equal its own complex conjugate.
16

d P (α , β ) ≡ α − β = (α − β , α − β ) (29)

A (vector or function) space with an inner product defined on it can be called an inner product space. So,
all inner product spaces are normed spaces (and also, of course, metric spaces).

4.6 Convergence
Our previous mathematical arguments on convergence were nominally written for convergences of
sequences of real numbers. Because we wrote them in terms of a metric, we can, however, now extend
those same arguments and definitions, without change, to vector or function spaces; we just substitute our
new metric as in Eq. (29), and consider the elements in the space to be vectors like α and β instead of
real numbers x and y. We can therefore talk about a sequence of vectors, which we could write as (α n ) ,
and consider convergent sequences, including Cauchy sequences, of vectors, and we can have complete
vector spaces in the same sense as complete spaces of real numbers.

4.7 Inner products and orthogonality

The other very important property of an inner product is that we use it to define the concept of orthogonality.
Specifically,
a non-zero element α of an inner product space is said to be orthogonal to a non-zero
element β of the same space if and only if
(α , β ) = 0 (30)

This is a generalization of the idea of the geometrical vector “dot” product, which is similarly used to define
orthogonality in geometrical space. Note here that we are extending that idea to allow for complex vector
“components” and for arbitrary, even possibly infinite, numbers of dimensions.

4.8 Energy inner products and orthogonality

Such orthogonality as defined using the inner product, in addition to being very useful as a mathematical
device, also has direct physical meaning and implications. For example, suppose we are able to write some
electrostatic field E as the sum of two (non-zero) fields E1 and E2 that are mathematically orthogonal as
defined by their energy inner product, i.e., ( E1 , E2 ) = 0 , as in Eq. (27). Then, the energy associated with
this field would be
W = ( E, E ) = ( E1 + E2 , E1 + E2 ) = ( E1 , E1 ) + ( E2 , E2 ) + ( E1 , E2 ) + ( E2 , E1 )
(31)
= ( E1 , E1 ) + ( E2 , E2 )
So, if the two fields are orthogonal in their energy inner product, then the total energy is just the sum of the
energies of each field considered separately. Such energy inner products are therefore particularly important
for physical fields22.

22
Furthermore, if we want to quantize the fields, it is very desirable to start with classical fields that are orthogonal in
such an energy inner product; then we can separate the Hamiltonian into a sum of Hamiltonians, one for each field
that is orthogonal to all the others, where orthogonality is defined using this inner product, and then quantize those
Hamiltonians separately.
17

5 Hilbert spaces and their properties

The single most important kind of vector space for our purposes is a Hilbert space, and we will be working
with these from this point on. With the mathematics we have constructed up to this point, the definition of
a Hilbert space is now straightforward23.
A Hilbert space is a complete inner product space. (32)
Generally,
we will notate spaces (which might or might not be Hilbert spaces) using italic upper case letters such
as D, F, G, and R, with subscripts to distinguish spaces if necessary,
and if we use H (with or without a subscript) for a space, then it is certainly a Hilbert space.
We have already seen that inner product spaces have physical meaning and usefulness. The notion of
completeness is generally something we can expect our spaces of physical vectors to have. Essentially it
means we are not “missing out” specific functions from the space, and we are careful to include functions
that might be at the “edges” in a mathematical sense of our space – i.e., that are the limits of sequences of
functions that are otherwise within the space.
One of the reasons why Hilbert spaces are so useful is that we can construct orthogonal sets of functions,
and use those to represent other functions. Indeed, once we construct appropriate sets, called basis sets, we
can represent any function in the space using them.

5.1 Orthogonal sets and basis sets

An orthogonal set in a Hilbert space is a subset of the space whose (non-zero) elements are pairwise
orthogonal (i.e., every member is orthogonal to every other member). So, for any two (non-zero) members
α and γ of this orthogonal set, (α , γ ) = 0 unless α = γ . Even more convenient is an orthonormal set,
which is an orthogonal set in which every element has norm 1, i.e., (α ,α ) = 1 ; functions with such a norm
can be called normalized. We presume that we can index the members with an integer or natural number
index24, j or k, for example, in which case we can write for an orthonormal set

23
In making this step to Hilbert spaces, we have “jumped over” Banach spaces. Banach spaces are complete metric
(vector) spaces, but do not necessarily have an inner product. These are typically discussed at length in mathematics
texts, but we have no real use for them here, so we omit them. Some of the definitions theorems and proofs we use in
Hilbert spaces can be executed in the more general Banach spaces but anything that is true for inner product spaces in
general or for Banach spaces is also true Hilbert spaces, because Hilbert spaces are just special cases of Banach spaces
(i.e., with the explicit addition of the inner product). Similarly, a few definitions and results can be constructed in
inner product spaces that are not necessarily complete, but Hilbert spaces are again just special cases of these. So here
we just give all results for Hilbert spaces without calling out those that also apply to the simpler inner product or
Banach spaces.
24
Technically, this assumption that the orthogonal or orthonormal sets of interest to use can be indexed with integers
or natural numbers, extending possibly to an infinite number, is an assumption that the set is countable. (A countable
set is simply one whose elements can be put in one-to-one correspondence with members of the set of integers or
natural numbers; countable sets include the set of all rational numbers, but the set of real numbers, for example, is not
itself countable.) From this point on, we are technically presuming our Hilbert spaces are countable in this sense. We
could argue that we can justify such an assumption a posteriori because our resulting mathematics works in modeling
physical problems, which is in the end the only justification for using any mathematical models for the physical world.
For physical problems, such an assumption of countability is common and implicit in constructing basis sets. For
example, in working with plane wave functions in all directions (a set that is not countable if the components of the
direction vectors are taken to be arbitrary real numbers), it is common to imagine a box of finite but large size, and
count the functions that fit within the box, with “hard wall” or periodic boundary conditions at the edges of the box.
This is an ad hoc construction of a countable set of plane wave functions, and it remains countable as the size of the
box is increased towards infinity in the various directions.
18

(α j ,α k ) = δ jk (33)

where the Kronecker delta δ jk has the properties δ jk = 1 for j = k , but δ jk = 0 otherwise.

An important property of such sets is the notion of linear independence. To discuss this, we first formally
need to introduce the idea of linear dependence and some related terms. A linear combination of vectors
β1 ,, β m of a vector space is an expression of the form d1β1 + d 2 β 2 +  + d m β m for some set of scalars
{d1 , d 2 ,, d m } .
The idea of whether a set of vectors is linearly independent has to do with whether one (or more) members
of the set can be expressed as linear combinations of the others; if this is possible, then the set is linearly
dependent; if not, the set is linearly independent. Formally, this is decided using the equation
d1β1 + d 2 β 2 +  + d m β m =0 . (We presume that there is at least one element here, i.e., m ≥ 1 .) If the only
set of scalars {d1 , d 2 , , d m } for which this holds is when they are all zero, then the set of vectors
{β1 ,, β m } is linearly independent. Otherwise, there is always a way of expressing some vector in the set
in terms of a linear combination of the others, and the set of vectors is linearly dependent25. Note that an
orthogonal or orthonormal set is linearly independent26.
We can choose to have a set of vectors defined as those vectors γ that can be represented in an orthonormal
set {α1 ,α 2 ,} by the sum
γ = a1α1 + a2α 2 +  ≡ ∑ a jα j (34)
j

We can also call such an expression the expansion of γ in the basis α j (i.e., in the set {α1 ,α 2 ,} ), and
the numbers a j are called the expansion coefficients.
We can give the corresponding space of all such vectors that can be written using such an expansion the
name27 Gα . This is then the set of all vectors that can be represented using this “basis” set {α1 ,α 2 ,} . A
set of orthogonal vectors (and, preferably, orthonormal vectors for convenience) that can be used to
represent any vector in a space can be called an (orthogonal or orthonormal) basis for that space. Indeed,
because we have deliberately constructed this set using only linear combinations of this set of orthonormal
vectors {α1 ,α 2 ,} , this set is automatically a basis for the space Gα . The number of orthogonal or
orthonormal functions in the basis – i.e., the number of functions in the set {α1 ,α 2 ,} – is called the
dimensionality of the basis set and of the corresponding space. Depending on the space, this dimensionality
could be finite or it could be infinite28. The basis set that can be used to construct any function in a given
space is said to span the space.

25
E.g., for non-zero d m , we could write − (1 / d m ) ( d1 β1 + d 2 β 2 +  + d m −1 β m −1 ) =
β m . β m is then being expressed as
a linear combination of the other vectors.
26
To prove the linear independence of an orthogonal set formally, consider the orthogonal set of vectors {α1 , , α n }
and consider the equation d1α1 +  + d nα n =0 (the zero vector) with the d j being complex numbers. Taking the
inner product with any one of the elements α j leads to (α j , ( d1α1 +  + =
d nα n ) ) d j (α
= j ,α j ) 0 (the number zero).
Since the elements α j are by definition non-zero, this implies that every d j is zero, which means that the set is
linearly independent (no vector in this set can be made up from a linear superposition of other vectors in the set).
27
We have not yet proved that this space formed in this way by such vectors is a Hilbert space, though we can always
find such sets of orthogonal functions for a Hilbert space, as is proved later.
28
Indeed, much of our reason for setting up this formal mathematics is because we need to deal with spaces of possibly
infinite dimensionality. If we were only dealing with finite dimensionality, the mathematics can be expressed more
simply, but we need the infinite-dimensional results. The results for finite dimensionalities are then just special cases.
19

By definition, a basis, because it can represent any function in a given space, is also said to be a complete
set of functions for the space. (Note, incidentally that this is a different use of the word “complete” from
the idea of a complete space as defined above; this potential confusion is unfortunate, but is unavoidable in
practice because of common usage29.)
The coefficient a j in the expansion Eq. (34) is easily extracted by forming the inner product with α j , i.e.,

(α j , γ ) = a j (35)

Indeed, we can take this to be the defining equation for the expansion coefficients. Note this evaluation of
the coefficients uses whatever we have defined for the inner product of the space; the inner product in the
Hilbert space of interest need not be a Cartesian inner product.

We can now view this set of numbers {a j } as being the representation of the function γ in the basis
{α1 ,α 2 ,} , and we can choose to write them as a column vector
 a1 
=γ  a2  ≡ γ (36)
  
Now, a key conceptual point is that, since the function γ will typically be an actual physical function –
such as an electromagnetic field, for example – it is the same function no matter how we represent it. There
are many basis sets (in fact, usually an infinite number of possibilities30) that can be used to represent a
given function in a space, but no matter which representation we use, and therefore which specific column
of numbers we are writing in an expression like Eq. (36), the function is the same function. (Indeed, this
could be regarded as one justification why Dirac notation does not include any explicit specification of the
basis – at one level, it makes no difference what the basis is.)
We should note explicitly that the specific form of the inner product is something that goes along with the
Hilbert space and is part of the definition of the space. Indeed, it will be useful to give a name to the inner
product used in the definition of a given Hilbert space; we will call it the underlying inner product of the
Hilbert space31. Whatever basis we decide to use, its orthogonality should be set using this underlying inner
product, and the expansion coefficients should be calculated using this underlying inner product.

5.2 Basis sets on Hilbert spaces

A key attribute of a Hilbert space is that there is always a basis for a Hilbert space. That is, there is always
some complete set of orthogonal (or orthonormal) functions {α1 ,α 2 ,} that forms a basis for any given
Hilbert space. Importantly for our purposes, this also applies to infinite-dimensional Hilbert spaces. We
give the proof below in 11.3 “Proof (3) of the existence of a basis for a Hilbert space”.

5.3 An “algebraic shift”

The existence of a basis set for any given Hilbert space is very useful when we describe linear operators,
which is the subject of the next section, and it allows an important “algebraic shift”, as we will now explain.

29
Mathematics texts sometimes use the terminology total set instead of “complete set”, but this is not common in
physical science and engineering.
30
Any linear combination of the original orthogonal or orthonormal basis sets that results in new orthogonal vectors
can be used as a basis, and there is an infinite number of such possibilities.
31
This terminology of “underlying inner product” is one that I am introducing here. It is not a standard term as far as
I am aware.
20

Because we can always construct a basis set in a Hilbert space, then we can always construct a mathematical
column vector as in Eq. (36) to represent an arbitrary function in the Hilbert space. This means that, once
we have constructed the expansion coefficients using the underlying inner product in the space, as in Eq.
(35), the subsequent inner products of functions represented with vectors of such expansion coefficients
can always be considered to be in the simple “row-vector times column-vector” Cartesian form as in Eq.
(20). It is this option to change subsequently to such Cartesian inner products that we are calling our
algebraic shift.
To see why this works we can formally consider an inner product of two functions η and µ in a given
Hilbert space. To start, we expand each function in an orthonormal basis set {α1 ,α 2 ,} for the space,
obtaining
η = ∑ rkα k and µ = ∑ tkα k (37)
k k

where
rk = (α k ,η ) and tk = (α k , µ ) (38)

are inner products formed using the underlying inner product in the space, which might be, for example, a
weighted inner product such as an energy inner product.
Now the inner product of η and µ in this Hilbert space can be written
 r1 
≡[ r 
(=
µ ,η ) ∑ t ∗p rq (α p=
,α q ) ∑ δ
t ∗p=
rq pq ∑ t ∗p rq ] 2 
t1∗ , t2∗ , (39)
p ,q p ,q p ,q
  
So, once we have made the “algebraic shift” of regarding the vectors as being vectors of expansion
coefficients that have been constructed using the underlying inner product in the space, then the subsequent
mathematics of the inner products is simply the Cartesian inner product as in Eq. (20).
So, because there is always an orthonormal basis for any Hilbert space, now we can always write any vector
or function η in a Hilbert space as the “ket” η . We can consider this ket to be the column vector of
numbers
 (α1 ,η ) 
 
η = (α 2 ,η )  (40)
  
With the understanding, as in Eq. (18), that we can similarly write the Hermitian adjoint of any such vector
as

 (α1 ,η ) 
†


η ≡=
η
†
(α 2 ,η=)  (α1 ,η )∗ (α 2 ,η )∗  (41)
  
then we can write the inner product of any two vectors µ and η in a given Hilbert space as
 (α1 ,η ) 
 
( µ ,η ) ≡ (α1 , µ ) ∗
(α 2 , µ )∗
 (α 2 ,η )  ≡ µ η ≡ µ η (42)
  
So, the inner product of any two vectors or functions in the space – an inner product that must be formed
using the underlying inner product of the space – can be rewritten as a Cartesian inner product of the two
vectors consisting of the expansion coefficients on a basis, where those expansion coefficients are formed
using the underlying inner product.
21

Because we have now found a way of writing any inner product as a Cartesian inner product (sitting “above”
the underlying inner product in the expansion coefficients), algebraically we can now “break up” the inner
product into the simple “Cartesian” product of two vectors as in
µη ≡ µ η (43)

even when the underlying inner product would not necessarily allow us to do this. This “algebraic shift”
then allows us to use the full algebraic power of vector-matrix multiplication, including associative laws
that break up the inner product as in Eq. (43). We will return to this once we have similarly considered
representing operators in a related way.
This algebraic shift also gives us a specific way of seeing vectors as “being” functions: we can write out
any function in the Hilbert space as such a mathematical column vector by performing the expansion using
the underlying inner product.

6 Linear operators
An operator is something that turns one function into another, or, equivalently, generates a second function
starting from a first function. Generally, an operator maps from functions in its domain, a space D, to
functions in its range, a space R. Here, we will consider both the domain and the range to be Hilbert spaces.
They may be the same space or they may be different spaces32.
In our case, we are specifically interested in linear operators33. With a linear operator A, we write the action
of the operator on any vector or function α in its domain D to generate a vector or function γ in its range
R as
γ = Aα (44)
The linear superposition requirement is consistent with the usual definition of scalars and linear operators:

For any two vectors or functions α and β in its domain D, and any scalar c (which here
we allow to be any complex number), an operator is a linear operator if and only if it
satisfies the two properties:
(O1) A (α + β ) = Aα + Aβ
(O2) A ( cα ) = cAα
(45)
In words, the first property, O1, says that we can calculate the effect of the operator on the sum of two
vectors or functions by calculating its effect on the two functions separately and adding the result. The
second property, O2, says that the effect of the operator on c times a vector or function is the same as c
times the effect of the operator on the function.

32
An example physical problem where the domain and range are quite different spaces is where we start with source
functions in one volume that lead to waves in another volume. Not only would the generated functions be in a different
space – actually, even a different physical volume – than the source functions; they could be built from entirely
different physical quantities. The source functions might be current densities, and the resulting waves might be electric
fields. We might therefore have quite different kinds of inner products in these two spaces. Situations like these could,
however, be handled with operators mapping between the spaces. In our mathematics, we can also formally keep track
of just what inner product is being used in each space; the underlying mathematics supports this even if it is not
commonly explicit to have different inner products in different spaces.
33
For example, we are presuming here that any physical wave systems we are considering are linear, with linear
superposition applying to waves and sources.
22

6.1 Bounded operators and operator norms

To consider the convergence of the effects of operators, and even to consider the idea of convergences of
sets of operators themselves, we need to define a operator norm for any operator A. There is more than one
way of doing this, though like any other norm, any such operator norm must satisfy the properties given in
(1) above for a norm. The first approach, which is quite general in that it can apply to a large range of types
of operators, is what we can call a supremum (operator) norm. This norm is built on the norms for vectors
or functions in both the domain and the range.
In this case, we consider the set of all possible (non-zero) vectors or functions α in the domain D. Any
particular vector will have some norm α in the domain space (which we will take to be the one based on
the underlying inner product in that space as in Eq. (28)). In the range space R, the vector Aα will have a
(vector) norm Aα (based on the underlying inner product in that space), and we can consider the operator
to be bounded if for any such α there is some (necessarily non-negative) real number c such that
Aα ≤ c α (46)
Here, we are allowed to choose c as large as we like, but it must be finite. Then we can define the supremum
(operator) norm as the smallest possible choice of c such that this expression (46) is valid for all α in the
domain D. Equivalently, we can write34
Aα
A sup = sup (47)
α in D α
α ≠0

With relation (46), we can also therefore write

Aα ≤ A sup α (48)

since A sup is defined as the smallest possible value of c for which Eq. (46) always works. In fact, a relation
of this form Eq. (48) is a requirement for any operator norm, so quite generally for any operator norm A
we will require
Aα ≤ A α (49)
and such an expression (49) is useful in later proofs. Specifically, we will show this kind of relation also
holds for the Hilbert-Schmidt norm that we introduce later.
Note here that the norm in Aα is the vector norm as in Eq. (28) (though note formally here that this is
the vector norm in the range R, so it would be based on the underlying inner product in the range space). In
words, this is saying that this supremum norm for the operator A is the size of the “largest” possible vector
we could produce in the range when starting with a unit length vector in the domain. By “largest” here, we
mean the supremum (lowest possible upper bound) on the norm of the vector produced in the range.
Note that, with the definition of an operator norm, it becomes possible to consider the convergence not only
of real numbers and vectors, but also of operators, and this will be important below.

6.2 Representing operators as matrices

Because any Hilbert space has some complete basis set, we can use this property and the underlying inner
product to represent a linear operator as a matrix. This has important practical uses, of course, but it also

34
In words, this notation means “the supremum of the number Aα / α for any possible choice of a non-zero vector
or function α in the domain D of the operator A”.
23

has algebraic uses and helps us further define and extend the Dirac notation as being a particularly useful
notation for Hilbert spaces and linear operators.
Suppose, then, that we have two Hilbert spaces, H1 and H 2 ; these may be the same Hilbert space, but here
we also want to allow for the possibility that they are different. (We will need this when, for example, we
are considering “source” and “receiving” spaces for waves.) We can propose vectors η in H1 and σ and µ
in H 2 . We will also presume an orthonormal basis {α1 ,α 2 ,} in H1 and an orthonormal basis {β1 , β 2 ,}
in H 2 . Both Hilbert spaces may be infinite dimensional, and so these basis sets may also be infinite.
We presume that a bounded linear operator35 A21 maps from vectors in space H1 to vectors in space H 2 ,
for example, mapping an vector η in H1 to some vector σ in H 2
σ = A21η (50)
Quite generally, we could construct the (underlying) inner product between this resulting vector and an
arbitrary vector µ in H 2 . Specifically, we would have
( µ ,σ )2 ≡ ( µ , A21η )2 (51)

Note that this inner product is taken in H 2 (where we remember, as in Eq. (50), that A21η is a vector in
H 2 ) , and we have used the subscript “2” to make this clear. This inner product is in the form of the
underlying inner product in H 2 .
Note again that the forms of the underlying inner products in the two different spaces H1 and H 2 do not
have to be the same; they just both have to be legal inner products. So the inner product in space H1 might
be a non-weighted inner product useful for representing, say, current sources, and that in H 2 might be a
power or energy inner product for waves that could therefore be a weighted inner product. These possible
differences in inner products in the two spaces mean that, for the moment, that we have to be careful to
keep track of what space an inner product is in36.
Now it will be useful to define what we will call a matrix element of the operator A21 . In the most general
situation which we are considering, where H1 and H 2 could be different spaces, with different basis sets,
we can define this matrix element, which is generally a complex number, as
a jk = ( β j , A21α k )2 (52)

Again, this inner product is being taken in H 2 , as indicated with the subscript “2”.
Now let us consider an expression of the form Eq. (51) again, but this time we are going to represent each
of the vectors η and µ by expanding them on their corresponding basis sets using the underlying inner
product in each space. So, we have
η = ∑ rk α k (53)
k

and

35
Note that the order of the subscripts on the operator A21 here is one that makes sense when we think of an operators
operating on a function or vector on the “right”, in space 1, to generate a new vector or function on the “left”, in space
2 (which may be different from space 1). Indeed, for differential operators this “right to left” order is almost always
implicit in the notation. Unless we invent a new notation, differential operators only operate to the right. Matrix
operators can operate in either direction, but it is more conventional to think of column vectors as being the “usual”
notation and row vectors as being an “adjoint” notation, in which case matrix-vector operations are also typically
written in this same order.
36
Note that it is generally meaningless to try to form an inner product between a function in one Hilbert space and a
function in another Hilbert space; an inner product is a characteristic of a given Hilbert space, so we only need to put
one subscript on the inner product in Eq. (51) to indicate the space in which it is being taken.
24

µ = ∑t jβ j (54)
j

where the rk and the t j are complex numbers given by

rk = (α k ,η )1 (55)

(an inner product formed using the underlying inner product in H1 ) and
t j = ( β j , µ )2 (56)

(an inner product formed using the underlying inner product in H 2 ). Then, we can rewrite Eq. (51) as

j (
( µ , A21η )2 = ∑ t ∗j β j , A21  ∑ rkα k 
k  ) 2

= ∑ t ∗j rk ( β j , A21α k )2 (57)
j ,k

= ∑ t ∗j a jk rk
j ,k

Now we are in a position to make an “algebraic shift” towards a matrix-vector algebra, written in Dirac
notation. Now we algebraically regard the “bra” vector µ as the row vector of expansion coefficients
†
 t1 
µ ≡ [t1 , t2 ,] ≡ t2  ≡ ( µ )
†
∗ ∗ (58)
  
which is equivalent to the “ket” version
 t1 
µ ≡ t2  (59)
  

and similarly the “ket” vector η is regarded as the column vector of expansion coefficients

 r1 
η ≡  r2  (60)
  
Once we are working with these bra and ket vectors, we can also decide to regard an operator A21 in
algebraic expressions with bra and ket vectors as the matrix
 a11 a12 
A21 ≡  a21 a22  (61)
   
Then the sum ∑ t ∗j a21rk can be interpreted as the vector-matrix-vector product
j ,k

∑ t ∗j a jk rk ≡ µ A21 η (62)
j ,k

Explicitly, we note that, quite generally, from Eqs. (57) and (62)
( µ , A21η )2 = µ A21 η (63)

The actual “underlying” operator A21 operating on a function η in H1 , as in the expression A21η inside
the underlying inner product in H 2 on the left of Eq. (63), is only specified when it is operating “to the
25

right”37; the expression “ µ A21 ” does not necessarily have any meaning. However, once we have made this
algebraic shift to the matrix-vector Dirac notation, the matrix-vector product µ A21 (which results in a row
vector) is just as meaningful as the product A21 η (which results in a column vector). The fact that the
underlying operator A21 possibly only operates to the right has been “hidden” inside the matrix elements
a jk ≡ ( β j , A21α k )2 (64)

We could be criticized here for using the same notation for the matrix version of the operator and for the
underlying linear operator, but there need be no confusion; if we see an expression such as A21η , we are
dealing with the underlying operator, which possibly only operates to the right, but if we see an expression
such as µ A21 , A21 η , or µ A21 η , with the vectors in bra and/or ket form, then we are dealing with
the matrix version of the operator.
In most use of Dirac notation, as, for example, in quantum mechanics, it is much more typical to have the
operators map from a given Hilbert space to itself. Additionally, inner products other than a simple
Cartesian form are unusual in quantum mechanics. Hence much of the subtlety we have been setting up
here, in being careful about what inner product is in what space, and what form of inner product we are
using, is unnecessary in quantum mechanics. Here, however, because we want to get the algebraic benefits
of Dirac or matrix-vector algebra and we may well be operating between different Hilbert spaces with
different inner products in each, we needed to set up this algebra with some care. The good news is that,
with our understanding of how to use the underlying inner products in each space to evaluate expansion
coefficients, as in Eqs. (55) and (56), and matrix elements, as in Eq. (64), we can make this algebraic shift
to matrix-vector or Dirac notation and use their full power even in this more general situation..
We can usefully go one step further with Dirac notation here. We can also write the matrix A21 itself in
terms of bra and ket vectors. Again this is standard in other uses of Dirac notation, though, at least for the
moment, we will be explicit about what spaces the vector are in by using “1” and “2” subscripts on the
vectors. Specifically, we can write
A21 ≡ ∑ a jk β j 2 1
αk (65)
j ,k

Then
2 µ A21 η 1 ≡ 2 (
µ ∑ a jk β j
j ,k
2 1
αk η
) 1

= ∑ t ∗p
p
2 (
β p ∑ a jk β j
j ,k
2 1 )
α k ∑ rq α q
q
1
(66)
= ∑ t ∗p ∑ δ pj a jk 1δ kq ∑ rq
p j ,k q

= ∑ t ∗j a jk rk
j ,k

which is again the same as the result in the original equation Eq. (57), so this approach for writing matrices
works here also.
Quite generally, a form like β j 2 1
α k is an outer product. In contrast to the inner product, which produces
a complex number from the multiplication in “row vector – column vector” order, and which necessarily
only involves vectors in the same Hilbert space, the outer product can be regarded as generating a matrix
from the multiplication in “column vector – row vector” order, and can involve vectors in different Hilbert
spaces. Dropping the additional subscript notation on the vectors, instead of Eq. (65) we will just write
A21 ≡ ∑ a jk β j α k (67)
j ,k

37
For example, derivative operators are usually only defined as operating to the right.
26

Note that a linear operator like A21 from one Hilbert space to another can be written in such an outer
product form as in Eq. (65) on any desired basis sets for each Hilbert space. Of course, the numbers a jk
will be different depending on the basis sets chosen.
This statement of the operator as a matrix in Dirac notation completes our “algebraic shift”. From this point
on, we use either the notation with functions written as just Greek letters such as α and β with (underlying)
inner products (α , β ) , or Dirac notation with functions written as kets, such as α and β (or their
corresponding bra versions α and β ) and (Cartesian) inner products written as α β ≡ α β .
Importantly, because the underlying inner products are always used in constructing the vectors and matrices
in the Dirac notation, the result of any such expression in both notations is the same. So, we can move
between notations depending on convenience, and we will do so below. The ability to use the associative
property of matrix-vector notation (including “breaking up” the inner product as in α β ≡ α β ) often
results in considerable algebraic simplification.

6.3 Adjoint operator

We can now usefully define what we will call the adjoint operator of an operator A21 . We will (temporarily)
call this operator B12 so we can define its properties without some confusion of notation38.
We define this adjoint operator through the relation, for any vectors η in H1 and µ in H 2 ,
( µ , A21η )2 = (B12 µ ,η )1 (68)

Note that B12 is an operator that maps from Hilbert space H 2 to Hilbert space H1 . Note, too, that in this
case, the inner product on the right hand side is performed in H1 ; both B12 µ and η are vectors in H1 . Now,
similarly to Eq. (52), we will write a “matrix element” between the appropriate basis functions, called for
the moment
bkj = (α k ,B12 β j )1 (69)

Now that we have these matrix elements for B12 defined, we can make the algebraic shift to matrix-vector
algebra. We treat B12 as a matrix with matrix elements as in Eq. (69) and we write the vectors of expansion
coefficients for µ and η as in Eqs. (59) and (60), respectively. So, instead of Eq. (68) we can write

(B=
12 µ ) η
†
µ A21 η
= µ B12 η
†
(70)

where the “†” is the matrix and vector Hermitian adjoint operation, as discussed in section 4.3, and where
we used the known standard result for matrix-vector multiplication that
(C θ )† ≡ ( θ )† C† =
θ C (71)
for a matrix C and a vector θ . Now expanding the vectors on their basis sets on the right hand side of
Eq. (70), we have
µ A21 η = ∑ t ∗j β j B12
j
†
(
∑ rk α k
k
) (72)
= ∑ t ∗j rk β j B12
†
αk
j ,k

Now, if the matrix B12 has matrix elements b jk in the jth row and kth column, then the matrix B12
†
has
matrix elements bkj∗ in the jth row and kth column, i.e.,

β j B12
†
α k = bkj∗ (73)

So from Eq. (72)

38
A confusion that could make it seem that we are assuming what we are trying to prove
27

µ A21 η = ∑ t ∗j bkj∗ rk (74)

j ,k

From Eq. (66), the left hand side of Eq. (74) has to equal ∑ t ∗j a jk rk . Hence we have
j ,k

∑ t ∗j a jk rk = ∑ t ∗j bkj∗ rk (75)
j ,k j ,k

However, the vectors or functions η and µ (and hence also the sets of coefficients rk and t j ) are arbitrary
in their Hilbert spaces. So therefore we must have
bkj∗ = a jk (76)

which means that the adjoint operator B12 is (at least in matrix form), the Hermitian adjoint of the original
operator A21 .
B12 ≡ A†21 (77)
so we can write as a defining equation of an adjoint operator
( µ , Aη ) = ( A† µ ,η ) (78)
for any vectors η and µ in the appropriate Hilbert spaces. (Here we have dropped the subscripts for
simplicity of notation.) Note that this expression Eq. (78) can be stated for the general case of the operator
A , not just its matrix representation. Note also that we can see from this matrix form that
( A† )† = A (79)

So, henceforth, we can write the adjoint operator to A21 as simply A†21 , and our adjoint operator is simply
the Hermitian adjoint of the original operator39. Note that we have proved this even for different spaces H1
and H 2 with possibly different inner products in both spaces.

6.4 Identity operator

The mathematics of expanding on an orthonormal basis and making the algebraic shift to matrix-vector
(i.e., Dirac) notation gives us an algebraically useful form for the identity operator in a space. To see this,
we start by making an algebraic shift for the basis vectors, writing the basis functions α j in a given space
themselves as Dirac bra α j or ket α j vectors40. Hence, in Dirac notation, we can write for some
function γ in the same Hilbert space
γ =∑ αj γ αj (80)
j

Now α j γ is just a complex number, so we can move it within the expression in the sum to obtain

=γ ∑=
αj αj γ
j (∑ α j
j αj
)γ (81)

where now we have explicitly split up the inner product into a product of a “bra” and a “ket” vector. Using
the associative properties of matrix-vector multiplications, inserting the parentheses in the expression on

39
Note that, though this adjoint operator A†21 is written with the subscripts in the order, from left to right, “2-1”, it is
an operator that maps from H 2 to H1 ; changing the order here would have created possibly more confusion.
40
We can regard the basis functions themselves as being expanded on a basis (possibly, but not necessarily, a different
basis), using the underlying inner product to calculate the expansion coefficients, just as for any other function.
28

the far right of Eq. (81), we now have the outer product α j α j appearing in the sum. In this case, we see
that the effect of the operator
Iop = ∑ α j α j (82)
j

is that it acts as identity operator for all vectors γ in this space. Note that the identity operator can be
written as such a sum of such outer products41 using any (complete) basis set in the space42, a property that
is mathematically algebraically very useful in proofs and other manipulations.

6.5 Compact operators

Compact operators are a category of operators that go beyond just being bounded. The notion of
compactness is essentially one we need when working with operators in infinite dimensional spaces43.
Now44, we will build towards an understanding of what a compact operator is, which in turn can give us a
sense of their importance in physical problems45.

6.5.1 Definition of a compact operator

The formal definition of a compact (linear) operator is somewhat abstract.
We presume we have a linear operator A that maps from one normed (vector) space, F, to
another, possibly different vector space G. So a vector α in F leads to a corresponding vector
γ = Aα in G. A compact (linear) operator is such an operator A with the additional property
that, for any set of bounded vectors {α m } in F, the set of corresponding vectors γ m in G is
precompact (i.e., one whose closure is compact).
(83)
From this definition we can prove a theorem that gives a somewhat more direct criterion for an operator to
be compact. Specifically
The operator A (from the normed space F to the normed space G) is compact if and only if
it maps every bounded sequence (α m ) of vectors in F into a sequence in G that has a
convergent subsequence.
(84)
See 11.4 “Proof (4) of a criterion for compactness of an operator” below.

6.5.2 An illustrative extreme example

To see the power of, and the need for, operator compactness, consider the following “extreme” example.
Consider an infinite-dimensional Hilbert space, with an orthonormal basis {α1 ,α 2 ,} . For any two such
basis vectors, the “distance” between them, as defined by the metric (see section 4.5), is

41
Note therefore that the identity operator Iop can then be considered as a sum of these “outer product” matrices.
42
In general, different spaces have different identity operators, and so, if necessary, we can subscript the identity
operator to indicate what space it operates in.
43
Compactness is a somewhat trivial property in bounded finite dimensional spaces because all bounded finite
dimensional linear operators are compact, as we will prove below.
44
The mathematical definitions, theorems, and proofs on compact operators in this section are based on Kreyszig’s
approach [2], especially theorems 8.1-5, 8.1-4 (a), 2.5-3 (in particular, the proof of the compactness of any closed and
bounded finite dimensional normed space), and 2.4-1, though we have harmonized the notation with our approach and
avoided introducing some concepts that are not required elsewhere in our discussion.
45
In the physics of waves, the properties of compact operators are behind the notion of diffraction limits and limitations
on the number of usable channels in communications, for example.
29

d (α j , α k ) ≡ (α j − α k , α j − =
αk ) ( α j , α j ) + (α k , α k ) − (α k , α j ) − (α j , α k ) (85)
= 1+1− 0 − 0 = 2
(This can be visualized as the distance between the “tips” of two unit vectors that are at right angles.) So,
we can construct an infinite sequence that is just the basis vectors, each used exactly once, such as the
sequence (α1 ,α 2 ,) . This sequence does not converge, and has no convergent subsequences46; every pair
of elements in the sequence has a “distance” between them of 2 . A compact operator operating on that
infinite sequence of different basis vectors will get rid of this problem in the vectors it generates – those
will have some convergent subsequence. So, the compact operator eliminates one troubling aspect of
working with infinite dimensional spaces.

6.5.3 Compactness of operators with a finite-dimensional range

As we proceed to understand compact operators, first, we note that any operator with a finite dimensional
range is compact47. The proof of this statement itself has several steps to it, and is given below in 11.5
“Proof (5) of compactness of operators with finite dimensional range”. We then use this first result to prove
the following theorem48:

Consider an infinitely long sequence ( An ) of compact linear operators from a normed

space F (which can, of course, be a Hilbert space) into a Hilbert space G. If An − A → 0
as n → ∞ for some operator A , then this limit operator A is compact.
(86)
We give the proof of this theorem below in 11.6 “Proof (6) of compact limit operator for convergent
compact operators”. This theorem (86) is a statement that, if an operator can be approximated to any degree
of accuracy by a finite matrix, then it is compact. This is not a statement that any compact operator can be
approximated by a sufficiently large matrix, but the broad class of Hilbert-Schmidt (compact) operators can
be rigorously approximated in this way, and we develop the theory of them below.

6.6 Hilbert-Schmidt operators

It is possible to derive various further results for compact operators, and we will do continue to do so.
However, for a broad class of physical problems49, we will be particularly interested in so-called Hilbert-
Schmidt operators; the mathematics of these is somewhat simpler than carrying forward the full general
mathematics of compact operators. They are perhaps also easier to understand intuitively, so we restrict
some of our discussion to them. We first have to define what we mean by a Hilbert-Schmidt operator, and
then we will prove that all Hilbert-Schmidt operators are compact.

6.6.1 Definition of a Hilbert-Schmidt operator and the sum-rule limit

We can define a Hilbert-Schmidt operator as follows.
We presume we have a Hilbert space H1 with an orthogonal basis {α1 ,α 2 ,} (which we
presume to be orthonormal for convenience), and a bounded operator A that maps from vectors

46
The same problem does not arise in finite-dimensional spaces; if we construct an infinitely long sequence made up
from just the finite number of basis vectors in the space, we will have to repeat at least one of the basis vectors an
infinite number of times, which gives us at least one convergent subsequence – the sequence consisting of just that
basis vector repeated an infinite number of times.
47
The reader may already be able to see this informally and intuitively from the above “extreme” example and the
preceding footnote46.
48
This theorem is a somewhat restated version of Theorem 8.1.5 in Kreyszig [2], and we give a version of that proof.
49
For example, essentially all the “Green’s function” operators we encounter in dealing with the physics of waves
generated by sources correspond to Hilbert-Schmidt operators.
30

in H1 to vectors in a possibly different Hilbert space50 H 2 . Then A is a Hilbert-Schmidt

operator if and only if
S ≡ ∑ Aα j
2
<∞ (87)
j

Since the result of this sum is finite, we can give it a name and a notation, calling it51 the sum rule limit S,
subscripted if necessary to show it is associated with some specific operator. The square root of this
(necessarily non-negative) sum-rule limit S can be called the Hilbert-Schmidt norm of the operator, i.e.,

A= S ≡ ∑ Aα j
2
HS (88)
j

For any arbitrary complete basis sets { α j } and { β k } in H1 , starting from this definition, we can prove
52
three other equivalent expressions for S, given in the three lines in the equations (89) below
≡ A HS ∑ α j A=
†A α
∑ β k A†A β k
2
S= j
j k
2
= ∑ akj (89)
j ,k

≡ Tr ( A†A) =
Tr ( AA† )
See 11.7 “Proof (7) of equivalent statements of the Hilbert-Schmidt sum rule limit S”. Since all of these
different statements of S are equivalent, proving that any one of these versions is finite on any complete
basis is sufficient to prove an operator A is a Hilbert-Schmidt operator. We can also now explicitly prove
that the required property for any operator norm as given in relation (49) ( Aα ≤ A α ) also applies for
this Hilbert-Schmidt norm. See 11.8 “Proof (8) of the operator norm inequality for the Hilbert-Schmidt
norm”.

6.6.2 Compactness of Hilbert-Schmidt operators

One particularly important property of Hilbert-Schmidt operators is that they are compact. The proof of this
is given below in 11.9 “Proof (9) of compactness of Hilbert-Schmidt operators”.

6.6.3 Approximating Hilbert-Schmidt operators by sufficiently large matrices

Consider a Hilbert-Schmidt operator A that maps from a Hilbert space H1 to a possibly different Hilbert
space H 2 . Consider also an m × n matrix Amn that is a “truncated” version of the matrix version of A .

50
These Hilbert spaces can be infinite dimensional.
51
This “sum rule limit” name is one we are creating, and it not standard in the mathematics literature.
52
The Hilbert-Schmidt norm is often also written in integral form. Indeed, once we consider physical operators like
Green’s functions, this is very appropriate. Here, for the purposes of our mathematics we mostly omit that, regarding
it as a special case of forms derived from the infinite sum as in Eq. (89). If we write it out as an integral, we have to
be more specific about the form of the corresponding operator, such as a Green’s function that might be operating on
different kinds of physical spaces (e.g., 1-dimensional or 3-dimensional), and it might have some specific more
sophisticated character, including tensor or dyadic forms. For completeness, though, one specific example, for a scalar
Green’s function G ( r2 ; r1 ) giving the scalar wave a position r2 in volume V2 in response to a point source at position

r1 in volume V1 , would be S = ∫V2 ∫V1 G ( r2 ; r1 ) d 3r1d 3r2 . See [1] for more discussion of such physical Green’s
2

functions. Indeed, whether a specific operator is a Hilbert-Schmidt one will often be determined by such an integral.
An important point is that, as a result, a very broad class of Green’s function operators, including those in wave
problems, are Hilbert-Schmidt operators. To justify that more fully, we need to consider the physics behind such
operators; situations with finite volumes, and where the response from a finite source is itself finite, are, however,
generally going to correspond to Hilbert-Schmidt operators [1]. It is that finiteness from the physics that allows us to
exploit the mathematics of compact operators, and especially Hilbert-Schmidt ones.
31

Then we can prove that the vector result Amn µ of operating with Amn on any vector µ in H1 converges to
the vector result Aµ if we take m and n to be sufficiently large. See the 11.10 “Proof (10) of approximation
of Hilbert-Schmidt operators by sufficiently large matrices” below. Hence, Hilbert-Schmidt operators can
always be approximated by sufficiently large finite matrices.

6.6.4 Nature of operators A†, A†A and A A† for a Hilbert-Schmidt operator A

For a Hilbert-Schmidt operator A , the operators A† , A† A and AA† are Hilbert-Schmidt
operators (and are therefore also compact).
(90)
See 11.11 “Proof (11) of Hilbert-Schmidt and compact nature of various operators derived from Hilbert-
Schmidt operators”.

6.7 Hermitian operators

A particularly powerful and useful class of operators are the so-called “Hermitian” operators. The most
general definition of a Hermitian or self-adjoint operator A is that, for all vectors or functions β and γ
in the relevant Hilbert space or spaces,
( β , Aγ ) = ( Aβ , γ ) (91)

If we compare this with the definition of the adjoint operator, Eq. (68), we see that this means this operator
is equal to its own adjoint. Equivalently then, in particular if we are considering the matrix representation
of the operator on some basis,
A = A† (92)
and for the matrix elements of the operator
a jk = akj∗ (93)

An equivalent statement would therefore be that this matrix is equal to its own “conjugate transpose”.

6.7.1 Hermiticity of A†A and A A†

In general, the operators that map from, say, sources in one space to waves in another are often not
Hermitian, so we will certainly be dealing with non-Hermitian operators. However, the operators A† A and
AA† are Hermitian, as is simply proved using the usual rule for the Hermitian adjoint of a product being
the “flipped round” product of the Hermitian adjoints, and the fact that the Hermitian adjoint of a Hermitian
adjoint takes us back to the original matrix, i.e.,
= A) ( A† ) A†A
( A†A)† (= † †
(94)
( AA† )† (=
= A† ) ( A) AA†
† †
(95)
Hence:
For a Hilbert-Schmidt operator A (which is not necessarily Hermitian), the operators
A† A and AA† are both compact and Hermitian
(96)
This result is very important in the mathematics here, as will become apparent once we understand the
properties of eigenvalues and eigenfunctions of compact Hermitian operators.
32

7 Eigenvalues and eigenfunctions of compact Hermitian

operators
Here we will develop the theory of eigenvalues and eigenfunctions of compact Hermitian operators. There
are several important ultimate results. One is that the eigenfunctions are all orthogonal and form a complete
set (or one that can be easily completed). Another is that the eigenfunctions are progressively the ones that
maximize the possible “strengths” of the effects of the operator on functions. The detail of this will become
clear as we develop this, leading up to the so-called “spectral theorem” that gives the final results we want.
There are several steps to this, and we need to start with some preliminary results.
As we go through some of these operator properties, including some “eigen properties”, we will label some
of them as (OE1), (OE2), etc.

7.1 A property of Hermitian operators

First, we prove a property of a Hermitian operator A . By definition, as in Eq. (91), for any vector β ,
( β , Aβ ) = ( Aβ , β ) (97)

But the property (IP3) of an inner product requires that

( β , Aβ ) = ( Aβ , β )∗ (98)

So, ( Aβ , β ) = ( Aβ , β ) , which therefore requires that ( Aβ , β ) is real, and hence also ( β , Aβ ) is real. So,
∗

quite generally,
(OE1) for a Hermitian operator ( β , Aβ ) is a real number (99)

7.2 Definition of eigenfunctions and eigenvalues

Quite generally, for an operator A , some vector α is an eigenvector (or eigenfunction) of A if and only if
(OE2) Aα = cα (100)
where c is some number, possibly complex, that is then called the eigenvalue associated with this
eigenvector.

7.3 Reality of eigenvalues of a Hermitian operator

For any eigenvector α ,
(α=, Aα ) (α = , cα ) c (α ,α ) (101)
where we used property (IP2) of an inner product. But (α ,α ) is necessarily real by inner product property
(IP3), and from (99) (OE1), ( β , Aβ ) is real for any vector β , so therefore the eigenvalue c is also real; so,
(OE3) All eigenvalues of Hermitian operators are necessarily real. (102)

7.4 Finite number of eigenvectors of a compact operator for a given

non-zero eigenvalue
It is possible for a given eigenvalue to have multiple different linearly independent eigenvectors. The
dimensionality of the space that includes all such eigenvectors for a given eigenvalue can be called the
multiplicity or the degeneracy of the eigenvalue. We can state an important property:
33

(OE4) A non-zero eigenvalue of a compact Hermitian operator has finite multiplicity (103)
We prove this below in 11.13 “Proof (13) of finite multiplicity”. In 11.14 “Proof (14) that the eigenvalues
of Hermitian operator on an infinite dimensional space tend to zero”, we also show that

(OE5) If a compact Hermitian operator is operating on an infinite dimensional space, then

the sequence of eigenvalues ( c p ) must tend to zero as p → ∞ .
(104)

7.5 Orthogonality of eigenvectors for different eigenvalues

For a compact Hermitian operator A , suppose it has two different eigenvalues a and b. So for some
corresponding (non-zero) eigenvectors α and β, we have
Aα = aα
and
Aβ = bβ
Then
(α , β )
b= , bβ ) (α , Aβ )
(α= (105)

Now, by the Hermiticity of A , we have

(α=
, Aβ ) ( A=α , β ) (=
aα , β ) a∗ (α , β )
(106)
= a (α , β )
where in the last step we used the fact that the eigenvalues of Hermitian operators are real, as in (102) (OE3)
above. Hence, putting the left hand side of Eq. (105) together with the right hand side of Eq. (106) and
rearranging, we have
( b − a ) (α , β ) =
0 (107)

Since by assumption b ≠ a , then (α , β ) = 0 , which means

(OE6) for a Hermitian operator, eigenvectors for different eigenvalues are orthogonal (108)

7.6 A preliminary result for the supremum norm of compact

Hermitian operators
So far, we have defined the supremum norm of an operator A on a Hilbert space H in terms of Eq. (47).
For algebraic convenience, but without loss of generality, we rewrite this for vectors α of unit norm. So,
we have
A = sup Aα (109)
α =1

If A is a compact Hermitian operator, we can prove that the supremum norm of A can be rewritten as
A = sup (α , Aα ) (110)
α =1

We prove53 this below in 11.15 “Proof (15) of Hermitian operator supremum norm”.

53
For a similar proof, see [3], pp.198 – 199, Lemma 8.26. Our proof is not identical because we avoided requiring
some prior results used in that proof, proving some parts directly instead, and we avoided some re-use of notation.
34

This result, Eq. (110), is at the core of the main results we will prove for eigenvectors of Hermitian
operators. Note that, with the vector α also appearing on the left-hand side of the inner product (α , Aα ) ,
this result is saying, effectively, that the “largest” possible vector that can be produced by an operator acting
on a unit-length vector is one that lies in the same or the opposite “direction” compared to the original
vector, for some choice of that vector.

7.7 Spectral theorem for compact Hermitian operators

The spectral theorem essentially allows us to conclude that the eigenfunctions of a compact Hermitian
operator form a complete set (or one that is easily completed), which is an extremely useful result. Along
the way in this proof, we also establish an important maximization (or minimization) property of
eigenvalues and their associated eigenvectors. The spectral theorem for the eigenfunctions of a compact
Hermitian operator can be stated as follows:
For a compact Hermitian operator A mapping from a Hilbert space H onto itself, the set
of eigenfunctions {β j } of A is complete for describing any vector φ that can be
generated by the action of the operator on an arbitrary vector ψ in the space H, i.e., any
vector φ = Aψ . If all the eigenvalues of A are non-zero, then the set {α j } will be
complete for the Hilbert space H; if not, then we can extend the set by Gram-Schmidt
orthogonalization to form a complete set for H.
(111)
(See the 11.3 “Proof (3) of the existence of a basis for a Hilbert space” for a discussion of Gram-Schmidt
orthogonalization). We prove this theorem below in 11.16 “Proof (16) of the spectral theorem”.
A consequence is that we can write any such compact Hermitian operator in terms of its eigenfunctions and
corresponding eigenvalues as
∞
=A ∑ rj β j ( β j , ⋅) (112)
j =1

(where we will substitute the vector being operated on for the dot “ ⋅ ” when we use the operator) or, in
Dirac notation
∞
A = ∑ rj β j β j (113)
j =1

Here, the eigenvalues rj are whatever ones are associated with the corresponding eigenvector β j . (Note in
both Eqs (112) and (113) that, for the case of degenerate eigenvalues, we presume that we have written an
orthogonal set of eigenvectors for each such degenerate eigenvalue (which we are always free to do) and
for indexing purposes for an p-fold degenerate eigenvalue, we simply repeat it p times in this sum, once for
each of the corresponding eigenvectors.)

7.8 Maximization property of eigenvectors

As part of the proof of the spectral theorem (11.16 “Proof (16) of the spectral theorem”), we also show the
following.
The eigenvectors β j of a compact Hermitian operator can be found by a progressive
variational technique, finding the largest possible result for Aβ j where β j is
constrained to be orthogonal to all the previous eigenvectors. This will also give a
corresponding set of eigenvalues rj in descending order of their magnitude.
(114)
35

This means, physically, that the eigenfunctions are essentially the “best” functions we can choose if we are
trying to maximize performance in specific ways (such as maximizing power coupling between sources
and the resulting waves), and we could even find them physically just by looking for the best such
performance.

8 Inner products based on positive Hermitian operators

As we mentioned above in section 4.4, there is an even more general class of entities that have all the
properties of inner products, and hence are inner products, and we can introduce those now. We need these
ideas for defining “energy” inner products for electromagneticsm, for example [1].

8.1 Operator-weighted inner product

Suppose we have a (linear) Hermitian operator A that acts on functions such as α, β, and γ, and suppose
we already have defined an inner product of the form ( β ,α ) with all the properties IP1 to IP4 as in (23).
Now, the action of A on a vector γ is to generate another vector α as in
α = Aγ (115)

So, we can form the inner product ( β ,α ) ≡ ( β , Aγ ) . Now, from the Hermiticity of A , we know that
( β , Aγ ) = ( Aβ , γ ) , as in Eq. (91), and by (IP3), we know that ( β , Aγ ) = ( Aγ , β )∗ . So, let us define a new
entity, which we could call54 an operator-weighted inner product,
( β , γ )A ≡ ( β , Aγ ) (116)

Then, using first Eq. (91) and then IP3, we have

(=
β , γ )A ( =
β , Aγ ) (=
Aβ , γ ) ( γ , Aβ ) ≡ ( γ , β )A
∗ ∗
(117)

Hence this new entity, based on a Hermitian operator A , also satisfies the property IP3 of an inner product.
It is straight forward to show that, because A is linear, this entity also satisfies (IP1), as in
(γ ,α + β )A ≡ ( γ , A (α + β )=) (γ , Aα + Aβ =) (γ , Aα ) + (γ , Aβ )
(118)
= ( γ , α )A + ( γ , β )A
and (IP2), as in
(γ , aα )A ≡ (γ , Aaα ) = (γ , aAα ) = a (γ , Aα ) ≡ a (γ ,α )A (119)

As for (IP4), we already know that any entity ( β , Aβ ) is a real number, as shown in property (OE1) (Eq.
(99)). However, it is not in general true for a Hermitian operator A that ( β , Aβ ) is positive. So for ( β , γ )A
to be an inner product, we need one further restriction on A , which is that it should be a positive operator,
which by definition55 means that
( β , Aβ ) ≥ 0 (120)

and hence by definition ( β , γ )A satisfies (IP4) with this restriction.

54
As an explicit name, this “operator-weighted inner product” is a term we are creating here as far as we know, though
this idea is known and this name may therefore be implicitly obvious.
55
Note that there is some variation in notation in mathematics texts. Kreyszig [2] uses this definition for a positive
operator, for example, and if the “ ≥ ” sign is replaced by a “>” sign in (120), he would then call the operator positive-
definite. Others, however, such as [5], would give (120) as the definition for a non-negative operator, using “positive
operator” only if the “ ≥ ” sign is replaced by a “>” sign.
36

So, for any positive (linear) Hermitian operator A , we can construct an (operator-weighted) inner product
of the form given by Eq. (116). (See also [4], p. 168.) The weighted inner product as in Eq. (26) can be
viewed as a special case of this more general inner product56.

8.2 Transformed inner product

One particular kind of positive operator is one that can be written in the form
A = B†B (121)
where B is a linear operator. We can prove that A in this case is a positive operator. Consider an arbitrary
vector β , and form the inner product

( β , Aβ )
= β ,B†Bβ ) ( β ,B† (Bβ ) )
(= (122)

But, by the defining property of an adjoint operator, as in Eq. (78), and with the property (79)
( β ,B† (Bβ ) ) = (Bβ ,Bβ ) (123)

which is the inner product of a vector with itself, which is necessarily greater than or equal to zero. So for
an operator as in Eq. (121)
( β , Aβ ) ≥ 0 (124)

hence proving A = B†B is a positive operator. Hence, for any such operator, we could form an operator-
weighted inner product.
We can, however, take an additional step that opens another sub-class of inner products. Specifically, we
could define what we could call57 a transformed inner product. We can regard the operator B as
transforming58 the vector β - after all, B operating on β is just a linear transform acting on β – and we
could write generally
( β , γ )TB ≡ (Bβ ,Bβ ) (125)

where our subscript notation “ T B ” indicates this inner product with respect to the transformation B of the
vectors in the inner product. Our proof above, Eqs. (122) to (124) shows that this inner product ( β , γ )TB
also satisfies (IP4).

9 Singular-value decomposition
The idea of singular-value decomposition (SVD), especially for finite matrices, is a well-known
mathematical technique for rewriting matrices. As a general approach to rewriting linear operators, it may
be less well known, but in wave problems [1][6][7] this approach can be particularly useful and physically
meaningful59.

56
A positive weight function can be viewed as just a diagonal operator with real values on the diagonal, which is also
therefore a Hermitian operator
57
This name “transformed inner product” is one we are creating here.
58
Note, incidentally, that, though transforms are often defined with unitary operators (see Eq. (140)) (or ones
proportional to unitary operators, as in Fourier transforms, for example, there is no requirement that this operator B is
unitary.
59
In that case, we may want to know the SVD of the Green’s function GSR (which will be a Hilbert-Schmidt operator
and hence compact) for the wave equation of interest when mapping from specific “source” Hilbert space to a specific
“receiving” Hilbert space, for example. The resulting sets of functions will give us the “best” possible sources and
corresponding received waves, all of which will be orthogonal in their respective spaces. These will also correspond
37

9.1 Form of singular value decomposition

For a compact (but not necessarily Hermitian) operator A , mapping from a space H S to a possibly different
space H R , we consider first the eigenequation for the operator A† A , i.e.,
A† Aψ j = c jψ j (126)

for the set of eigenvectors60 {ψ j } (which we will choose to be normalized) in H S and the corresponding
eigenvalues c j . Then

c j (ψ j ,ψ j ) = (ψ j , A† Aψ j ) ≡ ( Aψ j , Aψ j ) ≥ 0 (127)

because in the last step in Eq. (127) we have an inner product of a vector with itself (see (IP4), Eq. (23)).
So necessarily all the eigenvalues of A† A (and similarly of AA† ) are non-negative. So, we can choose to
write these eigenvalues as c j = s j . So, using the expansion of the form Eq. (113) for A† A , we have
2

∞
A† A = ∑ s j ψ j ψ j
2
(128)
j =1

So,
Aψ n
= ψ
= n A Aψn
† 2 2
sn (129)

Then
Aψ n = sn (130)

So we can construct a set of functions {φn } in H R for all eigenfunctions {ψ j } corresponding to non-zero
eigenvalues, where we define
1
φn = Aψn (131)
sn
This set of functions is, first, normalized; that is
2
1 sn
φn =
φn ∗
ψ n A† A ψ= = 1 (132)
sn∗ sn
n
sn sn
and we have
1 1  ∞ 
= ψ = A
φm φn A ψ ψ m ∑ sj ψ j ψ j  ψn
† 2
∗ m n ∗
sm sn sm sn  j =1 
∞ ∞
1 1
= = ∑ sj ψm ψ j ψ j ψn ∑ s j ψ m ψ j δ jn
2 2
∗ ∗
(133)
= sm sn j 1 = sm sn j 1
2 2
snsn
= ∗ ψ m ψn
= = δ mn δ mn
sm sn sm∗ sn

to the best-coupled and orthogonal channels for communicating with waves between the volumes [6]. The SVD
approach also allows a way to synthesize arbitrary linear optical components [7].
60
Note that these eigenvectors are orthogonal, being eigenvectors of a compact Hermitian operator, and with
appropriately-chosen mutually orthogonal versions of any degenerate eigenvectors.
38

so this set {φn } is also orthonormal. Now suppose we consider an arbitrary function ψ in H S . Then we can
expand it in the orthonormal set {ψ j } as in Eq. (81) to obtain

ψ =∑ ψj ψ ψj (134)
j

So, using (131)

= ∑ ψj ψ ψj
A ψ A=
j
∑=
ψ ψ Aψ
j
∑s j j
j
j ψ j ψ φj

(∑ s )
(135)
= ∑=
s φ ψ ψ
j
j j j
j
j φj ψ j ψ

Since ψ was arbitrary, we can therefore write

A ≡ ∑sj φj ψ j (136)
j

which is the singular value decomposition (often abbreviated to SVD) of the operator A from a space H S
to a possibly different space H R . The numbers s j are called the singular values of the operator A .

Note, first, that we can perform this SVD for any compact operator. Second, this SVD tells us that we can
view any such compact operator A as “connecting” a set of orthogonal functions {ψ j } in H S one-by-one
to a set of orthogonal functions {φ j } in H R , with associated “connection strengths” given by the
corresponding singular values s j in each case.

9.2 Establishing the singular value decomposition of an operator

From Eq. (136), we can write
A† A = ∑ ( sk∗ ψ k φk )(s j φj ψ j )
j ,k

= ∑=
ss ψ φ φ ψ
j ,k
∑δ ∗
k j k k j j
j ,k
s s ψk ψ j
∗
kj k j (137)

= ∑ sj ψ j ψ j
2

j ,k

But from Eq. (113), we see that this is just the representation of the operator on a basis of its eigenfunctions,
which are ψ j with eigenvalues s j . Explicitly, we can check that these are the eigenfunctions and
2

eigenvalues of A† A .
A† A ψ n =  ∑ s j ψ j ψ j  ψ n
2

 j ,k 
= ∑=
s ψ ψ ψ ∑s ψ j δ jn
2 2
j j j n j (138)
j ,k j ,k

= sn ψ n
2

Similarly, the functions φ j are the eigenfunctions of AA† with the same eigenvalues s j . Hence the
2

singular value decomposition can be established by solving for the eigenfunctions and eigenvalues of A† A
and for the eigenfunctions of AA† .
39

9.3 Matrix representation of the singular value decomposition

We can now explicitly show a common operator way of writing the SVD, which then also directly applies
to matrices, and is then seen as a matrix factorization. Specifically, the SVD of some linear operator or
matrix A would be written in the form
A = VDdiagU† (139)

where U and V are unitary operators or matrices, that is, operators or matrices for which
U†U = Iop and V† V = Iop (140)

where Iop is the identity operator or matrix. We prove this equivalence below in 11.17 “Proof (17) of the
equivalence of Dirac and matrix forms of SVD”.
Another way of looking at this is that, if we expand ψ p and φq on some basis {γ j } , then the elements
of the pth row of U† are the elements of ψ p , and the elements of the qth column of V are the elements
of φq .

The SVD is, of course, a standard decomposition for finite matrices. Note here, though, that we are also
rigorously defining the equivalent mathematics for compact operators that may be operating in or between
infinite dimensional spaces.

10 Concluding remarks
This completes our introduction to this mathematics. Obviously, the reader can proceed further, and the
various standard functional analysis texts certainly provide that. Indeed, my hope is that this introduction
can make those texts61 more accessible and hence valuable.

11 Proofs of specific mathematical theorems

11.1 Proof (1) that every convergent sequence in a metric space is a

Cauchy sequence
If a sequence ( xn ) converges to the element x, i.e., xn → x , then for every (real number) ε > 0 there is a
positive integer or natural number N for any given ε such that d ( xn , x ) < ε / 2 for all n > N . Hence by the
triangle inequality for metrics and the symmetry of a metric, both of which are basic defined properties of
a metric (see the definitions (M4) and (M3) respectively in the definitions (2) above),
d ( xm , xn ) ≤ d ( xm , x ) + d ( x, xn ) < ( ε / 2 ) + ( ε / 2 ) =
ε
This shows that the sequence ( xn ) is Cauchy, completing the proof.

11.2 Proof (2) of the Bolzano-Weierstrass theorem

Here we prove the Bolzano-Weierstrass theorem (7) (“Each bounded sequence of real numbers has a
convergent subsequence”). One standard proof uses “nested intervals”. We can start by imagining that we
have marked all the elements of an arbitrary (infinitely long) bounded sequence of real numbers xn (where

61
Which mathematicians should understand are very difficult for ordinary mortals to follow!
40

n runs over all the natural numbers) on a real “line”, as in Fig. 1(a); all the points xn necessarily lie between
the lower bound, a number xinf corresponding to the infimum of the set of points, and the upper bound, a
number xsup corresponding to the supremum of the set of points. Of course, the number of points we need
to mark on the line is infinite, and for graphic purposes we can only indicate some of these on the graph,
but we understand the actual number of points to be infinite.

Fig. 1. Illustration of the process, starting with a sequence (xn) of points that are marked on the line,
dividing an interval in two progressively, each time retaining an interval that has an infinite number
of points, and hence contains an infinite subsequence of the original sequence (xn).
By definition, because we have an infinitely long sequence, then within the interval I1 , which goes from
the infimum xinf to the supremum xsup , there is an infinite number of points on the line. Now let us divide
that interval in half, with a mid-point xmid1 . Our goal here is to establish a new interval, half as big as the
previous one, and still with an infinite number of points in it. There are now three possibilities: (1) there is
an infinite number of points in the interval between xinf and the mid-point xmid1 but a finite number
between xmid1 and xsup ; (2) there is an infinite number of points in the interval between xmid1 and xsup but
a finite number between xinf the mid-point xmid1 ; (3) there are infinite numbers of points between xinf and
the mid-point xmid1 as well as infinite number of points in the interval between xmid1 and xsup . In the first
case, we now choose a new interval I 2 that runs from xinf and the mid-point xmid1 (which is the example
case shown in Fig. 1(b)). In the second case, we instead choose the new interval I 2 to run between xmid1
and xsup . In the third case, it does not matter which of the two new intervals we choose; we just arbitrarily
choose one or the other; our goal is to show there is at least one convergent subsequence, so either one of
these intervals would be suitable (it is not a problem if there are two convergent subsequences). The interval
we are left with contains an (infinitely long) subsequence of the original sequence. (On whatever interval
we end up choosing, we should choose it to include its end points so that we do not end up with a sequence
that converges to a limit that lies “just” outside the interval).
Now we keep repeating this process, as illustrated in Fig. 1(c) and Fig. 1(d) for example successive
intervals, dividing the interval in two each time, choosing a (or the) part with an infinite number of points
within it, and continuing this process. As a result, we end up with an arbitrarily small interval that
nonetheless contains a subsequence with an infinite number of points. Thus we can see we are establishing
a convergent subsequence.
41

So, formally, after the choice of the jth interval, we have an (infinitely long) subsequence y jm (where m
runs over all the natural numbers) of the original sequence xn . (Note that the elements of y jm are all
elements of the original sequence xn , and are in the same relative order as they were in xn .) The size of
this interval is ∆y =
j ( xsup − xinf ) / 2( j −1) and all of its elements lie within this range (or on the edge of it).
So, for any ε , no matter how small, there is always some sufficiently large choice of j such that ε < ∆y j .
Then, for our standard metric for real numbers s and t , that is, d ( s, t )= s − t , we have, for any elements
y jp and y jq , where p and q are any members of the set of natural numbers,
d ( y jp , y jq ) = y jp − y jq < ε (141)

If we choose an x that lies in the range between y pj and yqj (inclusive of the end points), then we can say
for any ε , no matter how small, there is an x such that
d ( y jp , x )= y jp − x < ε (142)

Hence, there is a convergent subsequence of the original sequence ( xn ) that approaches arbitrarily closely
to some limit x, and so we formally have proved that for any bounded sequence ( xn ) there is a convergent
subsequence, proving the theorem as required.

11.3 Proof (3) of the existence of a basis for a Hilbert space

If we can show that any vector γ in a given Hilbert space H can be expressed as a linear combination of
orthonormal functions {α1 ,α 2 ,} , a set that may be of any required dimensionality (including infinite
dimensionality), then by definition that set {α1 ,α 2 ,} is a basis for the space H. To prove this set exists,
we can formally construct it by considering as many non-zero vectors γ 1 , γ 2 , as we like in the space, and
showing how to construct the set of orthonormal functions {α1 ,α 2 ,} from them. (This process is formally
equivalent to Gram-Schmidt orthogonalization.)
We can start this process by taking the first of these vectors, γ 1 , and constructing a normalized version of
it, making that the first basis set element α1 , i.e.,
γ1
α1 = (143)
(γ 1 , γ 1 )
Now we consider the second vector γ 2 . If γ 2 = cγ 1 where c is some complex number, then we can already
fully represent γ 2 already using just α1 , i.e., explicitly γ 2 = c (γ 1 , γ 1 ) α1 . Since our goal is to establish the
various elements α1 ,α 2 , that will make up a basis set, then to save unnecessary operations, we will
therefore just consider additional vectors γ 2 , γ 3 , that cannot already be represented by the basis vectors
we have already found. (Formally this means that we are choosing γ 1 , γ 2 , to be linearly independent.).
Therefore, with this restriction, we can write
=γ2 (γ 2 ,α1 )α1 + β 2 (144)

where β 2 is some non-zero vector orthogonal to α1 . To see that β 2 is orthogonal, we can form the inner
product
(α1 , γ 2 )
= (α1 , γ 2 ) (α1 ,α1 ) + (α1 , β 2 )
(145)
= (α1 , γ 2 ) + (α1 , β 2 )
so
(α1 , β 2 ) = 0 (146)
42

proving the orthogonality. Now, therefore, we can form a second element of our basis set using a normalized
version of β 2 , specifically
β2
α2 = (147).
( β2 , β2 )
To construct the third element, we choose a γ 3 that cannot already be represented as a linear combination
of α1 and α 2 , leaving an orthogonal vector β3 as in

γ 3  ∑ (α j , γ 3 )α j  + β3
2
= (148)
 j =1 
Generally, we can keep going like this, with

γ m  ∑ (α j , γ m )α j  + β m
m −1
= (149)
 j =1 
and choosing
βm
αm = (150)
( βm , βm )
In this process, if our basis set is not complete, as proved by the fact that it cannot represent some vector,
then we just add in a normalized version (i.e., α m ) of the orthogonal “remainder” vector β m as the
necessary new element in our basis set.
Of course, if we had a space of finite dimensionality, this process would truncate at some point once we
could no longer find any vector in the space that could not be expressed as a linear combination of the basis
vectors we had found so far, and we would have found our basis set. For an infinite dimensional space, we
can just keep going, and so, inductively, we can create an orthonormal basis set to represent any function
in such a Hilbert space.

11.4 Proof (4) of a criterion for compactness of an operator

We are proving the statement that “The operator A (from the normed space F to the normed space G) is
compact if and only if it maps every bounded sequence (α m ) of vectors in F into a sequence in G that has
a convergent subsequence.” This is mostly a question of definition62.
We can deal with the “only if” part of this criterion first, which is true essentially directly from the definition
(83) of a compact operator. Specifically, if the operator A is compact (as defined above in (83)) and the
set of vectors α m is bounded, then by that definition of compactness, the resulting set of vectors γ m = Aα m
gives a space whose closure is compact.
Next we deal with the “if” part, which, restated, becomes “if A maps every bounded sequence (α m ) of
vectors in F into a sequence ( γ m ) (i.e., where γ m = Aα m ) in G that has a convergent subsequence, then
A is compact.” Consider some subspace B of F that consists only of bounded vectors α m , and consider
any sequence of vectors ( γ m ) in G that can be formed by the action of A on any vector in B (so the
sequence ( γ m ) is an arbitrary sequence in the image of B in G.) Then by presumption in the statement to
be proved, this sequence ( γ m ) has a convergent subsequence. But ( γ m ) is an arbitrary sequence in the
image of B, and B is an arbitrary bounded subset of F, so every sequence in the image of B has a convergent
subsequence, so every sequence ( γ m ) generated by a bounded sequence (α m ) in F has a bounded
subsequence. So the resulting space in G is compact, by the definition (8). Hence the operator A is compact
by the definition (83). (The definition (83) is stated in terms of a precompact space, but a compact space is

62
This is one of those proofs that takes some space to write down, but that actually has very little in it.
43

also precompact. A precompact space is one whose closure is compact. The closure of a compact space,
which is already closed, is the same compact space.)

11.5 Proof (5) of compactness of operators with finite dimensional

range
Before proving this statement itself, we need to prove two other statements.

11.5.1 A theorem on linear combinations

First, we need to prove63 the following theorem on linear combinations:
Consider a linearly independent set of vectors {α1 , ,α n } in a normed vector space of any finite dimension.
Then there is a real number c > 0 such that, for every choice of (possibly complex) scalars a1 , , an , we
have
a1α1 +  + anα n ≥ c ( a1 +  an ) (151)

Before proving this theorem, we can note that, loosely, it is indicating that there are limits to how small a
vector can be if it is made up out of linearly independent vectors that are large. The proof proceeds as
follows.
We can write = s a1 + an where s ≥ 0 since the modulus of any complex number is greater than or
equal to zero. The only way we can have s = 0 is for all the a j to be zero, in which case (151) holds for
any c. So to complete the proof, we now consider all the other possibilities, for which necessarily s > 0 .
Then we can rewrite (151) as
b1α1 +  + bnα n ≥ c (152)

where bn = an / s and, necessarily, ∑ nj =1 b j = 1 . Hence, it is enough now to prove the existence of a c > 0
such that (152) holds for every collection of n scalars b1 , , bn (complex numbers) with ∑ nj =1 b j = 1 .
Now we proceed by a reductio ad absurdum proof, starting by assuming that the statement is false, i.e., that
there is a set or sets of such scalars for which c is not greater than zero. To start this argument, we choose
some (infinitely long) sequence ( β m ) of vectors, each of which can be written
β=
m b1( m )α1 +  + bn( m )α n (153)
with
( )
∑ nj =1 b j m = 1 (154)

(the coefficients b(j m ) can be different for each such vector) and we require that this sequence is such that
β m → 0 as m → ∞ .
Now we reason in what is sometimes called a “diagonal argument”. Since ∑ nj =1 b(j m ) = 1 , we know that for
every coefficient b(j m ) in any of the vectors β m in the sequence ( β m )

b(j m ) ≤ 1 (155)

Since we have a sequence of vectors ( β m ) , we can if we want construct a sequence of the values of the jth
coefficient in each vector. Hence for each chosen j, we have a sequence of coefficients (a sequence of
scalars, not of vectors)
( b(jm) ) = ( b(j1) , b(j2) ,) (156)

63
This particular proof follows Kreyszig [2], Lemma 2.4-1.
44

If we imagined that we wrote out all the n coefficients b1(1) , , bn(1) of the first vector β1 as a horizontal
row, and then wrote the coefficients b1( 2) , , bn( 2) of the second vector β 2 on a second horizontal row
beneath it, and so on, as in
b1(1)  bn(1)
b1( 2 )  bn( 2 )
  

then this sequence ( b(j m ) ) = ( b(j1) , b(j 2 ) ,) would be one vertical column.

Note that, as usual with sequences, this sequence is infinitely long, and we know from (155) that this is a
bounded sequence. Now we specifically choose the sequence ( b1( m ) ) (i.e., the first column). So, from the
Bolzano-Weierstrass theorem, the sequence ( b1( m ) ) has a convergent subsequence, with some limit b1 .
Now we take the subsequence of vectors that corresponds to those with their first coefficient as this
subsequence of ( b1( m ) ) , with that first coefficient still limiting to b1 .
From that subsequence of vectors, we can choose a “sub-subsequence” (which is just another subsequence)
in which the second coefficient similarly limits to some number b2 . (The existence and convergence of this
(sub)subsequence is similarly guaranteed by the Bolzano-Weierstrass theorem.) We use this argument
progressively a total of n times, with each “column” converging to a corresponding limit b j ,by which time
we are left with a (sub)sequence of vectors that we can call ( γ k ) , a subsequence of the original sequence
( β m ) . The individual vectors in the sequence (γ k ) are of the form
n
γ k = ∑ g (jk )α j (157)
j =1

and we have
( )
∑ nj =1 g jk = 1 (158)

because the coefficients g (jk ) , j = 1, , n for each k are just the coefficients b(j m ) , j = 1, , n for some vector
β m in the original sequence of vectors; we have just been choosing a subsequence from that original
sequence, and each γ k is just some β m in the original sequence. This sequence ( γ k ) converges to the vector
n
γ = ∑ b jα j (159)
j =1

where, as in (154) and (158), we have

∑ nj =1 b j = 1 (160)

so the b j cannot all be zero. Since the original vectors {α1 , ,α n } were by choice linearly independent,
then, with coefficients b j that are not all zero, the vector γ cannot be the zero vector. We have found a
convergent subsequence of the original sequence ( β m ) that does not converge to the zero vector. But this
contradicts the original assumption that we could construct such a sequence of vectors that converges to
zero, as required to allow the non-negative number c to be zero. Hence, by reductio ad absurdum, c > 0 ,
and we have proved the theorem.
45

11.5.2 A theorem on the compactness of any closed and bounded finite

dimensional normed space
Next we need a second result, which is the following theorem64:
In a finite-dimensional normed space G, any subset M of that space is compact if it is
closed and bounded.
(161)
The proof of this theorem is as follows. We presume the space G has dimensionality n (so a linear
combination of n linearly independent vectors is required to specify an arbitrary vector in the space), and
we presume that {α1 , ,α n } is a basis for the space G. Now consider an infinitely long sequence of vectors
( β m ) in the subspace M. Each β m can be represented on the basis as
β=
m b1( m )α1 +  + bn( m )α n (162)
Since the subspace M is bounded, so also is the sequence ( β m ) , and we can call that bound some positive
real number v, so for any β m , we have β m ≤ v . Then, using the result (151) just proved above, we have
n n
( ) ( )
βm
v≥ = ∑ b jm α j ≥ c ∑ b jm (163)
=j 1 =j 1

where c > 0 . Hence the infinitely long sequence of numbers ( b(j m ) ) for some fixed j is bounded, and by the
Bolzano-Weierstrass theorem, it must have an accumulation point g j . By a similar “diagonal” argument
as in the proof above of (151), we conclude that the infinitely long sequence ( β m ) has an infinitely long
subsequence ( γ k ) that converges to a vector γ = ∑ nj =1 b jα j . Since the (sub)space M is closed, this vector
γ must be in the space M. Hence we have proved that the arbitrary (infinitely long) sequence ( β m ) has a
subsequence that converges in M. Hence M is compact, proving the theorem as in (161).

11.5.3 Core proof of theorem

Now that we have proved two key results, (151) and (161), we can proceed to the proof the theorem we
want65, which can be stated as follows:
For a bounded operator A from a normed space F to a normed space G, if the range of
A in G has finite dimensionality, then A is a compact operator.
(164)
Note, incidentally, that it is only necessary that the range of the operator has finite dimension. Of course, a
finite matrix has both a finite dimensional range and a finite dimensional domain, but the finite dimensional
range is sufficient to prove that a (bounded) finite matrix is automatically a compact operator. We can prove
the theorem (164) as follows.
Consider an arbitrary (infinitely long) bounded sequence ( β m ) of vectors in F. Then the inequality (49),
which here becomes Aβ m ≤ A β m , shows that the (infinitely long) sequence of vectors ( Aβ m ) is
bounded. Hence the set of vectors {Aβ m } constitutes a bounded subset of G. Since this is a bounded set,
by the Bolzano-Weierstrass theorem (on the sequence of real numbers formed using the metric for the
vectors on the sequence), the sequence ( Aβ m ) will have a convergent subsequence, converging to some
vector γ . Hence the space generated by all the possible sequences ( β m ) is bounded, and can be closed by
adding in the vectors corresponding to the accumulation points of these sequences, making the resulting
space closed and bounded. But we are asserting that the space G has finite dimensionality, and so this
subspace must also have finite dimensionality. Since this subspace is a closed, bounded finite-

64
This is one half of the Theorem 2.5-3 in Kreyszig [2], and we follow that proof.
65
This is Theorem 8.1-4(a) in Kreyszig [2], and we give an expanded version of that proof.
46

dimensionality space, it is therefore compact by (161). To close it, we had just to add the corresponding
limiting vectors, and so the operator A is generating a precompact space when acting on bounded vectors,
and it is therefore compact by the definition (83).

11.6 Proof (6) of compact limit operator for convergent compact

operators
We remember that we are trying to prove the theorem (86)
Consider an infinitely long sequence ( An ) of compact linear operators from a normed space
F into a Hilbert space H. If An − A → 0 as n → ∞ for some operator A , then this limit
operator A is compact.

This proof uses a “diagonal argument”, which we introduced first above in the proof 11.5.1 “A theorem on
linear combinations” (151) in 11.5 “Proof (5) of compactness of operators with finite dimensional range”.
In this way, we will show that for any bounded sequence ( β m ) in F, the “image” sequence ( Aβ m ) in H
has a convergent subsequence, and hence by the condition (84) for compactness of an operator, the operator
A is compact.
So, we proceed as follows. Since A1 (the first operator in the sequence ( Am ) ) is compact, then it maps
bounded sequences ( β m ) in F to sequences ( A1β m ) that have a convergent subsequence in H. We notate
that subsequence as ( A1γ 1,m ) for some corresponding sequence ( γ 1,m ) in F that is a subsequence of ( β m )
. Now a sequence that is convergent in a metric is also automatically a Cauchy sequence (see (6) above), a
property we will use later, so instead of just saying that we have a convergent subsequence, we will say the
subsequence is Cauchy. So, the subsequence ( A1γ 1,m ) is Cauchy. Now we can proceed in a “diagonal
argument” fashion. Similarly, since the operator A2 is compact (and indeed all the operators An are
compact by choice) we can find a subsequence of ( γ 1,m ) , which we will call ( γ 2,m ) for which the sequence
( A2γ 2,m ) is Cauchy. Continuing in this fashion, we see that the “diagonal sequence” (ηq ) = (γ q,m ) (where
q is a natural number) is a subsequence of ( β m ) such that, for every n ≤ q , ( Anη q ) is Cauchy. Now, by
choice ( β m ) is bounded, and hence (η q ) is bounded, say, η q ≤ c for some positive real c, for all q.
Having established these Cauchy sequences by this diagonal method, we can now proceed to use the
presumed operator convergence ( An − A → 0 as n → ∞ ) together with this Cauchy property.
Because An − A → 0 , there is an n = p such that A - A p < δ for any positive δ we choose. Specifically,
we will choose to write δ = ε / 3c for some positive number ε . Since ( Anη q ) is Cauchy for every q ≥ n ,
then there is a (natural number) u ≥ p such that
ε
A pη j − A pη k < for all j , k ≥ u . (165)
3
Now, suppose we have four vectors µ, κ, ρ, and ζ in a normed vector space. Then we could write
µ −ζ = µ −κ +κ − ρ + ρ −ζ (166)
So, by the triangle inequality for norms (property N4 in (1)), we could write
µ −ζ ≤ µ −κ + κ − ρ + ρ −ζ
(167)
≤ µ −κ + κ − ρ + ρ −ζ
So, similarly, we can write for j , k ≥ u
47

Aη j − Aη k ≤ Aη j − A pη j + A pη j − A pη k + A pη k − Aη k
ε
≤ A - Ap η j + + A p - A ηk (168)
3
ε ε ε
< c+ + ε
c=
3c 3 3c
This shows that ( Aη q ) is Cauchy and converges since H is complete (being a Hilbert space). Hence, finally,
for an arbitrary bounded sequence ( β m ) in F, the sequence ( Aβ m ) has a convergent subsequence in H,
and hence by the condition (84) for compactness of an operator, the operator A is compact.

11.7 Proof (7) of equivalent statements of the Hilbert-Schmidt sum

rule limit S
We start from the definition (87) that S ≡ ∑ j Aα j for a Hilbert-Schmidt operator A operating on vectors
2

in a Hilbert space H1 with a complete basis {α1 ,α 2 ,} to generate vectors in a Hilbert space H 2 . Note
first that norm Aα j is a vector norm in Hilbert space H 2 , and so Aα j
2
can be written as an inner
product in that space. Specifically
Aα j ≡ (γ j , γ j )
2
(169)

where γ j ≡ Aα j is a vector in H 2 . Now we can use the equivalence as in Eq. (42), so ( γ j , γ j ) = γ j γ j ,

the equivalence as in Eq. (63), and standard matrix-vector manipulations, giving

(γ j , γ j ) (=
γ j , Aα j ) (γj )
†
= j A αj
γ= A αj
(170)
(=
A αj ) A αj
†
= α j A†A α j
Hence, the sum-rule limit can be rewritten as
=S ∑ α j A† A α j ≡ Tr ( A† A ) (171)
j

where the notation on the right, Tr ( A† A ) , is a shorthand for the trace of the matrix, the trace being the sum
of the diagonal elements of a matrix.
We can now prove three standard equivalences about Eq. (171), all of which are proved by introducing
and/or eliminating versions of the identity operator or matrix for the space (as in Eq. (82)).
First, the trace of any matrix is independent of the (complete) basis used to represent it, so the result S from
Eq. (171) is the same no matter what the complete basis { α j } is. This is a standard result, but we give the
proof here for completeness. We consider a second complete basis { βk } on the space, so we have the
identity operator, which we can write on this basis as Iop = ∑ k β k β k { α j } basis as
or on the
Iop = ∑ j α j α j . So starting from the trace of an operator or matrix B expressed on the { α j } basis, we
proceed, introducing Iop twice (with different summation indices), moving round complex numbers (inner
products) and eliminating an identity operator, i.e.,
48

Tr (B ) ∑=
= α j B α j ∑ α j=
j
IopBIop α j
j
∑ αj
j ,k , p
( βk βk B β p β p ) α j

= ∑ α j βk βk B β p β p α j
= ∑ βk B β p β p α j α j βk
j ,k , p j ,k , p

( )
(172)
= ∑
= βk B β p β p ∑ α j α j βk ∑ β k B β p β p Iop β k
k, p j k, p

B β p β p βk
= ∑ β k= =
∑ β k B β p δ pk ∑ β k B β k
k, p k, p k

Hence we have proved that the trace of an operator or matrix is independent of the basis used. Applying
this to the result Eq. (171) allows us therefore to conclude that we get the same answer for S independent
of the (complete) basis used to evaluate it.
Second, introducing an identity operator Iop = ∑ j α j α j inside the sum and using associativity of matrix-
vector multiplication, we can show
= α j A†Iop A α j ∑ α j A† ∑ α k α k A α j
S ∑=
j j
( k
)
= ∑ α j A† α k α k A α j (173)
j ,k
2
= ∑
= akj∗ akj ∑ akj
j ,k j ,k

which is the sum of the modulus squared of all the matrix elements.
Third, starting from the middle line in Eq. (173),
S = ∑ α j A† α k α k A α j
j ,k

α k A α j α j A† α k ∑ α k A ∑ α j α j A† α k
= ∑=
j ,k k ( j ) (174)

k AIop A α k
= ∑ α= † α k AA† α k Tr ( AA† )
∑=
k k

These three equivalences, Eqs. (172), (173), and (174) are the ones we set out to prove.

11.8 Proof (8) of the operator norm inequality for the Hilbert-Schmidt
norm
We can write an arbitrary function η in H1 on this basis {α1 ,α 2 ,}
η = ∑ hp α p (175)
p

So, with a basis {β1 , β 2 ,} in H 2 we can write for an operator A expanded as in Eq. (67)
A η ∑=
= hp A α p ∑ a jk hp β j=
α k α p ∑ a jk hk β j (176)
p j ,k , p j ,k

Now, by definition of the vector norm

 ∗ ∗  
=Aη
2
η A† )( A η )
(=  p∑,q β p a pq hq =∑ a jk hk β j  ∑ a∗pq hq∗ a jk hk δ pj
  j ,k  p ,q , j ,k

 ∗ ∗  
2
= ∑=  ∑ a jq hq   ∑ a jk hk   ∑ ∑ a jk hk (177)
j  q  k  j k
 2
≤ ∑  ∑ a jk ∑ hm  =  ∑ a jk   ∑ hm  = A HS η
2 2 2 2 2

j  k m   j ,k   m 
49

(We have used the Cauchy-Schwarz inequality Eq. (208) in going from the second to third line – see 11.12
“Proof (12) of Cauchy-Schwarz inequality” below.) So, finally, we have proved, as required, that, for the
Hilbert-Schmidt norm,
Aη ≤ A HS η (178)

11.9 Proof (9) of compactness of Hilbert-Schmidt operators

11.9.1 First step – an inequality for the Hilbert-Schmidt norm
First, we will establish that we can write an expression analogous to Eq. (49), but where the operator norm
is the Hilbert-Schmidt norm rather than the supremum norm in that relation. That is, we want to prove, for
a Hilbert-Schmidt operator A and for any arbitrary vector µ in the Hilbert space H on which it acts
Aµ ≤ A HS µ (179)

First, let us define a vector β1 that is a normalized version of the vector µ, i.e.
µ
β1 = (180)
µ
Quite generally, then, for the vector norm based on the inner product, it is straightforward that
Aµ = Aβ1 µ . So now to prove Eq. (179), we need to prove that
Aβ1 ≤ A HS (181)

or equivalently
Aβ1 ≡ β1 A†A β1 ≤ A
2 2
HS (182)

Now, we are free to choose β1 to be the first element of an orthogonal set that forms a basis for this Hilbert
space H of interest. So we have from Eq. (89)
A HS ∑ β k A† A β k ≥ β1 A† A β1
2
= (183)
k

because all the elements β k A†A β k in the sum over k are greater than or equal to zero, being inner
products of the vector A β k with itself. Hence

A ≥ β1 A†A β1 ≡ Aβ1
2 2
HS (184)

proving (181), and hence proving Eq. (179).

11.9.2 Second step – the remaining proof

For the next step in the larger proof, first we write a Hilbert-Schmidt operator A , which maps from a
Hilbert space H1 to a Hilbert space H 2 , for arbitrary orthonormal basis sets { α k } and { β j } in H1 and
H 2 respectively, in the form
A ≡ ∑ a jk β j α k (185)
j ,k

as in Eq. (65) (which we can always do). Next, we now consider another operator An of in which we
truncate the sum over one of the indices so that its range has finite dimensionality, that is,
n
An = ∑ ∑ a jk β j α k (186)
j =1 k
50

We note immediately that such an operator is compact, as proved above in 11.5 “Proof (5) of compactness
of operators with finite dimensional range”. Now consider the operator A − An , which, from Eqs. (185) and
(186), we can write as
∞
A − An =∑ ∑ a jk β j α k (187)
j= n +1 k

So, from Eq. (89),

∞
A − An
2 2
HS =
∑ ∑ a jk (188)
j= n +1 k

is bounded because A is a Hilbert-Schmidt operator, then

2
But because we know ∑ j ,k a jk

A − An → 0 as n → ∞ (189)
Hence from the theorem (86), since A is then the limit of a sequence of compact operators, A is also
compact. Hence we have proved our result that all Hilbert-Schmidt operators are compact.

11.10 Proof (10) of approximation of Hilbert-Schmidt operators by

sufficiently large matrices
This proof follows similar approaches to the 11.9 “Proof (9) of compactness of Hilbert-Schmidt operators”
above. For a matrix representation of the Hilbert-Schmidt operator A that maps from Hilbert space H1 to
Hilbert space H 2 (which can be but is not necessarily different from H1 ), written as in Eq. (185), we can
write the “truncated” m × n matrix version Amn as
m n
Amn = ∑ ∑ a jk β j α k (190)
=j 1 =
k 1

For some arbitrary (finite) vector µ in H1 , consider the vector η (in H 2 ) that is the difference between
the vectors Aµ and Amn µ , i.e.,
η= ( A − Amn ) µ
Aµ − Amn µ = (191)
Then
η = ( A − Amn ) µ ≤ A − Amn HS µ (192)

where we have used the result Eq. (179) proved above in 11.9 “Proof (9) of compactness of Hilbert-Schmidt
operators”. So, from Eq. (89),
∞ ∞
A − Amn
2 2
HS =
∑ ∑ a jk (193)
j =m +1 k =+
n 1

is bounded because A is a Hilbert-Schmidt operator, then

2
But because we know ∑ j ,k a jk

A − Amn → 0 as n → ∞ (194)

and so η → 0 as m and n tend to infinity. Because this difference vector vanishes in this limit, we have
proved that we can approximate any Hilbert-Schmidt operator arbitrarily well by a sufficiently large matrix.

11.11 Proof (11) of Hilbert-Schmidt and compact nature of various

operators derived from Hilbert-Schmidt operators
We now prove the Hilbert-Schmidt and compact nature of the operators A† , A† A and AA† for a Hilbert-
Schmidt operator A . Consider a Hilbert-Schmidt operator A operating on functions or vectors in a Hilbert
51

2
space H1 to generate functions or vectors in a Hilbert space H 2 . Then from Eq. (89) we know ∑ j ,k akj
is bounded (indeed, this boundedness is a necessary and sufficient condition for the corresponding operator
to be Hilbert-Schmidt). For such matrix elements akj of the operator A , then we know from Eq. (76) that
the matrix elements of the operator A† are b jk = akj∗ . Hence ∑ j ,k b jk
2 2
= ∑ j ,k akj , which is therefore also
bounded, and so A† is also a Hilbert-Schmidt operator (and therefore is also compact).
2
Given that ∑ j ,k akj is bounded, then for any matrix element akj is also bounded, so for some sufficiently
large positive real number c
akj ≤ c (195)

Now, by definition, and using Eq. (186) to represent A , we can similarly write
A† = ∑ a∗jk α k β j (196)
j ,k

and hence we can write the matrix product

A† A = ∑ a∗pq α q β p a jk β j α k
j ,k , p ,q

= ∑ a∗pq α q δ pj a jk α k (197)
j ,k , p ,q

(
= ∑ ∑ a∗jq a jk α q α k
k ,q j )
So,
2
A† A
2
HS ≡ ∑ ∑ a∗jq a jk (198)
k ,q j

Now
2 2
∑ a∗jq a jk ≤ ∑ a∗jq a jk ≤ ∑ a jq ∑ a pk (199)
j j j p

where we used the Cauchy-Schwarz inequality for the last step. (We prove this inequality below in 11.12
“Proof (12) of Cauchy-Schwarz inequality”.) So

( )=
∑ (∑ a ) a
2
A†A
2 2 2 2 2
HS ≡ ∑ ∑ a∗jq a jk ≤ ∑ ∑ a jq ∑ a pk jq pk
k ,q j k ,q j p k , p q, j
(200)
A HS a pk A A HS A HS A
2 2 2 2 2 2 4
= ∑
= = HS ∑ a pk = HS
k, p k, p

So
A†A ≤ A
2
HS HS (201)
Because A by choice is a Hilbert-Schmidt operator, it has a finite Hilbert-Schmidt norm A HS , and so
A HS is also finite. Hence A†A HS is finite, and so A†A is a Hilbert-Schmidt operator, and hence also is
2

compact. Since the operator AA† is just the Hermitian adjoint of the operator A† A , it also is a Hilbert-
Schmidt operator and is also compact.

11.12 Proof (12) of Cauchy-Schwarz inequality

Consider two mathematical vectors β and γ with complex elements. We can write these as column
vectors if we wish
52

 b1   g1 
 
β = b2  and γ =  g 2  (202)
     
We presume these vectors are non-zero (the resulting theorem is trivial of either of them is zero). We also
presume these two vectors are not proportional to one another (i.e., they are not in the same “direction”) -
again, the resulting theorem is trivially obvious if they are. Now we define a third vector
γ β
η= β − γ (203)
γ γ
Here, the notation γ β is just signifying that we are taking the simple Cartesian inner product of these
vectors. For η to be zero, we would require β ∝ γ , which by assumption is not the case, so η is non-
zero.
Now we can write
γ β
γη = γ β − γ γ = γ β − γ β =0 (204)
γ γ
where we have used the inner product properties (IP1) and (IP2). (Since both γ and η are non-zero, for
such an inner product to be zero, η is necessarily orthogonal to γ .)
Rewriting Eq. (203) gives
γ β
β= η + γ (205)
γ γ
so
2 ∗
γ β γ β γ β
β β β γ γ + ηη + γη + ηγ
2
≡=
γ γ γ γ γ γ
2 2
(206)
γ β γ β
2 2
γ β γ β
= γ γ + ηη ≥ γ γ= =
γ γ γ γ γ γ γ
2

where we used Eq. (204) to eliminate the two right-most terms on the top line. So
γ β ≤ β γ (207)

or, equivalently, in component or summation form

2 2
∑ g ∗j b j ≤ ∑ g p ∑ bq (208)
j p q

Eqs. (207) and (208) are each forms of the Cauchy-Schwarz inequality, which we had set out to prove.

11.13 Proof (13) of finite multiplicity

Consider a compact Hermitian operator A that acts on vectors in a Hilbert space H. If an eigenvalue c of
this operator has a multiplicity (degeneracy) different from 1, then there is a set of linearly independent
vectors {η j } in H of dimensionality given by the multiplicity. From those vectors we can form an
orthonormal basis set {β j } of that same multiplicity that spans the space including all the vectors {η j } .
That basis set is also a set of eigenvectors of A with the same eigenvalue c since each element of the set is
a linear combination of vectors that are eigenvectors with the same eigenvalue c. So, the effect of A on
each orthogonal vector β j is to generate a vector, cβ j . Suppose c is non-zero, and suppose that its
53

multiplicity is infinite, so {β j } is an infinite set that is mapped, vector by vector, to the set {cβ j } , all of
which vectors have the same finite non-zero norm and all of which are orthogonal. So, A maps the infinite
sequence ( β j ) to the infinite sequence ( cβ j ) . But, because the vectors β j are orthonormal by definition,
the sequence ( cβ j ) has no convergent subsequence; the metric66 d ( cβ n , cβ m ) = 2 c for any choice of
two different values of n and m. This contradicts the requirement for a compact operator ((84)) that it should
map any sequence of finite vectors to a sequence with a convergent subsequence. Hence:
For any non-zero eigenvalue c of a compact Hermitian operator, the multiplicity of the
eigenvalue is finite.
(209)

11.14 Proof (14) that the eigenvalues of Hermitian operator on an

infinite dimensional space tend to zero
Suppose that we have a compact infinite-dimensional Hermitian operator that acts on vectors in a Hilbert
space H. Consider the corresponding infinite set of eigenvectors {α j } , all of which are orthogonal (or can
always be chosen to be so in the case of eigenvectors of degenerate eigenvalues.) Consider now the
sequence (α j ) formed from those eigenvectors, in which each eigenvector appears only once, and in which
we can consider the eigenvectors to be ordered in decreasing order of their corresponding eigenvalue. (The
order within the finite set of eigenvectors corresponding to any degenerate eigenvalue does not matter.) If
the eigenvalues do not converge to zero with increasing j, the resulting sequence of vectors ( Aα j ) will not
have a convergent subsequence, because they are all orthogonal vectors of finite length (see the “extreme
example” argument above in “An illustrative extreme example” in the discussion of compact operators).
Hence:
If a compact Hermitian operator is operating on an infinite dimensional space, then the
sequence of eigenvalues ( c p ) must tend to zero as p → ∞ .
(210)

11.15 Proof (15) of Hermitian operator supremum norm

We want to prove Eq. (110), A = sup (α , Aα ) , starting from Eq. (109), A = sup Aα where α is any
α =1 α =1
vector in the Hilbert space H on which the Hermitian operator A acts. We do this by proving, first, that
sup (α , Aα ) ≤ A and, second, that A ≤ sup (α , Aα ) , which leaves the equality as the only remaining
α =1 α =1
possibility from the two inequalities, which is what we want to prove.
For algebraic convenience, we define the (positive) real number u as
u = sup (α , Aα ) (211)
α =1

(Note that this is equivalent to a statement u = sup ( β , Aβ ) ; the name of the vector being used to find the
β =1
supremum is arbitrary, and we will need this flexibility below.)
Using the Cauchy-Schwarz inequality, as in Eq. (207), and noting that α = 1 by choice, we note, then, that
(α , Aα ) ≤ Aα α =
Aα (212)

66
Remember that the distance between the tips of two orthogonal unit vectors is 2.
54

So,
u sup (α , Aα ) ≤ sup Aα=
= A (213)
α =1 α =1

(where the last step on the right is just the definition of A ) completing the first half of the proof.
For the second half of the proof, note first that we can prove, for any vector η in H, and considering all
vectors γ in H with unit norm, i.e., γ = 1 ,
η = sup ( γ ,η ) (214)
γ =1

To prove this statement, Eq. (214), note first that, by the Cauchy-Schwarz inequality, Eq. (207)
(γ ,η ) ≤ γ η =
η (215)

so η ≥ ( γ ,η ) . But if we choose γ = η / η (i.e., a normalized version of η ), then

(γ ,η )
= (η=
,η ) / η η (216)

since η = (η ,η ) by definition, which shows there is at least one choice of γ for which η = ( γ ,η ) .
Hence, taking this result together with η ≥ ( γ ,η ) from (215) proves Eq. (214).
So, with the definition Eq. (109), A = sup Aα , and choosing η = Aα in Eq. (214)
α =1

=A sup
=
= Aα sup sup ( γ , Aα ) ≡
α 1= γ 1
α 1 = ( ) sup
α 1,=
= γ 1
(γ , Aα ) (217)

Next we need to derive an inequality for ( γ , Aα ) . The first step is to prove an algebraic equivalence, and
we start by choosing a vector µ = exp ( is ) γ where the real number s is chosen so that ( µ , Aα ) is real. (We
are always free to do this, and such a number s can always be found.) We note that
( exp ( −is ) γ , Aα ) =
( µ , Aα ) = exp ( is ) ( γ , Aα ) =
(γ , Aα ) (218)

and
µ= µ µ = exp ( −is ) γ exp ( −is ) γ = γ γ = γ (219)

Then we note that

( α + µ , A (α + µ ) ) + ( α − µ , A (α − µ ) )
= (α , Aα ) + ( µ , Aα ) + (α , Aµ ) + ( µ , Aµ )
− (α , Aα ) + ( µ , Aα ) + (α , Aµ ) − ( µ , Aµ )
= 2 ( µ , Aα ) + (α , Aµ ) 
= 2 ( µ , Aα ) + ( A†α , µ )  by the definition of A† (220)
= 2 ( µ , Aα ) + ( Aα , µ )  by the Hermiticity of A

= 2 ( µ , Aα ) + ( µ , Aα )  by (IP3)
*

= 4 Re ( µ , Aα )
= 4 ( µ , Aα ) by the chosen reality of this inner product
So, from Eqs. (218) and (220)

(γ , Aα ) =
1
( α + µ , A (α + µ ) ) + ( α − µ , A (α − µ ) )
2 2
(221)
16
55

Writing φ= α + µ and ψ= α − µ , we have

1
(γ , Aα ) (φ , Aφ ) + (ψ , Aψ )
2 2
= (222)
16
Now
(φ , Aφ ) + (ψ , Aψ ) ≤ (φ , Aφ ) + (ψ , Aψ ) (223)

by the triangle inequality for complex numbers. Writing a normalized vector β = φ / φ , and using the
Cauchy-Schwarz inequality and the definition of u, Eq. (211),
(φ , Aφ ) =
φ ( β , Aβ ) ≤ φ sup ( β , Aβ ) =
2 2
φ u
2
(224)
β =1

and similarly
(ψ , Aψ ) ≤ ψ 2 u (225)

So, using these results (224) and (225) in (223) and substituting that result into Eq. (222), we have
u2 2 2
(γ , Aα ) ≤
2
φ +ψ
2

16
u2 2 2 u2
(α + µ , α + µ ) + (α − µ , α − µ )
2
α +µ + α −µ
2
= ≡
16 16
u2 u2 2 2
2 (α , α ) + 2 ( µ , µ ) ≡
2
2 α +2 µ
2
= (226)
16 16
u2 2 u 2
2 2 u2
α + µ = α + γ
2 2 2 2
≡ = 1+1
4 4 4
= u2
In the last step we used the fact that both α and γ are 1, by choice in this proof. (The equivalence
α + µ + α − µ = 2 α + 2 µ that is proved in the middle of these steps is sometimes called the
2 2 2 2

parallelogram law or parallelogram inequality.) Hence, we have proved

(γ , Aα ) ≤ u (227)

and so, automatically, sup (γ , Aα ) ≤ u , and hence, using Eq. (217)

α 1,=
= γ 1

A ≤u (228)
Since we have proved both that A ≤ u (Eq. (228)), and that A ≥ u (Eq. (213), then we conclude that
A = u= sup (α , Aα ) , which is the statement, Eq. (110), that we set out to prove.
α =1

11.16 Proof (16) of the spectral theorem

To prove the spectral theorem67, first we prove for our compact, Hermitian operator A that either r = A
or r = − A is an eigenvalue of A .

67
Our proof here is similar to that in [3], theorem 9.16, pp. 225 – 227, though our version is expanded. The overall
structure of this proof is standard, and similar proofs are found in many other sources.
56

11.16.1 Proof that either r = A or r = − A is an eigenvalue of A

If A = 0 (the zero operator), then there is nothing to prove because all vectors in H are then eigenvectors
with eigenvalue zero, so we presume A ≠ 0 . Starting from the result A = sup (α , Aα ) (Eq. (110), proved
α =1
above in 11.15 “Proof (15) of Hermitian operator supremum norm”), we conclude that there must therefore
be a sequence (α n ) in H, with α n = 1 , such that

A = lim (α n , Aα n ) (229)
n →∞

Since (α n , Aα n ) is necessarily real, as proved above (Eq. (99), (OE1)), then

lim (α n , Aα n ) = r (230)
n →∞

where, therefore, the real number

r = A or r = − A (231)
The (infinitely long) sequence (α n ) is bounded, and A is compact. So there is a (infinitely long)
subsequence (α m ) , such that the sequence ( Aα m ) converges to some limit vector γ . That is,
lim Aα m → γ (232)
m →∞

Note that for this infinitely long sequence (α m ) , because it is a subsequence of (α n ) , we must still have,
as in Eq. (230)
lim (α m , Aα m ) = r (233)
m →∞

Next, we will prove that γ is an eigenvector of A , with eigenvalue r. First, we note that γ is not the zero
vector because then, from Eq. (230), we would have r = 0 , and that cannot be because r = A or r = − A
and by presumption A ≠ 0 . Now, if and only if γ is an eigenvector with eigenvalue r, then Aγ = rγ (Eq.
(100), (OE1)), so, formally, with the identity operator Iop for the space H,
( A − rIop ) γ =
0 (the zero vector) (234)

So, substituting for γ using (232), now we compute

( A − rIop ) γ = lim ( A − rIop ) Aα m

2 2
= lim AA − r Aα m
2
m →∞ m →∞

= lim A ( A − rIop )α m
2

m →∞

≤ A lim ( A − rIop )α m = A lim ( ( A − rIop )α m , ( A − rIop )α m )

2 2 2
m →∞ m →∞

A lim ( Aα m , Aα m ) + (α m ,α m ) − r (α m , Aα m ) − r ( Aα m ,α m ) 
2
= r2 (235)
m →∞

A lim  Aα m + r 2 αm − 2r (α m , Aα m ) 
2 2 2
=
m →∞

≤ A lim  A αm + r 2 αm − 2r (α m , Aα m ) 
2 2 2 2
m →∞

A [ r 2 + r 2 − 2r 2 ]
2
=
=0
where we have used Aα ≤ A α (Eq. (49)), (α m , Aα m ) = ( Aα m ,α m ) by Hermiticity, A = r 2 from
2

(231), lim (α m , Aα m ) = r from Eq. (233), and α m = 1 by definition.

2
m →∞
57

Hence, γ is indeed an eigenvector, which proves r is an eigenvalue. Note that this argument proves that
one or other or possibly both of A or − A is an eigenvalue of A , but at least one of them is.

11.16.2 Inductive construction of a set of eigenvectors and eigenvalues

With the above procedure, we have found a first eigenvalue r1 , with either r1 = A or r1 = − A . (If both
of these are eigenvalues, then we choose just one of them for the moment – say the positive one, for
definiteness.) r1 may have more than one linearly independent eigenvector associated with it (i.e., it may
be a degenerate eigenvalue), but we know that number m1 of linearly independent eigenvectors is finite by
(209) above, and m1 is the dimensionality of the space M 1 spanned by these eigenvectors. We can therefore
construct an orthogonal set of basis functions {β1 , , β m1 } for the space M 1 (and these are also
eigenvectors for this eigenvalue). (It is possible, of course, that the eigenvalue r1 is not degenerate, in which
case m1 = 1 , and there is only one basis function in the set, but for generality we keep the full notation to
cover the non-degenerate and degenerate cases.) For notational reasons, the operator A will now also be
called A1 (i.e., A ≡ A1 ).
A subspace M of a Hilbert space is called an invariant subspace of a linear operator B if, for any vector α
in M, Bα is also a vector in M. We can see that M 1 is an invariant subspace of H for the operator A1 .
Now, when A1 acts on any linear combination of the vectors {β1 , , β m1 } , the result is a vector in the same
space M 1 since A1β j = r1β j for any β j in the set. We can then usefully define another operator
m1
A1 − ∑ r1β j ( β j , ⋅)
A2 = (236)
j =1

(Here in the mathematical notation ( β j , ⋅) means that, when the operator acts on a vector, we substitute that
vector for the dot “ ⋅ ” in this inner product expression. This is generally much clearer in Dirac notation,
where we would write Eq. (236) as A= 2 A1 − ∑ mj =1 1 r1 β j β j , though we will retain the mathematical
notation in this proof.)
Now, A2 acting on any vector on H, only generates vectors that are orthogonal to all the β j , which means
that any eigenvectors of A2 are also orthogonal to these {β j } and the associated eigenvalues must all be
different from r1 . So now we can repeat the process we used with A1 to find now a (largest magnitude)
eigenvalue r2 of A2 with an associated set of m2 orthogonal vectors {β m1 +1 , , β m1 + m2 } . Note that
necessarily =r2 A2 ≤ r1 ; it is possible that, if both + A1 and − A1 were eigenvalues of A1 , r2 is now
the “other” one of those, and hence is of equal magnitude to r1 . Otherwise, it must be some (real) number
of smaller magnitude. Hence, we have now found a second set of eigenfunctions {β m1 +1 , , β m1 + m2 } , all
orthogonal to the first set {β1 , , β m1 } and with a different eigenvalue r2 . We proceed similarly, with
kn + mn
An − ∑ rn β j ( β j , ⋅)
An +1 = (237)
j = kn

where kn is the sum of all the preceding mq , i.e.,

n −1
kn = ∑ mq (238)
q =1

with, formally, k1 = 0 , and

r1 ≥ r2 ≥ r3 ≥  (239)
Note, incidentally, that this means that the eigenvectors and eigenvalues can be found progressively by a
variational approach; we would choose a (normalized) test vector θ constrained to be orthogonal to all
58

preceding eigenvectors, and varying θ to give the largest possible value of the inner product (θ , Aθ ) , and
the resulting vector would be the “next” eigenvector, with an associated eigenvalue equal to the resulting
maximized value of (θ , Aθ ) . Though it might be unlikely that we would in practice use such a variational
technique for calculations, this point is conceptually and physically important in establishing eigenvectors
as the ones that maximize the inner product (θ , Aθ ) . We state this formally above as (114).
If we make a notational change to write the eigenvalues with the same index j as used for the eigenvectors,
with the understanding that the eigenvalue rj is whatever one is associated with the eigenvector β j , then
we can concatenate all the expressions as in Eq. set to give
kn+1
A − ∑ rj β j ( β j , ⋅)
An +1 = (240)
j =1

Now, we know by the same procedure that

An +1 = rn +1 (241)
Hence, we know that
kn+1
A − ∑ rj β j ( β j , ⋅) =rn +1 (242)
j =1

and so
kn+1
lim A − ∑ rj β j ( β j , ⋅) =lim rn +1 (243)
n →∞ j =1 n →∞

But we know that the eigenvalues of a compact operator must tend to zero as n → ∞ (Eq. (104) (OE5) as
proved above in 11.14 “Proof (14) that the eigenvalues of Hermitian operator on an infinite dimensional
space tend to zero”), and so we have proved that
∞
=A ∑ rj β j ( β j , ⋅) (244)
j =1

where the sum converges in the operator norm. We restate this representation of the operator A above as
(112). This also proves that the set of eigenfunctions of a compact operator are complete for describing the
effect of the operator on any vector. We state this formally above as (111).
If all the eigenvalues are non-zero, then the set will be complete for the Hilbert space H. If not, then we can
extend the set by Gram-Schmidt orthogonalization to complete it.

11.17 Proof (17) of the equivalence of Dirac and matrix forms of SVD
Formally, we can write a matrix that is diagonal on some basis {γ j } as

Ddiag = ∑ s j γ j γ j (245)
j

where s j are the diagonal elements, and we can define two matrices
U= ∑ ψ p γ p and V = ∑ φq γ q (246)
p q

It is straightforward to check that these are unitary as in Eq. (140); specifically,

( ) (∑ ψ )
†
U†U = ∑ ψ p γ p q γq = ∑ γ p ψ p ψ p γq
p q p ,q
(247)
= ∑ γ=
p δ pq γ q γ p γ p Iop
∑=
p ,q p

and similarly for V† V . Then

(
A = VDdiagU† = ∑ φq γ q
q )( ∑ s
j
j γj γj
)( ∑ γ
p
p ψp
) (248)
= ∑ φq δ qj s jδ jp ψ p ∑ s j φ j ψ j
=
q, j , p j

which is identical to Eq. (136), so these two approaches are equivalent.

12 References
[1] D. A. B. Miller, “Waves, modes, communications, and optics,” arXiv:1904.05427 [physics.optics]
[2] E. Kreyszig, Introductory Functional Analysis with Applications (Wiley, 1978)
[3] J. K. Hunter and B. Nachtergaele Applied Analysis (World Scientific, 2001)
[4] G. W. Hanson and A. B. Yakovlev Operator Theory for Electromagnetics (Springer, 2002)
[5] D. Porter and D. S. G. Stirling, Integral Equations: A Practical Treatment, from Spectral Theory to Applications
(Cambridge, 1990)
[6] D. A. B. Miller, “Communicating with Waves Between Volumes – Evaluating Orthogonal Spatial Channels and
Limits on Coupling Strengths,” Appl. Opt. 39, 1681–1699 (2000). doi: 10.1364/AO.39.001681
[7] D. A. B. Miller, “Self-configuring universal linear optical component,” Photon. Res. 1, 1-15 (2013) doi:
10.1364/PRJ.1.000001
60

13 Index of definitions

accumulation point 9 compact 28

algebraic shift 20 Hermitian 31
basis 18 Hilbert-Schmidt 29
Cauchy-Schwarz inequality 52 identity 28
closure 7 linear 21
dimensionality 18 positive 35
domain 21 self-adjoint 31
eigenfunction 32 orthogonality 16
eigenvalue 32 outer product 25
degeneracy 32 parallelogram law 55
multiplicity 32 range 21
eigenvector 32 representation 19
expansion 18 scalar 21
coefficient 18 sequence 5
field 5 bounded 7
functional 13 Cauchy 7
Gram-Schmidt orthogonalization 41 convergent 6
Hermitian adjoint 12 divergent 7
infimum 7 sesquilinear 15
inner product 13 set 5
Cartesian 13 bounded 7
energy 15 complete 19
operator-weighted 35 orthogonal 17
transformed 36 orthonormal 17
underlying 19 singular value decomposition 38
weighted 15 space 6
invariant subspace 57 compact 9
Kronecker delta 18 complete 8
limit 7 function 11
linear Hilbert 17
combination 18 inner product 16
dependence 18 metric 6
independence 18 normed 6
matrix 11 precompact 9
element 23 vector 11
metric 6 span 18
induced by the norm 6 subsequence 5
norm 6 sum rule limit 30
Hilbert-Schmidt 30 supremum 7
operator 22 triangle inequality 6
supremum 22 for metrics 6
normalization 17 for norms 6
operator 21 vector addition 11
adjoint 26 vector multiplication by scalars 11