Advanced Calculus: Lecture Notes For MA 440/540 & 441/541 2015/16
Advanced Calculus: Lecture Notes For MA 440/540 & 441/541 2015/16
Advanced Calculus: Lecture Notes For MA 440/540 & 441/541 2015/16
Rudi Weikard
logHxL
x
1 2 3 4 5
-1
-2
Chapter 4. Continuity 27
4.1. Limits of functions 27
4.2. Continuous functions 28
4.3. The intermediate value theorem and some of its consequences 29
4.4. Uniform convergence and continuity 29
Chapter 5. Differentiation 33
5.1. Derivatives 33
5.2. The mean value theorem and Taylor’s theorem 35
5.3. Uniform convergence and differentiation 36
Chapter 6. Integration 39
6.1. Existence and uniqueness of integrals 39
6.2. Properties of integrals 41
6.3. The fundamental theorem of calculus 42
6.4. Integration of piecewise continuous functions 42
6.5. Uniform convergence and integration 43
(5) During class the instructor has the final decision on determining whether an ar-
gument may stand or not. His verdict may still be challenged after a proof is
“published” (see rule (6)).
(6) If someone other than an author discovers a flaw in a “published” proof, he or she
will get the opportunity to explain the mistake and present a correct proof for a
total of 20 points.
(7) While presenting proofs you may only refer to those axioms and theorems in the
notes which occur before the one you are working on, to published proofs of such
theorems, to the definitions, and to the appendix.
(8) You must give credit where credit is due, i.e., during your presentation you must
declare the points at which you had help and by whom.
(9) It is also possible to report joint work. In such a case 4 points will be earned for
the presentation while the other 6 are evenly distributed among the collaborators.
(10) The successor of a presenter will be chosen as the student with the smallest number
of points among the volunteers taking into account the modification by rules (3),
(11) and (12). A die is rolled, if necessary.
(11) You may volunteer for a particular problem by an e-mail to me. This (in the order
received) establishes priority among volunteers with the same number of points.
(12) For a student who has earned a challenge reward 20 points will be subtracted from
his or her current score for the purpose of determining a successor. At the time
such a student is selected to prove a theorem the challenge reward expires.
(13) Class attendance and participation is required. Absences from class are recorded
in Canvas. After 10 absences from class 10 points will be subtracted from your
class score and the count of absences is set again to zero.
(14) There will be no partial credit except as described above to share credit.
0.3. Hints
• As there is no partial credit, be well prepared to answer whatever questions may
arise.
• There is no need to take notes, final proofs will be “published”. Think along
instead.
• Try to earn points early.
• The index at the end of these notes may lead you to some of the necessary defini-
tions (let me know of any omissions).
• Often a proof will be fairly simple once one realizes that a previous result or a
previous method of proof can be used. Thus constant participation in class, even
when it’s other people’s turn to present, is highly advised! You should go over the
proofs again at home and see if they are still clear to you, possibly rewriting them
with more details.
• Mathematical reasoning takes time. You may expect some frustration — without
it there would never be a sense of accomplishment. Plan to spend a lot of time
thinking about a problem before writing down a final solution.
• Exercises are for private study. Points are only earned for proofs of theorems.
• More difficult problems are marked with a ∗.
• See me if you need help!
0.4. THE LANGUAGE OF MATHEMATICS 3
1http://www.math.uab.edu/~rudi/teaching/algebra.pdf
CHAPTER 1
We will introduce the real numbers R by a series of axioms, namely the field axioms, the
order axioms, and the least upper bound axiom. This means we will be sure of the precise
properties of the real numbers. Of course, you probably already have some intuition as to
what real numbers are, and these axioms are not meant to substitute for that intuition.
However, when writing your proofs, you should make sure that all your statements follow
from these axioms or their consequences proved previously. You will be amazed about the
rich world being created — by you — from these axioms.
Exercise 1. Show that one may define binary operations in the set {0, 1} which turn
it into a field. (This field is called Z2 .)
Theorem 1. The additive identity in R is unique.
Exercise 2. Is −0 an additive identity?
Theorem 2. Every real number x has a unique additive inverse.
The unique additive inverse of x ∈ R is called the negative of x and is denoted by −x.
For simplicity we will usually write x − y in place of x + (−y).
Theorem 3. x + y = x + z if and only if y = z, assuming that x, y, z ∈ R.
Theorem 4. Suppose x, y ∈ R. Then the following two statements hold.
(1) If x + y = x then y = 0.
(2) If x + y = 0 then y = −x.
Theorem 5. −(−x) = x for all x ∈ R.
Theorem 6. There is a unique multiplicative identity in R and every non-zero real
number x has a unique multiplicative inverse.
The unique multiplicative inverse of x ∈ R \ {0} is called the reciprocal of x and is
denoted by x−1 . We will also use the notation x1 and 1/x in place of x−1 and we will usually
write xy or x/y for xy −1 .
Theorem 7. If x, y, z ∈ R and x 6= 0, then the following statements hold.
(1) xy = xz if and only if y = z.
(2) If xy = x then y = 1.
(3) If xy = 1 then y = x−1 .
(4) (x−1 )−1 = x.
Theorem 8. For every x ∈ R we have 0x = 0.
Theorem 9. 0 does not have a reciprocal and neither is it the reciprocal of any number.
Theorem 10. If x and y are non-zero real numbers, then xy 6= 0.
Theorem 11. Let x, y ∈ R. Then
(1) (−1)x = −x,
(2) (−x)y = −(xy) = x(−y), and
(3) (−x)(−y) = xy.
Theorem 12. Let a, b ∈ R and x, y ∈ R \ {0}. Then a/x + b/y = (ay + bx)/(xy).
(O1): −P ∩ P = ∅,
(O2): −P ∪ {0} ∪ P = R, and
(O3): If a, b ∈ P then a + b ∈ P and ab ∈ P.
We mention in passing that the rational numbers also form an ordered field.
Theorem 14. 0 is neither in P nor in −P.
Exercise 4. Show that the field Z2 in Exercise 1 can not be ordered.
Definition 2. The elements in P are called positive and those in −P are called negative.
The non-negative numbers are those in (−P)c = P ∪ {0} while the non-positive numbers are
those in Pc = −P ∪ {0}.
Definition 3. Let a and b be real numbers. We say that a < b or, equivalently, b > a
if b − a ∈ P. We say that a ≤ b or, equivalently, b ≥ a if b − a ∈ P ∪ {0}.
Theorem 15. 1 > 0.
Theorem 16. Suppose a, b ∈ R. Then a ≤ b if and only if a < b or a = b.
Theorem 17. Let x, y ∈ R. Then either x ≤ y or y ≤ x (or both).
Theorem 18. Let x, y ∈ R. If x ≤ y and y ≤ x then x = y.
Theorem 19. For any two real numbers x and y exactly one of the following three
statements is true: x < y, x = y, or x > y.
Theorem 20. Let x, y, z ∈ R. If x < y and y < z, then x < z.
Theorem 21. Let x, y, z ∈ R. Then x + y < x + z if and only if y < z.
Theorem 22. Let x, y, z ∈ R.
(1) If 0 < x and y < z, then xy < xz.
(2) If x < 0 and y < z, then xz < xy.
Theorem 23. If x, y ∈ R and 0 < x < y, then 0 < y −1 < x−1 .
These consequences of the order axioms show that one can represent the real numbers
on the familiar number line. The relationship x < y is to be interpreted as “x lies to the
left of y”. It is often helpful to think along these lines when trying to devise proofs. Here
2 = 1 + 1 and 3 = 2 + 1.
-2 -1 - 12 0 1 1
3 2
1 2 3
Exercise 5. Identify the theorems which guarantee the ordering indicated in this sketch
of the number line.
Theorem 24. If x, y ∈ R and x < y, then there exists a number z ∈ R such that
x < z < y.
This shows that any field satisfying the order axioms will have many elements.
Definition 4. Let a, b be real numbers. Each of the following types of subsets of R
is called a finite interval : (a, b) = {x ∈ R : a < x < b}, [a, b) = {x ∈ R : a ≤ x < b},
(a, b] = {x ∈ R : a < x ≤ b}, and [a, b] = {x ∈ R : a ≤ x ≤ b}. The following sets
8 1. THE REAL NUMBERS
Definition 9 (Factorial). For natural numbers n one defines the number n! (pro-
nounced ‘n factorial’) by
Yn
n! = k.
k=1
In particular, 1! = 1, 2! = 2, and 3! = 2 · 3 = 6. One also defines 0! = 1 (get used to it).
1.3. THE INDUCTION PRINCIPLE 9
Qn
Definition 10 (Powers). If x ∈ R and n ∈ N, then the number k=1 x is called the
n-th power of x. It is denoted by xn . We also define x0 = 1 and, for x 6= 0, x−n = (x−1 )n .
In particular 00 = 1 (get used to that, too).
Theorem 31. For any n ∈ N
n
X n
X
cak = c ak
k=1 k=1
A sum like the one on the right of the previous equation is called a telescoping sum.
Theorem 34. If m and n are natural numbers, then so are n + m and nm.
Theorem 35. The sum of the first n natural numbers is n(n + 1)/2, i.e.,
n
X 1
k= n(n + 1)
2
k=1
for all n ∈ N.
Theorem 36. Let a be real number different from 1 and n a natural number. Then
n
X 1 − an+1
a0 + ak = .
1−a
k=1
In the situation of the previous theorem m/n is called the representation of q in lowest
terms. A negative rational number q also has a representation in lowest terms, namely
−m/n, if m/n is the representation in lowest terms of −q > 0.
Is there are real number whose square is 2? It will turn out that introducing the real
numbers allows to handle this deficiency. In fact, that’s why real numbers were invented.
Exercise 7. Find out what Z1 , Z2 , and Z3 actually are. Then find all functions
ϕ : Z1 → Z1 and all functions ψ : Z2 → Z2 . Which of these are injective and which are
surjective? Also, find all injective functions τ : Z3 → Z3 .
Exercise 8. Think about how counting is related to surjections, injections, and bijec-
tions.
Theorem 45. Suppose X is a non-empty finite set. If there is a bijective function from
X to Zn , then #X = n. Conversely, if #X = n and ϕ : Zn → X is surjective, then it is, in
fact, bijective.
Theorem 47. Suppose X and Y are sets and X ⊂ Y . Then the following two state-
ments are true:
(1) If X is infinite, then Y is infinite.
(2) If Y is finite, then X is finite.
Theorem 67. Suppose a and x are real numbers and c is a positive real number. Then
−|x| ≤ ±x ≤ |x|. Moreover, the interval (a − c, a + c) equals the set {x ∈ R : |x − a| < c}.
Theorem 68 (Triangle inequality). If x, y ∈ R, then
|x + y| ≤ |x| + |y|
and
|x + y| ≥ |x| − |y|.
Definition 19. Let X be a set. A function d : X × X → [0, ∞) is called a metric or
distance function if it has the following properties:
(1) d(x, y) = 0 if and only if x = y.
(2) d(x, y) = d(y, x) for all x, y ∈ X.
(3) d(x, y) ≤ d(x, z) + d(z, y) for all x, y, z ∈ X.
(X, d) is then called a metric space. The number d(x, y) is called the distance between x
and y.
Property (3) is also called triangle inequality.
Theorem 69. The function R × R → [0, ∞) : (x, y) 7→ |x − y| is a metric.
Note that, by Theorem 67, the interval (a − c, a + c) is the set of all real numbers whose
distance from a is less than c.
CHAPTER 2
2.1. Sequences
Definition 20. A sequence of real numbers is a function which maps N to R.
For a sequence x : N → R we will usually write n 7→ xn , where xn is traditionally used
for the value x(n) of the function x at n.
Exercise 10. For the sequence x : n 7→ 1/n and y : n 7→ 1/n2 consider where the
points xn and yn fall on the number line.
We have the impression that these sequences approach (whatever that might mean)
zero. How could we make such a notion precise?
Exercise 11. Show that |1/n − 0| < 1/100 for all n > 100 and |1/n − 0| < 1/1000 for
all n > 1000.
Exercise 12. Intuitively, what number L do the entries of the sequence n 7→ xn =
(n2 + (−1)n )/n2 approach?
Exercise 13. Consider the sequence from the previous exercise and suppose ε = 1/100.
Find an N ∈ R such that |xn − L| < ε for all n > N . What about ε = 1/500? What will
happen for even smaller ε? Note that ε describes the error we allow.
Definition 21. Let x : n 7→ xn be a sequence. We say that x converges to the real
number L if for every positive real number ε there is an N ∈ R such that |xn − L| < ε
whenever n > N ; or, more concisely,
∀ε > 0 : ∃N ∈ R : ∀n > N : |xn − L| < ε.
We say that the sequence x converges or is convergent if there is a real number L such
that x converges to L. If a sequence is not convergent, we say that it diverges or is divergent.
Theorem 70. If c ∈ R, then x : n 7→ xn = c converges to c.
Theorem 71. The sequence n 7→ 1/n converges to 0.
Theorem 72. If the sequence x converges to L1 and also to L2 , then L1 = L2 .
Definition 22. If a sequence x converges to L, then L is called the limit of x and we
write
lim xn = L.
n→∞
Definition 23. We say that a sequence is bounded above (or below) if its range is
bounded above (or below). A sequence is called bounded if it is bounded both above and
below.
Theorem 73. A finite set of real numbers is bounded.
15
16 2. SEQUENCES AND SERIES
Similarly, the limit of the sequence n 7→ inf{xk : k ≥ n} is called the limit inferior of x and
is denoted by lim inf n→∞ xn , i.e.,
lim inf xn = lim inf{xk : k ≥ n}.
n→∞ n→∞
Theorem 90. Let x be the sequence defined by xn = (−1)n + 1/n. Then we have
lim supn→∞ xn = 1 and lim inf n→∞ xn = −1.
2.1. SEQUENCES 17
Theorem 91. Suppose x is a bounded sequence. Then lim inf n→∞ xn ≤ lim supn→∞ xn .
Theorem 92. Let x be a bounded sequence. Then lim supn→∞ −xn = − lim inf n→∞ xn .
Theorem 93. Let x and y be bounded sequences such that xn ≤ yn for all n ∈ N.
Then lim supn→∞ xn ≤ lim supn→∞ yn and lim inf n→∞ xn ≤ lim inf n→∞ yn .
Theorem 94. Let x and y be bounded sequences. Then
lim sup(xn + yn ) ≤ lim sup xn + lim sup yn
n→∞ n→∞ n→∞
and
lim inf (xn + yn ) ≥ lim inf xn + lim inf yn .
n→∞ n→∞ n→∞
Note that, if m ≤ n, only the values f (j) with m ≤ j ≤ n are needed in the above
definition.
The following theorems are useful when dealing with finite sums. In each of them f is
a real-valued function with an appropriate domain.
Pn Pn+p
Theorem 104. Suppose m, n, p ∈ Z. Then k=m f (k) = k=m+p f (k − p).
Pn Pn
Theorem 105. Suppose m, n ∈ Z. Then k=m f (k) = k=m f (m + n − k).
Theorem 106. Suppose k, m, n ∈ Z and k ≤ m ≤ n. Then
n
X m
X n
X
f (j) = f (j) + f (j).
j=k j=k j=m+1
Pm Pn Pn Pm
Theorem 107. If m, n ∈ N ∪ {0}, then k=0 `=0 f (`, k) = `=0 k=0 f (`, k).
Theorem 108. Any permutation is a finite composition of transpositions.
Theorems 105 and 107 are special cases of the following one.
Theorem 109 Pn commutative law). Suppose n ∈ N and π : Zn → Zn is
Pn(The generalized
bijective. Then k=1 f (k) = k=1 f (π(k)). Moreover, P if A is a finite setPwith n elements
n n
and φ and ψ are bijective functions from Zn to A, then k=1 f (φ(k)) = k=1 f (ψ(k)).
This theorem allows for the following definition.
2.3. Series
Definition 30. If x : N → R is a sequence of real numbers the sequence
n
X
s : N → R : n 7→ sn = xk
k=1
P∞
is a called the sequence of partial sums of x or a series. We will denote s by k=1 xk .
Thus a series is actually a sequence. Conversely, in view of Theorem 33, a sequence may
also be cast as a series:
Pgiven a sequence s defining x1 = s1 and xn = sn − sn−1 Pshows that
∞ ∞
s is a series, viz., s = k=1 xk . We emphasize that (in this context) the symbol k=1 xk is
not to be thought of as a real number. It is only a concise notation for a sequence of partial
sums. Indeed, if this sequence ofPpartial sums does not converge, it is not sensible to assign
∞
any real number to the symbol k=1 xk .
2.3. SERIES 19
for any k ∈ N.
20 2. SEQUENCES AND SERIES
Theorem 120 (Comparison Test). Let x, y, and z be sequences P∞ and assume there is
an N ∈ NP such that |xn | ≤ yn ≤ zn for all n ≥ N . If the series n=1 yn converges, then
∞
the series n=1 xn converges, too, and
∞ ∞
X X
x ≤ yn .
n
n=N n=N
P∞ P∞
If the series n=1 yn diverges, then the series n=1 zn diverges, too.
P∞
Theorem 121. If 2 ≤ k ∈ N, then n=1 n−k converges.
Theorem 122 (Raabe’s4 Test). Suppose that there is an N ∈ N and a β > 1 such that
an+1 β
an ≤ 1 − n + 1
P∞
whenever n ≥ N . Then the series n=1 an converges.
Theorem 123 (Ratio Test). Suppose that xn 6= 0 for all n ∈ N and that
xn+1
lim sup
< 1.
n→∞ xn
P∞
Then the series n=1 xn converges.
P∞
Theorem 124. Let a be a fixed real number. Then the series n=0 an /n! converges.
Definition 32. We set
∞
X
exp(a) = an /n!
n=0
and
∞
X
exp(1) = 1/n! = e.
n=0
Theorem 129. Let x ≥ 0 and n ∈ N. Then there exists a unique y ≥ 0 such that
y n = x.
Definition 34. The unique number
√ y given by Theorem 129 is called the (non-negative)
n-th root of x ≥ 0. We write y = n x.
√
Inspired by Theorem 39 the notation y = n x = x1/n is frequently used.
√
Theorem 130. If p > 0, then limn→∞ n p = 1.
√
Theorem 131. limn→∞ n n = 1.
p
P∞(Root Test). Let x be a sequence such that lim supn→∞ |xn | < 1.
n
Theorem 132
Then the series n=1 xn converges.
The following inequality shows that the root test is more precise than the ratio test.
The advantage of the ratio test is that it is usually more easily applicable.
Theorem 133. Suppose a is a bounded sequence of positive numbers. Then
√ an+1
lim sup n an ≤ lim sup
n→∞ n→∞ an
if the right hand side exists.
P∞ P∞
Definition 35. ThePseries n=1 xn is said to converge absolutely if the series n=1 |xn |
∞
converges. If the series n=1 xn converges, but does not converge absolutely, then we say
that it converges conditionally.
P∞
Theorem 134. If the series n=1 xn converges absolutely, then it converges and
X∞ X ∞
xn ≤ |xn |.
n=1 n=1
Definition 36. Let π : N → N be a bijective sequence. Let Px∞be a given sequence and
define a sequence
P∞ y by yn = x πn . Then we say that the series n=1 yn is a rearrangement
of the series n=1 xn .
P∞ P∞
Theorem 135.* P∞Suppose thatP∞ n=1 xn converges absolutely and let n=1 yn be a
rearrangement of n=1 xn . Then n=1 yn converges absolutely, and
∞
X ∞
X
yn = xn .
n=1 n=1
P∞
Let n=1 xn be a convergent series which does not converges absolutely. Then there is
a rearrangement of the series such that the sequence of its partial sums is not bounded above
or below. Moreover, given any real number s, there is a rearrangement which converges to
s. We will not try to prove this statement.
CHAPTER 3
A zoo of functions
Definition 37. Let X be a set and f, g functions from X to R. We define their sum,
difference, and product respectively by (f ± g)(x) = f (x) ± g(x) and (f g)(x) = f (x)g(x)
for all x ∈ X. If g(x) 6= 0 for all x ∈ X we also define the quotient of f and g by
(f /g)(x) = f (x)/g(x).
The composition of functions (g ◦ f )(x) = g(f (x)) is defined in the appendix.
Definition 38. Let S ⊂ R. A function f : S → R is called non-decreasing (or non-
increasing) if f (x) ≤ f (y) (or f (x) ≥ f (y)) whenever x, y ∈ S and x ≤ y. It is called
strictly increasing (or strictly decreasing) if the inequalities are strict. A function is called
monotone if it is non-increasing or non-decreasing. It is called strictly monotone if it is
strictly increasing or strictly decreasing.
Definition 39. If n is a non-negative integer and a0 , a1 , . . ., an are real numbers, then
the function p : R → R defined by
n
X
p(x) = ak xk
k=0
is called a polynomial function or a polynomial for short. The integer n is called the degree
of p if an 6= 0. The zero function is also a polynomial but no degree is assigned to it.
Theorem 136. If p and q are polynomials of degree n and k, respectively, then p + q
and pq are also polynomials. The degree of p + q is the larger of the numbers n and k unless
n = k in which case the degree is at most n. The degree of pq equals n + k.
Theorem 137. A polynomial p of degree n has at most n zeros, i.e., there are at most
n points x such that p(x) = 0.
Definition 40. Let p and q be polynomials and assume q 6= 0, i.e., q is not the zero
polynomial. Let S = {x ∈ R : q(x) 6= 0}. Then, the function r : S → R given by
p(x)
r(x) =
q(x)
is called a rational function.
Every polynomial is a rational function (choose q = 1).
Definition 41. Let J be an interval in R. A function f : J → R is called an algebraic
function if there is a natural number n and polynomials p0 , p1 , ..., pn , not all zero, such
that
p0 (x) + p1 (x)f (x) + · · · + pn (x)f (x)n = 0
for all x ∈ J. A function which is not algebraic is called transcendental.
23
24 3. A ZOO OF FUNCTIONS
P∞ x2k+1 P∞ x2k
Theorem 146. Suppose x > 0. Then the series k=0 (2k+1)! and k=0 (2k)! are con-
vergent.
3. A ZOO OF FUNCTIONS 25
Continuity
27
28 4. CONTINUITY
Theorem 152. Let S, a, f , and g be as in Theorem 151. Suppose that g(x) 6= 0 for all
x ∈ S and that limx→a g(x) 6= 0. Then f /g has a limit at a. In fact,
lim (f /g)(x) = lim f (x)/ lim g(x).
x→a x→a x→a
P∞Definition 52. Let x0 ∈ R and an ∈ R for all non-negative integers n. Then the series
n
n=0 an (x − x 0 ) is called a power series.
P∞
P∞Theorem 178. Suppose that x, t ∈ R and |x| < |t|. If tn converges, then
n=0 anP
n
P ∞ n ∞ n
n=0 an x converges absolutely. If, however, n=0 an x diverges, then n=0 an t diverges
also.
P∞
Definition 53. Let the series S = n=0 an xn and the set
∞
X
A = {x ≥ 0 : |an |xn converges}
n=0
P∞
Theorem 180. Suppose the power series n=0 an xn has radius of convergence R > 0
and that 0 < ε < R. Then the power series converges uniformly on S = {x ∈ R : |x| ≤
R − ε}. If it has infinite radius of convergence and r > 0, then it converges uniformly on
S = {x ∈ R : |x| ≤ r}.
Theorem 181. The power series defining the exponential function or the sine or cosine
functions have infinite radius of convergence. Each of these functions is continuous on R.
CHAPTER 5
Differentiation
Throughout this chapter S denotes a set without isolated points. The most important
case is when S is an interval (finite or infinite; open, closed, or half-closed).
5.1. Derivatives
Definition 54. Let f : S → R be a function, and a ∈ S. We say that f is differentiable
at a if
f (x) − f (a)
lim
x→a x−a
exists. The limit is called the derivative of f at a and is commonly denoted by f 0 (a). If
f : S → R is differentiable at every point a ∈ S, we say that f is differentiable on S.
The expression
f (x) − f (a)
,
x−a
defined for x ∈ S \ {a} is called a difference quotient.
The difference quotient has an obvious geometric interpretation as the slope of a secant
passing through two points on the graph of f . The notation f 0 for the derivative of f was
introduced by Newton1. We will not use Leibniz’s2 notation involving differentials (df /dx).
Theorem 182. Let f : S → R be differentiable at a ∈ S. Then f is continuous at a.
Theorem 183. The constant function and the identity function are differentiable on
R. Their derivatives at any point are 0 and 1, respectively.
Theorem 184. The function h : R → [0, ∞) : x 7→ |x| is differentiable on R \ {0} but
not at zero.
√
Theorem 185. The function f : [0, ∞) → [0, ∞) : x 7→ x is differentiable on (0, ∞)
but not at zero.
Theorem 186. Let f : S → R and g : S → R be functions which are differentiable at
a ∈ S and let c ∈ R be a constant. Then
(1) cf is differentiable at a and (cf )0 (a) = cf 0 (a) and
(2) f ± g are differentiable at a, and (f ± g)0 (a) = f 0 (a) ± g 0 (a).
Theorem 187 (Product or Leibniz rule for derivatives). Let f : S → R and g : S → R
be functions which are differentiable at a ∈ S. Then, f g is differentiable at a and
(f g)0 (a) = f 0 (a)g(a) + f (a)g 0 (a).
33
34 5. DIFFERENTIATION
Note that the rule for the differentiation of the power function with an integer power
works for all n ∈ Z.
Since, in a vicinity of a, the term (x − a)h(x) is generally much smaller than f (a) +
m(x − a), the theorem says that f (x) may be well approximated by f (a) + m(x − a) if and
only if it is differentiable at a. The function x 7→ f (a) + m(x − a) is called the tangent line
of f at the point (a, f (a)).
Theorem 194. The exponential functions has derivatives of all orders on all of R, in
fact exp(n) (x) = exp(x) for all n ∈ N and all x ∈ R.
5.2. THE MEAN VALUE THEOREM AND TAYLOR’S THEOREM 35
Theorem 209.PSuppose that the functions gk : [a, b] → R are differentiable, that the
∞ 0
series of functions P k=0 gk converges uniformly on [a, b] and that, for some x0 ∈ [a, b],
∞ P∞
the numerical series k=0 gk (x0 ) also converges. Then k=0 gk converges uniformly to a
differentiable function on [a, b] and, for each x ∈ [a, b],
∞
!0 ∞
X X
gk (x) = gk0 (x).
k=0 k=0
P∞ n
P∞
Theorem 210. The power series n=0 cn x and n=1 ncn xn−1 have the same radius
of convergence.
Theorem P∞ 211. Let a > 0. Suppose the function f : (−a, a) → R is defined by the
power series n=0 cn xn whose radius of convergence is at least a. Then f has derivatives
of all orders on (−a, a).
Theorem 212. sin and cos have derivatives of all orders on R. In fact, sin0 = cos and
0
cos = − sin.
Definition 57. Suppose f : (a, b) → R has derivatives of all orders at x0 ∈ (a, b).
Consider the power series
∞
X f (n) (x0 )
Tf (x) = (x − x0 )n .
n=0
n!
If there is a positive r such that f (x) = Tf (x) for all x ∈ (x0 − r, x0 + r) then f is called
analytic at x0 and Tf is called the Taylor series of f at x0 .
5.3. UNIFORM CONVERGENCE AND DIFFERENTIATION 37
Analytic functions are the realm of Complex Analysis, perhaps the most beautiful sub-
ject in Mathematics.
Theorem 213. The function x 7→ 1/(1 − x) defined on (−∞, 1) is analytic at zero.
CHAPTER 6
Integration
Throughout this chapter a and b denote two real numbers such that a < b.
and
n
X
U (f, P ) = Mj (xj − xj−1 )
j=1
39
40 6. INTEGRATION
Theorem 224. Uniformly continuous functions on finite closed intervals are Riemann
integrable.
As a matter of fact, the qualifier “uniformly” is not necessary in the previous theorem,
because any function which is continuous on a finite closed interval is uniformly continuous
there. However, to prove this fact one needs the Heine-Borel theorem, which we have
not available. Instead, we will try for a different proof for the integrability of continuous
functions in Theorem 236 following an idea advocated by C. Bennewitz3.
The concept of the Riemann integral has proved to be too weak and too cumbersome for
many applications in differential equations, probability, and elsewhere. It is too weak since
many functions one might want to integrate are not Riemann integrable, e.g., Dirichlet’s
function. It is too cumbersome since it is sometimes difficult to show when limit processes
can be interchanged with integration (despite what we show in Section 6.5). To remedy
this Lebesgue4 introduced a more powerful integral called the Lebesgue integral. This is the
topic of more advanced courses in Real Analysis.
Theorem 234. Suppose f is bounded on [a, b]. Then the functions F± : [a, b] → R : x 7→
I± (f, a, x) are differentiable at any point c where f is continuous in which case F±0 (c) = f (c).
Theorem 235. Suppose f : [a, b] → R is Riemann integrable over [a, b] and F : [a, b] →
R is a piecewise antiderivative of f . Then
Z b
f = F (b) − F (a).
a
The combination of Theorems 234 (for the case where F± coincide) and 235 is known as
the Fundamental Theorem of Calculus. Its essence is that the operations of integration and
differentiation are in some sense inverses of each other. But note that Theorem 234 makes
no statement as to the size of the set where F 0 does not exist while Theorem 235 requires
that set to be finite. This imbalance is a shortcoming of the Riemann integral and is only
overcome by the more powerful Lebesgue integral.
Theorem 236. Bounded, piecewise continuous functions on finite closed intervals have
piecewise antiderivatives and are Riemann integrable.
Theorem 237 (Integration by parts). Suppose f, g : [a, b] → R are bounded and piece-
wise continuous on [a, b] and let F and G, respectively, be piecewise antiderivatives. Then
Z b Z b
Fg + f G = F (b)G(b) − F (a)G(a).
a a
Theorem 241. Suppose that n 7→ gn : [a, b]P → R is a sequence of functions which are
∞
Riemann integrable over [a, b] and that the series n=0 gn converges uniformly to a function
g : [a, b] → R. Then g is Riemann integrable and
Z b ∞ Z b
X
g= gn .
a n=0 a
Special topics
limx→a f 0 (x)/g 0 (x) = L for some real number L. Then g is one-to-one on (a, b) and, for
every ε > 0, there is a point c ∈ (a, b) such that for all x, y ∈ (a, c) we have g(x) 6= 0 and
f (x) − f (y)
L−ε< <L+ε
g(x) − g(y)
as long as x 6= y.
Theorem 249 (L’Hôpital’s1 rule I). Let (a, b) be an open interval (allowing for a = −∞
and b = ∞) and let f, g : (a, b) → R be differentiable functions. Assume g 0 (x) 6= 0
for all x ∈ (a, b), limx→a f (x) = limx→a g(x) = 0, and limx→a f 0 (x)/g 0 (x) = L where
L ∈ R ∪ {−∞, ∞}. Then
f (x)
lim = L.
x→a g(x)
Theorem 250 (L’Hôpital’s rule II). Let (a, b) be an open interval (allowing for a = −∞
and b = ∞) and let f, g : (a, b) → R be differentiable functions. Assume g 0 (x) 6= 0
and g(x) > 0 for all x ∈ (a, b), limx→a g(x) = ∞, and limx→a f 0 (x)/g 0 (x) = L where
L ∈ R ∪ {−∞, ∞}. Then
f (x)
lim = L.
x→a g(x)
Definition 71. The unique zero of cos in (0, 2) is denoted by π/2, i.e., π is twice that
zero.
Theorem 258. π > 0, cos(π/2) = 0, and sin(π/2) = 1.
Theorem 259. sin(x + π/2) = cos(x), sin(x + π) = − sin(x), and sin(x + 2π) = sin(x)
for all x ∈ R.
Theorem 260. sin(x) = 0 if and only if x = mπ with m ∈ Z.
Theorem 261. Suppose sin(x + a) = sin(x) for all x ∈ R. Then a is an integer multiple
of 2π.
Theorem 262. sin is strictly increasing on [−π/2, π/2]. Also, cos is strictly decreasing
on [0, π].
Definition 72. A function f : (a, b) → R is called convex on (a, b), if
f ((1 − t)x + ty) ≤ (1 − t)f (x) + tf (y)
for all x, y ∈ (a, b) and all t ∈ [0, 1]. It is called concave on (a, b), if −f is convex on (a, b).
Note that the graph of the function t 7→ (1 − t)f (x) + tf (y) describes a straight line
segment joining f (x) and f (y) in the coordinate plane. Convexity means that the graph of
a function lies below any such segment.
Theorem 263. f is convex on (a, b) if and only if
f (v) − f (u) f (w) − f (v)
≤
v−u w−v
whenever a < u < v < w < b.
Theorem 264. If f : (a, b) → R is twice differentiable and f 00 ≥ 0 on (a, b), then f is
convex on (a, b).
Theorem 265. The sine function is positive in (0, π) and negative in (π, 2π). It is
strictly increasing in (0, π/2) and (3π/2, 2π) and strictly decreasing in (π/2, 3π/2). Finally,
it is concave in (0, π) and convex in (π, 2π).
This theorem allows to draw a sketch of the sine function on [0, 2π] and by periodicity
on all of R. The graph of the cosine function is simply obtained by shifting the graph of the
sine function.
Theorem 266. tan is defined on R\{(2m+1)π/2 : m ∈ Z} and has range R. Moreover,
tan(x + π) = tan(x) for all x in its domain.
Theorem 267. The tangent function is negative and concave in (−π/2, 0) and positive
and convex in (0, π/2). It is strictly increasing in the entire interval (−π/2, π/2).
Definition 73 (Inverse trigonometric functions). arcsin, arccos, and arctan are the
inverse functions of sin |[−π/2,π/2] , cos |[0,π] , and tan |(−π/2,π/2) , respectively.
Theorem 268. arcsin : [−1, 1] → [−π/2, π/2]√is strictly increasing and continuous. It
is differentiable on (−1, 1). In fact arcsin0 (x) = 1/ 1 − x2 .
Theorem 269. arctan : R → (−π/2, π/2) is strictly increasing and differentiable. In
fact arctan0 (x) = 1/(1 + x2 ).
P∞
Theorem 270. If |x| < 1, then arctan(x) = n=0 (−1)n x2n+1 /(2n + 1).
Theorem 271. π/4 = arctan(1) = arctan(1/2) + arctan(1/3).
505 6115
Theorem 272. 162 ≤π≤ 1944 .
48 7. SPECIAL TOPICS
Note that for constant speed v the length of a curve is v(b − a) (speed times time).
Theorem 276. Suppose two curves are continuously differentiable, equivalent to each
other, and that the function ϕ establishing the equivalence has a continuous derivative.
Then the curves have the same length.
The importance of this observation is that it shows length to be a property of (certain)
geometric objects in the plane rather than of functions with values in the plane.
Theorem 277. The length of the unit circle (its circumference) is 2π.
7.4. SETS OF MEASURE ZERO AND SOME CONSEQUENCES 49
Definition 80. Let f, F : [a, b] → R be two continuous functions with the property
f (x) ≤ F (x). Suppose S = {(x, y) ∈ R2 : a ≤ x ≤ b and f (x) ≤ y ≤ F (x)}. Then the area
of S is given by
Z b
A= (F − f ).
a
Exercise 20. Suppose f (x) = 0 and F (x) = h. What kind of set is S and what is its
area?
Theorem 278. The area of the unit disk is π.
Theorem 286. The interval [0, 1] does not have measure zero.
Theorem 287. The set of real numbers is uncountable.
APPENDIX A
In this section we briefly recall some notation and a few facts from set theory and logic1.
Some of what is said below is intentionally vague lest we should write a book on the matter.
1For more details you may check my lecture notes on Algebra at http://www.math.uab.edu/
~rudi/teaching/algebra.pdf and the books referenced there.
51
52 A. SOME SET THEORY AND LOGIC
Theorem A.1. Let A, B and C be sets. Then the following statements hold:
(1) Commutative laws:
A∪B =B∪A and A ∩ B = B ∩ A.
(2) Associative laws:
(A ∪ B) ∪ C = A ∪ (B ∪ C) and (A ∩ B) ∩ C = A ∩ (B ∩ C).
(3) Distributive laws:
\ \
A∪( X) = (A ∪ X) and
X∈F X∈F
[ [
A∩( X) = (A ∩ X).
X∈F X∈F
The set X \ A which consists of all the elements of X which do not belong to A, i.e.,
x ∈ X \ A if and only if x ∈ X and x 6∈ A. If A is a subset of X, then X \ A is called the
complement of A in X. If it is clear what X is, then the complement is also denoted Ac .
A.3. FUNCTIONS 53
Theorem A.2 (De Morgan’s laws). Let E be a set and suppose that F is a non-empty
collection of subsets of E. Then De Morgan’s laws hold:
!c
[ \
X = X c,
X∈F X∈F
!c
\ [
X = X c.
X∈F X∈F
These formulas can be expressed concisely, if not precisely, as follows: The complement
of a union is the intersection of the complements and the complement of an intersection is
the union of the complements.
A.3. Functions
Let X and Y be sets. An ordered pair is an object of the form (x, y) where x ∈ X and
y ∈ Y . The set of all such ordered pairs (x, y) is called the Cartesian product of X and Y ,
and is written X × Y . A subset f of X × Y is a function, if for each x ∈ X there is a unique
y ∈ Y such that (x, y) ∈ f . As the concept of function is extremely important we repeat
the preceding statement in a more formal way: f ⊂ X × Y is a function if and only if
(i) ∀x ∈ X : ∃y ∈ Y such that (x, y) ∈ f and
(ii) if (x, y1 ) ∈ f and (x, y2 ) ∈ f , then y1 = y2 .
The more conventional notation for functions is y = f (x) instead of (x, y) ∈ f . If f ⊂ X × Y
is a function, we write f : X → Y or, to define it precisely, f : X → Y : x 7→ y = f (x), e.g.,
f : R → [0, 1] : x 7→ 1/(1 + x2 ). We call X the domain of f , Y the codomain or target of f ,
and say that f maps X to Y . The element f (x) ∈ Y is called the image of x under f . The
set of those y ∈ Y for which there is an x ∈ X such that y = f (x) is called the range or
image of f , and will be written as ran(f ). Let f : X → Y . If A ⊂ X, we define the function
f |A : A → Y , called the restriction of f to A, by setting (f |A )(x) = f (x) for every x ∈ A.
We define the image of A under f as the set
f (A) = {y ∈ Y : ∃x ∈ A : f (x) = y} = {f (x) : x ∈ A}.
In other words f (A) = ran(f |A ).
Theorem A.3. Let f : X → Y and A, B ⊂ X. Then f (A ∪ B) = f (A) ∪ f (B) and
f (A ∩ B) ⊂ f (A) ∩ f (B).
Equality does not always hold in the second case. Can you think of an example where
it does not?
If A ⊂ Y , we define the pre-image of A under f , denoted by f −1 (A) ⊂ X, by:
f −1 (A) = {x ∈ X : f (x) ∈ A}.
Theorem A.4. Let f : X → Y and A, B ⊂ Y . Then f −1 (A ∪ B) = f −1 (A) ∪ f −1 (B)
and f −1 (A ∩ B) = f −1 (A) ∩ f −1 (B).
A function f : X → Y is onto (or surjective) if ran(f ) = Y . A function f : X → Y is
one-to-one (or injective) if f (x1 ) = f (x2 ) implies that x1 = x2 . If f is both one-to-one and
onto, we say that f is bijective. A bijective function from a finite set X to itself is called a
permutation of X. A permutation π is called a transposition if π(x) = x for all but at most
two elements x of the domain of π.
Suppose now that f : X → Y is an injective function. Then there exists a unique
function g : f (X) → X such that g(f (x)) = x for all x ∈ X, and f (g(y)) = y for all
54 A. SOME SET THEORY AND LOGIC
y ∈ f (X). The function g is denoted f −1 , and is called the inverse function of f . Note that
notation is abused here in the sense that f −1 is used to describe two different things. Only
the context tells which of the two meanings is used. In any case, caution is advised. These
two different meanings are exemplified in
f −1 ({y}) = {f −1 (y)}
which holds for all y ∈ Y if f : X → Y is bijective.
Given a function f : X → Y and a function g : Y → Z their composition g ◦ f is the
function from X to Z defined by
(g ◦ f )(x) = g(f (x)).
Theorem A.5. The inverse of an injective function is injective. The composition of
injective functions is injective and the composition of surjective functions is surjective.
arccos, 47 uniform, 30
arcsin, 47 convex, 47
arctan, 47 countable, 10
countably infinite, 10
analytic, 36 curve
antiderivative, 42 plane, 48
piecewise, 42
area, 49 de Morgan’s laws, 53
associative law of addition, 5 degree, 23
associative law of multiplication, 5 dense, 12
derivative, 33
bijective, 53 higher order, 34
binary operation, 5 second order, 34
binomial coefficients, 20 differentiable, 33
Bolzano-Weierstrass theorem, 17 twice, 34
bound variable, 51 disjoint, 52
bounded, 11 disk, 48
sequence, 15 unit, 48
bounded above, 11 distance, 13
bounded below, 11 distance function, 13
divergence
Cartesian product, 53 of a sequence, 15
Cauchy sequence, 17 of a series, 19
circle, 48 domain, 53
unit, 48
codomain, 53 element, 52
commutative law of addition, 5 empty set, 52
commutative law of multiplication, 5 Euclidean metric, 48
complement, 52 extremum
complete, 17 local, 35
composition, 54
concave, 47 factorial, 8
continuity formula, 51
uniform, 40 free variable, 51
continuous function, 53
at a, 28 algebraic, 23
on S, 28 cosine, 25
convergence exponential, 24
absolute, 21 logarithmic, 24
conditional, 21 polynomial, 23
of a function, 27 rational, 23
of a sequence, 15 sine, 25
of a series, 19 tangent, 25
pointwise, 29 functions
55
56 INDEX
trigonometric, 25 sequence, 16
Fundamental Theorem of Calculus, 42 non-increasing
function, 23
greatest lower bound, 11 sequence, 16
non-negative, 7
identity, 5 non-positive, 7
additive, 5 number
multiplicative, 5 natural, 8
image rational, 8
of a function, 53 whole, 8
of a point under a function, 53
of a set under a function, 53 one-to-one, 53
inclusion, 52 onto, 53
inductive, 8 ordered pair, 53
infimum, 11
injective, 53 part
integer, 8 negative, 41
intersection, 52 positive, 41
interval partial sums, 18
closed, 8 partition, 39
finite, 7 permutation, 53
infinite, 8 polynomial, 23
open, 8 positive, 7
inverse, 5 power, 24
additive, 5 integer, 9
multiplicative, 5 power series, 30
inverse function, 54 pre-image, 53
isolated point, 27 predecessor, 8
predicate, 51
least upper bound, 11 primitive, 42
length piecewise, 42
of a curve, 48 product, 5
limit, 15
of a function, 27 radius, 48
limit inferior, 16 radius of convergence, 30
limit point, 27 range, 53
limit superior, 16 rational number, 8
linearity, 41 rearrangement, 21
lower bound, 11 reciprocal, 6
lower Riemann integral, 40 refinement, 39
lower Riemann sum, 39 common, 39
lowest terms, 10 restriction, 53
Riemann integrable, 40
maximum, 11
local, 35 sequence, 15, 29
measure zero, 49 sequence of partial sums, 18
metric, 13 series, 18
minimum, 11 alternating harmonic, 19
local, 35 geometric, 19
monotone, 23 harmonic, 19
strictly, 23 set, 52
speed, 48
natural numbers, 8 statement, 51
negation, 51 strictly decreasing
negative number, 7 function, 23
negative of a number, 6 sequence, 16
neighborhood, 45 strictly increasing
non-decreasing function, 23
function, 23 sequence, 16
INDEX 57
subject, 51
subsequence, 17
subset, 52
proper, 52
successor, 8
sum, 5
supremum, 11
surjective, 53
target, 53
Taylor series, 36
telescoping sum, 9
transcendental, 23
transposition, 53
triangle inequality
for a metric, 13
for the absolute value, 13
truth value, 51
uncountable, 10
union, 52
unit circle, 48
unit disk, 48
upper bound, 11
upper Riemann integral, 40
upper Riemann sum, 39
variable, 51
velocity, 48
whole numbers, 8