Calculus of Vector Functions
Calculus of Vector Functions
RICHARD E. WILLIAMSON
RICHARD H. CROWELL
HALE F. TROTTER
Digitized by the Internet Archive
in 2010
http://www.archive.org/details/calculusofvectorOOwill
Calculus
of
Vector Functions
Third Edition
Calculus
of
Vector Functions
RICHARD E. WILLIAMSON
Department of Mathematics
Dartmouth College
RICHARD H. CROWELL
Department of Mathematics
Dartmouth College
HALE F. TROTTER
Department of Mathematics
Princeton University
10 9 8 7
R. E. W.
R. H. C.
H. F. T.
Possible Courses
of Study
We have tried to organize the text so that each section leads naturally
to the next, but it is by no means necessary to include all sections or to
take them up in strictly consecutive order. In particular, it is not necessary
to complete the linear algebra before starting on the calculus. (Students
who have already taken linear algebra can, of course, start with Chapter 3
and use the first two chapters for reference and review.) Everything
essential for Chapters 3 through 7 is contained in the first seven sections
of Chapter (An exception is the use of facts about dimension in the
1.
until the section on change of variable in multiple integrals and the last
three sections on vector analysis, where they are really needed. On the
other hand, determinants can be used in practice for inverting small
(2-by-2 or 3-by-3) matrices or solving small systems of linear equations,
so that some may prefer to take them up earlier and postpone (or omit)
the row-reduction method given in the first section of Chapter 2. At the
cost of avoiding a few higher dimensional exercises later on, one may
even restrict discussion of matrix inversion to the trivial 2-by-2 case.
Other changes of order, such as taking up multiple integration early
in the course, should also cause no difficulty.
The sequences of section numbers listed below are minimal in the sense
that they contain the prerequisite material for later entries, but do not
contain everything that might be desirable in a typical course. Additional
sections can be added to make up a full one-year course suitable for par-
ticular needs. Experience has shown that for ordinary class use two
meetings should be used to cover an average section. An occasional short
Possible Courses of Study
section merits only one meeting, and some longer ones, in order to be
covered entirely, require three.
Ch. 1
Contents
Introduction, 1
1 Vectors, 4
2 Geometric interpretations, 7
3 Matrices, 15
4 Linear functions, 25
5 Dot products, 37
6 Euclidean geometry, 46
7 Determinants, 51
8 Determinant expansions, 67
2 Some applications, 88
9 Eigenvectors, 167
10 Coordinates, 179
8 Differentials, 250
Appendix, 579
1 Introduction, 579
Index, 611
Calculus
of
Vector Functions
Introduction
y = x2 + 3
yields a real number y for any real number x and so defines a function/
from % to %
with (for instance) /(0) 3,/(-2) 7,/(V3) =
6, etc. = =
In this book we are concerned with functions of several variables whose
values may be real numbers or, more generally, may be w-tuples of real
numbers. For example, a pair of formulas
Jl = V^i + x\ -f X3
y2 = xrx 2 + 5x 3
g(l,2,3) = (Vl4,17),
g(3, 2, 1) = (Vl4, 11).
We shall use 51" to stand for the set of all rt-tuples of real numbers.
1
(Jl is thus the same as '.R .) The domain of a function is on which it
the set
is defined, and the range or image is the set of values assumed by the
Introduction
/
31" ft"
ft 2 ft 1 defined by a formula
f(x, y) = z.
context in which they will not be confused with the usual notations for
matrices and numbers. The printing of a word in boldface type indicates
that the word is being defined at that point in the text.
1
SECTION 1
(x t +yu x +y t % ,. . . ,x n +_y„).
(rxj, rx 2 , . . . , rx n ).
x + y = (2, 6, -2, 6)
and
3x = (6, -3,0,9).
Sec. 1 Vectors
1 rx + sx = (r + s)x.
2. rx + ry = r(x + y).
3. r(sx) = (rs)x.
4. x + y = y + x.
5. (x + y) + z = x + (y + z).
6. x + = x.
7. x + (-x) = 0.
in Chapter 2, but for the present "vector" may be taken to mean "element
of :K"" for some n. Numbers arc sometimes called scalars when emphasis
on the distinction between numbers and vectors is wanted. In physics, for
example, mass and energy may be referred to as scalar quantities in
distinction to vector quantities such as velocity or momentum. The term
scalar multiple is synonymous with what we have called numerical
multiple.
The vectors
(1,0,. ..,0)
(0, 1 , 0, . . . , 0)
= (0,
en . . . , 0, 1)
x = xfr + . . . + x„e„.
Because every element of ,'R" can be so simply represented in this way, the
set of vectors {e l5 . . . , e„} is called the natural basis for 3l n . For example,
in 'Ji
3
we have (1 , 2, — 7) = + 2e — 7e The
ei 2 3. entries in
x=(x ,..., x n 1 )
EXERCISES
1. Given x = (3, — 1, 0), y (0, 1, 5), and z = (2, 5, — 1) compute 3x, y -I- z,
3. Show that no choice of numbers a and b can make ax by (3, 0, 0), -'
where x and y are as in Problem 1. For what value(s) of c (if any) can the
equation ax by = (3, 0, c) be satisfied?
I
Sec. 2 Geometric Interpretations
4. Write out proofs for (a) law 3 and (b) law 4 on page 5, giving precise justifica-
tion for each step.
5. Verify that the set C[a, b] of all continuous real-valued functions defined on
the interval a <x<b is a vector space, with addition and numerical
multiplication defined by (/ \- g)(x) = f(x) + g(x) and (rf)(x) = rf{x).
6. Prove that the representation of a vector x in Jl" in terms of the natural basis
is unique. That is, show that if
-*i e i + • • • + x ne n y^ + . . . + yn e n ,
then xk = yk for k = 1 n.
SECTION 2
Geometric representations of Jl 1
as a line, of 5l 2
as a plane, and of .II
3 GEOMETRIC
as 3-dimensional space may be obtained by using coordinates. To represent INTERPRETATIONS
1
'Si one must first specify a point on the line to be called the origin,
as a line,
a unit of distance, and a direction on the line to be called positive. (The
opposite direction is then called negative.) Then a positive number x
corresponds to the point which is a distance x in the positive direction
from the origin. A negative number x corresponds to the point which is a
distance |.y| from the origin in the negative direction. The number zero of
course corresponds to the origin. The number line is most often thought
of as horizontal with the positive direction to the right. With this standard
convention, we obtain the familiar Fig. 1, in which the arrow indicates
the positive direction.
Figure 1
first axis is
p x and whose projection on the second axis is /?.,. The (per-
pendicular) projection of a point/? on a line L is defined as the foot of the
Vectors and Linearity Chap. 1
The conventional choice is to take the first axis horizontal with the
positive direction to the right, and the second axis vertical with the
positive direction upwards. This leads to the usual picture shown in Fig. 2.
gives points p x ,p 2 and/? 3 on the three axes. Then the point/?, correspond-
,
ing to (jtx x 2 x 3 ), is the one whose projections on the three axes are p x p 2
, , , ,
and p 3 .
/>(-3,2)
Figure 2 Figure 3
Figure 4
This rule can be applied to any pair of vectors, whereas the parallelogram
law does not (strictly speaking) apply if the arrows representing u and v
lie in the same straight line.
If we write w for u + v, then v =w— u. Figure 6 (which is simply
Fig. 5 relabeled) illustrates the useful fact that the difference of two vectors
w and u is represented by the arrow from the end of u to the end of w,
appropriately translated.
v(translated) w - u (translated)
Figure 5 Figure 6
ing an has the same direction as the arrow representing u and is a times as
long. For a negative number b, b\\ points in exactly the opposite direction
to u and is \b\ times as long. (See Exercise 6(b).)
The question of whether to represent a vector geometrically as a
10 I 'velars and Linearity Chap. 1
Sec. 2 Geometric Interpretations 11
multiples r(l, 1, 1) to get a line through the origin. Then the set of all
Figure 9
plane, we require that x x and x 2 not lie on the same line. Another way to
state this condition is that no multiple of x x should equal a multiple of
x 2 except for the zero multiples. More generally, we say that a set of
vectors
Xj, x2 , . . . ,x n
c1 x 1 + . . . + cnxn =
12 Vectors and Linearity Chap. 1
for some numbers c lt c n , then all the c's must be zero. If on the
. . . ,
Ci—c +c = 2 3
2ci + c — 3c = 0. 2 3
If we now let cz have any nonzero value, we can then solve for c x and c 2 .
Ci — c2 =— 1
2c i + c2 = 3,
and we solve the two equations. Adding them gives 3c x = 2, or c x = §,
while subtracting the second from two times the first gives — 3c = 2 —5,
or Co = f Thus .
X = C&i + . . . + c„x n ,
3
Example 4. The natural basis for .'K is the set of vectors
then (<?!, c2 , c3 ) = (0, 0, 0). On the other hand, any vector (x, y, z)
3
in !il can be written
(x, y, z) = xe x + ye + 2 ze 3 .
EXERCISES
1. For each pair u, v of vectors in Jl 2 given below, draw the arrows representing
u, v, u -f v, u — v, and u + 2v.
u2 + .vv
2
. [Ans. w = (— f, f).]
(b) Let u lt Vj, and u 2 be as in part (a), but take v 2 = (2, —2). (Note that v 2
is then a numerical multiple of v^) Sketch the lines u l | t\ 1 and
u2 + s\.,. Show algebraically that the lines do not intersect.
5. Show that in :R", the lines represented by sux t u and /v t + v are the same
if and only if both v t and v ()
— u are numerical multiples of u l .
u2 + v
2
14 I ectors and Linearity Chap. 1
OVB and UPA are congruent. From this deduce that OK and UP are
parallel and of equal length. Then OVPU is a parallelogram, and the
parallelogram law follows,
(b) In Figs. 7(a) and 7(b) the points U, V, and W are constructed to have
coordinates as shown. Prove that the triangles OUP, OVQ, and OWR
are similar. Show that the angles HOP, VOQ, and WOR are therefore
equal and that the lengths OK and OW^arc proportional to the length
of Oil as stated in the text.
8. (a) Show if u and v are two distinct vectors, then the vectors ru +
that
(1 —form a line through the points corresponding to u and v.
r)v
n
(b) If x x and x 2 are two vectors in 3i then the set of all vectors rx x +
,
(1 — /)x 2 where ,< t < 1, is the line segment joining \ x and x 2 A set .
5" in R" is convex if, whenever S contains two points, it also contains the
line segment joining them. Prove that the intersection of any collection
of convex sets is convex.
9. Represent the following lines in the form rx x [ x , where / runs over all real
3
(a) The line in J\ parallel to (1, 2, 0) and passing through the point (1,1, 1).
2
(b) The line in 3i joining the points (1, 0) and (0, 1).
3
(c) The line in :R joining the points (1, 0, 0) and (0, 0, 1).
3
10. Represent the following planes in 3l in the form ux 1 + vx 2 I
x„, where
// and v run over all real numbers. Sketch each plane.
(a) The plane parallel to the vectors (1, 1,0) and (0, 1, 1) and passing
through the origin.
(b) The plane parallel to the vectors e t and ea and passing through the point
(0, 0, 1).
(c) The plane passing through the three points (1, 0, 0), (0, 1, 0), and
(0, 0, 1).
11. Determine whether each of the following sets of vectors is linearly dependent
or linearly independent.
12. Determine whether the first vector below is in the set spanned by the set S.
In each case give a geometric interpretation: the first point does (or does
not) lie on the plane (or line) spanned by S.
14. (a) Prove that two nonzero vectors x and y are linearly dependent if and
only if they lie on the same line through the origin,
(b) Prove that three nonzero vectors x, y, and z are linearly dependent if
and only if they all lie on the same plane through the origin.
J>!
= 2x x + 3x 2 — 4x 3
y2 = xl — x2 + 2x 3 ,
fiveexamples above have shapes 3-by-2, 2-by-3, 2-by-2, l-by-3, and 4-by-l.
Note that the number of rows always comes before the number of columns.
The 1-by-/? matrices are called n-dimensional row vectors, and /?-by-l
j a 21 a 22 a 23 a 2i ) or A = {au ) t
x 2 i, x nl , .
matrix with the same shape and ijth entry equal to a u + b u For example
.
2 3\ /l -1\ /3 2^
4/ \0 0/ \0
example
Sec. 3 Matrices 17
(The proofs are just the same as when X, Y, and Z are all in .'ft".) In other
words, according to the definition in Section 2, for any fixed m and n, the
set of in -by- n matrices forms a vector space with the operations of addition
z2 = 5jj + 2y 2 ,
and
yt = 2x t + 3x 2 + 4x 3
yi = X\ -^2 + ^X 3 ,
3 -l\ (2 3 4
and B =
5 2/ \l -1 2
za = 3(2*! + 3x 2 + 4*3) — Ui — x + 2x 2 3)
zx = (3 • 2 - 1 l)x, + (3 •
3 - 1 •
(-1)> 2 + (3 -4 - 1 • 2)x 3
z2 = (5 • 2 + 2 •
1)*! + (5 • 3 +2 • (- l))x 2 + (5 4 + 2 •
2)x3
'3-2-1-1 3 •
3 — 1 -
(— 1) 3 -
4 — 1
-
2\ (5 10 10
C=
X5-2 + 2-1 5-3 + 2-(-l) 5-4 + 2-2/ \12 13 24
A = (a u a 2 , . . . , a k) and B =
are row and column vectors of the same dimension. Then the product AB
is defined to be thenumber a-Jjx + a 2 b 2 + + a k b k Now let A be an . . . .
m-by-k matrix and B be an k-by-n matrix. (It is important that the number
of columns of A be equal to the number of rows of B.) Then each row of
A is a A'-dimensional row vector and each column of B is a ^-dimensional
column vector. We define the matrix product AB as the m-by-n matrix
whose ijth entry is the product (in the sense just defined) of the z'th row of
A and they'th column of B. The product AB always has the same number
of rows as A and the same number of columns as B. For instance, in our
example
'3 -1\ /2 3 4\ / 5 10 10^
12 13 24
the entry in the second row and third column of the result is obtained by the
calculation
You should check that the other entries in the product can be obtained by
the rule stated above. Schematically the entries in a matrix product are
found by the mechanism illustrated below. The process is sometimes
called row-by-column multiplication of matrices.
* * * *
* * *
* * * 3 ***[*]*
* * *
* * * *
3.1 Theorem
1. (A + B)C = AC + BC.
2. A(B + C) = AB + AC.
3. {tA)B = t{AB) = A(tB).
4. A(BC) = (AB)C.
According to the last law, it makes sense to talk of the product of three
matrices and simply write ABC, since the result is independent of how the
factors are grouped. In fact this 3-term associative law implies that the
result of multiplying together any finite sequence of matrices is inde-
pendent of how they are grouped. Not all the laws that hold for multi-
plication of numbers hold for multiplication of matrices. In particular,
the value of a matrix product depends on the order of the factors, and
AB is usually different from BA. It is also possible for the product of two
matrices to be a zero matrix, without either of the factors being zero.
Exercise 5 at the end of this section illustrates these points.
The laws stated above are easily proved by writing out what they mean,
using the definitions of the operations, and then applying the associative,
distributive, and commutative laws of arithmetic. Number 4 is the most
complicated to prove, and we give its proof in full below. The other
proofs are left as exercises.
To prove thatA(BC) = (AB)C, let A, B, and C have respective shapes
p-by-q, q-by-r, and r-by-s. Let U = BC and V = AB. (Then U has
shape q-by-s, and V has shape p-by-r.) We have to show that AU = VC.
The //'th element of AU is (by definition of matrix multiplication) equal
Q T
AU
k=l
ik ll kj-
HIT Ay'th
\
is
1=1
. (/th
r Tin \
sum
l<l<r
l<k<Q
. . . 1
that has l's on its main diagonal and zeros elsewhere is called an identity
matrix. It has the property that
IA = A, BI=B
for any matrices A, B such that the products are defined. Thus it is an
identity element for matrix multiplication just as the number 1 is an identity
for multiplication of numbers. There an n x n identity matrix for every
is
and according
T, 1 - U DC
to the definition this shows that
K
1 2\ /l 2\~ l
=
[1-2
is invertible and that I
3 l) \3 7/ \-3 1
Sec. 3 Matrices 21
Many matrices, on the other hand, are not invertible. No zero matrix can
have an inverse, and several less obvious examples are given in the
exercises.
It is usually not easy to tell whether a large matrix is invertible, and it
can take a lot of work to compute its inverse if it has one. Determinants
can be used to give a formula for the inverse of a matrix (Theorem 8.3),
and a more effective way to compute inverses is given in Section 1 of
Chapter 2. Two-by-two matrices and a few other easy cases are discussed
in Exercises 9 to 13 of this section. The rule for finding the inverse of a
2-by-2 matrix is as follows.
bY 1
-b\
3 2
-
a
=— H d i /
if ad-bc^O.
ad - bc\-
,
\c d! c a
-1
1 3\ (2 -3
,-1 2/ Hi 1
1 3\ /* —\\ /* -*\ / 1 3
-1 2/U \J U \JY
so that we have indeed found the inverse.
Two important properties of invertible matrices are easily proved
directly from the definition. The first one ensures that, no matter how an
inverse to a matrix A is computed, the resulting matrix A~ x is always the
same.
3.3 Theorem
Proof. Suppose there are two matrices, B and C, such that both
AB = BA = / and AC = CA = /.
3.4 Theorem
If A =A X A2 . . . A n and all of A
,
x , . . . , AH are invertible, then A is
Problems 7 and 8 show that matrix operations with diagonal matrices are
particularly simple.
EXERCISES
-2
C =
Sec. 3 Matrices 23
2. Show that for any matrix A and zero matrices of appropriate shapes,
AO =0 and OA = 0.
(a) AX = B + Y. (d) CX + DY = 0.
A(B + C) = AB + AC
for matrix multiplication.
1 2\ 2 6
5. Let U= V= . Compute UV and VU. Are they the
2-4/ \1 3/
same? It is possible for the product of two matrices to be zero without
either factor being zero?
6. Let X
Let D - diag(l,2, 3)
b\
10. (a) Let A = I
J
be a 2-by-2 matrix with ad # be, and let A~ x be given
2 1/' \l 0/ \1 3,
zero.
b\
12. Show that if ad = be, then I I is not invertible.
(a) A = (b) A =
(
4 3^
Ans. (a)
li
(Note that is the only number whose cube is 0. Part (b) of this problem
thus illustrates another difference between the arithmetic of numbers and of
matrices.)
(a) Show that if A = /or -/, then A2 = /, where /is an identity matrix of
any dimension.
(a b\- (\ 0\
(b) Show that I I = I I if a2 + be = 1 ; so the equation
Sec. 4 Linear Functions 25
SECTION 4
The product of an m-by-n matrix and ^-dimensional column vector LINEAR FUNCTIONS
(n-by-l matrix) is an w-dimensional column vector. An w-tuple in %
n
tion such as
fyi
) =
2 3
~4
\yj
~ (
\l -1 2)
jx = 2x + x 3x 2 — 4x 3
y<i = Xi X2 ~r 2X3,
as may be seen by simply writing out the result of the matrix multiplication.
Thus the vector function described by a matrix amounts to multiplication
of a domain vector by the matrix.
Functions given by matrix multiplication can be characterized by some
very simple properties. Note that the definitions given below apply to
functions between any two vector spaces, although we are at present
concerned only with the spaces 31".
C
hold for all vectors x, y in \J, and all numbers r.
4.1 Theorem
g is linear,
= *ig(ei) + • • • + x n g(e„).
By the definition of A we have
g(x)
a 21 x x + . . . + «o„.v„
^/,„ l
.v, + . . . + fl,„, r v,
7
g(x) = /fx.
The proof given above actually shows how to construct the matrix
corresponding to a linear function. This construction is important and
we summarize it as a theorem for emphasis.
Sec. 4 Linear Functions 27
4.2 Theorem
b d]\o)
~ \b)' \b d)\\) \d t
By Theorem 4.2,
2
28 I 'ectors and Linearity Chap. 1
m = Kg
*•> - ("$
(a) (b)
Figure 10
The geometrical effect of/ is illustrated in Fig. 11, in which C is the unit
/*i\ /"i\
= =
image of
W under/ then x x
\uj and .v 2 Ui.z . Hence, if
\W2/
I is in
Theorem 4.2 says that, f/is a linear function from :K" to 3t"\ then the
;
jih column of the matrix of/is/(e ). Because we can write any vector x in
;
x = c^ + . . . + c„e„,
Figure 11
Sec. 4 Linear Functions 29
Since x is an arbitrary vector in the domain of/, the last equation expresses
4.3 Theorem
/(
= "yi + ^2
J
is linear (why?), and Theorem 4.2 says that the matrix of/has as columns
the two vectors
- J, and = y.
/(J) /(J)
For example, if
then
[\
VI
'
1
and indeed the range of/is spanned by the columns of the 3-by-2 matrix.
= ty x + v .
30 Vectors and Linearity Chap. J
4.4 Theorem
is certainly one-to-one. For only one vector is carried into any other
vector by the rotation. Alternatively, the zero vector is the only vector
carried into the zero vector. In algebraic terms, the rotation is pro\ed in
Example 1 to be representable by
73
1
- x — —
n/3
v
Then Theorem 4.4 shows that the one-to-one property of/ is equivalent
to the equations
-'
2
"
1
v/ 3
Sec. 4 Linear Functions 31
having only the solution (x,y) = (0,0). This of course can be verified
directly by solving the equations.
If/ and g are any two functions (not necessarily linear) such that the
range space off is the same as the domain space of g we define the com-
positiong °f to be the function obtained by applying first /and then g.
More explicitly, if x is in the domain of/and f(x) is in the domain ofg,
then g °/(x) is defined as g(f(x)). If /(x) or g(f(x)) is not defined, then
g °/(x) is not defined.
Composition of linear functions lies behind matrix multiplication. In
introducing the concept of matrix multiplication, we considered a function
from .'H 2 to :K 2 given by
*\ = 3j>i - y2
?2 = 5)>i + 2y 2
3 2
and another from :K to .'K given by
yx = 2x + x 3x 2 + 4.y 3
JAj = Xi X2 -\- ^v 3 .
We then computed that the composition of the two functions was given by
the formulas
z, = 5*! + 10x 2 + \0x 3
/ 5 10 10\
of the composite function as the product of the matrices
\12 13 24/
'3 -l\/2 3 4\
5 2/ \ 1 — 1 2,
for the original functions. The following theorem states the important
fact that the composition of linear functions is always given by matrix
multiplication.
4.5 Theorem
B =
M e 2 =&*l)
gi*i)
Figure 12
v5
34 Vectors and Linearity Chap. 1
4.6 Theorem
0/ l-i/
«w>-(_3-(_;
/0 -1\
Therefore ^(x) = I x for all x and
(2\ /0 -l\/2\ / 1\ /0
which agrees with the geometric description off. See Fig. 13.
Linear and affine functions are often called linear and affine transforma-
tions, though we more often use the term function in this book.
Sec. 4 Linear Functions 35
'
(b) What matrix corresponds to reflection in the line through the origin
135 counterclockwise from the horizontal?
(c) Compute the product of the matrices in (a) and (b) and interpret the
result geometrically.
Am. (b) "
')
(
\ - 1 0/
6. (a) Find the 2-by-2 matrix .\f r corresponding to reflection in the line-
through the origin at an angle x from the horizontal. Check your
result against Exercise 5 for a 45 . a 135 . What is M\l
(b) Let -'
be another angle and compute the product M M«. X
Show that this
represents a rotation, and identify the angle o( rotation. When does
MM x p
Mp MS
7. (a) Show that
and (
8. Show that a function /from one vector space to another is affine if and only if
for all numbers rand all x, y in the domain off. [Hint. Consider the function
g(x) f(x) /(0).]
9. Let / be a linear function from a vector space 'i to a vector space 10. Show
that if x, x„ are linearly independent vectors in U and /is one-to-one,
then the vectors /(x,) /'(*„) are linearly independent in 10. [Hint. If
/ is one-to-one, and f(x) 0, then x 0.]
3 3
10. (a) Prove that if /is a linear function from :K to ft and /is one-to-one,
then the image / (I.) of a line /. by f is also a line.
(b) Show by example that, if the linear function of part (a) fails to be one-to-
one, then the image of a line by / mav reduce to a point.
(c) Show that if L ]
and I.
z
are parallel lines and /'is a linear function, then
/(/.]) and I (L,,) are parallel lines, provided that / is one-to-one.
11. Show that the composition of two affine functions from a space into itself
is affine. Suppose f(x) Ax b, ;mxi (\ d. Suppose*/ g)(x)
13. Show that an affine function A is one-to-one if and only if A(x) = A(0)
always implies x = 0.
To allow the full application of vector ideas to Euclidean geometry, we DOT PRODUCTS
must have a means of introducing concepts such as length, angle, and
perpendicularity. We will show in this section that all these concepts can
be defined if we introduce a new operation on vectors. If x (jcx xn) =
and y = (y\, ,yn) are vectors in Jl", we define the dot product or
. . .
x-y =x y +
1 1 . . . +xnyn .
Symmetry x •
y = y x. •
Additivity: (x -j-
y) • z = x z + y • • z.
3
the origin to the point with coordinates i \,. x 2 x3 ). ,
In :K the Pythagorean
theorem gives us a simple formula for the distance (see Fig. 14). Letting
|x| stand for the length of the sector x. we have
|x|- x\ \
x\ \
x\. (I)
Thus we see that the length of the vector x can be expressed in terms of the
dot product as \ x • x |x|. Note that we use the same symbol for the
length of a vector as for the absolute value of a number. Indeed, if we
think of a number as a one-coordinate vector, its length is its absolute
value. In :H" we define the length of a vector by the same formula that
works in 3l 3 |x| : \ x • x.
Next we would like to express the angle between two nonzero vectors
in :H". The usual convention is to take this angle 6 to be in the interval
: tt (see Exercise 1). The solution to the problem is provided by
the following theorem.
x y
Figure 14
5.2 Theorem
x-y
cos
W |y|
Proof. Let us apply the law of cosines to the triangle shown in Fij
|x — y|
2
= |x|
2
+ |y|
2
— 2 |x| |y| cos 6,
(x — y) • (x — y) = x • x -f y •
y — 2 |x| |y| cos 6.
Hence,
2x •
y = 2 |x| |y| cos 6,
yy= (-i) 2
+ i
2
= 2,
and
xy = -1+3 = 2.
Hence
x •
y = 0. (2)
have to write down three conditions, two for the perpendicularity require-
ments, using (2), and a third condition to assure length 2:
Xj -f 2x 2 + 3.v 3 = 0,
X, - X3 = 0,
x\ + x\ + x\ = 4.
These equations have the pair of solutions x = ±(vf, — vf, V§).
If n is a unit vector, that is, a vector of length 1 , then the dot product
n • x is called the coordinate of x in the direction of n. The geometric
interpretation of n • x is shown in Fig. 17. For since cos — (n • x)/|x|,
it follows that n • x is either the length of the perpendicular projection on
the line containing n, or else its negative. The vector (n • x)n is called the
component of x in the direction of n and is sometimes denoted x n .
Figure 17
Figure 18
)
we first find the unit vector in that direction. Since |(1, 1, 1)| = \ 3, the
(F . n)n = (W .(^A) , 2) n
V3 \3 3 3/
Any vector space on which there is defined a product with the properties
5. 1 is an inner product space. Thus %n is an inner product space,
called
and some other examples are given in Problems 8 and 9. Inner products
in spaces other than Jl" are used in this book only in Chapter 2 Section 4
and Chapter 5 Section 5.
In terms of the inner product we can always define the length or
norm of a vector by
|x| = VX • X.
The proofs of the first two are easy and are left for the reader to check.
The proof of the third is harder and will be taken up later, though we
remark here on its geometric significance, illustrated by Fig. 19.
The fact that length has been defined in terms of an inner product
leads to some properties of length that are not derivable from those
already listed in 5.3. First we prove the following.
Figure 19
Proof. We assume first that x and y are unit vectors, that is, that
or
xy < 1.
|x-y|
<1,
|x| |y|
cos 6 = ^^- .
|x| |y|
we get
|x + y|
2
= + y) (x + y) = |x| + 2x
(x •
2 •
y + |y|
2
<|x| + 2|x||y| +
2
= (|x| + |y|
2
|y|)
2
,
fa n a 12
A =
a\ 2 + a\ 2 + . . . =
and a lx a 12 + a 21 a 22 +
1 = 0. We form the matrix
. . .
A 1
, called the transpose of A, by reflecting A across its main diagonal. Thus
'«n 21 • •
a 12 a 22 .
5.5 Theorem
I 1 V3
\ 2 2
and so A~ x =A 1
.
The second of the above two square matrices is obtained from the first
by interchanging rows and columns. When two matrices are so related
we still say that one is the transpose of the other, even if they are not
square. Thus if
1 2 3
A =
4 5 6
then
T 4\
2 5
3 6,
Obviously,
(A 1
)
1
= A for any matrix A.
EXERCISES
1. Show that the natural basis vectors satisfy e,- • e,- = 1, and et
- • e, = if
i *].
2. (a) Find the angle between the vectors (1, 1, 1) and (1, 0, 1).
(b) Find a vector of length 1 perpendicular to both vectors in part (a).
4. Prove that the dot product has the properties listed in 5.1.
6. Find the coordinate of the vector x = (1, —1,2): (a) in the direction of
n = (1/V3, l/v/3, 1/V3) and (b) in the direction of the nonunit vector
(1,1, 3). (c) What is the component of x in the direction of n?
7. (a) Prove that any vector in 31" and n is a unit vector, then x can be
if x is
written as x = y +
z, where y is a multiple of n and z is perpendicular to
n. [Hint. Take y
to be the component of x in the direction of n.]
(b) Show that the vectors y and z of part (a) are uniquely determined. The
vector z so determined is called the component of x perpendicular to n
Sec. 5 Dot Products 45
8. (a) Consider the vector space C[0, 1] consisting of all continuous real-
valued functions defined on the interval <x<
[The sum of/ and
1.
<f,g)=\f(x)g{.x)dx
x * y = Xiji + 2x 2y 2
is an inner product,
(b) With length defined by jx] „.
= (x * x) 1/2 , sketch the set of points
satisfying |x|„. = 1.
10. Show that if |x| = |y| in an inner product space, then x | y is perpendicular
to x - y.
/'cos 9 —sin 6
ysin 6 cos 6
13. (a) Show that if A is a 2-by-2 orthogonal matrix, then \A\\ = |x| for all
vectors x in R2 .
15. Let
2-1
and B
3-5 2
Compute AB, BA, A B\ and B A l l l
.
H>. (a) I or any two matrices A and //. show that it' /)# is defined, so is Z?'/F,
. and that IV A' (Mi)'
(b) Slum that if I in imciliblc, then so is .1'. and that (/f) '
(/J
]
)'.
si ( HON (,
II (I 11)1 \\ In this section we develop some facts and formulas about lines and planes
GEOMETRY in :ii- and 3l 3 . Some of the ideas will reappear in Section 1 of Chapter 3.
2 3
Recall that if x, is a nonzero vector in 5l or :l\ , then the set of all
numerical multiples tx }
is a line L passing through the origin, and any
line parallel to L„ consists of the set of all points /x, f-
x , where x ()
is some
fixed vector. An alternative way to say the same thing is: a line is the
range of a function of the form f{t) tx x„, where x, }
0.
parallel toL, and the points /(a 2 a,) a, make up L itself. For example, |
when I \vc get a, and when / we get a 2 See Fig. 20. Alternatively,
. I .
Figure 20
3
To determine a plane in .'fi , we take two noncollinear vectors x, and x 2
(that is, so that neither is a multiple of the other) and consider all points
ux + t'x 2 where u and v are numbers. These points form a plane P
x
through the origin. A parallel plane will then consist of all points u\ + l
g 3
saying that a plane is the range of a function 'M- > .'H , where g(u, v)
passing through them, they must not lie on a line. (Then the vectors a 3 — a,
of all points
w = iv(a 3 - aO + r(a 2 - a,) + a,.
Sec. 6 Euclidean Geometry 47
Figure 21
p • (x -x = ) (1)
p •
Xi = U l U 2 Vi — UiU 3 V 2 +u 2 u3v x —u 2 u x v3 + u u v — U U V = 0.
3 t 2 3 2 X
In other words, the given plane consists of all points (x, y, z) with co-
ordinates satisfying that equation.
The simplest way to sketch the plane satisfying an equation like (3)
is to pick three points with simple coordinates that satisfy say (0, 0, f), it,
(0, \ , 0), and (— f , 0, 0). Then locate the points relative to coordinate
axes and sketch the plane using the three points as references. We have
purposely chosen in our example points that lie on the coordinate axes.
See Fig. 23.
Figure 23
If p is a nonzero vector in 2
ft and x is a point in ft
2
we can determine ,
This is one of the several forms for the equation of a line in the xv-plane.
The slope of the line is evidently —p /p 2 1
.
6.1 Theorem
But n • n = 1, so we obtain t +n •
y = c, from which the theorem
follows.
Hence the distance is f . Notice that the equation of the plane could also
be written 3x — 4z = —3 and, in normalized form, (f)x — (|)z = — f.
EXERCISES
1. Write a formula in the form/(r) = tin x + x for a line containing x and
parallel to \ v In each case determine whether the point a lies on the line,
that is, whether a is in the range off.
5. Find an equation for the line in :ft 2 that is perpendicular to the line tx x + x„
and passes through the point y 2 .
8. For each of the points and planes or lines listed below, find the distance
from the point to the plane or line.
9. (a) Verify that the cross-product of x x and x 2 [formula (2) in the text] is
perpendicular to x 2 .
(b) Find a representation for the line perpendicular to the plane consisting
of all points w(l, 2, 1) + v( — 1, 0, 1) + (1,1,1) and passing through the
origin.
2x + 3y + z = 1
(b) Do the same as in part (a) for the general equation p • (x —x) = 0,
when p ^0. [Hint. If p ^=0, then p has a nonzero coordinate.]
x = (x • e^e! + (x • e 2 )e 2 + (x • e 3 )e3.
(b) If the vector x in part (a) is a unit vector, that is, a vector u of length 1
show that u • e^ = cos a,, where a, is the angle between u and e,. The
coordinates cos a t
are called the direction cosines of u relative to the
natural basis vectors e,. If x is any nonzero vector, the direction
cosines of x are defined to be the direction cosines of the unit vector
x/|x|.
(c) Find the direction cosines of (1, 2, 1).
(d) Show that parts (a) and (b) generalize to &n .
12. Let u and v be points in ft". Show that the point ^u + ^v is the midpoint of
the line segment joining u and v.
3
13. Let u and v be noncollinear vectors in :R . To find a vector x perpendicular
to both u and v, we solve the equations u • x = and v • x =0, that is,
Sec. 7 Determinants 51
solve
u xx + u 2y + u3z =
(*)
v xx + v 2y + v3z = 0,
(u x v 2 - u^Jx = (u 2 v 3 - u3 v 2)z.
(x, y, z) = {u 2 v 3 - u3 v 2 u 3 v 1
,
- u x v3 u x v 2
,
- u 2 v x ).
Then
-9
5, An
4 2
-5 -6^
a 23 = 0,
-3 4)
=
(4), &la =
det A = a.
= a n det y4 n — a 12 det A l2 + . . .
— (— \)"a ln det A ln .
In words, the formula says that det A is the sum with alternating signs, of
the elements of the row of A, each multiplied by
first the determinant of
its corresponding minor. For this reason the numbers
(_ 1)2+1 det /
) =40
\ 4 2/
The factor (
— 1)'>J associates plus and minus signs with det A n according
to the pattern
/+- + -••
+ - + • •
+ - + • •
Sec. 7 Determinants 53
Example 2.
1 2\
(a) det [
= 1 (4) - 2(3) =4-6= -2.
la b\
(c) det I = ad — be.
®Z\X\ I #22-^2
= '"
2°
equation
(a 22 a n - a 12 a 2l )x l = (a 22 r x - a 12 r 2 ).
is
I
W
the result of replacing the
"J
is the matrix ol coefficients
first column of A by
,-
see in Theorem 8.4, a similar result holds for systems of // linear equations
in /; unknowns, for all values of n.
Since our definition of determinants by 7.1 is inductive, most of the
proofs have the same character. That is, to prove a theorem about
determinants of all square matrices, we verify it for 1 -by- 1 (or in some
cases. 2-by-2) matrices, and also show that if it is true for (// - l)-by-
(// 1) matrices, then it holds for //-by-" matrices. In the proofs, we
give only the argument for going from step // I to step //. The reader
should verify the propositions directly for 1 -by- 1 and 2-by-2 matrices;
the verification is in all cases quite trivial. A, B, and C will always denote
n-by-n matrices. We write a, for the /lh column of a matrix A. If A has //
7.2 Theorem
detB=i(-l) tn />,,detB u .
A-=l
7.3 Corollary
Example 3. Let
1 2 3\ / 1 6 3N
1 2 41, 5=1-164
12/ \ 3 2,
=0+4-3=1
det B= - 6(-2 - 0) +
(1)(12 - 12) 3(-3 + 0)
7.4 Theorem
while
Cij det C = 1;
a Xj det C +
1? fe
1;
- det CXi
= a t j det Au + 6 l3 det - 2? l3 .
Hence
fc+1
detC=i(-l) c u.detC 1 ,.
= det A+ det B.
56 Vectors and Linearity Chap. 1
Example 4. Let
1
Sec. 7 Determinants 57
(Xi bx cx
x2 b2 c2
x3 b3 c3
lb 2 c2
\
lx 2 c2
=x x det — bi det
\
-f c x det
\b 3 cj \x 3 c3 J \x 3 u3
= Xiibfa - b 3c 2 ) - - x c + c^xfo - x b
b x (x 2 c 3 3 2) 3 2)
7.6 Lemma
If B is obtained from A by exchanging two adjacent columns, then
det B = — det A.
Proof. Suppose A and B are the same, except that a ; = b J+1 and
a J+1 =.bj. For k #y or j -f 1, we have b lk Blk = = a lk and det
— detA lk by the inductive hypothesis, so Blk = (— l) k+1 b lk det
— (— l) u det A lk On
fc+1
tf the other hand b Xj = a lj+1 and Bu =
.
a lj+1 detA lj+1 Similarly (-\)> +2 b lj+1 det B 1J+1 = (-l) 3+1fl„ det A
.
v .
Then
det = (1)(8 - 5) - (3)(-4 - 3) + (-2)(l0 - (-12))
A
= 3 + 21 - 44 = -20
det B = (1)(5 - 8) - (-2)(10 - (-12)) + (3)(-4 - 3)
= -3 + 44 - 21 = 20
= -det A
,
1.1 Theorem
by Lemma 7.6 each step changes the sign of the determinant. Since
2k + 1 is an odd number, det B = — det A.
7.8 Theorem
/3 0\
,
Example 6. Multiplication by gives a function Jl 2 -> 51 2 which
J
1 2
• Z \
For another example, consider the function g given by the matrix I
which has determinant 1. Its effect is illustrated in Fig. 24(b). The unit
square is mapped into a parallelogram with the same base and altitude,
so the area remains unchanged. The composition g of multiplies areas
by 6 [since f(S) has 6 times the area of S and g(f(S)) has the same area
as/(5)]. The matrix of g °/is given by the matrix product
Sec. 7 Determinants 59
(0,2)
(0,1)
60 Vectors and Linearity Chap. 1
If A and B are any two square matrices of the same size, then
det (AB) = (det^)(det5).
Proof. Let
L(Xj, . .
.
, x„) — det A det (x ls . . . , xn) — det (Ax Y , . . . , Ax n ),
where x l5 . . . , x„ are vectors in 51". Clearly L is linear as a func-
tion of each vector Xj. Furthermore, L(e, , . . . , e, ) = for any
set {e, , . . . , e, } of natural basis vectors. The reason is that if
Lib,, . .
. , b„) = I.(i><A, . . . , jU»e<)
Hence,
det A det B- det /*J? = det A det (b lt . .
.
, bj
- det (Ab u . . . , Ab n )
= L(bls . . . , b„) = 0.
Note that the proof just given uses only Theorems 7.5, 7.7, and 7.8
and does not use Theorem 7.9. (The point is important because the
product rule is used in the proof in Section 7 of the Appendix, on which
the proof of Theorem 7.9 depends.)
Sec. 7 Determinants 61
The natural unit of area in 3l 2 is given by the unit square with edges
(1,0) and (0, 1), and the natural unit of volume in iR 3 is given by the unit
cube with edges (1 0, 0), (0, 0), (0, 0, 1). In general we take the unit of
, 1 ,
volume in 31" to be that of the cube whose edges are the natural basis
vectors e l5 e„ that form the columns of the n-by-n identity matrix.
. . . ,
tn satisfies the condition < tt < 1. The resulting set of points is called
the parallelepiped determined by its edges x x x n If we choose only , . . . , .
Figure 25
7.11 Theorem
Let a l5 a 2 ,
. . . , a n be n vectors in 31". Then the volume of the
parallelepiped with edges a l5 . . . , a„ is |det (a l5 . . . , a„)|.
the factor |det (a l5 a„)|, and the cube has unit volume, the
. . . ,
r x cos X
r2 cos 0;
Example 7. Let a x , a .. The vectors have
r x sin X
r 2 sin 2
Figure 26
r x cos L r2 cos 2
det /^(cos X sin 2
— sin 6 X cos 2)
/*! sin d 1 r 2 sin 2
= r x r 2 sin (0 2 — X)
Figure 27
with coordinates
«2
64 Vectors and Linearity Chap. I
Figure 28
ordinary volume is very much like the relation between directed distance
on a line and ordinary distance. Indeed, oriented volume may be con-
sidered a generalization of directed distance, and we use the idea in
Chapter 7, Section 7.
EXERCISES
1. Find AB, BA, and the determinants of A, B, AB, and BA when
/l -2
(a) A [Arts, det AB = -14.]
,3 1
'2
| 3
v
4
(a)
66 Vectors and Linearity Chap. 1
3. Show that if D is the diagonal matrix diag (/-,,..., r„), then det D =
/,;_, . . . rn .
6. Apply the product rule to show that, if A is invertible, then det A # and
(det A l
) (det/4)- 1 .
7. Let /4 be an m-by-m matrix and B an n-by-n matrix. Consider the (//; /;)-
IA Ov
by-(/w «) matrix which has A in the upper left corner, B in the
\0 B/
lower right corner, and zeros elsewhere. Show that its determinant is equal
to (det /l)(det B). [Suggestion. First consider the case where one of A or B
is an identity matrix, and derive the general result from the product rule.]
10. Find the volume, and the area of each side, of the parallelepiped with edges
(1, 1, 0), (0, 1, 2), (-3, 5, -1).
[Ans. volume = 17, areas = Vl66> V66, 3.]
13. Find a representation for a line perpendicular to (2, 1, 3) and (0, 2, —1),
and passing through (1, 1, 1).
3
14. Let P be a parallelogram determined by two vectors in :K . Let Px P v and
, ,
A 2 (P Z ).
15. (a) Verify by direct coordinate computation that |u x v|
2
= |u|
2
|v|
2 -
2
(u • v) .
(b) Use the result of part (a) to show that |u x v| = |u| |v| sin 0, where 6 is
(c) Show that |u| |v| sin is the area of the parallelogram with edges u and v.
16. The complex numbers can be extended to the quaternion algebra JC, which
is a four-dimensional vector space with natural basis {1, i,j, k). Thus a
R
(a) Show that the quaternion product of two elements of S is not necessarily
in S.
(b) Define a product on S by first forming the quaternion product and then
replacing its real part by zero. Show that the resulting product is the
same as the cross-product in R3 .
a = fliUj + o2u 2 + fl
3u3
b = b^ + b 2u 2
c = c^.]
SECTION 8
det^=i(-l) m a 1;.det,4 iy ,
where, in general, A i}
; denotes the (« — l)-by-(« — 1) minor corresponding
to a tj . In the present section we prove more general formulas of the same
kind. These formulas, which apply to any n-by-n matrix A, are
Formula 8.1 holds for each integer j between Formula 1 and n, while
8. 1 R holds for each integer Formula 8.
i between 1 and n. (For / = 1 , 1
postpone the proof of the formulas to the end of the section, first showing
some of their consequences.
Example 1. Let
12 3 4\
A = I 5 6 7
\8 9 0/
/3 4\ (2 4\ (2 3
-5 det +6 det - 7 det
\9 0/ \8 0/ \8 9
IS 6\ (2 3\ (2 3
4 det - 7 det + det
\8 9/ \8 9/ \5 6
= (4)(-3) - (7)(-6)
= -12 + 42 = 30.
2= (-iy+^det^,,
t' l
where x 1 , . . . , x n may be any set of n numbers. From 8.1 we see that this
is equal to a certain determinant; in fact it is the expansion by the yth
column of the matrix obtained from A by replacing column with
they'th
(Xj, . . . , -Y,,). Now consider what happens if we x n equal
take xlt . . . ,
8.2 Theorem
For any A7-by-/? matrix A,
i(-D'
!
det Au
:
v/ i\f+; a a
(det /I if k = i
The number (— \)
i+i det Au is called the ijth cofactor of the matrix A.
We shall abbreviate and write A for the matrix with entries a ti
it as a u .
the y'th column of A) and the kth column of A. That is, it is the sum
n
^ a tj a lk . By Theorem 8.2 this is det A if y = k, and otherwise. Hence
«=i
A'A is equal to (det A)I, a numerical multiple of the identity matrix. A
similar calculation using 8.2 shows that AA is also equal to (det A)I. l
8.3 Theorem
12 3 4\
A = j
5 6 7
\8 9 0,
70 Vectors and Linearity Chap. 1
To obtain the matrix of cofactors, insert the factors ( — l) i+i , changing the
sign of every second entry and giving
63
Sec. 8 Determinant Expansions 71
the other hand, A(A~ l \>) = (AA'^b = b. In other words, the equations
have a unique solution, and it is /l _1 b. They'th entry in the column vector
A~ x
\t is the matrix product of the y'th row of A~* and the vector b. If
det A 7^ 0, we may express the elements of A' 1
in terms of cofactors of A
and obtain
for this product. From 8.1 this may be recognized as (det A)' 1 det B U)
where B U) is the result of replacing they'th column of A by b. We have
proved:
det B U)
x< = ,
det A
xx —2x 2 +4x 3 = 1
2x x +3x 2 —x = 3 3.
We have
1 -2
-1 1
2 3
1 1
-1 2
2 3
72 Vectors and Linearity Chap. I
We have not made any assertions in this section about what happens
if the determinant of the coefficient matrix of a system of equations is zero.
It an easy consequence of the product rule that a matrix with zero
is
Since 8.1 withy = 1 is exactly like 7.1 except that it refers to the first
column instead of the first row of a matrix, it is clear that transposing a
matrix (which just exchanges the roles of rows and columns) should not
affect the value of the determinant. The formal statement and proof
follow.
8.5 Theorem
of the matrices. We shall not always bother to write out these corre-
sponding theorems, but shall refer to the "row" version of a numbered
statement by using the same number with an R after it. (We have already
numbered 8.1R and 8.2R to conform to this convention.) For example,
Theorem 7.2R would read: If B is obtained from A by multiplying some
row by the number r, then det B = r det A.
Theorem 8.5 implies that, from any theorem about determinants that
involves columns of matrices, we can derive a corresponding theorem
involving rows instead, by applying the given theorem to the transposes
of the matrices. In particular, 8.1R is a consequence of 8.1 and 8.5. We
shall not bother to write out the row versions of the other statements
but may refer to them by the original statement number with an R after it.
8.6 Theorem
-4
The third column of C is equal to the third column of A plus 2 times the
first column. As in Example 5, Section 7, det A = —20. Then det C =
—20. As a check,
(b) Let
A =
Sec. 8 Determinant Expansions 75
8.7 Lemma
If the first column of the matrix A is e8 , then
3 =1
column of A is e l5 then a u = 1, while for > 1 the first
If the first /'
holds for the case i = 1 For / > 1 we need to use the inductive
.
det ,4 = 2(-l)> +1 a l3 .
det Ar
3 =2
Each minor A Xi has zir_x for its first column. (Removing the top entry
from the vector e f in 31" gives e z _! in 3l n_1 .) By the inductive
hypothesis,
3=2
By Formula 7.1 the right side of this equation is (— l) i+1 det i?,
Proof of 8.1. We first prove 8.1 for the special case j = 1. The first
+ flni det ( e n, a 2 , . . . , aj
= |(-ir a det^ l
il 1.
j=i
76 Vectors and Linearity
Thus we obtain
= i(-l)*"a„det4„
as was to be proved.
EXERCISES
1. Using appropriate cases of 8.1 or 8.1R, express each of the following as a
linear function of the jc's.
(a)
(b)
Sec. 8 Determinant Expansions 77
(a) Ix + 6y = 5
'
6x + 5y = -3.
(b) 2x +y =
3y + z = \
Az + x = 2. M«5. x = --£ 2 S -I
(c) x x + x 2 + x3 + x4 = — 1
x — x2
1 + 2x4 =
->x 1 — X3 == *
x2 — xt = 0.
/"'
78 Vectors and Linearity Chap. 1
(b) Consider tridiagonal matrices in which the entries on the diagonal all
have the value 2 and the entries next to the diagonal all have the value 1.
Let dn be the determinant of an n-by-n matrix of this type. Find a formula
for dn [Suggestion. Start out by seeing what happens for n = 2, 3, 4.]
.
2
Linear Algebra
SECTION 1
a u Xi + . . . + a ln x n = bx
1.1 ...
a mlX l + • • • + a mn X n = t>m
where the a's and fs are given and the ;c's are to be determined. The whole
system can be written in matrix form as Ax = b, where A is the m-by-n
coefficient matrix with entries o i3 b is a column vector in %
,
m and x is a
,
79
80 Linear Algebra Chap. 2
Example 1.
3x + \2y + 9z = 3
2x + 5y + 4z = 4
-x + 3y + 2z = -5.
Multiply the first equation by \, which makes the coefficient of* equal to 1
and gives
x+ Ay + 3z = 1
2x+ 5y + 4z = 4
-x + 3y + 2z - -5.
Add (—2) times the first equation to the second, and replace the second
equation by the result. This makes the coefficient of x in the second
equation equal to and gives
x+ 4y + 3z = 1
- ly - 2z = 2
—x + 3y + 2z = —5.
Add the first equation to the third, and replace the third equation by the
result, to get
x + Ay + 3z = 1
- 3y - 2z = 2
ly + 5z = -4.
x + Ay + 3z = 1
y+&= —§
ly + 5z = -A.
Sec. 1 Linear Equations, Inverse Matrices 81
Add (—4) times the second equation to the first, and (—7) times the
second equation to the third, to get
x ~r ~zZ = ~3—
y + §z = -I
17
3^
—
— 3- 2
x ~r 3^ = "3"
y + |z= -f
z= 2.
Add (—5) times the third equation to the first and (— f) times the third
equation to the second to get
x= 3
y = -2
z= 2.
Clearly, this sytem has just one solution, namely, the column vector
have found a solution for them. This verification of course does not rule
out the theoretical possibility that the original equations might have other
solutions as well. In fact the final system is equivalent to the original
system and has the same set of solutions. The same is true for any pair of
systems where one is obtained from the other by steps such as were used
in this example. Before we can prove this, we must first state exactly
-1
1 |x = I -2
v
1/ \ 2,
which, since the matrix on the left is the identity matrix, simply amounts
to saying that x is equal to the vector on the right.
The theorem that justifies our method of solving linear equations is as
follows.
Sec. 1 Linear Equations, Inverse Matrices 83
1.2 Theorem
-3
Add (— V) times the row to the second row, and then add 3 times the
first
first row to the third row to produce zeros in the second and third entries
of the first column, and obtain
'1 -2 -3\ / 2^
1 5|x= j
-6
,0 -1 -5/ \ 6;
Add 2 times the second row to the first and then add 1 times the second
84 Linear Algebra Chap. 2
1 7\ /-ION
1 5 )x = I -6
,0 0/ \ Oy
x +7z=-10
y + 5z = -6
Ox + Oy + Oz = 0.
The third equation is satisfied for any values of x, y, z. The first two
equations may be rewritten as x — — 10 — Iz and y = —6 — 5z. Thus,
for any value of z,
is a solution, and every solution has this form for some value of z. We
have now described the set of solutions of
and by Theorem 1.2 we know that this is the same as the set of solutions
of the matrix equation we started with.
1 -2 -3\ /2\
\ -2 —^ x= 7
-3 5 4/ \2,
Sec. 1
2. Carrying out
85
ION
2y
/, 7^
Whatever x is, the third row in the product 10 1 5 |x will be zero
— 0—
\0 0/
because the third row of the left factor is Thus no value of x can give
zero.
a column vector with 2 in the third row, and we conclude that the equation
has no solution.
equations, we obtain
x + 7z=-10
y + 5z = -6
Ox + Oy + Oz = 2.
The last equation obviously cannot be satisfied for any values of x, y, z.)
1.3 Theorem
'2 4
A =
We start with
4
-3 -1,
Add —2 times the second row to the first and —1 times the second row
to the third to get
'0 4 8\ /l -2 0\
1
Sec. 1 Linear Equations, Inverse Matrices 87
Multiply the first row by \ and then add 3 times the first row to the third
to get
'0 1 2\ n -\ ON
o I, I 1
Multiply the third row by — 1 and then add —2 times the third row to the
first to get
'0 1 0\
1 I.
1,
v
1 0\ /
oio, 1
.0 1/ \-|
The last matrix on the right is A~ x , as may be verified by multiplying by A.
EXERCISES
1. Solve the following systems of equations:
'
/
88 Linear Algebra Chap. 2
1 2
(a)
5 6
Ans.
SECTION 2
SOME In this section we shall look at some applications of vectors and linear
APPLICATIONS equations. The selection of examples is made so as to avoid technical
complications from the fields of application. Several of our examples
involve the notion of a network, which we define to be a finite collection
of points, or nodes, some of which may be joined to some others by line
segments. It is theoretically unimportant whether a network is visualized
as lying in 2-dimensional space or 3-dimensional space; we choose which-
ever is pictorially more convenient. Some networks are illustrated in
Fig. 1.
Sec. 2 Some Applications 89
Figure 1
(»< - v t)
(1)
where c u is the current flowing from node to nodey, r a is the resistance /'
of the connection between nodes i andy, and vt and vt are the values of the
electrical potential at nodes
andy. (The appropriate units of measure-
/'
Figure 1(a) shows a circuit with four nodes and five segments, with
the resistance of each segment indicated beside it. Suppose an external
power source is connected at nodes 1 to 4 to maintain values v 1 = 12
and v4 = 0. Since node 2 has no external connection, the current flowing
in must balance the current flowing out, so that if signs are taken into
account, the sum of the currents out of node 2 must be zero. Using (1),
we get the equation
which are the reciprocals of the resistances in the lines joining node 2 to
the others. A similar equation will hold at any node that does not have an
external connection. Thus at node 3 we get
+ fy8
-v 2 2.
^(12 — 6.22) = 0.96. The total current flowing from node 1 into the rest
of the network is then 2.33 + 0.96 = 3.29, which must of course be equal
to the current flowing into node 1 from the external source.
-»-f,
(a) (b)
Figure 2
,
then by definition
r =f + 1 f2 +f = 3 (l,2).
But suppose we are given the directions of the three force vectors
and are asked to find constants of proportionality that will produce a
given resultant, say, r = (— 1, — 1). In other words, suppose we want to
find nonnegative numbers clt c 2 c 3 such that
,
Cjfa + c 2 f2 + c 3 f3 = (— 1, — 1).
(A negative value for some one of the c's would reverse the direction of
the corresponding force.) This vector equation is equivalent to the system
of equations we get by substituting the given vectors (1) that determine
the force directions. We find
Ci
IMaMiH-i) (3>
—c +x 4c 2 — 2c 3 = —1,
3c +x 3c 2 — 4c 3 = — 1.
C\+ c2 = 1
with magnitudes
IcAl = fVlO, |c 2 f2 |
= 2, |c 3 f3 |
= 2^5.
the zero vector. We would then have replaced the vector (— 1 , — 1) on the
right side of Equation (2) by (0, 0), and solved the new system in a similar
way.
O l«5
(a) (b)
Figure 3
Pi = £ + (£)/>2
Similarly, because going to a 4 does not occur in the events we are watching,
Pi = (¥)Pi + i\)Pz
Pa = (*)/>•
Pi - (i)/>2 = a
-(£)/>! +/>2-(i)/>3 =
— (h)P2 + Pz =
Sec. 2 Some Applications 93
Pi = f» Pz — t> />3
= t-
It appears that, the nearer we the more likely we are to get to a 5
start to cr
5 ,
(a)
Figure 4
any joint has the effect of closing off the external pipe at that joint.) We
assume that the inflow at any joint must equal the outflow. Thus at the
upper left corner in Fig. 4(a) we find t1 r1 + r 2 while at the lower left = ,
riH
94 Linear Algebra Chap. 2
1 1 o\
-1 0-1 1
0-1 1
0-1 1
The vector equation shows that there is a linear relation between t and r,
/i + r2 =0
— »i — r« + r5 =0
—r 2 + r3 =0
r6 + r< = 0.
A simple check shows that these assignments for r x through r 6 will satisfy the
above system. It follows that there are infinitely many different pipe flows
that will produce zero external flow at each joint. Similarly, if we have
a solution r of the original system (4) for some given vector t on the right
side, then by the superposition principle any of the solutions r can be
added to r to give a new solution r + r; we have
A(t + r) = Ar + At
= +t= t.
I
"
'f(x) dx ^ uf(a - h) -:-
if (a) + wf(a + h)
Ja-h
where it, v, and w are constants. If the formula is to be correct for all
polynomials of degree less than or equal to 2, it must in particular be
correct for the polynomials f (x) = U /i(-v ) = fix) = x 2 Each
x, ar>d .
have
l+
f (x) dx = 2h and fQ (a - h) =f (a) =/„(a + h)
I -A
so u +v+ u' = 2//. Using /^.v) = .r and 2 (.\)
= .y
2
similarly gives the
equations
(a — h)u + av + (a + h)w = 2ah
and
(a 2 - h2 - 2ah)u + a2 v + (a 2 + h2 + 2a/2)tv = 2a 2 /? + f/?
3
.
These equations are easily solved (see Exercise 13) and give the result
u = vv = \h, v = ^h.
We have obtained a rule which is correct for the particular polynomials
f ,fi, and/ Its correctness for any quadratic polynomial follows readily.
2 .
Let us write E(f) for the error committed when the rule is applied to a
general function/, so
a+k
E(f) - \
Ja—h
fW ^ - \hf(a - h) - if (a) - \f{a + h).
It is easy to see that £ is a linear function from the vector space of poly-
nomials to JR.
1
, and of course E(f )
E{f^) E(f2 ) = = = by construction.
If/is any quadratic polynomial, so f(x) px 2 qx = + + r, then/= pf2 +
qf x
-^
f and by linearity
EXERCISES
Figure 1(b) shows an electrical network with the resistance (in ohms) of each
edge marked on Suppose an external power supply maintains node A at
it.
2. The edges and vertices of a 3-dimensional cube form a network with 8 nodes
and 12 edges. Suppose each edge is a wire of resistance ohm, and that two 1
at the other vertices, and the current flowing in the external connections if
(a) the two vertices with external connections are at opposite corners of the
cube
(b) they are at the two ends of an edge
(c) they are at opposite corners of a face of the cube.
3. (a) Suppose that three forces acting at the origin in ft 3 have the same
directions as (1, 0, 0), (1,1, 0), and (1,1, 1). Find magnitudes for the
forces acting in these directions so that the resultant force vector will be
(-1,2,4).
(b) Can any force vector be the resultant of forces acting in directions
specified in part (a)?
5. Ifforcesactin Jl 2 in the directions of (2, 1), (2, 2), and (-3, -1), show that
magnitudes can be assigned so that the system is in equilibrium.
6. Suppose that a random walk traverses the paths shown in Fig. 3(a). What is
7. (a) Suppose that a particle traces a random walk on the paths shown in
Fig. 3(b). Letting pk be the probability of going from b k to b 6 without
going through b b fmd ,
pk for k = 1,2, 3, 4.
(b) How is the result of part (a) modified if b t and the path leading to it are
eliminated altogether?
(c) How is the result of part (a) modified if a new path is introduced between
6 4 and A 6 ?
\0 1 3/
12. (a) Let the external flow vector in Fig. 4(b) be given by t = (1, 1, 2, 4).
Show that there is more than one consistent internal flow vector r, and
find two of them,
(b) Let the external flow vector in Fig. 4(b) be given by t = (1,0, 1, 1).
13. Carry out the solution of the equations for u, v, w given in Example 5
of the text. [Suggestion: begin by subtracting a times the first equation
from the second and a 2 times the first equation from the third.]
a+Sh
f(x) dx = tf(a) + uf{a + h) + vf(a + 2A) + wf(a + 3/0
I
Ja
SECTION 3
In this section we are concerned with formalizing the method of solving THEORY OF LINEAR
linear equations that has been illustrated in the two preceding sections, EQUATIONS
and in proving that it always works. We begin by giving a precise definition
for the "noninterference" property discussed informally after Example 3
3
Linear Algebra Chap. 2
3.1 (a) Every column containing a leading entry is zero except for the
leading entry.
(b) Every leading entry is 1.
1
Sec. 3 Theory of Linear Equations 99
3.3 Theorem
n
way satisfies Ax = b. The /th entry in the product is 2 a ikxk- We
a:=i
and by assumption b is then also zero. If the /th row is not zero,
t
Example 2. Consider
W 1 5 0\
= .12
3 0.
A , bx = ^ ,
b2 =
^0 1,
The third row of A is zero and the third entry of \i x is not; so according to
Theorem 3.3 the equation Ax = b x has no solution. On the other hand,
the third entry of b 2 is zero. The pivotal columns of A are the first, third,
and fifth, with associated rows the second, first, and fourth. The proof
of the theorem shows that, if we construct
100 Linear Algebra Chap. 2
by making the first, third, and fifth entries equal to the second, first, and
fourth entries of b 2 and making the other rows zero, then Ax will be equal
,
The next problem is how to tell whether an equation that has a solution
has more than one. Theorem 3.4 and its corollary show that the question
can be reduced to a special case, and theorem 3.5 deals with this special
case.
3.4 Theorem
Corollary
homogeneous equation.
3.5 Theorem
and
+ s
are solutions of
violated. If the column contains the leading entry r for the /th row,
_1
multiplying the /th row by r will make the leading entry 1. (Since r was
3.6 Theorem
-2
Column 1 does not satisfy 3.1(a), but has a leading entry of in the first 1
1 3 -2 0\
3-2 0\
(1
1
Adding 2 times row 2 to row 1 and (—2) times row 2 to row 3 clears out
the other entries in column 3 to give
1
104 Linear Algebra Chap. 2
3.7 Theorem
3.8 Theorem
We thus see that it is sensible to apply the method used for computing
method fails, then one obtains a reduced matrix with at least one non-
pivotal column and, hence, by Theorem 3.5, a nonzero x such that Ax = 0,
which demonstrates that A is not invertible.
We conclude by observing that every elementary operation, when
applied to a column vector in %
n
is a linear function, called an elementary
,
on the left by some n-by-n elementary matrix. (The precise forms of such
matrices are described in Exercise 7 at the end of this section.) This fact
enables us to prove the next theorem.
3.9 Theorem
f(x) = Ax
can be expressed as a composition of elementary transformations.
EXERCISES
1. Determine which of the following matrices are reduced. For those that are
not, state exactly how they violate the conditions. For those that are
reduced, list the pivotal columns and their associated rows.
1
Linear Algebra Chap. 2
3. Let
5. Show that if a square «-by-« matrix is reduced and has no all-zero row, then
every row and column contains n — 1 zeros and one 1. Hence show that it
(a) Denote by D t
(r) a matrix which is the same as the identity matrix
except for having r in place of 1 in the /th diagonal entry. Show that the
Sec. 3 Theory of Linear Equations 107
matrix product
D (r)M
f
(b) Denote a matrix with 1 for its //th entry and 0's elsewhere by Eu . For
example, for 3-by-3 matrices,
(I + rEvT1 = (/ - rE ),
i} D&T X
= />,(;] ,
and Tj = Tu .
8. Prove that if there are more unknowns than there are equations in a linear
system, then the system has either no solutions or infinitely many.
are any n + 1 numbers, then there is exactly one polynomial of degree < n
such that/?O ) = b ,p(x n ) = b n [Hint. Show that the problem leads
, . . . .
11. A reduced matrix in which the first pivotal column (starting from the left)
is associated with the first row, the second pivotal column is associated with
SECTION 4
VECTOR SPACES, In the earlier parts of the book we have restricted ourselves to vectors in
SUBSPACES, n
'Ji . In this section we consider more general vector spaces, though some
DIMENSION
of the ideas have already been introduced with ii\
2
and Jl 3 as the main
examples.
Recall that a vector x is a linear combination of the vectors xXl . . . , xn
if there are numbers rlt . . . ,r„ such that
x = r1 x 1 + ..'.+ rn \ n .
Example 1. Let
l
= (1,0,0), x2 = (0,1,0), x3 = (0, 0, 1), k = 0,1,1).
Then y = (2, 2, 0) is a linear combination of x 1 and x 2 because it is
s; so y = rx + sx is impossible. 2 3
If a set of vectors lies in a plane through the origin in 3-space, then every
linear combination of them lies in the same plane —
x and y recall that if
are in a plane through the origin, the parallelogram rule makes x + y
a vector in the same plane. Any numerical multiple of x lies in the same
plane because it lies in the line containing x. Any linear combination of
x 1} . . . up by multiplications and additions, and if the vectors
, x„ is built
Xj, x„ lie in a plane, so do all linear combinations of them.
. . . ,
Similarly, if x x x n all lie in one line through the origin (so they
are all multiples of some one vector), any linear combination of them
lies in the same line.
(a) The set of all vectors in %n with first entry equal to is a subspace,
since any linear combination of such vectors will also have a for its first
entry.
.
(b)
C
For any vector space U,
C
U itself is a subspace. The term proper
subspace is often used to refer to those subspaces that are not the whole
space. In any vector space the zero vector forms a subspace all by itself.
(c) For any linear function 17 — > ID, the set JC of vectors x in U
with/(x) = is a subspace of T) called the null space off. If x l5 . . . , x fc
k k
are in JV and x = 2 r t x i> tnen /( x ) = 2 r if( x i) = 0> because / is
linear and all the/(x,) are 0. Hence x is also in JV, and so JV is a subspace
C
of \J. In particular, the set of vectors (x, y, z) in 'A 3 such that
x + 2y + 3z = 0,
or equivalently,
(1
3
is a subspace because it is the null space of the linear function from 'Ji to
Jl defined by the preceding l-by-3 matrix.
the range off means that there are vectors \ lf x k in the domain of / . . . ,
=f(r1x 1 + . . + rk x k
. ).
5 =x
)0
is a subspace because it is the range of the linear function just defined by
the above 3-by-2 matrix.
n
vectors x x x which are in the span of S, that x = £ s a u 3 f° r
, . . . , fc ,
kin \
is,
n
t
3=1
some vectors u_, in S. Then x = ^ rA ^= s^uA = £ ^, where /
;
=
k = i l \ 3 1 / = 3 1
1 rx+ sx = (r + s)x.
2. rx + ry = r(x -f y).
3. r(sx) = (rs)x.
4. x + y = y + x.
5. X + y) + z = x + (y +
( z).
4.1 Theorem
c
Example 3. (a) Let l) consist of all continuous real-valued functions
of a Define/ + g and rf in the obvious way as the functions
real variable.
whose values for any number x are/(x) + g(x) and rf(x), respectively.
(Of course, we are using the theorems that/+ g and //are continuous if
Sec. 4 Vector Spaces, Subspaces, Dimension 111
/ and g are.) It is easy to verify that the laws for a vector space are
satisfied.
C
(b) Let P be the subspace of U consisting of all polynomials, i.e., all
of P.
The description of lines, planes, and ordinary space as 1-, 2-, and 3-
Example 4. (a) The four vectors x = (2, 0, 0), x = (0, —2, 0), 1 2
rx + sx + rx =
x only if r — s = = 0.
2 3 t
for its z'th entry and for all other entries, form a basis for 3\". Verification
that the e, are in fact linearly independent and span Jl" is left to the reader.
112 Linear Algebra Chap. 2
4.2 Theorem
appear on the right side. By substituting the right side for x A in any .
4.3 Theorem
any two vectors y x and y 2 in the space, one must be a multiple of the
Sec. 4 Vector Spaces, Subspaces, Dimension 113
that the y's are dependent, then we will have shown that the state-
ment of the theorem holds for a spanning set of n elements, and the
inductive proof will be complete. Each of the vectors yu y n+1 . . .
,
n + 1 • If the /i + 1 numbers a a are all zero, then the y's all lie in the
space spanned by the /; — vectors x 2
1 x w thus by the inductive , . . . , ;
j =2
But since not all the r's are zero, this implies that y x , . . .
, y n+1 are
dependent, as we wanted to show.
4.4 Theorem
both sets are independent, and both are spanning sets, Theorem 4.3
implies that k < n and n < k.
The dimension of a vector space that has a finite spanning set is the
number of elements in any basis for the space. (The dimension of the
space consisting of the zero vector alone is defined to be 0.) We write
dim (1)) for the dimension of the vector space C U. Note that Theorem 4.2
114 Linear Algebra Chap. 2
guarantees the existence of a basis, and that Theorem 4.4 guarantees that
the dimension does not depend on which basis is taken.
(c) The space P of all polynomials (Example 4(b)) does not have any
finite spanning set. If it did have one with k elements, then the fact that
1, x, x2 , . . .x k are k -\-
, 1 linearly independent elements of P would
contradict Theorem 4.3.
dimensional.
Theorem 4.2 asserts that we can get a basis from a finite spanning set by
deletingsome of its members. The next theorem shows that, in a finite-
dimensional space, we can get a basis from a linearly independent set by
adding vectors to it.
4.5 Theorem
must show that all the r's are 0. If rk+l were not 0, we could write
x A.^ x = — (/'i//'i + i)x 1 — ... — (rk lrk+1 )x k which is impossible because
. ,
4.6 Theorem
4.7 Theorem
u l5 . . . (Theorem
, ur for 17. 4.6 guarantees the possibility of this
construction.) Then dim (JV) = k and dim (17) =k+ r. Let wx =
/(Ux), . . . , w =/(u
r r ).
We claim that wl9 . . . , wr is a basis for W,
which implies that dim (ID) = r and proves the theorem. It is
k r
x = 2 a y + 2 Mi. t i
and then
y = I aj(jd + 2
=
t' l = Z l
WW = + 2 6,-w,,
2 =1
i=l
fljVx + • • • + ak y k — ^i u i — • • — b Tu r = 0. Since v l5 . . . \k u 1, ,
. .
.
u r are linearly independent, this is possible only if all the 6's (and all
the a's) are zero, which shows that the w's are linearly independent.
^+ b,
where C
U is a linear subspace and b is a single vector. The dimension of an
c
affine subspace is of course defined to be the dimension of Vf, and a
1-dimensional affine subspace is usually called aline, while a 2-dimensional
3
affine subspace is called a plane. Examples in ,'K are shown in Fig. 5(b).
Two affine subspaces are parallel if they are translates of one another.
Two of the most important ways of describing subspaces, namely, as
range or null space of a linear function, provide standard ways of describ-
ing lines and planes. The parametric representation of planes discussed in
Sec. 4 Vector Spaces, Subspaces, Dimension 1 17
L(u, v) — ux r + vx 2
A(u, v) = ux 1 + vx 2 + x
L(x - x ) = 0.
Methods for solving such equations are discussed in the earlier sections
of this chapter. If x = (x, y) is 2-dimensional then equation (1) would
take the form
ax + by = c,
a xx + b xy+ cx z = dx
a.,x + b y +
2 c 2z = d2
x — y + 2z = 1
x +y + 3z =
118 Linear Algebra Chap. 2
2x + 5z = 1
x +y + 3z = 0.
Multiplying the first equation by I and subtracting from the second gives
x + \z = J
We can represent the set of all solutions of this pair of equations para-
metrically as an affine subspace by setting z — t. We find
X = + 2"/ 2
/V — — It
2»
— A2
Z = /
EXERCISES
1. Which of the following subsets of Jl 3 are subspaces ? In each case either show
that the subset is a subspace or find some linear combination of elements of
the subset that is not in the subset.
2. Let x x = (1, 2, 3), x2 = (-1, 2, 1), x3 = (1, 1, 1), and x 4 = (1, 1, 0).
5. Part (d) shows that the union of two subspaces is not always
of Exercise 1
a subspace. Show that the union of two subspaces is a subspace if and only
if one of them is contained in the other.
6. Show that the range of a linear function may be a proper subspace of its
range space.
8. For any two subsets A and 3i of a vector space, let .-t — 3i be the set of all
9. Show that if is the only element in the intersection of two subspaces S, 73,
then
dim (S + TJ) = dim (S) + dim (73)
[Hint. Show that a basis for S together with a basis for 13 gives a basis for
s + t;.]
[Hint. Start with a basis for S n 13 and extend it (Theorem 4.6) to a basis
for S and a basis for 13.]
1
1 -3 2
Find a basis for the null space of/, and one for the range of/. Verify that
Theorem 4.7 holds.
12. Describe the solution set of each of the equations or systems of equations
below as an affine subspace, that is, as a translate by a specified vector of a
linear subspace with a specified basis.
(a) x +y = 1.
(b) 2jc + y =1
2x - 3y + z = 2.
(c) x + 2y + 3z = 10
Ax + 5y + 6z = 1
(a) x + 2y
(b)
(c)
SECTION 5
LINEAR FUNCTIONS In Chapter 1 we saw that matrices obey some of the same rules for
addition and multiplication that ordinary numbers do. Furthermore,
we have seen that given a linear function /from :K" to %m , there is an
m-by-/7 matrix A such that/(x) = Ax for all x in J{". Using these facts
we could prove that linear functions from 'A" to %m obey algebraic rules
just as their matrices do. However, it turns out that these same rules
apply to a wider class of linear functions and not just those representable
by matrices. For this reason we shall prove the rules in general form and
then apply them in Section 6 to a systematic analysis of some linear
differential equations.
We
begin by describing the operations of addition, scalar multiplica-
tion,and composition of functions. Let / and g be functions with the
same domain and having the same vector space as range space. Then the
function/ + g is the sum of/ and g defined by
(f+g)(x)=f(x) + g{x)
Sec. 5 Linear Functions 121
r/(x) = r(/(x)).
We have already defined the composition of functions in Chapter 1,
but we repeat the definition here. We require now that the range of/ be
contained in the domain of g. Then g°f is the composition of/ and g
and is defined by
/ g
Example 1. Suppose 5l 2 > % 2
and .'R
2
3t 2 are given by
and
'Q-C-K -3C
x\ _ /2x+y\ 12 \\lx
v+ O= / +
3x + 2y\ /3 2\/x
2x + 2j/ ~\2 2/V
Also, 3/ is given by
'•'CM'C
; x -X
.
5.1 Theorem
(f + g)(x)=f(x) + g(x)
= Ax + Bx = (A + B)x.
The first equality holds by the definition of/+ g, the second by the
relation of matrix to function, and the last by the general rule
AC + BC — (A + B)C for matrix multiplication and addition.
To prove statement 2 we write, similarly,
where the last equality follows from the rule B(AC) = (BA)C for
matrix multiplication.
Statement 3 is just the result of Theorem 4.5, Chapter 1.
'Q-Q- 'C)-t:
;:)-(:) 'Ch:
Then the matrices of/ and g are
2 -1\ /l
and
3 0/ \0 -1
Sec. 5 Linear Functions 123
2 -«_/, 0j_/0 -,
3 0/ \0 -1/ \3 2
^2 -1\/1 0\ /2 1
_
,3 0/\0 -lj \3
5.2 Theorem
(1) (f + g)°h=foh+goh
(2)fo(g + h)=fog + fo h
(3) (rf)og = r(fog)=fo(rg)
(4) ho (gof) = (hog) of.
ho(gof)(x) = h(gof(x))
= Kg(A*))Y
Similarly,
(hog)of(x)=hog(f(x))
= h(g(f(*)))-
124 Linear Algebra Chap. 2
The results of the two computations are the same, so the formulas we
started with must be the same. Since x is arbitrary, h ° (g of) =
(h o g) of.
(0O)
Example 3. Let C (Jl) be the vector space of infinitely often differen-
tiable real-valued functions y of the real variable x. If we let D stand for
differentiation with respect to x, then
Dy=y',
and D is a function with domain C <oo)
(3l). The familiar rules about
differentiation stating that (/ + g)' =/' + g' and (cf)' = cf imply that D
is linear. Because the meaning will always be clear, we can omit the little
D y=y",
2
D*y=y'".
We can even define
D°y =y
for occasional convenience. Combining powers of D by addition and
numerical multiplication leads to formulas like
D -D-
2
2, (D + 1)(Z> - 2),
and applying the rules (1) and (2) of Theorem 5.2 we get
(D + l)(D - 2) =D -D- 2
2.
for everyx in the domain of/. Applying/ to both sides of this equation
gives/°/ _1 (/(x)) =/(x); therefore, we also have
/o/-l(y) = y,
for every y =/(x) in the image of/. We leave as an exercise the proof
that:
Sec. 5 Linear Functions 125
x
5.3 If/ is linear, then so is/ .
Finally, the row reduction method of the first section of this chapter is
an efficient way to find the inverse of an invertible matrix, that is, to find
A -1 such that
A- 1 A = AA^ = 1
/,
f(x) = Ax,
-1
always has an inverse function/ given by
defined by
fx\
g(x)
126 Linear Algebra Chap. 2
)= x, for in C.
w
j
which is not invertible because it isn't even a square matrix. The difficulty
_1 2
is that g is not defined on all of ,'il , but only on the subspace C.
have a name: the rank of a linear function/is the dimension of the range of
/. Thus the linear function
f =
\y) \0 3/\
(2x\
2
has its rank equal to 2 because its range is all of 'J\ . The function
1 0\
\o oj\j
2
has its rank equal to 1 because its range is just the x-axis in Si For a .
useful criterion.
5.6 Theorem
Let/be a linear function with matrix A. Then the rank of/is equal
to the number of linearly independent columns in A.
1 7^
,2 14/
Then subtracting 2 times the first row from the third row gives
1 7\
15.
v
0/
5.7 Theorem
5.8 Theorem
Let %n —>% n
be linear, with square matrix A. Then/ has an
inverse if and only if A is invertible.
ABx =/o/-i(x) = x
EXERCISES
1. Suppose that/ and g are linear functions such that
1\ /1\ /0\ /l
'o-i- A.
y-n -CHI
Sec. 5 Linear Functions 129
(a)/.
(bU-
(c)f + g.
(£)2f-g.
(e)<?o/.
(Ofo(f + g).
Linear Algebra Chap. 2
(0 1 1 2
(a) (b)
U 0) 2 4y
<o o : 2
(c) | 1
I
2
,1 Oy v4
9. Let °U and '10 be finite dimensional vector spaces and let/be a linear function
with domain 1J and with range equal to 10. (Thus dim (1D) = rank (/).)
(a) Show that V contains a largest subspace S such that/, when restricted to
S, becomes one-to-one from StoW. [Hint. Let 91 be the null space of/
with basis n 1 n k Extend this basis to a basis for 1) by adding
, . . . , .
vectors s l5 . . . , s t .]
(b) Show that dim (S) = rank (/), where S is the subspace of part (a).
10. If/ is a function and y an element of the image of/, then the subset S of
is
11. What is the simplest way to find the rank of a diagonal matrix
diag («!,.. .,«„)?
SECTION 6
DIFFERENTIAL In this section we look at some vector space ideas that arise in studying
OPERATORS differential equations. The equations we shall treat will be like the
following:
/ - ly = (1)
y — 7>y = ex (2)
y = ce rx , (5)
because the c will cancel on both sides. In fact, Equation (5) gives the
most general solution to (4), for observe that we can write (5) in the form
e~ rx
v — c.
(e- rxy)' =
or, using the product rule for derivatives,
rewritten. But now we can reverse these steps, supposing that y is some
solution. We start with
y - ry =
rx
and then multiply by e" to get
(e-ry)' = 0.
e~ TX
y = c,
y - 3y = ex ,
£-3x,/ 2e~ 3x v = e~ 2x .
(e
-3Xy)' — e ~2x
for the most general solution. It is easy to verify directly, of course, that
we have indeed found some solutions, one for each value of c. What we
have shown additionally is that any solution must be of the form — \ex +
ce 3x .
(D + 2)y = Dy + 2y
= / + 2y
(D 2 - \)y = D' y -y z
= D(Dy) -y=y"-y.
An is that D acts as a linear function on y;
important observation
the term linear operator sometimes used to avoid possible confusion
is
over the fact that y itself is a function of x (though not necessarily a linear
one). To see that D acts linearly, all we have to do is recall the familiar
Sec. 6 Differential Operators 133
properties of differentiation:
D{)\ + y = Dy, + Dy
s) 2
These two equations express the linearity of D. From the fact that
compositions of linear functions are linear it follows that the operators
D D32
, , and in general Dn are also linear. Because numerical multiplica-
tion is a linear operation and because the sum of linear operations is
D +
2
a, D + aD +
2
b, (D + s)(D +
are all linear operators, with the respective interpretations
+ a)y = y" + ay
(D 2
(D + s){D + t) =D + 2
(s + t)D + st,
Z) a - = (D-
1 \)(D + 1)
(D 2 + aD + %= 0.
134 Linear Algebra Chap. 2
Our method of solution will be to try to factor the operator into factors
of the form (D + s) and (D + /), and then apply the exponential multiplier
method of Examples and 2 repeatedly.
1
(Z)
2
+ 5D + 6)y = 0.
(D 2 + 5D + 6) = (D + 3)(D + 2);
(D + 2)y = u
for the moment, we substitute u into the previous equation and arrive at
(D + 3)w = 0.
Therefore
e 3x u = cx ,
for some constant clf and so
u = c x e~~ 3x .
Recall now that we have temporarily set (D + 2)y = it. We then have
(D + 2)y = c x e~
3x
.
tx
Multiply this last equation by e to get
e^Dy + 2e 2xy = c x e~ x
or
D(e 2xy) = c x e~ x .
e^y = —c x e~ x + c2
or
y = — c e~ 3x + x c 2 e~ 2x .
Sec. 6 Differential Operators 135
Since the constants c x and c 2 are arbitrary anyway, we can change the
sign on the first one to get
y = c t e~ 3x + c^er 21
e
ax
(D + a)y = D(e axy).
6.1 Theorem
(D - r,){D - r2) . . . (D - r n )y = 0,
(D - ri )(D - r 2 )y = 0,
we set
(D - r 2 )y = u.
e~ riX Du — r x e~ T ^ x u =
or
D{e~ rix u) = 0.
e -TiX U _ Ci
Now
(D - r 2 )y = cx e ^x
r
.
136 Linear Algebra Chap. 2
We multiply by e r- r
to get
D{e- r **y) = Cl e
(r >- r * )x
(6)
gives
Cl
e-*z*y = gin-*)*
+ C2
cl
y = e
r
r, — r..
D{e~ rt*y) = ev
Now integration gives
g-rtxy = CiX _|_
Ca
or
y = c 1 xe TiX + c 2 e T2X ,
(D - rjiij = 0.
Substitution of the general solution for u^ into (7) gives a new equa-
tion which we split up by setting
(D-r )...(D- 3 r n )y = u2 .
(D — r 2 )u 2 = «l9
D n + a^D"- + 1
. . . + ax D+ a (8)
y
(n)
+ a n_ iy ^-v + . . . + a iy '
+ a y = 0.
Notice that, to get the polynomial from the equation, we replace y
ik)
by D k
and that the term a D° = a Finding a
y corresponds to a . factor-
ization for the polynomial depends on knowing its roots, for if the
polynomial (8) has roots r u r n then it has the factored form
. . . , ,
(D - ri )(D -r )...(D- 2 r n ).
Z> 3 - 4D + 2
AD,
D(D 2 - AD + 4) = D(D - 2)
2
The roots are and 2, where 2 is a repeated root. Thus the general
solution to the equation is a linear combination of e 0x , e 2x and xe Zx
, ,
Linear Algebra Chap. 2
and so
y = cx + c 2 e 2x + c 3 xe 2*
(D - rO . . . (D - r n )y =
is a vector subspace of C (n)
, because JV is the null space of the linear
operator
L = (D - rx) . . . (D - rn)
6.2 Theorem
(D-r )...(D- y r n )y = 0.
differential equations
{D - rk )y = uk_ x
D(e~ TkXy) =
are of the form
e -r k Xy _ Cf
Example 5. Given
y" + 2y' +y = e 3x ,
140 Linear Algebra Chap. 2
(D + 1)« = e Zx .
Multiplication by e x gives
e x Du + ex u = e ix
or
D(e*w) = e 4x .
or
u = £e 3x + c^ - *.
Since (Z) + l)_y = w, we have
(D + 1)7 = ie
3x
+ c^.
Again multiplying by ex , we get
e x Dy + e xy = le
ix
+ cx
or
Then
or
j = ^e * + 3
c x xe~ x + c 2 e _x .
In the above example the solution breaks naturally into a sum of two
parts, y h and y9 :
yh = c x xe~ x + c 2 e~ x ,
L(y) =
associated with L(y) = f. The function _y„ is called a particular solution of
Uy) =f
because it is just that: a particular solution, though not the most general
one. In fact, we get y p by setting c l = c2 — in the general solution. This
breakup of the solution into two parts is an example of a general fact about
linear functions, a fact that is used in solving systems of linear algebraic
equations. The principle is important enough, and at the same time simple
enough, that we state it here also.
Sec. 6 Differential Operators 141
6.3 Theorem
Proof. Suppose that L(y) =/and that also L(y p ) = f. Then, since
L is linear,
L(y - yp = ) L(y) - L(y p )
The method of Example 5 can always be used to find the most general
solution to the equation L{y) =/of the form
y = Oi7i + a 2y 2
equation.
.
(D + \)
2
y = e 3x
had the general solution
TVe
3x
+ Cyxe- X + c 2 e~ x .
we would not have to start all over again, but would only have to find a
particular solution for
(D + \fy = 1.
(D + \fy = e 3x + e~*
(D + \fy = e~ x .
EXERCISES
1. For each of the following differential equations, find an appropriate
exponential multiplier and then solve by integrating both sides of the
modified equation
e $p(x) dx
d
(
e lv(x) dx 7
y\
dx
Sec. 6 Differential Operators 143
3. Use the result of Problem 2 to find an exponential multiplier for each of the
following differential equations. Then solve the equation.
(b)
£ + xy = x. (d) / + y = 0.
4. Write each of the following differential equations in operator form, e.g.,
(D 2 + 2D t 1)/ = 0, and then factor the operator into factors of order 1.
5. Sketch the graph of each function of x given below. Then find a differential
equation of which each one is a solution, the equation being of the form
y" + ay' + by = 0.
(b) e* + e- x . (d) 2e
2x - 3e
3x
.
x y"2
+ (x 3 + x)y + (x
2
- \)y =
can be written in operator form as
x)y-0.
K)< \(D
y(x) = cx e
tx
+ c 2 e~
tx
y(x) =d 1
cos x + d2 sin x.
(c) Verify directly that cos x and sin x are solutions of v" +y = 0.
(D - ri)...(D -r„)/=0.
By applying the operators (Z) — r,) to both sides of the given Equation (1),
it follows that y must also be a solution of the higher-order homogeneous
equation
(Z> - /i) . . . (D - r n )(D
2
+ aD + b)y = 0. (2)
Since we have a routine for writing down the most general solution yg of
this last equation, we can find a particular solution y p of Equation (1) by
substituting the general solution of (2) into it and seeing what conditions
save duplication, we can first eliminate from yg all terms that already occur
in the homogeneous part, yh of the solution of (1). Linear independence of
,
for y v The
. general solution of (1) is then y = yh + y v Find . the general
solutions of the following differential equations.
x
(c) y" + 2/ + y = e .
yh = C^iO) + C 2 U 2 (x).
(a) Verify that substitution of Equations (4), (6), and (7) of Exercise 11
into
/ I ay' +by =f
produces Equation (8).
(b) Verify that Equations (5) and (8) of Exercise 1 1 have the solution given
by Equation (9).
x(t) and its acceleration is x(t). From physical principles it can be shown
that, theoretically, x satisfies the differential equation
mx + kx = 0,
(a) Show that if the initial displacement is x(0) = and the initial velocity
Vcfn
x(t) = -7- (1 - e- kt ' m ).
k
(b) Show that the displacement x(t) has an upper bound equal to v^mjk.
What is the effect of increasing the viscosity constant or of increasing
the mass of the pellet?
(c) Show that the velocity of the pellet decreases to zero as t increases.
(d) Show that the acceleration of the pellet is negative for t > 0. What is
ay + pz =/(/)
yy + 6z = g(t)
as a special case when a, b, c, and d are all zero. The method of elimination
can be used to solve the differential system as well as the algebraic. To find
y{t) and z(t), operate on both sides of the first equation by (dD + d),and
on the second equation by (bD + ji). Then subtract one equation from the
other. The resulting equation can be solved for y(t) since it does not contain
z. Next substitute the general solution y(t) into one of the equations and
solve that for z(t); to determine possible relations between the constants of
integration, it may be necessary to substitute y(t) and z(t) into the other
given equation. Sometimes simplifications in this procedure can be made.
Sec. 7 Complex Vector Spaces 147
(a) Use the method just described to find the general solution of the system
(£> + \)y + z = 0.
3y + (D - \)z = 0.
(b) Determine the constants in the solution of part (a) so that the initial
(Z> + \)y + Dz =
Dy - (D - \)z = t.
(d) Determine the constants in the solution of part (c) so that the initial
15. Suppose that two 100-gallon tanks of salt solution have concentrations (in
pounds per gallon) of salt y(t) and z(t), respectively. Suppose that solution
is flowing from the v-tank to the z-tank at a rate of 1 gallon per minute, and
from the z-tank to the j-tank at a rate of 4 gallons per minute, and that the
overflow from the j'-tank goes down the drain, while the z-tank is kept full
by the addition of fresh water. We assume that each tank is kept thoroughly
mixed at all times.
(a) Show that y and z satisfy a system of differential equations of the type
discussed in Exercise 14. [Hint. Express Dy and Dz each as a linear
combination of y and z.]
(b) Find the general solution of the system found in part (a), and then
determine the constants in it to be consistent with initial concentrations
j(0) = ^ and z(0) = i,
(c) Draw the graphs of the particular solutions y{t) and z(t) found in part
SECTION 7
In the earlier parts of this chapter we have always understood the vector COMPLEX VECTOR
space operation of numerical multiplication to mean multiplication by a SPACES
real number. However, we can replace real numbers by complex numbers,
and let the definition of vector space and linear function remain otherwise
the same. Then, all the theorems we have proved for real vector spaces
are still true relative to the complex numbers. To prove this, all we have
to do is observe that the only properties of real numbers that we used in
proving theorems about an abstract vector space are properties that are
shared by complex numbers. Theorems involving inner products are
another matter, which we shall discuss at the end of the section. As
motivation for considering complex vector spaces, we shall explain how
the extensionis the key to further development of the study of differential
(a- •
x') i (v y')
IV
Figure 7
y '.l'-
\x + iy\ = v/x
z
+ y
2
X r iy
and corresponds to the length of a vector in 'Ji
2
. (See Fig. 8.)
are related by
y
x + iy = \x <y\ + i
x + iy\ \x + iy\)'
iy\
Because \x + iy\ = V* + y 2 2
, the numbers xj\Jx 2 +y 2
and yjs/x 2 +y 2
r|(cos i sin 8)
Figure 9
number has infinitely many polar angles, each pair differing by an integer
multiple of 277.
Now if z and z' are complex numbers with polar angles 6 and 0', we
can write their product in polar form as follows:
= \z\ \z'\ ((cos 6 cos 6' — sin 6 sin 6') + / (cos 6 sin d' + sin 6 cos 6'))
In the last step we have used the addition formulas for cosine and
sine. The result of the computation is a number in polar form
having absolute value \z\ \z'\ and polar angle d + 6'. We conclude
that the absolute value of a product of complex numbers z and z'
is the product of their absolute values
and that if z and z' have polar angles 0(z) and O(z'), then zz'
has a polar angle 0(z) + d(z'). These facts are illustrated in
Fig. 10. Figure 10
150 Linear Algebra Chap. 2
2
Thus e'° is a complex number with |<"°| \ sin cos- I , and with
polar angle d. Since polar angles are added when complex numbers are
multiplied, we have
1 = a- = 7°.
These last equations are justifications for using the exponential notation;
the function behaves like the real exponential, for which e°e° = e
6+e '
z = \z\e
mz
\
and its conjugate is
2 = \z\e~
mz \
ax ax clx
and
|
(Ma) - iv(x))dx = |
m(x) clx + i fi-(A) dx.
—d e
ix
= —d (cos x +
/ , •
i sin x)
dx dx
= —sin x f i cos x
= /(cos x + / sin x) = ie
tx
.
Sec. 7 Complex Vector Spaces 151
In short, we have
4- e
ix
= ie
ix
.
dx
Similarly
e
ix
dx = - e ix + c,
J i
and compute
7.1 A. e
<a+ifi)x
= (a + ip)e
la+ifi)x
dx
and
7.2 r e <«+«-/»* rf x = —— 1
e
<«-M/»*
+ Cj a + i/5 ^ 0.
J a + //5
(D 2 + aD + b)y =
when the factored operator
(D - ri )(D - r2) = i)
2
- (r, + r 2)D + r^
contains complex numbers rx and r2 . We shall see that the usual tech-
(£»
2
+ \)y = 0,
and factor D + 2
1 to get
(D - i)(D + i)y = 0.
152 Linear Algebra Chap. 2
Then set
(D + i)y = u, (1)
and try to solve
(D - i)u =
for //. As in the real case, we multiply by a factor designed to make the
left side the derivative of a product. The same multiplier rule suggests
that the correct factor is e~ ix so we write
,
er ix {D - 0" =
or, since D(e~ ix
u) = e~ ix
{D — i)u,
D(e~ ix u) = 0.
e~ ix
u = cx or u — cx eix .
(D + i)y = cxe*,
Integrating gives
e
ix
y = \ Cl e 2ix + c2
or
y =~ cx e
ix
+ c 2 e~
ix
.
2i
y = c x e ix + c 2 e~ ix
d +
=— x id*
= —d,
— id 2
c, and Co .
Sec. 7 Complex Vector Spaces 153
c1 x 1 + . . . + c nx n =
implies that all the c's are zero whenever they are chosen from the complex
numbers, then the same implication certainly holds if the c's are only
chosen from the real numbers. However, the converse statement is not
true, as the following example shows.
7.3 Theorem
Theorem 7.3 is not usually applied directly in the above form because
operators such as
P(D) = D + aD + 2
b,
which occur in practice, most often have real numbers for the coefficients
a and b. This implies that, in the factorization
D + aD +
2
b = (D - r{){D - r 2 ),
qTX qTX
Cl e
rx
+ c2 e = c e (a+mx + c e ~
fx
x 2
(x ill)x
e ax sin /3x are easier to interpret geometrically than are the complex
exponentials that gave rise to them. Hence the solutions are often written
using the trigonometric form
y = dx e™ cos j8x + d 2 e*
x
sin /3x.
y" + 2/ + 2y =
has characteristic polynomial
D + 2D +
2
2.
(2>_(_1_/))(2)_(_1 +f))y = 0.
The complex exponential solutions are
e (-l-i)x £ {-l+i)x_
(-i-i)x {
- 1+i)x
Cie _|_ c 2e
or
dx e~ x cos x + d.2 e~
x sin x.
7.4 Theorem
{D n + a n _ x D n -i + . . . +aD+ x a )j = 0,
where the # are real, is a vector space JL of dimension n relative to
fc
l ax l ax
x e cos /5.x, x e sin /?x,
where a and ft
are real.
Proof. We know from Theorem 7.3 that the complex solutions of the
above differential equation are linear combinations of functions of
the form x lerx where either r is real or else there is a companion
,
solution x e
rx
We have seen that any solution, and in particular
l
.
l x l ax
x e" cos fix, x e sin fix. (3)
c + cx x + . . . + c nx
n
— for all x
implies that the polynomial has more than n roots. This is possible only
if all the coefficients are zero.
|cx|
2
= (ex, ex) = c 2 (x, x) = c 2 |x| 2 .
But |cx| 2 and |x| 2 are both real numbers, while in general c 2 the square of a ,
complex number, is not real. To get around this difficulty we require that
(x, y) be conjugate symmetric,
then |z| turns out to have the three properties of length listed in 5.3 of
Section 5, Chapter 1.
P(x) = 2c ke
ik *
k=-n
defined for— n < x < tt with complex coefficients ck . We can define a
complex inner product on 8„ by
Ipl = (P, Pf
2
\l/2
2
\p(x)\ dxj .
real case and is left as Exercise at the end of this section. The differ-
1 1
ence between the real and complex proofs here illustrates the fact that,
because a complex inner product has somewhat different properties from
a real inner product, we cannot expect theorems involving inner products
to extend without change from real to complex vector spaces. Complex
inner products are used in this chapter in the exercises following Sections
8 and 9.
EXERCISES
1. For each of the following complex numbers z, find z and \z\. Then write z
in polar form and find 1/z.
2 + /
(b) -1 + 2/. (d)
i
2. Verify that, for complex numbers zlt z 2 , and z 3 the distributive law ,
z-Sztf^ = {z x z 2 )z z and zx + (z 2 + z 3) = (z 2 + z2 ) + z3 ,
all hold.
4. Prove directly from the definitions of conjugate and absolute values that
z^ = z^ and \z z = x t \
\z x \
|z 2 |.
to k\
fc
e
into two infinite series, and use the result to justify defining e' by
e
tB
= cos 6 + i sin d.
(a) (£>
2
+ 1)7 = 1. (d) y' +y = \.
mx + kx + hx = 0,
(b) Assuming m, h, and k all equal to 1 , find the solution of the differential
equation satisfying x(0) = and x(0) = 3.
(c) If in part (b) we change to k =2, but leave the other conditions the
same, find the corresponding solution to the differential equation.
(d) What is the maximum displacement from the initial position under the
conditions of part (b)? Show that the oscillation tends to zero as t
increases.
(e) Sketch the graph of the displacement function under the conditions of
part (c). What is the maximum displacement and at what time does it
occur?
9. (a) Show that {a cos ex + b sin ex), where a, b, and c are real numbers,
can be written in the form r cos (ex — 0), where r = Va 2 + b 2 and
is an angle such that cos d = a\r and sin Q = bjr.
(b) The result of part (a) is useful because it shows that a linear combination
of cos ex and sin ex has a graph which is the graph of cos ex shifted by a
suitable phase angle, 8, and multiplied by an amplitude, r. Sketch the
graph of
1 1
- cos 2x -i 7= sin 2x
2 V3
by first finding r and an appropriate 0.
ax x
10. Show directly that e cos Px and e' sin fix are linearly independent
relative to the real numbers by using the formulas
gift*
+ e -iP*
i0x _e -ifix
rx TX
together with the fact that e and e are linearly independent relative to the
complex numbers when r j= f.
|<z,w>|<|z||w|
:
so that
(e-
i0
z, w) - |<z,w)|.
12. Show that C", the set of w-tuples of complex numbers, has dimension //
relative to the complex numbers and dimension 2a? relative to the real
numbers.
SECTION 8
ORTHONORMAL BASES If 1) is a vector space with an inner product, and T) has a basis
Ui,U 2 , . . . , u„,
then it is often desirable that the basis be an orthonormal set. This means
that distinct vectors u (
and u ; are perpendicular:
u. -u ; = 0, i^j, (1)
X = CjUj + • • + CjU; + • • • + c nu n ,
X • U; = CiUj • IT, + . . . + Ci U j • Uj + . . . + c n \x n u;
= + • • + c, + . . . + 0,
so that
C, = X • u, (3)
2
Example 1. The vectors (1,0) and (0, 1) in 'J\ form an orthonormal
basis. So does the pair of vectors (l/\'2, l/\/2) and (— 1/V2, 1/V2). In
terms of the latter basis we can write
(x y) '
= c +c
i-Jrji} 'hr2-jl}
s/2 J2i
-
*-*»>(*# fe a
5 -
(1 ' 2) = + 72'^'72)'
7i(vi'^)
Example 2. The set of functions defined for — <x <
tt tt by
n
T(x) = 2 (o n cos kx + foj. sin /ex).
(/g)^ 1 fV(x)g(x)Jx,
77 J-s-
and with respect to this inner product, the above set turns out to be
orthonormal. In fact, computation shows that
o,
- I cos kx cos Ix dx =
— I sin kx sin Ix dx
Linear Algebra Chap. 2
The fact that orthonormal bases are simpler to work with suggests
the following question: Given a basis for a vector space with an inner
product, is there some way to find an orthonormal basis? The answer is
8.1 Theorem
CiUj + . . . + cm + . . . + cnu n = 0.
111 = —
|Xx|
.
Then lu^ = Ix^/lxJ = 1. Now pick another of the x's, say x 2 and form ,
its projection on alt that is, form the vector (x 2 • u 1 )u 1 . If the vectors were
ordinary geometric vectors, the relationship between xlt x 2 , and
ux would be somewhat as shown in Fig. 1 1.
y2 = x2 - (x 2 -u,)uls
y2 • Uj = x2 • ux - (x 2 •
UiHii! • u2)
Figure 11 = x2 • ux — x2 • ux = 0.
Sec. 8 Orthonormal Bases 163
u2
"
= —
|y,l
.
The vector y 2 cannot be zero because, by its definition, that would imply
and u x were linearly dependent.
that x 2
Having found u x and u 2 we choose x 3 and form ,
its projection on the
subspace of T) spanned by u t and u 2 By definition, . this is the vector
p = (x3 • U^Ui + (x 3 • u 2 )u 2 .
y3 = x3 - ( x3 • "i)ui - ( X3 ' u 2 )u 2 .
y3 • ux = (x 3 • ux) - (x 3 • ux) = 0.
Similarly
y3 • u2 = (x 3 • u2) - (x 3 • u2) = 0.
u3 = —y3
|y 3 l
.
Once again y 3 =
would imply linear dependence of x 3 ul5 and u 2 ,
.
But because the subspace spanned by u 2 and u 2 is the same as the one
spanned by x x and x 2 this would imply linear dependence of the x's.
,
y m=x J+1
- (x ;+1 • ujux (X; + l ' U,K,
8.2
;+l
ly^+ii
The vector
(x • ujux + . . .
+ (x •
11,011,
y2 = x - (x2 2
• uju!
= (i,o,-i)- (=±)(-L, — *)
V6/V6 V6 %/*/
= (1, 0, -1) + (|, -i, J) = (*, -i, -|).
Then
2 ~ ~
|y.l V66/36
(7 '-''- 4) -
=vk
Thus the plane P can be represented as all linear combinations
WXj + vx 2
or all linear combinations
su x + /u 2 .
The relationship between the two pairs of vectors is shown in Fig. 13.
Figure 13
Sec. 8 Orthonormal Bases 165
product by
x x
u,(x) =
(x, x r x ax
= y/ix.
Next set
>'
3 (x)
= x
2
— (x
2
, u 1 (x))w 1 (x) - (x
2
, u 2 (x))u 2 (x)
(£5*)^-
Then
» 3 (x) = l\l/2
2 2
'
(x -i) Jx
= V¥(x 2 -i) = 2
Vl(3x -l).
Of course, if we start with two bases for the same vector space "U and
apply the Gram-Schmidt process to them, we will in general get different
8.3 Theorem
= v fc
• u; - = r it j= 1 k- 1,
yk ' U A:
~ rk-
Thus yk = {yk • u^u^.. Since both the u's and the v's have length 1,
we must have
=
.
,
|v**u*l I-
It follows that yk • uk = ± 1 ; so v fc
= ±uk .
Similarly, the pairs {e 1 and {x l5 x 2 } span the xv-plane, and the com- , e2}
plete bases both span all of 3l 3 It follows from Theorem 8.3 that applying .
e,
.•'e,
Figure 14
Sec. 9 Eigenvectors 167
EXERCISES
1. (a) Find a vector (x, y, z) such that the triple of vectors (1, 1, 1), ( — 1, \, \),
and (x, y, z) forms an orthogonal basis for Jl 3 .
2. The vectors (1,1,1) and (1, 2, 1) span a plane Pin R 3 . Find an orthonormal
basis for P by using the Gram-Schmidt process.
3. The three vectors (1 2,, 1), ( -10, 0), and (0, 1,0,2) form a basis for a
, 1 , 1 ,
f{x)g{x) dx,
</".*>
f
find an orthonormal basis for P.,. [Hint. One basis for P2 consists of
{I,*,* 2 }.]
5. Show that the application of the Gram-Schmidt process to the three vectors
Xj, x 2 x3
, in Example 5 of the text gives the triple e l5 — e2 , e3 .
7. Let C[ — 7T, tt] be the vector space of complex-valued functions /(x) defined
for — tt < x < n. Let {f,g) be defined for/and £ in C[ — n, n] by
!\,m = n.
0, m j= n.
(c) Show that the vector subspaces of C[~n, tt] spanned by the following
two sets are the same:
SECTION 9
In this section we shall find a natural way to associate a basis for a vector EIGENVECTORS
t
space 1) with a given linear function /from 1? to \5. Suppose that there
168 Linear Algebra Chap. 2
f(x) = Ax.
Thus
x\ / 1 1 \ I x\ j +y x
1 1\/1\ /l
= 3
4 l/\2/ \2
and that
:>-i-i
2
That is, the vector (1,2) in 'Ji is an eigenvector corresponding to the
eigenvalue and the vector (1 —2) is an eigenvector corresponding to the
3, ,
Before discussing how to find eigenvectors, we shall see why they are
useful.Suppose that 'V is a vector space, /is a linear function from C
U to
HI, and suppose that °0 has a basis {x ]5 x 2 , . . . , x„} consisting of eigen-
vectors of/, that is,
/(x fc ) = 4x ft ,
k= 1,2, ... ,«,
X = CjXx + . . . + c n\ n .
=
Then, using the linearity of/ and the fact that the x's are eigenvectors,
we have
/(x) = Cl /(Xl ) + . . .+ c n f(x n )
= c^Xj + . . . + cn X n x n .
(n \ n
9.1
6=1 / fc=i
where
9.2 /(x,) = 4x fc
, k=\,2,...,n.
zoo U/
Q+ „/( _]
we get
Figure 15 shows the effect of/ on each of the two eigenvectors x x and
x2 . It follows that the image /(x) of any vector x can be constructed by
170 Linear Algebra Chap. 2
MX | • VXj
Figure 15
1. Stretching away from the line through x 2 along the lines parallel to
For this analysis to work, it was essential that the eigenvectors of/ span
all of 3l 2 .
^4 l/\j>
A0 ,
4 l!\y kJ\y
or
(1-/1) 1 x
(1)
4 (i-X)l\y
:
It is clear that if the foregoing 2-by-2 matrix has an inverse, then the only
solutions are and y — 0. Hence we must try to find values of X for
x =
which the matrix fails to have an inverse. This will occur precisely when
/(1-A) 1 \
det = 0,
\ 4 (1 - X)l
and
- (1J0-C
X = 3: -2x +y=
X = - 1 2x + y = 0.
It follows that there are many solutions; but all we need is one for each
eigenvalue. We choose for simplicity
X = 3
\yl \2
X= -1
\yl \-2
though any nonzero numerical multiple of either vector would do.
9.4 (A - VK = 0.
\
Because Equations 9.3 and 9.4 are expressed in terms of the matrix A off,
we sometimes and eigenvalues of the matrix rather
refer to eigenvectors
than the function/. Of course Equation 9.4 is just Equation 9.2 in matrix
form; but Equation 9.2 also applies to linear functions that are not
representable by matrices.
Example 4. This example will show one way in which a linear function
can fail to have eigenvectors that provide a suitable basis for the vector
2
space on which/acts. Consider the linear function/from J{ to :K 2 defined
for fixed 0, < < 2tt, by
x\ /cos — s'mO\/x
/
yl \sin cos 01 \y
This function carries
1 /cos
into
0/ \sin
and
0\ /-sin 6
into
1/ \ cos0
(cos 6 — X —sin \
=
sin 6 cos 6 — XI
or
X2 - 2X cos 6 + 1 = 0.
This quadratic equation for X has only complex conjugate roots of the
form
cos 6 ± i sin 6 = e
±i6
.
2
Since we are looking for nonzero vectors x in III satisfying
f(x) = e
±ie
x,
/(x) •
y = x ./(y)
c
for all x and y in l). In particular, if/is a linear function from 51" to 31",
it has a matrix A = (a i}), and we can write the symmetry equation as
Ax '
y = x • Ay.
<a b'
A=
\b c
a b\ / Vl \ />'A _ lax, + bx 2 \ U)
b c!\xj \yj \bx1 + cxj \y 2 l
= ax^i + b(x 2 y 1 + x 1 y 2) + cx 2 y 2 .
Similarly, we compute
f
xA la b\ly 1 \_(xA(ayx + by %
Since the two dot-products are equal, we conclude that /is symmetric.
9.5 Theorem
C
Let /be a symmetric linear function from HJ to \J, where °\3 is a
vector space with an inner product. If x x and x 2 are eigenvectors of
/corresponding to distinct eigenvalues X x and A 2 then x 2 and x 2 are
,
orthogonal.
and
Xi -/(x 2 ) = xx • (A 2 x 2 ) = A 2 (Xi • x 2 ).
A 1 (x 1 • x2) = X (x x2 1
•
2 ).
T 2\
U\-X) 2 \
det = 0,
\ 2 (1 - X)l
formation /of Example 1, even though they have the same eigenvalues.)
Equation 9.4 for the eigenvectors has two interpretations, depending on
which eigenvalue is used. We have
1 J)0 -C
1= -1:
X= 3: -2x + = 0, 2y
A=-l: 2x+2y = 0.
We find solutions
X= 3: (x,y)=(l,l),
The vectors Xj = (1, 1), x2 = (1, — 1) are clearly orthogonal, and they
We can now take advantage of the fact that ii! and u 2 are eigenvectors
and also that they form an orthonormal set. The latter fact enables us to
express any vector in Jl 2 as a linear combination of u t and u 2 very simply.
From the previous section we have
X = (X • Uj)^ + (x • u 2 )u 2
for any x in 5l
2
. Now using the fact that u x and u 2 are eigenvectors of the
linear function g, we can write
g(\) = (x •
uOsCuO + (x • u 2 )g(u 2 ).
g(x) = 3(x •
u^Ux - (x u 2 )u 2 .
Thus the action of g has a geometric description like that given for fin
Example 2 of this section. The only difference is that the stretching and
reflection takes place along different lines: perpendicular lines in the case
of g, nonperpendicular lines in the case off.
EXERCISES
1. The linear function /from Jl
2
to R2 with matrix
'1 12\
2. Find all
;) n< (::)•
following matrices.
4\ /l 0\
(b)
10/ (d) 2 1
\0 1 2/
4. Show that is an eigenvalue of a linear function /if and only if/ is not
one-to-one.
176 Linear Algebra Chap. 2
f(x) for x in 31. Then the differential operator D 2 acts linearly from C (G0) (^)
to C (00) (3l).
(a) Show that for any real number A the functions cos ?.x and sin Ax are
eigenvectors of D 2
corresponding to the eigenvalue —A 2 .
(b) Let C (x, [0, tt] be the subspace of C (x, (#) consisting of functions/such
=0. Show if D
2
that /(0) =/(tt) that is restricted to acting on
C (x) [0, 77], then its only eigenfunctions are of the form sin kx, corre-
sponding to A = —A: 2 , where k is an integer.
'1 V
\0 1,
9. (a) Find the eigenvalues and a corresponding pair of eigenvectors for the
/from R 2 to 31 2 having matrix
linear function
(b) Show that the eigenvectors of the function /in part (a) form an orthog-
2
onal basis for 3l , and use this fact to give a geometric description of the
2
action of/ on 31 .
(c) Generalize the results you found for (a) and (b) to any linear function/
from 31" to 31" having a diagonal matrix diag (a u a 2 , . . . , c„).
2
10. Find the eigenvalues of the function^ on 3l with matrix
1 2^
,1 1,
2
Show that the corresponding eigenvectors span 3l and describe the action
ofg.
Sec. 9 Eigenvectors 111
11. Let C 2 be the complex vector space of pairs z = (zx z 2) of complex numbers, ,
z • w = z 1 w1 + z2 w2 .
/'cos 6 —sin 6\
v
sin 8 cos 8}
V
X = e
i{
X=e-
(b) Show that if sin 6=0, then every nonzero vector in C 2 is an eigen-
vector of the function fin part (a).
(c) Show that the eigenvectors in part (a) form a basis for C2 .
(d) Show that the eigenvectors in part (a) are orthogonal with respect to the
complex inner product in C2 .
12. Explain in geometric terms why a rotation about the origin in ft 2 through
an angle 8, < 8 < 2tt, has no eigenvectors in ft 2 unless 8 = or 8 = -n.
13. Suppose that 'W is a complex vector space with a complex inner product, so
that <z, w) = (w, z). Then a linear function / from 10 to 10 is called
Hermitian symmetric if </(z), w) = <z,/(w)>, for z, w in 10.
(a) Show that if /'is Hermitian symmetric and has A for a complex eigen-
value, then A must actually be real.
(b) Show that if 10 has complex dimension 2 and / is given by a 2-by-2
complex matrix
y\
(a) Let
(a c
b d
Show that the vector equation
x' = Ax
is the same as the system of first-order differential equations
x = ax + cy
y = bx + dy
'1 2\
v2 1,
find solutions of the vector differential equation x' = As. in the form
x = e^i'c! + e^'cj.
(Z> - r x ){D - r 2 )y =
is equivalent to the system of first-order equations
x = rx x
y' =x + r 2 x,
(e) Show that the characteristic equation of the matrix A found in part (d)
is the same as the characteristic equation of the operator (D — r^) x
(D — r 2 ) as defined in Section 6.
15. (a) Show that the functions cos kx and sin kx are eigenvectors of the
differential operator D2 acting on C (cc) What are the corresponding
.
eigenvalues?
(b) If we restrict the operator D 2
to real-valued functions defined for
—" < x < n, we can define an inner product by
</><?>=
L f(x)g(x)dx.
Show that the eigenvectors cos kx and sin Ix are orthogonal with respect
to this inner product for k, I = 1,2, 3, ... . Then use Theorem 8.1 to
(c) Show that the functions e ikx and e~ lkx are eigenvectors of the differential
operator D2 acting on complex-valued functions. What are the corre-
sponding eigenvalues?
(d) Using the complex inner product
SECTION 10
10.1 Theorem
then
^Vi + . . . + s n\ n = x = w + r n\ n ,
fa - *iK + • • • + (r n - s n )v n = 0.
Since the v's are independent, the numbers rt — st must all be zero,
and so (s x , . . . , s n ) is the same as (r ls . . . , r n ).
\ / \ A (b) In the plane, let Vj be the vector of unit length along the
q ~ *"*
horizontal axis, and v 2 the vector of unit length at an angle 60°
counterclockwise from v x . Let x be the vector of unit length 60°
Figure 16 beyond v 2 shown , in Fig. 16.
By geometry, CB is parallel to OA and OC is parallel to AB
(why?); so OABC is a parallelogram and v = 2 x + Vj. Hence x = —v + 1
v2 and the coordinates of x with respect to this basis are (—1, 1).
known for each vector of a basis, then the function is known completely.
This follows from the fact that any vector x can be expressed as a linear
combination r x \x + • • • + r n\ n of the vectors in a basis. Then/(x) is the
combination ^/(Vi) + . . . + rn
f(\ n) of the vectors /(vj with the same
ID, respectively. The possibility that 1J and 10 are the same space is not
ruled out. The matrix off with respect to the bases V and is defined to W
be the m-by-n matrix whose y'th column is the coordinate vector of/(v ) ;
with respect to the basis W. Thus if we denote by A = (a^) the matrix off
relative to Kand W, then the entries in they'th column of A appear as the
coefficients a tj in the equation
The matrix A contains all the information needed to evaluate f(x) for
any x. For if x is represented as
/(x)=i>,/(v,)
n m
=i
f=l
(i<wW
\j=l I
to be the same bases. We do this in all the examples for which HJ is the
same as ID.
0/ \ 1/
itself, and it may be considered as a linear function from °U to C
U. We have
-1 -1
1
(0 -1 N
1 1
nomial into the 4-tuple (1, 1, 1, 1); it takes the polynomial x into the
1
4-tuple (1, —1,2,3); and it takes x z into (1, 1,4,9). Its matrix with
182 Linear Algebra Chap. 2
10.2 Theorem
Note that if °\J = ft", the coordinate map c for the natural basis is the
identity function c(x) = x.
all of 'Ui, is called an isomorphism, and two spaces such that there exists an
isomorphism between them are said to be isomorphic. We have shown that
coordinate maps are isomorphisms, which proves:
10.3 Theorem
Any statement true for one space (provided it can be formulated entirely
in terms of addition and numerical multiplication) can be carried over by
an isomorphism to give a true corresponding statement for an isomorphic
space. For example, if TJ —^-> ID is an isomorphism, then an equation
n n
y =2
i=l
a xi
i
ls true m ^ if an d only if
f(y) =^
t'=l
a if(*i) is true in ID.
with respect to the bases V and W. To prove the assertion, note that the
10.4 Theorem
(C W o g o Cy
1
) °(c v ofo Cr]) = CW o
(g of) O c'r],
The special case in which 1L, CU, and ID are all the same space with the
same basis is particularly important. If °0 —f-+ T) has matrix A, then
/o/has matrix A 2 f °f °f has
,
matrix A3 , etc.
A) 1 0'
A=10 2
\0 0/
It is easy to compute that
'0 2"
\0 0/
1 2 0\
1 1 3y
3 2
with respect to the natural bases in 'J\ and ll\ . Consider the bases
in 3t 3 and
•) -(;;
in Si 2
. To find the matrix off with respect to (v l5 v 2 v 3 ) , and (w x , w 2 ), we
compute , ,
5w
'(
'Hio) = -
/(v 2 ) = ( 1=8*!- 3w 2 , /(v3) = ( )
= - 4w +
i
3w 2-
'5 8 -4
\0 -3 3
X
Sec. 10 Coordinates 185
c.v( w) = (c x ° cv
1
) ° c v (v/)
The inverse matrix gives the x's in terms of the v's and corresponds to
x
cv o cy . In the example in the preceding paragraph, the matrices giving
V and W in terms of the natural bases in 'Si
2
and Jl 3 are
1 2
and
2 3
-3
1 2
2 3
and W
in terms of Y, and let /be a linear function from to 10 whose ^
matrix with respect to Xand Y is A. Then the matrix of/ with respect to
Kand H^is Q~ X AP. This is most easily seen by working with the coordinate
maps. The matrix for/with respect to Kand corresponds to W
C W o/o Cy
1
= (Cjy O Cy
l
) °(Cy°fo C~x) O (C X O Cy
1
),
QT AP
X
1
'5 8 -4
\0 -3 3
U and 10 are the same space, and X is the same basis as Y, and V
If
C
is same basis as W, then P and Q are the same, and the new matrix for
the
/is P~ l AP. Two matrices A and B are said to be similar if there exists an
"invertible matrix P such that B = PAP' 1
. A few properties of similarity
are presented in the exercises.
-1
-1
Sec. 10 Coordinates 187
'0\
2
2. Find the matrix of a rotation of 45° in Jl with respect to:
V2
(a) The basis x1 = Arts.
1/V2 V2
(b) The basis of unit vectors in the directions of x x and x 2 .
3. Show that the elementary matrices E^ of shape m-by-n form a basis for the
vector space of m-by-« matrices. What is the dimension of the space?
'1 2\
4. Let A Show that each of the following functions from the
1
space of 2-by-2 matrices to itself is linear, and find the matrix of each with
respect to the basis of Exercise 3.
6. Let C
U-^— W be an isomorphism. Show that v l5 . . . , vn form a basis
for <U if and only if/Cv^, . . . ,/(v„) form a basis for IB. This proves that
isomorphic spaces have the same dimension.
7. For any two vector spaces CU, 'IT>, let £("07, *U)) consist of all linear functions
from 1J to ID. For/ and g in C(1J, 'ID) and r a number, define functions
8. (a) Show that C(3l", &m ) is isomorphic to the space of all m-by-n matrices,
(b) Show that if dim (1)) = n and dim (ID) = m, and bases are chosen in
'U and 'ID, then assigning to each/in U('U, 'ID) its matrix with respect to
the given bases gives an isomorphism between £(1), ID) and the space
of m-by-n matrices.
(a) Write the matrix of t with respect to the basis 1 , x, x2 for the space P2 .
11. The trace of a square matrix A is defined to be the sum of its diagonal
/I 2\
elements and is written tr (A). For example tr =1+4=5, and
\3 4/
if / is the n-by-n identity matrix, tr (/) = n.
(a) Show that if A and B are square and have the same size, then tr (AB) =
tr (BA).
(b) Show that if A and B are similar, then tr (A) = tr (B). [Hint. This can
be proved in one line by applying part (a) to the right pair of matrices.]
12. Suppose Jl 3 — /
> Jl 3 is a rotation and \ 1 is a unit vector in the direction of
the axis of the rotation. If v 2 and chosen to make (v 1( v 2 v 3 ) an
v 3 are ,
cos a —sin a
, v3) is
/(ei) = e2 ,
/(e 2 ) = e3 , /(e3 ) = ex
metrical reasoning (what is/o/o/?) and check the result of Exercise 12.
[Ans. 120°.]
14. Show that the matrix
3
represents a rotation of Jl and find its axis and angle. (If x is a unit vector
in the direction of the axis, then Ax = x and so (A — I)\ = 0. To show that
A represents a rotation, find its matrix with respect to an orthonormal basis
that includes x.) [Ans. Axis is (3, 3, —1).]
3
Derivatives
SECTION 1
VECTOR FUNCTIONS The purpose of this section is to make the reader familiar with some
specificexamples of vector functions. These examples will be given by
formulas that can be analyzed in terms of the elementary functions of
one-variable calculus. Wherever possible we will give a pictorial descrip-
tion of the function, something that can often be in more than one done
way. In later parts of the book we shall often return to examples like the
ones here, and so develop more familiarity with them, and with the ways
in which they can be applied.
(x 2 +y + z2
{
x +y +z
has as its domain all of 3l 3 ; but, because x 2 +y +2
z2 > 0, its image
contains only those vectors in JI 2 for which the first coordinate is
f
3x + 4jA
\zj
,Vl - x2 -y
h(x, y) = \
x + y
190
Sec. 1 Vector Functions 191
f(x) = x\
h{x) = sin x.
The domain and range of/ are both equal to 31. According to the usual
interpretation of the square-root symbol, the range of g is the set of non-
negative real numbers, and so also is the domain of g. The domain of/? is
31, while the range of A is the interval —1 <y < 1. To emphasize what
its domain is, the function g might be denoted
g:[0, oo)-^3l,
where [0, oo) stands for the interval < x < oo.
and (*!, . . . , xn )
a b
c d
we usually write
a b\ x
c d!\y
for the value of the function at
or (x, y).
192 Derivatives Chap. 3
f(x) = C/i(x), . . .
,fm (x))
for every x in the domain off and/, is called the Ah coordinate function
of/
/x\ +y + Ix z
f\}\ = Uy+jz + zx
Figure 1
Sec. 1 Vector Functions 193
(x,y,z) = (x,y,f(x,y)).
fix, y) = x2 +y 2
,
the graph obtained by holding either x or y fixed and letting the other
one vary. The result is a curve in a plane parallel either to the yz- or xz-
coordinate plane. Two of these are shown in Fig. 2. We have chosen
Figure 2
''cos ^
/(') = for every t in 31.
sin t
194 Derivatives Chap. 3
angle in radians between the vector/(0 and the positive x-axis. As / runs
through 31, the unit circle is covered infinitely often. It follows that the
range of/ is the whole unit circle. The circle is not, however, the graph of
/ The latter is and
a subset of 3l 3 is a spiral whose axis is the /-axis. See
Fig. 3(b). What we have done is sketched the points of the form
(t, x, y) = (/, cos /, sin that make up the graph of the function.
(cos /, sin t)
(b)
Figure 3
oo < <
t oo.
The graph of/is a subset of Jt 4 so we shall not attempt to draw it. Instead
,
tion, we have drawn in Fig. 4 that part of the range of/that lies above the
first quadrant of the xy-plane.
In drawing Fig. 4 we have labeled the axes in the manner usually
associated with a right-hand orientation. Of course, if we were to inter-
change x and y in this labeling, the picture would look different. A
Sec. 1 Vector Functions 195
Figure 4
similar change in Fig. 5(b) would result in a ramp that spirals down,
turning always to the left instead of to the right. To make it easy to see the
relationship between the pictures, we have always chosen the right-hand
orientation for the axes in the 3-dimensional ones. For a discussion of
orientation see Section 7 of Chapter 1.
< u < 4,
The domain off is the shaded rectangle in Fig. 5(a). To sketch the
range, we proceed as follows. Choose a number a in the interval
< u < 4, and set u = a. Then,
and x 2 +y — 2
a2 . We interpret v both as distance along the z-axis and as
the angle between (x, y, 0) and the x-axis. It follows that the image under
/ of the line segment u = a, for < v < 277- (see Fig. 5(a)) is the spiral
whose projection on the .xy-plane is the circle of radius a and whose axis
is the z-axis (see Fig. 5(b)). Next, choose a number b in the interval
196 Derivatives Chap. 3
V
Sec. 1 Vector Functions 197
explicitly defines the upper half of the circle of radius 4, and the same curve
is defined parametrically by the pair of functions
x(t) = 4 cos /,
< t <
y(t) = 4 sin /,
3
the range of the function 51 > Jl defined by
f{t) = rx x + x 2
00 < < t 00,
Figure 7
(a)
is a circle, x + = y) = 1 we get x + y — 0,
2 2 2 2
y Corresponding
1 .to/(x,
which determines a single point (x,y) = (0, 0). See Fig. 7(b). Level sets
are customarily used on topographical maps to show changes in terrain
elevation at regular intervals.
.
+ y2 + Z2 = lc (1)
x2 +y + 2
z2
f(x,y,z)
x +y + z
+y + z = 2 x2 2 2
(2)
x +y +z =\.
We have seen in Example 8 that x + y + z = 2 2 2 2
implicitly defines a
In fact Pis evidently the plane containing the three points (1,0, 0), (0, 1, 0),
and (0, 0, 1) and shown in Fig. 9. The level set determined byf(x, y, z) =
(2, 1) is then the circle C consisting of all points satisfying both Equations
(2) and so is the intersection of S and P.
We summarize the definitions of explicit, parametric, and implicit
representations. A set S is defined:
%n — > %m .
2. parametrically if 5 is
m
the range in 3i of a function
Jl" H 31™,
5 is a level set of/, that is, for some point k in 'SV S is the set of all x
in the domain of/ such that/(x) = k.
A set S defined in some one of the above three ways will be called a
curve or a surface provided that/satisfies certain smoothness conditions to
be described in the last section of Chapter 4. In the meantime we shall use
/
S
VU>
^> \ Sfc
(a) (b)
Figure 10
/(*.*)=(*.*)
(I, I) translated
\ (1, D«
(1, H
/
Figure 11
EXERCISES
1. Suppose that the temperature at a point (x, y, z) in space is given by
T(x, y, z) — x 2 + y % + z2 . A particle moves so that at time / its location
is given by {x,y, z) =
2 3
(/, t Find the temperature at the point occupied
, t ).
2 cos /
3 sin t
yi\ + x2)
What are the images of horizontal lines in the xy-plane? What are the
coordinate functions of/?
Sec. 1 Vector Functions 201
6. For each of the following linear functions: (i) What is the domain? (ii)
Describe and sketch the range, (iii) Describe and sketch the set implicitly
defined by the equation L(x) = 0. What is this set usually called?
(d) L \y
f\ )
y
\\ ifW <\y\,
(c) g(x, y) = sin x. (f) g(x, y)
10 ifW >\y\.
1\ /l
(e) fit) =
\t
, ,
^ cos u sin v\
(b)
g\ )
= I y |
= I
sin u sin t;
|
,0 <v< tt/2.
(sl) f{x, y) =x + y = 1.
2 2
x y
(c) /(*, j) = (x 2 + / + l)
2
- 4jc
2 = 0.
(e) xyz = 0.
(x - y\ /0)
(g) ,
\y + z] \0
(2x +y + z = 2,
(h)
-z = 3.
xyz \ /0
(i) '
\x+y] \\
11. Suppose that the density per unit of area of a thin film, referred to plane
rectangular coordinates, is given by the formula d(x, y) = x 2 + 2y 2 —
x + 1, for -1 <x
< and -1 < J < 1. Sketch the set of points at
1
(a)/j
)=| for -1 <x <2,y = 0, y = 1.
for x2 + f < 4.
=
( )
for x2 + y2 <
(d)/
y-^r?(!) f° r * 2+/s4 -
x + y]
f
\yl \-*+y
Show that /accomplishes an expansion out from the origin by a factor ^/2
and a rotation through an angle 7r/4.
(x\ ix" —y
\yj \ 2xy
What are the coordinate functions of/? Consider the domain space to be
the xj-plane and the range space to be the wr-plane.
0\ /l
and
0/ \1
(b) Find the image of the region defined by < x, < y, and x 2 + y 2 < 1.
(c) Find the angle between the images of the lines y = and y = (l/^/3)x.
[Ans. tt/3.]
What are the coordinate functions of/? Find the image of the region
bounded by the lines x = y, y = x — S, x = —y, y = 8 — x.
204 Derivatives Chap. 3
SECTION 2
f(t) = tXy + x
3
at time /, where x t and x are points in ft . More generally, a function/
taking values in ft" might be given in the form
fit) = (A(0, • • •
,/„(0),
= itx 1 + x , ty x +j , tz x + z )
fk of/. Thus if
= (t.it),
fit) . . .
,/„(0)
'• * > is defined for an interval a < < t b containing /„, we write
lim(f, t
2
) = (lim t, lim f
2
)
(-2 \(->2 4->2 /
= (2, 4).
o
The function g is continuous for all real / because the coordinate functions
/ and t
2
are continuous.
with g(t) defined on an interval a < < t b. We shall first define the
derivative of g and show how it leads to a definition of tangent line to the
curve y defined as the range of g.
The function g has a derivative g\t) at a point / in the interval (a, b) if
ft— n
If the limit exists for each / in (a, b), then g'(t) determines a new function
3i — > 3\ n , just as in the case n — 1 . The derivative is often written dgjdt.
+ 2
-t 2
hm — +
.. g(t h)~ g(t)
^-^ = hm..
-
\((t
1
h)
ir
*-oft\(f + hf - t
3
206 Derivatives Chap. 3
Example 4. If
COS t
If
h(t) = t
2
then h'(t) =
It is clear from Fig. 13 that the vector g(t + h) — g(t) has a direction
which, as h tends to 0, should tend to what we would like to call the tangent
direction to the curve y at g(t). However, since g is assumed continuous,
and the zero vector that we get as a limit has no direction. This difficulty
is overcome in most examples by dividing by h before letting h tend to zero.
Observe that division by h will not change the direction of g{t -f h) g{t) —
if h is positive; it will reverse h is negative. A glance at Fig. 13 shows
it if
that this reversal is desirable for our purposes. (What would be wrong with
dividing by \h\ ?) The derivative g'(t), if it exists and is not zero, is called the
tangent vector to y at g(t ). Of course, any nonzero multiple of g'(t) is then
called a tangent vector, and the line with direction vector g'{t) and passing
through g{t) is called the tangent line to y at g{t). Thus, if g(t ) is a
particular pointon a curve, the tangent line at g(/ ) will have a parametric
representation of the form
tg'Oo) + g(t )-
The tangent vector g'{t ) is usually pictured with its tail atg(/ ) as in Fig. 13.
Figure 13
Sec. 2 Functions of One Variable 207
For the spiral curve given by f(t) = (cos t, sin /, t), we have
/'(0) = (0, 1, 1). The tangent line to the spiral at (1,0, 0) can be
represented by
Figure 14
and in this case the tangent vector/' (0) has length V2.
One reason for singling out g'(t) for special attention as the tangent
vector, rather than some
is that we often want to consider the
multiple of it,
parameter t andg(/) as representing the path of a point
as a time variable,
moving in 31". Under this interpretation, the Euclidean length \g'(t)\ is the
natural definition for the speed of motion along the path y described by
g(t) as t varies. To justify the use of the term "speed," we observe that, for
small h, the number \g(t + h) — g{i)\\\h\ is close to the average rate of
traversal of y over the interval from / to t + h. In addition, if g'(t) exists,
lim
i g« + /.)-g(.)i = ,
|g ( , )|
a->o \h\
of course, the same as what we have called the tangent vector to y atg(/),
provided v(/) # 0.
The fact that the tangent vector shrinks to zero in this example
as g(t) approaches the origin is a reflection of the fact that, if the
velocity vector varies continuously, the speed must be zero at an
abrupt change in the direction of motion. In this way the para-
metrization describes the physical situation well. However, for the
purpose of assigning a tangent line to the path at the origin, the
given parametrization is not useful.
We list here some useful formulas that hold if/and g each has a vector
derivative on an interval (a, b) and cp is real-valued and differentiate there.
2.3 (<pf)'
= <pf + <p'f, (f>
real-valued.
2-4 (f-g)'=f-g'+f-g.
= ([fi(u)]\ . .
. , [/„(«)]')
— —
Sec. 2 Functions of One Variable 209
If Jl > %n has a derivative 31 > 51", then we can ask for the
derivative of g', which we denote by g". Thus we have %— > 31", though
g" may
be defined at fewer points than g or even g'. We also write d 2gjdt 2
forg", and so on for higher-order derivatives.
Example 8. Let 31 —
> ft 3 describe a path in 'J\ 3 with velocity vector
v(t) and speed v(t) at each point g(t). Then t(r) = (\jv{t))y(t) is a tangent
vector of length 1, provided v(t) ^ 0. In any case, we can write g'(t) =
v(t)t(t). If we assume that g has a derivative, we define the acceleration
vector at g{t) by a(f) = g"(t). The physical significance of a(7) is that if g(t)
describes the motion of a particle of constant mass m, then ma(t) is the
force vector F{t) acting on the particle. If we denote by a{t) the length of
a(f), then a(t) is the magnitude of the acceleration, and ma(t) is the
magnitude of the force acting on the particle.
implies that a = (vt)'. Applying Formula 2.3, we get a = v't + ft'. Thus
if t'(t) = 0, the acceleration vector, and hence the force vector at g(t),
has either the same or else the opposite direction to the motion. On the
other hand, if t'(t) ^ 0, we can define the unit vector n by n(t) = t'(t)l\t'(t)\,
= v't It'l
This equation expresses the acceleration a(/) at each point g{t) in terms of
an orthonormal pair of vectors t(t) and n(t). We have |t| = |n| = 1 by the
The pair t(/), n(7), should be pictured at g(t) as in Fig. 16. The third
unit vector b(/) shown there is defined by b = t x n, and is called the
binormal vector to the path, while n is called the principal normal. Thus any
vector naturally associated with the point g(t) on the path can be written
Figure 16
210 Derivatives Chap. 3
EXERCISES
1. Ifg(t) = (e
l
, t) for all real t, sketch in ft 2 the curve described by^ together
with the tangent vectors ^'(0) and^'(l).
% Let/(0 = (r, t
2
,
3
t ) for <t< 1.
(a) Sketch the curve described by/ in ft 3 and the tangent line at (i, |, £).
(b) Find \f'{t)\.
(c) If /(f)= (f, f 2 r 3 ) for all real t, find all points of the curve described
,
by /at which the tangent vector is parallel to the vector (4, 4, 3). Are
there any points at which the tangent is perpendicular to (4, 4, 3)?
4. Show that the curve describedby g(t) = (sin 2t, 2 sin 2 f, 2 cos t) lies on a
3
sphere centered at the origin in Si Find the length of the velocity vector .
v(f) and show that the projection of this vector into the xj-plane has a
constant length.
' (a) Show that if 31 > A3 is continuous for < /, then there is a unique
path g through a given point ^(0) in :R 3 , having v(f) as its velocity
vector at g(t).
(b) Show that a continuous function Si -^—>- R3 defined for < t, deter-
mines a unique path in H3 , having a(r) as its acceleration vector, pro-
vided the initial point ^(0) and initial velocity v(0) are specified.
8. Show that if /is vector- valued, differentiable, and never zero for a < t < b,
then
(b) |
/| is constant if and only if//' = 0.
x" + ax' + bx =
to be solved for a function x(/) taking values in H" and defined on some
interval. We assume that a and b are constants. Show that if the real
/ .
\(t) = c xe
r
^ + c 2 er **,
10. Prove that (/ x ^)' = (/ x g') + (/' x g), where / and £ take values in
3
3l and are differentiable on an interval.
(ll^ Show that if 31 -^-> 31" has a derivative and ^'(r) = for a < t < b, then
(a) Show that if F is identically zero, then P is constant. This is called the
law of conservation of linear momentum.
(b) Show that L'(t) = N(t), and hence that if is identically zero, then L N
is constant. This is called the law of conservation of angular momentum.
13. Show that if a particle has an acceleration vector a(7) at time / and v(t) =£ 0,
functions ...,/„ of/ are integrable, we can define the integral of/ over
1;
(a) If /(r) = (cos t, sin t) for < < / 77/2, compute /(f) </7.
15. If 31 —-* 31" and 31 -^->- 31" are both integrable over [a, b], show by using
the corresponding properties of integrals of real-valued functions that:
kf(t) dt =k \
f{t) dt, k any real number,
Ja Ja
f'(t)dt=f(b)-f(a).
SECTION 3
ARC LENGTH The definition of length for vectors can be used to define the length of
a parametrized curve y. We assume that y is described by a continuous
function '& —> %
n where the domain of
, g is a closed interval a < t <b.
Thus y is the image of [a, b] under g. Corresponding to any finite set P of
numbers a = t < tx < < tK = b, there are points g{t k ), k = 0,
• • •
. . .
,
Let l(y) denote the least upper bound of the numbers l(P). This will, of
course, be infinite numbers 1{P) is unbounded. If l(y) is finite,
if the set of
then y is said to be rectifiable, and l(y) is called its length. It is clear from
the definition that l(y) depends on the function g that describes y and not
just on y itself. This is reasonable if we want to take into account the fact
that some part of y may be traced more than once byg. In practice this is
very often what is wanted. If it should happen that g is not one-to-one,
then we may write 1(g) instead of l(y) to emphasize the dependence on g.
The length of a path is usually awkward to compute directly from the
definition. However, if y is parametrized by a function g such that the
tangent vector g'{t) varies continuously with /, then l(y) is finite and equal
Sec. 3 Arc Length 213
to J"*
\g'(t)\ dt. In fact, it is enough to assume that g is piecewise smooth,
that is, g' is continuously extendable to the endpoints of finitely many
intervals end form the interval a < t < b. This
which placed end to
allows us to find the length of some curves for which the tangent has an
abrupt change, as in Fig. 17.
Figure 17
3.1 Theorem
8fe) ~ g( '"'
lim
tk-Hk-1 t
k — k_1
t
= g-« M ),
we have
where Z(t k — t
k ^) satisfies
lim Z(t k - t
k _x )
= 0.
tk-tk-i
214 Derivatives Chap. 3
KP)=I\g(tk )-g(t k . 1 )\
by
Example 1. The plane curve defined by g(t) = (t \t\, \t\) for —1 < < / 1
/( g ) =£vs + 1 dr
^
parametrically by the one-to-one function
g(x a 6 -
Figure 18
>=U))'
Sec. 3 Arc Length 215
/(y)=fVl +(f'(x)fdx.
If y is a piecewise smooth curve described by a one-to-one function g(t),
a < t <
we can think of l(y) as representing the total mass of y,
b,
assuming y has a uniform density equal to 1 at each point. More generally,
if/? is a real-valued function defined on y, we can form the integral
p(g(t))\g'(t)\dt,
r
and, if it exists, call it the integral of/? over y. In particular, if/? is a non-
negative function that can be interpreted as the density per unit of length of
a mass distribution over y, the integral becomes the definition of the total
mass m of the distribution.
\
g (t)
- q|
2
= + sin + (t - tt)
cos 2 1
2
1
2
= 1 + (t - tt) 2
.
Also, we have seen that \g'{t)\ = yjl. Hence, the total mass of the distri-
bution is given by
f [i + (t- tOV 2 dt
(l+rVf = (§)V2(37T+77
3
).
^ —Tt
Since \g'{t)\ is positive, the function s(t) is strictly increasing. Thus s(t)
is a one-to-one function from the interval a < t < b to the interval
216 Derivatives Chap. 3
describes the same curve y that g(t) does, but with a new parametrization
in which the variable s, with < s < /(y), represents the length of the
path along y from h(0) to /?(.s). The curve y is then said to be para-
metrized by arc length.
s(f)= Vl + 4u
2
rf«
Jo
= \tsj\ + 4r
2
+ \ log (2r + Vl + 4/
2
)
Uy)
p(h(s))ds.
I"
Jo
and use the fact that the speed v(t) has been defined to be \g'(t)\. Equation
3.3 gives further justification for the definition of speed: the derivative of
arc length with respect to time turns out to equal speed.
fs (*»f«) =
i.
dr..
— (s) = 1
(3)
ds v(t)
t'(t) = v(t)^t(t)
ds
Since v(t) > 0, when we take the length of the vectors on both sides, we
can take v(t) outside getting
The factor
d |t'(0l
x(t) t(0
ds v(t)
The terms in the sum are called the tangential and normal (or centripetal)
components of the acceleration, respectively, and the numerical factors are
sometimes denoted a = v' and a n = v 2 k.
t
the path of motion is a straight line, then the unit tangent vector t is a
constant vector so
Thus k =and we have a(f) = v'(t)i{t), which shows that the acceleration
0,
same direction as the tangent or the opposite direction,
vector has either the
depending on the sign of v Exercise 10 shows that k, the curvature of the
.
EXERCISES
1. Find the length of the following curves.
- (a) (x, v) = (t, log cos f), < / < 1. [Arts, log (sec 1 + tan 1.]
2. (a) Set up the integral for the arc length of the ellipse
(b) Show that the computation of the integral in part (a) can be reduced to
the computation of a standard elliptic integral of the form
v 1 - k2 sin
2
6 dd.
and
<r.
7. (a) Show that for a line given by g(t) = tx x + x , the curvature is identically
zero,
(b) Show that if a curve y, parametrized by arc length and given by a
function f(s), has a tangent at every point and has curvature identically
zero, then y is a straight line.
8. Find the total mass of the spiral given by g(t) = (a cos /, a sin /, bt),
/(y)
(" V\ + (/; Qt)) 2 + (f2 (x)f dx,
where /, and 2
are the coordinate functions of y, assumed continuously
differentiable.
Show that if 0(s, h) is the angle between tO) and t(.s + h), which tends to
zero as h tends to zero, then
d(s, h)
k(s) = lim
11. Show that if a curve is given parametrically by a function g{t), then in terms
of derivatives with respect to t, the curvature at^(/) is
2 2
V\g'{t)\ \g'\t)\ -(g\t)-g"(t)f
3
\gV)\
12. (a) Find the curvature function k(/) of the parabola x(/) = 2
(/, t ) for
— oo < t < oo.
(b) Show that, for a circle of radius r, the curvature is given by <<(f) = 1/r.
13. A piece of wire is coiled in a uniform spiral 3 inches in diameter and 2 feet
long. Find the length of the wire if it contains 6 complete turns.
= g(t)p(g(t))\g\t)\dt.
^|J a
(See Exercise 14 of the previous section for the definition of the vector
integral.)
(a) Find the center of mass of the spiral g(t) = (a cos t, a sin t, bt),
< <
t 2tt, with density at (x, y, z) equal to x 2 + y 2 + z 2 .
(b) Show that if y has uniform density 1 and is parametrized by arc length,
then
1
/M<y>
g(s) ds.
(c) Use the results of Problem 17(b) of the previous section to show that
the center of mass then satisfies
1 fKy)
z < \g{s)\ ds.
=J. ''
SECTION 4
h
LINE INTEGRALS The f{x) dx of a real-valued function of a real variable can be
integral \a
At each point g(t) of y we picture the tangent vector g'(t) as an arrow with
its initial point atg(r). Also at g(t) we locate the arrow describing the vector
Sec. 4 Line Integrals 221
Figure 19
j/(g(tj)-g'(t)dt
over y is
O (1, It, 3t
2
dt = (t
2
+ 2r
D
+ 3f
8
dt = 1.
f
•
) )
Jo
v
' lg'(OI
integrand, and hence the integral will be zero. At the other extreme, for a
given field F, if the speed \g'(t)\ is prescribed at each point of the curve, then
the integrand will bemaximized by choosing a curve y that at each point
has the same direction as the field there. Thus the integrand in the line
integral can be thought of as a measure of the effect of the vector field
along y.
Formula (1) can be generalized to any number of dimensions. Thus if
'Ji
n — > 31", and 'J{ —?-+ "J\" describes for a < t <
b a smooth curve, y,
lying in D, the line integral of F over y is still defined by Formula (1), in
which the dot-product is now formed in 31".
222 Derivatives Chap. 3
jt/2
/.
(—cos t sin t + sin t cos t) dt = 0.
Figure 20 Figure 21
W=(F )(s), t
the constant field F(x k ). That is, near x k we approximate F(x) by the vector
Sec. 4 Line Integrals 223
F(Xk
\
\ g'O k ) (tk+ \- tk)
Figure 22
field that assigns the constant vector F(x k ) to every point. The tangential
F(x k ) t(t k ), where t(t) = g'U)/\g'(t)\.
coordinate of F{x k ) is •
approximately
k W
= {F(x k ).t{t k ))\g'{t k )\{t k+1 -t k )
= F(g{t k ))-g'(t k )(t k+1 -t k ).
which we define to be the work done by the field F in moving the particle
through the domain of F along y.
tinuous and hence that the line integral would exist. However these
conditions are stronger than necessary. It is enough to assume that the
path of integration is piecewise smooth and then that the vector field F is
sufficiently regular so that the integral in Formula (1) exists. Thus the
derivative g may be discontinuous at finitely many points, so that y has
sharp corners as shown in Fig. 23.
Then y has a corner at (0, 1,0). where t = -n\2. Indeed g is not differenti-
able there, and in fact lim g'(t) = (— 1, 0, — 1) and lim g'(t) =
( -*jt/2- «-«r/2+
(—1,0, 1), showing that the direction of the tangent jumps abruptly
at / = 77/2. Nevertheless, the integral of F over y exists. To compute it, the
interval of integration would ordinarily be broken at t = 77-/2. However,
in this particular case F(g(t)) •
g'(t) = — t tt/2 unless t = 77/2, and
224 Derivatives Chap. 3
Figure 23
the line integral is easily seen to have the value zero over the interval
< <t IT.
F(g(t))-g'(t)dt
dx dy
4' F^x, y,z)— + F 2 {x, y, z)-f + F3 (x, y, z)
dz
dt.
•'a dt dt dt
Fx dx +F 2 dy +F 3 dz
and can be still further shortened by writing dx = (dx, dy, dz). Then the
formula becomes
JF dz.
F-dx
d = [(2f)(l) + (~t - r
2
)(-l) + 2
(2t )(2t) + (-r - t)(-2t)] dt
\r *=l
\
= 4.
Sec. 4 Line Integrals 225
p(g(t))\g'(t)\dt. (2)
I
•/ a
The function/* can be thought of as the density per unit of length of a mass
distribution over y, in which case the integral represents the total mass
distributed along y. If we write the line integral of a vector field over y in
the form
I
F(g(t))-^-\g'(t)\dt, (3)
•> a \g(t)\
the relationship between Formulas (2) and (3) becomes clear. The integral
of the vector over y depends on the direction of the tangent to y at
field
each point and not just the length of the tangent as in the case of the
integral of a real-valued function over y. If the curve y is parametrized by
arc length, then the two integrals take the respective forms
ly
p(g(s))ds and F(g(s)) • t(s) ds,
1
J
where t(s) =
is a tangent vector to y of length 1.
{dgjds){s)
It is from the definition of the line integral that, in general, the
clear
value depends on the parametrization of the curve y. The extent to which
the value is independent of parametrization is taken up in the exercises.
EXERCISES
^\
I. Compute the following line integrals.
(d) r
y
(dx + dy), where y 1 is given by (x, y) = (cos t, sin t),0 < < 2-n.
t
. [ dx
— + dy
- , .
x
(f)
Jy (e dx + z dy + sin z dz), where y is given by (x, y, z) = (t, t
2
,
3
t ),
<t < 1.
(g) $ y
F • dx, where F(x, y, z, w) = (x, x, y, xw) and y is given by
(x, y, z, w) = (/, 1, /, /), < t <, 2.
226 Derivatives Chap. 3
2. Let y x be given by (x, y) = (cos /, sin /), < t < tt/2, and y 2 be given by
)( x> y) =- u,u),0<u <\. Compute j^ (fdx + g dy) and y2 (fdx +
(1 J"
Mc) f(x, y) = , , ,
^(x, j)
3. Find the work done in moving a particle along the curve {x, y, z) = (t, 2
t, t ),
x =/(/), 0<r<l,
and if —y is the curve described by
then
F-dx = - F dx.
Hg^))'g'{t)dt =\ F(h(s))-t(s)ds,
Ja JO
where h(s) = g(t(s)) and t(s) = (dh/ds)(s). [Hint. Use the change of
variable theorem for integrals.]
7. Let 31" —F ** Jl n be a vector field and y a curve such that the line integral
3/ 4
2
(a) If the density at the point corresponding to t is t
Sec. 5 Partial Derivatives 227
(b) If the density s units from the origin measured along the curve is {s + 1).
(c) If the density at a point is equal to its distance from the origin measured
in ft 3 .
10. Show that if $ y F-dx and j y G-dx exist, then f y (aF + bG) • dx =
a§ y F'dx + b]yG' dx, where a and b are any constants.
defined on [a, b], and & 31" defined on [b, c], with g{b) = A(6), then
12l Let a function g{t) represent the position of a particle of varying mass m{t)
in Jl 3 at time /. Then the velocity vector of the particle is v(/) =g'(t), and
the force vector acting on the particle at£-(/) is F(g(tj) = [m(t)\(t)]'.
(a) Show that F(g(t)) g'(t) = m'(t)v 2 (t) + m(t)v(t)v' (t) , where v is the
speed of the particle.
(b) Show that if m(t) is constant, then the work done in moving the particle
over its path between times t = a and t = b is w = (m/2)(v 2 (b) — v (a)).
2
SECTION 5
+ — f(x
5.1 —
dx,
(x) = lim /(*!,
«-o
. .
.
, xt t, . . . , x n) lt . . . , xt ,
. . . , x n)
The domain space of df\dx is 31", and the domain of df/dx is the subset
t i
of the domain of/consisting of all x for which the above limit exists. Thus
the domain of dfjdx could conceivably be the empty set. The number
t
—
rlf
(x, y, z) = 2xy + z
2
,
ox
d
-f
(x, y, z) = x2 + 2yz,
dy
&(x,y,z) = yi + 2zx.
oz
— (1,2, 3) = 4 + 6 = 10.
dz
df d sin u cos v
— COS W COS I',
du du
df d sin u cos v
= —sin u sin v,
dv dv
of Itt jA _
"
dsin_u_cos_v 1 7T jA
" -sin
77
— sin
. 77
— = — 1.
.
denoted by d 2fldxj dx t
. This can be repeated indefinitely, provided the
derivatives exist. An alternative notation for higher-order partial deriva-
tives is illustrated below, in which the variable of differentiation is denoted
by a subscript.
*-/
_ J Xi
OX,
Jx Xi
dx 2
'
dx\dx)
2 3
/ a / \ a / _L
iXdxjBxJ dxjdxjdx;
Sec. 5 Partial Derivatives 229
Example 3. Consider/(x, y) = xy — x2 .
d
fx = -[ = y-2x
ox
Jfxy
= — J—
-3-3 = 1
ay ox
Jfxx = —=—
-,
dx*
2
2
^ ax ay
2
z=f(x,y), y = b.
Figure 24
230 Derivatives Chap. 3
g'(a) = df
f (a, b).
ox
h'(b) = of
f-(a,b).
By
satisfying
B d
z =/( fl , b) + (x- a) {- (a, b) + (y - b) {- (a, b) (1)
ox oy
is a plane containing the tangent lines found above. In fact, specifying
y = b and x — a determines the two lines.
f(x,y) = 1 -2x - y 2 2
dn\ v
:(i,i) = -2, Stfitt— i.
ax'
:\2 2/ dy\2 2/
Using Equation (1) to define the tangent plane to the graph of/ at (\, \),
Figure 25
See. 5 Partial Derivatives 231
z = i - 2(x - J) - (y — I)
= -2* - + f jk
We can sketch the tangent plane by drawing the two tangent lines in it
where some
b is positive number. We then say that/ is continuous if, for
every point x in the domain of/,
lim/( Z)=/(x).
|x-z|—
The limit relation means that/(z) can be made arbitrarily close to/(x)
if |x — z|, the distance from x to z, is made small enough. As usual,
the intuitive idea of continuity is that the values of the function /should
not change abruptly, resulting, for example, in breaks in the graph of/.
The graphs shown in Figs. 24 and 25 are those of continuous
functions, while Fig. 26 shows a simple example of the graph of
a discontinuous function. The condition that the domain D off
contain all points z sufficiently near every x in D is expressed by
saying that D is an open set.
5.2 Theorem
G(u)=f(u,y + k)-f(u,y)
on the interval with endpoints x and x + h. We find
Now apply the mean-value theorem again, this time to the function
H(v) =fx (x 1, v) on the interval with endpoints y andy + k. We find
F(h,k) = hkfxy (x 1 ,y l ),
- [f(x + h,y)-f(x,y)]
allows us to follow the same general procedure, this time differ-
entiating with respect to y, then x. We find
Now let both /; and k tend to zero. It follows that the distances
\( Xl - x) 2 + O'x
- v)
2
and V(x 2 - xf + (y 2 - yf
both tend to zero; therefore, by the continuity of/xy and/vx we , get
fxv {x,y) =fvx {x,y). But the point (x, y) was chosen arbitrarily, so
2 2
a / a/
Bx By By Bx
z 3
B g 3 g
2
Bx By Bx Bx By
B% B%
etc.
Bz Bx By Bz Bx By Bz
The last two formulas follow from repeated application of the two-variable
EXERCISES
1. Find —
ox
and —
dy
, where fix,
j
y) is:
j
x+y+1
(c) e .
(0 logx J.
\n y
Ans.
dx x(\n xf
a2 a2
2. Find - —/—
oy dx
and - —/—
ox ay
, where / is
(b)f(x,y,z,w)
J
= „
z* + V
y
w*
(c) f(x,y,z)=x^K
dH(x, y)
4 - Find if f<.x,y) = log (x + y).
dx 2 dy
d2f f d2
5. Show that -^ + —^ = is satisfied by
J
ox oy*
Jxx '
Jyv '
Jzz
= "•
= "" 2,/2
7. If/Ocj, x2 , . . . , x„) l/(xf + xj + ... + 4) (
, show that
JXyX-i "i"
Jx 2 x 2 I
• • • I
Jx nx n ^*
I
234 Derivatives Chap. 3
ay d2
/
dx
9..f
f(*.y>
-
SECTION 6
VECTOR PARTIAL
DERIVATIVES
Sec. 6 Vector Partial Derivatives 235
Example 1. Let
/("> v)
Then
cos v
In Example 1 we can
, restrict the vector variable (w, v) so that v = v
is held fixed. Then f(u, v ) describes a curve as u varies. Similarly
f(u , v) describes another curve as v varies. Such curves are called param-
eter curves in the range of/. We shall assume that the coordinate functions
of/ are continuous functions. According to the definition of Section 2,
the vectors
— (u , v ) and — (u , v )
du ov
are tangent vectors to the two parameter curves obtained by varying u and
v, respectively, at the point where they intersect, namely, f(u v ). An ,
example is shown in Fig. 27, where the two curves are singled out by the
Figure 27
236 Derivatives Chap. 3
relations v =
and u = w that determine them; the varying parameter
v
in case v = and in case u = u it is v.
v is u,
of/ corresponds to
7(1,77/2) = (0, 1.7T/2)
£(
du
W2)=(A f(W2)
dv
=
\
V J
A function Jl 3 — 2
> 3t can sometimes be interpreted as a 2-dimensional
flow as follows. A point (x, y, t) in ft is carried by/into a point (x y') =
3
,
f{x, y, t) in Jl
2
, where (x, y) is to be thought of as the position of a par-
ticle in 'Ji
2
at time t = 0, and/(x,7, /) is to be the position of that same
Sec. 6 Vector Partial Derivatives 237
Figure 28
(x, y,
J
dt
t),
for each fixed t form a vector field with the tail of the arrow f (x, y,
t t)
now located at the point (x',y) —f(x, y, t), initially at (x,y). This
vector field is called the velocity field of the flow at time /. If the velocity
should not be assumed that for a nonsteady flow every particle on a given
trajectory follows the same path as the particle that initiates the trajectory.
x cos t — y sin t
f(x,y,t) =
x sin / + y cos t
cos t —sin t\ Ix
sin t cos // \y
238 Derivatives Chap. 3
/(x, /) = y
has at most one solution x for any given vector y. Of course a function
3l
4 —^->> Jl 3
of the form /(x, /) =f(x,y, z, /) would be used to describe
a 3-dimensional flow. Flows of higher dimension are also important in
theoretical mechanics.
Sec. 6 Vector Partial Derivatives 239
EXERCISES
1. Find formulas for the vector partial derivatives df/dx(x, y) and dfjdy(x,y) if
/x+y
(a) f{x,y) = I x -y
2 2
\x +y
x x
(b) f(x, y) = (e cos y, e sin y).
2., For each of the following functions, find the vector partial derivatives at the
""
indicated point. Sketch the coordinate curves for which these vectors are
tangents at the given point, and sketch the tangent plane to the range of/at
that point.
(b) /(«, v) = (cos u sin v, sin u sin v cos , v) for (w, i>) = (tt/4, 7t/4).
3. Find a parametric representation for the tangent plane of Exercise 2(a) and
also a representation for the line perpendicular to the tangent plane and
passing through the point of tangency.
(x + t\
f(x, y, t) = , for t < 0.
2
\y + t
)
(a) Sketch the trajectories of the flow that start at (x, y) = (0, 0), (0, 1), and
(1,1).
(b) For t = 1 sketch the velocity vectors at the points/(;t, y, 1), with (x, y)
chosen as in part (a).
(c) Show that the flow determined by/ is not steady. [Hint. It is not enough
toshow that dfjdt depends on t. Consider /(0, 0, 1) and dfj 3/(0, 0, 1) as
compared with /(l, 1,0) and 3// 3/(1, 1,0).]
5. Consider the 2-dimensional flow
(a) Sketch the trajectories of the flow that start at (x, y) — (0, 1) and (1, 1).
(b) For / = 1 sketch the velocity vectors at the points f(x, y, 1), with (x,y)
chosen as in part (a).
(c) Solve the equation (x',y') =f(x,y, t) for (x, y) in terms of (x' ,
y'), and
substitute the result into df/dt(x,y, t) to show that the flow determined
by/ is a steady flow.
6. The flow
>x cos / — y sin ^
SECTION 7
LIMITS AND Limits and continuity have been introduced for functions of one variable
CONTINUITY in Section 2 and for real-valued functions in Section 5. Here we shall unify
and extend the definitions and show how to construct continuous vector
functions from continuous real functions. To begin, the definition of
limit is based on the idea of nearness. To say for example that
sin x
lim 1
x->0 x
For example, \x — 3| < 0.4 says that the distance between the number x
and the number 3 is less than 0.4 or, equivalently, that x lies in the interior
of the interval with center 3 and half-length 0.4.
2.6 3.4
ciently close to 0" is translated in terms of inequalities as: For any positive
number e, there exists a positive number b such that if
< \
x - 0| = |jc| < 6,
then
- 1 < e.
is aspherical ball with radius e and center x . For example, if x = (1,2, 1),
V(x - l)
2
+ (y - 2)
2
+ (z - l)
2
< 0.5
Xq = (1,2,1)
Figure 30
there are points in S other than x that are contained in a ball of arbitrarily
small radius with center at x. If, for example, S is the disk defined by
x2 +y < 2
1, together with the single point (2, 0), then the set of limit
points of S consists of S together with the circle x 2 +y = 2
1 . See Fig. 3 1
However, the point (2, 0) is not a limit point of S even though it is a point
of S.
We come now to the definition of limit for a function % n
m
> 3\ m Let — .
lim/(x) = y .
x-x
centered at x whose intersection with the domain of/, except possibly for
Figure 31
x .
Domain space Jl
2
Range space 3V
Figure 32
The statement
lim /(x) = y
X—
is also commonly read "The limit of/(x), as x approaches x , is y
."
The domain of/ is all of 31, and at an arbitrary point t of 31, /has limit
/(/). To see this we use known facts about cos t and sin / and consider
This holds because \'a 2 + b2 < \a\ + \b\. (See Problem 15.) Since, using
the fact that sin t and cos t are continuous,
t->to <-'o
2 2
whenever \t — t \
< 8. Hence, Equation (1) shows that \f{t) — /(/ )l < e
whenever \t — t \
< 8.
2
Example 2. Consider the real-valued function defined in all of 3l
f(x, y) = —+—
-x
2
y
-
2
x x .
There is no limit as (jc, 7) —> (0, 0), for example along the line y = x.
We can write
lim/(x) = 00
X—
to describe what happens.
Example 3. The range space and the domain are the same as in the
preceding example and
x
2 -y 2
/(*> y) = 2 2
•
X* +/
,
There is no limit as (x, y) -> (0, 0). If (x, y) approaches (0, 0) along the
line y — ocjc, we obtain
a = 0.
The functions in the last two examples are both real-valued. The
following theorem shows that the problem of the existence and evaluation
of a limit for any function %n — > 51 m reduces to the same problem for the
coordinate functions. The latter are, of course, real-valued.
7.1 Theorem
lim/(x) =y (2)
X—
if and only if
can be made arbitrarily small if and only if all the absolute values
can be made arbitrarily small. But the equivalence of these last two
statements follows at once from the elementary inequalities
and
l/(x) - y |
< y/m max (|/,(x) - y{ \}.
l<i<m
We leave these as exercises.
\sin f/
Then
limA(l) =
u
but lim/2 (0 does not exist because the coordinate function sin (1/r) has
no limit at t = 0.
7.2 Theorem
sin -
is continuous except at t = 0.
7.3 Theorem
Since L is linear,
< A: |x —x |.
valued function of a single variable, 31 — > 31", is precisely one for which
the coordinate functions/^ ...,/„ are continuous real-valued functions of
a real variable. The latter of course include most of the functions of ordi-
nary calculus, such as x 2 and, for x > 0, log x. These same functions
, sin x,
can be used to construct examples of the continuous coordinate functions
that go to make up a vector-valued function of a vector variable,
x
3i
n — >- 3l
m . For example, the coordinate functions of
,, . /sin xv cos x y\
g, given by
h(x, y) = VI — x2 —y 2
log (x + y),
Sec. 7 Limits and Continuity 247
such that < x < 2 and -1 <y < 1 (cf. Fig. 34(a)). The
points (1,0), (\,\), (1, -1), (2,0) all belong to S. The first two are
interior points and the last two are not. More generally, the interior points
of S are precisely those (x, y) that satisfy the inequalities < x < 2 and
— <y <1 1. The formal definition is as follows: x is an interior point
of a subset S of 31" if there exists a positive real number 6 such that x
belongs to S whenever |x — x < b. |
A subset of 'Ji
n
, all of whose points are interior, is called open. Notice
that according to this definition the whole space 31" is an open set. So also
is the empty subset of 31". Since contains no points, the condition
for openness is vacuously satisfied. An open set containing a particular
point is often called a neighborhood of that point.
Example 7. For any e > and any x in 31", we can show that the set
B€ , of all vectors x such that |x — x |
< e is open. In 3-dimensional space,
for example, B(
is the e-ball pictured in Fig. 30 for e = 0.5. Let x x be an
arbitrary point in B e . Then^ — x |
< e. We must show that every vector
sufficiently close to xx is in B e . Set, as in Fig. 34(b),
i)
0!
© .(1,0) (2,0)
(1,-1)
(a) (b)
Figure 34
248 Derivatives Chap. 3
<6+ (e - <5) = e.
Hence, x is in B and
( , the proof is complete.
Example 8. Let / be a finite set of points in .'Jl". Then the set consisting
of all points in Jl" that are not in / is open. Thus a vector function that is
defined at all points of Jl" except for some finite set has for its domain an
open subset of Jl".
A set S is closed if it contains all its limit points, and the closure of S
is the set S together with its limit points. The boundary of S is the closure
of S with the interior of Thus an interval a < x < b, denoted
S deleted.
[a, b], is and is in fact a closed set in the
called a closed interval
above sense. An open interval a < x < b, denoted (a, b), is on
the other hand an open set. The diagram on the left shows an
open set S, then the closure of S, and then the boundary of S.
EXERCISES
In Exercises 1 and 2 take the domains of the functions to be as large as
possible.
ly + tan x\
jj \ln (x + y)
y
fx\ / x2 + 1
(b) f\
'
y
\y
2
- !
(c )/(^v)=-^-+j.
y, if x ^ 0.
= smx
[
if x = 0.
(e) /(/)
Sec. 7 Limits and Continuity 249
. 1 1
fu
(b)/
(c)f(x,y)
(d,/
C
lu
(e)/
(f) /(x) =-
3. When x = (1, 2), draw the set of all vectors x in Si 2 such that
(a) |x - x |
< 3.
(b) |x - x |
= 3.
(c) |x - x |
< 3.
4. Identify as open, closed, both, or neither, the subset of &2 consisting of all
x2 y
2
a bl
6. (a) Prove that the union of an arbitrary collection of open subsets of 31" is
open.
(b) Prove that the intersection of a. finite collection of open subsets of 31" is
open.
(c) Give an example to show that a nonempty intersection of infinitely many
open subsets of &n may fail to be open.
x-^x
Give an example of a function/and a point x such that/is not continuous
atx and (l)/has a removable discontinuity at x (2) /does not have a ,
removable discontinuity at x .
function 31" — t
> 31" is a translation if there exists a vector y in 31" such
that t(x) =x + y for all x in 31".
11. If/ and g are vector functions with the same domain and same range space,
prove
lim (f(x) + g(x)) = lim /(x) + lim g(x),
X—*X X—*-x x—>x
13. Let S be a closed subset of 31". Prove that the complement of 5 in 31" is open.
14. Converse of Exercise 1 3 : If S is an open subset of 31" , show that the comple-
ment of S in 31" is closed.
17. Show that Theorem 7.3 can also be proved by using Theorems 7.4 and 7.5.
SECTION 8
DIFFERENTIALS Many of the techniques of calculus have as their foundation the idea of
approximating a vector function by a linear function or by an affine
function. Recall that a function ,'ft " -^-+ %m is affine if there exists a linear
We shall see that affine functions form the basis of the differential calculus
of vector functions.
'2 3 0\
1 5/
u = 2x + 3y + 1
v = x + 5z + 2
lim(/(x)-^(x)) = 0. (2)
x-*x
However, Equation (2) may appear to say more than it really does. To
seewhat it really says, we observe that from (1) we get
It follows that Equation (2) is precisely the statement that the vector func-
tion/is continuous at x . This is significant, but it says nothing about L.
Thus, in order for our notion of approximation to distinguish one affine
function from another or to measur, in any way how well A approximates
/,some additional requirement is necessary. A natural condition, and the
one we shall require, is that f(x) — A(x) approach faster than x
approaches x That is, we demand that
.
- /(x - L(x - x
Hm /(*) ) )
= Q
x->x |X — X |
A function
is
—
.
domain.
According to the definition, the domain of a differentiable function is
an open set. It is, however, convenient to extend the definition sufficiently
to speak of a differentiable function /defined on an arbitrary subset S of
the domain space. By such an /we shall mean the restriction to S of a
differentiable function whose domain is open.
Figure 35
f(x)—f(x )-a(x-x
,.
hm
)
= 0.
x-*x g \X X |
This is equivalent to
lin/W -»*•> = a.
A ( x ) =/(*o) + /'(*<>)(* - x ).
Its graph is the tangent line to the graph of/at x , and a typical example is
drawn in Fig. 36. Thus we have seen that the general definition of differ-
entiability for vector functions reduces in dimension 1 to the definition
usually encountered in a one-variable calculus course.
y
,
I&-W <7X X
&<w
C7X 2
... ft
0X„
w N
jK-W |t
dXi dx 2
W ...
^(
dx„
Xo ;
8.1 Theorem
If the function %n — Jl
m is differentiable at x , then the differential
dx /is uniquely determined, and its matrix is the Jacobian matrix of
/ That is, for all vectors y in %n ,
<W(y) =/'(xo)y.
f{x, y, z) =
(|
(x, y, z) & (x, y, z)
| (x, y, z))
Thus the differential of/at (1, 1, tt) is the linear function whose matrix is
<2 e 0\
1 o
2x + 2y 2x + 2y s
\y
2
+ 2xy 2xy + x'
How can one whether or not a vector function is differentiable?
tell
Theorem 8.1 says only that if/ is differentiable, then the differential is
represented by the Jacobian matrix. It does not go the other way. Thus
8.2 Theorem
lim
/(*)-/(*.) -Ux-x )
= Q (3)
x-*xq |X Xq|
enough to prove the theorem for the coordinate functions of/, or,
what is notationally simpler, to prove it under the assumption that/
is real-valued. If
x = (*!, . . . , xn)
and
x = {a x , . . . , a„),
set
yk = (x lt ...,**»%! <*„), k = 0, 1 n,
Sec. 8 Differentials 257
n
df
--2(x k -a k )^(x ).
Hence
si dx,. dx,.
where we have used the triangle inequality and the fact that
\x k — ak \
< \x — x |
for k = 1 , 2, . . . , n.
f(x,y) = VT r
all (x, y) such that x +
y < 1 is the same as in Example 2
2 2
defined for
except that we have removed the boundary of the disk so that the domain
is an open set. The Jacobian matrix is
(/x /»)
Wl - x - 2
y
.2
vi-r-f
V X
The entries are continuous on the open disk, and we conclude, by Theorem
8.2, that/is differentiable there.
m= /'cos
sin/
^
— oo < < / 00.
-sin /
cos /
/-V3/2\ 0^
and
\flj
These vectors, drawn with their tails at their corresponding image points
under/, are shown in Fig. 38. Evidently, for curves at least, the differential
is related to the notion of a tangent vector. The affine function that best
approximates /(/) in some neighborhood of / is the vector function of /
given by
f'(t )(t - t ) +/(/„),
Figure 38
Sec. 8 Differentials 259
explicitly by/, together with the tangent plane to the surface at the point
.f(x,y)-f(x ,y )
v-axis
iJ.^/'(xo)(;:;:)
Figure 39
A (x, y) = f(x , y ) + f (x ,
y ) all (x, y) in 5l
2
.
y -jo
The difference A(x, y) —f(x ,y ) is a good approximation to the incre-
ment f(x,y) — f(x y , ), provided (x, y) is close to (x ,
y ). Figure 39 is
EXERCISES
1. A linear function Hn —L
Rm is defined by a matrix (o„), and
is a vector in S\.
m Construct
. a specific example by choosing m and n, a
matrix (a^), and a vector
3
(b) Explicitly defines a line in Jl .
3
(c) Explicitly defines a plane in Jl .
3
(d) Parametrically defines a line in Si .
What condition must the matrix (a„) satisfy in order to give an example
for (e)?
(x\ = lx" —y
\yl = \ 2xy
Sec. 8 Differentials 261
1/V21
(a) (b) (c) (d)
1/V2J
2
Ans. (c)
2
3. Find the derivative of each of the following functions at the indicated points.
(a)/ x2 + y2 at
\yi \yi \h
(b) g(x, y, z) = xyz at (x, y, z) = (1 , 0, 0).
/sin ^
(c) /(/) = at t = -7 Ans.
cos t
A
(d) /(/) = / at / = 1.
>^.J)= I I
\x 2 + y2
(f) ^
/l
differential of P at (1, 1, 1) Ans.
1
f(x, y) = 4 — x2 — y2 .
bx b2 b3
\c 1 c2 c3
/'(x )x = lim
/(x + fx ) -/(x o)
<-<-o t
x v ^rr7
i \ _ (
\y! \ x +y
(c) f(u, v) = |« + f|.
f(xsin(l/jc), x2 + y2) if x # 0.
(d) h{x,y)
{(0,
2
x +y 2
), if x = 0.
15. Is the function 51" — Jl defined by^(x) = |x| differentiable at every point
of its domain?
17. Show that if /and g are differentiable at x and a is a real number, then
xy
x ^ ±y,
f(*,y)
= r
x = ±y,
has a Jacobian matrix at (0, 0), but that it is not differentiable there.
1
\x" sin -, x ¥=0
fix) =
x =0
is differentiable for all x but is not continuously differentiable at x = 0.
SECTION 9
NEWTON'S METHOD
In this section we treat Newton's method for approximating a solution of
an equation /(x) = 0, where %n —>% n
is a nonlinear function. If/ is
mated by 3, 3.1, 3.14, 3.141, ... . Then we can form the sequence of
264 Derivatives Chap. 3
the vector (V2, v). It is natural to pair the numbers as we have, and not
in some other order, because it so happens that the entries in each pair are
accurate approximations to V2 and 77 to as many decimal places as are
given.
To make the ideas precise we define the limit of a sequence in ^l". Let
n
xls x 2 x 3
, , . . . be an infinite sequence of vectors in 3\ ,
just one vector
corresponding to each positive integer. Suppose there is a vector x in 31"
such that, for any e > 0, there is an integer N for which
|x* - x| < e
whenever k > N. Then we say that the given sequence converges to the
and we write
limit x,
lim x*. = x.
• *i «x 2
• xj
• x4
• x<
Figure 40
the fact that lim xk — v 2 and lim yk = 77 implies that lim (xk ,
yk ) =
(V2, 77).
We turn now
Newton's method for approximating a solution of an
to
equation/(x) =
where/is real-valued and x is a real variable.
0, in the case
We assume that /is continuously differentiable. If the graph off should
happen to be convex as shown in Fig. 41 then it is geometrically clear that ,
the tangent line to the graph at (x ,f(x )) crosses the *-axis at a point x t
Sec. 9 Newton's Method 265
Figure 41
This time we use the tangent line at (x 1 ,f(x 1 )) and call its intersection
with the x-axis x 2 Thus we can generate a sequence of numbers x x u
. ,
x2, .approximating x.
. .
We observe first that the tangent line at (x ,f(x )) has the equation
x-axis, we set y = in the above equation and solve for x x The result is .
=f'(x )( Xl
- x ) +/(*„),
whence
/'Mix! - x Q ) = -f(x ).
If/'(x ) ^ 0,
x, — x = — f(x )
f'(*o)
or
/(*o)
Xi — JVa
/'(*o)
formula to get
fix,)
fix,)
In general, we compute xk+1 by
f(x k )
f'(x k )
266 Derivatives Chap. 3
_ x (4-3)
x k+l — k -
2x k
_ (4 + 3)
2x k
Thus we get x x =\= 1.75. Substituting this value in the above formula for
k = 1 gives x2 = f| *** 1.7321. This approximation to y 3 happens to be
correct to three decimal places, though we have not proved that.
whence
/'(xoXXi - x ) = -/(x ).
_1
Now if/'(x ) has an inverse matrix [/'(x )] , we can apply the inverse to
both sides, getting
Finally
Xi = x - [/'(x )]- 1/(x ).
_1
In this equation [/'(x )] /(x ) is the vector obtained by applying the
inverse of the matrix /'(x ) to the vector /(xq). The vector x 1 is the first
x2 = Xl - [/'(xoryxxo).
(x 2 +f - 2\
/<*.*)-
\X 2 —y — 2
1
2 —
Since we require both x + y
2 2 —
2 = and x j
2
— 1 = 0, it will be
helpful to sketch the curves defined by these two equations. The exact
solutions are represented by the four points of intersection of the circle
x 2 + y 2 — 2 = and the hyperbola x 2 — y 2 — I = shown in Fig. 42.
The choice of an initial approximation depends on which solution we want
to approximate. Looking for the solution in the first quadrant, we try
x = (1, 1). Since Figure 42
(x2 +y 2
-2\
f(x,y)='
r
we have
2x 2)'
f'(x,y) =
2x 2y
and
[/"(*, joi-^ 1
ly- ly
Therefore,
- [/'(*, >0rV(*, y) =
\y/ \y
1.25
0.75
268 Derivatives Chap. 3
4 (1.25) \ _ /1.225
2 (0.75)' +
4(0.75)
2(1. 225)
2
+ 3
4(1.225) I /1.22574 \
x ""
2
2( 1.70833) + 1 / \0.707108/'
4(1.70833)
Similarly we get
1.22474
x
0.707107
and
1.22474
x.
0.707107
As in the previous example, further iteration using only five places after
9.2 x k+1 = x fc
- [/'(x )]-y(x,)
Sec. 9 Newton's Method 269
fx 2
+y 2
-2\ /0\
[X 2_ f _ l
j
-\q]>
WI ^i 'i
-i
i)
SO
'X
x - [/'(xo)ry(x)
\y
'
-2x\ + 4x + 3\
"
4 _ /1.25
2yl + 4y + 1 / \0.75
4
In the next step
'— 2(1.25) 2 + 4(1.25) + 3'
4 /1.21875
j
==
-2(0.75)
2
+ 4(0.75) + 1 / \ .71875
Continuing, we arrive at
'1.22474
0.707107
which agrees with the result obtained in Example 3, though it takes more
steps.
270 Derivatives Chap. 3
Chapter 4.
EXERCISES
1. (a) Sketch the graph of/(x) = $x - x for -2 < x < 2.
(b) Sketch the tangent lines to the graph of/at x = J, x = J, and x = —J.
(c) For each of the three choices for x in part (b), what solution of/(*) =
can the Newton iteration be expected to converge to ?
(d) Discuss the choice x = ^3/9 for an initial approximation to a solution of
fix) = 0.
2. Show that Newton's method will not produce convergence to the unique
solution of ^x = unless the initial choice x = is made. [Hint. Show that
xk = (-2)%.]
3. (a) To approximate the solution of cos* —x = by Newton's method,
show that when x Q is chosen, then for k > 0,
xk sin xk + cos x k
x k+i - l + sin x fc
x2 +y - 1 =
x + y* - 2 =
by following the steps below:
5. Let
'
Ix +y + z
f(x,y,z) = U 2
+/ + 2 2
\ x3 + y3 + z3
and find /'(l, 2, -1). Taking x = (1,2, -1), apply the modified Newton
Formula 9.2 to approximate a solution to f(x,y, z) = (2, 6, 8).
6. Let
2
(H 2 + UV \
U + V
6
J
Noting that ^(1, 1) = (2, 2), use Newton's method 9.1 or its modification
9.2 to approximate a solution to^(H, v) = (1.9, 2.1).
4
Vector Calculus
SECTION 1
/(x + ru)-/(x)
(x) Iim
The domain of df/du is the subset of the domain of/ for which the above
limit exists.
The connection between the derivative with respect to a unit vector
and the differential is provided in the following theorem.
1.1 Theorem
^(x)=/'(x)u
on
272
Sec. 1 Directional Derivatives 273
. J(x+tu)-f(x)-f'(x)(tu) =
l
t-o \tu\
which is equivalent to
lim
1 f(x + l u)-m_ fXx)tt
(-o |u| t
/(x + ,u)-/(x)
|im =/ (xK,
£->0 t
The equation
^(x)=/'(x)u
on
It follows that the directional derivative of/in the direction of u has at the
origin the value dfjdu (0, 0, 0) = f.
Let 'J\
2 —
> 01 be a function whose graph is a surface in 3-dimensional
Euclidean space, and let u be a unit vector in 3l 2 , i.e., |u| = 1. An example
Figure 1
—
df ,
(x), = hm /(x+Mi)-/(x)
,.
.
du <->o t
f(x + tu)-f(x)
is the slope of the line through the points /(x + tu) and/(x). It follows
that the limit, (dfjdu)(x), of the ratio is the slope of the tangent line at
(x,/(x)) to the curve formed by the intersection of the graph off with the
plane that contains x and x + u, and is parallel to the z-axis. This curve is
drawn with a dashed line in the figure. The angle y shown in Fig. 1
o
tan y = — (x).
ou
The situation here is a generalization of that illustrated in Fig. 24 of
Chapter 3, Section 5. If we choose u = (1, 0), the angle y becomes the
angle a in the earlier figure and
9u dx
On the other hand, if we take v = (0, 1), then y is the angle (3 in Figure 24,
and
d\ dy
The mean-value theorem for real-valued functions of a real variable
can be extended to real-valued functions of a vector variable as follows.
1.2 Theorem
Let 51" —
> 31 be differentiate on an open set containing the seg-
ment S joining two vectors x and y in 5l n Then there is a point x .
on S such that
/(y)-/(x)=/'(x )(y-x).
g(t + h g(t)
= f(m(t))(y - - -
l~ x) ± |y x| Z(h(y x)),
h
whence
g'(t) =/'(m(0)(y - x). (1)
= g'C),
276 Vector Calculus Chap. 4
1.3 Theorem
If 31" — > 3i
m is differentiable on a polygonally connected open set
EXERCISES
1. For each of the following functions defined on 3-dimensional Euclidean
space, find the directional derivative in the direction of the unit vector u at
the point x.
(b) h(x, v, z) = xyz, u = (cos a sin sin a sin P, cos /3), x = (1 0, 0).
/S, ,
(c) f(x,y), x = (x, y), and y = (cos a, sin a). (Assume that/is real-valued
and differentiable.)
(11)
W5' V5/'
Arts.
V5 'J
x
(b) f(x,y) = e sin v at x = (1, 0) and in the direction (cos a, sin a).
x+y
(c) f{x,y) = e at x = (1, 1) in the direction of the curve defined by
g{t) = (r
2 3
t ) at g(l) for / increasing.
,
Sec. 2 The Gradient 211
3. Find the absolute value of the directional derivative at (1, 1,0) of the
function /(x, j, z) = x + ye in the direction of the tangent line at^(0) to
2 z
defined by g(t) = {t
2
- t + 2, /, t + 2) at^(0). [Am. -I/V3.]
[Ans. 8/V6.]
(x, y, z) = (u v, u
2
+ v, u). [Ans. 2j V 3.]
7. If the temperature at a point (x, y, z) of a solid ball of radius 3 centered at
is given by T{x, y, z) = yz + zx + xy, find the direction in which T
-^ (0, 0, 0)
8. Show that the mean-value formula of Theorem 1.2 can be written in the form
/(y) -fix) df
(x„),
|y -x| du
x\y\
(*,y)*(0,0)
U (x,y) = (0,0)
has a directional derivative in every direction at (0, 0), but that / is not
differentiable at (0, 0).
SECTION 2
In the first section of Chapter 3 we looked at some examples of vector THE GRADIENT
fields. Many of the most important ones arise as gradient fields, which are
described below.
If/is a differentiable real-valued function, "Ji" -^ %, then the function
V/ defined by
*»-£«-£«)
278 Vector Calculus Chap. 4
is called the gradient off. The gradient is evidently a function from Jl" to
;K", and it is most often pictured as a vector field.
Figure 2
The reason is that (dfjdu)(x) =/'(x)u, and the application of the matrix
/'(x) to u is the same as the dot product V/(x) u. Using Equation 2. 1 we
• ,
can easily prove the following theorem. (This is the origin of the use of the
term gradient, usually meaning incline or slope.)
2.2 Theorem
Let .')(" —^- 'Ji be differentiable in an open set D in 31". Then at each
point x in D for which V/"(x) # 0, the vector V/(x) points in the
Sec. 2 The Gradient 279
^( X )=V/(x).u<|V/(x)||u|
ou
= |V/(x)|
But when u = V/(x)/|V/(x)|, and only then, we have
^(x) = |V/(x)|.
Thus the rate of increase dfjdu{x) is never greater than V/(x)| and |
is
ft -U- 5l 3
is given by f(t) = (t, t
2
, t) and 5l -*-+
3
% by g(x, y, z) =
x cos (y + z), then g(f(t)) = ? cos (t
2
+ /). This defines a new function
from For example, if g is denned in a region D of 5l 3 and/
51 to 51.
2.3 Theorem
no = vs(/(o) •/'(')•
280 Vector Calculus Chap. 4
Proof. By definition,
F(t + h) - F(t)
F'(t) lim
h
s(f(t\ h))-g(f(t))
lini
= Vg(x ) •
(y - x),
h h
The vector x is now some point on the segment joining /(/) and
f{t + h). (Why is x in the domain of/?) Since g was assumed
continuously differentiable, Vg(x) is continuous, and so V^(x )
tends to V«-(/(r))as /; tends to zero. The dot product is continuous,
so F'(t) exists, with
FP7A = ..
lim
„ f
Vg(x
, fit + h)- f(t)
(0 ) • .
= Vg(/(0) •/'(«)•
known only that/(/ ) = (— 1> l)and/'(/ ) = (2, 3), then the composition,
F(t) = g(f(t)), is known only at / = t and F'(t ) cannot be computed by
,
interested in S when it is a curve, and for % 3 -^-> %, the sets S most often
considered are surfaces. Examples are shown in Fig. 3.
Vf (x )
y/>o)
Figure 3
= V/(x ).v.
2.4 V/(x ) 0.
Figure 4
and its tangent contain a common line with direction (1,1, V2), and that
the normal vector to the tangent is perpendicular to that line.
Putting together Theorem 2.2 with 2.4, we see that the direction of
maximum increase of a real-valued differentiable function at a point is
V/(x, y) = (y, x) and Vg(x,y) = (2x, —2y). Hence, for each (x, y) we
have Vf(x,y)-Vg(x,y) = 0. Thus the normal vectors, and hence the
tangents, are perpendicular. It also follows that a tangent to a curve from
one family points in a direction of maximum increase for the defining
function of the other family. The argument fails at (0, 0). See Exercise 6
following this section.
Sec. 2 The Gradient 283
Figure 5
As another example of the use of the gradient, we shall prove the follow-
ing theorem, which is an extension of the familiar formula
ff'(t)dt=f(b)-f{a\ (1)
2.5 Theorem
V/.rfx=/(b)-/(a).
1
In particular, the value of the line integral of a gradient field
over a curve depends only on the endpoints of the curve; thus, in
this case, the notation
V/.rfx=/(b)-/(a)
f
is justified.
284 Vector Calculus Chap. 4
fv/.dx = fV/(G(0)-G'(0<*r
Jy Ja
h
jf(G(t))dt.
I at
But by Equation (1), the fundamental theorem, the last integral is
\{x 2 +y 2
). Then Vf(x,y) = (x,y). If y any continuously differ-
is
entiate curve with initial and final endpoints (Xi, ji), and (x 2 ,y 2 ) then
This is what we would expect formally from the fundamental theorem. If,
on the other hand, we letg(x, y) = xy, we have Vg(x, y) = (y, x), and for
any curve of the kind previously considered we have
\y dx + x dy = x 2y2 — xxyv
Jy
EXERCISES
1. Find the gradient, V/", of each of the following functions at the indicated
points.
2. For the functions in the previous problem find the direction and rate of
maximum increase at the indicated point.
3. (a) The notation grad/is often used for the gradient V/ Show that
—
df
(x) = grad/(x) •
y.
Sec. 2 The Gradient 285
(b) The notation V y /is often used for the derivative df/dy. Show that
V y /(x) = V/(x) •
y.
4. Find, if possible, a normal vector and the tangent plane to each of the
following level curves or surfaces at the indicated points.
(a) Show that VF = ( — Sf/dx, — df/dy, 1), which is never the zero vector.
(b) Find a normal vector and the tangent plane to the graph off(x, y) =
xy + yex at (x,y) = (1, 1).
7. If g(x, y) = ex+v and /'(0) = (1,2), find F'(0), where F(t) =g{f(t)) and
/(0) = (1, -1).
8. Let y be a curve in ft? being traversed at time t = 1 with speed 2 and in the
direction of (1, —1,2). If / = 1 corresponds to the point (1, 1, 1) on y
find the rate of change of the function x + y + xy along y at t = 1.
9. If f(x, y, z) = sin x and F(t) = (cos /, sin r, /), find g'i-n), where g(t) =
10. Let ft —> ft" be differentiable. Let ft." > ft be continuously differen-
tiable and such that the composition g(t) =f(F(tj) exists. If F'(/ ) is
-L r"•
~ fx
-J.
d Xl
' ' ' '
' dxn -
(a) For the case n = 2, show that if the system
¥
— (x, y) = Fx (x, y),
d
—f (x, y) = F2 (x, y)
286 Vector Calculus Chap. 4
f(x,y) = F (x,y)dx
1 Q(;)
J
and
12. (a) Find the function / of Problem 11(b) by direct computation of a line
integral of V/from (0, 0) to (x, y).
(b) Find the function /of Problem 11(c) by direct computation of a line
integral from (0, 0, 0) to (x, y, z).
- **> •
- tn
d Xl ' dxn
dxj dxi
(xy, x 2
) and G(x, y, z) = (y, x, —zx), are not the gradients of real-
valued functions.
~y X
15. Let F(x, Jv) = , A for (jc, y) # (0, 0).
'
(
\x- + y ,
2
x* + y-f
(a) If F
x and F 2
are the coordinate functions of F, show that
dj\ _ 9F2
dy dx
(b) Show that in any region not meeting the v-axis, F is the gradient of
f(x, y) = arctan (y{x), the principal branch.
Sec. 3 The Chain Rule 287
(c) Show that Fis not the gradient of any function/differentiable in a region
completely surrounding the origin —for example, the region defined by
x2 + y 2 > 0.
16. Show that if a force field in a region D of Jl 3 has the form V/ for some
continuously differentiable function :R 3 — >• 31, then the work done in
moving a particle through the field depends only on the level surfaces of/on
which the particle starts and finishes.
(a) Show that the equipotential surfaces are orthogonal to the field.
(b) Find the equipotential surfaces of the field V/(x, y, z) = (x, y, z).
(c) Find the field of which f(x, y, z) = (x 2 + y 2 + z 2 )~ 112 (the Newtonian
potential) is the potential function.
(d) Find the field of which f(x,y) = —\ log (x 2 + y2 ) (the logarithmic
potential) is the potential function.
(e) Show that if/(x) = |x|
2 ~"
(the generalized Newtonian potential in Jl",
n > 3), then V/(x) = (2 - «) |x|-"x.
18. Let 31 — F
> 3l 3 define a smooth curve y for a < < / /?, with y lying in a
rF(t.
V/- </x = V/(F(r)) •
F\t).
dt JF(a)
n Show that
joining x to x for all x in some open set D in 3i y/ = F, where .
SECTION 3
One of the most useful formulas of one-variable calculus is the chain rule, THE CHAIN RULE
used to compute the derivative of the composition of one function with
another:
If two functions /and g are so related that the range space off is the
same domain space of g, we may form the composite function g °f
as the
by first applying/and then g. Thus,
S°/(*) =£(/(*))
for every vector x such that x is in the domain of/and/(x) is in the domain
of g. The domain of g of consists of those vectors x that are carried by/
into the domain of g. An abstract picture of the composition of two
functions is shown in Fig. 6.
Domain g of
(range/) n (domain g)
Figure 6
x = gi(u, v)
y = g (w, ^)-
2
x = #(u),
where x = (x, y), u = (it, v), and g has coordinate functions g Y g 2 Now ,
.
u=f (s,t) 1
v =f 2 (s, t).
Sec. 3 The Chain Rule 289
u =/(s),
or
*=*(/())•
Using the notation g °/for the composition of g and/, we can also write
x = g °/(s).
To determine the derivative of g °/in terms of the derivatives of g and
/, suppose that %n — 3l
m is differentiable at x and that 3l
m -i-y 3l p is
an m-by-n matrix. It follows that the product g'(y )/"( x o) is defined and
is a p-by-n matrix. The chain rule says that this product matrix is the
derivative of g °f at x . Because the matrix product corresponds to
composition of linear functions, the result can be stated in terms of
differentials: the differential of a composition is a composition of
differentials.
= s ...,p-
(g °/)'«) (I
-
The right side of this last equation can be written as a matrix product in
terms of the derivative matrices
g'(/(0)=(^(/(0),...,^(/(0)):
and
\f'Jt)i
,
%m J_^ ftp
s c/wj\
-
/foo
I
and
i^(/(x))^(x). CD
t =i dy k dxj
Vg,(/(x))-^(x)
OX
= a
-^-
ox
)
(x), (2)
,
Sec. 3 The Chain Rule 291
u + v). We find
(v u\ I2x 2y\
-d
i ,)
/'<*•-">
=u _ 2y \
To find (g °f)' {2, 1), we note that/(2, 1) = (5, 3) and compute
'J\
— > 'A is often denoted, in conjunction with the equation y =/(x), by
dy/dx. Similarly, the partial derivatives of a function 3l 3 — > 51 are
commonly written as
—
dw
dx
,
dw
—
By
, and
,
—
dw
dz
vv = xy 2 e x+3z then
,
?W
= 2 x + Zz
y e + xy 2 e x+ Zz^
dx
^ = 2xye^\
By
^ = 3x/e^.
y
dz
.
This notation has the disadvantage that it does not contain specific
reference to the function being differentiated. On the other hand, it is
notationally convenient and is,moreover, the traditional language of
calculus. To illustrate its convenience, suppose that the functions g and/
are given by
dw dg dx dg dy dg dz"
ds dx ds dy ds dz ds
(3)
dw dgdjK dgdy dgdi
dx dx dt dy dt dd dt
the domain space off is 1-dimensional, that is, if/ is a function of one
variable. Consider, for example,
w = g(u,v),
W =f(t)=[ \/2 (0
d{g°f) _ dw
'
dt dt
dw dw\ . dt
and
( Hu dv) "
dv
idt,
Sec. 3 The Chain Rule 293
"f
dw _ idw dw\ I I _ dvv du .d_w dv .
dt \du dv / \ a,-, I du dt dv dt
x = g(s), s=f{t),
the more explicit Formula 3.2 can be written as the famous equation
dx dx ds
(5)
dt ds dt
x= u2 + v3 (u = + t 1
and
y= e uv , [v = el ,
/(*)=[ )
= ()> -co</<co,
fu\ /it
2
+ v
3
\ {x\ ( — co<u<oo.
y
v) \ e uv J \yj' [— oo < y < co.
IH)
294 Vector Calculus Chap. 4
l \
U
The matrix of the differential of g at is
'
dx dx^
du dv\ I 2u 3v
2
uv l
dy dy J
\ve ue
\du dv/
fx\
= (g°f)(t), -co</<co.
yi
Hence, the two derivatives dxjdt and dyjdt are the entries in the Jacobian
matrix that defines the differential of the composite function g of. The
chain rule therefore implies that
That is,
dx
— = dx du + dx
- dv = 2u + 3v ,
_ ,
„ ,
e
t
dt du dt dv dt
(6)
dy
= d_ydu dydv
+ = veUV + ueUV+t
dt du dt dv dt
—
dx
l
(0) =2+ 3 = 5,
dt
dz dx dz dy
. . .
+ + + • •
dx dt dy dt
Sec. 3 The Chain Rule 295
Example 4.
Example 5. Let
( x =f(u, v).
z = xy and
\y = g(u, v).
du dv du dv
Suppose also that /(I, 2) = 2 and g(l, 2) = -2. What is dzjdu(\, 2)?
The chain rule implies that
—
dz
= dz dx
+ dz dy
.
(7)
du dx du By du
When u= and v = 2,
1 we are given that x =/(l, 2) = 2 and y =
g(l,2) = -2. Hence
^(2,-2) =
dx
yU =_ 2 =-2
^(2,-2)
dy
= xU =_2 = 2.
|^(1, 2) = — 2)( — 1) +
( (2)(5) = 12.
du
z = ax 2 + bxy + cy 2 .
dw
(-1) =f\a - b + c)(-2o + 26 - 2c).
dx
The Jacobian matrix, or derivative, of a function /from 31" to 31" is a
square matrix and so has a determinant. This determinant, det/'(x)>
is a real-valued function of x called the Jacobian determinant of/; it plays
a particularly important role in the change-of-variable theorem for
integrals taken up in Chapter 6. At this point we remark on a simple
corollary of the chain rule and the product rule for determinants:
3.3 Theorem
If 31" —/
> 31" is differentiate at x and 31" —g
> 31" is differentiate
at y =/(x ), then the Jacobian determinant of g ° / at x is the
product of the Jacobian determinant of /at x and that of g at y .
If/ is defined by
'x /iCx,, . . . , x n)
/„(*!, • • , „)
3(xi, •
•
or equivalently
d( Xl , . .
Sec. 3 The Chain Rule 297
Example 7. Let
2xy
Then,
o(x, y)
,
det
\2y 2x1
= 4(.x + y%2
x \ fr cos 6
y J \r sin 6
—-
d(w, z)
d(r, 0)
(r , O)
= d(w,
d(x, >)
z)
(x , y
,
)
——
d(x, y)
d(r, 0)
,
(r ,
O)
EXERCISES
1. Given that
xz + xy + 1
2h
2
>' + 2
'3
1
2. Let
298 Vector Calculus Chap. 4
and
x + 2y + r
s\y
x2 —y
3 + 4a3 \
(a) Find the Jacobian matrix of go fat t = a. Ans.
\2a - 1
(b) Find dujdt in terms of the derivatives of x, y, z, and the partial derivatives
of u.
f(t) = J t
2 -
-i:
and
(x ) (x ) = 2,
ex Sy dz
and
F(x, y, z) = x2 + y2 + zz = w.
Given that
df df
-f-
= x2 + 2xy
J
- Jy 2 and -^ = x2 - 2xy
J + 2,
dx dy
6. If w = Vx + y +
2 2
z2 and
find dwjdr and 9>v/30 using the chain rule. Check the result by direct
substitution.
/I
^17 \« sin v] \-Trj2 < v < tt/2,
arctan
(a) Show that if differentiable vector functions/and^- are so related that the
composite function^ o/is an identity function, then the transformation
(df(x) g) ° (dx f) is also an identity function for x in the domain of g of.
(b) How does this exercise apply to the preceding one?
pitfalls. Resolve the following paradox: Let w =f(x, y, z) and z = g(x, y).
By the chain rule
dw dw dx dw By dw dz
dx dx dx dy dx dz dx
Hence,
dw dw dw dz
dx dx dz dx'
and so
dw dz
dz dx
—dz
= 3 and —
dx
= 5.
11. If y = f(x — at) + g{x + at), where a is constant and /and g are twice
differentiable, show that
d2 d2
a2 —y = —z.y
x (Wave equation)
dx' 1
dr
(x\ fr cos d\
)
= j J
, show that
y) \r sin 0/
df df
x
Tx + yyy- nf^y^
14. Show that for a differentiable real-valued function g{x,y),
dg(x, x) dg dg
+
-lx-=Tx (X '
X)
Ty^ x) -
Using the function f(x) = (x, x) show that this result is equivalent to the
statement (g <>/)'(*) = g'(f(x))f'(x).
Apply the equation to the function g(x,y) = xv .
15. (a) If
w =f(x,y, z, t), x=g(u,z,t), and z = h{u, t),
write a formula for dwjdt, where by this symbol is meant the rate of
change of w with respect to /, and where all the interrelations of w, x, z, t
are taken into account,
(b) If
f(x, y, z, i) = 2xy + 3z + t
2
,
g(u, z, t) = ut sin z,
h{u, t) =2u + t,
Sec. 4 Implicitly Defined Functions 301
/x ,(2,l)=/yx (2 )
l) = l, /w (2, 1)=2.
2 -^-*- ft 2
Let Jl be defined by
^(w, i>) = (w + y, «i>).
17. Calculate the Jacobian determinants of the following functions at the points
indicated.
rcos <£
18. Using the functions/and^- in Exercises 17(a) and (b), compute the Jacobian
(0\
determinant of the composite function^ °/at
(
[Ans. -1120.]
It may happen that two vectors are related by a formula that doesn't SECTION 4
express either one directly as a function of the other. For example the IMPLICITLY DEFINED
formula FUNCTIONS
P -~k
302 Vector Calculus Chap. 4
may express the relationship between pressure p on the one hand, and
volume and temperature (v, t) on the other hand, of the gas in some
container. Or the equations
x2 +/ + z2 = 1
x +y + z =
may be interpreted as a relation between the three coordinates of a point
on a certain sphere of radius 1 centered at (0, 0, 0) in Jl 3 . In neither
example do the equations give an explicit formula for any of the coordinates
in terms of others. In this section we study the application of calculus to
such relations.
For any two functions 31 2 — > 31 and 31 — > 31, the equation
F(x,y) = (1)
/i(x) = Vl - x2 ,
-1 <x < 1.
f (x) = -Vl -
2 x2 ,
— <x < 1 1.
Figure 7
Sec. 4 Implicitly Defined Functions 303
F(x, y) =
if F(x,f(x)) = for every x in the domain of/.
x+ y + z- =0 1
(2)
2x +z+2=
determine y and z as functions of x. We get
y = x + 3, z = — 2x — 2.
"MM-"-;
Although Example 1 shows that an implicitly defined function need
not be continuous, we shall be primarily concerned in this section with
functions that are not only continuous but also differentiable. The implicit
function theorem appearing at the end of Section 6 gives conditions for
/ defined by an equation F(x,/(x)) 0.
the existence of a differentiable =
Before discussing this theorem, however, we consider the problem of
finding the differential of/when it is known to exist. Suppose the functions
3l 2 —F > 31 and %— f
> % are differentiable and that
F(x,f(x)) =
for every x in the domain of/. Then the chain rule applied to F(x,f(x))
yields
Fx (x,f(x)) + Fv (x,f(x))f'(x) = 0.
.,
Hence,
= ^,/W)^o.
4.1 /'(*>
-FT^v
Fy(x,f(x))
lf
suppose that x and j are differentiable functions of z, that is, the function
defined implicitly by Equations (3) is of the form (x, y) — f{z). To
compute dxjdz and dyjdz, we apply the chain rule to the given equations
to get
2x^ + 2^ + 2z = 0,
dz dz
dx dy
yz— + xz— + xy = 0.
dz dz
These new equations can be solved for dxjdz and dyjdz. The solution is
which is the matrix/'(z). Notice that the corresponding values for x and y
have to be known to make the formula completely explicit. That is, from
the information given so far, there is no possible way of evaluating
{dxjdz)(\). On the other hand, given the point (x, y, z) = (1, —2, 1), we
have {dxjdz){\) = — 1. The reason is that, just as in Example 1, there is
more than one function/defined implicitly by Equations (3). By specifying
a particular point on its graph, we determine /uniquely in the vicinity of
the point.
Example 4. Consider
xu + yv + zw = 1
x-\-y-\-z-\-u-\-v-\-w = §,
xy + zuv + w — 1
-
— +u — + w — +
dx dy 3z
u
,
z = 0,
dw dw dw
dw dw dw
dx By dz
y— + x-^ + ui>— + 1 =
,
0.
ow ow aw
Then, solving for dx/dw gives
9x uu
2
+ xz + — vv zwt; -XW-D
9w 2
w y + vy + wx — >'\v — mx — uu
2
Similarly, we could solve for dyjdw and dzjdw. To find partials with respect
to u, differentiate the original equations with respect to u and solve for
dx/du,'dyjdu, and dzjdu. Partials with respect to v are found by the same
method.
(Fiiu, v,x,y)
F(u, v, x, y) = I
\F2 (u,v,x,y)
and that the equations
9F, 3F, dx
-+ — dy =
dF,1
-^-
„
0, — + —dx —
dF,
i
dx
+— —=
3F,1dy dF,
l
0,
du dx du dy du dv dy dv dv
d_Fj ,
3F_2 dx dFi dy _ dF 2 dF 2 dx dF 2 dy
du dx du dy du dv dx dv dy dv
306 Vector Calculus Chap. 4
f
d£i 9fi\ /dFi dFA /dx dx}
du dv I / dx dy \ I du dv
dF 2 j \dF 2 dF 2 ]\dy dy
dv I \ dx dy I Xdu dv)
The last matrix on the right is the matrix of the differential of/ at (w, v).
1
'dx dx\ IdFx dFX- /dl\ dF\\
du dv I / dx dy \ I du dv
(6)
dy_ dy J \ d_F\ dFj,
J \ dFj dF\
\du dv/ \dx dy I \du dv
To be able to solve Equation (5) uniquely for the matrix /'(w, v), it is
essential that the inverse matrix that appears in Equation (6) exist. This
implies, in particular, that the number of equations originally given equals
the number of variables implicitly determined or, equivalently, that the
range spaces ofF and /must have the same dimension.
The analog of Equation (6) holds for an arbitrary number of coordinate
functions F and is proved in exactly the same way. We can summarize
t
4.2 Theorem
If 3i
n+m -^> %m and 31" -U 'Ji
m are differentiable, and if y =/(x)
satisfies .F(x, y) = 0, then
/'(x) = -F7\x,f(x))Fx(x,f(x)),
provided Fy has an inverse. The derivative Fy is computed with x
held fixed, and Fx is computed with y held fixed.
xz + yz
and that we choose x = x, y — (y, z). Then
(2xy + z
Sec. 4 Implicitly Defined Functions 307
and
2
(jc x
z x+y
Newton's method provides, in many cases, an effective way of com-
puting an approximate value for an implicitly defined function. Suppose
that/(x) satisfies the = in some neighborhood of
equation F(x,f(x))
x = x . To compute /(x we need to solve the equation F(x y) =
) ,
for y. We apply Newton's method to the function g(y) = F(x y). The ,
Of course, some choice for y has to be made using more detailed in-
formation about F.
yi =h
2
(i) 1
y2 =
i + 2(i) 15
2
(tV) i
y3 =
1 + 2(tb) 255
2
(-2) 4
yi =
1 + 2(-2) 3
y2
M) 2
_ 16
l+2(-i) 15'
2
(-II) _256
^ ~ + 2(-xf) _
1 255
.
(x 2y + z
F(x, y, z) =
'
x +yz 2
x2 1
F{v.z)(x,y,z) =
\lyz y'
so
1 1
Fiv . z)
(i,y,z) =
K
2yz f
Computing the inverse matrix* by the formula
'a 6\-i ( d -b
i
\c d! a <t - be \_ c a
gives
L—
[iWL ^> z)]" = 1--2yz 1 1 / / -1
y \-2_yz 1
1 - 2y'z 4
to the solution.
:
EXERCISES
©if
x 2y + yz = 0, xyz +1=0,
find dxjdz and dyjdz at (x, y, z) = (1, 1, — 1).
dx 1 dy _3
Ans. =
Jz 2 '
rfz 2
x— j + 2k + t; = 0,
dx dy
—
OU
and +1
Oli
at (x, y, w, t>, w) = (1, -1, 1, 1, -1).
dx dy
Ans. — =0, -~-
3«
= 1.
3w
fit)
which satisfies
/"(!) =
-1
Find the tangent line to /at f = 1. ,4/75. /
dF dF
(x -x ) — (x ,
y , z ) + (y yo) iz
(x °' y°> Zo)
dF
+ (z - z ) — (x ,
y , z ) =
is the equation for this tangent plane.
9. The equations
2x+y + 2z + u-v-l =
xy + z — u + 2v — 1 =0
yz + xz + u2 + v =
near (x, y, z, u, v) = (1,1, — 1, 1, 1) define x,j, and z as functions of uandv.
(a) Find the matrix of the differential of the implicitly defined function
fx(u, v)\
K
z(u, v)J
SECTION 5
CURVILINEAR Formulas that occur in mathematics and its applications can often be
COORDINATES simplified bygood descriptions of the quantities to be singled out for special
attention. Since in practice these quantities are usually represented by
vectors whose entries are real-number coordinates, the problem can be
viewed as one of choosing the most useful system of coordinates. Thus
we consider introducing coordinates in 31" different from the natural
coordinates x t
that appear in the designation of a typical point (xl3 . .
.
, x n ).
Specifically, to each point (x l5 . . . , xn ) there will be assigned a new
w-tuple («!, u„). Clearly, if we are to be able to switch back and forth
. . . ,
from one set of coordinates to the other, the assignment described above
must be one-to-one, that is, for each (xlt x n ) there should be just . . . ,
assignment is often made for some specific subregion of 31" rather than
for the whole space. The vector space of new coordinates (ux ,un ) , . . .
C
will be denoted by IL" to avoid confusion with 31", whose points
(x lt . . . , xn ) are being assigned the new coordinates.
denoted by U 2
. The function W—
> % 2
denned by
x
\
312 Vector Calculus Chap. 4
x > 0,
is the inverse of the restriction of Tto the region <r< 00, — tt/2 < 6 <
77-/2. Similarly, the function defined by
A/7T7\
arccott -
x ], y>0,
I
J
is the inverse of the restriction ofrtoO<r< oo,O<0<7r.
We have not defined polar coordinates for the origin of the xy-plane
simply because
^0 cos 0\ /0\
= for all 6,
sin 0/ \0/
and so the one-to-one requirement fails at the origin. This fact causes no
real difficulty. For example, the equation in rectangular coordinates of
the lemniscate,
0c 2 +/) = 2
2(x 2 -j 2
), (2)
r
2
= 2 cos 20, r > 0. (3)
The image under T of the set of pairs (r, 0) that satisfy Equation (3) is
precisely the set of pairs (x,y) that satisfy Equation (2), except for the
origin. We may simply fill in this one point. See Fig. 9.
Sec. 5 Curvilinear Coordinates 313
Figure 9
-axis
-
2tt
1 1 1
<i> -axis
(r,<f>,e)
314 Vector Calculus Chap. 4
(f>
and are sometimes interchanged, particularly in physical applications.
We can compute an explicit expression for the inverse function, which
we denote by T~l by , solving the equations
x = r sin (f>
cos 6,
y = r sin <f>
sin 6,
z = r cos (f),
Vx 2 + /+ z
2
z
arccos x
2
+ y
2
> 0.
Vx 2 + r+
Vx 2 + /
Since the range of arccos (the principal branch) is the interval < d < 77,
Figure 11
Sec. 5 Curvilinear Coordinates 315
x
2
+ / + z2 = 1, x
2
+ v
2
>
z =—
V2
Vx + f + 2
z *, z >
V3x, x> 0,
respectively.
6= -r(r varies)
4
r = 2 (6 varies)
6= — jg-(r vanes)
Figure 12
assumed that for some open subset N in the domain of T, the restriction of
T to N is one-to-one and therefore has an inverse T' 1 . The curvilinear
316 Vector Calculus Chap. 4
= t-A
x l I r cos 6
17 \ r sin
and
f
x\ / r sin </> cos Q\
K
zJ \ rcos<j> /
cos 6 —r sin 6\
(5)
sin r cos B)
and
cos (f>
—rs'mcf) /
respectively. The matrices (5) and (6), and more generally the Jacobian
matrices of differentiable coordinate transformations, have a simple
geometric interpretation. Each column of the Jacobian is obtained by
differentiation of the coordinate functions with respect to a single variable,
while holding the other variables fixed. This means that they'th column of
the matrix represents a tangent vector to the curvilinear coordinate curve
for which they'th coordinate is allowed to vary. That is, let the coordinate
t
transformation be given by lL" —
column of the matrix
:ii". Then they'th
only they'th coordinate of u to vary. Tangent vectors are shown (with their
Sec. 5 Curvilinear Coordinates 317
'2 Vl)
y
318 Vector Calculus Chap. 4
Figure 14
3
Example 3. {Cylindrical coordinates in 'J\ .) The coordinate transfor-
mation is defined by
f
x\ Ir cos 6^
'cos 6 —r sin 6 N
sin 6 r cos d
Figure 15
. .
d(x, y, z)
d(r, 6, z)
result is that the matrix H, whose columns are the rectangular coordinates
of 1/AiCi, l//z„c n is an orthogonal matrix whose inverse is equal to
. . . , ,
its transpose.
h3 = r sin (f>,
so that the matrix H is given by
'sin <f>
cos 6 cos (f>
cos 6 — sin 6 s
_1
// = | cos (f)
cos 6 cos </> sin —sin </> |
-sin cos
c i- c i = gn> i,j=l,...,n.
Since c t
• c; — c,- • cu we have g (j = gH . If the vectors c l5 . . . , c n are
orthogonal, as is often the case in practice, then only the inner products
Cj • c t
= gu will be different from zero. A number of important formulas
can be expressed in curvilinear coordinates entirely in terms of the
functions g i} and without explicit reference to the particular curvilinear
coordinate functions being used.
In the .vy-plane suppose curvilinear coordinates are given by
x\ (x(u, v)
yl \y(u, v)
320 Vector Calculus Chap. 4
whence
gu +
[dul \du)'
dx dx dy dy
gl2 = g 21 =
du dv du dv
v(t)
(x(u(t), v(t))\
f(t) = a < < t b,
y(u(t), v(t))
!
'dx} dx du dx djv
dt du dt dv dt
dt dy dy du dy dv
J \
\dt I \du dt dv dtl
dx dx dy dy} du dv
+ 2
,
du dv du dv_ dt dt
+
[(IHDW
——
du dv
(Mi 2gi2
dt dt ffi
Sec. 5 Curvilinear Coordinates 321
/ » du, du A1/2
,\
1 2'
(7)
\i,i=i dt dt 1
dt
Ir cos 6^
\yl \rsin
The Jacobian matrix is
''cos B —rsind^
Vsin 6 r cos dl
Hence
= 1, fa = gn = 0,
lr{B) cos 0^
A curve f{6) = )
has a tangent vector dfjdB of length
\r(8) sin 6 t
((IM
Example 5. If (r, <f>,
3
6) are spherical coordinates in Jl , the length of
a curve y defined by an equation
r\ /r(/)\
= I <f>(t)
I
, a <t <b,
Wv
can be computed from Equation (7). The derivative of the coordinate
transformation
'x\ Ir sin <f>
cos 6^
y I = I r sin <f>
sin
\ rcos(f>
322 Vector Calculus Chap. 4
is the matrix
sin <f>
sin d r cos (/> sin r sin </S cos 6
The function g i3
- is the inner product of the z'th and the/th column of the
preceding matrix. Hence
(
gll git #13
L #22 £23
< r <-
2
has length
\0/ W
i. r /i
/(A)= Vl+sin 2 frff.
C"'~ - C" n
VI
1
+ sin
2
tdt = yj2\
1
VI - \ sin
2
d dO.
Jo Jo
EXERCISES
1. Sketch the three curves given below in polar coordinates.
3
2. In Si , sketch the curves and surfaces given below in spherical coordinates.
^sin A
< <-,
2' t
t
J
3
5. Let (r, <f>, 6) be spherical coordinates in Si. . The equation
determines a curve in 3l 3 (as well as in the r<f>0-space "U 3 ). Sketch the curve
in H3 . [Suggestion. The curve lies on a sphere.]
6. Prove that the Jacobian matrices (5) and (6) of the polar and spherical
coordinate transformations given in Examples 2 and 3 have inverses.
7. The equations
x = ar sin <£ cos 0\
z = cr cos <j>
2
t
3
determines a curve in Si. . Compute the cartesian components of the tangent
vector to the curve.
\ ,
(x\ I ar cos 6
a > 0, b > 0.
br sin
12. Verify the assertion made in the text that the y'th column of the Jacobian
matrix at u of a coordinate transformation 1L" —T > 31" is a tangent vector
at x = r(u ) to the curvilinear coordinate curve obtained by letting they'th
coordinate of u vary.
13., Show that if/(x, y) = /(/, 6), where (x, y) = (r cos 6, r sin 0), then
a2 / a2 / a2 / i a2 / 1 a/
a* 2 ~ "a^ 2 ~ a/2 ~ 72 ae 2
_r"
7 Tr '
14. (a) Find the formula for the arc length of a curve determined in plane polar
^ coordinates by an equation of the form
jr(t))
a < <b. t
15. An equation
fu\ iu(t)\
a<t<b,
3
determines a curve y on the conical surface in 3l
1
u cos r sin a\
\ (0 <M < 00,
u sin v sin a I
where a is fixed, < a < 77/2. Find the general formula for the arc length
of y.
" dx k dx k
(a) Show that a?,, = > -r- -=— for , /,
/'
= 1, . . . , n.
fc=l 9«£ 9«;
Sec. 6 Inverse and Implicit Function Theorems 325
ij
(b) If (g ) is the matrix inverse to (go), show that
k=i Sx k dx k
SECTION 6
If a function/is thought of as sending vectors x into vectors y in the range INVERSE AND
of/, then it is natural to start with y and ask what vector or vectors x IMPLICIT FUNCTION
are sent by /into y. More particularly, we may ask if there is a function THEOREMS
-1
that reverses the action of/ If there is a function/ with the property
/
_1
(y) = x if and only if /(x) = y,
-1
then/ is called the inverse function of/. It follows that the domain of
/
-1
is the range of/ and that the range of/ -1 is the domain of/ Some
familiar examples of functions and their inverses are
/(*) = *\
326 Vector Calculus Chap. 4
'x\ /4 5\ Ix
y\ = 1 -6 j
,z/ \3 4/\z
It is obvious that any affine function A(x) = L(\ — x ) + y is one-to-one
if and only if the linear function L is one-to-one also. In this example,
and L(x) =
can be computed to be
That this is the correct expression for A' 1 may be checked by substituting
^(x) for y. We get
A-^Aix)) = L-^LCx - x )) + x = x.
Hence
— —
Sec. 6 Inverse and Implicit Function Theorems 327
Obviously this method will enable us to find the inverse of any affine
transformation 31" -^-> %n if the inverse exists.
linear vector functions. Given a function 31" > % n one may ask: (1)
Does it have an inverse? and (2) If it does, what are its properties? In
general it is not easy to answer these questions just by looking at the
function. On the other hand, we do know how to tell whether or not an
affine transformation has an inverse and, what is more, how to compute it
explicitly when it does exist. Furthermore, iff is differentiable at a point
x it can be approximated near that point by an affine transformation A.
,
For this reason, one might conjecture that if the domain of/is restricted
to points close to x then/will have an inverse if A does. In addition, one
,
[f-'Yiyo) = [/'(xo)]-
1
,
The existence of/ -1 is proved in the Appendix. Once the existence has
been established, we can write where 51" >% n is thef~ °/=
x
/,
Figure 16
interval is one-one. For suppose x x and x 2 are any two points in the
interval such that xx < x 2 By . the mean-value theorem it follows that
/(*»)-/(*!)
/'(-Xo),
for some x in the interval xx < x < x2 . Since/'(x ) > 0, and x 2 — xx >
0, we obtain
/(*,) -f(Xl ) > 0.
Thus, /is strictly increasing in the interval a < x < b, and our contention
is proved. It follows that /restricted to this interval has an inverse. The
other conclusions of the inverse function theorem can also be obtained in
a straightforward way for this special case.
x
3
— 2xy 00 < X < 00,
-1
the differential dx /is defined by the Jacobian matrix
'3x 2 -2y 2
-4xy\ (\ 4
1 1
Sec. 6 Inverse and Implicit Function Theorems 329
y =/(Xo) =
= y +/'(x )(x - x ).
= x + [/'(x )]- 1 (y - y ).
lu\
Hence, if we set y = I
\v
u\
+
i
u
+
u — x 4_y + x, v = x +y z
(4x3y + 1 x4 \ 15 1
2
1 3^ /( X>y)=(lil) \1 3
330 Vector Calculus Chap. 4
Since the columns of this matrix are independent, the differential has an
inverse, and according to the inverse function theorem the transformation
has an inverse also, in an open neighborhood of (x,y) = (1, 1). The
inverse transformation would be given by equations of the form
(2,2) jEftJV
'f
au ov 1/5 1
f
\ou
(2,2) ?W
ov
Suppose %n — 3i
n
is a function for which the hypotheses of the
inverse function theorem are satisfied at some point x . It is important to
realize that the theorem does not settle the question of the existence of an
inverse for the whole function/, but only for/ restricted to some open set
containing x For example, the transformation
.
x\ III cos v\
= . , 0<u,
y) \ u sin vj
u u I
The inverse matrix exists for all (it, v) satisfying it > 0. However, the
otherwise unrestricted transformation clearly has no inverse, for the same
image point is obtained whenever v increases by 277. Two corresponding
regions are shown in Fig. 17. If the transformation is restricted so that,
for instance, < v <becomes one-to-one and has an inverse.
2tt, then it
In Section 4 we considered
problem of finding derivatives of an
the
implicitly defined function / under the assumption that / satisfied an
equation F(x,/(x)) = 0, with both / and F differentiable. We saw that
to solve for/'(x ) by matrix methods it was necessary for Fy (x ,/(x )) to
have an inverse. It is natural that the same condition occur in the next
Sec. 6 Inverse and Implicit Function Theorems 331
9_7T
lit
Figure 17
1. F(x o ,y ) = 0.
2. Fy (x , y ) has an inverse.
1 + 3/U = 4^0.
332 Vector Calculus Chap. 4
3/1 1 / x
iU
+ 27 3/1 _ 1 x™ + 27 |
x xV 27 Vx xV 27 '
xyzw — 1
Let x = I and y = I I •
Then
l-z 3 - w2 + 2
F<*o, y) =
\ zw - 1
-3z
2
2w\ (-3 -2
w z /Q=(l)
" \ 1 1
1 -2
1 3
EXERCISES
1. Can a function have two different inverses?
-1
4. Compute /I for the following affine functions:
"
(a) A(x) = Ix + 2.
1 3 H - 1
(b) ^
2 4 v -2
jc - 3
*
/l«.y. /4
7 -4
Usjngthe inverse function theorem, show that the following functions have
*
x
inverses wnen"TestriclelTTo~5rjtfrie'lJ"pen set containing .
(c) y 7x + 6, x = 4.
jr — j'
6. Let/
2xy
-1
(c) If/ is the inverse of/ in a neighborhood of the point /
J,
compute
x
the affine transformation that approximates close to \2)
f~
-3
Ans.
"
7. Find the affine function that best approximates the inverse of the function
l
( \
near the point / Notice that to find the precise inverse would be
W
I.
difficult.
I 3 3 \ /
Ans.
f r sin <f>
cos 8
r sin <f>
sin 6
r cos <f>
T
vl \yl \g( x y)
>
(h(u, v)
--
S
y] \v] \k(u, v)
3 5\ dh
find —
dv
(3,4). [Ans. -5.]
4 7/
—
10. If
11. Let
/h\ /h 2 + h 2 i; + Kb
2
\v) \ U + V
-1
(a) Show that /has an inverse/ in the vicinity of the point
/11.8
-1
(b) Find an approximate value of/ I
/ f{x,y,z)
F{x,y,z) = I g{x,y,z)
14. Although the condition that the differential dx f have an inverse is needed
for the proof of the inverse function theorem, it is perfectly possible for
this condition to fail even though an inverse exists. Verify this fact with the
example /(jc) = x3 .
(X l
- + x 2o sin -, if x + 0.
2
/(*)=
lo, if x = 0.
336 Vector Calculus Chap. 4
Show that rf /is one-to-one but that/ has no inverse in the vicinity of x = 0.
What does the example show about the inverse function theorem?
if
/(So).
(a) If
yls . . .
, yn be a basis for the range of dx f.
Extend to a basis
v i> • • •
, y«, y n+ \, • • , y m for all of &'", and define R m -£->• Jl" by
Show
GH
that {g o /)and dXo (g ° /) = (df(X(j) g)
= («!, . . . , a n ).
° (dx
J) satisfy the condition
of the inverse function theorem.]
23. Prove that under the assumptions of Theorem 6.2 there is only one function
/defined by F(x,f(xj) =
neighborhood of x and satisfying/(x ) = y
in a .
SECTION 7
While explicit, implicit, and parametric representations of surfaces have SURFACES AND
so far been used for illustrative purposes, a precise definition of the term TANGENTS
"surface" has not been given. In this section we shall define a smooth
surface in terms of each of the three representations, give a unified
definition of tangent for each mode of representation, and show how the
three are related. In particular, we shall see that for each of the three
ways of representing a surface —as an image, as a graph, or as a level set —
representation for a tangent is obtained by taking the image, graph, or
level set of the affine approximation to the given function.
An /7-dimensional plane in 3l m is either an ^-dimensional linear sub-
space (that is, the set spanned by n linearly independent vectors) or else an
affine subspace (that is, the translation of a linear subspace by a fixed
vector y ). When n = 1 or n = 2 we get a line or an ordinary plane. A
parametric representation of an /?-dimensional plane in 3i m is obtained by
looking at the range of an affine function %n — >'Ji
m
, where A(x) =
L(x) +
y and L is a linear function defined on 31". To ensure that the
,
'cos v — sin v Q
sin v u cos v
y= u
Z — V,
we obtain
x = — 2z + 77
as the equation that implicitly defines the tangent plane to S at (0, 2, 77/2).
Figure 18
ft'
fx\
7(0= \y I
= I
r \.
/ 1
3| +(t-tM2t
2
\3t /
l
1°'
= t\ 2t \ I — I t\ |
\3d/ \2tl
340 Vector Calculus Chap. 4
Figure 19 Figure 20
Since (1, 2r , 3/q) ^ 0, it follows that the range of A is the tangent line to
the curve at /"(/„). Figure 19 shows the curve and the tangent line to it at
/(I) = (1,1,1)-
7.1 Theorem
f(u, v) — I
u sin u |
, for (w, v) in Jl 2 ,
-
ftp. »)
has continuous entries. The range of/ is a cone shown in Fig. 20. Points
of the cone not at the vertex correspond to values of u ^ 0, and it is easy
to check that for u 7^ the columns of/'(w, v) are linearly independent.
Thus the tangent plane at such a point has the expected dimension,
namely 2. However, at the vertex, u = 0. Therefore, /'(0, v) has only one
nonzero column. Thus any attempt to use the affine approximation to/
to define a tangent at the vertex leads to a 1-dimensional tangent. Indeed,
it seems natural to say that the cone has no tangent at its vertex. However,
the cone satisfies the definition of smooth surface at every other point.
The lack of smoothness at the vertex is not associated with a lack of
differentiability in/, but rather with the failure of the tangent to exist.
the form (x,/(x)), for all x in the domain of/ If/ is differentiable
at x , the affine function
7.2 Theorem
A(x) = L(x) + r
(x\
y/9 - X2 -r
(2,1,2)
Figure 21
Sec. 7 Surfaces and Tangents 343
f
\ x/ 9
-*
_ x 2_ };
2
-y
)
^9 _ x a _ y^,^
= U
\
-iv
2/
The tangent plane to the hemisphere at (2, 1 , 2) is the graph of the approxi-
mating affine function
(- -9C:3
9
= x — - y.1
2 2
2x + + _y 2z = 9.
metrically by a function Jl" —^->- 3l n+m of the form g(x) = (x,/(x)). This
raises the question of whether the sets 5 that can be represented in both
ways have the same tangents and of whether the notion of smooth surface
is the same in both representations. It is clear that g is continuously
\/ ( x o)
and this shows that the graph of the affine approximation to/is the same as
the range of the affine approximation to g. Hence the two definitions of
tangent and of smooth surface are the same where they overlap.
We recall that the null space of a linear function 3i n+m > 3i m is a —
subspace of 3i n+m Then, for a fixed vector x the set of all x such that
. ,
x —
x is in the null space of L is a plane in 3i n+m Clearly, the plane passes .
Consider a function Ji
n+m — > %m and a fixed vector z in %m . Let
5 be the level set defined by the equation F(x) = z . If F is differ-
x = (x ,y ,z ) = (2, 1,2)
(2x 2v 2z ) = (4 2 4).
(4 2 4)1 y- 1 =0.
—F > 3i m
does not assign a tangent to the level set F (x) = z at x = x . This is
similar to the complication that occurs in the parametric theory and that
gave rise to Theorem 7.1. In the present case we need to know that n is
the dimension of the null space of the linear transformation with matrix
F'(x ). Since the dimension of the null space is equal to the dimension of
thedomain minus the dimension of the range, we want the dimension of the
range to be m. (Theorem 4.7, Chapter 2.) Hence
Sec. 7 Surfaces and Tangents 345
7.3 Theorem
Let Jl"+" !
—^'J{'" be differentiable. Then the tangent to the level
set F(\) = z at x exists (and has dimension n) if and only if F'(x )
F'(x, y) = (3x
2
- 3y, 3/ - 3.x).
The one point at which the differential fails to assign a tangent is (x, y) —
(0, 0). There the null space of the differential has dimension 2.
/ 3f _3r_\
3t
fit) = 1 < < t 00
ll + J
^'l
f 1 + ry'
is a parametrization of the part of the curve that lies in the first and second
quadrants, and it assigns the curve a horizontal tangent at the origin.
Interchanging the coordinate functions off gives a parametrization of the
part of the curve in thefirst and fourth quadrants.
If 'Ji
n+ —
m F >- % m is continuously differentiable and its level set F(\) =
has an implicitly determined tangent at a point x , then according to
Theorem 7.3 the m-by-(n +
m) matrix -F'(x ) has m linearly independent
columns. Denote the variables corresponding to these columns by the
vector y and observe that the implicit-function theorem applies. The
conclusion is that, writing x = (v, y), there is a continuously differentiable
function 31™ — > tfl
m satisfying F(v,/(v)) = in some neighborhood of x .
the level set F(x) = can be represented as the graph of the function/. It is
routine to show that we get the same tangent by using the explicit or the
implicit representation, and we leave the computation as Exercise 12.
346 Vector Calculus Chap. 4
EXERCISES
1. Find a parametric representation tx 1 + x 2 for the tangent line to each of the
curves defined parametrically by the following functions at the points
indicated. Sketch the curve and the tangent line.
(b) g(t) = y
2. Find the tangent plane to each of the surfaces defined parametrically by the
following functions at the points indicated. Sketch the surface and the
tangent plane in (a) and (b).
< u <2
(a)/
< v <2 ^/
Ans.
3. Find the tangent plane or tangent line to each of the following explicitly
defined curves and surfaces at the points indicated.
'1
4. Find the tangent line or tangent plane to each of the following implicitly
defined curves and surfaces at the points indicated. Sketch the curve or
surface and the tangent in (b), (d), and (e).
T +^+z
2
(b) l,atx = (l>0>-y)-
Ans. - + V3 z = 2.
2
(
'2
d ) -5 +
/ = l.atUo.J'o) (4--).
15
x* + f
, atx
x +y
Ans.
5. In each of Exercises 1(a), 2(a), 2(b), and 4(b), find a normal to the given
curve or surface at the point indicated.
Ans. 1(a)
6. Each of the following curves and surfaces fails, according to our definitions,
to have a tangent line or plane at the indicated point. Why?
(a) /(/)
(b) g{t)
(c)/
348 Vector Calculus Chap. 4
7. Find all points at which the surface defined parametrically by the function
,2„2
«t + 1
8. Different vector functions can define the same curve or surface. Show that
the functions
Is
1 - 1 2s \
f{t) = I
*" I, -oo < t < 00,
(a) What curve and what surface are parametrically defined by / and g,
respectively?
(b) Show that according to our definition the curve does not have a tangent
line and the surface does not have a tangent plane at (0, 0, 0).
10. Let
fit) = I .),
-oo < r < co.
(a) Show that the curve in Jl 3 defined explicitly by /has a tangent line at
every point.
(b) Show that the curve in &'- defined parametrically by / fails to have a
tangent line at one point.
Sec. 7 Surfaces and Tangents 349
(c) Interpret (a) and (b) geometrically. What is the relation between the
tangent line in (a) and in (b)?
11. Let y = lie in the range of a function &n —F Rm . The surface S defined
implicitly by the equation F(x) = is assumed to have a tangent "B at x .
n+ m
(a) Check that the surface S' in R defined explicitly by F has a tangent
T?' at (x y ).
,
13. Verify that the two parametrically defined tangents at the origin that are
described in Example 6 of the text are horizontal and vertical, respectively.
SECTIOxN 1
EXTREME VALUES The problem of finding the maximum and minimum values of a real-
valued function of several variables is important in many branches of
applied mathematics, as well as in pure mathematics. Familiar examples
are extremes of temperature, speed, or economic profit, each of which
may be a function of more than one variable in a practical problem.
A real-valued function / has an absolute maximum value at x if, for
all x in the domain of/,
f(x) </(x ),
/(x ) </(x).
350
Sec. 1 Extreme Values 351
fi(xo)=tt(yo) = 0.
Since
/2O0) = — (*o, y ),
dy
and so the only extreme value of/ in the interior of the ellipse occurs at
( x o>yo)
= (0, 0). It is obvious from the graph of/, shown in Fig. 1, that
< °>
' V2 '
(1, 0, 0)
Figure 1
this value is a minimum. We next consider the values of/on the boundary
curve itself. The ellipse can be defined parametrically by the function
Thus, the values of/on the ellipse are given as the values of the composition
f° g. Any extreme values of/ on the ellipse will be extreme for/° g. The
.
/ —sin / \
= — I sin 2/.
Extreme values therefore may occur at / = 0, 77/2, tt, and 37r/2. The cor-
responding values of (a\ v) are (1,0), (0, I/V2), (-1, 0), and (0, -1/V2),
and those off are and \, respectively. We see that the absolute
1. \, 1,
f°g that occur at t = tt\2 and 377/2 are not extreme for/ as can be seen
by looking at Fig. 1
The methods used in the preceding example are valid in any number of
dimensions. The next theorem is the principal criterion used in this
1.1 Theorem
^ /(x + t u) - /(x )
—
df
(x ) =/(x )u.
du
Sec. 1 Extreme Values 353
Therefore,
- /(Xo)
0<lim /(Xo +
fU)
= r(x )u,
?->o+ t
n ^ hm /(Xp -
< i-
tu) - f(x )
=/ (xo)(— u) = —/ (x„)u.
,
<-o+ t
(x ) =/'(x )u,
du
and that the derivative v/ith respect to u measures the rate of change of/
in the direction of u. At an extreme point in the interior of the domain of/,
this rate should be zero in every direction. The importance of the theorem
is that of all the interior points x of the domain of/ we need to look for
extreme points only among those for which/'(x) = 0. Points x for which
/'(x) = are called critical points of/.
In practice we are often given a function /that is differentiable on an
open set and want to find the extreme points of/ when it is restricted to
some subset S of the domain of/ example the following two
In the next
remarks are x such that /'(x) =
illustrated: (1) a point is not necessarily
an extreme point for f; (2) / may have an extreme point x when restricted
to a set S without having /''(x) — 0.
F'(t) - 4/ 3 - 3/
2 - 1 = (t - 1)(4/
2
+ + / 1).
a- r- = 1 , z = 2
f
x\ /cosr
y\ = \
sin / 1 , < < t 2tt.
The function / on C takes the value F(t) = cos t -\- sin / + 2. We have
F'(t) = -sin / + cos t, so F'(t) = at / = it/4 and t = 5tt/4. Since
F"(tt/4) < Oand F"(5rrjA) > 0,
Figure 3
cylinder x2 +y — 2
1 =0 and the plane z — 2 = 0. When a specific
parametrization is not readily available, we can still sometimes apply the
method of Lagrange multipliers to be described below. The method
consists in verifying the pure existence of a parametric representation and
then deriving necessary conditions for there to be an extreme point for a
function /when restricted to the parametrized curve or surface.
G 1 (x1 ...,xn) ,
=
G2 (x 1 , ... ,xn) =
Gm (xx ...,xn ) ,
=
n
implicitly define a surface S in Jl , and that at a point x of S the
matrix G"(x has some m columns linearly independent.
If x is
)
A,„G,
*»-8«"vi»)-
In terms of the gradient, the Lagrange condition
x2 + y
2
- 1 = 0, z - 2 = 0. (3)
We write down
(x +y+ z) + ?n (x* +j - 2
1) + Uz - 2).
1 + 2X x x = 0, 1 + lk x y = 0, 1 + A2 = 0.
Sec. 1 Extreme Values 357
x, y, and z, we get
the first of these points and its minimum value, —v 2 + 2, at the other.
Notice that, while the A's are not needed in the final answer, it is necessary
to consider all values of the A's for which the equations can be satisfied.
x -y+ z+ k(x 2 +y + 2
z2 - 1)
V3
A=± T , * = -, = z = T -. 1
The maximum of/ occurs at (1/V3, — 1/\/3, 1/V3). The maximum value
is v 3.
n
2(x k - a*)
2
+ h(xi, . ,x n )
the equations
2(xx - a x) + X -f- (x 1; . .
. , x n) =
2(x„ - a„) + A —- (x ls . . . , xj = 0.
ox„
tPi - «i
(P)\
\p n — aJ
\dx„
Figure 4
We want to minimize the total area 27rx 2 + lirxy of the top, bottom, and
sides. We write
and look for critical points of/ We find that/. = 0,/„ = reduce to
2x +y + Xxy = 0, 2x + Ax2 - 0.
x -\-
y + z — =0 1 and + —z=
x _y
7 + A + ^ = 0, x + A + /M = 0, A — /* = 0.
The only point that satisfies these conditions, together with the condition
that it lie on S, is x = (J, £, \). We have V/(x ) = (£, J, 0), which is
perpendicular to 5. The unit vector u in the direction of V/(x ) is shown
in Fig. 5 with its initial point moved to x .
Figure 5
.
EXERCISES
1. Find the critical points of x 2 + Axy — y 2 — 8x — 6y. [Ans. (2, 1).]
2. Find the points at which the largest and smallest values are attained by the
following functions.
(d) 1/(jc
2
I
y
2
) in the region (x - 2)
2
+ y2 < I.
5. Find the maximum value of the function x(y f- z), given that x2 + y 2 = 1
and xz = 1 [Ans. f .]
6. Find the minimum value of x + y
2
, subject to the condition 2x 2 + y2 = 1.
[Ans. -IIVI]
£(a-, y) - at (x ,
y ).
8. A rectangular box with no top is to have surface area 32 square units. Find
the dimensions that will give it the maximum volume.
G(x) = has a tangent "E of dimension n -mat x and that the function/ ,
fix) = y ix - x,i
2
.
Find the point at which /attains its minimum and find the minimum value.
SECTION 2
Let F(x, y) =
x y, be the Euclidean dot-product of two vectors x and y
• QUADRATIC
in Jl". In addition to having the property F(x, x) > 0, the function F POLYNOMIALS
satisfies
n n
2=1 j=l
l\ /0
1 / 1
i /
In n \ n n
F(x, y) = Ffexfit, 2 y^A/ = 2 2 F e ^ *dXiy ( fi
\ 1 1 = 3=1 2 1
H*> y) = 2 a a x<y^ aa = a n- ( 4)
Q(x) =2 a^xjj, au = aH .
(5)
/a u a 12 ... a ln \ ly x
\a nl . . . a nn \ \yn \
or
F(x, y) = x*Ay = x • Ay.
principal diagonal.
In the matrix notation, (5) becomes
Q(x) = x*Ax,
Q{x) — x Ix
l
= x • x.
= x
2
+ Axy + y
2
,
( Xl x2 x3 x 4) I I = x\ + 2x\ + 3x
2
+ Ax
z=y
Figure 6
Qi(x, y,z) = (x —y — z) 2 = x2 +y + 2
z2 — 2xy — 2xz + 2yz
The polynomial
In the examples just given, we have seen illustrations of the fact that if a
quadratic polynomial can be written, say, in the diagonal form
Q(x, y, z) = a x x2 + a 2y 2 + o3 z 2 ,
Sec. 2 Quadratic Polynomials 365
then Q is positive definite if and only if all the coefficients a t are positive.
Furthermore, if some coefficients are negative or zero, it is possible to
determine regions for which Q is positive or negative. In the following
examples we consider one way in which a polynomial Q{x,y) can be
written in diagonal form.
F((x,y),(x',y'))=(x y)r JT
The general quadratic polynomial is then
Q(x, y) — ax 2
+ 2bxy + cy 2 .
Similarly, if c ^ 0,
2
<20x y) ,
= i c
2
ly + - x\ + (ac - b )x
2 2
.
(7)
u = x + _
y, v = Ox + y,
a
au
2
+ -(ac- b )v
2 2
.
Xl = (l,0), x2 = (~^.l)-
form as
1
u
v
\0
\yi \yi
Q{x) = 1
Figure 7
2.1 Theorem
<2( X ) = i xy \ + ... + xn yl
As a result,
Q{x k ) = Xk , fc=l,...,».
chosen in this way, let cUn_k be the subspace of 'J\ H consisting of all
vectors perpendicular to xl5 x k Let S"-*-1 be the unit sphere in . . . , .
Continue the process until n unit vectors have been chosen in this
way, each perpendicular to those already chosen.
The vectors x l5 x n clearly form an orthonormal basis for
. . . ,
Q'(x)y = 2F(x, y)
for any x and y in Jl". (See Exercise 10.) Hence, at the critical
point x ls
0=/'(x 1 )y = 2F(x 1 ,y)-2;.x 1 .y,
By restricting Q to
c
Un _ 1 the subspace of % n perpendicular to x x , we
t The set of all unit vectors in .ft" is an (n — l)-dimensional surface implicitly
defined by the equation |x| = 1. For this reason, we write the index n-lon 5n_1 .
368 Real Valued Functions Chap. 5
F(x 2 X*)
, = 0, k = 3,. . . , n.
F(Xi, xk) = 0, if i # k.
If an arbitrary vector x is written in terms of the basis x l5 . . . , x„
as x =yx + l 1 . . .
+ y„x„, we obtain
= I,F(xk ,xM=lQ(x*)yl-
This completes the proof.
2.2 Theorem
1=1
Q(x,y) = 3.x
2
+ 2xy + 3y
2
for some A. That is, for some ?., the vector (x, y) must satisfy
(3 - X)x + j =
.v + (3 - % =
in addition to jc
2
+y = 2
1. (A has been replaced by —A.) Nonzero
solutions to these equations will exist only if the columns of the matrix
-;. 1
1 3-
are dependent. Since dependence is equivalent to
;. 1
0,
3-;.
1 = 2: U,,)=( ± i T -L).
A = 4: (*,,)= ± X ± J.).
(
^/ ±
In terms of the new variables we have
Q(x) = Au + 2
2v 2 .
In Fig. 8, level curves of Q are shown in their relation to the new and to the
original basis vectors.
For some purposes it is unnecessary to compute the orthonormal basis
vectors x of Theorem 2.1 provided that the numbers A can be found. For
example, it is clear just from knowing the X k whether Q is positive definite
or not. The following theorem enables us to compute, or estimate, the Xk .
,
2.3 Theorem
Q(x) = x Ax,
l
where Q(x k ) = Xk . Then the numbers X k are the roots of the equation
Theorem 2.1, it is not necessary to know what they are in order to find
the X k The X k can be computed by solving Equation (9). Equation (9)
.
is called the characteristic equation of Q, and the roots X k are called char-
acteristic roots or eigenvalues. The next theorem provides another method
for computing the basis vectors x lt . xn. . , .
2.4 Theorem
(A - 4/)z* = 0. (10)
Then, with respect to this basis, Q has the diagonal form (8).
Sec. 2 Quadratic Polynomials 371
Proof (of Theorem 2.3). Suppose that the orthonormal basis vectors
xx , x„ that diagonalize Q are
. . . ,
»
Yn
A =:
bm
t
x = I and y =
g(x) = (ByyA(By)
= y (B AB)y
t t
= y Ay.
f
A- XI = B-'AB - XI
- B'\A - XI)B.
372 Real- Valued Functions Chap. 5
x = Cz
C AC l
= I I A(z 1 ... z„)
(i4Zi . . . Az„)
(AjZj . . . X n z n ).
l, if i=;\
ZjZj = Zj • Xj =
0, if i#j,
Sec. 2 Quadratic Polynomials 373
we get
'
A,
A,
aac
'0 \
^2 2
or
-A 3 + |A + i = 0.
The characteristic roots are A = 1, — \, —\. So there is an orthonormal
system of coordinates (w, v, w) with respect to which Q has the form
u2 — \v % — \w 2 .
To find the related basis vectors we look for the unit vector solutions of
the equations
-A \
-X
equation simply requires that the two remaining basis vectors lie in the
plane x +y+z= perpendicular to xv Then x 2 and x 3 can be chosen
0,
to be arbitrary perpendicular vectors in that plane, for example,
(1 -1
) and (1L 2).
374 Real- Valued Functions Chap. 5
(0, 0, c)
(0, 0, c)
a' t>- C
Two-sheeted hyperboloid
Figure 9
1
Sec. 2 Quadratic Polynomials 375
V"2
A- 4=
V2
376 Real- Valued Functions Chap. 5
Figure 10
Then
P(x) = F(x, x, . . . , x)
EXERCISES
1. By changing coordinates, write each of the following quadratic poly-
nomials as a sum of squares. In each problem exhibit an orthonormal basis
that does the job, and write the coordinate transformation.
(a) 3x 2
+ 2V2xy + Ay 2 .
Ans. 0(x)
Sec. 2 Quadratic Polynomials 377
(b) 3x 2 + 2V3xy + 5y
2
.
5
4/w. <2(x) = 2/r + 6r 2 ;x! =
(4 4H^)-]
(c) (x J)
:;)0
(d) 2x 2 - 5xy + 2/ - 2xz + 4z 2 - 2yz.
^HJ. Q(x)
2 2
(11 M
V3V2'3V2'3V2/' *3 ~
P
\2 '
2
3 '
_1\
V
(b) Find the maximum of the polynomial in 1(a), restricted to the circle
x2 +f = 3. [Ans. 15.]
[Ans. Neither.]
5. Sketch the level curves Q(x) = 1 and Q(x) = for each of the following
polynomials in 5t2 .
6. Sketch the level surfaces Q{\) = 1 and Q(x) = for the following poly-
3
nomials in Jl .
(a) x2 — xy -f
2
y + z2 . (b) x2 + xy. (c) x 2 — 2xy + y 2 — z2 .
Q(ax) = 2
a Q(x),
for every real number a.
8. Let Q be an arbitrary quadratic polynomial on Jl", and let Fbe its associated
symmetric bilinear function. Prove that
10. Let Q be an arbitrary quadratic polynomial on Jl", and let /"be its associated
symmetric bilinear function, that is, Q(x) = F(x, x). Prove that Q is a
differentiable vector function or, more explicitly, that
12. What follows illustrates the fact that the condition of symmetry on a bilinear
function can be obtained by averaging out the nonsymmetry. Let G be a
real-valued function defined for all pairs of vectors x and y and linear in each
variable (we do not assume symmetry). Show that the function F defined by
13. Let Q be a quadratic polynomial on R". Prove that there exists a basis
(x l7 . . . , x„) for Jl" such that, for any vector x = y1x 1 + . . . + y„x n ,
n
Q(x) = ^ hy\, witn ?-i =0,1, at -1.
Conclude that the above polynomial is positive definite if and only if the
three determinants are positive:
a \f
f b
380 Real- Valued Fund ions Chap. 5
definite, then there is an orthonormal basis for 31" with respect to which Q Y
SECTION 3
TAYLOR EXPANSIONS We begin by reviewing the definition and the simplest properties of the
Taylor expansion for functions of one variable. If f(x) has an Mh
derivative at x , its Taylor expansion of degree N about xQ is the poly-
nomial
v
/(*„) - |j/'(*«)(* - *o) + jj"(x )(x -x )
2
+... + ^f iN \x )(x - x )- .
The relation between / and its Taylor expansion can be expressed con-
veniently by the following integral remainder formula.
3.1 Theorem
/(*)=/(*„) + ^/ (x )(x-x
/
)
+ LY,
- N + RN
• • •
+ ^/ (-x )(.x x ) , (1)
where
/<A)(
Rs = l
-
P(x - A'-y Y, ( -
(0 dt - Vo)
'
(\x - N-Ht.
t)
S \x - x )\
j/ ){x
which is just the last term of the Taylor expansion. The first integral
can be integrated by parts to give
x- 2 LX - 1} -1}
1
f(x - - t) [f «) -f lN (x )] dt = RN_V
(JV-2)!J*
Jx
,
We therefore obtain
If we substitute the preceding equation into (1), we get (1) back again
with N replaced by N— 1. The induction is completed by noticing
that, for N = 1 , Equation (1) is just
is equal to its Taylor expansion of degree N. For if/is of degree N,f is '
common examples. It is only in the first one that we have equality. The
expansions are all about x = 0.
(!+*)» =!(?)*>.
*+*"!'
2
il-x) M ~o\ m-1
e
x
: 1 +-x + -x + ...+— x N 2
,
1!2! N\
log (1 - x): -x - ix
2
- ix - 3
... --x iV
, (2)
N
3 5 2 ^+i
x x
+——
x
sin x: x — '— ... + (—l) fc -
3! 5! (2k + 1)!
F(t)=f(tu + x ,tv+y ),
.
Figure 11
where u and v are held fixed. We attempt to find a Taylor expansion for F
about / = and, therefore, compute the successive derivatives of F at 0.
We find, by the chain rule,
F'(t) = u^-(tu + x , ty + y ) + v
f-
(tv + x , ty + y ),
ox oy
^"'(0) = u — 2
(x , y ) + 2uv —— (x , y ) + v* — l
(x ,
>- ).
dx dx By dy
^"'^.iJ^'^^'P' "** 1
N
where is the binomial coefficient NljkliN — k)\. Now replace u by
becomes, for / = 1,
Sec. 3 Taylor Expansions 383
fix, y) ~f(x ,
>' ) + — (— (x , y )(x - x ) + — (x„, y )(y - y )f
'
1 ! \ox dy
1 /r)
2
+ — T~ Oo, >'o)(x - x f +
(
f
2
2 ——
5.x
r)
2
dy
f
(
+ —
df
2
2 Oo, y )(y - y )
)
)
+ (y - y )
2
fv »(x y , )) (3)
jV
l /iv\
+
/x (0,0)=/,(0,0) = 0,
The graphs of/ and its second-degree Taylor expansion are shown in Fig.
12.
1 + ±(*2 + yl)
Figure 12
384 Real- Valued Functions Chap. 5
{
x
i + yB
dy>
applied to/and evaluated at x = (x ,y ) is by definition the first-degree
polynomial
\
X
YX + y Yy)
has been multiplied out according to the binomial expansion. The
operator is applied to/, and the partial derivatives are then evaluated at
x = (x ,
y ). Notice that x and y are the only variables that appear in
the preceding equation, since
(Xo)
j)a7a7^
is a constant for a fixed x . The Mh-degree Taylor expansion of/at x can
now be written
f(x , Jo) + J7
<4„ fix -x ,y- y + ) ...+ — <l£ fix - x y
,
-y ).
4.i>/(* - 1. y
- 1) = ((x - i)f + (y - Dj-T /
= (x-l) 3
^ (l,l)
dx 3
2
+ 3(x-l) (^-l) -4^(1,1)
2
dx dy*
+ (y-if^{(i,i). 3
dy
Sec. 3 Taylor Expansions 385
A
Example 3. When the polynomial (x 1 + x2 + . . . + x„) is multiplied
out, each term will consist of a constant times a factor of the form
x^x^" 2 Xnn where the nonnegative integers ki satisfy kx
. . . + . . . + kn = N.
The multinomial expansion has the form
N
(x 1 + ... + xn f= k
I
+...+k n =x\k 1
(
. . . kn!
)#...**•.
N
(v/Cj kj )
U, .
. .
. .
. k nl k1 . . . . kn .
This computation will be done later using Taylor's theorem (Theorem 3.2).
= 1 (
U 1
xl"
/f ( fll ,...,fl,).
The function dx /is exactly the same as the differential defined in Section 8
of Chapter 3. For completeness we can also define the 0th differential by
</(x)=/(x ).
Example 4. The second-degree Taylor expansion of e Xl+ • •
Xn
about
x = is
i+
ii(^ + - + -l-> + ii(^ + - +x»4)/
= + ^ (^i +
1 • • • + xn) + j (xi + 2x x x 2 + . . . + x\ + . . . + 2
x n ).
/Oo, >'o)
- — (— (x , y )(x - x ) + — (x , y )(y - y )j
dy
X Xn
f(x ,y ) + (d f)\
_
Ky ) \y yQ,
Let 'Ji"
—
* 'S\ have all derivatives of order iV continuous in a
neighborhood of x Let Tx (x — x ) be the Mh-degree Taylor
.
Then
(/(x) - Tv (x - x
..
hm ))
= 0,
x-*Xq |X Xq|
x y
2
+ x
3
+ / = -1 + ^((x - 1) + My + 1))
+ i (4(x - l)
2
+ 4(x - l)(y -f- 1) - 6(y + l)
2
)
+ ± (6(x - l)
3
+ 6(x - lf(y + 1) + 6(y + l)
3
)
= -l + (x - i) + My + i) + 2(x - i) 2
+ (x-l) + l) + (y+ l) 2
(>'
3
.
e
x+ v = +1 j
( x +^+_ ( x2 _|_ 2x> , _j_
ys)
+ _ > _
f
for a n x and y
It follows that
tends to (0, 0). According to Taylor's theorem, there is only one poly-
nomial of degree two having this property.
Example 7. Let/(x, y) = e
xu
sin (x + y)- Since
e" =1+ xy + - xV + Rx
and
sin (x + y) = (x + y) - j| (x + >0
3
+ K2 ,
we can multiply the expansions together, putting into the remainder all
388 Real- Valued Functions Chap. 5
f(x, y) = e
xv
sin (x + y) = (x + y) + x 2y + xy 2 - ^ (x + y)
3
+ R,
where R/\(x, y)\ 3 tends to zero as (x, y) tends to (0, 0). In other words, we
have found the third-degree Taylor expansion of e xy sin (x + y) about (0, 0).
In standard form the expansion looks like
=
jj
(* + y) + jj (--x + 3
3x y
2
+ 3x/ - /) + R.
(0,0) = 1.
dx dy dx dy
Then
2
e
cosx
=1+ (l - j+ R'(*)) + ^(l - + ^'Wj
|
+ k(i-| + *'(*))
Since /?(l — x 2 /2 + R'(xf) does not even tend to zero as x tends to zero,
we must proceed differently to find a Taylor expansion of eeoiX . We have
+ ^(-| + i?'(x))
= *(l-f) +*(*),
where R"(x)jx 2 tends to zero as x tends to zero. The coefficients can also be
found by direct computation of the derivatives of e cosx .
Sec. 3 Taylor Expansions 389
n
k=l 1 +
1
Xk
no
k=X
-**'+*(**»
2
where R'/Kxx, . . . , x n )\ tends to zero as (x u . . . , x n ) tends to zero.
= N about
Example 10. The Taylor expansion of/(x) {x± + . . . + xn )
xn = is
(xx + . .
+ xj* = h(*7T + • • • + ^rT'
.
N!\ ox l
ox Jo
N dNf
N
V I
»i+...+*»=iv\fe 1 . . . kh
EXERCISES
3
1. Find the third-degree Taylor expansion of (h + v)
4. Find the quadratic terms of the Taylor expansion of Xe x+V about (0, 0)
{x+v)
5. Find the quadratic terms of the Taylor expansion of esin about (0, 0).
.
xi+yi
6. If/(;t, y) = (x 2 +y 2,
)e , use a Taylor expansion of/ to compute
ay
M«5. 0.]
[Ans. 2(x +y + z) 2 .]
(d) A: = 4, x = (0, 0, 0),/(x, >', z) = x3 - 3xv 2 ;
4x/ + 6x 2y3 + ly 5 .
[Ans. 96xy 3 .]
SECTION 4
TAYLOR EXPANSIONS The tangent t>' to the graph of a function J{" > J{ at a point (x ,/(x )) —
AND EXTREME isfound by computing the first-degree Taylor expansion of/about x We .
VALUES now consider the question of whether or not the graph of/ crosses 73 at
(x ./(x )). The possibilities for a function of one variable are shown in
Fig. 13.
Sec. 4 Taylor Expansions and Extreme Values 391
Figure 13
which implies that the graph of/ lies above the tangent 75. Similarly, if
dx f(x — x ) is negative except for x = x the graph of/ lies below 75. ,
/will cross 75 at p .
It follows that if (1) holds for two vectors x x and x 2 not necessarily close to
,
f{x, y) = 2x 2 — xy — 3j
2 — 3x + ly
dljy = 4(x - l)
2
- 2(x - 1)0' - 1) - 6(y - l)
2
.
_ J
Trying x x
— (2, 1) and x 2 = (1, 2), we obtain
</(x 1 -x = 4 ) o.
</(*» - x ) = -6 < 0.
We conclude that the graph of/ crosses the tangent plane 73 at (1, 1,2)
and, consequently, that/has neither a local maximum nor minimum at x .
4.1 Theorem
Let :H" > 'M have all its second partial derivatives continuous
Notice that not all possible cases are covered by parts (a), (b), and (c).
It may happen, for example, that the second differential is zero somewhere
other than at x , but that it still does not change sign.
where
R
0.
x — *o |x — X
Under assumption (a), we must show that
\dlj(x - x ) +R>
in some neighborhood of x , excluding x itself. The homogeneity of
the quadratic polynomial d* /(see Equation (2)) implies that
dx f(x — x )
Ix — x„|
2
\|x - x |/
Since d% /is positive definite, its values for unit vectors are bounded
away from zero by a constant in > 0. (See Exercise 14, Section 2).
Now choose d > so that, for < |x — x < 6, |
\R\ m
<
~ "
|x - x |
2
4
It follows that
\dlj(x - x ) +R >^ |x - x |
2
- j |x - x |
2
>
which, according to Equation what we wanted to show.
(3), is
The proof of part (b) is same as the proof just
practically the
given. To prove part (c), suppose that d^o f(x 1 — x ) > and
^/(x, - x ) < 0. Set
= R
V \dlj(x t
—x + ) |x t -
(4)
|
Xi (0-x |
2
J
Since
R
o,
f-o \x (t) t
- x
itfollows that, for any nonzero / sufficiently small, the left side of
Equation (4) is positive if = and negative if i = 2. In other
/'
1
words, the graph off lies both above and below the tangent "G for
some values of x arbitrarily close to x This completes the proof. .
f{x,y) = (x 2 + />* -*
2 2
394 Real- 1 allied Functions Chap. 5
lias its critical points at (0,0). (0. 1). and (0. —1). This implies that the
tangents at these points are horizontal planes. The second-degree Taylor
expansions at the three points are. respectively,
/(0.0) -
/(0
Sec. 4 Taylor Expansions and Extreme Values 395
Example 4. If/Xx) = x sin 3 x, the first four terms of the Taylor expan-
sion at x — are identically zero, while
4fx =
4
24x .
ax 2 + 2bxy + cy
1
= (x y)
a,
396 Real- Valued Functions Chap. 5
gz = 2x+y = 0,
gy = 2y + x = 0,
gz = 2? = 0;
so the only critical point occurs at (0, 0, 0). Since dig = 2g, we can test g
itself for positive definiteness. We have
a f e
give any information, consider the next highest term of the Taylor expansion
that does give information.
(a)
,
for all .v and all integers k. For this reason it is possible to restrict atten-
tion to those x's lying in some fixed interval of length 2tt, say, — 77 x <
77. We shall compute some examples of Fourier approximations to get
some idea of how they work.
Then
1
x\ cos kx dx, - Pi|x| ,
sin
•
kx dx.
77 J-ir 77 J-n
Clearly, kx has integral zero over [—77, 77], because the integrals
\x\ sin
over [
— 77,
and [0, 77] are negatives of one another. Hence b k =
0] for
k = 1,2,.... On the other hand, the graph of |x| cos kx is symmetric
about the y-axis. For k =£ we integrate by parts, getting
x cos kx dx
x sin kx'
—2
77/c
r sin kx dx
Jo
/
cos kx = (COS k: I)
rk' -k 2
(> - k = 2, 4, 6,
-~Lz,k = 1,3, 5,
77A.'"
When /c = 0, we have
x dx
To summarize.
k = 2, 4, 6, . . .
4
2
fc= 1,3,5,...,
rrk
4 cos 3x — 4 cos Nx
s v (x)
, ,
= — — 4— cos x —
77
2
2 77 77 3" 77 iV
If N is even, we have .v
v (.v) v
A ,(.v). Figure 15 shows how the graphs of
s , Vj, and .v
3 approximate that of |x| on [
— 77, 77].
.
Figure 15
Example 2. Let
1. < x < IT
g(*) =
— <X < IT 0.
k = 0, 1, 2, . .
k = 2, 4, 6, . . .
fc = l, 3, 5
s
,
ivW
.
= 4- sin x +
. , 4 sin 3x
—+ ,
. . .
4 sin
N
Nx
The graphs of s lt s3 , and s5 are shown in Fig. 16, together with that of g{x).
An important question is whether, for specific values of x, a Fourier
approximation s x (x) converges as N— »- oo to/(x), where /is the function
from which the Fourier coefficients are computed. We define the Fourier
series of/ to be the infinite series
- + 2( a fc
cos kx + b k sin /ex), (2)
where o and b k are given by Formula 5.1. Theorem 5.2 below gives some
fc
Figure 16
400 Real- Valued Functions Chap. 5
Indeed, suppose that the graph off is piecewise smooth. This means that
the interval [—77, tt] can be broken into finitely many subintervals, with
endpoints — <tt xx < x2 < . . . < xk < tt, such that / can be extended
continuously from each open interval (xk xk+1 ) to the closed interval ,
(W(*»-) +/<*»+)]•
Here/(x— ) stands for the left-hand limit of/ at x, and/(x+) represents
the right-hand limit. The graph of a smooth function is
typical piecewise
shown in Fig. 17, with the average value indicated by a dot at each jump.
2ir 3tt
Figure 17
5.2 Theorem
Let/be piecewise smooth on (—tt, tt). Then the Fourier series of/
converges at every point x of the interval to (?)[f(x— ) +/(*+)]•
In particular, if/is continuous at x, then the series converges tof(x).
At x — ±tt the series converges to (\)[f(TT—) +/(—"+)].
The proof is given in the Appendix.
1,
* 4 sin (2k + l)x
fc=0 77" 2k + 1
Sec. 5 Fourier Series 401
To be very specific, we can set x = tt/2 and arrive at the alternating series
expansion
y (-0* _* '
=o 2k + 1
fc 4
Theorem 5.2 shows to some extent the reason for choosing the co-
efficients in a trigonometric polynomial according to Formula 5.1. The
reason is that, under favorable circumstances, the resulting sequence of
trigonometric polynomials converges, as the sequence of partial sums of a
Fourier series, to the function f. We shall now explain another reason.
Consider the vector space TS N of trigonometric polynomials of degree N,
can be formed into linear combinations so as to span the vector space ¥> N .
Thus "GN has dimension at least 2-/V — 1 . If we now introduce the inner
product
in "6 jV , we can show that the set of functions (3) is an orthonormal set and,
therefore, is linearly independent according to Theorem 8.1 of Chapter 2.
5.3 Theorem
[1, m= n ^ 0,
(cos km, cos nx) —
[0, m# n,
m = n,
(1,
0, m^ n,
/ V 2
sin
/
0, all
402 Real- Valued Functions Chap. 5
From Theorem 5.3 it follows that the trigonometric functions (3) form an
orthonormal set in 7S N and, hence, by Theorem 8.1 of Chapter 2, an
orthonormal basis. It follows further, from Theorem 8.1 of Chapter 2,
that the coefficients in a basis expansion
N
f(x) = —a = + 2(o* cos kx + b k sin kx)
«.=(4=./(*)\
1/V2 taken out and included with the trigonometric polynomial. Thus
the Fourier coefficient formulas are the correct formulas for computing
coefficients relative to the orthonormal basis (3) and inner product (4).
This fact explains why it is not necessary to recompute the previously
found coefficients if the space 73,v is extended to 'G v+1 by including
cos (N + l)x and sin (N + \)x.
We conclude this section by showing how the use of complex
exponential functions can simplify some Fourier series formulas. Recall
from Section 7 of Chapter 2 that, by definition,
e
±ikx = CQS ^ x j^ sjn £. x
The identities
%(*) = I c k e* x , (6)
k=-N
Sec. 5 Fourier Series 403
where
Cjc k>0,
(7)
k <0.
2
2. (a) With respect to the inner product (f,g) of two continuous functions
defined by
</.#> =" I
fMg(x)dx,
[1, m = n =£
|p, m # n
(\, m =h^0
(sin nx, sin hijc) = I
(0, m # «
c N
fix) = — - 2 ( c fc cos Arx + r/fc sin /be),
3. Using the identities in Equation (5) of the text, express the following trigono-
metric polynomials as polynomials (i.e., linear combinations of powers) in
kx kx
e' and e~' .
(a) 1 — cos x.
(Jo) cos x — 1 sin 2x.
f.
For example, the identity
cos 2 jc = \ + \ cos 2x
2
expresses cos x as a trigonometric polynomial and so provides the Fourier
expansion of cos 2 x. Find the Fourier expansion of each of the following
functions.
(a) sin 2 x.
3
(b) cos x.
(c) sin 2x cos x.
1
£ ——(N +
————
sin 1/2)7/
- + 2. cos ku =
2 jit! 2 sin («/2)
/
[Hint. Sum the identity 2 sin (h/2) cos ku = sin (k + -i)« — sin (k — ^)u
for k from 1 to A'.]
(b) Is the sum on the left the Fourier expansion of the function on the right?
=—
p e~!
ikx
k = ±1, ±2, Then
jv
V to
define c fc f{x) dx, 0, c fc e
fix) = 2 d*r
k=-N
2c = a , 2c k =ak - ib k ,
and
2c_ fc
= ak + ib k for A: = 1,2,3,..
The direct application of Fourier methods to practical problems usually MODIFIED FOURIER
requires some modification of the standard formulation presented in the EXPANSIONS
previous section. In the present section we describe some of these modifi-
cations and calculate some examples.
While the interval [—tt, tt] is a natural one for Fourier expansions
because it is a period interval for the trigonometric functions, it may be that
a function encountered in an application needs to be approximated on some
other interval.
If the function / to be approximated is defined not on the interval
[— tt, but on [—p,p], a suitable change in the computation of the
-n]
Then we can compute the Fourier coefficients of/p by Formula 5.1. The
resulting trigonometric polynomials sN will approximate on [— tt, tt}. ;)
k-TTX
—
kTTX
p <x <p
*©-M(- +
. .
,
k cos b k sin
)
P P
.
ak —— fp {x) cos kx dx — - / 1 — )
I7 /
cos kx dx
TT J-7, TT J-w \
&ttx k7Tx\
cos +
, ,
b k sin
.
2 k=i \ P P J
Example 1. If
p
2 f . knx J
sin dx
pjo p
(0, fc = 2, 4, 6
sin /ex dx = {
4
Jo fe = l,3, 5,
(irk
4 . TTX
j
*^ :
N^.
Figure 18
b
2 C .. 2knx
= fix) cos
.
;
,
b-a ax,
flj. ;
b-aJa
6.2
b
2 2k7TX_,
k —I [
/(*) sin
.
b-a dx.
'.
6, = 2 f x sin •
2/c7rx dx
2k7TX~l1
= 2 — x cos COS 2/C7TX rfx
2^77 2k', I"
COS 2/C7T
1
- — —1/ sin 2-7TX
. sin 477-x , sin 6-7TX
2 7T\
408 Real- Valued Functions Chap. 5
(even) (odd)
Figure 19
the ^-axis and odd if its graph is symmetric with respect to the origin. It
follows that
f(x) sin
6.1
K = -p J-p/O) sin ~P
\
dx = 0. (3)
It follows that an eren fund ion has only cosine terms in its Fourier expansion.
Similarly, if/is an odd periodic function, the product
f(x) cos
Sec. 6 Modified Fourier Expansions 409
1 C
p
) cos —P dx = 0. (4)
Thus an odd function has only sine terms in its Fourier expansion.
The facts in the preceding paragraph are the key to solving the following
problem: given a function /(.x) defined just on the interval < x <p,
find a trigonometric series expansion for/consisting only of cosine terms
or, alternatively, only of sine terms. The trick is to extend the definition
of /from the interval < x <p to all real x in such a way that the
extension is periodic of period 2p and either is even or is odd. We then
compute the Fourier series of the extension. If f e
is an even periodic
extension off, then/, will have only cosine terms in its Fourier series but
will still agree with / on < x < p.
an odd periodic
Similarly, if/ is
extension of/, then/ has only sine terms in its expansion but will agree
with /for < x < p. We illustrate the procedure with two examples.
Figure 20
f{—x) for —2 < x < 0, and then extend periodically, with period 4, to
the whole x-axis. We can use Formula 6.1 to compute the Fourier coeffi-
cients of/,. Since f is even, e
Equation (3) shows that b k = for all k.
Also, Equation (1) allows us to write
ak =~
J
fe(x) cos — dx
ak =
Jo
(1 — .x) cos — 2
dx
—2
A:t7
(1 -x)sin — /C77"X
2
2
f sin
Jo
. /C7TX
dx
() - /c odd,
fc even.
Finally,
a = (1 - x) dx = 0.
Jo
Thus the cosine expansion off on < x < 2 has for its general nonzero
term
—— cos
kirx
k even.
2 2
r k 2
—2 /
cos ttx +
,
cos
—— + —277-x
:
. cos -3-rrx
•)
Example 4. Starting with the same function as in Example 2, f(x) =
1 —x for < x < 2, we compute a sine expansion by considering the odd
periodic extension shown in Fig. 21. We first define f {x) = —f(—x) for
—2 < x < 0, and then extend periodically with period 4. Using Formula
6.1 and Equation (4) we find, as we intended, that a k = for all k. Also,
Figure 21
Sec. 6 Modified Fourier Expansions 411
by Equation (1),
bk =- I /oW Sin ~^
2
-,
/ (x)
. .
sin —
/C77X ,
dx.
/oW = 1 -
412 Real- Valued Functions Chap. 5
(c) Compare the results of (a) and (b) with the complete Fourier expansion of
g(x) = X, —TT<X<TT.
6. Using elementary properties of integrals, prove Equations (1) and (2) of the
text.
2 C*
«* =o. bk =- /(*) sin
SI kx dx,
"Jo
2
a.
I
= -If* g{x) cos kx dx, b' = 0.
77
Jo
SECTION 7
HEAT AND WAVE In this section we show how a Fourier series can be used to solve some
EQUATIONS problems in heat conduction and wave motion. We first find a differential
equation that is satisfied by the physical quantity being studied and then
apply Fourier series to solve the equation.
Suppose we are given a thin wire of uniform density and length p. Let
u{x, t) be the temperature, at time t, at a point x units from one end. Thus
< x < p, and we assume / > 0. We shall assume that the only heat
transfer is along the direction of the wire and that the temperature at the
two ends is held fixed. For this reason we can, without loss of generality,
represent the wire as a straight segment along an .x-axis and picture the
temperature as the graph of a function it = u(x, t). an example of which is
shown in Fig. 22.
The basic physical principle of heat conduction is that heat flow is
proportional to, and in the direction opposite to, the temperature gradient
V;/. Recall that Vj/ is the direction in which the temperature is increasing
most rapidly, so it is reasonable that heat should flow in the opposite
direction, from hotter to colder. Since the medium is -dimensional and is 1
Figure 22
proportional to
-—
du
(*i,
,
s
+
,
—
du
(x
,
2,
.
0- (1)
ox ox
But the rate of change of heat in the segment is also proportional to the
total change in temperature,
x
*du
(x, t) dx, (2)
Jx i dt
the total being taken over the interval. By the fundamental theorem of
calculus, the expression (1) can be written as
X2
—2
d u
- (x, t) dx. (3)
Ixj dx
Hence the two proportional expressions (2) and (3) for rate of change of
can be combined to give
total heat
a
2
Jxi
—
OX
(x, t)dx = \
Jxi
—
Ot
(x, dx,
7.1 a
a
2
—2
B u
dx
2
,
(x, t)
.
= —
du
dt
(x,
,
s
t).
y(x, 0) = h{x).
u(x, t) = G(x)H(t)
ahlxx = u t gives
a 2 G"(x)H(t) = G(x)H'(t),
, G"(x) H'(t)
(5)
G(x) H(t)
For this equation to be satisfied for varying x and /, both sides must be
equal to a constant, which we denote by —X 2 This procedure . is the
origin of the term separation of variables.
Setting both sides of (5) equal to —A 2 gives two equations:
a 2 G" + l G = 0, 2
(6)
H' + X H = 0.2
(7)
Equation (6) has solutions
G(x) = c2 s ni
(
— )*j A: = 1,2, ....
which, because ?.
2
= {k 2 a 2 n 2 )Jp 2 , becomes
,2, 2 2, 2..
H(t) = e~
k {a
" /p u .
The product solution u{x, t) is thus given, except for a constant factor, by
- fc2,AV)
u k (x, t) = e 'sin (—\x, k = 1, 2
,
Zb k u k (x,t)
looks like
-k (a ir Ip )t
(8)
(7>
To satisfy the initial condition w(x, 0) = h(x), we require
for some N. If h(x) can be expressed in this form, we can expect that a
solution to the problem is given by Equation (8). The boundary con-
ditions, u(0, t) = u(p, t) = 0, require that the temperature remain zero
at the ends, and the initial condition u(x, 0) = h(x) specifies the initial
(10)
Figure 23
—
416 Real- Valued Functions Chap. 5
To make Equation (10) represent the Fourier expansion of// on [0, 77], we
extend h to the interval [—77, 77] in such a way that the cosine terms in the
expansion of/; will all be zero, leaving only the sine terms to be computed.
We do this by extending the graph of// symmetrically about the origin, as
shown in Fig. 23. Then
1 ('
/i(x) cos kx dx = 0,
77 J-*
because h{— x) cos k(— x) = — h(x) cos kx; therefore, the integrals over
[—77, 0] and [0, 77] are negatives of one another. To compute bk we use
the fact that //(— x) sin k{— x) = h{x) sin kx, so that the graph of this
function is symmetric about the j-axis. Then
b» =- h(x) sin kx dx
= —2 f h(x) sin kx dx
77 Jo
=a
77
n
Jo
x sin /c.v dx -f —
77 Js-/2
(77 — x) sin fcx dx
= — : sin
Hence,
0. /c even,
k = 1,5,9,
b,=
-7:, fc = 3, 7, 11,
V77/C
4 /sin x 3x 7x
;(— -
sin sin 5x sin
,
Hx)
, .
" +
5
2
~ 7
2
for each x in [0, 77]. Finally, from Equation 8 we expect the solution to the
equation a 2 u xx = u t to be given by
s<
— — e - 3V< sin3x —1 e_r°
2 2
.2 2.
2
sin 7x +
7
On the other hand, by Newton's law, the force equals mass, p As, times
acceleration n(s ). Hence
As
But
a = —
or
(s, and t(s) = —
os
(s, t);
p —
d
dr
x,2
(s,t)
A = F„d —x,
ds
2
2
(s, t).
.
(11)
2 2
dt ds
2 2
dt ds
7.2 ^=
d
dt
2
a>
d
-y
d
y(x,0)=/(x), (12)
y(x, t) = G(x)H(t),
upon which 7.2 becomes
G(x)H"(t) = a 2 G"(x)H(t)
or
if"(Q_ j2
G"(x)
a '
H(t) G(x)
Since the left side is independent of/, both sides are constant; so we write
G"(x)
G(x)
= .
A and
,
—^ =
H"(t)
H(t)
a
2 ,
A.
c, + c2 - and A
CjC^ * + c e- Vxv = 0.
2
Solving for Cj and c2 we find that, for nonzero solutions to exist, we must
have c x = —c 2 and e~
v Xp
= ex
XT>
, or e ^
2 Xv
= ] . Thus, allowing complex
exponents, we conclude that 2\J Ap = 2-nki for some integer k, so that
G(x) = „ J.irkilj>)x
cx e
J.—iiki!p)x
= 2c A sin — x.
P
We write
G k (x) = b k sin — x.
P
since we have determined that A = Tr
2
k 2 \p 2 . Solutions of this equation are
of the form
,,
H,,k (t) = C k cos trka + D„ k sin trka
_,
t
,
.
t ;
y k (x, = A k cos
nka
t + B„ h sin
,
. itka
t
.
sin —
Ttk
x.
P P P
To satisfy the initial conditions (12) and (13), we form finite or infinite
sums of the type
—
irk
x.
P
Thus the initial conditions become formally
The coefficients A k and {rcka\p)Bk are then determined so that they are the
Fourier sine coefficients of/ and g, respectively.
given by
so simple in this case that we can find their Fourier sine coefficients by
inspection.) The solution to 7.2 then takes the form
,
y(x,
N
t) = ,
b cos —
-na
t
.
sin
tt
— x.
P P
Recall that for Equation 7.2 to be physically realistic the vibrations of the
string should be fairly small — in other words, the coefficient b should be
small. In Fig. 25, the graph of our solution is shown with a = 1 , p = tt,
Figure 25
EXERCISES
1. Use the method of separation of variables to solve the 1-dimensional heat
equation a 2 n XJ . ii t ,
subject to each of the following boundary and initial
conditions.
2". Use the method of separation of variables to solve the 1 -dimensional wave
equation a 2 uxx = u tt , subject to each of the following boundary and initial
conditions.
=
(x, < x < -nil }
(b) L(y) = y tt ~ a 2y xx .
"linear" in the sense that,if two functions satisfy it, then so does any
„ d2 u du d2u
r
2
2
V r 1
=
dr dr d6 2
H" - m= 0,
r
2
G" + rG' + W= 0.
(b) Show that H" - IH = has solutions satisfying 7/(0) = H{1tt) if and
only if A = —k 2 where k is an integer, and that the solutions can then be
,
k
(c) Show that r 2 G" t rG' k2G has solutions r and r~ k for k =
0,1,2,..., but that negative exponents are ruled out if u(r, 0)
G(r)H(6) is to be finite for r 0.
then
SECTION 8
= lim !A(x).
-V- ' fr=l
Recall that this means that, for each x in S, there is a number/(x) such
that, given e > 0, there is an integer K sufficiently large that
i a( X )-/(x) < e,
whenever N > K.
x#l,
x = 1.
Then
v
= Hm 2 X * =
1
2**' '
^or — <x<1 1.
.
For real values of x outside the interval (— 1 , 1), the series fails to converge.
00
The trigonometric series ^ (sin kx)/k 2 converges pointwise for all real
x. The reason is that its terms can be compared with those of the convergent
series ^
fc=i
l/^'
2
> by observing that
sin kx
<1, fc = l, 2,
~k 2
An infinite series
2 fk( x ) tnat converges for each x in a set S to a num-
A:=l
The definition just given should be compared carefully with that of point-
wise convergence. Notice that uniform convergence implies pointwise
convergence, but not conversely. Roughly speaking, uniform convergence
of a series of functions defined on a set S means that the series converges
with at least a certain minimum rate for all points in S. A pointwise con-
s N (x)
Figure 26
424 Real- Valued Functions Chap. 5
vergent series may have points at which the convergence is arbitrarily slow.
Figure 26 is a picture of uniform and nonuniform convergence to the
same function/; sN (x) and t y (x) are Nth partial sums of two series.
To determine that a series converges uniformly we have the following.
Let
fc
^
=
fk
l
be a series of real-valued functions defined on a set S. If
00
1 •
\fk( x )\ ^ Pk f° r a 'l x in 5 and for k = 1 , 2, ... ,
00
2. 2/>k converges,
CO
then
fc=i
^ /t converges uniformly to a function /defined on 5.
00
Proof. The comparison test for series shows that ]T /fc(x) converges
= i /*(x).
fc=A +l r
It follows that
fc=l I
*=A'+1 *=A'+1
00
Since
CO
^pk converges, we can, given e > 0, find a K such that
k=l
sin kx 1
k'
and 2
k=l
l/^ 2 converges. However, the power series 2x
k=0
k
, while it converges
[— r, r] by observing that \x
k
\
< r k for x on [— r, r] and that^ f
k
converges
fc=0
8.2 Theorem
lim/^x)
x-»x
provided the series of numbers on the right converges, and the series
on the left converges uniformly on S.
JV iV
<
k=l k=l
426 Real- Valued Functions Chap. 5
i/,(x)-i«,
fc=l k =l 3
Then for x satisfying |x — x„| < <5, the left side of (1) is less than e.
8.3 Corollary
If
A-
^f
= l
k is a uniformly convergent series of continuous functions fk
defined on a set S in %n , then the function / defined by f(\) =
00
^f (\) k is continuous on S.
8.4 Theorem
If the series "£fk converges uniformly on the interval [a, b], and the
00 ,b
1
fc=l Ja
/",(*) dx IAM ./v.
Let e > 0, and choose K so large that if N> A', then <
i \+ 1
e(b — a) l
, for all x in [<?, />]. Then using the fact that, for con-
tinuous g,
we have
2 fk (x)dx < (b - a) •
e •
(6 - a)~ for N> K.
I
Thus the left side of Equation (2) is less than e in absolute value for
N> K, which was to be shown.
8.5 Theorem
functions defined on an interval [a, b]. If ^fk (x) — f(x) for all x in
CO
°°
A °° rlf
|[A(x)-A(a)] =i f/i(r)A
fc=l k=l Ja
2. sin kx
^ ; 4
Clearly the series converges for all real x. Furthermore the series of deriva-
tives of the terms of the given series is
S cos kx
A =l K
This series converges uniformly for all x by the Weierstrass test, because
cos kx
3
< —
~ 3
1
k k
3
and ^
i
C/^' ) converges. Hence, by Theorem 8.5,
d 2-, sin kx _^
~ cos kx
dx A k* ti
ft k*
2
d ^.sinh _ S. sin kx
i 2 *- i 4 ^— ; 2
dx k=i k k=i k
EXERCISES
1. Show that the series ^ .v
l"
converges uniformly for — */ < <
Jt f/ if < rf < 1.
fc=0
* cos kx
2. (a) Show that the trigonometric series 2,
k=1
— — 7^ converges uniformly for
all real x.
(b) Prove that the series of part (a) defines a continuous function for all real
A.
- -r
Ja i; cos kx + b k sin kx
(b) Prove that the uniformly convergent series of part (a) is necessarily the
Fourier series o\' the function it represents. [Hint. Use Theorem 8.4.]
00
U(X, 0=2
k=\
Ck e
~
k~ l
S'n kx
Sec. 9 Orthogonal Functions 429
(b) Show that if u(x, t) in part (a) is defined for / = by a uniformly con-
2
vergent series, then u(x, t) is continuous on the set S in Jl defined by
< t, < x < IT.
(c) Show that the function u(x, t) is infinitely often differentiable with
respect to both x and /, for t > 0.
6. By considering the partial sums of the power series ^x k for —1 < x < 1,
fc=0
show that the series fails to converge uniformly on ( — 1, 1).
00
00 00
(b) Show that the solution w(x, t) of part (a) satisfies the boundary con-
ditions w(0, t) = w{tt, t) — for t > and an initial condition w(x, 0) =
h(x), where h is twice continuously differentiable.
10. Can Theorems 8.4 and 8.5 be proved under the more general assumption
that the functions involved are vector-valued, with values in Jl n ? [Hint.
See Problems 16 and 17 of Chapter 3, Section 2.]
SECTION 9
Some of the properties of Fourier expansions are shared by a large class ORTHOGONAL
of similar expansions in which the functions sin kx and cos kx are replaced FUNCTIONS
by some sequence {<p k } k=l2 ^ of functions, all of which are mutually
orthogonal. The orthogonality is measured in terms of an inner product
on a vector space of functions. For example, if/and g are elements in the
space C[— 7T 77] of continuous functions on the interval [—77-, it], we can
,
430 Real- Valued Functions Chap. 5
k = I
(<Pk , <Pi)
0, k^l.
The sequence{(p k } k=12 ,... is orthonormal with respect to the inner product
given by Equation (1). The term "normal" comes from the fact the
functions have been normalized by requiring \\(p k \\
= (<p k , (p k )
112 = 1. In
the trigonometric case, the normalization is achieved by dividing the sines
\ '-.
and cosines by
To see the importance of orthonormal sequences in general, we con-
sider the following problem: Let (f,g) be an inner product on a vector
space, and let {(p k } k =i, 2 .... t> e a sequence of elements, orthonormal with
respect to the inner product. Using the norm defined by ||/|| = (f,f)
1/2
,
g -J.c k <Fk
isminimized for given g and N. The fact that the sequence <pk is ortho-
normal makes the solution very simple by adding and subtracting —
N
2 (g, Vkf, we get
N N
< 8 - 2 c k(pk ~i,ck (pk , g -2q.7\
k=l k=l
2
= \\g\\ -2j,c k (g,(pk )+Icl
igii —2 <»»%>'
+ X[<g,?>*> -2c 2
fc <g,^> + cf]
2
= \\g\\ -i(g,<pky+i[(g,<pk ) ck t (2)
k= l k=l
But the first two terms in the last expression are independent of the choice
of the ck s, and the last sum is then minimized by taking ck — (g, q> k ). The
numbers g, <j , are called the Fourier coefficients of £ with respect to the
Sec. 9 Orthogonal Functions 431
9.1 Theorem
The important thing about the conclusion of Theorem 9.1 is that the
2 2
lgll > I(g,<P k ) -
2 2 2
o<ngii -i<g,^> +i[<g,^)-c,]
k=l k=l
0<||£||
2
- I&n) 2
k=l
N
Example 1. The approximation of g by a sum ]£ ^-9"^ h as been measured
fc=i
(1) on the space C[ — tt, tt] of continuous functions on [—ir, it]. Given the
orthonormal sequence cp
x (x)
= \\^2tt, (p 2 „(x n ) = (cos nx)l\fn, q> 2n+1 (x n ) =
(sin nx)l\Jir, we try to minimize, for given g in C[— tt, tt], the norm
2AM-1
g - I, CkVk
We have seen that this is done by taking c k = (g, <p k ). But, by the definition
of the inner product,
g(0 dt, fc = 1
J-w J2tt
[" cos nt
<g> <P*>
, N
g(t)—=-dt,
.
k — 2n
J -IT y/TT
C" sin nt
, .
g(t)—^dt, ,
k = 2n + l.
\J-n y/TT
(g. <Pi)<pM = —
ATT J-ir
g(t) dt,
<g> <P2„)<P2n( x )
= - g(t) cos nt dt cos nx,
—1 f*
(g. <Pin+l><P*n+l( x ) g(t) sin nt dt
TT J-7T
Then
fc=l 2 A-=l
where a k and 6 7c
are the trigonometric Fourier coefficients as defined in
Section 5. The square of the norm to be minimized takes the form
A linear transformation with the same domain space and range space is
7"(x)g(x)rfx = dx.
J J7«g"(*)
Integration by parts twice shows that
9.3 Theorem
Lfi — A-ifi, Lf = X f 2 2 2 ,
and
(A, V 2) = <A> ^A> = a,(A,A>.
Because Lis symmetric, <L/i,/2 ) = (/ ls Z/2 >, so hifnfz) = K{fi,fz)
or (A x— X )(f,f =
2 2) 0. Since A x and A 2 are not equal, we must have
(fi,fz) = 0.
(f,g)=j f(x)g(x)dx.
a
=A(p/t)' -A(p/0'
=/ib/2 + p'/J] -A[p/i + p'fi]
make L symmetric.
This is called the Legendre equation of index n, and it is satisfied by the nth
Legendre polynomial defined by
9.5 P n (x) = ——
2
n
n\dx n
(x
2
- l)
n
, n = 0, 1, 2,
2
(x) dx = n =
J> 2n + l
, 0, 1, 2, . . . .
Ic k Pk (x),
k=0
where
ck = —1 — fj [
I
g(x)Pk (x) dx. (5)
fTgW-ic F,(x)
J-i fc=o
fe
dx
EXERCISES
1. (a) Verify that
</><?>= fMg(x)dx
f(x)g(x)w{x) dx
J
define an inner product?
2. Let {9^.} be an orthonormal sequence of functions in C[a, b], and let ^ be
in C[a, b]. Show that if the real-valued function
[Hint. Differentiate the formula for A under the integral sign. This is
•b -|2
(a)
2 J
fc=i
* a
g(x)<p k {x) dx converges.
Sec. 9 Orthogonal Functions 437
coefficients a k
00 00
and b k such that ^
fc=l
( al + tf)
converges, but such that
2
fc=l
o k or 2^
fc=2
d° es not conver ge -
6. Let C[a, b] be the continuous real- valued functions defined on [a, b],
(a) Show that C[a, b], C'[a, b], and C"[a, b] are vector spaces, each con-
tained in the preceding one.
(b) Let Bx [a, b] be the set of functions/contained in C[a, b] and satisfying
a condition of the form
cf(x ) + df'(x ) = 0,
where c and d are constants, not both zero. Show that Bx [a, b] is a
vector subspace of C[a, b].
(c) Show that C"[a, b] r\ Ba [a, b] r» B b [a, b] is a vector subspace of C[a, b].
t l
7. Show that the differential operator d \dx is symmetric with respect to the
inner product
(a)/(-l) =/(l)=0.
(b)/'(l) =/'(-l) =0.
(c) Cl /(-l) + dj\-l) = c 2 /(l) + d2 f\\) = 0,
where c? + df > 0.
8. Verify that the Legendre polynomial P n defined by Formula 9.5 satisfies the
Legendre equation: (1— x 2 )y" — 2xy' + n(n + \)y = 0. [Hint. Let = //
(x 2 — \)
n
. Then (x 2 — \)u = 2nxu. Differentiate both sides (n + 1) times
with respect to x.]
10. (a) By using Formula 9.5 and repeated integration by parts, show that
1 1
f (2nV C
PnK
2
(x)' dx
£ In + 1
11. Prove that Pn , the nth Legendre polynomial, has n distinct roots in the
interval [ — 1, 1]. [Hint. Use Formula 9.5 and Rolle's theorem.]
12. The 3-dimensional Laplace equation in spherical coordinates (r, <p, 6) has
the form
tvt)
or\
9 /
or/
„ du\
- -•
sin
—T
1
<p dq>\
d I
sin V T"
du\
ay)1+ ^~1T
sin <p
1
12
d 2u
^2"
ctr
= °-
(a) Show that, for solutions u(r, q>, 6) = v(r, cp) that are independent of 8,
the equation has the form
d2
or"
1
sin 9? 3
3
-
/
sin <p — 3«\
= 0.
(b) Show that the method of separation of variables applied to the equation
of part (a) leads to the two ordinary differential equations
r
2
G" + 2K7' = kG, '
sin
1
<p
—d
dq>\
I
sin y —
dH\
dtp ]
\ = -W.
r
-d/2)+VA+(i/4) and r
-(i/2)-VA+d/4)_ [Hint Let r = et ]
(d) Show that the equation for H can be put in the form of the Legendre
equation
(1 - x 2)H" - 2xH' + IH = 0.
13. Show that in the case of the orthonormal sequence derived from the Legendre
N N
polynomials, the general expansion ^ {g, <p k )<p k reduces to ^c k P n (x),
k=0 fc=0
where ck is given by Equation 5 of the text.
Sec. 9 Orthogonal Functions 439
14. Show by using Theorem 8.3 of Chapter 2 that the normalized Legendre
polynomials
\ln + 1
are the same as the sequence of polynomials obtained by applying the Gram-
Schmidt process to the functions {1, x, x2 , . . . }. Use the inner product
=
{f,8) ~jjf{x)g(x)dx
|
Multiple Integration
SECTION 1
ITERATED INTEGRALS This chapter is devoted to the study of multiple integrals of functions with
n
domains in 3i . Such integrals occur in many branches of pure and
applied mathematics, with interpretations such as volume, mass, prob-
ability, and flux. In this section we start with iterated integrals because
they are computationally useful and because they provide a natural
transition to the multiple integral from the ordinary definite integral,
f(x) dx,
f
dy
r f(x, y)
ismeant simply the definite integral of the function of one variable obtained
by holding x fixed for example ;
-
x y- l/=2
f
Jo
x
3
/ dy = 3
—-X
_ Jy=0
F(x)=jj(x,y)dy
440
,
F(x) dx =
F(x)dx /(*, y) dy dx.
\
a Ja \_Jc
H /(*, y) dy.
This notation has the advantage of emphasizing which variable goes with
which integral sign, namely, x with j^ and y with Jf.
(x
2
+ y) dy x*y + dx
Jo Jl f
Jo
[(2x
2
+ 2) - (x + 2
dx
r
Jo
J)]
(x
2
+ f)rfx = i + f =i i.
r
Jo
6
Figure 1
442 Multiple Integration Chap. 6
the integral
(x
2
+ y) dy = x
2
+
I
is the area of the shaded cross section. It is customary to interpret the
definite integral of an area-valued function as volume. Thus we can regard
the iterated integral
(x +y)dy = \±
j>r
as the volume of the 3-dimensional region lying below the surface and
above the rectangle < x < 1, 1 < y < 2.
pi pi pi
dy (x
2
+ y) dx = \
+ yx dy
Ji Jo Ji
/2nn2 2
=) i G + y)dy =
y
f 3^2
= (f + 2) - (i + i) = V.
This time
(x
2
+ j;) rfx = J + j;
i:
isthe area of a cross section parallel to the xz-plane. See Fig. 2. The
second integral again gives the volume of the 3-dimensional region lying
below the surface z = x2 +y and above the rectangle 0<x<l,
A
Figure 2
Sec. 1 Iterated Integrals 443
1 <y < 2. It is not surprising, therefore, that the two iterated integrals
of Examples 1 and 2 are equal.
It is important to be able to integrate over subsets of the plane that are
more general than rectangles. In such problems the limits in the first
+
"ft + *z
frfx f (x y) dy *y dx
Jo Jo
- - *
2 2
-j:o
x(l x
2
) + (1 )
dx
L
i r
1 - 2x
2
+ x
4
=
x' + dx
"ft
For each x between and 1, the number y lies between the values
y = and j = 1 — x 2 In other words, the point (x, y) runs along the
.
Figure 3
The integrand f{x,y) = x + y has a graph (see Fig. 4), and the iterated
integral is the volume under the graph and above the region B.
Suppose we are given an iterated integral over a plane region B in
which the integrand is the constant function /defined by f{x,y) = 1, for
all (x, y) in B. The integral may then be interpreted either as the volume
of the slab of unit thickness and with base B or simply as the area of B.
For example,
dx\ dy =
Jo Jo
Figure 4
dx (xj'-f xy~) dy
J-i Jc
and
+ +
Hi: (x^y xy') dx + (x'y xy-) dx
The second integral breaks into two pieces because, for fixed j between
and 1, the integration with respect to x is carried out over two separate
intervals. Computation of the integral is straightforward. We get
m xy
2
rx 3
y x y
2 2
'
y Jo
( v - y
4
) dy
y = x \ \
Figure 5
Sec. 1 Iterated Integrals 445
J-iL 2 3 Jo J-i\2 3 /
.r^ + raUsCfc
J-i 2 J-i 3
3
The functions x i j2 and x |x| /3 are even and odd, respectively. It follows
that
x dx =
J-i 2 J-i 3 Jo
The theorem which states that, under quite general hypotheses, the
value of an iterated integral is independent of the order of integration will
be proved in the next section. This will prove that different orders of
integration in computing volume must lead to the same result.
Example 5.
"i rx rzx+v ri rx
dx\ dy\ (x + y + 2z)dz = \ dx\ (4x
2
+ 6xy + 2y
2
) dy
Jx Jx Jo Jx
ri rx r2x+y
|
dx dy\ dz
Jo Jx Jx
2X+V
dz,
I
446 Multiple Integration Chap. 6
y = x*
Figure 6
is the length of the vertical segment joining the point (x,y, x) to the point
(x,y, 2x + y). For fixed x, the integral
px /»2
dz
JX JX
is the area of a cross section parallel to the jz-plane. Finally, the triply
iterated integral is the volume.
\
dxA dx 2 . . . dx n
Jo Jo Jo
dx 1 = 1
j:
regions < x 2 x 2 < xlt and x x < 1 shown in Fig. 7. For n = 3, we have
,
k
Figure 7 Figure 8
I„ = \
dxA dx 2 . .
.\ dx n
Jo Jo Jo
n rxi rx„- 2
= dx 1 dx 2 ... x„_x dx n _ x
Jo Jo Jo
pi pn-3 x 2
= p dx 1 dx 2 . . .
-t£ rfx„_:
Jo Jo Jo 2
px
= p dx x dx 2 .
Jo Jo 3!
dx x =—
1 o (n - 1)!
EXERCISES
Evaluate the following iterated integrals and sketch the region of integration
for each.
•H (x 2y 2 + xy 3) dy.
M »0 p
x —
(x +
2\ sin
y*) dy.
j dx. [Ans. 1 — cos 2.]
448 Multiple Integration Chap. 6
(£)
J*
dyj sin xdx.
^ [Ans. 0.]
/*1 pv 2
r 1 r^i-xi
73 ^ </y.
Jo Jo
f
—1 /»2a:
/*jt/2 rcosy
9, rfy x sin _y c/;t.
17. Sketch the subset B defined by0<x<l,0<y<x, and write down the
integral over B in each of the two possible orders of f(x,y) = xsinj.
Evaluate both integrals.
18. Sketch the region defined by x > 0, x2 + y 2 < 2, and x 2 + y 2 > 1. Write
down the integral over the region in each of the two possible orders of
f(x,y) = x2 . Evaluate both integrals. [Ans. 3tt/8.]
19. Consider two real-valued functions c(x) and d(x) of a real variable x.
Suppose that, for all x in the interval a < x < b, we have c(x) < d(x).
(a) Make a sketch of two such functions and of the subset B of the xy-
plane consisting of all (x, y) such that a < x < b and c(x) < y < d{x).
(b) Express the area of B as an iterated integral.
(c) Set up the iterated integral of f(x, y) over B.
,
2^ Sketch
t the subset B of &3 , defined by <x< 1, <y < + x,
1 and
< z < 2. Write down the iterated integral with order of integration z,
then y, and then x, of the function f(x, y, z) = x2 + z over the subset 5.
Compute the integral. [Ans. -2 5-.]
^
22. Let /be defined by f(x, y, z) = 1 on the hemisphere bounded by the plane
z = and the surface z = Vl — x2 — j2 . Evaluate an iterated integral of/
in some order over the region. [Ans. 27t/3.]
[ dxS dx 9
, J.v„
Jo Jo
pi p2 p»
</Xi rfx 2 . . . /(x 1; . . . , x„) dx n
Ja 1 Ja 2 Ja n
25. Evaluate
r 1
r 1
r 1
r xi
dxA dx 2 . . . dx n _ x (x x + x 2 ) dx n .
Jo Jo Jo Jo
2p. Prove that
Jo Jo
dxo
r /(x„) dx n =
(« - l)!jo
j:
(x - o n -VW dt -
SECTION 2
Multiple integrals are closely related to the iterated integrals of the pre- MULTIPLE INTEGRALS
ceding section. Suppose we are given a real-valued function defined on a
set B in %n . Our problem is to formulate a definition of the integral of/
over B
analogous to the definition of a one-variable integral, using sums
rather than iterated integrals.
We first consider some simple sets in 3tn . A closed coordinate rectangle
is a subset of Jl" consisting of all points x = (x 1; . . . , x„) that satisfy a set
of inequalities
at <x < b u t
i = 1 n. (1)
If in Formula (1) some of the symbols "<" are replaced by "<," the
resulting set is still called a coordinate rectangle. In particular, if all the
inequalities are of the form a t
< x, < b u the set is open and is called an
450 Multiple Integration Chap. 6
1< x <4
1 < v < 1
Figure 9
content is the same thing as area, and we often write A(R) instead of
V(R) to have the notation remind us of area rather than volume.
A subset B of 3i n is called bounded if there is a real number k such that
|x| < k for all x in B. A finite set of (n — l)-dimensional planes in 31"
2
(lines in !ft
) As
parallel to the coordinate planes will be called a grid.
illustrated in Fig. 10, a grid separates 31" into a finite number of closed,
bounded rectangles R u R r and a finite number of unbounded regions.
. . . ,
y-&\\s
Figure 10
Assertion (b) means that there exists a real number K such that
|/(x)| < K, for all x in B. The multiple integral of/over B will be defined
in terms of the function /#, which is/altered to be zero outside B, that is
{ fix), if x is in B.
/*(*) = .
c .
[0, if x is not in
l B.
Let G be a grid that covers B and has mesh equal to m{G). In each of the
bounded rectangles R t
formed by G, i 1, =
r, choose an arbitrary
. . . ,
i/^on^)
is Riemann sum for/over B. Its value, for given/and B, depends
called a
on G
and x lt , xr If, no matter how we choose grids G with mesh
. . .
Km IfB&MRt)
m(G)->0 (=1
exists and is always the same number, then this limit is the integral of/
over B and is denoted by § B fdV. If the integral exists, / is said to be
integrable over B.
The limit that defines the multiple integral is somewhat different from
the limit of a vector function defined in Chapter 2, Section 2, although the
452 Multiple Integration Chap. 6
m(G)-0i=l JB
means that, for any e > 0, there exists d > such that if G is any grid
that covers B and has mesh less than 5, and S is an arbitrary Riemann sum
for/B formed from G, then
1 fdV
<e.
/(x) dx.
r
Other common notations for the integral of 31" — >- 31 over B are
f(x, y, z) dx dy dz, if n — 3,
i
/ dx x . . . dx n ,
for arbitrary n.
2.1 Theorem
Let/ bedefined and bounded on a bounded set B in 3V, and let the
boundary of B be contained in finitely many smooth sets. If/ is
continuous on B except perhaps on finitely many smooth sets, then
/is integrable over B. The value of fafdV is unchanged by changing
values of/ on any smooth set.
Sec. 2 Multiple Integrals 453
n
By a smooth set in 'J{ is meant the image of a closed bounded set under
a continuously differentiable function —
% m > :i\", m < n. Thus, if n = 2
and m= 1, we may get a smooth curve. A smooth set in Si 1 will be
understood to be just a point. To say that the value of $ B fdVis unchanged
by changing the values of/ on such a set means that /can be assigned
arbitrary values on the set without affecting the existence or the value of
the integral. For instance we can change the integrand on any finite set
of points without changing the integral. This kind of modification is often
convenient for removing discontinuities.
(2x + y) dx dy,
1
where B is the rectangle < x < 1, <y < 2. The existence of the
integral is ensured by Theorem 2.1. For this reason, any sequence of
Riemann sums with mesh tending to zero may be used to evaluate it.
z = 2x +y
R,
(b)
Figure 11
mesh of G n is \jn, and the area of each of the rectangles R u is \jn 2 Setting .
\n nl
454 Multiple Integration Chap. 6
n 2n n 2n I
jj j\ 1
1 / « 2n \
_ j_ An + 2
n 4n
2
+ 2n \
~n 2
^ 2 2 /
in
2
+ 3n
= 3
2
2n
Hence,
I
(2x + y) Jx d^ = lim (4 + -) = 4.
Jjs M-00 \ 2«/
2.2 Theorem
\fdV
Jb
exists, then the two integrals are equal.
Since the argument used to prove Theorem 2.2 applies equally well to
any order of iterated integration, we have as an immediate corollary:
2.3 Theorem
= (4x + dx
i 2)
= [2x
2
+ 2x]] = 4.
x <2, <y < 1, I < z <2, and shown in Fig. 12. Consider/ (x, y, z) =
xyz. Then
/ dV = xyz dx dy dz
JR JR
= [(f)(i)(f) = f •
=>
Figure 12 Figure 13
456 Multiple Integration Chap. 6
fdV = \
dx\ dy\ xyz dz.
B Jo Jo J
f fdV = -
fx dx Jo( ' v(4 - x
2
- y
2
) dy
Jb 2 Jo
»2
- x3 + |x
5
=
r(2x ) rfx |.
V(B) = I
1 dV = I
dV.
Jb Jb
For sets B in Jl 2 we write
, ^(5), for area, instead of V(B). It follows from
the last part of Theorem 2. 1that the content of a continuously differentiable
A>dimensional (k < ri) curve or surface S is zero, for
V(S) = dV =
|
0.
Js
For some sets B, the integral j"# rfF does not exist. If this happens, the
content of B is not defined (see Exercise 21). Notice that for rectangles
R, the content V(R) has been defined twice: first as the product of the
lengths of mutually perpendicular edges and second as an integral. That
the two definitions agree follows immediately from Theorems 2. 1 and 2.2.
A(B) = \
dA = \
dx\ dy = \
/(x) dx.
B Ja Jo Ja
Sec. 2 Multiple Integrals 457
Hence, the above definition of content is consistent with the usual one
one
for the area under the graph of a nonnegative integrable function of
variable. If/ is integrable over B and also nonnegative on B, we could
similarly show that the volume under the graph off and above the set B
is the double integral B \fdA.
equal to
f (x
2
+ y
2
) dx dy = Fdx f '"(x 2 + y
2
) dy
JD J-l J-Vi-x 2
= 2 f (jcV/l - x + 1(1 - x
a 2
)Vl - x
2
) dx
= ; fVVl - x
2
+ 2xVl - x
2
)
dx
3 Jo
4 ItT Tt\
_ ~
77
'
8/ 2 Figure 14
3 \4
Figure 15
458 Multiple Integration Chap. 6
Notice that what we have called the content of a subset of 31™ is more
properly called its n-dimensional content. For example, the square defined
by the inequalities 0<x<2,0<^<2in3l 2
has 2-dimensional content
4,whereas the square defined in 'Ji by < x < 2, < y < 2, z = 0, and
3
which looks the same, has 3-dimensional content 0. Thus the content of a
set depends on the dimension of the containing Euclidean space with
respect to which it is being measured, as well as on the shape of the set
itself. Having already indicated that 2-dimensional content is called area,
If/and g are integrable over B and a and b are any two real numbers,
then af + bg is integrable over B and
1 fdV
>0.
2.6 Theorem
2.7 Theorem
Proof of 2A. Let e > be given, and choose d > so that if S x and
S 2 are two Riemann sums forfB and gB respectively, and whose grids
,
Sec. 2 Multiple Integrals 459
sum
2
for (of
and
+
\b\
bg) D
Hwhose
gdV <
than 6. Then
S = 2(af+bg)B (x )V(R ) l i
= aS^ + bS 2 .
Hence,
a\fdV- b\ gdV
Jb Jb
Proof o$ 2.5. Since all the Riemann sums are nonnegative, the limit
must also be nonnegative.
Proof ot 2.6. This follows immediately from Theorems 2.1 and 2.2.
Proof of 2.7. The existence and the value of the integral § B fdV
depend only on the function fB Similarly, J c /b . dV is defined by
using (fB ) c which is equal tofB
, .
2.8 Theorem
2.9 Theorem
If/ is integrable over each of two disjoint sets B 1 and B2 , then /is
integrable over their union and
The next theorem will show that, for functions / and regions B for
which \sfdV exists, the value of the integral is completely determined by
the properties stated in the four Theorems 2.4-2.7. The theorem is
2.10 Theorem
(a) If /g/and IBg are defined and a and b are real numbers, then
IB (af + bg) is defined and
e = IB f~\Jb fdV,
and choose 6 > so that if S is any Riemann sum forfB whose grid
has mesh less than <5, then
fdV < -
j;B 2
Let G be an arbitrary grid that covers B and has mesh less than d,
and denote the closed bounded rectangles formed by G by R u . . .
,
R r . Set
C=R 1
KJ . . . UR r ,
g = ZfiXHi-
1=1
( \ _ P>
if x is in i^,
Xr,W - 0> otherwise.
|
It follows immediately from properties (c), (d), and (a) that Icg is
j cg = i/«*w
i=l
fdV - / cg <-
2
lBf=IefB<hg- (4)
fdV >IB f.
1
By an entirely analogous argument using the notion of greatest
lower bound instead of least upper bound, we can obtain
fdV <IB f,
1
JB
IBf=jdxjfdy.
(overf?)
Sec. 2 Multiple Integrals 463
e
fdx >0, if/>0.
1
dx = B — a.
f
\
fdx = f[( v , S) ]dx,
Jy Ja
ifa<y<(5<^ and
[y, d] is the interval
y <x < d.
j- r g (x,y)dx = r^(x,y)dx.
dy Ja Ja dy
EXERCISES
1. Make a drawing of the set B and compute ] B fdA, where
Multiple Integration
chaP- 6
464
(d) fix, y) = x 2 - y 2 and B consists of all (x, y) such that < x < 1 and
x 2 - v
2
> 0.
[Ans. JJ
[Ans. 6.]
(b) 1fix, v) = 3x2 + 2y and £ is the rectangle <x <2,0 <y <l.
' [Ans. 10.]
2
"
< = ——
n(n +
2
1)
'
«(« + l)(2n + 1)
2' =
t=i
2
6
n / n \2
3. Find the volume under the graph of/ and above the set B, where
(a) fix, y) = x + y2 and B is the rectangle with corners (1, 1), (1, 3),
B 1
defined by 0<x<1,0<j<1
B2 defined by 1 < x < 2, -1 <J < 1
1
U 2
-v, if x>l.
Compute S Bl vB 2 /(*» J) rfjc d>'- ^ Ans ' ^
7. (a) Prove the Leibnitz rule for differentiating
an integral with respect to a
parameter: If ix,y) is continuous
gy on a rectangle a x b, < <
c <y <d, then
iJW.j)*-J>.**
^ec -
2 Multiple Integrals 465
fy pb
[Hint. Interchange the order of integration in dy\ gv (t,y) dt, and
Jc Ja
then differentiate both sides with respect to y.]
(b) Use part (a) and the chain rule to show that if continuous, and A,
gy is
d r*/ih 2* (y)
1
git,y)dt
»A 2 (tf)
8. Prove (compare 5.2, Chapter 3): Iffx ,f andfxy are continuous on an open
v ,
set, then/XJ ,
=fyx . [Hint. Apply the Leibnitz rule of Exercise 7(a) to the
equation
10. Compute the multiple integral of f(x,y,z, w) = xyzw over the 4-dimen-
sional rectangle
14. The 4-dimensional ball B of radius 1 and with center at the origin is the
subset of ft 4 defined by x\ + x\ + x\ + x\ < 1. Set up an expression for
the volume V(B) as a fourfold iterated integral.
15. Use Theorem 2.8 to show that if /and |/| are integrable over B, then
\$BfdV\<$B \f\dV.
exist.
(a) Show that if &n -^-> 5Lm and #" -^- # m are both integrable over B, then
k -fdV = k •
fdV.
JB JB
17. Use the result of Exercise 16(c) to show that if ft" > Xm is continuous
on a set B, and x is interior to B, then
s^/v /</F=/(x ),
--ml.*"
is called the centroid of R, and the real number
/(z) = I
|z -z\ 2 dVx
JR
is called the moment of inertia of R about z.
19. Prove the analog for multiple integrals of Theorem 8.4 of Chapter 5: If R
has finite volume, and the series ^/a. of continuous functions converges to/
k=l
uniformly on R, then
°°
f c r °° ~
density on R if
pdV = \. (2)
j;
If £ is an experiment with possible outcomes in Rn distributed according
to the density/), then the probability that the outcome lies in a set B in
n
is R
defined by
Pr[E in B]
L pdv -
(k(l -x -y 2 2
),x 2 + y2 < 1
p(x,y) =
x2 +y > 2
1,
(b) If the outcomes of E are distributed according to the density of part (a),
find the probability that E has an x-coordinate bigger than J.
21. Let B be 2
the subset of Jl consisting of allpoints (x, y) such that <y < 1
22. On the rectangle <x < 1 and <y < 1, let/(x, _y)
= 1, if x is rational,
and f(x,y) = 2y, if x is irrational. Show that
Jo Jo
h;
The change-of-variable formula
(e~
xy - 2<r 2* !
for
') </x
1
#
M
-dimensional integrals
(«-*» - 2e- 2 *»)</7.
SECTION
CHANGE OF
3
is
VARIABLE
/(x) rfx = /(^(U))f (U) du. (i)
J*(a) Ja
2
= 2
=
J> x f/x cos u du
3.1 Theorem
( T(R)J
Figure 16
The proof is in the Appendix. Before showing why the formula works,
we give some examples of its application. Notice that the factor that </>'
finding the geometric relationship between the subset R and its image
T(R) for various transformations T.
(1.1)
(2,1) '(3, 1)
Figure 17
x\ fu\ lu + V
det T' =
1
(x + v) dx dy = [(u + ») + v]i du dv
Jp Jr
= rfu I (u + 2t>) dv = 4.
Jo Jo
domain.
u cos v
u sin v
470 Multiple Integration Chap. 6
y
-(1,7) 2
< 'T)
Sec. 3 Change of Variable 471
Figure 19
we get T(R) — B. The corresponding regions are shown in Fig. 19. Since
sjx'
.
(x\
w vj \u + V2
R.. = =
Figure 20
of the boundaries are indicated in the picture. The image of each of the
four line segments that comprise the boundary of R uv is computed as
follows:
u =
(a) If and <v < 1 , then x = — v and y = v
2
, that is, y = x2
and — < x < 0.
1
"1
then
«i — v1 = u2 — v2 ,
Mi + vt — M2 + fa-
Sec. 3 Change of Variable 473
Obviously, if wx — u 2 then v x
, = v2 . Suppose u x < u2 . This implies
< u\ — u\ = v2 — vu
Hence, vx < v2 , whereas v\ < v\. This is impossible if both vx and v 2 are
nonnegative; so the one-to-one-ness of T on R uv is established. The
Jacobian determinant of T is
2m -1
det r= = 4uv + 1.
1 2v
x dx dy = (u
2
— v)(4uv + 1) du dv
jRxv J Ruv
= \
dv\ (4u
3
v — 4uv + u
2
— v) du
Jo Jo
=j\-2i? + $)dv=-l
of a subset by the vector T(u ) leaves its volume unchanged, and the
differential da T, being a linear transformation, changes volume in a
particularly simple way. Indeed, under a linear transformation volumes
get multiplied by a constant factor, and the factor of proportionality is
just the absolute value of the determinant of the transformation. For
instance, suppose T is taken to be a linear transformation, and /is the
constant function 1, that is, f(u) = 1 for all u in \L™. The change-of-
C
3.2 Theorem
\v \u + v
2w 0\ lu
yVI \ 1 1/ \V ::)(:
Near
M T\ il 0\ lu - 1\ /2m
")=r( + r(u )(
,v) \\)
]
\v - \u + v
(1,1)
Figure 22
Sec. 3 Change of Variable 475
figure on the right. The affine approximation A carries R onto the paral-
lelogram outlined with dashes. Notice that the area of the parallelogram is
roughly equal to that of the curved figure. The exact area of the parallelo-
gram is easily computed to be \, twice the area of R. The important point
is that the affine approximation to T doubles the area of the square, while
T itself approximately doubles that area. The magnification factor, 2, is
2
det r = 2.
1 1
To find the exact area of the image T(R), we use the change-of-variable
theorem. Since u is positive on R, the transformation T is one-to-one
there. The inverse function is given explicitly by
y
so Tis one-to-one. Moreover, the Jacobian determinant/ = 2w is positive
on R. Hence,
A(T(R)) = j
dA = { \J\ dA
Jt(r) jr
/•3/2 /*3/2 /*i
= dv\ 2u du = \u
%
Figure 23
/
f
fdV = z( fdV^^mnK)), (2)
JT(R) i JT(Ri) i
2F(W))«2/*WW (3)
fdVHtZifo-nMViR,).
j;
Fig. 24. Since the Jacobian determinant of the polar coordinate trans-
formation
fx\ fr cos 6\ [r\
^T
\yj \r sin 6]
<IL-
SB
Ar
Figure 24
Sec. 3 Change of Variable 411
cos 6 —r sin
J=
sin 6 r cos 6
\dA = \
r dA = \
rdr\ dd
JS JR Jr a JOo
= r ArA0 + KM A0
o
2
3
Example 8. Spherical coordinates are introduced in 'Ji by means of
the transformation
ordinate "cube" C shown in Fig. 25. The spherical ball with center at the
origin and radius is defined in 3l
1
3
by jc 2 + y 2 + z 2 < and is denoted 1
below by Bxyz With respect to polar coordinates, the same ball is defined
.
by the inequalities
Figure 25
V(B XVZ ) = dx dy dz = r
2
sin <f>
dr d<f> dd
JBxvz JB r<t>e
fl fir rzv
= r
2
dr\ sin <j> d<f>\ d6
Jo Jo Jo
= Kl + D27T = 4j
Notice (as in Example 3) that both conditions (a) and (b) of Theorem 3.1
fail to hold on However, except on a subset of Br<f>e having zero
Br<t>9
.
EXERCISES
1. Let
2uv
(a) Sketch the image under T of the square in "M 2 with vertices at (1, 1),
(b) Sketch the image under T'\ I of the square in part (a).
(c) Sketch the translate of the image found in part (b) by the vector
1\ /l
7"
1/ \1
Sec. 3 Change of Variable 479
Verify that this is the image of the square under the affine approxi-
mation to Tat r
\1>
(d) Find the area of the region sketched in (c). [Ans. 2.]
(e) Find the area of the region sketched in (a). [Ans. ^.]
2. Let
(u\ lu COS V
T
y] \vj \ u sin v
(a) Sketch the image under Tof the square S with vertices at (0, 0), (0, tt/2),
(b) Sketch the image under T'\ I of the square S. What is the area of
\ 0/
the image?
(c) Sketch the image of S under the affine approximation to Tat (77/4, 77/4).
3. Let
III COS V
usmv
lu\
Show that T I
J
transforms a rectangle of area A into a region having
area it A.
4. Compute the area of the image of the rectangle in the wt'-plane with vertices
at (0, 0), (0, 1), (2, 0), and (2, 1) under the transformation
lA " sS]
y) \2 i)C)-
5. Consider the transformation T defined by
x\ {u\ lu 2 — v
7
y! W \ 2uv
Let R uv be the quarter of the unit disk lying in the first quadrant, i.e.,
u
2
+ v
2
< 1, u > 0, v > 0.
x = u + v,
y v. Let R uv be the region bounded by (1) */-axis,
(2) t'-axis, and (3) the line u + v = 2.
dxdy
[Ans. 2.]
tRxy Vl
j; + 4x + 4y
x = u, y = v(l + u 2 ),
and let R uv be the rectangular region given by < u < 3 and < v < 2.
d(x,y)
(b) Find
d(u, v)
(a) Find R.
(b) Compute | /e 1 dudv by integrating directly over R, and then by using
the transformation formulas to integrate over D. [Ans. ^p.]
(c) Compute J /e v du dv both directly and by using the change-of-variable
theorem. [Ans. 128.]
u =x
v =y(l + 2x).
1 <y < 3,
find the image region R of D.
(c) Find
\
du dv, v dv du, and u dv du
Jn JR J R
by direct integration, and then by reducing them to integrals over D.
[Ans. 24, 228, 45.]
10. Compute the area bounded by the polar coordinate curves = 0, — n/4,
and r = 2
. [Ans. 7r
5
/(2
10 •
10).]
x 2
v2 z2
2
h—-\
2 2
<
~ 1
a b c
x 2 dx dy dz.
\
JO<z<l
xx = ux
X2 = Ml + U2
X3 = ll
x + W2 + M3
Xn = «1 + »a + • • + «n
16. Cutting a solid of revolution R into thin cylindrical shells, with axis the
same as the axis of revolution, leads intuitively to the following formula
for the volume of the solid:
2nrh(r) dr.
I
Here /;(/) is the thickness of the solid at a distance r from its axis, measured
along a line parallel to the axis. Show that introducing cyli ndrical coordinates
in the integral l R dx dy dz leads to the same formula.
17. (a) Let a ball B of radius a have density p at each of its points equal to the
distance of the point from a fixed diameter. Find the total mass of the
ball.
(b) Let a cylinder of height h and radius a have a density p equal at each
point to the distance of the point from the axis of the cylinder. Find the
total mass of the cylinder.
SECTION 4
IMPROPER The definition of the integral can be extended to functions that are
INTEGRALS unbounded and not necessarily zero outside some bounded set. We shall
first consider some examples.
Figure 26
which x > 1 and y > 1 , it is natural to define § B fdA in such a way that it
can be called the volume under the graph of/. We can approximate this
volume by computing the volume above bounded subrectangles of B.
lying
To be specific, let By be the rectangle with corners at (1, 1) and (TV, N)
and with edges parallel to the edges of B. For > 1 we have N
r fdA=\
N
dx\
dx\
rN
i
—
x y
{
2
dy
-{\:f)-{^
As jV tends to infinity, the rectangles BlX eventually cover every point of B,
and the regions above the By fill out the region under the graph of/.
Then, by definition,
fdA lim
f fdA= 1.
f(x,y) = -log (x 2 +y 2
), < x2 +y < 2
1.
Sec. 4 Improper Integrals 483
r:.>
/u '
>-)=-i°g(* 2 + y
2
)
i
r i /
Figure 27
The graph off is shown in Fig. 27. Since /is unbounded near (0, 0), we
cut out from B a disk centered at (0, 0) and with radius e. Call the part of
B that is left B €
. We have, using polar coordinates,
fin pi
log (x
2
+ 2
dx dy d - (log r)r dr
JBf
y )
1 i
= -2-rr[r log
2
r-\r 2
]\
= it + 27re
2
log e — ttc
2
log (x
2
+ >'
2
) dx dy = lim -
)
log (x
2
+ y
2
) dx dy = tt.
c-0 JB€
Figure 28
(x
2
+ y
2
+ 1)- 1/2 dx dy dz
I
r-2 rin rir
= \
dr\ dd\ (r
2
- l)-
1/2
r
2
sin <f>
d<j>
Jl+e Jo Jo
= 4tt
f (r
2
- l)-
1/2 2
r dr
= 47r[irjr
2
- 1 + I log (r + Vr 2 - l)]
2
lim f (x
2
+/- z
2
+ 1)
1/2
dx dy dz = 4^3 + 2tt log (2 + ^3).
e-0 JB€
Both conditions are satisfied in the three examples considered so far, and
we shall assume that they hold throughout the rest of the section.
In the examples, we have seen that the integral § B fdV can sometimes
Sec. 4 Improper Integrals 485
provided that the limit is finite and is the same for every increasing family
of bounded sets By converging to B. It is assumed that the BN are chosen
so that the ordinary Riemann integrals § Bn J" dV (as defined in Section 2)
exist. The integral thus obtained is called the improper Riemann integral
4.1 Theorem
limf fdV
N JBy
is finite for some particular increasing family of sets By converging
to B. Then \ B fdV is defined and has the same value,
lim fdV,
-V JCy
for every other family {CN} converging to B.
|
fdV <lim| fdV
JCy X J By
for all N. Because IcyfdV increases and is bounded above,
limf fdV
A' JCy
y=\
Figure 29
I fdA = \dx\ y-
me~x dy
JRs Jo Jl/N
Then
J[/*V***-lta(l-^)(2-^)-Z
. .
~a -
N l
- 1/iV
1
i ; ,
a¥- 1,
h/N
j;
\2\ogN, a = l.
_a
1
x dx = l
, for a > 1,
f a - 1
and
x a
dx = 1
, for a < 1
1o I — a
The integral
%1
~1
P
J-i x
fails to exist if we require that its value be independent of the limit process
by which it is computed. Indeed, if we integrate first over the intervals
[— 1, —(5] and [e, 1] with < d < 1 and < e < 1, we get
-6
dx = log |x| + log X
J-l X Je X
d
= log -
1
As e and 6 tend to zero, log (<3/e) can be made to tend to any number by
controlling the limit of the ratio d/e. In particular, if we keep e = d,
the limit is zero.
For a function /having a graph symmetric about some point x it is ,
p.v. -= 0.
J-l X
Figure 30
S
p.v. f fdV = lim f -^rdrdd
JD <r->0 JDe r
2"
C C 1 rlr
= lim sin 6 d0\ —= 0.
€-0 JO Je r
4.2 Theorem
Let /and g have the same infinite discontinuity points. If |/| <g
and J B g dV exists, then so does § B fdV.
Then, because/ + 1/1 > 0, the value of the integral j"^ (/ + l/l) dV
increases as J5 A- increases, and we have
N JBy N JBy
Since the family B N is arbitrary, B J"dV is defined.
jj
10
2
+ /) 1/2 ,
for x < 0.
~
Example 8. Let j{x) = (— \) n l jn for n — 1 < x < n and » = 1,2,
3, . . The graph ofy is shown in Fig. 31 as far out as x = 4. Then
. .
;(x)rfx = 2— y — =io g 2,
71-+00 JO
and we can write
Figure 31
490 Multiple Integration Chap. 6
EXERCISES
1. In each part determine whether the integral is defined or not. If it is defined,
compute its value.
f=° dx
(a)
TF^TT • Ans W2-]
X2 + 1
\- -
dx
(b)
x- - 1
dx
(c)
Jo v 1 — x1 -
dx dy
(d)
(x — y) dx dy
(e)
|
f dx dy dz
'
2
J x*+v*+z*>i (* + y1 + z2)2
f ^/a: dy dz
(g)
"
J x*+v*+z*>i xyz
xyi
x ~ v~ z
(h) e~ dx dy dz, where C the infinite column
I
|
max
is
2. Prove that
~1 -
(a) T(n) = e- x x n dx = (n 1)!
(b) Express J
r e~*(.x:
— j) _1/2 */x </y in terms of r, where T is the region
x >y > 0. [/4«5. 2r(|).]
<^
—-
3. Let 5 be the ball |x| < 1 in A". For what values of a does
,
f:
|
-
exist?
_ Ixl
C
x x dx 1
x dx
C
4. Compute: (a) p. v. (b) p.v.
J —x X~
,
-t I
.
_i 2x - 1
1
f x</.v
5. Compute the values of the function ^-(v) = , taking a principal
J-i v - y -
6. Compute the integral of the function/in Example 7 in the text, and compute
\ B (x 2 + f)-1 '* dx dy. [Ans. |»r, 2w.]
Sec. 4 Improper Integrals 491
7. Show that the integral of the function j in Example 8 of the text depends on
the sequence of sets used to compute the limit. [Suggestion. Take each jB v
8. Let f{x, y) = 2
sin (x 2 + y ) over the quadrant Q defined by x > 0, y > 0.
9. In what sense does each of the following integrals exist? The possibilities
are ordinary Riemann integral, improper integral, conditionally convergent
integral, or none of these.
/""sin* f"
(a) —x j- dx. (c) sin x dx.
Jit Jo
00 1
f sin x f 1
(b) dx. (d) sin-dx.
Jo x Jo x
lim e~ €xf(x) dx = k.
e— 0+ Jo
/*CO (*DO
~ y2
11. (a) Compute e~ x2 dxdy. (Use polar coordinates.)
Jj? 2
12. (a) Show that the area bounded by the graph of y = \\x, the x-axis, and
x
the line = 1 is infinite,
(b) Compute the volume swept out by rotating the region described in
part (a) about the jc-axis.
S B fdA and $c dV
both exist, then they are equal.
15. Show that if the ordinary Riemann integral l B fdV exists, then it exists as an
improper integral (given conditions (1) and (2)) and the two integrals are
equal.
492 Multiple Integration Chap. 6
° 2 [p] = I
|x - M[p]\ 2 p(x) dV.
JR
Show that each of the following functions is a probability density, and
compute its mean and variance if they exist.
SECTION 5
ESTIMATES OF
In many of the examples of this chapter, numerical evaluation of integrals
INTEGRALS
has been made by using the fundamental theorem of calculus to arrive at a
precise answer in terms of some elementary function. In practice, such a
computation is very often not feasible, and then an estimate for the value
of an integral may have to serve instead. The fundamental inequality
used in making estimates is contained in Theorem 2.8. We repeat it here.
sin x
/(*) = '
<x < 1
(1 + x 2) 2
< <
Hence, by 5.1,
1 1
x x dx
< f sin
— dx ,
< f
\
-
Jo((1+x
2 2 2 2
Jo(l+x ) )
dx 1
0.25;
Figure 32 1o(l+x 2 2
) L2(l+x 2 )J<
,
hence
04- + Jo (1
sin x
x
2 2
)
dx < 0.25.
Figure 33
0.73 <cos (
^L^\
J
< 1-
By 5.1 we have
J
0.73 dx dy <
J
cos (^^) dxdy <\ Idx dy.
Evaluating the largest and smallest of these integrals is easy because they
are the volumes of solid rectangles with base 5 and heights 0.73 and 1
The estimates in the two preceding examples are fairly rough because
we have replaced the given integrand /by approximating functions that
differ from / considerably over a relatively large part of the domain of
integration. Of course, the estimates could be improved by choosing
approximating functions that agree with/more closely. However, to be
approximating functions themselves should be easy to
really useful, the
integrate. One way to achieve this is to choose for an approximating
494 Multiple Integration Chap. 6
(1 + x + y)
3
dx dy,
j;
wm\ N 2M/\(2NM)J
(2NM))
To simplify the expression, we can take TV = M; hence the sum takes the
form
N N
wh\ N 2nJ\2N 2
,
Evaluating sums like this for even moderately large values of N is best
done on a computer. We get the following table:
N
Sec. 5 Estimates of Integrals 495
5.2 Theorem
df
(x) < M {,
( = !,...,».
dx
{b t —
a^/Ni in the x -coordinate, where is an integer. Then the
(
N t
V(R) |hM t it
(fdV=2 f /dK.
Then
ZV(R )max\f(x)-f(xk
k )\.
fc=l x in Ri;
l/(x)-/(y)| = |/'(*)(x-y)|
n
df
i=l OXj
1=1
as was to be shown.
496 Multiple Integration Chap. 6
/ = (1 + x + yf dx dy
JR
of Example 3 has integrand f(x, y) = (1 + x + )>)* We have fx (x, y) —
fv(x ,y) = 3(1 + x + y) R < <
2
The . rectangle determined by x 1 and
< y < \ is such that
max \fx (x, y)\ = max |/„(x, y)\ = 3(f)
2
= ¥.
(•r.v) in R Ix.y) in iJ
Thus x M M
= 2 = ~t- If we subdivide into K equal parts along the x-axis
and L equal parts along the j-axis, we have /? x = IjK and h s = 1/2L.
Then a Riemann sum 5^ L based on such a grid will satisfy
IJ-Szrrl <
«<4(i)(7V(i)(?);
8 \X 2L/
\I-SK L \<0.05._
y — u sin v
for < u < 1 and < v < tt/2, we can transform Q into the rectangle R
shown in Fig. 34.
Sec. 5 Estimates of Integrals 497
i
i i i
i
i i i
+-I-I-+-4-
rn~r|i
+ H-I-4-4-
.
i i i
j
I I I I
J
-I
i
— — ——
|
'
i
i
I
i
I-
I
Figure 34
/ = f f(x, y) dx dy,
JQ
the change-of-variable theorem shows that
I = \ (u
2
cos
2
v + 3
u sin v)u
3 2
du dv.
JR
Setting g(u, v) = w4 cos 2 v + u 5 sin 3 v, we find
= |4u
3
cos v
2
+ 5w sin
4 3
v\ < 9
du
and
K
= — 2u |
4
cos v sin v + 3u sin v cos
5 2
u| < 5
dv '
based on a grid with TV equal subdivisions along the w-axis and equal M
subdivisions along the f-axis. Then Theorem 5.1 shows that
<15 13
N M
By taking N= M = 60, we get the error bound
I-* "^eo 6ol -^ 60 ' ITo ^- "•-'•
.
EXERCISES
1. Find rough estimates above and below for the following integrals by using
the largest and smallest values of the integrand on the domain of integration.
(b) e*
2
dx. (e) dx\ dy\ Vx + y + zdz.
Jo Jo Jo
ri /»2
(c)
Pdy P cos xy dy. (f) dx\ dy\
r>i
(x 2 + y2 + z 2 )dz.
Jo Jo Jo Ji J-i
•l /*i
3. For each integral in Problem 2, estimate the error in making the approxi-
mation. Use Theorem 5.1
where M {2)
> 2
\(d 2fldx )(x)\ for / = 1, 2, . . . , n. If the integrals in Problem
2 are estimated using the midpoint rule for 10 equal subdivisions in part (a)
and 10 2 = 100 in part (b), find a bound for the error.
(x + y2 ) dx dy.
I'
SECTION 6
NUMERICAL The estimates of the previous section are rather crude relative to the degree
INTEGRATION of accuracy often required in numerical work. The purpose of this section
is to show how greater accuracy can be obtained. Theoretically, Theorem
Sec. 6 Numerical Integration 499
j\(x) dx = —^
b
\q{d) + 4<?(
£
-p) + q(b) (1)
x = a, xx = —+
(fl fr)
, x2 = .
6.1
"fix) dx m °
- ~a
r
Ja
Figure 35
N even
I I I I I I I
a — .x o x i x2 • • • x r
v _ 2 xN _ i
xn = b
Since
fxz rxt rxy
f(x)dx = \
f{x)dx + \
f(x)dx + ... + ]
f(x)dx,
J*6
h
f/(x) dx * -=-± [/(.v ) + 4/(x0 +/(x 2)]
Ja 6(AT/2)
6(N/2)
+ J"-?"
[/(.x.Y - 2 ) + 4/(x v _ 1 ) + /(.x.v )].
6.2 /(x) dx
^ b ~ a
+ 4/(x x ) + 2/(x 2 ) + + 4/(x A _ 1 ) +/(x A )].
*" [/(*o) . . .
3N
1 -0 VN 2/N lN ~ 1),N
[e° + 4e + 2e + ... + 4e + e
1
].
3N
N
502 Multiple Integration Chap. 6
h
5
M
f f(x)dx--[f(a) +
Ja
4f(c)+f(b)] <
90
where ft = (ft — d)\2, and c = (b + a) 12. First let
+
£(') = P 7« dx - l
- [f(c + + 4/(c) + /(< 0]
Jc-i 3
and
5
ft
2
60r
£ (0= - Mf (c + -/ (c - 0] - tt £ W-
that
+ 3) -f\c -
f"\c r t 3) = 2t zf«\u) (4)
for some /4 between c — and c + t3 t3 . Since < /3 < ft, the number
t4 lies in (a, ft). We also have from (3) and (4)
= {i)
£(/>)
fQ f (U)-
1
2
x
e dx
I"
Jo
We find f
U) (x) = 12e*
2
+ 48xV + 16xV 2 2
; so on [0,1], we have
\f
U) (x)\ < Me + 48e + \6e < 228. Then the error in replacing the
integral by the Simpson approximation using N points is at most
(1 - 0)
5
228 L3
'
4 4
180iV " iV
A 2 2,— Aj_ x — 4, A j =
. . . , and J is even. The points x are evenly 1 ,
;
[dy [f(x,
Jc Ja
y) dx m (
±^ 5J
(A
i^I B&AJi** »)
jK. A-=0 ;=0
6.4 f f{x, y)
J It
dx dy * ^|
yJKk=0j=0
I^A/C*,-, y k ),
where x ; = a -\- j{b — a)jJ and y k =c+ /c(d — c)jK. The products
/ijjBj. are simply the products of the usual Simpson coefficients. For
example, if A3 = 4 and B2 = 2, then ^ 352 = 8.
504 Multiple Integration Chap. 6
Example 3. Consider
(1)
(4)
(1)
Sec. 6 Numerical Integration 505
we apply the one-variable Simpson's rule three times to get the 3-dimen-
sional Simpson rule:
Example 4. To approximate
we apply Formula 6.4 to the solid rectangle 1 < x < 3.6, 1.5 < y < 1.8,
1.1 < z < 2.3. The edges of the rectangle have lengths 2.6, 0.3, and 1.2,
"
II4
_ (3.6- 1)(1.8- 1.5)(2.3- 1.1) ' _ _
= I/lAQlog(x
. ,
z,),
,
Sie.2.8 .
7m nwg
27(16)(2)(8)
. .
,=oA=o ; =o
y
>'
fc
6.6 Theorem
bf — a t
h,=
N<
> /?,M.-.
180 ,fl
E =
Jc
dy
Ja
f(x, z) dy - v
^
9JK
2Aj2B k f(x
j=0 1=0
it y k ),
h — a J
d
C
+ -rflAA f(x»y)dy
iJ j=o Jc
(b-a)(d-c) $ A *
ttt;
yj k
2 a jZ
j=o k-=o
BJ(xj, y k ).
d — cK
+ IAj /(*„ y) dy —r—lBJix^yt)
3J ;=0 J" 3K k=o
(b - a)
5
b - a (d - c)
5
\E\ < Mj dy l* M,
r 180J
4
3J ~o
t
180K 4
J
and y, respectively. Furthermore, it is easy to check that ]£ ^i =
3/. Hence
|£| <{d-c)
180J
v-
4
M. + ib- a)
180K 4
vM 2
- - -
+ (d-
4
(d c)(b a) (b af .. ..
M x
c)
— Mo
180 J 4
K 4
of Example 4, we find
— - (x, y, z)
}
= —ox
ax
4 ^ '
— (x, y, z) = -6y
— -(x, y, z) = — 6z .
fully, because they are to be raised to the fourth power. In fact, we take
508 Multiple Integration Chap. 6
EXERCISES
1. In applying Simpson's rule to the approximation of $%x sin xdx, what is
xi
(a) x* dx. (b) sin x dx. (c) e~ dx.
Ji Jo Jo
3. Use the Simpson's rule with four intervals to estimate each of the following
integrals.
x x
(a) sin x dx, (b) log sin x dx. (c) e dx.
Jo Ji J-i
4. Using a computer, apply Simpson's rule over 100 intervals to estimate each
of the following integrals.
2
.
2 1
-
dx f
23
:™dx
f f
(a) e*dx. (b)
-3-TT- < c>
Jo Jo X3 + I Jn X
5. Using Theorem 6.3, estimate the error in each approximation in Exercise 4.
10. How large should / and K be chosen in Formula 6.4 to ensure that a Simpson
approximation to
dx e*» dy
Jo Jo
gives an error no more than 0.0001 ? Remember that J and K must be even
integers.
Sec. 6 Numerical Integration 509
11. How large should J, K, and L be chosen in Formula 6.5 to ensure that a
Simpson approximation to
gives an error
H
Jo
?
dy
/•1.2
e*
yz
dz
(x
2
+ 2y
2
) dx dy
L
cannot be well approximated directly by Simpson's rule because A is not
rectangular.
(a) Transform the integral by the change-of-variable theorem into one over
a rectangular region.
(b) Approximate the integral found in part (a), accurate to within 0.005.
7
SECTION 1
GREEN'S THEOREM The fundamental theorem of calculus says that if /' is integrable for
a <t <b, then
f\t)dt=f(b)-f{a). (1)
V/-(x).Jx=/(b)-/(a). (2)
The main theorems of the present chapter are also variations on the idea
that an integral of some kind of derivative of a function can be evaluated
by using the values of the function itself on a set of lower dimension. We
begin with the version known as Green's theorem.
Let D be a plane region whose boundary is a single curve y, paramet-
rized by a function g in such a way that, as / increases from a to b, g(t)
traces y once in the counterclockwise direction. An example is shown in
Fig. 1. If Fr and F2 are real-valued functions defined on D, including its
f ( IT
JDXOXy
~ ir)
OX 2!
dx> dx * = lF
Jy
>
dx > +F >
dx *> W
under appropriate smoothness conditions. The requirement that y be
traced counterclockwise is the analog of the fact that, in Equations (1) and
(2), the differences on the right have to be taken in the proper order. The
analogy of Equation (3) with Equations (1) and (2) can be further
strengthened if we think of the integrand {dF^dx^ — {dFijdx 2 ) as a kind
510
Sec. 1 Green's Theorem 511
72
71
74
Figure 1
and F2 (x l5 x 2 ) = x x e Then
x,i
.
—
OXi
(x l5 x 2 ) - —
ox 2
- (x l5 x 2) = e
2
+ e
1
;
so
r
Jl>\OX 1
/^f 2
_ |F,\
OX 2 l
^^=p^p J-l J-l
(eXa
+ eXi)
^
= 4U -
The boundary curve y can be parametrized in four pieces y u i = 1,2,3, 4,
b>
//i
F dx t x -f- F dx 2
2
= — x 2 e Xl dx x + x^* 2 dx 2
dt
dt dt
I
l
e dt = e — -
J-\ e
Similarly, the integrals over the other three sides are also equal to (e — \je),
so
is to be applied, we shall see that this must be done so that the boundary is
traced just once, and in the proper direction.
The importance of the clockwise versus counterclockwise traversal of
the boundary becomes apparent when we observe that, for any line
integral, a reversal of the direction of the path changes the sign of the
integral. Thus, if y is parametrized by g(t) for a < t < b, we can denote
by y~ the curve parametrized by g~(t) = g(a + b — t) for a < t < b.
It is clear that y~ is the same set as y, but is traced in the opposite direction,
that is, from g(b) to g(a) instead of the other way around. Then, since
(£"('))' = —g'(a + b - t), we have
I
F dx • = - f F(g(a + b -t))> g'(a +b— t) dt.
Jy— Ja
dx = —
L F -*—l F
1.1 •
\
• dx.
Green's theorem can be proved most easily for regions D such that y,
the boundary of D, is crossed at most twice by a line parallel to a co-
ordinate axis. Such a region is called simple. Thus a coordinate line inter-
sects the boundary of a simple region either in a line segment or else in at
most two points. In fact, using Equation 1.1, we can extend the theorem
to finite unions of simple regions. A few such are shown in Fig. 2, where
only D x is simple. In D 2 the boundary is shown traced not always counter-
clockwise, but rather with the region always to the left as a point traces the
curve. For bounded regions with a single boundary curve, the two
descriptions of the orientation of the boundary amount to the same thing.
y\
Figure 2
gi(0
a < < t b.
(::) g 2 (0
514 Vector Field Theory Chap. 7
Since
F dx 1
x + F, dx, = Fj dx x + F 2 dx 2 ,
we can work with each of the terms on the right separately. We have
The curve y consists of the graphs of two functions u(x x ) and v(x x ),
perhaps together with one or two vertical segments, as shown in
Fig. 3. On
a vertical segment, g x is constant, so g[ there. On —
the remaining parts of y we apply the change of variable x x = g x (t)
so that, on the top curve, g 2 (t) = u(xx ), while on the bottom,
g 2 (t) = v(x x ). It follows that
F x (x x x 2 )
, dxx = F x
(x x u(Xl))dXl
, + ^(xj, v( Xl))d Xl ,
J
F x (x x , x 2) dXl = -F x
(x x u(Xl))
, + FjKxu v( Xlj)] dXl
I
(x l5 x 2 ) rf.v 2 dx x
dx.
1 z>
— —
dx.
axj ax2 .
F,(xj, x 2 ) dx 2 = —- dx x dx,.
Jv Jd ox x
x2
0'
x2 = u ( x \)
Figure 3
Sec. 1 Green's Theorem 515
k = 1, , K.
. .Applying Green's theorem to each simple region D k
.
we get
tdF1 _d_F1 \
J*
dx 1 dx 2 -
f
Fi dx, + F, dx 2
JD^dx^ dx 2
The sum of integrals over D k is an integral over D; so
jD\dx 1 dxj
+F + + +F
iF x dx x 2 dx 2 . . .
I ^i dxj_ z
dx 2
Figure 5
Fx dxi +F 2 dx 2 + Fi dx 1 +F 2 dx 2 = 0.
i:
Thus, while the parts of the curves yk that make up y contribute to
f^r7! dx x + F 2 dx 2 the other parts cancel, leaving
,
d
2 ~ 1 )dx = +F
(^ \
jDXOXi {
OXJ
l
dx 2 \F 1 dx 1
Jy
2 dx 2 .
The last part of the proof just given extends Green's theorem from
simple regions to one like those shown in Fig. 6. The extension has an
(a) (b)
Figure 6
_h_ dli =
d
(4)
dx x dx 2
F x dx x +F 2 dx 2 = F x
dx x -f F2 dx 2
Fi(x u x 2 ) = '
2
,
'
F 2 (x l5 x 2) =
x{ + x\ A+A
for (*!, x2) ^ (0, 0). Direct computation shows that these functions
satisfy Equation (4). If y is the ellipse shown in Fig. 7 and defined by
F dx x
1 +F 2 dx 2 = 0,
?Uc-
F dx x
x +F 2 dx 2 = F dx x
x +F 2 dx 2
Jy Jo
But on c we have x\ + x\ = 1 , so
F dx +F dx 2 = — x 2 dx x + xx dx
b x x 2
= (sin
2
1 + cos
2
dt = 2-n.
i
It is important to observe that Green's theorem could not have been
applied directly to the entire interior of the ellipse because {dF2 jdx x ) and
(dFJdxo) fail to exist at the origin.
Example 3. The curve y x given by g(t) = (t, t 2), < t < 1, is shown
in Fig. 8. Suppose that F(x x x 2 ) = (F^Xx, x 2 ), F2 (x x x 2 ))is a continuously
, ,
f F dx =
• {'[F^t, t
%
) + F 2 (t, t')(2t)] dt.
Jy 1 Jo
that
— —— -
-
) dx x dx 2 (J
d \ dx x dx 2 J
would imply
518 Vector Field Theory Chap. 7
'ft'
Thus
I F dx
• = F I x (f, 0) dt + I
F2 (l, dt.
Jy 3 JO JO
This may be easier to compute than either of the integrals over y x and y 2 ,
Wcoi
gV)\ \gV)\
i*'(oi/
F-tds.
,, ,
.
curl F(x)
dF,
= -— (x) —
ox,
dF
—
ox 2
x
(x).
F
Figure 9
curl F dA F • t ds,
sometimes called Stokes's theorem. Now interpret Fas a force field in the
plane. The line integral represents the work W{y) done in moving a
particle around y in the counterclockwise direction under the influence of F.
Stokes's theorem says that W(y) is equal to the integral of the curl of F over
D. In particular, if curl Fis identically zero in D, then W{y) = for every
smooth circuit y contained in D, whether y is oriented counterclockwise or
not. For this conclusion to hold, it is of course necessary that curl F be
defined throughout the inside of every circuit in D to which Stokes's
theorem is applied. Conversely, it is possible to show that if W(y) =
for every smooth circuit, then curl Fis identically zero. See Section 2.
Fcan also be interpreted as the velocity field of a fluid flow in D. That
is, the vector field Fat each point of D represents the speed and direction of
the flow at that point. In this case the line integral in Stokes's theorem is
called the circulation of F around and Stokes's theorem says that the
y,
circulation of F along y is the integral of the curl of F over D. Thus to
say that curl Fis identically zero in D is to say that the circulation is zero
around every smooth closed curve with its interior contained in D. A
field Ffor which curl Fis zero is called irrotational for this reason.
Now using the unit normal n(?), we can rewrite Green's theorem
in another way. Instead of applying the fundamental Equation (3) to the
field F=(Fx F2 ), we apply it to the related pair of functions (— 2 Fj).
, ,
F 'n ds.
520 Vector Field Theory Chap. 7
Kg+g)^-
We define a real-valued function div F, called the divergence of F, by
EXERCISES
©1. Use Green's theorem to compute the value of
dxz where y is each of the following closed
jCj ,
the line integral $
paths.
y
x2 dx1 +
(a) The circle given by g(t) = (cos /, sin f), < t < 2n.
(b) The square with corners at (±1, ±1), traced counterclockwise.
(c) The square with corners at (0,0), (1,0), (1,1), and (0,1), traced
counterclockwise.
2. Using some one of the paths y l , -/.>, or y 3 in Example 3 of the text, compute
the line integral J
x 2 dx 1 + x 1 dx 2 .
3. Let 7 be the curve parametrized by g(t) = (2 cos t, 3 sin t), < t < In.
Compute j"
v (2x 1 + x 2 ) dx^ + (a: x + 3x 2 ) dx 2 .
(a) fy
e* cos y dx + e x sin y dy, where y is the triangle with vertices (0, 0),
(1,0), (1, tt/2), traced counterclockwise.
(b) Use the same integrand as in part (a), but change the path to the square
with corners at (0, 0), (1,0), (1, 1), and (0, 1), traced counterclockwise.
(c)
J c (x
2 — y 2 ) dx + (x 2 + y2 ) dy, where c is the circle of radius 1 centered
at (0, 0) and traced clockwise.
Sec. 1 Green's Theorem 521
V/(x), the gradient of /. Show that if F(x) = (F^x), F2 (x)), then the
equation {dFjdxJ — (dFJdx 2 ) = is satisfied in D.
V/.rfx,
1
x (a) for k = k = k an arbitrary positive integer, (d)
0; (b) for 1; (c) for
What interpretation can be given the line integral if k is a negative integer?
F>dx=Ik , A: = 1,2.
ick
Show that if y is any closed smooth path that avoids x x and x 2 then ,
F • dx = n-Ji + n 2 I2 ,
.1
9. (a) Consider a particle moving in a plane vertical to the surface of the earth
and subject to the gravitational field G(x, y) = (0, mg), where m is the mass
of the particle and^ is the acceleration of gravity. Show that as the particle
moves in the plane, the amount of work done is independent of the path
between two points and depends only on the and final points. In initial
(dF1 /dx 2 ) throughout the plane. Show that the same conclusions hold.
10. (a) Let 31 > Jl 2 trace a simple closed curve y in the counterclockwise
direction. Show is given by t(/) =
that, if a unit tangent vector to y
g\t)l\g'{t)\, then the outward pointing unit normal to y at^COis given by
n(0 = (g2(t)l\g'(0\, -g'i(0llg'U)\), where g x and g 2 are the coordinate
functions of^-.
522 Vector Field Theory Chap. 7
F • n ds.
1
[Hint. In the previous formula, replace F2 by F x
and F 1 by -F2 .]
11. Assume that the vector field F= (F1 , F.,) in Exercise 10(c) is a gradient field,
that is, F= V/for some real-valued/. Show that Green's theorem can be
written in the form
vy.n<fc,
J>"-J,
where A/ = (Pf/dx*) + {d 2f/dx 2 ), the Laplacian of/.
2
12. (a) Show that if/ is a continuous real-valued function defined in an open
set D of ft
2
, and fa/fdA = for every circular disk M in D, then /is
identically zero in D. [Hint. Show that if/(x ) # for some x in D,
then there is a disk M centered at x such that |/(x)| > d for some
(5 > 0, and all x in M.]
(b) Use part (a) and Stokes's theorem to show that if curl Fis continuous in
an open set D, and the circulation of F is zero around every smooth
circuit in Z), then Fis irrotational in D, that is, curl Fis identically zero
in D.
(c) Use part (a) and Gauss's theorem to show that if div Fis continuous in D
and $("/) = for every smooth circuit y in D, then Fis incompressible.
-(a) (x + iyf.
(b) <?
SECTION 2
CONSERVATIVE The examples of the previous section show that, under certain conditions,
VECTOR FIELDS it is possible to alter the path of integration in a line integral in the plane
without affecting the value of the integral. Not all line integrals have this
property, but those that do are particularly important, not only for the
computational reasons already illustrated, but also because of their relation
to the gradient. In fact, we have the following theorem, valid in 3l n , which
is a converse to Theorem 2.5 of Chapter 4.
Sec. 2 Conservative Vector Fields 523
2.1 Theorem
F-dx
I
is independent of the piecewise smooth path y from x to x in D,
then the real-valued function defined by
/(x) = f F-dx
Jx
unit vector u and for all real numbers / satisfying |/| < <5,
-t-
/( x + tu) — /(x) = F dx • - \ F dx •
«/x Jx
"x+iu
F -dx
r
= F(x + vu) ' u dv.
dxi t-*o t
i r*
= lim - F(x + ve s ) • e,- dv
t->o t Jo
—— F(x + ue.)
3
• e,- dv
dtJo '
= F(x) • e, = F,(x),
=
W(x u Xo)
f \
Jx,
F- dx.
If the particle follows a particular path given by g(t), then the velocity and
acceleration vectors are v(7) = g'(t), a(t) = g"(t), and we have F(g(t)) =
ma{t), where m is the mass of the particle. Hence,
W(x x 2 )Y ,
= ma(0 • v(r) dt,
ifg(t { ) = x,. But since a(7) = v'(0> and (djdt)v 2 (t) = 2v(r) • v'(/), we have
m C d
=^
2
o
W(x ,x 2 )
1
f [v\t)]dt,
2 J dt tl
= l;
2
(f 2 )- t;
2
(f I )). (1)
^(
The function k(t) = (m/2)v 2 (t) is called the kinetic energy of the particle at
time t.
w(x) =— F dx•
W(x lt x 2 ) = r> • dx
r*2 /*xi
= F dx • - F-dx
Jx n Jx„
u(x 2 ) + «( Xl ). (2)
Sec. 2 Conservative Vector Fields 525
«(x2) + tJ
r
2
v (t 2 ) = m(xj) + y v\td.
In other words, along the path traced by g{t), the sum u(g(t)) + k(t) is a
constant, independent of /, called the total energy of the path. For this
energy which is "conserved" and which gives rise to the term "con-
servative field."
2.2 f dx • = F dx •
where y[x l5 x 2 ] and (3[x l5 x 2 ] are any two piecewise smooth curves in D
having initial point x : and terminal point x 2 An alternative formu- .
2.3 f F dx =
•
details of the proof have already been illustrated in Section 1 and will be
left as an exercise.
We can summarize what we have proved about gradient fields in
Theorem Chapter 4, Theorem
2.5 of 2.1 of the present section, and in the
previous remark, as follows.
2.4 Theorem
f F-dx.
F>dx = 0.
—=F
ox-L
x and -^ = F 2
dx 2
.
-?h ay _3Fi
and
dx 2 dx 1 dx 2 9xj dx 2 dx±
dF\ dF2 .„
=
dx 2 dx 1
throughout D. By the definition of curl F, Equation (3) can be written
curl F — 0. The equation can also be expressed another way: We consider
&- = F» j =1 n.
dxj
dF,_ df
2
_ a
2
/ _dF t
(4)
dx t dx dxj
t
dxj dx dx.
But the functions dFjdxj are the entries in the n-by-« Jacobian matrix
of 51" —
> 51", and Equation (4) expresses the fact that this matrix is
symmetric. Hence
Sec. 2 Conservative Vector Fields 527
2.5 Theorem
defined for all (x, j) 7^ (0, 0). It is easy to check that dFjdy = dF2 /dx.
But there isno continuously differentiable/such that V/"= F. The reason
is that, for x > 0, the function/(x, 7) = arctan (y/x) satisfies V/"= F, but
this /cannot be extended to be a single-valued solution of the equation
in the entire plane with the origin deleted.
Example shows that the nature of the region D on which Fis defined
2
is significant in determining whether F is a gradient field. By making a
special assumption about D we can obtain a partial converse to Theorem
2.5.
2.6 Theorem
Xq^.
Figure 11
which leaves the value of the integral (5) unaltered. Each path can
be described by a sequence of coordinate directions, only one of
which is allowed to vary at a time. (For example, the dotted path in
F -dx F t
dXj +F j dxj.
(BFj
Jd jR s \OX
[d Xi i
dx
OXj/
<p F dx • = I
F dx • — I
F dx
• = 0,
and so the change of path leaves the value of the integral invariant.
Once it has been established that x can be approached along a
path of integration that varies only in an arbitrary coordinate, say
the Arth, we have, as in the proof of Theorem 2.1, the equation
dfjdxk (x) = Fk (x), for all k. Thus V/(x) - F(x) for all x in R.
Sec. 2 Conservative Vector Fields 529
K* y) = (x y* * (0
W
(-f^-2 -rj—)
+ y i x' + yV ' '
' 0) '
y dx x dy
/(*, y) +
Jd.O)
o) x'
.
+r xz +r
where the path of integration any piecewise smooth curve from
is
(1, 0) to (x, y). A polygonal path from (1, 0) to (x, 0) and from
Figure 12
(x, 0) to (x, y) is particularly simple. On the first segment, the
entire integral is zero because y is identically zero, and on the second seg-
ment, with x constant, the integral reduces to
fv xdy
2
= arctan
0-
Jo x + y
2
The most general potential of F in the right half-plane differs from this
one by at most a constant. (Why?) The general solution there of V/= F
is therefore
EXERCISES
1. Consider the approximation to the earth's gravitational field acting on a
particle of mass 1 represented by the vector field F(x,y, z) = (0, 0, —g).
(a) Find for F the potential function u(x, y, z) that is zero when (x, y, z) =
(0, 0, 0).
(b) If a particle of mass 1 has at (0, 0, 0) a velocity vector (v t v 2 v 3 ) with , ,
v3 > 0, and no force but F acts on the particle, find the path of the
particle.
(c) Verify that the sum of potential energy and kinetic energy remains
constant for the path of part (b).
2, (a) Show F and G are gradient fields defined on the same domain D,
that if
then F+ G
and cF are gradient fields, where c is a constant,
(b) Let SJ be the vector space of gradient fields defined on a domain D.
Show that "V has infinite dimension.
530 Vector Field Theory Chap. 7
3. Use Theorem 2.4 or 2.6 to decide whether the following vector fields are
gradient fields.
(a) F(x,y) = (x - y, x + 2
y), for (x, y) in Jl .
(b) G(x, y, z) = (y, z, x), for {x, y, z) in dl 3 .
4. Use Theorem 2.5 to show that the vector fields in Problems 3(a) and 3(b) are
not gradient fields in any open subset of :R 2 or ft 3 , respectively.
5. Show that the vector field of Problem 3(c) is a gradient field in the region
y > of Jl 2 and find an explicit representation for its potential.
8. Consider the vector field F which is the gradient of the Newtonian potential
/(x) = — |
1
xl^ for nonzero x in Jl 3 .
Find the work done in moving a particle
from (1, 1, 1) to (—2, —2, —2) along a smooth curve lying in the domain
ofF.
9. Give a detailed proof of the equivalence of Relations 2.2 and 2.3 of the text.
10. In 3l n , how many paths can there be from x to x of the special kind described
in the proof of Theorem 2.6?
SECTION 3
g(u, v) = I
g 2 (u, v) (1)
\g»(«> v ))
dg dg
(u, v), (u,v)
du dv
du dv
(b)
Figure 13
f{u,v)xf{u,v)
du dv
d
3.1 o(S) = f ^(u,v)x^(u,v) du dv.
Jd du \
I dv
u
3.2 pda = \
p(g(u, v)) f(u,v)x"f(u,v) du dv.
JS JD du dv
g(u, v)=\ v
for
1 <w + v <4; 2 2
-^ (m, u) X ^ (u, u)
3u 3d
Sec. 3 Surface Integrals 533
xi
534 Vector Field Theory Chap. 7
yc
x{
Figure 15
3.3
Jd
f F(g(u, v)) (|i
You
(ii, v) X ^
ov
(u, v)\
J
du dv,
and denote it by
s F dS or j s F • n <ftr.
j" •
It is easy to check that the coordinates of the normal vector are given
by Jacobian determinants as
x
du dv \d(u,v) '
d(u,v) '
d(u, i>)/'
often written
+ ^2
"
du dv
J/j 9(w, f ) d(u,v) d(u, v)
flow per unit of area and time across a piece of smooth surface 5
lying in R. If S is perfectly flat and F is a constant field, then the
flux is equal to where FB is the coordinate of F in the
FD a(S),
direction of a unitnormal to S. Thus, in this case, the flux is equal
to the volume of a tube of fluid illustrated in Fig. 16. Because Fn =
F n, we get the formula
•
0> =F • na(S)
for the flux.
If S is a piece of smooth surface in R, we partition S along Figure 16
coordinate curves of the form u = const, and v = const, and
assume that, within each part of S so formed, the field F is constant.
Approximating 5 by tangent parallelograms spanned by vectors
leads to the picture shown in Fig. 17. The approximate flux across a
typical subdivision Sk of S will have the form
The sum
I% = i F(g(u
fc=i fc=i
k )) •
(f*
\dw
(uA ) x ^
du
(u,))
/
Au A,
/(*("))
Figure 17
536 Vector Field Theory Chap. 7
m(O)->0fc=l JD \OU OV I
F •
ds,
LIS
which is the previously defined integral of F over S. Consequently, we
define the flux of F across S by
= FdS.
j;
We remark that the sign of <1> would change if S were reparametrized so
that the normal vector determined by the parametrization pointed in the
opposite direction.
p, N / Xi £2 £3 \
The flux of F across a sphere S a centered at the origin and of radius a takes
the form
dx 2 dx 3 + x 2 dx 3 dx x + x 3 dx x dx 2 .
a Jsa
f
Xi\ la sin w cos
< w < 77
so that
d(x 2 x 3 )
,
= 2
a sin
.
2
w cos
d(x 3 , xj
a sin 9? sin
B(<p, 6)
d(xx, x 2)
a sin 93 cos 9?.
Sec. 3 Surface Integrals 537
These are the coordinates of a normal vector pointing out from the
sphere. Then
F dS
• = a dO sin
3
9? cos
2
+ sin
3
7? sin
2
6 + sin <p cos
2
95 d<p
Jsa JO Jo
terminal point of one piece is the same as the initial point of the one that
follows it. To integrate a vector field over a piecewise smooth surface,
7
*!
Figure 18
538 Vector Field Theory Chap. 7
\
F dS = I
F-dS + \
F-dS.
JS JSi Js 2
da = du dv,
du dv
does not change when the orientation is reversed by interchanging the roles
of u and v. But in Formula 3.3, the vector surface element,
dS du dv,
\du dv
does change sign when u and v are interchanged. We observe that the
surface element dS can also be written in the form
dS = n do,
du
Sec. 3 Surface Integrals 539
0<«<1,
6 K ' ' I I ' < v < 2.
\
W /
(b) F(x, y, z) = (x 2 0, 0) and
, S is given by
(u cos »\
u sin i'
\
I
<
— « <
— 1,
'
I' 0<t;<277.
3^ Find the total mass of a spherical film having density at each point equal to
the linear distance of the point from a single fixed point on the sphere.
4. Let x = g(u, v), for («, v) in D, and x = /7(s, 0, for (s, in fi, be para-
metrizations for the same piece of smooth surface S in :R 3 . If there is a
one-to-one transformation T, wayscontinuously differentiable both
between D and B, such that the Jacobian determinant of Tis positive, and
such that g(u, v) = h(T(u, v)) for (u, v) in D, then g and h are called
equivalent parametrizations of S.
(a) Show that equivalent parametrizations assign the same surface area to
S. [Hint. Use the change-of-variable theorem.]
(b) Show that equivalent parametrizations assign the same value to the
surface integral of a vector field over S.
independent of a.
7. (a) If &2 f
>
Jl is continuously differentiable on a set D bounded by a
piecewise smooth curve, show that the area of the graph of/ is
o(S) = f Vl + (fx f + 2
(fy ) dxdy.
540 Vector Field Theory Chap. 7
(b) Find the area of the graph of f(x,y) = x 2 + y for < x < 1,
<y < 1. [Ans. Vf + i log Vl + V3).] (
8. Show that if Jl
3 —^> Jl is continuously differentiable and implicitly deter-
mines a piece of smooth surface S on which dG/dz # 0, and which lies over
a region D of the xy-plane, then
Assume
MJSRIH
that just one point of S lies over each point of D.
</.v (/y.
10. For each of the following sets find a parametrization as a piecewise smooth
orientable surface with outward pointing normal.
12. Let F be a continuous fluid flow field and let M be a piecewise smooth
Mobius strip lying in the domain of F. Is it possible to define the flux of
F across Ml
13. Parametrize the set of Problem 10(a) so that it is unoriented, with normals
pointing out on the bottom and in on the sides. Compute the integral of
F(x, y, z) = (x, y, 2z —x — y) over the unoriented surface.
14. Prove that if F and G are continuous vector fields on a piece of smooth
surface S, then
15. (a) Let F be a continuous vector field on a piece of smooth surface S. Show
that
17. The solid angle determined by one nappe of a solid cone e in X3 , with
vertex at the origin, is defined to be the area of the intersection of e with
the unit sphere |x| =1. See Fig. 19.
(a) Show that a suitable reduction of the above definition leads to the usual
definition of the angle between two lines.
~^{b) Compute the solid angle determined by the cone x2 + y2 < 2z 2 , < z. Figure 19
SECTION 4
Figure 20
.'ft
3 — > Ol 3 is a continuously differentiable vector field whose domain
contains S. Then the vector field curl F is defined by
curl F(x)
.
_, .
=
/5F
(
-^-s3
\dx
(x) - BF- 2
3.x,
.
(x),
.
—
BF
a.v
t ,
(x)
.
- BF-3 (x), BF . 3F
-^ (x) - J
.
2
3x x
. .
— l .
(x)J
\
is an open set, then the domain of curl Fis the same set. The vector field
curl F ds • = d) F •
dx. (1)
I J as
Then the border of S consists of three line segments and a spiral curve
shown in Fig. 21 together with the domain D of the parametrization.
.(')
(4)
(3) I
Figure 21
Sec. 4 Stokes's Theorem 543
Yi-
y-i-
Y3-
Yi'
x 3 dx x + *i dx 2 + x 2 dx 3 .
It is easy to see that the integrals over y x yz and y 3 are , , all zero, while over
Yi we get
—r
72
f
F dx
• \ (cos
2
t + sin t — t sin r) rfr
Jy4 Jo
t/2
-.
\\lu
c/m f
I
(sin v — cos v + u) t/f = —
t,.
Jo Jo 4
curl F dS = <f
F •
dx,
(j)
Jbs
f, dx x =f-
Js
—dx,
x
dxy dx, + ^
dx 3
dx 3 dx,. (2)
F, dx 2 = — — - dx 2 dx 3 -j dx x dx 2
Js dx 3 dx t
and
F 3 dx 3 = — — - dx 3 dx, + — dx 2 dx 3
ds '
Js dx x dx.
are similar, and addition of the three equations gives Stokes's for-
mula. To prove the top equation, suppose that h(t) — (ii{t), r(7))is a
counterclockwise-oriented parametrization of b, the boundary of D.
Then g(h(t)) is a piecewise smooth parametrization of the border of
S, which by definition is then positively oriented. Writing glt g,. g3
for the coordinate functions of g, we have
<\> F dx 1
l
= (F1 (g(u,v))jg1(u,v)dt
>>dS J dt
dt
= JVifete o)
du dv. (3)
•'as jD\_du\ dv J dv\ du /
d u\ dv J dv\ du 1
(4)
dx 2 d(u, v) dx 3 d(u, v)
Substitution of this identity into Equation (3) gives Equation (2), thus
completing the proof.
r curl F dS.
i -*-Ls
The value of the line integral is /"around y r and it
called the circulation of ,
integral is, for small enough r, nearly equal to the dot product curl F(x ) n •
,
Figure 23
I
F dS = I
F dS•
+ I
F-dS.
JSiVSs JSi JSi
The piece of common border curve, indicated by a broken line in Fig. 23,
line integrals over d-S^ and BS 2 are added, the integrals over the common
part will cancel, leaving a line integral over the rest of the borders of Sx
and S 2 . It is this remaining part that we call the positively oriented border
of SV U S 2 and denote by d(Si U S 2 ). (Thus
, in general d(S t U S2 ) ^
dSi U dS 2 .) With this understanding we write Stokes's theorem in the
form
curl F dS = i) F • dx,
Js JdS
then the positively oriented "border" of the sphere consists of the half-
circle shown in Fig. in each direction. Thus the half-circle
24 traced once
corresponds to the segments u and u = 2tt in the parameter domain.
=
(What happens to the segments v = and v = 77?) The result is that a
line integral over dSa will be zero, and Stokes's theorem applied to a vector
field F on S a gives
curl F dS = 0.
£Sa
Sec. 4 Stokes's Theorem 547
IT
1
548 Vector Field Theory Chap, 7
J
(a) (b)
Figure 25
the strict converse false. Using Stokes's theorem, we can prove another
is
and y is the border of S. The region between two spheres shown in Fig.
25(a) is simply connected. However, the open ball with a hole punched
through it is not simply connected, because any surface whose border
2
encircles the hole must lie at least partly outside B. In :R , the typical
simply connected region is the inside of a closed curve, while the outside
of such a curve is not simply connected. In Fig. 26(a). the curve y is the
border of the surface consisting of the part of the plane lying inside y.
However, the presence of the hole in Fig. 26(b) prevents a similar con-
struction. More precisely, we shall say that an open set is simply connected
ifevery piecewise smooth closed curve y lying in B is the border of some
piecewise smooth orientable surface S lying in B, and with parameter
domain a disk in Jl 2 . We assume for applications that 5 is parametrized by
(a)
Figure 26
,
4.3 Theorem
F dx • = curl F-dS = 0.
EXERCISES
1. Compute curl F if
(a) F(xlt x 2 x 3 ) = (x 2
,
- x\, xa - x\,x x - x 22 ).
(b) F(x,y,z) = (x,2y,3z).
2. Verify by computing both integrals that Stokes's theorem holds for the
vector field F(x lf x2 x3) =
, (x lr x 2 x 3 ) on
, a hemisphere centered at the
origin in Jl 3 .
'u cos v\
\ 1 < u <2,
f(«, v) = I u sin v I
curl F • n da F-tds,
j;
IS JdS
5. Use the result of Exercise 15 of the previous section and Stokes's theorem
to prove that if F is a continuously differentiable vector field at x , then
lim ,
F-tds = curl F(x ) • n ,
r -+o A(D r )
Figure 28
14. In the open subset B of 3i3 consisting of a 3 with the origin show that
deleted,
a circle centered at the origin is the border of a piecewise smooth surface
lying in B.
SECTION 5
The Gauss theorem (described below) has many applications, some of GAUSS'S THEOREM
which are discussed in Section 6. Like Stokes's theorem, it can be looked
at as an extension of Green's theorem. We begin with a region R in 'Si z
having as boundary a piecewise smooth surface S. Each piece of S will be
parametrized by a continuously differentiable function 3la —L+.313 such
that the normal vector dgjdu x dgjdv points away from R at each point
of S. The boundary surface 5 is then said to have positive orientation, and
we denote the positively oriented boundary of R by dR. To state the
theorem, we consider a vector field F, continuously differentiable on R
and its boundary. We define the divergence of F to be the real-valued
function div F defined on R by
divF(x) = ^(x) +
ox 1 ^ 2
ox.,
(x) + ^
ox 3
3
(x),
where F F x ,
2 , F3 are the coordinate functions of F. Then the Gauss (or
divergence) formula is
I
diwFdV = I
F dS.
JR JdR
The Gauss formula is like Stokes's formula, Green's formula, and the
formula
grad/.rfx=/(b)-/(a),
r
Ja
the origin, and on the inner sphere point toward it, as shown in
Fig. 29. We shall say that dR is positively oriented with respect to
R if normal vectors given by the parametrization of BR point
the
away from R.
We shall prove Gauss's theorem for the case in which R is a
finite union of simple regions, where a simple region in 5l 3 is one
\
div F dV = f F-dS.
Jr hi
f
JR\OX 1
—+—+~
(3f 1dF
)
. dF 2
OX 2
.
OX 3 f
3\ ,
dx 1 dx 2 dx 3
. .
= F dx 2 dx 3 + F 2 dx 3 dx x + F3 dx 3 dx 2
1 .
J BR
a.
We assume first that R is a simple region and prove only the equation
—* dx 1 dx 2 dx 3 = F 2 dx 3 dx lt
JR OX 2 JSR
the proofs for the terms containing F x and F3 being similar. Addition
of the resulting equations will then prove the theorem for simple
regions. Because R is simple, dR consists of the graphs of two
functions, s(x u x3 ) and r(x u x3 ), perhaps together with pieces
parallel to the x 2 -axis as shown in Fig. 30. Let
/gi(u, v)\
x2 = r ( Xy , x3)
Figure 30
f
JiR
F2 dxz dx, = f
Jd
F2 ( gl , g 2 g,)
, ^-^ o(u, V)
rfu ^r, 0)
and, on the sections of dR that are parallel to the x2-axis, the normal
vector to dR is perpendicular to the x 2 -axis. Hence d(g 3 gi)jd(u, ,
v),
g 3 (u, v)
gi(u, v)
+ F2 (x l5 r(xt x 3 ), x 3 )
, dx x dx 3 ,
JRz
JdR
F 2 dx 3 dx y
=
Jli2l.JslX1.r3)
—
OXo
(x l5 x,, x 3 ) </x 2 Jx! rfx 3
— - dXi dx 2 dx 3
r dx 2
.1;
\
F dx dx 3
1 2 +F 2 dx 3 dx x + F3 dx-^ dx 2
.[dR
dFj. . dF 2 ,
dF
,
dx y dx 2 dx 3
R\OXi OX., OX-,
f(x, y, z) = (x 2 + / + z y V2 2
is independent of a.
across a sphere of radius a, centered at the origin,
Using Gauss's theorem we can prove something more general, and with a
minimum of calculation. Thus, let Sj and S 2 be any two piecewise smooth
closed surfaces, one contained in the other, both containing the origin, and
bounding a region R between them; for example, R might be the region
between two spheres. A routine calculation of the gradient shows that
F(x,y, z) = (x 2 + y 2 + z 2 )~ 3/2 (— x, —y, — z), and then that the diver-
gence of this field is zero, i.e., div F= everywhere except at the origin.
In particular, div F= throughout R. Applying Gauss's theorem to R
gives
f
FdS = \ divF dV = 0.
JdR JlR
f F dS =- I
_ F dS +•
I
F dS = 0.
JSR J Si J Si
Sec. 5 Gauss's Theorem 555
Thus the integrals over the outward-oriented surfaces are equal. To find
the actual value, it is enough to compute it for one surface, say a sphere.
The result is —4tt.
— —
V(BT) J Br
I
div FdV = —!—
V(B r) Jbb
I
F • n do,
where n is the outward unit normal. As r tends to zero, the left side tends to
div F(x ). See Problem 17 of Chapter 6, Section 2. On the right side, the
integral is the average rate of flow per unit of volume across the sphere of
radius r centered at x Hence the limit as r tends to zero is, by definition,
.
the rate per unit of volume of flow outward from x Because of its con- .
EXERCISES
1. Compute the divergence of the following vector fields:
(a) F(x Xt x 2 x 3) ,
= (x\, x\, xl).
(b) F(x,y, z) = (sin xy, 0, 0).
(c) F(xu x 2 x3) ,
= (x 2 , x3 , Xj).
2. Prove the following identities for any twice continuously differentiable vector
field For real-valued function/.
A/ = div (grad/),
4. The trace of a square matrix is defined as the sum of the elements on its main
diagonal. If & n — > R n
is a differentiate vector field, we define div F to be
the real-valued function given by
where tr A stands for the trace of A. Show that in the 2- and 3-dimensional
cases this definition agrees with those previously given.
FdS
j;s
over the sphere of radius 1 centered at the origin in ft 3 , and with outward
pointing normal, where F is
(a) F(xt x 2 x 3) = (xf, x\, x§).
, ,
6. Show that for a region R to which Gauss's theorem applies, the volume of R
is given by
i r
V{R) = - atj dx 2 dx 3 + x* dx3 dx x + x z dx x dx2 .
3 JdR
7. (a) Use Gauss's theorem to prove that if F is a continuously differentiable
vector field with zero divergence in a region R, then the integral of F
over dR is zero,
(b) Write an intuitive argument, based on the interpretation of the divergence,
for the assertion in part (a).
jr y* z*
V — + —2 = 1,
a2 b2 c
and let D(x, y, z) be the distance from the origin to the tangent plane to S
at (x, y, z).
-1
(a) Show that if F(x,y, z) = {x\a 2 ,y\b 2 zjc 2 ), then
, F> n = £> , where n is
D' 1 da =
4tt
— (be
( i
—ca—-
ab\
1.
Js 3 \a b c)
curl F(x) = for all x in R, and incompressible if div F(x) = for all x in
R. Assume F continuously differentiable.
SECTION 6
To facilitate the application of the Gauss and Stokes theorems, it is THE OPERATORS V,
helpful to extend the use of the symbol V, called "del," that is used in Vx, AND V>
denoting the gradient field of a real-valued function. In terms of the
natural basis e x , e2 , e 3 for ill
3
, we recall that
V/ (1)
dx t dxo ox.
J_ d_ _d_
(2)
ox 1 dx 2 ax z
VxF =
558 Vector Field Theory Chap. 7
6.2 f
I V F dV =
V-FdV--
• I
f
F-nda
I
JR JdR
rx(/F)=/Vxf+V/xf (7)
V • («F + bG) = a V F + 6 V G • •
(8)
V.(/F)=/V.F-V/.F (9)
V (F
• x G) = (V x F) G - F- (V x • G). (10)
V-(VxF) = 0, (11)
V x (V/) = 0, (12)
V.V/=V% (13)
where V /is
2
the Laplacian of/ defined by
J a 2 a 8 a 2 "
d.Vj ox 2 ox 3
(Equations (11), (12), and (13) are the same as those in Problems 2 and 3
of the previous section, where the alternative symbol A was used for the
Laplace operator.)
The formulas given above can be used to derive many special cases of
the Gauss and Stokes theorems. A particularly important kind arises if the
vector field Fis assumed to be a gradient V/ or a multiple/ V g. If we set
F = V/in Formula 6.2, the result is
I
V-VfdV = \ Vf-ndo. (14)
JR JdR
But by Equation (13), V V/= V 2/ and by Equation
•
2.1 of Chapter 4,
Sec. 6 The Operators V, V x and V-
, 559
d
\T-fdV = [ on
-[da. (15)
Jr Js
This is called Green's first identity. Because of the symmetry in the middle
term, interchange of/ and g and subtraction of the corresponding terms
gives Green's second identity.
6.4 f
( /V 2
g - gV 2/) dV = f (/ & - gA
d
da.
Jr Js\ on on!
V 2
u = h,
( uV udV + 2
( \Vu\
2
dV = ( u — do.
jr Jr Js dn
But the and last terms are zero because V u =
first in R and u = on 2
(f,g) = \ fgdV.
It is easy to check that the integral is indeed an inner product. With respect
to the inner product, it is often important to know when the operator D is
(Df,g) = (f,Dg),
In Equation (16), a x and a 2 are constants, not both zero, and djdn denotes
differentiation with respect to the outward unit normal on 5".
Js \ dn dn/
on
Mg+fa,f = 0,
on
zero, and the boundary condition implies that both/and g are identically
zero on S. But then Equation (17) is still satisfied. Thus we have shown
that V 2 is symmetric on :F Notice that V 2 /is not defined for all/in .!F
.
,
I
Vf.kdV = \
fk-nda.
JR JdR
Since k is constant,
V X FdV = \
n x F da. (19)
R JdR
The proof is left as an exercise.
We conclude the section with a description of the expressions for
gradient, divergence, and curl with respect to an orthogonal curvilinear
coordinate system. Thus we assume that all coordinate curves intersect
at right angles, which means that the natural tangent vectors c 1; c 2 c 3 ,
—
1
Ci, —c —c
1
2,
1
3,
hi h2 h3
562 Vector Field Theory Chap. 7
where h t
= |cf |. We get coordinate functions Fu F2 F3 , satisfying
Rather lengthy computation using the chain rule gives the formulas:
6.5 (V/) o r= - — Cl + - — c + - — c 2 3
«i dUi «2 om 2 h 3 ou 3
6.7 ±1 '/ dh
3 F3 dh t F*
(V x F) ° T
/ll/l 2 ^! .\ du 2 du
u3 1
jdh1 Fx dh 3 F3 \
BhoFi
+
du 3
~ d Ul r
2 F2
•
dKF,
/dh dhjA
+
\ du l du 2 /
In the last formula, the plus sign is chosen if the c's form a right-
handed system, and the minus sign is chosen otherwise. Of course the c's
and the A's may vary from point The formulas are no more
to point.
complicated than they are because we have assumed the c's perpendicular.
As a result, the product h x h 2 h 3 can be interpreted as the volume of the
rectangular box spanned by c l5 c 2 and c 3 In the case of rectangular
, .
3
Example 3. Introducing spherical coordinates in 'Ji , we have from
Chapter 4, Section 5, Example 4, that
1 df
V/(x, y,z) = ^ (r, cp, 0) Cl + \ / (r, cp, 6)c 2 + ,
(r, (p, 0)c 8 .
or r ocp r sin cp 00
Sec. 6 The Operators V,Vx,WV- 563
and
2
dr F (r, cp, 8) d sin cpF2 (r, cp, 6)
V • F(x, y, z) = sin cp
x
+ r
Br dq>
2
V /(x, y, z) = sin cp
—
r sin dr\ dr J
+ sin cp — I sin
df(r,cp,d)\
+,
d f(r, cp,
2
6)
dcp\ dcp dd
EXERCISES
1. Verify the identities (4) through (10) of the text.
F 1
— + F —- + F —
d
d Xl
2
d
dx 2
3
d
dx3
.
V x (k x F) = k(V F) - • (k • V)F,
(d) Use part (c) to show that if F and G are differentiate, then
V x
k x x k kx
4. Prove that
(a)
3
x^O.
\\x\) Ixl
'
(b) v2 =0 x ^°-
(r) '
564 Vector Field Theory Chap. 7
then the flux of the temperature gradient across any smooth closed surface
in R is zero. Use this fact and Equation (15) to show that a steady-state
temperature function that is twice continuously differentiable is harmonic,
i.e., V2 7 = 0. [Hint. Suppose V 2 7"(x ) > 0. Show that V 2 T(x) > in
some ball centered at x .]
7. Show that if V Fis • identically zero in a ball B in 3l 3 , then the vector field G
defined by
(a) If G(x) = JJ
[F(tx) x (tx)] dt, then show that
8. A vector field —
ft 3 defined in a region R is called solenoidal if, in some
Jl 3
= _1
9. Consider the Newtonian potential function 7V(x) and its associated
|x|
gradient field V/V(x). (See Exercise 4.) Show N(x) can be interpreted
that
as the work done in moving a particle from co to x along some smooth
path through the field V./V.
I)r |x - y|
11. (a) Use Equations 6.5 and 6.6 to show that the Laplacian in cylindrical
coordinates has the form
dO 2 dr
1 3/(r)
^ 2f(x,y,z)
dr 2 r dr
,
3"/(r) ,
2 3/(r)
SECTION 7
Having defined the line integral in Chapter 3, we observed that it could be DIFFERENTIAL FORMS
abbreviated
F 1 dx x + F 2 dx 2 + F3 clx 3
j;
in the 3-dimensional case. Our purpose here is to show that the integrand
Fx dx x + +
F2 dx 2 F3 dx 3 has an interpretation which leads to another way
of looking at the line integral. From there we can go naturally to a
definition of surface integral.
We shall denote by dxk the function that assigns to a vector a in :R n its
Figure 31
n
where wx acts on vectors a in 'J\ by
a can be written, if a = (a l9 a 2 a3 ),
,
df
dj(*) = ^~ (x)^! + ^~ (x)a + 2
O.Xi dx 2 <7X 3
= ^
dxj
(x) dxfr) -r ^
dx 2
(x) rfx 2 (a) + ^
<7X 3
(x) rf.v 3 (a).
specially related, as they are by the requirement that V/*= Ffor some F
= F(g(0)-g'(0-
If we write
we see that the right-hand integral is the line integral of the field Fover y.
a < < b, wq can do either of two things. We can use the coefficient
t
Jy Ja
K
= lim 2,a> gllk) (gXt k ))(h.- r _i).
i m(P)->0it=l
fc
It is clear that the two formulas give the same definition of j y a> x , the
integral of t he \-form co x over y.
Next we define a product of 1 -forms which is different from ordinary
pointwise multiplication of functions. We first define the product of the
3
basic 1-forms dx x dx 2 dx3
, , in 'Ji . The product dx x A dx 2 is defined so
3
that it is a function on ordered pairs of vectors in 'Ji . Geometrically,
dx x A dx 2 (a, spanned by the pro-
b) will be the area of the parallelogram
jections of a and b into the x^-plane. The sign of the area is determined
so that if the projections of a and b have the same orientation as the positive
x x and x 2 axes, then the area is positive; it is negative when these orient-
ations are opposite. Such a projection is shown in Fig. 32. Thus, if a =
(a u a 2 a 3 ) and b = (b u b 2 b 3 ), then
, ,
\a 2 b2J
and the determinant automatically gives the area the correct sign. We can
use the basic 1-forms dx x and dx 2 to write the last equation as
Figure 32
dx t
(a) dx t (b)
dx { A dXj (a, b) = det (3)
K
dXj (») dx,(b)
with a similar geometric interpretation for each one. For example, we have
fa 2 b2
dx 2 A dx 3 (a, b) = det
a, b.
7.2 dx t A dx t = 0.
7.3 dx A dxj
t
(b, a) = — dx t
A dx } (a, b).
Sec. 7 Differential Forms 569
If we now ask for the most general linear combination of the functions
dx A (
dXj, it is clear from 7.1 and 7.2 that it can be written in the form
cx dx 2 A dx3 + c2 dx3 A dx x + c3 dx x A dx 2 .
Furthermore, if F=
(F1 F2 F3 ) is a vector , , field in a region D of JI 3 we ,
rx = F {\) i/x
x 2 A dx 3 + F (x) ^x
2 3 A dx x +F 3 (x) Jx x A dx 2
3
of ordered pairs (a, b) of vectors in JI . The function t x is called a differen-
tial 2-form or 2-form.
(2 1\ /3 1\ /l
det , det , det
\3 1/ \1 0/ \2 1
is perpendicular to (1, 2, 3) and to (0, 1,1), and that its length is equal to
the area of the parallelogram P spanned by a and b. Thus T(a, b) =
(2, 1, 5) • (a x b) and is thus the coordinate of (2, 1,5) in the direction
perpendicular to P, multiplied by the area of P. If we interpret (2, 1, 5)
as the constant velocity vector of a fluid flow in space, r(a, b) will be the
total flow across P in one unit of time. Such a flow is shown in Fig. 33.
For a flow F of constant speed and direction across a flat surface S, the
flux is defined to be the normal coordinate F of F, times the area of S,
and so is equal to (F • n)ff(S).
We can summarize what has just been done by pointing out that we
have defined a multiplication, called the exterior product of basic 1-forms
dx x dx 2
, , etc. The resulting products written dx ( A dx} are basic 2-forms
in that every 2-form is a linear combination of them. In Jl 2 there is only
one basic 2-form, dx x A dx 2 while in 'Ji 3 there are three of them. (How
,
Figure 33
were polynomials in the variables dx x dx 2 etc., and then using the rules
, ,
7.1 and 7.2 to simplify. In practice, the wedges are often omitted from the
notation.
(x dx +y 2
dy) A (dx + x dy)
= x dx A dx + y dy A 2
dx + x 2 dx A dy + xy 2 dy A dy
= - y dx A dy + x 2 2
dx A dy +
= (x — y dx A dy.
2 2
)
— x dx A dx -r x dy A dx + x dz A c/x
(ax bx cx
a2 b2 c2
a3 b3 c3
. .
< u) (^(u),...,^(u)).
\du 1 dUj, )
(4)
JS JR XOll! OU v I
Example 5. If
,,
"h
(
I
<!
XOM!
—
dg
" ,
'
—
duj
dg
=F
(7U 2 /
\
I
JT
x
o g
8 (g
—
d^,
2, g 3)
M 2)
+F
, r,
2
o g
9(g 3 g
d(i<j,
,
U 2)
t )
|F 3
o» 9(gl,
<?( M 1>
g2 )
u 2)
572 Vector Field Theory Chap. 7
dx 2 A dx 3 +F 2 dx 3 A dx x + jF 3 dx 1 A dx 2
Js Js
'
F\ ° g r:
9(g2, g3 )
+ ,
r
-F 2 ° g
—
9(g3, gl)
: + ,
F3
„
°
~ Hgl^gz)'
du x du 2
-I
.
:
d(u u u 2 ) 9("i. "2) d( M i> "2).
EXERCISES
1. Find the value of each of the following differential forms acting on the in-
(c) fc(f) = (r
2
,r ,f )
2 2
for -1 <t < 1.
7. Prove that if co and d> are 1 -forms and if y1 and y 2 are curves over which
co and w are integrable, then
CD = I co + I co.
dx f=f(b) -/(a).
i
9. (a) Prove that if co x is a 1-form in a region Z> of 31", then for each fixed
x in D, oj x is a real-valued linear function on all of R n .
(b) Prove the converse to part (a), namely, that if o x is a real-valued linear
n
function defined on all of 31", then a>
x (a) = yc k (x ) dx k (a), for all
3
10. (a) Let r be a real-valued function of pairs (a, b) of vectors in 3l such that
r(a, b) = — r(b, a) and such that t is linear in a and in b. Show that
there is a vector c r in 3l 3 such that r(a, b) = det (a, b, c r ) for all a, b
in 3l 3 . [Hint. Show first that the result holds if a and b are in a basis
for 3l 3 .]
(b) Use part (a) to show that if r(a, b) = — r(b, a), and r is bilinear, then
t is a 2-form in 3l 3 .
11. For an ordered p-tuple (a 1; a 2 , . . . , a,,) of vectors in 31" where/? > 1, define
i=l j>
This equation defines the basic p-forms in 31", of which the generals-forms
are linear combinations.
(a) Compute dx 2 a dx3 a dx^ + 2 ciq a dx 2 a Jx4 (a, b, c), where a = (1,
-1,0, 2), b = (-1, 1, 1, 1), andc = (0, 1,2,0).
(b) Prove that the interchange of adjacent factors in a basic p-iorm changes
the sign of the form.
(c) Prove that a basic p-fovm with a repeated factor is zero.
(d) Prove that the general p-form can be written
«i<...<«p
°> v = dXit A A dx H
J, fit *, • • •
and
wP = 2 Sh u dx Jt A -- Adxs9 >
h<—<i*
define their exterior product co p a oj q by
13. Show that the definition of the integral of a /?-form agrees with that given
for a 1-form when/? = 1.
15. (a) If a»
3
is the 3-form fdx x a dx 2 a */jr
3 in 4l 3 , and ft 3 — > ft 3 is differ-
SECTION 8
THE EXTERIOR The fundamental theorem of calculus states that if (djdx)f is integrable
DERIVATIVE
on [a, b], then
l
d
dx = f{b) - f{a). (1)
Ia dX
The Stokes and Gauss formulas,
I
curl F dS = I
F • t ds (2)
Js JdS
\
div FdV = \
F-nda, (3)
JR JdR
are similar in that they express the integral of a kind of derivative of a
function in terms of the function itself on a set of lower dimension. Using
Sec. 8 The Exterior Derivative 575
dx l dx n
Thus the exterior derivative of /is the particular 1-form that at each point
of the domain of/is equal to what we have earlier called the differential of
/ To continue, if
co
1
=/j dxx + • • •
+/„ dxn
is a 1-form with differentiable coefficients, then in terms of the 1-forms
dfx , . . . , dfn we define
,
If o)\ x iX )
= xxx 2 dx x + (x\ + x\) dx 2 then dw
,
1
is given by
= (x dx + 2 t x x dx 2 ) A dx x + (2x x dx x + 2x 2 dx 2 ) A dx 2
=x t dx x A c/x 2 .
If co {x , ViZ)
= xz dx A dy + j>
2
z c?x A dz, then dco 2 is given by
2
d(xz dx A dy + )> z dx A dz)
= (z <ix + x dz) A dx A dy + (2yz dy + 2
y dz) A dx A dz
= (x — 2_yz) dx A dy A dz.
Using the exterior derivative we can state the general Stokes formula in
the form
8.1
I
dco* = I
0)
P
,
JB JdB
co
1
—F x dx 1 +F 2 dx 2 ,
and then
dco
. (dF
(<3b\
= —— dx + I
Vox!
x ,
1
—
6t\
3^!
dx 2
dx 2
\
\
/
)
A dx x +
tCF 2
/dF,
I
V^X!
— dx x -j
dF 2
dx 2
dx 2
\
1
/
A ^x 2
(4)
(dF 2 dFA
= T--^l
,
dx t Adx 2 .
\ox! dx 2
1
Substitution of dco 1 and co into Equation 8.1 gives
~ 7r) ^ Xl A dx i = Fl rfXl
+ F2 rfx2 -
<5>
f (t~*
Jb\OX OX 2 l f
1 JdB
I— — —- 2
} dx 1 dx 2 = <h Fx dxy +F 2 dx 2 ,
Jd\ox 1 ox 2 I Jy
co
1
—F x dx x +F 2 dx 2 +F 3 dx 3 ,
dco
1
= I
,dx
• — ^ I dx 2 A dx
dx 3 A dx x + ,
M dx x A rfx 2
oxj \ox 3 \ox x ox 2 !
Thus the 2-form dco 1 has as coefficient functions the coordinates of the
vector field curl F where F = (Fx , F2 F3 , ). It is immediate that the general
Stokes formula becomes precisely the Stokes formula of Section 4 if we
make B and dB stand for a piece of smooth surface 5 and its positively
oriented border dS.
The Gauss formula of Section 5 comes from considering a 2-form
co
2
=F x dx 2 A dx 3 +F 2 dx 3 A dx x +F 3 dx x A dx 2 .
Sec. 8 The Exterior Derivative 577
dco* =(Vaxj
d
-fl + ^+M
ox 2
d
vx 3 /
dXl A dx, A dx3 . (7)
field F = (F1? F2 F3 ),
This 3-form has as coefficient the divergence of the ,
and substitution into the general Stokes formula gives the Gauss, or
divergence, formula of Section 5, except that, as with Green's formula, the
volume integral of Gauss's formula is not identical with the integral of a
3-form. See Exercise 10.
"
a differential form with coefficient functions Fu F2 F3 has been described
,
if (»
2 <—> F, then dco 2 <-» div F,
if a)
1 <-> F, then dco 1 <— > curl F
Finally, if oj° <->/, then rfco <-> grad/, where/is real-valued.
EXERCISES
1. Compute dco, where to is
578 Vector Field Theory Chap. 7
2
(b) Show similarly that if oj is a 2-form, there may not exist a 1-form co
1
(c) Interpret parts (a) and (b) in terms of gradient and curl.
8. (a) Let F = (Fl5 . . . , F„) be a vector field in 31", and consider the 1-form
cu
1
= F x i/jfj + . . . + F„ ^ n . Show that the condition Jw 1 = is
10. In the general Stokes formula, Equation 8.1, let co p be a 2-form with con-
tinuously differentiable coefficients in the ball |x| < 1 of R 3 Show
. that it is
SECTION 1
The theorems and techniques of calculus depend on both algebraic and INTRODUCTION
n
topological properties of 'J\ and of functions from Si n to Si" The alge- 1
.
1.1 Theorem
Let 3i n —
> Si be continuous on a closed, bounded subset K of Si n .
1.2 Theorem
579
580 Appendix
1.3 Theorem
Let 'J\
n — > .'Jl be continuous on a closed, bounded subset Koi 3i n .
Proofs of these three theorems, and also of Theorem 1.4, can be found in
I** - x ; |
< e
1.4 Theorem
If Xj, x 2 x 3 ...
, , is a Cauchy sequence of vectors in 31", then the
n
sequence has a limit x in 3i , so that
lim x k — x.
fc->CO
1.6 Homogeneity ax = a x .
if x is a limit point of a set 5" with respect to one norm, then it is a limit
point of S with respect to every norm, and the same goes for the other
definitions referred to above. It follows, in particular, that these
basic limit concepts are not dependent upon a Euclidean inner
product. An example of a norm on !il" different from the Euclidean
norm is the so-called box norm, or maximum norm, defined, for
any x = (x1} . .. , x n ), by
It is easy to check that this is a true equivalence relation, that is, it satisfies
equivalent to || |d.
More important is the fact that equivalent norms result in the same
definitions of limit point, limit, interior point, and differentiability.
Continuity, open set, boundary, and closed set, which are defined in terms
of the preceding concepts, are therefore also independent of a choice
between equivalent norms.
To verify the above contention, let || || l5 and || || 2 be equivalent
norms, and suppose that x is a limit point of 5 with respect to || Id.
X
582 Appendix
Then, for any ex > 0, there exists a point x in S such that < ||x — Xoflx <
ex . Thus, if e 2 > is given arbitrarily, we may set e x = € 2 jK and obtain,
by Inequality (1),
x->x
Then, as we have just proved, x is a limit point of the domain of/ with
respect to both norms. For any e x > 0, there exists d 1 > such that if x is
in the domain off and < ||x — XqIIj < d lf then ||/(x) — y ||i < «i- Let
e2 > be given arbitrarily, set € = ejK, and then choose x <5
2
= d x k. If
< ||x — x < ||
follows by Inequality (1) that
2 (5
2 , it
< II x
k k
Hence,
11/00 - yolU <K
||/(x) -y ||i < K€l = 6„
and we conclude that lim/(x) =y with respect to || || 2 . The
arguments for the definitions of interior point and differentiability
are similar, and we omit the details.
With respect to a given norm on a vector space 17, the e-ball
with center x is the set of all x in 17 such that ||x — x < e. ||
1.8 Theorem
Proof. Let || ||
be an arbitrary norm on 17. Choose a basis {x 1 ,
|x| = y/xt + . . . + xl
for any x = x1 x 1 + . . . + x nx n . We shall show that is equiva-
|| ||
lent to | |, that is, there exist positive real numbers k and K such
Sec. 2 The Chain Rule 583
that k |x| < ||x|| < K |x|, for all x in °0. By the transitivity property
of the equivalence relation between norms, it then follows that any
two norms on T) are equivalent.
For any x = XjXj + + x n \ n we have . . . ,
< (ijx,\i)Ji/, = K M ,
where K= ]T II*; II
> 0. We now prove that k exists. We contend
<l|x < K |x — xn |
< e.
<
||x|| >k\\\, for any x in XS.
SECTION 2
In Chapter 4 we have presented two different versions of the chain rule, THE CHAIN RULE
one in the section on the gradient and another in Section 3 devoted entirely
to the chain rule. Both of these theorems contain assumptions that certain
functions are continuously differentiable. Furthermore, in Theorem 3.1
we assumed that the composition g of was defined on an open set. Neither
of these assumptions need to be made in order for the chain rule formula
to hold. Of course, to conclude that g°f is continuously differentiable,
we need to assume that both/and g have continuous derivative matrices.
The virtue of the next theorem is that it contains a minimum of assump-
tions, and the proof is, in style, very much what would be used to prove
the theorem for real functions of a real variable.
If %n -^ %m is x and %m -
differentiable at %p is differentiable
at/(x ), then g °fis differentiable at x and
(g°fy(x ) = g'(f(xo))f'(Xo).
x y
584 Appendix
But 6' has been chosen just so that, if the last inequality holds, then
/(x) is in the domain of g. Thus any point x in Jl n that satisfies
|x — x < 6 lies in the domain of the composite function g of, and
|
matrix g'(y )/'( x o) satisfies the criterion for being the differential of
g of at x That is, we must prove that if
.
(1)
then
lim Z(x — x ) = 0.
x->x
and
lim Zj(x — x ) = lim Z,(y — y ) = 0.
x— y—
that
(2)
Sec. 3 Arc Length Formula 585
< k |x — x | + |Zi(x — x )| |x — x |,
|x - x |
{|^'(y )[Z 1 (x - x„)]| + [k + |Zt (x - x„)|] |Z2 (/(x) -/(Xo))|}
SECTION 3
To appreciate fully the significance of the connection between arc length ARC LENGTH
and the formula j"^ \f'{t)\ dt that is used to compute it, it is necessary to FORMULA
know that the formula doesn't always work. The reason is that there is a
continuous curve y in til 2 which has length 2, but such that if the integral
formula is applied to it the result is 1. The construction of such an
example is fairly complicated, and showing that the relevant integral has
value 1 is itself nontrivial. For the curve y we can take the graph of the
so-called Cantor function. (See R. P. Boas, A Primer of Real Functions,
John Wiley & Sons, 1960, p. 131, or B. R. Gelbaum and J. M. H. Olm-
stead, Counter-examples in Analysis, Holden-Day, Inc., 1964, p. 97.)
Once the Cantor function is understood, it is fairly easy to show that its
graph has length 2, as defined by the least upper bound of the lengths of
inscribed polygons.
For a piecewise smooth curve y, given by a piecewise continuously
differentiable function 'A — > 'JV for a < t < b, we have the following
1
Theorem
Proof. We show first that l(y) < J** |g'(OI dt, noting that since \g'\ is
lg('*) - g(>A-l)l
Jtk-i
f i g '(oi^-^<iig(?,)-g(^-i)i. (2)
Ja A-=l
This will show that no number smaller than the integral is an upper
bound for the sums on the right. We take as an initial partition P
all the finitely many endpoints of closed intervals on which g' is
u < t
k . Since there are only finitely many intervals we can
[t k _i, t k ],
choose a single positive b that will work for all of them. Now make
a partition P fine enough that max {t k — tk_i) < b, still including
in P all the points of P . On each interval of the new partition we
have \g'(t)\ < \g'(t k )\ + €, by the uniform continuity of g'. Thus,
< r
jtk-i
gv)dt + r (gv )-g'(t))dt
jtk-i
k
where in the last step we have again used the results of Exercises 16
and 17 of Section 2, Chapter 3. Again using the uniform continuity
of g', together with the previous inequality, we get
[g'C*)l (h - t
k_x) < |g(f„) - g(t k _ x )\ + e (tk - y.
Applying this inequality to Equation (3) gives
If e is chosen so that 2e(b — a) < rj, then the desired Inequality (2)
will be satisfied for the partition P constructed above.
SECTION 4
Theorem 5.2 of Chapter 5 asserts that the Fourier series of a piecewise CONVERGENCE OF
smooth function/converges pointwise to the average of the right and left FOURIER SERIES
limits of / at each point. The assumption that / is piecewise smooth
means that the interval [—77, tt] can be broken into finitely many sub-
intervals, on each of which f and f can be extended to be continuous.
At the endpoints of an interval [xk xk+1 ] we require/to be continuous if
,
interval if it is given the values of the right and left derivatives, respectively:
+
(x k ) urn
f(x t + u)-f(xk +)
f --
u-»0+
4.1 Theorem
Let /be piecewise smooth on [—it, n}. Then the Fourier series of/
converges at each point x of the interval to (l)[f(x ) + f(x +)].
—
In particular, if/ is continuous at x, then the series converges to
fix).
*ivW = — fit) dt
+ -I
77" fc=l
cos kx f(t) cos kt dt + sin kx f(t) sin /cf df
N
fit) \ + ][cos/c(f — x) dt.
77 J-n
We now extend /outside the interval [— ir, tt] so that it has period
2t7, and make the change of variable t = x + u. Then the new
interval of integration is [—77 — x, 77 — x]. But since the integrand
has period 2tt, the value of the integral remains unchanged if we
shift back to the interval [—77, 77]. Thus we have
s
,
y (x)
,
= -1 f ' ,,
\
f(x + ,
m) — (N +
n sin \)u
- ,
du.
77 J-!T 2 sin (£)u
limst<x)=G)f(x+), (2)
2V->00
Sec. 5 Proof of the Inverse and Implicit Function Theorems 589
where
+/ n
*n(x) =- 1 [\<
f(x + ,
u)
x — (N +
sin \)u
-du.
.
(3)
77 J0 2 sin (\)u
^v(x) = (1/tt) $°_ n f(x + w) (sin (TV + \)uj2 sin (J)w) </«, and addi-
tion of the two equations will finish the proof.
To prove Equation (2), we observe from Equation (1) that
77 Jo 2 sin (|)u
The proof will becomplete if weshow that this last integral tends to
zero as ./V tends to infinity. But g(u)
[f(x u) —f(x +)]/sin (|)« = +
is piecewise continuous, so the result is a consequence of
lim
rg(u) sin ku du = lim
f
b
g(u) cos ku du = 0.
fe-» oo Ja k-*ccja
g\u)du>f(al+bl).
\
Ja k l
SECTION 5
Hence, the series on the right converges. We conclude that a k and b k
tend to zero as k tends to infinity.
INVERSE AND
IMPLICIT FUNCTION
THEOREMS
We start with a theorem that not only guarantees the existence of an
inverse function, but also proves the convergence of the modified Newton
iteration given in Equation 9.2 of Chapter 3.
5.1 Theorem
/-i/'(xo)]-y"(x)
iir(xo)]-y(x )i<(i-*>-, a)
then the equation
x*+i = x* - [/'(x )]- 1/(x fc )
/,(x')-/,(x) = i|^'(y,)(x;.-x,),
segment joining x and x'. Writing F(x, x') for the matrix
if)
we get
/(x')-/(x) = F(x,x')(x'-x);
hence
n I n \2
\Ay\
In I n n \
V i= l \,=1 3=1 J
- < - =K -
|g(x') g(x)|
fn |x' x| |x' x|. (3)
X fc+ l = g( x k)-
Then
l
x fc+l - x*l = Ig(Xfc) - g( x*-i)l
^A-lx^-x^l <**| x i- x ol- (4)
l
XJfc+l — X m\ ^ Xfc+l — X +
l A:I l
X —X +
fc +
fc-ll • • •
|
x m +l — X m\
< (K k + K"- + 1 m
... + K ) Xl - x | |
rJc—m+l\
1 — K.
<K m r
For m= we get
L
fc+1
— xn < r:
so, forA: =
1,2, 3, ... x lies in r (x ) and, hence, in the subset
, fc
N
of the domain of/ on which the hypotheses of the theorem are
satisfied. Thus each xk is defined and satisfies the Inequality (4).
Returning to (4) we see that
\x k+1 - xj < Km r
implies that x , x x x 2 ...
, , is a Cauchy sequence which necessarily
converges to some vector x satisfying |x — x |
< r. Since
x fc+i = g(*k)
and g is continuous, we have
that is,
Thus
/( x ) = 0.
592 Appendix
|(x' - x) - [/'(x )]
-1
(/(x') ~/(x))| <K\x' - x|.
|x' - x| - |[/'(x )]
-1
(/(x') -/(x))| <K\x'- x|.
Hence
(1 _ K) X |
'
_ X |
< \[f'(x ))^(f(x') -/(x))|. (5)
Let 'J\
n — > 'Ji
n
be a continuously differentiable function such that
/'(x ) has an inverse. Then there is an open set N containing x
such that /(TV) is open and such that/, when restricted to N, has a
-1
continuously differentiable inverse/ . In addition,
if-'Yiyo) = Lf (x )]-\
where y =/(x ).
off at y =/(x).
1
condition for being the derivative
For x' and x in TV, let y' =/(x') and y =/(x). Since / is
differentiable at x,
_1
Applying [/'(x)] to both sides gives
part 2.
_1
Proof, Part 3. / is continuously differentiable. Part 2 shows that
_1
/ is differentiable on f{N) and that
[f-'Yiy) = [/'(x)]- 1 .
5.3 Lemma
There is a neighborhood iV of x and a number M> such that
M\x'-x\ <\f(x')-f(x)\
for all x and x in N.
Let :Jl"
+m — > 'Ji
m be a continuously differentiable function. Suppose
1. JF(x o ,y ) = 0.
2. Fy (x Q , y ) has an inverse.
y
594 Appendix
"i(x, y) = Xi
H (x, y) ^
z *2
Gi(x, y) =
x and G 2 {x, y) = y. Since //(x, y) = (x, F(x, y)), the
function H is the identity on x. The same must therefore be true of
H-1 Hence,
.
Ui = &i ° ti .
We define/ by
/(x) = G H~ 2
x
(x, 0), for every x in N.
Then
//-i(x, 0) = (GiH-^x, 0), G.2 H~Hx, 0)) = (x,/(x)).
The proof makes use of the implicit function theorem, and of the fact PROOF OF
that, for a linear function L defined
dimension of the domain is in 31", the
equal to the dimension of the range plus the dimension of the null
THEOREM
space. We begin by restating the theorem.
Lagrange's Theorem
,xn) =
+ lnfi n
dG x
dXl dx. dx„
(1)
dG m dG m dG,
^dx x
dx 2 dx n
are independent. Then the matrix has an inverse. Write x =
(a u . . . , a n ), and set u = (a u . . . , a m ) and v = (a m ^,«„) . . . ,
3l
n_m —
m defined on a neighborhood vVof v such that /2(v = u
> 3l )
and G(h(y), v) = for all v in N. The function :R"- m -^> 31" defined
by
H(v) = (//(v), v) for all v in N,
is a parametric representation of a part of S containing x = H(v ).
The surface S has a tangent 73 of dimension n — m at x . The reason
is that, first of all, the derivative of H at v is the (n — m)-by-
(n — in) matrix
dh x dh x dh x
dx, dx, dx~,
dh m
Sec. 6 Proof of Lagrange's Theorem 597
Looking at (2) and (3) together, we see that dx f and d G are both
zero on the range of dr H, which set is the tangent "G. Thus the
matrix
~Jf_
dx,
dGj_ d_Gj_
dx, dx„
dG n
LBx, dx.
X m ), and so
dx x dx n
bg 1 ac,
dx 1 dx n
(Ao, K • , AJ = 0. (4)
5G^ 5G,
_3xi 9x,
It cannot happen that A = 0, for then the rows of (1) are dependent,
contradicting the fact that (1) has an inverse. Taking A = 1 (if
—
C7X ; -
(x ) +K —
OXj
(x ) + • • • + Am —2 (x
OX,
) = 0,
for _/
= 1, . . . , m. In other words, (/+ k1 Gl + . . . + A m (7TO )'(x )
= 0. This completes the proof.
SECTION 7
PROOF OF TAYLOR'S The method of proof consists of reducing the problem to the one-variable
THEOREM case an(j tnen making an estimate of the size of the integral formula for the
remainder.
Then
(/(x) - Tv (x - x
lim-
))
= 0, (1)
did. ,
d\k
dt \ OX x OX n /x +ty
- ***[(*£ + •
+ y "i5 (y)
>'„ —
OX n /x +ty
/
<^ /(y)- y
F (fc)
(0) = </(y).
1
(1 - iV
[f
(A
'(0 - F '(0)] rfr.
(TV- 1)! Jo
In terms of/, this is
— 1):
±— f \l - r)
v- 1
[< +ty /(y) - </(y)] Jr.
(TV-1) Jo
< max
0<*<1
yi— + ••• + y„— /
J
"
\ d*i dxJx
= max
0<<<1 k 1 +...+k n =N\ K l K n/
600 Appendix
Then since \y t \
<: |y|, we have
x n
\y\ ...yl
<i,
\
lyl'
and so
|/(x) -T iV (y)|
< (3)
|y|"
N d
N d
N
v / \ f f
2, I , ,
k nJ
I
'
max (x„ + ty) (x )
-...+k„=N V^l • • 0<<<1 dx^.-.dxl" dx\* . . . dx knn
A
|y|
\tyX
= ,.
lim
Pk Qy ) + R(ty
-
)
k
\ty \
I yd* '-oi^iyol*
Sec. 8 Existence of the Riemann Integral 601
However, because all the terms of R have degree greater than k, the last
is zero. But then P (y ) = 0, which is a contradiction.
limit k
SECTION 8
Let/ be defined and bounded on a bounded set B in 31", and let the
boundary of B be contained in finitely many smooth sets. If/ is
continuous on B, except perhaps on finitely many smooth sets, then
/is integrable over B. The value of §B fdV is unchanged by chang-
ing the values off on any smooth set.
We recall that a smooth set in 31" is the image of a closed bounded set
8.1 Theorem
Let 31
m -^> 31" be continuously differentiable. Then for every closed,
bounded subset A' in the domain ofg, there is a constant M such that
\g(y) ~ g(*)\ < M |x - y|
continuous, thatZ(x —
y) is continuous except perhaps when x y. =
If both x and y tend to some point z in the domain of g, then (x, y)
tends to (z, z), and we want to show that Z(x y) tends to zero. —
We apply the mean-value theorem, Theorem 8.2 of Chapter 2, to the
602 Appendix
max 2 d&
l<fc<n ;=1 \OXj
w - & WW-, OXj 1
i
Since the partial derivatives are assumed continuous, and since each
x k tends to z as x and y do, it follows that Z(y — x) tends to zero as
x and y tend to z. This shows that Z(x — y) is continuous on^x K.
Since |Z(x — y)| is continuous, it attains its maximum value
M'; so |Z(y — x)| < M' for all x and y in K. Hence
But
8.2 Theorem
and subdivide the cube into smaller cubes of side length s/N, where N
is an integer bigger than 1. There are N
m of these little cubes. On
each of the little cubes that contain any points of K we have by
Theorem 8.1
-
containing S is at most N m (MsjN) n = {Ms) n jN n m Since n > m, .
As a corollary, we get the fact that a smooth set has zero content.
Now we can prove the existence theorem for integrals stated at the
beginning of the section. Suppose that / and B are as described in
the hypotheses. We
must produce a number which we shall prove is the
Riemann integral of/ over B. Let/B be the function /extended to be zero
outside B. For an arbitrary grid G covering B, let R k be the kth bounded
rectangle of G, and let/, be the infimum of/B on R k Define .
Similarly, define
S(G)=Zfk V(R k ),
k=l
if the Riemann sum is an arbitrary one formed from the grid G. Further-
more, if G' is a grid consisting of a subdivision of the rectangles of a grid
G, we have
S(G) < S(G') < S(G') < S(G).
In particular, if G and G" are two grids, and G' contains all the rectangles
of both of them, then
We define
or
-5(G) <-IBf<-S(G).
This inequality added to (2) gives
.v
in which the Riemann sum has been formed from the grid G.
Now all we have to do is show that S(G) — S(G) can be made arbitrarily
small if the mesh of G is made small enough. Then according to the
definition of the integral, we will have shown that the integral of/over B
exists and is IBf. Let e be a positive number. By Theorem 6.2, we can
cover the boundary of B, the smooth surfaces containing the discontinuity
points of f, and any other smooth surface on which we would like to
disregard the values of/, with finitely many open rectangles R[, R{, . . . ,
of total content less than e. On the part of B not covered by these rec-
tangles, /is continuous; so by Theorem 1.3 there is a <3 > such that/. —
fk < e over any rectangle Rk belonging to a grid with mesh less than b.
By making the mesh still smaller, say, less than b' , we can arrive at a
mesh size such that the rectangles R[, . . . , R',, are always contained in
finitely many rectangles R'i, . . . , R"m of any grid with mesh less than b'
and such that the total content of the latter rectangles is less than 2e.
Suppose that the remaining rectangles of such a grid G are R x Rn , . . . , ,
for any grid of small enough mesh. Since e can be made arbitrarily small,
the proof is complete.
SECTION 9
This section contains a proof of the change-of-variable theorem (Theorem THE CHANGE-OF-
1 n
ft fn,on tJ.r f.
VARIABLE FORMULA
3.1) of Chapter 6.
FOR INTEGRALS
Theorem
Let 51" —
> 51" be a continuously differentiable transformation. Let
R be a set in 31™ having a boundary consisting of finitely many
smooth sets. Suppose that R and its boundary are contained in the
interior of the domain of T and that
1. T is one-to-one on R.
[ fdV = [{foT)\J\dV.
JTIR) JR
If/ should be discontinuous on a smooth set S contained in R, then
the theorem can be applied to R with S deleted. The subsequent inclusion
of S and T(S) in the domains of integration will affect neither integral
since these sets have zero content.
Proofs We first consider the special case in which /is the constant
function 1, and T is linear, then, by Theorem 3.9 of Chapter 2, T
can, except for trivial interchanges of variables, be written as the
product of elementary linear transformations of two types: numer-
ical multiplication of a coordinate,
JR JRk Jh
If we denote by \a\ Ik the set of all numbers of the form \a\ xk where ,
= |
dV.
JM(R)
JR
f |J| dV = [ dV =
JR JRk
[ dVn A dx
Jlk
k
= f dVn_x \ dx k = \ dV.
JRk Jlj+rx J A(R)
In proving the general theorem we shall use the following norm for the
matrix A — (a{j ) of a linear function
l<t<n 3=1
||x|| = max |x ; | ,
l<3'<n
Sec. 9 The Change-of- Variable Formula for Integrals
y in C
K(r(C))<[max||r(y)||]V(C).
I yin C )
|det [T'(x)]-
1
!
V(T(Q) = V^T)' 1
° 7(C))
1
^fraax||[r(x)]- T'(y)||}'
!
nC)
cubes with centers xlf xN and suppose that b is the maximum side-
. . . , ,
Addition gives
n=l ly in Ck I
608 Appendix
As 6 approaches zero, the sum on the right approaches J" c |det T'\ dV
Then the last inequality becomes
Having proved this last inequality, we use it to prove the formula for
more general sets than cubes. We shall assume / > 0. The general case
follows by considering the positive and negative parts of / separately and
adding the resulting formula for each part. Let G be a cubical grid
covering R and having mesh <5. Let C\, . . . , CN G that
be the cubes of
are contained in R. If we let R x be the part of R that is not contained in
any of the cubes Ck then , R = C U x . . . U C Y U Rx Whenever y is .
fc
lA-f dV<2f k \
\J\dV.
k=
;=1 JTlCl:) A-=l JCk
From this it follows that
D = \ fdV-\{foT)\J\dV
JT(R) JR
+1 f
(A-Z-^UIrfK-f (/oT)|J|rfK.
||
r(x) - 7Yy)|| < B\\x - y||, for all x and y in R. (6)
(apply Theorem 1.3 of this Appendix to T(R) together with its boundary),
Sec. 9 The Change-of- Variable Formula for Integrals 609
I
.fix) -fk < e, \
for x in T(Ck ), k=l,...,N.
Then
Again using (6) and, if necessary, decreasing the mesh again, we can get
V(T(R v )) < B n e. Then
M + V(T(R)) + V(R)},
D < e{MB n +
where M a number such that/ < M on T(R) and (f° T)
is \J\ < M on R.
Since e is arbitrary, we must have D < 0, that is,
Jr(«) Jr
If we apply this last inequality to the situation in which T is replaced
by T~ x we get,
_1
\ (fo T) \J\ dV < I
(/o T o T- 1
) |J o T" 1
!
|J |
dF.
(J o T- 1 )^- 1 ) = 1 ; so
T(U) having content at most B"v, where B is the constant of relation (6).
Then, applying the change-of-variable formula to the region R with U
deleted, we get
[ fdV-( (foT)\J\dV
JT<R) JR
Index
611
612 Index
conjugate, 148
exponential, 150 Del, 557
inner product, 156 Dependent vectors, 12, 111
numbers, 147 Derivative, 205, 227, 253
vector space, 147, 153 directional, 272, 273
Component, tangential, normal, 217 Derivative with respect to a vector,
Component of a vector, 40 255
Composition of functions, 31, 121, Determinant, 51, 52
288 expansions, 67
Conditionally convergent integral, Diagonal form, 363
490 Diagonal matrix, 22
Conjugate homogeneity, 157 Differentiable function, 252
Conjugate symmetric, 156 252
Differential,
Connected, polygonally, 276 Ath-order, 385
Conservative field, 522, 524 Differential form, 565, 566, 569
Content, 450, 456 Differential operator, 130
/7-dimensional, 458 Diffusion equation, 413
Continuous differentiability, 259 Dimension, 1 13
Continuous function, 244, 245 Directional derivative, 272, 273
Convergence: Direction cosine, 50
conditional, 490 Discontinuity, removable, 250
Fourier series, 400 Discontinuity point, infinite, 483
pointwise, 422 Divergence of a field, 520, 551
sequence, 264 Divergence theorem, 520
uniform, 423 Domain, space, 1, 2, 199
Convergent family of subsets, 485 Dot product, 37
Convex, 14
Coordinate, 40
Coordinate functions, 180 E
Coordinate map, 182
Coordinate of a vector relative to Echelon form, 107
another vector, 40 Eigenfunction, 433
Index 613
Matrix (cont.) O
elementary, 105, 107
identity, 20 Odd function, 408
inverse, 20 One-to-one function, 30, 127
invertible, 20 Open coordinate rectangle, 450
Jacobian, 255 Open set, 231, 247
orthogonal, 42, 43 Operator, linear, 132, 433
product, 18 differential, 130
symmetric, 363 Orientable surface, 537
transformation, 25 Orientation:
transpose. 43, 44 boundary surface, 551
Maximum: negative, 62
absolute, 350 positive, 62, 537, 551, 552
local,350 surface, 537
norm, 580 Oriented volume, 65
Mean, probabilistic, 492 Origin, 7
Mean-square approximation, 432 Orthogonal functions, 429
Mean-value theorem, 275 Orthogonal matrix, 43
Mesh, 450 Orthogonal vectors, 42
Minimum: Orthonormal sequence, 430
absolute, 350 Orthonormal set, 43, 160
local, 350
Minor, 51
Modification, elementary, 82
Moment of inertia, 466
Multinomial coefficients, 385 Paraboloid, hyperbolic, 364
Multinomial expansion, 385 Parallel, 10, 116
Multiple integral, 449 Parallelepiped, 61
Multiplication, elementary, 81 Parallelogram, law, 8
Multipliers, Lagrange, 355 Parametric definition, 196, 198
Parametrization, arc-length, 216
Partial derivative, 227
N Particular solution, 140
Phase angle, 159
Natural basis, 6 Piecewise smooth curve, 213, 400
Negativedefinite, 377, 390 Piecewise smooth surface, 537
Neighborhood, 247 Pivotal column, 98
Network, 88 Plane, 11, 116, 337
Newtonian potential, 287, 564 through the origin, 1 16
Newton's method, modified, 263 Pointwise convergence, 422
Norm, 41 Poisson equation, 559
box,maximum, 580 Polar angle, 149
Normal component, 217 form, 149
Normal vector, 281 Polar coordinates, 3 1