Function of Many Variables
Function of Many Variables
The concept of “ceteris paribus” is one that is well known even to beginning students of economics. It
is a mental experiment where we consider the effect of a change in one variable on another variable
holding everything else fixed. The concept of partial derivatives delivers exactly these sorts of
statements.
d
f (k , 4) = 1.2k −0.7 .
dk
If k = 1 , then df (1, 4) / dk = 1.2 , i.e. when k = 1 , a one-unit increase in k will increase output by
l =4.
All these values would of course change if l were fixed at some other value:
If l = 9 , then
d d2
f (k ,9) = 6k 0.3 , f (k ,9) = 1.8k −0.7 and −1.26k −1.7 < 0
f (k ,9) =
dk dk 2
For any value of k , an increase in k will result in a larger increase in output than in the previous case
when l = 4 . Additional capital is more productive when l = 9 than when l = 4 , although this
productivity is diminishing at a faster rate. Clearly the effect on y of increasing k depends not only
Instead, we simply treat l as fixed (as in the ceteris paribus mental experiment) and treat the function
as a function of one variable k . Then we can take derivatives in the usual way. We say we are taking
∂y ∂f
= = fk′
= f1′ (k=
,l) 0.6k −0.7 l 0.5 .
∂k ∂k
(I have given four commonly used notations for partial derivatives -- use whichever notation seems
Note the use of “ ∂ ” rather than " d " in the ∂y / ∂k notation. This is merely convention: we use ∂ when
the derivative is a partial derivative (some variable is being held constant) and “ d ” for derivatives of
Our answer to the question “what is the effect on y of increasing k when l is held at l = 4 ?” can then
f1′ (k , 4)
= 0.6
= k −0.7 40.5 1.2k −0.7
In similar fashion, we can ask what the effect is on y of increasing l holding k fixed. We have
∂y ∂f
= = fl′
= f 2′ (k ,=
l) k 0.3l −0.5
∂l ∂l
∂z ∂z
Example 18.2 z = x 3 y 4 . Then = 3 x 2 y 4 , and = 4 x3 y 3 .
∂x ∂y
There is another notation for partial derivatives: given z = f ( x, y ) , I can also write f1′ for f x′ , and f 2′
for f y′ . The '1' in the subscript of f1′ means differentiate with respect to the first variable which appears
in the list of independent variables in f ( x, y ) , i.e., ' x ' . The '2' in the subscript of f 2′ means
differentiate with respect to the first variable which appears in the list of independent variables in
f ( x, y ) , i.e., ' y ' . This is especially useful when we apply the chain rule.
∂2 f ∂ ∂f
2
≡ ≡ f xx′′
∂x ∂x ∂x
The interpretation of these two higher
2
∂ f ∂ ∂f partial derivatives should be obvious.
2
≡ ≡ f yy′′
∂y ∂y ∂y
∂2 f ∂ ∂f
≡ ≡ f yx′′ How does the “rate at which f changes with y ”
∂x∂y ∂x ∂y
change when x changes?
∂2 f ∂2 f
and are called cross-partials.
∂l ∂k ∂k ∂l
∂f ∂f
= Aak a −1l b = Abk a l b −1
∂k ∂l
∂2 f ∂2 f
= Aa (a − 1)k a − 2l b = Ab(b − 1)k a l b − 2
∂k 2 ∂l 2
∂2 f ∂2 f
= Aabk a −1l b −1 = Aabk a −1l b −1
∂l ∂k ∂k ∂l
∂2 f ∂2 f
Notice that = .
∂l ∂k ∂k ∂l
This is not a coincidence. It turns out that the order with which you take cross partials is irrelevant. This
is a result known as Young’s Theorem, which we will not prove in this course.
∂z 1 ∂z x
= ; = − 2;
∂x y ∂y y
∂2 z ∂2 z 2x
= 0; = 3 ;
∂x 2 ∂y 2
y
∂2 z 1 ∂2 z 1
= − 2; = − 2.
∂y∂x y ∂x∂y y
responds to a change in the variable xi , holding all other variables fixed. To answer such questions,
x4
Example 18.5 If f ( x, y, z ) = , then
yz
4 x3 x4 x4
f x′ = ; f y′ = − ; f z′ = − ;
yz y2 z yz 2
For each of the n partial derivatives of the function z = f ( x1 , x2 ,..., xn ) , we can compute n second-order
partial derivatives:
∂ ∂f ∂2 f
=f ij′′ =
∂x j ∂xi ∂x j ∂xi
In other words, for a function of n-variables, we have n 2 second-order partial derivatives. We often
organize these n 2 partial derivatives into the following square array, called the “Hessian Matrix”:
x4
Example 18.6 f ( x, y , z ) =
yz
4 x3 x4 x4
f x′ = ; f y′ = − ; f z′ = − ;
yz y2 z yz 2
12 x 2 4 x3 4 x3
f xx′′ = f xy′′ = − f xz′′ = −
yz y2 z yz 2
4 x3 2 x4 x4
f yx′′ = − f yy′′ = f yz′′ =
y2 z y3 z y2 z2
4 x3 x4 2 x4
f zx′′ = − f zy′′ = f zz′′ =
yz 2 y2 z2 yz 3
We can compute even higher-ordered derivatives. It turns out that the cross-derivatives are the same
regardless of the order in which the differentiation was carried out. We have
This is always true. For any function of n variables z = f ( x1 , x2 ,..., xn ) , we always have
∂2 f ∂2 f
= .
∂xi ∂x j ∂x j ∂xi
f ( x + h, y ) − f ( x , y ) f ( x, y + h ) − f ( x, y )
1. Given f ( x, y ) below, find and . Find
h h
f ( x + h, y ) − f ( x , y ) f ( x, y + h ) − f ( x, y )
(i) lim h→0 and (ii) lim h→0
h h
(Comment: as you do these drills, try to get the intuition that you are, in the case of (i) differentiating
the function with respect to x , treating y as though it were a constant, and in the case of (ii),
) 3x + 2 y ;
(a) f ( x, y=
(b) f ( x, y ) = 5 xy ;
) x2 + y 2 ;
(c) f ( x, y=
(d) f ( x, y ) = 4 x 2 y ;
(e) f ( x, y ) = 3 x 2 ;
(f) f ( x, y ) = y 2 / x ;
f ( x + h, y ) − f ( x , y ) f ( x, y + h ) − f ( x, y )
In each case, find lim h→0 and lim h→0
h h
(b) f11′′ , f 22
′′ , f12′′ , and f 21
′′ . Verify in each case that f12′′ = f 21
′′ .
(b) f11′′ , f 22
′′ , f33
′′ , f12′′ , f 21
′′ , f13′′ , f31
′′ , f 23 ′′ . Verify in each case that f ′′ij = f ′′ji .
′′ and f32
(ii) Find the range of the function when the function is defined over the largest possible
domain;
(a) f ( x, y ) =x 2 + 2 xy + y 2 x, y ) x 3 e − y + y 3
(b) f (=
x2 y3
(c) f ( x, y ) = x 2 ye xy (d) f ( x, y ) =
x+ y
7. Given f ( x, y ) =x 2 + 2 xy + y 2 , find
(a) [ f ( x + h, y ) − f ( x, y )]/ h
(b) [ f ( x, y + h) − f ( x, y )]/ h
8. The volume of a cone of height h and radius (at its base) r is given by
= ( r , h) π r 2 h / 3 .
V f=
How would you interpret the equation ∂V / ∂r =2V / r . Show that the change in the volume
of the cone as you increase its height while holding the base radius constant is proportional to