Convex and Concave Functions
Convex and Concave Functions
SEM-II
University of Delhi
1. Learning outcomes
2. Introduction
3. Derivative Test for Concavity and Convexity
4. Second Derivative and Concavity and Convexity
4.1 Total Differential Method
4.2 Definitions
4.3 Use of Hession Matrix for the determination of Convexity and Concavity
After you have read this chapter, you should be able to:-
2. Introduction
A function f is concave if and only if any pair of distinct point p and R in the domain of f
and 0 1
f ( p (1 – ) R ) f ( p) (1 – ) f ( R )
0 0 1 1
Where p = ( x1 , x2 ) and R = ( x1 , x2 )
The definition can be extended to strict concavity by changing the weak inequality ≥ to
the strict inequality >.
A function f is convex if and only if any pair of distinct points p and R in domain of f and
for 0<θ<1
f ( p (1 – ) R ) f ( p) (1 – ) f ( R )
The right hand side is the height of line segment and the left hand side is the height of
the arc AB.
Figure 1
ƶ = f(x1, x2)
The function f(x,y) is concave (convex) if and only if for any pair of distinct points A and
B on its graph (a-surface) the line segment lies either on or below (above) the surface
except at point A and B. Strict concavity requires the line segment AB lies below the arc
AB. Imagine a dome-shaped surface. The surface of convex function typically be bowl-
shaped. For non-strictly concave and convex function the line segment AB is allowed to
lie on the surface itself, some portion of the surface, or even the entire surface may be
flat rather than curved
Figure 2
In the case of functions of two or more than two variable, it becomes difficult to use
diagrammatic method or algebraic method to determine the concavity or convexity of
function. The functions are such that they require a lot of algebraic manupulation to use
the algebraic formula. A way out is to use the derivatives if the functions is
differentiable.
A differentiable function f(x) = f(x1, x2,..., xn) is concave if and only if for any given point
0 2 0 1 1 1
p = ( x1 , x2 ,....., xn ) and any other point R = ( x1 , x2 ,...., x n ) in convex domain
n
f ( R ) f ( p) f i ( f ) ( R – p)
L 1
Geometrically it means that for a concave function the tangent plane on point p on the
graph of the function lies initially above the graph of the function.
In the case of a convex function graph of the function lies strictly above all the tangent
planes or the hyper planes, except the point of tangency.
Example
2 2
ƶ = x1 x2
The function is convex if for all X = (x1, x2) and Y = (y1, y2)
f f
(where 2 x1 and 2 x2 ) = 2 x1 y1 – 2 x12 2 x2 y2 – 2 x22
x1 x2
( y1 – x1 ) 2 ( y2 – x2 ) 2 ≥ 0
The expression in the brackets will remain positive whatever the value of (x1, x2) and
(y1, y2). This proves that the function is convex
Till now we have discussed about curvature properties of the function by using algebra
or first derivative concavity and convexity of a function is usually discussed using the
second derivative. The second derivative shows how the function represented by the first
derivative changes. In the case of function of one variable we saw that if f''>0 is convex
which means that for f'>0 the function increases more rapidly as x increases while for
f'<0 the function values full less quickly. For f''<0 the function is concave which means
that for f'>0 the function value increases less quickly as x increases while for f'<0 the
function value falls more quickly.
We cannot use the method of determining concavity and convexity for function of two
variables (or n variables). Second partial derivatives cannot be used directly because
there are infinite number of paths that one can take from same point.
2 2
ƶ = x1 x2 – 20 x1 x2
f11 and f22 are positive, the function is not strictly convex in all directions. Cross partial
derivative also plays a role in determining the curvature of the function.
In order to determine the concavity (convexity) of the functions of two variables (this
approach can be extended to n-variables also) we shall use the method of total
differential.
Let y = f(x)
d (dy )
d2y = d(dy) = dx
dx
d [ f '( x) dx]
dx
dx
d (dx )
f ''( x ) dx dx f '( x ) dx
dx
= f''(x) dx2
This is called the second total differential of f(x). Since the term dx2 = (dx)2, it is strictly
positive for any value of dx ≠ 0. It follows that d2y has some sign as f''(x). Therefore the
determination of convexity and concavity which relies on the sign of f''(x) can be
presented using the sign of d2y. A function is convex if f''(x) ≥ 0 and concave if f''(x) ≤ 0
then
The same conditions relating to the sign of d2y to concavity/convexity apply to functions
of n-variables. Here we shall explain this method for two variables.
y = f(x1, x2)
f f
dy dx1 dx2
x1 x2
= f1 dx1 + f2 dx2
(dy) (dy )
d (dy ) d 2 y dx1 dx2
x1 x2
2 2
= f11 dx1 f 21 dx1 dx2 f12 dx1 dx2 f 22 dx2
2 2
= f11 dx 2 f12 dx1 dx2 f 22 dx2 ` (f12 = f21)
The expression makes it clear that d2y depends on cross partial derivative f12 as well as
f11 and f22.
4.2 Definations
Def: A twice continuously differentiable function y = f(x1, x2) is concave if and only if,
d2y is everywhere negative semi definite
Def : A twice continuously differentiable function y = f(x1, x2) is convex if and only if d2y
is everywhere positive semi definite.
If the second order total differential is satisfies the condition d2y ≶ 0 then the function is
strictly concave/convex.
The method of determining the sign of d2y directly can involve a lot of algebraic
manipulation even when the function is function of two variables. In an earlier topic
dealing with maxima and minima we have used quadratic forms and their properties to
determine maxima-minima. Here also we can use the same method to determine the
sign of d2y.
f f12 dx1
[dx1 dx2 ] 11 ...............
f 21 f 22 dx2
We known that f12 = f21 from young's theorem. It follows that 2×2 matrix is symmetric.
This matrix whose elements are second order partial derivatives and cross partial
derivatives is called the Hession matrix and is denoted by H. Hession matrix can be used
to determine the concavity and convexity of the function.
4.3 Use of Hession Matrix for the determination of Convexity and Concavity
Def: For any function y = f(x1, x2, ...., xn) = f(X) where X............. which is twice
diffeerntiable with Hession H, the function f is strictly concave on Rn iff H is negative
definite for all X in Rn, that is
Then
d2y = dxT H dX
The function f is strictly convex on Rn if and only if H is positive definite for all x € Rn,
that is
The Hession H is positive definite if and only if all the leading principal minors of Matrix H
are positive.
For example if y = f(x1, x2, x3) the leading principal minors are
f11 f12
|H1| = |f1|, |H2| = |H3| = |H| = |f11 f12 f13|
f 21 f 22
If |H1|<0, |H2|>0 |H3|>0 the f(x1, x2, x3) is strictly convex (the Hession
is positive definite.
then H is negative definite, and the f(x1, x2, x3) is strictly concave.
H is positive definite on Rn if and only if its leading principle minors are positive |H1|>0,
|H2|>0, .... |Hn| = |H|>0 for X ∈ Rn
H is negative definite on Rn if and only if its leading principle minors alternate in sign
begining with a negative value for |H1|.
>0 n is even
|H1|<0 |H2|>0, ... |Hn| = |H|
<0 n is odd
Note : A leading principle minor of order r of Hession matrix is found by suppressing the
last n–r rows and columns.
Example. In the case of a 3×3 matrix |H1| is found by suppressing 2nd and 3rd rows and
columns.
|H1| = f11
A leading principle minor of order 2 |H2| is found by suppressing the third row and third
column.
So far we have given the conditions for strict concavity and strict convexity. There are
functions which are not strictly concave/convex. They are concave or convex.
Example
A twice differentiable function y = f(x1, x2...xn) is concave if and only if d2y is everywhere
negative semidefinite.
In terms of Hession matrix this means H is negative semi definite on Rn if and only if all
its principle minors alternate in sign begining with negative or zero value for k=1 (HK).
A twice differentiable function y = f(x1, x2, ... xn) is convex if and only if d2y is
everywhere positive semi-definite.
In terms of H this means all its principal minors are positive or zero
Note : Principal minors of order are found by suppressing n-k rows and columns of H.
f11 f12
|HX1| f11, f12 |HX2| =
f 21 f 22
f11 f12
f11 ≤ 0, f22 ≤ 0, ≥0
f 21 f 22
If we choose any two points b and d on the line and connect then by a line. The line
connecting the points b and d also lies on the straight line which we drew in the
beginning. All the point on the line connecting b and d lie on the original line.
Now look at the circle choose any two points on or in the circle and connect them by a
straight line x1x2. This straight line also lies within the circle.
We are now in a position to define the property of straight line and the circle shown
above.
A set is convex if the line joining any two points of the set lies entirely within the set.
The straight line and the circle which includes the area within the circle is an example of
convex set.
Figure 4
In fig. 4, we have drawn the first quadrant of the eucledium space. If we take two points
a1 and a2 in the figure and connect them by a straight line then the entire line lies within
1 2
the quadrant. For example if we look at the point ƶ = a1 a2 the point ƶ lies on the
3 3
straight line connecting the two points a1 and a2. Any point on the straight line
connecting points a1 and point a2 can be expressed as :
Def: A set S is convex if for every pair of points x1 ∈ S and x2 ∈ S, point x̄ = x1 + (1–
)x2is also an element of , for every value of when 0 ≤ ≤ 1.
A set containing only one point is a convex set. Null set is also considered as a convex
set.
Figure 5
The circle is not hallow. All the three figures given above are examples of convex sets.
Figure 6
In these figures there is a feature of reentrance (and also a hole). This is a cause for
non-convexity. To qualify as a convex set, the set of points in the figure must contain no
holes, and its boundary points must be not be reentered anywhere.
A function which gives rise to a hill over its entire domain is a concave function. A
function which gives rise to a valley over its entire domain is convex function.
If the hill (or valley) does not contain any flat surface then the function is suidth be
strictly concave (convex) function. In case a function which give rise to hill (or valley)
and contains flat surface also, is a concave function (convex function).
4.5 Assumption
The domain of the function is a convex set. This assumption is necessary because we
use the combination of x1 and x2 in the domain D to prove whether the f is a concave or
convex function.
In fig. 7, let x ≥ 0 be the domain of the function. This domain is a convex set. If we take
two values of x in the domain x1 and x2. The associated values of the function are f(x1)
and f(x2) connect these two points by a straight line AB. The graph of the function is also
given in the fig1. The graph of f is shown by are AB.
The straight line lies below the arc AB. The value of the function at x̄ between x1 and x2
is f(x̄) = C. This is higher than the point (D) on line AB immediately above the value x̄. It
is clear that f(x̄) > d. This property can be expressed as strict concavity of the function.
c = f(x1 + (1 – )x2)
In simple words if we take only two points on the domain where the domain is a convex
set then convex combination of these points is also in the domain of the function.
4.6 Theorem
c = f(x1 + (1 – )x2) > f(x1) + (1 – ) f(x2) ∈ [0, 1] and x ∈ Dan interval which is a
convex set.
In fig. 8, line AB lies entirely above the graph of the function except at point A & point B.
the f(x) is a convex function. A convex function bends below the line joining points f(x1)
and f(x2) (AB).
It is a concave function if
A linear function is a convex and concave function because it satisfies the conditions of
both convex and concave function.
Figure 9a Figure 9b
Figure9c
For a convex function the inequality is reversed. In fig 9(a) the function is convex and in
fig 9(b) the function is concave. In fig 9(a) look at all the points above the graph of the
function and below the straight line k parallel to x-axis. The set of points satisfy the
above definition and are actually a convex set. Similarly the shaded area in fig b is a
convex set. The function depicted in fig a is a convex function on the convex domain [a,
b].
Now observe fig 9c. There is a tangent at point x0. The tangent line h on any point on a
concave function will lie above the function (except at point f(x0)). For a convex function
the tangent line at any point x0 will lie below the graph of the function.
The right hand side is actually the equation of the tangent at x0 on the function. If we
move slightly away from point x0 on either side of x0 the tangent line at point x0 lies
above the graph of the function.
In the case of convex function (if the function is differentiable) the tangent line at (x0,
f(x0)) will lie below the graph of the function except at point (x0, f(x0))
If the function is differentiable twice then we can use the second derivative to test the
concavity and convexity.
f'(x) > 0 on an interval (a, b) means that the function is increasing on (a, b)
f'(x) < 0 on an interval (a, b) means that the function is decreasing on (a, b)
If f''(x) > 0 it means the slope of the tangent is increasing as we move from left to right
on the graph. In the fig (a) the slope of the tangent is increasing when we move from x1
to x2, where x1 < x2. This happens when the function is convex.
If f''(x) < 0 on (a, b) then tangent becomes flatter when we move to x0 from the left. It
means the slope of the tangent is decreasing as we move from left to right on the graph.
In other words when we move from x1 to x2 where x1 < x2 the slope of the tangent is
decreasing.
We conclude f is strictly concave on interval I if and only if f''(x) < 0 for all x in the
interior of I.
Function f is strictly convex on interval I if and only if f''(x) > 0 for all x on interior of I.
Example
Y X
(y + y ) – (x + x ) ≥ (2x1 2x2)
Y X
= 2x1y1 – 2x + 2x2y2 – 2x
y + y – x – x2 + 2x + 2x – 2x1y1 – 2x2y2 ≥ 0
y + y + x + x – 2x1y1 – 2x2y2 ≥ 0
f11 < 0 f22 > 0 f11 f22 – (f12)2 > 0 = 4 0 > 0 proved
While discussing concave and convex functions we saw that if the function is concave
(convex) there is no need to check the second order condition to determine whether the
function achiever maxima (minima) or not. When we are dealing with problem of
constrained optimization, it is again possible to dispense with the second order condition.
In the case of constrained optimization quasi-concavity of the function obviates need for
second order condition for determining the maxima. In a similar manner quasi-convex
function removes the need for second order condition when we are trying to find out
minimum of the function.
4) Choose two distinct point xi and xj (or x1 and x2) such that xi < xj in the convex
domain of the function.
5) The function f(x) forms an arc between xi and sj such that f (xi) = A and f (xj) =B
In fig. 11(a) point B is higher in height than A. In other words f(xj) > f(xi). The function
is strictly quasi-concave if all other points on are AB are higher in height than point A.
3) The function f(x) forms an arc CD between xi and xj such that f(xi) = c and f(xj) =
D. In fig(b) f(xj) < f(xi). The function is strictly quasi-concave if all other points on
the arc are lower in height than f(xi)
Let f be a function of x. Then for any two distinct points xi and xj in the convex domain of
the function such that xi < xj and 0 < θ < 1, the function is strictly quasi-concave
function if the following inequality is satisfied.
If we replace the strict inequality with weak inequality then the function is quasi-
concave.
The weak inequality implies that there is some horizontal straight line segment also on
the arc AB.
7. Quasi-convex function
Suppose f is a function of x. then for any two points xi and xj and for 0 < θ < 1 the
function is strictly quasi-convex if the following inequality is satisfied.
If we replace the strict inequality with weak inequality the function satisfy the condition
is quasi-concave.
Differentiable functions
2) All concave (convex) functions (strict or non strict) arc quasi-concave (quasi-
convex). But the opposite is not true.
Suppose a function z = f(x1, x2, ........ xn) is twice continuously differentiable. The quasi-
concavity and quasi-convexity of the function can be checked with the help of first and
second partial derivatives of the function arranged as a bordered determinant.
0 ……….
………….
|B| = …………
.. .. ..
. …………
Quasi-convex.
Strictly quasiconvex if
11. Exercises
1) Are the following function quasiconcave ? Which of them are also concave
c) f(x, y) = x2 y3
d) f(x, y) = x y2
2) Which of these function defined on are quasiconvex which are also convex
a) +
b) 3 +4
c) 2x1 + 3x2 –
a) z = – (x + x )
is it a concave function ?
d) x4 + x2 + y2 + y4 – 3x – 8y (Convex)
f) x – y – x2
a) (x1 + x2) /
defined on R2++
12. References
Allen, R.G,D, Mathematical Analysis for Economists, London: Macmillan and Co. Ltd
Knut Sydsaeter and Peter J. Hammond, Mathematics for Economic Analysis, Prentice Hall
Carl P. Simon and Lawrence Blume, Mathematics for Economists, London: W .W. Norton & Co.