Essentials of Advanced Macroeconomic Theory 2012
Essentials of Advanced Macroeconomic Theory 2012
Essentials of Advanced Macroeconomic Theory 2012
Essentials of Advanced
Macroeconomic Theory
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Ola Olsson
Essentials of Advanced
Macroeconomic Theory
First published 2012
by Routledge
2 Park Square, Milton Park, Abingdon, Oxon OX14 4RN
Simultaneously published in the USA and Canada
by Routledge
711 Third Avenue, New York, NY 10017
Routledge is an imprint of the Taylor & Francis Group, an Informa business
c 2012 Ola Olsson
The right of Ola Olsson to be identified as author of this work has been
asserted by him in accordance with the Copyright, Designs and Patent
Act 1988.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
List of figures ix
Preface xi
1 Introduction 1
1.1 The issues 1
1.2 The national accounts identity 2
1.3 Outline 3
PART I
The Long Run 5
2 The Malthusian World 7
2.1 Introduction 7
2.2 The law of diminishing returns 7
2.3 The Malthusian trap 9
2.4 Endogenous fertility 11
2.5 The collapse of the Malthusian link 14
PART II
The Short and Medium Run 51
Downloaded by [Hacettepe University] at 20:58 10 April 2017
7 Financial Crises 58
7.1 Basic assumptions 58
7.2 Banks 59
7.3 A bank run equilibrium 61
7.4 Foreign credit 62
7.5 Short-term debt 63
7.6 Liberalizing international credit markets 64
PART III
Macroeconomic Policy 99
11 IS–MP, Aggregate Demand, and Aggregate Supply 101
11.1 Aggregate expenditure and the multiplier 101
11.2 The IS–MP model 103
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Notes 156
Bibliography 161
Index 164
Downloaded by [Hacettepe University] at 20:58 10 April 2017
List of figures
Downloaded by [Hacettepe University] at 20:58 10 April 2017
are surveyed below do not emerge from a core set of assumptions that are then
extended and applied in different directions, neither is the analysis based on a
small number of key equations as in certain macro textbooks. Despite recent
efforts, macroeconomic theory is still not a coherent body of theory in the same
way as microeconomics or econometrics is. The DGE program was clearly an
attempt to provide such a coherent framework, but recent events have put that
effort into a less favorable light.1
The “workhorse” model for most chapters is, however, the well-known two-
period, representative agent model of consumer optimization with a utility func-
tion given by U = u(c1 ) + βu(c2 ) that is maximized subject to varying constraints.
The majority of all models presented are thus “micro-founded”, which should
make the links to microeconomic theory more easily recognizable. Several of the
chapters start off with a typical Keynesian model, which is then contrasted to
models founded in individual household behavior and characterized by rational
expectations and intertemporal optimization.
A key motivation of this text in comparison with the literature in the field is its
condensed form. As a rule, most advanced textbooks in macroeconomics are about
500–600 pages long, mixing theory with somewhat randomly chosen empirical
applications. This text is intended to be less than half as long as a standard textbook
and to serve more or less as a reference source on modern macro theory. It is my
hope that it will direct impatient readers (like myself) quickly to the main results.
Admittedly, this writing approach might run the risk of alienating readers who rely
more on texts focusing on the intuition behind models. Such readers might want
to gather deeper intuitive insights from other sources, for instance from articles or
more comprehensive macroeconomic textbooks.
A further and important delimitation of this work is that it will not discuss
empirical tests of the theories surveyed. The reason is partly that I want to keep
the text compact, but also that it is my impression that researchers seem to be
somewhat more in agreement about what they think are the most relevant models,
as compared with what they consider to be the most successful empirical tests of
those models. Theory also changes more slowly than the stock of empirical results.
This text should ideally be complemented with selected readings on empirical
motivations and applications of the theories presented.
Preface xiii
The book is intended to be suitable for a master’s course in macro theory, last-
ing for about half a semester. Certain sections or chapters might also serve as an
introduction to macroeconomics for nonspecialized graduate students. Readers are
presumed to be relatively well equipped with calculus and algebra. A fairly strong
background knowledge of both micro and macro theory is taken for granted.
The text has emerged from my experience of teaching advanced macro the-
ory at the University of Gothenburg. Special thanks are due to my former teacher
and most ardent reviewer Wlodek Bursztyn for having provided extensive com-
ments on several previous versions (we still do not agree on certain aspects . . .).
I have also benefited from many valuable discussions with Heather Congdon Fors
and Per Krusell regarding macro theory in general. Oded Galor, Halvor Mehlum,
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Ola Olsson
(ola.olsson@economics.gu.se)
Floda, Sweden
May 2011
1 Introduction
Downloaded by [Hacettepe University] at 20:58 10 April 2017
• What policies are most effective against unemployment, and how should the
government or the central bank fight inflation?
• How can economic growth be increased?
• How should governments stabilize short-run fluctuations and business cycles?
• What is a sustainable level of government debt?
which measure the level at a given point in time. We will return to this latter type
of variables later.
Total GDP can be calculated in three ways, which all should yield the same
result. The most common characterization of GDP is to study it from the user side,
i.e. what total GDP is spent on. This is the expenditure approach to measuring total
GDP and can be described by the equation
Yt = Ct + It + G t + X t − Mt (1.1)
Yt = agriculture + · · · + manufacturing + · · ·
+ professional and business services . . . (1.3)
where the total values of production from all sectors of society are added.
The equation for the user side of GDP in (1.1) is the backbone of macro-
Downloaded by [Hacettepe University] at 20:58 10 April 2017
economic theory from which the subsequent analysis is derived and extended in
numerous ways. It also serves as a introduction to an outline of the exposition
below.
1.3 Outline
The following chapters are organized as follows. We start off by analyzing the
long-run determinants of total GDP, i.e. growth theory. Chapter 2 is devoted to
the Malthusian model of growth, Chapter 3 presents the neoclassical (or Solow)
growth model and its extensions, and Chapter 4 deals with endogenous growth
models where technological progress plays a prominent role. In Chapter 5, we
develop the overlapping generations model, which is long-run in nature and which
is used also in the chapters ahead.
After the long run, we take a look at macroeconomic theory in the short and
medium run. In Chapter 6, we study models on the behavior of total GDP and its
components over the business cycle, i.e. a period of roughly five years. Chapter 7
discusses a recurrent phenomenon in capitalist economies: financial crises and
bank runs. We then move on to analyze specifically the constituent parts of the
expenditure side: consumption (and saving) in Chapter 8, investment and asset
markets in Chapter 9, and one of the key markets for understanding the macro
economy, the labor market, in Chapter 10.
In the third main section of the book, we analyze a broad range of topics related
to macroeconomic policy. We begin by presenting the traditional IS–MP, aggre-
gate supply and aggregate demand frameworks and the refinements suggested by
the rational expectations view and the new Keynesian view in Chapter 11. We then
go on to public finance and fiscal policy (Chapter 12), and inflation and monetary
policy (Chapter 13). Lastly, we discuss international aspects of economic policy
in Chapter 14.
Some basic mathematical results that are used throughout the text are provided
in an appendix.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Part I
2.1 Introduction
In this chapter, we will describe a model of long-run economic growth that was
applicable to all countries in the world up until the industrial revolution and which
still is a highly relevant model for some developing countries. In this “Malthusian
world” there is a strong link between income per capita and population growth, so
that anything that increases aggregate income in a society will soon be neutralized
by an increase in the size of the population. Hence, even despite periods of rapid
technological progress, income per capita will remain at a fairly constant level.
Recent empirical work on historical data has shown that standards of living indeed
appear to have been roughly similar in Assyria around 1500 BC, in Egypt during
Roman times, and in late eighteenth-century England (Clark 2007). This section
is motivated by this stylized fact from economic history.
The main insights behind this model were proposed by Thomas Malthus (1798)
but also critically hinge on the principle of diminishing returns to factors of
production, and on theory that was further developed by David Ricardo. In the
sections below, we will briefly discuss the theory of diminishing returns, the
Malthusian model of long-run stagnation, and reasons for the eventual collapse
of the Malthusian link. We will also show how fertility can be endogenously
determined within a representative household.
Y = AX α L 1−α
Downloaded by [Hacettepe University] at 20:58 10 April 2017
(2.1)
The marginal product is always positive, indicating that if we increase the labor
force in the country by one person, that person will always produce some extra out-
put. However, as we see from (2.2), the marginal product will fall as we increase
the number of workers since L now appears in the denominator. Formally, we can
also show this by taking the second derivative:
∂ 2Y
= −α(1 − α)AX α L −α−1 < 0
∂ L2
Thus, the Cobb–Douglas functional form ensures that Y has a concave relationship
with L.
Note also that we can express total output per worker or per capita y as
α
Y AX α L 1−α X
=y= =A = Ax α (2.3)
L L L
where x is land per capita. Output per capita is one of the most often used indica-
tors of standards of living in a country and is highly correlated with factors such
The Malthusian world 9
as life expectancy, levels of education, the rule of law, and political freedom. We
will refer to it frequently in the pages ahead.
From the expression in (2.3), it is clear that output per capita will fall with a
greater level of population in the Malthusian model. On the one hand, one more
worker means a little bit more output, but it also means another person to share
total production with. The latter negative effect dominates. The first and second
derivatives show us that
∂y ∂2y
= −α AX α L −α−1 < 0, = α(1 + α)AX α L −α−2 > 0
∂L ∂ L2
The central feature is that both Bt and Dt are functions of output per capita
lagged one year, yt−1 . Birth rates increase with yt−1 such that Bt (yt−1 ) > 0
whereas Dt (yt−1 ) < 0. With an increase in output per capita, the supply of food
increases, which allows families to grow. Likewise, the higher food consumption
associated with a larger output at a given size of the population means that people
die to a lesser extent from disease.1
The relationships between births, deaths, and the sizes of population and output
per capita are drawn in Figure 2.1. In the figure, we assume for simplicity that Bt
and Dt are linear functions of y. In the lower graph, we have drawn the negative
convex relationship between y and L as stipulated by (2.3). The main insight from
the figure is that output per capita will tend to converge towards an equilibrium
level given by y ∗ . In the Malthusian model, y ∗ is often referred to as the subsis-
tence level since it is inevitably at a quite low level that is not far above the level
of income that allows people to survive. At this level, population ceases to grow
and L t − L t−1 = Bt (y ∗ ) − Dt (y ∗ ) = 0.
To see that y ∗ is an equilibrium, consider a relatively high level of initial out-
put y 0 . At this level of affluence, many children are born and relatively few people
die of disease. Hence population levels increase, which pushes y to the left in the
figure. The economy comes to rest again at y ∗ where output per capita has fallen
to its subsistence level.
An even more grim situation is of course if the economy instead starts off to the
left of y ∗ . In this situation, people are starving, few children are born, and many
10 The long run
D(y )
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Y/L
L* y = A(X/L)α
L0
Y/L = y
y* y0
people die from disease. The level of the population shrinks, which gradually
causes output per capita to rise. Back at y ∗ , normal times resume.
Consider now a positive technological shock such as the use of the plow or the
introduction of windmills in the Middle Ages. Such a shock will appear as an
increase in A in (2.3) and as an outward shift of the y-curve in the lower panel
of Figure 2.2. This will temporarily cause output per capita to rise to y > y ∗ .
However, this new situation of prosperity will soon lead to a higher birth rate and
a lower death rate, which will cause the population to grow. When population
has grown to L ∗,new > L ∗ , income per capita is back at its old level y ∗ . The only
lasting result of a positive technology shock during the Malthusian era is thus a
larger population.
As was mentioned above, several developing economies in the world that are
dominated by subsistence agriculture are still caught in the Malthusian trap. Con-
sider an extended drought of several years, such as happened in the African Sahel
in 1985. In terms of Figure 2.2, such a shift would be like a negative shock to At
and would cause the y-curve in the lower bar to shift leftwards so that income fell
The Malthusian world 11
B(y )
D(y )
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Y/L
L*,new
L*
y = Anew(X/L)α
y = A(X/L)α
Y/L = y
y* y′
below its subsistence level. If such populations are left to themselves, famine will
set in and people will start dying of starvation. Starvation would, however, cause
income levels per capita to start rising again, according to the Malthusian logic.
Eventually, the economy would return to the old equilibrium level but this time
with a much smaller population.
In the modern era, most Western governments would find such mass starvations
unacceptable and provide emergency aid in the form of food and other relief. Such
an intervention is of course highly commendable from a moral point of view, but
it does not remove the Malthusian trap from the affected economy. If a downward
adjustment of population levels does not take place, the country will be caught
with a too large population that the country itself cannot feed. In this way, many
of the poorest countries become dependent on aid.
function makes two simplifying assumptions. First, we imagine that this single
(hermaphrodite) individual can choose how many children he or she wishes to
have. Second, we imagine that children can come in nondiscrete amounts (i.e. we
allow n t = 1.2 to be a possibility).
The budget constraint for the individual is
ct + ρn t = yt (2.6)
where ρ > 0 is the relative cost of raising children and yt is income per capita as
specified above. This constraint shows that the individual faces a trade-off between
own consumption ct and having children. If the person decides to have children,
the cost per child is fixed at ρ.
The utility function and the budget constraint together define a utility-
maximization problem:
max Ut subject to ct + ρn t = yt
nt
max ln Ut = (1 − β) ln(yt − ρn t ) + β ln n t
nt
βyt L t β AL 1−α Xα
L t+1 = n ∗t L t = = t
= ϕ(A, L t ) (2.7)
ρ ρ
Lt+1
Lt+1 = Lt (45º)
Lt+1 = ϕ (A,Lt)
Lt
Lt*
1
L∗ 1 βA α
= ∗=
X x ρ
At+1 − At = At γ E i
tion. Regions that were generously endowed thus had fast growth and eventually
reached a critical level Ā where the transition to Neolithic agriculture happened
early.5 The transition to agriculture then implied that a certain part of the popula-
tion was freed from producing and formed a new elite of specialists that would
be essential for the subsequent rapid advance of science and technology. The
Neolithic revolution thus created the first instance of endogenous technological
growth, i.e. the purposeful creation of new knowledge through the allocation of
labor resources to a new sector.
In Olsson and Hibbs (2005), the growth rate of technology changed shape to
become
At+1 − At
= g(at L t )
At
where at < 1 is the share of the total labor force L t at time t engaged in endogenous
knowledge creation, and g(at L t ) is a function of at L t such that g (at L t ) > 0. In
Diamond’s (1997) informal account and in Olsson and Hibbs’ model, the very
early creation of this knowledge-producing sector is the key to understanding why
Western Eurasia could start to dominate other continents from AD 1500 despite the
fact that Europe was not richer in per capita terms than any other part of the world.
According to this view, the industrial revolution was just a natural extension to a
development that had its roots in the transition to agriculture.
This theory has been criticized by a direction of research emphasizing the cen-
tral role of economic and political institutions, i.e. the fundamental rules that
societies live by. Acemoglu et al (2005) demonstrate that Western colonialism
played an important role in the accumulation of capital in western coastal Europe.
The inflow of capital strengthened the political power of a merchant class in coun-
tries like Britain and the Netherlands, which in turn led to reforms and stronger
institutional constraints against the executive vis-à-vis the citizens. In Spain, how-
ever, the inflow of capital only led to the enrichment of an elite around the
Crown and fostered rent-seeking behavior rather than production and investment.
The institutional changes in Britain and the Netherlands eventually triggered the
industrial revolution.
16 The long run
2.5.2 Galor and Weil’s unified growth model
The models above are not concerned with the more exact mechanism whereby
the link between population growth and income per capita is broken. Galor and
Weil (2000), however, focus explicitly on the dynamics of this process. One of the
main assumptions in their model is that parents face a basic trade-off between child
quantity (number of children) and child quality (children’s level of education).6 In
this section we present an extremely simplified version of Galor and Weil’s model.
Parents divide their time between work and childrearing. Time spent on one
child in period t is in turn divided between basic childrearing τ and education so
that the child’s level of education in the next period is et+1 . Let the family’s total
number of children in period t be denoted by n t and let the total time available to
Downloaded by [Hacettepe University] at 20:58 10 April 2017
parents be normalized to unity. Hence, the time budget constraint for parents is
n t (τ + et+1 ) ≤ 1
The time budget constraint shows that parents face a trade-off between having
many children (a high n t ) and giving them a good level of future education et+1 .
This is similar to what Becker and Lewis (1973) refer to as the child quantity/child
quality trade-off. If, for instance, n t (τ + et+1 ) = 1 applies, then in order to increase
et+1 we must necessarily decrease n t in a proportional manner.7 In the full version
of the model, Galor and Weil (2000) derive the optimal allocation of time spent
on the two activities and show that it will to a great extent depend on the parents’
preference structure. It is well known, however, that during most of human his-
tory, the trade-off was such that et+1 = 0. A central element of the breaking of
the Malthusian trap was precisely that levels of education started rising and the
number of children born in each family started falling.
So why did families start to substitute child quantity for child quality? In the
model, Galor and Weil derive the result that the optimal level of et+1 is a function
of the growth rate of technological knowledge such that et = e(gt ). The logic is
simply that the greater the advancement of technological knowledge in a society,
the higher will the optimal level of education be. The derivatives are ∂et /∂gt =
eg ≥ 0 and ∂ 2 et /∂gt2 = egg ≤ 0.
Let us also assume that the growth rate of technological knowledge is given by
At+1 − At
= gt+1 = g(et , L t )
At
(a) gt
et+1 = e (gt+1)
g′ gt+1 = g (et,L0 )
g″
Downloaded by [Hacettepe University] at 20:58 10 April 2017
g ′′′
et
0 = e* = g* e″ e′
(b) gt
et+1 = e (gt+1)
g* gt+1 = g (et,L1)
g′
gt+1 = g (et,L0 )
et
0 e′ e*
Figure 2.4 (a) A Malthusian trap in the evolution of technology and education in the
Galor–Weil model. (b) Escape from the Malthusian trap in the Galor–Weil
model due to an increase in population (L 1 > L 0 ).
The basic relationship between the two variables is shown in Figures 2.4a
and 2.4b. In these figures, we have drawn both the et+1 = e(gt+1) and the gt+1 =
g(et , L t ) curves simultaneously. Their slopes are as derived above.8 Note that
below a certain level of technological progress gt , the optimal level of education et
is zero.
Now consider an economy that starts off at a level of education e > 0. At
this level, next year’s technological progress is fairly high at g > 0. The level
of population is L 0 > 0. However, at this level of g, education in the next period
is discouraged and will fall to e < e , as indicated by the arrow. At this lower
18 The long run
level of education, technological progress will slow down and g will fall further to
g < g . Hence, in the economy at hand, there is a negative feedback loop between
education and technology that eventually causes the economy to reach an equilib-
rium where e∗ = g ∗ = 0. Since no time is spent on education, parents will spend
all their nonworking time on having babies and n t will be large. This is the typical
Malthusian trap scenario.
Consider now another economy (or the same economy at a different date) as in
Figure 2.4b where population is L 1 > L 0 so that the g(et , L 0 ) curve has shifted
upward to g(et , L 0 ). If we start off at e and g , education will be strongly encour-
aged by the high level of g, so that e will increase in the next period. The higher
level of education in turn spurs faster technological progress, and so the economy
enters a positive virtuous circle that settles down at equilibrium levels e∗ , g ∗ > 0.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Since parents’ time budget constraint has not expanded, they will substitute child
quantity n t for child quality et . As total production in society starts rising fast
due to technological progress, the combined effect of a lower n t and a higher gt
makes income per capita rise explosively. The economy has escaped from the
Malthusian trap.
3 The Solow Growth Model
Downloaded by [Hacettepe University] at 20:58 10 April 2017
3.1 Introduction
In this third chapter, we will analyze the determinants of wealth and poverty during
the industrial era among nations most of which have managed to escape from
the Malthusian trap. Rather than focusing on short-term phenomena like business
cycles, we will try to understand long-run patterns of development, for example
why some of the richest countries such as Switzerland and Norway have a level of
GDP that is about one hundred times greater than the GDP of some of the poorest
countries such as Niger and Haiti.1
Population growth will play no important role in this framework since it is
assumed to have stabilized at relatively low (exogenous) levels. The key factor
of production is instead physical capital and the key process is that of conver-
gence, showing how economies that initially start off with relatively low levels of
physical capital per worker should grow faster than relatively richer countries.
The neoclassical growth model builds fundamentally on the work of Robert
Solow (1956) and has become one of the most important models in macroeco-
nomic research. We start by deriving the well-known k̇-equation and the most
important implications that follow from it such as convergence. In the final
sections, we present some extensions to the Solow model where we demon-
strate how technological progress and human capital can be included in the basic
framework.
where Yt is total GDP measured from the production side, K (t) is total physical
capital that is a function of time t, and L(t) is the aggregate labor force.3 K (t)
20 The long run
might be thought of as the total stock of factories and machines in a country and
L(t) as the total number of workers. K and L (we will henceforth usually drop
the time (t) notation for ease of exposition) are thus production factors or inputs
in the aggregate production process.
We also make the following more technical assumptions:
The constant-returns assumption means that if we, for instance, double the levels
of K and L simultaneously, we will get a doubling of total output.4 The second
assumption implies that F has a concave relationship with both production factors
and that the marginal product is always positive. This is the same “Ricardian”
assumption of diminishing returns as was used in the Malthusian model.
It is further assumed that we can transform the aggregate production function
in the following way:
F(K , L) K L K K
=F , =F , 1 = f (k), where k = (3.2)
L L L L L
The most often used functional form for the production function in growth
theory is Cobb–Douglas:
3.3 Dynamics
All variables in the model are functions of time, so the next step is to specify their
dynamics or laws of motion. The growth of labor L is in this setting assumed to
The Solow growth model 21
be exogenously given, i.e. not explained by the model:
∂ L(t) L̇
= L̇(t) = n L, where n > 0, implying =n
∂t L
In this expression, n is the (percentage) growth rate of the labor force (or of
population size). Although time derivatives indicate instantaneous changes in the
stocks, we normally think of L̇L , for instance, as the growth rate during a year, as
in the national accounts.5 A typical level for n would thus be 0.01–0.05.
The key dynamic equation in the Solow model is that specifying the rate of
change in the physical capital stock:
Downloaded by [Hacettepe University] at 20:58 10 April 2017
K̇ = sY − δ K (3.5)
In this expression, s > 0 is the fraction of total output Y that is being saved and
δ > 0 is the capital depreciation rate, i.e. the fraction of total physical capital that is
worn down every year. (Typical and often observed levels are s = 0.2 and δ = 0.05.)
The same expression can be restated as sY = K̇ + δ K = I , which tells us that total
savings sY can be used for net investment K̇ (leading to actual increases in the
capital stock) and replacement investments δ K (replacing the capital that has been
worn down), and which together sum to total aggregate investment I .
The Solow growth model implicitly assumes a closed economy without trade
and where there is no government. Hence, the user side of the economy contains
only investment and consumption (compare with the fundamental equation (1.1)
above). Therefore, we can write
Y = K̇ + δ K + C = I + C
The expression in the third line of (3.6) is the central equation of the Solow growth
model.
3.4 Equilibrium
The k̇-equation can also be drawn as in Figure 3.1. The vertical axis simply shows
levels whereas the horizontal axis shows capital per unit of effective labor, k.
22 The long run
f (k)
(δ + n)k
(1 – s )f(k) = c sf(k )
k
k*
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The two most important lines here are the s f (k) and (δ + n)k curves, where it
is important to note that the former is concave since f (k) < 0. s f (k) is some-
times referred to as the actual level of investment and (δ + n)k as the break-even
level of investment. Beyond these curves, however, there is also the f (k) curve.
Note that the vertical distance between the s f (k) and f (k) curves equals c = C/L,
i.e. consumption per unit of labor.
At low levels of k, k̇ > 0, whereas at high levels of k, k̇ < 0. The only stable
equilibrium in Figure 3.1 occurs at k ∗ , which is defined by the point where the
s f (k) and (δ + n)k curves cross. This is also the level of k where k̇ = 0, implying
that K and L grow at a “balanced” rate.6 k ∗ is often also referred to as the steady-
state equilibrium.
3.5 Implications
3.5.1 Cobb–Douglas functional form
If we assume a Cobb–Douglas functional form for the production function as
in (3.4), we will have the following k̇-equation:
k̇ = sk α − (δ + n)k
From this expression, we can also derive the growth rate per worker (by using the
chain rule):
∂(k α )
ẏ αk α−1 k̇ α k̇
= ∂tα = = (3.7)
y k kα k
sα
= sαk α−1 − α(δ + n) = − α(δ + n)
k 1−α
The Solow growth model 23
s′
s
time
t0
Downloaded by [Hacettepe University] at 20:58 10 April 2017
time
t0
Figure 3.2 Impact on growth rate per worker of an increase in the savings rate.
In the short run, the growth rate will depend on the initial level of k since
economies with a low level of k will have k̇ > 0 and will grow faster. However, as
k increases, the economy will gradually approach its steady-state level k ∗ where
k̇ = 0. Furthermore, an increase in the savings rate s that happens in a steady state
will temporarily increase the growth rate since it will turn k̇ positive. The long-run
effect, however, should be zero when k has moved to a new (and higher) equilib-
rium level (see Figure 3.2). Similarly, a sudden increase in population growth will
lead to a period of negative growth, until the economy settles at a lower equilib-
rium level of k and an associated lower level of output per worker. This prediction
of temporary effects on the growth rate but permanent effects on the steady-state
levels of output is among the most central predictions from the model and has
been tested numerous times in the empirical literature.
We can also solve for the steady-state level of k:
k̇ = 0 =⇒ s(k ∗ )α = (δ + n)k ∗ (3.8)
1
(δ + n) δ+n α−1
=⇒ (k ∗ )α−1 = =⇒ k ∗ =
s s
1
∗ s 1−α
=k =
δ+n
Note that we want to carry out the last step in order to have the exponent positive
1
( α−1 < 0). This expression for the steady-state level clearly shows that k ∗ will
increase with the savings rate s, and decrease with the capital depreciation rate δ
24 The long run
and with the population growth rate n. Identical results are found by moving the
curves in Figure 3.1. The steady-state level of output per worker is thus
α
∗ ∗ α s 1−α
y = (k ) =
δ+n
c∗ = (1 − s) f (k ∗ ) = f (k ∗ ) − (n + δ)k ∗ (3.9)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
We know from (3.8) that k ∗ increases with s, but in the expression (3.9) there
is both a positive and a negative effect of k ∗ on c∗ . When we take the partial
derivative, we get
∂c∗ ∂k ∗
= [ f (k ∗ ) − (n + δ)]
∂s ∂s
The sign of this expression will be determined by the sign of the term in square
∗
brackets (know that ∂k ∗
∂s > 0 always holds). Since f (k ) is very large at small
∗
levels of k (see Figure 3.1), we can infer that ∂c
∂s > 0 when k is small, whereas at
∂c∗
greater levels of k it will be the case that ∂s < 0. Hence, there is a level of k ∗ ,
∗
referred to as k ∗,gold , when ∂c
∂s = 0:
∂c∗
= 0 when f (k ∗,gold ) = n + δ (3.10)
∂s
Beyond this level of k, a further capital accumulation will decrease intensive con-
sumption c. The level k ∗,gold is referred to as s gold . The intuition about this golden
rule of capital accumulation is that it is only useful to increase the savings rate up
to a certain level.
3.5.3 Convergence
The model above has some strong predictions about convergence, i.e. that coun-
tries that start off with a lower level of k should experience a higher growth rate
of output per worker. Another way of illustrating the convergence property is
described below.7
If we insert a Cobb–Douglas production function f (k) = k α into the model
above, it was shown that we can write the growth rate of output per worker as:
ẏ
= α(sk α−1 − δ − n)
y
The Solow growth model 25
1
Since YL = k α = y, we can express k as k =yα. Inserting into the growth equation
above yields
ẏ α−1 s
= α(sy α − δ − n) = α 1−α − δ − n (3.11)
y y α
Variants of this expression form the basis of the cross-country empirical studies on
the determinants of economic growth. The key prediction concerning convergence
is that the growth rate of output per worker should decrease with its initial level, y.
In other words, holding all other factors constant, poorer countries should grow
faster than richer ones. A country’s growth rate during the convergence process
Downloaded by [Hacettepe University] at 20:58 10 April 2017
should further increase with its savings rate s, and decrease with the popula-
tion growth rate n and with the capital depreciation rate δ. The growth rate of
rich countries that have reached their steady state will depend on the exogenous
parameter g.
The convergence result suggests that the poorest countries in the world should
experience the highest growth rates. Although we know that many previously poor
countries have experienced very fast growth rates in recent decades – for instance
China, India, and Botswana – other countries have experienced stagnant growth or
even growth collapses. Some countries, such as DR Congo and Zambia, have even
seen their levels of income per capita fall by half. We will return to this issue below.
3.6 Extensions
The simple version of the Solow growth function presented above provides the
basic intuition behind the important convergence property and the central role
played by physical capital accumulation. It abstracts, however, from numerous
factors that are believed to be central for economic growth to occur even in the
medium run. Two of the most important of these factors are technological progress
and human capital accumulation.
Let us further assume that the growth rate of technology is exogenously given by
Ȧt
=g>0
At
26 The long run
The intensive form is now written as κ = K /AL and is referred to as capital per
unit of effective labor. The time derivative of κ is
K̇ KL KA
κ̇ = − Ȧ − L̇
AL (AL)2 (AL)2
sY − δ K Ȧ L̇
= − κ − κ = s fˆ(κ) − κ(δ + g + n)
AL A L
where fˆ(κ) = F(K , AL)/AL. As in the section above, the economy will be in a
s 1
steady-state equilibrium when κ̇ = 0, which happens at a level κ ∗ = δ+g+n 1−α .
The equilibrium level of capital per unit of effective labor thus decreases with the
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Note that when κ = κ ∗ , the term inside the parentheses will be equal to zero. The
equilibrium growth rate of output per capita is then simply g > 0. Hence, rich
countries at their equilibrium growth rates will only grow through technological
progress. In the Solow model, this growth rate is exogenously given and we cannot
say anything very interesting about it. The main aim of endogenous growth theory,
which is the topic of the next chapter, is to derive this growth rate as the result of
intentional human investments in research and development (R&D).
κ̇ = sκ κ α ηβ − κ(δ + g + n)
η̇ = sη κ α ηβ − η(δ + g + n)
The parameters sκ > 0 and sη > 0 reflect the share of total output per unit of
effective labor ỹ = κ α ηβ that is invested in physical capital and human capital,
respectively. A share 1 − sκ − sη of total income is devoted to consumption. sη
might thus be seen as a country’s rate of investment in education. As discussed in
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Chapter 2, this rate was very low in most countries until about 1800. For simplicity,
it is assumed that both stocks have the same depreciation rate δ + g + n.
The steady-state levels κ ∗ and η∗ are found where κ̇ = η̇ = 0. Reaching the
solution involves more algebra than before since we first have to solve for one of
the two steady-state levels and then insert this solution into the expression for the
other. In the first step, we find from κ̇ = 0 that
1
∗ sκ (η∗ )β 1−α
κ =
δ+g+n
1
∗
sκα sη1−α 1−α−β
η =
δ+g+n
Downloaded by [Hacettepe University] at 20:58 10 April 2017
β 1−β α β
∗ ∗ α ∗ β sη sκ 1−α−β sκα sη1−α 1−α−β
y = At (κ ) (η ) = At
δ+g+n δ+g+n
β 1
sηα sκ 1−α−β
= At
(δ + g + n)α+β
Since At is the only factor of production that does not converge to a steady-state
level, the equilibrium growth rate of output per capita will be equal to g, as above.
4 Endogenous Growth Theory
Downloaded by [Hacettepe University] at 20:58 10 April 2017
4.1 Introduction
The Solow growth model has been criticized on two grounds: Firstly, the pre-
diction that all countries should converge to the same long run growth rate
does not appear to have materialized. Secondly, the engine of long-run growth,
technological progress g, is left unexplained by the model.
In this chapter, we will take a closer look at endogenous growth theory, which
attempts to explain how technological progress emerges as a consequence of more
or less active choices by the agents in the economy. In particular, we will ana-
lyze how a separate R&D sector interacts with sectors producing intermediate and
final goods. The presence of intellectual property rights means that intermediate
goods-producing firms might in practice obtain a monopoly on the good they are
producing. The later sections analyze the implications of incorporating assump-
tions of imperfect competition. The main contributors to this literature are Paul
Romer (1986, 1990) and Aghion and Howitt (1992).
Due to the focus on technological progress, the current chapter is mainly rele-
vant for relatively advanced developed economies where R&D plays a key role.
Most countries in the world do not carry out any advanced R&D and only imi-
tate the innovations carried out in Western countries. Section 4.6 contains a model
where a social planner in a country can choose between imitation and innovation.
4.2 AK model
In a famous article, Romer (1986) argues that the tendency towards diminishing
returns, which is a central feature of the Solow growth model, is perhaps exagger-
ated. According to Romer, it also appears as if certain types of policy have a more
lasting impact on growth rates than in the Solow model. The simple solution to
these aspects was offered by Rebelo’s (1991) AK model.
Let us assume that the correct specification of the long-run production func-
tion is
Y (t) = AK (t) (4.1)
where K (t) is now interpreted as a broad aggregate of physical and human capital
and A is a fixed technology parameter. As before, the model is driven by a capital
30 The long run
accumulation function
K̇ = s AK − δ K (4.2)
where s is the savings rate and δ is the rate of physical capital depreciation. Capital
per worker is k(t) = K (t)/L(t) and the growth rate of the work force equals the
growth rate of the population, so that L̇/L = n > 0. The dynamics for capital per
worker is then
∂k K̇ K L̇
= − = s Ak − δk − nk (4.3)
∂t L L L
Downloaded by [Hacettepe University] at 20:58 10 April 2017
ẏ k̇
= =sA−δ−n (4.4)
y k
The key thing to notice in this expression is that any change in policy-related
parameters like s and n will now have permanent effects on the growth rate. For
instance, an increase in savings rate s will lead to a permanently higher growth
rate. This result stands in contrast to predictions from the Solow model (see (3.7),
where an increase in s only had a transitory effect on growth rates). A drawback
is of course that the factors that drive long-run growth are still exogenous to the
model.
Two things are particularly noteworthy about equations (4.5) and (4.6). Firstly,
A is available to the full amount in both sectors. The reason for this assumption
is that ideas are inherently nonrival in character, i.e. one person’s use of an idea
(for instance, the blueprint for a new engine) does not preclude another person’s
simultaneous use of the same idea. Unlike ideas, most factors of production like
capital and labor are rival, for instance in the sense that an actual engine can either
be used in factory 1 or in factory 2, not in both at the same time. Labor is also a
rival production factor since a fraction a is employed in R&D and 1 − a in the
final goods sector.
Secondly, as we will see, the nature of the dynamics in this model will hinge on
the level of the output elasticity of existing knowledge, θ . This parameter should
be thought of as describing how useful older knowledge is for the creation of new
knowledge. θ < 1 implies that there is a kind of “fishing-out” effect in the sense
that there are diminishing returns to the existing stock of technological knowledge
in finding new ideas. We will return to this issue below.
A final assumption in the model is that labor grows at a rate n > 0 as before.
Ȧ Ba γ L(t)γ
= g A (t) = Ba γ L γ Aθ−1 = (4.7)
A A(t)1−θ
As in the Solow model, a balanced, steady-state growth rate will exist when g A (t)
is constant over time, i.e. when ∂g∂tA (t) = ġ(t) = 0. From inspection of (4.7), we
know that this will only happen if L(t)γ grows at the same rate as A(t)1−θ . One
way of finding this level is to start by assuming that ġ A = 0. The time derivative
of the growth rate in (4.7) is most easily found by first taking logs,
ln g A = ln B + γ ln(a L) + (θ − 1) ln A
32 The long run
and then taking the time derivative of the log:1
∂ ln ga ġ A L̇ Ȧ
= = γ + (θ − 1) = γ n + (θ − 1)g A
∂t gA L A
ġ A = [γ n + (θ − 1)g ∗A ]g ∗A = 0 (4.8)
What type of equilibrium we will have will thus crucially depend on the level of θ .
In the case where θ = 1, so that there are no diminishing returns to existing
knowledge, as assumed in the original models of Romer (1990) and Aghion and
Howitt (1992), the expression in the square brackets in (4.8) will not be zero and
the only existing steady state in the current setting is g ∗A = 0. Note also that the
(positive) growth rate of technological knowledge in this case will be g A (t) =
B(a L)γ . An increase in the R&D labor force a L, caused by increases in either the
share a or the size of the total labor force L, will thus increase the growth rate of
technology. There will further never be a stagnation in growth rates.
In the case where θ < 1, on the other hand, (4.8) will be satisfied if
γ n + (θ − 1)g ∗A = 0 =⇒ γ n = (1 − θ )g ∗A
γn
=⇒ g ∗A =
1−θ
A steady-state growth rate thus exists and will not depend on the level of the R&D
labor force. This “semi-endogenous” growth result, due to Jones (1995), is thus
free of the scale effect of R&D labor inherent in Romer (1990) and Aghion and
Howitt (1992). Instead, only the growth rate of the population n will matter in
the long run. Since output per capita equals Y/L = A(1 − a), g ∗A will also be the
growth rate of GDP per capita.
R = P A B(a L)γ Aθ − wa L
where P A is the price of the innovations that the R&D firm produces. For now,
we will take that price as exogenously given. The first-order conditions for profit
maximization give us
∂ F
= −AL + wL = 0
∂a
Downloaded by [Hacettepe University] at 20:58 10 April 2017
∂ R
= γ P A Ba γ −1 L γ Aθ − wL = 0
∂a
In equilibrium, the wage costs of the two firms should thus be the same. The
conditions above imply that
γ P A Ba γ −1 L γ Aθ = AL
Using this term, we can solve for the optimal sectoral allocation a ∗ :
∗ γ −1 A1−θ L 1−γ
a = =⇒
γ PAB
1−θ 1−γ γ −1
1 1
A L γ P A B 1−γ 1
a∗ = =
γ PAB A1−θ L
Given that γ < 1, the allocation of labor to R&D should thus increase with the
price of innovations P A and with R&D productivity B and decrease with L. The
reason for the last observation is that workers face diminishing returns in the R&D
sector, whereas we assume here constant returns in the final goods sector. Since
we assume that L grows with an assumed rate of n > 0, this should imply that the
share of workers in R&D should fall with time. Similarly, if θ < 1 so that there
are diminishing returns to the existing stock of technological knowledge, then a ∗
decreases with A. Note, however, that if θ = 1 as in the steady-state result above,
then a ∗ will be independent of the level of knowledge.
One of the key points with the result above is that a ∗ will be a positive function
of the price of innovations P A . But how is that price determined if ideas are a
nonrival good? Fundamentally, the price of patents will depend on the institutions
in society for intellectual property rights. If such property rights are weak, the
price will be low and there will be weak incentives for people and firms to engage
in R&D. On the positive side, without property rights to ideas, there would be
no monopoly power for users of a particular innovation and anyone could com-
mercialize any existing technological idea. According to the model, however, as
34 The long run
P A goes to zero, a ∗ will also tend towards zero, so that no new ideas would be
produced.
In the next section, we will extend the model and show how the price of inno-
vations is determined in a market with monopolistic intermediate goods producers
who buy patents on innovations from an R&D sector.
mediate goods firms affects the model. We will make the simplifying assumption
that labor is only used in the final goods sector.2
• A final goods sector that employs all L workers as well as intermediate capital
goods as factors of production.
• An intermediate goods sector using no labor but producing intermediate goods
that are used as factors of production (physical capital) in the final goods sector.
The patents for the intermediate goods are bought from the R&D sector and
each intermediate goods producer is a monopolist on the good that they have
acquired a patent on.
• An R&D sector that produces patents for intermediate goods and sells them to
the intermediate goods producers.
Final goods (equivalent to total measured output in the economy) are produced
by a representative firm according to the function
A
Y = L 1−α (X j )α = L 1−α (X 1α + X 2α + · · · + X αA ) (4.9)
j =1
∂Y
= αL 1−α X α−1 for all j (4.10)
∂Xj j
Endogenous growth theory 35
The marginal product of good j is thus independent of the levels of all other
intermediate goods. One might think of the intermediate goods in the model as
major inventions that are neither substitutes for nor complements to each other.
The price of final goods is assumed to be 1.
The representative final goods producer maximizes profits:
A
A
max F = L 1−α (X j )α − Pj X j
X
j =1 j =1
where P j is the price of intermediate good j . The usual first-order conditions yield
Downloaded by [Hacettepe University] at 20:58 10 April 2017
∂ F
= αL 1−α X α−1 − Pj = 0 (4.11)
∂Xj j
Condition (4.11) states the familiar result that firms should acquire capital good j
up to the level where its value marginal product equals its marginal cost, which
is P j . Equation (4.11) can be rewritten so as to obtain an expression for the final
goods sector’s demand for good j :
1
α 1−α
Xj = L (4.12)
Pj
Demand increases linearly with the work force but decreases, as expected, with
the price charged by the intermediate firm.
Each intermediate good is produced by a single intermediate goods producer
who has a patent on the good in question for a period of T years and hence a
monopoly on its production. No labor is used in this sector. The marginal cost
of producing one unit of good j is simply assumed to be equal to 1. The profit
function is therefore given by
Ij = (P j − 1)X j (P j ) (4.13)
∂ Ij
= X j (P j ) + P j X j (P j ) − X j (P j ) = 0
∂ Pj
Pj 1
P j = X j (P j ) (1 − P j ) = − (1 − P j )
X j (P j ) 1−α
Note that we have inserted the derived expression for the price elasticity of demand
on the right-hand side. By rearranging the last equation so as to isolate P j , we
finally obtain the profit-maximizing price for the monopolist and the correspond-
ing level of demand from the final goods sector and profits for the intermediate
goods sector:
1
P j∗ =
α
Since α < 1, we can infer that P j∗ > 1, i.e. the monopolistic intermediate goods
producer will charge a price higher than the marginal cost, which is equal to 1. In
microeconomics textbooks, this phenomenon is known as mark-up pricing.3
When we have solved for P j∗ , the corresponding equilibrium levels of final
sector demand and intermediate sector profits can be derived to be
2 1+α
X j (P j∗ ) = α 1−α L, Ij = (P j∗ − 1)X j (P j∗ ) = (1 − α)α 1−α L (4.14)
2 2
Y ∗ = L 1−α Aα 1−α L = Aα 1−α L 2−α (4.15)
In accordance with intuition, total output of final goods therefore increases with
the number of intermediate goods A, which in this model is an indicator of the
state of technological knowledge.
Finally, then, we can describe the R&D sector that produces patents and sells
them to intermediate goods producers. The patentable invention of one additional
Endogenous growth theory 37
design for a new intermediate good is made using a production function BA(t),
where B is a productivity parameter as before and A is the existing stock of patents
at time t. The output is certain and always results in a new design that can be
patented.4 As before, we assume that knowledge about previous patents is fully
accessible to the R&D firm. A(t) is thus a nonrival production factor since it is
available to the full extent both in the final goods sector and in the R&D sector,
although it is partially excludable since only one intermediate goods producer has
a patent and is allowed to produce the patented good commercially. There is free
entry into the sector.
If the R&D firm chooses to make the invention, this will entail a fixed cost η > 0.
As we shall see, this fixed cost will be very important for the determination of the
price P jA that the R&D firm charges for the invention.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
• First stage: The R&D firm decides on whether to invent a new good or not.
• Second stage: Once the new good has been invented, the R&D firm decides on
what price P jA to charge the intermediate goods producing firm j for the patent.
The intermediate firm produces the good and it is sold to the final goods sector,
where it is used in the production of final goods.
T
1+α
T
V j (0) = Ij β t = (1 − α)α 1−α L βt (4.16)
t=0 t=0
where β ≤ 1 is a time discount factor. Clearly, the lower is β, the lower is the
present value of the firm. If we make the greatly simplifying assumption that β = 1,
1+α
then V j (0) = T (1 − α)α 1−α L.
38 The long run
The R&D firm observes the intermediate goods firm’s value function and will
bid up the price of the patent until P jA = V j (0). A higher price P jA > V j (0) is not
possible; in that case the intermediate goods producer will not buy the patent. The
R&D firm will thus squeeze out all the profits that the intermediate goods firm can
make.
There must also be equilibrium in the capital market. In buying the patent, the
intermediate firm makes an investment of size P jA . During a given time period,
the returns to this investment must not be less than the returns that the firm could
have received by investing in a risk-free asset at an interest rate r > 0. Hence, in
equilibrium it must be the case that rP Aj = Ij .
In a general equilibrium model, there must be equilibria on all markets. In the
R&D sector, finally, if P jA > η, then new R&D firms will enter the sector and
Downloaded by [Hacettepe University] at 20:58 10 April 2017
1+α
Ij
(1 − α)α 1−α L
r= A = (4.17)
Pj η
A(0)
1+α
Ij = A(0)T (1 − α)α 1−α L
j =1
The aggregate market value will increase linearly with the number of existing
inventions A, with the duration of patents T , and with the size of the labor force L.
That market value increases with the number of inventions (and hence of interme-
diate goods producers) is not surprising. The market value increases also with L.
The reason is that the demand for intermediate capital goods in the final goods
sector increases linearly with L. More workers means that the marginal product of
a new machine is high and hence that demand for that machine is high.
T might be thought of here as the strength of intellectual property rights, where
a large T means strong rights.5 What would happen in our model if the strength
of such rights suddenly fell? In the short run, this would cause a disequilibrium,
which would lead to a number of responses. To start with, the market value of
firms V j (0) would fall. This would in turn mean that the price for patents charged
by the R&D sector P jA would be too high for the intermediate goods producers. In
the short run, R&D firms will therefore choose not to produce any new inventions
and exit from the sector. A new equilibrium can only be restored if also the fixed
costs of invention η fall.
Endogenous growth theory 39
A value of T = 0 would be equivalent to saying that intellectual property rights
do not exist. In such a world, any intermediate goods producing firm can use any
invention without paying for it and there would be no monopolies and no mark-up
pricing. This might have positive welfare effects in the short run since it implies
a lower price of intermediate goods (it would fall from 1/α to 1, i.e. the marginal
2
cost of production in (4.13)) and total demand would increase from α 1−α L to L
1
(since α = 1). Total output would also increase to Y = Aα 1−α L 2−α = AL. How-
ever, on the downside, A would cease to grow in our model, since as long there is
some positive cost of invention η > 0, there is no incentive whatsoever to produce
new ideas that cannot be sold. The issue of whether intellectual property rights is
a good or a bad thing thus entails a trade-off in the model as outlined here; the
Downloaded by [Hacettepe University] at 20:58 10 April 2017
absence of property rights removes monopolies and increases welfare in the short
run but decreases the production of inventions in the long run.
N
Y = L 1−α (q λ j X j )α = L 1−α [(q λ1 X 1 )α + (q λ2 X 2 )α + · · · + (q λ N X N )α ]
j =1
(4.18)
Just like before, L is the total labor force and X j is the the quantity used of the
intermediate capital good j . N > 0 is now the fixed number of intermediate goods
in the economy. The key new feature of this production function is the inclusion of
a quality indicator q λ j , where q > 1 and the exponent λ j ≥ 1 is a discrete number
1, 2, 3, . . . reflecting how many times that sector j has undergone innovation. For
simplicity, it is assumed that in each sector, each new innovation improves quality
40 The long run
by a factor of q. Different sectors might have had different innovation intensity so
that, for instance, it is possible that λ j > λ j +1 . As has probably become evident, λ j
is now our indicator of technological progress.
The other key difference is that a new superior innovation in sector j is
assumed to appear after Tλ j ≥ 1 periods. Tλ j is now random and takes on a value
1, 2, 3, . . . with a certain probability distribution.6 When a new innovation of qual-
ity q λ j +1 > q λ j then appears, the existing monopoly ends. The firm is assumed to
be risk-neutral and does not take any extra precautions due to this risk.
Apart from these two new assumptions, the model has the same structure as
in Romer (1990). It can be solved by following the same steps as above. Since
intermediate firms are monopolists, they will charge a mark-up price P j∗ = 1/α.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Through the usual profit-maximization conditions, final sector demand for the
2 αλ j
most recent version of good j can be shown to be X ∗j = Lα 1−α q 1−α , which is the
αλ j
same expression as in (4.14) except that it is multiplied by q 1−α . Note that final
sector demand thus increases with quality. Since the price is the same as before,
this is not surprising.
Let us assume that the R&D firm has come up with a new innovation at time
0 and that the intermediate firm considers whether to buy the patent or not. The
expected discounted value of the intermediate firm at time 0 would then be
E 0 (Tλ j ) E 0 (Tλ j )
1+α αλ j
E 0 (V j ) = Ij β t = (1 − α)Lα 1−α q 1−α βt
t=0 t=0
αλ j
The noteworthy features of this expression are the two terms q 1−α and E 0 (Tλ j ).
Rapid technological progress in this model means that a relatively large number
of innovations happen all the time. On the one hand, this means that at time 0, λ j
should be relatively large, which would imply a relatively high E 0 (V j ). But on the
other hand, rapid technological progress would also mean that the expected dura-
tion of monopoly should be short, i.e. E 0 (Tλ j ) should be small and the expected
value of the firm relatively small. If the latter effect dominates, firms will be dis-
couraged from entering the sector. In this manner, creative destruction does not
have an unambiguously positive effect on the economy.
The current level of technology At is thus a result of a process exploiting last year’s
Downloaded by [Hacettepe University] at 20:58 10 April 2017
level of knowledge in the country At−1 as well as the level at the world technology
frontier, Āt−1 . The two terms are multiplied respectively by effort levels and the
parameters μ, γ > 0 that describe the (time-invariant) usefulness of imitation and
innovation for knowledge growth in the country. Furthermore, At−1 is multiplied
by the level of human capital Ht−1, capturing the general level of education in the
country. This reflects the assumption that innovation is inherently more reliant on
skills than imitation.7
If we divide expression (4.19) by At−1 and subtract At−1 /At−1 = 1 on both
sides, we can write
At − At−1
= gt−1 = eμdt−1 + γ (1 − e)Ht−1 − 1
At−1
where dt−1 = Āt−1/At−1 ≥ 1 is a measure of the country’s distance from the world
technological frontier. If dt−1 is large, imitation is relatively effective.
How should a social planner in a country optimally devote effort to imitation
and innovation? The simple answer is that all effort should be devoted to imitation
if the marginal product of imitation exceeds the marginal product of innovation,
i.e. if μdt−1 > γ Ht−1 , and to innovation if the opposite result holds. The main
insight from this expression is that in countries that are far from the technological
frontier (a high dt−1 ) and where human capital levels are relatively low, the growth
of technological knowledge is optimally driven solely by imitation and is given by
gt−1 = μdt−1 − 1. On the other hand, for countries at the technological frontier
where dt−1 = 1, it will be the case that imitation is not possible, that the optimal
level of imitation effort is e = 0, and that gt−1 = γ Ht−1 − 1. Note also that if the
level of human capital were a choice variable for governments, it would in this
model be pointless to increase, for instance, levels of engineering skills (Ht−1) if
the country is mainly an imitator, since such skills are only useful for innovation.
5 The Overlapping Generations Model
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The growth models presented so far were representations of how macro vari-
ables and profit-maximizing firms are hypothesized to behave in relation to each
other. None of the results were based on an analysis of household behavior over
time, i.e. on the behavior of and choices made by utility-maximizing individu-
als and households who live for more than one period and who care about the
future. The overlapping generations (OLG) model outlined below now introduces
a micro-founded model of economic growth and intertemporal choice that will be
employed as a “workhorse” model in several chapters to follow.
One of the key advantages of the OLG model is that it allows us to derive
an endogenous saving rate. In the Solow model, the saving rate s was simply
exogenously determined.
The OLG model was initially presented by Diamond (1965) and Blanchard
(1985). We will henceforth carry out the analysis in discrete time, i.e. we con-
sider periods t, t + 1, t + 2, . . . , instead of a continuous time framework. As we
shall see, the insights from a discrete time framework are complementary to those
already shown.
u (c)
1
u ′(c)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
1/4
c
c1 c2
Figure 5.1 A strictly concave utility function and its associated marginal utility.
where β ≤ 1 is a time discount factor. The closer β is to unity, the greater the
patience of individuals. If β instead is very low, the individual discounts the future
a lot and thinks consumption when young is more valuable in relative terms. The
utility function satisfies the usual assumptions of a positive but strictly diminishing
marginal utility, so that u (c j,t ) > 0 and u (c j,t ) < 0 at all c j,t > 0. It is further
time-separable since u (c1,t ) is independent of c2,t+1 .
The utility function and its associated marginal utility curve are shown in
Figure 5.1. In order to understand more intuitively how the curves are linked,
consider for instance the point on u(c) where the slope is equal to unity. The level
of c at which this happens is c = c1 so that u (c1 ) = 1. At some higher level of c,
for instance at c = c2 > c1 , we have that u (c2 ) = 1/4 < u (c1 ). It is important to
understand this basic theme of utility functions because it will appear again and
again in this text: as c increases, u(c) increases whereas u (c) falls.
Individuals earn an income y1,t during young age in period t and an income
y2,t+1 when they are old. What is not consumed in the first period can be saved for
old age. The individual will then get an interest rate rt+1 on their savings, which
amount to st = y1,t − c1,t ≥ 0. However, savings may also be negative, such that
st = y1,t − c1,t < 0. In that case individuals borrow for consumption in the first
period at the same interest rate rt+1 .2 Consumption during old age is thus
If we isolate the income terms on the right-hand side, we get what is usually
referred to as the individual’s intertemporal budget constraint:
c2,t+1 y2,t+1
+ c1,t = + y1,t (5.3)
(1 + rt+1 ) (1 + rt+1 )
44 The long run
What this expression says is that the present value of a person’s current and future
consumption must be equal to the present value of his or her lifetime incomes.3
How much should the individual optimally consume in each period? In order to
solve this problem, we can start by inserting (5.2) into the utility function in (5.1)
and take the first-order condition for a maximum:
∂Ut
= u (c1,t ) − βu (c2,t+1 )(1 + rt+1 ) = 0
∂c1,t
u (c1,t
∗ )
This fundamental result will be derived many times in the chapters to come.
It implies that an individual should optimally consume so that the relative
marginal utility of consumption when young, u (c1,t ∗ )/u (c ∗
2,t+1 ), exactly equals
∗
β(1 + rt+1 ), where c j,t denotes an optimal level. What does this imply for the
optimal relative level of consumption c1,t ∗ /c ∗
2,t+1 ?
Consider, for instance, a case where individuals are patient so that β is close to
unity and where rt+1 is relatively high. In that case, it is likely that β(1 + rt+1 ) > 1.
According to (5.4), we must then optimally have chosen consumption levels such
that u (c1,t
∗ )/u (c ∗ ∗ ∗
2,t+1 ) > 1 applies as well. Note that u (c1,t ) > u (c2,t+1 ) must
(due to diminishing marginal utility as illustrated in Figure 5.1) imply that c1,t ∗ <
∗
c2,t+1 . Hence, if individuals are patient and if interest rates are high, consumption
should be greater in old age than during youth, and vice versa.
With a general utility function like that in (5.1), we cannot get any further than
interpreting the optimality condition in (5.4). We are not able get any explicit
solution for either c1,t∗ , c∗
2,t+1 , or the level of saving. If we want to achieve closed-
form solutions, we need to assume a more specific utility function.
∗
c2,t+1
∗ = β(1 + rt+1) (5.5)
c1,t
From this condition, we can achieve closed-form solutions for our intertempo-
∗
ral choice of consumption. Insert c2,t+1 ∗ into the left-hand side
= β(1 + rt+1 )c1,t
The overlapping generations model 45
of (5.2) and solve for ∗
c1,t to find
When we have solved for c1,t , it is of course easy to also solve for c2,t+1 :
∗
we already know from before that c2,t+1 increases with people’s patience β and
with the interest rate rt+1 . However, note that an increase in rt+1 with incomes
∗ . This might be referred to as a substitution effect: the
held constant will lower c1,t
individual substitutes consumption in young age for consumption in old age.
We can also solve for the optimal level of savings:
Not surprisingly, savings increase with y1,t , β and with rt+1 . Note, however, that,
all else being equal, a net increase in y2,t+1 will lead to a decrease in savings.
Here ρ ≥ 0 is a time discount rate and θ is the Arrow–Pratt measure of relative risk
aversion (individuals are risk-averse). The CRRA utility function is particularly
useful since it can encompass several different types of utility, depending on the
value of θ . A θ < 0 means that the individual is risk-loving (since marginal utility
c−θ increases with the level of c), θ = 0 means risk neutrality, whereas θ > 0
implies a risk-averse individual. It can further be shown that if θ = 1, then u(c j,t ) =
ln c j,t .4 The case most often studied will be θ ∈ (0, 1).
We now assume, for simplicity, that individuals earn a wage income when
young, y1,t = wt , and that an old individual has zero income, y2,t+1 = 0. Hence,
young individuals must save some of their labor income when young in order to
consume when old:
st = wt − c1,t (5.7)
46 The long run
When old, individuals consume their savings from the previous period plus the
interest earned from these savings:
The objective function in (5.6) and the constraint in (5.9) define a constrained
Downloaded by [Hacettepe University] at 20:58 10 April 2017
∂ −θ
= c1,t −λ=0
∂c1,t
−θ
∂ c2,t+1 λ
= − =0
∂c2,t+1 1 + ρ (1 + rt+1)
−θ (1+r
−θ c2t+1 t+1 )
Since c1t = λ and 1+ρ = λ, we can write
−θ ∗ 1
−θ
c2,t+1 (1 + rt+1 ) c2,t+1 1 + rt+1 θ
c1t = =⇒ ∗ = = 1 + gc,t (5.10)
1+ρ c1t 1+ρ
In other words, the relative consumption of the individual when in old age can be
described as 1 + gc,t , where
1 1
c2,t+1 − c1t (1 + rt+1 ) θ − (1 + ρ) θ
gc,t = = 1
c1t (1 + ρ) θ
is the growth rate of consumption between t and t + 1. Note that gc,t can be pos-
itive or negative. gc,t will depend positively on the interest rate rt+1 , negatively
on the time discount rate ρ, and negatively on the risk-aversion parameter θ . The
greater is ρ, the greater is the individual’s impatience and hence the lower is their
consumption when old. A θ that approaches 1 further means that the utility func-
tion has a markedly concave curvature and that the marginal utility of consumption
The overlapping generations model 47
diminishes fast (θ = 1 implies that u(c j,t ) = ln c j,t ). Such individuals will also
consume relatively less in the future than if they entertained a θ that is close to 0.
The result in (5.10) is often referred to as the “discrete Ramsey” result.6 It
will play an important role in the sections ahead. Note that the requirement for
a positive growth rate is that rt+1 > ρ. The next step is to derive an expression
for rt+1 .
∗
c2,t+1 (1 + rt+1 )st
∗ = = 1 + gc,t
c1,t (wt − st )
and that
(1 + gc,t )wt wt
st = =
2 + rt+1 + gc,t (1 + rt+1 )/(1 + gc,t ) + 1
wt
st = θ−1 1
(5.11)
(1 + rt+1) θ (1 + ρ) θ +1
Saving will thus be a positive function of wt and of rt+1 (since θ < 1). Both are
unknown at this stage. However, by introducing firms into the analysis, we might
also derive wt and rt+1 . Not surprisingly, saving decreases with the time discount
rate ρ.
5.2.1 Firms
Firms produce according to the standard neoclassical aggregate production
function
Yt
Yt = F(K t , L), where = f (kt ) and Yt = L f (kt )
L
48 The long run
We assume that labor and capital are paid their marginal products:
∂Yt 1
rt = = L f (kt ) = f (kt )
∂ Kt L
∂Yt Kt
wt = = f (kt ) − L f (kt ) 2 = f (kt ) − f (kt )kt
∂L L
1
(k ∗ )α (1 − α) 1 − α 1−α
k∗ = =⇒ k ∗ =
2+ρ 2+ρ
Note that this is the equivalent of the steady-state level of k derived in the Solow
model. The key difference is that ρ enters this expression since we have now
explicitly taken into account an individual’s preferences. k ∗ will decrease with ρ
just like the growth rate of consumption in (5.10) decreases with ρ. Hence, the
greater the individual’s patience, the lower is ρ and the greater is the steady-state
level of the capital stock. It is further easily derived that the equilibrium level of
α
output per capita is y ∗ = (k ∗ )α = ( 1−α
2+ρ )
1−α .
The overlapping generations model 49
5.3 Endogenous growth
A key result from the OLG model is that the optimal intertemporal growth rate of
consumption is given by
1
c2t+1 − c1t ∗ 1 + rt+1 θ
= gc = −1
c1t 1+ρ
In the OLG model, rt+1 was found as in the analysis above. Let us now assume
instead a world that is described by Romer’s R&D model in Section 4.4 with
the only exception that we now also specify household optimizing behavior as in
Section 5.1 of the OLG model. Recall from (4.17) that in a general equilibrium, the
Downloaded by [Hacettepe University] at 20:58 10 April 2017
1+α
Ij = (1 − α)α 1−α L = r P jA = r η
1−α L 1
1 + 1 (1 − α)α 1+α
η θ
gc∗ = −1
1+ρ
The growth rate will increase with the number of workers L. As explained above,
this “scale effect” in the Romer model arises since more workers means a greater
demand for intermediate goods. The growth rate will fall with the cost of inventing
new goods, η. As before, the growth rate decreases with the level of the behavioral
parameters θ and ρ.
Note that the equation above further implies that gc∗ > 0 only if η1 (1 − α)
1+α
α 1−α L > ρ. If people are generally impatient so that ρ is high and if the cost
of invention η is high, then this criterion for positive growth might not be satis-
fied. It further suggests that a certain population size is necessary for R&D to be a
viable path to consumption growth.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Part II
From the long run, we now move to the short and medium run, by which we will
here mean time periods of less than five years. Long-run growth is the average
growth rate over decades, but in the shorter run there can be substantial cycli-
cal variations in GDP, referred to as business cycles. The theory of real business
cycles (RBCs) is associated mainly with Kydland and Prescott (1982) and Long
and Plosser (1983). See also the overview in Rebelo (2005).
Early RBC models were in part inspired by the empirical record showing that
business cycles did not appear to follow any systematic cyclical patterns. There
was also a dissatisfaction with the Keynesian type of explanations emphasizing
market failures in the form of, for instance, wage rigidities. Keynesian theory fur-
ther lacked a micro foundation, i.e. the modeling was not based on the decisions
of optimizing individuals. The RBC models instead proposed a framework based
on individual behavior where the engine of the cyclical behavior was real shocks,
induced by technological change and government spending, for instance, rather
than nominal or monetary effects as emphasized by Keynesian theory. A further
difference from previous modeling was the introduction of leisure in the utility
function of the individual. This novelty made it possible to derive implications for
the intertemporal substitution of labor supply.
ln Yt = α ln K + (1 − α) ln N + (1 − α)(ln lt + ln At ) (6.2)
= + (1 − α)(lnlt + ln At )
where is the exogenous part. The sources of variation in output will therefore
be working hours lt and technology At .
The development of technological knowledge is assumed to follow the follow-
ing process:
Downloaded by [Hacettepe University] at 20:58 10 April 2017
ln At = Ā + gt + Ãt (6.3)
In this expression, Ā is the initial level of technology, g is the trend growth rate of
technological knowledge (as in the Solow model), t is time, and Ãt is a stochastic
shock to the trend. More specifically, the stochastic component is given by
where ρ A ∈ (0, 1) is a parameter indicating the persistence of past shocks for the
current level of technology and t is an error term such that E(t ) = 0 for all t.
A positive technology shock might, for instance, be the sudden appearance of a
new and drastically improved computer program or a breakthrough in transporta-
tion technology. A negative technology shock to trend growth could arise if the
implementation of some existing technology unexpectedly stalled during some
period.
In technical terms, the process described above follows a first-order autore-
gressive process (AR(1)) since Ãt depends on the corresponding level one period
before. Inserting (6.4) into (6.3) gives us
ln At = Ā + gt + ρ A Ãt−1 + t
where K ti is the physical capital held by firm i at time t, L it is the labor employed
Downloaded by [Hacettepe University] at 20:58 10 April 2017
by firm i , At is nonrival technology available to all firms, and rt is the interest rate
on capital.
The first-order condition for profit maximization for labor is
∂it
= (K ti )α A1−α
t (1 − α)(L it )−α − wt = 0
∂ L it
where the left-hand side in the middle expression is the marginal product of labor.2
Rearranging terms gives us an expression for firm i ’ s demand for labor, L i,D :3
1
(1 − α)A1−α α
L i,D
t = Kt
i t
(6.6)
wt
What is immediately evident from this expression is that labor demand will
increase with a positive technology shock that increases At . Analogously, a
negative technology shock will decrease labor demand. Labor demand will also
increase with the firm’s stock of capital K ti since that increases the marginal
product of labor.
RBC models assume a perfectly functioning economy where all markets are
in equilibrium. Hence, an increase in labor demand should generally mean
that aggregate employment (in terms of working hours lt ) should increase. In
the empirical literature, the prediction about a positive relationship between
technology shocks and employment has been extensively discussed.
6.3 Households
Labor supply is determined by the households and involves a trade-off. On the
one hand, more work means more income and higher consumption. On the other
hand, more work means less leisure, which is now assumed to be a part of the
individual’s utility function. The lifetime utility of an individual who lives for two
periods and who receives utility from consumption and leisure is given by the
56 The short and medium run
function
U = U (c1 , c2 , 1 − l1 , 1 − l2 ) (6.7)
= ln c1 + b ln(1 − l1 ) + β[ln c2 + b ln(1 − l2 )]
c2 = (1 + r )(w1l1 − c1 ) + w2l2
c2 w2 l 2
c1 + = w1 l 1 + (6.8)
1+r 1+r
By utilizing (6.7) and (6.8), we can set up the individual’s maximization problem
as a Lagrangian:
In this optimization problem, we let labor supply l1 and l2 be the choice variables.
The first-order conditions are
∂ b
=− + λw1 = 0 (6.9)
∂l1 (1 − l1∗ )
∂ bβ λw2
=− + =0 (6.10)
∂l2 (1 − l2∗ ) 1 + r
By isolating the λs on the right-hand sides in (6.9) and (6.10), we can write
b bβ(1 + r )
λ= =
(1 − l1 )w1 (1 − l2∗ )w2
∗
1 − l1∗ 1 w2
= (6.11)
1 − l2∗ (1 + r )β w1
Equilibrium business cycles 57
What we have achieved in (6.11) is an expression for the utility-maximizing rela-
tive leisure in period 1. Since it is not possible to obtain explicit solutions for
l1∗ and l2∗ , we have to base our analysis on (6.11).4
Let us assume that there will be a known fall in the relative wage w2 /w1 .
Such a fall means that (1 − l1∗ )/(1 − l2∗ ) optimally must fall too, implying that
relative labor supply l1∗ /l2∗ must rise. In other words, a fall in the relative wage
in period 2 means that individuals will work more in period 1, thus causing a
temporary increase in total output in (6.2).
As another example, consider an increase in the real interest rate r . Just as in
the previous case, such an increase means that relative first-period labor supply
l1∗ /l2∗ must increase. The intuition is that a rise r makes first-period labor income
Downloaded by [Hacettepe University] at 20:58 10 April 2017
relatively more valuable (since only first-period consumption can be saved), which
leads to an intertemporal substitution of labor supply towards more work in the
first period. This rational response to a real shock also leads to a boom in the
first-period total income Yt .
As the examples above have indicated, RBC models are often too complex to
be solved analytically. Therefore, scholars in this tradition use advanced methods
of simulation where realistic parameter values are plugged in before a variation in
some real variable is made.
7 Financial Crises
Downloaded by [Hacettepe University] at 20:58 10 April 2017
In real business cycle models of the basic kind as above, all markets work effi-
ciently, including the financial market. According to that logic, there is therefore
no point in modeling a separate financial sector. However, the deep financial crisis
that hit the world economy in autumn 2008 seems not to lend much support to
an efficient financial market hypothesis. A recurring feature of financial crises
rather appears to be bank runs, i.e. that a large number of depositors withdraw
their savings at the same time, potentially causing banks to become insolvent and
collapse.1 In some serious cases, this might even cause a systemic banking crisis
when a country’s whole banking system is close to collapse.2 The macroeconomic
consequences on, for instance, economic growth and fiscal balances are typically
substantial and go beyond normal business cycle downturns (Reinhart and Rogoff
2009a, b).
In this section, we present a model of bank runs from Chang and Velasco (2001),
who in turn build upon Diamond and Dybwig (1983). It is mainly meant to capture
an “emerging market” with an open economy and a fairly advanced banking sector.
Recent events suggest that it might also be used to describe the situation in, for
instance, the United States.
c11−θ c1−θ
U =ρ + (1 − ρ) 2 (7.1)
1−θ 1−θ
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Note that a patient individual who invests all her initial endowment e as well as
the maximum amount she can borrow on the international market f in the risk-free
project will consume e R + f (R − 1) in period 2. Clearly, only individuals who
know that they are impatient will choose to invest in the world market rather than
in the domestic project (since this will give them c1 = e instead of c1 = e(1 − r )).
7.2 Banks
Since individual types are private information, people have an interest in pooling
risk and money. Let us therefore assume that they form a bank together where
they pool all their resources and that aims to maximize the utility of the repre-
sentative individual. The individual can withdraw money for consumption either
in period 1 or in period 2. The total amount (per depositor) that the bank invests
in the investment project is k > 0. Furthermore, the bank can borrow money on
the international capital market. Let d be the amount of “long-term” borrowing
per depositor in period 0 to be repaid in period 2 and b be the equivalent num-
ber in period 1 to be repaid in period 2. Since it is beneficial to borrow abroad
and invest domestically, it will always be the case that d + b = f . The prevalence
of impatient individuals who want to consume in period 1 implies that the bank
needs to liquidate part of the investment project at this time. l < k is the size of
this liquidation.
The timing of events in the model is the following:
What are the socially optimal levels of c1∗ , c2∗ , and l ∗ , i.e. the solutions that
maximize the joint welfare of both patient and impatient individuals? To start with,
recall that early liquidation is associated with a loss, which means that l ∗ = 0 must
be the socially optimal level.4 Hence ρc1∗ = b in period 1, i.e. the bank will satisfy
impatient depositors’ consumption needs only through international borrowing.
In period 2, we will further have that (1 − ρ)c2∗ + f = Rk (recalling that l ∗ = 0).
What we want to do is to rewrite the conditions above into an intertemporal
Downloaded by [Hacettepe University] at 20:58 10 April 2017
(1 − ρ)c2∗ = Rk − f = R(d + e) − f = R( f − b + e) − f
= R( f − ρc1∗ + e) − f
and then rearrange the resulting expression so that c1∗ and c2∗ end up on the left-
hand side:
The term w thus describes the individual’s initial endowment plus the returns to
the investment project and might be interpreted as the economy’s wealth (per indi-
vidual). It also serves as the budget constraint for the individual in its utility
maximization.
In order to maximize the utility function in (7.1) subject to (7.2), we set up the
Lagrangian
c11−θ c1−θ
=ρ + (1 − ρ) 2 + λ[Rw − Rρc1∗ − (1 − ρ)c2∗ ]
1−θ 1−θ
∂
= ρc1−θ − λRρ = 0 (7.3)
∂c1
∂
= (1 − ρ)c2−θ − λ(1 − ρ) = 0 (7.4)
∂c2
The solution to this maximization problem can be found by using the following
procedure. First, by combining (7.3) and (7.4), it can be shown that the equilib-
rium levels of consumption are given by c2∗ /c1∗ = R 1/θ . Inserting the value for c1∗
Financial crises 61
into the budget constraint in (7.2) eventually allows us to express the closed-form
equilibrium levels:5
The key thing to remember from these solutions so far is that the socially opti-
mal situation is that no premature liquidation occurs, i.e. that l ∗ = 0, but this might
still be the actual outcome if too many investors withdraw their money in the first
period. We will analyze this aspect more closely next.
Rk − f (1 − ρ)c2∗
l+ = = = w(1 − α) (7.6)
R R
Imagine now that all depositors come and want to withdraw money in the first
period, perhaps due to a bank panic. The bank pays out c1∗ to each of them, in
accordance with the contract. How long can the bank make such payments?
The answer is that withdrawals (per individual) can be made up to the bank’s
total liquidation value b + rl + . This is simply the maximum amount of cash assets
that the bank can provide in the short term. Should consumption withdrawals
exceed short-term foreign loans b plus the highest attainable (premature) project
liquidation value rl + , the bank becomes insolvent and must close down. The for-
eign loans are then paid back already in period 1, impatient and some patient
62 The short and medium run
individuals get their money back, but a number of patient individuals run the risk
of losing their savings.
More specifically, if all individuals withdraw their money in period 1, the
amount (per individual) will be c1∗ . A bank run equilibrium is said to exist if
If this situation prevails, banks cannot survive a sudden withdrawal of all indi-
vidual savings. z + might therefore be referred to as a measure of the bank’s
illiquidity.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
If we insert the equilibrium values derived above into (7.7), we find that
αw
z + > 0 if c1∗ − (b + rl + ) = − [αw + r w(1 − α)] > 0
ρ
It is clear that wealth w is just a scalar in this expression and will not matter for
determining the sign. Removing w and recalling that α is given by (7.5) enables
us to show that
Clearly, if θ ≥ 1, then a bank run equilibrium will exist, since R > r . In the
often assumed range of θ ∈ (0, 1), (7.8) will not be satisfied and there will not
exist a bank run equilibrium. In other words, the more risk-averse individuals are,
the more likely that a bank run equilibrium will exist. Whether the equilibrium
actually materializes or not depends on individual strategies and on the partic-
ular nature of the strategic interaction between individuals. The analysis above
describes the fundamental characteristics of the economy that need to be in place
for a bank run to be possible.
Rk − d
la = (7.9)
R
Financial crises 63
Since d < f , it will be the case that la > l+ (see (7.6)). The bank’s level of
illiquidity is
d
z a = c1∗ − rl a = c1∗ − r k − (7.10)
R
1
= c1∗ + r d − 1 − re
R
By comparing this level with that in (7.7) and using (7.6) and (7.9), we can deduce
that
Downloaded by [Hacettepe University] at 20:58 10 April 2017
r
z a − z + = b + r (l + − l a ) = b 1 − >0
R
In other words, if foreign creditors will not provide loans in period 1 in case of
a panic, the bank has a higher illiquidity and is more vulnerable than otherwise,
despite the fact that the maximum level of liquidation is higher. This is of course
well in line with intuition.
It is even possible that an international refusal to lend might cause a bank run.
If the international creditors announce in period 1 that b is not available to the
bank, contrary to expectations, depositors will realize that the bank’s vulnera-
bility has increased and will therefore try to withdraw as much as possible in
period 1, potentially causing the bank to collapse. In this sense, it does not matter
if b becomes unavailable through domestic conditions or due to a crisis abroad.
It also shows how foreign financial crises can have serious repercussions on the
domestic financial sector.
Using the same procedure as before and comparing z a and z b , we can easily see
that z b > z a , implying that this type of short-term debt rearrangement increases
the risk of a bank run even further.
64 The short and medium run
7.6 Liberalizing international credit markets
It seems to be an important policy issue to analyze what happens when interna-
tional credit markets are liberalized in the sense that f , the international borrowing
constraint, is loosened so that f rises. Several empirical studies on financial crises
seem to suggest that such major economic upheavals are usually preceded by
financial liberalization.
To start with, we know from (7.2) that the economy’s wealth increases with an
increase in f since foreign credit has a zero interest rate and can earn a return
R > 1 for every dollar invested. From (7.5), we also see that consumption in both
periods will increase. In this sense, a deregulation of the international financial
market would increase social welfare.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
However, if we assume that the bank lives in a world of short-term debt where
b = 0 and d = f , as just described, we can rewrite (7.11) as
αw
z b = c1∗ + f − r k = + f − r ( f + e)
ρ
αw
= + ( f − αw)(1 − r ) − r e
ρ
α f (R − 1) 1
= e+ + f 1−α+ − αe (1 − r ) − r e
ρ R R
where we have substituted in the expression for w to the right on the second line.
What comes out very clearly from this equation is that z b increases with f . In
other words, if debts can be easily canceled in the case of a bank run, an increase in
capital inflows will increase financial fragility and increase the bank’s illiquidity.6
In this sense, a liberalization of international financial markets is rational ex ante
because it increases the socially optimal level of consumption, but is typically
regarded as irrational after a financial crisis since increased capital inflows make
banks more vulnerable to bank runs.
8 Consumption and Saving
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The key new assumption in the permanent income hypothesis (PIH) is that deci-
sions regarding current consumption are only based on permanent income rather
than on the current income:
Ct = YtP (8.2)
The wider implications of this model will be explored in greater detail in the
next section, where a more complete version of the model is presented.
T
U= β t u(ct ) = u(c0 ) + βu(c1 ) + β 2 u(c2 ) + · · · + β T u(cT ) (8.3)
t=0
where ct is individual consumption and where the function has the standard prop-
erties u (ct ) > 0 and u (ct ) < 0. There is thus always a positive marginal utility
of consumption, but marginal utility is declining as consumption increases. The
individual is now assumed to live for T + 1 > 0 years.3 u(ct ) is referred to as
the instantaneous utility function and is stable over time. β ≤ 1 is a time discount
factor, as before.
We assume for simplicity that the individual has a known stream of exogenously
future incomes y0 , y1 , y2 , . . . , yT and that income tax rates are zero. The lifetime
budget constraint (or permanent income) is
T
ct yt
T
= (8.4)
(1 + r )t (1 + r )t
t=0 t=0
Consumption and saving 67
where r ≥ 0 is a time-invariant interest rate. There is further a perfect credit market
so that individuals can borrow money if yt < ct , as long as the lifetime budget
constraint in (8.4) is satisfied.
Although this intertemporal constraint is really just an extension of the two-
period constraint from the OLG model in (5.3), it is worth elaborating a little
exactly how (8.4) arises. Consider, for example, a case where T = 2. In this case,
consumption in the last period must be
That is, the indidual will consume all net savings from previous periods
(y1 − c1 )(1 + r ) + (y0 − c0 )(1 + r )2 plus last period income y2 . If we isolate the
consumption terms on the left-hand side, we can write
2
c2 + c1 (1 + r ) + c0 (1 + r )2 = ct (1 + r )t
t=0
2
= y2 + y1 (1 + r ) + y0(1 + r )2 = yt (1 + r )t
t=0
Normally, we want to rewrite the budget constraint so that it expresses the present
value in period 0, i.e. when the decision about the consumption plan is made.
By dividing both sides by (1 + r )2 , we end up with the expression in (8.4) when
T = 2, which shows that the present value of lifetime consumption must be equal
to the present value of lifetime incomes.
T
T yt
T
ct
= β t u(ct ) + λ − (8.5)
(1 + r ) t (1 + r )t
t=0 t=0 t=0
where λ > 0 is a standard Lagrange multiplier. Note that the choice variables for
this optimization are the levels of consumption for each time period: c0 , c1 , c2 , etc.
68 The short and medium run
The first-order conditions for a maximum are thus
∂
= u (c0 ) − λ = 0 (8.6)
∂c0
∂ λ
= βu (c1 ) − =0
∂c1 1+r
...
∂ λ
= β T u (cT ) − =0
∂cT (1 + r )T
u (ct∗ )
∗ ) = β(1 + r ) for all t ∈ {0, 1, 2, . . . , T − 1}
u (ct+1
8.4 An example
In order to develop the intuition for the PIH result further, let us make the highly
simplifying assumption that β = 1 and that r = 0. In that case, the Euler equa-
tion reduces to u (c0∗ ) = λ = u (c1∗ ) = · · · = u (cT∗ ). In other words, optimally, the
marginal utility of consumption should be the same in all periods. Since u(ct ) is
strictly concave, there can only be one unique level of ct where marginal utility
(the slope of the utility function) is u (ct ) = λ. Hence, the individual will opti-
mally set a level of consumption such that ct∗ = ct+1 ∗ = c ∗ = · · · = c ∗ . This result
t+2 T
of identical optimal levels of consumption throughout life is sometimes referred
to as consumption smoothing.
The fact that the optimal level of consumption is identical throughout an
individual’s lifetime T means that we can reformulate the budget constraint as
T
T
ct = (T + 1)ct∗ = yt
t=0 t=0
Dividing both sides by life length T + 1 gives us the key expression for
consumption in the PIH:
T
t=0 yt
ct∗ = (8.7)
T +1
8.4.1 Saving
In the PIH, savings (positive or negative) are used to smooth the time pattern of
consumption. Current income is yt = ct + st . Hence, the expression for optimal
saving in the PIH is
Downloaded by [Hacettepe University] at 20:58 10 April 2017
T
t=0 yt
st∗ = yt − ct∗ = yt − (8.8)
T +1
where st∗ is positive if current incomes are relatively large and negative if current
income in period t is lower than the optimal level of consumption.4
An important implication of the expression (8.8) is that whereas the level of
consumption is constant over time and is insensitive to the time distribution of
incomes y1 , y2 , . . . , yT , savings are very sensitive to current incomes. To be more
precise, the derivative of st∗ with respect to yt is 1 − 1/(T + 1) > 0 whereas the
derivative of ct∗ with respect to yt is only 1/(T + 1). The sensitivity of savings with
respect to current incomes is larger if T is large. A prediction from these results
would thus be that savings should display a greater degree of variation over life
than consumption levels.
T τ −1
T
cτ∗ = yt − T cτ∗ = yt − τ cτ∗ + yt − (T − τ )cτ∗ (8.9)
t=0 t=0 t=τ
T
= Aτ −1 + yt − (T − τ )cτ∗
t=τ
−1
In this expression, τt=0 yt − τ cτ∗ = Aτ −1 shows accumulated incomes minus total
consumption spendings from t = 0 to τ − 1. If Aτ −1 > 0, the individual has positive
assets at the time of analysis τ , and negative if Aτ −1 < 0.
70 The short and medium run
If we isolate cτ∗ on the left-hand side and then divide both sides by T + 1 − τ ,
we obtain
T
∗ Aτ −1 + t=τ yt
cτ =
T +1−τ
A commonly shown version of this expression is when it is assumed that τ = 1.
The expression then collapses into the very simple form
T
A0 + t=1 yt
c1∗ = (8.10)
T
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Note that, in this case, A0 = y0 − c0∗ , so that it shows accumulated savings from
previous periods. The net level of assets A0 can be positive or negative, depending
on the individual’s incomes during young age.
In some of the expositions below, we will work with versions of an intertempo-
ral maximization problem that builds on the logic of expression (8.10).
ct, yt
yt
ct
Downloaded by [Hacettepe University] at 20:58 10 April 2017
time
1 2 3
Figure 8.1 Example of consumption and income over the life cycle.
assume cmpc = 1/2 and Yt = 10, then Ct ≈ 5. However, in the PIH scenario, an
identical increase in income should increase consumption only by Yt /T . If we
assume a middle-aged person in the Western world at 40 years who can expect to
live for another 40 years, the increase in consumption will be only 10/40 = 0.4.7
The more far-reaching implication of this example is that short-run policy
efforts aimed at boosting consumption are likely to be fairly successful if we
assume a Keynesian consumption function, but they will be very ineffective in
the PIH framework.
T
a 2
E 1 (U ) = E 1 ct − ct (8.11)
2
t=1
72 The short and medium run
E t is an expectations operator indicating that expectations are made at time t.
The expectations operator has the standard properties used in statistical the-
ory.9 Note that the expression (8.11) can be written out in full as E 1 (U ) =
E 1 (c1 − a2 c12 ) + E 1 (c2 − a2 c22 ) + · · · + E 1 (cT − a2 cT2 ). It is further central to under-
stand that expectations about future levels of consumption are all made at time
t = 1. For simplicity, we still assume that the time discount factor is β = 1.
The instantaneous utility function (inside the parentheses) u t (ct ) = ct − a2 ct2
is also different from what we have seen before. a > 0 is a constant parameter.
Marginal utility of ct is u t (ct ) = 1 − act > 0 if ct < 1/a. This means that if
consumption levels exceed 1/a, marginal utility of consumption will actually be
negative. Although this might be an interesting scenario, we will assume levels of
consumption such that ct < 1/a. The second derivative is u t (ct ) = −a < 0, which
Downloaded by [Hacettepe University] at 20:58 10 April 2017
ensures concavity.
The budget constraint is equivalent to the one implied above with the exception
that we have now introduced uncertainty about lifetime incomes. A0 = y0 − c0∗ has
the same interpretation as above and r = 0:
T
T
E 1 (ct ) ≤ A0 + E 1 (yt ) (8.12)
t=1 t=1
In other words, at the time of decision, the individual expects to consume the same
amount at all future dates as he or she currently consumes.
Inserting this result into the budget constraint yields the key result of the
random-walk model:
T
1
c1∗ = A0 + E 1 (yt ) (8.14)
T
t=1
This expression is identical to the one in (8.10) except for the expectations opera-
tor E 1 . The interpretation is also similar to the one above: the individual consumes
every period a fraction 1/T of his or her expected lifetime resources. A noteworthy
feature is that the presence of uncertainty does not really
T seem to affect the indi-
vidual’s consumption choice. The individual values t=1 E 1 (yt ) as equivalent to
T
t=1 y t in (8.7). Hall’s random-walk model is therefore sometimes described as
being characterized by certainty equivalence. We will relax this assumption below.
Consumption and saving 73
8.5.1 Introducing a stochastic component
In order to describe how uncertainty can enter the model, let us imagine that actual
consumption at some date t is given by
In this simple expression, et is a stochastic error term with the property that
E t−1 (et ) = 0. et reflects that there is an element of uncertainty since actual con-
sumption might turn out to be higher (et > 0) or lower (et < 0) that what was
expected at t − 1. Furthermore, we know from (8.13) that E t−1 (ct∗ ) = ct−1
∗ . Hence,
T
T
1
e2 = E 2 (yt ) − E 1 (yt ) (8.16)
T −1
t=2 t=2
In other words, if the error term is different from zero, expectations about the flow
of incomes over the discrete interval from t = 2 to t = T must have changed from
period 1 to period 2.11 Whether aggregate consumption actually follows a random
walk as proposed by Hall (1978) has been a source of numerous empirical studies
in macroeconomics.
u (ct ) = E t u (ct+1 ) =⇒
1 − act = E t (1 − act+1) =⇒ 1 − a E t (ct+1 ) = u [E t (ct+1 )]
A key technical aspect of this expression is that the expected marginal utility
of ct+1 equals the marginal utility of the expected value of ct+1 : E t u (ct+1 ) =
Downloaded by [Hacettepe University] at 20:58 10 April 2017
u [E t (ct+1 )]. This is a feature that follows from the quadratic utility function in
the random-walk model, and the exact meaning of this will be demonstrated below.
One problem with the quadratic utility function is, as mentioned above, that
it allows for a situation where u (ct ) < 0 at ct > 1/a. A negative marginal
utility of consumption at high levels of consumption is problematic in terms
of realism. Note also that in the quadratic case, u (ct ) = 0. If, however, we
assumed u (ct ) > 0, marginal utility would be a negative and convex function
of consumption, as illustrated in Figure 8.2.
If the only new assumption is that u (ct ) > 0 at all ct > 0, then E t u (ct+1 ) must
be higher than before (due to the convexity of u (ct )). At the old level of ct , we
then have that u [E t (ct+1 )] < E t u (ct+1 ), i.e. a violation of the Euler condition.
In order to restore the equilibrium in the Euler equation, there must be an increase
in u [E t (ct+1 )] = u (ct ). This can only come about through a decrease in ct .
A reduction in consumption at time t means an equivalent increase in that period’s
savings. Hence, the new assumption that u (ct ) > 0 leads to precautionary saving.
u ′(ct )
u ′′′(ct ) > 0
u ′′′(ct ) = 0
ct
1/a
Figure 8.2 Precautionary saving and the third derivative of the utility function.
Consumption and saving 75
u ′(c )
Et u′(c )
u ′(c )
Downloaded by [Hacettepe University] at 20:58 10 April 2017
c
c** c* c a
cL′ cL cH c′H
T
1 ct1−θ
U= (8.17)
(1 + ρ)t 1 − θ
t=0
76 The short and medium run
As before, θ ∈ (0, 1) is the relative risk-aversion parameter and ρ ≥ 0 is the time
discount rate.
The individual’s intertemporal budget constraint in the multi-period case is
T
ct yt
T
= (8.18)
(1 + r )t (1 + r )t
t=0 t=0
t
1+r θ
ct∗ = c0∗ (8.19)
1+ρ
One of the more interesting aspects of this expression is that we are not likely to
have consumption smoothing any more. If, for instance, r > ρ (which should be the
case in dynamic, thrifty societies), then consumption will increase exponentially
over time and will not display a random walk.12 Figure 8.4 shows examples of the
optimal consumption paths for three different scenarios: r > ρ, r = ρ, and r < ρ.
As before, the optimal growth rate of consumption is given by
∗ − c∗ 1
ct+1 1+r θ
= gc∗ =
t
−1
ct∗ 1+ρ
c *t
r>ρ
r=ρ
c0
r<ρ
t
t=0 T
Figure 8.4 Optimal consumption paths over time for different levels of r and ρ.
Consumption and saving 77
8.8 Relative consumption
Recent research, inspired by experimental studies, has demonstrated that the indi-
vidual’s utility is determined not only by her own absolute level of consumption
but also by her relative level, compared with some reference point R. The refer-
ence point might, for instance, be the neighbor’s level of consumption – in which
case the individual will have so-called “keeping up with the Joneses” concerns
– or it could be derived from past levels of her own consumption, in which case
“habit formation” plays a role in utility.13
As a simple illustration of these ideas, consider the following utility function
for two periods with properties similar to that in Bowman et al (1999):
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The first assumption means that the individual will always be better off the greater
her own level of consumption is compared to the reference point Rt . The second
assumption simply means that if consumption is at the reference point, utility from
that period will be equal to u(ct ). The third assumption is more far-reaching and
implies that the individual has loss aversion. This means that her gain–loss func-
tion is concave above the reference point and convex below. The deeper meaning
is that individuals are even more sensitive to losses than in the standard concave
utility setting, which is what defines loss aversion.
The reference point might further be endogenous to the decisions made in the
model. It is common to assume an exogenous reference point in the first period R1 .
R2 might, however, depend on c1 :
An α = 0 simply means that the reference point is constant in the two periods.
If α > 0, however, R2 is influenced by c1 . If c1 > R1 , then R2 > R1 and the
individual is subject to habit formation.
A key implication of the prevalence of habit formation is that the intertemporal
consumption choice is no longer time-independent, as in the PIH.14 The individ-
ual’s consumption decision for period 1 will affect utility in period 2 since a very
high level of consumption in period 1 will make the individual “addicted” to a
high level of consumption, which will affect how she values c2 .
78 The short and medium run
In the case where α = 0, so that the reference point is some exogenous factor,
it is usually assumed that the individual will compare her own consumption with
that of some other person, perhaps the average consumption level in the country
or the consumption of neighbors or friends. The exact implications of such rela-
tive consumption comparisons depend on the characteristics of the u(ct ) and v(t)
functions. Research on the impact of reference-dependent utility is currently very
intensive, and it remains to be seen how it will change macroeconomic models of
aggregate consumption.
u (ct )
= β(1 + r ) (8.23)
u (ct+1 )
whereas the same calculation for the the initial period 0 and period 1 gives us
u (c0 )
= γβ(1 + r ) (8.24)
u (c1 )
Note that the only difference from the standard result is the inclusion of γ < 1. In
this sense, preferences are clearly time-inconsistent.
Consumption and saving 79
If we also assume the usual CRRA utility such that u(ct ) = ct1−θ /(1 − θ ), then
1
we get the standard Ramsey result that ct+1 /ct = [β(1 + r )] θ for all t > 1 but that
c1
c0 = 1
(8.25)
[γβ(1 + r )] θ
Since γ < 1, this implies that the individual will consume relatively more in the
first period than if γ = 1, as in the time-consistent, standard case. In each period
of the individual’s remaining life, there will then be a bias present that will make
the individual consume relatively more in the current period. This will not be
sustainable, since the lifetime budget constraint is the same as before, and sharp
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Investments are efforts aimed at increasing a stock of capital, usually physical cap-
ital.1 A country’s investments are closely associated with its savings, as shown in
the neoclassical growth model. In this chapter, we will present the standard neo-
classical model of investment, based on the profit maximization of an individual
firm, and derive central results such as Tobin’s marginal q and the user cost of
capital. We will also discuss the nature of adjustment costs and how their char-
acteristics affect firms’ investment decisions. The account of investment theory
outlined here relies on Branson (1989), Caballero (1997), and Romer (2005).
In Section 9.4 we will briefly consider the housing market and analyze the
determinants of housing demand and equilibrium price levels.
T
t T
1
V (0) = = [Pt F(L t , K t ) − wt L t − PtI It ] (9.1)
(1 + r )t (1 + r )t
t=0 t=0
In this expression, t is total profits in the economy at time t, r is the interest rate,
Pt is the price (index) of firms’ total produced output Yt = F(L t , K t ) (which is a
Investment and asset markets 81
function of labor L t and physical capital K t ), wt is the wage rate, PtI is the price
of the investment good, and It is total gross investment. The production function
F(L t , K t ) is characterized by ∂ F(L t ,K t ) = F > 0 and F > 0. The lifetime of
∂ Lt Lt Kt
the firm is from 0 to T .
Capital accumulates according to
K t+1 = K t + It − δK t = K t (1 − δ) + It (9.2)
T
t
max V (0) = s.t. K t+1 = K t (1 − δ) + It at each t = (0, 1, . . . , T )
L t ,K t ,It (1 + r )t
t=0
(9.3)
T
1
T
= [Pt F(L t , K t ) − wt L t − PtI It ] + λt [It + K t (1 − δ) − K t+1]
(1 + r ) t
t=0 t=0
(9.4)
∂ 1
= (Pt FL t − wt ) = 0 (9.5)
∂ L t (1 + r )t
∂ Pt f K t
= + λt (1 − δ) − λt−1 = 0 (9.6)
∂ K t (1 + r )t
∂ PtI
=− + λt = 0 (9.7)
∂ It (1 + r )t
Note that the λt−1 in (9.6) comes from the fact that K t also appears in the
restriction for period t − 1: λt−1 [It−1 + K t−1 (1 − δ) − K t ].
I
PtI Pt−1
λt = , implying that λt−1 =
(1 + r )t (1 + r )t−1
82 The short and medium run
Substituting this result into (9.6) yields
I
Pt FK t PtI (1 − δ) Pt−1
+ − =0
(1 + r )t (1 + r )t (1 + r )t−1
Pt F
Kt
If we first isolate (1+r)t on the left-hand side and then multiply both sides by
(1 + r ) /Pt , we obtain
t
δ PtI + r Pt−1
I − (P I − P I )
t t−1 Ct
FK t = = (9.8)
Pt Pt
Downloaded by [Hacettepe University] at 20:58 10 April 2017
What we have derived is the real user cost of capital Ct /Pt . The user cost of
capital Ct contains three terms: the depreciation cost of investing in capital δ PtI ,
plus the interest cost of holding capital during t valued at t − 1, r Pt−1
I , minus the
possible increase in the price of the investment good (which an investor benefits
from), PtI − Pt−1
I . The nominal user cost C is divided by the price index P to
t t
yield the real user cost of capital.
The expression in (9.8) implies that firms should optimally make investments
up to the level where the marginal product of capital FK t equals the real user cost
Ct /Pt . This equilibrium level is implicitly given in (9.8) such that
K t∗ = K (Yt , Ct , Pt )
Yt = AK tα L 1−α
t (9.9)
where A is some positive productivity parameter and 0 < α < 1 is the output elas-
ticity of capital. With this function, we have a marginal product of capital equal to
FK t = AαK tα−1 L 1−α
t = αYt /K t . Inserting this value back into (9.8), we get
αYt Ct
=
K t∗ Pt
α Pt Yt
K t∗ =
Ct
In other words, the optimal level of capital increases with the price level Pt and
with total output Yt and decreases with the (nominal) user cost Ct .
Investment and asset markets 83
9.2.3 Tobin’s q
The expression for the real user cost of capital might be rearranged into
Pt FK t + PtI (1 − δ) − Pt−1
I
(1 + r ) = 0
1 Pt FK t + PtI (1 − δ)
=1 (9.10)
1+r I
Pt−1
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The term on the left-hand side is referred to as Tobin’s marginal q after Nobel
Prize laureate James Tobin. It shows the change in the value of the firm at t of a
marginal increase in the capital stock at t − 1, Pt FK t + PtI (1 − δ), divided by the
cost of acquiring that marginal increase Pt−1I , and discounted back to period t − 1,
1/(1 + r ).
The increase in the value of the firm arises from an increase in revenue Pt FK t
and from an increase in the value of its capital PtI (1 − δ). Note that FK t decreases
with K t due to diminishing returns. In equilibrium, the discounted increased value
I . If the firm
of the firm should be equal to the cost of a marginal unit of capital Pt−1
is not in equilibrium so that Tobin’s q is greater (or smaller) than unity, then the
marginal benefit of a small increase (decrease) in the capital stock will exceed the
marginal cost. Thus the capital stock will be expanded until it reaches the optimal
level K ∗ where Tobin’s marginal q is one, as required.
T
1
[Pt F(L t , K t ) − wt L t − PtI It − A(It )] (9.11)
(1 + r )t
t=0
∂ −PtI − A (It )
= + λt = 0 (9.12)
∂ It (1 + r )t
84 The short and medium run
while the other conditions are as before. This further means that
PtI + A (It )
λt = (9.13)
(1 + r )t
The expressions indicate that the introduction of adjustment costs will make
the marginal cost of investment higher than before. This in turn implies that
investments should generally be smaller than without adjustment costs.
where P I − P S is the difference between purchase and the potential sale prices. If
this difference is large, the firm will only buy the unit of capital if it is quite sure
that it is not going to need to sell it in the near future.
A second type of adjustment costs is one where there is a substantial fixed cost
component:
where F > 0 is a fixed cost and where adjustment also has a flexible component
κ (It ) > 0. A fixed cost could, for instance, be that a firm must make a large dis-
crete change in how its goods are produced after the investment. Irreversibilities
and fixed adjustment costs both imply that investments should appear in a dis-
continuous (lumpy) fashion, with long periods of inactivity ended by investment
“spikes”.
A third and quite different kind of adjustment costs are those that are biased
towards small, incremental changes:
a It2
A(It ) = (9.16)
2
over its lifetime is the choice of whether and when to buy a place to live. In
making this decision, the household members need to consider their optimal
mix of consumption of ordinary goods and the utility provided by owning a
house.
If we think of this as a static problem, let us denote the size of the housing
stock as H (perhaps reflecting the number of square meters), which is multi-
plied by p H (the price per unit of housing) in order to get the total value of
the household’s housing stock at a given time, p H H .3 If a house is bought, the
household borrows the whole amount. In each period of time, the household has
to pay r p H H in interest. For simplicity, we assume that no repayments need to
be made during the period under consideration. In addition to this, the household
needs to pay for repair and maintenance an amount δp H H , where δ > 0 is the
depreciation of the housing stock. The total cost of housing in one period is thus
(r + δ) p H H .
The household earns an income y during each period. This income is used for
consumption c and for housing such that
c + (r + δ) p H H ≤ y
U = ln c + η ln H (9.17)
In this function, η > 0 shows the relative importance given to housing. If η > 1,
then the household gets a higher relative marginal utility from housing than from
consumption of other goods.
The household’s choice is the optimum amount of housing to hold. If we insert
the budget constraint into the utility function, we can write
U = ln[y − (r + δ) p H H ] + η ln H
86 The short and medium run
Taking the first-order condition for a maximum, we obtain
∂U (r + δ) p H η
=− + =0
∂H Y − (r + δ) p H H H
This condition can be rearranged to solve for the household’s demand for housing:
ηy
HD= (9.18)
p H (r + δ)(1 + η)
will equal supply, so that H S = H D . This implies in turn that the equilibrium price
of housing is
ηy
p H,∗ =
(r + δ)(1 + η)H S
Hardly surprisingly, the price will decrease with the short-run supply of
houses H S . Clearly, the greater the existing stock of houses, the lower the price.
A more interesting implication of the result above, and one that is constantly rel-
evant for millions of households, is the fact that the equilibrium price of houses
decreases with the interest rate r . An unexpected lowering of the interest rate, per-
haps due to a monetary policy intervention by the central bank, will lead to a shift
in the demand curve outwards and a temporary excess demand for housing. This
will eventually cause house prices to rise to a new and higher level ( p H,∗∗ ), as
illustrated in Figure 9.1.
pH
HS
p H,**
p H,*
H D,1
p H,*** HD
H D,2
HS
Figure 9.1 Equilibrium price of housing following falls in interest rates and in income.
Investment and asset markets 87
Note also the impact of a fall in household income y, perhaps due to an
economy-wide recession. In the short run, prices will typically be slow to adjust
from the initial level p H,∗ and there will be an excess supply of houses that will
be empty. Just like with wages, house prices tend to be sticky downwards. A price
decrease means that household wealth for homeowners decreases and people will
be reluctant to sell houses at a lower price than they paid for them (or for a lower
amount than the household has used as collateral for loans for consumption, as in
the United States). Hence, downward adjustment will be slow. The price should
eventually settle at the new lower equilibrium level p H,∗∗∗ .
Downloaded by [Hacettepe University] at 20:58 10 April 2017
10 Unemployment and the
Labor Market
Downloaded by [Hacettepe University] at 20:58 10 April 2017
LS(w )
w̃
w*
LD(w)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
labor
(efficiency wage theory); firms want to cut wages but are prevented from doing
so by contracts and labor unions (insider–outsider theory); and unemployment
is largely due to worker heterogeneity and the substantial problem of allocating
workers to suitable jobs (search and matching models).
= F(eL) − wL (10.1)
∂
Downloaded by [Hacettepe University] at 20:58 10 April 2017
w̃
e (w̃) = 1 (10.4)
e(w̃)
The term on the left-hand side is the elasticity of effort with respect to the wage.
The wage at which this equality holds is w̃. Whether such a wage can be found
depends to a large extent on the character of the e(w) function. Figure 10.2 shows
one example where the efficiency wage equilibrium is w̃ > 0.
The firm’s labor demand is in turn implicitly given by the expression in (10.3).
U (w, e) = w − e (10.5)
where w is the wage received and e is the effort exerted. Utility is a negative function
of effort and a positive function of the wage. We further assume that individuals can
either exert e = 0 or some positive level e = ē > 0. Regardless of behavior, there
is a probability b per unit of time that the worker loses his or her job. In the case
Unemployment and the labor market 91
e(w )
e(w)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
w
~
w
of shirking, there is an additional probability q > 0 of being fired. Note that q < 1,
which means that manager monitoring is imperfect; even in the case of shirking,
there is a probability 1 − q that the worker will manage to keep his or her job. If
the worker is unemployed, he or she will receive an unemployment benefit of w̄.
Workers are for simplicity assumed to maximize the expected present discounted
value of utility over an infinite life. The interest rate is r > 0.
The key choice for the worker is whether to shirk or not. The lifetime utility
from being an employed shirker is referred to as VES , whereas the lifetime utility
of an employed nonshirker is VEN . The so-called “fundamental asset equation” for
shirkers shows that the interest rate times the “asset” VES should be equal to the
“flow benefit” at a point in time (equal to the wage w) minus the expected change
in the value of the asset due to unemployment (b + q)(VES − VU ):
r VES = w − (b + q)(VES − VU ) (10.6)
A rational individual will not choose to shirk if VEN ≥ VES , a condition that
we will refer to as the “no-shirking condition” (NSC). If we use the expressions
in (10.8), a comparison between the two levels shows that
Downloaded by [Hacettepe University] at 20:58 10 April 2017
VEN ≥ VES =⇒
w − ē + bVU
w + (b + q)V
≥ =⇒
r +b+q r +b
q
w + (b + q)VU ≥ (w − ē + bVU ) 1 + =⇒
r +b
−wq r +b+q
≥ (bVU − ē) − (b + q)VU
r +b r +b
Expression (10.9) shows the wage that needs to be paid for workers not to shirk.
All firms are aware of the NSC and hence set wages such that VEN = VES = VE so
that no shirking is carried out.
Given that VEN = VES = VE , we can equate the right-hand sides of (10.6) and
(10.7) and insert VE :
w − (b + q)(VE − VU ) = w − ē − b(VE − VU ) =⇒
ē
VE − VU =
q
r VU = w̄ + a(VE − VU ) (10.11)
where a > 0 is the probability per unit of time that an unemployed worker finds a
new job. As mentioned above, w̄ > 0 is the level of unemployment benefits.
Inserting (10.11) and the result that VE − VU = qē into (10.10) gives us a closed-
form solution to the wage rate that satisfies the NSC:
Downloaded by [Hacettepe University] at 20:58 10 April 2017
ē
w = w̄ + ē + (a + b + r )
q
This critical wage increases with the required level of effort ē, with the interest
rate r , with unemployment benefits w̄, with the probability of finding a new job a,
and with the probability of becoming unemployed b, and decreases with the prob-
ability that a worker who shirks is caught. The term w̄ might be thought of as an
indicator of the generosity of unemployment benefits. If unemployment benefits
are high, workers have less to lose from being unemployed, and are therefore more
prone to shirk. Hence, firms must pay higher wages. Likewise, if it is easy for an
unemployed worker to find a new job, the expected cost of being unemployed is
small, the worker is more willing to shirk, and a higher wage has to be paid.
What will the level of employment be? In a steady-state equilibrium, the number
of people who become unemployed will be equal to the number of people who
find new jobs. The number of workers who become unemployed at any given time
is Lb, where L is the aggregate number of employed workers. The number of
unemployed workers finding new jobs at any given time is a(N − L). By setting
these equal to one another and solving for a, we get a = NLb
−L . This in turn implies
N −L
that a + b = N −L . Further, N = μ is the unemployment rate in the economy,
Nb
All this is still on the supply side of labor. On the demand side, we will look
at a representative individual firm, just as in the RBC model. Let us imagine
that the firm has a production function Q = F(L) so that the profit function is
= F(L) − wL. Let us further assume that F (N) > ē, i.e. if all potential work-
ers were employed, they would still have a marginal product above the required
effort level. Full employment would therefore be a good idea from the firm’s point
of view.
The usual profit maximization would give us the first-order condition
F (L) = w, as in the RBC model. In equilibrium the marginal product of labor
must therefore be equal to the NSC, i.e. F (L) = e + ( μb + r ) qe . This defines the
94 The short and medium run
equilibrium level of employment in the labor market. The higher the wage required
to keep workers from shirking, the greater the level of unemployment. Workers
who are unemployed would be willing to work for a wage below w̃, but cannot
credibly commit to not shirking at that wage and hence are not employed.
aim for wage levels that are beyond a full employment equilibrium. The theory of
insiders and outsiders in the labor market was pioneered by Lindbeck and Snower
(1986).
To see this more formally, let us assume a representative firm with profits
given by
= AF(L I + L O ) − w I L I − w O L O
∂ 1
= [ Ai F (L I + L O,i ) − Rw I,i ] = 0 (10.12)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
∂ L O,i 2
∂ 1
= [−(L I + RL O,i ) + λU (w I,i )] = 0 (10.13)
∂w I,i 2
From (10.12), we find that the optimal level of outsider employment should sat-
isfy Ai F (L I + L ∗O,i ) = Rw I,i , which is the standard result that marginal product
should equal marginal cost. Comparing the good and the bad business cycle out-
comes, we see that the left-hand side will be larger when Ai = A G . Hence, in order
to restore the first-order condition, F (L I + L ∗O,i ) must fall, which it can only do
if L ∗O,G > L ∗O,B . Thus, firms will employ more outsiders in good times, which
makes sense. Importantly, an increase in union power such that R increases must
be balanced by an increase in F (L I + L ∗O,i ), which it can only do if L ∗O,i falls. In
this highly stylized sense, strong labor unions therefore increase unemployment.
From (10.13), we have another interesting implication. The first-order condi-
tion implies that L I + RL O,i = λU (w∗I,i ) at the optimal insider wage rate w∗I,i .
Recall that if A G prevails, then L ∗O,G will be relatively high, which in turn means
that w∗I,G < w∗I,B (since marginal utility decreases with w∗I,i ). Thus, insider wages
are countercyclical and will be higher in bad times. The intuition is that firms
and insiders reach this decision jointly in order to keep outsiders from getting
employed.
M(U, V ) = ηU β V 1−β
96 The short and medium run
where η > 0 is an efficiency parameter for new job creation, U = μN > M is the
number of unemployed people (the unemployment rate μ times the total size of
the labor force L), and V = v N is the number of vacancies in the economy (v is the
vacancy rate). The number of employed people in the economy is L = N − μN.
It is commonly assumed that the matching function displays constant returns to
scale, i.e. it can be described as Cobb–Douglas. Note that a marginal increase in
the number of unemployed will only increase the number of newly created jobs
by β M/U < 1.
If we divide M through by the total labor force N, we can rewrite the
expression as
M(U, V )
Downloaded by [Hacettepe University] at 20:58 10 April 2017
where q (θ ) = −ηβθ −β−1 < 0. A Poisson process means that in some given short
time interval t, the probability that one vacancy will be filled with one matched
worker is t · q(θ ). This probability thus increases with the unemployment rate μ
and with the length of the time interval t.
The Poisson arrival rate of a match for an unemployed worker is similarly
m(μ, v)
= ηθ 1−β = q(θ )θ (10.14)
μ
This function has some interesting properties. Let us assume that the time interval
under consideration is t = 1. Then (10.14) shows the probability that an unem-
ployed worker will find a job during one time unit. Consider now a small increase
in the unemployment rate μ. In that case, the matching rate per vacancy q(θ )
will increase. However, since there are also more unemployed people around to
compete for the vacancies, the labor market has tightened and the net effect on the
probability of finding a job is negative: ∂(m(μ, v)/μ)∂μ = −η(1 − β)θ 1−β /μ < 0.
As in the Shapiro–Stiglitz model, the total number of jobs lost during a certain
time interval is bL = b(N − μN), where b shows the fraction of the employed who
Unemployment and the labor market 97
lose their jobs. b might be thought of as the probability that an individual worker
succumbs to an exogenous shock during a short time interval t that makes her
lose her job.
In equilibrium, it should be the case that the flow of people into employment
is constant, L̇ = ηU β V 1−β − bL = 0. This also means that the number of jobs
created through matching (ηU β V 1−β ) should be equal to the number of jobs
destroyed (bL). Dividing through by N, we obtain the equilibrium condition
unemployment rate that satisfies (10.15) is also a steady-state level. To see why,
see Figure 10.3, which depicts job creation and job destruction as a function of μ.
The m(μ, v) curve is positive and concave in μ whereas the job destruction curve
is negative and linear.
If we start from a level μ > μ∗ , then we see that more jobs are created through
the matching process than are destroyed. This means that the unemployment rate μ
should decrease. The situation is of course the reverse if μ < μ∗ . Only at the
steady-state level μ = μ∗ will the system be in equilibrium.
The schedule in Figure 10.3 can also be used for analyzing various comparative
statics. Consider, for instance, an increase in the efficiency of matching η, perhaps
as a result of a new government unemployment program. An increase in η will
shift the m(μ, v) curve upwards at any given level of μ. The new steady-state
level is now μ∗∗ < μ∗ . An increase in v would have a similar effect, whereas an
increase in b would imply an increase in μ∗ .
mnew(μ,v)
b
m(μ,v )
b(1– μ)
μ∗∗ μ∗ μ=1 μ
∂B
∂μ∗ −(1 − β)ημ∗,β v −β −(1 − β)η
= − ∂∂vB = ∗,β−1 v 1−β + b
= <0
∂v ∂μ
ηβμ θ (ηβθ 1−β + b)
A negative relationship is certainly well in line with intuition, since more vacan-
Downloaded by [Hacettepe University] at 20:58 10 April 2017
cies should imply a lower rate of unemployment. This can also be deduced from
Figure 10.3, where a higher level of vacancies implies a higher level of match-
ing and a lower equilibrium unemployment rate. The negative association is well
documented empirically for many countries in the world.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Part III
Macroeconomic Policy
11 IS–MP, Aggregate Demand, and
Aggregate Supply
Downloaded by [Hacettepe University] at 20:58 10 April 2017
In this third part of the book, we will now shift our focus in order to analyze the
effects of macroeconomic policy. Most of this chapter will be based on the IS–MP
model of the goods and money markets. This model is not micro-founded since it
is not based on optimizing household behavior. Instead, it follows in the Keynesian
tradition of assuming certain behaviors of variables at the macro level. Only the
specifications of aggregate supply will rely on micro foundations. The analysis in
this chapter is therefore quite different from the analysis in most other chapters.
We start off with the traditional Keynesian framework where we discuss aggre-
gate expenditure and multipliers. We then derive the aggregate demand function
from equilibria in the goods and money markets. We also elaborate on the prope-
rties of the aggregate supply function under varying assumptions of price and
wage stability and provide an overview of the Lucas critique of traditional Keyne-
sian economic policy. After that, we present a new model that introduces financial
intermediation into the standard IS–MP framework. Finally, we also present some
of the main ideas in the so-called new Keynesian paradigm.
Yt = E t = ca + cmpc (1 − τ )Yt + It + G t
1
Yt = (ca + It + G t )
1 − cmpc (1 − τ )
total expenditure, E
ca + It + Gt
45°
Y
Y* Y **
Figure 11.1 The Keynesian cross and the multiplier impact of an increase in government
spending.
IS–MP, aggregate demand, and aggregate supply 103
output levelY ∗∗ > Y ∗ .
Note also that the multiplier effect implies that G t − G t <
∗∗ ∗
Y − Y , i.e. equilibrium output will increase by a larger amount that the initial
increase in G t .
Y = E (Y, r, G)
where we now use a general expenditure function E (Y, r, G). E increases with
output Y (through the consumption function) and with real government pur-
chases G and decreases with the real interest rate r (for instance, since investments
decrease with r ). Should Y exceed total expenditures, actual production will be
unused and inventories will accumulate.
We describe the partial derivatives with the following notation:
dY
dY
= EY +Er
dr
IS dr
IS
dY
Er
=
dr
I S 1 − E Y
Hence, the slope will be steeper the greater the sensitivities of planned expendi-
tures to the real interest rate r and to changes in incomes Y are.
104 Macroeconomic policy
11.2.2 The money market
The money market means the market for the supply and demand of what is
called “high-powered money”, including currency (coins and notes) and reserves
(on highly liquid bank accounts). The supply side of “real balances” is simply
the total nominal stock of money, M, divided by the aggregate price level in
society, P, measured for instance by a consumer price index.
The demand for real balances is determined by the nominal interest rate i , which
we will here express as the real interest rate r plus expected inflation π e so that
i = r + π e , and by the real level of income Y :
M
= L(r + π e , Y ), L r+π e < 0, LY > 0 (11.2)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The demand for money decreases with the nominal interest rate since the oppor-
tunity cost of holding money increases with the nominal interest rate that people
would receive if they made more investments that are long-term.2 In order to find
the slope of the MP curve in (Y, r ) space, consider an increase in r . If the equi-
librium in (11.2) is to remain in place, the decrease in money demand must be
balanced by an increase in Y . Hence, the MP curve will have a positive slope.
The key monetary policy variable is the real interest rate r , which is set by a
central bank. This interest rate (often referred to as the “repo rate”) is perfectly
transmitted to the rest of the economy. In setting its interest rate, the central bank
takes into consideration the output level Y as well as actual inflation π.3 The
behavioral rule is that central banks increase the real interest rate when output Y
increases or when inflation rises. Hence, r (Y, π) and the derivatives are rY > 0 and
rπ > 0.
Reformulating (11.2) gives us an expression for the level of the nominal money
supply, which will be endogenous to monetary policy:
M = PL(r (Y, π) + π e , Y )
Note the distinction that monetary policy reacts to actual inflation levels π whereas
the public, in this setting, adjusts its money demand through changes in expected
inflation π e .
r
A higher inflation shifts the MP curve
MP
IS IS′
Downloaded by [Hacettepe University] at 20:58 10 April 2017
that satisfies an equilibrium in the goods market must be higher than before. The
MP curve remains unaffected since government expenditures are assumed not to
affect the aggregate price level in the short run. The outward shift in the IS curve
implies a higher level of equilibrium output but also a higher interest rate, r .
dY
dY
dr
= EY + Er
dπ
AD dπ
AD dπ
AD
dr
dY
= rπ + rY
dπ AD dπ
AD
106 Macroeconomic policy
AS
AD
Downloaded by [Hacettepe University] at 20:58 10 April 2017
dY
dY
= EY + Er rπ + rY
dπ
AD dπ
AD dπ
AD
dY
=⇒ (1 − E Y − Er rY ) = Er rπ
dπ
AD
dY
E r rπ rπ
= = 1−E
dπ AD 1 − E Y − Er rY
Er − rY
Y
Since the numerator is positive while the denominator is negative, the expression
as a whole will be negative, as shown in Figure 11.3.
Other changes in the components of the IS and MP curves will also have an
impact on the AD curve. We discussed previously the effect of an increase in gove-
rnment expenditures G. The outward shift in the IS curve to a new equilibrium
level of Y is associated with a simultaneous shift rightwards of the AD curve,
since output increases at a given level of inflation.
= P F(L) − wL
where P is the aggregate price level (equal to the firm’s own price level), F(L)
is the production function with the usual properties F (L) > 0 and F (L) < 0,
and w is the wage rate, which is fixed in the short run. The typical competitive
firm will then hire up to the point when marginal revenue equals marginal cost:
P F (L) = w. Rewriting this equilibrium condition gives
w
Downloaded by [Hacettepe University] at 20:58 10 April 2017
F (L) =
P
To see the link to the aggregate supply (AS) curve, consider an increase in
inflation π. Such an increase will involve a rise in P, which in turn means that the
real wage falls (since the nominal wage w remains constant). In order to preserve
equilibrium, the marginal product on the left-hand side must also fall, which it can
do if L rises to some L 1 > L. If firms hire more labor, they can further produce
more output so that F(L 1 ) > F(L). Hence, there emerges a positive relationship
between inflation and output. This is the reason for the positive slope of the AS
curve in Figure 11.3.
wt = γ Pt−1 (11.3)
wt γ Pt−1
F (L t ) = =
Pt Pt
Pt −Pt−1 Pt Pt−1
Since πt−1 = Pt−1 = Pt−1 − 1, we know that Pt = 1
1+πt−1 . Thus, we can
write
γ
F ( L̃ t ) = (11.4)
1 + πt−1
108 Macroeconomic policy
As mentioned above, an increase in inflation therefore increases the number
of people employed in the next period. If the total labor force is Nt , then unem-
ployment is Nt − L̃ t = μN. A rise in inflation in this way decreases the level
of unemployment. This stable negative relationship between unemployment and
inflation is famously referred to as the Phillips curve (Phillips 1958). The exis-
tence of exploitable Phillips curves has been a field of intense empirical research
in recent decades.
did not appear to be in place. Friedman (1968) and Phelps (1968) argued, for
instance, that a shift in policy towards a higher level of inflation with the aim of
keeping unemployment low could not in the longer run keep wages from increas-
ing. Rational workers would see through the Phillips curve reasoning and would
not accept persistent decreases in their real wage. In terms of our model, this
means that it is not likely that workers would accept a wage-setting rule according
to (11.3).
The implication of this type of reasoning is that in the longer run, there should
be a normal or natural rate of unemployment – sometimes referred to as the
nonaccelerating inflation rate of unemployment (NAIRU) – from which mone-
tary or fiscal policy could not diverge. Short-run deviations might be possible,
as indicated by the short-run Phillips curve (SRPC) in Figure 11.4, but in the
longer run there should not be an exploitable relationship between inflation and
unemployment.
π
LRPC
π*
SRPC
unemployment
NAIRU
ln Y − ln Ȳ = b[ p − E( p)]
LS
LSnew
i new
LD
Downloaded by [Hacettepe University] at 20:58 10 April 2017
LDnew
lending
As above, the IS curve is defined as the combination of the real interest rate
r (= i ) and total income Y , where total income equals total expenditure E(Y, r ),
but let us now assume that we have no government. Total expenditure will thus be
spent on consumption and investment, and savings should equal investment.
Unlike before, we derive the slope of the IS curve through the supply and
demand of household lending. In Figure 11.5, we have drawn household demand
for lending LD as a negative function of the nominal interest rate i . Similarly,
households’ supply of lending (i.e. savings) increases with the nominal inter-
est rate. Without any credit market frictions, supply will equal demand at the
rate i .6
If there is an exogenous increase in income Y , how will this affect the supply
and demand for lending? Firstly, if all households have somewhat more income,
they should be able to increase lending supply. The LS curve therefore shifts down
to the right. The demand for loans should fall as incomes increase, but by a smaller
amount. Hence, the LD curve shifts down to the left. The result is a lower equi-
librium interest rate i new < i and a higher level of lending than before. We have
thereby shown that there must be a negative relationship between Y and i for the
goods market to be in equilibrium.
However, the analysis on the basis of Figure 11.5 assumes no credit market fric-
tions, and there is no need for financial intermediaries since the supply of credit
always meets demand. Let us now introduce a situation where financial interme-
diaries are necessary for effectively channeling credit (and perhaps for pooling
risks, as shown in Chapter 7). In order to finance their activities, the financial
intermediaries (we henceforth refer to them as banks) set a higher interest rate for
borrowing i b than for lending i s , as in Figure 11.6a. The difference is referred to
as the credit spread ω ≥ 0.
112 Macroeconomic policy
(a) i
LS
ib
credit spread ω1
is
LD
Downloaded by [Hacettepe University] at 20:58 10 April 2017
lending
L1
(b) ω
XSnew
XS
ω2
ω1
XD
lending
L2 L1
Figure 11.6 (a) Introducing financial intermediaries and credit supply frictions.
(b) Supply and demand for financial intermediation as a function of the
credit spread.
The supply and demand for financial intermediation is shown in Figure 11.6b
and is a function of the credit spread ω. Clearly, if the credit spread is large, the
interest rate on savings will be low and the demand for lending will also be rel-
atively low. The reverse holds if the credit spread is low. On the other hand, a
large credit spread should increase the number of firms willing to act as banks.
The supply of financial intermediation is a positive function of ω since it is more
profitable to be a bank when the difference between savers’ and borrowers’ inter-
est rates are large. For a given level of ω, financial innovations or productivity
increases would shift the XS schedule to the right whereas, for instance, tighter
IS–MP, aggregate demand, and aggregate supply 113
banking regulations such as a higher capital requirement (the bank’s equity as a
share of total assets including its lending) would shift the XS curve to the left. The
equilibrium credit spread is initially given by ω1 when the two curves intersect.
The level of lending is then L 1 .
Let us now consider the effects of a sudden reassessment of the total value of
the banking system’s assets, i.e. its lending to businesses and individuals.7 A nat-
ural requirement for banks when risks increase would be to try increase the credit
spread ω. For a given level of lending, this would be equivalent to shifting the XS
curve to the left, as in Figure 11.6a to XSnew . The result is an increase in the equi-
librium credit spread to ω2 and a decrease in the amount of lending in the economy
to L 2 .
The increase to ω2 implies that i s must fall and that households’ supply of
Downloaded by [Hacettepe University] at 20:58 10 April 2017
lending to the banking system falls to L 2 . In Figure 11.7, we draw the IS and
MP curves in (i s , Y ) space instead of in (r, Y ) space as above. The MP curve is
given by M/P = L(r (Y, π) + π e , Y ) = L(i s (Y, π), Y ) since we have assumed that
π e = 0. Furthermore, money demand L(i s (Y, π), Y ) is a function of the interest
rate on savings i s rather than of i b . People are more willing to convert their long-
term assets into cash or cashable deposits if the interest rate on savings is low. The
monetary policy rule by the central bank i s (Y, π) is targeted at i s and stipulates
that i s increases with both Y and π as before. If this rule does not change, the MP
curve will not shift as a result of the contraction of the XS curve.
The IS curve will, however, be affected by the change. For a given level of Y , i s
is now lower than before due to the increase in the required credit spread. The IS
curve will therefore shift down, as in Figure 11.7. In equilibrium, there is a con-
traction of GDP. Hence, even without changes in monetary policy, a disturbance
in the credit markets can have real effects via the IS curve.
In terms of the 2008 crisis, the model above seems to suggest that a downward
spiral in credit supply and economic activity ensued after the initial shock. The
is
MP
ISnew IS
nomic reasons for nominal rigidities and such rigidities might matter for the real
economy.
Consider, for instance, the assumption that there is a small cost associated with
a change in prices. In line with Mankiw (1985), we might think of such a cost
as arising from having to print new menus in the case of a restaurant. Let us
assume a market structure with some degree of monopolistic competition so that
there are many price-setting firms that find their optimal production level by set-
ting marginal revenue equal to marginal cost. Actual demand is revealed after
quantities have been set.8 If, for instance, the demand for a product decreases,
a profit-maximizing producer should decrease its price.
However, if there is a menu cost for changing prices, the producer must compare
the loss in profit from keeping the price at the existing level, to the small cost of
changing the price. If the loss is smaller than the menu cost, a rational producer
should not change the price. Hence, in this case, there are good microeconomic
reasons for price rigidity.9
In macro models of the aggregate price level, such price rigidity can be modeled
by assuming that only a fraction of firms adjust their price to the profit-maximizing
level. The remaining firms simply keep the price that they charged last period.
A particularly simple formulation used by, for instance, Gali and Gertler (2007)
is that during some period t, the actual (log) price level pt is determined by an
equation
pt = θ pt−1 + (1 − θ ) pt∗
The parameter θ should here be interpreted as the probability that firms cannot
adjust their price during period t. With probability 1 − θ , firms are able to adjust
their price to the profit-maximizing level. θ can thus be regarded as a measure of
the degree of price rigidity. It follows that each firm is expected to maintain its
price level for an amount of time equal to 1/(1 − θ ) > 0. If θ = 2/3, for instance,
then firms only adjust their prices every third period. The adjusted price level pt∗
will in turn be given by a mark-up over current and future expected marginal cost
since new Keynesian models typically assume that markets are characterized by
imperfect competition.
12 Public Finance and Fiscal Policy
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Dt ≡ (G t − Tt ) + rt Dt
116 Macroeconomic policy
The term in parentheses is often referred to as the primary deficit. Budget deficits
imply that Dt > 0, whereas budget surpluses imply a decreasing stock of debt
Dt < 0. Note that in countries with very large debts (and hence large interest
payments), even a moderate primary surplus (G t − Tt < 0) might not be enough
for the debt to shrink.
Budget deficits are typically financed by governments’ issuing of bonds that are
sold to households and that give their holder an interest rate of rt every year. If the
total debt is bond-financed, Dt will be equivalent to the value of the stock of all
outstanding bonds.
Over the long run, it is often assumed that government revenues and expen-
ditures should add up. A “sustainable” long-term fiscal policy should thus
Downloaded by [Hacettepe University] at 20:58 10 April 2017
satisfy:
∞
∞
Gt Tt
+ D 0 ≤ (12.2)
(1 + rt )t (1 + rt )t
t=1 t=1
where rt is the real interest rate as before. In other words, over an infinite time
horizon, the present value of the flow of future government expenditures, plus
some initial debt level D0 , should not exceed the present value of the flow of all
future tax revenues. We will return to this intertemporal constraint below.
∞
∞ y t − τt
ct
≤ d 0 + (12.3)
(1 + rt )t (1 + rt )t
t=1 t=1
The present value of a lifetime of consumption expenditures should not exceed the
present value of the infinite flow of disposable incomes (labor income yt minus
income taxes τt ) plus the initial stock of bonds d0 . If the the economy is made up
of L identical individuals, we can express the individual’s constraint in (12.3) in
terms of Ct = ct L, D0 = d0 L, Yt = yt L, and Tt = τt L. ∞
t=1 Tt /(1 + rt ) =
t
When
∞
(12.2) holds with equality, we know that
t=1 G t /(1 + rt ) + D0 . Inserting this expression into (12.3) gives us
t
∞
∞ ∞
Gt
Ct Yt
≤ −
(1 + rt )t (1 + rt )t (1 + rt )t
t=1 t=1 t=1
Public finance and fiscal policy 117
In this way, we can express the households’ budget constraint as a function of
the present value of government expenditures. The key thing to notice is that the
time path of taxation does not enter the equation above. The financing of deficits
with bonds or taxes should therefore not matter to households. This is indeed
the Ricardian equivalence result. Only the quantity of government expenditures
should matter for consumption over the long run.
The intuition behind this result is that households do not value bonds as net
wealth. Consider the following example: From having a balanced budget with
no debt at some time t, the government chooses to lower taxes Tt . G t remains
the same as before, but now G t > Tt , so that debt accumulates. The government
finances the deficit by selling bonds to the households. Will this increase in dis-
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The expressions in (12.4) assume that the distortionary costs of taxation are an
increasing and convex function of taxes as a share of GDP, Tt /Yt (the tax quotient),
scaled by the general size of the economy Yt . The z-function provides a shortcut
for the many different distortions that taxes give rise to but provides no information
about their exact sources. No taxes implies zero distortions in this setting: z(0) = 0.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
∞
∞
∞ Gt
Yt Tt Tt
min z subject to = + D0
T0 ,T1 ... (1 + rt ) t Yt (1 + rt ) t (1 + rt )t
t=1 t=1 t=1
As was the case in the PIH, the first-order condition for this problem gives us an
T∗ T∗
Euler equation of the type z ( Ytt ) = z ( Yt+1 ), which in turn implies that optimally
t+1
∗
Tt∗ Tt+1
=
Yt Yt+1
In other words, distortionary costs are minimized when taxes as a share of GDP are
smoothed over time. This is the tax smoothing result, with the same basic intuition
as consumption smoothing.
Extending this model to include uncertainty is straightforward. In the case of a
quadratic z-function (as in Hall’s random-walk model), we will have
∗
Tt∗ T
= E t t+1
Yt Yt+1
Taxes as a share of GDP thus follow a random walk. The result suggests that at all
points in time, the government includes all available information about expected
future incomes and sets the current level of Tt /Yt so that it is expected to be equal
to all future levels of tax quotients. Any change from this pattern must be due to
random new information.
respectively. The instantaneous utility functions u(gtR ) and v(gtL ) have the usual
properties u (gtR ), v (gtL ) > 0 and u (gtR ), v (gtL ) < 0. Note also that u(gtL ) =
v(gtR ) = 0, i.e. if the other party wins the election and provides its preferred public
good, the losing party is assumed to yield no utility. β ≤ 1 is the usual time discount
factor.
The country earns an exogenous flow of per capita incomes during the two
periods equal to y1 , y2 > 0, where we assume y2 ≥ y1 . The income tax rate is
τ < 1, so that the intertemporal flow of government revenue is τ y1 and τ y2 . The
120 Macroeconomic policy
government starts off without any government debt. In the first period, the govern-
ment in power might, however, incur a debt equal to d that must be repaid in full
in period 2. These conditions imply that the budget constraints for party j = R, L
in power are
j
g 1 = τ y1 + d
(12.6)
j
g 2 = τ y2 − d
The party in power during period 1 thus maximizes (12.5) subject to (12.6).
The choice variable that we are interested in is the level of debt chosen in the first
period, d. The optimal level of debt can be found if we insert the expressions for
Downloaded by [Hacettepe University] at 20:58 10 April 2017
(12.6) into the utility functions and then maximize with respect to d.
Let us, for instance, assume that the right-wing party is in power in period 1.
The optimization problem for the government is thus
∂V R
= u (τ y1 + d) − βρu (τ y2 − d) = 0
∂d
With this general utility function, we are as always unable to calculate any explicit
solutions, although it is certainly possible to carry out comparative statics with the
help of implicit differentiation.
In order to derive an explicit solution, let us assume logarithmic utilities so that
u(gtR ) = ln gtR and u(gtL ) = ln gtL . In that case, the first-order condition tells us that
1 βρ
=
τ y1 + d τ y2 − d
By manipulating this expression, we can solve for the optimal level of government
debt:
τ (y2 − βρy1 )
d∗ = (12.7)
1 + βρ
The key insight from this expression is that the optimal level of debt incurred
by an incumbent party in the first period is negatively associated with the party’s
probability of winning the election in period 2. More formally, the derivative is
∂d ∗ (y1 + y2 )βτ
=− <0
∂ρ (1 + βρ)2
Hence, if the right-wing party is the incumbent in period 1, the optimal debt will
be smallest if ρ = 1, i.e. if the incumbent party is 100 percent sure of winning
Public finance and fiscal policy 121
the next election. From (12.7), we can deduce that the debt will then be d ∗ =
τ (y2 − βy1 )/(1 + β) ≥ 0. Note that as long as y2 − βy1 > 0, the party in govern-
ment will always opt for a positive amount of debt. The size of the debt increases
with the growth in income, y2 − y1 . Hence, it is more rational for fast-growing
economies to have government debt than for slow-growing countries.
A straightforward corollary of the result above is that debt will be highest when
the probability of re-election ρ approaches zero. If that is the case, we can see from
the utility function that the party in power will discount the future by a factor βρ
that will be close to zero. Since the party can only spend on the public goods
that it likes in the first period, it will take on as much debt as possible in the first
period. In the limiting case of ρ = 0, we can deduce from (12.7) that the optimal
first-period debt will be τ y2 , i.e. the debt will be so high that the opposing party
Downloaded by [Hacettepe University] at 20:58 10 April 2017
that takes over has to spend all its government revenue on paying back the debt. If,
for example, the right-wing party only cared for military spending, it would spend
τ (y1 + y2 ) on the military in period 1 so that the left-wing party would not be able
to spend anything on health care while in power in period 2.
Usually, the level of government debt is discussed in terms of shares of total
GDP. If y1 = y2 = y, then optimal debt as a share of GDP is simply
d ∗ τ (1 − βρ)
=
y 1 + βρ
This expression highlights two other important factors for countries’ willingness
to accept government debt: the time discount rate β and the tax rate τ . Societies
with more impatient individuals (a low β) are more likely to have high debt than
societies with patient individuals. Similarly, it is only intuitive that high-tax coun-
tries (a high τ ) with a large public sector will have higher debt as a share of GDP
than low-tax countries.
Assume now that whereas first-period revenues are known with certainty,
second-period revenues are uncertain and might take on either a high value y2H
or a low value y2L , where y2H > y2L . The differences in income could, for instance,
be due to high or low world market prices for the goods that the country is selling.
Suppose that the probability of the good outcome is ρ and the probability of the
bad outcome is 1 − ρ. On top of this, actual revenue in period 2 might depend
on government adjustment effort e. Adjustment efforts could, for instance, be an
exchange rate realignment or the abolition of harmful tariffs that serve to improve
the functioning of the economy and increase income. For simplicity, we assume
that every unit of adjustment effort results in one additional unit of output. Taken
together, the expected level of second-period government revenue in period 1 is
therefore E 1 (y2 ) = ρy2H + (1 − ρ)y2L + e whereas the actual levels are
y2 = y2i + e
ally want to avoid episodes of sovereign default, although there are many instances
of such events in history. Since it was founded in 1945, the International Mone-
tary Fund (IMF) has acted as a lender of last resort to heavily indebted countries.
By 2011, Greece was one of the countries that had to turn to the IMF for help to
finance its government debt.
How should an optimal debt contract be designed? Clearly, a profit-maximizing
financial institution would only care about its own expected profit and would not
lend to a country with a debt overhang. Given that the IMF presumably gives some
weight to the indebted country in its utility function, its optimal strategy will be
different.
Let us consider the situation in period 1 and imagine that a liquidity crisis has
emerged so that remaining debt payments exceed expected future government rev-
enue, b(1 + r ) > τ E 1 (y2 ). What are the options for a lender like the IMF? To begin
with, note that even if the debt is unsustainable according to the definition above,
it might still be the case that τ (y2H + e) > b(1 + r ), i.e. the country might be able
to pay back its debt if the good outcome y2H materializes. One strategy might
therefore be to lend the full amount and hope for the good outcome to happen
(or for government effort e to be sufficiently high) so that the IMF recovers all of
its credit. The actual repayment q would then be:
τ y2 if τ y2 < b(1 + r )
q= (12.9)
b(1 + r ) if τ y2 > b(1 + r )
τ yL
q = (1 + r low )b = τ y L =⇒ r low = −1
b
124 Macroeconomic policy
in which case the country’s debt would in fact be sustainable. Actual payment,
regardless of outcome, is then q = τ y L . However, this lower interest could be seen
as a reward for irresponsible behavior and since the IMF would have to make up
for the interest differential to the international interest rate (r − r low ), it would still
be close to a government default.
A third and closely related strategy might be to reschedule or “forgive” some of
the debt already in period 1 to a new level blow < b such that
τ yL
q = (1 + r )blow = τ y L =⇒ blow =
(1 + r )
(b − blow )(1 + r ) and is likely to make it hard for the country to borrow money on
the international capital market in the future.
Why would the IMF ever choose the latter two alternatives when they clearly
yield repayments that are smaller than or equal to the contingent payment
in (12.9)? Note that in the two latter cases, government spending is either g2 =
τ y2 − τ y L = τ (y H + e) > 0 or g2 = τ (y L + e) ≥ 0. By setting a low level of
repayment, the IMF gives governments stronger incentives for making necessary
adjustment efforts e. In the subsection below, we formalize this idea further.
∂ E 1 (U ) ∂ E 1 (g2 )
= u (g2 ) − v (e) = 0
∂e ∂e
Since we know that u (g2 ) > 0 and v (e) > 0 for all e > 0, the existence of a positive
Downloaded by [Hacettepe University] at 20:58 10 April 2017
equilibrium level e∗ > 0 will depend on the term ∂ E 1 (g2 )/∂e. If the IMF does not
forgive any debt, if it lends at the international interest rate r , and hopes for a
positive outcome so that a full repayment can be made, then for all levels such that
τ E 1 (y2 ) < b(1 + r ), the government is expected to earn τ E 1 (y2 ) and has to repay
the same amount. A rise in e increases expected revenue and expected repayments
one for one and hence ∂ E 1 (g2 )/∂e = 0. Governments therefore have no incentive
to make adjustment efforts and the optimal level is e∗ = 0.
If the debt contract is set at the more generous level q = τ y L , then τ E 1 (y2 ) > q
and extra efforts will increase E 1 (g2 ) at all e > 0. This implies that ∂ E 1 (g2 )/∂e = 1
and that the optimal level of effort is given by u (g2 ) = v (e∗ ). In this section, we
have not explicitly modeled the utility function of the IMF, but is seems quite
intuitive that an outcome where e∗ > 0 might be part of its preference structure.
13 Inflation and Monetary Policy
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Inflation is the percentage increase in the aggregate price level (usually measured
by a consumer price index) from one year to the next. Inflation has been observed
throughout history for as long as money has existed. The chapter will focus on
the causes of inflation and analyze how monetary policy affects inflation and the
economy as a whole.
At a general level, we saw in Chapter 11 that inflation will be heavily influ-
enced in the short run by aggregate supply and aggregate demand. An expansion
in demand, for example, always leads to an increase in inflation (unless the AS
curve is horizontal). Negative supply shocks – for instance, due to a rise in the
price of a key production factor like oil – will lead to a supply contraction and an
increase in inflation.
In this chapter we will delve further into mainly the political economy of infla-
tion and the perceived trade-off between inflation and output growth. We will
start off with the classical quantity theory of money and then proceed with the
time-inconsistency model of monetary policy and its potential solutions, originally
associated with Kydland and Prescott (1977). After that, we move on to an exten-
sive discussion of political business cycles and analyze how inflation is affected
by the presence of political parties with different preferences for price stability.
These sections will rely to a great extent on Alesina and Stella (2010). Finally, we
include a treatment on seigniorage.
MV = PY
M
= L(r + π e , Y )
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Rewriting this expression, and assuming for now that Ȳ and r̄ are constants and
that the monetary policy rule does not apply, we get P = M/L(r̄ + π e , Ȳ ), i.e. just
like in the quantity theory, there is a close association between the aggregate price
level and the nominal money stock. Indeed, we will have that actual inflation and
expected inflation are
Ṗ Ṁ
= π = πe =
P M
where Ṗ/P and Ṁ/M are the growth rates (the time derivative divided by the cur-
rent level) of the aggregate price index and the nominal money stock, respectively.
Prices are thus completely flexible.
Let us consider the case when a monetary authority announces that it will
increase the growth rate of money Ṁ/M. This increase in turn increases actual
as well as expected inflation π = π e . Through its positive impact on the nominal
interest rate i = r + π e , the demand for money therefore shrinks immediately.2
In order to balance the fall in money demand, real money supply must also fall,
i.e. P must increase. Hence, the rise in the money growth rates both increases the
inflation rate in the longer run and leads to a one-time discontinuous jump in the
aggregate price level.
Analogously, if monetary authorities reduced money growth, there would be a
decrease in long-run inflation but also an immediate discontinuous fall in the price
level. In order to avoid such a dramatic scenario, central banks should combine the
lowering of long-run money growth rates with a temporary increase in the level of
money supply (Romer 2005).
y = ȳ + b(π − π e ) (13.1)
Downloaded by [Hacettepe University] at 20:58 10 April 2017
where y is actual output (in logarithmic form), ȳ is the natural rate of output (long-
run aggregate supply), π and π e are actual and expected inflation as before, and
b > 0 is a parameter capturing the sensitivity of output to deviations in the inflation
rate. Central banks (or policy-makers more generally) are assumed to be in control
of the actual level of inflation π, whereas the public forms rational expectations
about inflation π e . One might think of these expectations as being manifested in
wage contracts so that there is an element of stickiness in inflationary expectations.
The only way that central banks can increase output from its natural level ȳ is to
surprise the public by setting an inflation rate π > π e .
Let us further assume that central banks are governed in their decision-making
by a (dis)utility or social loss function
1 1
V = (y − y ∗ )2 + a(π − π ∗ )2 (13.2)
2 2
The aim of the central bank is to conduct a monetary policy that ensures that actual
output and inflation are as close as possible to some target levels y ∗ and π ∗ . a > 0
reflects the relative weight given to fighting inflation. Most central banks have
inflation fighting as their primary objective, implying a > 1. Furthermore, central
banks as well as governments will typically want to maintain levels of output
(unemployment) that are higher (lower) than the natural rate, i.e. y ∗ > ȳ. Let us
think of y ∗ as the level of output associated with full employment. We assume, for
simplicity, that the targeted rate of inflation is π ∗ = 0.3
There are at least two main monetary policy strategies that central banks can
follow: either to make binding inflation commitments that are independent of the
public’s expectations or to allow for a discretionary monetary policy that takes the
public’s expectations into account in its decision-making.
If the central bank makes binding commitments, it will not allow itself to adjust
levels of inflation as a response to how the economy performs. Assume that the
central bank makes a commitment that, no matter what, it will maintain inflation
at π = 0. If the public considers this to be a credible commitment, its inflation
expectations will be π e = 0. In this case, the central bank will simply set π = 0,
which is the socially optimal level.
Inflation and monetary policy 129
13.3.2 Discretionary monetary policy
If the central bank follows a discretionary monetary policy, then its governors
allow themselves flexibility by determining the inflation rate after the public has
formed its expectations. An important component is further that central banks rec-
ognize that actual output is given by (13.1). There is thus a possibility of boosting
output by setting an inflation rate above the expected one, as implied by the Lucas
supply function. Combing these assumptions gives us the following minimization
problem for the central bank:
1 1
min V (π) = [ ȳ + b(π − π e ) − y ∗ ]2 + aπ 2 (13.3)
π 2 2
Downloaded by [Hacettepe University] at 20:58 10 April 2017
However, the public does not want to be fooled, and forms rational expecta-
tions about the central bank’s inflation policy. Formally, the objective of the public
might be described as being aimed at minimizing (π e − π)2 . In Kydland and
Prescott’s model, the interaction between the central bank and the public takes
the form of a sequential, complete information game where the public moves first
by forming π e , whereupon the central bank sets π conditional on the observed
level of π e .
As usual in sequential games, we find the solution through backward induction,
i.e. by starting in the second stage.
Second stage: The central bank chooses the π that minimizes (13.3), taking π e
as given.
In order to find the level that minimizes (13.3), we need the first-order
condition4
∂V
= [ ȳ + b(π − π e ) − y ∗ ]b + aπ = 0
∂π
After some manipulations, the combined levels of π and π e that satisfy this first-
order condition give us the central bank’s best response (or reaction) function
b ∗ b2 π e
π r (π e ) = (y − ȳ) + (13.4)
a + b2 a + b2
π
45º
πr
πe
Π EQ
In order to obtain the full solution to the model, we need now to move to the
first stage:
First stage: The public forms inflation expectations π e , taking into account the
known response function of the central bank.
The public knows the rules of the game and realizes that the central bank will
attempt to surprise it by setting an inflation higher than the expected one according
to (13.4). Since the people want to minimize surprises, they will increase their
inflationary expectations until π e = π r . Since the slope of the reaction function
in (π, π e ) space is smaller than unity, there must be a level where this equality
holds. As shown in Figure 13.1, this happens at a level π = π EQ . Inserting π EQ =
π e = π r into (13.4) and solving gives us the subgame perfect Nash equilibrium
level of inflation:
b
π EQ = (y ∗ − ȳ) > 0
a
The central insight from this expression is, once again, that π EQ > 0, i.e., even
though the official inflation target is π ∗ = 0, the fact that discretionary mone-
tary policy is allowed will always give us an inflation rate that is higher than the
socially optimal level. The reason is that a target of π ∗ = 0 is time-inconsistent:
when the public’s inflation expectations are set first, it is always optimal for the
central bank to set π > 0 in the next stage. Hence, although discretionary policy is
more flexible, which sounds like a good thing, it produces inflation rates that are
higher than those that would prevail under binding inflation targets.
Note also that since π EQ = π e in equilibrium, y EQ = ȳ + b(π EQ − π e ) = ȳ, i.e.
the equilibrium level of output will remain at the natural rate of output and below
Inflation and monetary policy 131
the socially optimal level y ∗ . In this sense, the net result of discretionary monetary
policy is only a socially suboptimal level of inflation.
where εt is a random output shock with an expected value E t (εt ) = 0. For simpli-
city, let us also assume that εt can assume the values γ > 0 in good times and −γ
in bad times with a probability distribution that satisfies E t (εt ) = 0.
If we set up the same loss function as before with this new assumption, it will be
1 1
min V (π) = [ ȳ + b(π − π e + εt ) − y ∗ ]2 + aπ 2 (13.6)
π 2 2
From the first-order conditions, we can deduce that the central bank’s reaction
function will now be
b b2 (π e − εt )
π r (π e ) = (y ∗ − ȳ) + (13.7)
a+b 2 a + b2
In the second stage, when the central bank determines the level of inflation by
taking the public’s expectations π e as given, it also has information on the actual
realization of εt . The public, however, will make its expectations before knowing
what actually happens to the economy. Their rational expectation about the shock
is simply E t (εt ) = 0.
When we insert π EQ = π e into the expression above as before, we can solve for
the equilibrium level of inflation under discretion and uncertainty:
b ∗ b2 (π EQ − εt )
π EQ = (y − ȳ) +
a + b2 a + b2
b2 a b b 2 εt
π EQ 1 − = π EQ = (y ∗ − ȳ) −
a+b 2 a+b 2 a+b 2 a + b2
b
π EQ = (y ∗ − ȳ) − b2 εt
a
1 1
V L = (y − y ∗ )2 + a L π 2
2 2
1 ∗ 2 1
V = (y − y ) + a R π 2
R
2 2
respectively, where V L is the loss function of the left-wing party and V R the
equivalent function for the right-wing party. The key difference between the two
Inflation and monetary policy 133
is that a R > a L , i.e. the right-wing party attaches a relatively higher weight to
fighting inflation. Both parties want to achieve full employment, so that y = y ∗ ,
but the left-wing parties are, as will be demonstrated, more willing to pay a price
for this in terms of higher inflation. Compared with the above, let us now set b = 1
for simplicity so that output responds fully to a surprise inflation.
If we minimize the loss functions and take the first-order conditions in the usual
way, we can derive the optimal response functions
y ∗ − ȳ πe
π r,L = + (13.8)
1 + aL 1 + aL
y ∗ − ȳ πe
π r,R = +
Downloaded by [Hacettepe University] at 20:58 10 April 2017
1 + aR 1 + aR
(y ∗ − ȳ) (y ∗ − ȳ)
π EQ,L = > 0, π EQ,R = >0 (13.9)
aL aR
Since a R > a L , it follows that π EQ,R < π EQ,L , i.e. equilibrium inflation will be
lower when the right-wing party is in power. Note that equilibrium output levels
will be y L = ȳ + π EQ,L − π e = ȳ and y R = ȳ + π EQ,R − π e = ȳ.
In nonelection years, this is well known to the public and they adjust their
expectations accordingly. In election years, however, there is an uncertainty about
what party will be in power. In line with the discussion about random output
shocks above, let us imagine that public expectations are made in the first stage
and that the politically controlled central bank then sets the inflation rate after
having observed the election result and in accordance with the winning party’s
preferences.
Let ρ be some objective probability of an election victory for the right-wing
party at the time expectations are formed. The probability of a left-wing victory is
therefore 1 − ρ. The expected inflation is then
(y ∗ − ȳ)[1 + a R − ρ(a R − a L )]
πe = (13.10)
(1 + a R )a L + ρ(a R − a L )
This expression implies that π e will decrease with ρ, the probability of a right-
wing election victory. This is well in line with intuition since the right-wing party
is more inflation-averse than the left-wing party.
134 Macroeconomic policy
We can insert the result in (13.10) into (13.8) to solve for the actual levels of
inflation during an election year t:
EQ,L (y ∗ − ȳ)(1 + a R )
πt =
(1 + a R )a L + ρ(a R − a L )
EQ,R (y ∗ − ȳ)(1 + a L )
πt =
(1 + a R )a L + ρ(a R − a L )
Once again, since a R > a L , it will be the case that πtEQ,L > πtEQ,R . Another inter-
esting result emerges if we compare chosen inflation levels by each party during
a “mid-term” year when there is no election, and during an election year. Let us
Downloaded by [Hacettepe University] at 20:58 10 April 2017
imagine that the mid-term year is t − 1 and that an election happens the next year
EQ,L
at t. Hence, πt−1 is the optimal level of inflation chosen by a left-wing party in
EQ,R
office during a mid-term year and is given by (13.9). Let πt−1 be defined in the
same way. Then it is easily shown that the following inequalities will hold:
EQ,L EQ,L
(i) πt < πt−1 ;
EQ,R EQ,R
(ii) πt > πt−1 ;
EQ,L EQ,R
(iii) πt > πt−1 ;
(iv) πtEQ,R < πt−1
EQ,L
The direction of the sign in (i) informs us that when a left-wing party is in
power and wins the election, inflation will fall during an election year. The reason
is that voters give the right-wing party a probability ρ > 0 of winning and hence
have lower expectations for inflation during the election year. The government-led
central bank can thus set a lower level of inflation than before after a left-wing
victory. For a right-wing party in power, the reverse logic applies in (ii); in an
election year, inflation expectations will rise due to some probability of a left-
wing party win and the economy will thus get a higher level of inflation during an
election year, even when the right-wing party wins.
Inequalities (iii) and (iv) are perhaps less surprising: if a left-wing party wins
the election and takes power from the right-wing party, inflation will increase (iii),
whereas if the reverse happens, inflation will fall (iv).
The actual levels of output under different election winners during an election
year will be
victory by the right-wing party. Hence, the central bank can surprise the electorate
with a high inflation and boost output. In case of a right-wing party win, the actual
level of inflation will be lower than the one anticipated by the public and output
will therefore decline, even if the right-wing party was previously in power. One
year after the election, output levels will be back at ȳ again.
In Figure 13.2, we show one example of such a political business cycle dynamic.
It shows the development of output and inflation when the economy transits from
y election year
time
t –2 t –1 t t +1 t +2
EQ,L
EQ,L
EQ,R
time
t –2 t –1 t t+1 t+2
Figure 13.2 Political business cycle effects over time on output (y) and inflation (π) of
an unexpected (ρ > 1/2) left-wing party election victory over an incumbent
right-wing party.
136 Macroeconomic policy
right-wing party rule to an unexpected election victory by the left. In period
t − 1 and earlier, output and inflation are ȳ and (y ∗ − ȳ)/a R , respectively. When
the election period comes, the probability of a right-wing victory is considered
large, ρ > 1/2, which is reflected in low inflationary expectations and a relatively
low πtEQ,L . Due to this surprise, there is a relatively large boom in output, which
increases by ρη > 0. In period t + 1, the economy has settled down at the natural
rate of output and the higher equilibrium level of inflation.
The key insight from this section is that even if the same party is in power before
and after an election and no fundamentals in the economy have changed, there will
still be a temporary cyclic effect on inflation and output that is fully explained by
the uncertainty surrounding the election itself.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
where i t is the nominal interest rate at time t, πt is the actual inflation rate as
before, π ∗ is the socially optimal inflation rate, rt∗ is the equilibrium real interest
rate, yt is the actual output level (in logs), and ȳ is the natural rate of output or
the level of potential output predicted by a linear trend. aπ and a y are the weights
given to inflation and output, respectively.
In Taylor’s original work, which was used for understanding United States mon-
etary policy, it was assumed that rt∗ = 2 percent, π ∗ = 2, and aπ = a y = 1/2.8 Using
these parameter values, Taylor rule becomes
πt − 2 yt − ȳ
i t = πt + 2 + +
2 2
3πt yt − ȳ
=1+ +
2 2
Inflation and monetary policy 137
Hence, when both output and inflation are at their preferred levels (so that πt − 2 =
yt − ȳ = 0), then i t = 4 percent. If inflation rises by 1 percent above this level, i.e.
πt = 3, while the other values remain constant, the rule says central banks should
raise their nominal interest rate by 1.5 percent to 5.5 percent. Similarly, if output
rose so that it was 1 percent above its long-run trend (yt − ȳ = 1), then the nominal
interest rate should be raised by 0.5 percent. Taylor (1993) showed that this rule
was very close to the actual observed response of the Federal Reserve in the United
States to levels and changes in output and inflation.
13.6 Seigniorage
The time-inconsistency problem above is mainly meant to describe the situation
Downloaded by [Hacettepe University] at 20:58 10 April 2017
M
= L(r + π e , Y )
P
where we assume for simplicity that r and Y are unaffected by the money growth
rate and that π = π e . Also, as in Section 11.2, inflation equals the money growth
rate: π = MṀ
= gm .
Let us assume that the government now becomes involved in a very costly
endeavour for which it cannot easily get financing from the international com-
munity, such as a war.10 The only short-run solution to the financing difficulty
is to order the central bank to print money that the government can use to meet
its new expenditures. The increase in the nominal money stock from this money
printing is Ṁ. If we divide this by the aggregate price level, we get the level of
seigniorage S:
Ṁ Ṁ M M
S= = = gm (13.13)
P M P P
The reformulated expression on the right-hand side explains why seigniorage is
sometimes referred to as an inflation tax on real money balances, where the money
growth rate is the “tax” rate.
In a money market equilibrium, it must be the case that M/P = L(r + π e , Y ).
Inserting this term into (13.13) gives us
S = gm L(r + gm , Y ) (13.14)
138 Macroeconomic policy
S
Downloaded by [Hacettepe University] at 20:58 10 April 2017
gM
The interesting aspect of this expression is that it shows that money printing will
have a negative impact on money demand since it will increase the nominal interest
rate i = r + gm . There is thus a trade-off: on the one hand, printing more money
increases government revenue; on the other hand, it decreases the “tax base” by
decreasing the amount of real money balances held in society. Taking derivatives
on the basis of (13.14) shows formally the positive and negative effects:
∂S
= L(r + gm , Y ) + gm L i (r + gm , Y )
∂gm (+) (−)
Note that the second term approaches zero as gm goes to zero, whereas L(r +
gm , Y ) is positive even if gm = 0. As gm rises, it is plausible that the second nega-
tive term will eventually dominate and make the whole expression negative. In that
case, the relationship between S and gm is shaped like an inverted “U”, as shown
in Figure 13.3. The curve in Figure 13.3 has been referred to as the inflation-tax
Laffer curve.11 A nonbenevolent government should thus create inflation up to the
maximum where ∂g∂ Sm = 0.12 Although this might seem like a cynical type of anal-
ysis, it is a well-known fact that hyperinflations are almost always directly caused
by governments and that they can be curbed relatively easy if there is a political
will to do so.
14 The Open Economy
Downloaded by [Hacettepe University] at 20:58 10 April 2017
The models analyzed so far have all assumed a closed economy, i.e. one where
trade does not occur. In this final chapter, we will analyze the effects of opening the
economy to trade and to capital flows across borders. In reality, total trade (exports
plus imports) often amounts to a substantial proportion of GDP, especially in small
countries.
In the sections below, we start by defining key concepts and relations such as the
current account, the balance of payments, and nominal versus real exchange rates.
We then introduce a representative agent framework with a utility-maximizing
individual who optimizes a consumption stream in a two-period model. The pur-
pose of this exercise is to show that a balanced current account might not be
optimal in all periods. We then discuss more traditional models in the Keynesian
tradition of sticky prices such as the Mundell–Fleming model, and the theory of
exchange rate overshooting. Finally, we also analyze the criteria for an optimal
currency union.
Y =C + I +G + X − M
S = S P + SG = Y − C − G = I + N X (14.1)
In an open economy, total savings might thus be equal to I , but can depend on net
exports.
Equation (14.1) can further be reformulated as
N X = S P − I − (G − T )
likely associated with a current account deficit N X < 0. This scenario, sometimes
referred to as twin deficits, has characterized the world’s largest economy, the
United States, during the last decade.
N X t = Bt+1 − Bt = Yt + rt Bt − Ct − It − G t (14.2)
If N X t > 0, then Bt+1 > Bt and the economy’s stock of foreign assets increases. If
we think of Bt as bonds, then the country is “paid” with foreign bonds to make up
for the difference between exports and imports with the rest of the world. If instead
imports exceed exports, so that N X t < 0, then the country needs to borrow in order
to import more than they export. In this case, the stock of foreign assets decrease.
The existing stock of foreign bonds at time t, Bt , earns an interest return of rt .1
The open economy 141
14.1.3 Exchange rates
The nominal exchange rate e shows the price of a unit of foreign currency in terms
of the home-country currency.2 A rise in the exchange rate thus means that the
foreign currency becomes more expensive, which means a weakening, or depre-
ciation, of the home currency. In an analogous manner, should e fall, that would
imply a strengthening, or appreciation, of the home currency compared with the
specific foreign currency in question.
Let the aggregate price level in the foreign country be P f and the aggregate
price level in the home country P, as before. Then the real exchange rate is
defined as
eP f
Downloaded by [Hacettepe University] at 20:58 10 April 2017
=
P
For example, consider a comparison between the Swedish krona and the euro. The
nominal exchange rate is roughly 10 (1 euro costs 10 kronor), whereas the price
level is perhaps 5 percent higher in the euro zone, on average. In that case, the real
euro/krona exchange rate will be 10 · 1.05 = 10.5.
A rising real exchange rate, i.e. a depreciation, means that foreign goods and
services are becoming more expensive relative to those produced in the home
country. Hence, consumers in the home country will likely substitute more expen-
sive foreign goods for cheaper home-produced goods. This will imply that net
exports rise, i.e. exports from the home country increase and imports decrease.
This relationship is an outcome of a more complicated theoretical exercise called
the Marshall–Lerner condition.3
U = u(c1 ) + βu(c2 )
The current account equation (14.2) and the assumptions just mentioned imply
that
B2 = Y1 − C1 − I1 − G 1 = F(K 1 ) − C1 − (K 2 − K 1 ) − G 1
B3 + K 3 − B2 − K 2 = −B2 − K 2 = F(K 2 ) + r B2 − C2 − G 2
The intuition for the latter expression is simply that second-period consumption
will partly come from having sold off or “eaten” the capital and assets stocks
B2 and K 2 . If we isolate B2 in the equations above and combine them, we can
get an expression that implicitly defines the intertemporal budget constraint for a
representatitive individual in the economy:
C2 + G 2 − K 2 − F(K 2 )
F(K 1 ) − C1 − (K 2 − K 1 ) − G 1 =
(1 + r2 )
F(K 2 ) − G 2 C2 − K 2
F(K 1 ) − G 1 + = C1 + K 2 − K 1 +
(1 + r ) (1 + r )
= u(C1 ) + βu(C2 )
F(K 2 ) − G 2 K 2 − C2
+ λ F(K 1 ) − G 1 + − C1 − K 2 + K 1 +
1+r 1+r
∂
= u (C1∗ ) − λ = 0
∂C1
∂ λ
= βu (C2∗ ) − =0
∂C2 1+r
∗
∂ F (K 2 ) 1
=λ −1+ =0
∂ K2 1+r 1+r
The first two conditions can easily be combined to obtain the usual Euler equation
result that, optimally, u (C1∗ ) = β(1 + r )u (C2∗ ). Rearranging the last condition
Downloaded by [Hacettepe University] at 20:58 10 April 2017
gives the equally familiar result that F (K 2∗ ) = r , i.e. investments should be made
up to the point where the marginal product of capital equals the marginal cost r .
Without further simplications, the first-order conditions cannot be used for find-
ing explicit solutions. Let us therefore assume that β(1 +r ) = 1, which implies that
we get the consumption smoothing result of C1∗ = C2∗ = C ∗ . Inserting this solution
back into the intertemporal budget constraint gives us
N X 1∗ = B2∗ = Y1 − C ∗ − I1 − G 1 (14.4)
(Y1 + K 1 − G 1 )(1 + r ) + F(K 2∗ ) − G 2 − r K 2∗
= Y1 −
2+r
− (K 2∗ − K 1 ) − G 1
Note that we allow for the case that B2 < 0, which would imply a current account
deficit and that the country is a net lender in the first period on the international
market. In the second period, we have ruled out the possibility that the country
ends with either debts or positive assets, so N X 2∗ = B3 − B2∗ = −B2∗ = −N X 1∗ .
Hence, if there is a current account surplus in the first period (B2∗ > 0), there must
an equivalent deficit in the next (and last) period, and vice versa. What we are
analyzing is therefore only the dynamics of the current account, not its “final”
level.
To start with, a current account surplus (B2 > 0) is more likely if initial capital
and income K 1 and Y1 = F(K 1 ) are high. The derivative with respect to K 1 is
∂ N X 1∗ /∂ K 1 = [F (K 1 ) + 1]/(2 + r ) > 0, which is clearly positive. Hence, coun-
tries that are initially rich in capital are likely to “choose” a current account surplus
144 Macroeconomic policy
in the first period, i.e. to export more than they import. The intuition for this sur-
plus in national saving is really the same as in the permanent income model of
consumption: since individuals aim to smoothe consumption, they will save when
incomes are abnormally high and use up the savings when incomes are lower.
How about the pattern of government spending, G 1 and G 2 ? First, note that
if G 1 = G 2 = Ḡ, then ∂ N X 1∗ /∂ Ḡ = 0. The reason is that an increase in Ḡ would
decrease C ∗ by exactly the same amount and the two effects in (14.4) would cancel
each other out. If, however, it was known for instance that the government would
decrease only G 2 while keeping G 1 constant, then we find from the derivative that
∂ N X 1∗ /∂ G 2 = 1/(2 + r ) > 0, i.e. a decrease in G 2 should cause a current account
deficit in the first period. The intuition is that a decrease in G 2 would cause optimal
consumption C ∗ to rise in (14.4) while holding all other variables constant.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
X − M = N X (, Y, Y f )
Y = E(Y, r, G, , Y f )
where and Y f are included since they influence the current account. The IS curve
will actually be flatter in the open economy scenario because national income Y
has both a positive effect through increased consumption and a negative effect
through the impact on imports.
Rather than an MP curve, let us now work with a traditional LM curve. Whereas
an MP curve includes a monetary policy rule r (Y, π) whereby the central bank’s
interest rate reacts to changes in income levels and inflation, we now simply
assume
M
= L(r, Y )
P
The open economy 145
As will be shown below, r might respond to the balance of payments situation.
Monetary policy instead enters as changes in real money supply M/P. In this
model, we are not primarily interested in inflation and will assume π e = 0 for
simplicity so that the nominal interest rate i = r + π e equals the real one, i = r .
We will indeed assume throughout this section the aggregate price level P is sticky
in the short run.
As before, we have that L r < 0 and that L Y > 0. Hence, an increase in Y will
increase money demand. In order to preserve the money market equilibrium, we
must have that r increases so that money demand falls back again to equal M/P.
The LM curve will thus feature a positive association between r and Y .
The balance of payments equation is given by the current account plus the cap-
Downloaded by [Hacettepe University] at 20:58 10 April 2017
ital account. The level of the capital account, CA, depends primarily on the real
interest differential between the home country and the rest of the world r − r f .
There are two main scenarios for how the nominal interest rate differential will
matter. In the first scenario with perfect capital mobility between countries, there
can be no interest rate difference, i.e. r = r f . If there is imperfect capital mobility,
perhaps due to regulations in the international capital market, then r and r f need
not be exactly identical. In that case, the current account will generally be a
function of the interest rate differential C A(r − r f ), where
∂C A(r − r f )
= C Ar−r f > 0
∂(r − r f )
This means that capital will flow into the country if the interest rate domesti-
cally, r , is larger than that abroad, r f .
The current account and capital account equations taken together define the
balance of payments (BOP) curve:
N X (, Y, Y f ) + C A(r − r f ) = 0
In the case of perfect capital mobility, the BOP curve will be horizontal at r = r f .
With imperfect capital mobility, the BOP curve will be upward-sloping. To see
why, consider an increase in Y . Such an increase means that imports rise, which
leads to a fall in N X. In order to preserve the balance of payments “balanced” at
zero, this requires an offsetting rise in the capital account. Such a rise can only
come about through an increase in the domestic interest rate r . When the interest
rate rises, capital will flow into the country to finance to current account deficit.
Another implication is that there will be a positive association between r and Y .
We are now equipped with three curves: the IS curve (with net exports), the LM
curve, and the BOP curve. The curves form a flexible and useful framework for
understanding the effect of fiscal and monetary policy in an open economy setting.
Many different scenarios may be analyzed. For instance, let us consider the
case of perfect capital mobility, fixed exchange rates, and an expansive monetary
policy, as shown in Figure 14.1. The economy is initially in an equilibrium where
all curves cross. An increase in the nominal money stock (with prices staying
146 Macroeconomic policy
r
LM
LM′
IS
rf BOP
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Y
Y0 = Y2 Y1
Figure 14.1 Expansionary monetary policy with fixed exchange rates and perfect
capital mobility.
constant) leads to a shift in the LM curve to the right. This will cause a fall in the
real interest rate below the international level r f . As a result, people will want to
buy foreign bonds and there will be a large capital outflow. Since the exchange
rate is supposed to be fixed, the central bank needs to purchase domestic currency
in order to keep it at its fixed level. This leads to a reduction in nominal money
supply, and the LM curve shifts back again to its initial position Y0 . In other words,
under perfect capital mobility and a fixed exchange rate, monetary policy will be
ineffective.
Let us now consider a quite different scenario with imperfect capital mobility,
fixed exchange rates, and a fiscal expansion. In this case, the BOP curve is upward-
sloping, as shown in Figure 14.2. The expansionary fiscal policy (increase in G)
causes the IS curve to shift to the right. This causes the interest rate to rise, which
in turn implies large capital inflows, i.e. foreign people buy domestic government
bonds. This puts pressure on the exchange rate to appreciate ( falls). In order to
keep the fixed exchange rate, the central bank must increase money supply, which
shifts the LM curve to the right. The end result is an increase in income from Y0
to Y2 , as shown in the figure. Fiscal policy will thus be relatively effective under
these circumstances, but it would have been even more effective if there had been
perfect capital mobility.
r
LM
IS′ LM′
IS
BOP
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Y
Y0 Y1 Y2
Figure 14.2 Expansionary fiscal policy with fixed exchange rates and imperfect capital
mobility.
Dornbusch (1976) shows that when expectations are taken into account under flex-
ible exchange rates, there might be overshooting in the adjustment of the exchange
rate which makes these extra volatile.
The model builds on two key premises. The first is that real interest rate differ-
entials between the home country and the rest of the world should be explained by
the following expression:
rt = r f + E t (et+1 − et ) (14.5)
The home-country nominal interest rate at time t should be equal to the interest
rate abroad r f plus the expected change in the exchange rate between t + 1 and t.
Should we, for instance, observe that rt > r f , then it must be the case that the
home currency is expected to depreciate so that E t (et+1 − et ) > 0.6 Relation (14.5)
shows the equality that must prevail if an investor is going to be indifferent about
investing a certain amount of money either in the home currency or in the foreign
currency and is often referred to as the uncovered interest rate parity.
The other basic premise is the standard expression for equilibrium in the money
market:
M
= L(rt , Yt )
P
The price level P is assumed to adjust only slowly, as in most Keynesian models.
To illustrate the main implications of the model, consider an expansionary
monetary policy from the central bank at time t that increases nominal money
148 Macroeconomic policy
supply M. With prices held constant in the short run, this means that real money
supply increases. The only way that this can be balanced in the short run is through
a fall in the real interest rate rt so that money demand rises.
Let us assume that initially there are no interest differentials between the
two countries. The expansion in nominal money then has two effects. The
lower interest rate in the home country should, according to (14.5), imply that
E t (et+1 − et ) < 0, i.e. the home currency is expected to appreciate. However, as
shown in the Mundell–Fleming framework in Figure 14.1, a monetary expansion
will lead to an outflow of capital due to the lower domestic interest rate and a
depreciation of the currency in the longer run. The only way to reconcile equilib-
ria in the two markets is if the exchange rate initially depreciates by more than
Downloaded by [Hacettepe University] at 20:58 10 April 2017
its long-run equilibrium value. This would allow it to appreciate over time so
that it eventually satisfies the uncovered interest rate parity condition. Therefore
the exchange rate overshoots its long-run equilibrium value and then gradually
appreciates.
It is important to note that this scenario will only occur if the aggregate price
level is sticky. It will not happen if prices immediately adjust to the increase in
money supply.
• Labor mobility and an integrated labor market. If this is not the case, workers
will remain in depressed regions and expanding regions will be constrained by
lack of access to labor.
• Capital mobility and price and wage flexibility.
• A common risk sharing mechanism so that adversely affected regions are sup-
ported by the more fortunate ones. This might, for instance, take the form of
income redistribution of tax revenues.
• The regions included should have similar business cycles. If certain regions
have cycles that are very asymmetric compared with the majority of regions,
the adversely affected regions might have benefited from depreciation but are
unable to carry out any with a joint currency.
The literature in this area has recently been primarily concerned with the euro
area. Recent events during 2010 appear to confirm that certain countries (such as
Greece) clearly do not fit some of Mundell’s requirements very well. In general,
The open economy 149
capital mobility across Europe is fairly good but labor mobility is still not substan-
tial. A common risk sharing mechanism is perhaps about to be created, but it is
doubtful whether the euro countries typically face symmetric shocks. Apart from
the economic criteria cited above, there might of course also be political reasons
why countries choose to form currency unions.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
15 Mathematical Appendix
Downloaded by [Hacettepe University] at 20:58 10 April 2017
15.1 Introduction
This book requires that the student be familiar with certain tools of differential cal-
culus. Fortunately, it is sufficient to limit our analysis to functions of one variable.
The following concepts are discussed:
The material of the appendix is rather standard. More detailed presentations can
be found in most first-year calculus textbooks.
f (x) − f (x 0 )
f (x 0 ) = lim (15.1)
x→x0 x − x0
f (x) = 0 ∀ x ∈ R (15.2)
Mathematical appendix 151
Identity function: f (x) = x
f (x) = 1 ∀ x ∈ R (15.3)
Power: f (x) = x n
The exponent could in principle be any constant n. The derivative of this
function is given by
Notice that the identity function is a special case of a power with n = 1. To verify
the previous rule, we substitute n = 1 in equation (15.4) and we obtain f (x) =
Downloaded by [Hacettepe University] at 20:58 10 April 2017
x 0 = 1.
Exponential function: f (x) = e x
The exponential is probably the most important function in calculus. Its base is
the irrational number e ≈ 2.7. It has the very interesting and unique1 property of
being neutral to differentiation. Namely,
f (x) = e x ∀x ∈R (15.5)
1
f (x) = ∀ x ∈ (0, ∞) (15.6)
x
Take, for example, the function f (x) = 4x 3 . It is easy to see that f (x) =
4 · 3x 3−1 = 12x 2 . Setting c = −1 and combining the two rules (sum of func-
Downloaded by [Hacettepe University] at 20:58 10 April 2017
tions and multiplication with a scalar) entails a similar rule for the difference
f = f 1 − f 2 , namely, f (x) = f 1 (x) − f 2 (x).
Notice that the function is defined only for those x that do not make f 2 equal to 0,
since otherwise the division would not be defined in the first place. An example of
x 2 +1 2 +1) x 2 −2x−1
such a function is f (x) = x−1 , which yields f (x) = 2x(x−1)−(x2 = ,
(x−1) x 2 −2x+1
for every x = 1.
This is a compound function in the sense that h is a function of some other func-
tion g, which is a function of x. That is to say, one could set g(x) = x 2 + 1 and
Mathematical appendix 153
f (y) = e yand by plugging g(x) into y obtain h(x) = f (g(x)) = e g(x) . Since g is
depends on x and f depends on g, it is straightforward that f depends on x.
Formally, consider two functions g(x) and f (g(x)). We define the compound
function as follows:
same time, changing g(x) would lead to a change in f that depends on g. Thus,
the initial change in x has triggered a chain reaction. Through the effect on the
intermediate function, it has caused a change in f . The size of this change would
be equal to the size of the change in g times the size of the change that g has
caused to f . That is,
2 +1
then h (x) = 2xe x
2 +1
For example, if h(x) = e x .
change. In discrete time, these are usually expressed by a percentage, but since
time is continuous we are forced to go to the limit and examine their relative
change in an infinitesimally small time interval.
Consider, for instance, a variable X (t) as a function of time. The instantaneous
relative change is given by the growth rate of X with respect to time. Denoting the
derivative2 at t by Ẋ (t), we can write the growth rate as a fraction ẊX (t)
(t) .
Remember from the previous sections that the derivative of the logarithmic
function is given by d(ln X)
d X = X . Then taking the derivative of ln X (t) with respect
1
d ln X (t) 1 d X (t)
= (15.13)
dt X (t) dt
d ln X (t) Ẋ (t)
= (15.14)
dt X (t)
Thus, the growth rate of X (t) is nothing but the derivative of the logarithmic
function.
Preface
1 Any macro model that, for instance, assumes away unemployment (like many DGE
Downloaded by [Hacettepe University] at 20:58 10 April 2017
models routinely do) seems to have missed something important about how economies
work.
1 Introduction
1 GDP does not include the production of intermediate goods or services, nor production
on the black market. The market value of public services like health care are further
very imprecisely measured in the national accounts.
2 An individual’s purchase of a car for personal use counts as consumption in the national
accounts, although it might intuitively be seen as an investment.
growth model follow a structure that is similar to the equivalent sections in Barro and
Sala-i-Martin (2004).
3 In general, in this type of model, we might view the competitive and monopoly prices
P ∗ ∈ {1, 1/α} as the two extremes in a range of levels of competition.
4 This is clearly a great simplification. In reality, R&D is usually a very risky enterprise
that frequently produces no useful results.
5 In most endogenous growth models, T is assumed to be infinite. In reality, patents
typically last for about 20 years.
6 Aghion and Howitt (1992) assume that the arrival of new innovations follows a Poisson
process. Here we will leave that process undefined.
7 The inclusion of a human capital stock is a slight departure from the model in Acemoglu
et al (2003) but in line with their general argument.
7 Financial Crises
1 A famous recent example of an individual failure of a financing institution is of course
the collapse of Lehman Brothers in September 2008.
2 Recent examples include Sweden in 1991 and Iceland in 2008. With some exceptions,
the financial crisis of 2008 did not turn into a general bank run among traditional banks,
mainly because of various lender insurance schemes supported by governments.
3 One might think of the utility function as describing an individual who is impatient with
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Chapter 15).
15 Mathematical Appendix
1 Actually the whole family of exponential functions f (x) = cex , c ∈ R, has this property.
2 In macroeconomics, it is conventional to use this notation.
Bibliography
Downloaded by [Hacettepe University] at 20:58 10 April 2017
University Press.
Friedman, M. (1968) The role of monetary policy. American Economic Review, 58(March),
1–17.
Gali, J. and M. Gertler (2007) Macroeconomic modeling for monetary policy evaluation.
Journal of Economic Perspectives, 21(4), 25–46.
Galor, O. and D. Weil (2000) Population, technology and growth: From the Malthusian
regime to the demographic transition. American Economic Review, 110, 806–828.
Hall, R.E. (1978) Stochastic implications of the life cycle-permanent income hypothesis.
Journal of Political Economy, 86(6), 971–987.
Hibbs, D.A. (1977) Political parties and macroeconomic policy. American Political Science
Review, 71(4), 1467–1487.
Jones, C.I. (1995) R&D-based models of endogenous growth. Journal of Political Econ-
omy, 103(August), 759–784.
Krugman, P. (1988) Financing vs forgiving a debt overhang. Journal of Development
Economics, 29, 253–268.
Kydland, F. and E. Prescott (1977) Rules rather than discretion: The inconsistency of
optimal plans. Journal of Political Economy, 85(June), 473–492.
Kydland, F. and E. Prescott (1982) Time to build and aggregate fluctuations. Econometrica,
50, 1345–1370.
Laibson, D. (1997) Golden eggs and hyperbolic discounting. Quarterly Journal of Eco-
nomics, 112(2), 443–477.
Lindbeck, A. and D. Snower (1986) Wage setting, unemployment, and insider-outsider
relations. American Economic Review, 76(2), 235–239.
Long, J.B. and C.I. Plosser (1983) Real business cycles. Journal of Political Economy, 91,
39–69.
Lucas, R.E. (1972) Expectations and the neutrality of money. Journal of Economic Theory,
4(April), 103–124.
Lucas, R.E. (1973) Some international evidence of output-inflation tradeoffs. American
Economic Review, 63(3), 326–334.
Malthus, T. (1798) An Essay on the Principle of Population. London.
Mankiw, G. (1985) Small menu costs and large business cycles: A macroeconomic model
of monopoly. Quarterly Journal of Economics, 100(May), 529–539.
Mankiw, G., D. Romer, and D. Weil (1992) A contribution to the empirics of economic
growth. Quarterly Journal of Economics, 107(2), 407–437.
Mortensen, D. and C. Pissarides (1994) Job creation and job destruction in the theory of
unemployment. Review of Economic Studies, 61(3), 397–415.
Mundell, R. (1961) A theory of optimum currency areas. American Economic Review,
51(4), 657–665.
Obstfeld, M. and K. Rogoff (1999) Foundations of International Macroeconomics,
Cambridge, MA: MIT Press.
Bibliography 163
Olsson, O. and D.A. Hibbs (2005) Biogeography and long-run economic development.
European Economic Review, 49(4), 909–938.
Phelps, A.W. (1968) Money-wage dynamics and labor market equilibrium. Journal of
Political Economy, 76(July), 678–711.
Phillips, A.W. (1958) The relationship between unemployment and the rate of change of
money wages in the united kingdom, 1861–1957. Economica, 25(November), 283–299.
Ramsey, F. (1928) A mathematical theory of saving. Economic Journal, 38(152), 543–559.
Rebelo, S. (1991) Long-run policy analysis and long-run growth. Journal of Political
Economy, 96(June), 500–521.
Rebelo, S. (2005) Real business cycle models: Past, present, and future. NBER Working
Paper 11401, NBER.
Reinhart, C. and K.S. Rogoff (2009a) The aftermath of financial crises. American Economic
Review: Papers and Proceedings, 99(2), 466–472.
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Reinhart, C. and K.S. Rogoff (2009b) This Time Is Different? Eight Centuries of Financial
Folly. Princeton, NJ: Princeton University Press.
Rogoff, K. and A. Sibert (1985) The optimal degree of commitment to an intermediate
monetary target. Quarterly Journal of Economics, 100(4), 1169–1189.
Romer, D. (1993) The new Keynesian synthesis. Journal of Economic Perspectives, 7(1),
5–22.
Romer, D. (2005) Advanced Macroeconomics Macroeconomics. Boston: McGraw-Hill.
Romer, P. (1986) Increasing returns and long-run growth. Journal of Political Economy,
94(October), 1002–1037.
Romer, P. (1990) Endogenous technological change. Journal of Political Economy,
98(October), S71–S102.
Romer, P. (1994) The origins of endogenous growth. Journal of Economic Perspectives,
8(Winter), 3–22.
Schumpeter, J. (1934) The Theory of Economic Development (translated from the German
version published in 1912). New York: Oxford University Press.
Shapiro, C. and J.E. Stiglitz (1984) Equilibrium unemployment as a worker discipline
device. American Economic Review, 74(3), 433–444.
Solow, R. (1956) A contribution to the theory of economic growth. Quarterly Journal of
Economics, 70(February), 65–94.
Sørensen, P.B. and H.J. Whitta-Jakobsen (2005) Introducing Advanced Macroeconomics:
Growth and Business Cycles. Maidenhead: McGraw-Hill.
Taylor, J.B. (1993) Discretion versus policy rules in practice. Carnegie-Rochester Confer-
ence Series on Public Policy, 39, 195–214.
Woodford, M. (2010) Financial intermediation and macroeconomic analysis. Journal of
Economic Perspectives, 24(4), 21–44.
Index
Downloaded by [Hacettepe University] at 20:58 10 April 2017
Greece 121, 123, 148 individual consumption 12, 43–4, 46, 55–6,
gross domestic private investment 2 156n2
gross domestic product (GDP) 1, 156n1; individual maximization problem 56–7
and budget deficit 115; calculating individuals 1
2–3; expenditure approach 2–3; and industrial revolution, the 14, 14–5
government debt 115; income side inflation 1, 104, 105; and aggregate
2–3; and the national accounts identity demand 126; and aggregate supply
2–3; production side 2–3; and trade 126; binding inflation commitments
139 128; central bank policy 159n3;
gross national income 160n1 delegation of monetary authority 132;
growth models, fundamental assumption discretionary monetary policy 128,
7–8 129–31, 130; equilibrium 130–1, 130,
growth rate: endogenous 49; of the 131, 133; and monetary policy
logarithmic function 154; of a 126–38; and the money market 127,
multiplicative function 153–4 160n1; and output growth 126; the
Phillips curve 107–8, 109, 128; and
habit formation 77 political business cycles 126, 132–6,
Haiti 19 135; public expectations 129, 130–1,
Hall, R.E. 71–2, 73 131; and the quantity theory of money
Harrod neutrality 157n8 126–7; random output shocks 131–2;
Hibbs, D.A. 14–5, 132 and reputation 132; and seigniorage
high-powered money 104 137–8, 138; subgame perfect Nash
home country, the 140 equilibrium 130; the Taylor rule
household optimization 45–7; basic 136–7; time-inconsistency model of
assumptions 42; intertemporal budget monetary policy 126, 127–32
constraint 43–4; logarithmic utility inflation bias 128, 129
44–5; marginal utility curve 43, 43; inflation tax 137–8; Laffer curve 138, 138
utility functions 42–7, 43 innovations 40–1
households: behavior 42; budgets 1; insider–outsider theory, wage setting 88,
equilibrium business cycles and 55–7 94–5
housing market 85–7, 86 instantaneous utility function 66–7, 72
housing stock 85 intellectual property rights 33, 38–9
Howitt, P. 29, 30, 32, 39–40, 157n6 interest cost 82
human capital 26–8, 41, 157n7 interest rates 1, 56, 75–6; and credit supply
human capital investments 159n1 111–4, 112, 113; differentials 147;
hyperinflation 137, 137–8, 160n9 housing market 85, 86, 86;
international 125, 160n5; IS–MP
Iceland 158n2 model 104; Keynesian investment
ideas 26, 31 function 80; real 57; uncovered
identity function 151 interest rate parity 147
168 Index
intermediate goods sector 34, 34–6, 38–9 production 8; production function
international credit markets, liberalization 89–90; profit maximization 55; R&D
64 32; search and matching framework
international economic policy-making 1–2 89; skills 41; and technological
International Monetary Fund (IMF) 123–4, progress 25–6
125 labor demand 55, 88, 90, 93, 158n3, 159n1
international politics 1–2 labor market 32, 55, 88–98; disequilibrium
intertemporal budget constraint 43–4, 46, 88–9, 89; efficiency wage theory 88,
60, 142, 157n3 89, 89–94, 91; insider–outsider theory
investment demand 159n6 88, 89, 94–5; integrated 148; job
investment irreversibility 84 creation 96, 97; Poisson arrival rate
investment projects 59, 60 96; search and matching framework
investment returns 60 88, 95–8, 97; tightness 96
Downloaded by [Hacettepe University] at 20:58 10 April 2017