0% found this document useful (0 votes)

5 views

dp-intro dynamic programming

This document introduces the Bellman Equation through an informal explanation of a sequential optimization problem in a dynamic programming context. It illustrates how to recursively represent a problem involving asset management over time and derives the Bellman equation as a functional equation that describes the optimal decision-making process. The document emphasizes the equivalence of solutions to the original problem and the Bellman equation in a stationary environment.

Uploaded by

Dareen Fayyad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

dp-intro dynamic programming

Uploaded by

Dareen Fayyad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Introduction to the Bellman Equation

January 2009

This note shows in an informal way, using a specific example, how a sequential problem
can be represented recursively, as a dynamic programming problem. It should help you
establish intuition behind the Bellman equation.

In the lectures on Ch. 8, deterministic endowment economy with sequential markets, we

have seen the following infinite-horizon sequential optimization problem:

∞
X
max β t u(ct ) (1)
{ct ,at+1 }∞
t=0
t=0
s.t. ct + at+1 Qt = yt + at
at+1 ≥ −At+1
given a0 and {yt }∞
t=0

Rewrite problem (1) in terms of assets as control variables and consider a finite-horizon
case t = 0, 1, 2, ..., T :
T
X
max β t u(yt + at − at+1 Qt ) (2)
{at+1 }T
t=0 t=0

given a0 and {yt }Tt=0

Also, to simplify further discussion, assume the borrowing constraint does not bind.
Solving problem (2) requires finding sequence {a1 , a2 , ..., aT −1 , aT , aT +1 } .

Suppose {a1 , a2 , ..., aT } are chosen and aT +1 needs to be decided. Then the problem is a
one-period optimization with a single control variable aT +1 :

max u(yT + aT − aT +1 QT ) (3)

aT +1

1
The solution to this problem is obvious: aT +1 = 0, because the return on the asset cannot
be enjoyed after death. Note that to solve the problem we only need to know {aT , yT } and not
the whole sequence of assets and endowment. That is, {aT , yT } are individual state variables
in problem (3). Denote by V T (aT , yT ), called a value function, the maximum attained by
the objective function given aT :

V T (aT , yT ) ≡ max u(yT + aT − aT +1 QT ) (4)

aT +1

= u(yT + aT )

and by g T (aT ) the optimal decision rule for aT +1 given aT :

aT +1 = g T (aT , yT )

Next, suppose {a1 , a2 , ..., aT −1 } are chosen, and {aT , aT +1 } need to be decided. Then the
problem becomes a two-period optimization with two control variables {aT , aT +1 } and state
variables {aT −1 , yT −1 , yT } :

max {u(yT −1 + aT −1 − aT QT −1 ) + βu(yT + aT − aT +1 QT )} (5)

{aT ,aT +1 }

Note that the problem can be solved in two steps. First we can find the optimal decision
rule for aT +1 given aT and then find the optimal decision rule for aT given aT −1 . That is, we
can rewrite problem (5) as follows:

T −1 T −1
V (aT −1 , ŷ ) ≡ max u(yT −1 + aT −1 − aT QT −1 ) + β max u(yT + aT − aT +1 QT ) (6)
aT aT +1

where V T −1 (aT −1 , ŷ T −1 ) denotes the maximum welfare attained over two periods given aT −1
and the two-period history of income ŷ T −1 ≡ {yT −1 , yT } . Using the notation introduced in
the previous step (one-period problem) we get

V T −1 (aT −1 , ŷ T −1 ) = max u(yT −1 + aT −1 − aT QT −1 ) + βV T (aT , yT )

Likewise, denote by g T −1 (aT −1 ) the optimal decision rule for aT given aT −1 :

aT = g T −1 (aT −1 , ŷ T −1 )

The optimal decision rule g T −1 (aT −1 ) is obtained from the following F.O.C.:

∂V T (aT )
u′ (yT −1 + aT −1 − aT QT −1 )QT −1 = β (7)
∂aT

2
Since optimal aT +1 = 0, V T (aT , yT ) = u(yT + aT ), hence the F.O.C. becomes

u′ (yT −1 + aT −1 − aT QT −1 )QT −1 = βu′ (yT + aT ) (8)

Now continue solving problem (2) in a similar fashion. That is, suppose {a1 , a2 , ..., aT −2 }
are chosen, and {aT −1 , aT , aT +1 } need to be decided. You should get a value function corre-
sponding to the three-period problem:

V T −2 (aT −2 , ŷ T −2 ) = max u(yT −2 + aT −2 − aT −1 QT −2 ) + βV T −1 (aT −1 , ŷ T −1 )

aT −1

and a decision rule aT −1 = g T −2 (aT −2 , ŷ T −2 ) obtained from the F.O.C.

u′ (yT −2 + aT −2 − aT −1 QT −2 )QT −2 = βu′ (yT −1 + aT −1 − aT QT −1 ) (9)

Iterating all the way to period 0, we get a value function that corresponds to the complete
problem (2):

V 0 (a0 , ŷ 0 ) = max u(y0 + a0 − a1 Q0 ) + βV 1 (a1 , ŷ 1 )

with the optimum attaned at a1 = g 0 (a0 , ŷ 0 ).

Now consider a stationary environment: let the asset price Qt be constant, and impose
some restrictions on the individual endowment process {yt }∞ t=0 , such that a law of motion
′
for the endowment can be decribed by y = Γ(y). Then the sequence of value functions
T
{V t (at , ŷ t )}t=0 converges to a time-invariant function V (a, y) as T −→ ∞ (infinite horizons),
and the sequential problem (1) admits the following recursive representation:

V (a, y) = max
′
{u(y + a − a′ Q) + βV (a′ , y ′ )} (10)
a

given a0 and y ′ = Γ(y).

T
If the sequence of the decision rules {g t (at , ŷ t )}t=0 converges to a time-invariant function
g(a, y) as T −→ ∞, then a′ = g(a, y) attains V (a, y).
Equation (10) is called Bellman or functional equation. It states sequential problem
(1) in the language of dynamic programming. The solutions to problems (1) and (10) are
identical in a stationary environment.
Solution to the Bellman equation (10) satisfies the F.O.C.

∂V (a′ , y ′ )
u′ (y + a − a′ Q)Q = β (11)
∂a′

3
Using the Envelope theorem, obtain

∂V (a, y)
= u′ (y + a − a′ Q)
∂a
so that the F.O.C. can be written as

u′ (y + a − a′ Q)Q = βu′ (y ′ + a′ − g(a′ , y ′ )Q) (12)

Solution to this equation gives a′ = g(a, y).

Pteridofitas de México
No ratings yet
Pteridofitas de México
1,065 pages
A Child's Guide To Dynamic Programming
No ratings yet
A Child's Guide To Dynamic Programming
20 pages
3 Recursive
No ratings yet
3 Recursive
8 pages
Infinite-Horizon Dynamic Programming: Tianxiao Zheng Saif
No ratings yet
Infinite-Horizon Dynamic Programming: Tianxiao Zheng Saif
10 pages
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 5 Applications of Deterministic DP: Prof. Jianjun Miao
23 pages
SanchezPajueloKai_PS3
No ratings yet
SanchezPajueloKai_PS3
6 pages
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
No ratings yet
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
7 pages
Dynamic Programming For Dummies Parts I & II
No ratings yet
Dynamic Programming For Dummies Parts I & II
53 pages
Dynamic Equilibrium Models III: Infinite Periods
No ratings yet
Dynamic Equilibrium Models III: Infinite Periods
15 pages
SLchapt 3
No ratings yet
SLchapt 3
10 pages
1 Lagrangian
No ratings yet
1 Lagrangian
8 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Paulo Brito Ecomat Discreto
No ratings yet
Paulo Brito Ecomat Discreto
49 pages
Bellman Note
No ratings yet
Bellman Note
15 pages
Dynamic Programming 3 Bellman
No ratings yet
Dynamic Programming 3 Bellman
8 pages
Dynamic Equilibrium Models Iii: Business-Cycle Models
No ratings yet
Dynamic Equilibrium Models Iii: Business-Cycle Models
26 pages
Bellman
100% (1)
Bellman
8 pages
Examples of Dynamic Programming Problems
No ratings yet
Examples of Dynamic Programming Problems
5 pages
Mathii at Su and Sse: John Hassler Iies, Stockholm University February 25, 2005
No ratings yet
Mathii at Su and Sse: John Hassler Iies, Stockholm University February 25, 2005
87 pages
Typeset by AMS-TEX
No ratings yet
Typeset by AMS-TEX
27 pages
Sanchez_Kai_PS2
No ratings yet
Sanchez_Kai_PS2
11 pages
Homework1 Africa
No ratings yet
Homework1 Africa
2 pages
Problem Set 2 (2)
No ratings yet
Problem Set 2 (2)
2 pages
T08DynamicProgramming
No ratings yet
T08DynamicProgramming
11 pages
Dynamic Programming
No ratings yet
Dynamic Programming
21 pages
Richter_RCK_discrete_time
No ratings yet
Richter_RCK_discrete_time
15 pages
Handout 10 Dynamic Programming Nov14
No ratings yet
Handout 10 Dynamic Programming Nov14
113 pages
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
No ratings yet
Dynamic Programming: Quantitative Macroeconomics (Econ 5725)
55 pages
1 13 Optimal Control Proofs
No ratings yet
1 13 Optimal Control Proofs
9 pages
Lecture SM 1 DP
No ratings yet
Lecture SM 1 DP
71 pages
Dynamic Programming
No ratings yet
Dynamic Programming
52 pages
Necessary and Sufficient Conditions For Dynamic Optimization
No ratings yet
Necessary and Sufficient Conditions For Dynamic Optimization
18 pages
cs229 Notes13
No ratings yet
cs229 Notes13
15 pages
EC004 OutputDynamics - Microfoundation 2022 Lecture4
No ratings yet
EC004 OutputDynamics - Microfoundation 2022 Lecture4
15 pages
EC004 OutputDynamics - Microfoundation 2022 Lecture5
No ratings yet
EC004 OutputDynamics - Microfoundation 2022 Lecture5
21 pages
Macro2 HW2 Solution v1
No ratings yet
Macro2 HW2 Solution v1
15 pages
Dynamic Programming Method
No ratings yet
Dynamic Programming Method
3 pages
Computing The Cake Eating Problem
No ratings yet
Computing The Cake Eating Problem
13 pages
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
17 pages
2 Growth Neoclassical Growth
No ratings yet
2 Growth Neoclassical Growth
71 pages
SGPE Lecture2
No ratings yet
SGPE Lecture2
48 pages
Dynamic Programming Value Iteration
100% (1)
Dynamic Programming Value Iteration
36 pages
Explicit Solution of a General Consumption Investment Problem
No ratings yet
Explicit Solution of a General Consumption Investment Problem
36 pages
Dynare Ramsey
No ratings yet
Dynare Ramsey
3 pages
Lecture3 NeoclasslicalModel
No ratings yet
Lecture3 NeoclasslicalModel
56 pages
Lagrange For Dyn Opt
No ratings yet
Lagrange For Dyn Opt
11 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Lecture4 BellmanOperator Handout
No ratings yet
Lecture4 BellmanOperator Handout
13 pages
Dyn Mac-2010 PDF
No ratings yet
Dyn Mac-2010 PDF
38 pages
Dynamic Macroeconomic Modeling With Matlab
No ratings yet
Dynamic Macroeconomic Modeling With Matlab
38 pages
Optimization Methods (MFE) : Elena Perazzi
No ratings yet
Optimization Methods (MFE) : Elena Perazzi
28 pages
Dynamic Programming2
No ratings yet
Dynamic Programming2
3 pages
ECON 809: Problem Set 1
No ratings yet
ECON 809: Problem Set 1
18 pages
1 Explicit Solution To An Irreversible Investment Model With A Stochastic Production Capacity
No ratings yet
1 Explicit Solution To An Irreversible Investment Model With A Stochastic Production Capacity
15 pages
2 Dynamics
No ratings yet
2 Dynamics
10 pages
A Review of Recursive Methods in Economic Dynamics
No ratings yet
A Review of Recursive Methods in Economic Dynamics
10 pages
Lorenzoni 2007 Problem Set On Lorenzoni Walentin
No ratings yet
Lorenzoni 2007 Problem Set On Lorenzoni Walentin
2 pages
EC744 Lecture Note 1: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 1: Prof. Jianjun Miao
18 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
3.5/5 (1)
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Mock Test Manisha
No ratings yet
Mock Test Manisha
8 pages
Shaukat Khanum Memorial Cancer Hospital & Research Centre
No ratings yet
Shaukat Khanum Memorial Cancer Hospital & Research Centre
1 page
Doc2 try 02_Mr. Khalid
No ratings yet
Doc2 try 02_Mr. Khalid
5 pages
RRB_Wsie_Vacancy_07_2024_SarkariResult_Com_Final_CEN_Isolated_Cat Pub (1)
No ratings yet
RRB_Wsie_Vacancy_07_2024_SarkariResult_Com_Final_CEN_Isolated_Cat Pub (1)
2 pages
Agile Leadership Model in Health Care: Organizational and Individual Antecedents and Outcomes
No ratings yet
Agile Leadership Model in Health Care: Organizational and Individual Antecedents and Outcomes
22 pages
Chemical Project Economics
No ratings yet
Chemical Project Economics
26 pages
Technical Topic - Turbine Flush Guide
No ratings yet
Technical Topic - Turbine Flush Guide
3 pages
Teaching Generation Z
No ratings yet
Teaching Generation Z
8 pages
E-CAPS-25 - For CoE (XI) - Physics - (Que. - Answer Key)
No ratings yet
E-CAPS-25 - For CoE (XI) - Physics - (Que. - Answer Key)
5 pages
DGS 8800e
No ratings yet
DGS 8800e
82 pages
Web-Sikkim Census 2011 Data
No ratings yet
Web-Sikkim Census 2011 Data
15 pages
ASSIGNMENT
No ratings yet
ASSIGNMENT
6 pages
Importance of Maths in Everyday Life
No ratings yet
Importance of Maths in Everyday Life
6 pages
Untitled
No ratings yet
Untitled
2 pages
9D Participle Clauses-1
No ratings yet
9D Participle Clauses-1
1 page
Cancer Biology 3rd Edition Roger J.B. King - Own the complete ebook with all chapters in PDF format
100% (1)
Cancer Biology 3rd Edition Roger J.B. King - Own the complete ebook with all chapters in PDF format
55 pages
Catch Up Friday Intervention Plan
100% (1)
Catch Up Friday Intervention Plan
2 pages
12a How To Do AHP Analysis in Excel
No ratings yet
12a How To Do AHP Analysis in Excel
21 pages
Gentle Glory International Enterprise LTD: Unit 2205, Wellborne Commercial Centre No.8 Java Road, North Point, Hong Kong
No ratings yet
Gentle Glory International Enterprise LTD: Unit 2205, Wellborne Commercial Centre No.8 Java Road, North Point, Hong Kong
24 pages
GSP 2205 Questions and Answers
No ratings yet
GSP 2205 Questions and Answers
29 pages
JCB Lubricants - 2021
No ratings yet
JCB Lubricants - 2021
19 pages
Download Full Geology and Landscape Evolution: General Principles Applied to the United States 2nd Edition Joseph A. Dipietro PDF All Chapters
100% (4)
Download Full Geology and Landscape Evolution: General Principles Applied to the United States 2nd Edition Joseph A. Dipietro PDF All Chapters
49 pages
ENVPEP110103EN - V0 (Web)
No ratings yet
ENVPEP110103EN - V0 (Web)
4 pages
Introduction To Surfer: Preliminary Training Course in Surfer Application in Contouring Mapping
No ratings yet
Introduction To Surfer: Preliminary Training Course in Surfer Application in Contouring Mapping
14 pages
Smartphysics Homework Solutions
100% (1)
Smartphysics Homework Solutions
5 pages
Cost Benefit Analysis Outline EC
No ratings yet
Cost Benefit Analysis Outline EC
5 pages
Summative Test in Math 8
No ratings yet
Summative Test in Math 8
2 pages
Tugas 14 Inggris
No ratings yet
Tugas 14 Inggris
4 pages
Sage 5 - Recent Trends in MGMT
No ratings yet
Sage 5 - Recent Trends in MGMT
32 pages

dp-intro dynamic programming

Uploaded by

dp-intro dynamic programming

Uploaded by

Introduction to the Bellman Equation

In the lectures on Ch. 8, deterministic endowment economy with sequential markets, we

given a0 and {yt }Tt=0

max u(yT + aT − aT +1 QT ) (3)

V T (aT , yT ) ≡ max u(yT + aT − aT +1 QT ) (4)

and by g T (aT ) the optimal decision rule for aT +1 given aT :

max {u(yT −1 + aT −1 − aT QT −1 ) + βu(yT + aT − aT +1 QT )} (5)

V T −1 (aT −1 , ŷ T −1 ) = max u(yT −1 + aT −1 − aT QT −1 ) + βV T (aT , yT )

Likewise, denote by g T −1 (aT −1 ) the optimal decision rule for aT given aT −1 :

u′ (yT −1 + aT −1 − aT QT −1 )QT −1 = βu′ (yT + aT ) (8)

V T −2 (aT −2 , ŷ T −2 ) = max u(yT −2 + aT −2 − aT −1 QT −2 ) + βV T −1 (aT −1 , ŷ T −1 )

and a decision rule aT −1 = g T −2 (aT −2 , ŷ T −2 ) obtained from the F.O.C.

u′ (yT −2 + aT −2 − aT −1 QT −2 )QT −2 = βu′ (yT −1 + aT −1 − aT QT −1 ) (9)

V 0 (a0 , ŷ 0 ) = max u(y0 + a0 − a1 Q0 ) + βV 1 (a1 , ŷ 1 )

with the optimum attaned at a1 = g 0 (a0 , ŷ 0 ).

given a0 and y ′ = Γ(y).

u′ (y + a − a′ Q)Q = βu′ (y ′ + a′ − g(a′ , y ′ )Q) (12)

Solution to this equation gives a′ = g(a, y).

You might also like