0% found this document useful (0 votes)

245 views

Optimal Control Dynamic Programming

This document discusses optimal control and nonlinear systems. It begins by introducing unconstrained and constrained optimization problems. Variational methods and Pontryagin's Minimum Principle are used to derive optimal controllers by formulating an optimal control problem as a constrained optimization problem. Dynamic programming and the Hamilton-Jacobi equations provide an alternative approach. Pontryagin's Minimum Principle states that solutions to the optimal control problem must also solve the state and co-state equations. Examples are provided to demonstrate constrained optimization and applying Pontryagin's Principle.

Uploaded by

Puloma Dwibedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

245 views

Optimal Control Dynamic Programming

Uploaded by

Puloma Dwibedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Nonlinear Systems and Control Spring 2015

1 Optimal Control
We consider first simple unconstrained optimization problems, and then
demonstrate how this may be generalized to handle constrained optimiza-
tion problems. With the observation that an optimal control problem is a
form of constrained optimization problem, variational methods are used to
derive an optimal controller, which embodies Pontryagins Minimum Princi-
ple. Subsequently an alternative approach, based on Bellmanss Principle of
Optimality, and Dynamic programming is used to derive the Hamilton-Jacobi
equations.

1.1 Unconstrained Optimization

Consider a function

L:<<
We want to find
min L(u)
u

Let us assume that L is sufficiently smooth, and consider the Taylor expan-
sion:

dL dL2
L(u) = L(u0 ) + |u=u0 (u u0 ) + 2 |u=u0 (u u0 )2 + ..
du du
Then we have a necessary condition
dL
|u=u0 = 0
du
and a sufficient condition

dL2
|u=u0 > 0
du2
Note that these are only conditions for a local minimum. Additional condi-
tions are required to find the global minimum if the function is non-convex.
If we have a function with more than one variable, that is L : <n < we
have the following conditions
L L L
{ , , ... }|u=u0 = 0
u1 u2 un

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 1 of 18

Nonlinear Systems and Control Spring 2015

and
2L 2L

u21
... u1 un
2L
|u=u0 = ... ... ... |u=u0 > 0

u 2
2L 2L
un u1
... u2n

i.e. is positive definite.

Example 1 Find the minimum of the function

f (x1 , x2 ) = 6x21 + 2x22 + 3x1 x2 8x1 + 3x2 4.

Necessary condition
f 12x1 + 3x2 8
= =0
u 4x2 + 3x1 + 3
41 20
Solving these equations we find x0 = ( 39 , 13 ) and when we insert x0 into
the Hessian Matrix we see that

2L

12 3
(x0 ) = .
u2 3 4
the resulting matrix is positive definite. We conclude that the point x0 is a
minimum.

The plot in Figure 1 shows the function for which we are finding the mini-
mum.

1.2 Constrained Optimization

Theorem 1 Consider the problem

minx L(x), f (x) = 0

with L : <n < and f : <n <m . Then this problem is equivalent to

min L(x) + f (x).

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 2 of 18

Nonlinear Systems and Control Spring 2015

Figure 1: Convex Function

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 3 of 18

Nonlinear Systems and Control Spring 2015

The function
H(x, ) = L(x) + f (x)
is called the Hamiltonian of the optimization problem. The coefficients
<m are called Lagrange Multipliers of the system.
Proof
Without loss of generality we consider x <2 . The necessary conditions for
a minimum are
H
=0
x,
or
H L f
= +
x1 x1 x1
H L f
= +
x2 x2 x2
H
= f (x1 , x2 )

The third condition is equivalent to the boundary conditions of the original
problem being satisfied.
The first two conditions are equivalent to saying that the vectors
!
L f
x1 x1
L ; f
x2 x2

are parallel or colinear. If these vectors are parallel, then the matrix
!
L f
x1 x1
L f
x2 x2

has rank less than 2, which means that the linear system obtained by equating
to zero the derivative of the Hamiltonian has a non trivial solution on .
With the help of a diagram in Figure 2, it is easy to understand that where
we have a minimum or maximum the two gradients (with the red vector
representing the gradient of L and the black vector representing the gradient
of f ) have to be parallel, as otherwise one can increase or decrease the value
of L while satisfying the constraint f (x) = 0.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 4 of 18

Nonlinear Systems and Control Spring 2015

Figure 2: Gradients of L and f must be colinear at the extrema

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 5 of 18

Nonlinear Systems and Control Spring 2015

Example 2 Find the minimum of the function

L(x1 , x2 ) = x21 4x1 + x22
with the constraints
x1 + x2 = 0.
Indeed, the Hamiltonian is given by
H(x, ) = x21 4x1 + x22 + (x1 + x2 ).
Thus, the expressions for
H
=0
x,
are
H
0= = 2x1 4 +
x1
H
0= = 2x2 +
x2
H
0= = x1 + x2

Solving this system of linear equations we find the solution (x1 , x2 , ) =
(1, 1, 2). This solution is only a candidate for solution. One must now
test whether these coordinates represented the solution we seek.
Example 3 Consider the problem
minx1 ,x2 ,x3 L(x1 , x2 , x3 ) = x21 + x22 + x23 ,
such that
x1 x2 + x2 x3 + x3 x1 = 1.
As per recipe we write the Hamiltonian.
H(x, ) = x21 + x22 + x23 + (x1 x2 + x2 x3 + x3 x1 1).
Taking the derivatives we see
0 = 2x1 + x2 + x3
0 = 2x2 + x1 + x3
0 = 2x3 + x1 + x2
0 = x1 x2 + x2 x3 + x3 x1 1

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 6 of 18

Nonlinear Systems and Control Spring 2015

From the first three equations we see that

x = y = z.

This result plus the last equation tell us that

r
1
x=y=z= , = 1.
3
Again, we must make sure that this candidate is indeed a solution to our
original problem.

1.3 Pontryagins Minimum Principle

Consider the system
x = f (x, u), x(0) = x0 (1)
with associated performance index
Z T
J[x0 , u()] = (x(T )) + L(x(t), u(t))dt (2)
0

and final state constraint

(x(T )) = 0 (3)
The following terminology is customary:

1. J[x0 , u()] is called cost function.

2. (x(T )) is called end constraint penalty

3. L(x, u) is called running cost

4. H(x, u, ) = L(x, u) + f (x, u) is called Hamiltonian.

The Optimal Control Problem is: Find the control function

u : [0, T ] 7 <m

such that the performance index is minimized and the final state constraint
and the system equations are satisfied.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 7 of 18

Nonlinear Systems and Control Spring 2015

Theorem 2 Solutions of the Optimal Control Problem also solve the follow-
ing set of differential equations:

State Equation:
x = H = f (x), (4)
L f
Co-State Equation: = Hx = + (5)
x x
L f
Optimality Condition: 0 = Hu = + (6)
u u
State initial condition: x(0) = x0 (7)
Co-state final condition: (T ) = (x + x )|x(T ) (8)

where is the Lagrange multiplier corresponding to end condition given by

Equation 3.

Proof We use the Lagrange multipliers to eliminate the constraints. Since

the main constraints are now given by a dynamical system x = f (x, u) it is
clear that they must hold for all t and thus the vector function : [0, T ] <n
is a function of time.
Using the notation H(x, u, ) = L(x, u) + f (x, u), the unconstrained
minimization problem can be written as
Z T
J(x, u, ) = (x(T )) + (x(T )) + [L(x, u) + (f (x, u) x)]dt

0
Z T
= (x(T )) + (x(T )) + [H(x, u, ) x)]dt.

0

The differentials are written as

J = [x (x) + x (x)]x)|x=x(T ) +
Z T
[Hx x + Hu u + H x + x]dt+

0
(x)|x(T )

Note that by integrating by parts:

Z T Z T
x = x|t=T + x|t=0 +
xdt
0 0

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 8 of 18

Nonlinear Systems and Control Spring 2015

Furthermore, since x(0) = x(0) is constant, it holds that x|t=0 = 0. So we

can rewrite the previous expression as

J = [x (x) + x (x) ]x)|x=x(T ) +

Z T

[(Hx + )x + (H x)
+ Hu u]dt+
0
(x)|x(T )

Now for the function u : [0, T ] <m to minimize the cost function, J must
be zero for any value of the differentials. Thus, all the expressions before the
differentials have to be zero for every t [0, T ]. This observation gives the
equations as required.

Remark 1 Just as in the static optimization case, where the zeros of the
derivatives represent candidates to be tested for extremum, the solutions of
the system described in Theorem 2 are to be seen as candidates to be the
optimal solution and their optimality must be tested for each particular case.
In other words, the Pontriaguin Maximum Principle delivers necessary, but
not sufficient, conditions for optimality.

Remark 2 The system of equations in Theorem 2 is a so called Two Point

Boundary Value problem. generally speaking, these problems are only solv-
able by dedicated software implementing suitable numerical methods.

Remark 3 The expression regarding u() in Equation 6 is a special case of

a more general minimum condition. In general we must seek

u = arg min H(x, u, )

for given values of x(t) and u(t). In other words, the Pontryagins Minimum
Principle states that the Hamiltonian is minimized over all admissible u for
optimal values of the state and co-state.

Remark 4 Special attention is needed in the case where Hu = const for all
u, in which case the solution is found where the constraints are active. This
sort of solution is called Bang-Bang solution.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 9 of 18

Nonlinear Systems and Control Spring 2015

Example 4 Consider the minimization problem

Z T
J[x0 , u()] = 0.5 u2 (t)dt
0

subject to

x1 = x2 , x1 (0) = x10
x2 = u, x2 (0) = x20

and with final constraint

(x(T )) = x(T ) = 0.

Applying Theroem 2, we transform the system into

x1 = H1 = x2 ; x1 (0) = x10
x2 = H2 = u; x2 (0) = x20
1 = Hx1 = 0; 1 (T ) = 1
2 = Hx2 = 1 ; 2 (T ) = 2

where
H(x, u, ) = 0.5u2 + 1 x2 + 2 u
Now we see that
Hu = u + 2 = 0
Thus, we can solve the differential equations and see that

1 (t) = 1
2 (t) = 1 (t T ) + 2
u(t) = 1 (t T ) 2

Placing these linear expressions in the dynamic equations for x and using
the initial conditions x(0) = x0 , we obtain a linear system of equations with
respect to (1 , 2 ), which gives us the final parametrization of the control law
u. Figure 3 shows the result of applying this control law with T = 15.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 10 of 18

Nonlinear Systems and Control Spring 2015

Figure 3: Trajectories for Minimal Energy Case. Arrival time T = 15.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 11 of 18

Nonlinear Systems and Control Spring 2015

Example 5 Consider the same problem as in Example 4, but with a perfor-

mance reflecting minimal time, i.e.
Z T
J[x0 , u()] = 1 dt
0

and constrained input

1 u(t) 1 t 0.

Now
H(x, u, ) = 1 + 1 x2 + 2 u.
We notice that Hu = 2 , i.e. Hu does not depend on u and thus the extremum
is to be reached at the boundaries, i.e. u (t) = 1, t 0. Using

u = arg min H(u) = arg min 2 (t)u(t)

u u

we see that
u (t) = sign2 (t).
Figure 4 depicts the result of deploying this control. Note that the control law
is discontinuous. Further, we observe that the system now needs less time to
reach the origin than in the previous Minimal Energy example. Of course,
this success is obtained at the cost of deploying more actuator energy.

1.4 Dynamic Programming

An alternative approach to solving optimal control problems to the varia-
tional methods above is given by Dynamic programming , this is based
on the following observation.

Remark 5 Bellmans principle of optimality: An optimal policy (control

law) has the property that no matter what the previous decisions (control)
have been, the remaining decisions must constitute an optimal policy with
respect to the state resulting from these decisions (the current state). This
implies that the optimal policy is computed backwards, in that we can split
the optimal problem into 2 optimal control sub-problems, corresponding to 2
time intervals.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 12 of 18

Nonlinear Systems and Control Spring 2015

Figure 4: Trajectories for Minimal Time Case. Arrival time T 13.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 13 of 18

Nonlinear Systems and Control Spring 2015

Figure 5: Dynamic Programming Route

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 14 of 18

Nonlinear Systems and Control Spring 2015

Figure 6: Dynamic Programming Optimal Route

Example 6 (Shortest Path Problem) Consider the problem of travelling

from A to B with the minimum total cost. The graph to be traversed, and the
associated edge costs are given in the following diagram
The approach taken is to work backwards from B noting the minimum
cost from there to B:
This results in the solution depicted in Figure6.
For the solution of an optimal control problem, we split the overall time
interval into two shorter time intervals. The solution of the first interval is
dependent on the solution of the second interval, as follows:
Re-write the optimal control problem by splitting the integral at
Z t Z T
J(t0 ) = (x(T ), T ) + L(x(, u( ))d + L(x(, u( ))d
t0 t

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 15 of 18

Nonlinear Systems and Control Spring 2015

Figure 7: Dynamic Programming Route Segment

This gives the 2 sub-problems:

Z t
J1 (t0 ) = V (x(t), t) + L(x(, u( ))d
t0
Z T
J2 (t) = (x(T ), T ) + L(x(, u( ))d
t

where we define the function

V (x(t), t) = J2 (t)

Note that in the limit as t 7 T , we find that J1 = J and J2 = (x(T ), T ).

Alternatively, as t 7 t0 , J1 = V and J2 = J.

Remark 6 The principle of optimality may now be restated as follows: let

u? be the optimal control for the original control problem, for all t [t0 , T ]
u? gives the optimal control for the control problems defined by J1 and J2 .

Definition 1 The function V (x, t) is called the value function or the cost
to go. It represents the value of the solution of the optimal control problem
starting at x at the time t.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 16 of 18

Nonlinear Systems and Control Spring 2015

Consider the minimization of a value function V (x, t) during t, T ]. Let

?
u be the optimal control for T = tf . Then
Z tf
? ?
V (x (t), t) = (x (tf ), tf ) + L(x? ( ), u? ( ))d
t
Z t
= min{(x(tf ), tf ) L(x( ), u( ))}
u tf

Additionally, we have to fulfill

x = f (x(t), u(t))

and the boundary condition

V (x, T ) = (x(tf ), tf )

It follows that
dV
= min L(x(t), u(t))
dt u
V V dx
+ = min L(x(t), u(t))
t x dt u
V V
= min{ f (x(t), u(t)) L(x(t), u(t))}
t u x
V
= min H(x, , u)
u x
Definition 2 This equation is known as the Hamilton-Jacobi equation.
This is a partial differential equation in V , with boundary condition V (x, T ) =
(x(tf ), tf ). Solving this equation gives the solution of the optimal control
problem with the control being a state feedback law given by
V
u? = arg min H(x, , u)
x
Note that the co-state introduced in the previous section is then given
V
by x
.
When the Hamilton-Jacobi-Bellman equation is to be solved in practical
cases, numerical methods seem appropriate. However, the solution of the
Hamilton-Jacobi equation by numerical methods is only tractable for low
dimensional systems. For higher dimensional systems, the number of data

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 17 of 18

Nonlinear Systems and Control Spring 2015

points required increases with the power of the dimension of the system (space
and time discretization). For example, for a nonlinear system of dimension
n where the value function is to be determined on a hyper-cube [a, a] with
n
a spatial discretization , a total of 2a

points will have to be stored. For
even modest requirements the amount of data becomes very large and not
tractable.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 18 of 18

Full download Practical Methods for Optimal Control and Estimation Using Nonlinear Programming Second Edition Advances in Design and Control John T. Betts pdf docx
100% (17)
Full download Practical Methods for Optimal Control and Estimation Using Nonlinear Programming Second Edition Advances in Design and Control John T. Betts pdf docx
60 pages
Metods and Technique For Proving Inequalities
60% (5)
Metods and Technique For Proving Inequalities
188 pages
Bertsekas) Dynamic Programming and Optimal Control - Solutions Vol 2
20% (5)
Bertsekas) Dynamic Programming and Optimal Control - Solutions Vol 2
31 pages
DPOCexam2008midterm Solution
No ratings yet
DPOCexam2008midterm Solution
12 pages
HJB Equations
100% (1)
HJB Equations
38 pages
Optimization Methods (MFE) : Elena Perazzi
No ratings yet
Optimization Methods (MFE) : Elena Perazzi
28 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
20 pages
Stochastic Model Predictive Control
No ratings yet
Stochastic Model Predictive Control
208 pages
Busy Ant Maths Year 4 Answers
83% (6)
Busy Ant Maths Year 4 Answers
33 pages
Pontryagin's Maximum Principle
No ratings yet
Pontryagin's Maximum Principle
21 pages
Dynamic Programming Value Iteration
100% (1)
Dynamic Programming Value Iteration
36 pages
Dynamic Programming and Optimal Control, Volumes I Solution Selected
No ratings yet
Dynamic Programming and Optimal Control, Volumes I Solution Selected
30 pages
Introduction To Optimal Control Theory and Hamilton-Jacobi Equations
100% (1)
Introduction To Optimal Control Theory and Hamilton-Jacobi Equations
55 pages
LQG/LQR Controller Design: Undergraduate Lecture Notes On
No ratings yet
LQG/LQR Controller Design: Undergraduate Lecture Notes On
37 pages
Material: Glad & Ljung Ch. 12.2 Khalil Ch. 4.1-4.3 Lecture Notes
No ratings yet
Material: Glad & Ljung Ch. 12.2 Khalil Ch. 4.1-4.3 Lecture Notes
42 pages
EACT633 - Optimal Control and It's Applications Worksheet For Chapter 4, 5 & 6
No ratings yet
EACT633 - Optimal Control and It's Applications Worksheet For Chapter 4, 5 & 6
3 pages
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
No ratings yet
Dynamic Programming Handout - : 14.451 Recitation, February 18, 2005 - Todd Gormley
11 pages
Bellman
100% (1)
Bellman
8 pages
Hw2sol PDF
100% (1)
Hw2sol PDF
5 pages
Complete Quadratic Lyapunov Functionals Using Bessel LegendreInequality
No ratings yet
Complete Quadratic Lyapunov Functionals Using Bessel LegendreInequality
6 pages
Notes On Linearisation (H.K.Khalil)
No ratings yet
Notes On Linearisation (H.K.Khalil)
11 pages
Optimal Control (Course Code: 191561620)
No ratings yet
Optimal Control (Course Code: 191561620)
4 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
28 pages
Switched Systems
100% (2)
Switched Systems
185 pages
Balanced Truncation
No ratings yet
Balanced Truncation
15 pages
Lecture 4 Chapter 4 Lyapunov Stability
No ratings yet
Lecture 4 Chapter 4 Lyapunov Stability
86 pages
Robust and Optimal Control PDF
No ratings yet
Robust and Optimal Control PDF
7 pages
Linear Systems Raymond A DeCarlo Cap 4 - A - 6
No ratings yet
Linear Systems Raymond A DeCarlo Cap 4 - A - 6
55 pages
Notes 3
No ratings yet
Notes 3
8 pages
Control Principles For Engineered Systems 5SMC0: State Reconstruction & Observer Design
No ratings yet
Control Principles For Engineered Systems 5SMC0: State Reconstruction & Observer Design
19 pages
Lec 24 Lagrange Multiplier
No ratings yet
Lec 24 Lagrange Multiplier
20 pages
Paulo Brito Ecomat Discreto
No ratings yet
Paulo Brito Ecomat Discreto
49 pages
Robust Control Notes
No ratings yet
Robust Control Notes
415 pages
Stability of Nonlinear Systems
No ratings yet
Stability of Nonlinear Systems
18 pages
Topic 10 Nonlinear Systems and Their Linearizations
No ratings yet
Topic 10 Nonlinear Systems and Their Linearizations
20 pages
Skript Control Systems II V1
No ratings yet
Skript Control Systems II V1
248 pages
Constrained Optimization
No ratings yet
Constrained Optimization
6 pages
Predictive Control: For Linear and Hybrid Systems
No ratings yet
Predictive Control: For Linear and Hybrid Systems
458 pages
CH 4
No ratings yet
CH 4
113 pages
Notas - Dynamic Optimation and Optimal Control
No ratings yet
Notas - Dynamic Optimation and Optimal Control
26 pages
Stability: EE-601 Linear System Theory
No ratings yet
Stability: EE-601 Linear System Theory
26 pages
(Peter Van Overschee, Bart de Moor (Auth.) ) Subs
No ratings yet
(Peter Van Overschee, Bart de Moor (Auth.) ) Subs
262 pages
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
No ratings yet
Bellman's Equations: T t+1 T T 0 T T t+1 T T, T
7 pages
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 3 Dynamic Programming Under Certainty: Prof. Jianjun Miao
17 pages
Nonlinear and Adaptive Control: Zhengtao Ding
No ratings yet
Nonlinear and Adaptive Control: Zhengtao Ding
116 pages
Razumikhin and Krasovskii Stability Theorems For Time-Varying Time-Delay Systems
No ratings yet
Razumikhin and Krasovskii Stability Theorems For Time-Varying Time-Delay Systems
11 pages
Optimal Control PDF
No ratings yet
Optimal Control PDF
91 pages
Dynamics of Linear Systems: September 27, 2017
No ratings yet
Dynamics of Linear Systems: September 27, 2017
101 pages
Partial Diff Equations
No ratings yet
Partial Diff Equations
51 pages
Partial Differential Equations A Unified Hilbert Space Approach 1st Edition Rainer Picard All Chapters Instant Download
100% (1)
Partial Differential Equations A Unified Hilbert Space Approach 1st Edition Rainer Picard All Chapters Instant Download
82 pages
Robust Control Part1
No ratings yet
Robust Control Part1
12 pages
LQR
No ratings yet
LQR
5 pages
EC744 Lecture Notes: Economic Dynamics: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Notes: Economic Dynamics: Prof. Jianjun Miao
13 pages
EC744 Lecture Note 8 Applications of Stochastic DP: Prof. Jianjun Miao
No ratings yet
EC744 Lecture Note 8 Applications of Stochastic DP: Prof. Jianjun Miao
31 pages
Controllability and Observability of Nonlinear Systems: Key Points
100% (1)
Controllability and Observability of Nonlinear Systems: Key Points
22 pages
A Mathematical Approach To Control System PDF
No ratings yet
A Mathematical Approach To Control System PDF
666 pages
MIT Dynamic Programming Lecture Slides
No ratings yet
MIT Dynamic Programming Lecture Slides
261 pages
1 IEOR 4701: Continuous-Time Markov Chains
No ratings yet
1 IEOR 4701: Continuous-Time Markov Chains
22 pages
Stability Analysis of Nonlinear Systems Using Lyapunov Theory - I
No ratings yet
Stability Analysis of Nonlinear Systems Using Lyapunov Theory - I
28 pages
Optimal Control Theory
No ratings yet
Optimal Control Theory
3 pages
8 Pontryagin
No ratings yet
8 Pontryagin
31 pages
Mengistu Chalchisa
No ratings yet
Mengistu Chalchisa
46 pages
Abstract Algebra For Beginners - Beachy
100% (1)
Abstract Algebra For Beginners - Beachy
177 pages
WIN SEM (2023-24) FRESHERS - MAT1002 - TH - AP2023247000233 - 2024-02-13 - Reference-Material-I
No ratings yet
WIN SEM (2023-24) FRESHERS - MAT1002 - TH - AP2023247000233 - 2024-02-13 - Reference-Material-I
112 pages
Matrix Mastery Game Design All in One
No ratings yet
Matrix Mastery Game Design All in One
9 pages
CHAPTER 5 Trigonometric Function (Revision)
No ratings yet
CHAPTER 5 Trigonometric Function (Revision)
4 pages
Functions and Graphs
No ratings yet
Functions and Graphs
27 pages
Grade 3
100% (1)
Grade 3
3 pages
10 Hadamard Matrices
No ratings yet
10 Hadamard Matrices
4 pages
Understanding Math10 (Teacher's Guide) PDF
No ratings yet
Understanding Math10 (Teacher's Guide) PDF
139 pages
LAB 2 Newton-Raphson Method
100% (1)
LAB 2 Newton-Raphson Method
3 pages
SSC 105 Module Teacher in Absentia
100% (1)
SSC 105 Module Teacher in Absentia
75 pages
CONTROL LAB Hu
No ratings yet
CONTROL LAB Hu
108 pages
Chapter 002
No ratings yet
Chapter 002
9 pages
10th Math Question Bank for Slow learners Eng Medium
100% (1)
10th Math Question Bank for Slow learners Eng Medium
10 pages
1980 - 20 - Theory of Stochastic Processes
No ratings yet
1980 - 20 - Theory of Stochastic Processes
7 pages
Quotient Rings
No ratings yet
Quotient Rings
5 pages
Operations and Algebraic Thinking-Ok PDF
No ratings yet
Operations and Algebraic Thinking-Ok PDF
4 pages
On The Determination of Stress Intensity Factors For Some Common Structural Problems
100% (1)
On The Determination of Stress Intensity Factors For Some Common Structural Problems
19 pages
Second Order Differential Equations With Variable Coefficients (Euler-Cauchy Equation) (A) Homogeneous Euler-Cauchy Equation
No ratings yet
Second Order Differential Equations With Variable Coefficients (Euler-Cauchy Equation) (A) Homogeneous Euler-Cauchy Equation
6 pages
Matrix A Has 2 Rows and 3 Columns. Its Order Is 2×3
No ratings yet
Matrix A Has 2 Rows and 3 Columns. Its Order Is 2×3
3 pages
Ordinary Differential Equations I Lecture (3) First Order Differential Equations
No ratings yet
Ordinary Differential Equations I Lecture (3) First Order Differential Equations
8 pages
Kani Maths 12th Maths em Villupuram Ceo Important Question
No ratings yet
Kani Maths 12th Maths em Villupuram Ceo Important Question
16 pages
Viva Question1
No ratings yet
Viva Question1
9 pages
Additional Exercise of Limit, Continuity - Differentiability (Eng) - Phase-I
No ratings yet
Additional Exercise of Limit, Continuity - Differentiability (Eng) - Phase-I
22 pages
Nyquist Diagram and Stability Criterion
No ratings yet
Nyquist Diagram and Stability Criterion
28 pages
Time-Domain Numerical Solution of The Wave Equation: February 6, 2003
No ratings yet
Time-Domain Numerical Solution of The Wave Equation: February 6, 2003
17 pages
Set Theory
No ratings yet
Set Theory
6 pages
Optimality Conditions For General Constrained Optimization: CME307/MS&E311: Optimization Lecture Note #07
No ratings yet
Optimality Conditions For General Constrained Optimization: CME307/MS&E311: Optimization Lecture Note #07
28 pages
DMS Text Book
No ratings yet
DMS Text Book
62 pages

Optimal Control Dynamic Programming

Uploaded by

Optimal Control Dynamic Programming

Uploaded by

Nonlinear Systems and Control Spring 2015

1.1 Unconstrained Optimization

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 1 of 18

i.e. is positive definite.

Example 1 Find the minimum of the function

f (x1 , x2 ) = 6x21 + 2x22 + 3x1 x2 8x1 + 3x2 4.

1.2 Constrained Optimization

Theorem 1 Consider the problem

minx L(x), f (x) = 0

min L(x) + f (x).

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 2 of 18

Figure 1: Convex Function

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 3 of 18

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 4 of 18

Figure 2: Gradients of L and f must be colinear at the extrema

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 5 of 18

Example 2 Find the minimum of the function

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 6 of 18

From the first three equations we see that

This result plus the last equation tell us that

1.3 Pontryagins Minimum Principle

and final state constraint

1. J[x0 , u()] is called cost function.

2. (x(T )) is called end constraint penalty

3. L(x, u) is called running cost

4. H(x, u, ) = L(x, u) + f (x, u) is called Hamiltonian.

The Optimal Control Problem is: Find the control function

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 7 of 18

where is the Lagrange multiplier corresponding to end condition given by

Proof We use the Lagrange multipliers to eliminate the constraints. Since

The differentials are written as

Note that by integrating by parts:

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 8 of 18

Furthermore, since x(0) = x(0) is constant, it holds that x|t=0 = 0. So we

J = [x (x) + x (x) ]x)|x=x(T ) +

Remark 2 The system of equations in Theorem 2 is a so called Two Point

Remark 3 The expression regarding u() in Equation 6 is a special case of

u = arg min H(x, u, )

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 9 of 18

Example 4 Consider the minimization problem

and with final constraint

Applying Theroem 2, we transform the system into

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 10 of 18

Figure 3: Trajectories for Minimal Energy Case. Arrival time T = 15.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 11 of 18

Example 5 Consider the same problem as in Example 4, but with a perfor-

and constrained input

u = arg min H(u) = arg min 2 (t)u(t)

1.4 Dynamic Programming

Remark 5 Bellmans principle of optimality: An optimal policy (control

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 12 of 18

Figure 4: Trajectories for Minimal Time Case. Arrival time T 13.

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 13 of 18

Figure 5: Dynamic Programming Route

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 14 of 18

Figure 6: Dynamic Programming Optimal Route

Example 6 (Shortest Path Problem) Consider the problem of travelling

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 15 of 18

Figure 7: Dynamic Programming Route Segment

This gives the 2 sub-problems:

where we define the function

Note that in the limit as t 7 T , we find that J1 = J and J2 = (x(T ), T ).

Remark 6 The principle of optimality may now be restated as follows: let

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 16 of 18

Consider the minimization of a value function V (x, t) during t, T ]. Let

Additionally, we have to fulfill

and the boundary condition

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 17 of 18

Lecture 6: Optimal Control: Pontriaguin and Dynamic Programming 18 of 18

You might also like