Project Report: A Linear Programming Approach For Dynamic System Control With Inequality Constraints
Project Report: A Linear Programming Approach For Dynamic System Control With Inequality Constraints
Project Report: A Linear Programming Approach For Dynamic System Control With Inequality Constraints
1 Introduction
The presence of inequality constraint on the state variables and control effort is a distinc-
tive feature of many practical control problems. In control of mechanical servo systems,
the limitation of the control effort may comes from the physical limitation as the actua-
tor’s maximum ability, and the limitation of state variables may comes from the control
objectives. For example, when controlling the motion of a machine tool, the position
accuracy of the tool should be limited in certain range to avoid damage the part being
cutting. Another example will be when a robot manipulating, the robot arm should be
remained in certain range to avoid collision. Further more, these constraints can change
with time in an unpredictable manner due to the change of the environment, equipment
availability, or a shift in control objectives. Thus control in the presence of constraints
is important and interesting from a practical point of view.
Linear programming (LP) has proved to be a powerful method to solve optimal control
problems for linear systems with linear equality and inequality constraints. Significant
attention has been paid to the classical time-optimal and energy-consumption-optimal
problems. [1] have proposed resolving the time-optimal control problem on-line at each
sample instant to generate a feedback control strategy. [2] gives a comprehensive intro-
duction on the linear programming applying on optimal control.
In this project, a feedback control strategy is presented for multi-variable control
problem with inequality constraints on the states and input control variables. At each
sample instant, a linear programming problem is solved in real-time to determine the
optimal value of the control input with minimize a linear performance index while sat-
isfying the inequality constraints. The potential advantages of this linear programming
based control strategy are:
• It provide a systematic, flexible way for control problems with large numbers of
variables and constraints.
1
• Efficient computational techniques, such as the Simplex method, are available for
solving linear programming problems.
• The system model, constraints and objective function can be easily modified in
real time control if needed, thus it has the potential to deal with some nonlinear
dynamic systems.
On the other hand, this linear programming based optimal control strategy have the
potential disadvantages:
• The system model, performance measuring index, and constraints must be linear
in order to be able to change into a standard linear programming problem.
• Since an LP problem will be solved at every time step, thus the computer re-
quirements must be carefully evaluated. Also, this time of computation limits the
sampling rate of the digital controller, thus will not be suitable for the systems
with fast dynamics.
ẋ = Ax + Bu + Gd (1)
where x is the n × 1 state vector. u is the r × 1 control input vector, where r ≤ n.
d is the vector of the unknown disturbance.
In order to use a digital controller, the system needed to be transformed into a
discrete-time state-space model by means of forward Euler approximation. By keeping
the sample time as a constant dt, the discrete-time model of the system is
α≤ u≤β (3)
and the systems state vector is subject to the constraint of
θ≤x≤λ (4)
where α, β, θ and λ are constant vectors with the proper dimension, and they are
specified accordingly to control requirements and the system constraints.
We want to design a control law for the system to make the state vector x to track a
desired trajectory xd. In the next section, the control policy based on linear programming
is presented in detail.
2
3 LP based control policy
In this section we will use the linear programming approach to determine the control
signal u(k) based on the measurement at the certain sample instant. Here we assume that
all the state variables can be measured. If this is not the case, then a state estimator such
as Kalman filter [3] can be used to generate an estimation of the full state measurement.
In the linear programming based control approach, the dynamic model in Equation 2
and the current measurement are used to predict the state variables in the next sample
step:
where wi is the non-negative weighting factor for the i-th state variable. These
weighting factors are used to penalize prediction errors in the more important state
variables.
We want to use linear programming to minimize the performance index in Equation
8 subject to all the constraints in every time step. In the following of this section, the
major task is to transform this problem into a standard linear programming problem.
The standard linear programming formulation requires the unknown variables to be
non-negative. In order to transform the objective function 8 to a linear form and to
unsure the unknown variables are non-negative, we introduce new variables xi+(k + 1)
and x−i (k + 1), and their definition are:
3
equality constraint is assumed. We can change the inequality constraints in Equation 3
and 4 into equality constraints by introducing new variables.
First let us use a change of variable to make the constraint on control input u in
Equation 3 into single-side inequality. It is convenient in the LP formulation to express
Equation 3 as
0 ≤ ũ(k) ≤ β − α (9)
where
ũ(k) = u(k) − α (10)
Then we introduce slack and surplus variables to convert the inequality constraints
into equality constraints. Since ũ(k) has single side inequality constraints, and x(k)
has the constraints on two sides, thus the one slack variable vector and one surplus
variable vector are introduced for x(k), and only one slack variable is introduced for ũ(k).
By putting all the variables together, in the transformed standard linear programming
problem the unknown variable is
Σ ΣT
z = x+ x− ũ x+
s x−
s us+ (11)
where the subscript s denotes either slack or surplus variable. Based on this unknown
variable defination, the performance index can be written as:
x+
−
x
Σ Σ ũ
J= w w 0 0 0 0 (12)
x+ s
xs−
+
us
and the linear equality constraint in the standard linear programming problem is
x+
I −I −∆ 0 0 0 x− Φx(k) − xd(k) + ∆α
(13)
= θ − xd(k)
II −I
−I 0
0 0 0 0
I −I 0 xũ+ λ − xd(k)
s
β−α
0 0 I 0 0 I x−
s
s +
u
We can write the problem in a more compact form as
4
min J = cT z s.t. Az = b (14)
Thus the control problem is successfully changed into a linear programming problem,
and we can see that the size of the unknown in the linear programming problem is
N = 4n + 2r, and the number of constraints is M = 3n + r. We can also see that the
matrix A is full rank, that is, rank(A) = M . In the next section some implementation
details of the linear programming solver is demonstrated.
4 Linear programming
In this project, the simplex method is used to find the optimal solution for the standard
linear programming problem we formulated in the previous section. The selection of
simplex method over the interior method have two reasons: first, the simplex search is
fast, especially when the problem size is small, and this is a major consideration when
this control algorithm is used for a real-time control when the computation must finish
within one sample time step. Second, the interior point method gives an approximation
of the optimal solution, which may cannot make the best use of the control with satisfying
the constraint on it.
The idea of the simplex method is very simple: since the feasible set of the problem
is a simplex, then simply check the corners can find the optimal point. It moves from
corner to corner, decreasing the value of the objective function. As pointed out in the
textbook, since the number of corners can grow exponentially with the problem size, the
worst case of simplex method could be slow. Luckily in practice the best corner is found
quickly.
Let us write the linear programming problem that we are discussing again as
1. Chooses one variable, called the entering variable, from the nonbasic variables and
adds the corresponding column of the non-basis to the basis.
2. Chooses a variable, called the leaving variable, from the basic variables and removes
the corresponding column from the basis.
5
3. Updates the current solution and the current objective value.
The selection of the entering and the leaving variables is done by solving two linear
systems while maintaining the feasibility of the solution. The simplex method code in
this project followed the examples given in the [4] and [5].
5 Simulation result
In this section, the LP based control is tested with several different system, they are:
(a) a double integrator plant; (b) a N-th order integrator plant, and (c) a two-link
robot arm’s trajectory control. The first two systems are linear system and are used to
test the scalability of the linear programming based control method. The third system
is a nonlinear plant, and when using this proposed control method, a linearization is
performed at every time step based on the present configuration of the robot arm. This
test is to show that this on-line optimization control method is also applicable to nonlinear
system (require the system’s nonlinear dynamics is “smooth” enough so that it can be
linearized at a certain time instant).
d x 0 1 x 0
= + u (16)
dt ẋ 0 0 ẋ 1
A physical interpretation of this system might be: controlling the position of a con-
centrated mass by controlling the force acting on it, that is, F = u, and set the mass of
the block to be unity. Figure 1 shows a diagram of the system.
m
F
We discretized this system with a sampling time of 1ms. In this problem we want
the system to follow the given trajectory of
6
xd 5 × sin(t) (17)
=
x˙d 5 × cos(t)
−6 ≤x1 ≤ 6 (18a)
−50 ≤x2 ≤ 50 (18b)
−umax ≤u ≤ umax (18c)
(18d)
Here we set the limit on the control input to be a variable umax. A simulation of
this plant using the proposed linear programming is performed. First we set the limit
on control effort umax = 10. Since in steady state the needed u is smaller than this, this
constraint basically means there is no constraint on the input of the system. Figure 2
shows the tracking performance and corresponding control input.
x 10
−3 Tracking error of the double integrator plant − a simplest test Control effort of the double integrator plant
Tracking performance of the double integrator plant − a simplest test 10 5
Position error e
6 x
Position x 9.5 4
Desired trajectory xd
4 9 3
8.5 2
poisiton (m)
7.5 0
0
7 −1
−2 6.5 −2
6 −3
−4
5.5 −4
−6 5 −5
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
0 2 4 6 8 10 time (s) time (s)
time (s)
Figure 2: Tracking performance of the double integrator plant with umax = 10.
To add some limit to the control input, we then set the limit on control input as
umax = 4.5. Figure 3 shows a tracking performance of the system and the corresponding
control input signal u.
7
Tracking performance of the double integrator plant − a simplest test Tracking error of the double integrator plant − a simplest test Control effort of the double integrator plant
6 0.25 5
Position x Position error ex
Desired trajectory xd
4
0.2 3
2
2
poisiton (m)
0 0
0.1 −1
−2
−2
0.05 −3
−4
−4
−6 0
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 −5
time (s) time (s) 0 1 2 3 4 5 6 7 8 9 10
time (s)
Figure 3: Tracking performance of the double integrator plant with umax = 4.5.
We can see that in Figure ?? the control input signal is no longer linear, and the
upper and lower limit of the control input signal shows that the linear programming
based method can effectively limit the control effort within the limit.
A simulation of this system is also performed using a very commonly used opti-
mal feedback controller based on a quadratic performance index, often known as linear
quadratic regulator (LQR) control [6]. Figure 4 shows a tracking performance and the
control input signal u. The optimal controller is designed with penalty weight on the
two state variables are 10 and 1 respectively, and the penalty coefficient on the control
input is 0.1. The calculated feedback gain by means of LQR is K = [105.5], and the
corresponding control input is u(k) = − K(x(k) −xd(k)). The simulation result shows
that significant larger control effort is used while the tracking error is worth than the
on-line optimization method. Even when a hard constraint is given to the control out-
put, like the simulation with umax = 4.5, the tracking performance of the LP control is
still better then the feedback control method. However, it need to point out that this
feedback control algorithm takes significantly less real-time computation then the LP
method.
Tracking performance of the double integrator plant − a simplest test Tracking error of the double integrator plant − a simplest test Control effort of the double integrator plant
6 0.8 6
Position x Position error ex
Desired trajectory xd
0.6
4 4
0.4
2 2
control effort − force (N)
0.2
poisiton (m)
poisiton (m)
0 0 0
−0.2
−2 −2
−0.4
−4 −4
−0.6
−6 −0.8 −6
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
time (s) time (s) time (s)
Figure 4: Tracking performance of the double integrator plant with LQR control.
8
5.2 N-th order integrator plant control
In order to study the performance of the scalability of this control strategy, a n-th order
integrator system is used to test this control algorithm. The plant’s dynamics equation
is
x(n) = u (19)
Transform this system into a state space form, the the system’s dynamic equation is
x1 0 1 0 ··· 0 x1 0
x2 0 0 1 ··· 0 x2 0
d ..
. = .. . . .. . + .. u (20)
dt
xn−1 0 0 0 ··· 1 xn−1 0
xn 0 0 0 ··· 0 xn 1
Again, we discretize the system with a sampling time of 1 ms, and the control goal
of the system is also to make the variable of interest x1 to track the desired trajectory
xd = 5 ×sin(t).
First we show the tracking performance of this n-th order system. In this test case the
dimension of the linear system is set to be n = 400, thus the dimension of the standard
linear programming problem has a size of N = 4n + 2r = 1602. Setting the constraint of
the state variable to be very loose, and define the limitation on the control input similar
as that of the double integrator system. In the first test case we put the limitation as
umax = 10, which means there is no limitation when the system is in steady state. The
simulation result is presented in Figure 5.
Control effort of the N−th order integrator plant N = 400
Tracking performance of the N−th order integrator plant N = 400 x 10
−3 Tracking error of the N−th order integrator plant N = 400
10
6 6
Position x
Desired trajectory xd
4 8
4
2 6
0 4
control effort − force (N)
−2 2
poisiton (m)
poisiton (m)
0 −4 0
−6 −2
−2
−8 −4
−10 −6
−4
−12 −8
−6 −14 −10
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
time (s) time (s) ti me (s)
Figure 5: Tracking performance of the 400-th order integrator plant with umax = 10.
In this simulation, we can see that the maximum tracking error is 12 × 103, which is
good comparing with a control design with a reasonable feedback gain.
9
Similar to the double integrator system, in order to add some constraint to the system,
we set the limit on control input as umax = 4. The order of the linear dynamic system
is kept as n = 400. Figure 6 shows a tracking performance of the system, where
Tracking performance of the double integrator plant − a simplest test
6 −3
x 10 Tracking error of the N−th order integrator plant N = 400 Control effort of the N−th order integrator plant N = 400
Position x 6
Desired trajectory xd
4
4
4
3
2
2 2
0
−2 1
poisiton (m)
0
−4 0
−6 −1
−2
−8
−2
−10
−4 −3
−12
−4
−6 −14
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
time (s) time (s) time (s)
Figure 6: Tracking performance of the 400-th order integrator plant with umax = 4.
The signal in Figure 6 (c) shows that the LP based control successfully limited the
control variable. The tracking error hasn’t changed much. This experiment proofs the
effectiveness of the LP based control strategy for high-dimension system.
Another test based on this system is the run time of the computation at a single
sample instant. We measured the runtime of the simulation of 10 sample instants and
then take the average to get the runtime of one linear programming problem, and used the
size of the linear dynamic system as n = 50, 100, 150, ..., 1000. The corresponding linear
programming problem will have a size of N = 4n + 2r , and for this test system r = 1.
Figure 7 plots the runtime of one sample instant over the size of the linear programming
problem N, where the blue line shows the simulation time using the Matlab internal
function ‘linprog’, and the green line shows the simulation time using a self-implemented
simplex function.
The simulation shows that the average runtime of the linear programming is approx-
imately linear with the problem size. However, when this linear programming based
control strategy is really used with a actual system, the worst case must be considered
because the computation has to be done with in one time step. Thus the sample rate of
the system might be limited by the computation time. This is a major challenge to this
proposed control strategy, which may make it not suitable for problems with a large size.
10
3
1.5
0.5
0
0 2000 4000 6000 8000 10000
Size of the linear programming problem
Figure 7: Runtime experiment for the linear programming based control problem.
Figure 8: Diagram of the two-link planar robot arm system. θ1 and θ2 are the angles of
the two joints. l1 and l2 are the length of the two linkages. r1 and r2 denote the
distance between the center of mass of the linkage to the corresponding joint. The
control input of the system are torques acting at the two joints.
11
I1 + I2 m2 l1 r2 cosθ2 I2 + m2 l1 r2 cosθ2 θ¨1 −2m2 l1 r2 θ˙2 −m2 l1 r1 sinθ2 θ˙2 θ̇1 u1
+ =
I2 + m2l1r2cosθ2 I2 θ¨2 −m2 l1 r1 sinθ2 θ˙1 θ2 0 ˙ u2
(21)
By define a vector q = [θ1 θ2]T , and u = [u1 u2]T , the system can be written in a
more compact form as
2. Determine the x-y space desired end-effector position (xd, yd) and the corresponding
desired velocity;
3. Use inverse kinematics transformation of the robot model to calculate the desired
trajectory in the joint space [qd q̇d ]T ;
12
2
1.5
0.5
−0.5
−1 −0.5 0 0.5 1 1.5 2
Simulation of this two link robot is performed with a weighting factor to the state
tracking errors to be w = [10 10 1 1]. Figure 10 presents the simulation result of the
tracking performance, tracking error and the control input signals.
−4 Tracking error of the two−link robot
Tracking performance of two−link robot arm 6
x 10
2 Joint1 angle error e1 Control effort of the 2−link robot
Joint1 angle q1 Joint2 angle error e2 1
Joint2 angle q2 Torque on joint 1
1
1.8 Trajectory q1
d 4 Torque on joint 2
2
Trajectory q1 0.5
d
1.6
2 0
1.4
control effort − force (N)
poisiton (rad/s)
poisiton (rad/s)
−0.5
0
1.2
−1
1
−2
−1.5
0.8
−4
0.6 −2
0.4 −6 −2.5
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
time (s)
time (s) time (s)
The maximum tracking error during the cycle is 6 × 10−4. This simulation validated
the effectiveness of the LP based control for smooth nonlinear dynamic systems.
13
that have hard inequality constraints on state variables and control input variables. The
control problem is successfully changed into a standard linear programming problem,
and the simplex method is used to find the proper control signal. Several dynamic
systems, either linear or nonlinear, are simulated to test the performance of the LP
based controller. Simulation results demonstrate that when the computation time is
not limiting the sampling rate, the LP based control have the advantages of: (a) better
tracking accuracy, (b) can deal with smooth nonlinear system, (c) can strictly limit the
variables within the constraints.
Further exploration in this topic can be: (a) Sensitivity study of the control perfor-
mance by sweeping the parameters, such as the constraint limits and weighting coeffi-
cients; (b) study the effect of noise and disturbance signals to the system; (c) real-time
implementation of this control strategy on an embedded computer and test the control
on some hardware.
In the process of working on this project, the most challenging part should be changing
the control problem into a linear programming problem, while the most time-consuming
part is the implementation of the simplex method and the three systems for simulation.
I am lucky to have worked on a project based on optimization, thus I can get a chance to
explore more in depth in this material although there is no homework for this chapter.
Significant thanks for this course! All the neither easy nor straightforward problem sets
have gave me great chance for practicing Matlab coding, so that I can work this project
out. Thank you 18.086!
References
[1] L. Zadeh and B. Whalen, “On optimal control and linear programming,” Automatic
Control, IRE Transactions on, vol. 7, no. 4, pp. 45–46, 1962.
[3] R. E. Kalman and R. S. Bucy, “New results in linear filtering and prediction theory,”
Journal of basic engineering, vol. 83, no. 1, pp. 95–108, 1961.
[6] A. E. Bryson, Applied optimal control: optimization, estimation and control. CRC
Press, 1975.
14