Decentralized Robust Control For Multi-Vehicle Navigation
Decentralized Robust Control For Multi-Vehicle Navigation
Decentralized Robust Control For Multi-Vehicle Navigation
TuD08.1
I. I NTRODUCTION
The research effort in multi-agent system relies on the fact
that multiple agents have the possibility to solve problems
more efciently than a single agent. Cooperative control
of multiple vehicles gives rise to signicant theoretical and
technical challenges and has various engineering applications
including cooperative transportation of a large payload, formation ying of unmanned aerial vehicles [12], and multi
agent gaming such as robot soccer or robot rescue [2], [17],
[23].
In this paper, the problem of interest is to control a formation of autonomous mobile robots through a cluttered environment. Here, the vehicles are dynamically decoupled but
have common constraints that make them interact between
each other. Indeed, each robot has to avoid collision with the
other ones. Moreover, some communication links between
some vehicles must be maintained during the movement.
Since the available power onboard the vehicle is limited,
the distance between two vehicles which may exchange information will naturally be constrained. Besides maintaining
the communication, the feasibility of the trajectories implies
the respect of the dynamic constraints, as well as avoiding
obstacles and collisions. In this paper, we will focus solely
on these dynamic and geometric aspects and ignore mobile
networking factors, such as fading, cross talk, and delay,
This work was partially supported by programs Interreg ACOS, TAT T31
and ARCIR Robocoop.
M. Defoort, T. Floquet and W. Perruquetti are with LAGIS UMR CNRS
8146, Ecole Centrale de Lille, BP 48, Cite Scientique, 59 651 VilleneuvedAscq, France
A. Kokosy is with ISEN, 41 bvd Vauban, 59 046 Lille Cedex, France
ISBN 978-3-9524173-8-6
2150
TuD08.1
Z
YQ Q
[Q\Q
<$;,6
Q
;$;,6
Fig. 1.
cos n (t) 0
f (qn (t), un (t)) = sin n (t) 0 un (t)
0
1
un (t) = [vn (t), wn (t)]T .
vn (t) and wn (t) are respectively the linear and angular
velocities. The dynamic model (1) is constrained to take into
account the practical limitations on the vehicle velocities, i.e:
|vn (t)| vn,max
.
|wn (t)| wn,max
(2)
mn {1, . . . , Mn }.
(3)
(1)
2151
TuD08.1
(4)
(5)
qn qn, f in 2 + un 2
C n,com (t).
(6)
A. Conict sets
n=1
where
,
R+ are
T constants, q =
T
T weighting
T
T
T
q1 , . . . , qN
and u = u1 , . . . , uN
are the concatened
optimal trajectory and optimal control input. Note that
L(q , u ) = 0 if and only if qn = qn, f in and un = 0.
In the following, in order to reduce the computational
complexity and communication requirements, a distributed
strategy is stated. One idea is to only include, for each
receding horizon controller, the robots that could have direct
conicts (i.e. may produce collision or may lost the communication). It enables a distributed optimization based on
local information.
(7)
2152
min
tk +Td
un (t,tk ) tk
qn (t,tk ) qn, f in 2 +
un (t,tk ) 2 dt
TuD08.1
subject to
n (t,tk ) =
q
f (
qn (t,tk ),
un (t,tk ))
|
vn (t,tk )| vn,max vn
n (t,tk )| wn,max wn
|w
d(
qn (t,tk ), omn ) + rmn , omn O n (tk1 )
n (tk ,tk ) = qn (tk ,tk1 ),
for all t [tk ,tk + Td ] and where q
+
+
vn R , wn R .
Remark 2: The inclusion of the constants vn and wn in
the constraints of the motion planning generator guarantees
that there is sufcient control authority to track the optimal
planned trajectory (see Section IV).
Given the conict sets C n,collision (tk1 ), C n,com (tk1 ), the
n (t,tk ) and the ones of its neighbors, let
assumed trajectory q
us dene the optimal control problem Pn (tk1 ) associated
with the nth robot which consists of determining the optimal
control input un (t,tk ) and the optimal predicted trajectory
qn (t,tk ) = [xn (t,tk ), yn (t,tk ), n (t,tk )]T which satises all the
constraints (1)-(5):
Problem Pn (tk1 ):
min
tk +T
un (t,tk ) tk
qn (t,tk ) qn, f in 2 + un (t,tk ) 2 dt
qq (t,tk ))
d(qn (t,tk ),
qn (t,tk ))
d(qn (t,tk ),
(9)
(10)
(11)
+ rmn ,
(12)
Rq C n,com (tk1 )
2 +
(13)
min(dn,com , dq,com ) (14)
(15)
2153
TuD08.1
the others.
C. Technique for solving receding horizon control problem
In this part, the optimal control problem Pn (tk ) is solved
using atness properties, B spline parametrization and sequential quadratic programming.
Remark 7: One can easily apply the same technique to
solve problem Pn (tk ).
Using the atness property [9] of system (9), all system
variables can be differentially parameterized by xn , yn and
a nite number of their time derivatives. Indeed, n , vn and
wn can be expressed by xn , yn and their rst and second time
derivatives, i.e.
n
vn
= arctan xyn
n
=
xn2 + y2
n
.
yn xn xn yn
=
xn2 + y2
n
wn
(16)
Once the cost and the constraints are mapped into the at
output space, the optimal trajectory is planned in this space.
Indeed, the optimal control problem Pn (tk ) becomes:
tk +T
tk qn qn, f in 2 dt
2
min
tk +T
yn xn xn yn T
2
2
xn ,yn
xn + yn , 2
dt
+ tk
2
xn + yn
subject to t [tk ,tk + T ]
xn2 + y2
n
yn xn xn yn
x2 + y2
n
vn,max vn
wn,max wn
omn O n (tk1 )
d(qn , omn )
R p C n,collision (tk1 )
p)
d(qn , q
d(qn , qq )
n )
d(qn , q
+ rmn ,
, Rq C n,com (tk1 )
2 +
min(dn,com , dq,com )
For each at outputs, i.e xn and yn , an initial guess trajec (t,t ) and y (t,t )) which respects the terminal
tory (xn,ini
k
k
n,ini
constraints is computed.
Then, in order to transform the optimal trajectory generation problem into a parameter optimization one, a piecewise
polynomial function, B-spline, is adopted to approximate
the trajectory. The B-spline functions are chosen as basis
functions due to their exibility and ease of enforcing
continuity across breakpoints.
The trajectories of the at outputs are dened as:
xn (t,tk ) =
yn (t,tk ) =
(t,t ) + ln a B (t,t ),
xn,ini
j=1 j,n j,3 k
k
n
yn,ini (t,tk ) + lj=1
bj,n B j,3 (t,tk )
(17)
x (t,tk )
cos (t,tk ) 0
y (t,tk ) = sin (t,tk ) 0 v (t,tk )
(18)
w (t,tk )
0
1
(t,tk )
where the optimal predicted velocities v (t,tk ) and w (t,tk )
satisfy:
|v (t,tk )| vmax v
.
|w (t,tk )| wmax w
By directly applying v (t,tk ) and w (t,tk ) obtained in section
III.C, the robot does not follow the reference trajectory
with a good accuracy. It is obvious that the real control
v(t,tk ) and w(t,tk ) rely on the state measurements q(t,tk ).
Furthermore, one can note that there are tracking errors at
time tk , i.e q(tk ,tk )
= q (tk ,tk ). Due to measurement noise
and modeling uncertainties, there are input uncertainties for
v(t,tk ) and w(t,tk ). That is to say, the real equation of the
robot trajectory actually fullls the following differential
equation: t [tk ,tk+1 ],
x(t,t
k)
cos (t,tk ) 0
v(t,tk ) + v
y(t,t
k ) = sin (t,tk ) 0
(19)
w(t,tk ) + w
0
1
(t,tk )
where v and w represent the uncertainties. Due to saturation constraints, it is assumed that:
| v| < v
.
| w| < w
The controls v(t,tk ) and w(t,tk ) must be designed such
that system (19) follows the reference (18) in spite of the
perturbations. In fact, the goal is to asymptotically stabilize
the tracking errors ex (t,tk ) = x (t,tk ) x(t,tk ), ey (t,tk ) =
y (t,tk ) y(t,tk ) and e (t,tk ) = (t,tk ) (t,tk ) to zero
while respecting the following constraints:
2154
| v | vmax
.
| w | wmax
(20)
TuD08.1
Transforming the tracking errors expressed in the inertial
frame to the robot frame, the error coordinates can be
denoted as:
cos
ex
sin 0
e1
e2 = sin cos 0 ey .
e3
e
0
0
1
Accordingly, the tracking-error model is represented by:
v cos e3
e1
1 e2
e2 = v sin e3 + 0 e1 v + v .
w + w
e3
w
0
1
(21)
(22)
uid is the ideal control and uism represents the ISM part
which is designed to be discontinuous in order to reject the
perturbation.
The rst part of the control design is to nd a saturated
control law uid such that the nominal system e = g1 (e) +
g2 (e)uid is globally asymptotically stable in spite of initial
tracking errors (i.e. q(tk ,tk )
= q (tk ,tk )). The chosen control
law uid has been developed by Jiang in [14]. The motivation
for such a choice is that this design takes into account the
actuator bounds. It is described by:
v cos e3 + 3 tanh e1
(23)
uid = w + 1 v e2 sin e3 + tanh e
2
3
1+e2 +e2 e3
1
(24)
[e1 , e3 ]T ,
s0 (e) =
z
= se0 (g1 (e) + g2 (e)uid ),
s0 (e(tk ))
z(tk ) =
(25)
where
(26)
2155
TuD08.1
3
1.2m
1.2m
1.2m .
1.2m
1.2m
2.5
2
1.5
1
0.5
safe
Fig. 2.
0
0
Communication graph
A5
A2
(a)
10
15
1.2
A3
0.8
0.6
dq
,q
dq
,q
dq
,q
dq
,q
1
2
0.4
0.2
2
3
4
5
q ,q
1
0
0
10
15
(b)
4
Fig. 4.
Fig. 3.
Algo.
Comput.
time
Comm.
links
Practise
Feasibility
Guarantee
CEN
LF[16]
DEC1[15]
DEC2
10 sec
with all
vehicles
infeasible
if
N1
YES
0.5 sec
with
neighbors
dependance
on the
leader
YES
5 sec
with
neighbors
high
robustness
properties
NO
0.3 sec
with
neighbors
high
robustness
properties
YES
VI. C ONCLUSIONS
The proposed algorithm, based on receding horizon control, provides a solution to the stabilization problem for a
formation of robots which is subject to constraints (control
bounds, obstacle avoidance and bounded distances between
robots) and uncertainties (measurement uncertainties,. . . ).
The main contribution of this work is the design of a robust
decentralized receding horizon controller guaranteeing the
collision avoidance and the maintenance of communication
links using only local information. Simulations on a formation of ve mobile robots highlighted the good results in term
of computational time to generate a conict free trajectory.
2156
R EFERENCES
[1] R. Brockett, Asymptotic stability and feedback stabilization, in R.
Brockett, R. Millman, and H. Sussmann (eds.), Differential geometric
control theory (Boston, MA: Birkhauser), pp. 181-195, 1983.
TuD08.1
[2] C. Clark, A coordination platform for multi-robot systems, PhD
Thesis, Standford University, 2004.
[3] C. Canudas de Wit and O. Sordalen, Exponential stabilization of mobile robots with nonholonomic constraints, IEEE Trans. on Automatic
Control, 37, No 11, pp. 1791-1797, 1992.
[4] M. Defoort, J. Palos, A. Kokosy, T. Floquet, W. Perruquetti and
D. Boulinguez, Experimental Motion Planning and Control for an
Autonomous Nonholonomic Mobile Robot, Int. Conf. on Robotics
and Automation, Roma, Italy, 2007.
[5] M. C De Gennaro and A. Jadbabaie, Formation control for a cooperative multi-agent system using decentralized navigation functions,
American Control Conference, pp. 14-16, 2006.
[6] D. V. Dimarogonas and K. J. Kyriakopoulos, Formation Control
and Collision Avoidance for Multi-Agent Systems and a Connection
between Formation Infeasibility and Flocking Behavior, IEEE Conf
on Decision and Control, pp. 84-89, 2005.
[7] W. Dunbar and R. M. Murray, Model predictive control of coordinated multi-vehicle formation, IEEE Conf. on Decision and Control,
2002.
[8] W. Dunbar and R. M. Murray, Distributed receding for multi-vehicle
formation stabilization, Automatica, pp. 549-558, 2006.
[9] M. Fliess, J. Levine, Ph. Martin and P. Rouchon, Flatness and defect
of nonlinear systems: introductory theory and examples, Int. J. of
Control, 61, No 6, pp 1327-1361, 1995.
[10] T. Floquet, J. P. Barbot and W. Perruquetti, Higher-order sliding
mode stabilization for a class of nonholonomic perturbed systems,
Automatica, 39, No 6, pp. 1077-1083, 2003.
[11] J. P. Hespanha and A. S. Morse, Stabilization of nonholonomic
integrators via logic-based switching, Automatica, 35, No 3, pp 385393, 1999.
[12] G. Inalhan, D. Stipanovic and C. Tomlin, Decentralized optimization,
with application to multiple aircraft coordination, IEEE Conf. on
Decision and Control, 2001.
[13] Z. P. Jiang and H. Nijmeijer, Tracking control of mobile robots: a
case study in backsteeping, Automatica, 33, No 7, pp. 1393-1399,
1997.
2157