School of Aerospace, Transport and Manufacturing: M.Sc. Thesis
School of Aerospace, Transport and Manufacturing: M.Sc. Thesis
School of Aerospace, Transport and Manufacturing: M.Sc. Thesis
Thesis
Abstract
In this thesis a global analysis of the modern air combat battlefield is made to
study the implementation of fleets of unmanned aerial vehicles acting as a swarm to
confront other aircraft. Previous approaches tackle the application of UAV Swarms to
environments less demanding than air combat as reconnaissance or search and rescue.
Hence a task allocation algorithm is introduced to this context and simulated under
the desired circumstances trying to embrace new combat tactics and procedures.
The main inspiration is the behaviour of insects such as T emnothorax Albipennis to
develop a stochastic policy based algorithm. Simulations are run to test the viability
of this proposal.
2015 - 2016 ii
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Acknowledgements
First I would like to thank BAE Systems for the opportunity to work in such as
interesting field as the design of artificial intelligence algorithms for fleets of unmanned
vehicles. I would also like to thank my supervisor Dr. Hyo Sang Shin for his help
and supervision during my research. And to finish, I would like to remark my family
and friend’s support during my stay at Cranfield University.
2015 - 2016 iv
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Contents
Abstract. ii
Acknowledgements. iv
1 Introduction. 1
1.1 Context. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Aims & Objectives. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.3 Contribution. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Literature Review. 2
2.1 Definition of Modern Air-Combat Scenario. . . . . . . . . . . . . . . 2
2.1.1 Beyond-Visual-Range Engagements. . . . . . . . . . . . . . . 2
2.1.2 Within-Visual-Range Engagements. . . . . . . . . . . . . . . 2
2.2 Modern Aircraft Evasion Tactics. . . . . . . . . . . . . . . . . . . . . 3
2.2.1 Defense of short-range, infrared guided missiles. . . . . . . . . 4
2.3 Unmanned Aerial Vehicles as Actors in Aerial Combat. . . . . . . . 6
2.3.1 High maneuvering capability. . . . . . . . . . . . . . . . . . . 6
2.3.2 Detecting Equipment. . . . . . . . . . . . . . . . . . . . . . . 8
2.3.3 How key air combat needs affect unmanned aerial vehicles
design and behaviour. . . . . . . . . . . . . . . . . . . . . . . 12
2.3.4 Unmanned Aerial Vehicle issues when considered for the mod-
ern battlefield. . . . . . . . . . . . . . . . . . . . . . . . . . . 15
2.3.5 Summary of Possible Unmanned Aircraft Candidate for an
Air-Combat Scenario. . . . . . . . . . . . . . . . . . . . . . . 16
4 Battlefield Actors. 20
4.1 Engagement Procedure. . . . . . . . . . . . . . . . . . . . . . . . . . 20
4.2 Graphical Representation. . . . . . . . . . . . . . . . . . . . . . . . . 21
6 Task Allocation. 25
6.1 Task Allocation Problem Definition. . . . . . . . . . . . . . . . . . . 25
6.2 Optimized Stochastic Policies for Task Allocation in Swarms of Robots. 25
6.2.1 Definitions and Assumptions. . . . . . . . . . . . . . . . . . . 25
6.2.2 Base model. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
6.2.3 Extended Base Model: Including Quorum Concept. . . . . . 27
6.2.4 Agent Implementation. . . . . . . . . . . . . . . . . . . . . . 28
2015 - 2016 vi
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
7 Simulation Analysis. 29
7.1 First Case: Different simulation duration. . . . . . . . . . . . . . . . 29
7.2 Second Case: Different matrix K values. . . . . . . . . . . . . . . . . 30
7.3 Third Case: Different number of swarm agents. . . . . . . . . . . . . 32
References 35
List of Figures
1 Boyd’s OODA loop. . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2 BAE Taranis, and Eurofighter Typhoon at the back. . . . . . . . . . 7
3 Eurofight Typhoon radar: Euroradar Captor-E. . . . . . . . . . . . . 8
4 Fighter F-22 Raptor and Bomber F-117 Nighthawk shape comparison. 9
5 Radio signal reflection comparative. . . . . . . . . . . . . . . . . . . 10
6 Attacker/Tactical Bomber Lockheed F-117 Nighthawk. . . . . . . . . 10
7 Tactical Bomber Northop Grumman B-2 Spirit. . . . . . . . . . . . . 10
8 Lockheed RQ-170 Sentinel. . . . . . . . . . . . . . . . . . . . . . . . 14
9 Graphic representation of the simulation after 10 seconds. . . . . . . 21
10 Graphic representation of the simulation after 100 seconds. . . . . . 21
11 Decision Making Flow Chart. . . . . . . . . . . . . . . . . . . . . . . 22
12 Potential Fields. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
13 Kinematics. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
14 Strongly connected graph. . . . . . . . . . . . . . . . . . . . . . . . . 26
15 Enemies killed with configuration 1. . . . . . . . . . . . . . . . . . . 29
16 Swarm agents survived with configuration 1. . . . . . . . . . . . . . . 30
17 Enemies killed with configuration 2. . . . . . . . . . . . . . . . . . . 31
18 Swarm agents survived with configuration 2. . . . . . . . . . . . . . . 31
19 Enemies killed with configuration 3. . . . . . . . . . . . . . . . . . . 32
20 Swarm agents survived with configuration 3. . . . . . . . . . . . . . . 33
2015 - 2016 ix
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
List of Tables
1 Aircraft comparison . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
2 Department of Defense Selected Acquisition Reports of some UAV
and manned aircraft models. . . . . . . . . . . . . . . . . . . . . . . 13
3 Agents involved in simulations. . . . . . . . . . . . . . . . . . . . . . 20
4 Agents’ representation. . . . . . . . . . . . . . . . . . . . . . . . . . . 21
5 Simulation 1. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
6 Simulation 2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
7 Simulation 3. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
2015 - 2016 xi
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
1 Introduction.
1.1 Context.
The development of intelligent unmanned systems to be implemented in aircraft
present great potential in the future in terms of efficiency and cost reduction. Un-
manned air vehicle design tends to specialisation, this means that decision-making
and task-allocation algorithms will need to produce cooperative behaviours in order
to achieve such an improvement in performance. Therefore swarm systems present a
high robustness when confronting the losses of units because the general behaviour
to accomplish the system objective arises from a collective perspective. Moreover
the conduct of a single agent may even have no sense, but as a part of a larger group
produces the desired result.
1.3 Contribution.
This thesis proposes a behaviour model to characterise a swarm of unmanned air
vehicles with attack purposes. The core is a task allocation algorithm based in
stochastic policies. Previous models tackle problems as static scenarios where number
of tasks do not change. The author develops new procedure to affront a dynamic
situation where number of tasks changes along time. Moreover, the development
includes decision-making and basic control features for each unit. The full model
arises from the study of what are the possible needs of today’s UAVs when introduced
in air combat.
2015 - 2016 1
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2 Literature Review.
2.1 Definition of Modern Air-Combat Scenario.
The aim of this work is to study new behaviour approaches for fleets of UAV when
facing combat scenarios. Nowadays air-to-air fight engagement procedures distinguish
between two main phases, Beyond Visual Range phase and Within Visual Range
engagements. This division of the air fight conflict in two parts attends to the different
needs of each phase. In these procedures, weapons and maneuvers differ considerably
from one to other. Furthermore traditional aircraft based combat superiority fall into
three main features that define combat development. Assuming that pilots have the
maximum level of training possible for the aircraft the are flying, these features are
weapon capability, detection measures, and aircraft maneuvering capability. These
are the main bases of the modern basic fight principles defined by CNATRA [12].
Next, the two main type of air combat engagements will be generally described
to define the context of the scenario proposed by this thesis, which include combat
assumptions and suppositions. Moreover, the comparison between characteristics of
some actual UAV models and modern fighters will lead to the definition of a possible
combat or interception scenario that may include the use of unmanned aircraft.
The process involving a long range attack with guided missiles is divided in three
parts: First step is detection using the radar. Second step is lock up step. Target
tracking is fixed over the objective allowing the last part. Now the pilot is able to
shot the projectile without need of taking care of the trajectory to the enemy aircraft.
The electronic gear over the missile in communication with the main radar of the
airplane predicts the path to follow until the collision.
Despite the obvious benefits of this way of attack in theory, tests and combat
experience have demonstrated that degrading factors as weather, aggressive maneuv-
ering, electronic countermeasures among others; cause hit probability to decrease
substantially [13]. The success values obtained from combat show that WVR shots
are much effective.
2015 - 2016 2
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
The principal danger for pilots when diminishing distance to enemies is obvious.
The less further the enemy is, more probable is that they will spot you with their
radar, or even going into their visual range.
1. Detection: The aircraft radar signature appears in the radar of the counterpart.
3. Missile launching.
The following defensive procedures, defined and well explained at [14]; try to
avoid the attack as early as possible by trying to stay ”invisible” to the enemy radar.
If detected and under missile fire, then the goal is to get away from the missile by
combining maneuvering and obscuring the electronic sensors from the missile that
are tracking the objective. In the next sections the main procedures will be explained.
Aircraft are also equipped with electronic countermeasures (ECM), that include
radar jammers. Since jammers produce a radar signal, they highlight position to
enemy radar. The purpose of this signal is to obstruct the enemy radar when locking
and tracking your aircraft. Due to this, it is usual to avoid using the jammer if
concealing position to the enemy is the priority, but prudent to begin using the
jammer once the aircraft is detected.
2015 - 2016 3
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
radar signal reflected off the target aircraft produced by the launch platform. Missiles
can be also actively guided, using an on-board radar.
Radar warning receivers (RWR), are the main tool to detect radar-guided missiles.
They detect and classify incoming radar signals. When a target of interest is been
tracked, it gets marked the counter-side radar with a pulse signal. This is detected by
RWR informing the pilot of the signal origin distance. Then, if the enemy launches a
passive radar-guided missile, the incoming signal will switch from pulse to continuous
one. This change will be noticed by the RWR informing the pilot of a missile coming,
but it will not give any position since it is the enemy and not the missile the producer
of the signal.
For active radar guided missiles, the missile’s radar itself is detected by the RWR.
The RWR recognizes the radar’s waveform as that of a known kind of enemy missile,
and sounds the missile launch warning. In this case, the RWR can also plot the
estimated location of the missile.
2015 - 2016 4
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
infrared sensors, and/or reducing his own heat signature by powering back his engines.
The jammer is not used during the end-game. The main reason is that the vast
majority of modern radar-guided missiles have a home-on-jam characteristic. If
the missile perceives that its own radar signal is being jammed, it will turn off the
jammed signal; only needing to home in the jammer’s radar signal to pursuit the
target. In this situation, the jammer acts as a guide for the missile.
Guns defense.
Eventually, if the contenders run out of missiles a dogfight may ensue. When an air-
craft becomes defensive in a dogfight, there are two options: transform the defensive
engagement into an offensive one, or try to run away.
Last option available for the pilot is to convert the engagement from defensive to
offensive. In a defensive situation, defensive aircraft nose is facing away the enemy
while and enemy’s nose is directly looking at the counterpart aircraft. Reach an
offensive position involves ”gaining angles” on the enemy, which requires out-turning
him. If the aircraft has better turn performance than the enemy, then it should be
possible to slowly catch up the offensive advantage. It is supposed that the enemy
will try to deny this opportunity by performing the necessarily maneuvers.
2015 - 2016 5
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2015 - 2016 6
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
When dealing with modern fighters, to decide how to gain the needed nose pointing
capability to outperform adversary’s OODA loop and achieve combat advantage,
three solutions are on the table:
• Thrust Vectoring.
Nowadays there are still not UAV models in service specially developed for combat.
All the principal military aircraft producers are immerse in a technical race to satisfy
this need. Good examples of this programs are the BAE systems Taranis, figure
2, or the Dassault nEUROn. The main mission of these programmes are not the
air superiority, rather their principal goal goes from tactical attacks based in large
range and stealth to surveillance and reconnaissance. The problem of UAVs when
confronting air combat is that despite the advances in control techniques and flight
designs, UAV maneuvering performance is far from traditional fighters.
At table 1, information obtained from [15, 16, 17, 18] Different kind of UAV
models are compared with a modern fighter model and the future substitute of it. It
is almost impossible for civil usage to obtain flight envelopes of this aircraft, since
the are designed for military purpose. Comparing that kind of documents would be
the optimal way to establish a rigorous technical approach. But for the scope of this
thesis the information gathered is enough to conform the big picture.
The large difference in thrust-to-weight ratio between UAVs and fighter is obvious.
Modern air superiority aircraft are designed considering a value of this feature over
one (around 1.35 for the F35A, and 1.6 for the F16C/D). Values below one for
the unmanned vehicles can give a general perspective of their considerable less
maneuvering capability, even if the wing load, aerodynamic geometries and other
aspects are not considered.
2015 - 2016 7
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Aircraft radars usually have two programs: searching and tracking. At search
mode, the radar produces signals in a zig-zag pattern. When these signals are
reflected by possible targets, an indication appears on the radar display. In this
mode, the possible targets are not tracked down. When the pilot desires to lock up
an aircraft, the radar will switch to track programme. Then the radar aims more
energy on a particular target. Because of it, the pilot obtains more information about
the aimed aircraft, but at reducing possible information about other targets in the
area.
Radar-guided missiles are divided in two types: Active and passive. Active radar
missiles have their own on-board radar. It is one-way signal radar, hence the missile
needs assistance from the launching aircraft until it is close enough to actively lock
the target by itself. However they can be fired with no radar lock. On the other
hand, passive radar missiles need the aircraft signal all time since they know where
is the target by following the reflected signal over it.
2015 - 2016 8
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Even if an aircraft can only scan out a region corresponding to a cone in front with
the apex situated over the radar, it can detect incoming signal from any direction. A
digital processor listens to distinctive signals from the sources around the aircraft,
and display their azimuth angle.
Figure 4: Fighter F-22 Raptor and Bomber F-117 Nighthawk shape compar-
ison.
High Radar cross-section reductions occur when the most part of the the incoming
radar signals are absorbed instead of been reflected. The best way to reflect them as
low as possible is avoiding conforming orthogonal surfaces. This usually happens at
the tail in traditional aircraft configurations where the vertical and horizontal parts
are set at right angles,figure 5. Stealth designs as the tactical bomber Lockheed
F117 Nighthawk tend to tilt the tail surfaces to avoid as much as possible corner
reflections, figures 6 and 4. Most radical designs, as the Northrop Grumman B-2
Spirit or the BAE systems Taranis, have no tail, figure 7. These kind of shapes
present excellent low drag properties and highly reduce radar aircraft profile. They
try to resemble a so-called flat plate, that is the most efficient profile in terms of
RCS reduction as there would be no angles to reflect radar signals.
2015 - 2016 9
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2015 - 2016 10
2015 - 2016
Manufacturer General Atomics Northrop Grumman BAE Systems Lockheed Martin Lockheed Martin
Parameters Units MQ-9 Reaper RQ-4 Global Hawk Taranis F35A CTOL F16C/D 50-52 block
Thrust [lbf] 900 [hp] 7,600 6,480 43,000 27,000
Weight [kg] 2,223 14,628 (MTOW) 8,000 (MTOW) 31,750 (MTOW) 16,875 (MOTW)
Wingspan [m] 20.1 39.8 9.94 10.7 9.8
Lenght [m] 11 14.5 11.35 15.7 14.8
Height [m] 3.8 4.7 4 4.38 4.8
Range [km] 1,852 22,780 - 2,200 3,200
Speed [kn] 200 (cruise) 310 High Subsonic Mach 1.6 Mach 2
Ceiling [m] 15,240 18,288 - 15,000 15,240
Payload [kg] 1,701 1,360 - 8,160 7,800
M.Sc. Autonomous Vehicle Control & Dynamics
11
Cranfield University
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2.3.3 How key air combat needs affect unmanned aerial vehicles
design and behaviour.
Again from the gathered information in table 1, some important facts arise. Actual
attack UAV models as the MQ-9 Reaper are much cheaper in flying costs than
multi-role fighters that develop the same kind of missions in certain scenarios as
air-to-ground strikes. At the moment, the unit cost comparison will be falling from
one side or the other depending on the fighter model studied, type of missions
developed, logistic and personnel cost related to the deployment of these units... For
example, the Lockheed F16 unit cost is much cheaper nowadays as the program is in
production and improvement since 1970. Furthermore it is the most exported fighter
model, (producer statement). Hence the production costs are extensively low.
But, USAF is actually immersed in the substitution of the F16 by the F35, which
is considerably more expensive at all levels nowadays. The reduction of costs is
something that is hardly predictable, more with the appearance of UAVs that are
taking part in a more efficient way of the traditional missions these kind of aircraft
develop.
The principal advantages concerned to the use of unmanned aerial vehicles in the
battlefield are the following:
Costs.
UAV may be much cheaper than fighters. This obviously depends in the model
that are compared, but usually the lack of a human on board makes the design to
avoid certain elements that increases prices. Cockpit, pilot expenses as salary or
formation. These elements are not needed for UAVs. Also as they are remotely
controlled, operators can take turns permitting the extension of mission length. This
is remarkable when watching to the RQ-4 Global Hawk range, table 1.
Probably the most significant factor in UAV programs’ expansion is the cost
advantage this technology present against traditional aircraft and related operations.
UAVs are unmanned, but the operation that involve them still need operators, main-
tenance teams, and large networks of equipment and personnel to guarantee the
intelligence and legal procedures to develop missions. Opponents to UAV use, argue
that this need of crews and the significant larger accident ratio, make unmanned
air vehicles more expensive to governments in the long-term than conventional air
combat gear.
From a global perspective, UAVs are marginally cheaper to buy and operate
than manned aircraft. But there is something that often goes unnoticed, this is
the operational advantage over cost effectiveness. The use of unmanned vehicles
in sensitive foreign operations outweigh the possible menace produced by higher
accident ratios and growing counter-reaction at target environments.In table 2 a
general comparison of some UAV models and manned aircraft purchases obtained
from Selected Acquisition Reports of the USA Department of Defense, shows that
UAV programs are usually less expensive to buy and operate than traditional fighter
aircraft.
2015 - 2016 12
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
(Data is for DoD assets for use in overt missions only. Data current as of December 2011 except for F-15 and F-22 SARs,which
are current as of December 2010.
[1] APUC is the Average Procurement Unit Cost, denoted in millions of dollars.
[2] Cost per Aircraft is the APUC divided by the number of aircraft (if multiple) comprising one unit, denoted in millions
of dollars.
[3] O&S Average Annual Cost per Aircraft is the cost of Operations and Support per Unit, including Unit-level Manpower,
Unit Operations, Maintenance, Sustaining Support, Continuing System Improvements, Indirect Support, and Other, denoted
here in thousands of dollars and adjusted to reflect costs on a per-aircraft basis.
[4] O&S per Aircraft is the preceding value divided by the number of aircraft (if multiple) comprising one unit, denoted in
thousands of dollars.
[5] Base year is the initial year in which acquisition was valued. Variations due to different cost base years should be taken
into consideration.
[6] Cost obtained from the MQ-9 Reaper SAR. The MQ-1 Predator is antecedent to the MQ-9 Reaper.
[7] Cost obtained from F-22 SAR, updated to base year 2010, and is provided per squadron of 18 aircraft. This value
represents the per-aircraft cost, obtained by dividing the total annual cost per squadron of $141.5 million by 18 aircraft,
denoted in thousands of dollars.
[8] Cost obtained from the F-35 SAR. The F-16C/D is antecedent to the F-35.
[9] Cost is provided per squadron of 18 aircraft. This value represents the per-aircraft cost, obtained by dividing the total
annual cost per squadron of $202.6 million by 18 aircraft, denoted in thousands of dollars.
[10] Cost calculated based on estimated average of 300 flight hours per year, denoted in thousands of dollars.
It is important to remark that F-16 and F-15 programs are no longer able to
be purchased. Hence, acquisition cost is irrelevant compared to UAVs. Anyway,
operating costs are still important since they are the main USAF fighter and striker
models at combat zones nowadays.
When discussing staff requirements, traditional aircraft need one pilot and a
weapon operator, or even just a pilot depending on the model. Normal estimations
about UAVs show that the requirements for UAVs like the MQ-1 Predator are around
80 persons to operate one unit from beginning to end.
It is a fact that actually UAVs tend more to crash than fighters. USA Congres-
sional Budget Office states that MQ-9 Predator mishap rate has improved from 28
crashes per 100,000 flying hours to 7.6 crashes. But these values are still far from
the ones presented by the F-15. This fighter model had in 2011 a mishap rate of
2.36 mishaps per 100,000 hours of flight. USAF counts total losses from Predator
2015 - 2016 13
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
crashes in 2011 of $48 million in total, and happening mainly overseas. This amount
is less than the one in the same period for F-16, that was $ 57.3 millions, most of
the crashes at home.
While monetary losses that arise from crashes might be just a small part of the
whole budget, security issues are critical. In 2011, a mishap of a Lockheed RQ-170
Sentinel UAV, figure 8; in Iran showed that this kind of accidents risk to compromise
missions and security since the enemy is able to catch sensitive technological material
that later can be sold to hostile countries.
Thanks to its nature, UAVs are able to go into dangerous regions with no security
issues for the pilots’ lives. Moreover they are not constrained by operators’ shift
schedules or pilot endurance. They gather more intelligence and reconnaissance
data than any manned aircraft, and even attack selected targets. It is really hard
to assess these advantages, or the the negative reaction from counterpart civilians
and foreign countries when UAVs are deployed. There are still no over-all objective
measurements to evaluate strategic advantages and possible inconveniences to the
use of aerial unmanned vehicles in fighters typical missions. This type of information
is the most lacking and where the efforts should go on to better implement UAV
programs.
2015 - 2016 14
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2.3.4 Unmanned Aerial Vehicle issues when considered for the mod-
ern battlefield.
The advantages of introducing unmanned aerial vehicles to missions assigned tradi-
tionally to manned aircraft might be at same time disadvantages. The reason is the
actual state of the technology that is supposed to achieve the needs of today’s air
combat, air-ground attack, reconnaissance, etc.
Actual UAV control loops does not match human reflex speed reaction and range
of possibilities. The important of this point in combat air is crucial as fast maneuver-
ing and counter act are the key factors to achieve advantage when considering ODDA
loop. Improving this problem with problem with velocity may take several time to be
solved. But nevertheless unmanned aerial vehicles most potential use is to not to be
used in a deterministic way as manned aircraft but more like swarming. This means
sacrificing individual performance but improving the over all result by putting a large
number of individuals working together. Anyway, artificial intelligence algorithms
to achieve swarm behaviours are still in development. Efficient swarming will be
translated in an algorithm that guides the group not the individual, letting each unit
to decide by itself, opposite to central command.
Furthermore, the way actual UAVs are controlled, not fully automated but re-
motely piloted; means a degradation of OODA loop. The lacking of pilot’s situational
2015 - 2016 15
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
awareness makes OODA cycle slower and less efficient. Off-board operators are not
prepared to react to unexpected circumstances as pilots. And there is another issue,
the possibility of communication problems.
• Stealth.
• Detection range.
• Performance.
The proposed model would be somehow something similar to the BAE Systems
Taranis. The size and shape needed should be the small as possible to avoid detection.
Due to the inherent better performance of a manned aircraft because of the faster
OODA loop of the pilot compared to a modern autonomous vehicle, and the much
better dynamics of a fighter compared to UAVs; the best engagement procedure for
an unmanned air vehicle to attack an aircraft would consist in trying to approach
the target as much as possible without been detected and launch a missile or look
for a direct hit with the own vehicle.
2015 - 2016 16
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Actual UAV models are remote controlled manually or semi automatized by pilots.
Pathfinder Raven, Reaper or Predator are examples of this form of control. They are
similar in size to standard fighters i.e. Eurofighter Typhoon, and cost about ten times
less. But they still act somehow like manned airplanes when missions are defined.
With the appearance of even smaller and cheaper UAVs, new mission approaches
can be designed. Complete autonomous fleets capable of develop missions with no
continued human assistance arise. Hence decision-making responsibility should be
part of each agent composing the system.
This kind of systems should complete their goals, been able at the same time to
recover from faults, and responding predictably to human supervision at any time.
Large fleets of agents make centralized task management and control approaches not
working. The main issue is communication networks. Such a large number of agents
transmitting and receiving information simultaneously would make a centralised
systems very unstable and computing demanding, therefore this approach is not
acceptable.
At this point is where a decentralized design arises. The key is to obtain a desired
group behaviour focused on achieving a common goal, by defining simple individuals
behaviours. The individual stimulus should be just local information like sensor data
and communication with other agents around. Thus the three main domains that
challenge the development of this kind of systems are:
• Task allocation.
• Communication networks.
The simulations produced by this thesis will be mainly focused in the task allocation
aspect, studying approaches that tackle large number of agents and trying to apply
them to the scenario of our interest: Air battlefields. This does not mean that the
other fields will be forgotten, just that the solutions applied will be already proved
methods or model simplifications.
2015 - 2016 17
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Optimal coordination among the UAV swarm is the key of mission success. The
main reason of this fact is that the distributed nature of the agents. Assigning
them properly lead to the maximum efficiency of the system. Hence, to maintain
flexibility when communications may not be available, the decision-making should be
distributed. The allocation problem obtained from this system configuration becomes
complex, needing trade-offs in the structure of the resolution algorithms.When design-
ing algorithms to solve the allocation problem, compromise between two goals is
needed, the optimality of the solution and the time to obtain that solution. Therefore
task allocation problems are most part NP-Hard, demanding exponential time to
obtain a satisfactory solution.
In the last decades, two main approaches to design decentralized task allocation
frameworks have been developed. One is CBBA algorithm [3], and the other is SCA
algorithm [4]. CBBA framework is based in auction-based decision strategy as a way
for decentralized task allocation, and consensus routine based on local communication
is used as mechanism to resolve conflicts achieving agreement on the winning bid
values. SCA is defined by enabling improvement of the global cost of task alloca-
tions obtained from fast greedy algorithms. It is a Markov Chain Monte Carlo method.
This approaches work well when the number of agents conforming the swarm
is not very large, around 10 or 20 units. But when this number increases, the
communication needs for auction algorithms become unaffordable for networks that
large in size. Furthermore, not only the size is an issue, in real scenarios losses in
communication appear due to the topology of the terrain and other factors. Hence,
new approaches should be designed. In this situation, with groups of hundreds of
agents and limited communications; is when new approaches inspired by insects
behaviours arise. Self-organized behaviour of social insects like ants, can be translated
as agents with limited on-board resources, scalable in number. They conform systems
that are considerably robust to changes in population, and scalable in units and tasks
[5]. This framework will be explained in next section of this thesis, as background
for the model that will be developed.
2015 - 2016 18
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2015 - 2016 19
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
4 Battlefield Actors.
The strategy adopted is inspired by the UAV model proposed from the author’s
interpretation of the characteristics of a modern air combat scenario. To simplify
the case, the air combat studied would be between a swarm of 30 agents against 10
enemy units. Due to the limitations of the scope of the thesis, the most important
characteristic of the swarm to test is the task allocation algorithm. Hence the enemy
considered would be other unmanned aircraft, with an speed performance and man-
euver capability similar to the one of the swarm agents. Then, to win the battle the
only advantage presented by the swarm is the obvious one, the larger number of units.
The aim is to present UAV with the least quantity of on-board resources. To
conform a system that would be high scalable in number of units forming it and
enemies to attack. It should be robust to changes in this numbers. The reason
is that at combat there would be obviously losses that change system size. These
characteristics are related to decentralized systems. In them, agents change between
simple behaviours modified by external interactions with the rest of elements. To
achieve it, a distributed algorithm using stochastic rules to dispense targets between
all the swarm will be applied.
The enemy units are supposed to have a defense mechanism that consists in a
proximity detection system with a radius of action. When a swarm agent goes inside
this radius, the enemy UAV will fly away from it trying to avoid collision or attack
by increasing as much as possible the distance between the two actor involved in the
engagement. This mechanism is based in potential field algorithm explained later on.
2015 - 2016 20
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
The figures 9 and 10, represents the simulation for 10 and 100 seconds. It is observable
that two enemies have been killed, and around 18 swarm agent have failed during
their attack. In the left bottom corner, the green circle represents the base where
the swarm starts.
2015 - 2016 21
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Due to the kind of attacking system been designed, there is almost no communic-
ations between agents, and with the exterior of the cluster that they form. So the
decision making scheme is reduced to just the agent and the enemy integrity. Fur-
thermore, in the simulations this algorithm is implemented with the task allocation
one all together.
The algorithm goes as follows. First after initialisation the agent receives from
the reconnaissance UAV the number of units, their initial coordinates, and other
parameters affecting the task allocation algorithm that will be explained later on.
Next the agent proceeds to run the task allocation procedure to attack a target.
When the target is known the swarm unit starts the chase. When the distance is
short enough to detect the state of the enemy, the agent will evaluate the target
integrity. If it is operative, the swarm member will proceed to fly against the enemy
trying to hit it. This is a typical suicide behaviour. If the enemy is already down
because of others’ attack, then the agent runs one more time the task allocation
algorithm to select a target, and continues as before. It is important to note that
if the attack is not successful, it is considered that the agent will be killed as its
attack was suicidal. This means that the swarm UAV has just one opportunity to hit
the target. The probability of hit success is a parameter that will be tested during
simulations.
2015 - 2016 22
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
5.2 Navigation.
An air combat scenario includes two groups of agents fighting each other in the
air. These teams go from one versus one battles to several against several unit
engagements. Hence, because of obvious large space of the battle field and the
relative small size of the units moving in it, the navigation algorithm to choose is easy.
Bug algorithms are discarded because they usually only work in environments with
static obstacles. Exact algorithms like Voronoi diagrams, obtain a mathematically
determined way to get the answer. But they need sensor precision of the obstacles.
Therefore they are above all applied to avoid static ones. The last ones, algorithms
based on grid description are the best option for dynamic obstacles as the need of
sensor precision is not as critical as at other methods.
Hence an algorithm based in potential fields is applied in this thesis for the
agents’ movement. Concept of potential fields are part of a larger family of grid
algorithms. They emulate the behaviour of magnetic fields spreading. When applying
this concept to robot navigation, two main objects arise in the environment: Goal
points and obstacles. The goal point is what attracts the agent to its position. In
counterpart, The obstacles repulse the agent from them.
(q − qgoal )
Fatt (q) = −∆Uatt (q) = ξ
kq − qgoal k
1 1 1 q − qobst
Frep (q) = −∆Urep (q) = ν −
ρ(q, qobst ) ρ0 ρ(q)2 ρ(q)
ξ and ν are the attractive and repulsive factors, ρ(q, qobs ) is the distance from the
agent to the obstacle, and ρ0 is the maximum distance from the obstacle repulsive
field has effect.
2015 - 2016 23
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Therefore defining the battlefield as a two-dimensional space the total force is:
N
X
F (x, y) = Ftarget (x, y) + Fagenti (x, y);
i
In our case, as the obstacles sizes are relatively small compared to the rest of size
bodies, and all of them are moving, there is no possibility to fall into local minimum
issues that provoke the agents to get stuck between obstacles and not move to the
goal position.
5.3 Control.
The control applied to the swarm agents is a basic simplification of the a rigid body
dynamics. At this simulation, the units have been treated as particles, hence no
rotation has been taken into account. Then, the resulting movement of each element
is just translational. The cause of the movement is the total force produced by
attractive and repulsive fields over each UAV. The kinematics are:
φi = arctan(Fy /Fx )
xi = xi−1 + v × dt × cos φ
yi = yi−1 + v × dt × sin φ
The subscript i represents the iteration. dt is the time step between iterations. Hence
the displacement of each element is the previous position, plus the velocity times the
time step in the direction of the force that produces the movement.
2015 - 2016 24
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
6 Task Allocation.
The problem of assigning subsets of agents from the total swarm in an optimal
proportion to each task is defined as an instance of a single-task, multi-robot task
problem (ST-MR). Also known as the coalition formation problem when conformed
by software agents. Some approaches as market-based ones need large agent commu-
nication and cooperation. Therefore this approaches are often costly. Approaches
that arise from insects’ behaviour, swarms of ants, etc; based in optimized stochastic
policies, that rely in little to no communication between agents is the task allocation
algorithm that will be implemented in this thesis to tackle the issue of assigning
targets to agents.
The constraints between tasks are defined by a directed graph G = (V, E), where
V, the set of M vertices, corresponds to enemies {1, ..., M }, and E, the set of NE
edges, is the corresponding transitions between tasks. Enemies i and j are considered
adjacent and said that i ∼ j, if an agent assigned to enemy i can switch to j. This
relation is defined as (i, j) ∈ V × V, with the set E = {(i, j) ∈ V × V|i ∼ j}. In the
thesis scenario, G model represents the enemy inter-connectivity: V is the set of
M targets, and each edge (i, j) is a one-way path that agents can use from i to j.
The P existing routes from i to j are represented by the various edges (i, j)m where
m = 1, ..., P .
2015 - 2016 25
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
buildings, etc. This permits the swarm elements to travel directly from one target to
another, instead of going through a sequence of intermediate steps before achieving
the desired enemy.
Where i = 1, ..., M . Each transition from i to j has a fraction of agents per time unit
that are going from enemy i to j. Then the model defined by equation 1 specifies
the switch rate of swarm fraction xi (t) as the difference between the income and
outcome of agents at target i. It shows this effect by representing elements as they
change immediately from one enemy to other, not considering the time that each
element needs to transit from one to other task. Because of the constant value of
kij , some agents still travel between tasks when xd is achieved. This feature improve
robustness of the system [9].
2015 - 2016 26
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
dx
= Kx (2)
dt
Where K is a matrix with the following properties:
K ∈ RM ×M
KT 1 = 0
Kij ≤ 0 ∀(i, j) ∈ E
−kij ,
if i 6= j, (i, j) ∈ E
Kij = 0, if i =6 j, (i, j) 6∈ E (3)
P
(i,l)∈E kil , if i = j.
dxi (t) X X
= kji xj (t) − φij (t) (4)
dt
∀|(j,i)∈E ∀|(i,j)∈E
Where φij (t) is the flow of swarm units travelling from target i to j. If quorum of
enemy i is exceeded, transition rate from i to adjacent objectives j can be set to a
multiple of the existing transition rate αkij , with α > 0 satisfying max αkij < min
max . Resulting in:
kij
This analytic switching function acts like a threshold method like the one described
by [10]. In [8], it is demonstrated that the introduction of the quorum concept to
the base model speeds up the allocation process and allows to eliminate wireless
communication procedure that was implemented in [7].
In the simulation, as the swarm units attack as suicide projectiles trying to hit
the assigned target; The quorum concept in this model is equivalent to the number
of impacts each enemy can support.
2015 - 2016 27
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
end
if current time t is less or equal to the minimum ts,j then
Attack the current target
else
Change attack to enemy with minimum ts,j at time t = min ts,j
end
Algorithm 1: Original T.A. algorithm at agent level.
To apply the quorum, agents should be able to estimate the quorum level corres-
ponding to each enemy. The transition of each unit is synchronised with its internal
clock. Moreover as the objective is to adapt this algorithm to our air combat scenario,
the swarm agents will die as well as the enemies. This means that the algorithm will
suffer some modifications to introduce agents’ integrity concept.
2015 - 2016 28
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
7 Simulation Analysis.
7.1 First Case: Different simulation duration.
The killing speed will be measured in this simulation, watching how many targets
have been eliminated in simulations with different duration. Each simulation is
evaluated 5 times. The other parameters like the number of swarm agents, and the
matrix K are constant.
Table 5: Simulation 1.
Figure 15 presents the average value of enemies killed, for four different cases,
varying in each one of them the time duration of the simulation. The results show
that the the killing rate seems to not be affected by the time duration, as it presents
high diversity in results with no correlation related with time. This theory of non
correlation is reinforced when figure 16 is observed. The number of friendly units
dead is similar in average and dispersion values no matter the case.
2015 - 2016 29
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
The results show that that a priori the system is fast enough to kill but the
effectiveness with this configuration is not very high. No more than 5 kills in the best
case no matter the duration of the simulation. This fact shows that the intervals of
time that are considered are not much relevant with the actual configuration of K
values and number of swarm agents. It may be possible that as the algorithm uses
a randomisation function, the uncertainty that arises does not permit to get clear
results with the actual number of simulations (5 times each). A possible solution
can be to increase this number.
Table 6: Simulation 2.
2015 - 2016 30
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
In this experiment, it is clear the correlation between the matrix K and the killing
rate. The cases with K values different from the design value present almost no kills.
The survivability of the swarm is enhanced when K is the design one. Other values
increase the elements dying.
2015 - 2016 31
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
Table 7: Simulation 3.
In figure 19 the killing rate grows as the number of elements conforming the
swarm scales up. The deviation for each case is variant, hence the robustness is still
affected. For example, for a agent number of 40 units, the killing rate is averaged 7
with a deviation of 1 unit. Then, for 60 units. the average value changes to 6, with
one simulation where 9 enemy units died.
2015 - 2016 32
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
2015 - 2016 33
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
• The research made lead to define the characteristics and features to apply in an
unmanned aerial vehicle that will be able to confront other aircraft in the air
battlefield. Moreover, a decision making and a task allocation algorithm were
implemented inspired by swarm behaviours coming from insects’ interactions.
• The results the come from this work, although initially promising it is observed
that further work is needed in optimization of the matrix K to find values that
will boost performance. It is also needed to review the algorithm to make it
more robust against the changes that arise in an air engagement.
• Further improvement of the model introduced in this thesis will make it more
realistic. More advanced dynamics will help to get simulations and results that
will be more reliable. More capabilities to the agents or enemies can be also
implanted, like missiles launch, etc. Furthermore other kind of enemies as SAM
sites can be added to make the scenario even more challenging.
2015 - 2016 34
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
References
[1] N. Correll, D. Rus. Architectures and control of networked robotic systems.
Handbook of Collective Robotics, pp. 81-104, Pan Stanford, Singapore, 2013.
[3] Han-Lim Choi, Luc Brunet, and Jonathan P. How. Consensus-Based De-
centralized Auctions for Robust Task Allocation. IEEE Transactions on Robotics,
Vol. 25, No. 4, 2009.
[8] M. Ani Hsieh, . Halsz, S. Berman, and V. Kumar. Biologically inspired re-
distribution of a swarm of robots among multiple sites. Springer Science+Business
Media, LLC 2008.
[11] Franks, N., Pratt, S. C., Britton, N. F., Mallon, E. B., and Sump-
ter, D. T. Information flow, opinion- polling and collective intelligence in
house-hunting social insects. Philosophical Transactions B: Bio- logical Sciences,
357(1429), 15671584. 2002.
[12] Department of the NAVY, Chief of Naval Air Training Flight Train-
ing Instruction: Basic Fighter Maneuvering & Section Engaged Maneuvering.
CNATRA P-1289, N715. January 2016.
[13] Lt Col Patrick Higby Promise and Reality: Beyond Visual Range (BVR)
Air-To-Air Combat. Air War College (AWC) Electives Program. Air Power
Theory, Doctrine, and Strategy: 1945-Present. Maxwell AFB, AL. March 2005.
2015 - 2016 35
M.Sc. Autonomous Vehicle Control & Dynamics Cranfield University
[14] Robert L. Shaw Fighter Combat: Tactics & Maneuvering. Naval Institute
Press. Annapolis, Maryland. 1985.
[15] www.UAVglobal.com.
[17] Lockheed Martin F-35 Lightning II Program Status and Fast Facts 4Q 2015.
2016.
[18] U.S. Air Force F-35A Lightning II, RQ-4 Global Hawk, F-16 Fighting Falcon
& MQ-9 Reaper fact sheets. Air Combat Command Public Affairs Office. April
2014.
2015 - 2016 36