1. Introduction
With the rapid development of the social economy and urbanization, we are now facing increasing traffic congestion and air pollution problems, which have serious effects on the sustainable development of big cities. To tackle these problems, more and more cities have implemented traffic restriction measures, such as “tail number restriction methods” that limit car owners to driving only on alternate days [
1] and truck restriction policies that restrict freight trucks from entering into specified zones during specified time periods [
2]. These have effectively alleviated the pressures of congestion and pollution; however, inevitably, these measures have significantly affected the business of the logistics industry. In particular, the express delivery sector, which typically relies on the use of trucks to distribute a large number of parcels to customers (the majority of whom reside in urban traffic-restricted zones) has been affected.
In recent years, drones, or unmanned aerial vehicles (UAVs), have shown great potential in last-mile delivery, as they can reduce operational costs, air pollution, and congestion [
3]. During the COVID-19 pandemic, drones appeared as an interesting solution to contactless delivery, especially for urgently needed medical supplies [
4,
5,
6,
7]. For instance, the SF express used cargo drones to transport tons of medical materials to Chinese cities, such as Wuhan and Ordos, when they were shut down in the spring of 2020 due to the outbreak of the pandemic [
8]. Many food service companies also employed drones to deliver food to customers without human-to-human contact [
9]. Because their advantages include high speed, flexibility, and environmental friendliness, drones can be considered an effective method for circumventing ground-based traffic restrictions in express delivery.
The main drawbacks of drones in delivery are the limited payload capacities and flying ranges, which prevent drones from independently performing larger-scale delivery tasks. In contrast, trucks have much larger payload capacities and mileages. Therefore, trucks and drones can cooperate to achieve complementary advantages by using trucks as moving depots from which drones can repeatedly take parcels and swap/recharge batteries to surmount their limitations [
10,
11,
12].
In this paper, we study the problem of using a truck and a drone to cooperatively deliver parcels to a number of customers in a restricted traffic zone. Initially, the truck carries all of the parcels, the drone, and the replaceable batteries for the drone to a location on the outer boundary of the restricted zone. The truck cannot enter the restricted zone. The drone is responsible for delivering the parcels to the customers. Due to the payload and range limitations, the drone has to repeatedly return back to the truck to take parcels and, if needed, swap batteries; meanwhile, the truck can move along the boundary of the restricted zone to improve the delivery efficiency of the drone, as illustrated in
Figure 1. The problem is to determine the outer path of the truck and the inner path of the drone, such that the delivery tasks can be completed as early as possible. To efficiently solve the problem, we propose a hybrid optimization algorithm, which adapts a main population-based metaheuristic to optimize the inner path (delivery sequence) of the drone and employs a sub-procedure to determine the truck–drone intersections based on convex relaxation. We conduct experiments on a set of test instances, the results of which demonstrate that the proposed algorithm exhibits a competitive performance compared to other selected popular optimization algorithms. The main contribution of this paper can be summarized as follows:
We present the problem of cooperative truck–drone path optimization for delivering parcels to customers in restricted traffic zones.
We propose a hybrid metaheuristic and convex relaxation optimization algorithm to efficiently solve the problem.
We validate the effectiveness and efficiency of the proposed method on a variety of test instances.
Figure 1.
Illustration of the cooperative truck–drone delivery.
Figure 1.
Illustration of the cooperative truck–drone delivery.
In the remainder of this paper, we will review the related works in
Section 2, formulate the problem of cooperative truck–drone delivery path optimization in
Section 3, propose the hybrid optimization algorithm in
Section 4, and present the experimental results in
Section 5. Finally, we conclude with discussions in
Section 6.
2. Related Work
The application of drones in last-mile logistics has been practiced by many delivery companies. Amazon introduced its Prime Air drone delivery program in 2013 and conducted the first commercial drone delivery three years later [
13]. Additionally, in 2016, DHL began to use a drone system to deliver consumer goods in the Bavarian community of Reit im Winkl [
14], while UPS cooperated with Zipline (a UAV manufacturer) to deliver blood for lifesaving transfusions in Rwanda [
15]. In China, SF obtained the first operating license for commercial drone deliveries in 2018 and has developed a number of UAV models to tackle issues related to the end delivery of special logistics operations, such as ‘local specialty economy’, medical cold-chain logistics, and emergency aid distribution [
16]. In recent years, many other companies, including Google, FedEx, Russian Post, DPD, AMA, Alibaba, etc., have also reported the use of drones during delivery [
17,
18].
Due to the limitations of load capacity and the travel range of UAVs, a typical solution is to use drones in combination with ground vehicles (trucks), which has attracted recent attention from both the industry and academia. Based on the roles of drones and trucks in delivery, the research works can be divided into three classes. In the first class, drones and trucks have roughly equally important roles. Murray and Chu [
19] presented mixed integer linear programming formulations for two problems, named the flying sidekick traveling salesman problem (FSTSP) and the parallel drone scheduling TSP (PDSTSP); they proposed two simple heuristic algorithms to solve problems of small and medium sizes. Murray and Raj [
20] extended the FSTSP to deal with multiple drones, and they proposed a heuristic method that consists of solving a sequence of three subproblems. Ham [
21] extended the PDSTSP to incorporate multiple trucks, drones, and hubs, where drones can also perform pick-up tasks; they modeled the problem as an unrelated parallel machine scheduling and employed constraint programming to solve it. Agatz et al. [
22] proposed another formulation called the TSP with drone (TSP-D); they developed several route-first-cluster-second heuristics based on local search and dynamic programming. Ulmer and Thomas [
23] studied a same-day delivery problem in which either a drone or a truck could be used to meet the customer demand, and dynamic programming is employed to decide the acceptance or rejection of randomly arriving customer orders. Wu et al. [
24] considered a collaborative truck–drone routing problem for contactless parcel delivery in epidemic areas, in which each customer needs to be served exactly once by either a truck or a drone; they proposed an improved variable neighborhood descent combined with simulated annealing, tabu search, and K-means clustering to efficiently solve the problem. Zheng et al. [
25] studied the problem of cooperatively using drones and police cars to search for and catch escaped criminals; they proposed an evolutionary algorithm for routing the drones and cars to minimize the expected time of capture. The method has been applied to a similar problem of the cooperation of drones and ground working units [
26,
27].
In the second class, drones play the primary role while trucks act as supporting units; e.g., trucks take parcels to a hub from which drones deliver parcels to customers. Carlsson and Song [
28] studied a delivery system in which a drone provides delivery service to customers while making return trips to a truck that is itself moving. Wang and Sheu [
29] proposed the VRP with drones (VRPD) with service hubs to which trucks supply items and from which drones deliver items, and the drones can land only on the hubs; they formulated the problem as mixed integer programming and developed a branch-and-price algorithm for the problem. Luo et al. [
30] formulated a two-echelon cooperated routing problem for a truck and a drone that launches and lands on the truck; they proposed two heuristics: the first constructs a complete tour and then splits it by truck routes, while the second constructs the truck tour and assigns UAV flights to it. Karak and Abdelghany [
31] presented a hybrid vehicle–drone routing problem for pick-up and delivery services, in which vehicles are used only as mobile depots for the drones; they proposed an extended Clarke and Wright algorithm to the problem. Salama and Srinivas [
12] studied another problem of truck–drone cooperation that allows the truck to stop at non-customer locations for drone launch and recovery; they proposed a hybrid simulated annealing and variable neighborhood search algorithm for the problem. Bányai [
11] presented four models, including first-mile/last-mile delivery by e-trucks, first-mile delivery by drones from e-trucks, last-mile delivery by e-trucks, and integrated first-mile/last-mile delivery by drones from e-trucks; they conducted a numerical analysis to show that the last model could lead to a significant reduction in energy consumption and virtual greenhouse gas emissions.
In the third class, trucks play the primary role; i.e., their routes have priorities that the drones have to follow. Savuran and Karakaya [
32] defined a mobile depot VRP which routes a drone deployed on a mobile carrier to visit fixed targets; they proposed a genetic algorithm that has been adapted to satisfy the constraints of depot mobility and range while maximizing the number of targets visited by the UAV. The problem studied by Boysen et al. [
33] fixes a sequence of truck stops and schedules the trips of a drone to determine its take-off and landing stops. These types of problems are categorized by [
34] as the mothership and drone routing problem, which is solved by a set of heuristics embedded with a second-order cone program. In other cases, drones support the trucks in ways such as gathering information for the trucks or providing communication among the trucks [
35]. Interested readers can refer to [
36] for a survey on drone–truck combined operations. Compared with the existing studies in the literature, our study considers the problem of cooperatively planning the truck and drone paths to minimize the service time under the restricted traffic constraint, in which the truck can only move along the boundary of a restricted zone.
3. Problem Description
The problem under consideration is to cooperatively schedule a truck and a drone to provide delivery services to a set of customers in a restricted traffic zone. The input parameters of the problem are shown in
Table 1. At the beginning, the truck carries all cargo and the drone to the initial location (coordinate)
. There are
n customers that need to be served, the location (coordinate) of the
i-th customer is
, and the weight of the cargo to be delivered to the
i-th customer is
(
). The maximum load of the drone is
W (without loss of generality, it is assumed that
for any
i). Let
B denote the outer boundary of the restricted traffic zone; the velocity of the truck along the boundary is
. The maximum distance of the drone (after being fully charged) is
D. For the drone, its velocity changes with its load. The minimum velocity (when it is fully loaded) and the maximum velocity (when it is not loaded) are denoted as
and
, respectively. When the load of the drone is
w, its velocity is calculated as:
This problem is to determine the delivery sequence of customers (i.e., the path of the drone), denoted by , and based on this, the path of the truck consisting of a set of locations (also called intersections) at which the truck receives the returned drone for battery replacement and cargo reloading and then re-sends the drone for subsequent delivery. Assume that the truck carries sufficient batteries for the drone to complete the delivery task, i.e., battery recharging is not required during the task.
The first customer at which the drone needs to return back to the truck to replace the battery and reload cargo satisfies either one of the following conditions:
According to the weight capacity limitation, the drone cannot load the cargo for the next customer:
According to the power and consequent distance limitation, the drone cannot fly to the next customer and then return back to the truck:
where
denotes the optimal intersection of the truck and the drone on the outer boundary
B to minimize the flight distance of the drone from the current customer
to the truck and then to the next customer
. The first intersection
is determined as follows so as to minimize the back-and-forth travel time of the drone:
where
is the velocity of the drone starting from the truck to the next round’s first customer
, which is calculated based on the sum of the weights of customers
according to Equation (
1).
Regardless, the intersection should be within the travel ranges of the truck and the drone. Specifically, if the boundary is convex, the intersection should not exceed the orthogonal projection of the current customer onto the boundary, as illustrated in
Figure 2.
Similarly, for each next
k-th back-and-forth round, the customer
at which the drone needs to return back to the truck satisfies either one of the following conditions:
where
denotes the
k-th intersection (
) of the truck and the drone on the outer boundary
B, which is determined as follows:
where
is the velocity of the drone starting from the truck to the
-th round’s first customer
.
Based on the above conditions, we can obtain all of the customer locations
at which the drone needs to return back. These locations divide the delivery sequence into
K subsequences. Consequently, we can iteratively calculate the velocity
of the drone flying to each customer
as well as the time
at which the drone arrives at
. First,
and
can be directly calculated as
Next, for each
that belongs to the
k-th subsequence (
), but is not the first location of a subsequence,
and
can be calculated based on the previous customer as
For
, which is the first location of the
k-th subsequence (
), i.e.,
,
and
can be calculated as
The objective of the problem is to minimize the time at which the drone serves the last customer
, i.e., the completion time of all delivery tasks, and the problem can be formulated as follows (where
are auxiliary variables for ensuring that
is a permutation of
):
4. A Hybrid Optimization Method for the Problem
In this section, we propose a hybrid metaheuristic optimization method for the problem. As shown in the flowchart in
Figure 3, the method uses a main procedure that evolves a population of candidate solutions (i.e., customer delivery sequences) to search for an optimal (or near-optimal) drone path and employs a sub-procedure to optimize the truck–drone intersections for each main solution
in the population.
4.1. Main Metaheuristic for Optimizing the Drone Path
The main problem for optimizing the drone path is a sequence optimization problem. The method randomly initializes a population of solutions, each of which is a random permutation of the customer indices . The method then evolves the solutions using the WWO metaheuristic, which has demonstrated performance advantages in sequence optimization compared to other popular metaheuristics, such as GA, PSO, and BBO.
The basic principle of WWO is to assign each solution
a wavelength
that is inversely proportional to the solution fitness and make
search in a range proportional to
, such that high-fitness solutions search in small ranges, while low-fitness solutions search in large ranges to balance global and local searches. Initially, all wavelengths are set to 0.5; at each iteration, the wavelength of each solution
is updated according to its objective function value
as follows:
where
is a constant of 1.0026;
and
are the maximum and minimum objective function values in the population, respectively; and
is a very small positive value to avoid division by zero.
At each iteration, each solution performs a propagation operation, which, at each dimension i from 1 to n with a probability of , chooses a random subsequence and reverses it. In this way, the expected number of subsequence reversals on is ; the better the solution, the smaller the , and a small change in the solution is expected.
After propagation, if the updated solution is better than the original one, it replaces the original one in the solution. In particular, if the updated solution is better than any other solution searched before, i.e., it is a new best solution , a breaking operation is performed on to produce a random number (between ) of neighboring solutions, each of which is obtained by randomly swapping two customer positions and in the solution. The best neighbor, if better than , will replace in the population.
Compared to the basic WWO algorithm, here, we remove another operation called refraction, which replaces any solution that stays a predefined number of generations with a new solution moving towards the best-known solution; instead, we employ a population reduction policy from the simplified WWO (SimWWO), which reduces the population size
from an upper limit
to a lower limit
by iteratively removing the current worst solution:
where
and
are the maximum and minimum population sizes, respectively;
t is the current generation number or the number of fitness evaluations (NFEs); and
is the maximum generation number or NFEs of the algorithm.
Algorithm 1 presents the pseudo-code of the SimWWO metaheuristic for optimizing the drone path.
Algorithm 1: Simplified WWO algorithm adapted to optimize the drone path for the problem. |
|
5. Computational Experiments
We construct nine problem instances based on selected regions of the city of Hangzhou in Zhejiang Province, China.
Table 2 gives the basic information about the test instances, including the number of customers, the areas and perimeters of the restricted traffic zones, the average distance between the customers, and the average weight of the demands of the customers. The areas of these regions are most suitable for one truck and one drone to efficiently perform delivery tasks (for larger areas, larger numbers of trucks and drones are expected). For these instances, we set the maximum velocity of the truck as
km/h (according to the average velocity on urban roads in common conditions), the velocities of the drone as
km/h,
km/h, the maximum distance of the drone as
km, and the maximum load of the drone as
kg (according to the technical performance of a typical type of drone for express delivery).
For each test instance, we compare the adapted SimWWO algorithm with the following six metaheuristic algorithms for permutation optimization:
A GA that uses linear order crossover and shift-change mutation [
38].
A BBO algorithm [
39] adapted to this problem using subsequence migration [
40].
A DE algorithm for permutation optimization based on floating-to-integer mapping [
41].
A discrete PSO adapted to this problem using subsequence learning [
42].
Enhanced BBO [
43] that integrates local and global subsequence migration.
Basic WWO [
44] adapted to this problem using subsequence reverse propagation.
For each algorithm, we tune its control parameters to the nine problem instances, which results in the parameter setting, shown in
Table 3, that exhibits the best average performance on the whole test set.
To ensure a fair comparison, we set the termination condition as the maximum number of objective function evaluations (NFEs) that reaches
for all algorithms. We run each algorithm 30 times for each instance.
Table 4 presents the maximum, minimum, median, and standard deviation (std) of the objective values over the 30 runs of each algorithm for each instance, where the best median value on each instance is shown in bold; the last row gives the averaged median values of the nine instances. The best median value among the seven algorithms for each instance is shown in bold. A nonparametric Wilcoxon rank sum test is conducted to compare the results of the algorithms for each instance, and a superscript
before a median value indicates that there is a significant difference between the result of SimWWO and that of the corresponding comparative algorithm (at a confidence level of 95%). Moreover, the box plots in
Figure 4a–i give the median, minimum, maximum, first quartile (25%) and third quartile (75%) of the objective values obtained by the algorithms for instances 1–9, respectively. For illustration,
Figure 5,
Figure 6,
Figure 7,
Figure 8,
Figure 9,
Figure 10,
Figure 11,
Figure 12 and
Figure 13 show the resulting routes obtained by the different algorithms on instances 1–9, respectively.
As we can observe from the results, the adapted SimWWO achieves the best median values among the seven algorithms for all instances. For instance 1, there are five algorithms (except GA and PSO) that obtain the same best median value; however, only SimWWO obtains the best median value in all 30 runs. For instance 2, DE, WWO, and SimWWO obtain the same best median value, and both WWO and SimWWO obtain the best median value in all 30 runs. For the remaining instances 3–9, SimWWO uniquely obtains the best median values among the seven algorithms. According to statistical tests, SimWWO obtains significantly better results than GA, PSO, and BBO for all nine instances and better results than DE, EBO, and WWO for eight instances. From the box plots, we can also observe that the deviation of the resulting objective function value of SimWWO among the 30 runs is the smallest among the comparative algorithms, which validates the robustness of SimWWO. Averaged over the nine instances, the median delivery time of the solution obtained by SimWWO is only 48.96% of that obtained by GA, 65.6% of that obtained by PSO, 77% of that obtained by DE, 80.6% of that obtained by BBO, and 78.45% of that obtained by EBO. In addition,
Figure 14 presents the convergence curves of the algorithms for the nine instances, which shows that DE converges quickly for the first two small-size instances, but the convergence speed of SimWWO is the fastest for the remaining medium- and large-size instances. In summary, the experimental results demonstrate the performance advantages of SimWWO over the other comparative algorithms in solving the considered problem.