1. Introduction
Autonomous underwater vehicles (AUVs), which are small crafts capable of autonomously executing underwater missions, boasting excellent concealment and high safety, have been widely applied in fields such as ocean exploration, environmental monitoring, and underwater rescue [
1]. Tasks such as localization [
2], path planning [
1], trajectory tracking [
3], and detection [
4,
5] are essential for AUVs. Path planning and trajectory tracking are the indispensable components of AUV control systems to perform tasks effectively [
6].
Path planning requires algorithms to possess the ability to predict the future and select points within the environment [
6]. Therefore, compared to commonly used methods such as the A* algorithm [
7,
8], the dynamic window approach [
9], and theartificial potential field method [
10,
11], model predictive control (MPC), with its receding horizon feature and predictive characteristics, exhibits excellent compatibility with path planning [
12].
An advanced approach to MPC was formulated in [
3], emphasizing the synergy between localized planning and path tracking for autonomous vehicles. This framework introduced an enhanced MPC method, leveraging refined particle swarm optimization techniques, to tackle both planning and tracking challenges in an integrated manner. The work reported in [
13] devised a path planning strategy for intelligent vehicles to avoid collisions with dynamic pedestrians, incorporating an attention-guided long short-term memory network, a modified social force model, and a systematic exploration of MPC. This methodology specifically addresses the scenario of pedestrian–vehicle interactions at unsignaled intersections, emphasizing dynamic conflict resolution. The research in [
14] focused on addressing the challenge of robust collision-free formation navigation for a set of multiple unmanned surface vehicles, considering these vehicles as underactuated nonlinear systems operating under both state and input constraints. The objective was to devise a solution for these constrained unmanned surface vehicles to navigate safely while maintaining their formation.
It can be seen that MPC has a good ability to deal with dynamic obstacles. Moreover, the multi-horizon prediction into the future enables MPC to generate more executable paths [
15]. However, MPC encounters the issue of high computational cost, which is unfavorable for online solutions [
15]. To address the issue of computational burden, an event-triggered mechanism is often combined with MPC to solve problems. In [
16], an EMPC strategy incorporating an adaptive APF was devised for autonomous electric vehicles, enabling both obstacle evasion and precise trajectory tracking. In [
17], an EMPC approach was formulated to accomplish trajectory following and obstacle circumvention for a wheeled mobile robot, taking into account input constraints and external disturbances.
The complete AUV motion planning generally consists of path planning and trajectory tracking [
18]. Common methods used to handle trajectory tracking include PID control [
19,
20], sliding mode control [
21,
22], backstepping control [
23,
24], and so on. In contrast to the majority of conventional control approaches, the distinctive strength of MPC resides in its inherent ability to systematically incorporate and manage system constraints directly within the controller’s design phase, which renders MPC an exceptionally robust platform for tackling diverse control engineering challenges [
25].
In [
25], an innovative LMPC framework was introduced for AUVs, leveraging computational resources through online optimization to enhance trajectory tracking performance. By incorporating a nonlinear backstepping tracking control law, the study formulated contraction constraints within the LMPC problem, theoretically ensuring closed-loop stability and optimizing the AUV’s trajectory execution. The work reported in [
26] formulated a robust MPC approach tailored for AUV trajectory tracking amidst bounded disturbances. This tube-based MPC approach includes two distinct optimization tasks: the first tackles a standard MPC problem for the nominal system, establishing a reference state trajectory, while the second strives to confine the perturbed system’s state within a tube encircling this reference trajectory, thereby ensuring the robust stability of the closed-loop systems. Research work in [
27] designed a trajectory tracking scheme using the sliding mode control technique in order to be robust against bounded disturbances. In order to align the planned position, [
27] made strict assumptions on the reference state, i.e., that position tracking error converged while velocity error converged.
In summary, the existing research on path planning and trajectory tracking for AUVs has the following limitations. For path planning, EMPC is often only used for local obstacle avoidance during trajectory tracking (e.g., [
16,
17]) rather than the entire path planning process, which we believe provides a limited reduction in computational burden during planning. Furthermore, path planning with safety as a consideration focuses excessively on the degree of the collision hazard itself (e.g., [
28,
29]) while neglecting the changing rate of the collision hazard, which is often more closely related to environmental changes. For trajectory tracking, the use of sliding mode control methods (e.g., [
27]) requires constraints on the reference trajectory that needs position tracking error to converge while velocity error converges, and due to the inherent chattering phenomenon in sliding mode control, the actual tracking performance is not ideal. Moreover, the chattering phenomenon in sliding mode control has multifaceted impacts on the system, including degrading system stability, affecting dynamic performance, damaging system hardware, reducing control accuracy, and limiting the scope of system application [
30]. Existing LMPC methods typically use backstepping or linear sliding mode control (e.g., [
25]) as auxiliary control laws and are mostly applied to other models rather than AUVs (e.g., [
31]).
To address these issues, for path planning, this paper proposes an EMPC-based path planning algorithm operating during the entire process while leveraging the environmental changes as a triggering mechanism to reduce the computational burden. Additionally, we innovatively incorporate a collision hazard evaluation function with a threshold based on the changing rate of collision hazard, enabling the AUV to make selections between reachability and safety. For trajectory tracking, to avoid the chattering phenomenon caused by sliding mode control while ensuring the stability of MPC, this paper constructs an LMPC method for trajectory tracking using integral sliding mode control as an auxiliary control law. This approach not only guarantees stability but also ensures tracking performance, and there is no need to make additional assumptions on the reference state.
The main contributions of the paper are as follows:
- (1)
An EMPC leveraging environmental changes as a triggering mechanism is developed for path planning. Compared to [
1,
12], our approach, by accepting feasible solutions according to environmental change not only ensures reachability but also imposes a lower computational burden.
- (2)
We innovatively introduce the changing rate of the collision hazard as the condition for safety penalty. Considering the possibility that the environment itself is at a high risk, traditional methods such as [
28,
29] that solely rely on hazard evaluation functions to determine collision avoidance actions may cause AUsV to constantly attempt to flee in such an environment. The proposed method takes safety into account based on environmental changes, which allows a larger feasible area while balancing safety and reachability.
- (3)
The LMPC augmented with integral sliding mode control as an auxiliary control law has been employed for the trajectory tracking of AUV. Compared to [
27], the proposed tracking approach not only ensures tracking performance but also eliminates the chattering phenomenon without making any assumptions on the reference path (position tracking error is converged while velocity error converged).
The remaining part of this paper is organized as follows: In
Section 2, we introduce our path planning method. In
Section 3, we first design the auxiliary control law and then incorporate it into our LMPC-based trajectory tracking method. Feasibility and stability are proven rigorously. In
Section 4, we introduce our integrated path planning and trajectory tracking method. In
Section 5, we explain our simulation results. In
Section 6, we draw a conclusion of our work.
2. Path Planning Method Based on EMPC and Changing Rate of Collision Hazard
In this section, we will first explain the environment and then establish a kinematic model that describes the movement of AUV, and subsequently, we will present an approach to path planning utilizing EMPC as the foundation.
2.1. Environment Description
To better illustrate our path planning methods, we will first explain the working environment of AUVs in this part. Thus, the following two assumptions are needed.
Assumption 1. The environmental information obtained from the sensor of AUV is comprehensive.
Remark 1. Assumption 1 differs from the limit that “Global information is known”. Here, we require that all environmental information within the sensor’s range is known, rather than requiring that the entire global information is previously known.
Assumption 1 imposes a prerequisite for path planning, The credibility of sensor information typically serves as the foundation for path planning and even the motion design of AUVs, as exemplified in [
1,
32]. Therefore, to pursue more precise path planning, the formulation of Assumption 1 is reasonable. Meanwhile, to facilitate the description of obstacle constraints, we make the following assumption.
Assumption 2. All obstacles can be completely enclosed by circular areas.
Assumption 2 requires that circular areas be able to enclose obstacles. In fact, existing methods often use polygons or circles to enclose irregular obstacles such as [
33]. In the real situation, the enclosed areas of these rules will simplify the description of obstacle constraints and simplify the calculation of MPC [
33]. Here, for the sake of convenience, we uniformly use circular areas to enclose obstacles.
Assumptions 1 and 2 limit the path planning problem to an environment where sensor detection is possible and obstacle information can be formulated, laying a foundation for us to subsequently develop a path planning method based on MPC.
2.2. AUV Kinematics
Given the AUV’s sway velocity being substantially lower than its surge velocity, the nonholonomic kinematic equations effectively model the vehicle’s motion dynamics at the velocity level with sufficient accuracy, as demonstrated in [
1]. Thus, the AUV motion in the horizontal plane is
where
is the desired position and orientation;
are surge, sway, and yaw velocities;
v is the resultant tangential velocity of the vehicle; and
is the angular velocity. This can be written as
where
is the state of the system,
is the input.
MPC can be used for real-time path planning, but due to its computationally intensive nature, it is difficult to ensure the efficiency of online planning when using MPC. Additionally, existing path planning methods often prioritize reachability as the primary goal, but in hazardous situations, safety should be the primary consideration.
Therefore, the main objective in path planning of this paper is as follows: Plan a collision-free path for AUV (
2) while ensuring computational efficiency, requiring the capability of trading off the priorities of reachability and safety.
2.3. Path Planning Method Based on EMPC
In this part, we aim to outline our proposed path-planning method. Crucial components of our method will be introduced next.
Problem 1. Define as the event-triggered time. Let = denote the target point; denote the ground penalty which is the basic penalty of the environment; denote the collision hazard evaluated at , which is the risk assessment at [29], denote the total number of obstacles; and denote the threshold of changing rate of collision hazard. For , the optimal problem can be formulated aswhere the cost function is is the prediction horizon; Q, R, P and S are weighting matrices; and are, respectively, the region of state and input of AUV system (2); is the position; and is the radius of the ith obstacle. κ is a constant relevant to , let ,where , β is a positive constant to ensure that κ does not switch continuously around . The weights Q, R, P, and S are assigned based on the current objectives that need to be achieved, with a higher value assigned to the weight corresponding to the primary goal. Typically, to ensure reachability, Q is set to a relatively large value. Furthermore, to prioritize safety in critical situations, S, as the weight for the collision hazard function, needs to be set to a significantly high value. The ground penalty is used to prevent the AUV from driving closely along the boundary of obstacles. , on the other hand, is a penalty based on safety, ensuring that the AUV prioritizes safety before reachability when it is in a dangerous situation. The setting of provides a boundary for switching between reachability and safety.
2.4. Event-Triggered Mechanism
Due to the significant computational load of MPC, it is challenging to ensure the efficiency of online planning. Therefore, we decide to introduce an event-triggered mechanism to enhance computational efficiency.
Since MPC can generate a sequence of optimal moves in a single computation, we propose to only re-plan when the environment changes (i.e., re-compute MPC). If the environment remains unchanged, we will utilize the previously computed feasible solution. Thus, we first need to define what a change in the environment is.
2.4.1. Definition of Environment Change
We categorize the following main cases as belonging to the realm of environmental changes.
- (1)
New obstacles are added or existing obstacles are reduced.
- (2)
The obstacle changes from stationary to dynamic.
- (3)
The velocity or orientation of a dynamic obstacle changes (i.e., it does not match the previously predicted motion state).
Remark 2. As a supplementary note to the third point, the presence of obstacles with constant speed and heading in the environment is not considered a change in the environment, as their subsequent positions and impact on our AUV can be predicted.
In particular, there are some other special cases in which we need to recalculate the MPC.
- (1)
A new penalty term is added to the cost function (for instance, is added when ).
- (2)
All feasible solutions obtained from a single MPC computation have been exhausted.
At these special moments, a trigger is also required.
Moreover, since it is difficult to mathematically explain how to avoid the Zeno phenomenon based on environmental changes as a triggering mechanism, we make the following assumption while considering practical situations.
Assumption 3. There exists a minimal time during two changing actions of the environment.
Remark 3. From a practical perspective, this assumption is reasonable. Firstly, the finite obstacles present in the environment are unlikely to be constantly altering their navigation postures. Additionally, in real-world scenarios, there exists a sampling time, where is equal to this sampling time, such as [16,17]. 2.4.2. Determination of Environmental Change
Based on the conclusions of [
28,
29], the collision hazard evaluation function typically takes into account factors such as the relative speed, angle, and distance between ships. Therefore, the collision hazard evaluation function can effectively describe environmental information in a mathematical manner.
According to [
28,
29], the collision hazard evaluation function can be formulated as
where
represents the minimum safe distance,
represents the distance between AUV and the
jth obstacle,
is the relative speed,
is a positive constant, and
.
Therefore, we can predict the value of
at a fixed point regardless of our AUV to determine if the environment has changed, as shown in
Figure 1.
A is a fixed point;
,
,
are stationary obstacles; and
,
are dynamic obstacles whose velocities are
and
, respectively.
At time t, we can predict in the future N steps of fixed point A, written as , which is relevant to .
Compare the predicted collision hazard with the actual calculated collision hazard. If they are not equal, it indicates that the environment has changed, and the MPC needs to be recalculated. Therefore, the event-triggered mechanism based on environment changes can be given by
where
The upper and lower bounds for the inter-event time
are specified as
, respectively. Thus, Zeno behavior can be avoided because of the designed event-triggered mechanism (
5).
2.5. Design of Changing Rate-Based Collision Hazard Function
The existing literature, such as [
28,
29], often advocates for determining collision avoidance based on whether a ship is in a high-risk situation (i.e., checking if
exceeds a certain collision hazard threshold). However, ships may inherently operate in high-risk environments, resulting in
continuously being treated as a penalty in the cost function. This defeats the original purpose of setting a threshold. The threshold is intended to enable the system to switch between reachability and safety. If
is constantly included, the AUV will first escape to a safe area and then re-ensure reachability in a safer environment, significantly increasing the path length and consequently energy consumption. Moreover, paths in high-risk environments are not necessarily unusable. If the AUV always prioritizes safety, it may become overly conservative, losing its ability to operate in complex scenarios.
Therefore, this paper proposes using the rate of change in risk as a condition to determine whether to include as a penalty, which is analogous to how an airbag in a car deploys based on acceleration.
Calculation of
As shown in
Figure 2, if we consider the most extreme scenario, then the AUV and the dynamic obstacle are at the minimum safe distance from each other, and they are moving towards each other. The following equation can be satisfied.
where
is the resultant velocity.
At this time,
where
is the extreme collision hazard.
The extreme changing rate of the collision hazard is when it changes from 0 to
within a sampling step. And if we consider all the obstacles, it can be formulated as
where
is the resultant velocity with the
jth obstacle.
Specifically, when the obstacle is stationary, the resultant velocity is the velocity of our AUV.
2.6. Algorithm of the Proposed Path Planning Method
The path planning algorithm is summarized as Algorithm 1.
Algorithm 1 Path Planning Method based on EMPC and changing rate of collision hazard |
- Input:
v (resultant tangential velocity), (yaw velocity) - Output:
(position) - 1:
Given the initial state of AUV and total duration - 2:
Calculate the changing rate threshold by (8) - 3:
Set a fixed point A to judge if the environment changes - 4:
- 5:
while do - 6:
Evaluate the collision hazard of environment by (4) - 7:
Calculate the changing rate of collision hazard - 8:
Predict the collision hazard of A in the future steps, written as - 9:
if then - 10:
Use as cost function - 11:
else - 12:
Use as cost function - 13:
Solve Problem 1 to obtain the input sequence - 14:
Use as input at time t - 15:
, - 16:
for do - 17:
if then - 18:
Use as input at time - 19:
- 20:
else - 21:
- 22:
break - 23:
Record the position and input
|
Remark 4. In contrast to [29,32,34], which solely emphasizes the magnitude of collision hazard, the path planning method presented in this paper innovatively incorporates the changing rate of collision hazard. This additional consideration enables a more comprehensive assessment of potential hazards and allows a larger feasible area for planning since our method can work in a high collision hazard environment but [29,32,34] attempts to flee from such environment. Furthermore, by employing EMPC to balance between feasible and optimal solutions, our approach enhances planning efficiency. Consequently, the paths generated by our method are not only safer but also more rational, as they account for both the current risk level and its trend over time. 5. Simulation Results
5.1. Path Planning
First, we will validate path planning. We set the parameters as follows: the sampling time is s, the prediction horizon in path planning is , and the weighting matrices are as follows: , , , , maximal velocity is m/s, minimal velocity is , maximal angular velocity is rad/s, and minimal angular velocity is . The changing rate threshold is set as (m/s)2, where .
As shown in
Figure 4,
Figure 4a represents the initial environment, where there are two dynamic obstacles. However, they are both in a static state for an initial period of time. When they start moving, their speeds are as follows: One of them moves solely horizontally with a speed of
m/s, while the other moves both horizontally with a speed of
m/s and vertically with a velocity of
m/s. The red line segments indicate the predicted sequence of the path generated by the MPC path planner. In the period when the dynamic obstacle is initially stationary, MPC is not updated (i.e., only utilizing the feasible solution without recalculation), AUV will follow the entire predicted sequence of the path, instead of strictly utilizing the first item of the predicted sequence, as illustrated in
Figure 4b. When the dynamic obstacle begins to move, MPC recalculates, and at this point, instead of utilizing a feasible solution, it employs the optimal solution as input, as depicted in
Figure 4c, where the black hollow circles represent the initial position of dynamic obstacles.
Figure 4d–f, illustrate the collision avoidance process of AUV with dynamic obstacles. Since the speed and direction of the obstacle remained unchanged, feasible solutions were used throughout the entire process.
Figure 4g demonstrates that AUV has successfully completed the collision avoidance, and
Figure 4h shows that AUV has finally reached its target point.
Figure 5 shows the velocity and angular velocity changes during path planning.
Other situations are shown in
Figure 6 and
Figure 7. Meanwhile, in order to verify the improvement of computing efficiency by an event-triggered mechanism, we recorded the number of MPC calculations in the above three environments, as shown in
Table 1. It can be seen that under the action of an event-triggered mechanism, the calculation time of MPC decreases significantly.
5.2. Trajectory Tracking
In this section, we will validate the proposed trajectory tracking algorithm using the sine function and conduct a comparative analysis with the straightforward approach of employing integral sliding mode control. The parameter of LMPC settings are outlined as follows: , , , the initial state is set as , the prediction horizon is , the sampling time is s, the total duration is set to s, the weighting matrices are , , and I is unit matrix. The parameters of integral sliding mode control are , , , , , , , , , , , , which is the same as the auxiliary controller.
Figure 8 illustrates the simulation results of trajectory tracking using both integral sliding mode control and LMPC. In this figure, the blue line represents the reference trajectory, the green line denotes the tracking trajectory achieved by integral sliding mode control, and the red line indicates the tracking trajectory by LMPC. It can be seen that the chattering phenomenon occurs when the sliding mode control is used, which affects the tracking effect. However, LMPC outperforms the straightforward application of integral sliding mode control in terms of tracking performance. Moreover, LMPC exhibits a significantly faster convergence speed in the initial stage compared to integral sliding mode control, and it is capable of avoiding the chattering phenomenon that may occur in integral sliding mode control.
Figure 9 depicts the state tracking performance, and it can be observed that the integral sliding mode control exhibits relatively large tracking errors in the initial stage, requiring a longer convergence time to the reference trajectory compared to LMPC. Furthermore, the chattering phenomenon significantly impacts the tracking performance across each dimension.
Figure 10 shows the simulation results of the input signals.
5.3. Integrated Path Planning and Trajectory Tracking
In this section, we will conduct simulation validation for the proposed integrated path planning and trajectory tracking approach. All parameter selections are identical to those previously defined, and the environment setup for the path planning component remains unchanged from previous discussions.
Figure 11 presents the simulation results, demonstrating that the proposed method can effectively accomplish the task of integrated path planning and trajectory tracking with satisfactory performance.
Figure 12 illustrates the input results for this simulation.