Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise

Yang, Hui; Sun, Qing; Yuan, Jiaxin

doi:10.3390/e26100834

Open AccessArticle

Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise

by

Hui Yang

,

Qing Sun

and

Jiaxin Yuan

^*

College of Air Transportation, Shanghai University of Engineering Science, Shanghai 201620, China

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(10), 834; https://doi.org/10.3390/e26100834

Submission received: 31 July 2024 / Revised: 26 September 2024 / Accepted: 29 September 2024 / Published: 30 September 2024

(This article belongs to the Special Issue Information Theory in Control Systems, 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

:

An adaptive neural network output-feedback control strategy is proposed in this paper for the distributed optimization problem (DOP) of high-order nonlinear stochastic multi-agent systems (MASs) driven by Lévy noise. On the basis of the penalty-function method, the consensus constraint is removed and the global objective function (GOF) is reconstructed. The stability of the system is analyzed by combining the generalized Itô’s formula with the Lyapunov function method. Moreover, the command filtering mechanism is introduced to solve the “complexity explosion” problem in the process of designing virtual controller, and the filter errors are compensated by introducing compensating signals. The proposed algorithm has been proved that the outputs of all agents converge to the optimal solution of the DOP with bounded errors. The simulation results demonstrate the effectiveness of the proposed approach.

Keywords:

stochastic multi-agent systems; adaptive backstepping control; command filter; distributed optimization problem; Lévy noise

1. Introduction

Research of stochastic systems has gained considerable attention in recent years. For example, Liang [1] constructed a common output feedback controller, independent of the switching signal, using the backstepping method to solve the global output feedback probability stability problem for a class of switching random nonlinear systems under arbitrary switching. Fang [2] explored a novel adaptive optimal control strategy for a class of sophisticated discrete-time nonlinear Markov jump systems via Takagi–Sugeno fuzzy models and reinforcement learning techniques. Furthermore, the usual sense stochastic noise is just driven by continuous Brownian motion [3,4], which may be a description of continuous stochastic volatility.

However, numerous types of discontinuous noise exist in many physical systems, for instance, random faults, abrupt changes, and sudden disruptions [5]. Furthermore, yet, there is a distinct type of noise, namely “Lévy noise”, which can characterize both Brownian motion and Poisson jump processes [6]. To date, there have been advancements in filed of control the stochastic systems with Lévy noise [7,8,9,10], but there has been no research of the consensus problem for MASs with Lévy noise. Furthermore, consensus control algorithms for MASs with stochastic noise which is not Lévy noise have developed rapidly and a substantial body of literature has been produced. For example, Refs. [11,12,13] discussed the stochastic linear MASs. Ref. [11] developed a two-step algorithm for each agent in order to dynamically estimate the states of its neighbors. Controllers based on the error between the estimated states of the neighbors and the complete state of the agent were designed by [12], aimed at resolving the limited consensus problem in continuous-time linear MASs with additive systems and communication noise. Ref. [13] introduced the innovative notion of the sub-accessibility of the sliding motion approaching a particular sliding surface for generic MASs which are driven by Brownian motion. References [14,15,16,17] discussed the stochastic nonlinear MASs. In detail, Ref. [14] investigated the consensus tracking problem for the MASs which have outputs, partial state constraints, and saturated inputs, where the unmodeled dynamics are evaluated by the use of RBFNN. In [15], the authors explored a class of fuzzy adaptive leader–follower tracking control problems for MASs with stochastic noise which have unknown dead-zone inputs, where the stochastic disturbances and uncertain functions of the MAS are approximated by introducing a fuzzy logic system. Furthermore, to ensure that all agents reach a consensus within a limited time frame, Ref. [16] has developed a distributed control algorithm based on stochastic stability theorems in finite-time and has integrated a power integrator technique. To achieve the consensus of stochastic nonlinear MASs under directional communication topology, Ref. [17] put forward a distributed adaptive fuzzy control scheme, which employs the integral mean theorem and the approximation properties of fuzzy logic systems.

In the above works, the research of MASs only focus on reaching the basic consensus behavior. However, distributed optimization is commonly preferred for real applications. The DOP is an expansion of the MASs consensus problem and refers to addressing the DOP on the basis of consensus. The main goal of DOP is to achieve the minimization of the GOF, which is the summary of all the local objective functions [18]. The result is that all of the agents are cooperating in trying to achieve the optimal value of the GOF. One of the key goals of DOP for MASs is the design of adequately distributed controllers [19], so that all MASs can converge cooperatively under a certain communication topology, and achieve the optimal solution of distributed optimization after convergence.

There have been several work applications to the study of designing distributed optimization algorithms based on first-order MASs and second-order MASs for solving the DOP [20,21,22,23,24]. Based on event-triggered strategies, Ref. [20] designed a distributed optimization algorithm to solve the DOP of continuous-time first-order MASs with external disturbances and discrete communication. Ref. [23] proposed an improved distributed continuous-time algorithm to design an event-triggered algorithm for the solution of a generalized DOP. However, numerous realistic systems, like manipulators and helicopters, cannot be represented by these low-order dynamics. Therefore, the DOP of high-order nonlinear MASs has attracted the attention of some scholars. For example, Ref. [25] constructed a bounded local control law for achieving global optimal consensus in MASs under the assumption that all agents reach an agreement on the condition of minimizing the summary of all agents’ objective function. An adaptive Lyapunov-based backstepping method was proposed by [26] to decompose the DOP of high-order MASs into an optimization or control problem for solving multiple first-order subsystems. Ref. [27] looked into the subject of the optimal output consensus and proposed an embedded control system which using an optimal signal generation technique. Ref. [28] investigated a distributed optimization algorithm of bipartite containment control for high-order MASs with state constraints. However, DOP research for high-order nonlinear stochastic systems has not yet been conducted.

Motivated by the above analysis, an adaptive NNs backstepping controller which is based on the command filter is developed in this article to solve the DOP of high-order nonlinear MASs which contain the Lévy stochastic noise. The significant contributions of this work in comparison to previous research are listed below.

(1): In contrast to [29,30,31], which only applied the adaptive control method based on observer to address the consensus problem, we focus on resolving the DOP for MASs with unmeasurable states. A distributed optimal adaptive controller is introduced to solve this problem, which utilizes the penalty function and the negative gradient. The objective of this controller is to ensure that the outputs of all the agents will progressively arrive at the optimal value of the GOF.
(2): All of Refs. [32,33,34] only solved the low-order MASs consensus problem with stochastic noise; in this article, a distributed optimal backstepping controller is proposed to solve the DOP for high-order MASs with unmeasurable states and Lévy noise.
(3): Different from our study, Refs. [25,27,35,36,37] did not combine neural networks (NNs), observers, and command filtering to solve the DOP for MASs with Lévy noise. NNs are used for approximation of unknown nonlinear functions and stochastic noise, and observers are used to obtain unmeasured states. We combine the command-filtered control technology with the error compensation technology to solve the problem of “explosion of complexity” and eliminate the effect of filtering errors.

2. Preliminaries

2.1. Graph Theory

Consider the MASs involving n agents, we take an undirected graph

Q = (M, Z, \bar{A})

to represent the relationship between agents, where

M = \{m_{1}, \dots, m_{N}\}

is a node set,

Z = \{(m_{i}, m_{j})\} \in M \times M

stands for the edge set and

\bar{A} = \{a_{i j}\} \in R^{N \times N}

is the adjacency matrix. An edge

(m_{i}, m_{j}) \notin Z

, if and only if

a_{i j} = 0

. Denote

N_{i} = \{j |(m_{i}, m_{j}) \in Z\}

as the neighbor set of node ß and the matrix

D = d i a g (d_{1}, \dots, d_{N})

,

d_{i} = \sum_{j \in n = N_{i}} a_{i j}

as the degree matrix. The Laplacian matrix

L = D - \bar{A}

.

2.2. Multi-Agent System

Take the MASs involving n agents and the dynamic for agent i with Lévy noise is:

\begin{matrix} \{\begin{matrix} d x_{i, m} (t) & = [x_{i, m + 1} + h_{i, m} (X_{i, m})] d t + F_{i, n} (X_{i, n} (t), t) d w (t) \\ + \int_{R} G_{i, m} (X_{i, n} (t), t, ζ) N (d t, d ζ) \\ d x_{i, n} (t) & = [u_{i} (t) + h_{i, n} (X_{i, n})] d t + F_{i, n} (X_{i, n} (t), t) d w (t) \\ + \int_{R} G_{i, n} (X_{i, n} (t), t, ζ) N (d t, d ζ) \\ y_{i} (t) & = x_{i, 1} (t) \end{matrix} \end{matrix}

(1)

where

m = 1, \dots, n

,

X_{i, m} = {(x_{i, 1}, x_{i, 2}, \dots, x_{i, m})}^{T} \in R^{m}

are the system states vectors,

u_{i}

is the control input of the system,

y_{i}

represent system outputs, and

h_{i, m} (X_{i, m})

are unknown nonlinear functions.

Assume that

(Ω, F_{t}, {\{F\}}_{t \geq 0}, P)

is a complete probability space. Furthermore,

w (t)

be a one-dimensional

F_{t}

-adapted Brownian motion.

N (t, ζ)

is a

F_{t}

-adapted Poisson random measure defined on with intensity measure

π

and a compensator

\tilde{N}

. We decide that N is not related to B and

ϑ

is a Lévy measure with

\tilde{N} (d t, d ζ) : = N (d t, d ζ) - ϑ (d ζ) d t

, where the pair

(B, N)

is called a Lévy noise.

2.3. Problem Formulation

Communication topology for undirected connected, DOP is portrayed as:

\begin{matrix} min_{x \in R^{N}} £_{i} (x_{i}), s . t . L x = 0_{N} \end{matrix}

(2)

where

x = {[x_{1}, \dots, x_{N}]}^{T}

. The approximate optimization problem is formulated using the principles of penalty function theory, which is described below:

\begin{matrix} min_{x \in R^{N}} \sum_{i = 1}^{N} £_{i} (x_{i}) + \frac{1}{2} η x^{T} L x \end{matrix}

(3)

where

η > 0

is a constant penalty parameter and

\frac{1}{2} η x^{T} L x

is the penalty term for a violation of the consensus constraint

L x = 0_{N}

.

This article deals with DOP, the GOF

£ : R^{N} \to R

is defined as the sum of the strictly convex objective function

£_{i}

:

£ (x_{1}) = \sum_{i = 1}^{N} £_{i} (x_{i, 1}) .

(4)

where

x_{1} = {[x_{1, 1} x_{2, 1} \dots x_{N, 1}]}^{T}

. According to [38],

1_{N}

is the eigenvector of the Laplacian matrix for the eigenvalue 0; when

α \in R

, if

x_{1} = α \cdot 1_{N}

, we can get:

L x_{1} = 0 .

(5)

\begin{matrix} x_{1}^{T} L x_{1} = 0 . \end{matrix}

(6)

Then, based on (4) and (6), we can define the penalty function as:

P (x_{1}) = \sum_{i = 1}^{N} £_{i} (x_{i, 1}) + x_{1}^{T} L x_{1} .

(7)

This paper intends to develop a control input

u_{i}

in order that each agent

{lim}_{t \to \infty} x_{i, 1} (t) \to x_{i, 1}^{*}

. Let the optimal solution

x_{1}^{*} = (x_{1, 1}^{*}, \dots, x_{N, 1}^{*})

. Define the optimal solution

x_{i, 1}^{*}

for agent i as:

(x_{1, 1}^{*}, \dots, x_{N, 1}^{*}) = \underset{(x_{1, 1}, \dots, x_{N, 1})}{arg min} P (x_{1}) .

(8)

According (7) and (8), when the MASs receive the optimal solution

x_{1}^{*}

we will get that all agents will achieve consensus and synchronously arrive at the optimal solution.

Then, we can develop the local objective function for agent i as:

\begin{matrix} £_{i} (x_{i, 1}) = m_{i} x_{i, 1}^{2} + τ_{i} x_{i, 1} + n_{i} \end{matrix}

(9)

where

m_{i} > 0

and

n_{i}

and

τ_{i}

are constants, with

1 \leq i \leq N

.

Remark 1.

From (7), we can see the penalty function consists of two parts. Where

\sum_{i = 1}^{N} £_{i} (x_{i, 1})

is the GOF and

x_{1}^{T} L x_{1}

is the penalty term which can make all agents to the consensus. The purpose of this article is to construct a controller that can be satisfied to minimize the penalty function while minimizing the GOF and ensure the agents achieve consensus.

The following lemmas are used to facilitate the calculation.

Lemma 1

([10]). Let

V (x, t) \in C^{2, 1} (R^{n} \times R^{+}; R)

, that can be continuously differentiated twice in x and once in t, and then we can design the operator

L V

as follows:

\begin{matrix} L V (x, t) = & V_{t} (x, t) + V_{x} (x, t) τ (x, t) \\ + \frac{1}{2} t r (σ^{T} (x, t) V_{x x} (x, t) σ (x, t)) \\ + \int_{R} (V (x + κ (x, t, ζ)) - V (x, t)) ϑ (d ζ) \end{matrix}

(10)

where

\begin{matrix} V_{x} (x, t) = (\frac{\partial V (x, t)}{\partial x_{1}}, \dots, \frac{\partial V (x, t)}{\partial x_{n}}) \\ V_{x x} (x, t) = {(\frac{\partial^{2} V (x, t)}{\partial x_{k} \partial x_{l}})}_{n \times n} \\ V_{t} (x, t) = \frac{\partial V (x, t)}{\partial t} \end{matrix}

and

τ (x, t)

represent the parameters of drift term,

σ (x, t)

represent the Brownian motion term, and

κ (x, t, ζ)

for the Poisson jump term.

Lemma 2

([39]). The command filter is defined as:

\begin{matrix} {\dot{\bar{a}}}_{i, 1} = κ_{n} {\bar{a}}_{i, 2} \\ {\dot{\bar{a}}}_{i, 2} = - 2 ς κ_{n} {\bar{a}}_{i, 2} - κ_{n} ({\bar{a}}_{i, 1} - α_{i}) \end{matrix}

(11)

where

ς \in (0, 1]

and

κ_{n} > 0

are the positive parameters to be designed and

{\bar{a}}_{i, 1}

and

α_{i}

express the command filter output signal and input signal.

{\bar{a}}_{i, 1} (0) = α_{i} (0), {\bar{a}}_{i, 2} (0) = 0 .

Lemma 3

([40]). Any

H_{1}, H_{2} \in R^{n}

, satisfied that:

{H_{1}}^{T} H_{2} \leq \frac{λ^{ϖ}}{ϖ} {∥H_{1}∥}^{ϖ} + \frac{1}{ψ λ^{ψ}} {∥H_{2}∥}^{ψ}

(12)

where

ϖ > 1

,

λ > 0

,

ψ > 1

, and

(ϖ - 1) (ψ - 1) = 1

.

Lemma 4

([41]). A function

V (x, t) \in C^{2}

, two functions

Υ_{1}

and

Υ_{2}

\in K_{\infty}

, two positive constant ℑ and

I

,

t > t_{0}

, we can get that:

\begin{matrix} Υ_{1} (∥x∥) \leq V (x) \leq Υ_{2} (∥x∥) \\ L V (x, t) \leq - ℑ V (x, t) + I \end{matrix}

(13)

for

\forall x \in R^{n}

. Thereafter, the system exists as a distinct solution and is

E [V (t)] \leq V (0) e^{- A t} + B / A

compliant. Then, all signals have probability bounds.

Lemma 5.

Under [7], the experimental solution of the (1) is said to be:

1.: At the origin it is almost certainly globally stable when $℘ \in (0, 1)$ , a class $K_{\infty}$ -function κ exists let $P \{∥X_{i, l} (t)∥ < κ (∥X_{i, l} (t_{0})∥) + ℘_{0}\} > 1 - ℘$ , for all $t \in [t_{0}, \infty)$ and $X_{i, l} (t_{0}) \in R^{n}$ , $℘_{0}$ is a nonnegative constant.
2.: At the origin, it is almost certainly the global real $K_{\infty}$ -index that is stable and $∥X_{i, l} (t)∥ ⩽ κ (∥X_{i, l} (t_{0})∥) e^{- ℘ (t - t_{0})} + ℘_{0}$ , for all $X_{i, l} (t_{0}) \in R^{n}$ , κ is a category $K_{\infty}$ -function, ℘ is a positive constant, and $℘_{0}$ is a nonnegative constant.

3. Main Results

3.1. Observer Design

Since only

y_{i}

is measurable in the system, we need a state observer to estimate the unmeasurable state. Before designing the observer, the system is rewritten for agent i as:

\begin{matrix} d X_{i, n} & = [A_{i} X_{i, n} + K_{i} y_{i} + \sum_{l = 1}^{n} B_{i, l} [h_{i, l} (X_{i, l})] + B_{i} u_{i} (t) y_{i}] d t \\ + B_{i, l} {F_{i} (X_{i, l} (t), t) d w (t) + \int_{R} G_{i} (X_{i, l} (t), t, ζ) N (d t, d ζ)} \\ y_{i} (t) & = C_{i} X_{i, n} \end{matrix}

(14)

where

A_{i} = [\begin{matrix} - ι_{i, l} \\ ⋮ & I_{n - 1} \\ - ι_{i, n} & 0 & \dots & 0 \end{matrix}]

,

K_{i} = [\begin{matrix} ι_{i, 1} \\ ⋮ \\ ι_{i, n} \end{matrix}]

,

B_{i} = [\begin{matrix} 0 \\ ⋮ \\ 1 \end{matrix}]

,

B_{i, l} = {[0 \dots 1 \dots 0]}^{T}

,

C_{i} = [1 0 \dots 0]

.

There subsists a positive matrix

P_{i}^{T} = P_{i}

which satisfies

A_{i}^{T} P_{i} + P_{i} A_{i} = - 2 Q_{i}

with respect to the given positive matrix

Q_{i}^{T} = Q_{i}

.

Since the nonlinear functions

h_{i, l} (X_{i, l})

are unknown, we can take a lemma as follows.

Lemma 6

([42]). Being an excellent means of approximating continuous functions, the paper makes use of RBFNN to compensate for the nonlinear functions

h_{i, l}

,

i = 1, \dots, n

. The unknown function may be displayed as follows:

h_{i, l} (X_{i, l} | ψ_{i, l}) = ψ_{i, l}^{T} φ_{i, l} (X_{i, l})

(15)

where

X_{i, l}

is the input vector,

1 \leq i \leq n

,

φ_{i, l} (X_{i, l})

is the Gaussian basis function vector, and

ψ_{i, l}

is the ideal constant vector.

The state of the MASs (1) are assumed not to be provided in this article. Therefore, agent i’s states must be estimated by an observer. Under these condition, we can define the observer:

\begin{matrix} {\dot{\hat{X}}}_{i, n} & = A_{i} {\hat{X}}_{i, n} + K_{i} y_{i} + \sum_{l = 1}^{n} B_{i, l} [{\hat{h}}_{i, l} ({\hat{X}}_{i, l} | ψ_{i, l})] + B_{i} u_{i} (t) \\ \hat{y_{i}} & = C_{i} {\hat{X}}_{i, n} \end{matrix}

(16)

where

{\hat{X}}_{i, l}

represents the estimated values of

X_{i, l}

.

According to (14) and (16), we will get:

\begin{matrix} d e_{i} (t) \\ = [A_{i} e_{i} + \sum_{l = 1}^{n} B_{i, l} [h_{i, l} ({\hat{X}}_{i, l}) - {\hat{h}}_{i, l} ({\hat{X}}_{i, l} |ψ_{i, l}) + Δ h_{i, l}]] d t \\ + B_{i, l} (F_{i} (X_{i, l} (t), t) d w (t) + \int_{R} G_{i} (X_{i, l} (t), t, ζ) N (d t, d ζ)) \end{matrix}

(17)

where

Δ h_{i, l} = h_{i, l} (X_{i, l}) - h_{i, l} ({\hat{X}}_{i, l})

and

e_{i} = X_{i, n} - {\hat{X}}_{i, n}

is the error in observing the state of the system (1).

According to lemma 6, we can get that:

{\hat{h}}_{i, l} ({\hat{X}}_{i, l} |ψ_{i, l}) = ψ_{i, l}^{T} φ_{i, l} ({\hat{X}}_{i, l}) .

(18)

The optimal parameter vectors are defined as:

ψ_{i, l}^{*} = arg min_{θ_{i, l} \in Ω_{i, l}} [sup_{{\hat{X}}_{i, l} \in U_{i, l}} |{\hat{h}}_{i, l} ({\hat{X}}_{i, l} |ψ_{i, l}) - h_{i, l} ({\hat{X}}_{i, l})|]

(19)

Define the parameter estimation error

{\tilde{ψ}}_{i, l}

and parameter estimation

ε_{i, l}

as:

\begin{matrix} {\tilde{ψ}}_{i, l} & = ψ_{i, l}^{*} - ψ_{i, l}, l = 1, 2, \dots, n . \\ ε_{i, l} & = h_{i, l} ({\hat{X}}_{i, l}) - {\hat{h}}_{i, l} ({\hat{X}}_{i, l} |θ_{i, l}^{*}) \end{matrix}

(20)

Assumption 1

([43,44]). The errors of the optimal approximation remain bounded and there are positive constants

ε_{i 0}

satisfying

|ε_{i, l}| \leq ε_{i 0}

.

Assumption 2.

There exist some known constants

γ_{i}

which are related as follows:

|h_{i, l} (X_{i, l}) - h_{i, l} ({\hat{X}}_{i, l})| \leq γ_{i, l} ∥X_{i, l} - {\hat{X}}_{i, l}∥ .

(21)

By Equations (16) and (17), we have:

\begin{matrix} d e_{i} (t) & = [A_{i} e_{i} + Δ h_{i} + ε_{i} + \sum_{l = 1}^{n} B_{i, l} [{\tilde{ψ}}_{i, l}^{T} φ_{i, l} ({\hat{X}}_{i, l})]] d t \\ + B_{i, l} (F_{i} (X_{i, l} (t), t) d w (t) \\ + \int_{R} G_{i} (X_{i, l} (t), t, ζ) N (d t, d ζ)) \end{matrix}

(22)

where

ε_{i} = {[ε_{i, 1}, \dots, ε_{i, n}]}^{T}

,

Δ h_{i} = {[Δ h_{1}, \dots, Δ h_{n}]}^{T}

.

The first Lyapunov function be constructed:

V_{0} = \sum_{i = 1}^{N} V_{i, 0} = \sum_{i = 1}^{N} \frac{1}{2} e_{i}^{T} P_{i} e_{i} .

(23)

By Lemma 1, we obtain:

\begin{matrix} L V_{0} & \leq \sum_{i = 1}^{N} {\frac{1}{2} e_{i}^{T} (P_{i} A_{i}^{T} + A_{i} P_{i}) e_{i} + e_{i}^{T} P_{i} (ε_{i} + Δ h_{i}) + \sum_{l = 1}^{n} e_{i}^{T} P_{i} B_{i, l} [{\tilde{ψ}}_{i, l}^{T} φ_{i, l}] + B_{i, l} (\frac{1}{2} t r (F_{i}^{T} P_{i} F_{i}) \\ + \frac{1}{2} \int_{R} (G_{i}^{T} P_{i} G_{i} + 2 e_{i}^{T} P_{i} G_{i}) ϑ (d ζ))} \leq \sum_{i = 1}^{N} {- e_{i}^{T} Q_{i} e_{i} + e_{i}^{T} P_{i} (ε_{i} + Δ h_{i}) \\ + e_{i}^{T} P_{i} \sum_{l = 1}^{n} B_{i, l} {\tilde{ψ}}_{i, l}^{T} φ_{i, l} + B_{i, l} (\frac{1}{2} t r (F_{i}^{T} P_{i} F_{i}) + \frac{1}{2} \int_{R} (G_{i}^{T} P_{i} G_{i} + 2 e_{i}^{T} P_{i} G_{i}) ϑ (d ζ))} \end{matrix}

(24)

Through Lemma 3 and Assumption 2, we obtain:

\begin{matrix} e_{i}^{T} P_{i} (ε_{i} + Δ h_{i}) \\ \leq |e_{i}^{T} P_{i} ε_{i}| + |e_{i}^{T} P_{i} Δ g_{i}| \\ \leq {∥e_{i}∥}^{2} + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} {∥P_{i}∥}^{2} \sum_{l = 1}^{n} {|Δ h_{i, l}|}^{2} \\ \leq {∥e_{i}∥}^{2} + \frac{1}{2} {∥e_{i}∥}^{2} {∥P_{i}∥}^{2} \sum_{l = 1}^{n} γ_{i, l}^{2} + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2} \\ \leq {∥e_{i}∥}^{2} (1 + \frac{1}{2} {∥P_{i}∥}^{2} \sum_{l = 1}^{n} γ_{i, l}^{2}) + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2} \end{matrix}

(25)

and

\begin{matrix} e_{i}^{T} P_{i} \sum_{l = 1}^{n} B_{i, l} {\tilde{ψ}}_{i, l}^{T} φ_{i, l} ({\hat{X}}_{i, l}) \\ \leq \frac{1}{2} e_{i}^{T} P_{i}^{T} P_{i} e_{i} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} φ_{i, l} ({\hat{X}}_{i, l}) φ_{i, l}^{T} ({\hat{X}}_{i, l}) {\tilde{ψ}}_{i, l} \\ \leq \frac{1}{2} λ_{i, max}^{2} (P_{i}) {∥e_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} \end{matrix}

(26)

where

λ_{i, m a x} (P_{i})

is the maximum eigenvalue of the positive matrix

P_{i}

. According to (24)–(26), we can obtain that:

\begin{matrix} L V_{0} & \leq \sum_{i = 1}^{N} {- q_{i, 0} {∥e_{i}∥}^{2} + \frac{1}{2} {∥P_{i} ε_{i}^{q}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} \\ + \frac{1}{2} (t r (F_{i}^{T} P_{i} F_{i}) + \int_{R} (G_{i}^{T} P_{i} G_{i} + 2 e_{i}^{T} P_{i} G_{i}) ϑ (d ζ))} \end{matrix}

(27)

Assumption 3

([6,9]). There are two known constants

μ_{1}

,

μ_{2}

, such that the stochastic noise parameters

F_{i}

,

G_{i}

satisfy:

\begin{matrix} t r (F_{i}^{T} (X_{i, l}, t) F_{i} (X_{i, l}, t)) & \leq μ_{1} {∥X_{i, l}∥}^{2} \\ \int_{R} G_{i}^{T} (X_{i, l}, t, ζ) G_{i} (X_{i, l}, t, ζ) ϑ (d ζ) & \leq μ_{2} {∥X_{i, l}∥}^{2} \end{matrix}

(28)

According to Lemma 3 and Assumption 3, we can get that:

\begin{matrix} \frac{1}{2} t r (F_{i}^{T} P_{i} F_{i}) + \frac{1}{2} \int_{R} (G_{i}^{T} P_{i} G_{i} + 2 e_{i}^{T} P_{i} G_{i}) ϑ (d ζ) \\ \leq \frac{μ_{1} λ_{i, max}^{2} (P_{i})}{2} {∥X_{i, l}∥}^{2} + μ_{2} λ_{i, max}^{2} (P_{i}) {∥X_{i, l}∥}^{2} \\ + \frac{1}{2} λ_{i, max}^{2} (P_{i}) {∥e_{i}∥}^{2} \end{matrix}

(29)

so we have:

\begin{matrix} L V_{0} & \leq \sum_{i = 1}^{N} {- q_{i, 0} {∥e_{i}∥}^{2} + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} \\ + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2}} \end{matrix}

(30)

where

q_{i, 0} = λ_{i, min} (P_{i}) - (1 + \frac{1}{2} {∥P_{i}∥}^{2} \sum_{l = 1}^{n} γ_{i, l}^{2})

and

μ = \frac{μ_{1}}{2} + μ_{2}

.

Under Lemma 5, we can get that:

\begin{matrix} ∥X_{i, l}∥ ⩽ κ (∥X_{i, l} (t_{0})∥) e^{- ℘ (t - t_{0})} + ℘_{0} \end{matrix}

(31)

3.2. Controller Design

Theorem 1.

With Assumptions 1–3, the system (1), and the development of state observer (16), virtual control laws (37)–(39), together with adaptive laws (41)–(43), compensation items (33)–(36) and an adaptive neural network controller through command-filtered method (40), all signals

x_{i, 1}

in the MASs remain semi-global uniformly ultimately bounded (SGUUB) and the errors between outputs and the optimal solution are adequately small.

Proof.

Define the error variables:

\begin{matrix} s_{i, 1} = 2 m_{i} (x_{i, 1} - b_{i}) + \sum_{j \in N_{i}} a_{i j} (x_{i, 1} - x_{j, 1}) \\ s_{i, l} = {\hat{x}}_{i, l} - {\bar{a}}_{i, l} \\ z_{i, l} = s_{i, l} - ξ_{i, l} \end{matrix}

(32)

where

b_{i} = - 1 / 2 m_{i} τ_{i}

,

s_{i, l}

represents the tracking error,

{\bar{a}}_{i, l}

is the command filter output which is relative to the virtual controller

a_{i, l}

, and

ξ_{i, l}

is the error compensation signal, designed as:

\begin{matrix} {\dot{ξ}}_{i, 1} = d_{i} (ξ_{i, 2} + {\bar{a}}_{i, 2} - a_{i, 1}) - c_{i, 1} ξ_{i, 1} - ξ_{i, 1} \end{matrix}

(33)

{\dot{ξ}}_{i, 2} = {\bar{a}}_{i, 3} - a_{i, 2} - d_{i} ξ_{i, 1} + ξ_{i, 3} - c_{i, 2} ξ_{i, 2} - \frac{3}{2} ξ_{i, 2} .

(34)

{\dot{ξ}}_{i, m} = {\bar{a}}_{i, m} - a_{i, m} - ξ_{i, m - 1} + ξ_{i, m + 1} - c_{i, m} ξ_{i, m} - \frac{3}{2} ξ_{i, m}

(35)

{\dot{ξ}}_{i, n} = - ξ_{i, n - 1} - c_{i, n} ξ_{i, n} - \frac{3}{2} ξ_{i, n}

(36)

The structure of the virtual controllers and the control input are as follows:

\begin{matrix} a_{i, 1} & = \frac{1}{d_{i}} (2 m_{i} {\dot{b}}_{i} - c_{i, 1} s_{i, 1} - s_{i, 1} \\ + \sum_{j \in N_{i}} a_{i j} ({\hat{x}}_{j, 2} + θ_{j, 1}^{T} φ_{j, 1})) - θ_{i, 1}^{T} φ_{i, 1} \end{matrix}

(37)

a_{i, 2} = {\dot{\bar{a}}}_{i, 2} - d_{i} s_{i, 1} - c_{i, 2} s_{i, 2} - \frac{3}{2} s_{i, 2} - ψ_{i, 2}^{T} φ_{i, 2} ({\hat{X}}_{i, 2})

(38)

a_{i, m} = {\dot{\bar{a}}}_{i, m} - s_{i, m - 1} - c_{i, m} s_{i, m} - \frac{3}{2} s_{i, m} - ψ_{i, m}^{T} φ_{i, m} ({\hat{X}}_{i, m})

(39)

u_{i} = {\dot{\bar{a}}}_{i, n} - s_{i, n - 1} - c_{i, n} s_{i, n} - \frac{3}{2} s_{i, n} - ψ_{i, n}^{T} φ_{i, n} ({\hat{X}}_{i, n})

(40)

where

d_{i} = 2 m_{i} + \sum_{j \in N_{i}} a_{i j}

, and

c_{i, l}, 1 \leq l \leq n

are the parameters that need to be designed.

Design the adaptive laws as:

\begin{matrix} {\dot{ψ}}_{i, 1} = r_{i, 1} d_{i} φ_{i, 1} z_{i, 1} - {\bar{r}}_{i, 1} ψ_{i, 1} \end{matrix}

(41)

\begin{matrix} {\dot{ψ}}_{j, 1} = - r_{j, 1} φ_{j, 1} z_{j, 1} - {\bar{r}}_{j, 1} ψ_{j, 1} \end{matrix}

(42)

{\dot{ψ}}_{i, l} = r_{i, l} φ_{i, l} z_{i, l} - {\bar{r}}_{i, l} ψ_{i, l} .

(43)

where

2 \leq l \leq n

,

r_{i, 1}

,

{\bar{r}}_{i, 1}

,

r_{j, 1}

,

{\bar{r}}_{j, 1}

,

r_{i, l}

and

{\bar{r}}_{i, l}

are positive design constants.

□

3.2.1. Step 1

Firstly, according to (7), the gradient of the penalty function can be calculated:

\frac{\partial P (x_{1})}{\partial x_{1}} = v e c (\frac{\partial £_{i} (x_{i, 1} (t))}{\partial x_{i, 1}}) + L x_{1}

(44)

where

v e c (\frac{\partial £_{i} (x_{i, 1} (t))}{\partial x_{i, 1}})

is a column vector. The optimal solution

x_{1}^{*}

satisfies:

\begin{matrix} \frac{\partial P (x_{1}^{*})}{\partial x_{1}^{*}} = 0 . \end{matrix}

So, for agent i:

\frac{\partial £_{i} (x_{i, 1}^{*} (t))}{\partial x_{i, 1}^{*}} + \sum_{j \in N_{i}} a_{i j} (x_{i, 1}^{*} - x_{j, 1}^{*}) = 0 .

(45)

Under (9) and (45),we will obtain that:

2 m_{i} (x_{i, 1}^{*} - b_{i}) + \sum_{j \in N_{i}} a_{i j} (x_{i, 1}^{*} - x_{j, 1}^{*}) = 0 .

(46)

Then, combine the (32) with (46), we can get:

\begin{matrix} \frac{\partial P (x_{1})}{\partial x_{i, 1}} & = \frac{\partial £_{i} (x_{i, 1} (t))}{\partial x_{i, 1}} + \sum_{j \in N_{i}} a_{i j} (x_{i, 1} - x_{j, 1}) \\ = 2 m_{i} (x_{i, 1} - b_{i}) + \sum_{j \in N_{i}} a_{i j} (x_{i, 1} - x_{j, 1}) \\ = s_{i, 1} \end{matrix}

(47)

The Lyapunov function be constructed as:

\begin{matrix} V_{1} = V_{0} + \sum_{i = 1}^{N} (\frac{1}{2} z_{i, 1}^{2} + \frac{1}{2 r_{i, 1}} {\tilde{ψ}}_{i, 1}^{T} {\tilde{ψ}}_{i, 1} + \frac{1}{2} \sum_{j \in N_{i}} a_{i j} \frac{1}{r_{j, 1}} {\tilde{ψ}}_{j, 1}^{T} {\tilde{ψ}}_{j, 1}) \end{matrix}

(48)

where

z_{1} = {[z_{1, 1} \dots z_{N, 1}]}^{T}

,

r_{i, 1}

and

r_{j, 1}

are designed parameters. According to (1), (16) and (32), we have:

\begin{matrix} d s_{i, 1} = & ((2 m_{i} + \sum_{j \in N_{i}} a_{i j}) d x_{i, 1} - \sum_{j \in N_{i}} a_{i j} (x_{j, 2} + h_{j, 1} (x_{j, 1}))) d t \\ + (2 m_{i} + \sum_{j \in N_{i}} a_{i j}) (F_{i, 1} d w (t) + \int_{R} G_{i, 1} N (d t, d ζ)) \\ - \sum_{j \in N_{i}} a_{i j} (F_{j, 1} d w (t) + \int_{R} G_{j, 1} N (d t, d ζ)) - 2 m_{i} b_{i} d t \\ = & (d_{i} (z_{i, 2} + ξ_{i, 2} + e_{i, 2} + {\bar{a}}_{i, 2}) - 2 m_{i} b_{i} d t + d_{i} h_{i, 1} (x_{i, 1}) \\ - \sum_{j \in N_{i}} a_{i j} ({\hat{x}}_{j, 2} + e_{j, 2}) - \sum_{j \in N_{i}} a_{i j} h_{j, 1} (x_{j, 1})) d t \\ + d_{i} (F_{i, 1} d w (t) + \int_{R} G_{i, 1} N (d t, d ζ)) \\ - \sum_{j \in N_{i}} a_{i j} (F_{j, 1} d w (t) + \int_{R} G_{j, 1} N (d t, d ζ)) \end{matrix}

(49)

where

d_{i} = 2 m_{i} + \sum_{j \in N_{i}} a_{i j}

. By (48), we can get

{\dot{z}}_{i, 1} = {\dot{s}}_{i, 1} - {\dot{ξ}}_{i, 1}

. Then, according to (48), (49) and Lemma 1, we can obtain:

\begin{matrix} L V_{1} = & L V_{0} + \sum_{i = 1}^{N} {z_{i, 1} (d_{i} (z_{i, 2} + ξ_{i, 2} + e_{i, 2} + {\bar{a}}_{i, 2}) - 2 m_{i} b_{i} d t - {\dot{ξ}}_{i, 1} + d_{i} h_{i, 1} (x_{i, 1}) \\ - \sum_{j \in N_{i}} a_{i j} ({\hat{x}}_{j, 2} + e_{j, 2} + h_{j, 1} (x_{j, 1}))) - \frac{1}{r_{i, 1}} {\tilde{ψ}}_{i, 1}^{T} {\dot{ψ}}_{i, 1} - \sum_{j \in N_{i}} a_{i j} \frac{1}{r_{j, 1}} {\tilde{ψ}}_{j, 1}^{T} {\dot{ψ}}_{j, 1} + \frac{d_{i}}{2} (t r (F_{i, 1}^{T} F_{i, 1}) \\ + \int_{R} G_{i, 1}^{T} G_{i, 1} ϑ (d ζ)) - \frac{\sum_{j \in N_{i}} a_{i j}}{2} (t r (F_{j, 1}^{T} F_{j, 1}) + \int_{R} G_{j, 1}^{T} G_{j, 1} ϑ (d ζ))} \end{matrix}

(50)

Applying Lemma 3, the following inequality holds:

\begin{matrix} z_{i, 1} d_{i} e_{i, 2} + z_{i, 1} \sum_{j \in N_{i}} (- a_{i j} e_{j, 2}) \\ \leq z_{i, 1}^{2} + \frac{d_{i}^{2}}{2} {∥e_{i, 2}∥}^{2} + \frac{{(\sum_{j \in N_{i}} a_{i j})}^{2}}{2} {∥e_{j, 2}∥}^{2} \end{matrix}

(51)

From Assumption 3, there are four of known constants

η_{1}

,

η_{2}

,

η_{3}

,

η_{4}

such that the parameters of stochastic noises

F_{i, 1}

,

G_{i, 1}

,

F_{j, 1}

,

G_{j, 1}

satisfy:

\begin{matrix} t r (F_{i, 1}^{T} (X_{i, l}, t) F_{i, 1} (X_{i, l}, t)) & ⩽ η_{1} {∥X_{i, l}∥}^{2} \\ \int_{R} G_{i, 1}^{T} (X_{i, l}, t, ζ) G_{i, 1} (X_{i, l}, t, ζ) ϑ (d ζ) & ⩽ η_{2} {∥X_{i, l}∥}^{2} \\ t r (F_{j, 1}^{T} (X_{j, l}, t) F_{j, 1} (X_{j, l}, t)) & ⩽ η_{3} {∥X_{j, l}∥}^{2} \\ \int_{R} G_{j, 1}^{T} (X_{j, l}, t, ζ) G_{j, 1} (X_{j, l}, t, ζ) ϑ (d ζ) & ⩽ η_{4} {∥X_{j, l}∥}^{2} \end{matrix}

(52)

Based on the first virtual controller

a_{i, 1}

(37), the error compensation signal

ξ_{i, 1}

(33) and update law

ψ_{i, 1}

(41),

ψ_{j, 1}

(42), we will get that:

\begin{matrix} L V_{1} & \leq - q_{1} | | e | |^{2} + \sum_{i = 1}^{N} {\frac{1}{2} | | P_{i} ε_{i} | |^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) | | X_{i, l} | |^{2} \\ + d_{i} z_{i, 1} z_{i, 2} - c_{i, 1} z_{i, 1}^{2} + \frac{{\bar{r}}_{i, 1}}{r_{i, 1}} {\tilde{ψ}}_{i, 1}^{T} ψ_{i, 1} + \sum_{j \in N_{I}} a_{i j} \frac{{\bar{r}}_{j, 1}}{r_{j, 1}} {\tilde{ψ}}_{j, 1}^{T} ψ_{j, 1} + D_{i, 1}} \end{matrix}

(53)

From Young’s inequality, we will get:

\begin{matrix} {\tilde{ψ}}_{i, 1}^{T} ψ_{i, 1} & \leq - \frac{1}{2} {\tilde{ψ}}_{i, 1}^{T} {\tilde{ψ}}_{i, 1} + \frac{1}{2} ψ_{i, 1}^{* T} ψ_{i, 1}^{*} \\ {\tilde{ψ}}_{j, 1}^{T} ψ_{j, 1} & \leq - \frac{1}{2} {\tilde{ψ}}_{j, 1}^{T} {\tilde{ψ}}_{j, 1} + \frac{1}{2} ψ_{j, 1}^{* T} ψ_{j, 1}^{*} \end{matrix}

(54)

Therefore, rewrite (53) as:

\begin{matrix} L V_{1} & \leq - q_{1} | | e | |^{2} + \sum_{i = 1}^{N} {\frac{1}{2} | | P_{i} ε_{i} | |^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) | | X_{i, l} | |^{2} \\ + d_{i} z_{i, 1} z_{i, 2} - c_{i, 1} z_{i, 1}^{2} - \frac{{\bar{r}}_{i, 1}}{2 r_{i, 1}} {\tilde{ψ}}_{i, 1}^{T} {\tilde{ψ}}_{i, 1} - \sum_{j \in N_{I}} a_{i j} \frac{{\bar{r}}_{j, 1}}{2 r_{j, 1}} {\tilde{ψ}}_{j, 1}^{T} {\tilde{ψ}}_{j, 1} + D_{i, 1}} \end{matrix}

(55)

where

e = {[e_{1}^{T}, \dots, e_{n}^{T}]}^{T}

,

q_{1} = \sum_{i = 1}^{N} (q_{i, 0} - \frac{1}{2} d_{i}^{2} - \frac{{(\sum_{j \in N_{i}} a_{i j})}^{2}}{2})

and

D_{i, 1} = \frac{d_{i}}{2} (η_{1} + η_{2}) {∥X_{i, l}∥}^{2} + \frac{\sum_{j \in N_{i}} a_{i j}}{2} (η_{3} + η_{4}) {∥X_{j, l}∥}^{2} + \frac{{\bar{r}}_{i, 1}}{2 r_{i, 1}} ψ_{i, 1}^{* T} ψ_{i, 1}^{*} + \sum_{j \in N_{I}} a_{i j} \frac{{\bar{r}}_{j, 1}}{2 r_{j, 1}} ψ_{j, 1}^{* T} ψ_{j, 1}^{*}

.

3.2.2. Step 2

In accordance with (32), we take

z_{i, 2} = s_{i, 2} - ξ_{i, 2}

. After (16) and (18), we have:

\begin{matrix} {\dot{z}}_{i, 2} & = {\dot{s}}_{i, 2} - {\dot{ξ}}_{i, 2} \\ = z_{i, 3} + ξ_{i, 3} + {\bar{a}}_{i, 3} - {\dot{\bar{a}}}_{i, 2} - {\dot{ξ}}_{i, 2} + ι_{i, 2} e_{i, 1} \\ + ψ_{i, 2}^{T} φ_{i, 2} + {\tilde{ψ}}_{i, 2}^{T} φ_{i, 2} + ε_{i, 2} + Δ h_{i, 2} \end{matrix}

(56)

Construct the Lyapunov function:

\begin{matrix} V_{2} = V_{1} + \sum_{i = 1}^{N} (\frac{1}{2} z_{i, 2}^{2} + \frac{1}{2 r_{i, 2}} {\tilde{ψ}}_{i, 2}^{T} {\tilde{ψ}}_{i, 2}) \end{matrix}

(57)

Then, we have:

\begin{matrix} L V_{2} = L V_{1} + \sum_{i = 1}^{N} (z_{i, 2} {\dot{z}}_{i, 2} + \frac{1}{r_{i, 2}} {\tilde{ψ}}_{i, 2}^{T} {\dot{\tilde{ψ}}}_{i, 2}) \end{matrix}

(58)

where

r_{i, 2}

is a positive designed parameter. Based (56) and (58), we can obtain that:

\begin{matrix} L V_{2} & = L V_{1} + \sum_{i = 1}^{N} {z_{i, 2} {\dot{z}}_{i, 2} + \frac{1}{r_{i, 2}} {\tilde{ψ}}_{i, 2}^{T} {\dot{\tilde{ψ}}}_{i, 2}} \\ = L V_{1} + \sum_{i = 1}^{N} {z_{i, 2} (z_{i, 3} + ξ_{i, 3} + {\bar{a}}_{i, 3} - {\dot{\bar{a}}}_{i . 2} \\ - {\dot{ξ}}_{i, 2} + ι_{i, 2} e_{i, 1} + ψ_{i, 2}^{T} φ_{i, 2} + {\tilde{ψ}}_{i, 2}^{T} φ_{i, 2} + ε_{i, 2} \\ + Δ h_{i, 2}) - \frac{1}{r_{i, 2}} {\tilde{ψ}}_{i, 2}^{T} {\dot{ψ}}_{i, 2}} \end{matrix}

(59)

According to Lemma 3, we obtain:

ι_{i, 2} z_{i, 2} e_{i, 1} \leq \frac{1}{2} z_{i, 2}^{2} + \frac{1}{2} ι_{i, 2}^{2} {∥e_{i, 1}∥}^{2}

(60)

z_{i, 2} (ε_{i, 2} + Δ h_{i, 2}) \leq z_{i, 2}^{2} + \frac{1}{2} {∥ε_{i, 2}∥}^{2} + \frac{1}{2} γ_{i, 2}^{2} {∥e_{i, 2}∥}^{2}

(61)

Substituting the second virtual controller

a_{i, 2}

(38), the error compensation signal

ξ_{i, 2}

(34) and update laws

ψ_{i, 2}

(43) and the inequalities (60), (61) into (59), we will calculate that:

\begin{matrix} L V_{2} & \leq L V_{1} + \sum_{i = 1}^{N} {z_{i, 2} z_{i, 3} - c_{i, 2} z_{i . 2}^{2} - d_{i} z_{i, 1} z_{i, 2} \\ + \frac{1}{2} (ι_{i, 2}^{2} {∥e_{i, 1}∥}^{2} + {∥ε_{i, 2}∥}^{2} + γ_{i, 2}^{2} {∥e_{i, 2}∥}^{2}) + \frac{{\bar{r}}_{i, 2}}{r_{i, 2}} {\tilde{ψ}}_{i, 2}^{T} ψ_{i, 2}} \\ \leq - q_{2} {∥e∥}^{2} + \sum_{i = 1}^{N} {\frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} \\ + z_{i, 2} z_{i, 3} - \sum_{l = 1}^{2} c_{i, l} z_{i . l}^{2} + \sum_{l = 1}^{2} \frac{{\bar{r}}_{i, l}}{r_{i, l}} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} + D_{i, 2}} \end{matrix}

(62)

From Young’s inequality, we will get:

\begin{matrix} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} \leq - \frac{1}{2} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + \frac{1}{2} ψ_{i, l}^{* T} ψ_{i, l}^{*} \end{matrix}

(63)

Therefore, rewrite (62) as:

\begin{matrix} L V_{2} & \leq - q_{2} {∥e∥}^{2} + \sum_{i = 1}^{N} {\frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} \\ + z_{i, 2} z_{i, 3} - \sum_{l = 1}^{2} c_{i, l} z_{i . l}^{2} - \sum_{l = 1}^{2} \frac{{\bar{r}}_{i, l}}{2 r_{i, l}} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + D_{i, 2}} \end{matrix}

(64)

where

q_{2} = q_{1} - \sum_{i = 1}^{N} \frac{1}{2} (ι_{i, 2}^{2} + γ_{i, 2}^{2})

,

D_{i, 2} = D_{i, 1} + \frac{1}{2} {∥ε_{i, 2}∥}^{2} + \frac{{\bar{r}}_{i, 2}}{2 r_{i, 2}} ψ_{i, 2}^{* T} ψ_{i, 2}^{*}

.

3.2.3. Step m

According to (32), we can get:

\begin{matrix} {\dot{z}}_{i, m} & = z_{i, m + 1} + ξ_{i, m + 1} + {\bar{a}}_{i, m + 1} - {\dot{\bar{a}}}_{i, m} - {\dot{ξ}}_{i, m} + ι_{i, m} e_{i, 1} \\ + ψ_{i, m}^{T} φ_{i, m} + {\tilde{ψ}}_{i, m}^{T} φ_{i, m} + ε_{i, m} + Δ h_{i, m} \end{matrix}

(65)

Put forward the Lyapunov function:

\begin{matrix} V_{m} = V_{m - 1} + \sum_{i = 1}^{N} (\frac{1}{2} z_{i, m}^{2} + \frac{1}{2 r_{i, m}} {\tilde{ψ}}_{i, m}^{T} {\tilde{ψ}}_{i, m}) \end{matrix}

(66)

where

r_{i, m}

is a positive designed parameter. After derivation:

\begin{matrix} L V_{m} = L V_{m - 1} + \sum_{i = 1}^{N} (z_{i, m} {\dot{z}}_{i, m} + \frac{1}{r_{i, m}} {\tilde{ψ}}_{i, m}^{T} {\dot{\tilde{ψ}}}_{i, m}) \end{matrix}

(67)

Substituting (66) into (67), we can get:

\begin{matrix} L V_{m} & = L V_{m - 1} + \sum_{i = 1}^{N} {z_{i, m} (z_{i, m + 1} + ξ_{i, m + 1} + {\bar{a}}_{i, m + 1} \\ - {\dot{\bar{a}}}_{i, m} - {\dot{ξ}}_{i, m} + ι_{i, m} e_{i, 1} + ψ_{i, m}^{T} φ_{i, m} + {\tilde{ψ}}_{i, m}^{T} φ_{i, m} \\ + ε_{i, m} + Δ h_{i, m}) + \frac{1}{r_{i, m}} {\tilde{ψ}}_{i, m}^{T} {\dot{\tilde{ψ}}}_{i, m}} \end{matrix}

(68)

According to Lemma 3, we obtain:

ι_{i, m} e_{i, 1} z_{i, m} \leq \frac{1}{2} z_{i, m}^{2} + \frac{1}{2} ι_{i, m}^{2} {∥e_{i, 1}∥}^{2}

(69)

z_{i, m} (ε_{i, m} + Δ h_{i, m}) \leq z_{i, m}^{2} + \frac{1}{2} {∥ε_{i, m}∥}^{2} + \frac{1}{2} γ_{i, m}^{2} {∥e_{i, m}∥}^{2}

(70)

On the basis of the m-th virtual controller

a_{i, m}

(39), the error compensation signal

ξ_{i, m}

(35) the update laws

ψ_{i, m}

(43) the above inequalities, we will obtain that:

\begin{matrix} L V_{m} & \leq - q_{m} {∥e∥}^{2} + \sum_{i = 1}^{N} {\frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} \\ + z_{i, m} z_{i, m + 1} - \sum_{l = 1}^{m} c_{i, l} z_{i, l}^{2} + \sum_{l = 1}^{m} \frac{{\bar{r}}_{i, l}}{r_{i, l}} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} + D_{i, m}} \end{matrix}

(71)

From Young’s inequality, we will get:

\begin{matrix} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} \leq - \frac{1}{2} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + \frac{1}{2} ψ_{i, l}^{* T} ψ_{i, l}^{*} \end{matrix}

(72)

Therefore, rewrite (71) as:

\begin{matrix} L V_{m} & \leq - q_{m} {∥e∥}^{2} + \sum_{i = 1}^{N} {\frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} \\ + z_{i, m} z_{i, m + 1} - \sum_{l = 1}^{m} c_{i, l} z_{i, l}^{2} - \sum_{l = 1}^{m} \frac{{\bar{r}}_{i, l}}{2 r_{i, l}} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + D_{i, m}} \end{matrix}

(73)

where

q_{m} = q_{m - 1} - \sum_{i = 1}^{N} \frac{1}{2} (ι_{i, m}^{2} + γ_{i, m}^{2})

,

D_{i, m} = D_{i, m - 1} + \frac{1}{2} {∥ε_{i, m}∥}^{2} + \frac{{\bar{r}}_{i, m}}{2 r_{i, m}} ψ_{i, m}^{* T} ψ_{i, m}^{*}

.

3.2.4. Step n

According to (32), we can get:

\begin{matrix} {\dot{z}}_{i, n} & = u_{i} + ι_{i, n} e_{i, 1} + ψ_{i, n}^{T} φ_{i, n} + {\tilde{ψ}}_{i, n}^{T} φ_{i, n} \\ + ε_{i, n} + Δ h_{i, n} - {\dot{\bar{a}}}_{i . n} - {\dot{ξ}}_{i, n} \end{matrix}

(74)

The Lyapunov function can be constructed as:

\begin{matrix} V_{n} = V_{n - 1} + \sum_{i = 1}^{N} {\frac{1}{2} z_{i, n}^{2} + \frac{1}{2 r_{i, n}} {\tilde{ψ}}_{i, n}^{T} {\tilde{ψ}}_{i, n}} \end{matrix}

(75)

where

r_{i, n}

is a positive designed parameter. Under (74) and (75), we obtain:

\begin{matrix} L V_{n} & = L V_{n - 1} + \sum_{i = 1}^{N} {z_{i, n} {\dot{z}}_{i, n} + \frac{1}{r_{i, n}} {\tilde{ψ}}_{i, n}^{T} {\dot{\tilde{ψ}}}_{i, n}} \\ = L V_{n - 1} + \sum_{i = 1}^{N} {z_{i, n} (u_{i} + ι_{i, n} e_{i, 1} + ψ_{i, n}^{T} ψ_{i, n} \\ + {\tilde{ψ}}_{i, n}^{T} ψ_{i, n} + ε_{i, n} + Δ h_{i, n} - {\dot{\bar{a}}}_{i . n} - {\dot{ξ}}_{i, n}) \\ + \frac{1}{r_{i, n}} {\tilde{ψ}}_{i, n}^{T} {\dot{\tilde{ψ}}}_{i, n}} \end{matrix}

(76)

Under Lemma 3, we will taken the inequality relation as follows:

ι_{i, n} e_{i, 1} z_{i, n} \leq \frac{1}{2} z_{i, n}^{2} + \frac{1}{2} ι_{i, n}^{2} {∥e_{i, 1}∥}^{2}

(77)

z_{i, n} (ε_{i, n} + Δ h_{i, n}) \leq z_{i, n}^{2} + \frac{1}{2} {∥ε_{i, n}∥}^{2} + \frac{1}{2} γ_{i, n}^{2} {∥e_{i, n}∥}^{2}

(78)

Under (77) and (78), substituting the n-th control controller

u_{i}

(40), the error compensation signal

ξ_{i, n}

(36), and the update laws

ψ_{i, n}

(43) into (76), we will get that:

\begin{matrix} L V_{n} & \leq - q_{n} {∥e∥}^{2} + \sum_{i = 1}^{N} {\frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} \\ - \sum_{l = 1}^{n} c_{i, l} z_{i, l}^{2} + \sum_{l = 1}^{n} \frac{{\bar{r}}_{i, l}}{r_{i, l}} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} + D_{i, n}} \end{matrix}

(79)

From Young’s inequality, we will get:

\begin{matrix} {\tilde{ψ}}_{i, l}^{T} ψ_{i, l} \leq - \frac{1}{2} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} + \frac{1}{2} ψ_{i, l}^{* T} ψ_{i, l}^{*} \end{matrix}

(80)

Therefore, rewrite (77) as:

\begin{matrix} L V_{n} & \leq - q_{n} {∥e∥}^{2} + \sum_{i = 1}^{N} {- \sum_{l = 1}^{n} c_{i, l} z_{i, l}^{2} + \frac{1}{2} \sum_{l = 1}^{n} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} - \sum_{l = 1}^{n} \frac{{\bar{r}}_{i, l}}{2 r_{i, l}} {\tilde{ψ}}_{i, l}^{T} {\tilde{ψ}}_{i, l} \\ + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2} + D_{i, n}} \end{matrix}

(81)

where

q_{n} = q_{n - 1} - \sum_{i = 1}^{N} \frac{1}{2} (ι_{i, n}^{2} + γ_{i, n}^{2})

,

D_{i, n} = D_{i, n - 1} + \frac{1}{2} {∥ε_{i, n}∥}^{2} + \frac{{\bar{r}}_{i, n}}{2 r_{i, n}} ψ_{i, n}^{* T} ψ_{i, n}^{*}

.

3.3. Stability Analysis

From (81), we will note that

I = D_{i, n - 1} + \frac{1}{2} {∥ε_{i, n}∥}^{2} + μ λ_{i, max} (P_{i}) {∥X_{i, l}∥}^{2} + \frac{{\bar{r}}_{i, m}}{2 r_{i, m}} ψ_{i, m}^{* T} ψ_{i, m}^{*} + \frac{1}{2} {∥P_{i} ε_{i}∥}^{2}

, and according to Lemma 5, we will obtain

∥X_{i, l}∥ ⩽ κ (∥X_{i, l} (t_{0})∥) e^{- ℘ (t - t_{0})} + ℘_{0}

.

Define

ℑ = min {2 q_{n} / λ_{i, max} (P_{i}), 2 \sum_{l = 1}^{n} c_{i, l}, \sum_{l = 1}^{n} (\frac{{\bar{r}}_{i, l}}{r_{i, l}} + \frac{1}{2})}

, Equation (81) then becomes:

L V (x, t) \leq - ℑ V (x, t) + I .

(82)

Therefore, we can further write (82) as:

d (E [V (x, t)]) / d t = E [L [V]] ⩽ - ℑ E [V] + I

(83)

where

E [•]

is the probability expectation. Let

ℑ > I / K

,

E [V] = K

; later, we will obtain that

d (E [V]) / d t < 0

. Accordingly,

V ⩽ K

which is an invariant set, which means that if

E [V (x (t_{0}), t_{0})] ⩽ K

, later

E [V (x, t)] ⩽ K

for all time

t \in [t_{0}, t_{ρ}]

. Hence, (83) with regards to any

V (x (t_{0}), t_{0}) < K

, and all time

t \in [t_{0}, t_{ρ}]

. Moreover, it holds that:

E [V (x (t_{ρ}), t_{ρ})] ⩽ E V (x (t_{0}), t_{0}) + I (t_{ρ} - t_{0}) ⩽ \overset{⌣}{c} e^{t},

(84)

where

\overset{⌣}{c} = (E V (x (t_{0}), t_{0}) + I) / e^{t_{0}}

, then, we will get that:

\begin{matrix} E V (x (t), t) & \leq e^{- ℑ (t - t_{0})} E V (x (t_{0}), t_{0}) + I / ℑ - I / ℑ e^{- ℑ (t - t_{0})} \\ \leq E V (x (t_{0}), t_{0}) + I / ℑ \end{matrix}

(85)

According to Lemma 4, we can rewrite (85) as:

0 ⩽ E [L V (x, t)] ⩽ e^{- ℑ t} V (x (t_{0}), t_{0}) + I / ℑ .

(86)

Based on (86), it can be inferred that

E [V (x, t)]

is ultimately bounded by

I / ℑ

, we obtain:

lim_{t \to \infty} E [V (x, t)] ⩽ I / ℑ .

(87)

After that, we can get that all the variables, such as

x_{i, n}

, e,

s_{i, l}

,

z_{i, l}

, the virtual control

a_{i, l}

, and the control inputs

u_{i}

are bounded in probability on the basis of the Lyapunov function. Likewise, we can sum up that all signals of MASs (1) remain SGUUB in the closed-loop system and the errors between outputs and the optimal value is sufficiently small.

Remark 2.

Compared to [45], in which the DOP is investigated for MASs with the nonlinear function, the high-order MASs in this paper contain stochastic noise, which implies that the control protocol designed will be incorporated into many commercial engineering applications, such as marine surface vehicles, unmanned aerial vehicles, and wheeled multi-mobile robots.

4. Simulation

To illustrate the proposed method, simulations are performed in this section. Figure 1 shows the block diagram of the designed control system.

Through this example, the MAS consisting of five agents is considered, whose topology of the communication graph as shown in Figure 2. The model is as follows:

\{\begin{matrix} d x_{i, 1} & = [x_{i, 2} + h_{i, 1} (X_{i, 1})] d t + F_{i, 1} d w + \int_{R} G_{i, 1} N (d t, d ζ) \\ d x_{i, 2} & = [u_{i} + h_{i, 2} (X_{i, 2})] d t + F_{i, 2} d w + \int_{R} G_{i, 2} N (d t, d ζ) \\ y_{i} & = x_{i, 1} \end{matrix}

(88)

where

i = 1, 2, 3, 4, 5

, the Brownian motion term and Poisson jump term are

F = - \frac{π}{2} x_{1}^{2}

and

G = (x_{2} - x_{1}) ζ

, respectively, and the initial states are selected as

x_{1} (0) = [0.1, 0.1]

,

x_{2} (0) = [0.2, 0.2]

,

x_{3} (0) = [0.3, 0.3]

,

x_{4} (0) = [0.4, 0.4]

, and

x_{5} (0) = [0.5, 0.5]

. The unknown functions in system (88) are:

\begin{matrix} \begin{matrix} h_{1, 1} & = h_{2, 1} = h_{3, 1} = h_{4, 1} = h_{5, 1} = 0 \\ h_{1, 2} & = x_{1, 1} - 0.25 x_{1, 2} - x_{1, 1}^{3} \\ h_{2, 2} & = x_{2, 1} - 0.25 x_{2, 2} - x_{2, 1}^{3} + 0.1 {(x_{2, 1}^{2} + x_{2, 2}^{2})}^{1 / 2} \\ h_{3, 2} & = x_{3, 1} - 0.25 x_{3, 2} - x_{3, 1}^{3} + 0.2 {(x_{3, 1}^{2} + 2 x_{3, 2}^{2})}^{1 / 2} \\ h_{4, 2} & = x_{4, 1} - 0.25 x_{4, 2} - x_{4, 1}^{3} + 0.2 {(2 x_{4, 1}^{2} + 2 x_{4, 2}^{2})}^{1 / 2} \\ h_{5, 2} & = x_{5, 1} - x_{5, 2} + 0.5 {(x_{5, 1}^{2} + x_{5, 2}^{2})}^{1 / 2} \end{matrix} \end{matrix}

Each of the five agents has the following local objective functions:

\begin{matrix} \begin{matrix} £_{1} (x_{1, 1}) = x_{1, 1}^{2} - 2 x_{1, 1} + 2 \\ £_{2} (x_{2, 1}) = x_{2, 1}^{2} - 4 x_{2, 1} + 6 \\ £_{3} (x_{3, 1}) = x_{3, 1}^{2} - 6 x_{3, 1} + 12 \\ £_{4} (x_{4, 1}) = x_{4, 1}^{2} - 8 x_{4, 1} + 20 \\ £_{5} (x_{5, 1}) = x_{5, 1}^{2} - 10 x_{5, 1} + 30 \end{matrix} \end{matrix}

Then, the penalty function is defined as (7), and the following conditions must be met to obtain the optimum solution for DOP:

\begin{matrix} \begin{matrix} \frac{\partial P (x_{1}^{*})}{\partial x_{1}^{*}} = 0 . \end{matrix} \end{matrix}

According to Equations (33), (36), (37) and (40)–(43), design the parameters update laws, the error compensation signal, the virtual control law and the control input as follows:

\begin{matrix} a_{i, 1} & = \frac{1}{d_{i}} (2 m_{i} {\dot{b}}_{i} - c_{i, 1} s_{i, 1} - s_{i, 1} + \sum_{j \in N_{i}} a_{i j} ({\hat{x}}_{j, 2} \\ + ψ_{j, 1}^{T} φ_{j, 1})) - ψ_{i, 1}^{T} φ_{i, 1} \\ {\dot{ξ}}_{i, 1} & = d_{i} (ξ_{i, 2} + {\bar{a}}_{i, 2} - a_{i, 1}) - c_{i, 1} ξ_{i, 1} - ξ_{i, 1} \\ {\dot{ψ}}_{i, 1} & = r_{i, 1} d_{i} φ_{i, 1} z_{i, 1} - {\bar{r}}_{i, 1} ψ_{i, 1} \\ {\dot{ψ}}_{j, 1} & = - r_{j, 1} φ_{j, 1} z_{j, 1} - {\bar{r}}_{j, 1} ψ_{j, 1} \\ u_{i} & = {\dot{\bar{a}}}_{i, 2} - d_{i} s_{i, 1} - c_{i, 2} s_{i, 2} - \frac{3}{2} s_{i, 2} - ψ_{i, 2}^{T} φ_{i, 2} ({\hat{X}}_{i, 2}) \\ {\dot{ξ}}_{i, 2} & = - d_{i} ξ_{i, 1} - c_{i, 2} ξ_{i, 2} - \frac{3}{2} ξ_{i, 2} \\ {\dot{ψ}}_{i, 2} & = r_{i, 2} φ_{i, 2} z_{i, 2} - {\bar{r}}_{i, 2} ψ_{i, 2} \end{matrix}

(89)

In addressing the entirety of the design parameters,

c_{i, 1}

,

c_{i, 2}

stand out as pivotal to the system’s performance. Its direct correlation with the system’s convergence accuracy establishes it as the paramount tuning element. Concurrently, the configuration of neural network parameters

r_{i, 1}

,

r_{i, 2}

,

r_{j, 1}

,

{\bar{r}}_{i, 1}

,

{\bar{r}}_{i, 2}

, and

{\bar{r}}_{j, 1}

merits equal attention, given their significant influence on the control inputs. Furthermore, the observer’s parameters

ι_{1, 1}

,

ι_{2, 1}

,

ι_{3, 1}

,

ι_{4, 1}

,

ι_{5, 1}

,

ι_{1, 2}

,

ι_{2, 2}

,

ι_{3, 2}

,

ι_{4, 2}

, and

ι_{5, 2}

are equally essential, serving as the critical intermediate variables within the framework of the virtual control law.

Regarding the control parameters

c_{i, 1}

,

c_{i, 2}

, they exert a direct influence on the system’s control input. While higher values can enhance the rate of convergence, excessively high values might result in overly large control inputs, which could negatively impact the system’s overall performance. Consequently, we have selected moderate values for

c_{i, 1}

and

c_{i, 2}

to ensure a swift and stable response from the system. The parameters are chosen as

c_{i, 1} = 4

,

c_{i, 2} = 3

.

When adjusting the adaptive law parameters

r_{i, 1}

,

r_{i, 2}

, and

r_{j, 1}

, we found that increasing the values of

r_{i, 1}

,

r_{i, 2}

, and

r_{j, 1}

can amplify the system’s output jitter, while decreasing them might prevent the neural network from outputting effectively and adapting to changes in nonlinear functions. Therefore, we selected moderate values

r_{i, 1} = r_{i, 2} = r_{j, 1} = 1

to balance the system’s stability with the neural network’s responsiveness. Additionally, increasing the values of

{\bar{r}}_{i, 1}

,

{\bar{r}}_{i, 2}

, and

{\bar{r}}_{j, 1}

may speed up convergence, but excessively high values could cause the neural network to converge directly to zero, thereby losing functionality. Therefore, we have chosen an appropriate value for

{\bar{r}}_{i, 1}

, and

{\bar{r}}_{i, 2}

,

{\bar{r}}_{j, 1}

to ensure that the neural network can operate effectively. Here,

{\bar{r}}_{i, 1} = {\bar{r}}_{i, 2} = {\bar{r}}_{j, 1} = 80

.

With regard to the observer parameters, we recognize that an increase in these parameters can reduce the observer error. However, we must take into account the constraints of practical applications, so we cannot increase these parameters indefinitely. We have chosen an appropriate value to ensure the performance of the observer and the stability of the system. So the design parameters for the observer are selected as

ι_{1, 1} = ι_{2, 1} = ι_{3, 1} = ι_{4, 1} = ι_{5, 1} = 5

,

ι_{1, 2} = ι_{2, 2} = ι_{3, 2} = ι_{4, 2} = ι_{5, 2} = 15

and the initial states are designed as

{\hat{x}}_{1} = [0.2, 0.2]

,

{\hat{x}}_{2} = [0.3, 0.3]

,

{\hat{x}}_{3} = [0.4, 0.4]

,

{\hat{x}}_{4} = [0.5, 0.5]

, and

{\hat{x}}_{5} = [0.6, 0.6]

.

In this example, we give the first agent

b_{1} = 1

, the second agent

b_{2} = 2

, the third agent

b_{3} = 3

, the fourth agent

b_{4} = 4

, and the fifth agent

b_{5} = 5

. Through calculation, we can see that the optimal value of the five agents is

x_{i, 1}^{*} = 3

. From the Figure 3, there is evidence that the five agents will eventually converge to the optimal value.

The simulation results in this simulation are shown in Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. Figure 3 indicates that the output of each agent are consistent with the optimal solution, and a certain extent of error in the figure. Figure 4 shows the tracking error trajectories

s_{i, 1}

, which clearly shows that

s_{i, 1}

converge to zero quickly. Through Figure 5, we put the output of the agent 1 as an example to make a comparison between the true and estimated values. The control input

u_{i}

is presented in Figure 6. Figure 7 displays the value of the penalty function, and we may conclude that the proposed control protocol will minimize the penalty function. Figure 8 displays the value of gradient, which can be clearly established the gradient value is converging well towards zero.

Under the above simulation results, the proposed algorithm will guarantee that all agents reach the optimal solution in the MASs with dynamic uncertainty and stochastic noise. Tracking errors are converging towards a tiny region of source in a short period of time, and all agents eventually tend to optimize. Controllers designed with this approach not only make sure that all agents provide excellent tracking performance for the system containing nonlinear uncertainties and random noise, it also considers the DOP. At the same time, the value of the penalty function successfully trades down to the minimal value.

5. Conclusions

This paper studies the DOP for high-order MASs with nonlinear functions and Lévy stochastic noise. The penalty function is built using the properties of the undirected communication graph and GOF to ensure that all agents achieve the optimal value of DOP while reaching consensus. To avoid “complexity explosion”, we take the command-filtered into account to design the adaptive NNs backstepping control, and the error compensation mechanism is applied to remove the influence of the filtering errors. The stability of the system is analyzed by combining the generalized Itô’s formula with the Lyapunov function method. Simulation results demonstrate that the developed algorithm can make the outputs of all agents reach the optimal value with bounded errors.

This study holds significant implications in practical application fields. Compared the research in to [9], which is confined to the analysis of master–slave systems involving only two agents, this paper expands the scope of research by applying stochastic systems to MASs and successfully addressing the DOP, enabling the research outcomes to be applied to a more diverse range of practical scenarios. However, this paper does not cover MASs with full-state constraints, as discussed in [46]. To further deepen the research, this paper plans to draw on the research methods of [47] and adopt an innovative Policy Iteration (PI) algorithm to explore the online adaptive optimal control problem of nonlinear multi-agent systems.

Author Contributions

Conceptualization, H.Y. and J.Y.; Methodology, Q.S.; Software, Q.S.; Resources, H.Y. and J.Y.; Writing—original draft, Q.S.; Writing—review & editing, H.Y. and J.Y.; Supervision, J.Y.; Funding acquisition, H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Liang, X.L.; Hou, M.Z.; Duan, G.R. Output feedback stabilization of switched stochastic nonlinear systems under arbitrary switchings. Int. J. Autom. Comput. 2013, 10, 571–577. [Google Scholar] [CrossRef]
Fang, H.; Tu, Y.; Wang, H.; He, S.; Liu, F.; Ding, Z.; Cheng, S.S. Fuzzy-based adaptive optimization of unknown discrete-time nonlinear Markov jump systems with off-policy reinforcement learning. IEEE Trans. Fuzzy Syst. 2022, 30, 5276–5290. [Google Scholar] [CrossRef]
Luo, S.; Deng, F.; Chen, W.H. Unified dwell time–based stability and stabilization criteria for switched linear stochastic systems and their application to intermittent control. Int. J. Robust Nonlinear Control 2018, 28, 2014–2030. [Google Scholar] [CrossRef]
Li, M.; Deng, F. Necessary and sufficient conditions for consensus of continuous-time multiagent systems with markovian switching topologies and communication noises. IEEE Trans. Cybern. 2019, 50, 3264–3270. [Google Scholar] [CrossRef] [PubMed]
Imzegouan, C. Stability for Markovian switching stochastic neural networks with infinite delay driven by Lévy noise. Int. J. Dyn. Control 2019, 7, 547–556. [Google Scholar] [CrossRef]
Li, M.; Deng, F.; Liu, X. Almost sure stability of second-order nonlinear stochastic system with Lévy noise via sliding mode control. Int. J. Robust Nonlinear Control 2019, 29, 6053–6063. [Google Scholar] [CrossRef]
Do, K. Backstepping control design for stochastic systems driven by Lévy processes. Int. J. Control 2022, 95, 68–80. [Google Scholar] [CrossRef]
Zhou, L.; Zhu, Q.; Wang, Z.; Zhou, W.; Su, H. Adaptive exponential synchronization of multislave time-delayed recurrent neural networks with Lévy noise and regime switching. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 2885–2898. [Google Scholar] [CrossRef] [PubMed]
Yuan, J.; Zhang, C.; Chen, T. Command filtered adaptive neural network synchronization control of nonlinear stochastic systems with levy noise via event-triggered mechanism. IEEE Access 2021, 9, 146195–146202. [Google Scholar] [CrossRef]
Yang, J.; Zhou, W.; Shi, P.; Yang, X.; Zhou, X.; Su, H. Adaptive synchronization of delayed Markovian switching neural networks with Lévy noise. Neurocomputing 2015, 156, 231–238. [Google Scholar] [CrossRef]
Wang, T.; Hu, M.; Zhao, Y. Consensus of linear multi-agent systems with stochastic noises and binary-valued communications. Int. J. Robust Nonlinear Control 2020, 30, 4863–4879. [Google Scholar] [CrossRef]
Wang, Z.; Sun, H.; Zhang, H.; Liu, X. Bounded consensus control for stochastic multi-agent systems with additive noises. Neurocomputing 2020, 408, 72–79. [Google Scholar] [CrossRef]
Zhao, B.; Peng, Y.; Deng, F. Consensus tracking for general linear stochastic multi-agent systems: A sliding mode variable structure approach. IET Control Theory Appl. 2017, 11, 2910–2915. [Google Scholar] [CrossRef]
Zhao, Y.; Yu, H.; Xia, X. Event-triggered adaptive consensus for stochastic multi-agent systems with saturated input and partial state constraints. Inf. Sci. 2022, 603, 16–41. [Google Scholar] [CrossRef]
Guo, X.; Liang, H.; Pan, Y. Observer-based adaptive fuzzy tracking control for stochastic nonlinear multi-agent systems with dead-zone input. Appl. Math. Comput. 2020, 379, 125269. [Google Scholar] [CrossRef]
Zhao, L.; Jia, Y. Finite-time consensus for second-order stochastic multi-agent systems with nonlinear dynamics. Appl. Math. Comput. 2015, 270, 278–290. [Google Scholar] [CrossRef]
Wang, F.; Zhang, Y.; Zhang, L.; Zhang, J.; Huang, Y. Finite-time consensus of stochastic nonlinear multi-agent systems. Int. J. Fuzzy Syst. 2020, 22, 77–88. [Google Scholar] [CrossRef]
Zheng, Y.; Liu, Q. A review of distributed optimization: Problems, models and algorithms. Neurocomputing 2022, 483, 446–459. [Google Scholar] [CrossRef]
Li, S.; Nian, X.; Deng, Z. Distributed optimization of general linear multi-agent systems with external disturbance. J. Frankl. Inst. 2021, 358, 5951–5970. [Google Scholar] [CrossRef]
Kia, S.S.; Cortés, J.; Martínez, S. Periodic and event-triggered communication for distributed continuous-time convex optimization. In Proceedings of the 2014 American Control Conference, Portland, OR, USA, 4–6 June 2014; IEEE: Piscataway, NJ, USA, 2014. [Google Scholar]
Deng, Z.; Wang, X.; Hong, Y. Distributed optimisation design with triggers for disturbed continuous-time multi-agent systems. IET Control Theory Appl. 2017, 11, 282–290. [Google Scholar] [CrossRef]
Xu, G.H.; Guan, Z.H.; He, D.X.; Chi, M.; Wu, Y.H. Distributed tracking control of second-order multi-agent systems with sampled data. J. Frankl. Inst. 2014, 351, 4786–4801. [Google Scholar] [CrossRef]
Yi, X.; Yao, L.; Yang, T.; George, J.; Johansson, K.H. Distributed optimization for second-order multi-agent systems with dynamic event-triggered communication. In Proceedings of the 2018 IEEE Conference on Decision and Control (CDC), Miami, FL, USA, 17–19 December 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 3397–3402. [Google Scholar]
Mo, L.; Hu, H.; Yu, Y.; Ren, G. Distributed optimization without boundedness of gradients for second-order multi-agent systems over unbalanced network. Inf. Sci. 2021, 565, 177–195. [Google Scholar] [CrossRef]
Xie, Y.; Lin, Z. Global optimal consensus for higher-order multi-agent systems with bounded controls. Automatica 2019, 99, 301–307. [Google Scholar] [CrossRef]
Qin, Z.; Liu, T.; Jiang, Z.P. Distributed optimization of nonlinear uncertain systems: An adaptive backstepping design. IFAC-PapersOnLine 2020, 53, 5653–5658. [Google Scholar] [CrossRef]
Tang, Y.; Deng, Z.; Hong, Y. Optimal output consensus of high-order multiagent systems with embedded technique. IEEE Trans. Circuits Syst. I Regul. Pap. 2018, 49, 1768–1779. [Google Scholar] [CrossRef] [PubMed]
Yao, Y.; Yuan, J.; Chen, T.; Yang, X.; Yang, H. Distributed convex optimization of bipartite containment control for high-order nonlinear uncertain multi-agent systems with state constraints. Math. Biosci. Eng. 2023, 20, 17296–17323. [Google Scholar] [CrossRef]
Wu, W.; Tong, S. Observer-based fixed-time adaptive fuzzy consensus DSC for nonlinear multiagent systems. IEEE Trans. Cybern. 2022, 53, 5881–5891. [Google Scholar] [CrossRef] [PubMed]
Luo, S.; Xu, X.; Liu, L.; Feng, G. Leader-following consensus of heterogeneous linear multiagent systems with communication time-delays via adaptive distributed observers. IEEE Trans. Cybern. 2021, 52, 13336–13349. [Google Scholar] [CrossRef]
Zhang, Y.W.; Zhao, X.W.; Zhang, J.; Lai, Q. Adaptive Leader-Following Consensus of Multiagent Systems with Unknown Disturbances and Switching Topologies. IEEE Trans. Circuits Syst. II Express Briefs 2023, 70, 2944–2948. [Google Scholar] [CrossRef]
Cao, X.; Zhang, C.; Zhao, D.; Sun, B.; Li, Y. Event-triggered consensus control of continuous-time stochastic multi-agent systems. Automatica 2022, 137, 110022. [Google Scholar] [CrossRef]
Xiao, G.; Wang, J.; Liao, Y. Finite-time adaptive consensus of stochastic multi-agent systems with node-based and edge-based adaptive law design methods. Int. J. Adapt. Control Signal Process. 2022, 36, 2920–2937. [Google Scholar] [CrossRef]
Cao, Y.; Li, B.; Wen, S.; Huang, T. Consensus tracking of stochastic multi-agent system with actuator faults and switching topologies. Inf. Sci. 2022, 607, 921–930. [Google Scholar] [CrossRef]
Ma, Q.; Meng, Q.; Xu, S. Distributed optimization for uncertain high-order nonlinear multiagent systems via dynamic gain approach. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 4351–4357. [Google Scholar] [CrossRef]
Liu, D.; Shen, M.; Jing, Y.; Wang, Q.G. Distributed optimization of nonlinear multiagent systems via event-triggered communication. IEEE Trans. Circuits Syst. II Express Briefs 2022, 70, 2092–2096. [Google Scholar] [CrossRef]
Li, G.; Wang, X.; Li, S. Finite-time distributed approximate optimization algorithms of higher order multiagent systems via penalty-function-based method. IEEE Trans. Syst. Man Cybern. Syst. 2022, 52, 6174–6182. [Google Scholar] [CrossRef]
Li, Z.; Duan, Z. Cooperative Control of Multi-Agent Systems: A Consensus Region Approach; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
Yuan, J.; Zhang, C.; Qiu, Z.; Wang, F. Chaos and control of ships with water on deck under periodic excitation with Lévy noise. Fluct. Noise Lett. 2017, 16, 1750027. [Google Scholar] [CrossRef]
Liu, Y.; Zhu, Q.; Zhao, N.; Wang, L. Adaptive fuzzy backstepping control for nonstrict feedback nonlinear systems with time-varying state constraints and backlash-like hysteresis. Inf. Sci. 2021, 574, 606–624. [Google Scholar] [CrossRef]
Zhao, X.; Wang, X.; Zhang, S.; Zong, G. Adaptive neural backstepping control design for a class of nonsmooth nonlinear systems. IEEE Trans. Syst. Man Cybern. Syst. 2018, 49, 1820–1831. [Google Scholar] [CrossRef]
Li, Y.; Tong, S. Adaptive neural networks decentralized FTC design for nonstrict-feedback nonlinear interconnected large-scale systems against actuator faults. IEEE Trans. Neural Netw. Learn. Syst. 2016, 28, 2541–2554. [Google Scholar] [CrossRef]
Chen, B.; Liu, X.; Liu, K.; Lin, C. Direct adaptive fuzzy control of nonlinear strict-feedback systems. Automatica 2009, 45, 1530–1535. [Google Scholar] [CrossRef]
Li, K.; Li, Y. Adaptive neural network finite-time dynamic surface control for nonlinear systems. IEEE Trans. Neural Netw. Learn. Syst. 2020, 32, 5688–5697. [Google Scholar] [CrossRef] [PubMed]
Zhou, T.; Wu, H.; Cao, J. Distributed optimization in predefined-time for multi-agent systems over a directed network. Inf. Sci. 2022, 615, 743–757. [Google Scholar] [CrossRef]
Liang, X.; Ge, S.S.; Li, D. Coordinated tracking control of multi agent systems with full-state constraints. J. Frankl. Inst. 2023, 360, 12030–12054. [Google Scholar] [CrossRef]
He, S.; Fang, H.; Zhang, M.; Liu, F.; Ding, Z. Adaptive optimal control for a class of nonlinear systems: The online policy iteration approach. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 549–558. [Google Scholar] [CrossRef]

Figure 1. The block diagram of the designed control system.

Figure 2. Topology of the communication graph.

Figure 3. The system state

x_{i, 1} (i = 1, \dots, 5)

.

Figure 3. The system state

x_{i, 1} (i = 1, \dots, 5)

.

Figure 4. The error

s_{i, 1} (i = 1, \dots, 5)

.

Figure 4. The error

s_{i, 1} (i = 1, \dots, 5)

.

Figure 5.

x_{1, 2}

and its estimation.

Figure 5.

x_{1, 2}

and its estimation.

Figure 6. Control input

u_{i}

.

Figure 6. Control input

u_{i}

.

Figure 7. The value of the penalty function.

Figure 8. The value of the gradient.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, H.; Sun, Q.; Yuan, J. Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise. Entropy 2024, 26, 834. https://doi.org/10.3390/e26100834

AMA Style

Yang H, Sun Q, Yuan J. Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise. Entropy. 2024; 26(10):834. https://doi.org/10.3390/e26100834

Chicago/Turabian Style

Yang, Hui, Qing Sun, and Jiaxin Yuan. 2024. "Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise" Entropy 26, no. 10: 834. https://doi.org/10.3390/e26100834

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Adaptive Optimization Algorithm for High-Order Nonlinear Multi-Agent Stochastic Systems with Lévy Noise

Abstract

1. Introduction

2. Preliminaries

2.1. Graph Theory

2.2. Multi-Agent System

2.3. Problem Formulation

3. Main Results

3.1. Observer Design

3.2. Controller Design

3.2.1. Step 1

3.2.2. Step 2

3.2.3. Step m

3.2.4. Step n

3.3. Stability Analysis

4. Simulation

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI