Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Robust Variable Selection with Exponential Squared Loss for the Spatial Error Model
Previous Article in Journal
S-Embedding of Lie Superalgebras and Its Implications for Fuzzy Lie Algebras
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Player Non-Cooperative Game Strategy of a Nonlinear Stochastic System with Time-Varying Parameters

1
College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China
2
College of Electronic and Information Engineering, Shandong University of Science and Technology, Qingdao 266590, China
3
College of Electrical Engineering and Automation, Shandong University of Science and Technology, Qingdao 266590, China
*
Author to whom correspondence should be addressed.
Submission received: 25 September 2023 / Revised: 27 November 2023 / Accepted: 30 November 2023 / Published: 20 December 2023

Abstract

:
This paper discusses the multi-player non-cooperative game of nonlinear stochastic time-varying systems described by Itô-type differential equations in a finite time interval. Multi-player non-cooperative game problems are represented by multi-objective Pareto (MOP) control problems to describe the fact that each player has their own goals. By applying Hamilton–Jacobi inequalities (HJIs), the criterion of upper bounds of the MOP boundary is obtained for nonlinear stochastic systems, and the corresponding strategies are designed for such games, so the MOP problem is transformed into a HJI-constrained MOP problem. In order to overcome the difficulty of solving HJIs, a global linearization method is proposed to approximate the nonlinear systems. By the proposed global linearization method, multi-player non-cooperative game problems are transformed into Riccati equation-constrained MOP problems, and the approximate solutions of HJI-constrained MOP problems are obtained. Finally, a practical example is given to illustrate the effectiveness of the proposed method.

1. Introduction

Non-cooperative games are widely applied in various fields of natural and social sciences, such as economics, finance, and engineering technology [1,2,3]. Compared to cooperative games, the behavior and decision making of participants in non-cooperative games are independent of each other. Under proper conditions, there exists a Nash equilibrium point in the non-cooperative games [4]. To resolve the problem of multiple Nash equilibria for non-cooperative games, the concept of Pareto undominated Nash equilibrium is raised and is studied by many researchers. For example, by combining the subset of non-cooperative players with that of aggregate ones, Goldman and Shier proposed weaker conditions to solve the equilibrium-point solutions of the game and given the approximate versions to obtain the equilibrium points [5]. Using a modified fixed point theorem, Zhao, Hong and Li achieved an existence theorem of extended Nash equilibria of the nonmonetized noncooperative game [6]. Ye and Hu employed an extremum-seeking method to non-cooperative games and offered a non-model-based seeking scheme to achieve the time-varying Nash equilibrium [7]. Using the Hamilton–Jacobi–Bellman equations, Aduba and Won proposed the necessary and sufficient conditions for the Nash equilibrium solution of nonzero-sum dynamic statistical Nash games with N-player m-th cost cumulant optimization [8]. Bressan and Nguyen studied a kind of non-cooperative game in the case of infinite time horizon with exponentially discounted quadratic costs [9]. Dianetti and Ferrari established the existence of Nash equilibria for a class of N-player stochastic games with a monotone-follower type, in which the Markovian setting is not necessary and the approximation of the equilibrium values is also provided [10]. Furthermore, theories of non-cooperative or cooperative games such as vector-valued games [11], a Pareto undominated mixed-strategy [2], Nash–Stackelberg–Nash games [12] and multio-bjective games [13] have also been developed, and many excellent results have been obtained.
In practice, many economic, financial, biological or engineering systems can be described by ordinary differential equations [14,15,16,17,18] and stochastic differential equations [19,20,21,22,23,24,25] in continuous time and by difference equations [26,27,28] in a discrete-time case. For example, in order to demonstrate the interplay among interest rate, investment demand and price, the following ordinary different equation are proposed:
ξ ˙ ( t ) = ζ ( t ) + ( η ( t ) a ) ξ ( t ) η ˙ ( t ) = 1 b η ( t ) ξ 2 ( t ) ζ ˙ ( t ) = ξ ( t ) c ζ ( t ) ,
where ξ ( t ) is the interest rate of the financial market, η ( t ) is the investment demand of market participants, and ζ ( t ) is the reasonable market price. By solving the following equation,
ζ + ( η a ) ξ = 0 1 b η ξ 2 = 0 ξ c ζ = 0 ,
it’s easy to find that one of the equilibria of System (1) is
[ ξ e q , η e q , ζ e q ] = [ ± 1 b c a b , 1 c + a , 1 c 1 b c a b ] .
In the real word, the equilibrium points of an economic systems or financial systems are changing at different times, which implies that parameters a , b and c in System (1) change over time. Moreover, taking into account inherent stochastic fluctuations and external disturbances, the following stochastic dynamic system is constructed in reference [23]:
d ξ ( t ) = [ ζ ( t ) + ( η ( t ) a ) ξ ( t ) + u 1 ( t ) + b 12 u 2 ( t ) + b 31 u 3 ( t ) + v 1 ( t ) ] d t + ξ ( t ) ] d W ( t ) d η ( t ) = [ 1 b η ( t ) ξ 2 ( t ) + b 21 u 1 ( t ) + u 2 ( t ) + b 23 u 3 ( t ) + v 2 ( t ) ] d t + η ( t ) d W ( t ) d ζ ( t ) = [ ξ ( t ) c ζ ( t ) + b 31 u 1 ( t ) + b 32 u 2 ( t ) + u 3 ( t ) + v 3 ( t ) ] d t + ζ ( t ) d W ( t ) ,
where u i ( t ) denotes strategies ( i = 1 , 2 , 3 ), v ( t ) = [ v 1 ( t ) , v 2 ( t ) , v 3 ( t ) ] T is the exogenous disturbance, W ( t ) is the one-dimensional standard Brownian motion. In this model, the influence of government control strategy, investment strategy of bank consortium and investment strategy of the public on the stability of the financial system are considered to obtain the non-cooperative investment strategies for financial markets. According to the global linearization technique [20,29], the multi-objective problem (MOP) with the Hamilton–Jacobi inequality (HJI) constraint is transformed into MOP with an equivalent Linear matrix inequality (LMI) constraint to solve the non-cooperative game problems discussed in [23] in which the financial systems are described by time-invariant stochastic differential equations. These results are based on the help of Riccati equations or Riccati inequalities for linear systems [30,31].
Against the above -mentioned background, the time-varying systems are used to describe such systems as follows:
d ξ ( t ) = [ ζ ( t ) + ( η ( t ) a ( t ) ) ξ ( t ) + u 1 ( t ) + b 12 u 2 ( t ) + b 13 u 3 ( t ) + v 1 ( t ) ] d t + ξ ( t ) ] d W ( t ) d η ( t ) = [ 1 b ( t ) η ( t ) ξ 2 ( t ) + b 21 u 1 ( t ) + u 2 ( t ) + b 3 u 3 ( t ) + v 2 ( t ) ] d t + η ( t ) d W ( t ) d ζ ( t ) = [ ξ ( t ) c ( t ) ζ ( t ) + b 31 u 1 ( t ) + b 32 u 2 ( t ) + u 3 ( t ) + v 3 ( t ) ] d t + ζ ( t ) d W ( t ) .
So conditions such as Hamilton–Jacobi equations or inequalities and Riccati equations for corresponding non-cooperative games of time-varying systems are also different from those of the time-invariant ones, which needs further study. This paper follows the line of [23] to discuss the non-cooperative games for nonlinear stochastic time-varying systems, and, in comparison to [23], this paper has the following novelty and innovation: (1) The systems discussed in this paper have more general forms which consider the time-varying case. (2) Because the Riccati equations, the Hamilton-Jacobi equations and matrices used in this paper are in the time-varying form, the global linear methods are more complex than the methods given by [23]. This paper is organized as follows: In Section 2, some theories of stochastic differential equations are reviewed, and the non-cooperative games for stochastic time-varying systems are described. In Section 3, the multi-party non-cooperative investment strategies for nonlinear time-varying stochastic financial systems are designed with the help of solving Hamilton–Jacobi inequalities for nonlinear systems and Riccati equations for linear systems. In Section 4, a global linearization method for a nonlinear stochastic financial system with time-varying parameters is proposed, which transforms an HJI-constrained MOP to an LMI-constrained MOP. In Section 5, a practical financial example is illustrated to verify the effectiveness of the proposed method.
Notations: T : finite time interval [ 0 , T ] , 0 < T < ; n v : dimension of vector v; A T or x T : transport of matrix A or vector x; | x | : Euclidean norm of vector x; E [ X ] : expectation of random variable X; A > 0 ( A 0 ) : symmetric real matrix A is a (semi-) positive definite matrix.

2. Preliminary

Suppose ( Ω , F , P ) is a complete probability space. An economic or financial system with m-person decision makers can be described by the following time-varying stochastic differential equations:
d ξ ( t ) = [ f ( t , ξ ( t ) ) + i = 1 m g i ( t , ξ ( t ) ) u i ( t ) + h ( t , ξ ( t ) ) v ( t ) ] d t + [ l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ] d W ( t ) ξ ( 0 ) = ξ 0 R n , t T = [ 0 , T ] , 0 < T < ,
where f : ( T , R n ) R n , g i : ( T , R n ) R m × n u i , h : ( T , R n ) R n × n v , l : ( T , R n ) R n , h 1 : ( T , R n ) R n × n v x = [ ξ 1 ( t ) , , ξ i ( t ) , ξ n ( t ) ] T denotes the state vector of economic or financial system, u i ( t ) denotes the ith agent’s strategies or policies, v ( t ) is the external disturbance, W ( t ) is a one-dimensional normal Brownian motion.
The goal of the ith agent is to maximize their profit returns or to minimize their return risk. The ith agent’s game strategy u i is decided under the background of the strategies of other m 1 agents with u 1 ( t ) , , u i 1 ( t ) , u i + 1 ( t ) , u m ( t ) which are unknown to the ith agent. Since external disturbance v ( t ) also plays an important role and is not available to every participant including the ith agent, its impact on the financial system must be considered from the worst case. Each agent tends to minimize the worst-case effects of external distractions.
In a non-cooperative financial system, for the ith agent, it is difficult to obtain all other strategies and external distractions. Thus, each game strategy u i ( t ) of an ith player tries to minimize the worst case effects of both a combination of competitive strategies and external distractions to achieve the desired target, ξ d i . So the goals of the ith agent can be modeled by
r i * = min u i ( t ) max u i ( t ) E 0 T [ ( ξ ( t ) ξ d i ) T Q i ( ξ ( t ) ξ d i ) + u i T ( t ) R i u i ( t ) ] d t E ( ξ ( 0 ) ξ d i ) T ( ξ ( 0 ) ξ d i ) + E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t i = 1 , 2 , , m ,
where u i ( t ) = [ u 1 T ( t ) , , u i 1 T ( t ) , u i + 1 T ( t ) , , u m T ( t ) ] T , Q i > 0 and R i > 0 are weight matrices.
Denote ξ ^ i ( t ) = ξ i ( t ) ξ d i and ξ ^ ( t ) = ξ ( t ) ξ d i , then the goals of (6) can be simplified as
r i * = min u i ( t ) max u i ( t ) E 0 T [ ξ ^ ( t ) T Q i ξ ^ ( t ) + u i T ( t ) R i u i ( t ) ] d t E [ | ξ ^ ( 0 ) | 2 ] + E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t i = 1 , 2 , , m .
under constraint
d ξ ^ ( t ) = [ f ^ ( t , ξ ^ ( t ) ) + i = 1 m g ^ i ( t , ξ ^ ( t ) ) u i ( t ) + h ^ ( t , ξ ^ ( t ) ) v ( t ) ] d t + [ l ^ ( t , ξ ^ ( t ) ) + h ^ 1 ( t , ξ ^ ( t ) ) v ( t ) ] d W ( t ) ξ ^ ( 0 ) = ξ 0 ξ d i R n , t T = [ 0 , T ] , 0 < T < ,
where φ ^ ( t , ξ ^ ) = φ ( t , ξ ^ + ξ d i ) , φ = f , g , h , l , h 1 .
So the m-agent noncooperative game strategy ( u 1 * , u 2 * , , u m * ) of multi-objective Pareto problems (5)–(6) or (7)–(8) can be seen as a Nash equilibrium if and only if [4]
( r 1 * , r 2 * , , r m * ) ( r 1 , r 2 * , , r m * ) ( r 1 * , , r i * , , r m * ) ( r 1 * , , r i 1 * , r i , r i + 1 * , , r m * ) ( r 1 * , r 2 * , , ρ m 1 * , r m * ) ( r 1 * , r 2 * , , ρ m 1 * , r m ) ,
where ( r 1 * , r 2 * , , r m * ) is the Pareto boundary of multi-objective problems (MOPs) (5) and (6), i.e.,
( r 1 * , , r i * , , r m * ) = min u ( r 1 , , r i , , r m ) .
Because stochastic Systems (5) and (8) are presented by Itô-type differential equations, the existence and uniqueness of solutions of such systems can be obtained by the following Lemma 1.
Lemma 1.
Suppose the Itô-type stochastic differential equations (SDEs) driven by a one-dimensional Brownian motion are given by
d ξ ( t ) = b ( t , ξ ( t ) ) d t + σ ( t , ξ ( t ) ) d W ( t ) ξ ( 0 ) = ξ 0 R n , t [ 0 , T ] .
Coefficients  b ( t , ξ )  and  σ ( t , ξ )  satisfy Lipschitzian conditions
| b ( t , ξ 1 ) b ( t , ξ 1 ) | + | σ ( t , ξ 1 ) σ ( t , ξ 2 ) | C 1 | ξ 1 ξ 2 | , ξ 1 , ξ 2 R n ,
and
| b ( t , ξ ) | + | σ ( t , ξ ) | C 2 | ξ | , ξ R n ,
  for some constant   C 1  and  C 2 . Then, SDE (11) exists as a unique solution.
The following Itô formula of SDEs will be used in the proofs of our main results.
Lemma 2.
Suppose V ( t , ξ ) C 1 , 2 ( [ 0 , T ] , R n ) and { ξ ( t ) , t [ 0 , T ] } is the solution of SDE (11), then the following Itô formula is obtained:
d V ( t , ξ ( t ) ) = [ V t ( t , ξ ( t ) ) + V ξ T ( t , ξ ( t ) ) f ( t , ξ ( t ) ) + 1 2 σ T ( t , ξ ( t ) ) V x x σ ( t , ξ ( t ) ) ] d t + V ξ T ( t , ξ ( t ) ) σ ( t , x ) d W ( t ) ,
where V t and V ξ are first-order partial derivatives w.r.t. t and x, respectively, with
V ξ = V ξ 1 , V ξ 2 , , V ξ n T ,
and V x x is the second-order partial derivatives w.r.t. x defined as
V x x = 2 V ξ 1 2 2 V ξ 1 ξ 2 2 V ξ 1 ξ n 2 V ξ 2 ξ 1 2 V ξ 2 2 2 V ξ 2 ξ n 2 V ξ n ξ 1 2 V ξ n ξ 2 2 V ξ n 2 .

3. Multi-Party Non-Cooperative Investment Strategies for Time-Varying Stochastic Financial Systems

To simplify the calculation, we let the desired target ξ d i = 0 , i = 1 , 2 , , m . So the solutions of (5) of (8) are equivalent, i.e., ξ ^ i ( t ) = ξ ( t ) . The MOP problems of (5)–(6) and (7)–(8) convert to the following multi-objective problem:
r i * = min u i ( t ) max u i ( t ) E 0 T [ ξ ( t ) T Q i ξ ( t ) + u i T ( t ) R i u i ( t ) ] d t E ξ ( 0 ) T ξ ( 0 ) + E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t , i = 1 , 2 , , m .
subject to
d ξ ( t ) = [ f ( t , ξ ( t ) ) + i = 1 m g i ( t , ξ ( t ) ) u i ( t ) + h ( t , ξ ( t ) ) v ( t ) ] d t + [ l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ] d W ( t ) ξ ( 0 ) = ξ 0 R n , f ( t , 0 ) 0 , l ( t , 0 ) 0 , t [ 0 , T ] , 0 < T < .
In System (14), x = 0 is the equilibrium point. Namely, only the stabilization issue is considered in traditional non-cooperative game strategies. In order to obtain the MOP solution ( r 1 * , r 2 * , , r m * ) and strategies ( u 1 * , u 2 * , , u m * ) , for every i, we first offer r i > 0 as the upper bound of r i * , i.e., r i * r i with u i = u i * . So the following inequality can be obtained when u i = u i * :
max u i ( t ) E 0 T [ ξ ( t ) T Q i ξ ( t ) + u i T ( t ) R i u i ( t ) ] d t E ξ ( 0 ) T ξ ( 0 ) + E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t r i .
So, for every u i ( t ) and v ( t ) , there exists
E 0 T [ ξ ( t ) T Q i ξ ( t ) + u i T ( t ) R i u i ( t ) ] d t E ξ ( 0 ) T ξ ( 0 ) + E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t r i
which equals to
E 0 T [ ξ ( t ) T Q i ξ ( t ) + u i T ( t ) R i u i ( t ) ] d t r i E | ξ ( 0 ) | 2 + 0 T | u i | 2 ( t ) d t + 0 T | v ( t ) | 2 d t .
Let
J i ( u , v ) = E 0 T [ ξ ( t ) T Q i ξ ( t ) + u i T ( t ) R i u i ( t ) r i | u i ( t ) | 2 r i 0 T | v ( t ) | 2 ] d t r i | ξ ( 0 ) | 2 .
Then, r i is the upper bound of r i * with u i = u i * if and only if, for every u i and v, there exists
J i ( u i * , u i , v ) 0 .
Theorem 1.
For a given r i > 0 , if there exists V ( t , x ) C 1 , 2 ( [ 0 , T ] , R n ) , it satisfies the following Hamilton–Jacobi inequality (HJI):
H ( V ) ( t , x ) : = V t + V x T f ( t , x ) + 1 2 l T ( t , x ) V x x l ( t , x ) + x T Q i x 1 4 V x T g i ( t , x ) R i 1 g i T ( t , x ) V ξ + 1 4 r i V x T g i ( t , x ) g i T ( t , x ) V ξ + 1 4 ( V x T h ( t , x ) + l T ( t , x ) V x x h 1 ( t , x ) ) × r i I 1 2 h 1 T ( t , x ) V x x h 1 ( t , x ) 1 × ( h T ( t , x ) V x T + h 1 T ( t , x ) V x x l ( t , x ) ) 0 , r i I 1 2 h 1 T ( t , x ) V x x h 1 ( t , x ) > 0 , ( t , x ) [ 0 , T ] × R n , V ( T , x ) 0 , V ( 0 , x ) r i | x | 2 x R n ,
then r i is a upper bound of ρ * of MOP problems (13) and (14) with
u i * ( t ) = 1 2 R i 1 g i T ( t , ξ ( t ) ) V ξ ( t , ξ ( t ) ) .
Proof. 
Applying the Itô formula to V ( t , ξ ( t ) ) , we obtain
d V ( t , ξ ( t ) ) = [ V t ( t , ξ ( t ) ) + V x T ( t , ξ ( t ) ) ( f ( t , ξ ( t ) ) + i = 1 m g i ( t , ξ ( t ) ) u i ( t ) + h ( t , ξ ( t ) ) v ( t ) ) + 1 2 ( v T ( t ) h 1 T ( t , ξ ( t ) ) + l T ( t , ξ ( t ) ) ) V x x ( t , ξ ( t ) ) ( l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ) ] d t + V x T ( t , ξ ( t ) ) ( l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ) d W ( t ) .
Integrating on [ 0 , T ] , then taking expectation on both sides, we obtain
E [ V ( T , ξ ( t ) ) ] E [ V ( 0 , ξ ( 0 ) ) ] = E 0 T [ V t ( t , ξ ( t ) ) + V x T ( t , ξ ( t ) ) ( f ( t , ξ ( t ) ) + i = 1 m g i ( t , ξ ( t ) ) u i ( t ) + h ( t , ξ ( t ) ) v ( t ) ) + 1 2 ( v T ( t ) h 1 T ( t , ξ ( t ) ) + l T ( t , ξ ( t ) ) ) × V x x ( t , ξ ( t ) ) ( l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ) ] d t .
Since V ( T , x ) 0 , we have
0 = E 0 T [ V t ( t , ξ ( t ) ) + V x T ( t , ξ ( t ) ) ( f ( t , ξ ( t ) ) + i = 1 m g i ( t , ξ ( t ) ) u i ( t ) + h ( t , ξ ( t ) ) v ( t ) ) + 1 2 ( v T ( t ) h 1 T ( t , ξ ( t ) ) + l T ( t , ξ ( t ) ) ) V x x ( t , ξ ( t ) ) × ( l ( t , ξ ( t ) ) + h 1 ( t , ξ ( t ) ) v ( t ) ) ] d t E [ V ( 0 , ξ ( 0 ) ) ] .
So
J i ( u , v ) = E 0 T [ V t ( t , ξ ( t ) ) + V x T ( t , ξ ( t ) ) ( f ( t , ξ ( t ) ) + 1 2 l T ( t , ξ ( t ) ) V x x ( t , ξ ( t ) ) l ( t , ξ ( t ) ) + x T ( t ) Q i ξ ( t ) + V x T ( t , ξ ( t ) ) g i ( t , ξ ( t ) ) u i ( t ) + u i T ( t ) R i u i ( t ) + V x T ( t , ξ ( t ) ) j i m g j ( t , ξ ( t ) ) u j ( t ) r i | u i | 2 + V x T ( t , ξ ( t ) ) h ( t , ξ ( t ) ) v ( t ) + 1 2 v T ( t ) h 1 T ( t , ξ ( t ) ) V x x ( t , ξ ( t ) ) h 1 ( t , ξ ( t ) ) v ( t ) + l T ( t , ξ ( t ) ) V x x ( t , ξ ( t ) ) h 1 ( t , ξ ( t ) ) v ( t ) r i | v ( t ) | 2 ] d t E [ V ( 0 , ξ ( 0 ) ) ] .
Completing the square w.r.t. u i ( t ) , u i ( t ) and v ( t ) , respectively, on the integral part of the right side, we have
J i ( u i * , u i , v ) = E 0 T [ H ( V ) ( t , ξ ( t ) ) + u i + 1 2 R i 1 g i T ( t , ξ ( t ) ) V ξ ( t , ξ ( t ) ) R i 2 u i 1 2 r i g i T ( t , ξ ( t ) ) V ξ ( t , ξ ( t ) ) r i 2 v 1 2 ( r i I 1 2 h 1 T ( t , ξ ( t ) ) V x x ( t , ξ ( t ) ) h 1 T ( t , ξ ( t ) ) ) 1 ( h T ( t , ξ ( t ) ) ) H 2 ] d t ,
where notation y A 2 : = y T A y with A > 0 and H = r i I 1 2 h 1 T ( t , ξ ( t ) ) V x x ( t , ξ ( t ) ) h 1 T ( t , ξ ( t ) ) . Keeping Hamilton–Jacobi inequality (16), when u ( t ) = u * ( t ) , for every u i and v, we obtain
J i ( u i * , u i , v ) 0 .
So r i is an upper bound of ρ * of MOP problems (13)–(14). □
We let
V = ( V , r i ) | V ( t , x ) C 1 , 2 ( [ 0 , T ] , R n ) , V   a n d   r i > 0   s a t i s f i e s   H J I ( 16 ) ;
then, the Pareto boundary r i * of MOP (13) and (14) can be obtained by following form:
r i * = min ( V , r i ) V r i .
So the MOP problems of (13) for nonlinear System (14) can be transformed to an HJI-constraint MOP,
( r 1 * , , r i * , , r m * ) = min ( V , r i ) V ( r 1 , , r i , , r m ) .
Now, a linear stochastic system is considered, which is given as the following:
d ξ ( t ) = [ A ( t ) ξ ( t ) + i = 1 m B i ( t ) u ( t ) + C ( t ) v ( t ) ] + [ A 1 ( t ) ξ ( t ) + C 1 ( t ) v ( t ) ] d W ( t ) ξ ( 0 ) = ξ 0 R n ,
with multi-objective Problem (13); then, the results of Theorem 1 described by the Hamilton equation can be obtained as the following proposition with a Riccati equation form.
Proposition 1.
For a given r i > 0 , if there exists symmetric matrix P t , it satisfies the following Riccati equation:
P ˙ t = P t A ( t ) + A T ( t ) P t + A 1 T ( t ) P t A 1 ( t ) + Q i P t B i ( t ) R i 1 B i T ( t ) P t + 1 r i P t B i ( t ) B i T ( t ) P t + Q i P t B i ( t ) R i 1 B i T ( t ) P t + 1 r i P t B i ( t ) B i T ( t ) P t + P t C ( t ) + A 1 T ( t ) P t C 1 ( t ) r i I C 1 T ( t ) P t C 1 ( t ) 1 C T ( t ) P t + C 1 T ( t ) P t A 1 ( t ) r i I C 1 T ( t ) P t C 1 ( t ) > 0 , r i I P 0 0 ;
then, r i is an upper bound of r i * of MOP problems (13) and (20) with
u i * ( t ) = R i 1 B i T ( t , ξ ( t ) ) P t ξ ( t ) .
Proof. 
We let V ( t , x ) = x T P t x . We note the following facts:
V t ( t , x ) = x T P ˙ t x , V ξ ( t , x ) = 2 P t x , V x x ( t , x ) = 2 P t ,
and the coefficients of nonlinear System (14) are specialized by
f ( t , x ) = A ( t ) x , g i ( t , x ) = B i ( t ) , h ( t , x ) = C ( t ) , l ( t , x ) = A 1 ( t ) x , h 1 ( t , x ) = C 1 ( t ) .
It’s easy to show that V ( t , x ) = x T P t x satisfies HJIs (16). By Theorem 1, r i is an upper bound of r i * and the optimal controller u i * ( t ) given by (22). □
Remark 1.
Compared to Hamilton–Jacobi inequalities and Riccati equations applied in [23], the Hamilton–Jacobi inequalities (16) and Riccati equations (21) are all time-varying forms.

4. The Global Linearization Method for a Nonlinear Stochastic Financial System with Time-Varying Parameters

Generally, it is difficult to solve HJIs (16). In this section, a global linearization method is proposed to obtain an approximate solution of multi-objective Problem (14) with the constrain of nonlinear stochastic financial System (14). We suppose the global linearized systems of a nonlinear stochastic financial system in (14) are bounded by a polytope given as follows:
f ( t , x ) x g i ( t , x ) x h ( t , x ) x h 1 ( t , x ) x C 0 A 1 ( t ) B i 1 ( t ) C 1 ( t ) A 1 ( 1 ) ( t ) C 1 ( 1 ) ( t ) , A 2 ( t ) B i 2 ( t ) C 2 ( t ) A 2 ( 1 ) ( t ) C 2 ( 1 ) ( t ) , , A J ( t ) B i J ( t ) C J ( t ) A j ( 1 ) ( t ) C J ( 1 ) ( t ) , t [ 0 , T ] .
The state trajectory of a nonlinear financial system of (14) can be represented by the combination of state trajectories of the following local linearized systems derived from the vertices of the polytope in (23):
d ξ j ( t ) = [ A j ( t ) ξ j ( t ) + i = 1 m B i j ( t ) u i ( t ) + C j ( t ) v ( t ) ] d t + [ A j ( 1 ) ( t ) ξ i ( t ) + C j ( 1 ) v ( t ) ] d W ( t ) , ξ j ( 0 ) = ξ 0 R n , j = 1 , 2 , , J .
According to the global linearization theory [29], the trajectory of the shifted financial system in (14) can be represented by a convex combination of the trajectories of these local linearized financial systems in (24) as follows:
d ξ ( t ) = j = 1 J α j ( t , ξ ( t ) ) [ A j ( t ) ξ ( t ) + i = 1 m B i j ( t ) u i ( t ) + C j ( t ) v ( t ) ] d t + j = 1 J α j ( t , ξ j ( t ) ) [ A j ( 1 ) ( t ) ξ i ( t ) + C j ( 1 ) v ( t ) ] d W ( t ) , ξ j ( 0 ) = ξ 0 R n , j = 1 , 2 , , J ,
where α j ( t , ξ ( t ) ) denotes the interpolation functions defined as
α j ( t , ξ t ) = 1 / | ξ ( t ) ξ j ( t ) | 2 j = 1 J ( 1 / | ξ ( t ) ξ j ( t ) | 2 ) , ξ ( t ) ξ j ( t ) 1 , ξ ( t ) = ξ j ( t ) .
In practice, we can replace the trajectory of the nonlinear stochastic financial system in (14) by the trajectory of the interpolated financial system in (25).
Proposition 2.
For a given r i > 0 , if there exists symmetric matrix P t that satisfies the following Riccati equations,
P ˙ t = P t A j ( t ) + A j T ( t ) P t + A 1 j T ( t ) P t A 1 j ( t ) + Q i P t B i j ( t ) R i 1 B i j T ( t ) P t + 1 r i P t B i j ( t ) B i j T ( t ) P t + Q i P t B i j ( t ) R i 1 B i j T ( t ) P t + 1 r i P t B i j ( t ) B i j T ( t ) P t + P t C j ( t ) + A 1 j T ( t ) P t C j 1 ( t ) r i I C 1 j T ( t ) P t C 1 j ( t ) 1 × C j T ( t ) P t + C 1 j T ( t ) P t A 1 j ( t ) r i I C 1 j T ( t ) P t C 1 j ( t ) > 0 , r i I P 0 0 . ,
then r i is a upper bound of ρ * of MOP problems (13) and (25) with
u i * ( t ) = j = 1 J α j ( t , ξ ( t ) ) R i 1 B i T ( t , ξ ( t ) ) P t ξ ( t ) .
Remark 2.
Compared to the Riccati equations constructed in [23], the Riccati equations (26) appllied in Proposition 2 are all given by ordinary differential equations with time-varying forms other than algebra equations proposed in [23].
We let
P = ( P t , r i ) | s y m m e t r i c   P t   a n d   r i > 0   s a t i s f i e s   R i c c a t i   e q u a t i o n ( 26 ) ;
then, the Pareto boundary r i * of MOP (13) and (25) can be obtained by following form:
r i * = min ( P t , r i ) P r i .
Using J local linear financial systems in (24) to interpolate nonlinear stochastic financial System (14), the HJI-constraint MOP of a time-varying nonlinear financial system in (16) is transformed into the following Riccati equation-constraint MOP:
( r 1 * , , r i * , , r m * ) = min ( P t , r i ) P ( r 1 , , r i , , r m ) .
Remark 3.
The following steps can help us to obtain the values of r i * :
Step 1: Take a positive r i ( 0 ) > 0 as an initial value.
Step 2: For a given r i ( 0 ) , solve Riccati equations (26).
Step 3: If Riccati equations (26) have a solution for r i ( 0 ) , take another positive r i ( 1 ) smaller than r i ( 0 ) , i.e., r i ( 1 ) < r i ( 0 ) ; else, take r i ( 1 ) larger than r i ( 0 ) , i.e., r i ( 1 ) > r i ( 0 ) .
Step 4: Update value r i ( 0 ) by r i ( 1 ) ; repeat Steps 1 to 4.
Step 5: Continue until for r i ( 0 ) , the Riccati equations of (26) have solutions but the ones of r i ( 1 ) do not, and the error between r i ( 0 ) and r i ( 1 ) is lesser than the given accuracy δ > 0 , i.e., | r i ( 0 ) r i ( 1 ) | < δ . Take ρ i ( 0 ) as the approximation of Pareto boundary r i * .

5. Examples and Simulations

Inherent random fluctuations and external disturbances of actual environmental conditions and investors’ investment strategies play a very important role in the financial system in real life. In this example, let Wiener process W ( t ) represent the inherent random fluctuations, let v ( t ) represent external disturbance. The financial model is described by the following Itô-type stochastic differential equations:
d ξ ( t ) = [ ζ ( t ) + ( η ( t ) a ( t ) ) ξ ( t ) + u 1 ( t ) + 0.1 v ( t ) ] d t + [ ζ ( t ) + ( η ( t ) a ( t ) ) ξ ( t ) + 0.2 v 1 ( t ) ] d W ( t ) d η ( t ) = [ 1 b ( t ) η ( t ) x 2 ( t ) + 0.1 v ( t ) ] d t + u 2 ( t ) + 0.1 v ( t ) ] d t + [ 1 b ( t ) η ( t ) x 2 ( t ) + 0.3 v 2 ( t ) ] d W ( t ) d ζ ( t ) = [ ξ ( t ) c ( t ) ζ ( t ) + u 3 ( t ) + 0.1 v ( t ) ] d t ξ ( t ) + c ( t ) ζ ( t ) + 0.2 v 3 ( t ) ] d W ( t ) ,
where W ( t ) is the Wiener process, v ( t ) represents the corresponding external interference, u 1 ( t ) , u 2 ( t ) and u 3 ( t ) denote the investment strategies of government, bank consortium and public, respectively.
The goal of the multi-objective problem for System (31) is given by the following:
r i * = min u i ( t ) max u i ( t ) E 0 T [ | ξ ( t ) | 2 + | η ( t ) | 2 + | ζ ( t ) | 2 + | u i ( t ) | 2 ] d t E 0 T | u i | 2 ( t ) d t + E 0 T | v ( t ) | 2 d t . i = 1 , 2 , , m .
Take the coefficients in global linearization System (25) corresponding to nonlinear System (31) with the following forms:
A j ( t ) = Ξ 1 ( j ) ( t ) 0 1 Ξ 2 ( j ) ( t ) b ( t ) 0 1 0 c ( t ) , B 1 j ( t ) = 1 0 0 , B 2 j ( t ) = 0 1 0 , B 3 j ( t ) = 0 0 1 , C j ( t ) = 1 1 1 , Q i = 1 0 0 0 1 0 0 0 1 , R i = 1 ,
where Ξ 1 ( j ) ( t ) = η ( t ) + 1 / b ( t ) a ( t ) , Ξ 2 ( j ) ( t ) = ξ ( t ) , and A 1 ( j ) ( t ) = σ A j ( t ) , C 1 ( j ) ( t ) = C j ( t ) , σ = 0.4 . Q i and R i are the weighted matrices of [ ξ ( t ) , η ( t ) , ζ ( t ) ] T and u i , respectively.
Figure 1a illustrates the performance of points ( r 1 , r 2 , r 3 ) , where r 1 , r 2 and r 3 are the upper bounds of r 1 * , r 2 * and r 3 * , respectively. In order to display the upper bound and the Pareto boundary more clearly, Figure 1b–d show the pairwise relationships between r 1 , r 2 and r 3 with their Pareto boundaries r 1 * , r 2 * and r 3 * .
( r 1 * , r 2 * , r 3 * ) = min ( P t , r i ) P ( r 1 , r 2 , r 3 ) .
Figure 2 shows the surface of the Pareto boundary, which verifies the relationships between ( r 1 * , r 2 * , r 3 * ) and ( r 1 , r 2 , r 3 ) described by (33).
Figure 3 shows the profile of Brownian motion W ( t ) , and Figure 4 illustrates the trajectories of ξ ( t ) , η ( t ) and ζ ( t ) in System (31) driven by a Brownian motion. Figure 5 illustrates trajectories of u 1 ( t ) , u 2 ( t ) and u 3 ( t ) , and they represent the investment strategies of government, bank consortium and public, respectively.

6. Conclusions

The non-cooperative game strategy for a nonlinear stochastic time-varying parameter financial system is studied. The proposed non-cooperative game is robust enough to achieve the desired goal of each player. Different from the traditional iterative method, it can be solved by all participants at the same time, and can realize the Nash equilibrium solution. When the value of a parameter in the system changes with time, the entire property changes with time. Finally, a practical example of investment and international capital flow is given to verify the performance of the proposed non-cooperative game strategy design for nonlinear stochastic time-varying parameter systems with continuous fluctuations.

Author Contributions

Conceptualization, R.Z. and X.L.; methodology, X.L. and T.Z.; software, X.L. and R.Z.; validation, X.L. and M.L.; formal analysis, M.L., X.L. and W.Z.; investigation, W.Z.; resources, X.L. and T.Z.; data curation, X.L.; writing—original draft preparation, X.L. and T.Z.; writing—review and editing, X.L. and R.Z.; visualization, W.Z.; supervision, W.Z.; project administration, X.L.; funding acquisition, X.L. and W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant numbers 62273212, 62373229; the Natural Science Foundation of Shandong Province of China, grant numbers ZR2020MF062, ZR2021MF052.

Data Availability Statement

Data is contained within the article.

Acknowledgments

We thank the anonymous reviewers for their constructive suggestions to improve the quality of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Reny, P.J. Nash equilibrium in discontinuous games. Annu. Rev. Econ. 2020, 12, 439–470. [Google Scholar] [CrossRef]
  2. Fu, H.F. On the existence of Pareto undominated mixed-strategy Nash equilibrium in normal-form games with infinite actions. Econ. Lett. 2021, 201, 109771. [Google Scholar] [CrossRef]
  3. Poveda, J.I.; Krstic, M.; Basar, T. Fixed-time Nash equilibrium seeking in time-varying networks. IEEE Trans. Autom. Control 2023, 68, 1954–1969. [Google Scholar] [CrossRef]
  4. Nash, J. Non-cooperative games. Ann. Math. 1951, 54, 286–295. [Google Scholar] [CrossRef]
  5. Goldman, A.J.; Shier, D.R. Player aggregation in noncooperative games. J. Res. Natl. Bur. Stand. 1980, 86, 391–396. [Google Scholar] [CrossRef] [PubMed]
  6. Zhao, C.X.; Hong, S.H.; Li, C.H. Existence of extended Nash equilibriums of nonmonetized noncooperative games. Fixed Point Theory Appl. 2015, 2015, 65. [Google Scholar] [CrossRef]
  7. Ye, M.J.; Hu, G.Q. Distributed seeking of time-varying Nash equilibrium for non-cooperative games. IEEE Trans. Autom. Control 2015, 60, 3000–3005. [Google Scholar] [CrossRef]
  8. Aduba, C.; Won, C.H. N-player statistical Nash game control: m-th cost cumulant optimization. IEEE Trans. Autom. Control 2016, 61, 2688–2694. [Google Scholar] [CrossRef]
  9. Bressan, A.; Nguyen, K.T. Stability of feedback solutions for infinite horizon noncooperative differential games. Dyn. Games Appl. 2018, 8, 42–78. [Google Scholar] [CrossRef]
  10. Dianetti, J.; Ferrari, G. Nonzero-sum submodular monotone-follower games: Existence and approximation of Nash equilibria. SIAM J. Control Optim. 2020, 58, 1257–1288. [Google Scholar] [CrossRef]
  11. Zhang, Y.; Chang, S.S.; Chen, T. Existence and generic stability of strong noncooperative equilibria of vector-valued games. Mathematics 2022, 9, 3158. [Google Scholar] [CrossRef]
  12. Zhang, Y.F.; Liu, F.; Wang, Z.J.; Chen, Y.; Feng, S.L.; Wu, Q.W.; Hou, Y.H. On Nash-Stackelberg-Nash games under decision-dependent uncertainties: Model and equilibrium. Automatica 2022, 142, 110401. [Google Scholar] [CrossRef]
  13. Setiawan, R.; Salmah; Endrayanto, I.; Indarsih. Analysis of the n-person noncooperative supermodular multiobjective games. Oper. Res. Lett. 2023, 51, 278–284. [Google Scholar] [CrossRef]
  14. Ma, J.H.; Chen, Y.S. Study for the bifurcation topological structureand the global complicated character of a kind of nonlinear financesystem. Appl. Math. Mech. 2001, 22, 1240–1251. [Google Scholar] [CrossRef]
  15. Cont, R.; Tankov, P. Financial Modelling with Jump Processes; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar]
  16. Wang, L. Dynamic model of stock prices based on technical tradingrules—Part I: The models. IEEE Trans. Fuzzy Syst. 2015, 23, 787–801. [Google Scholar] [CrossRef]
  17. Wang, L. Dynamical model of stock prices based on technical trading rules—Part II: Analysis of the model. IEEE Trans. Fuzzy Syst. 2015, 23, 1127–1141. [Google Scholar] [CrossRef]
  18. Wang, L. Dynamic model of stock prices based on technical tradingrules—Part III: Application to Hong Kong stocks. IEEE Trans. Fuzzy Syst. 2015, 23, 1680–1697. [Google Scholar] [CrossRef]
  19. Stoyanov, J. Stochastic financial models. J. Roy. Stat. Soc. A (Stat. Soc.) 2001, 174, 510–511. [Google Scholar] [CrossRef]
  20. Chen, B.S.; Chen, W.H.; Wu, H.L. Robust H2/H global linearization filter design for nonlinear stochastic systems. IEEE Trans. Circuits Syst. 2009, 56, 1441–1454. [Google Scholar] [CrossRef]
  21. Wu, C.F.; Chen, B.S.; Zhang, W. Multiobjective investment policy for a nonlinear stochastic financial system: A fuzzy approach. IEEE Trans. Fuzzy Syst. 2017, 25, 460–474. [Google Scholar] [CrossRef]
  22. Chen, B.S.; Yeh, C.H. Stochastic noncooperative and cooperative evolutionary game strategies of a population of biological network under natural selection. Bio. Syst. 2017, 162, 90–118. [Google Scholar] [CrossRef] [PubMed]
  23. Chen, B.S.; Chen, W.Y.; Young, C.T.; Yan, Z. Noncooperative game strategy in cyber-financial systems with Wiener and Poisson random fluctuations: LMIs-constrained MOEA approach. IEEE Trans. Cybern. 2018, 48, 3323–3336. [Google Scholar] [CrossRef] [PubMed]
  24. Chen, B.S.; Lee, M.Y. Noncooperative and cooperative strategy designs for nonlinear stochastic jump diffusion systems with external disturbance: T-S fuzzy approach. IEEE Trans. Fuzzy Syst. 2020, 28, 2437–2451. [Google Scholar] [CrossRef]
  25. Tan, C.; Zhang, S.; Wong, W.; Zhang, Z. Feedback stabilization of uncertain networked control systems over delayed and fading channels. IEEE Trans. Control Netw. Syst. 2021, 8, 260–268. [Google Scholar] [CrossRef]
  26. Tan, C.; Yang, L.; Zhang, F.; Zhang, Z.; Wong, W. Stabilization of discrete time stochastic system with input delay and control dependent noise. Syst. Control Lett. 2019, 123, 62–68. [Google Scholar] [CrossRef]
  27. Zhang, T.; Deng, F.; Sun, Y.; Shi, P. Fault estimation and fault-tolerant control for linear discrete time-varying stochastic systems. Sci. China Inf. Sci. 2021, 64, 200201. [Google Scholar] [CrossRef]
  28. Jiang, X.; Zhao, D. Event-triggered fault detection for nonlinear discrete-time switched stochastic systems: A convex function method. Sci. China Inf. Sci. 2021, 64, 200204. [Google Scholar] [CrossRef]
  29. Boyd, S.; El Ghaoui, L.; Feron, E.; Balakrishnan, V. Linear Matrix Inequalities in System and Control Theory; SIAM: Philadelphia, PA, USA, 1994. [Google Scholar]
  30. Tan, C.; Yang, L.; Wong, W. Learning based control policy and regret analysis for online quadratic optimization with asymmetric information structure. IEEE Trans. Cybern. 2022, 52, 4797–4810. [Google Scholar] [CrossRef]
  31. Tan, C.; Zhang, H.; Wong, W. Delay-dependent algebraic Riccati equation to stabilization of networked control systems: Continuous-time case. IEEE Trans. Cybern. 2018, 48, 2783–2794. [Google Scholar] [CrossRef]
Figure 1. Performance of ( r 1 , r 2 , r 3 ) and Pareto boundary of ( r 1 * , r 2 * ) , ( r 2 * , r 3 * ) , ( r 3 * , r 1 * ) .
Figure 1. Performance of ( r 1 , r 2 , r 3 ) and Pareto boundary of ( r 1 * , r 2 * ) , ( r 2 * , r 3 * ) , ( r 3 * , r 1 * ) .
Axioms 13 00003 g001
Figure 2. Performance of Pareto boundary ( r 1 * , r 2 * , r 3 * ) .
Figure 2. Performance of Pareto boundary ( r 1 * , r 2 * , r 3 * ) .
Axioms 13 00003 g002
Figure 3. Profiles of Brownian motion.
Figure 3. Profiles of Brownian motion.
Axioms 13 00003 g003
Figure 4. Trajectories of ξ ( t ) , η ( t ) and ζ ( t ) .
Figure 4. Trajectories of ξ ( t ) , η ( t ) and ζ ( t ) .
Axioms 13 00003 g004
Figure 5. Trajectories of controllers u 1 ( t ) , u 2 ( t ) and u 3 ( t ) .
Figure 5. Trajectories of controllers u 1 ( t ) , u 2 ( t ) and u 3 ( t ) .
Axioms 13 00003 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lin, X.; Zhang, T.; Li, M.; Zhang, R.; Zhang, W. Multi-Player Non-Cooperative Game Strategy of a Nonlinear Stochastic System with Time-Varying Parameters. Axioms 2024, 13, 3. https://doi.org/10.3390/axioms13010003

AMA Style

Lin X, Zhang T, Li M, Zhang R, Zhang W. Multi-Player Non-Cooperative Game Strategy of a Nonlinear Stochastic System with Time-Varying Parameters. Axioms. 2024; 13(1):3. https://doi.org/10.3390/axioms13010003

Chicago/Turabian Style

Lin, Xiangyun, Tongtong Zhang, Meilin Li, Rui Zhang, and Weihai Zhang. 2024. "Multi-Player Non-Cooperative Game Strategy of a Nonlinear Stochastic System with Time-Varying Parameters" Axioms 13, no. 1: 3. https://doi.org/10.3390/axioms13010003

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop