Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Solvability, Approximation and Stability of Periodic Boundary Value Problem for a Nonlinear Hadamard Fractional Differential Equation with p-Laplacian
Next Article in Special Issue
A Lower Bound for the Volatility Swap in the Lognormal SABR Model
Previous Article in Journal
Tsallis Entropy for the Past Lifetime Distribution with Application
Previous Article in Special Issue
Uncovering the Impact of Local and Global Interests in Artists on Stock Prices of K-Pop Entertainment Companies: A SHAP-XGBoost Analysis
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Equilibrium Strategy for Target Benefit Pension Plans with a Longevity Trend and Partial Information

College of Mathematics and System Sciences, Xinjiang University, Urumqi 830017, China
*
Author to whom correspondence should be addressed.
Axioms 2023, 12(8), 732; https://doi.org/10.3390/axioms12080732
Submission received: 21 June 2023 / Revised: 18 July 2023 / Accepted: 25 July 2023 / Published: 27 July 2023
(This article belongs to the Special Issue Mathematical and Computational Finance Analysis)

Abstract

:
This paper considers the problem of portfolio selection and adjustment for target benefit plans (TBP) with longevity trends and partial information. The longevity trends are modeled by a time-varying force function. The financial market consists of risk-free assets and stocks, in which the return rate of stocks is a stochastic process and cannot be completely observed. This paper adopts the mean-variance utility model as an optimization criterion. The aim is to maximize the terminal value of the pension fund and the excess pension benefit after the participant’s retirement. The optimization equations are developed in game theory to obtain explicit solutions for the equilibrium strategies. Finally, the influence of the longevity trend on the internal structure of the pension system and the sensitivity of the equilibrium strategies to the related parameters are explored by numerical analysis. The conclusion shows that this model’s results can provide stable and adequate retirement benefits for participants.

1. Introduction

Pension financialization refers to investing pension funds in financial markets to achieve higher returns, thereby providing more support for pension systems facing the challenges of population aging and sustainability. The goal of pension financialization is to enhance the value of pension funds by investing in diverse financial instruments such as stocks, bonds, and real estate, to consequently increase the pension returns and alleviate government fiscal pressure. Nevertheless, this approach has risks and challenges due to the volatility and uncertainty of the financial markets [1,2]. Therefore, effective risk management strategies and regulatory measures must be developed to ensure the safe and stable operation of pension funds.
Typically, pension plans are categorized as either defined-benefit (DB) pensions or defined-contribution (DC) pensions, based on the varying arrangements of contributions and benefits. The DB pension plan provides participants with stable retirement benefits, but the plan sponsor bears all the risks. In DC pension plans, the risks are entirely borne by the plan members, but the benefits for the retired members mainly depend on the investment returns, and their future benefits are highly uncertain. Due to the increased risk of longevity and demographic shifts, the traditional DB and DC pensions have gradually revealed their limitations, and policymakers are actively searching for a more up-to-date pension system, hoping to further optimize the existing pension system. This includes the collective DC pension plan [3,4] from the Netherlands, the risk-sharing DB plan [5,6] from Japan, and the target benefit plan (TBP) [7,8] from Canada.
The target benefit pension plan is a type of hybrid pension plan that encompasses the advantages of both the DB and DC plans. The plan risks are no longer borne by individuals but shared by plan participants of different generations (including active individuals, retirees, and members not yet in the pension plan). The plan enables risk transfer and sharing among its members, so as to maintain a stable level of pension benefits across generations and ensure the sustainability of the pension plan. Gollier (2008) [9] and Cui (2011) [10] showed that compared with a traditional DB pension or an individual DC pension, the hybrid pension plan with the characteristics of risk sharing can improve the benefit level of retirees. In the management of the target return pension plan, Wang et al. (2018) [11] controlled the cumulative retirement return in the optimization goal and obtained the optimal strategy by minimizing the discontinuous risk and retirement return risk of the pension plan to ensure stable and safe returns for participants over time. Wang et al. (2019) [12] used an S-shaped utility to describe the risk preference of the insured and considered the TBP model based on the risk preference. Zhao and Wang (2022) [13] adopted a binary utility function to express sponsors’ preference for benefit and wealth, to obtain the optimal investment strategy and dynamic adjustment strategy of the retirement income, ensuring adequate retirement benefits for retirees, while maintaining intergenerational risk sharing equity.
In pension management problems, the existing literature can be divided into three categories according to different optimization objectives: utility maximization/minimization [14,15], minimization of the deviation of account assets from the desired target [16,17], and maximization of the terminal value under the mean-variance (MV) criterion [18,19]. Among them, the first type of optimization objective generally expects to maximize the terminal wealth and focuses more on the return of the account funds. The second type of optimization objective sets an expected target for the account assets, according to the participants’ retirement needs, and considers the strategy that can achieve the expected target, focusing on the safety of the pension investment. The third type of optimization criterion is to obtain the optimal strategy that takes into account both the pension benefits and risks. For pension managers, each of these three optimization objectives has its advantages and disadvantages, and they can only choose the appropriate optimization criteria for different preferences. In the management problem of TBP plans, many scholars have studied the optimal management problem under the first two optimization objectives and obtained the desired results. Unfortunately, the third type of mean-variance objective has yet to be studied, and there is much room for research.
The MV portfolio selection theory proposed by Markowitz [20] manages risk without unduly restricting investment. It is considered the beginning of modern finance, and the theory has become one of the foundations of modern finance. Due to the failure of the MV criterion to satisfy the iterated expectation property, it cannot be utilized directly in dynamic programming principles to derive a time-consistent optimal strategy. To address this limitation, Basak and Chabakauri (2010) [21] solved this problem by defining a time-consistent equilibrium strategy in a game theoretic framework. Similarly, Björk and Murgoci (2010) [22] applied this method to solve the more general Markovian time-inconsistent problem. Based on Björk and Murgoci (2010) [22]’s theory, scholars have solved various time-inconsistent problems by solving the extended Hamilton–Jacobi–Bellman (HJB) system of equations [23,24]. In recognition of the significant influence of consumption on investment decisions in real-life situations, Kronborg and Steffensen (2015) [25] attempted to consider the consumption choices in the MV criterion to explore the optimal asset allocation strategy. However, numerical simulations revealed that the consumption choices obtained by Kronborg and Steffensen (2015) [25] were discontinuous and did not correspond to reality. To address this issue, Yang et al. (2021) [26] put forward a novel category of the mean-variance utility (MVU) model to study the investment–consumption portfolio problem. The equilibrium strategy was obtained in the game theory framework, which overcame the shortcomings of Kronborg and Steffensen (2015) [25]. In addition, Kang (2021) [27] investigated the equilibrium strategy in which the prices of risky assets were controlled by the Heston stochastic volatility model, based on Yang et al. (2021) [26]. Based on the above research, we introduce the MVU model into the management problem of TBP plans. This criterion integrates both the return and risk of pension wealth at the terminal moment and the utility of the deviations of benefit payments from the expected target over the entire time horizon. The MVU enables decision makers to identify the optimal investment and benefit payment strategy that balances the enhancement of wealth and the fulfillment of benefits, maximizes the interests and needs of participants, and hence accomplishes the long-term financial planning and goals.
The accelerating rate of population aging due to the worldwide mortality reduction and the global fertility decline has put tremendous pressure on traditional pension plans. In recent years, governments, financial institutions, and individuals have been affected by longevity risk, and it has become a common challenge worldwide. In 2004, European investment banks tried to issue longevity bonds, and capital markets issued many derivatives, such as longevity swaps, to transfer longevity risk. Some of the literature (Wong et al. (2014) [28], Kwok et al. (2016) [29]) has considered the use of longevity-related assets to hedge against longevity risk in the market. In addition, another approach proposed by scholars to deal with longevity risk is the natural hedge approach, in which lower mortality increases annuity product payments while reducing life insurance product expenses. Combining the two creates a natural hedge to reduce losses from longevity risk. Whichever method is chosen, an accurate forecast of future mortality is required. Earlier mortality models were static, where mortality did not change over time and is no longer applicable. Therefore, scholars have proposed dynamic mortality prediction models that add a time factor to the static mortality models. Furthermore, Strulik and Vollmer (2013) [30] showed that the increase in the average life expectancy from the second half of the 20th century onwards was mainly due to the increase in the maximum age of the population. Therefore, Knell (2018) [31] assumed in his article that the maximum age of the population increased linearly with time. Devolder and Melis (2015) [32] argued that the mortality force is dependent on the age and the time of a specific population. On the one hand, due to medical advances and economic development, people’s mortality rates are gradually decreasing. On the other hand, as people age, the force of mortality increases. Inspired by Knell (2018) [31], who assumed that the maximum age of survival was an increasing function of time, Fu (2023) [33] also applied an age- and time-dependent mortality force, called the modified Makeham’s Law, to describe the longevity risk.
Until now, most studies have assumed that investors can fully grasp the economic structure and market conditions, but this assumption is difficult to satisfy in practice. For example, in portfolio problems, because the distribution nature of risk asset prices related to portfolio decisions is usually unobtainable, decision makers cannot accurately obtain the rate of return of risk assets. They can only estimate it based on all the available information. In addition, many studies have confirmed the predictability of the return rate of risk assets [34,35]. Wang et al. (2021) [36] analyzed the variance between optimal investment strategies for managing DC pensions in scenarios of partial and complete information. The results showed that partial information could affect the investment proportion of stocks. Applying filtering technology and the stochastic dynamic programming method, Wang et al. (2022) [37] derived the optimal investment strategy for a DC pension plan with stock return predictability and inflation risk. The study highlighted that the predictability of the partially observable stock returns yield plays a crucial role in the optimal investment strategy for inflation-indexed bonds.
This paper considers a TBP model with overlapping generations. In the continuous-time case, the insured’s contribution is determined. At the same time, the payment of the insured is the target payment, and the income risk of the fund account is shared by the insured of different generations. Since longevity risk exists inside the pension system and directly affects the size of the number of people and accounts in the pension system, the longevity trend factor is considered in constructing the pension model in this paper. The financial market consists of a risk-free asset and a stock, in which the return rate of stock is random, subject to a mean recovery process, and cannot be completely observed. The ultimate goal is to design a new risk-sharing plan for pension planners according to the participants’ risk preferences and objectives. Here, to simultaneously measure the benefits and risks of the pension plan in the investment process and make the benefits of retired members reach or exceed the preset target benefits during the whole distribution period, the mean-variance utility objective model is adopted. To maximize the terminal wealth of pension accounts, minimize the investment risks, and maximize the pension excess benefits after the retirement of the insured persons, the optimal equation was established under the thought of game theory, and the balanced investment strategy and benefit adjustment strategy were obtained. Finally, the sensitivity of the equalization strategy to the related parameters is explored through numerical analysis. The contributions of this paper are as follows: (1) this paper introduces the mean-variance utility model into the optimal management of the target benefit plan for the first time. It obtains the equilibrium investment strategy and benefit payment adjustment strategy. (2) This paper considers the longevity trend phenomenon in the model’s construction and explores the impact of the longevity trend on the pension system through numerical simulation. (3) This paper considers partial information factors in controlling the target benefit pension plan for the first time, deduces the investment and return adjustment strategy under the fully observable stock return, and compares it with the strategy under the partially observable condition. (4) This paper deduces the equilibrium benefit adjustment strategy under different utility functions.
The rest of this paper is arranged as follows: In Section 2, some necessary assumptions are given, the model framework is described, and the optimization problem is proposed. Section 3 presents the model, giving the theoretical results and the conclusion under special circumstances. In Section 4, the sensitivity analysis of the relevant parameters is carried out by numerical simulation, and the corresponding analysis is presented. Section 5 gives the main conclusions of this paper.

2. Model Formulation

Let ( Ω , F , F , P ) be a complete probability space satisfying the usual conditions, where P is a probability measure on Ω , F : = { F t } t [ 0 , T ] is a complete and right-continuous filtration, and F t represents the sum of all information obtained up to time t, i.e., the decision made at any time t is based on the filtration F t . T > 0 presents the investment horizon of the pension. L F 2 ( 0 , T , R ) denotes the set of all R-valued measurable stochastic processes f ( s ) adapted to { F s } s [ t , T ] , where E [ 0 T f 2 ( t ) d t ] < .

2.1. Demographic and Economic Formulation

This paper considers a target benefit pension system consisting of working-stage and retirement-stage participants. Let a, b, and m ( t ) be the entrance age, the retirement age, and the maximal survival age of the participants. It is noteworthy that most scholars in the literature generally assume the maximum survival age to be constant; however, here it is assumed to be a function of time, because the maximum survival age varies continuously with time t. As medical care and people’s living standards continue to improve, the problem of population aging is becoming more and more serious. In order to reflect the trend of population aging in the model, a modified version of the de Moivre function in Knell (2018) [31] is used as a survival function to study the impact of this phenomenon on the internal system of pensions.
s ( t , x ) = 1 x m ( t ) λ , a < x < m ( t ) , m ( t ) = m ( 0 ) + φ t ,
where φ 0 is the increase factor of the maximum survival age, and λ 0 represents the adjustment coefficient of the survival function. In particular, when λ = 1 , the survival function s ( t , x ) is the original linear de Moivre survival function.
Let p ( t ) indicate the number of people newly enrolled in the pension scheme at time t. Then, the number of the cohort at x-year-old at time t is
p ( t ( x a ) ) s ( t , x ) , x > a .
The total number of members in the working and the retirement stages are as follows, respectively:
Θ ( t ) = a b s ( t , x ) p ( t ( x a ) ) d x , Ξ ( t ) = b m ( t ) s ( t , x ) p ( t ( x a ) ) d x .
L ( 0 ) is the average annual salary of the newly retired, which increases at the rate of α . Therefore, the average annual salary of a retiree at age b at time t is L ( t ) = L ( 0 ) e α t . The average salary for a retirees aged x at time t is L ( t ) e α ( x b ) , denoted by L ¯ ( t , x ) .
The proportion of the pension to the average salary of the insured is an important standard to measure the pension management performance and reflects the degree of the replacement of the pension after retirement to the pre-retirement salary and the degree of the maintenance of consumption. Similar to Wang et al. (2018) [11], we suppose the annual benefit payment rate of a pension is a certain percentage of the average annual salary at the time of retirement, denoted b ( t ) . The annual payment rate of the pension for a retiree aged x at time t is
B ( t , x ) = b ( t ) L ¯ ( t , x ) e θ ( x b ) = b ( t ) L ( t ) e ( α θ ) ( x b ) , 0 t T , x b ,
where θ represents the interest rate. The actual total cumulative benefit payment at time t is
B ( t ) = b m ( t ) s ( t , x ) p ( t x + a ) b ( t ) L ( t ) e ( α θ ) ( x b ) d x , 0 t T , x b ,
denoted by I ( t ) = b m ( t ) s ( t , x ) p ( t x + a ) e ( α θ ) ( x b ) d x ; then, B ( t ) = I ( t ) L ( t ) b ( t ) .
It is assumed that B * is the total target retirement benefit at time 0, with the interest rate β increasing. Therefore, the total target benefit at time t is B * e β t .
Let C 0 be the instantaneous contribution rate of the working member at time 0, and the contribution rate increases exponentially with rate α . Then, the total contribution rate C ( t ) of all working members is
C ( t ) = a b s ( t , x ) p ( t x + a ) C 0 e α t d x , 0 t T ,
denoted C 1 ( t ) = a b s ( t , x ) p ( t x + a ) C 0 d x = C 0 Θ ( t ) ; then, C ( t ) = C 1 ( t ) e α t .

2.2. Financial Market

The price process of a risk-free asset is
d S 0 ( t ) = r S 0 ( t ) d t , S 0 ( 0 ) = s 0 > 0 .
The price process of a stock S ( t ) is
d S ( t ) S ( t ) = μ ( t ) d t + σ s d W s ( t ) , S ( 0 ) = s 1 > 0 ,
where μ ( t ) represents the random rate of return of the stock, subject to the following process
d μ ( t ) = k ( δ μ ( t ) ) d t + σ s μ d W s ( t ) + σ μ d W μ ( t ) ,
where k > 0 is the mean regression parameter, δ > 0 is the long-term mean of the variable μ ( t ) , and σ s μ > 0 and σ μ > 0 are the volatility index.
In the financial market, it is assumed that pension plan managers can only observe the stock price process S ( t ) and cannot directly observe the variable μ ( t ) . Therefore, decisions made by managers must be adapted to F t S = σ { S ( u ) : u [ 0 , t ] } ( F t ) , i.e., the filtering generated by the stock price process. Since W μ ( t ) does not adapt to F t S , the plan manager lacks information about W μ ( t ) , which is called partial information.
Suppose that π ( t ) is the dollar amount invested in the stock at time t; then, { ( π ( s ) , b ( s ) ) } s [ t , T ] represents the strategy adopted by the plan manager from time t to T. The dynamic process of pension fund X π , b ( t ) is satisfied by
d X π , b ( t ) = π ( t ) d S ( t ) S ( t ) + ( X ( t ) π ( t ) ) d S 0 ( t ) S 0 ( t ) + ( C ( t ) B ( t ) ) d t = [ π ( t ) ( μ ( t ) r ) + X ( t ) r + C ( t ) I ( t ) L ( t ) b ( t ) ] d t + σ s π ( t ) d W s ( t ) .

2.3. The Optimization Problem

The mean-variable utility model is used to study the optimal management of the TBP plan. This objective function maximizes the cumulative expected utility of the retirees’ benefits during the whole distribution period while ensuring the pension fund maximizes the total wealth and minimizes the investment risk at the final moment.
max π ( · ) , b ( · ) E t , x , μ [ X ( T ) ] γ 1 2 Var t , x , μ [ X ( T ) ] + γ 2 E t , x , μ t T e r ( s t ) U ( B ( s ) B * e β s ) d s , s u c h t h a t π ( · ) , b ( · ) L F 2 ( 0 , T ; R ) , ( X ( · ) , π ( · ) , b ( · ) ) satisfy Equation ( 1 ) ,
where γ 1 > 0 is the risk avoidance parameter of the investment process, and γ 2 > 0 is the preference factor, which is used to measure how much attention fund managers attach to pension returns. U ( · ) is the utility function of pension excess benefits. For writing purposes, we denote E t , x , μ [ · ] = E [ · | X π , b ( t ) = x , μ ( t ) = μ ] , Var t , x , μ [ · ] = Var [ · | X π , b ( t ) = x , μ ( t ) = μ ] .

3. Equilibrium Investment and Benefit Payment Adjustment Strategy

3.1. Filtration Problem

We define μ ^ ( t ) = E [ μ ( t ) | F t S ] and Σ ( t ) = E [ ( μ ( t ) μ ^ ( t ) ) 2 | F t S ] as the conditional expectation and conditional variance of μ ( t ) , respectively. According to Theorem 12.7 in Liptser and Shiryayev (2001) [38], there is
d μ ^ ( t ) = k ( δ μ ^ ( t ) ) d t + σ s μ σ s + Σ ( t ) σ s 2 d S ( t ) S ( t ) μ ^ ( t ) d t , d Σ ( t ) d t = 2 k Σ ( t ) + σ μ 2 + σ s μ 2 σ s μ σ s + Σ ( t ) σ s 2 .
Let d Z ˜ ( t ) = σ s 1 d S ( t ) S ( t ) . We define the innovations process B s ( t ) as
d B s ( t ) = d Z ˜ ( t ) σ s 1 μ ^ ( t ) d t .
B s ( t ) is a standard Brownian motion with respect to F t S t [ 0 , T ] . Therefore, Z ˜ ( t ) has a semi-martingale representation with respect to the filtration F t S t [ 0 , T ] ,
d Z ˜ ( t ) = d B s ( t ) + σ s 1 μ ^ ( t ) d t .
We have
d S ( t ) = S ( t ) [ μ ^ ( t ) d t + σ s d B s ( t ) ] ;
thus, the estimation process of the stock return rate is
d μ ^ ( t ) = k ( δ μ ^ ( t ) ) d t + Σ ( t ) σ s + σ s μ d B s ( t ) , d Σ ( t ) d t = σ μ 2 2 ( k + σ s μ σ s ) Σ ( t ) Σ 2 ( t ) σ s 2 .
The dynamic equations are given by
d S ( t ) S ( t ) = μ ^ ( t ) d t + σ s d B s ( t ) , d X π , b ( t ) = π ( t ) d S ( t ) S ( t ) + ( X ( t ) π ( t ) ) d S 0 ( t ) S 0 ( t ) + ( C ( t ) B ( t ) ) d t = [ π ( t ) ( μ ^ ( t ) r ) + X ( t ) r + C ( t ) I ( t ) L ( t ) b ( t ) ] d t + σ s π ( t ) d B s ( t ) .
Using the Kalman filtering theory, the optimization Problem (2) is as follows:
max π ( · ) , b ( · ) E t , x , μ ^ [ X ( T ) ] γ 1 2 Var t , x , μ ^ [ X ( T ) ] + γ 2 E t , x , μ ^ t T e r ( s t ) U ( B ( s ) B * e β s ) d s , s u c h t h a t π ( · ) , b ( · ) L F 2 ( 0 , T ; R ) , ( X ( · ) , π ( · ) , b ( · ) ) satisfy Equation ( 4 ) .

3.2. Equilibrium Problem

According to the composition of optimization model (5), the following notation is defined
y π , b : = y π , b ( t , x , μ ^ ) = E t , x , μ ^ [ X ( T ) ] , z π , b : = z π , b ( t , x , μ ^ ) = E t , x , μ ^ [ ( X ( T ) ) 2 ] , w π , b : = w π , b ( t , x , μ ^ ) = E t , x , μ ^ t T e r ( s t ) U ( B ( s ) B * e β s ) d s .
Therefore, the value function of the optimization problem (5) can be rewritten in the following form
f π , b ( t , x , μ ^ , y π , b , z π , b , w π , b ) = y π , b γ 1 2 [ z π , b ( y π , b ) 2 ] + γ 2 w π , b .
Definition 1.
(Equilibrium strategy) For a strategy ( π * ( t ) , b * ( t ) ) L F 2 ( 0 , T ; R ) with any initial state ( t , x , μ ^ ) [ 0 , T ] × R × R + , we define the following strategy
( π ˜ ε ( s ) , b ˜ ε ( s ) ) = ( π , b ) , t s < t + ε , ( π * ( s ) , b * ( s ) ) , t + ε s < T ,
where ε > 0 . If
lim inf ε 0 1 ε ( f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) f π ˜ ε , b ˜ ε ( t , x , μ ^ , y π ˜ ε , b ˜ ε , z π ˜ ε , b ˜ ε , w π ˜ ε , b ˜ ε ) ) 0 ,
for all ( π , b ) , where f is the objective function, then ( π * ( t ) , b * ( t ) ) is called the equilibrium strategy, and the equilibrium value function V ( t , x , μ ^ ) is defined as
V ( t , x , μ ^ ) = f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) .
C 1 , 2 , 2 ( [ 0 , T ] × R × R + ) = { ϕ ( t , x , μ ^ ) | ϕ ( t , · , · ) is once continuously differentiable on [ 0 , T ] , and ϕ ( · , x , μ ^ ) is twice continuously differentiable on R × R + }. For any ϕ ( t , x , μ ^ ) C 1 , 2 , 2 ( [ 0 , T ] × R × R + ) , we define an operator Λ π , b as
Λ π , b ϕ ( t , x , μ ^ ) = ϕ t + [ π ( μ ^ r ) + x r + C I L b ] ϕ x + k ( δ μ ^ ) ϕ μ ^ + 1 2 π 2 σ s 2 ϕ x x + 1 2 ( Σ σ s + σ s μ ) 2 ϕ μ ^ μ ^ + σ s π ( Σ σ s + σ s μ ) ϕ x μ ^ .
Lemma 1. 
Assume that the functions Y = Y ( t , x , μ ^ ) , Z = Z ( t , x , μ ^ ) , and W = W ( t , x , μ ^ ) satisfy the following system of equations
Λ π , b Y ( t , x , μ ^ ) = 0 , Y ( T , x , μ ^ ) = x ,
Λ π , b Z ( t , x , μ ^ ) = 0 , Z ( T , x , μ ^ ) = x 2 ,
Λ π , b W ( t , x , μ ^ ) = r W ( t , x , μ ^ ) U ( B ( t ) B * e β t ) , W ( T , x , μ ^ ) = 0 ;
then, Y ( t , x , μ ^ ) = y π , b ( t , x , μ ^ ) , Z ( t , x , μ ^ ) = Z π , b ( t , x , μ ^ ) , and W ( t , x , μ ^ ) = W π , b ( t , x , μ ^ ) .
Proof. 
See Appendix A. □
Lemma 2. 
If there exists a function F = F ( t , x , μ ^ ) that satisfies
F t = inf π , b Π { [ x r + C B + π ( μ ^ r ) ] ( F x Q 1 ) k ( δ μ ^ ) ( F μ ^ Q 2 ) 1 2 π 2 σ s 2 ( F x x U 1 π , b ) 1 2 ( Σ σ s + σ s μ ) 2 ( F μ ^ μ ^ U 2 π , b ) π σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π , b ) + J π , b } , F ( T , x , μ ^ ) = f π , b ( T , x , μ ^ , x , x 2 , 0 ) ,
where Q 1 = f x π , b , Q 2 = f μ ^ π , b , F ( 1 ) = y π , b ( t , x , μ ^ ) , F ( 2 ) = z π , b ( t , x , μ ^ ) , F ( 3 ) = w π , b ( t , x , μ ^ ) ,
J π , b = f t π , b + ( r F ( 3 ) U ( B ( t ) B * e β t ) ) f w π , b ,
U 1 π , b = f x x π , b + f y y π , b ( F x ( 1 ) ) 2 + f z z π , b ( F x ( 2 ) ) 2 + f w w π , b ( F x ( 3 ) ) 2 + 2 f x y π , b F x ( 1 ) + 2 f x z π , b F x ( 2 ) + 2 f x w π , b F x ( 3 ) + 2 f y z π , b F x ( 1 ) F x ( 2 ) + 2 f y w π , b F x ( 1 ) F x ( 3 ) + 2 f w z π , b F x ( 2 ) F x ( 3 ) ,
U 2 π , b = f μ ^ μ ^ π , b + f y y π , b ( F μ ^ ( 1 ) ) 2 + f z z π , b ( F μ ^ ( 2 ) ) 2 + f w w π , b ( F μ ^ ( 3 ) ) 2 + 2 f μ ^ y π , b F μ ^ ( 1 ) + 2 f μ ^ z π , b F μ ^ ( 2 ) + 2 f μ ^ w π , b F μ ^ ( 3 ) + 2 f y z π , b F μ ^ ( 1 ) F μ ^ ( 2 ) + 2 f y w π , b F μ ^ ( 1 ) F μ ^ ( 3 ) + 2 f w z π , b F μ ^ ( 2 ) F μ ^ ( 3 ) ,
U 3 π , b = f x μ ^ π , b + f x y π , b F μ ^ ( 1 ) + f x z π , b F μ ^ ( 2 ) + f x w π , b F μ ^ ( 3 ) + f μ ^ y π , b F x ( 1 ) + f μ ^ z π , b F x ( 2 ) + f μ ^ w π , b F x ( 3 ) + f y y π , b F x ( 1 ) F μ ^ ( 1 ) + f y z π , b F x ( 1 ) F μ ^ ( 2 ) + f y z π , b F x ( 2 ) F μ ^ ( 1 ) + f y w π , b F x ( 1 ) F μ ^ ( 3 ) + f y w π , b F x ( 3 ) F μ ^ ( 1 ) + f z z π , b F x ( 2 ) F μ ^ ( 2 ) + f z w π , b F x ( 2 ) F μ ^ ( 3 ) + f z w π , b F x ( 3 ) F μ ^ ( 2 ) + f w w π , b F x ( 3 ) F μ ^ ( 3 ) ,
and Lemma 1 holds for any strategy ( π , b ) L F 2 ( 0 , T ; R ) , then the equilibrium investment and benefit payment adjustment strategy ( π * ( t ) , b * ( t ) ) is given by
( π * ( t ) , b * ( t ) ) = arg inf π , b Π { F t [ x r + C B + π ( μ ^ r ) ] ( F x Q 1 ) k ( δ μ ^ ) ( F μ ^ Q 2 ) 1 2 π 2 σ s 2 ( F x x U 1 π , b ) 1 2 ( Σ σ s + σ s μ ) 2 ( F μ ^ μ ^ U 2 π , b ) π σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π , b ) + J π , b } ,
and F ( t , x , μ ^ ) = V ( t , x , μ ^ ) .
Proof. 
See Appendix B. □

3.3. Solution to the Equilibrium Problem

According to Lemma 2 and the first-order optimality condition, the following equations are obtained
π π ( μ ^ r ) ( F x Q 1 ) 1 2 π 2 σ s 2 ( F x x U 1 π , b ) π σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π , b ) = 0 , b B ( F x Q 1 ) f w π , b U ( I L b B * e β t ) = 0 .
It can be obtained by further solving
π * ( t ) = ( μ ^ ( t ) r ) ( F x Q 1 ) + ( Σ + σ s σ s μ ) ( F x μ ^ U 3 π * , b * ) σ s 2 ( F x x U 1 π * , b * ) , b * ( t ) = I 1 ( t ) L 1 ( t ) U B 1 F x Q 1 f w π * , B * + B * e β t ,
where [ f ] 1 ( · ) is the inverse function of f. It is assumed that the inverse exists. According to (6), we have f π , b ( t , x , μ ^ , y π , b , z π , b , w π , b ) ; thus,
f y π , b = 1 + γ 1 y π , b ,   f z π , b = γ 1 2 ,   f w π , b = γ 2 ,   f y y π , b = γ 1 ,   f t π , b = f z z π , b = f w w π , b = f x y π , b = f x z π , b = f x w π , b = f y z π , b = f y w π , b = f z w π , b = 0 .
Substituting the above partial derivatives into Lemma 2, we obtain
Q 1 = Q 2 = 0 , U 1 π * , b * = γ 1 ( F x ( 1 ) ) 2 , U 2 π * , b * = γ 1 ( F μ ^ ( 1 ) ) 2 , U 3 π * , b * = γ 1 F x ( 1 ) F μ ^ ( 1 ) , J π * , b * = γ 2 ( r F ( 3 ) U ( I L b * B * e β t ) ) .
Inspired by Yang [26] and Kang [27], we assume that F, F ( 1 ) , and F ( 3 ) have the following forms
F ( t , x , μ ^ ) = A ( t ) x + D ( t ) μ ^ 2 + G ( t ) μ ^ + H ( t ) , F ( 1 ) ( t , x , μ ^ ) = a ( t ) x + d ( t ) μ ^ 2 + g ( t ) μ ^ + h ( t ) , F ( 3 ) ( t , x , μ ^ ) = a ˜ ( t ) x + d ˜ ( t ) μ ^ 2 + g ˜ ( t ) μ ^ + h ˜ ( t ) ,
where
A ( T ) = 1 , D ( T ) = G ( T ) = H ( T ) = 0 , a ( T ) = 1 , d ( T ) = g ( T ) = h ( T ) = 0 , a ˜ ( T ) = d ˜ ( T ) = g ˜ ( T ) = h ˜ ( T ) = 0 .
Therefore, F ( 2 ) can be written as
F ( 2 ) ( t , x , μ ^ ) = 2 γ 1 F ( 1 ) + γ 2 F ( 3 ) F + ( F ( 1 ) ) 2 .
Inserting (19) and (21) into (18), we have
U 1 π * , b * = γ 1 ( a ( t ) ) 2 , U 2 π * , b * = γ 1 ( 2 d ( t ) μ ^ + g ( t ) ) 2 , U 3 π * , b * = γ 1 a ( t ) ( 2 d ( t ) μ ^ + g ( t ) ) , J π * , b * = γ 2 ( r F ( 3 ) U ( I L b * B * e β t ) ) .
The equilibrium strategy ( π * , b * ) satisfies
π * ( t ) = ( μ ^ ( t ) r ) A ( t ) γ 1 a ( t ) ( Σ + σ s σ s μ ) ( 2 d ( t ) μ ^ + g ( t ) ) γ 1 σ s 2 ( a ( t ) ) 2 , b * ( t ) = I 1 ( t ) L 1 ( t ) U B 1 A ( t ) γ 2 + B * e β t .
Substituting (19), (21), (22), and (23) into (9), (10), and (12), we obtain
x A t + r A + λ a γ 2 r a ˜ + μ ^ 2 D t 2 k D γ 2 r d ˜ + 1 2 γ 1 σ s 2 A 2 a 2 4 γ 1 ( Σ + σ s σ s μ ) A d a + μ ^ G t k G + 2 δ k D γ 2 r g ˜ + 1 γ 1 σ s 2 r A 2 a 2 γ 1 ( Σ + σ s σ s μ ) ( g 2 r d ) A a + { H t + δ k G + ( C I L b * ) A γ 2 r h ˜ + γ 2 U ( I L b * B * e β t ) + 1 γ 1 σ s 2 ( r A ) 2 2 a 2 + γ 1 r ( Σ + σ s σ s μ ) A g a + γ 1 ( Σ + σ s σ s μ ) 2 D } = 0 ,
x a t + a r + μ ^ 2 d t 2 k d + 1 γ 1 σ s 2 A a 2 γ 1 ( Σ + σ s σ s μ ) d + μ ^ { g t k g + 2 δ k d 1 γ 1 σ s 2 2 r A a + γ 1 ( Σ + σ s σ s μ ) ( g 2 r d ) } + { h t + δ k g + ( C I L b * ) a + 1 γ 1 σ s 2 r 2 A a + γ 1 r ( Σ + σ s σ s μ ) g + γ 1 ( Σ + σ s σ s μ ) 2 d } = 0 ,
x a ˜ t + μ ^ 2 d ˜ t ( 2 k + r ) d ˜ + 1 γ 1 σ s 2 A a ˜ a 2 2 γ 1 ( Σ + σ s σ s μ ) a ˜ d a + μ ^ { g ˜ t ( k + r ) g ˜ + 2 δ k d ˜ 1 γ 1 σ s 2 2 r A a ˜ a 2 + γ 1 ( Σ + σ s σ s μ ) ( g 2 r d ) a ˜ a } + { h ˜ t r h ˜ + δ k g ˜ + ( C I L b * ) a ˜ + U ( I L b * B * e β t ) + 1 γ 1 σ s 2 r 2 A a ˜ a 2 + γ 1 r ( Σ + σ s σ s μ ) a ˜ g a + γ 1 ( Σ + σ s σ s μ ) 2 d ˜ } = 0 .
Combined with the boundary conditions (20) for further calculation, we obtain
A ( t ) = a ( t ) = e r ( T t ) , a ˜ ( t ) = d ˜ ( t ) = g ˜ ( t ) = 0 , d ( t ) = 1 γ 1 2 k σ s 2 + 2 ( Σ + σ s σ s μ ) 1 e 2 k + 2 ( Σ + σ s σ s μ ) σ s 2 ( T t ) , D ( t ) = t T e 2 k ( t s ) 1 4 γ 1 ( Σ + σ s σ s μ ) d ( s ) 2 γ 1 σ s 2 d s , g ( t ) = t T e ( k + ( Σ + σ s σ s μ ) σ s 2 ) ( t s ) 2 k δ d ( s ) + 2 r γ 1 ( Σ + σ s σ s μ ) d ( s ) 1 γ 1 σ s 2 d s , G ( t ) = t T e k ( t s ) 2 δ k D ( s ) r + γ 1 ( Σ + σ s σ s μ ) ( g ( s ) 2 r d ( s ) ) γ 1 σ s 2 d s , h ( t ) = t T h 1 ( s ) d s , H ( t ) = t T H 1 ( s ) d s , h ˜ ( t ) = t T e r ( t s ) U ( I ( s ) L ( s ) b * ( s ) B * e β s ) d s ,
where
h 1 ( s ) = δ k g ( s ) + ( C ( s ) I ( s ) L ( s ) b * ( s ) ) a ( s ) + r 2 + γ 1 r ( Σ + σ s σ s μ ) g ( s ) + γ 1 ( Σ + σ s σ s μ ) 2 d ( s ) γ 1 σ s 2 , H 1 ( s ) = δ k G ( s ) + ( C ( s ) I ( s ) L ( s ) b * ( s ) ) A ( s ) γ 2 r h ˜ ( s ) + γ 2 U ( I ( s ) L ( s ) b * ( s ) B * e β s ) + r 2 + 2 γ 1 r ( Σ + σ s σ s μ ) g ( s ) + 2 γ 1 ( Σ + σ s σ s μ ) 2 D ( s ) 2 γ 1 σ s 2 .
Theorem 1. 
The equilibrium strategy of the optimization problem (5) is
π * ( t ) = μ ^ ( t ) r γ 1 ( Σ + σ s σ s μ ) ( 2 d ( t ) μ ^ + g ( t ) ) γ 1 σ s 2 a ( t ) , b * ( t ) = I 1 ( t ) L 1 ( t ) U B 1 A ( t ) γ 2 + B * e β t .
The equilibrium value function is
V ( t , x , μ ^ ) = F ( t , x , μ ^ ) = A ( t ) x + D ( t ) μ ^ 2 + G ( t ) μ ^ + H ( t ) ,
and the expected terminal wealth under the equilibrium strategy is
E t , x , μ ^ [ X ( T ) ] = F ( 1 ) ( t , x , μ ^ ) = a ( t ) x + d ( t ) μ ^ 2 + g ( t ) μ ^ + h ( t ) .
Remark 1. 
Different optimal benefit payment adjustment strategies can be obtained by choosing different utility functions in the optimization problem (5).
(i) With a logarithmic utility function, i.e., U ( B ( t ) B * e β t ) = log ( B ( t ) B * e β t ) , the optimal benefit payment strategy satisfies
b * = I 1 ( t ) L 1 ( t ) γ 2 A ( t ) + B * e β t .
(ii) With a power utility function, i.e., U ( B ( t ) B * e β t ) = ( B ( t ) B * e β t ) θ / θ , the optimal benefit payment strategy becomes
b * = I 1 ( t ) L 1 ( t ) A ( t ) γ 2 1 θ 1 + B * e β t .
(iii) With an exponential utility function, i.e., U ( B ( t ) B * e β t ) = exp { η ( B ( t ) B * e β t ) } /η, the optimal benefit payment strategy can be obtained as
b * = I 1 ( t ) L 1 ( t ) 1 η ln γ 2 A ( t ) + B * e β t .
Remark 2. 
(Complete Information Case)  For the optimization problem with complete information, i.e., μ ( t ) is completely observable, the equilibrium strategy is
π ¯ * ( t ) = μ ( t ) r γ 1 σ s σ s μ ( 2 d 0 ( t ) μ ( t ) + g 0 ( t ) ) γ 1 σ s 2 a ( t ) , b ¯ * ( t ) = I 1 ( t ) L 1 ( t ) U B 1 A ( t ) γ 2 + B * e β t .
The value function is
V ¯ ( t , x , μ ) = F ¯ ( t , x , μ ) = A ( t ) x + D 0 ( t ) μ 2 + G 0 ( t ) μ + H 0 ( t ) .
The expected terminal wealth under the equilibrium strategy is
E t , x , μ [ X ( T ) ] = F ¯ ( 1 ) ( t , x , μ ) = a ( t ) x + d 0 ( t ) μ 2 + g 0 ( t ) μ + h 0 ( t ) ,
where
d 0 ( t ) = 1 2 γ 1 σ s k σ s + σ s μ 1 e 2 k + σ s μ σ s ( T t ) , D 0 ( t ) = t T e 2 k ( t s ) 2 γ 1 σ μ 2 d 0 2 ( s ) + 1 2 γ 1 σ s 2 2 σ s μ d 0 ( s ) σ s d s , g 0 ( t ) = t T e ( k + σ s μ σ s ) ( t s ) 2 k δ d 0 ( s ) + 2 r γ 1 σ s σ s μ d 0 ( s ) 1 γ 1 σ s 2 d s , G 0 ( t ) = t T e k ( t s ) 2 δ k D 0 ( s ) 2 γ 1 σ μ 2 g 0 ( s ) d 0 ( s ) r + γ 1 σ s σ s μ ( g 0 ( s ) 2 r d 0 ( s ) ) γ 1 σ s 2 d s , h ˜ 0 ( t ) = t T e r ( t s ) U ( I ( s ) L ( s ) b ¯ * ( s ) B * e β s ) d s , h 0 ( t ) = t T h ¯ 1 ( s ) d s , H 0 ( t ) = t T H ¯ 1 ( s ) d s , h ¯ 1 ( t ) = δ k g 0 ( t ) + ( C ( t ) I ( t ) L ( t ) b ¯ * ( t ) ) a ( t ) + ( σ μ 2 + σ s μ 2 ) d 0 ( t ) + r 2 + γ 1 r σ s σ s μ g 0 ( t ) γ 1 σ s 2 , H ¯ 1 ( t ) = δ k G 0 ( t ) + ( C ( t ) I ( t ) L ( t ) b ¯ * ( t ) ) A ( t ) + ( σ μ 2 + σ s μ 2 ) D 0 ( t ) r γ 2 h ˜ 0 ( t ) 1 2 γ 1 σ μ 2 g 0 ( t ) + γ 2 U ( I ( t ) L ( t ) b ¯ * ( t ) B * e β t ) + r 2 + 2 r γ 1 σ s σ s μ g 0 ( t ) 2 γ 1 σ s 2 .
Equation (27) represents the equilibrium strategy in the partial information case, where the expected stock returns are only partially observable. In contrast, Equation (28) represents the equilibrium strategy in the complete information case, where the expected stock returns are fully observable. This disparity in the observability of the expected stock returns has significant implications for decision making and portfolio optimization. In the complete information case, with the complete observability of the expected returns, investors have access to more accurate and reliable information. This enables them to make more informed investment decisions and construct optimal portfolios based on the precise estimations of future stock performance. However, in the partial information case where the expected stock returns are only partially observable, investors face uncertainty and must rely on imperfect information. This introduces challenges in estimating future returns and constructing optimal portfolios, as there is a higher level of ambiguity and risk involved.

4. Numerical Example

We study the influence of various factors on the equilibrium investment and benefit payment adjustment strategy. Similar to [11,23,24], we make the following assumptions.
  • The time horizon T is set to 30 years.
  • The age when the pension participants join the pension plan is 30, the retirement age is 65, the maximum survival age at time 0 is 95, and we have a = 30 , b = 65 , m ( 0 ) = 100 .
  • Let the maximum survival age growth rate of the pension participants φ be 0.2 and adjust the survival function parameter λ = 0.5 . In the model set out in this paper, when the pension plan ends, the maximum survival age of the participant is 106 years old, which is within the reasonable maximum survival age range.
  • The number of people newly enrolled in the pension scheme at time t is 20.
  • The average annual salary L ( 0 ) of the newly retired at time 0 is 1, and the expected annual salary increase rate α is 0.03.
  • The target benefit B * of pension is 100, and the growth rate β = 0.02 . The cost of living adjustment rate θ = 0.02 .
  • The instantaneous payment rate C 0 at time 0 is set to 0.3, and the increase rate α = 0.03 .
  • The risk-free interest rate r = 0.04 , the stock price volatility σ s = 0.2 , and the values of the parameters related to the stock yield μ ( t ) are k = 0.043 , δ = 0.2 , σ s μ = 0.09 , and σ μ = 0.1 .
  • The risk avoidance parameter γ 1 = 1 , and the preference factor γ 2 = 1 .

4.1. Equilibrium Benefit Adjustment Strategy

This section shows the change in the payment adjustment strategy b * ( t ) in the time interval, gives the influence relationship between the various parameters and b * ( t ) , and provides a reasonable explanation.
Figure 1 shows the change in the equalization payment B ( t ) . As expected, B ( t ) increased with time. In addition, we added the track of the target payment B * e β t as a reference in the figure, and we see that although the actual payment was slightly less than the target payment B * e β t in the initial stage, the gap narrowed with the increase in the time, and the actual payment was able to reach the target payment in the later stage. This indicates that the equilibrium benefit adjustment strategy obtained in this paper can meet the requirements of the target benefit pension and provide stable retirement benefits for retired members.
Figure 2 shows the influence of the preference factor γ 2 on the payment adjustment strategy b * ( t ) . When γ 2 increases, the payment adjustment strategy increases. This is because γ 2 represents the manager’s attention to the benefit. With the increase in γ 2 , the manager is more inclined to realize the payment target.
Figure 3 describes the trajectory of the equilibrium payment adjustment strategy b * ( t ) under different risk aversion coefficients η when the manager selects the exponential utility function. As can be seen from the figure, with the increase in η , the manager chose a larger payment ratio, and the adjustment range was amaller, which was finally maintained around 0.197.
It can be seen from Figure 4 that the growth rate β of the target payment was positively correlated with the equilibrium payment adjustment strategy b * ( t ) . The explanation for this phenomenon is that B * e β t increases with β . At this time, the manager increases the current payment ratio in order to achieve the target benefit or keep the benefit stable.
Figure 5 shows the relationship between the adjustment rate θ and the equalization benefit adjustment strategy b * ( t ) . As can be seen from the figure, with the increase in θ , the adjustment range of the payment ratio b * ( t ) became smaller. The reason for this result is that when θ increases, the actual benefit of the plan participant increases accordingly. However, during this process, the target benefit set by the target benefit pension plan remains unchanged; so, the plan manager can easily achieve the target benefit without an excessive adjustment of the benefit ratio.

4.2. Equilibrium Investment Strategy

This section presents the relationship between the various parameters and the equilibrium investment strategy π * ( t ) and provides a reasonable explanation.
Figure 6 shows the influence of the risk aversion factor γ 1 on π * ( t ) . Given the risk aversion parameter, π * ( t ) , gradually decreased with time t. This is because over time, as pension funds accumulate wealth, investment in stocks gradually becomes unnecessary, and managers reduce their investment in stocks. In addition, it can be seen from Figure 6 that π * ( t ) decreased with the increase in γ 1 . When the risk aversion coefficient γ 1 increases, it means that the model is more risk averse. At this time, the plan manager has a more conservative attitude towards risky assets, and she/he is more willing to buy risk-free assets, which leads to the corresponding decrease in π * ( t ) .
Figure 7 describes the relationship between the volatility coefficient of the stock price σ s and π * ( t ) . Strategy π * ( t ) decreases with the increase in σ s . The increase in σ s means that the risk of investing in stocks is greater, and managers tend to be more conservative at this time.
With the increase in σ s , the amount of money invested in stocks decreased. The increase in σ s means that the risk of investing in stocks is greater, and then managers tend to invest in a conservative way.
Figure 8 analyzes the influence of the parameters k and σ s μ on π * ( t ) . The figure shows that π * ( t ) increased with k and decreased with σ s μ . The greater the speed of the mean-reversion k, the stronger the predictability of the stock returns, and the lower the risk, the more managers understand the information of stock returns, and there is a corresponding increase in the stock investment funds. When σ s μ is larger, the stock returns are more volatile, and the risk is greater; hence, the manager reduces the funds invested in the stock.

4.3. Influence of the Longevity Trend on the Equalization Benefit Adjustment Strategy

This section presents and compares the equalization payment adjustment strategies b * ( t ) with and without the longevity trend.
We set the maximum survival age of the pension participants as a constant m ( 0 ) when disregarding the longevity trends. As can be seen from Figure 9, when the maximum survival age of the pension participants increased with time, that is, when there was a longevity trend, the equilibrium benefit adjustment strategy b * ( t ) was lower than the equilibrium benefit adjustment strategy without considering the longevity trend, and with the growth of time, the difference in the equilibrium benefit adjustment strategy in the two cases gradually increased. For pension managers, the rise in the maximum survival age has put enormous pressure on the pension system. To ensure the stability and sustainability of the pension system, administrators reduce the adjustment of pension benefits b * ( t ) , reducing benefits for retirees.

5. Conclusions

This paper studied the optimal investment and benefit adjustment of target benefit pension plans with longevity trends and partial information. The model in this paper assumes that there is longevity risk inside the pension system, and the fund account income risk is shared by different generations of insured persons. The mean-variance utility model was adopted to maximize the terminal wealth of the pension accounts, minimize the investment risk, and maximize the pension excess benefit.
To solve the partial information problems, we first used the Kalman filter theory to estimate the stock return rate that cannot be completely observed. Since the optimization objective adopted in this paper was time-inconsistent, it was similar to [26,27], and we used the thought of game theory to solve it to obtain the equilibrium investment and benefit adjustment payment strategy. In addition, we compared the equilibrium strategies in the partially measurable case and the fully measurable case and also gave the equilibrium payment adjustment strategies under different utility functions. Finally, we provided some numerical analysis to verify our theoretical results. The conclusion shows that our model results can provide stable and sufficient retirement benefits to the participants. With the effect of longevity risk, pension fund managers will ensure the asset stability of the fund account by reducing the benefit payment rate.
However, there were some limitations to this study. For instance, we did not consider inflation, which is a widespread economic variable. Future research can explore the optimal management of target benefit pension plans in further depth.

Author Contributions

Writing—original draft, N.L.; Writing—review & editing, W.L. and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China (Nos: 11961064).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are very grateful to the editors and the anonymous referees for their constructive and valuable comments and suggestions, which led to the greatly improved version of our manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof. 
To simplify the writing, we denote f ( t , X π , b ( t ) , μ ^ ( t ) ) by f ( t , X π , b , μ ^ ( t ) ) . According to the Itô lemma, one has
Y ( t , X π , b , μ ^ ( t ) ) = Y ( T , X π , b , μ ^ ( T ) ) t T d Y ( s , X , μ ^ ( s ) ) = Y ( T , X π , b , μ ^ ( T ) ) t T { Y s ( s , X π , b , μ ^ ( s ) ) + [ r X ( s ) + π ( s ) ( μ ^ ( s ) r ) + C ( s ) B ( s ) ] Y x ( s , X π , b , μ ^ ( s ) ) + k ( δ μ ^ ( s ) ) Y μ ^ ( s , X π , b , μ ^ ( s ) ) + 1 2 π 2 ( s ) σ s 2 Y x x ( s , X π , b , μ ^ ( s ) ) + 1 2 ( Σ σ s + σ s μ ) 2 Y μ ^ μ ^ ( s , X π , b , μ ^ ( s ) ) + π ( s ) σ s ( Σ σ s + σ s μ ) Y x μ ^ ( s , X π , b , μ ^ ( s ) ) } d s t T π ( s ) σ s Y x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Y μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = Y ( T , X π , b , μ ^ ( T ) ) t T Λ π , b Y ( s , X π , b , μ ^ ( s ) ) d s t T π ( s ) σ s Y x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Y μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = X π , b ( T ) t T π ( s ) σ s Y x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Y μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) .
Taking the conditional expectation for (A1), we have
Y ( t , X π , b , μ ^ ( t ) ) = E t , x , μ ^ [ X π , b ( T ) ] = y π , b ( t , x , μ ^ ) .
Similarly, applying Itô lemma to Z ( t , X π , b ( t ) , μ ^ ( t ) ) yields
Z ( t , X π , b , μ ^ ( t ) ) = Z ( T , X π , b , μ ^ ( T ) ) t T d Z ( s , X , μ ^ ( s ) ) = Z ( T , X π , b , μ ^ ( T ) ) t T { Z s ( s , X π , b , μ ^ ( s ) ) + [ r X ( s ) + π ( s ) ( μ ^ ( s ) r ) + C ( s ) B ( s ) ] Z x ( s , X π , b , μ ^ ( s ) ) + k ( δ μ ^ ( s ) ) Z μ ^ ( s , X π , b , μ ^ ( s ) ) + 1 2 π 2 ( s ) σ s 2 Z x x ( s , X π , b , μ ^ ( s ) ) + 1 2 ( Σ σ s + σ s μ ) 2 Z μ ^ μ ^ ( s , X π , b , μ ^ ( s ) ) + π ( s ) σ s ( Σ σ s + σ s μ ) Z x μ ^ ( s , X π , b , μ ^ ( s ) ) } d s t T π ( s ) σ s Z x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Z μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = Z ( T , X π , b , μ ^ ( T ) ) t T Λ π , b Z ( s , X π , b , μ ^ ( s ) ) d s t T π ( s ) σ s Z x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Z μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = ( X π , b ( T ) ) 2 t T π ( s ) σ s Z x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) Z μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) .
Taking the expectation for (A2), we have
Z ( t , X π , b , μ ^ ( t ) ) = E t , x , μ ^ [ ( X π , b ( T ) ) 2 ] = z π , b ( t , x , μ ^ ) .
Finally, letting W ˜ ( t , x , μ ^ ) = e r t W ( t , x , μ ^ ) , then W ( t , x , μ ^ ) = e r t W ˜ ( t , x , μ ^ ) , and substituting it into (11) yields
Λ π , b W ˜ ( t , x , μ ^ ) = e r t U ( B ( t ) B * e β t ) , W ˜ ( T , x , μ ^ ) = 0 .
According to the Itô lemma, it further yields
W ˜ ( t , X π , b , μ ^ ( t ) ) = W ˜ ( T , X π , b , μ ^ ( T ) ) t T d W ˜ ( s , X , μ ^ ( s ) ) = W ˜ ( T , X π , b , μ ^ ( T ) ) t T { W ˜ s ( s , X π , b , μ ^ ( s ) ) + [ r X ( s ) + π ( s ) ( μ ^ ( s ) r ) + C ( s ) B ( s ) ] W ˜ x ( s , X π , b , μ ^ ( s ) ) + k ( δ μ ^ ( s ) ) W ˜ μ ^ ( s , X π , b , μ ^ ( s ) ) + 1 2 π 2 ( s ) σ s 2 W ˜ x x ( s , X π , b , μ ^ ( s ) ) + 1 2 ( Σ σ s + σ s μ ) 2 W ˜ μ ^ μ ^ ( s , X π , b , μ ^ ( s ) ) + π ( s ) σ s ( Σ σ s + σ s μ ) W ˜ x μ ^ ( s , X π , b , μ ^ ( s ) ) } d s t T π ( s ) σ s W ˜ x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) W ˜ μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = W ˜ ( T , X π , b , μ ^ ( T ) ) t T Λ π , b W ˜ ( s , X π , b , μ ^ ( s ) ) d s t T π ( s ) σ s W ˜ x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) W ˜ μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) = t T e r s U ( B ( s ) B * e β s ) d s t T π ( s ) σ s W ˜ x ( s , X π , b , μ ^ ( s ) ) d B s ( s ) t T ( Σ σ s + σ s μ ) W ˜ μ ^ ( s , X π , b , μ ^ ( s ) ) d B s ( s ) .
Taking the expectation for (A3), it is straightforward that
W ˜ ( t , X π , b , μ ^ ( t ) ) = E [ t T e r s U ( B ( s ) B * e β s ) d s | X ( t ) = x , μ ^ ( t ) = μ ^ ] ;
thus,
W ( t , x , μ ^ ) = e r t W ˜ ( t , x , μ ^ ) = w π , b ( t , x , μ ^ ) .

Appendix B

Proof. 
This proof process is divided into three steps. To simplify the writing, we denote f π , b ( t , X π . b ( t ) , μ ^ π , b ( t ) , y ( t , X π , b ( t ) , μ ^ π , b ( t ) ) , z ( t , X π , b ( t ) , μ ^ π , b ( t ) ) , w ( t , X π , b ( t ) , μ ^ π , b ( t ) ) ) by f π , b ( t , X π . b , μ ^ π , b ( t ) , y π , b , z π , b , w π , b ) .
Step 1. Applying the Itô lemma, we have
f π , b ( t , X π . b , μ ^ π , b ( t ) , y π , b , z π , b , w π , b ) = f π , b ( T , X π . b , μ ^ π , b ( T ) , y π , b , z π , b , w π , b ) t T d f π , b ( s , X π . b , μ ^ π , b ( s ) , y π , b , z π , b , w π , b ) = f π , b ( T , X π . b , μ ^ π , b ( T ) , y π , b , z π , b , w π , b ) t T ( f s π , b + f y π , b Y s + f z π , b Z s + f w π , b W s ) d s + ( f x π , b + f y π , b Y x + f z π , b Z x + f w π , b W x ) d X π , b ( s ) + ( f μ ^ π , b + f y π , b Y μ ^ + f z π , b Z μ ^ + f w π , b W μ ^ ) d μ ^ ( s ) + 1 2 π 2 ( s ) σ s 2 [ f x x π , b + f y π , b Y x x + f z π , b Z x x + f w π , b W x x + f y y π , b ( Y x ) 2 + f z z π , b ( Z x ) 2 + f w w π , b ( W x ) 2 + 2 f x y π , b Y x + 2 f x z π , b Z x + 2 f x w π , b W x + 2 f y z π , b Y x Z x + 2 f y w π , b Y x W x + 2 f z w π , b Z x W x ] d s + 1 2 ( Σ σ s + σ s μ ) 2 [ f μ ^ μ ^ π , b + f y π , b Y μ ^ μ ^ + f z π , b Z μ ^ μ ^ + f w π , b W μ ^ μ ^ + f y y π , b ( Y μ ^ ) 2 + f z z π , b ( Z μ ^ ) 2 + f w w π , b ( W μ ^ ) 2 + 2 f μ ^ y π , b Y μ ^ + 2 f μ ^ z π , b Z μ ^ + 2 f μ ^ w π , b W μ ^ + 2 f y z π , b Y μ ^ Z μ ^ + 2 f y w π , b Y μ ^ W μ ^ + 2 f z w π , b Z μ ^ W μ ^ ] d s + π ( s ) σ s ( Σ σ s + σ s μ ) [ f x μ ^ π , b + f y π , b Y x μ ^ + f z π , b Z x μ ^ + f w π , b W x μ ^ + f x y π , b Y μ ^ + f x z π , b Z μ ^ + f x w π , b W μ ^ + f y μ ^ π , b Y x + f z μ ^ π , b Z x + f w μ ^ π , b W x + f y y π , b Y x Y μ ^ + f y z π , b Y x Z μ ^ + f y w π , b Y x W μ ^ + f z z π , b Z x Z μ ^ + f z y π , b Z x Y μ ^ + f z w π , b Z x W μ ^ + f w w π , b W x W μ ^ + f w y π , b W x Y μ ^ + f w z π , b W x Z μ ^ ] d s .
Using (9)–(11) further yields
f π , b ( t , X π . b , μ ^ π , b ( t ) , y π , b , z π , b , w π , b ) = f π , b ( T , X π . b , μ ^ π , b ( T ) , y π , b , z π , b , w π , b ) t T f s π , b d s + f y π , b { [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] Y x k ( δ μ ^ ( s ) ) Y μ ^ 1 2 π 2 ( s ) σ s 2 Y x x 1 2 ( Σ σ s + σ s μ ) 2 Y μ ^ μ ^ σ s π ( s ) ( Σ σ s + σ s μ ) Y x μ ^ } d s + f z π , b { [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] Z x k ( δ μ ^ ( s ) ) Z μ ^ 1 2 π 2 ( s ) σ s 2 Z x x 1 2 ( Σ σ s + σ s μ ) 2 Z μ ^ μ ^ σ s π ( s ) ( Σ σ s + σ s μ ) Z x μ ^ } d s + f w π , b { r W U ( B ( s ) B * e β s ) [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] W x k ( δ μ ^ ( s ) ) W μ ^ 1 2 π 2 ( s ) σ s 2 W x x 1 2 ( Σ σ s + σ s μ ) 2 W μ ^ μ ^ σ s π ( s ) ( Σ σ s + σ s μ ) W x μ ^ } d s + ( f x π , b + f y π , b Y x + f z π , b Z x + f w π , b W x ) { [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] d s + π ( s ) σ s d B s ( s ) } + ( f μ ^ π , b + f y π , b Y μ ^ + f z π , b Z μ ^ + f w π , b W μ ^ ) [ k ( δ μ ^ ( s ) ) d s + ( Σ σ s + σ s μ ) d B s ( s ) ] + 1 2 π 2 ( s ) σ s 2 [ f x x π , b + f y π , b Y x x + f z π , b Z x x + f w π , b W x x + f y y π , b ( Y x ) 2 + f z z π , b ( Z x ) 2 + f w w π , b ( W x ) 2 + 2 f x y π , b Y x + 2 f x z π , b Z x + 2 f x w π , b W x + 2 f y z π , b Y x Z x + 2 f y w π , b Y x W x + 2 f z w π , b Z x W x ] d s + 1 2 ( Σ σ s + σ s μ ) 2 [ f μ ^ μ ^ π , b + f y π , b Y μ ^ μ ^ + f z π , b Z μ ^ μ ^ + f w π , b W μ ^ μ ^ + f y y π , b ( Y μ ^ ) 2 + f z z π , b ( Z μ ^ ) 2 + f w w π , b ( W μ ^ ) 2 + 2 f μ ^ y π , b Y μ ^ + 2 f μ ^ z π , b Z μ ^ + 2 f μ ^ w π , b W μ ^ + 2 f y z π , b Y μ ^ Z μ ^ + 2 f y w π , b Y μ ^ W μ ^ + 2 f z w π , b Z μ ^ W μ ^ ] d s + π ( s ) σ s ( Σ σ s + σ s μ ) [ f x μ ^ π , b + f y π , b Y x μ ^ + f z π , b Z x μ ^ + f w π , b W x μ ^ + f x y π , b Y μ ^ + f x z π , b Z μ ^ + f x w π , b W μ ^ + f y μ ^ π , b Y x + f z μ ^ π , b Z x + f w μ ^ π , b W x + f y y π , b Y x Y μ ^ + f y z π , b Y x Z μ ^ + f y w π , b Y x W μ ^ + f z z π , b Z x Z μ ^ + f z y π , b Z x Y μ ^ + f z w π , b Z x W μ ^ + f w w π , b W x W μ ^ + f w y π , b W x Y μ ^ + f w z π , b W x Z μ ^ ] d s .
Therefore,
f π , b ( t , X π . b , μ ^ π , b ( t ) , y π , b , z π , b , w π , b ) = f π , b ( T , X π . b , μ ^ π , b ( T ) , y π , b , z π , b , w π , b ) t T [ f s π , b + ( r W U ( B ( s ) B * e β s ) ) f w π , b ] d s + f x π , b [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] d s + f μ ^ π , b k ( δ μ ^ ( s ) ) d s + ( f x π , b + f y π , b Y x + f z π , b Z x + f w π , b W x ) π ( s ) σ s d B s ( s ) + ( f μ ^ π , b + f y π , b Y μ ^ + f z π , b Z μ ^ + f w π , b W μ ^ ) ( Σ σ s + σ s μ ) d B s ( s ) + 1 2 π 2 ( s ) σ s 2 [ f x x π , b + f y y π , b ( Y x ) 2 + f z z π , b ( Z x ) 2 + f w w π , b ( W x ) 2 + 2 f x y π , b Y x + 2 f x z π , b Z x + 2 f x w π , b W x + 2 f y z π , b Y x Z x + 2 f y w π , b Y x W x + 2 f z w π , b Z x W x ] d s + 1 2 ( Σ σ s + σ s μ ) 2 [ f μ ^ μ ^ π , b + f y y π , b ( Y μ ^ ) 2 + f z z π , b ( Z μ ^ ) 2 + f w w π , b ( W μ ^ ) 2 + 2 f μ ^ y π , b Y μ ^ + 2 f μ ^ z π , b Z μ ^ + 2 f μ ^ w π , b W μ ^ + 2 f y z π , b Y μ ^ Z μ ^ + 2 f y w π , b Y μ ^ W μ ^ + 2 f z w π , b Z μ ^ W μ ^ ] d s + π ( s ) σ s ( Σ σ s + σ s μ ) [ f x μ ^ π , b + f x y π , b Y μ ^ + f x z π , b Z μ ^ + f x w π , b W μ ^ + f y μ ^ π , b Y x + f z μ ^ π , b Z x + f w μ ^ π , b W x + f y y π , b Y x Y μ ^ + f y z π , b Y x Z μ ^ + f y w π , b Y x W μ ^ + f z z π , b Z x Z μ ^ + f z y π , b Z x Y μ ^ + f z w π , b Z x W μ ^ + f w w π , b W x W μ ^ + f w y π , b W x Y μ ^ + f w z π , b W x Z μ ^ ] d s .
We define
F ( 1 ) = y π , b ( t , x , μ ^ ) , F ( 2 ) = z π , b ( t , x , μ ^ ) , F ( 3 ) = w π , b ( t , x , μ ^ ) .
It follows from (13)–(16) that
f π , b ( t , X π . b , μ ^ π , b ( t ) , y π , b , z π , b , w π , b ) = f π , b ( T , X π . b , μ ^ π , b ( T ) , y π , b , z π , b , w π , b ) t T { J π , b ( s ) + f x π , b [ π ( s ) ( μ ^ ( s ) r ) + X π , b ( s ) r + C ( s ) B ( s ) ] + f μ ^ π , b k ( δ μ ^ ( s ) ) } d s + ( f x π , b + f y π , b Y x + f z π , b Z x + f w π , b W x ) π ( s ) σ s d B s ( s ) + ( f μ ^ π , b + f y π , b Y μ ^ + f z π , b Z μ ^ + f w π , b W μ ^ ) ( Σ σ s + σ s μ ) d B s ( s ) + [ 1 2 π 2 ( s ) σ s 2 U 1 π , b ( s ) + 1 2 ( Σ σ s + σ s μ ) 2 U 2 π , b ( s ) + π ( s ) σ s ( Σ σ s + σ s μ ) U 3 π , b ( s ) ] d s .
Step 2. Applying the Itô lemma to F ( t , x , μ ^ ) yields
F ( t , X π , b , μ ^ ( t ) ) = F ( T , X π , b , μ ^ ( T ) ) t T d F ( s , X π , b , μ ^ ( s ) ) = F ( T , X π , b , μ ^ ( T ) ) t T F s d s + F x d X π , b ( s ) + F μ ^ d μ ^ ( s ) + 1 2 π 2 ( s ) σ s 2 F x x d s + 1 2 ( Σ σ s + σ s μ ) 2 F μ ^ μ ^ d s + π ( s ) σ s ( Σ σ s + σ s μ ) F x μ ^ d s .
Since F ( t , x , μ ^ ) satisfies (12) for any arbitrary strategy ( π , b ) , we have
F t [ x r + C B + π ( μ ^ r ) ] ( F x Q 1 ) k ( δ μ ^ ) ( F μ ^ Q 2 ) 1 2 π 2 σ s 2 ( F x x U 1 π , b ) 1 2 ( Σ σ s + σ s μ ) 2 ( F μ ^ μ ^ U 2 π , b ) π σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π , b ) + J π , b .
Setting x = X π , b ( t ) and μ ^ = μ ^ ( t ) into (A6) and using (A5) and (9)–(11),
F ( t , X π , b , μ ^ ( t ) ) F ( T , X π , b , μ ^ ( T ) ) t T { { [ X π , b ( s ) r + C ( s ) B ( s ) + π ( s ) ( μ ^ ( s ) r ) ] ( F x Q 1 ) k ( δ μ ^ ( s ) ) ( F μ ^ Q 2 ) 1 2 π 2 ( s ) σ s 2 ( F x x U 1 π , b ) 1 2 ( Σ σ s + σ s μ ) 2 ( F μ ^ μ ^ U 2 π , b ) π ( s ) σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π , b ) + J π , b } d s + F x [ X π , b ( s ) r + C ( s ) B ( s ) + π ( s ) ( μ ^ ( s ) r ) ] d s + π ( s ) σ s d B s ( s ) + F μ ^ [ k ( δ μ ^ ( s ) ) d s + ( Σ σ s + σ s μ ) d B s ( s ) ] + 1 2 π 2 ( s ) σ s 2 F x x d s + 1 2 ( Σ σ s + σ s μ ) 2 F μ ^ μ ^ d s + π ( s ) σ s ( Σ σ s + σ s μ ) F x μ ^ d s } = f π , b ( T , X π , b ( T ) , μ ^ ( T ) , X π , b ( T ) , ( X π , b ( T ) ) 2 , 0 ) t T { { [ X π , b ( s ) r + C ( s ) B ( s ) + π ( s ) ( μ ^ ( s ) r ) ] Q 1 + k ( δ μ ^ ( s ) ) Q 2 + 1 2 π 2 ( s ) σ s 2 U 1 π , b + 1 2 ( Σ σ s + σ s μ ) 2 U 2 π , b + π ( s ) σ s ( Σ σ s + σ s μ ) U 3 π , b + J π , b } d s + F x π ( s ) σ s d B s ( s ) + F μ ^ ( Σ σ s + σ s μ ) d B s ( s ) . }
Substituting (A4) into the above inequality, we obtain
F ( t , X π , b , μ ^ ( t ) ) f π , b ( t , X π , b , μ ^ ( t ) , y π , b , z π , b , w π , b ) + t T { ( f x π , b + f y π , b Y x + f z π , b Z x + f w π , b W x F x ) π ( s ) σ s d B s ( s ) + ( f μ ^ π , b + f y π , b Y μ ^ + f z π , b Z μ ^ + f w π , b W μ ^ F μ ^ ) ( Σ σ s + σ s μ ) d B s ( s ) } .
Taking the expectation for the above inequality, conditional upon X ( t ) = x , leads to
F ( t , x , μ ^ ) f π , b ( t , x , μ ^ , Y π , b , Z π , b , W π , b ) .
Therefore,
F ( t , X π , b , μ ^ ( t ) ) sup { π , b } Π f π , b ( t , x , μ ^ , y π , b , z π , b , w π , b ) .
Step 3. The last step proves that ( π * , b * ) is an equilibrium strategy. Suppose the strategy ( π * , b * ) satisfies the infimum in (12), one has
F t = [ x r + C I L b * + π * ( μ ^ r ) ] ( F x f x π * , b * ) k ( δ μ ^ ) ( F μ ^ f μ ^ π * , b * ) 1 2 ( π * ) 2 σ s 2 ( F x x U 1 π * , b * ) 1 2 ( Σ σ s + σ s μ ) 2 ( F μ ^ μ ^ U 2 π * , b * ) π π * , b * σ s ( Σ σ s + σ s μ ) ( F x μ ^ U 3 π * , b * ) + J π * , b * .
According to the derivation of (A8), we have
F ( t , X π , b , μ ^ ( t ) ) = F ( t , X π * , b * , μ ^ ( t ) ) = f π * , b * ( t , X π * , b * , μ ^ ( t ) , Y π * , b * , Z π * , b * , W π * , b * ) + t T { ( f x π * , b * + f y π * , b * Y x + f z π * , b * Z x + f w π * , b * W x F x ) π ( s ) σ s d B s ( s ) + ( f μ ^ π * , b * + f y π * , b * Y μ ^ + f z π * , b * Z μ ^ + f w π * , b * W μ ^ F μ ^ ) ( Σ σ s + σ s μ ) d B s ( s ) } .
Taking the expectation for the above equality yields
F ( t , x , μ ^ ) = f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) sup { π , b } Π f π , b ( t , x , μ ^ , y π , b , z π , b , w π , b ) .
Combining (A9) and (A11), we obtain
F ( t , x , μ ^ ) = sup { π , b } Π f π , b ( t , x , μ ^ , y π , b , z π , b , w π , b ) .
For any strategy ( π ˜ ε ( s ) , b ˜ ε ( s ) ) defined by Definition 1, we have f π ˜ h , b ˜ h ( T ) = f π * , b * ( T ) . Equations (A4), (A7), and (A11) yield
f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) f π ˜ h , b ˜ h ( t , x , μ ^ , y π ˜ h , b ˜ h , z π ˜ h , b ˜ h , w π ˜ h , b ˜ h ) = F ( t , x , μ ^ ) f π ˜ h , b ˜ h ( t , x , μ ^ , y π ˜ h , b ˜ h , z π ˜ h , b ˜ h , w π ˜ h , b ˜ h ) t T { { [ X π ˜ h , b ˜ h ( s ) r + C ( s ) I ( s ) L ( s ) b ˜ ( s ) + π ˜ h ( s ) ( μ ^ ( s ) r ) ] f x π ˜ h , b ˜ h [ X π * , b * ( s ) r + C ( s ) I ( s ) L ( s ) b * ( s ) + π * ( s ) ( μ ^ ( s ) r ) ] f x π * , b * + k ( δ μ ^ ( s ) ) ( f μ ^ π ˜ h , b ˜ h f μ ^ π * , b * ) + 1 2 π ˜ h 2 ( s ) σ s 2 U 1 π ˜ h , b ˜ h 1 2 ( π * ( s ) ) 2 σ s 2 U 1 π * , b * + 1 2 ( Σ σ s + σ s μ ) 2 ( U 2 π ˜ h , b ˜ h U 2 π * , b * ) + π ˜ h ( s ) σ s ( Σ σ s + σ s μ ) U 3 π ˜ h , b ˜ h π * ( s ) σ s ( Σ σ s + σ s μ ) U 3 π * , b * + J π ˜ h , b ˜ h ( s ) J π * , b * ( s ) } d s + [ ( f x π ˜ h , b ˜ h + f y π ˜ h , b ˜ h Y x + f z π ˜ h , b ˜ h Z x + f w π ˜ h , b ˜ h W x ) π ˜ h ( s ) σ s F x π * ( s ) σ s ] d B s ( s ) + ( f μ ^ π ˜ h , b ˜ h + f y π ˜ h , b ˜ h Y μ ^ + f z π ˜ h , b ˜ h Z μ ^ + f w π ˜ h , b ˜ h W μ ^ F μ ^ ) ( Σ σ s + σ s μ ) d B s ( s ) } = t t + h { [ X π ˜ h , b ˜ h ( s ) r + C ( s ) I ( s ) L ( s ) b ˜ ( s ) + π ˜ h ( s ) ( μ ^ ( s ) r ) ] f x π ˜ h , b ˜ h [ X π * , b * ( s ) r + C ( s ) I ( s ) L ( s ) b * ( s ) + π * ( s ) ( μ ^ ( s ) r ) ] f x π * , b * + k ( δ μ ^ ( s ) ) ( f μ ^ π ˜ h , b ˜ h f μ ^ π * , b * ) + 1 2 π ˜ h 2 ( s ) σ s 2 U 1 π ˜ h , b ˜ h 1 2 ( π * ( s ) ) 2 σ s 2 U 1 π * , b * + 1 2 ( Σ σ s + σ s μ ) 2 ( U 2 π ˜ h , b ˜ h U 2 π * , b * ) + π ˜ h ( s ) σ s ( Σ σ s + σ s μ ) U 3 π ˜ h , b ˜ h π * ( s ) σ s ( Σ σ s + σ s μ ) U 3 π * , b * + J π ˜ h , b ˜ h ( s ) J π * , b * ( s ) } d s + t T { [ ( f x π ˜ h , b ˜ h + f y π ˜ h , b ˜ h Y x + f z π ˜ h , b ˜ h Z x + f w π ˜ h , b ˜ h W x ) π ˜ h ( s ) σ s F x π * ( s ) σ s ] + ( f μ ^ π ˜ h , b ˜ h + f y π ˜ h , b ˜ h Y μ ^ + f z π ˜ h , b ˜ h Z μ ^ + f w π ˜ h , b ˜ h W μ ^ F μ ^ ) ( Σ σ s + σ s μ ) } d B s ( s ) .
Taking the conditional expectation on the above inequality yields
f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) f π ˜ h , b ˜ h ( t , x , μ ^ , y π ˜ h , b ˜ h , z π ˜ h , b ˜ h , w π ˜ h , b ˜ h ) E t [ t t + h { [ X π ˜ h , b ˜ h ( s ) r + C ( s ) I ( s ) L ( s ) b ˜ ( s ) + π ˜ h ( s ) ( μ ^ ( s ) r ) ] f x π ˜ h , b ˜ h [ X π * , b * ( s ) r + C ( s ) I ( s ) L ( s ) b * ( s ) + π * ( s ) ( μ ^ ( s ) r ) ] f x π * , b * + k ( δ μ ^ ( s ) ) ( f μ ^ π ˜ h , b ˜ h f μ ^ π * , b * ) + 1 2 π ˜ h 2 ( s ) σ s 2 U 1 π ˜ h , b ˜ h 1 2 ( π * ( s ) ) 2 σ s 2 U 1 π * , b * + 1 2 ( Σ σ s + σ s μ ) 2 ( U 2 π ˜ h , b ˜ h U 2 π * , b * ) + π ˜ h ( s ) σ s ( Σ σ s + σ s μ ) U 3 π ˜ h , b ˜ h π * ( s ) σ s ( Σ σ s + σ s μ ) U 3 π * , b * + J π ˜ h , b ˜ h ( s ) J π * , b * ( s ) } d s ] .
Therefore,
lim h 0 inf 1 h f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) f π ˜ h , b ˜ h ( t , x , μ ^ , y π ˜ h , b ˜ h , z π ˜ h , b ˜ h , w π ˜ h , b ˜ h ) lim h 0 inf 1 h E t [ t t + h { [ X π ˜ h , b ˜ h ( s ) r + C ( s ) I ( s ) L ( s ) b ˜ ( s ) + π ˜ h ( s ) ( μ ^ ( s ) r ) ] f x π ˜ h , b ˜ h [ X π * , b * ( s ) r + C ( s ) I ( s ) L ( s ) b * ( s ) + π * ( s ) ( μ ^ ( s ) r ) ] f x π * , b * + k ( δ μ ^ ( s ) ) ( f μ ^ π ˜ h , b ˜ h f μ ^ π * , b * ) + 1 2 π ˜ h 2 ( s ) σ s 2 U 1 π ˜ h , b ˜ h 1 2 ( π * ( s ) ) 2 σ s 2 U 1 π * , b * + 1 2 ( Σ σ s + σ s μ ) 2 ( U 2 π ˜ h , b ˜ h U 2 π * , b * ) + π ˜ h ( s ) σ s ( Σ σ s + σ s μ ) U 3 π ˜ h , b ˜ h π * ( s ) σ s ( Σ σ s + σ s μ ) U 3 π * , b * + J π ˜ h , b ˜ h ( s ) J π * , b * ( s ) } d s ] = [ X π * , b * ( t ) r + C ( t ) I ( t ) L ( t ) b * ( t ) + π * ( t ) ( μ ^ ( t ) r ) ] ( f x π * , b * f x π * , b * ) + k ( δ μ ^ ( t ) ) ( f μ ^ π * , b * f μ ^ π * , b * ) + 1 2 ( π * ( t ) ) 2 σ s 2 ( U 1 π * , b * U 1 π * , b * ) + 1 2 ( Σ σ s + σ s μ ) 2 ( U 2 π * , b * U 2 π * , b * ) + π * ( t ) σ s ( Σ σ s + σ s μ ) ( U 3 π * , b * U 3 π * , b * ) + J π * , b * ( t ) J π * , b * ( t ) = 0 .
It can be seen from the above equation that ( π * ( t ) , b * ( t ) ) satisfies 1. Thus, ( π * ( t ) , b * ( t ) ) is an equilibrium strategy, and
F ( t , x , μ ^ ) = f π * , b * ( t , x , μ ^ , y π * , b * , z π * , b * , w π * , b * ) = V ( t , x , μ ^ ) .

References

  1. Meramveliotakis, G. Understanding Money & Credit in Contemporary Capitalism: Back to the Future of Marx’s Theory of Fetishism and Alienation. Critique 2011, 50, 307–324. [Google Scholar]
  2. Hassel, A.; Naczyk, M.; Wiß, T. The political economy of pension financialisation: Public policy responses to the crisis. J. Eur. Public Policy 2019, 26, 483–500. [Google Scholar] [CrossRef]
  3. Niels, K. The ‘defined ambition’ pension plan: A dutch interpretation. Rotman Int. J. Pension Manag. 2013, 6, 6–11. [Google Scholar]
  4. Bovenberg, A.L.; Mehlkopf, R.; Nijman, T.E. The Promise of Defined Ambition Plans: Lessons for the United States: The Next 40 Years; Oxford University Press: Oxford, UK, 2016; pp. 215–246. [Google Scholar]
  5. Pugh, C.; Yermo, J. Funding regulations and risk sharing. In OECD Working Papers on Insurance and Private Pensions; OECD: Paris, France, 2008. [Google Scholar] [CrossRef]
  6. Turner, J.A. Hybrid pensions: Risk sharing arrangements for pension plan sponsors and participants. Soc. Actuar. 2014, 1–44. [Google Scholar]
  7. Canadian Institute of Actuaries. Report of the Task Force on Target Benefit Plans. 2015. Available online: https://www.cia-ica.ca/docs/default-source/2015/215043e.pdf (accessed on 18 July 2023).
  8. Munnell, A.H.; Sass, A.S. New Brunswick’s new shared risk pension plan. In State and Local Pension Plans Briefs; Center For Retirement Research: Boston, MA, USA, 2013; No. 33. [Google Scholar]
  9. Gollier, C. Intergenerational risk-sharing and risk-taking of a pension fund. J. Public Econ. 2008, 92, 1463–1485. [Google Scholar] [CrossRef]
  10. Cui, J.; De Jong, F.; Ponds, E. Intergenerational risk sharing within funded pension schemes. J. Pension Econ. Financ. 2011, 10, 1–29. [Google Scholar] [CrossRef]
  11. Wang, S.X.; Lu, Y.; Sanders, B. Optimal investment strategies and intergenerational risk sharing for target benefit pension plans. Insur. Math. Econ. 2018, 80, 1–14. [Google Scholar] [CrossRef]
  12. Wang, S.X.; Rong, X.M.; Zhao, H. Optimal investment and benefit payment strategy under loss aversion for target benefit pension plans. Appl. Math. Comput. 2019, 346, 205–218. [Google Scholar] [CrossRef]
  13. Zhao, H.; Wang, S. Optimal investment and benefit adjustment problem for a target benefit pension plan with Cobb-Douglas utility and Epstein-Zin recursive utility. Eur. J. Oper. Res. 2022, 301, 1166–1180. [Google Scholar] [CrossRef]
  14. Battocchio, P.; Menoncin, F. Optimal pension management in a stochastic framework. Insur. Math. Econ. 2004, 34, 79–95. [Google Scholar] [CrossRef]
  15. Cairns, A.J.G.; Blake, D.; Dowd, K. Stochastic lifestyling: Optimal dynamic asset allocation for defined contribution pension plans. J. Econ. Dyn. Control 2005, 30, 843–877. [Google Scholar] [CrossRef]
  16. Bordley, R.; LiCalzi, M. Decision Analysis Using Targets Instead of Utility Functions. Decis. Econ. Financ. 2000, 23, 53–74. [Google Scholar] [CrossRef]
  17. Vigna, E.; Haberman, S. Optimal Investment Strategy for Defined Contribution Pension Scheme. Insur. Math. Econ. 2001, 28, 233–262. [Google Scholar] [CrossRef]
  18. Li, D.P.; Rong, X.M.; Zhao, H. Time-Consistent Investment Strategy for DC Pension Plan with Stochastic Salary Under CEV Model. J. Syst. Sci. Complex. 2016, 29, 428–454. [Google Scholar] [CrossRef]
  19. Zhang, L.; Li, D.P.; Lai, Y.Z. Equilibrium investment strategy for a defined contribution pension plan under stochastic interest rate and stochastic volatility. J. Comput. Appl. Math. 2020, 368, 112536. [Google Scholar] [CrossRef]
  20. Markowitz, H. Portfolio selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
  21. Basak, S.; Chabakauri, G. Dynamic mean—Variance asset allocation. Rev. Financ. Stud. 2010, 23, 2970–3016. [Google Scholar] [CrossRef]
  22. Bjo¨rk, T.; Murgoci, A. A General Theory of Markovian Time Inconsistent Stochastic Control Problems; Working Paper; Stockholm School of Economics: Stockholm, Sweden, 2010. [Google Scholar]
  23. Long, J.; Zeng, S. Equilibrium Time-Consistent Strategy for Corporate International Investment Problem with Mean-Variance Criterion. Math. Probl. Eng. 2016, 2016, 3295041.1–3295041.20. [Google Scholar] [CrossRef]
  24. Guan, G. Equilibrium and Precommitment Mean-Variance Portfolio Selection Problem with Partially Observed Price Index and Multiple Assets. Methodol. Comput. Appl. Probab. 2020, 22, 1–23. [Google Scholar] [CrossRef]
  25. Kronborg, M.T.; Steffensen, M. Inconsistent investment and consumption problems. Appl. Math. Optim. 2015, 71, 473–515. [Google Scholar] [CrossRef]
  26. Yang, B.Z.; He, X.J.; Zhu, S.P. Continuous time mean–variance–utility portfolio problem and its equilibrium strategy. Optimization 2021, 71, 4213–4241. [Google Scholar] [CrossRef]
  27. Kang, J.; Wang, M.; Huang, N. Equilibrium strategy for mean–variance–utility portfolio selection under Heston’s SV model. J. Comput. Appl. Math. 2021, 392, 113490. [Google Scholar] [CrossRef]
  28. Wong, T.W.; Chiu, M.C.; Wong, H.Y. Time-consistent mean-variance hedging of longevity risk: Effect of cointegration. Insur. Math. Econ. 2014, 56, 56–67. [Google Scholar] [CrossRef]
  29. Kwok, K.Y.; Chiu, M.C.; Wong, H.Y. Demand for longevity securities under relative performance concerns: Stochastic differential games with cointegration. Insur. Math. Econ. 2016, 71, 353–366. [Google Scholar] [CrossRef]
  30. Strulik, H.; Vollmer, S. Long-run trends of human aging and longevity. J. Popul. Econ. 2013, 26, 1303–1323. [Google Scholar] [CrossRef]
  31. Knell, M. Increasing life expectancy and NDC pension systems. J. Pension Econ. Financ. 2018, 17, 1–30. [Google Scholar] [CrossRef]
  32. Devolder, P.; Melis, R. Optimal mix between pay as you go and funding for pension liabilities in a stochastic framework. Astin Bull. 2015, 45, 551–575. [Google Scholar] [CrossRef]
  33. Fu, K.; Rong, X.; Zhao, H. Optimal investment problem for a hybrid pension with intergenerational risk-sharing and longevity trend under model uncertainty. Working Paper, Tianjin University. arXiv 2023, arXiv:2302.02351. [Google Scholar]
  34. Fama, E.F.; French, K.R. Permanent and temporary components of stock prices. J. Political Econ. 1988, 96, 246–273. [Google Scholar] [CrossRef]
  35. Boudoukh, J.; Michaely, R.; Richardson, M.; Roberts, M.R. On the implications of measuring payout yield: Implications for empirical asset pricing. J. Financ. 2007, 62, 877–915. [Google Scholar] [CrossRef]
  36. Wang, P.; Shen, Y.; Zhang, L.; Kang, Y. Equilibrium investment strategy for a DC pension plan with learning about stock return predictability. Insur. Math. Econ. 2021, 100, 384–407. [Google Scholar] [CrossRef]
  37. Wang, P.; Zhang, L.; Li, Z. Asset allocation for a DC pention plan with learning about stock return predictability. J. Ind. Manag. Optim. 2022, 18, 3847–3877. [Google Scholar] [CrossRef]
  38. Liptser, R.S.; Shiryayev, A.N. Statistics of Random Processes, II-Applications; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2001. [Google Scholar]
Figure 1. Comparison of the equilibrium benefit payment and the target benefit payment.
Figure 1. Comparison of the equilibrium benefit payment and the target benefit payment.
Axioms 12 00732 g001
Figure 2. Effect of γ 2 on b * ( t ) .
Figure 2. Effect of γ 2 on b * ( t ) .
Axioms 12 00732 g002
Figure 3. Effect of η on b * ( t ) .
Figure 3. Effect of η on b * ( t ) .
Axioms 12 00732 g003
Figure 4. Effect of β on b * ( t ) .
Figure 4. Effect of β on b * ( t ) .
Axioms 12 00732 g004
Figure 5. Effect of θ on b * ( t ) .
Figure 5. Effect of θ on b * ( t ) .
Axioms 12 00732 g005
Figure 6. Effect of γ 1 on π * ( t ) .
Figure 6. Effect of γ 1 on π * ( t ) .
Axioms 12 00732 g006
Figure 7. Effect of σ s on π * ( t ) .
Figure 7. Effect of σ s on π * ( t ) .
Axioms 12 00732 g007
Figure 8. Effects of k and σ s μ on π * ( t ) .
Figure 8. Effects of k and σ s μ on π * ( t ) .
Axioms 12 00732 g008
Figure 9. The effect of the longevity trend on b * ( t ) .
Figure 9. The effect of the longevity trend on b * ( t ) .
Axioms 12 00732 g009
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, W.; Li, N.; Muhammadhaji, A. An Equilibrium Strategy for Target Benefit Pension Plans with a Longevity Trend and Partial Information. Axioms 2023, 12, 732. https://doi.org/10.3390/axioms12080732

AMA Style

Liu W, Li N, Muhammadhaji A. An Equilibrium Strategy for Target Benefit Pension Plans with a Longevity Trend and Partial Information. Axioms. 2023; 12(8):732. https://doi.org/10.3390/axioms12080732

Chicago/Turabian Style

Liu, Wei, Na Li, and Ahmadjan Muhammadhaji. 2023. "An Equilibrium Strategy for Target Benefit Pension Plans with a Longevity Trend and Partial Information" Axioms 12, no. 8: 732. https://doi.org/10.3390/axioms12080732

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop