Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
The Application of the Piecewise Linear Method for Non-Linear Programming Problems in Ride-Hailing Assignment Based on Service Level, Driver Workload, and Fuel Consumption
Previous Article in Journal
A Multi-Objective Non-Dominated Sorting Gravitational Search Algorithm for Assembly Flow-Shop Scheduling of Marine Prefabricated Cabins
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bayesian Estimation of the Semiparametric Spatial Lag Model

1
College of Economics and Management, Fujian Agriculture and Forestry University, Fuzhou 350002, China
2
School of Economics and Management, Fuzhou University, Fuzhou 350108, China
*
Author to whom correspondence should be addressed.
Mathematics 2024, 12(14), 2289; https://doi.org/10.3390/math12142289
Submission received: 7 June 2024 / Revised: 4 July 2024 / Accepted: 20 July 2024 / Published: 22 July 2024
(This article belongs to the Section Probability and Statistics)

Abstract

:
This paper proposes a semiparametric spatial lag model and develops a Bayesian estimation method for this model. In the estimation of the model, the paper combines Reversible Jump Markov Chain Monte Carlo (RJMCMC) algorithm, random walk Metropolis sampler, and Gibbs sampling techniques to sample all the parameters. The paper conducts numerical simulations to validate the proposed Bayesian estimation theory using a numerical example. The simulation results demonstrate satisfactory estimation performance of the parameter part and the fitting performance of the nonparametric function under different spatial weight matrix settings. Furthermore, the paper applies the constructed model and its estimation method to an empirical study on the relationship between economic growth and carbon emissions in China, illustrating the practical application value of the theoretical results.

1. Introduction

Econometric models can be divided into parametric regression models and nonparametric regression models based on the structural aspect of the regression object. Parametric models assume that the regression object follows a known parameter structure, while nonparametric models do not make specific structural assumptions about the regression object but instead rely on mild smoothness conditions to effectively avoid the risk of misspecification. However, in practical applications, researchers often have limited knowledge about the exact functional form of the impact of explanatory variables on the dependent variable. In such cases, combining both models into a single model, known as a semiparametric regression model, is necessary to address real-world problems effectively. Therefore, the semiparametric regression model can be seen as a bridge between parametric and nonparametric regression models, allowing for the utilization of information contained in the data. It establishes a parametric model for the part of the explanatory variables that have a clear relationship with the response variable and a nonparametric model for the parts with less explicit relationships (such as spatial factors and temporal factors). The advantage of this model lies in its ability to combine the interpretability of linear regression with the robustness of nonparametric regression, while addressing the “curse of dimensionality” issue associated with nonparametric regression, making it easier to estimate. Additionally, as the model contains both parametric and nonparametric components, it provides a closer approximation to reality and better utilizes the information provided by the data when describing real-world problems. Therefore, the semiparametric model is practical in both theoretical research and practical applications, and it holds significant importance.
Due to the rapid development of spatial econometrics in recent years, spatial econometric models have gained widespread attention and extensive application in various fields (see [1,2,3]). Particularly in the fields of economics, management, and environmental science, variables are not only influenced by regional factors (exhibiting spatial correlation) but also often exhibit both linear and nonlinear relationships. Therefore, this paper considers incorporating spatial factor and spatial weight matrix into the semiparametric model, expanding it into a spatial semiparametric model.
Given the practicality of the spatial semiparametric model, there have been increasing applications and theoretical advancements in this area in recent years. Su [4] applied the quasi-maximum likelihood method to estimate partially linear spatial autoregressive model, and subsequently, Su [5] further discussed the estimation issues of the semiparametric spatial autoregressive model. Malikov and Sun [6] discussed the estimation and testing problems of the smooth coefficient semiparametric spatial autoregressive model, and later, Malikov et al. [7] applied the semiparametric spatial quantile autoregressive model to the study of residential housing values. Li and Chen [8] employed the maximum likelihood method to estimate the semiparametric varying coefficient spatial lag model. Sun [9] discussed the estimation and application of the semiparametric spatial lag model for panel data.
From the existing research results, it can be observed that in terms of theory, the main research interest in the semiparametric spatial model lies in the estimation of the coefficients in the parametric part and the unknown functions in the nonparametric part. Since this model contains both parametric and nonparametric components, the methods used to study this model combine the approaches used in linear regression and nonparametric regression, but it is not a simple combination of the two methods. The existing literature mainly employs traditional frequentist methods such as (quasi-) maximum likelihood estimation and generalized method of moments to estimate the semiparametric spatial model. In comparison to traditional methods, the Bayesian estimation method has the distinctive feature of utilizing information. It not only uses sample information but also incorporate prior information (even though acquiring precise prior information can be challenging), whereas traditional methods rely solely on sample information for estimation. Additionally, the Bayesian estimation method can obtain consistent and efficient estimates under more relaxed conditions. Particularly, the application of Bayesian methods, especially simulation-based Bayesian inference in the semiparametric model can effectively alleviate the “curse of dimensionality” issue. Therefore, in recent years, the Bayesian method has been successfully applied in semiparametric regression and spatial models. Gelfand et al. [10] applied Bayesian methods to discuss the estimation of the nonparametric spatial model. Han and Lee [11] studied the Bayesian estimation and model selection problems of the spatial durbin error model. Kang and Cressie [12] investigated Bayesian inference for the spatial random effects model. Fang and Qian [13] and Fang [14] applied Bayesian methods to estimate the nonparametric spatial lag model and general spatial lag model respectively. Lesage [15] used Bayesian method to study statistical inference problems in the panel data spatial econometric model. Louzada et al. [16] focused on the progressive development and content analysis of the Bayesian spatial models. Thapa et al. [17] employed Bayesian statistics using the nested-sampling algorithm to compare and rank multiple models of ergodic diffusion (including anomalous diffusion) as well as to assess their optimal parameters for in silico-generated and real time-series.
Due to the advantages of the Bayesian method, this paper will explore the Bayesian estimation problem of the spatial semiparametric model. Depending on the treatment of spatial dependence, spatial econometric models can be divided into spatial lag model and spatial error model. However, the spatial error model can essentially be viewed as a traditional econometric model with a special heteroscedastic structure, and it can be transformed into a generalized form of the spatial lag model known as the spatial durbin model. Therefore, in a sense, the spatial lag model is a more general spatial model. Hence, the following content of this paper will apply the Bayesian method to estimate the semiparametric spatial lag model.
The rest of this paper is organized as follows. Section 2 presents a model analysis. Section 3 develops prior distributions. Section 4 describes posterior distributions. Section 5 introduces the sampling process. Section 6 provides numerical simulation. Section 7 shows an application example. We conclude the paper in Section 7 with a conclusion.

2. Model Analysis

For a cross-sectional semiparametric spatial lag model with n spatial units:
Y = ρ W Y + Z β + H + ε
where ε ~ N ( 0 , σ 2 I n ) , Y = y 1 , y 2 , , y n , H = h x 1 , h x 2 , , h x n , h x i are unknown link functions, Z is an n × l matrix, β = β 1 , β 2 , , β l , and W represents the spatial weight matrix. To estimate model (1), it is necessary to estimate the spatial lag coefficient ρ , the parameter coefficient vector β , the variance of the random disturbance term σ 2 , and the unknown nonparametric function h x i , i = 1 , 2 , , n .
Regarding the estimation of the nonparametric function h x i , this paper adopts a m spline method, which involves a linear combination of given spline bases and spline coefficients. The spline function estimation of the unknown function h x i is given by:
h ^ x i = j = 1 1 + m + k B j x i γ ^ j = B x i γ
where B ( x i ) = B 1 ( x i ) , B 2 ( x i ) , , B 1 + m + k ( x i ) is a given set of spline bases, γ = γ 1 , γ 2 , , γ 1 + m + k is the spline coefficient, and r = r 1 , r 2 , , r k are the k spline nodes with a , b as the two boundary nodes. Therefore, to estimate the unknown function h x i , it is necessary to estimate the number of spline nodes k , the positions of the nodes r = r 1 , r 2 , , r k , and the spline coefficients γ . Finally, the unknown function h x i is estimated using the average value of the Markov chain E h x i k , r , Y , which is denoted as h ^ x i = E h x i Y = E E h x i k , r , Y .
Based on the above analysis, using matrix equations, model (1) can be written in the following form:
Y = ρ W Y + Z β + X r γ + ε
where
X r = B 1 x 1 B 2 x 1 B 1 + m + k x 1 B 1 x 2 B 2 x 2 B 1 + m + k x 2 B 1 x n B 2 x n B 1 + m + k x n n × 1 + m + k
If we let X = Z , X r , ζ = β γ , then Equation (3) can be further written as:
Y = ρ W Y + X ζ + ε
Therefore, assuming that ε follows a normal distribution, given the number and positions of the spline nodes k and r , the likelihood function of model (4) is as follows:
L ( Y k , r , β , γ , σ 2 , ρ ) 2 π σ 2 n / 2 A exp 1 2 σ 2 A Y Z β X r γ T A Y Z β X r γ
where I n is the n matrix and A = ( I n ρ W ) .

3. Prior Distributions

Since the number k and positions r of the spline nodes are both random variables and unknown, they also need to be estimated. First, the prior distributions of the number k and positions r of the spline nodes are set as follows:
π k = λ k k ! e λ , k = 0 , 1 , 2 ,
π r k = k ! b a k Δ k
where Δ k = I a < r 1 < r 2 < < r k < b is the indicator function.
Second, the number of spline nodes k , the prior distributions of the parameter vector ζ , and the variance σ 2 are set as multivariate normal distribution and inverse gamma distribution, respectively:
σ 2 k I G υ 1 2 , υ 2 2
ζ σ 2 , k N 1 + l + m + k 0 , σ 2 Σ 0
That is,
π σ 2 k = 1 Γ υ 1 2 υ 2 2 υ 1 / 2 σ 2 υ 1 2 + 1 e υ 2 2 σ 2
π ζ k , σ 2 = 2 π σ 2 1 + l + m + k 2 Σ 0 1 2 exp 1 2 σ 2 ζ T Σ 0 1 ζ
where
Σ 0 = d i a g I l , V 1 + m + k
Finally, the prior distribution of the spatial lag coefficient ρ is set as a uniform distribution:
ρ U λ min 1 , λ max 1
That is,
π ρ = 1 λ max 1 λ min 1
where λ min 1 and λ max 1 are the minimum and maximum eigenvalues of the spatial weight matrix W .

4. Posterior Distributions

First, we combine the prior distribution of all parameters with the likelihood function to obtain the joint distribution of all variables:
P k , r , ζ , σ 2 , ρ , Y = L Y k , r , ζ , σ 2 , ρ π k π r k π σ 2 k π ζ k , σ 2 π ρ
By Bayes’ theorem, the joint posterior density function of all parameters k , r , ζ , σ 2 , ρ can be obtained according to the above formula:
P k , r , ζ , σ 2 , ρ Y = P k , r , ζ , σ 2 , ρ , Y P k , r , ζ , σ 2 , ρ , Y d ζ d σ d ρ d k d r
The denominator P k , r , ζ , σ 2 , ρ , Y d ζ d σ d ρ d k d r in the above formula is independent of the parameters k , r , ζ , σ 2 , ρ , so in application, only the numerator is considered, and it is called the kernel of the posterior density. It can be seen in the following formula, where the symbol “ ” means proportional to.
P k , r , ζ , σ 2 , ρ Y P k , r , ζ , σ 2 , ρ , Y
Then, the prior distribution density function and likelihood function of all variables are substituted into the above formula to obtain the joint posterior distribution of all variables. It is shown in the following formula:
P k , r , ζ , σ 2 , ρ Y 2 π σ 2 n / 2 A exp 1 2 σ 2 A Y Z β X r γ T A Y Z β X r γ × λ k k ! e λ k ! b a k Δ k 1 Γ υ 1 2 υ 2 2 υ 1 / 2 σ 2 υ 1 2 + 1 e υ 2 2 σ 2 × 2 π σ 2 1 + l + m + k 2 Σ 0 1 2 exp 1 2 σ 2 ζ T Σ 0 1 ζ 1 λ max 1 λ min 1 λ k k ! e λ k ! b a k Δ k 2 π 1 + l + m + k 2 σ 2 n + υ 1 + l + m + k + 3 / 2 Σ 0 1 2 1 λ max 1 λ min 1 × exp 1 2 σ 2 A Y X ζ T A Y X ζ + ζ T Σ 0 1 ζ + υ 2 A λ k k ! e λ k ! b a k Δ k S n + υ 1 2 Σ 1 2 Σ 0 1 2 × S n + υ 1 2 σ 2 n + υ 1 + 2 / 2 exp S 2 σ 2 × 2 π σ 2 1 + l + m + k 2 Σ 1 2 exp 1 2 σ 2 ζ ζ ¯ r T Σ 1 ζ ζ ¯ r × A 1 λ max 1 λ min 1
where Σ = Σ 0 1 + X T X 1 , S = υ 2 + Y T A T A Y Y T A T X Σ 0 1 + X T X 1 X T A Y , ζ ¯ r = Σ 0 1 + X T X 1 X T A Y , and note that X = Z , X r , ζ = β γ , Σ 0 = d i a g I l , V 1 + m + k .
Furthermore, based on Equation (14), the conditional posterior distributions of each parameter can be obtained:
P k , r ζ , σ 2 , ρ , Y λ k k ! e λ k ! b a k Δ k S n + υ 1 2 Σ 1 2 Σ 0 1 2
P σ 2 k , r , ζ , ρ , Y I G n + υ 1 2 , S 2
P ζ k , r , σ 2 , ρ , Y N 1 + l + m + k ζ ¯ r , σ 2 Σ
P ρ k , r , ζ , σ 2 , Y A ρ exp 1 2 σ 2 A ρ Y X ζ T A ρ Y X ζ 1 λ max 1 λ min 1
where A ρ = ( I n ρ W ) .

5. Sampling

We first make a description of the Gibbs sampling method and the Metropolis–Hastings algorithm as well as the RJMCMC algorithm. The details are as follows.
Markov Monte Carlo simulation involves simulating a Markov process on a set of parameters, which converges to a stationary distribution, and this method to obtain a stationary distribution is called the MCMC method. Therefore, the basic idea of the MCMC method is to obtain samples of π x by establishing a Markov chain of stationary distribution π x , based on which various statistical inferences can be made.
There are two popular MCMC methods in Bayesian analysis: Gibbs sampling (sampler) method and Metropolis–Hastings (MH) algorithm.
Gibbs sampling is the simplest and most widely used MCMC method. It was proposed by Geman in 1984. It was initially used for the analysis of large and complex data such as image processing analysis, artificial intelligence, and neural networks, and then introduced into the Bayesian model research by Gelfand and Smith [18]. This has a profound impact on the practical application of Bayesian methods. The success of Gibbs sampling is that it uses a fully conditional distribution to reduce a complex problem with multiple related parameters to a simpler problem with only one parameter at a time. Therefore, the most essential function of Gibbs sampling is to generate random numbers of joint posterior density, so it is the key of the Gibbs sampler to derive all conditional posterior density.
Another MCMC method that is more general than Gibbs sampling is the Metropolis–Hastings method. Metropolis et al. [19] proposed a method for transferring kernels, and Hastings [20] later extended it to form the Metropolis–Hastings method.
The core of the MH method is to find a suitable transfer probability density function, and then construct a nonperiodic irreducible normal return Markov chain, so that the stationary distribution of the Markov chain is the joint posterior distribution we need. The purpose of approximate sampling from joint posterior distribution is realized by using the property that limits distribution as stationary distribution.
In fact, the Metropolis–Hastings sampling method is convenient for numerical operations in lower dimensions. However, at higher dimensions, it is not easy to select the appropriate proposed distribution, whereas Gibbs sampling only needs to know the full conditional distribution, so it has an advantage for integration in higher dimensions. Gibbs sampling is essentially a special case of Metropolis–Hastings sampling when the acceptance rate is all 1. In addition, it should be noted that if all conditional distributions are easy to sample directly, the Gibbs sampling method is the most suitable, because there is no need to subjectively select the proposed distribution, and the acceptance rate is 100%. If the fully conditional distribution is not easy to sample directly, the Metropolis–Hastings algorithm can be used. It is more common to use the Gibbs sampler and the Metropolis–Hastings algorithm in a fully conditional distribution where one part is easy to sample directly and the other part is not.
The traditional MCMC sampling algorithm introduced above is usually carried out on the basis of the same probability density dimension of parameter joint distribution, while the RJMCMC (Reversible Jump Markov Chain Monte Carlo) sampling algorithm is a complex MCMC algorithm. It is extended based on the Metropolis–Hastings algorithm to allow Markov chains to jump in variable-dimensional parameter space models. This method is often applied to Bayes models and variable selection problems where parameter spaces may not be consistent.
Therefore, according to the specific characteristics of all parameters, this paper combined the RJMCMC algorithm, random walk Metropolis sampler, and Gibbs sampling technology to design the entire sampling process.
Regarding the estimation of parameters ζ and σ 2 , we use the Gibbs sampling, and for the estimation of ρ , the Metropolis–Hastings algorithm with random walk is employed. For the sampling of k , r , Reversible Jump Markov Chain Monte Carlo (MCMC) method is used, which constructs a mixture sampler by removing a node, adding a node, and moving a node. The following section will introduce the updating process of each parameter.

5.1. Updating the Number k and Positions r of Spline Nodes

Assuming that the positions of the current k spline nodes are r = r 1 , r 2 , , r k , a node is randomly selected from the current nodes to be deleted, added, or to change its position with probability d k , b k , m k ( m k = 1 d k b k ) , respectively. Let k , r denote the proposed number and positions of the nodes after the change.
When adding a node, we first randomly select a sub-interval r l 1 , r l , and then randomly generate a value (e.g., ξ ) from the distribution function B e t a α , α on the interval r l 1 , r l as the position of the new node. Because the location of the new node has a specific interval range, the proposed distribution of the location of the new node is generally set to uniform distribution or B e t a distribution B e t a α , α . In particular, when α = 1 , the B e t a distribution becomes uniform, i.e., B e t a 1 , 1 = U 0 , 1 . Therefore, this paper sets the proposed distribution for the location of the new node as B e t a distribution.
If a node is to be removed, we randomly select one from the existing k nodes (e.g., r l , l = 1 , 2 , , k ), and delete it.
When moving a node, we randomly select one (e.g., r l ) from the current nodes r 1 , r 2 , , r k , change its position, and the new position r l is generated from the B e t a distribution function B e t a α , α on the interval r l 1 , r l + 1 .
Therefore, the total acceptance rate from state k , r to state k , r is:
R a t i o = min 1 , R k S S n + υ 1 2 Σ Σ 1 2 Σ 0 Σ 0 1 2
where,
R k = b k 1 d k b a λ B e t a α , α r l r l 1 α 1 r l + 1 r l α 1 r l + 1 r l 1 2 α 1 ,        k = k 1 d k + 1 b k λ B e t a α , α b a r l r l 1 2 α 1 ξ r l 1 α 1 r l ξ α 1 ,           k = k + 1 r l r l 1 r l + 1 r l r l r l 1 r l + 1 r l α 1 ,                                      k = k

5.2. Updating Parameter ρ

According to the expression of the conditional posterior distribution of parameter ρ , it is known that the density function of the conditional posterior distribution of parameter ρ is complex and cannot be directly sampled. Therefore, the random walk Metropolis–Hastings algorithm is used to handle it. Assuming the current value of ρ is ρ t ( P ( ρ t k , r , ζ , σ 2 , Y ) > 0 ), the candidate value ρ is generated from the proposal distribution F ( ρ ρ t ) = f ( ρ ρ t ) (where f ( ) represents the density function), and the transition process is: ρ = ρ t + c z , z N ( 0 , 1 ) .   c is the transition parameter. The acceptance probability of ρ is then A ( ρ ρ t ) = min 1 , R , where
R = P ( ρ k , r , ζ , σ 2 , Y ) F ( ρ t ρ ) P ( ρ t k , r , ζ , σ 2 , Y ) F ( ρ ρ t )       = A ρ exp 1 2 σ 2 A ρ Y X ζ T A ρ Y X ζ A ρ t exp 1 2 σ 2 A ρ t Y X ζ T A ρ t Y X ζ
Finally, based on the conditional posterior distributions of the other parameters ζ , σ 2 in the model, Gibbs sampling is directly used to sample them.
Therefore, the estimation process of parameter k , r , ρ , ζ , σ 2 can be summarized into the following four steps:
Step 1: Assign initial values to all parameters k , r , ρ , ζ , σ 2 , denoted as k 0 , r 0 , ρ 0 , ζ 0 , σ 0 2 .
Step 2: Fix the initial value of ρ , ζ , σ 2 , update parameter k , r using the RJMCMC algorithm, and denote the updated value as k 1 , r 1 .
Step 3: For each updated state k 1 , r 1 of k , r , fix the value of ζ , σ 2 , update parameter ρ using the Metropolis–Hastings algorithm, and denote the updated value as ρ 1 .
Step 4: Fix the values of k 1 , r 1 , and ρ 1 , use Gibbs sampling to sequentially update parameters ζ , σ 2 , and denote the updated parameters as ζ 1 , σ 1 2 .
After these steps, the transition from the initial state k 0 , r 0 , ρ 0 , ζ 0 , σ 0 2 to the next state k 1 , r 1 , ρ 1 , ζ 1 , σ 1 2 is completed. Then, we use k 1 , r 1 , ρ 1 , ζ 1 , σ 1 2 as the new initial state and repeat steps 2, 3, and 4 until convergence. Finally, we estimate all unknown parameters using the posterior sample mean. The estimation of the unknown function h x i can be approximated by the average of the Markov chain E h x i k , r , Y . That is h ^ x i = E h x i Y = E E h x i k , r , Y .

6. Numerical Simulation

In the following, a numerical example is used to simulate the estimation method proposed in the previous sections. In the numerical simulation, this study uses truncated power spline functions to fit the unknown function, with m = 1 . In the prior distribution of β , V = n I 1 + m + k , where n and I 1 + m + k are the sample capacity and the 1 + m + k order identity matrix, respectively. In the Poisson prior, λ = 1 , and in the prior distribution of σ 2 , υ 1 = υ 2 = 0.001 . In the sampling process, b k = c 0 min 1 , π k + 1 n π k , d k = c 0 min 1 , n π k 1 π k , and m k = 1 b k d k , where c 0 = 0.4 , b 0 = 1 , and d 0 = 0 . The hyperparameter is α = 1 and transition parameter is c = 0.5 .
For the initial values of the Markov chain, this study makes the following selections: the number of spline nodes k = 2 , k max = 10 , and the initial values of the node positions r are directly generated from the corresponding prior distribution D i r i c h l e t 1 , 1 , , 1 . The initial values of the spline coefficients are β 0 = ( 0.1 , 0.1 , , 0.1 ) 1 + m + k . The initial values of the parameter ρ and variance σ 2 are set to their true values.
Regarding the estimation of the unknown function, the goodness of fit of the fitted function is measured by the mean square error (MSE) on 50 grid points within the interval 2 , 2 , i.e.,
M S E _ h = 1 50 i = 1 100 h ^ u i h u i 2
Considering the semiparametric spatial lag model:
Y = ρ W Y + Z β + H + ε ,   ε ~ N ( 0 , σ 2 I n )
where Y = y 1 , y 2 , , y n , H = h x 1 , h x 2 , , h x n , Z is an n × l matrix, and β = β 1 , β 2 , , β l . In this example, we set β = 1 , 1 / 2 , 1 / 3 , h x i = cos ( 2 x i 1 ) , ρ = 0.3 , σ 2 = 0.5 , and the spatial weight matrix W is the Rook adjacency spatial weight matrix, which is randomly generated.
Using random methods and with the aid of MATLAB software, 625 sample points are generated, and the estimation method proposed in the previous sections is iterated 7000 times. The average of the posterior values from the last 5000 observations is taken as the estimate of the posterior mean.
Table 1 presents the true values, posterior means, posterior medians, standard deviations, and 95% highest posterior density intervals (HPD) of the model parameters. The simulation processes and results for each parameter are detailed in Figure 1. The left side of the figure shows the boxplots of the estimates of the components of parameter β , and the right side shows the boxplots of parameter ρ and the MSE between the function h and its true value. Observing Table 1 and Figure 1, it can be seen that the estimated values of each parameter in the numerical example are very close to the true values, and the corresponding confidence intervals are relatively small and contain the true values, indicating accurate estimation results.
To examine the performance of the proposed algorithm, this study also presents the running status of each parameter and the MSE of function h in Figure 2. Figure 3 depicts the fitting of the function h , where the solid line represents the actual curve, the dotted line represents the fitted function, and the two dashed lines are the 95% confidence limit curves. The mean MSE between the function h and its true value is 0.0509, indicating a very good fit.

7. Application Examples

China’s economy is currently undergoing a transition from extensive development to high-quality development, and greening and low carbonization are basic requirements and main manifestations of high-quality development. In this context, it is particularly important to explore the relationship between China’s economic growth and the environment, and the study of the interaction mechanism between the two has outstanding practical significance. Based on these considerations, this study will empirically examine the relationship between urban economic growth and carbon emissions in China using samples of prefecture-level and above cities.

7.1. Empirical Research Design

7.1.1. Empirical Model Specification

The I P A T equation is a commonly used method for assessing the environmental impact of human activities, first proposed by Commoner [21] and Ehrlich and Holdren [22]. It channels the impact of human activities on the environment ( I ) into three aspects: population ( P ), affluence ( A ), and technology ( T ), i.e.,
I = P A T
The I P A T equation is just an accounting identity and cannot be used for hypothesis testing. To overcome this limitation, Dietz and Rosa [23] extended the I P A T equation into a stochastic form and proposed the S T I R P A T equation, whose mathematical definition is as follows:
I = a P b A c T d e
where a , b , c , and d are the coefficients to be estimated, and e represents the random disturbance term. Taking the logarithm of both sides of the equation yields a multivariate linear regression model:
ln I = α + b ln P + c ln A + d ln T + ln e
At this point, the parameters b , c , and d represent the ecological elasticity of population, affluence, and technology, respectively.
Since there are many technological factors that affect the environment, it is generally difficult to find suitable proxy variables in practical applications. York et al. [24] believed that, without violating the product principle of the I P A T equation (i.e., T can be expressed as the product of its influencing factors), any variable that affects the level of technology can be added to Equation (25). In other words, variables other than population and affluence can be added to the regression model according to the concept of the technological multiplier, as long as these additional variables meet the concept of the technological multiplier.
To explore the impact of China’s urban economic growth on carbon emissions, this study extends the S T I R P A T model mentioned above. In addition to population and affluence factors, we also consider energy consumption factors and incorporate industrial structure and innovation input as technological factors into the model. At the same time, considering that carbon emissions between neighboring cities may affect each other and that there may be a nonlinear relationship between economic growth and carbon emissions (such as the Environmental Kuznets Curve, which indicates a U-shaped nonlinear relationship between economic growth and environmental quality), we construct the following empirical model based on the semiparametric spatial lag model proposed earlier:
ln C O 2 i t = ρ j = 1 n w i j ln C O 2 j t + β 1 ln p o p i t + β 2 i n s t r i t + β 3 i n o v i t + f ( ln p g d p i t ) + ε i t
where the subscripts i , t represent cities and years, respectively; C O 2 is carbon emissions; p g d p represents per capita income; p o p , e n e r , i n s t r , i n o v represent population, energy consumption, industrial structure, and innovation input, respectively; f ( ) is an unknown function; ε i t is a random disturbance; ρ is the spatial correlation coefficient; w i j is the ( i , j ) element of the spatial weight matrix. We choose the distance threshold weight matrix as the spatial weight matrix; that is, first determine a minimum distance based on the criterion that each city has “neighbors” and then find all “neighbors” of each city based on this distance. If two cities are neighbors, their weight relative to each other is 1; otherwise, it is 0. In the empirical analysis, we normalize the weight matrix by rows.

7.1.2. Variable Measurement and Data Sources

The measurement methods for each variable in Equation (26) are as follows:
(1)
Carbon emissions ( ln C O 2 ): Measured by the logarithm of the total carbon emissions of a city. First, estimate the national total carbon emissions C O 2 using the following method:
C O 2 = j = 1 n c j × λ j
where c j is the total consumption of the j type of fossil fuel, and λ j is the carbon emission coefficient of the j type of fossil fuel. Then, following the approach of Fan et al. [25], we match the total carbon emissions C O 2 to each city using the light-matching method to obtain the total carbon emissions of each city C O 2 .
(2)
Population ( ln p o p ): Measured by the logarithm of the ratio of the total population of a prefecture-level city to the total area of its administrative region. It represents the population factor affecting the environment.
(3)
Per capita income ( ln p g d p ): Measured by the logarithm of the per capita GDP of a city. It represents the wealth factor affecting the environment.
(4)
Industrial structure ( i n s t r ): Measured by the proportion of the value added of the secondary industry to GDP in a prefecture-level city. It represents the industrial factor affecting the environment.
(5)
Innovation input ( i n o v ): Measured by the proportion of scientific expenditure in fiscal expenditure of a prefecture-level city to GDP. It represents the technological factor affecting the environment.
In the empirical analysis, this study uses panel data from Chinese prefecture-level cities spanning from 2009 to 2019. The original data are sourced from the annual China City Statistical Yearbook. Descriptive statistics of the main variables are presented in Table 2, where N, Mean, Median, SD, Min, and Max, respectively, denote sample size, sample mean, sample median, sample standard deviation, minimum value, and maximum value.

7.2. Model Estimation Results and Analysis

We use the Bayesian estimation method constructed earlier to estimate the parameters and unknown function in Equation (26). Additionally, for comparison purposes, we also estimate the nonspatial linear model, nonspatial semiparametric model, and linear spatial model. The parameter results are shown in Table 3, and the nonparametric function estimation results are shown in Figure 4 and Figure 5.
From the estimation results in Table 3, it can be observed that in all four types of models, factors such as population, wealth, and industrial proportion have significant positive effects on carbon emissions. An increase in population size, wealth level, and industrial proportion will all promote an increase in carbon emissions. On the other hand, the impact of innovation input is the opposite; an increase in innovation input level helps suppress carbon emissions. Furthermore, from the estimation results of the variable coefficients, it can be concluded that population and wealth factors remain the core factors exacerbating carbon emissions, while improving the innovation level is an important means of mitigating carbon emissions. The impact of industrial structure on carbon emissions is relatively limited.
The estimation results of the linear spatial model reveal significant positive spatial clustering effects and spatial spillover effects in carbon emissions, consistent with existing literature. This also indicates that spatial correlation is an important factor that must be considered in the empirical model. Compared to the nonspatial linear model, the linear spatial model shows a significant decrease in the impact of factors such as population, wealth, and innovation input on carbon emissions, indicating that omitting spatial correlation can exaggerate the impact of these factors on carbon emissions.
From the estimation results of the nonspatial semiparametric model and the semiparametric spatial model, it can be observed that compared to the corresponding linear model, the estimation results of the parameter part in the nonlinear model are relatively close to the linear model. This suggests that the nonparametric structure of the model does not affect the identification of the parameter part and is therefore a robust setting. Based on the above analysis, it can be concluded that the semiparametric spatial lag model, which considers both spatial correlation and semiparametric structure, is a relatively robust model setting in empirical research.
Figure 4 and Figure 5 show the estimation results of the nonparametric component of two types of semiparametric models. The solid line represents the estimation of the unknown function, while the two dashed lines in between denote the 95% confidence band. From the figures, it can be observed that the impact of wealth level on carbon emissions exhibits an “inverted U” shape, indicating the presence of an Environmental Kuznets Curve effect for carbon emissions. Moreover, this effect has a “tail” characteristic, especially before reaching the turning point, where the promoting effect of wealth level on carbon emissions appears to be relatively flat. This suggests that using a simple linear model for estimation would miss the identification of the Environmental Kuznets Curve effect for carbon emissions. Additionally, when comparing the estimation results of the unknown function in the two types of models, it can be observed that the shapes of the unknown functions in both models are highly similar. This indicates that considering spatial correlation does not affect the identification of the nonparametric part of the model. However, the overall unknown function in the spatial model is larger than that in the nonspatial model, suggesting that ignoring spatial correlation would underestimate the unknown function. This further emphasizes that the semiparametric spatial model is a more robust model setting.
The empirical results above indicate that in China, there exists a complex and close relationship between economic growth and carbon dioxide emissions. The empirical research conclusions demonstrate the significant impact of factors such as population, wealth, industrial proportion, and innovation input on carbon emissions, revealing direct connections between certain aspects of economic growth and carbon emissions.
First, the increase in population size and wealth level has been found to have a promoting effect on carbon emissions, closely related to China’s current economic development stage and industrial structure. As the economy grows rapidly, the processes of industrialization and urbanization accelerate, leading to increased energy consumption and carbon emissions.
Second, the industrial proportion has a significant impact on carbon emissions, indicating that the industrial sector is one of the main sources of carbon emissions. This also reflects the environmental pressure posed by China’s industrial-based economic development model.
Third, the increase in innovation input helps to restrain the growth of carbon emissions, indicating that technological innovation plays a crucial role in slowing down the increase in carbon emissions. This demonstrates that China, while promoting economic growth, is actively seeking ways to reduce carbon emissions through technological innovation.
Furthermore, there is a nonlinear “inverted U” relationship between wealth level and carbon emissions, known as the Environmental Kuznets Curve effect. This suggests that with further economic growth, carbon emissions may reach a turning point and tend to decline, providing hope for the future decoupling of economic development and carbon emissions.

8. Conclusions

This paper proposes a semiparametric spatial lag model and develops a Bayesian estimation method for this model by using polynomial spline functions to fit the unknown link function. In the Bayesian estimation approach, the paper combines the Reversible Jump Markov Chain Monte Carlo method, Metropolis–Hastings algorithm with random walk, and Gibbs sampling techniques to sample all the parameters. The paper also conducts numerical simulations to demonstrate the Bayesian estimation theory using a numerical example, where the spatial weight matrix W is constructed using both Rook adjacency and Case adjacency spatial weight matrices. The numerical simulation results show that under both types of spatial weight matrices, the estimation performance of the parameter part and the nonparametric function is satisfactory, indicating that the proposed Bayesian estimation method has high estimation accuracy and robustness. Finally, the paper applies the proposed theoretical method to an empirical study on the nonlinear relationship between per capita GDP and carbon emissions in China, further demonstrating the applicability and value of the proposed Bayesian estimation method in empirical research.
The semiparametric spatial lag model and its Bayesian estimation method developed in this paper, by comprehensively considering the nonlinear relationships between variables and spatial correlations, provide a more accurate and robust analytical framework for understanding the complex relationship between China’s economic growth and carbon emissions, offering strong empirical support for policy-making. Firstly, the primary drivers of economic growth such as population growth, wealth accumulation, and industrial development significantly contribute to the increase in China’s carbon emissions, highlighting a direct link between economic development and environmental pressure. However, increased investment in innovation effectively mitigates this trend, indicating that technological innovation is key to balancing economic growth and environmental protection. Secondly, the paper reveals spatial agglomeration and spillover effects of carbon emissions, emphasizing the mutual influence of carbon emissions behavior between regions and suggesting the importance of considering regional collaborative governance in policy-making to address cross-regional environmental challenges. Furthermore, the study finds a nonlinear “inverted U” relationship between wealth levels and carbon emissions, known as the Environmental Kuznets Curve, suggesting that as economic growth progresses further, carbon emissions may reach a turning point and begin to decline, offering hope for decoupling future economic development from carbon emissions.
In comparison to traditional frequentist methods, the advantage of Bayesian estimation lies in its ability to better handle situations with small sample sizes, more complex models, and diverse parameters. The superior estimation capability of the Bayesian method in small sample situations allows the model and its estimation method proposed in this paper to be applicable to a wider range of empirical analyses. However, in Bayesian analysis, the construction of prior distributions is not a straightforward task, making the specification of prior distributions for model parameters generally challenging. Furthermore, in Bayesian analysis, typically only the kernel of the posterior distribution density function is known, making it difficult to obtain the specific density function and to find numerical quantiles of the cumulative distribution function. Therefore, the derivation of the posterior distribution is often challenging.

Author Contributions

Conceptualization, K.L. and L.F.; methodology, K.L. and L.F.; software, K.L.; validation, K.L. and L.F.; formal analysis, L.F.; investigation, K.L. and L.F.; resources, L.F.; data curation, K.L.; writing—original draft preparation, K.L. and L.F.; writing—review and editing, K.L. and L.F.; visualization, L.F.; supervision, L.F.; project administration, L.F.; funding acquisition, K.L. and L.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Social Science Fund of China, grant number 22BTJ026; National Natural Science Foundation of China, grant number 72003034; Natural Science Foundation of Fujian Province in China, grant number 2021J01113; Science and Technology Innovation Special Fund Project of Fujian Agricultural and Forestry University, grant numbers CXZX2022026, CXZX2023026, and KFb22106XA; and Outstanding Young Research Talent Program of Fujian Agriculture and Forestry University, grant number xjg201821.

Data Availability Statement

The data will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Long, X.; Zhu, Y.; Cai, W.; Li, S. An empirical analysis of spatial tax competition among Chinese counties based on spatial econometric models. Econ. Res. J. 2014, 49, 41–53. (In Chinese) [Google Scholar]
  2. Li, K.; Fang, L.; He, L. How urbanization affects China’s energy efficiency: A spatial econometric analysis. J. Clean. Prod. 2018, 200, 1130–1141. [Google Scholar] [CrossRef]
  3. Li, K.; Fang, L.; He, L. The impact of energy price on CO2 emissions in China: A spatial econometric analysis. Sci. Total Environ. 2020, 706, 135942. [Google Scholar] [CrossRef]
  4. Su, L.; Jin, S. Profile Quasi-maximum likelihood estimation of partially linear spatial autoregressive models. J. Econom. 2010, 157, 18–33. [Google Scholar] [CrossRef]
  5. Su, L. Semiparametric estimation of spatial autoregressive models. J. Econom. 2012, 167, 543–560. [Google Scholar] [CrossRef]
  6. Malikov, E.; Sun, Y. Semiparametric estimation and testing of smooth coefficient spatial autoregressive models. J. Econom. 2017, 199, 12–34. [Google Scholar] [CrossRef]
  7. Malikov, E.; Sun, Y.; Hite, D. (Under) Mining local residential property values: A semiparametric spatial quantile autoregression. J. Appl. Econom. 2019, 34, 82–109. [Google Scholar] [CrossRef]
  8. Li, K.; Chen, J. Profile maximum likelihood estimation of semi-parametric varying coefficient spatial lag model. J. Quant. Technol. Econ. 2013, 30, 85–98. (In Chinese) [Google Scholar]
  9. Sun, L. Estimation and Application of Panel Data Semiparametric Spatial Lag Models. Ph.D. Thesis, Xiamen University, Xiamen, China, 2015. (In Chinese). [Google Scholar]
  10. Gelfand, A.; Kottas, A.; MacEachern, S. Bayesian nonparametric spatial modeling with dirichlet process mixing. J. Am. Stat. Assoc. 2005, 100, 1021–1035. [Google Scholar] [CrossRef]
  11. Han, X.; Lee, L. Bayesian estimation and model selection for spatial durbin error model with finite distributed lags. Reg. Sci. Urban Econ. 2013, 43, 816–837. [Google Scholar] [CrossRef]
  12. Kang, E.; Cressie, N. Bayesian inference for the spatial random effects model. Publ. Am. Stat. Assoc. 2011, 106, 972–983. [Google Scholar] [CrossRef]
  13. Fang, L.; Qian, Z. Bayesian estimation of non-parametric spatial lag model. J. Quant. Technol. Econ. 2013, 30, 72–84. (In Chinese) [Google Scholar]
  14. Fang, L. Bayesian Estimation of the spatial lag model. Stat. Res. 2014, 31, 102–106. (In Chinese) [Google Scholar]
  15. Lesage, J. Spatial econometric panel data model specification: A Bayesian approach. Soc. Sci. Electron. Publ. 2014, 9, 122–145. [Google Scholar] [CrossRef]
  16. Louzada, F.; Diego, C.; Osafu, A. Spatial statistical models: An overview under the Bayesian approach. Axioms 2021, 10, 307. [Google Scholar] [CrossRef]
  17. Thapa, S.; Lomholt, M.; Krog, J.; Cherstvy, A.; Metzler, R. Bayesian analysis of single-particle tracking data using the nested-sampling algorithm: Maximum-likelihood model selection applied to stochastic-diffusivity data. Phys. Chem. Chem. Phys. 2018, 20, 29018–29037. [Google Scholar] [CrossRef]
  18. Gelfand, A.; Smith, A. Sampling-based approaches to calculating marginal densities. J. Am. Stat. Assoc. 1990, 85, 398–409. [Google Scholar] [CrossRef]
  19. Metropolis, N.; Rosenbluth, A.; Rosenbluth, M.; Augusta, H.; Edward, T. Equation of state calculations by fast computing machines. J. Chem. Phys. 1953, 21, 1087–1092. [Google Scholar] [CrossRef]
  20. Hastings, W. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970, 57, 97–109. [Google Scholar] [CrossRef]
  21. Commoner, B. The Closing Circle, 1st ed.; Random House Inc.: New York, NY, USA, 1971. [Google Scholar]
  22. Ehrlich, P.; Holdren, J. A Bulletin dialogue on the ‘closing circle’: Critique: One dimensional ecology. Bull. At. Sci. 1972, 28, 16–27. [Google Scholar] [CrossRef]
  23. Dietz, T.; Rosa, E. Rethinking the environmental impacts of population, affluence and technology. Hum. Ecol. Rev. 1994, 1, 277–300. [Google Scholar]
  24. York, R.; Rosa, E.; Dietz, T. STIRPAT, IPAT and ImPACT: Analytic tools for unpacking the driving forces of environmental impacts. Ecol. Econ. 2003, 46, 351–365. [Google Scholar] [CrossRef]
  25. Fan, Z.; Peng, F.; Liu, C. Political connections and economic growth: Evidence from the DMSP/OLS satellite data. Econ. Res. J. 2016, 51, 114–126. (In Chinese) [Google Scholar]
Figure 1. Simulation results for Example 1. The left side shows the boxplots of the estimates of the components of parameter β , and the right side shows the boxplots of parameter ρ and the MSE between the function h and its true value.
Figure 1. Simulation results for Example 1. The left side shows the boxplots of the estimates of the components of parameter β , and the right side shows the boxplots of parameter ρ and the MSE between the function h and its true value.
Mathematics 12 02289 g001
Figure 2. Running status of the parameters ρ , β , and M S E _ h in Example 1.
Figure 2. Running status of the parameters ρ , β , and M S E _ h in Example 1.
Mathematics 12 02289 g002
Figure 3. Fitting of the function h in Example 1, where the solid line represents the actual curve, the dotted line represents the fitted function, and the two dashed lines are the 95% confidence limit curves.
Figure 3. Fitting of the function h in Example 1, where the solid line represents the actual curve, the dotted line represents the fitted function, and the two dashed lines are the 95% confidence limit curves.
Mathematics 12 02289 g003
Figure 4. Estimation result of the unknown function f ( ) in the nonspatial semiparametric.
Figure 4. Estimation result of the unknown function f ( ) in the nonspatial semiparametric.
Mathematics 12 02289 g004
Figure 5. Estimation result of the unknown function f ( ) in the semiparametric spatial model.
Figure 5. Estimation result of the unknown function f ( ) in the semiparametric spatial model.
Mathematics 12 02289 g005
Table 1. Estimated results of the model parameters.
Table 1. Estimated results of the model parameters.
ParametersTrueMeanMedianStandard
Deviation
95% HPD
ρ 0.30.27430.27630.0521(0.1723, 0.3764)
β 1 11.03281.02890.0982(0.8404, 1.2251)
β 2 1 / 2 0.75050.74750.0930(0.5682, 0.9328)
β 3 1 / 3 0.56070.55980.0905(0.3834, 0.7380)
Table 2. Descriptive statistics of the main variables.
Table 2. Descriptive statistics of the main variables.
VariableNMeanMedianSDMinMax
lnc31001.3571.3660.3230.4632.363
lnpop31002.5612.5670.2891.3083.538
lnpgdp31004.5964.5840.2703.6525.670
instr310047.48547.81510.34210.68082.240
ino31000.2550.1810.2610.0136.334
Table 3. Model parameter estimation results.
Table 3. Model parameter estimation results.
VariablesNonspatial ModelLinear Spatial ModelNonspatial
Semiparametric Model
Semiparametric Spatial Model
Parameter Estimation Results95% Confidence Interval
lnpop0.719 ***0.671 ***0.723 ***0.675(0.653, 0.697)
(0.012)(0.011)(0.012)
lnpgdp0.571 ***0.456 ***
(0.014)(0.014)
instr0.001 ***0.001 *0.001 ***0.001(−0.000, −0.0012)
(0.000)(0.000)(0.000)
ino−0.143 ***−0.127 ***−0.140 ***−0.127(−0.153, −0.101)
(0.014)(0.013)(0.015)
Constant−3.130 ***−3.126 ***
(0.068)(0.062)
W*lnc 0.495 *** 0.495(0.458, 0.532)
(0.019)
Standard errors in parentheses *** p < 0.01, * p < 0.1.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, K.; Fang, L. Bayesian Estimation of the Semiparametric Spatial Lag Model. Mathematics 2024, 12, 2289. https://doi.org/10.3390/math12142289

AMA Style

Li K, Fang L. Bayesian Estimation of the Semiparametric Spatial Lag Model. Mathematics. 2024; 12(14):2289. https://doi.org/10.3390/math12142289

Chicago/Turabian Style

Li, Kunming, and Liting Fang. 2024. "Bayesian Estimation of the Semiparametric Spatial Lag Model" Mathematics 12, no. 14: 2289. https://doi.org/10.3390/math12142289

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop