Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                

A Hybrid Framework for Spatial Interpolation:
Merging Data-driven with Domain Knowledge

Cong Zhang 1    Shuyi Du 1,2    Hongqing Song 1,2, Correspondence: songhongqing@ustb.edu.cn    Yuhe Wang 1,3, Correspondence: yuhe.wang@tamu.edu
(September 6, 2024)
Abstract

Estimating spatially distributed information through the interpolation of scattered observation datasets often overlooks the critical role of domain knowledge in understanding spatial dependencies. Additionally, the features of these datasets are typically limited to the spatial coordinates of the scattered observation locations. In this paper, we propose a hybrid framework that integrates data-driven spatial dependency feature extraction with rule-assisted spatial dependency function mapping to augment domain knowledge. We demonstrate the superior performance of our framework in two comparative application scenarios, highlighting its ability to capture more localized spatial features in the reconstructed distribution fields. Furthermore, we underscore its potential to enhance nonlinear estimation capabilities through the application of transformed fuzzy rules and to quantify the inherent uncertainties associated with the observation datasets. Our framework introduces an innovative approach to spatial information estimation by synergistically combining observational data with rule-assisted domain knowledge.

Keywords: domain knowledge integration, spatial interpolation, spatial dependency correlation, neuro-fuzzy system

11footnotetext: National & Local Joint Engineering Lab for Big Data Analysis and Computing Technology, Beijing 100190, China.22footnotetext: School of Civil and Resource Engineering, University of Science and Technology Beijing, Beijing 100083, China.33footnotetext: Institute for Scientific Computation, Texas A&M University, College Station, Texas 77843, USA.

1 Introduction

Spatially dependent properties are widespread across various fields including subsurface resource exploitation [1, 2, 3, 4, 5], water resources management [6, 7], traffic engineering [8, 9], and environmental studies [10, 11]. These spatially varying attributes are typically recorded at a limited number of observed points or monitoring stations, often insufficient to represent the entire and multiscale spatial distribution accurately [12, 13, 5]. Due to the impracticality of monitoring every part of a domain to fully capture its attribute field, spatial interpolation is commonly employed to estimate values at unobserved locations, utilizing observed data and underlying spatial dependency correlations [14, 15].

Numerous spatial interpolation methods exist, as reviewed by Li and Heap [16], who suggest that these methods often rely on similar principles, such as inverse distance weighting [17] and Kriging [18]. The inverse distance weighting approach, which is built on a straightforward distance-based method, is generally unsuitable for spatially aggregated observations. Kriging, on the other hand, functions as an unbiased linear estimator under the assumption of a stationary normal distribution. Since it utilizes the conditional distribution of a Gaussian process, Kriging is essentially a specific case of Gaussian process regression [19, 20]. However, all these methods are founded on simplified assumptions, including stationarity, linearity, and normality, which limit their applicability in real-world scenarios [20, 21, 22]. In particular, their reliance on linear interpolation makes them inadequate for handling complex nonlinear spatial dependency relationships, often resulting in suboptimal estimations for continuous attribute fields. Therefore, there is a clear need for alternative approaches that can construct non-stationary, nonlinear, and non-Gaussian spatial dependency functions for spatial interpolation.

Given their versatile nonlinear approximation capabilities, the emerging data-driven approaches are promising to provide an alternative framework for spatial information modeling [11, 23, 24, 25, 26]. Various machine learning-based methods, such as random forests [27, 28, 29], artificial neural networks [30, 31, 32], radial basis function networks [21, 33], long short-term memory networks [34, 35], convolutional neural networks [36, 37], conditional generative adversarial networks [22, 38, 39], and ensemble learning [40, 41], have been successfully adapted and applied to spatial interpolation. However, without significant reengineering, these original machine learning methods are not inherently designed for spatial interpolation and often fail to account for the available spatial configuration information. In other words, directly applying these machine learning models may pose challenges and may not be suitable for most spatial interpolation scenarios [42, 43].

While methods involving convolutional neural networks and generative adversarial networks are well-suited for imagery or regularly gridded datasets, such as terrain elevation images, observational data are often irregularly sparse and scattered [44, 45]. Traditional deep learning methods may not be ideal for extracting spatial dependency features from such sparse observations, as they typically only rely on the spatial coordinates of the observation locations as input features [46]. To address this limitation, modified approaches have been developed. For example, Wu et al. proposed an inductive graph neural network Kriging model that better leverages distance information [47], while Ma et al. introduced a geo-layer into long short-term memory networks to integrate spatial correlations from monitoring stations [48]. In essence, effectively extracting spatial dependency features from irregularly sparse observed data necessitates considering both individual observations and their neighboring spatial configurations.

In addition to the data-driven approaches, the family of rule-based fuzzy inference systems can effectively model nonlinear functions by leveraging inherent reasoning paradigms [49, 50, 51, 52, 53]. Unlike data-driven methods, fuzzy inference systems can directly and broadly incorporate expert knowledge, making them valuable for enhancing spatial interpolation [52]. For example, the first law of geography, which states that “near things are more related than distant things” [54], can be articulated as a fuzzy IF-THEN rule—IF two spatial observations are closer, THEN their spatial dependency correlations are stronger. This rule underpins many existing interpolation methods that rely on concepts of distance and neighborhood. Similarly, there is a wealth of domain-specific knowledge related to both general spatial dependencies and particular interpolation scenarios. For instance, Yesilkanat et al. [55] demonstrated the successful application of domain expertise for spatial interpolation of environmental radioactivity using fuzzy IF-THEN rules, either collected or self-defined. However, their fuzzy rulesets were tailored to specific domains rather than forming a general rule base for spatial dependency extraction. Therefore, there is a need to develop a more general rule-based spatial interpolation framework that can effectively exploit and utilize latent domain knowledge. It is important to note that domain knowledge related to spatial dependency is often difficult to collect and represent in the form of fuzzy IF-THEN rules. Moreover, the lack of standardized approaches to transforming domain knowledge into rule bases limits the effective use of such knowledge. Consequently, it is both necessary and advantageous to systematically and automatically generate fuzzy rulesets to incorporate domain knowledge. Another advantage of rule-based fuzzy inference systems is their ability to tolerate inaccurate information. Observation data and monitoring records are often subject to errors from various sources [56], which can negatively impact interpolation accuracy. A robust spatial interpolation approach should account for these uncertainties associated with the observed data. Notably, fuzzy logic is well-suited for handling such inherent uncertainties, thereby enhancing the performance of spatial interpolation [57, 58].

In this paper, we present a hybrid framework merging data-driven and domain knowledge for interpolating spatially dependent properties. The framework extracts latent spatial dependency basis by leveraging observation data and neighboring information. By embedding a fuzzy inference system within our adaptive network, we can automatically transform domain knowledge into fuzzy rulesets. This framework capitalizes on the advantages of fuzzy reasoning, particularly its ability to tolerate inaccurate information and manage nonlinear spatially dependent functions. We validate our framework in two scenarios: subsurface formation parameter estimation and air quality mapping. Additionally, we conduct comparative studies to assess the estimation performance quantitatively and qualitatively against conventional interpolation techniques, including ordinary Kriging, inverse distance weighting, and Gaussian process regression.

2 Hybrid data-driven and rule-assisted learning framework

To address the limitations mentioned above, our framework combines the extraction of spatial dependency features from observation data with the transformation of latent domain knowledge into fuzzy rulesets. For constructing a fuzzy inference system, it is ideal to automatically convert domain knowledge into fuzzy IF-THEN rules. The Adaptive-Network-based Fuzzy Inference System (ANFIS) developed by J.S. Jang [49] provides a solid foundation for this task. However, the number of rules generated by ANFIS can become intractable as the number of input features increases. Specifically, if the number of membership functions assigned to each input dimension is fixed and the dimensionality increases linearly, the number of generated rules grows exponentially. This exponential growth can significantly limit the applicability of ANFIS when incorporating both spatial coordinates and relevant neighboring information as input features. Inspired by ANFIS, we propose an enhanced architecture to overcome this issue by input feature decomposition. As shown in Figure 1, our architecture integrates a data-driven approach with rule-based assistance. The network consists of an intrinsic input layer, a ruleset layer, a T-Norm operation layer, a normalization layer, a consequent layer, and a summation layer. Additionally, it includes a newly designed spatial dependency layer to exploit sparse observation data and their neighboring information. Essentially, our architecture retains the ANFIS learning mechanism for constructing fuzzy IF-THEN rules and approximating the estimation function.

Refer to caption
Figure 1: The network architecture of the hybrid framework based on ANFIS

The detailed architecture of our hybrid framework is illustrated in Figure 2. It has two main components: 1) data-driven Spatial Dependency Basis (SDB) extraction, and 2) rule-assisted spatial dependency function approximation. The SDB serves as a critical link between these two components, significantly reducing the number of generated rules. In the following sections, we outline the main steps and their corresponding implementation.

Refer to caption
Figure 2: The detailed architecture of the hybrid framework

2.1 Spatial Dependency Basis (SDB) extraction

We apply the reduced-rank approach for obtaining SDB [59]. The SDB is extracted from spatial observations to represent fixed spatial features. Rather than using the direct input features from these spatial observations, we use SDB as the input for the rule-based ANFIS. This approach aims to minimize the number of automatically generated fuzzy IF-THEN rules while fully leveraging the available spatial observations. As illustrated in Figure 2, the extraction of the SDB begins with constructing nearest neighboring spatial covariates.

2.1.1 Nearest neighboring spatial covariates

We use nearest neighboring spatial covariates [27, 42], which is the combination of the environmental covariates with the spatial covariates, to comprehensively describe the spatial dependency relationship from spatial observations. Compared to approaches that consider only spatial coordinates as input features, these combined covariates better characterize spatial correlations by fully accounting for spatial coordinates, observation data, and neighboring configurations. The neighboring configurations are established by constructing a neighboring graph for each observation location. We use nearest neighboring algorithm to select the m nearest neighbors for each observation location based on Euclidean distance, see Equation (1).

dij=𝐎𝐛𝐬i𝐎𝐛𝐬j2subscript𝑑𝑖𝑗subscriptnormsubscript𝐎𝐛𝐬𝑖subscript𝐎𝐛𝐬𝑗2d_{ij}=\|\mathbf{Obs}_{i}-\mathbf{Obs}_{j}\|_{2}italic_d start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = ∥ bold_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - bold_Obs start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT (1)

where 𝐎𝐛𝐬isubscript𝐎𝐛𝐬𝑖\mathbf{Obs}_{i}bold_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝐎𝐛𝐬jsubscript𝐎𝐛𝐬𝑗\mathbf{Obs}_{j}bold_Obs start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT are the i𝑖iitalic_i-th and j𝑗jitalic_j-th observed points, respectively. In 2D, 𝐎𝐛𝐬i=(xi,yi)subscript𝐎𝐛𝐬𝑖subscript𝑥𝑖subscript𝑦𝑖\mathbf{Obs}_{i}=(x_{i},y_{i})bold_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), and in 3D, if necessary, 𝐎𝐛𝐬i=(xi,yi,zi)subscript𝐎𝐛𝐬𝑖subscript𝑥𝑖subscript𝑦𝑖subscript𝑧𝑖\mathbf{Obs}_{i}=(x_{i},y_{i},z_{i})bold_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ). The value dijsubscript𝑑𝑖𝑗d_{ij}italic_d start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT represents the Euclidean distance between 𝐎𝐛𝐬isubscript𝐎𝐛𝐬𝑖\mathbf{Obs}_{i}bold_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝐎𝐛𝐬jsubscript𝐎𝐛𝐬𝑗\mathbf{Obs}_{j}bold_Obs start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT.

As illustrated in Figure 3, the observation locations and their corresponding neighbors form a neighboring graph. This graph is then used to construct the associated features, allowing the features at a specific observation location to encompass both its own data and the information from its neighbors.

Refer to caption
Figure 3: An illustration of Neighboring graph

We can express the neighboring graph using nearest neighboring spatial covariates to generate more valuable features for each observation. For a given observation Obsi=(xi,yi)subscriptObs𝑖subscript𝑥𝑖subscript𝑦𝑖\text{Obs}_{i}=(x_{i},y_{i})Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = ( italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) with observation value of ΦisubscriptΦ𝑖\Phi_{i}roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and the m𝑚mitalic_m nearest neighbors, the corresponding neighboring spatial covariates are written as:

[Obsi]=[xi,yi,xi1,yi1,Φi1,xi2,yi2,Φi2,,xij,yij,Φij,,xim,yim,Φim]1×(3m+2)\text{[}{Obs}_{i}]=[x_{i},y_{i},x_{i}^{1},y_{i}^{1},\Phi_{i}^{1},x_{i}^{2},y_{% i}^{2},\Phi_{i}^{2},\dots,x_{i}^{j},y_{i}^{j},\Phi_{i}^{j},\dots,x_{i}^{m},y_{% i}^{m},\Phi_{i}^{m}]_{1\times(3m+2)}[ italic_O italic_b italic_s start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ] = [ italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT , roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT ] start_POSTSUBSCRIPT 1 × ( 3 italic_m + 2 ) end_POSTSUBSCRIPT (2)

where xisubscript𝑥𝑖x_{i}italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the spatial coordinates of the i𝑖iitalic_i-th observed point, and i=1,2,,N𝑖12𝑁i=1,2,\dots,Nitalic_i = 1 , 2 , … , italic_N if there are N𝑁Nitalic_N observations in total. For the case with N𝑁Nitalic_N observed points, Obs1,Obs2,,Obsi,,ObsNsubscriptObs1subscriptObs2subscriptObs𝑖subscriptObs𝑁\text{Obs}_{1},\text{Obs}_{2},\dots,\text{Obs}_{i},\dots,\text{Obs}_{N}Obs start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , Obs start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , … , Obs start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT, we can construct a matrix of nearest neighboring spatial covariates as shown in Equation (3). The superscript j𝑗jitalic_j represents the j𝑗jitalic_j-th nearest neighbor of the i𝑖iitalic_i-th observed point, and j=1,2,,m𝑗12𝑚j=1,2,\dots,mitalic_j = 1 , 2 , … , italic_m.

[𝑂𝑏𝑠1𝑂𝑏𝑠2𝑂𝑏𝑠i𝑂𝑏𝑠N]=[x1y1x11y11Φ11x12y12Φ12x1jy1jΦ1jx1my1mΦ1mx2y2x21y21Φ21x22y22Φ22x2jy2jΦ2jx2my2mΦ2mxiyixi1yi1Φi1xi2yi2Φi2xijyijΦijximyimΦimxNyNxN1yN1ΦN1xN2yN2ΦN2xNjyNjΦNjxNmyNmΦNm]matrixsubscript𝑂𝑏𝑠1subscript𝑂𝑏𝑠2subscript𝑂𝑏𝑠𝑖subscript𝑂𝑏𝑠𝑁delimited-[]subscript𝑥1subscript𝑦1superscriptsubscript𝑥11superscriptsubscript𝑦11superscriptsubscriptΦ11superscriptsubscript𝑥12superscriptsubscript𝑦12superscriptsubscriptΦ12superscriptsubscript𝑥1𝑗superscriptsubscript𝑦1𝑗superscriptsubscriptΦ1𝑗superscriptsubscript𝑥1𝑚superscriptsubscript𝑦1𝑚superscriptsubscriptΦ1𝑚subscript𝑥2subscript𝑦2superscriptsubscript𝑥21superscriptsubscript𝑦21superscriptsubscriptΦ21superscriptsubscript𝑥22superscriptsubscript𝑦22superscriptsubscriptΦ22superscriptsubscript𝑥2𝑗superscriptsubscript𝑦2𝑗superscriptsubscriptΦ2𝑗superscriptsubscript𝑥2𝑚superscriptsubscript𝑦2𝑚superscriptsubscriptΦ2𝑚subscript𝑥𝑖subscript𝑦𝑖superscriptsubscript𝑥𝑖1superscriptsubscript𝑦𝑖1superscriptsubscriptΦ𝑖1superscriptsubscript𝑥𝑖2superscriptsubscript𝑦𝑖2superscriptsubscriptΦ𝑖2superscriptsubscript𝑥𝑖𝑗superscriptsubscript𝑦𝑖𝑗superscriptsubscriptΦ𝑖𝑗superscriptsubscript𝑥𝑖𝑚superscriptsubscript𝑦𝑖𝑚superscriptsubscriptΦ𝑖𝑚subscript𝑥𝑁subscript𝑦𝑁superscriptsubscript𝑥𝑁1superscriptsubscript𝑦𝑁1superscriptsubscriptΦ𝑁1superscriptsubscript𝑥𝑁2superscriptsubscript𝑦𝑁2superscriptsubscriptΦ𝑁2superscriptsubscript𝑥𝑁𝑗superscriptsubscript𝑦𝑁𝑗superscriptsubscriptΦ𝑁𝑗superscriptsubscript𝑥𝑁𝑚superscriptsubscript𝑦𝑁𝑚superscriptsubscriptΦ𝑁𝑚\begin{bmatrix}\mathit{Obs}_{1}\\ \mathit{Obs}_{2}\\ \vdots\\ \mathit{Obs}_{i}\\ \vdots\\ \mathit{Obs}_{N}\\ \end{bmatrix}=\left[\begin{array}[]{cccccccccccccccc}x_{1}&y_{1}&x_{1}^{1}&y_{% 1}^{1}&\Phi_{1}^{1}&x_{1}^{2}&y_{1}^{2}&\Phi_{1}^{2}&\dots&x_{1}^{j}&y_{1}^{j}% &\Phi_{1}^{j}&\dots&x_{1}^{m}&y_{1}^{m}&\Phi_{1}^{m}\\ x_{2}&y_{2}&x_{2}^{1}&y_{2}^{1}&\Phi_{2}^{1}&x_{2}^{2}&y_{2}^{2}&\Phi_{2}^{2}&% \dots&x_{2}^{j}&y_{2}^{j}&\Phi_{2}^{j}&\dots&x_{2}^{m}&y_{2}^{m}&\Phi_{2}^{m}% \\ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\ddots&\vdots&\vdots&% \vdots&\ddots&\vdots&\vdots&\vdots\\ x_{i}&y_{i}&x_{i}^{1}&y_{i}^{1}&\Phi_{i}^{1}&x_{i}^{2}&y_{i}^{2}&\Phi_{i}^{2}&% \dots&x_{i}^{j}&y_{i}^{j}&\Phi_{i}^{j}&\dots&x_{i}^{m}&y_{i}^{m}&\Phi_{i}^{m}% \\ \vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\vdots&\ddots&\vdots&\vdots&% \vdots&\ddots&\vdots&\vdots&\vdots\\ x_{N}&y_{N}&x_{N}^{1}&y_{N}^{1}&\Phi_{N}^{1}&x_{N}^{2}&y_{N}^{2}&\Phi_{N}^{2}&% \dots&x_{N}^{j}&y_{N}^{j}&\Phi_{N}^{j}&\dots&x_{N}^{m}&y_{N}^{m}&\Phi_{N}^{m}% \end{array}\right][ start_ARG start_ROW start_CELL italic_Obs start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL italic_Obs start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_Obs start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_CELL end_ROW end_ARG ] = [ start_ARRAY start_ROW start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW start_ROW start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋱ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL start_CELL ⋮ end_CELL end_ROW start_ROW start_CELL italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ]

(3)

For each observation location, the dimension of its nearest neighbor spatial covariates is 3m+23𝑚23m+23 italic_m + 2. Assigning two membership functions to each dimension results in the adaptive neural networks automatically generating 23m+2superscript23𝑚22^{3m+2}2 start_POSTSUPERSCRIPT 3 italic_m + 2 end_POSTSUPERSCRIPT fuzzy IF-THEN rules. This can lead to a significant number of generated rules, causing computational challenges in tuning and updating the hyperparameters during training to accurately map the nonlinear spatial dependency function. To address this, we further extract primary features that can characterize the spatial correlations using SDB.

2.1.2 Reduced-rank basis extraction

We use Uniform Manifold Approximation and Projection (UMAP) [60] to decompose the nearest neighboring spatial covariates into several fixed reduced-rank bases specifically targeting spatial dependency basis. According to Equation (3), the dimension of the nearest neighboring Spatial Covariates (SC) is 3m+23𝑚23m+23 italic_m + 2. If we want to reduce the dimensionality to d𝑑ditalic_d, we can write such reduction as in Equations (4) and (5) to extract the Spatial Dependency Basis (SDB).

SCi=𝑂𝑏𝑠i=[xiyixi1yi1Φi1xi2yi2Φi2ximyimΦim]3m+2𝑆subscript𝐶𝑖subscript𝑂𝑏𝑠𝑖delimited-[]subscript𝑥𝑖subscript𝑦𝑖superscriptsubscript𝑥𝑖1superscriptsubscript𝑦𝑖1superscriptsubscriptΦ𝑖1superscriptsubscript𝑥𝑖2superscriptsubscript𝑦𝑖2superscriptsubscriptΦ𝑖2superscriptsubscript𝑥𝑖𝑚superscriptsubscript𝑦𝑖𝑚superscriptsubscriptΦ𝑖𝑚superscript3𝑚2SC_{i}=\mathit{Obs}_{i}=\left[\begin{array}[]{cccccccccccc}x_{i}&y_{i}&x_{i}^{% 1}&y_{i}^{1}&\Phi_{i}^{1}&x_{i}^{2}&y_{i}^{2}&\Phi_{i}^{2}&\dots&x_{i}^{m}&y_{% i}^{m}&\Phi_{i}^{m}\end{array}\right]\in\mathbb{R}^{3m+2}italic_S italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ start_ARRAY start_ROW start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_x start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL start_CELL roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT end_CELL end_ROW end_ARRAY ] ∈ blackboard_R start_POSTSUPERSCRIPT 3 italic_m + 2 end_POSTSUPERSCRIPT

(4)
SDBi=[SDBi1SDBi2SDBid]d𝑆𝐷subscript𝐵𝑖matrix𝑆𝐷superscriptsubscript𝐵𝑖1𝑆𝐷superscriptsubscript𝐵𝑖2𝑆𝐷superscriptsubscript𝐵𝑖𝑑superscript𝑑SDB_{i}=\begin{bmatrix}SDB_{i}^{1}&SDB_{i}^{2}&\dots&SDB_{i}^{d}\end{bmatrix}% \in\mathbb{R}^{d}italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT end_CELL start_CELL italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_CELL start_CELL … end_CELL start_CELL italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT end_CELL end_ROW end_ARG ] ∈ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT (5)

Where i𝑖iitalic_i denotes the i𝑖iitalic_i-th observation location.

Then we construct a local relationship graph [61] based on the SC by converting the differences between neighbors into weights or probabilities using Equation (6).

Wij=exp(d(SCi,SCj)piσi)subscript𝑊𝑖𝑗𝑑𝑆subscript𝐶𝑖𝑆subscript𝐶𝑗subscript𝑝𝑖subscript𝜎𝑖W_{ij}=\exp\left(-\frac{d(SC_{i},SC_{j})-p_{i}}{\sigma_{i}}\right)italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = roman_exp ( - divide start_ARG italic_d ( italic_S italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_S italic_C start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) - italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) (6)

Where d(SCi,SCj)𝑑𝑆subscript𝐶𝑖𝑆subscript𝐶𝑗d(SC_{i},SC_{j})italic_d ( italic_S italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_S italic_C start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) is the distance between the i𝑖iitalic_i-th and j𝑗jitalic_j-th observation.

Apparently, a shorter distance implies a stronger spatial dependency relationship. Hence, d(SCi,SCj)𝑑𝑆subscript𝐶𝑖𝑆subscript𝐶𝑗d(SC_{i},SC_{j})italic_d ( italic_S italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_S italic_C start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) is considered as the spatial dependency between these two observations. pisubscript𝑝𝑖p_{i}italic_p start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the local spatial dependency between the i𝑖iitalic_i-th observation and its adjacent neighbor. σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the local spatial dependency between the i𝑖iitalic_i-th observation and its m𝑚mitalic_m nearest neighbors, which is regularized as jWij=log2(m)subscript𝑗subscript𝑊𝑖𝑗subscript2𝑚\sum_{j}W_{ij}=\log_{2}(m)∑ start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = roman_log start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_m ).

We then build an embedding spatial dependency basis which can preserve the structure of the local relationship graph between the nearest neighboring spatial covariates. This basis is optimized to minimize the difference between SC and SDB with respect to fuzzy set cross entropy in Equations (7) and (8).

C(Wij,μij)=ij[Wijlog(Wijμij)+(1Wij)log(1Wij1μij)]𝐶subscript𝑊𝑖𝑗subscript𝜇𝑖𝑗subscript𝑖𝑗delimited-[]subscript𝑊𝑖𝑗subscript𝑊𝑖𝑗subscript𝜇𝑖𝑗1subscript𝑊𝑖𝑗1subscript𝑊𝑖𝑗1subscript𝜇𝑖𝑗C(W_{ij},\mu_{ij})=\sum_{ij}{}\left[W_{ij}\log\left(\frac{W_{ij}}{\mu_{ij}}% \right)+(1-W_{ij})\log\left(\frac{1-W_{ij}}{1-\mu_{ij}}\right)\right]italic_C ( italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT [ italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT roman_log ( divide start_ARG italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG ) + ( 1 - italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT ) roman_log ( divide start_ARG 1 - italic_W start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_μ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT end_ARG ) ] (7)
μij=[1+ad(SDBi,SDBj)2b]1subscript𝜇𝑖𝑗superscriptdelimited-[]1𝑎𝑑superscript𝑆𝐷subscript𝐵𝑖𝑆𝐷subscript𝐵𝑗2𝑏1\mu_{ij}=\left[1+a\cdot d(SDB_{i},SDB_{j})^{2b}\right]^{-1}italic_μ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT = [ 1 + italic_a ⋅ italic_d ( italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 italic_b end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT (8)

The local relationship graph between extracted SDB is characterized by the weights μijsubscript𝜇𝑖𝑗\mu_{ij}italic_μ start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT. d(SDBi,SDBj)𝑑𝑆𝐷subscript𝐵𝑖𝑆𝐷subscript𝐵𝑗d(SDB_{i},SDB_{j})italic_d ( italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) denotes the distance between the i𝑖iitalic_i-th observation location and the j𝑗jitalic_j-th observation location with respect to the dataset of SDB. a𝑎aitalic_a and b𝑏bitalic_b are adjusted by non-linear least squares fitting.

Further implementation of UMAP, such as the fuzzy topological representation and parameter calculation, can be found in [60, 61].

2.2 Spatial dependency estimation

Estimating a parameter field involves approximating the spatial dependency function using both the Spatial Dependency Basis (SDB) and domain knowledge, particularly in the form of automatically generated fuzzy IF-THEN rules. As illustrated in Figure 2, our rule-assisted adaptive network transforms domain knowledge related to spatial dependency into a rule base.

ANFIS combines the nonlinear function approximation capabilities of neural networks with the knowledge utilization strengths of fuzzy inference systems. In this context, we adapt ANFIS to model the nonlinear spatial dependency function by generating latent knowledge from the dataset of spatial dependency bases. For example, in a 4-dimensional case, the transformed knowledge in the form of IF-THEN rules can be expressed as:

IF SDBi1 is 𝑅𝑢𝑙𝑒11 and SDBi2 is 𝑅𝑢𝑙𝑒21 and SDBi3 is 𝑅𝑢𝑙𝑒31 and SDBi4 is 𝑅𝑢𝑙𝑒41𝑆𝐷subscript𝐵𝑖1 is subscript𝑅𝑢𝑙𝑒11 and 𝑆𝐷subscript𝐵𝑖2 is subscript𝑅𝑢𝑙𝑒21 and 𝑆𝐷subscript𝐵𝑖3 is subscript𝑅𝑢𝑙𝑒31 and 𝑆𝐷subscript𝐵𝑖4 is subscript𝑅𝑢𝑙𝑒41\displaystyle SDB_{i1}\text{ is }\mathit{Rule}_{11}\text{ and }SDB_{i2}\text{ % is }\mathit{Rule}_{21}\text{ and }SDB_{i3}\text{ is }\mathit{Rule}_{31}\text{ % and }SDB_{i4}\text{ is }\mathit{Rule}_{41}italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT is italic_Rule start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT and italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 2 end_POSTSUBSCRIPT is italic_Rule start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT and italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 3 end_POSTSUBSCRIPT is italic_Rule start_POSTSUBSCRIPT 31 end_POSTSUBSCRIPT and italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 4 end_POSTSUBSCRIPT is italic_Rule start_POSTSUBSCRIPT 41 end_POSTSUBSCRIPT (9)
THEN Φi=CiSDBisubscriptΦ𝑖subscript𝐶𝑖𝑆𝐷subscript𝐵𝑖\displaystyle\Phi_{i}=\overrightarrow{C_{i}}\cdot\overrightarrow{SDB_{i}}roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = over→ start_ARG italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ⋅ over→ start_ARG italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG

where SDBi=[SDBi1,SDBi2,SDBi3,SDBi4,1]𝑆𝐷subscript𝐵𝑖𝑆𝐷subscript𝐵𝑖1𝑆𝐷subscript𝐵𝑖2𝑆𝐷subscript𝐵𝑖3𝑆𝐷subscript𝐵𝑖41\overrightarrow{SDB_{i}}=\left[SDB_{i1},SDB_{i2},SDB_{i3},SDB_{i4},1\right]over→ start_ARG italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG = [ italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 1 end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 2 end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 3 end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_i 4 end_POSTSUBSCRIPT , 1 ] includes the 4-dimensional spatial dependency basis and one as a constant. Cisubscript𝐶𝑖C_{i}italic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the consequent parameter matrix to be discussed later, and ΦisubscriptΦ𝑖\Phi_{i}roman_Φ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the estimated value at an unknown location. 𝑅𝑢𝑙𝑒11,𝑅𝑢𝑙𝑒21,𝑅𝑢𝑙𝑒31,and 𝑅𝑢𝑙𝑒41subscript𝑅𝑢𝑙𝑒11subscript𝑅𝑢𝑙𝑒21subscript𝑅𝑢𝑙𝑒31and subscript𝑅𝑢𝑙𝑒41\mathit{Rule}_{11},\mathit{Rule}_{21},\mathit{Rule}_{31},\text{and }\mathit{% Rule}_{41}italic_Rule start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_Rule start_POSTSUBSCRIPT 21 end_POSTSUBSCRIPT , italic_Rule start_POSTSUBSCRIPT 31 end_POSTSUBSCRIPT , and italic_Rule start_POSTSUBSCRIPT 41 end_POSTSUBSCRIPT are the corresponding defined linguistic labels of each spatial dependency basis, respectively.

A complete IF-THEN rule consists of both the antecedent clause in the IF part and the consequent clause in the THEN part. Typically, linguistic labels are often defined such as “Close,” “Medium,” and “Fair.” In this study, we refer to these linguistic labels as 𝑅𝑢𝑙𝑒ijsubscript𝑅𝑢𝑙𝑒𝑖𝑗\mathit{Rule}_{ij}italic_Rule start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT for convenience. 𝑅𝑢𝑙𝑒ijsubscript𝑅𝑢𝑙𝑒𝑖𝑗\mathit{Rule}_{ij}italic_Rule start_POSTSUBSCRIPT italic_i italic_j end_POSTSUBSCRIPT represents the j𝑗jitalic_j-th linguistic label of the i𝑖iitalic_i-th spatial dependency basis. Additionally, it is characterized by a bell membership function as shown in Equation (10):

μ𝑅𝑢𝑙𝑒=11+[(sge)2]fsubscript𝜇𝑅𝑢𝑙𝑒11superscriptdelimited-[]superscript𝑠𝑔𝑒2𝑓\mu_{\mathit{Rule}}=\frac{1}{1+\left[\left(\frac{s-g}{e}\right)^{2}\right]^{f}}italic_μ start_POSTSUBSCRIPT italic_Rule end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG 1 + [ ( divide start_ARG italic_s - italic_g end_ARG start_ARG italic_e end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ] start_POSTSUPERSCRIPT italic_f end_POSTSUPERSCRIPT end_ARG (10)

where s𝑠sitalic_s is the input data into the rules-set layer (shown in Figure 2), 𝑅𝑢𝑙𝑒𝑅𝑢𝑙𝑒\mathit{Rule}italic_Rule is the linguistic label, and e,f,g𝑒𝑓𝑔e,f,gitalic_e , italic_f , italic_g are premise parameters to be adjusted during the training process. The bell membership function is important as it can tolerate imprecise information by assigning a certain degree (between 0 and 1) of membership to each input s𝑠sitalic_s based on fuzzy logic. This quantifies the uncertainties associated with the observation data.

Within our ANFIS architecture, the rules-set layer is predefined to set the number of membership functions (or the number of linguistic labels) directly, which determines the number of the finally generated IF-THEN rules. Each node in this layer is given by Equation (11):

Ol,kRuleset=μ𝑅𝑢𝑙𝑒l,k(x)superscriptsubscript𝑂𝑙𝑘𝑅𝑢𝑙𝑒𝑠𝑒𝑡subscript𝜇subscript𝑅𝑢𝑙𝑒𝑙𝑘𝑥O_{\mathit{l,k}}^{Rule-set}=\mu_{\mathit{Rule}_{l,k}}(x)italic_O start_POSTSUBSCRIPT italic_l , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R italic_u italic_l italic_e - italic_s italic_e italic_t end_POSTSUPERSCRIPT = italic_μ start_POSTSUBSCRIPT italic_Rule start_POSTSUBSCRIPT italic_l , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_x ) (11)

where Ol,kRulesetsuperscriptsubscript𝑂𝑙𝑘𝑅𝑢𝑙𝑒𝑠𝑒𝑡O_{\mathit{l,k}}^{Rule-set}italic_O start_POSTSUBSCRIPT italic_l , italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_R italic_u italic_l italic_e - italic_s italic_e italic_t end_POSTSUPERSCRIPT is the k𝑘kitalic_k-th node or membership function of the l𝑙litalic_l-th dimensional input spatial dependency basis that makes the SDBl𝑆𝐷subscript𝐵𝑙SDB_{l}italic_S italic_D italic_B start_POSTSUBSCRIPT italic_l end_POSTSUBSCRIPT fuzzy between 0 and 1, where l=1,2,,d𝑙12𝑑l=1,2,\dots,ditalic_l = 1 , 2 , … , italic_d and d𝑑ditalic_d is the dimensionality of the extracted spatial dependency basis. In this study, we assign two membership functions for each input, so k=1,2𝑘12k=1,2italic_k = 1 , 2. As a result, we can write the premise parameters in P=[e11,f11,g11,e12,f12,g12,,ed1,fd1,gd1,ed2,fd2,gd2]1×6d𝑃subscriptsubscript𝑒11subscript𝑓11subscript𝑔11subscript𝑒12subscript𝑓12subscript𝑔12subscript𝑒𝑑1subscript𝑓𝑑1subscript𝑔𝑑1subscript𝑒𝑑2subscript𝑓𝑑2subscript𝑔𝑑216𝑑\vec{P}=[e_{11},f_{11},g_{11},e_{12},f_{12},g_{12},\dots,e_{d1},f_{d1},g_{d1},% e_{d2},f_{d2},g_{d2}]_{1\times 6d}over→ start_ARG italic_P end_ARG = [ italic_e start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 11 end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT 12 end_POSTSUBSCRIPT , … , italic_e start_POSTSUBSCRIPT italic_d 1 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT italic_d 1 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_d 1 end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT italic_d 2 end_POSTSUBSCRIPT , italic_f start_POSTSUBSCRIPT italic_d 2 end_POSTSUBSCRIPT , italic_g start_POSTSUBSCRIPT italic_d 2 end_POSTSUBSCRIPT ] start_POSTSUBSCRIPT 1 × 6 italic_d end_POSTSUBSCRIPT.

The T-Norm layer performs a multiplication operation on the signals from the previous layer. It outputs the weight ωtsubscript𝜔𝑡\omega_{t}italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT or the firing strength of a rule, according to Equation (12):

OtTNorm=ωt=μ𝑅𝑢𝑙𝑒l1k1(s1)μ𝑅𝑢𝑙𝑒l2k2(s2)μ𝑅𝑢𝑙𝑒l3k3(s3)μ𝑅𝑢𝑙𝑒l4k4(s4)subscript𝑂superscript𝑡𝑇𝑁𝑜𝑟𝑚subscript𝜔𝑡subscript𝜇subscript𝑅𝑢𝑙𝑒subscript𝑙1subscript𝑘1subscript𝑠1subscript𝜇subscript𝑅𝑢𝑙𝑒subscript𝑙2subscript𝑘2subscript𝑠2subscript𝜇subscript𝑅𝑢𝑙𝑒subscript𝑙3subscript𝑘3subscript𝑠3subscript𝜇subscript𝑅𝑢𝑙𝑒subscript𝑙4subscript𝑘4subscript𝑠4O_{\mathit{t}^{TNorm}}=\omega_{t}=\mu_{\mathit{Rule}_{l_{1}k_{1}}}(s_{1})\cdot% \mu_{\mathit{Rule}_{l_{2}k_{2}}}(s_{2})\cdot\mu_{\mathit{Rule}_{l_{3}k_{3}}}(s% _{3})\cdot\mu_{\mathit{Rule}_{l_{4}k_{4}}}(s_{4})italic_O start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT italic_T italic_N italic_o italic_r italic_m end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT italic_Rule start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⋅ italic_μ start_POSTSUBSCRIPT italic_Rule start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) ⋅ italic_μ start_POSTSUBSCRIPT italic_Rule start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) ⋅ italic_μ start_POSTSUBSCRIPT italic_Rule start_POSTSUBSCRIPT italic_l start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT italic_k start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_s start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) (12)

where OtTNormsuperscriptsubscript𝑂𝑡𝑇𝑁𝑜𝑟𝑚O_{\mathit{t}}^{TNorm}italic_O start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_T italic_N italic_o italic_r italic_m end_POSTSUPERSCRIPT is the t𝑡titalic_t-th weight and t=1,2,3,,2d𝑡123superscript2𝑑t=1,2,3,\dots,2^{d}italic_t = 1 , 2 , 3 , … , 2 start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT, l1l2l3l4subscript𝑙1subscript𝑙2subscript𝑙3subscript𝑙4l_{1}\neq l_{2}\neq l_{3}\neq l_{4}italic_l start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≠ italic_l start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≠ italic_l start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ≠ italic_l start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT, and k1,k2,k3,k4=1,2formulae-sequencesubscript𝑘1subscript𝑘2subscript𝑘3subscript𝑘412k_{1},k_{2},k_{3},k_{4}=1,2italic_k start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_k start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT = 1 , 2.

The normalized layer calculates the ratio of the t𝑡titalic_t-th firing strength to the sum of all rules’ firing strengths using Equation (13):

OtNormalized=ωt¯=ωtωt,t=1,2,3,,2dformulae-sequencesuperscriptsubscript𝑂𝑡𝑁𝑜𝑟𝑚𝑎𝑙𝑖𝑧𝑒𝑑¯subscript𝜔𝑡subscript𝜔𝑡subscript𝜔𝑡𝑡123superscript2𝑑O_{\mathit{t}}^{Normalized}=\overline{\omega_{t}}=\frac{\omega_{t}}{\sum\omega% _{t}},\quad t=1,2,3,\dots,2^{d}italic_O start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N italic_o italic_r italic_m italic_a italic_l italic_i italic_z italic_e italic_d end_POSTSUPERSCRIPT = over¯ start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG = divide start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG start_ARG ∑ italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG , italic_t = 1 , 2 , 3 , … , 2 start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT (13)

The consequent layer computes the result of the consequent clause of the IF-THEN rules following Equation (14). The parameters in this layer, known as consequent parameters, are adjusted during the training process using least squares estimation:

OtConsequent=ωt¯ft=ωt¯(CSDB)=ωt¯(p1SDB1+p2SDB2++pdSDBd+pd+1)superscriptsubscript𝑂𝑡𝐶𝑜𝑛𝑠𝑒𝑞𝑢𝑒𝑛𝑡¯subscript𝜔𝑡subscript𝑓𝑡¯subscript𝜔𝑡𝐶𝑆𝐷𝐵¯subscript𝜔𝑡subscript𝑝1𝑆𝐷subscript𝐵1subscript𝑝2𝑆𝐷subscript𝐵2subscript𝑝𝑑𝑆𝐷subscript𝐵𝑑subscript𝑝𝑑1O_{\mathit{t}}^{Consequent}=\overline{\omega_{t}}f_{t}=\overline{\omega_{t}}(% \overrightarrow{C}\cdot\overrightarrow{SDB})=\overline{\omega_{t}}(p_{1}SDB_{1% }+p_{2}SDB_{2}+\dots+p_{d}SDB_{d}+p_{d+1})italic_O start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C italic_o italic_n italic_s italic_e italic_q italic_u italic_e italic_n italic_t end_POSTSUPERSCRIPT = over¯ start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = over¯ start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ( over→ start_ARG italic_C end_ARG ⋅ over→ start_ARG italic_S italic_D italic_B end_ARG ) = over¯ start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG ( italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_S italic_D italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT italic_S italic_D italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT + ⋯ + italic_p start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT italic_S italic_D italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT + italic_p start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT ) (14)

where SDB=[SDB1,SDB2,,SDBd,1]𝑆𝐷𝐵𝑆𝐷subscript𝐵1𝑆𝐷subscript𝐵2𝑆𝐷subscript𝐵𝑑1\overrightarrow{SDB}=[SDB_{1},SDB_{2},\dots,SDB_{d},1]over→ start_ARG italic_S italic_D italic_B end_ARG = [ italic_S italic_D italic_B start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_S italic_D italic_B start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_S italic_D italic_B start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT , 1 ] and C=[p1,p2,,pd,pd+1]𝐶subscript𝑝1subscript𝑝2subscript𝑝𝑑subscript𝑝𝑑1\overrightarrow{C}=[p_{1},p_{2},\dots,p_{d},p_{d+1}]over→ start_ARG italic_C end_ARG = [ italic_p start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_p start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT , italic_p start_POSTSUBSCRIPT italic_d + 1 end_POSTSUBSCRIPT ].

The summation layer calculates the overall output by summing all the incoming signals from the previous layer, as shown in Equation (15):

O𝑠𝑢𝑚𝑚𝑎𝑡𝑖𝑜𝑛=OtConsequent=ωt¯ftsubscript𝑂𝑠𝑢𝑚𝑚𝑎𝑡𝑖𝑜𝑛superscriptsubscript𝑂𝑡𝐶𝑜𝑛𝑠𝑒𝑞𝑢𝑒𝑛𝑡¯subscript𝜔𝑡subscript𝑓𝑡O_{\mathit{summation}}=\sum O_{t}^{Consequent}=\sum\overline{\omega_{t}}f_{t}italic_O start_POSTSUBSCRIPT italic_summation end_POSTSUBSCRIPT = ∑ italic_O start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C italic_o italic_n italic_s italic_e italic_q italic_u italic_e italic_n italic_t end_POSTSUPERSCRIPT = ∑ over¯ start_ARG italic_ω start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT (15)

2.3 Implementation of the proposed hybrid learning framework

We outline the key implementation steps of our hybrid framework. First, we obtain the nearest neighboring spatial covariates by constructing neighboring graph for each observation point. We then use these covariates to extract the latent spatial dependency basis as the main input features for our rule-assisted adaptive networks. Next, we model and train the spatial dependency function using the rule-based ANFIS. Finally, we approximate the nonlinear spatial function, which allows us to perform spatial interpolation of a specific attribute field. Table 1 lists the corresponding pseudo code.

Table 1: Pseudo code of hybrid data-driven and rule-assisted learning procedure
Algorithm: Hybrid data-driven and rule-assisted learning procedure
Input: Observation record 𝑂𝑏𝑠=[𝑂𝑏𝑠1,𝑂𝑏𝑠2,,𝑂𝑏𝑠i,,𝑂𝑏𝑠N]𝑂𝑏𝑠subscript𝑂𝑏𝑠1subscript𝑂𝑏𝑠2subscript𝑂𝑏𝑠𝑖subscript𝑂𝑏𝑠𝑁\mathit{Obs}=\left[\mathit{Obs}_{1},\mathit{Obs}_{2},\dots,\mathit{Obs}_{i},% \dots,\mathit{Obs}_{N}\right]italic_Obs = [ italic_Obs start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Obs start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , … , italic_Obs start_POSTSUBSCRIPT italic_N end_POSTSUBSCRIPT ]
Output: Premise parameters P𝑃\vec{P}over→ start_ARG italic_P end_ARG and consequent parameters C𝐶\vec{C}over→ start_ARG italic_C end_ARG of fuzzy IF-THEN rules for establishing the approximated spatial dependency function
Step 1: Obtain nearest neighboring Spatial Covariates (SC) for each observation point
Step 1.1: Construct neighboring graph for each observation point
Function: NeighboringGraph(Obs,m𝑂𝑏𝑠𝑚Obs,mitalic_O italic_b italic_s , italic_m)
For each observation 𝑂𝑏𝑠iObssubscript𝑂𝑏𝑠𝑖𝑂𝑏𝑠\mathit{Obs}_{i}\in Obsitalic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_O italic_b italic_s:
 # Return m𝑚\mathit{m}italic_m nearest neighbors of 𝑂𝑏𝑠isubscript𝑂𝑏𝑠𝑖\mathit{Obs}_{i}italic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT in all observations
 Neighbors \leftarrow NN(𝑂𝑏𝑠i,Obs,msubscript𝑂𝑏𝑠𝑖𝑂𝑏𝑠𝑚\mathit{Obs}_{i},Obs,mitalic_Obs start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_O italic_b italic_s , italic_m)  # nearest neighbors algorithm
 NeighborsMatrix.append(Neighbors)
Return NeighborsMatrix
Step 1.2: Build SC from NeighborsMatrix
Step 2: Extract SDB from SC using UMAP
 # the extraction is performed by minimizing cross-entropy loss
Step 3: Model and train the spatial dependency function using rule-based ANFIS
Function: ANFIS(SDB,Φ𝑆𝐷𝐵ΦSDB,\Phiitalic_S italic_D italic_B , roman_Φ)
Step 3.1: Update premise parameters by the gradient descent method
PPη(error)P𝑃𝑃𝜂error𝑃\vec{P}\leftarrow\vec{P}-\eta\frac{\partial(\text{error})}{\partial\vec{P}}over→ start_ARG italic_P end_ARG ← over→ start_ARG italic_P end_ARG - italic_η divide start_ARG ∂ ( error ) end_ARG start_ARG ∂ over→ start_ARG italic_P end_ARG end_ARG  # adjust premise parameters
Step 3.2: Update consequent parameters by least squares estimation
C=(SDBTSDB)1SDBTΦ𝐶superscript𝑆𝐷superscript𝐵𝑇𝑆𝐷𝐵1𝑆𝐷superscript𝐵𝑇Φ\vec{C}=(SDB^{T}SDB)^{-1}SDB^{T}\Phiover→ start_ARG italic_C end_ARG = ( italic_S italic_D italic_B start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_S italic_D italic_B ) start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_S italic_D italic_B start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT roman_Φ  # using Kalman filtering algorithm to calculate C𝐶\vec{C}over→ start_ARG italic_C end_ARG
Return P𝑃\vec{P}over→ start_ARG italic_P end_ARG and C𝐶\vec{C}over→ start_ARG italic_C end_ARG

3 Applications

To evaluate the general applicability and practical performance of the proposed framework, we apply it to two different cases involving the estimation of spatially dependent properties using interpolation of scattered observation data. The first case addresses a classic challenge in subsurface formation characterization [1], where the goal is to estimate the distribution of formation properties, such as rock porosity and/or permeability, based on very scattered observations, and produce a continuous property distribution map for subsurface resource exploitation. This is a particularly relevant use case as only limited observation data are available spatially in general [14, 15]. The second case focuses on estimating the distribution of air pollutants by interpolating data collected from scattered observation stations.

3.1 Spatial interpolation for oil reservoir porosity mapping

We use an oil reservoir example as a representing case. We aim to generate the formation porosity distribution using limited porosity measurements at scattered spatial observation locations. The benchmark model is provided in Figure 4. This model contains 50-by-50-by-1 grid blocks.

Refer to caption
Figure 4: Oil reservoir model and its porosity distribution

We irregularly sample the porosity field, as shown in Figure 4 to mimic the scattered observation locations with porosity measurement data, see Figure 5. We then randomly choose 100 observation locations as the input for the proposed hybrid learning framework. By modeling a proper spatial dependency function, we obtain the porosity estimation for unsampled locations in this model with 2500 grid blocks.

Refer to caption
Figure 5: Scattered porosity data at randomly chosen observation locations

As shown in Figure 6, the estimated porosity distribution map closely recaptures the original spatial pattern in the benchmark model. The high porosity channel and the low porosity channel along the lower-left diagonal are apparently reconstructed. Additionally, several dispersed regions with significant porosity variations are also qualitatively estimated, such as the high porosity region in the upper-right corner and the low porosity zone in the lower-left corner.

Refer to caption
Figure 6: Porosity distribution constructed using our proposed hybrid learning framework

We further compare the estimated porosity values with the actual benchmark values using line plots as shown in Figure 7. The red dotted line represents the estimated porosity for each grid block, while the black solid line shows the actual porosity. Clearly, our estimation aligns well with the true porosity data; however, upon closer inspection of the curves, some minor discrepancies between the actual and predicted porosity values appear. To address these differences, we perform a quantitative analysis of the estimation performance using some evaluation metrics.

The evaluation metrics include Mean Square Error (MSE), Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and coefficient of determination (R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT), as given in the following Equations.

MSE =1ni=1n(yiy^i)2absent1𝑛superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖subscript^𝑦𝑖2\displaystyle=\frac{1}{n}\sum_{i=1}^{n}(y_{i}-\hat{y}_{i})^{2}= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT (16)
RMSE =1ni=1n(yiy^i)2absent1𝑛superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖subscript^𝑦𝑖2\displaystyle=\sqrt{\frac{1}{n}\sum_{i=1}^{n}(y_{i}-\hat{y}_{i})^{2}}= square-root start_ARG divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (17)
MAE =1ni=1n|yiy^i|absent1𝑛superscriptsubscript𝑖1𝑛subscript𝑦𝑖subscript^𝑦𝑖\displaystyle=\frac{1}{n}\sum_{i=1}^{n}|y_{i}-\hat{y}_{i}|= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | (18)
MAPE =1ni=1n|yiy^i|max(yi)absent1𝑛superscriptsubscript𝑖1𝑛subscript𝑦𝑖subscript^𝑦𝑖subscript𝑦𝑖\displaystyle=\frac{1}{n}\sum_{i=1}^{n}\frac{|y_{i}-\hat{y}_{i}|}{\max(y_{i})}= divide start_ARG 1 end_ARG start_ARG italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT divide start_ARG | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT | end_ARG start_ARG roman_max ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) end_ARG (19)
R2superscript𝑅2\displaystyle R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT =1i=1n(yiy^i)2i=1n(yiy¯)2absent1superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖subscript^𝑦𝑖2superscriptsubscript𝑖1𝑛superscriptsubscript𝑦𝑖¯𝑦2\displaystyle=1-\frac{\sum_{i=1}^{n}(y_{i}-\hat{y}_{i})^{2}}{\sum_{i=1}^{n}(y_% {i}-\bar{y})^{2}}= 1 - divide start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over¯ start_ARG italic_y end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (20)

Where y^isubscript^𝑦𝑖\hat{y}_{i}over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT are the estimated and actual porosity of the i𝑖iitalic_i-th grid, respectively. i=1,2,3,,n𝑖123𝑛i=1,2,3,\dots,nitalic_i = 1 , 2 , 3 , … , italic_n, and n𝑛nitalic_n is the total number of grid blocks in the model. The symbol |||\cdot|| ⋅ | represents the absolute value, and y¯¯𝑦\bar{y}over¯ start_ARG italic_y end_ARG is the mean of actual porosities.

Refer to caption
Figure 7: comparative curve between actual value and estimated predictions for each grid

The calculated metrics are given in Table 2. The errors are very low in terms of MSE, RMSE, MAE, and MAPE. The coefficient of determination (R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT) is high at 0.9283. Furthermore, we demonstrate the fitted straight line between estimated and actual porosities for a better illustration of R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT in Figure 8. The regression line y=x𝑦𝑥y=xitalic_y = italic_x indicates that the estimation closely matches the actual porosity, underscoring the reliability of our proposed spatial interpolation method.

Table 2: The evaluation metrics for our proposed method
MSE RMSE MAE MAPE R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
0.00011236 0.0106 0.0086 0.0391 0.9283
Refer to caption
Figure 8: Interpolation performance of each grid using scatter plot along with regression straight line

3.2 Spatial interpolation for ozone value mapping

The second case is to estimate ozone value distribution based on sparse measurement data at scattered air quality monitoring stations. The ozone data used in this study is publicly available from the U.S. Environmental Protection Agency (EPA) Air Quality System (https://aqs.epa.gov/aqsweb/airdata/). Those ozone values are based on the U.S. national ambient air quality standards, specifically in the form of the “annual fourth-highest daily maximum 8-hour concentration”. In our application, we select the observed ozone data recorded at the same time across various monitoring stations as our test dataset, as illustrated in Figure 9. We then conduct spatial interpolation across the entire continental United States to estimate ozone levels in areas without direct observations. This allows for an assessment of air quality, even in regions with limited monitoring infrastructures.

The map generated using our proposed method captures the higher ozone areas in the west and lower areas in the east, as clearly and intuitively visualized in Figure 10. The reconstructed ozone distribution demonstrates the capabilities in capturing spatial variations, effectively distinguishing between highly contaminated regions and less affected areas. This is of practical use for representing the whole air quality situation in sparsely monitored regions, where fewer monitoring stations are available. The ability to reflect these variations in regions with limited data highlights the robustness and practical utility of our spatial interpolation method for air quality monitoring and assessment.

Refer to caption
Figure 9: Ozone value at scattered monitoring station in continental United States
Refer to caption
Figure 10: Ozone value distribution interpolated using our hybrid framework

In addition to visualizing the spatial distribution patterns of ozone levels, we also provide quantitative results using the evaluation metrics mentioned earlier. Unlike the first case, where actual values at unknown locations are available for comparison, the actual ozone values at non-monitoring stations in this scenario are not accessible. Therefore, we cannot directly compare the estimated values with actual measurements at those locations. We employ the 10-fold cross-validation to conduct the comparison. We divide the observed data into a training dataset and a validation dataset to assess the model’s performance. We then calculate the average values for MSE, RMSE, MAE, MAPE, and R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT across the 10 validation datasets. The results of these evaluations are listed in Table 3.

Table 3: The evaluation metrics of 10-fold validation
datasets using our proposed method
MSE RMSE MAE MAPE R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT
0.00002601 0.0051 0.0037 0.0792 0.7626

4 Comparison and Discussion

We compare our hybrid framework with several established interpolation methods, including Inverse Distance Weight (IDW) [17], ordinary kriging [18], and gaussian process [62]. These three comparative methods are widely used for spatial interpolation.

4.1 Comparison in terms of evaluation metrics

Table 4 and Table 5 provide the respective comparative results. Our proposed method outperforms in terms of MSE, RMSE, MAE, MAPE, and R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT for both cases.

Table 4: Comparative evaluation metrics of different interpolation methods for reservoir porosity mapping
Metrics Ordinary Kriging IDW Gaussian Process Our Method
MSE 0.00046976 0.00113792 0.00030674 0.00011236
RMSE 0.02167388 0.03373304 0.03373304 0.0106
MAE 0.01617806 0.02701021 0.01324166 0.0086
MAPE 0.15395796 0.2807828 0.13766530 0.0391
R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.71519912 0.31011110 0.81403229 0.9283
Table 5: Comparative evaluation metrics of different interpolation methods for ozone value mapping
Metrics Ordinary Kriging IDW Gaussian Process Our Method
MSE 0.00003324 0.00002602 0.00002665 0.00002601
RMSE 0.00576572 0.00510121 0.00516196 0.0051
MAE 0.00447331 0.00411962 0.00411051 0.0037
MAPE 0.13360340 0.12824334 0.12676602 0.0792
R2superscript𝑅2R^{2}italic_R start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT 0.29874051 0.45106761 0.43791603 0.7626

4.2 Comparison in terms of spatial feature reconstruction

In addition to quantitative analysis in terms of error and accuracy, we also compare the spatial distribution patterns from both global and local perspectives. The comparative spatial distribution maps of porosity are shown in Figure 11. It is evident that all three traditional methods can reconstruct the global trends and features. Global features here mean the prominent spatial distribution patterns that are easily recognizable across the entire map. These global patterns include the high porosity channel or region in the lower-left diagonal and upper-right direction, as well as the low porosity channel along the diagonal.

However, when it comes to capturing spatially local features, all three traditional methods fall short. Local features refer to the finer details and variations within smaller regions that contribute to the overall spatial heterogeneity. The inability of these methods to accurately reconstruct these local features highlights a limitation in their capacity to fully represent complex spatial variability. This comparison underscores the importance of methods that can address both global and local spatial patterns to provide a more comprehensive and detailed understanding of the spatial distribution. As demonstrated in Figure 11(d), the circled regions highlight the local features that reflect variations within the porosity field. Compared to other commonly used techniques, our proposed method captures and represents these local features more effectively, providing a more detailed and nuanced depiction of the spatial variability. This enhanced ability to preserve local variations underscores the strength of our approach in handling complex spatial patterns.

Refer to caption
Figure 11: Comparative results of porosity interpolation generated from different methods

Similarly, the comparative spatial distribution maps of ozone values are shown in Figure 12. All methods can predict global trends to some extent. For instance, the highly polluted region in the southwest and the less polluted region in the east are consistently identified across all methods. However, when it comes to reconstructing local features, our proposed method outperforms the other three interpolation techniques, as indicated by the dashed circles.

It should be noted that the three commonly used interpolation techniques tend to produce overly smooth interpolations. While they may capture general trends, it often comes at the cost of losing important local variations within the spatial field. Excessively smooth interpolation can negatively impact the accuracy of spatial predictions by overlooking these critical local features or variations in the spatial dependency field. Our proposed method, by contrast, strikes a better balance between capturing global trends and preserving local details, leading to more accurate and nuanced spatial interpolations.

Refer to caption
Figure 12: Comparative results of ozone interpolation generated from different methods

4.3 Sensitive analysis with respect to the number of nearest neighbors

Although our proposed method offers advantages over traditional techniques in terms of accuracy and the preservation of local features in both cases, its performance in interpolating ozone is slightly less effective compared to its success in reconstructing the porosity field. The primary difference between these two cases is the density of observed data within the interpolation space. The denser data in the porosity field allows for more precise predictions, whereas the sparser data in the ozone case makes it more challenging to capture local variations accurately.

Compared to the first porosity field case, the observation density in the ozone case is less uniform than in our randomly selected porosity data. This nonuniform observation density tends to be more sensitive to the number of nearest neighbors m when constructing nearest neighboring spatial covariates in our proposed method. In other words, the choice of the number of nearest neighbors has certain impact on the spatial distribution patterns and the estimation accuracy of the reconstructed attribute fields. This sensitivity underscores the importance of carefully selecting the number of nearest neighbors to ensure high quality spatial interpolation, particularly in cases with uneven observation densities.

To further illustrate this point, we focus on the ozone case to discuss the sensitivity of the number of nearest neighbors m. We compare and analyze different hyperparameter settings, as demonstrated in Figure 13. This comparison highlights how varying the number of nearest neighbors affects the spatial interpolation results, allowing us to better understand the impact of this parameter on the accuracy and reliability of our estimated predictions.

Refer to caption
Figure 13: Reconstructed spatial distribution patterns with different number of nearest neighbors

Clearly, different values of m have significant impact on the interpolated spatial distribution and its corresponding accuracy. We choose to use 10 nearest neighbors to achieve the desired accuracy. However, it is important to note that the optimal value of m is highly dependent on the specific observed sampling data. Considerable time and effort are required to carefully tune the hyperparameter m to achieve higher interpolation quality. This tuning process is crucial for adapting the model to different datasets and ensuring reliable interpolation results.

Additionally, a larger value of m tends to preserve global spatial patterns, while a smaller value of m tends to emphasize local features. For instance, when m is set to 30, the estimated high-ozone value region in the middle appears more expansive. Conversely, when m is reduced to 5, the results reveal some sparse low-ozone value areas within the high-ozone value region. This occurs because a larger m means that a broader neighborhood is considered when estimating unknown locations, leading to results that are more influenced by higher attribute values within that neighborhood. On the other hand, a smaller m often overlooks the general trend due to its limited neighborhood size, resulting in a focus on local variations. This, however, makes the interpolations more susceptible to outliers or extreme values.

Therefore, neither a large nor a small value of m is ideal for constructing nearest neighboring spatial covariates. Careful tuning of the hyperparameter m is essential when implementing our hybrid data-driven and rule-assisted learning framework for spatial interpolation. Finding the right balance in m allows for a better representation of both global and local patterns, ensuring reliable and robust estimations.

5 Conclusions

In this study, we develop a hybrid data-driven and rule-assisted learning framework that combines spatial feature extraction with domain knowledge integration via fuzzy rule sets to enhance the interpolation of spatially dependent properties. By decomposing nearest neighbor spatial covariates into multiple spatial dependency bases and utilizing fuzzy IF-THEN rules within an adaptive network framework, our method accommodates imprecise information, considers data uncertainties, and improves spatial interpolation accuracy. Validated through applications in subsurface formation characterization and air quality assessment, our approach surpasses traditional techniques like ordinary Kriging, inverse distance weighting, and Gaussian processes by achieving lower error metrics and better capturing local spatial features. However, the method’s performance is sensitive to the number of nearest neighbors used, which influences the balance between global trends and local variations. While our approach is parametric and requires careful tuning, future research could explore non-parametric methods that dynamically adjust the number of nearest neighbors, offering a promising avenue for further improving spatial interpolation techniques.

References

  • [1] Jeffrey M Yarus and Richard L Chambers. Practical geostatistics-an armchair overview for petroleum reservoir engineers. Journal of Petroleum Technology, 58(11):78–86, 2006.
  • [2] Mobarakeh Mohammadpour, Hamid Roshan, Mehrdad Arashpour, and Hossein Masoumi. Machine learning assisted kriging to capture spatial variability in petrophysical property modelling. Marine and Petroleum Geology, 167:106967, 2024.
  • [3] Min Wang, Siu Wun Cheung, Eric T Chung, Maria Vasilyeva, and Yuhe Wang. Generalized multiscale multicontinuum model for fractured vuggy carbonate reservoirs. Journal of Computational and Applied Mathematics, 366:112370, 2020.
  • [4] Bicheng Yan, Lidong Mi, Zhi Chai, Yuhe Wang, and John E Killough. An enhanced discrete fracture network model for multiphase flow in fractured reservoirs. Journal of Petroleum Science and Engineering, 161:667–682, 2018.
  • [5] Lidong Mi, Bicheng Yan, Hanqiao Jiang, Cheng An, Yuhe Wang, and John Killough. An enhanced discrete fracture network model to simulate complex fracture distribution. Journal of Petroleum Science and Engineering, 156:484–496, 2017.
  • [6] Jin Li and Andrew D Heap. A review of spatial interpolation methods for environmental scientists. 2008.
  • [7] Hüseyin Yavuz and Saffet Erdoğan. Spatial analysis of monthly and annual precipitation trends in turkey. Water resources management, 26:609–621, 2012.
  • [8] Benedict Shamo, Eric Asa, and Joseph Membah. Linear spatial interpolation and analysis of annual average daily traffic data. Journal of Computing in Civil Engineering, 29(1):04014022, 2015.
  • [9] Michael Lowry. Spatial interpolation of traffic counts based on origin–destination centrality. Journal of Transport Geography, 36:98–105, 2014.
  • [10] Yan Hong, Henry A Nix, Mike F Hutchinson, and Trevor H Booth. Spatial interpolation of monthly mean climate data for china. International Journal of Climatology: A Journal of the Royal Meteorological Society, 25(10):1369–1379, 2005.
  • [11] Wei Li, Shengyu Kang, Yueqiang Sun, Weihua Bai, Yuhe Wang, and Hongqing Song. A machine learning approach for air-quality forecast by integrating gnss radio occultation observation and weather modeling. Atmosphere, 14(1):58, 2022.
  • [12] F Zhang, M An, B Yan, Y Wang, and Y Han. A novel hydro-mechanical coupled analysis for the fractured vuggy carbonate reservoirs. Computers and Geotechnics, 106:68–82, 2019.
  • [13] Janka Lengyel, Seraphim Alvanides, and Jan Friedrich. Modelling the interdependence of spatial scales in urban systems. Environment and Planning B: Urban Analytics and City Science, 50(1):182–197, 2023.
  • [14] Jerry Jensen. Statistics for petroleum engineers and geoscientists, volume 2. Gulf Professional Publishing, 2000.
  • [15] Pedram Masoudi. Analysing spatialized data: applications of geo-datasciences in geophysics and petroleum industry. PhD thesis, Université Paris-Saclay, 2024.
  • [16] Jin Li and Andrew D Heap. Spatial interpolation methods applied in the environmental sciences: A review. Environmental Modelling & Software, 53:173–189, 2014.
  • [17] Donald Shepard. A two-dimensional interpolation function for irregularly-spaced data. In Proceedings of the 1968 23rd ACM national conference, pages 517–524, 1968.
  • [18] Georges Matheron. Principles of geostatistics. Economic geology, 58(8):1246–1266, 1963.
  • [19] Wanfang Chen, Yuxiao Li, Brian J Reich, and Ying Sun. Deepkriging: Spatially dependent deep neural networks for spatial prediction. arXiv preprint arXiv:2007.11972, 2020.
  • [20] Haoyu Wang, Yawen Guan, and Brain Reich. Nearest-neighbor neural networks for geostatistics. In 2019 international conference on data mining workshops (ICDMW), pages 196–205. IEEE, 2019.
  • [21] Chao Shi and Yu Wang. Non-parametric machine learning methods for interpolation of spatially varying non-stationary and non-gaussian geotechnical properties. Geoscience Frontiers, 12(1):339–350, 2021.
  • [22] Di Zhu, Ximeng Cheng, Fan Zhang, Xin Yao, Yong Gao, and Yu Liu. Spatial interpolation using conditional generative adversarial neural networks. International Journal of Geographical Information Science, 34(4):735–758, 2020.
  • [23] Christopher Kadow, David Matthew Hall, and Uwe Ulbrich. Artificial intelligence reconstructs missing climate information. Nature Geoscience, 13(6):408–413, 2020.
  • [24] Shuyi Du, Ruifei Wang, Chenji Wei, Yuhe Wang, Yuanchun Zhou, Jiulong Wang, and Hongqing Song. The connectivity evaluation among wells in reservoir utilizing machine learning methods. IEEE access, 8:47209–47219, 2020.
  • [25] Shuyi Du, Jingyan Zhang, Ming Yue, Chiyu Xie, Yuhe Wang, and Hongqing Song. A novel sequential-based hybrid approach incorporating physical modeling and deep learning for multiphase subsurface flow simulation. Gas Science and Engineering, 118:205093, 2023.
  • [26] Qitao Zhang, Chenji Wei, Yuhe Wang, Shuyi Du, Yuanchun Zhou, and Hongqing Song. Potential for prediction of water saturation distribution in reservoirs utilizing machine learning methods. Energies, 12(19):3597, 2019.
  • [27] Aleksandar Sekulić, Milan Kilibarda, Dragutin Protić, and Branislav Bajat. A high-resolution daily gridded meteorological dataset for serbia made by random forest spatial interpolation. Scientific Data, 8(1):123, 2021.
  • [28] Hosang Han and Jangwon Suh. Spatial prediction of soil contaminants using a hybrid random forest–ordinary kriging model. Applied Sciences, 14(4):1666, 2024.
  • [29] Margot Geerts, Seppe vanden Broucke, and Jochen De Weerdt. Georf: a geospatial random forest. Data Mining and Knowledge Discovery, pages 1–35, 2024.
  • [30] MM Korjani, AS Popa, E Grijalva, S Cassidy, and I Ershaghi. Reservoir characterization using fuzzy kriging and deep learning neural networks. In SPE Annual Technical Conference and Exhibition?, page D031S038R001. SPE, 2016.
  • [31] Marijan Šapina. A comparison of artificial neural networks and ordinary kriging depth maps of the lower and upper pannonian stage border in the bjelovar subdepression, northern croatia. Rudarsko-geološko-naftni zbornik, 31(3):75–86, 2016.
  • [32] Sunayana, Komal Kalawapudi, Ojaswikrishna Dube, and Renuka Sharma. Use of neural networks and spatial interpolation to predict groundwater quality. Environment, Development and Sustainability, 22(4):2801–2816, 2020.
  • [33] Yurou Liang, Ping Duan, Jiajia Liu, Mingguo Wang, and Jie Zhang. Study on the space field reconstruction method of the radial basis function of electromagnetic radiation under optimal parameters. Electromagnetic Biology and Medicine, 43(1-2):19–30, 2024.
  • [34] Ryota Otake, Jun Kurima, Hiroyuki Goto, and Sumio Sawada. Deep learning model for spatial interpolation of real-time seismic intensity. Seismological Society of America, 91(6):3433–3443, 2020.
  • [35] Qian Yu, Hong-wu Yuan, Zhao-long Liu, and Guo-ming Xu. Spatial weighting emd-lstm based approach for short-term pm2. 5 prediction research. Atmospheric Pollution Research, 15(10):102256, 2024.
  • [36] Riku Hashimoto and Katsuya Suto. Sicnn: Spatial interpolation with convolutional neural networks for radio environment mapping. In 2020 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), pages 167–170. IEEE, 2020.
  • [37] Katsuya Suto, Shinsuke Bannai, Koya Sato, Kei Inage, Koichi Adachi, and Takeo Fujii. Image-driven spatial interpolation with deep learning for radio map construction. IEEE Wireless Communications Letters, 10(6):1222–1226, 2021.
  • [38] Hengnian Yan, Qiang Zheng, and Lingzao Zeng. Conditional generative adversarial networks for groundwater contamination characterization and source identification. Journal of Hydrology, 632:130900, 2024.
  • [39] Herbert Rakotonirina, Ignacio Guridi, Paul Honeine, Olivier Atteia, and Antonin Van Exem. Spatial interpolation and conditional map generation using deep image prior for environmental applications. Mathematical Geosciences, pages 1–26, 2024.
  • [40] Alvaro Egana, Felipe Navarro, Mohammad Maleki, Francisca Grandon, Francisco Carter, and Fabian Soto. Ensemble spatial interpolation: A new approach to natural or anthropogenic variable assessment. Natural Resources Research, 30:3777–3793, 2021.
  • [41] Georgia Papacharalampous, Hristos Tyralis, Nikolaos Doulamis, and Anastasios Doulamis. Uncertainty estimation in spatial interpolation of satellite precipitation with ensemble learning. arXiv preprint arXiv:2403.10567, 2024.
  • [42] Aleksandar Sekulić, Milan Kilibarda, Gerard BM Heuvelink, Mladen Nikolić, and Branislav Bajat. Random forest spatial interpolation. Remote Sensing, 12(10):1687, 2020.
  • [43] Glen T Nwaila, Steven E Zhang, Julie E Bourdeau, Hartwig E Frimmel, and Yousef Ghorbani. Spatial interpolation using machine learning: from patterns and regularities to block models. Natural Resources Research, 33(1):129–161, 2024.
  • [44] Petre Stoica, Prabhu Babu, and Jian Li. New method of sparse parameter estimation in separable models and its use for spectral analysis of irregularly sampled data. IEEE Transactions on Signal Processing, 59(1):35–47, 2010.
  • [45] Florian Beiser, Håvard Heitlo Holm, and Jo Eidsvik. Comparison of ensemble-based data assimilation methods for sparse oceanographic data. Quarterly Journal of the Royal Meteorological Society, 150(759):1068–1095, 2024.
  • [46] Grigorios Tsagkatakis, Anastasia Aidini, Konstantina Fotiadou, Michalis Giannopoulos, Anastasia Pentari, and Panagiotis Tsakalides. Survey of deep-learning approaches for remote sensing observation enhancement. Sensors, 19(18):3929, 2019.
  • [47] Yuankai Wu, Dingyi Zhuang, Aurelie Labbe, and Lijun Sun. Inductive graph neural networks for spatiotemporal kriging. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 4478–4485, 2021.
  • [48] Jun Ma, Yuexiong Ding, Jack CP Cheng, Feifeng Jiang, and Zhiwei Wan. A temporal-spatial interpolation and extrapolation method based on geographic long short-term memory neural network for pm2. 5. Journal of Cleaner Production, 237:117729, 2019.
  • [49] J-SR Jang. Anfis: adaptive-network-based fuzzy inference system. IEEE transactions on systems, man, and cybernetics, 23(3):665–685, 1993.
  • [50] Ebrahim H Mamdani. Application of fuzzy algorithms for control of simple dynamic plant. In Proceedings of the institution of electrical engineers, volume 121, pages 1585–1588. IET, 1974.
  • [51] Ebrahim H Mamdani and Sedrak Assilian. An experiment in linguistic synthesis with a fuzzy logic controller. International journal of man-machine studies, 7(1):1–13, 1975.
  • [52] Qiangqiang Mao, Xiaohua Ma, and Yuhe Wang. A decision support engine for infill drilling attractiveness evaluation using rule-based cognitive computing under expert uncertainties. Journal of Petroleum Science and Engineering, 208:109671, 2022.
  • [53] Maria IS Guerra, Fábio MU de Araújo, João T de Carvalho Neto, and Romênia G Vieira. Survey on adaptative neural fuzzy inference system (anfis) architecture applied to photovoltaic systems. Energy Systems, 15(2):505–541, 2024.
  • [54] Waldo R Tobler. A computer movie simulating urban growth in the detroit region. Economic geography, 46(sup1):234–240, 1970.
  • [55] Cafer Mert Yeşilkanat, Yaşar Kobya, Halim Taşkın, and Uğur Çevik. Spatial interpolation and radiological mapping of ambient gamma dose rate by using artificial neural networks and fuzzy logic methods. Journal of environmental radioactivity, 175:78–93, 2017.
  • [56] Bardia Bayat, Mohsen Nasseri, and Eric Delmelle. Uncertainty-based rainfall network design using a fuzzy spatial interpolation method. Applied Soft Computing, 106:107296, 2021.
  • [57] Evdokia Tapoglou, George P Karatzas, Ioannis C Trichakis, and Emmanouil A Varouchakis. A spatio-temporal hybrid neural network-kriging model for groundwater level simulation. Journal of hydrology, 519:3193–3203, 2014.
  • [58] Xiaoxi Zhao, Andrei S Popa, Iraj Ershaghi, Fred Aminzadeh, Yuanjun Li, and Steve D Cassidy. Reservoir geostatistical estimates of imprecise information using fuzzy-kriging approach. SPE Reservoir Evaluation & Engineering, 23(01):001–012, 2020.
  • [59] Federico Amato, Fabian Guignard, Sylvain Robert, and Mikhail Kanevski. A novel framework for spatio-temporal prediction of environmental data using deep learning. Scientific reports, 10(1):22243, 2020.
  • [60] Leland McInnes, John Healy, and James Melville. Umap: Uniform manifold approximation and projection for dimension reduction. arXiv preprint arXiv:1802.03426, 2018.
  • [61] Tim Sainburg, Leland McInnes, and Timothy Q Gentner. Parametric umap embeddings for representation and semisupervised learning. Neural Computation, 33(11):2881–2907, 2021.
  • [62] Joaquin Quinonero-Candela and Carl Edward Rasmussen. A unifying view of sparse approximate gaussian process regression. The Journal of Machine Learning Research, 6:1939–1959, 2005.