1. Introduction
Impervious surfaces are land cover composed of structural materials that hinder the natural penetration of water into the soil including building roofs, asphalt pavements, and hardened grounds [
1]. Within the city, impervious surfaces are land cover other than vegetation and bare land. Relevant studies show that the continuous expansion of impervious surface areas has caused social, ecological, and environmental problems such as the heat island effect and traffic jams [
2]. At the same time, impervious surface information is also an essential factor and indicator to reflect urban distribution patterns, regional expansion intensity, suitable living environments, and urbanization levels [
3]. In addition, large, vegetated areas are converted into impervious areas resulting in a reduction in urban green space. Therefore, these are also the primary data for studying urban flood disasters, biodiversity loss, and other ecological issues [
4,
5]. At present, the rapid expansion of global cities has brought more comprehensive challenges to regional, national, and global development [
6]. Compared with other regions, Asia is in a period of rapid urban population accumulation and accelerated urbanization, and the impervious surfaces are expanding rapidly. Therefore, impervious surface mapping for the Asian region can provide more real-time and accurate basic data for studying urban expansion patterns, climate change, and sustainable development.
In the past two decades, remote sensing images of various spatial resolutions have been used to extract impervious surface information [
7]. For high spatial resolution images (≤10 m), object-oriented classification methods are usually used to extract the impervious surfaces [
8]. For example, Sebari et al. proposed a fuzzy-object-based analysis method for extracting impervious surface information from IKONOS [
9,
10]. These methods can comprehensively integrate geometric, textural, spectral, and contextual information, improving spatial resolution and classification accuracy. However, due to the high cost and complex processing, large-scale impervious surface extraction still has challenges. Benefiting from good data availability, researchers tend to use medium-resolution images (10–100 m) [
11,
12]. The spectral mixture analysis method was proposed for impervious surface inversion using Landsat [
13]. However, this method is limited by the choice of pure end members and the complexity of the model, which increases the difficulty of extracting large-scale impervious surface information. Therefore, more methods are used to extract impervious surface information directly by using medium spatial resolution images for classification [
14,
15,
16,
17]. For example, Liu et al. made a global impervious surface product based on random forests using Landsat-8 and Sentinel-1 [
18] images. Chen et al. proposed a pixel-object-knowledge-based method based on Landsat [
19]. However, classification-based methods usually require a classifier supported by training samples. In large-scale information extraction, there are also problems such as unbalanced sample selection and poor generalization ability of the classifier, resulting in large regional differences in classification accuracy and low overall accuracy. Moreover, medium and high spatial resolution sensors usually have a low temporal resolution, which results in the spatial–temporal discontinuity of remote sensing data, further limiting large-scale impervious surface information extraction [
20,
21,
22]. Therefore, it is necessary to comprehensively utilize multi-source and multi-temporal data based on medium and high-resolution images. This greatly increases the difficulty of data preprocessing and model building.
In contrast, coarse spatial resolution data (≥100 m) have been widely used for large-scale information extraction, including impervious surface percentage estimation, due to its higher temporal resolution and spatial coverage. Researchers have used simple and fast index features to distinguish impervious and pervious surface information [
23]. However, coarse spatial resolution image pixels are usually dominated by mixed pixels, and the simple threshold method cannot accurately reflect the actual situation. Therefore, based on the basic assumption of mixed pixels, there are more studies that tend to establish the relationship between the impervious surface percentage and remote sensing features through linear or nonlinear regression models [
24]. For instance, Guo et al. combined multi-temporal MODIS NDVI and VIIRS DNB data to estimate impervious surface percentage in China through linear regression [
25]. Lu et al. fused the time-series MODIS NDVI and DMSP/OLS data to establish a stepwise regression model to predict the impervious surface percentage [
26]. This method is simple to operate and has a great predictive effect. Effective feature variables and reasonable regression models are conducive to the rapid realization of large-scale impervious surface percentage inversion [
27]. With the release of nighttime light (NTL) remote sensing data, many works began to focus on impervious surface information extraction based on the nighttime light index [
28,
29]. Among these studies, NTL combined other data to build new indexes such as VANUI (Vegetation-Adjusted NTL Urban Index) and TVANUI (Temperature- and Vegetation-Adjusted NTL Urban Index) [
30,
31]. Generally, coarse spatial resolution images have better temporal and spatial continuity, providing a reliable data source in large regions. However, with the expansion of the mapping region, impervious surface extraction will be affected by regional land surface differences. In addition, due to regional climates, there are also obvious spatial differences in remote sensing information. For nighttime light data, some issues, such as data oversaturation and unbalanced regional development, directly affect their ability to describe the impervious surface information. Therefore, it remains a challenge to use coarse spatial resolution images to accurately estimate the large-scale impervious surface percentage [
32,
33].
Spatiotemporal heterogeneity often affects the accuracy of the extraction of large-scale impervious surface information. For temporal sequences, the same object shows obvious differences in images from different periods, and the impervious surface extraction is different in different seasons [
34,
35]. For example, in the northern summer or vegetation growing season, the spectral difference between the vegetation and impervious surface is large, and the extraction results are better than in other seasons. Therefore, applying multi-temporal data requires consideration of the effects of the temporal differences. In terms of spatial scale, small-scale regional extraction usually does not need to consider the spatial differences [
36]. However, for large-scale information extraction affected by differences in terrain and natural ecosystems, the spatial and spectral characteristics of ground objects in remote sensing images show significant differences, which seriously restricts the extraction accuracy. Therefore, researchers have considered a regional division strategy to address the spatial inconsistencies. Gong et al. divided the study area into geographical grids and extracted impervious surfaces for each grid to reduce the spatial differences [
37]. However, the grid size is usually set based on the researcher’s experience. In addition, the method needs to provide accurate and effective training data for each grid [
19,
38]. To balance the data volume and efficiency, Zhang et al. extracted the global impervious surface based on the local adaptive scheme of the grid and adjacent grid [
18]. However, the inevitable differences in surface cover form different distribution patterns of impervious surfaces, which affect the extraction accuracy such as areas with sparse vegetation and areas with abundant vegetation. Considering the regional landscape differences, Gong et al. used the global biome distribution to divide the world into arid and non-arid regions based on grids [
39]. Huang et al. divided the world into 1221 hexagons, used different DEM data to adapt to the elevation and slope features in different latitudes, and considered adding light features to suppress misclassification in arid regions [
40]. Sun et al. improved impervious surface extraction performance by using finer-scale global biome maps in arid and semi-arid regions [
41]. However, the differentiation between arid and non-arid regions does not express global landscape differences adequately [
42]. A richer regional division strategy is needed to adapt to the extraction of impervious surface information in different regions. To this end, GlobeCover2009 considered the ecological landscape and regional climate change factors and divided the world into 22 ecological regions [
43]. Furthermore, Schneider et al. proposed a global urban ecoregion stratification scheme combining global climate characteristics and thematic data [
44]. Through this scheme, the NUACI (Normalized Urban Areas Composite Index) obtained the threshold for each ecological region [
45]. Those previous studies demonstrated that the regional divisions effectively overcome the impact of spatial heterogeneity of climate and ecology on a large scale. However, most of the proposed division principles are based on regular shapes such as grids and hexagons, ignoring the irregularity of the real boundaries, which significantly increases the sample requirements and operating time based on numerous subregions. In addition, although the combination of landscape patterns and ecological elements improves the pertinence of regional divisions, this process does not consider the temporal and spatial differences of the data itself, which affects the stability of regional information extraction.
Above all, remote sensing data with good spatiotemporal continuity, such as MODIS and NTL, have obvious advantages in estimating the large-scale impervious surface percentage. However, in large-scale regions where land cover exhibits significant spatial differences and seasonally inconsistent dynamics, obtaining accurate and balanced estimates remains a challenge. To this end, this study is motivated by solving the spatiotemporal inconsistencies in the estimation of large-scale impervious surface percentages, and attempts to combine regional climate classification information with the time-series features of remote sensing images to construct a more reasonable regional division method to overcome the influence of regional differences. Based on ensuring the differences in the landscape, a regional division strategy was constructed to realize the rational optimization of the spatial distribution and reduce the regional redundancies. At the same time, an adaptive selection and feature construction model of regional remote sensing image time series is developed to achieve more stable impervious surface features for different regions. Finally, the large-scale impervious surface percentage estimation is realized based on the construction of different regional features and feature set combinations.