Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Impact of Insect Foliar Herbivory on Soil N₂O Emission and Nitrogen Dynamics in Subtropical Tree Species
Previous Article in Journal
Shifts in the Soil Microbial Community and Enzyme Activity Under Picea crassifolia Plantations and Natural Forests
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Studying Forest Species Classification Methods by Combining PolSAR and Vegetation Spectral Indices

1
School of Geomatics, Liaoning Technical University, Fuxin 123000, China
2
Collaborative Innovation Institute of Geospatial Information Service, Liaoning Technical University, Fuxin 123000, China
3
Fuxin Ecological Group Digital Technology Research Institute Co., Ltd., Fuxin 123000, China
*
Author to whom correspondence should be addressed.
Forests 2025, 16(1), 15; https://doi.org/10.3390/f16010015
Submission received: 15 November 2024 / Revised: 20 December 2024 / Accepted: 23 December 2024 / Published: 25 December 2024
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

Abstract

:
Tree species are important factors affecting the carbon sequestration capacity of forests and maintaining the stability of ecosystems, but trees are widely distributed spatially and located in complex environments, and there is a lack of large-scale regional tree species classification models for remote sensing imagery. Therefore, many studies aim to solve this problem by combining multivariate remote sensing data and proposing a machine learning model for forest tree species classification. However, satellite-based laser systems find it difficult to meet the needs of regional forest species classification characters, due to their unique footprint sampling method, and SAR data limit the accuracy of species classification, due to the problem of information blending in backscatter coefficients. In this work, we combined Sentinel-1 and Sentinel-2 data to construct a machine learning tree classification model based on optical features, vegetation spectral features, and PolSAR polarization observation features, and propose a forest tree classification feature selection method featuring the Hilbert–Huang transform for the problem of mixed information on the surface of SAR data. The PSO-RF method was used to classify forest species, including four temperate broadleaf forests, namely, aspen (Populus L.), maple (Acer), peach tree (Prunus persica), and apricot tree (Prunus armeniaca L.), and two coniferous forests, namely, Chinese pine (Pinus tabuliformis Carrière) and Mongolian pine (Pinus sylvestris var. mongolica Litv.). In this study, some experiments were conducted using two Sentinel-1 images, four Sentinel-2 images, and 550 measured forest survey sample data points pertaining to the forested area of Fuxin District, Liaoning Province, China. The results show that the fusion model constructed in this study has high accuracy, with a Kappa coefficient of 0.94 and an overall classification accuracy of 95.1%. In addition, this study shows that PolSAR data can play an important role in forest tree species classification. In addition, by applying the Hilbert–Huang transform to PolSAR data, other feature information that interferes with the perceived vertical structure of forests can be suppressed to a certain extent, and its role in the classification of forest species, combined with PolSAR, should not be ignored.

1. Introduction

Tree species surveying is an important link in forest biomass estimation, which can provide data support for forest resource management, and at the same time, it is important for assessments of the ecological service value of vegetation [1,2,3]. The traditional forest species survey methods, represented by sample plot surveys, are difficult to apply to large-scale forest species survey tasks, because of difficulties related to data acquisition and spatial distribution, as well as their high costs [4,5]. With the development of remote sensing technology, remote sensing satellites have the advantages of a wide sensing range, continuous spatial distribution, and high temporal resolution, which facilitate tasks related to the large-scale investigation of tree species [4,5,6]. The spectral reflectance of multispectral images is widely used in forest tree species classification studies due to its ability to express the biochemical information of tree canopies [7,8,9]. However, multispectral imagery lacks the ability to perceive the vertical structure information of trees due to its insufficient transmittance, thus limiting its accuracy in tree species classification. Airborne LIDAR and satellite lasers detect the vertical structure of vegetation and understory topography by actively transmitting laser pulses, and are therefore able to capture differences in horizontal and vertical structural characteristics, such as canopy height and canopy width, between different tree species [10,11,12]. However, due to the small data coverage of the former and the footprint-type spot measurement of the latter, the remote sensing data are still spatially discontinuous, making it difficult to meet the needs of large-scale forest species survey missions over hundreds of square kilometers, or even larger areas. The Synthetic Aperture Radar (SAR) technique has become an effective remote sensing technique for obtaining the biophysical parameters of forests at regional and global scales, due to its imaging capability under all-weather and all-day conditions, and its sensitivity to the geometric and physical properties of the target [13,14,15,16,17]. The method realizes the transmission to the vegetation area, as well as a certain depth from the ground surface, by actively transmitting microwaves, which can realize the perception of the vertical structure information of the vegetation in a large-scale area. Therefore, this method has been widely applied to the classification of crop planting structure [18,19], forest species classification [13,20,21], and other related studies. In recent years, studies examining forest species classification by combining optical remote sensing images and SAR data have achieved notable results due to this method’s ability to combine the perception of the biochemical information of tree canopies and the vertical structure of trees [22,23]. Since the backscattering coefficients of SAR images contain topographic information, canopy information, and other noise information that interferes with the vertical structure perception of trees, the classification accuracy is limited to a certain extent. The Hilbert–Huang transform (HHT), with the advantages of having an adaptive decomposition property, no requirement for basis function selection, and its time–frequency distribution, can analyze smooth electromagnetic wave signals and nonlinear non-smooth electromagnetic wave signals [24,25]. Good research results have been achieved in seismic data analysis [26,27,28], gravity detection [29,30,31], and bridge deformation [32,33,34]. In our study, we propose a tree species classification method for multi-source remote sensing images combined with HHT. The method decomposes the electromagnetic wave information of SAR images into multiple intrinsic modal functions (IMFs) based on HHT, aiming to reduce to some extent the limiting effect of the coupling of the backscatter coefficient and multi-class information of SAR images on the accuracy of tree species classification, so as to emphasize the contribution of SAR data in tree species classification research. On this basis, we realized forest species classification using optical and synthetic aperture radar (SAR) images, to take into account the vertical structure of forest tree species and differences in vegetation spectral indices. The objectives of this study were (1) to investigate the degree of contribution of different orders of eigenmode functions to forest species classification, (2) to select the best eigenmode function of SAR image information and combine the reflectance information of multispectral images and spectral indices of multiple types of vegetation, and (3) to construct a wide-ranging forest species classification model based on machine learning.

2. Study Area and Data

2.1. Overview of the Study Area

The Fuxin region is located in the northwest of Liaoning Province, China, with an area of 10,355 km2 and a forest cover of about 2901 km2. The northern part of the Fuxin region is the Horqin Sandland, and the eastern part is in the Liaohe Plain. It is a transition zone between the Inner Mongolian steppe and the stony mountainous terrain of North China. The geomorphological pattern of the region is high in the northern part of the west and low in the southern part of the east, as shown in Figure 1. The Fuxin region’s climate is a semi-arid continental monsoon climate characteristic of the north temperate zone; the main feature of the climate is precipitation, but the spatial and temporal distribution is extremely uneven. The rain and heat simultaneously aid the growth of vegetation in the region, providing favorable conditions. The region is characterized by a variety of natural environments and a rich variety of forest species. It is worth noting that the region is at the junction of the semi-humid and semi-arid zones and that the forests of the Fuxin region are also an important part of the windbreak forests in North China, resulting in the distribution of both natural and planted forests, as well as coniferous and broadleaf forests, in the region [35]. Therefore, the classification of forest species in Fuxin is of great significance to the study of carbon storage, the carbon cycle, and desertification control in the region.

2.2. Research Data

Two Sentinel-1B images and four Sentinel-2 images covering the study area in 2021 were selected as data sources for our forest species classification study. Sentinel-1B and Sentinel-2 are multispectral imaging sensors operated by the European Space Agency and provide open, freely accessible data. (https://dataspace.copernicus.eu/, accessed on 29 August 2021). The Sentinel-1B image has a spatial resolution of 10 m, and the data have been processed by polarization decomposition, refined Lee filtering, and terrain radiation correction. The ground truth for the data used in the study was obtained by the researchers from the annual field survey, which took place in August 2021. The survey was based on 20 m × 20 m square plots covering a range of information on species composition, structural attributes, and health, and a total of 550 sample data points were measured for machine learning model training and validation, which were uniformly distributed within the forest cover of the study area and included all dominant tree species in the study area. According to the survey results, there were six dominant tree species in the study area: aspen (Populus L.), Chinese pine (Pinus tabuliformis Carrière), maple (Acer), Mongolian pine (Pinus sylvestris var. mongolica Litv.), peach tree (Prunus persica), and apricot tree (Prunus armeniaca L.).

2.3. Polarized Synthetic Aperture Radar Observations

In the polarization decomposition theory, the scattering matrix is the basis for the study of scattering of ground objects, containing both amplitude and phase information, and is a complex matrix linking the electric field strengths of the incident wave and the scattered return wave. However, the scattering matrix can only describe the scattering signature of pure scatterers, and it cannot describe that of distributed scatterers. The scattering of distributed scatterers can only be described statistically. In order to reduce the effect of noise, the scattering of distributed scatterers is commonly analyzed in dual-polarization SAR systems using the covariance matrix ( C 2 matrix), which is a second-order scattering information quantity generated by the spatial averaging of the target vector of the dual-polarization scattering matrix, k [36], which is generated by spatial averaging, as shown in Equation (1), and the C 2 matrix as shown in Equation (2) [37].
k = S V V S V H T
C 2 = C 11 C 12 C 21 C 22 = S V V 2 S V V S V H * S V H S V V * S V H 2
where * denotes the complex conjugate, and according to the scattering symmetry, there is S V V S V H * = S V H S V V * = 0. Equation (2) can be rewritten as Equation (3). In Equation (3), σ = S V V 2 , ε = S V H 2 / S V V 2 .
C 2 = σ σ 0 0 ε
Based on Equation (3), the C 2 matrix is a semi-positive definite Hermite matrix that can be decomposed into a weighted sum of two mutually orthogonal matrices, as shown in Equation (4). In Equation (4), λi and ξi denote the real eigenvalues and eigenvectors of the C 2 matrix, respectively, and Ci denotes the independent covariance matrices of rank 1, denoting a scattering mechanism and the corresponding eigenvalues λi. The expression of the eigenvalues of the C 2 matrix is shown in Equation (5). Based on Equation (5), the dual-polarization decomposition can be expressed as shown in Equation (6).
C 2 = i = 1 2 λ i C i = λ 1 ξ 1 ξ 1 * + λ 2 ξ 2 ξ 2 *
C 2 λ 1 = σ λ 2 = σ ε
C 2 = λ 1 C 1 + λ 2 C 2 = σ 1 0 0 0 + σ ε 0 0 0 1
From Equation (6), we can see that C 1 is the other scattering, which is the result of the joint action of odd and even scattering, and λ 1 represents the intensity of this other scattering, while C 2 is only related to VH and represents the multiple scattering process. λ 2 is used to represent the intensity of multiple scattering. In order to characterize the disorder in the scattering process of the feature, the scattering entropy (H) is introduced, which is defined as Equation (7). It is important to note that P i = λ i λ 1 + λ 2 .
H = i = 1 2 P i log 2 P i
The scattering entropy (H) characterizes the degree of randomness of target scattering. When H = 0, it indicates that the system is in a fully polarized state with only one scattering mechanism. When H tends to 0, the system is close to a fully polarized state, with one of the scattering mechanisms dominating; when H approaches 1, the λ 1 and λ 2 eigenvalues are close in size, and the system is close to a completely unpolarized state; when H = 1, the system is in a completely unpolarized state, and the target scattering is completely random noise. The scattering angle represents the average scattering mechanism from surface scattering to dihedral angle scattering and is defined as Equation (8).
a ¯ = P 1 a 1 + P 2 a 2
where a ¯ is the scattering angle. When a ¯ = 0°, it corresponds to isotropic surface scattering, and when a ¯ = 90°, it corresponds to isotropic dihedral scattering. A Sentinel-1 Single Look Complex (SLC) can generate the C 2 matrix, and then based on the C 2 matrix, we can choose the H-alpha decomposition. Then, we can finally obtain the three polarization features of the scattering angle, entropy, and anisotropy.

2.4. Spectral Index of Vegetation in Multispectral Images

The multispectral images used in this study were selected from Sentinel-2 L2A level data in July 2021, during the period of lush vegetation in the study area with less than 10% cloud cover. The Sentinel-2 imagery has 13 bands, which can provide data at up to a 10 m spatial resolution. In order to ensure the consistency of the spatial resolution of the images used in this study, we sampled all the bands to 10 m and went through the band fusion and region cropping preprocessing steps. Among the spectral features, the band information can reflect the differences between the absorption and reflectance intensities of the spectra of the leaf blades of different species [38], and several vegetation indices, including chlorophyll content [39], leaf moisture content [40], and vegetative soil conditions [41], have also been proven to be able to characterize the differences in the branching of the branches and leaves of different tree species. In our study, we selected 11 vegetation spectral features, including the Normalized Vegetation Index (NDVI), Normalized Humidity Index (NHI), Optimized Soil-Adjusted Vegetation Index (OSAVI), Chlorophyll Index (CGI), Green Index (GI), Moisture Stress Index (MSI), Red Edge Normalized Vegetation Index (reNDVI), Enhanced Vegetation Index (EVI), and Ratio Vegetation Index (RVI), along with the Chlorophyll Content Index (LCCI) and Normalized Difference Red Edge Index (NDRESWIR), as shown in Table 1. B1–B12 represent the spectral values of Sentinel-2 satellite images in band 1 and band 12, respectively; meanwhile, B1–B12 as optical features were also used as input data for forest species classification study.

3. Forest Species Classification Methods

3.1. The Hilbert–Huang Transform

The Hilbert–Huang transform (HHT) is a signal processing method that mainly consists of two parts, empirical modal decomposition (EMD) and the Hilbert transform (HT) [42]. The empirical modal decomposition can decompose the microwave signals of SAR data into several intrinsic mode function (IMF) components, and each IMF component is arranged in order, from high frequency to low frequency. The Hilbert transform (HT) is applied to each IMF component on the basis of EMD to obtain the amplitude and frequency of each IMF component. The HHT is based on the concept of intrinsic mode functions and the idea that any SAR data microwave signal can be decomposed into several IMFs, so that the instantaneous frequency has a real physical meaning. The HHT can adaptively decompose the signal according to its own time scale characteristics, and there are no human factors in the process of decomposition, meaning that the characteristics of the data itself are retained in the decomposition process, and the resulting individual IMF components contain the relevant information of the original signal.

3.1.1. Empirical Modal Decomposition

The empirical modal decomposition method, used to decompose SAR data microwave signals, should be based on the following three assumptions [43]: First, the SAR data microwave signal has at least two extreme points (one extremely large and one extremely small). Second, the feature scale is defined as the time interval between neighboring extreme points. Finally, if the signal has no extreme points but only “curved points”, then before decomposing it, one must first differentiate it one or more times to obtain the extreme points, and then integrate all the results to obtain the corresponding components. For this, the following steps are required [44]: The signal is known as x ( t ) . To determine all the extreme values of the microwave signal and fit them using the cubic spline interpolation algorithm, the upper envelope is obtained based on the fitted maximum and minimum values, respectively, x m a x ( t ) and lower envelope x m i n ( t ) . Then, we calculate the upper and lower mean values of the envelope, denoted as m 1 ( t ) , as shown in Equation (9). Using SAR data microwave signal data series x ( t ) , subtract the upper and lower mean values of the envelope m 1 ( t ) to obtain a new sequence h 1 ( t ) , as shown in Equation (10).
m 1 t = x m a x t + x m i n t / 2
h 1 t = x t m 1 ( t )
If the h 1 ( t ) IMF condition is satisfied, then h 1 ( t ) is the first IMF component. If the h 1 t IMF condition is not satisfied, then treat h 1 ( t ) as the original signal sequence, and then calculate the h 1 ( t ) and the mean value of the envelope of m1.1(t), and calculate the h 1 ( t ) and m1.1(t), and the difference between the two, to obtain h 1.1 t = h 1 t m 1.1 ( t ) . Repeat the above steps k times until h 1 . k t satisfies the IMF definition and the first eigenmode function is obtained, denoted as c 1 ( t ) . Usually, c 1 ( t ) represents the high-frequency part of the microwave signal data sequence of the SAR data, and it can be obtained by subtracting the original signal x ( t ) . Subtract c 1 ( t ) to obtain the remaining sequence r 1 t , that is, r 1 t = x ( t ) c 1 ( t ) . Take r 1 ( t ) as a new original sequence and follow the above steps to extract up to the nth eigenmode function, c n ( t ) . After extracting the last eigenmode function, the sequence r n ( t ) becomes a monotonic sequence. By integrating the decomposed components, the original sequence is obtained x ( t ) , as shown in Equation (11).
x t = i = 1 n c i t + r n ( t )

3.1.2. Hilbert Transform

For any continuous real signal X t , its Hilbert transform is shown in Equation (12), which is defined as a generalized integral over the real number field with respect to τ , the generalized integral, where τ is the integration variable used to traverse all time points in the integration process. Equation (13) represents the Hilbert inverse transform.
Y t = 1 π X ( τ ) t τ d τ
X t = 1 π Y ( τ ) t τ d τ
Y t and X t can form a set of conjugate complex pairs, which, in turn, leads to the analytic signal Z t , as shown in Equation (14), where a t is the instantaneous amplitude, and θ t is the phase.
Z t = X t + Y t i = a t e θ t i a t = [ X t 2 + Y t 2 ] θ t = a r c t a n Y t X t
Furthermore, the continuous real signal X t can be expressed as Equation (15), which, in turn, leads to the instantaneous frequency f ( t ) .
X t = R e j = 1 n a j t e θ t i = R e j = 1 n a j t e i ω j t d t f t = 1 2 d θ ( t ) d t
where Re denotes taking the real part, the expansion is called the Hilbert spectrum, and the residual component is omitted in the r n ( t ) equation. From the above analysis, it can be seen that the HHT method is a combination of EMD and the Hilbert transform.

3.2. Forest Species Classification Models

Considering the complexity and nonlinearity of forest species classification based on multi-source remote sensing data, it is difficult for existing generalized machine learning models to achieve the accurate classification of forest species in a natural state. Therefore, we proposed a multi-source remote sensing forest species classification method based on the above vegetation spectral features and the Hilbert–Huang transform of PolSAR signals. The model consists of two parts, as shown in Figure 2: (1) the feature transformation module—used to transform PolSAR features and multispectral remote sensing reflectance to obtain new features that can more accurately represent tree information, such as the vegetation-based spectral variables, etc.—and (2) the stochastic forest classifier module. Because the number of data samples is limited by the amount of measured forestry survey data in our study, the stochastic forest classifier, with its excellent classification effect under small-sample conditions, was chosen. Additionally, the random forest classifier, which has an excellent classification effect under small-sample conditions, was chosen as the base model, and hyperparameter optimization was based on the particle swarm algorithm [45].
The decision tree method is a decision classification method based on tree architecture, which mainly includes features such as selection, branch generation, and pruning [46], as shown in Figure 2. The random forest classifier section consists of several decision trees. The decision tree process starts from the root node, for partitioning the solution and performing multiple decompositions at the nodes. Different decision trees will have different criteria for dividing the data samples into two subsets. We iterate repeatedly according to the above logic until the termination of splitting is achieved. A wide variety of decision tree models are currently being developed and improved, while the CART decision tree algorithm has been widely used due to its ease of implementation and the relatively small amount of expertise and experience required [47]. Random forest is a parallel algorithm based on CART decision trees; it has a strong learning ability. Random forest trains multiple CART decision trees and then learns by summarizing the resultant experience of each decision tree in a certain combination. A random forest consists of many decision tree classifiers. Each decision tree classifier can be defined as h ( x , θ i ) , where x represents the input features, and θ i represents an independent set of identically distributed features. Each decision tree of the random forest votes independently for classification, and the final classification result is obtained via the combined vote [48]. The process of random forest classification is shown in Equation (16).
H = h 1 ,   h 2 ,   h 3 ,   ,   h k h x , θ i : X Y , i = 1 , 2 , , k
In the random forest classification process, the result with the highest number of votes in the decision tree is used as the classification result, which is calculated as in Equation (17). Due to random forest classification’s random nature, the risk of overfitting can be reduced by averaging the decision trees, making it highly resistant to overfitting. Random forest classification also has strong stability; even in the midst of a large fraction of feature loss, as long as no error occurs in more than half of the decision tree classifiers, accuracy can still be maintained, including in cases where a new data point has been added, as the presence of this new data point in the dataset will likely affect only one decision tree, hardly affecting all decision trees [49]. For unbalanced data, such classification has an ability to balance error. However, because random forests require the construction of multiple decision trees, they are more time-consuming and costly to compute [50]. The combination of the above features and the fact that the trees are independent of each other during the training of the random forest gives this classification process the ability to parallelize power and be relatively fast in remote sensing image dataset classification problems, having some applicability [51,52,53,54]. Thus, we selected random forest classification as the base model for tree classification.
y * = a r g m a x y Y , j H I ( f x t = y )
Due to the complex nonlinear relationship between the vegetation features of the multi-source remote sensing images selected in this study and the actual tree classification, we use the particle swarm algorithm (PSO) to find optimal solutions for key parameters such as n_estimators, max_features, max_depth, etc., on the random forest base model. The method, which uses population fitness data to find the optimal solution of a given problem, can be used for the optimization of nonlinear problems [55,56]. The algorithm treats each solution to the optimization problem as a random particle, and the position and velocity of each particle are random. During the iteration process, each particle records and updates its current position and velocity, recording its own optimal position and the optimal position of the population [57,58]. For example, to find an optimal problem in n dimensional space, there are m particles forming a population; the position of the 1st particle is written as x i = ( x 1 , x 2 , , x n ) , and the velocity is denoted as v i = ( v 1 , v 2 , , v n ) , while the iterative formula for updating the velocity and position of each particle in the population is as follows [59,60]:
v i ( t + 1 ) = ω v i ( t ) + c 1 r 1 ( p i ( t ) x i ( t ) ) + c 2 r 2 ( p g ( t ) x i ( t ) )
x i ( t + 1 ) = x i ( t ) + v i ( t + 1 )
where i = 1, 2, 3, …, m; P i is the individual pole position; P g is the global pole position; ω is the initial value of inertia weight; C 1 and C 2 are the acceleration coefficients; and r 1 and r 2 are random numbers between 0 and 1. For the RF model, the inclusion of the particle swarm algorithm with global convergence can better ensure the rationality of parameter optimization [59].

4. Experiments and Results

4.1. Forest Species Classification Results

In the regional forest species classification study, we accurately classified the forest species in the study area based on multi-source remote sensing forest species classification methods [61,62]. The model input variables were divided into two parts, namely, multi-source remote sensing data and measured forest survey data. Among them, the multi-source remote sensing data included the Normalized Vegetation Index (NDVI), the Normalized Humidity Index (NHI), the Optimized Soil Conditioning Vegetation Index (OSAVI), the Chlorophyll Index (CGI), the Greenness Index (GI), the Moisture Stress Index (MSI), the Red Edge Normalized Vegetation Index (reNDVI), the Enhanced Vegetation Index (EVI), and the Ratio Vegetation Index (RVI), along with the Chlorophyll Content Index (LCCI), the Normalized Difference Red Edge Index (NDRESWIR), and 11 other spectral vegetation features, in addition to the reflectance of multispectral data and second-order eigenvalues of five polarization observation variables, such as the main diagonal elements of the C2 matrix (C11 and C22) [63], and the H-alpha decomposition parameter (alpha, anisotropy, and entropy) modal function (IMF) [64]. Before random forest species classification, we first performed feature selection based on random forest importance ranking, as shown in Figure 3. In Figure 3, reNDVI and MSI were significantly less important than other features in forest species classification, so we removed these two features. We input the tree species information obtained from the 550 measured forest survey sites described in Section 2.2 and the 26 multi-source remote sensing features mentioned above into the tree species classification module at a ratio of 0.7:0.3, before optimizing the random forest model for parameter tuning based on the particle swarm algorithm. Among them, the tree species information is the type of tree species in the measured sample sites, and the multi-source remote sensing information includes the B1–B12 bands, the vegetation-based spectral variables shown in Table 1, and the five PolSAR features mentioned above. Then, we selected the optimal random forest model parameters for making inferences regarding the tree species in the study area. Regarding the model parameters, n_estimators are 100, max_features is all features. Table 2 represents the classification accuracy of multi-source remote sensing forest species classification methods, with overall classification accuracy and the Kappa coefficient as the evaluation criteria. As shown in Table 2, the overall classification accuracy of the model is 0.951, and the Kappa coefficient is 0.94. This shows the effectiveness of multi-source remote sensing forest species classification methods in the task of classifying forest species in the region.

4.2. Spatial Distribution of Forest Tree Species in the Study Area

In our tree species classification study, based on the optimal input feature combinations and training hyperparameters obtained in Section 4.1, as shown in Figure 4, the white-colored part of the Fuxin area is the non-forested area, consisting of towns, waters, and farmland. The non-forested area was not involved in our forest species classification based on expert visual interpretation. In addition, in order to highlight the differences between different tree species, we display the six dominant tree species in different colors in the forest species distribution map of the study area, shown in Figure 3, including aspen, Chinese pine, maple, Mongolian pine, peach tree, and apricot trees.
As analyzed in Figure 5, the forests in the study area are mainly distributed in the southwest, while the northeast has a small amount distributed, and the forests in the rest of the area are sporadically distributed. The above six dominant tree species in the study area are generally mixed forests, among which maple, aspen, and Chinese pine have a wider distribution area. Particularly noteworthy is that, in the northeastern forest area of the study area, Mongolian pine is the dominant tree species, and this area is also the area with the largest distribution of Mongolian pine in the study area. This is due to the fact that this area is a plantation forest, which is an important part of China’s “Three-North” protection forest, where a large number of Mongolian pine are planted in the windbreak forest, forming a unique Mongolian pine forest area in Fuxin.

5. Discussion

In Section 5.1, we discuss our first set of experiments, conducted to assess the extent to which individual remote sensing image features contributed to the forest tree species classification task. We provide a discussion of our second set of experiments, based on the Hilbert–Huang transform, in Section 5.2. We used these experiments to assess the change in the degree of contribution of polarized observational variables to the forest tree species classification task before and after the application of the Hilbert–Huang transform, as well as the interaction of multiple factors. Finally, in Section 5.3, we discuss the limitations of our proposed forest tree species classification method, and future research.

5.1. Ablation Study

In our study, in order to highlight the effects of various types of features on forest tree species classification, we designed a feature ablation experiment to quantitatively characterize each feature’s contribution to classification accuracy. In the feature space constructed by the optimal feature set, all the features were merged into three categories according to the differences in the feature descriptions on the image, specifically optical features, vegetation spectral features, and PolSAR features, where, optical features are the reflectance of the bands B1–B12 included in the Sentinel-2 image. Vegetation spectral features are the eleven vegetation-based spectral variables mentioned in Table 1. PolSAR features are the five polarization-based observational variables mentioned in Section 4.1. Under the same experimental conditions used to construct the model, featuring the experimental parameters and dataset division mentioned in Section 4.1, the three classes of features were cyclically deleted from the feature variables of the constructed model. The objective was to quantify the contribution of three remote sensing image features in forest species classification studies. If the classification accuracy was severely reduced after subtracting a certain type of remote sensing image feature, it indicated that this feature contributed more to the forest species classification task. It is worth noting that in this section of the experiment, no feature selection was performed in order to quantify the degree of contribution of optical features, vegetation spectral features, or PolSAR features. A total of 28 features were involved in the ablation study. The recognition accuracies of the three models and the changes in model accuracy before and after the deletion of features are shown in Figure 6. The model accuracy is most affected when no vegetation spectral features are introduced, as shown in Figure 6e,f. The overall classification accuracy decreased from 93.3% to 88.48%; from Figure 6e, it can be seen that the discriminative abilities of the peach tree, apricot tree, and oil pine are all greatly affected by the vegetation spectral features. This is due to the fact that there is a large gap between the peach tree, apricot tree, and camphor pine in the vegetation spectral indices, and such features can reflect these divergence gaps. The overall classification accuracy of our method is only second to that of when the PolSAR features extracted from SAR imagery are deleted. The decrease in overall classification accuracy after deleting the PolSAR features from the SAR images is only second to the vegetation spectral features, with a decrease of 1.78%. It can be seen that the canopy vertical structure feature extracted from the SAR imagery makes an important contribution to the classification results.

5.2. The Validity of the Hilbert–Huang Transform

The SAR image polarization observation variable contains terrain information, canopy information, soil moisture, and other information, so some noise information that interferes with the perception of the vertical structure of the tree limits the accuracy of the tree species classification to a certain extent. The Hilbert–Huang transform is usually used in the decomposition of time series signals, but in the processing of non-time series electromagnetic wave signals, a series of intrinsic modal functions that retain the original signal characteristics is generated by the decomposition and transformation of original electromagnetic wave signals, as shown in Figure 7. An electromagnetic wave is decomposed and transformed to generate a series of eigenmode functions that retain the original signal characteristics. Furthermore, some specific orders of eigenmode functions of the polarized observed signals will make a better contribution than the original signals in the tree species classification task. The reason for this phenomenon is that the eigenmode functions of specific orders play a role in suppressing the topographic information and soil moisture, etc., while preserving the characteristics of the original signal, which is more conducive to machine learning-based forest tree species classification. In order to investigate the effect of different orders of eigenmode functions in forest species classification, we used different orders of eigenmode functions to conduct forest species classification experiments in the study area of Fuxin comprising mixed coniferous and broad forest, as shown in Figure 7.
As shown in Figure 7, the first layer of each subplot is the original-input PolSAR signal. From the second layer onwards, there is the first–sixth-order intrinsic modal function (IMF), and the last layer is the residual signal. As can be seen from the results, we find that as the order of the intrinsic modal function increases, starting from the third-order intrinsic modal function (IMF), the signal waveforms appear to differ greatly from the original signal. Therefore, in our study, only the first- and second-order intrinsic modal functions were used to compare the original PolSAR signals, to explore the effectiveness of the Hilbert–Huang transform in tree classification studies. In view of this, based on the measured forest survey sample data and the PolSAR data of corresponding spatial locations mentioned in Section 2.2, we performed tree species classification experiments, respectively, and the results are shown in Table 3. From the results, it can be seen that the random forest model based on particle swarm optimization acquires higher tree species classification accuracy than the second-order IMF, which shows the effectiveness of our proposed tree species classification method based on the Hilbert–Huang transform.

5.3. Limitations and Future Research

We selected a particle swarm optimization-based random forest (PSO-RF) model using the Normalized Vegetation Index (NDVI), the Normalized Humidity Index (NHI), the Optimized Soil Conditioning Vegetation Index (OSAVI), the Chlorophyll Index (CGI), the Greenness Index (GI), the Enhanced Vegetation Index (EVI), the Ratio Vegetation Index (RVI), the Chlorophyll Content Index (LCCI), the Normalized Difference Red Edge Index (NDRESWIR), the reflectance of the multispectral data, and the third-order eigenmode function of the diagonal elements of the C2 matrix (C11 and C22), along with the H-alpha decomposition parameters (alpha, anisotropy, and entropy) (IMFs), vegetation spectral features, and polarization observation variables to classify the forest species in the study area. Since different multi-source remote sensing features contribute differently in the task of forest species classification [65], we explored the contribution of different types of remote sensing image features in the study of tree species classification based on ablation experiments. Meanwhile, in this study, we proposed a machine learning-based classification method (multi-source remote sensing forest species classification methods) using Sentinel-1 polarized observation data and Sentinel-2 multispectral data for forest tree species classification. In this study, a forest species classification experiment was conducted in the Fuxin area of Liaoning Province, China, as the study area. Some of our findings, such as the spatial distribution of forests and the species composition of forests in the northeastern part of the Fuxin area can be corroborated in the previously reported study by Wang et al. [35]. In addition, Yan et al. utilized MODIS images from 2012 to achieve forest classification in Northeast China by constructing a vegetation spectral index. In their results, the distribution of coniferous and broadleaf forests in the Fuxin area had some similarities with our study [66]. In a recently reported study, Liu et al. used GF-1 optical and Sentinel-1 images to map the distribution of coniferous forests in the northwest of Liaoning Province, China. Among them, the distribution of coniferous forests in the Fuxin area is very similar to the two coniferous forest distributions in our study [67]. However, the method in this paper still has some limitations. The method relies on a large amount of actual forest survey data and remote sensing data. In our case, although the number of samples is large, the proportions between samples of different tree species are not completely balanced. This may affect the accuracy of model calibration and estimation. In addition, natural conditions such as soil moisture and vegetation foliar water content may also affect the estimation results, and although we suppressed these signals to a certain extent based on the Hilbert–Huang transform, the effects still exist. Future work will focus on testing the method’s ability to classify other vegetation types (e.g., crops, etc.) as well as other types of forests (e.g., tropical rainforests). The use of other radar frequencies (e.g., LT-1 in the L-band and PolSAR data from the airborne radar in the P-band) and high-resolution multispectral data (e.g., GF-series satellites and ZY-series satellites) will also be investigated to test and analyze the method in different locations in future studies.

6. Conclusions

In this study, we selected optical features, vegetation spectral indices, and PolSAR polarization observation variables and applied them to temperate mixed conifer forests in the Fuxin area of Liaoning Province, China, to analyze their contributions to the classification of forest species in this area. Based on this, we realized forest species classification in the study area via the use of a stochastic forest classification model of particle swarm optimization (PSO). Through our study, we found that the PolSAR polarization observation variable can provide valuable information for forest species classification. The polarization decomposition of backscatter coefficients of SAR images to obtain canopy vertical structure information in feature space analysis experiments proved to be an important contribution to the task of forest tree species classification, demonstrating the effectiveness of the PolSAR polarization observation variable in forest species classification. In addition, by applying the Hilbert–Huang transform to PolSAR polarization observation variables, other feature information that plays an interfering role in perceiving the vertical structure of forests can be suppressed to a certain extent, and its role in forest tree species classification, along with that of PolSAR, should not be neglected. The model constructed shows high accuracy in forest species classification experiments, with an overall accuracy of 95.1% and a Kappa coefficient of 0.94. The multi-source remote sensing forest species classification approach integrates multiple sources of remote sensing data and significantly improves the accuracy of tree species identification compared to classification models derived from a single data source. Therefore, the method is suitable for large-scale forest species surveys of temperate mixed conifer forests based on remote sensing images.

Author Contributions

Conceptualization, H.Z. and W.S.; methodology, H.Z.; software, H.Z. and E.L.; validation, B.Z. and W.Z.; formal analysis, H.Z.; investigation, H.Z. and E.L.; resources, W.S. and J.D.; data curation, W.Z. and E.L.; writing—original draft preparation, H.Z.; writing—review and editing, W.S. and B.Z.; visualization, H.Z. and E.L.; supervision, W.S. and J.D.; project administration, H.Z., W.S. and Z.H.; funding acquisition, W.S. and B.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Basic Research Program of Liaoning Provincial Department of Education: LJ212410147065 and LJ212410147015 and the National Natural Science Foundation of China, under Grants 42071343 and 42204031 and Joint Fund Project of Liaoning Provincial Natural Science Foundation in 2024.

Data Availability Statement

The original data presented in this study are openly available via the [European Space Agency] at (https://dataspace.copernicus.eu/, accessed on 29 August 2021).

Acknowledgments

We are very grateful to all the reviewers, institutions, and researchers for their help and advice, which helped improve our work.

Conflicts of Interest

Author Zhongchao Hu was employed by the company Fuxin Ecological Group Digital Technology Research Institute Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

  1. Jenkins, J.C.; Chojnacky, D.C.; Heath, L.S.; Birdsey, R.A. National Scale Biomass Estimators for United States Tree Species. For. Sci. 2003, 49, 12–35. [Google Scholar] [CrossRef]
  2. Domke, G.M.; Woodall, C.W.; Smith, J.E.; Westfall, J.A.; McRoberts, R.E. Consequences of alternative tree-level biomass estimation procedures on U.S. forest carbon stock estimates. For. Ecol. Manag. 2012, 270, 108–116. [Google Scholar] [CrossRef]
  3. Suganuma, H.; Abe, Y.; Taniguchi, M.; Taniguchi, H.; Utsugi, H.; Kojima, T.; Yamada, K. Stand biomass estimation method by canopy coverage for application to remote sensing in an and area of Western Australia. For. Ecol. Manag. 2006, 222, 75–87. [Google Scholar] [CrossRef]
  4. Fassnacht, F.E.; Latifi, H.; Stereńczak, K.; Modzelewska, A.; Lefsky, M.; Waser, L.T.; Straub, C.; Ghosh, A. Review of studies on tree species classification from remotely sensed data. Remote Sens. Environ. 2016, 186, 64–87. [Google Scholar] [CrossRef]
  5. Leckie, D.G. Advances in remote sensing technologies for forest surveys and management. Can. J. For. Res. 1990, 20, 464–483. [Google Scholar] [CrossRef]
  6. Nagendra, H. Using remote sensing to assess biodiversity. Int. J. Remote Sens. 2001, 22, 2377–2400. [Google Scholar] [CrossRef]
  7. Wang, S.; Liu, C.; Wu, H.; Li, W. Urban tree species classification by combining satellite-based laser and multispectral images. J. Tongji Univ. Nat. Sci. Ed. 2024, 52, 970–981. [Google Scholar]
  8. Zhang, B.; Zhu, H.; Song, W.; Zhu, J.; Dai, J.; Zhang, J.; Li, C. A Multi-Baseline Forest Height Estimation Method Combining Analytic and Geometric Expression of the RVoG Model. Forests 2024, 15, 1496. [Google Scholar] [CrossRef]
  9. Zhu, H.; Zhang, B.; Song, W.; Xie, Q.; Chang, X.; Zhao, R. Forest Canopy Height Estimation by Integrating Structural Equation Modeling and Multiple Weighted Regression. Forests 2024, 15, 369. [Google Scholar] [CrossRef]
  10. Ghosh, A.; Fassnacht, F.E.; Joshi, P.K.; Koch, B. A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales. Int. J. Appl. Earth Obs. Geoinf. 2014, 26, 49–63. [Google Scholar] [CrossRef]
  11. Vatandaslar, C.; Lee, T.; Bettinger, P.; Ucar, Z.; Stober, J.; Peduzzi, A. Mapping percent canopy cover using individual tree- and area-based procedures that are based on airborne LiDAR data: Case study from an oak-hickory-pine forest in the USA. Ecol. Indic. 2024, 167, 112710. [Google Scholar] [CrossRef]
  12. Ming, L.; Liu, J.; Quan, Y.; Li, M.; Wang, B.; Wei, G. Mapping tree species diversity in a typical natural secondary forest by combining multispectral and LiDAR data. Ecol. Indic. 2024, 159, 111711. [Google Scholar] [CrossRef]
  13. Zhang, B.; Zhu, H.; Xu, W.; Xu, S.; Chang, X.; Song, W.; Zhu, J. A Fourier–Legendre Polynomial Forest Height Inversion Model Based on a Single-Baseline Configuration. Forests 2024, 15, 49. [Google Scholar] [CrossRef]
  14. Kumar, S.; Sara, R.; Singh, J.; Agrawal, S.; Kushwaha, S.P. Spaceborne PolInSAR and ground-based TLS data modeling forcharacterization of forest structural and biophysical parameters. Remote Sens. Appl. Soc. Environ. 2018, 11, 241–253. [Google Scholar]
  15. Musthafa, M.; Kumar, P.; Alsulamy, S.; Khedher, K.; Avtar, R. Development of L-band fully polarimetric SAR algorithm for forest biomass retrieval using 7SD and random forest regression. Phys. Chem. Earth Parts A B C 2024, 136, 103688. [Google Scholar]
  16. Kovacs, J.M.; Lu, X.X.; Flores-Verdugo, F.; Zhang, C.; de Santiago, F.F.; Jiao, X. Applications of ALOS PALSAR for monitoring biophysical parameters of a degraded black mangrove (Avicennia germinans) forest. ISPRS J. Photogramm. Remote Sens. 2013, 82, 102–111. [Google Scholar] [CrossRef]
  17. Suzuki, R.; Kim, Y.; Ishii, R. Sensitivity of the backscatter intensity of ALOS/PALSAR to the above-ground biomass and other biophysical parameters of boreal forest in Alaska. Polar Sci. 2013, 7, 100–112. [Google Scholar] [CrossRef]
  18. Krishnan, V.; Asaithambi, M. Semi-empirical SAR vegetation index for crop discrimination based on biomass in semi-arid region: A case study in Perambalur district, India. Remote Sens. Appl. Soc. Environ. 2023, 32, 101064. [Google Scholar] [CrossRef]
  19. Shaik Salma, N.; Keerthana, B.M. Dodamani, Target decomposition using dual-polarization sentinel-1 SAR data: Study on crop growth analysis. Remote Sens. Appl. Soc. Environ. 2022, 28, 100854. [Google Scholar] [CrossRef]
  20. Avtar, R.; Malik, R.; Musthafa, M.; Rathore, V.S.; Kumar, P.; Singh, G. Forest plantation species classification using Full-Pol-Time-Averaged SAR scattering powers. Remote Sens. Appl. Soc. Environ. 2023, 29, 100924. [Google Scholar] [CrossRef]
  21. CSchulz, C.; Förster, M.; Vulova, S.V.; Rocha, A.D.; Kleinschmit, B. Spectral-temporal traits in Sentinel-1 C-band SAR and Sentinel-2 multispectral remote sensing time series for 61 tree species in Central Europe. Remote Sens. Environ. 2024, 307, 114162. [Google Scholar] [CrossRef]
  22. Koyama, C.N.; Watanabe, M.; Hayashi, M.; Ogawa, T.; Shimada, M. Mapping the spatial-temporal variability of tropical forests by ALOS-2 L-band SAR big data analysis. Remote Sens. Environ. 2019, 233, 111372. [Google Scholar] [CrossRef]
  23. Lankoande, O.; Hayat, M.M.; Santhanam, B. Speckle Reduction of SAR Images Based on a Combined Markov Random Field and Statistical Optics Approach (Version1); The University of New Mexio: Albuquerque, NM, USA, 2022. [Google Scholar]
  24. Rudi, J.; Pabel, R.; Jager, G.; Koch, R.; Kunoth, A.; Bogena, H. Multiscale analysis of hydrologic time series data using the Hilbert–Huang transform. Vadose Zone J. 2010, 9, 925–942. [Google Scholar] [CrossRef]
  25. Xiang, Y.; Wang, X.; He, L.; Wang, W.; Moran, W. Spatial-temporal analysis of environmental data of north Beijing district using Hilbert-Huang transform. PLoS ONE 2016, 11, e0167662. [Google Scholar] [CrossRef]
  26. Wang, Q.; Geng, P.; Chen, J.; He, C. Dynamic discrimination method of seismic damage in tunnel portal based on improved wavelet packet transform coupled with Hilbert-Huang transform. Mech. Syst. Signal Process. 2023, 188, 110023. [Google Scholar] [CrossRef]
  27. Skeberis, C.; Zaharis, Z.D.; Xenos, T.D.; Spatalas, S.; Arabelos, D.N.; Contadakis, M.E. Time–frequency analysis of VLF for seismic-ionospheric precursor detection: Evaluation of Zhao-Atlas-Marks and Hilbert-Huang Transforms. Phys. Chem. Earth Parts A B C 2015, 85, 174–184. [Google Scholar] [CrossRef]
  28. Zhou, Y.; Chen, W.; Gao, J.; He, Y. Application of Hilbert–Huang transform based instantaneous frequency to seismic reflection data. J. Appl. Geophys. 2012, 82, 68–74. [Google Scholar] [CrossRef]
  29. Al-Greer, M.; Bashir, I.; Demirel, A.; Keysan, O. Autonomous fault detection and diagnosis for permanent magnet synchronous motors using combined variational mode decomposition, the Hilbert-Huang transform, and a convolutional neural network. Comput. Electr. Eng. 2023, 110, 108894. [Google Scholar] [CrossRef]
  30. Zhang, Y.; Yang, X.; Zhu, F.; Li, Y.; Duan, C.; Yang, J.; Dong, L.; Zhao, Y. Characteristics of non–linear dynamics and energy transfer in a vibration gas–solid fluidized bed by using Hilbert–Huang transform. Powder Technol. 2019, 344, 970–980. [Google Scholar] [CrossRef]
  31. Sahoo, S.K.; Katlamudi, M.; Barman, C.; Lakshmi, G.U. Identification of earthquake precursors in soil radon-222 data of Kutch, Gujarat, India using empirical mode decomposition based Hilbert Huang Transform. J. Environ. Radioact. 2020, 222, 106353. [Google Scholar] [CrossRef]
  32. Zhang, R.R.; King, R.; Olson, L.; Xu, Y.L. Dynamic response of the Trinity River Relief Bridge to controlled pile damage: Modeling and experimental data analysis comparing Fourier and Hilbert–Huang techniques. J. Sound Vib. 2005, 285, 1049–1070. [Google Scholar] [CrossRef]
  33. Rouillard, V.; Lamb, M.J. Road vehicle shock detection algorithm using the Hilbert envelope. Comput. Methods Appl. Mech. Eng. 2024, 419, 116637. [Google Scholar] [CrossRef]
  34. Kunwar, A.; Jha, R.; Whelan, M.; Janoyan, K. Damage detection in an experimental bridge model using Hilbert–Huang transform of transient vibrations. Struct. Control. Health Monit. 2013, 20, 1–15. [Google Scholar] [CrossRef]
  35. Wang, F.; Yang, F.; Wang, Z. A Study on the Evolution of Forest Landscape Patterns in the Fuxin Region of China Combining SC-UNet and Spatial Pattern Perspectives. Sustainability 2024, 16, 7067. [Google Scholar] [CrossRef]
  36. Lee, J.S.; Potier, E. Polarimetric Radar Imaging: Basics to Applications, Second Edition; CRC Press: Boca Raton, FL, USA, 2016. [Google Scholar]
  37. Mandal, D.; Kumar, V.; Ratha, D.; Dey, S.; Bhattacharya, A.; Lopez-Sanchez, J.M.; McNairn, H.; Rao, Y.S. Dual polarimetric radar vegetation index for crop growth monitoring using sentinel-1 SAR data. Remote Sens. Environ. 2020, 247, 111954. [Google Scholar] [CrossRef]
  38. Pu, R. Mapping tree species using advanced remote sensing technologies: A state-of-the-art review and perspective. J. Remote Sens. 2021, 2021, 9812624. [Google Scholar] [CrossRef]
  39. Datt, B. A new reflectance index for remote sensing of chlorophyll content in higher plants: Tests using Eucalyptus leaves. J. Plant Physiol. 1999, 154, 30. [Google Scholar] [CrossRef]
  40. Zhang, Z.; Tang, B.H.; Li, Z.L. Retrieval of leaf water content from remotely sensed data using a vegetation index model constructed with shortwave infrared reflectances. Int. J. Remote Sens. 2019, 40, 2313–2323. [Google Scholar] [CrossRef]
  41. Huete, A.R. A soil-adjusted vegetation index (SAVI). Remote Sens. Environ. 1988, 25, 295. [Google Scholar] [CrossRef]
  42. Shi, C.X.; Luo, Q.F. Hilbert-Huang transform and wavelet analysis of time history signal. Acta Seismol. Sin. 2003, 16, 422–429. [Google Scholar]
  43. He, L. Research on the Application of Hilbert-Yellow Transform in Power Harmonic Analysis. Ph.D. Thesis, Central South University, Changsha, China, 2009; pp. 6–7. [Google Scholar]
  44. Yan, R.; Gao, R.X. Hilbert–Huang transform-based vibration signal analysis for machine health monitoring. IEEE Trans. Instrum. Meas. 2006, 55.6, 2320–2329. [Google Scholar] [CrossRef]
  45. Rodriguez-Galiano, V.F.; Ghimire, B.; Rogan, J.; Chica-Olmo, M.; Rigol-Sanchez, J.P. An assessment of the effectiveness of a random forest classifier for land-cover classification. ISPRS J. Photogramm. Remote Sens. 2012, 67, 93–104. [Google Scholar] [CrossRef]
  46. Charbuty, B.; Abdulazeez, A. Classification based on decision tree algorithm for machine learning. J. Appl. Sci. Technol. Trends 2021, 2, 20–28. [Google Scholar] [CrossRef]
  47. Sun, Z.; Wang, G.; Li, P.; Wang, H.; Zhang, M.; Liang, X. An improved random forest based on the classification accuracy and correlation measurement of decision trees. Expert Syst. Appl. 2024, 237, 121549. [Google Scholar] [CrossRef]
  48. Adugna, T.; Xu, W.; Fan, J. Comparison of random forest and support vector machine classifiers for regional land cover mapping using coarse resolution FY-3C images. Remote Sens. 2022, 14, 574. [Google Scholar] [CrossRef]
  49. Yeom, S.; Giacomelli, I.; Fredrikson, M.; Jha, S. Privacy risk in machine learning: Analyzing the connection to overfitting. In Proceedings of the 2018 IEEE 31st Computer Security Foundations Symposium (CSF), Oxford, UK, 9–12 July 2018; pp. 268–282. [Google Scholar]
  50. Chabalala, Y.; Adam, E.; Ali, K.A. Exploring the effect of balanced and imbalanced multi-class distribution data and sampling techniques on fruit-tree crop classification using different machine learning classifiers. Geomatics 2023, 3, 70–92. [Google Scholar] [CrossRef]
  51. Bao, F.; Wu, S.; Gao, J.; Yuan, S.; Liu, Y.; Huang, K. Continental aerosol properties and absorption retrieval using random forest machine learning method specific to geostationary remote sensing. Remote Sens. Environ. 2024, 311, 114275. [Google Scholar] [CrossRef]
  52. Hou, W.; Chen, J.; He, M.; Ren, S.; Fang, L.; Wang, C.; Jiang, P.; Wang, W. Evolutionary trends and analysis of the driving factors of Ulva prolifera green tides: A study based on the random forest algorithm and multisource remote sensing images. Mar. Environ. Res. 2024, 198, 106495. [Google Scholar] [CrossRef]
  53. Aghabalaei, A.; Ebadi, H.; Maghsoudi, Y. Forest height estimation based on the RVoG inversion model and the PolInSAR decomposition technique. Int. J. Remote Sens. 2020, 41, 2684–2703. [Google Scholar] [CrossRef]
  54. Chang, X.; Xing, Y.; Gong, W.; Yang, C.; Guo, Z.; Wang, D.; Wang, J.; Yang, H.; Xue, G.; Yang, S. Evaluating gross primary productivity over 9 China Flux sites based on random forest regression models, remote sensing, and eddy covariance data. Sci. Total Environ. 2023, 875, 162601. [Google Scholar] [CrossRef]
  55. Soleimani, H.; Kannan, G. A hybrid particle swarm optimization and genetic algorithm for closed-loop supply chain network design in large-scale networks. Appl. Math. Model. 2015, 39, 3990–4012. [Google Scholar] [CrossRef]
  56. Fadhillah, M.F.; Lee, S.; Lee, C.-W.; Park, Y.-C. Application of Support Vector Regression and Metaheuristic Optimization Algorithms for Groundwater Potential Mapping in Gangneung-si, South Korea. Remote Sens. 2021, 13, 1196. [Google Scholar] [CrossRef]
  57. Hosseini, M.; Naeini, S.; Dehghani, A.; Khaledian, Y. Estimation of soil mechanical resistance parameter by using particle swarm optimization, genetic algorithm and multiple regression methods. Soil Tillage Res. 2016, 157, 32–42. [Google Scholar] [CrossRef]
  58. Fu, C.; Gan, S.; Yuan, X.; Xiong, H.; Tian, A. Determination of Soil Salt Content Using a Probability Neural Network Model Based on Particle Swarm Optimization in Areas Affected and Non-Affected by Human Activities. Remote Sens. 2018, 10, 1387. [Google Scholar] [CrossRef]
  59. Shahana, K.; Ghosh, S.; Jeganathan, C. A survey of particle swarm optimization and random forest based land cover classification. In Proceedings of the 2016 International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 29–30 April 2016; pp. 241–245. [Google Scholar]
  60. Wei, Q.; Nurmemet, I.; Gao, M.; Xie, B. Inversion of Soil Salinity Using Multisource Remote Sensing Data and Particle Swarm Machine Learning Models in Keriya Oasis, Northwestern China. Remote Sens. 2022, 14, 512. [Google Scholar] [CrossRef]
  61. Mwakipunda, G.C.; Komba, N.A.; Kouassi, A.K.F.; Ayimadu, E.T.; Mgimba, M.M.; Ngata, M.R.; Yu, L. Prediction of hydrogen solubility in aqueous solution using modified mixed effects random forest based on particle swarm optimization for underground hydrogen storage. Int. J. Hydrogen Energy 2024, 87, 373–388. [Google Scholar] [CrossRef]
  62. Luo, W.; Yuan, D.; Jin, D.; Lu, P.; Chen, J. Optimal Control of Slurry Pressure during Shield Tunnelling Based on Random Forest and Particle Swarm Optimization. CMES—Comput. Model. Eng. Sci. 2021, 128, 109–127. [Google Scholar] [CrossRef]
  63. Altmann, Y.; Tobin, R.; Maccarone, A.; Ren, X.; McCarthy, A.; Buller, G.; McLaughlin, S. Bayesian restoration of reflectivity and range profiles from subsampled single-photon multispectral Lidar data. In Proceedings of the 2017 25th European Signal Processing Conference (EUSIPCO), Kos, Greece, 28 August–2 September 2017. [Google Scholar] [CrossRef]
  64. Pottier, E.; Cloude, S.R. Application of the H/A/alpha polarimetric decomposition theorem for land classification. Proc. SPIE—Int. Soc. Opt. Eng. 1997, 3120, 132–143. [Google Scholar] [CrossRef]
  65. Hu, B.; Li, Q.; Hall, G.B. A decision-level fusion approach to tree species classification from multi-source remotely sensed data. ISPRS Open J. Photogramm. Remote Sens. 2021, 1, 100002. [Google Scholar] [CrossRef]
  66. Yan, E.; Wang, G.; Lin, H.; Xia, C.; Sun, H. Phenology-based classification of vegetation cover types in Northeast China using MODIS NDVI and EVI time series. Int. J. Remote Sens. 2015, 36, 489–512. [Google Scholar] [CrossRef]
  67. Liu, L.; Zhang, Q.; Guo, Y.; Li, Y.; Wang, B.; Chen, E.; Li, Z.; Hao, S. Mapping Coniferous Forest Distribution in a Semi-Arid Area Based on Multi-Classifier Fusion and Google Earth Engine Combining Gaofen-1 and Sentinel-1 Data: A Case Study in Northwestern Liaoning, China. Forests 2024, 15, 288. [Google Scholar] [CrossRef]
Figure 1. Geographic location map of Fuxin area.
Figure 1. Geographic location map of Fuxin area.
Forests 16 00015 g001
Figure 2. The structure of the multi-source remote sensing forest species classification methods.
Figure 2. The structure of the multi-source remote sensing forest species classification methods.
Forests 16 00015 g002
Figure 3. Random forest importance ranking chart.
Figure 3. Random forest importance ranking chart.
Forests 16 00015 g003
Figure 4. Distribution of forest species in Fuxin region in 2021, determined based on multi-source remote sensing forest species classification methods.
Figure 4. Distribution of forest species in Fuxin region in 2021, determined based on multi-source remote sensing forest species classification methods.
Forests 16 00015 g004
Figure 5. Map of localized forest species distribution in the study area. (a) Map of forest species distribution in the southwestern part of the study area. (b) Map of forest species distribution in the northeastern part of the study area.
Figure 5. Map of localized forest species distribution in the study area. (a) Map of forest species distribution in the southwestern part of the study area. (b) Map of forest species distribution in the northeastern part of the study area.
Forests 16 00015 g005
Figure 6. Results of feature ablation experiments. ((a) is the producer accuracy of the PolSAR feature ablation experiment; (b) is the user accuracy of the PolSAR feature ablation experiment; (c) is the producer accuracy of the optical feature ablation experiment; (d) is the user accuracy of the optical feature ablation experiment; (e) is the producer accuracy of the vegetation spectral feature ablation experiment; (f) is the user accuracy of the vegetation spectral feature ablation experiment; and (g) is the overall accuracy of the three overall accuracies of the feature ablation experiments).
Figure 6. Results of feature ablation experiments. ((a) is the producer accuracy of the PolSAR feature ablation experiment; (b) is the user accuracy of the PolSAR feature ablation experiment; (c) is the producer accuracy of the optical feature ablation experiment; (d) is the user accuracy of the optical feature ablation experiment; (e) is the producer accuracy of the vegetation spectral feature ablation experiment; (f) is the user accuracy of the vegetation spectral feature ablation experiment; and (g) is the overall accuracy of the three overall accuracies of the feature ablation experiments).
Forests 16 00015 g006
Figure 7. Plot of Hilbert–Huang transform results. ((a) Hilbert–Huang transform result for C11; (b) Hilbert–Huang transform result for C22; (c) Hilbert–Huang transform result for alpha; (d) Hilbert–Huang transform result for anisotropy; and (e) Hilbert–Huang transform result for entropy).
Figure 7. Plot of Hilbert–Huang transform results. ((a) Hilbert–Huang transform result for C11; (b) Hilbert–Huang transform result for C22; (c) Hilbert–Huang transform result for alpha; (d) Hilbert–Huang transform result for anisotropy; and (e) Hilbert–Huang transform result for entropy).
Forests 16 00015 g007
Table 1. Vegetation-based spectral variables [7].
Table 1. Vegetation-based spectral variables [7].
Vegetation Spectral CharacteristicsDescriptive
Normalized Vegetation Index (NDVI) N D V I = B 8 B 4 B 8 + B 4
Normalized Humidity Index (NHI) N H I = B 11 B 3 B 11 + B 3
Optimized Soil-Adjusted Vegetation Index (OSAVI) O S A V I = B 8 B 4 B 8 + B 4 + 0.16
Chlorophyll Index (CGI) C G I = B 8 B 3 + B 4 B 5
Green Index (GI) G I = B 3 B 4
Moisture Stress Index (MSI) M S I = B 11 B 8
Red Edge Normalized Vegetation Index (reNDVI) r e N D V I = B 8 B 4 + B 5 B 8 + B 4 B 5
Enhanced Vegetation Index (EVI) E V I = 2.5 ( B 8 B 4 ) B 8 + 6 B 4 7.5 B 2 + 1
Ratio Vegetation Index (RVI) R V I = B 8 B 4
Chlorophyll Content Index (LCCI) L C C I = B 4 B 7 B 4 B 5
Normalized Difference Red Edge Index (NDRESWIR) N D R E S W I R = B 4 B 6 B 12 B 4 B 6 B 12
Spectral BandB1–B12
Table 2. A confusion matrix for the classification of tree species for multi-source remote sensing forest species classification methods.
Table 2. A confusion matrix for the classification of tree species for multi-source remote sensing forest species classification methods.
Forecast CategoryReference CategoryUser Accuracy
AspenMapleMongolian PinePeach TreeChinese PineApricot Tree
Aspen42010000.976
Maple12710010.900
Mongolian pine00343000.918
Peach tree00015001.000
Chinese pine00002201.000
Apricot tree01000170.944
Producer accuracy0.9760.9640.9440.8331.0000.944
Kappa0.94
Overall classification accuracy0.951
Table 3. Results of PolSAR based on Hilbert–Huang transform in forest tree species classification.
Table 3. Results of PolSAR based on Hilbert–Huang transform in forest tree species classification.
Experimental
Data
Average User
Accuracy
Average Producer
Accuracy
Kappa Coefficient
(i.e., Height of Force)
Original PolSAR0.5640.5540.488
IMF10.5040.5670.459
IMF20.7920.8790.779
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhu, H.; Song, W.; Zhang, B.; Lu, E.; Dai, J.; Zhao, W.; Hu, Z. Studying Forest Species Classification Methods by Combining PolSAR and Vegetation Spectral Indices. Forests 2025, 16, 15. https://doi.org/10.3390/f16010015

AMA Style

Zhu H, Song W, Zhang B, Lu E, Dai J, Zhao W, Hu Z. Studying Forest Species Classification Methods by Combining PolSAR and Vegetation Spectral Indices. Forests. 2025; 16(1):15. https://doi.org/10.3390/f16010015

Chicago/Turabian Style

Zhu, Hongbo, Weidong Song, Bing Zhang, Ergaojie Lu, Jiguang Dai, Wei Zhao, and Zhongchao Hu. 2025. "Studying Forest Species Classification Methods by Combining PolSAR and Vegetation Spectral Indices" Forests 16, no. 1: 15. https://doi.org/10.3390/f16010015

APA Style

Zhu, H., Song, W., Zhang, B., Lu, E., Dai, J., Zhao, W., & Hu, Z. (2025). Studying Forest Species Classification Methods by Combining PolSAR and Vegetation Spectral Indices. Forests, 16(1), 15. https://doi.org/10.3390/f16010015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop