1. Introduction
Forest biomass monitoring is important for several reasons: the critical role of forests on carbon stock and flux, quantification of the effect of deforestation on greenhouse gas emissions, and its use as an indicator of land cover change in UN-FAO’s national statistics [
1,
2,
3]. Remote sensing has been widely used in past decades to estimate biomass, monitor deforestation, and carbon stock management [
4,
5,
6].
Remote sensing for estimating biomass can be divided into two-dimensional and three-dimensional methods. Two-dimensional methods use optical reflectance or SAR backscatter and relate it to biomass using regression [
7,
8,
9]. Satellites such as LANDSAT and MODIS have been used for this purpose [
10,
11]. The main problem with these methods is that the signal is insensitive to the increase of biomass above a certain level, i.e., the saturation effect. Therefore, they can only be used for areas with low biomass such as savannas [
9]. In the case of using SAR data, they can be extended to medium-level biomass if L and P band or multi-temporal SAR data is used [
12,
13,
14,
15]. In contrast, biomass estimation using three-dimensional methods is more accurate since forest biomass is strongly related to the vertical structure of the trees [
16,
17,
18,
19]. Airborne and space-borne Lidar can provide an accurate height map of the forested areas that later can be converted into a biomass map [
16,
20,
21]. In some studies, ground-based Lidar has been used to obtain biomass map as well [
17,
22]. Other means of acquiring a height map are photogrammetry, Interferometric SAR (InSAR) and Polarimetric Interferometric SAR (PolInSAR) [
23,
24]. PolInSAR records the reflected SAR signal from the same objects from two different points in two different times [
25]. InSAR and PolInSAR give a height map of a lower accuracy than Lidar, but they cover large areas and can be used under all weather conditions [
26,
27]. InSAR can produce a tree height map in two ways, e.g., by subtracting a DEM from InSAR height or by using dual-wavelength InSAR i.e., TanDEM-X and E-SAR [
19,
28,
29].
More advanced techniques such as TomoSAR and fusion of LiDAR and InSAR have also been used to estimate biomass [
30,
31,
32]. These techniques provide more accurate results as compared to InSAR. The height estimation accuracy of PolInSAR and TomoSAR is directly affected by the presence of temporal decorrelation [
33,
34,
35]. Temporal decorrelation is the change in SAR signal reflected from the objects changes in the position or attributes during the image acquisition time [
36]. We may thus expect that the accuracy of biomass estimation improves after mitigating the temporal decorrelation. Recently, the Random-Motion-over-Ground (RMoG) model has been proposed to compensate the effect of temporal decorrelation on height estimation using PolInSAR. This model is based upon the RVoG model combined with modeling motion of trees in the vertical direction. It has shown promising results in improving height estimation accuracy up to 20% [
33,
37]. We have proposed the Random-Motion-over-Ground-Legendre (RMoG
) model based on the RMoG model, but with the Fourier–Legendre series as the structure function instead of the simple exponential function used in the RMoG model [
38].
The main objective of this paper is to analyze the accuracy of biomass estimation after correcting for temporal decorrelation. To do so, the first step is to develop a model to estimate biomass from PolInSAR height maps resulting from the Random-Volume-over-Ground (RVoG), RMoG, and RMoG
models. We have selected the Remningstorp forest as a study area as it has been studied during BioSAR2010 campaign [
39] as the test site. Both linear and non-linear models were developed to clarify the relation between PolInSAR height and biomass. The second step is to apply different strategies for train and test dataset selection to make sure that the chosen model is general and accurate enough [
40]. At the final step, for evaluating the results both field data and Lidar data were used.
This paper has been organized in the following sections: first, a short introduction of biomass estimation using remote sensing in general and SAR data specifically is given. Second, the dataset and study area is characterized. In the third section, explanation of the methodology is given. In the fourth section, the results are presented followed by the discussion and the conclusion sections.
2. Materials and Methods
A summary of the methodology used in this study is presented in
Figure 1.
Hereafter, the methodology is described in details.
2.1. Study Area
A complete report about the BioSAR2010 campaign and Remningstorp area has been published previously [
39]. Here, a summary is provided that is relevant for this study. The Remningstorp forest (
N,
E) is located in the southwestern part of Sweden and has an area of approximately 1200 ha. The forest type is hemi-boreal, which intermediates between boreal and temperate forest types [
41]. Dominant forest species are
Norway Spruce and
Scots pine, and it is completed by a mixture of
oak,
birch, and
aspen. The elevation range of this area is between 120 and 145 m above sea level.
2.2. Field and Lidar Data
Field observations used in this study have been collected resulting in 214 circular plots with a radius of 10 m [
39]. The location of these field plots is shown in
Figure 2.
The area has been divided into several stands based upon the homogeneity of forest cover type. Field plots have been defined in such a way that they fall into one of these stands completely to minimize the effect of heterogeneity within the plots.
Measurements inside each plot according to the Heureka forestry system [
42] include an H100 height, defined as the basal area weighted average of the 10 highest trees in each plot [
18], Diameter at Breast Height (D
), i.e., the diameter at 1.3 m above ground, and the dominant tree species. According to the recorded information, 60% of the field plots is covered by spruce, 30% covered by pine, and 10% dominated by oak, birch, and aspen (mixed deciduous) trees. Additionally, dry aboveground biomass, including stems, branches, bark, and needles, has been measured [
39]. For all 10 m radius field plots, biomass was between 6 and 250 ton ha
with an average of 105 ton ha
. The allometric equation used for measuring biomass is based upon height and D
[
11]. Allometric equations are the equations that relate biophysical parameters to biomass. At the plot level, the general form of this equation equals
where
B is the above-ground biomass,
N is the number of trees per area unit,
H is the tree height,
is the species-related wood density, and
f is a form factor. These equations have been developed for different forests according to the FAO standard procedure [
43]. Developing these equations requires excessive field work and the data are only valid for a 5-year period. After this period, the density and the factor
f change, so the measurements are repeated every five years. In this paper, we used the previously developed allometric equations for the test site [
39].
Lidar data of the study area were acquired with an average density of 69 returns m
. The airborne Lidar data has been used for two main purposes. First for evaluating height estimation models i.e., the RVoG, RMoG, and RMoG
models. Second, the biomass map produced from Lidar data [
20,
39,
41] was used for the assessment of generated biomass maps from PolInSAR heights beside the field data. The predicted biomass map from Lidar height is presented in
Figure 3.
Besides predicting the biomass map, we used the Lidar Digital Surface Model (DSM) with a cell size of 0.5 × 0.5 m, as the reference height map. Since we have the canopy height map, the Lidar DSM is called the Canopy-Height-Model (CHM) hereafter. This map was derived from the differences between the first and the last returns of the Lidar pulses and is displayed in
Figure 4. Both the predicted biomass map and the CHM have been used as the reference data sets.
The areas with zero height values on the Lidar CHM are masked on the predicted biomass map. The areas with taller trees on the Lidar CHM correspond to areas with the highest biomass values. The two maps show a clear positive correspondence between tree height and biomass.
2.3. PolInSAR Data
During the BioSAR 2010 campaign, 10 PolInSAR images were obtained. These images were collected with the ONERA SETHI airborne sensor, developed by the Office National d’Etudes et de Recherches Aérospatiales (ONERA) [
39]. This device can acquire full Polarimetric Interferometric SAR images in both L and P bands.
Three pairs of P-band images, acquired specifically for PolInSAR analysis, were chosen for this study. Their spatial baseline is equal to 30 m, the heading angles equal 199, 178, and 270, respectively, and the sensor height is approximately equal to 4000 m. The images have been delivered in SLC (Single Look Complex) format and have four polarization channels: , , , and with as the system is mono-static. The ground-range resolution of the InSAR data is 0.5 × 0.5 m.
Next, after correcting for the flat earth phase [
44], the complex coherence was obtained according to [
25]. The optimized coherence channels plus linear polarizations (
,
, and
), circular polarization (
,
, and
), and the Pauli basis polarization (
and
) channels [
45] served as input for the RVoG, RMoG, and RMoG
models. These models are applied on each pair of P-band images separately and the final height maps are generated by mosaicing the resulting images.
2.4. Tree Height Estimation Using PolInSAR Data
Since the Polarimetric Interferometric SAR systems illuminates an area from two different positions at two different times, any change in the objects is represented in the form of signal decorrelation. Thus, the height of the trees, like other properties, can be retrieved by reversing the process of interferograms generation, e.g., quantifying the decorrelation sources. In the past decade, the models used for obtaining tree properties were only taking volumetric decorrelation into account. Volumetric decorrelation is the inconsistency of the signal caused by the vertical structure of the trees. In the presence of natural phenomena in particular wind, changes in the position of the scatterers cause temporal decorrelation. This should be taken into account when estimating tree height. The following models have been applied to estimate height without and with accounting for temporal decorrelation.
2.4.1. The RVoG Model
The RVoG model has been popular over the past decade for estimating forest height [
46,
47,
48]. It relates the observed complex coherence to the height of the vegetation layer with the assumption that only volumetric decorrelation is present. It expresses the volumetric coherence
, as
where
is the ground phase,
is the vegetation height, and
is the ground-to-volume-ratio [
47]. The parameter
is added to compensate for the surface scattering mechanism effects on the observed coherence. It can have any value between 0 and ∞ [
47]. The parameters
and
are defined as
where
is the mean wave extinction coefficient (
),
is the average look angle (
),
is the terrain slope angle (
), and
is the vertical wave number (
). The final form of complex coherence by the RVoG model
is
To solve Equation (
5), we need to estimate
, so we need a polarization channel as the volume and another channel as the ground scattering representation. Solving Equations (2)–(5) is done with a multi-stepwise procedure following [
47].
As the RVoG model ignores temporal decorrelation, it provides biased results if the temporal decorrelation is high. This can happen due to wind or precipitation. To tackle this limitation, the RMoG model has been proposed [
37], which models the induced motion caused by wind in the vertical direction.
2.4.2. The RMoG Model
The RMoG model assumes that the vegetation layer consists of randomly distributed vertical scattering objects located on a rough dielectric layer. The main difference with the RVoG model here is to assume that both layers have random movements along the vertical axis [
33]. The complex coherence using the RMoG model
is obtained as
where
is defined as
Here,
is the reference height, which is an arbitrary constant value [
33],
is the SAR system wavelength, and
and
are the surface and canopy layer motion standard deviations, respectively. The function
is the structure function, which defines the vertical structure of the vegetation layer. Similar to the RVoG model, the RMoG model assumes a Gaussian function to represent the structure of the trees. The procedure of solving Equation (
6) is explained in details in [
33].
Here, the primary assumption is that the maximum of the structure function occurs at the top of the canopy layer and the volume backscatter and from ground components are distinguishable. This assumption leads to biased height estimation especially in complex and multi-layer forests [
37]. To improve the accuracy of the RMoG model, and inspired by [
48], we have proposed the RMoG
model [
38].
2.4.3. The RMoG Model
The RMoG
model is similar to the RMoG model, but instead of an exponential function, it uses a finite Fourier–Legendre series as the structure function when considering the motion of scatterers along the vertical axis [
38]. Thus, the complex coherence of the RMoG
model
equals
Here,
,
and
[
38]. The
are normalized Legendre coefficients, and the
are Fourier functions of different orders [
48] defined as
where the
,
are Legendre polynomials [
49]. If we assume that
according to the short time interval between image acquisition times, the final form of complex coherence obtained by the RMoG
model equals
The number of terms in
depends upon the vegetation layer; for more complex vegetation layers, several terms are needed, whereas a single layer can be modeled by fewer terms. Note that, for estimating more than two Legendre coefficients, we need multi-baseline SAR data. It is argued in [
48] that using the first and second terms of the series should be sufficient in most cases; the accuracy, however, depends upon the complexity of the vegetation layer. According to the relatively simple structure of the hemi-boreal forest and the availability of PolInSAR data, we fixed the number of terms to two. Therefore, parameters to be estimated are
,
,
,
, and
. The way of solving Equation (
10) is described in [
38]. The solving method is similar to the RMoG model, but some extra steps should be taken in order to deal with more number of unknown parameters.
2.5. Biomass Estimation Using PolInSAR Data
The most important outcome of the RVoG, RMoG, and RMoG models is a height map of the area. Averaged, plot-wise height values are then extracted from these height maps. In this study, we applied a buffer zone of 10 m to reduce the border effect.
We used a robust regression model to retrieve biomass from PolInSAR height. The used algorithm is non-linear least square fitting with LAR (Least Absolute Resiuals) algorithm for providing the prediction bounds within a 95% of confidence interval [
50,
51]. According to past studies, there are different assumptions about the relation between biomass and PolInSAR height. The first and most common assumption is that the biomass can be obtained from PolInSAR height by a linear equation [
9]. In other studies, this relation has been shown to be non-linear [
41]. Therefore, to find the best model for estimating biomass, we started by a polynomial model to relate biomass to PolInSAR height. The general form of this model is
where
n is the polynomial degree,
B is biomass,
H is the PolInSAR height obtained from the RVoG, RMoG, and RMoG
models, and the
are coefficients to be estimated. The parameters
B and
H are standardized and thus are dimensionless. We determined the number of coefficients by trial and error, and the final form arrived at a value of
. The polynomial model is
We stopped to increase
n if
and consequently
according to [
52]. As is evident from Equation (
12), the relation between biomass and PolInSAR is non-linear.
Another common assumption is that biomass increases proportionally to the increase of height. Therefore, as a further examination, the one-term and two-term exponential equations were tested. Furthermore, we tested whether the power series can describe the relation between two variables. The general form of the selected exponential model equals
and for the power series it equals
where the
and
,
are the coefficients to be estimated.
The reliability of Equations (12)–(14) is based upon the assumption that the observed height values have a normal distribution of errors. In our case, according to [
39,
41], outliers occur in measured biomass values. Robust regression [
53] was therefore applied to avoid the effect of outliers on estimated parameters. It assigns a weight value to each data point with weights being adjusted iteratively. In the first iteration, one value, in our case 0.5, is assigned to all data points, and by using normal least squares the coefficients are estimated. In the next iterations, the weights are re-estimated to place less emphasis on the points at a larger distance from model predictions. Next, the model coefficients are recalculated using weighted least squares. This procedure proceeds until the difference between each estimated coefficient in two different iterations is below a thereshold, selected as 0.005 in this study [
41].
2.5.1. Data Splitting
The selection of a training and a test data set directly affects the reliability of estimated parameters. Different strategies exist for solving the so-called “
data splitting” problem [
40,
54]. It is important to choose a good subset of data for training that preserves the generality of the model while having high accuracy. These two aspects, however, are conflicting. For determining the best training subset, we applied two procedures. First, we selected 75%, 50%, and 25% of the data points randomly as the training set and carried out the regression. Second, the field plots were divided into three groups based upon measured biomass value as low (<120 ton ha
, medium (120–240 ton ha
), and high (>240 ton ha
). Half of the data points from each category was then used as training data and the other half as test data. These two approaches, called the random and stratified methods, respectively, were compared and the best method was selected. The modeling procedure was repeated for mean diameter and basal area to understand how structural properties of trees are related to the height obtained from PolInSAR images.
2.6. Accuracy Assessment
To compare PolInSAR height with ground truth data, the first step is to convert the height maps from slant-range to ground-range. This was done using a precise DEM available in the campaign data and using the suggested method in [
39]. We used both field and Lidar data as the reference height map. For evaluating the accuracy of three different height estimation models, the average height inside each plot was compared with the corresponding values on Lidar CHM and with the H100 value of each plot. Different metrics were used for evaluating the RVoG, RMoG, and RMoG
models. To test the distribution of measured and obtained biomass maps and for comparing them, the Kolmogorov–Smirnov (KS) test was applied [
55]. A one-sample KS test examined the null hypothesis that the data distribution is a hypothesized distribution against the alternative that it does not follow such distribution. The test statistic is the maximum absolute difference between the empirical Cumulative Distribution Function (CDF) calculated from the dataset
and the hypothesized CDF of
, in our case the normal distribution:
A two-sample KS test checks if the two datasets have similar distributions. The alternative is that the two tested datasets have different distributions.
3. Results
A summary of the most important biophysical parameters measured within field plots is provided in
Table 1 and a histogram in
Figure 5. These values are reported for the whole dataset, whereas the regression analyses results are based on the selected training data as described in
Section 2.5.1.
According to
Table 1, the measured biomass varies from 1.2 to 315.96 ton ha
. The mean value is 120 ton ha
and the median value is 105 ton ha
, showing that the distribution of the field data deviates from the normal distribution. Other measured biophysical parameters i.e., basal area and mean diameter also have a wide range of variation.
The histogram is skewed and shows several peaks. This makes it difficult to fit a specific distribution function to biomass values. Most plots have a biomass around 100–130 ton ha
, whereas the second peak occurs between 200 and 230 ton ha
. This multi-modal histogram validates our choice to divide the data into different categories and to select a training data set for each category. The relation between measured biomass and biomass predicted by Lidar is presented in
Figure 6. The relative error is approximately equal to 12.7%.
The plot shows a positive correlation between measured biomass and predicted biomass by Lidar height. Two points marked with circles are most likely measurement errors.
After processing SLC PolInSAR images and obtaining complex coherence [
25], height maps using the RVoG, RMoG, and RMoG
models were obtained. The height map generated by the RMoG
model is shown in
Figure 7.
Figure 7 shows a good agreement with the Lidar CHM of
Figure 4. For shorter trees, we see that the RMoG
model overestimates tree heights, while for taller trees the height is underestimated. Plot-wise averaged height values from three different models were compared to the corresponding H100 values. The results showed that
for the RVoG model,
for the RMoG model, and
for the RMoG
mode. The weak correlation between PolInSAR height and H100 is because H100 is measured based on the tallest trees inside each field plot while the PolInSAR height represents the average of canopy height. Similarly, height maps obtained from the RVoG, RMoG, and RMoG
models were compared with Lidar CHM, and results are shown in
Figure 8. The Lidar and PolInSAR heights show a positive correlation. In the case of the RMoG model, the noise has decreased and the data points are closer to the least square line. We also observe that the PolInSAR height estimation error increases with increasing tree heights.
Next, we selected training and test sets for performing regression analyses. The stratified sampling leads to more accurate biomass estimation. All results hereafter have been obtained using stratified sampling. The relation between PolInSAR height and biomass was examined using different models. First, we fitted a polynomial model to relate the PolInSAR height and biomass. The results are listed in
Table 2.
The highest
and lowest RMSE values were obtained by the RMoG
model. Another important observation is that, if we set
, then
. The
B value shows the error of the fitted polynomial model for estimating biomass from PolInSAR height. We can force a no-intercept regression to estimate biomass, but the RMSE value then becomes high (
ton ha
). Next, we correlated height obtained by PolInSAR and biomass via an exponential model and a power series. The results of fitting the exponential model are listed in
Table 3.
Similar to polynomial regression, the RMoG
model is the best model, with the lowest RMSE and highest
value. Previous studies [
52] suggest using an exponential model to estimate biomass from height. We observed that the RMSE is higher than when using the polynomial model, and
is lower. In contrast with the third-degree polynomial, the exponential model passes through the origin, hence a zero biomass value for
. In addition, the power series was tested to estimate biomass, and the results are presented in
Table 4.
We obtain higher RMSE values by using power series in comparison with the polynomial and exponential models. According to the variation of the data set and to provide further comparison, the piece-wise regression was also tested. In finding the break points of the dataset, the target is to find
y given the number of break points
x, which minimizes the following function
Here,
is the interpolation function,
is the break point, and
is the interpolation point for the
[
56]. The results of piece-wise linear regression relating biomass to the RMoG
height are listed in
Table 5.
The piece-wise regression showed good results as compared to the power-series and polynomial models listed in
Table 2 and
Table 3. for
m, the slope of the line decreases significantly, showing the saturation effect. For both exponential and piece-wise regression models, the saturation effect happens when biomass reaches approximately 300 ton ha
.
The polynomial, exponential, power series, and piece-wise linear regression curves for the RMoG
model are shown in
Figure 9.
We notice a saturation effect for
m in
Figure 9b–d.
Figure 8a does not show the saturation point clearly, which is in line with other studies [
52].
According to
Table 2,
Table 3,
Table 4 and
Table 5 and
Figure 9, the most accurate model is the exponential model, while the piece-wise regression also showed good results. Since it is more straightforward to fit a single curve to the whole dataset, we selected the exponential to produce the biomass map. For accuracy assessment, the remaining 52 plots out of 214 field plots were used as the test data set. Different metrics were employed for this purpose [
57,
58]. These metrics included bias
, co-variance
, root-mean-square-error (
RMSE), and the coefficient of determination
. The results are listed in
Table 6.
Biomass values of the test dataset were compared to corresponding predicted biomass by the Lidar data, and the results are plotted in
Figure 10.
Furthermore, the relation between PolInSAR height and mean diameter on the one hand and basal area on the other have been examined to gain insight into how these biophysical parameters affect height measurements. The results are shown in
Figure 11. The results showed a significantly better performance of the RMoG
model on test data.
The results of applying two-dimensional KS-test between predicted biomass by PolInSAR and Lidar data are listed in
Table 7, whereas the biomass map resulting from the RMoG
model is shown in
Figure 12. The results showed a positive linear relation between mean diameter, basal area, and PolInSAR height, whereas the relation between mean diameter and basal area are stronger. This shows that inclusion of the mean diameter in obtaining biomass estimation models increases the accuracy of the biomass estimation.
The parameter
h equals 0 if the null hypothesis of having similar CDFs is not rejected and equals 1 if the CDFs are not similar at the 0.05 significance level. The parameter
D is the distance between two dataset distribution functions and shows the goodness of fit. From
Table 7, we can see the
D decreased by 50% by applying the RMoG
model.
To provide a better comparison, the predicted biomass by PolInSAR height was also compared to the measured biomass of the field plots. The results are shown in
Figure 13.
The results shows a positive linear relation between PolInSAR predicted biomass and field biomass for all three height estimation models.
Figure 13c shows better performance of the RMoG
model, while the other models show good correlation as well.
4. Discussion
In this paper, we investigated whether correcting for temporal decorrelation improves biomass estimation accuracy. From the BioSAR2010 campaign, we selected plots with a 10 m radius, in line with other studies [
41]. Recorded values for these plots showed that biomass and other structural parameters had a broad range of varieties. This feature enabled us to examine how the PolInSAR height is related to different levels of biomass. Moreover, the effect of saturation was observed.
For evaluating three different height estimation models, the H100 and Lidar CHM were used as reference data. Neither Lidar nor PolInSAR height showed a significant correlation with the H100 parameter. The reason is that H100 represents the tallest trees, while the PolInSAR and Lidar height in each pixel represents the average of tree heights. In contrast, the Lidar and PolInSAR height values showed a strong relation. As we expected and as was shown previously, the relative error of height estimation decreased from 35% for the RVoG model to 18.6% for the RMoG models.
Since there is no agreement in previous studies whether the correlation between biomass and tree height is linear or not, we examined different regression models. The outcome of the training sets should be a model that can predict the values for the test data set accurately and meanwhile is able to predict new data. If the model is over-trained, the accuracy will be good but the model would not be able to produce good results with new data. These two aspects, accuracy and generality, is known as the bias and variance dilemma [
54]. It becomes important when dealing with a high volume of data and when the developed model is being used in estimating biomass at the global scale. In this study, we tried two different strategies for data splitting to evaluate the effect of training set data selection on estimating biomass. The results showed a better performance of stratified sampling, but the computational time increased threefold. The best result is achieved by fitting an exponential model. The exponential curve and the piece-wise regression show a saturation effect for a biomass level above 300 ton ha
. The saturation level increases from 100 ton ha
when using SAR backscatter [
59] to 300 ton ha
by using PolInSAR height as the main predictor. In tropical forests, it is important to consider the saturation effect since biomass is mostly higher than 300 ton ha
.
The RMSE of biomass estimation decreased from 46% when using height resulting from the RVoG model to 30% for the RMoG model. As expected, there is a considerable improvement in estimating biomass after modeling temporal decorrelation. The results validated the hypothesis of the improvement of biomass estimation after correcting for temporal decorrelation.
Our results were compatible with previous studies for the same test site. For example, InSAR images at C-band were employed to estimate stem volume and achieved an RMSE value of 27% [
60]. Similar studies using L and P-bands estimated biomass with RMSE values between 30–40% for L-band and between 20–30% for P-band [
57,
61]. Interestingly, recent research in Remningstorp area using TanDEM-X interferometric heights showed an RMSE value around 16.5% [
41]. This improvement can be due to a larger number of baselines and a high number of field plots. Other studies in tropical [
62], temperate [
63,
64], boreal, and hemi-boreal forests [
65] showed an average error between 2 and 5 m with and average of 3.5 m in estimating tree heights using PolInSAR. This shows a PolInSAR height can be used to estimate biomass in different forest types.
The RMSE only, however, cannot be enough to show the model performance. Based on
Table 6, the RVoG model has a significant bias, which, after correcting for temporal decorrelation, has decreased by about 50%. Moreover, the correlation increased from 0.45 for the RVoG model to 0.82 for the RMoG
model. For the RMoG model, the results were close to the RMoG
model. The difference is because of the more accurate structure function in the RMoG
model.
The relation between the RMoG height and two other structural parameters, namely basal area and mean diameter, was also high. This is due to the capability of P-band in penetrating into the vegetation layer as well as the Fourier–Legendre series employed in the RMoG model to represent the structure of trees. This is an interesting finding since it shows the importance of taking different vegetation structures into account for estimating biomass. Involving other biophysical parameters for example the D in estimating biomass can have a huge effect on improving accuracy.
The result of the one-dimensional KS test showed a different distribution of measured biomass and predicted biomass by all height estimation models and Lidar data. The two-dimensional KS test showed similar distribution functions for predicted values by Lidar and PolInSAR data. The main reason is that, at the plot level, biomass is measured for individual trees, while remotely sensed data measure the canopy of trees that are spread around the tree trunk.
There is a requirement for methods that can estimate biomass at broader scales. The method and dataset we used here are limited because we used one pair of PolInSAR images. By using multi-baseline tomographic PolInSAR images, we can add more terms to the structure function of the RMoG model. Therefore, reconstruction of the tree structure has a higher resolution. We expect that this leads to more accurate biomass maps, but with an increased computation time.
There are other biophysical parameters of trees i.e., diameter and basal area that affect the SAR signal. The ideal case would be to develop a model based on height, diameter, and the basal area of the trees. In the case of using PolInSAR data, the only observable parameter is tree height. One possibility is to use the field data together with PolInSAR height, but in doing so the developed model would only be applicable to this area and we lose any generality. Thus, we chose to develop a model based on only PolInSAR height to maintain the general applicability of the model [
41,
52].
Our findings showed a strong relation between PolInSAR height and above-ground biomass. The accuracy of biomass estimation improved considerably after correcting for temporal decorrelation. This means that the accuracy of height estimation is directly related to the biomass estimation accuracy. This shows the importance of mitigating temporal decorrelation in estimating height using PolInSAR since such mitigationaffects the further products of height maps such as biomass. At present, the P-band images, on which we tested our proposed model, are only available in airborne sensors. Although they will also be available on spaceborne sensors in the near future, it would be of great value to examine the effect of the correction of temporal decorrelation on biomass estimation accuracy using other wavelengths, especially the L-band. As a sequel to this study, we suggest developing a unique structure function for each tree type according to the availability of the forest classification map. This will lead to more accurate biomass maps [
66].
5. Conclusions
This study showed that, after mitigation of temporal decorrelation , the accuracy of biomass estimation improved by approximately 10%. Therefore, for producing biomass maps of a desirable accuracy, the temporal decorrelation factor should be considered. The best model selected for estimating biomass from PolInSAR was the exponential curve. Furthermore, we showed, in forests with diverse levels of biomass, that it is a good approach to divide the area according to the different height classes and use different regression lines for each class. By using PolInSAR height, the saturation level increases from 150 to 300 ton ha. This is important in tropical forests where biomass is high and saturation becomes an issue.
A weak relation was observed between remotely sensed height maps and H100 values. This is due to the sensitivity of remote sensing sensors to the average height of the tree canopy versus the H100, which represents the tallest trees inside each plot. Basal area and mean diameter also affect the SAR signals, and consequently PolInSAR height estimates
The presented results demonstrate the capability of PolInSAR to obtain biomass maps. This is important since, in the near future, the satellite BIOMASS mission aims to produce biomass maps with wall-to-wall coverage of important forested areas.