Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Next Article in Journal
Using Adjoint-Based Forecast Sensitivity to Observation to Evaluate a Wind Profiler Data Assimilation Strategy and the Impact of Data on Short-Term Forecasts
Next Article in Special Issue
Confidence-Aware Ship Classification Using Contour Features in SAR Images
Previous Article in Journal
Broadband Waveguide Chip Design with Phase Measurement Function for Enhancing Optical Interferometric Imaging
Previous Article in Special Issue
NDVI or PPI: A (Quick) Comparison for Vegetation Dynamics Monitoring in Mountainous Area
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Learn from Simulations, Adapt to Observations: Super-Resolution of Isoprene Emissions via Unpaired Domain Adaptation

by
Antonio Giganti
1,2,*,
Sara Mandelli
1,2,
Paolo Bestagini
1,2 and
Stefano Tubaro
1,2
1
Dipartimento di Elettronica, Informazione e Bioingegneria, Politecnico di Milano, 20133 Milan, Italy
2
Image and Sound Processing Lab (ISPL), Politecnico di Milano, 20133 Milan, Italy
*
Author to whom correspondence should be addressed.
Remote Sens. 2024, 16(21), 3963; https://doi.org/10.3390/rs16213963
Submission received: 18 September 2024 / Revised: 21 October 2024 / Accepted: 22 October 2024 / Published: 24 October 2024

Abstract

:
Plants emit biogenic volatile organic compounds (BVOCs), such as isoprene, significantly influencing atmospheric chemistry and climate. BVOC emissions estimated from bottom-up (BU) approaches (derived from numerical simulations) usually exhibit denser and more detailed spatial information compared to those estimated through top-down (TD) approaches (derived from satellite observations). Moreover, numerically simulated emissions are typically easier to obtain, even if they are less reliable than satellite acquisitions, which, being derived from actual measurements, are considered a more trustworthy instrument for performing chemistry and climate investigations. Given the coarseness and relative lack of satellite-derived measurements, fine-grained numerically simulated emissions could be exploited to enhance them. However, simulated (BU) and observed (TD) emissions usually differ regarding value range and spatiotemporal resolution. In this work, we present a novel deep learning (DL)-based approach to increase the spatial resolution of satellite-derived isoprene emissions, investigating the adoption of efficient domain adaptation (DA) techniques to bridge the gap between numerically simulated emissions and satellite-derived emissions, avoiding the need for retraining a specific super-resolution (SR) algorithm on them. For this, we propose a methodology based on the cycle generative adversarial network (CycleGAN) architecture, which has been extensively used for adapting natural images (like digital photographs) of different domains. In our work, we depart from the standard CycleGAN framework, proposing additional loss terms that allow for better DA and emissions’ SR. We extensively demonstrate the proposed method’s effectiveness and robustness in restoring fine-grained patterns of observed isoprene emissions. Moreover, we compare different setups and validate our approach using different emission inventories from both domains. Eventually, we show that the proposed DA strategy paves the way towards robust SR solutions even in the case of spatial resolution mismatch between the training and testing domains and in the case of unknown testing data.

1. Introduction

All plants emit diverse volatile compounds, collectively known as biogenic volatile organic compounds (BVOCs). Over the past two decades, BVOC emissions have attracted growing scientific interest due to their high reactivity and their ability to significantly alter atmospheric chemistry and physics [1,2,3,4,5,6,7,8,9]. Annually, BVOCs contribute about 1.1 petagrams (Pg) of carbon to the atmosphere, a figure comparable to global methane emissions. These emissions are closely linked to carbon assimilation, particularly in forest and agroforestry species, which are major BVOC sources [10,11,12].
Despite their beneficial role in absorbing atmospheric CO2 through photosynthesis, which helps mitigate climate change by sequestering approximately 90% of global biomass carbon, BVOCs have a contrasting, harmful effect on atmospheric chemistry [13,14]. They actively contribute to the formation of tropospheric ozone and aerosols by reacting with both natural and anthropogenic compounds, especially nitrogen oxides, leading to the production of ozone and photochemical smog [7,15,16,17]. Moreover, BVOCs influence the atmospheric lifetime of other greenhouse gases, like methane, and can promote the formation of secondary aerosols, a key component of PM10 in the atmosphere [18,19].
This highlights the significance of understanding biogenic emissions and emphasizes the need for detailed inventories of BVOC emissions, which are crucial for atmospheric research. Additionally, the dual significance of BVOC emissions and carbon assimilation in biosphere–atmosphere interactions means that the impacts of climate change, i.e., rising CO2 levels and increasing temperatures, on the carbon cycle and atmospheric chemistry are likely to be profound. As a result, understanding the mechanisms of BVOC synthesis and how plants and ecosystems respond to climate change is crucial for accurately predicting future emission trends.
To perform these investigations, quantitative estimations of BVOC emissions are required [20]. For this reason, different approaches have been proposed to estimate such emissions, which can be categorized into two main classes: (i) bottom-up (BU) and (ii) topdown (TD) approaches.
The BU approaches estimate BVOC emissions from numerical simulations based on biological and environmental factors at ground level [13,20]. These methods offer high spatial and temporal resolution, often down to a few kilometers and hourly timescales [21,22,23]. However, these models also come with intrinsic uncertainties relative to biological processes’ complexities, plant emissions’ variability, and the need to generalize from limited data points to broader regions. As a result, BU estimates can vary significantly, mainly when extrapolated to larger scales, introducing a degree of uncertainty into emission inventories. We denote with S the emissions obtained with these approaches since they are numerically S imulated.
In contrast, the TD approaches estimate BVOC emissions by exploiting their oxidation products, whose presence can be identified from satellite observations [14,24]. These approaches typically suffer from coarse spatial and temporal resolutions. For instance, TD methods at a global scale may provide data at resolutions of several kilometers [24,25,26,27], with temporal resolutions that are daily or even less frequent [28,29]. This broader scope can make it difficult to capture the fine-scale and rapidly changing emissions that BU models can detect, reducing the number of available observations. Nonetheless, TD approaches are crucial for providing a more comprehensive view of BVOC emissions over large areas, and they can serve as a powerful tool for validating and refining BU models [30,31]. We denote with O the emissions derived with these approaches since they are obtained from O bserved satellite measurements.
For example, Figure 1 depicts two BVOC emission maps (isoprene species) related to the same geographical area but from different domains, i.e., S (BU—numerical simulations) and O (TD—satellite observations). A smoother behavior of the O emissions is noticeable with respect to the S emissions, which present higher spatial frequency content. Notice also the differences in the dynamic ranges.
For these reasons, the higher spatial frequency content and greater availability of S emissions from BU approaches can play a crucial role in enhancing the spatial resolution of O emissions coming from TD approaches. However, this necessarily requires a preliminary adaptation phase between the two domains [32,33]. The refinement process typically involves using high-resolution (HR) S emissions to improve the resolution of low-resolution (LR) O satellite measurements [32], effectively filling in the spatial gaps and providing more detailed BVOC emission maps.
In this direction, super-resolution (SR) techniques [34,35] can be applied to BVOC data to generate denser emission maps by starting from coarser maps [36]. Recently, some methods have been proposed to super-resolve BVOC emission by leveraging deep learning (DL) models [36,37,38]. Despite their promising performance, the main limitation of the currently available approaches is that they require training and testing on BVOC inventories with similar characteristics [32]. Indeed, super-resolving TD emissions using an SR algorithm trained on BU emissions results in unreliable reconstructions due to the huge data domain mismatch [32,36,38].
This work addresses the SR task in data domain mismatch, exploiting numerically simulated emissions ( S domain) to super-resolve satellite-derived emissions ( O domain) without the need of retraining a specific SR algorithm on them. As reported above, O data are typically too few to train effective SR models with good generalization capabilities.
The differences between the temporal and spatial domains make the SR difficult (we provide further details in Section 4.1). For these reasons, we investigate the use of domain adaptation (DA) [39,40] strategies to bridge these discrepancies by transferring knowledge from S data to O data. To the best of our knowledge, no prior studies have tackled DA techniques in the context of isoprene emission SR.
Specifically, we employ DA techniques based on image-to-image translation algorithms, precisely, the cycle generative adversarial network (CycleGAN) [41]. The CycleGAN has been designed to transform images from one domain to another, aligning images with different characteristics. This process involves training generative adversarial network (GAN) models to learn the relationships between the two domains. In our proposed work, we use the CycleGAN in conjunction with SR algorithms to super-resolve O emission maps from the knowledge of detailed S data. In particular, we deviate from the conventional CycleGAN framework by introducing additional loss terms that improve DA and emission SR. This enhanced integration of data sources could support better environmental assessments and more informed decision making [42,43], as well as improving the system’s robustness and generalization capabilities in performing the SR task [32].
Among BVOCs, isoprene is by far the most important in terms of global emissions and atmospheric impact [12,18]. For this reason, we extensively demonstrate the proposed method’s effectiveness and robustness in restoring fine-grained patterns of isoprene emissions. We evaluate various setups and validate our method using different isoprene emission inventories from both domains. Finally, we demonstrate that the proposed DA strategy enables robust SR solutions, even when there is a spatial resolution mismatch between the training and testing domains or when the testing data are unknown.
The proposed method enables the synthesis of high resolution (HR) emission maps, making a significant step toward fulfilling the wishes of atmospheric chemical and climate modeling communities. Our main contributions are as follows:
  • We present a novel approach that addresses the SR task under data domain mismatch, using simulated emissions ( S domain) to super-resolve satellite-derived emissions ( O domain) without retraining specific SR models.
  • We propose a DA strategy to transfer knowledge from the S domain to the O domain, addressing the challenges of limited observed data and differences in spatial/temporal domains.
  • Our proposed DA strategy deviates from the original CycleGAN framework, enhancing it with additional loss terms to improve SR and DA performance.
  • We demonstrate the effectiveness and robustness of the proposed method on isoprene emissions by restoring fine-grained patterns across different setups.
  • We show that our DA strategy enables robust SR solutions even under spatial resolution mismatch or when testing data are unknown.
The remainder of this work is structured as follows. Section 2 presents some related works in the field of SR and DA for climate data. Section 3 introduces the backbone structure of our proposed DA method. Section 4 explains the overall methodology, network architecture, and the loss design. Section 5 introduces the emission inventories adopted to validate our methodology. Section 6 illustrates the training setup and the evaluation metrics used in this work. Section 7 reports an extensive experimental campaign to validate our proposed methodology. Section 8 investigates two actual challenges in emission SR concerning the system’s generalization capabilities. Section 9 draws some conclusions and future directions.

2. Related Works

2.1. Super-Resolution of Climate Data

The high costs and societal implications of atmospheric modeling and observational systems make recent advances in SR techniques particularly promising, because they offer the potential to enhance the resolution of climate data at a lower cost. DL-based SR methods have already shown great promise in Earth science, improving the spatial and temporal resolution of climate data from satellite observations and chemical transport model simulations [32,44]. As a matter of fact, these methods are now more effective than traditional statistical approaches in capturing local-scale patterns, driving advancements across various research areas [32,45,46].
For instance, several DL-based approaches have been proposed to super-resolve precipitation fields [47,48,49,50,51,52], wind fields [53], and other surface variables [54,55,56,57]. Other approaches focus on enhancing atmospheric chemicals and emissions [32,55,58]. For example, the authors in [59] proposed increasing the resolution of NO2 emission maps from tropospheric monitoring instruments, combining low resolution (LR) target data with in situ observations, with additional geographical information and climate variables. Similarly, in [60], the authors enhanced the spatial resolution of several air pollutants by exploiting graph convolution to simulate spatiotemporal dynamics of air pollutants, encoding their local spreading characteristics. Other approaches also enforce the adherence to known physical laws within the SR algorithm for climate variables [44,61]. Recently, the adoption of a generative diffusion model to super-resolve several climate variables from the ECMWF Reanalysis v5 (ERA5) product was proposed in [62].
Despite these advances, there are still some gaps in the SR of climate data, such as the limited focus on BVOCs emissions [36,37,38]. This offers the potential for investigations in atmospheric chemistry and climate modeling [32].

2.2. Domain Adaptation for Climate Data

In the literature, several approaches have been proposed to solve the adaptation to different data domains [40,59,63,64]. Most of the available works are related to standard 8-bit photographs [65,66,67]. Moreover, extensive research has been conducted on different remote sensing products and tasks, like land cover classification [63,68], change detection [69], semantic segmentation [63,70], and estimation of climate variables in data-poor regions [71,72].
A few works use DA techniques (like fine-tuning) to enhance the performance of domain-shifted climate data [32,38,73,74]. All these works address the out-of-distribution challenges that arise when the training and testing data follow different distributions. Most of the proposed techniques focus on adapting the SR algorithm to handle unobserved historical and future climate scenarios [32,75,76,77,78]. The aim of DA in this case is not only to increase the accuracy of the SR model but also to enforce the efficiency, robustness, and generalization capabilities of the model [33,72].
However, to the best of our knowledge, there are no prior works that exploit DA strategies for BVOCs emissions, and in general, to increase the robustness of a pretrained SR algorithm. For this reason, in this study, we propose a DA approach to increase the robustness in restoring fine-grained patterns of isoprene emissions derived from satellite observations, exploiting an SR network trained on an extensive simulated emissions inventory.

3. CycleGAN Basics

This section describes the backbone architecture employed in our DA framework for the SR task.
The CycleGAN [41] is a powerful generative model designed for image-to-image translation tasks, where the goal is to convert images from one domain (e.g., horses) into another one (e.g., zebras). Unlike traditional approaches that require aligned image pairs [79], CycleGAN learns to perform this transformation using unpaired datasets. This capability makes it highly valuable when paired data are difficult or impossible to obtain, such as enhancing images, converting paintings to photographs, or translating styles between different artistic mediums.

3.1. Structure

The original CycleGAN architecture consists of two main neural network (NN) components: generators and discriminators. There are two generators, G and F , and two discriminators, D X and D Y , forming a symmetric structure, as reported in Figure 2a.
Generators. (i) G translates images from domain X to domain Y ; (ii) F translates images from domain Y to domain X ;
Discriminators. (i) D Y distinguishes between real images in domain Y and those generated by X ; (ii) D X distinguishes between real images in domain X and those generated by Y .
The interplay between these components drives the learning process, where each generator attempts to produce images that can fool the corresponding discriminator. In contrast, each discriminator learns to differentiate between real and generated images better.

3.2. Training Objective

The effectiveness of CycleGAN is governed by carefully designed loss functions, each serving a specific purpose in guiding the network’s training.
GAN loss. The GAN loss is the core of the GAN framework. It encourages the generators to produce images indistinguishable from real images of the target domain. At the same time, each discriminator improves its ability to classify real images as real and generated images as fake. There are two GAN losses, one for each generator–discriminator pair. For instance, in the case of the mapping function G : X Y and its discriminator D Y , the GAN loss is
L GAN ( G , D Y , X , Y ) = E y p ( y ) [ log D Y ( y ) ] + E x p ( x ) [ log ( 1 D Y ( G ( x ) ) ) ] ,
where E x p ( x ) and E y p ( y ) denote the expected values of the X and Y data distributions, respectively. This loss shows an adversarial dynamic, where the generator G minimizes the discriminator’s ability D Y to distinguish between real and fake, while the discriminator maximizes its ability to differentiate them. In other words, G and D Y play the following two-player minimax game [80]: min G max D Y L GAN ( G , D Y , X , Y ) . A similar loss is defined for the mapping function F : Y X and its discriminator D X , i.e., min F max D X L GAN ( F , D X , Y , X ) . This loss pushes the generators to create realistic images while the discriminators improve their ability to distinguish real from fake images.
Cycle consistency loss. A key innovation in CycleGAN is the cycle consistency loss, which ensures that the learned mappings are consistent. Specifically, if an image is translated to the other domain and then back, it should return to its original form (Figure 2b):
L Cyc ( G , F ) = E x p ( x ) [ F ( G ( x ) ) x 1 ] + E y p ( y ) [ G ( F ( y ) ) y 1 ] ,
where · 1 represents the L1 norm. This loss prevents the generators from making drastic or unrealistic changes during translation, ensuring that the mapping is meaningful and reversible.
Identity loss. An optional identity loss is used to preserve the pixel intensity and structural integrity of images when they are already close to the target domain. This loss penalizes the generator if it alters an image that belongs to the target domain (Figure 2c):
L Id ( G , F ) = E y p ( y ) [ G ( y ) y 1 ] + E x p ( x ) [ F ( x ) x 1 ] .
The identity loss helps maintain the consistency of the output, especially in cases where the input is already well-aligned with the desired output domain.

4. Proposed Domain Adaptation Methodology

4.1. Motivations

We aim to super-resolve LR isoprene emission maps derived from satellite observations. We achieve this by exploiting information from emission maps obtained from numerical simulations. As explained in Section 1, we denote with S the simulated and with O the observed data domains. The former is obtained through numerical simulations (BU approaches), while the latter is derived from satellite observations (TD approaches).
In our previous work [36], we addressed the problem of recovering HR emission maps starting from LR versions, considering only numerically simulated S emissions. We estimated HR emissions I ^ HR s by starting from LR emissions I LR s :
I ^ HR s = T s 1 ( N s ( T s ( I LR s ) ) ) ,
where the T s operator is a data transformation applied to LR emissions [81,82], N s is an NN architecture for SR [83], and T s 1 is the inverse transformation operator of T s . The T s , T s 1 , and N s operators were learned from a training set of S -only emissions. This means the network was tailored to the domain of data under investigation, i.e., the domain S .
In this work, we propose to super-resolve emission maps derived from satellite observations (domain O ) by leveraging data obtained through numerical simulations (domain S ). This is paramount since having very diverse and complete data from numerical simulation models enables addressing a wide range of scenarios where the lack of acquired real data does not allow an NN to be trained. This holds for remote sensing (RS) data derived from satellite observations since they are usually restricted to a specific mission in space and time, preventing the possibility of generating a diverse and large dataset for robust DL training. In light of this, exploiting knowledge from simulated emissions could mitigate this lack of information and enhance the performance of DL algorithms, which are known to be very data-consuming.
Regarding the SR of isoprene emissions, an adaptation between the two domains ( S and O ) is necessary to maximize the information derived from numerically simulated emissions. Since the N s is trained on S -domain data, a domain-transfer technique is required to adapt the O and S domains. To super-resolve isoprene emission maps coming from the O domain by exploiting useful information from the S domain, we need to adapt the input (LR) and output (HR) maps of the SR operator N s :
  • Input-level adaptation ( O S ): N s requires as input an LR simulated map I LR s , since it is originally trained on the S domain. Therefore, an algorithm is required to transform the emission map from the O domain ( I LR o ) into an emission that “seems like” belonging to the S domain, thus performing I LR o I LR s ˜ . The s ˜ has the same meaning as in Figure 2: it indicates that, by applying the domain translation, we are estimating just an approximation of the S domain distribution.
  • Output-level adaptation ( S O ): the operator N s performs the SR task in the S domain, returning I ^ HR s ˜ . Therefore, another algorithm is required to transform the super-resolved map in the S domain ( I ^ HR s ) back to the O domain, thus performing I ^ HR s ˜ I ^ HR o ˜ . In this case, the o ˜ also indicates an estimation of the observed O domain distribution.
More formally, we propose to estimate the HR observed emissions I ^ HR o ˜ as
I ^ HR o ˜ = T o 1 ( O ( N s ( S ( T o ( I LR o ) ) ) ) ) .
where O and S are two different mapping functions in charge of performing the emission domain adaptation, and specifically,
O : S O and S : O S .
Instead, T o , T o 1 , and N s are the same data transformation and SR algorithm used in Equation (4), except for the fact that T o and its inverse T o 1 are estimated from O emissions.
Figure 3 shows a simplified pipeline of our proposed method.
We propose to solve this task by adopting a GAN-based image-to-image translation framework. Specifically, our pipeline relies on the CycleGAN architecture [41], introduced in Section 3, which has two essential features for our task: (i) unpaired image-to-image translation and (ii) robust adaptation.
When working with RS products like isoprene emission maps, it is very uncommon to have a perfect correspondence between the O and S data. Indeed, the two domains might report different characteristics in terms of dynamic range due to different temporal aggregations. Numerically simulated data are often aggregated over short periods to capture brief variability, which tends to smooth out longer-term characteristics [22,43]. In contrast, satellite observations are typically aggregated over extended periods to minimize the impact of random errors and improve the signal-to-noise ratio [21,25,26]. Additionally, atmospheric monitoring satellites are usually polar-orbiting, limiting the temporal resolution of measurements to the satellite’s revisit time. Meteorological conditions can further complicate data collection in certain regions, preventing information acquisition over short intervals. These distinctive factors collectively contribute to a domain shift between O and S data.
Motivated by these reasons, we design the O and S mapping functions as two different generative models in an adversarial framework. We provide more details on our proposed training and deployment steps in the following section.

4.2. Proposed Methodology

Our proposed DA methodology is based on five main steps (see Figure 3):
  • Emission transformation. Since isoprene emissions exhibit a spatially sparse pattern and a wide dynamic range in both the O and S domains [24,43], we adopt the approach originally proposed in [36], where a non-linear data transformation T based on prior information derived from statistical analysis of the HR data are used to increase the robustness to outliers and local maxima. This transformation adapts the emission dynamic into more feasible values, required for numerical stability when training the NNs. Therefore, we define the transformed emissions as T LR o = T o ( I LR o ) , T HR o = T o ( I HR o ) , T LR s = T s ( I LR s ) , T HR s = T s ( I HR s ) . T o and T s are the non-linear data transformations exploiting statistics from the O and the S domains, respectively (see [36] for more details).
  • Emission adaptation O S . The LR transformed emission maps from the O domain are fed into a DA network, which performs the mapping function S : O S . This mapping enables passing from T LR o to T LR s ˜ = S ( T LR o ) . The superscript s ˜ in T LR s ˜ indicates that by applying the adaptation function S we are finding an approximation of the simulated S domain distribution. This DA step is needed since the SR network expects data with a distribution compatible with S -domain emissions. For the emission maps in Figure 3, we can notice the significant spatial and numerical difference between the O (source) and S (target) domains of the S generator.
  • Emission super-resolution. The domain-adapted emission T LR s ˜ is the input of the operator N s in charge of performing the actual SR task. Therefore, the SR task can be formulated as
    T ^ HR s ˜ = N s ( T LR s ˜ ) .
    In this case, both T LR s ˜ and T ^ HR s ˜ are emission maps that “seem like” they belong to the S domain, hence the tilde. However, notice that they actually belong to the O domain.
  • Emission adaptation S O . After the SR in the S domain, we perform an inverse adaptation operation to convert HR emissions in the O domain. Therefore, the super-resolved emission T ^ HR s ˜ is processed by applying an inverse mapping function O : S O . Computing T ^ HR o ˜ = O ( T ^ HR s ˜ ) , we return to the initial domain of the emission map ( O ). In this case, o ˜ also indicates an approximation of the O domain. For emission maps in Figure 3, we can notice the spatial and numerical difference between the S (source) and O (target) domains of the O generator.
  • Emission back-transformation. At this point, the T ^ HR o ˜ emission is back-transformed to its original data range by applying the T o 1 data transformation, resulting in I ^ HR o ˜ = T o 1 ( T ^ HR o ˜ ) . This represents our estimation of the HR emission map in the O domain.
Both O and S are NNs, whose parameters are estimated during training. Specifically, we propose implementing them through specific modifications of the CycleGAN framework [41].
In the original implementation of the CycleGAN, the authors propose the adoption of two GANs, each consisting of two networks (generator and discriminator) that are trained simultaneously. The first GAN is trained to map a source domain into the target one; the second deals with inverse mapping, and thus, maps the target domain into the source one. Even if the parameters of the two GANs are different, their mapping functions are learned from the same data, switching source and target domains according to their needs (see Section 3 for more details).
In our work, we map the observed O domain (source) into the simulated S one (target) and vice versa, as in the original CycleGAN. However, in this specific task, we have another element of data diversity that needs to be considered. As a matter of fact, we work with HR and LR maps from both domains. After preliminary investigations, we found that the estimated mapping function is very dependent on the geographical spatial coverage of each map’s pixel, and mapping HR maps using a CycleGAN trained with only LR emissions (or vice versa) leads to poor domain translation in value ranges and spatial patterns.
For this reason, we propose doubling the number of CycleGAN networks involved, considering one CycleGAN for each spatial resolution (HR and LR). This means we consider two CycleGAN frameworks:
  • The first CycleGAN is used at step 2 of our pipeline and converts LR data from the O to the S domain (Figure 4a). We refer to the mapping function derived from its generator as S LR : O LR S LR ;
  • The second CycleGAN is used at step 4 of our pipeline and converts HR data from the S to the O domain (Figure 4b). We refer to the mapping function derived from its generator as O HR : S HR O HR .
The two CycleGANs are trained by considering different loss components detailed in the next section. Instead, the SR operator N s remains fixed as it depends on simulated data only and can be trained separately from our DA pipeline. As motivated above, we are interested in finding a suitable DA strategy for the observed data without retraining a specific SR network on them.

4.3. Training Loss Components

Our proposed DA architecture relies on multiple losses, each addressing a specific aspect of our task. For clarity, in this section, we refer to the training pipeline reported in Figure 5.

4.3.1. CycleGAN (CG) Loss

Being inspired by the original CycleGAN architecture, the CG loss L CG is composed of four different terms: the GAN loss L GAN , the cycle loss L Cyc and the identity loss L Id :
L CG = L GAN + L Cyc + L Id .
Since we have two different CycleGAN architectures (working on LR and HR data), each term of our final CG loss is the sum of the contributions coming from the two CycleGANs. Refer to Appendix A for more details about the formal definition of these losses.
In particular, the operator O LR is part of the CycleGAN architecture used at step 2 of our proposed pipeline. As specified in Section 4.2, two GANs are needed for implementing a single CycleGAN framework. Therefore, even if from a practical viewpoint we need only the S LR and O HR operators for mapping LR data from the O to the S domain (step 2 of our proposed pipeline) and HR data from the S to the O domain (step 4 of our proposed pipeline), we must also include the O LR operator (mapping S to O ) and S HR (mapping O to S ) in loss implementation (for a more in-depth explanation, see Appendix A).

4.3.2. Feature Alignment (FeA) Loss

To enforce the map translation consistency between the two domains, inspired by [65], we design an additional loss component that acts directly on the latent representation of the emission maps [65,84]. With this loss, we aim to minimize the feature distribution shift between the O and S domains at different spatial resolutions. Specifically, the feature alignment (FeA) loss L FeA is designed to force feature alignment between the two domains before and after the SR process. This is achieved by minimizing the distance between features extracted from the discriminators associated with the GAN architectures included in our model. We refer to these elements as feature extractors in Figure 5 since the feature extraction is performed from the next-to-last layer of the discriminators. The color coding of these modules emphasizes the additional data diversity of the two CycleGANs (the one that deals with the LR and the other with HR emissions).
The FeA loss L FeA is composed of two main terms: L FeA 1 and L FeA 2 .
For the L FeA 1 , we extract four different features:
Ξ LR o = D LR o ( T LR s ) , Ξ LR o ˜ = D LR o ( T LR s ˜ ) , Ξ LR s = D LR s ( T LR s ) and Ξ LR s ˜ = D LR s ( T LR s ˜ ) ,
where D LR o and D LR s are the discriminators of the CycleGAN employed at step 2 of our pipeline, and depicted in Figure 4a. The L FeA 1 is defined as a balanced contribution of the mean squared errors (MSEs) between the features:
L FeA 1 = γ MSE ( Ξ LR o , Ξ LR o ˜ ) + ( 1 γ ) MSE ( Ξ LR s , Ξ LR s ˜ ) ,
where the scalar coefficient γ can be seen as a domain-related parameter, and it is responsible for balancing the error between discriminator networks associated with the O ( γ = 1 ) or S ( γ = 0 ) domain. In Figure 6, we show the entire pipeline for obtaining the L FeA 1 term. The L FeA 1 term constrains the features of domain-adapted satellite-observed maps T LR s ˜ to be similar to the original simulated maps T LR s . This operation makes the domain-adapted maps T LR s ˜ into a suitable input for the SR network N s , which works on the S domain.
Similarly, for the L FeA 2 we extract four different features:
Ξ HR o = D HR o ( T HR o ) , Ξ HR o ˜ = D HR o ( T ^ HR o ˜ ) , Ξ HR s = D HR s ( T HR o ) and Ξ HR s ˜ = D HR s ( T ^ HR o ˜ ) ,
where D HR o and D HR s belong to the CycleGAN used at step 4, and depicted in Figure 4b. The L FeA 2 is defined as
L FeA 2 = γ MSE ( Ξ HR o , Ξ HR o ˜ ) + ( 1 γ ) MSE ( Ξ HR s , Ξ HR s ˜ ) .
In Figure 7, we show the entire pipeline for obtaining the L FeA 2 term. Noticeably, the L FeA 2 term constrains the features of super-resolved and domain-adapted maps T ^ HR o ˜ to be similar to the original satellite-observed maps T HR o .
Given these two loss terms, we formulate the L FeA as
L FeA = δ L FeA 1 + ( 1 δ ) L FeA 2 ,
where the coefficient δ balances the contribution between L FeA 1 ( δ = 1 ) and L FeA 2 ( δ = 0 ), thus the feature alignment of the LR emissions (before SR, i.e., before step 3) and the HR emissions (after SR, i.e., after step 3), respectively.
By acting on the latent representation of the emissions, the L FeA loss can be seen as a sort of regularization term, working at the feature level instead of the emission map level [65]. It is important to consider that with L FeA 2 , we could, in principle, have a perfect feature alignment since T HR o is the ground-truth emission of T ^ HR o ˜ , and the loss is designed to minimize the distance between them in a feature space. This is also noticeable from Figure 7, where the T HR o and T ^ HR o ˜ emission maps are matching for what regards their reported geographical areas. Considering L FeA 1 instead, perfect feature alignment cannot be achieved since we do not have paired emissions at our disposal due to system design. As we can see from Figure 6, T LR s is not the ground-truth emission of T LR s ˜ . Indeed, T LR s ˜ is just the domain-adapted version of T LR o in the S domain, i.e., T LR s ˜ = S LR ( T LR o ) . Therefore, L FeA 1 only addresses a domain-shift reduction thanks to our imposed feature matching.

4.3.3. Emission Consistency (EmC) Loss

As a final loss term, we design the emission consistency (EmC) loss L EmC to ensure that localized emission values in the estimated super-resolved map ( T ^ HR o ˜ ) are consistent with the original low-resolution map ( T LR o ), helping to preserve the overall value distribution.
From both T LR o and T ^ HR o ˜ , we extract N x and N y squared patches in both the x and y dimensions (longitude and latitude), respectively. In the case of T LR o , patches have size P × P pixels, while for T ^ HR o ˜ patches have size Q × Q pixels, with Q = α P being a multiple of P according to the chosen SR factor α . In the patch extraction procedure, we take care that the pair of patches P LR i j and P HR i j , i [ 0 , N x 1 ] and j [ 0 , N y 1 ] , have the same spatial extent and cover the same geographical area. Considering a single emission map T LR o , we define as P ¯ LR i j the average value of a single patch P LR i j extracted from it:
P ¯ LR i j = 1 P 2 m = 0 P 1 n = 0 P 1 P LR i j ( m , n ) .
In the same way, we define as P ¯ HR i j the average value of a single patch P HR i j extracted from T ^ HR o ˜ :
P ¯ HR i j = 1 Q 2 m = 0 Q 1 n = 0 Q 1 P HR i j ( m , n ) .
The EmC loss L EmC can be formulated as follows:
L EmC = 1 N x · N y i = 0 N x 1 j = 0 N y 1 P ¯ LR i j P ¯ HR i j ,
where · represents the absolute value operator. Figure 8 draws a sketch of the elements and parameters involved in the EmC loss L EmC computation.
In our experiments, we prove that the EmC loss enforces the conservation of the average emission values when performing the SR task, minimizing the value discrepancies between T LR o and its super-resolved version T ^ HR o ˜ . This loss term becomes paramount, especially in scenarios with a mismatch in the spatial resolution of O and S data, as we later show in our analysis.

4.3.4. Full Training Objective

Finally, in our training phase, we aim at minimizing the following loss term, which can be written as
L Train = L CG + λ 1 L FeA + λ 2 L EmC ,
where λ 1 and λ 2 are loss weights, representing the contributions of the L FeA and L EmC terms, respectively.

4.4. Deployment Stage

After training the proposed architecture, we exploit it to estimate HR isoprene emissions I ^ HR o ˜ starting from their LR versions I LR o . The pipeline to be followed is explained in Section 4.2 and reported in Figure 3, apart from a last refinement phase.
In the initial phase, the LR I LR o map is transformed to obtain T LR o . This map is then mapped into the S domain and super-resolved, obtaining T ^ HR s ˜ . The super-resolved emission T ^ HR s ˜ is mapped back into the O domain, resulting in T ^ HR o ˜ . At this point, the T ^ HR o ˜ emission is back-transformed to its original value range, resulting in I ^ HR o ˜ .
As a final refinement step, we include a histogram-matching operation that enforces the emission distribution of I ^ HR o ˜ to follow that of I LR o . This operation helps reduce even more the potential mismatch in the dynamic range of the estimated emissions due to further errors that can be introduced in the entire SR process.

5. Emission Inventories

For our experiments, we use four different isoprene emissions inventories, two from the simulated domain S (BU approaches) and two from the observed domain O (TD approaches). In the following lines, we list all the considered datasets. Table 1 reports detailed information on them, while Figure 9 shows examples of emission maps from all the inventories.

5.1. Simulated Domain

Isoprene emissions from this domain are computed through simulations from Model of Emissions of Gases and Aerosols from Nature (MEGAN) [13,20], a well-known semi-empirical modeling framework extensively used in the atmospheric modeling community to simulate emissions of various BVOC species.
The BU-MEG-050 [23] inventory includes isoprene emissions computed using the MEGANv3.2 model, covering the entire Earth’s surface from 2001 to 2020, with a 0.50 ° × 0.50 ° spatial resolution. Emissions are reported as hourly profiles and are averaged monthly.
The BU-MEG-025 [22] inventory includes isoprene emissions computed using the MEGANv2.1 model, covering the entire Earth’s surface from 2000 to 2019, with a 0.25 ° × 0.25 ° spatial resolution. Emissions are reported as hourly profiles and are averaged monthly.

5.2. Observed Domain

Isoprene emissions from this domain are derived from atmospheric satellite observations. The approach is based on inverse modeling based on formaldehyde (HCHO) columns observed from space [14,24,30,86,87].
The TD-OMI-050 [27] inventory uses the Ozone Monitoring Instrument (OMI) on the NASA-Aura (EOS/Chem-1) satellite to measure HCHO vertical columns. The resulting isoprene emissions cover the entire Earth’s surface from 2005 to 2014, with a 0.50 ° × 0.50 ° spatial resolution. Emissions are reported as daily averaged profiles.
The TD-GOME2-050 [85] inventory uses the Global Ozone Monitoring Experiment-2 (GOME-2) on the EUMETSAT-MetOp-A satellite to measure HCHO vertical columns. The resulting isoprene emissions cover the entire Earth’s surface from 2007 to 2012, with the exact spatial resolution of the TD-OMI-050 inventory, i.e., 0.50 ° × 0.50 ° . Emissions are reported as daily averaged profiles.

5.3. Experimental Dataset

To make a meaningful performance comparison between all the inventories, we consider emissions from the same acquisition years, considering the temporal coverage of the shortest one, i.e., the TD-GOME2-050, thus from 2007 to 2012. Then, to perform realistic evaluations, we never include emission maps from the same temporal coverage in the training and testing stages. To achieve this, we split the emission patches considering the years 2007–2010 for training, 2011 for validation, and 2012 for testing.
The isoprene emission maps of BU-MEG-050, TD-OMI-050, and TD-GOME2-050 present a grid of 720 × 360 cells. The emission maps of BU-MEG-025 instead have the same geographical extent but comprise 1440 × 720 cells, since the spatial resolution is twice that of the other maps. We slice the emission maps for each dataset to obtain smaller maps of 32 × 32 cells, following the steps presented in our previous work [36]. These become our ground-truth HR emission maps. The total numbers of HR emission maps ( I HR s and I HR o ) are almost 209 K/52 K/51 K, 211 K/53 K/53 K, and 65 K/17 K/16 K for the BU-MEG-050, BU-MEG-025, and TD-OMI-050 train/validation/test sets, respectively. Note the reduced number of emission maps from the O domain with respect to the S one. The TD-GOME2-050 is used only in the test phase. Thus, we extract only its test partition, which has almost 16 K emission patches.
We generate LR emission maps ( I LR s and I LR o ) by performing bicubic downsampling, obtaining maps of 16 × 16 cells. Our goal is to estimate HR patches from the O domain ( I ^ HR o ˜ I HR o ) with 0.50 ° × 0.50 ° spatial resolution starting from their LR counterparts ( I LR o ) with 1.0 ° × 1.0 ° spatial resolution, thus with a scale factor α = 2 .

6. Experimental Setup

6.1. Training Setup

We can divide our proposed framework into two different parts that are trained separately: (i) the SR part and (ii) the DA part.
The SR part includes the N s network, that is trained using only emissions of the S domain by following the exact method proposed in [36]. In particular, we select the BU-MEG-050 dataset for the S domain to match the spatial resolution of the O dataset, i.e., 0.50 ° × 0.50 ° . In Section 8, we use the BU-MEG-025 dataset for S emissions, investigating the spatial resolution mismatch scenario between the two domains. We do not provide details on the SR training procedure, as this does not represent the main goal of our proposed work. We refer the interested reader to our original work for more information [36].
The DA part considers emissions from the S domain (BU-MEG-050 dataset) in conjunction with those from the O domain, thus from the TD-OMI-050 dataset (the TD-GOME2-050 is used only in evaluation phase). This part is trained in an adversarial way, and it is in charge of adapting (translating) emissions from the O domain into emissions that appear to be from S , and vice versa. It includes the two CycleGAN networks described in detail in Section 4.2.
Our DA framework is trained in an end-to-end fashion. For all the experiments, we set λ 1 = 1 and λ 2 = 10 ; these values have been manually selected after experimental validation. For the optimizer we use Adam, with β 1 = 0.9 , β 2 = 0.99 , and maximum and minimum learning rates of 10 3 and 10 7 , respectively. We divide the learning rate by 10 if the validation loss does not improve in 10 epochs until the minimum learning rate is reached. We stop the learning phase if the validation loss does not improve after 50 epochs.
Since the final objective for the generators and the discriminators of our four GANs are different, we use two optimizers (and their associated learning rate schedulers) for the generators and the discriminators. We experimentally verified that this leads to a more stable training process, which is visible from the different convergence rates of the two architectures.
We use a batch size of 128. Each batch element comprises four different emission maps, namely, ( I LR o , I LR s , I HR o , I HR s ). Since this is an unaligned image-to-image translation and the number of S maps is greater than that of O maps, we select 128 O emissions, and then, we randomly pick the same number of S elements to form the final batch. A simplified version of our training setup is shown in Figure 5.
In our experiments, we use an NVIDIA Titan RTX 24 GB GPU running on an Intel i9-9980XE CPU equipped with 128 GB of RAM. Please refer to the released implementation code for additional training and network details.

6.2. Evaluation Metrics

For a quantitative evaluation of the proposed method’s effectiveness, we conduct a comparison between the original emissions I HR o and their super-resolved versions I ^ HR o ˜ by adopting several metrics commonly used in the SR literature [34,88,89,90]. We evaluate our method in terms of structural similarity index measure (SSIM) [91], normalized mean squared error (NMSE) in dB, Universal Image Quality Index (UIQI) [92], spatial correlation coefficient (SCC) [93], and maximum absolute error (MaxAE). The best values for these metrics are 1 for SSIM, UIQI, and SCC. The lower the better for NMSE and MaxAE. Each metric highlights a specific peculiarity of the quality of the final super-resolved emission map. Refer to Appendix B for more details.

7. Experimental Results

This section reports the results using the dataset BU-MEG-050 for the simulated S domain and TD-OMI-050 emissions for the observed O domain. In this way, we consider emission maps with the exact spatial resolution for both domains, i.e., 0.50 ° × 0.50 ° . In Section 7.1, we investigate the role of the γ and δ coefficients of the proposed L FeA loss term. In Section 7.2, we perform an ablation study on the different loss terms of our full training objective L Train .

7.1. FeA Loss Coefficient Analysis

To evaluate the contribution of our proposed FeA loss L FeA , we perform an analysis by varying the γ and δ coefficients between 0 and 1. In Table 2, we report the results of this study, using as a final objective L Train = L CG + λ 1 L FeA . For the scope of this study, we omit the EmC loss L EmC from the full training objective L Train , as we only want to focus on the contribution of L FeA .
We can notice that considering δ = 0 in (11), thus forcing the feature alignment at the end of the pipeline, benefits the performance. Indeed, considering only L FeA 2 leads to good performance for all the tested γ values (0, 0.5, 1.0) compared to the other δ values (0.5, 1.0). In addition, using γ = 0.5 in (10), thus considering an equal balance between domains, leads to better results in all the considered metrics.
We conjecture that considering γ = 0.5 and δ = 0 helps to guide the generators S HR and O HR to generate more realistic emission maps of their respective target domain, at the same time, trying even more to fool their respective discriminators. Since γ is responsible for balancing the error of discriminator networks associated with the O ( γ = 1 ) and S ( γ = 0 ) domains (as explained in Section 4.3.2), training with γ = 0.5 implies giving equal importance to the discriminator’s performance of the two different domains. This balancing proves crucial in an adversarial framework such as ours. From now on, we consider this configuration for the following experiments.

7.2. Loss Term Contributions

To evaluate the effectiveness of individual components of our proposed loss, we perform an ablation study by comparing the outcomes of diverse setups. Therefore, we gradually incorporate different terms of the proposed loss L Train to determine their significance. We report all the results in Table 3, considering different loss terms in the final objective L Train . This study considers the best case for the L FeA , thus γ = 0.5 , δ = 0 .
We can see that considering only the L CG term (first row) or just combining L CG with the L EmC (third row) does not provide acceptable results for all the considered metrics. On the contrary, when L Train = L CG + λ 1 L FeA or L Train = L CG + λ 1 L FeA + λ 2 L EmC , we notice a substantial increase in performance. The results for both configurations are similar, so it is not easy to establish the best configuration by only looking at the mean values of each metric. However, considering the case in which all the loss terms are included (last row), we experience a boost in the performance for the metrics related to the structural part of the super-resolved emission. In particular, if we compare the second row with the last one, the latter exhibits a more relevant improvement in terms of SCC (from 0.585 to 0.642 ) and in SSIM (from 0.806 to 0.812 ) if compared to the slight decreases in the NMSE (from 10.763 to 10.744 ) and UIQI (from 0.721 to 0.716 ).
In addition, we have noticed in our experiments that SCC is more sensitive to upsampling artifacts if compared to the SSIM. Even in the presence of such artifacts, the SSIM values do not decrease considerably, while SCC leads to poor results. This different behavior could be attributed to the fact that the SSIM was originally proposed to assess the perceptual quality of an image [91]. For this reason, the SSIM may return acceptable results even if only luminance or contrast is preserved in the upsampling process. Contrarily, SCC is not influenced by perceptual factors, as it consists of mere spatial correlations between the compared emission maps. Since we do not want to consider perceptual factors in our analysis, we choose SCC over SSIM as a more representative metric for our final task.
Therefore, combining a feature-level loss ( L FeA ) with an emission-level one ( L EmC ) looks promising to provide good results in maintaining the structural properties and values of the emission. The L FeA is the key element for helping the adversarial framework in adapting the two domains. In addition, L EmC enforces the emission consistency between the LR and the super-resolved HR emission, allowing the system to preserve the original structure even more. In the following experiments, we use this last configuration, i.e., Table 3—last row, where all the proposed loss terms are considered in the final training objective of our system.

7.3. Comparative Studies

In this section, we compare our proposed methodology with two alternative strategies that do not rely on an adversarial framework, showing the superiority of the proposed method concerning the recovery of HR spatial details of isoprene emission maps.

7.3.1. Strategy I—Training the SR with O Emissions

In this first configuration, we use the O emissions to directly train from scratch an SR network operator N , as we have in our previous works [36,38]. Therefore, we train an SR network with several ( I LR o , I HR o ) emission pairs for estimating the HR emission I ^ HR o . In this case, we use the same approach presented in [36]: it relies on (4), but it considers only emissions and operators estimated from the O domain, i.e.,
I ^ HR o = T o 1 ( N o ( T o ( I LR o ) ) ) ,
where all the considered operators, i.e., T o 1 , N o , and T o , are trained using HR O emissions.
Though this strategy might seem the best choice for obtaining good SR results on the O domain, we later show that this is not the most suitable option in general. Our experiments reveal that this procedure leads to good average reconstructions but with excessive variability over the dataset. Indeed, the limited number of satellite-derived measurements compared to those from numerical simulations hinders creating a large and diverse dataset, necessary for robust DL training. We refer to this strategy as case I.

7.3.2. Strategy II—Straightforward Data Transformation

In this second configuration, similarly to our proposed approach, we still exploit the SR network operator N s trained on S emissions. However, we do not estimate the T o transformation from HR data of the O domain differently from all the previous strategies. In this case, we estimate the T o transformation using LR observed emissions. This strategy allows it to be completely independent of HR observed data, enabling it to work in zero-knowledge scenarios, i.e., when no HR observed data are available. Therefore, we obtain a super-resolved emission as
I ^ HR o = T o 1 ( N s ( T o ( I LR o ) ) ) ,
where the operators T o 1 and T o are estimated using LR O emissions; the N s instead is trained using S emissions, as in Section 7.3.1. Our experimental results also show that this straightforward strategy provides acceptable reconstructions on average, even though a few samples yield extremely poor results, indicating a partially stable solution. We refer to this strategy as case II.

7.3.3. Our Proposed Strategies

In addition to case I and case II, we consider three different strategies proposed by us in Section 7.2. In particular, we investigate the cases where L Train = L CG (case III, Table 3 first row), L Train = L CG + L FeA (case IV, Table 3 second row) and L Train = L CG + L FeA + L EmC (case V, Table 3 last row). We exclude from the analysis the case where L Train = L CG + L EmC (Table 3 third row), since the performance is similar to case III, thus it is not significant for the comparisons.

7.3.4. Discussion

We start our comparison by analyzing the distributions of the MaxAE and the SCC, reported in Figure 10 and Figure 11, respectively, and computed between the super-resolved emissions and their ground-truth, considering all the five cases.
By analyzing the MaxAE histogram in Figure 10, we can easily notice a multimodal behavior for case I (Section 7.3.1) and case II (Section 7.3.2), the cases that do not use our proposed DA strategy. Focusing on case I (blue bars), training from scratch an SR network leads to a consistent number of emission maps that exhibit a very high MaxAE. Regarding case II (purple bars), the straightforward data transformation leads to even worse performance. Indeed, in this case, we also have a non-negligible number of emission maps with a high MaxAE. For our configurations, i.e., cases III, IV, and V, we can notice that considering the complete full training objective L Train = L CG + λ 1 L FeA + λ 2 L EmC (case V, green bars) leads to a more compact histogram compared to the other cases, indicating the robustness of our proposed methodology.
We can also notice interesting trends considering the SCC histogram in Figure 11. We can see how case II (Section 7.3.2, purple bars) and case III (Section 7.3.3, orange bars) show unacceptable performance. Specifically, case II exhibits a unimodal histogram with an average SCC of 0.399 . For case III, we have an average SCC of 0.008 , indicating no correlation between the super-resolved emissions and the ground-truth emissions. For the other configurations, i.e., cases I, IV, and V, we can notice that, also for this metric, considering the complete full objective (case V, green bars) increases the stability of reconstructions, leading to higher quality SR.
For completeness, in Table 4 we report the average (avg) and the standard deviation (std) values of the metrics considered in this study.
To better investigate the robustness of our method, we consider the distributions of NMSE (Figure 12) and MaxAE (Figure 13) as separate histograms for all the considered cases. In Figure 12a, we consider the NMSE related to case I, i.e., when we directly train an SR network on O emissions. For this evaluation, we set a threshold of 5 dB (red vertical line), above which the associated SR emission maps are considered poorly super-resolved. This study aims to visualize how these poorly super-resolved emission maps are super-resolved in the other considered cases. In blue, we denote the metric distribution related to the considered case. In gray, we denote the elements showing an NMSE > 5 dB in case I (Figure 12a), i.e., the ones that are super-resolved poorly.
Starting from case II (Figure 12b), we notice that the mitigation is not so effective, resulting in a wide spread of poorly super-resolved emissions. For case III (Figure 12c), we observe a noticeable performance deterioration, indicating that the network is not learning the final task. Performance is noticeably different in cases IV (Figure 12d) and V (Figure 12e). For these two cases, we can see the effect of the additional loss terms on the final objective, i.e., the L FeA and the L FeA + L EmC for case IV and case V, respectively. Indeed, comparing these cases with case I, we notice a significant performance shift towards lower NMSEs. Furthermore, in case V, adding the L EmC in the final objective leads to more effective artifact mitigation, obtaining a more squeezed and narrow histogram over lower NMSEs.
In Figure 13, we report the same study considering the MaxAE. In this case, we set a threshold of 6 × 10 5 (red vertical line). We consider poorly super-resolved emissions those that in case I report a MaxAE > 6 × 10 5 (Figure 13a). Starting from case II (Figure 13b), we notice that the mitigation is ineffective, as previously seen for the NMSE. Some of the low-quality emissions are improved, although a substantial number of emissions are still poorly super-resolved. From case III to case V (Figure 13c–e) instead, we can notice a trend towards more accurate emission reconstruction, with fewer artifacts than the previous cases. In particular, case V exhibits a narrower error distribution, with a pronounced trend toward lower MaxAEs.
For a visual comparison, in Figure 14 we report six different super-resolved emission examples I ^ HR o ˜ for all the considered configurations, including their related I LR o (first column, input) and I HR o (last column, ground-truth) emissions. We can see how case V produces more accurate emissions if compared to all the other cases. It is worth noting that for case II, even if on average we achieve good performance for some metrics (see Table 4, second row), the actual super-resolved emissions exhibit a blurry pattern, symptomatic of poor generalization in performing the restoration process for emissions from a different domain.
These results again confirm the importance of each element of our proposed architecture and loss design regarding robustness in the final SR task.

8. Towards Generalization in Emission Super-Resolution

In this section, we address two desirable generalization properties for SR of isoprene emissions. In particular, in Section 8.1 we investigate the case of spatial resolution mismatch; in Section 8.2, we investigate the possibility of super-resolving emissions coming from previously-unseen datasets. For brevity, we only evaluate the best-performing configuration setup of our system, i.e., Table 4case V.

8.1. Spatial Resolution Generalization

The results obtained previously considered emission maps with the exact spatial resolution for both the S (BU-MEG-050) and the O (TD-OMI-050) domains, i.e., 0.50 ° × 0.50 ° . In this section, we investigate the possibility of performing the DA and the associated SR task using S emissions with a higher spatial resolution than the O emissions. This is motivated by the fact that using more spatially detailed isoprene emission maps might, in principle, increase the performance of the emission-to-emission translation in conjunction with a more accurate SR.
We compare two different scenarios, considering different spatial resolutions for the S emission maps:
  • Matched: This scenario is the one adopted in all the previous results, where we have a matching between the spatial resolution of S and O emissions, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° ) for S emissions and TD-OMI-050 ( 0.50 ° × 0.50 ° ) for the O emissions;
  • Unmatched: This scenario uses different spatial resolutions between the S and the O emissions, i.e., BU-MEG-025 ( 0.25 ° × 0.25 ° ) for S emissions and TD-OMI-050 ( 0.50 ° × 0.50 ° ) for the O emissions.
In our previous work [36], we investigated how to directly super-resolve isoprene emission maps with a different spatial resolution to the training one without performing DA on the data. In all our experiments, we obtained a significant drop in both SSIM and NMSE with respect to the matched scenario.
Table 5 shows the results of this comparative study. The first row is related to the Matched scenario (we report the results of Table 4, case V); the second row refers to the Unmatched scenario. It is worth noting that the Unmatched scenario, considering emission maps with a higher spatial resolution for S , benefits the final task. Indeed, we obtain a significant increase for almost all metrics, with a substantial gain for the NMSE and UIQI.
For a more in-depth comparison, in Figure 15 we report the NMSE, SCC, and UIQI value distributions using violin plots. We observe that the results obtained using the BU-MEG-025 inventory as the S domain (Unmatched scenario) are more compact than those obtained adopting the BU-MEG-050 inventory (Matched scenario), as indicated by its narrower width and shorter range. This behavior is more pronounced for the SCC and UIQI metrics.
In addition, Figure 16 reports six isoprene emission map examples (first column), their related super-resolved versions from the two scenarios (second and third columns), and their relative squared absolute error maps (fourth and fifth columns). From the error maps on the right, we can appreciate a superiority in the SR task for the Unmatched scenario (BU-MEG-025), obtaining error patterns with less spatial variability. This is also visible when comparing the second (Matched—BU-MEG-050) and the third (Unmatched—BU-MEG-025) columns of Figure 16 with the ground-truth emissions I HR o in the first column.
These results highlight the ability of our model to not only adapt the two domains ( S and O ) but also the spatial resolution adopted by the SR network. Our system learns these two mappings simultaneously, benefiting from the improved SR network quality trained on spatially denser and more accurate emissions. This is relevant since, in principle, one could use an SR network trained on very different spatial resolutions by just adapting the input.

8.2. Emission Inventory Generalization

In the previous sections, we tested our system in super-resolving O maps from the TD-OMI-050 inventory. Notably, even if not from the same years of the testing set, data from the TD-OMI-050 dataset were part of our training set. In this section, we test the generalization capabilities of our system in super-resolving unseen data, i.e., data never used in training. We select emission maps from the TD-GOME2-050 inventory for this.
Table 6 compares the SR performance on the TD-GOME2-050 inventory considering both the Matched and Unmatched configurations (see Section 8.1). As in the previous generalization study, also in this case, we notice higher performance when dealing with emissions with a higher spatial resolution, i.e., Unmatched scenario, BU-MEG-025. Moreover, comparing these results with Table 5, where we show those achieved on TD-OMI-050, we do not observe a substantial drop in performance, even though this dataset was not used in the training phase. This result is promising, since our method could be a valid solution to generalize the SR task of isoprene emissions derived from different satellite onboard equipment.

9. Conclusions and Future Research

This work presents a novel approach to enhance the spatial resolution of isoprene emission maps estimated from satellite observations. For this, we exploit the fine-grained information of emission maps derived from numerical simulations, which are typically easier to retrieve and more detailed than satellite-derived acquisitions.
To accomplish this task, we develop a framework that integrates super-resolution (SR) and domain adaptation (DA) techniques to enhance low-resolution satellite-derived emission maps, i.e., from the observed domain, by exploiting the knowledge of numerically simulated emissions, i.e., from the simulated domain. By leveraging an adversarial framework based on cycle generative adversarial network (CycleGAN), we propose a reliable and robust emission-to-emission translation system, adapting satellite data to the simulated domain at low and high resolutions.
Even in a significant domain-shift scenario, the proposed method is a valid solution for capturing fine-scale emission patterns and super-resolving emissions. From an extensive experimental analysis, we find that, by acting at feature and emission levels, we enforce the DA and the SR process, effectively reducing the domain’s distribution shift and minimizing the emission discrepancy between emission maps with different spatial resolutions.
Generalization studies also find that using simulated emissions with a higher spatial resolution than the one that has to be super-resolved benefits the performance of the final SR task. In this case, our proposed method is not only able to adapt the two domains (simulated and observed) but also their different spatial resolutions. Moreover, our method proves robust even on unseen testing data.
The benefits of this research are manifold. By providing more accurate and detailed isoprene emission maps, our study contributes to a better understanding of atmospheric processes and climate dynamics driven by biogenic volatile organic compound (BVOC) emissions. For instance, increasing the spatial resolution of isoprene emissions would allow for more accurate identification of emission hotspots at a local scale. This would help refining the knowledge of how biogenic emissions contribute to ozone and secondary organic aerosol formation. In addition, providing spatially detailed biogenic maps could also enhance climate models by providing better data on BVOC-driven feedback processes like cloud formation and radiative forcing, influencing regional climate predictions. Policymakers could use these refined data to create localized emission regulations and better land-use planning, such as regulating high-BVOC-emitting plants in urban spaces.
Given the promising results, future research will explore the application of this framework to other types of BVOCs species, pollutants, and, in general, other Remote sensing (RS) products. This would test the versatility and robustness of the method across different environmental datasets and further enhance our understanding of various atmospheric processes. Additionally, further refinement of the DA techniques could improve the robustness and accuracy of the proposed framework, making it applicable to a broader range of environmental monitoring and assessment tasks.

Author Contributions

Conceptualization, A.G., S.M. and P.B.; methodology, A.G., S.M. and P.B.; software, A.G.; validation, A.G. and S.M.; formal analysis, A.G. and S.M.; investigation, A.G. and S.M.; resources, A.G.; data curation, A.G.; writing—original draft preparation, A.G. and S.M.; writing—review and editing, A.G., S.M. and P.B.; visualization, A.G.; supervision, S.M., P.B. and S.T.; project administration, S.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The implementation code of the presented methodology is available at https://github.com/polimi-ispl/sr-da-bvoc, accessed on 11 September 2024. The TD-OMI-050 and TD-GOME2-050 emission inventories are available at https://emissions.aeronomie.be/, accessed on 11 September 2024. The BU-MEG-025 emission inventory is available at https://permalink.aeris-data.fr/CAMS-GLOB-BIO, accessed on 11 September 2024. The BU-MEG-050 emission inventory is available at https://doi.org/10.57760/sciencedb.iap.00008, accessed on 11 September 2024. No restrictions apply to the availability of these data.

Acknowledgments

This work was supported by the Italian Ministry of University and Research (MUR) and the European Union (EU) under the PON/REACT project.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. CG Loss Formulation

The mathematical formulation of the adversarial term L CG of our final training objective L Train relies on the original objective of the CycleGAN framework [41] (see Section 3). However, as explained in Section 4.2, we have to consider two different CycleGAN frameworks, i.e., one for the low resolution (LR) and the other for the hr emission maps, as shown in Figure 4. The two CycleGANs are trained together, end-to-end. In the following sections, we formulate all the loss terms related to the three components of our CG Loss L CG , i.e., the L GAN , L Cyc , and L Id terms.
GAN loss. Since we have two CycleGAN frameworks, we have four different GAN losses, one for each generator-discriminator pair. The complete L GAN term is the sum of these four components, i.e.,
L GAN = L GAN s LR + L GAN o LR + L GAN s HR + L GAN o HR .
The L GAN s LR term is related to the mapping function S LR : O LR S LR , used at step 2 of our pipeline, i.e., Figure 4a, and is defined as
L GAN s LR ( S LR , D LR s , O , S ) = E s p ( s ) [ log D LR s ( s ) ] + E o p ( o ) [ log ( 1 D LR s ( S LR ( o ) ) ) ] .
The same CycleGAN also exploits the L GAN o LR term, that is related the inverse mapping function O LR : S LR O LR , and is defined as
L GAN o LR ( O LR , D LR o , S , O ) = E o p ( o ) [ log D LR o ( o ) ] + E s p ( s ) [ log ( 1 D LR o ( O LR ( s ) ) ) ] .
Considering the second CycleGAN, the L GAN s HR term is related the mapping function S HR : O HR S HR , and is defined as
L GAN s HR ( S HR , D HR s , O , S ) = E s p ( s ) [ log D HR s ( s ) ] + E o p ( o ) [ log ( 1 D HR s ( S HR ( o ) ) ) ] .
The same CycleGAN also exploits the L GAN o HR term, which is related to the inverse mapping function O HR : S HR O HR , used at step 4 of our pipeline, i.e., Figure 4b, and is defined as
L GAN o HR ( O HR , D HR o , S , O ) = E o p ( o ) [ log D HR o ( o ) ] + E s p ( s ˜ ) [ log ( 1 D HR o ( O HR ( s ) ) ) ] ,
where s ˜ denotes the distribution over the estimated S domain.
Cycle consistency loss. We have two different cycle consistency losses in our system; the complete L Cyc term is the sum of these two, i.e.,
L Cyc = L Cyc LR + L Cyc HR ,
where
L Cyc LR ( S LR , O LR ) = E o p ( o ) [ O LR ( S LR ( o ) ) o 1 ] + E s p ( s ) [ S LR ( O LR ( s ) ) s 1 ] ,
and
L Cyc HR ( S HR , O HR ) = E o p ( o ) [ O HR ( S HR ( o ) ) o 1 ] + E s p ( s ˜ ) [ S HR ( O HR ( s ) ) s 1 ] ,
where s ˜ denotes the distribution over the estimated S domain.
Identity loss. Also, in this case, we have two different identity losses in our system; the complete L Id term is the sum of these two, i.e.,
L Id = L Id LR + L Id HR
where
L Id LR ( S LR , O LR ) = E s p ( s ) [ S LR ( s ) s 1 ] + E o p ( o ) [ O LR ( o ) o 1 ] ,
and
L Id HR ( S HR , O HR ) = E s p ( s ˜ ) [ S HR ( s ) s 1 ] + E o p ( o ) [ O HR ( o ) o 1 ] ,
where s ˜ denotes the distribution over the estimated S domain.

Appendix B. Evaluation Metrics

The structural similarity index measure (SSIM) [91] evaluates the similarity between two images by measuring the perceptual difference between a ground-truth image I HR and its super-resolved version I ^ HR based on their intensity, contrast, and structural information. The similarity score is between −1 and 1, with 1 indicating perfect similarity, 0 indicating no similarity, and −1 indicating perfect anti-correlation or dissimilarity.
The normalized mean squared error (NMSE) is the ratio of the power of the error introduced by the super-resolution process to the power of the original image and is defined as
NMSE ( I ^ HR , I HR ) = 10 log 10 MSE ( I ^ HR , I HR ) Avg ( I HR 2 )
where Avg extracts the mean value of the image. Lower NMSE, in dB, indicates a better super-resolution quality.
The Universal Image Quality Index (UIQI) [92] measures the quality of images by comparing the super-resolved image I ^ HR with the ground-truth I HR . It evaluates three components: luminance, contrast, and structural information. The index ranges from −1 to 1, where 1 indicates perfect similarity, 0 indicates no similarity, and −1 indicates perfect negative similarity.
The spatial correlation coefficient (SCC) [93] measures how closely the super-resolved image I ^ HR matches the spatial patterns of the ground-truth I HR . It ranges from −1 to 1, where 1 indicates perfect correlation, 0 indicates no correlation, and −1 indicates perfect anti-correlation or dissimilarity. It is computed by applying a high-pass filter to both the predicted and target images, calculating local variances and covariances, and then, analyzing the correlation of these features.
The maximum absolute error (MaxAE) quantifies the worst-case discrepancy between the super-resolved image I ^ HR and ground-truth I HR by identifying the maximum absolute difference at any pixel location of a 2D image, and it is defined as
MaxAE ( I ^ HR , I HR ) = max m , n I ^ HR ( m , n ) I HR ( m , n ) ,
where m and n denote the pixel coordinates of the considered image. Therefore, MaxAE measures the largest error in the super-resolved image I ^ HR compared to the ground-truth I HR .

References

  1. Ciccioli, P.; Silibello, C.; Finardi, S.; Pepe, N.; Ciccioli, P.; Rapparini, F.; Neri, L.; Fares, S.; Brilli, F.; Mircea, M.; et al. The potential impact of biogenic volatile organic compounds (BVOCs) from terrestrial vegetation on a Mediterranean area using two different emission models. Agric. For. Meteorol. 2023, 328, 109255. [Google Scholar] [CrossRef]
  2. McGenity, T.J.; Crombie, A.T.; Murrell, J.C. Microbial cycling of isoprene, the most abundantly produced biological volatile organic compound on Earth. ISME J. 2018, 12, 931–941. [Google Scholar] [CrossRef]
  3. Wang, B.; Li, Z.; Liu, Z.; Sun, Y.; Wang, C.; Xiao, Y.; Lu, X.; Yan, G.; Xu, C. Characteristics, Secondary Transformation Potential and Health Risks of Atmospheric Volatile Organic Compounds in an Industrial Area in Zibo, East China. Atmosphere 2023, 14, 158. [Google Scholar] [CrossRef]
  4. DiMaria, C.A.; Jones, D.B.A.; Worden, H.; Bloom, A.A.; Bowman, K.; Stavrakou, T.; Miyazaki, K.; Worden, J.; Guenther, A.; Sarkar, C.; et al. Optimizing the Isoprene Emission Model MEGAN With Satellite and Ground-Based Observational Constraints. J. Geophys. Res. Atmos. 2023, 128, e2022JD037822. [Google Scholar] [CrossRef]
  5. Zhang, S.; Lyu, Y.; Yang, X.; Yuan, L.; Wang, Y.; Wang, L.; Liang, Y.; Qiao, Y.; Wang, S. Modeling Biogenic Volatile Organic Compounds Emissions and Subsequent Impacts on Ozone Air Quality in the Sichuan Basin, Southwestern China. Front. Ecol. Evol. 2022, 10, 924944. [Google Scholar] [CrossRef]
  6. Hewitt, C.N.; Langford, B.; Possell, M.; Karl, T.; Owen, S.M. Quantification of VOC emission rates from the biosphere. TrAC Trends Anal. Chem. 2011, 30, 937–944. [Google Scholar] [CrossRef]
  7. Weber, J.; Archer-Nicholls, S.; Abraham, N.L.; Shin, Y.M.; Griffiths, P.; Grosvenor, D.P.; Scott, C.E.; Archibald, A.T. Chemistry-driven changes strongly influence climate forcing from vegetation emissions. Nat. Commun. 2022, 13, 7202. [Google Scholar] [CrossRef]
  8. Wang, H.; Welch, A.M.; Nagalingam, S.; Leong, C.; Czimczik, C.I.; Tang, J.; Seco, R.; Rinnan, R.; Vettikkat, L.; Schobesberger, S.; et al. High temperature sensitivity of Arctic isoprene emissions explained by sedges. Nat. Commun. 2024, 15, 6144. [Google Scholar] [CrossRef]
  9. Guenther, A.; Hewitt, C.N.; Erickson, D.; Fall, R.; Geron, C.; Graedel, T.; Harley, P.; Klinger, L.; Lerdau, M.; Mckay, W.A.; et al. A global model of natural volatile organic compound emissions. J. Geophys. Res. 1995, 100, 8873–8892. [Google Scholar] [CrossRef]
  10. Mircea, M.; Borge, R.; Finardi, S.; Briganti, G.; Russo, F.; de la Paz, D.; D’Isidoro, M.; Cremona, G.; Villani, M.G.; Cappelletti, A.; et al. The Role of Vegetation on Urban Atmosphere of Three European Cities. Part 2: Evaluation of Vegetation Impact on Air Pollutant Concentrations and Depositions. Forests 2023, 14, 1255. [Google Scholar] [CrossRef]
  11. Silibello, C.; Finardi, S.; Pepe, N.; Baraldi, R.; Ciccioli, P.; Mircea, M.; Ciccioli, P. Modelling of Biogenic Volatile Organic Compounds Emissions Using a Detailed Vegetation Inventory Over a Southern Italy Region. In Air Pollution Modeling and Its Application XXVIII; Springer: Cham, Switzerland, 2022; pp. 279–285. [Google Scholar]
  12. Ashworth, K.; Boissard, C.; Folberth, G.; Lathière, J.; Schurgers, G. Global Modelling of Volatile Organic Compound Emissions; Springer: Cham, Switzerland, 2013; pp. 451–487. [Google Scholar]
  13. Guenther, A.; Jiang, X.; Shah, T.; Huang, L.; Kemball-Cook, S.; Yarwood, G. Model of Emissions of Gases and Aerosol from Nature Version 3 (MEGAN3) for Estimating Biogenic Emissions. In Air Pollution Modeling and Its Application XXVI; Springer: Cham, Switzerland, 2020; pp. 187–192. [Google Scholar]
  14. Müller, J.F.; Stavrakou, T.; Peeters, J. Chemistry and deposition in the Model of Atmospheric composition at Global and Regional scales using Inversion Techniques for Trace gas Emissions (MAGRITTE v1.1)—Part 1: Chemical mechanism. Geosci. Model Dev. 2019, 12, 2307–2356. [Google Scholar] [CrossRef]
  15. Calfapietra, C.; Fares, S.; Manes, F.; Morani, A.; Sgrigna, G.; Loreto, F. Role of Biogenic Volatile Organic Compounds (BVOC) emitted by urban trees on ozone concentration in cities: A review. Environ. Pollut. 2013, 183, 71–80. [Google Scholar] [CrossRef]
  16. Peñuelas, J.; Staudt, M. BVOCs and global change. Trends Plant Sci. 2010, 15, 133–144. [Google Scholar] [CrossRef]
  17. Laothawornkitkul, J.; Taylor, J.E.; Paul, N.D.; Hewitt, C.N. Biogenic volatile organic compounds in the Earth system. New Phytol. 2009, 183, 27–51. [Google Scholar] [CrossRef] [PubMed]
  18. Cai, M.; An, C.; Guy, C. A scientometric analysis and review of biogenic volatile organic compound emissions: Research hotspots, new frontiers, and environmental implications. Renew. Sustain. Energy Rev. 2021, 149, 111317. [Google Scholar] [CrossRef]
  19. Tani, A.; Mochizuki, T. Review: Exchanges of volatile organic compounds between terrestrial ecosystems and the atmosphere. J. Agric. Meteorol. 2021, 77, 66–80. [Google Scholar] [CrossRef]
  20. Guenther, A.B.; Jiang, X.; Heald, C.L.; Sakulyanontvittaya, T.; Duhl, T.; Emmons, L.K.; Wang, X. The Model of Emissions of Gases and Aerosols from Nature version 2.1 (MEGAN2.1): An extended and updated framework for modeling biogenic emissions. Geosci. Model Dev. 2012, 5, 1471–1492. [Google Scholar] [CrossRef]
  21. Wang, H.; Wu, Q.; Guenther, A.B.; Yang, X.; Wang, L.; Xiao, T.; Li, J.; Feng, J.; Xu, Q.; Cheng, H. A long-term estimation of biogenic volatile organic compound (BVOC) emission in China from 2001–2016: The roles of land cover change and climate variability. Atmos. Chem. Phys. 2021, 21, 4825–4848. [Google Scholar] [CrossRef]
  22. Sindelarova, K.; Markova, J.; Simpson, D.; Huszar, P.; Karlicky, J.; Darras, S.; Granier, C. High-resolution biogenic global emission inventory for the time period 2000–2019 for air quality modelling. Earth Syst. Sci. Data 2022, 14, 251–270. [Google Scholar] [CrossRef]
  23. Wang, H.; Liu, X.; Wu, C.; Lin, G. Regional to global distributions, trends, and drivers of biogenic volatile organic compound emission from 2001 to 2020. Atmos. Chem. Phys. 2024, 24, 3309–3328. [Google Scholar] [CrossRef]
  24. Opacka, B.; Müller, J.F.; Stavrakou, T.; Bauwens, M.; Sindelarova, K.; Markova, J.; Guenther, A.B. Global and regional impacts of land cover changes on isoprene emissions derived from spaceborne data and the MEGAN model. Atmos. Chem. Phys. 2021, 21, 8413–8436. [Google Scholar] [CrossRef]
  25. De Smedt, I.; Van Roozendael, M.; Stavrakou, T.; Müller, J.F.; Lerot, C.; Theys, N.; Valks, P.; Hao, N.; van der A, R. Improved retrieval of global tropospheric formaldehyde columns from GOME-2/MetOp-A addressing noise reduction and instrumental degradation issues. Atmos. Meas. Tech. 2012, 5, 2933–2949. [Google Scholar] [CrossRef]
  26. Bauwens, M.; Stavrakou, T.; Müller, J.F.; De Smedt, I.; Van Roozendael, M.; van der Werf, G.R.; Wiedinmyer, C.; Kaiser, J.W.; Sindelarova, K.; Guenther, A. Nine years of global hydrocarbon emissions based on source inversion of OMI formaldehyde observations. Atmos. Chem. Phys. 2016, 16, 10133–10158. [Google Scholar] [CrossRef]
  27. Stavrakou, T.; Müller, J.F.; Bauwens, M.; De Smedt, I.; Van Roozendael, M.; De Mazière, M.; Vigouroux, C.; Hendrick, F.; George, M.; Clerbaux, C.; et al. How consistent are top-down hydrocarbon emissions based on formaldehyde observations from GOME-2 and OMI? Atmos. Chem. Phys. 2015, 15, 11861–11884. [Google Scholar] [CrossRef]
  28. Fu, D.; Millet, D.B.; Wells, K.C.; Payne, V.H.; Yu, S.; Guenther, A.; Eldering, A. Direct retrieval of isoprene from satellite-based infrared measurements. Nat. Commun. 2019, 10, 3811. [Google Scholar] [CrossRef] [PubMed]
  29. Oomen, G.M.; Müller, J.F.; Stavrakou, T.; De Smedt, I.; Blumenstock, T.; Kivi, R.; Makarova, M.; Palm, M.; Röhling, A.; Té, Y.; et al. Weekly derived top-down volatile-organic-compound fluxes over Europe from TROPOMI HCHO data from 2018 to 2021. Atmos. Chem. Phys. 2024, 24, 449–474. [Google Scholar] [CrossRef]
  30. Wolfe, G.M.; Kaiser, J.; Hanisco, T.F.; Keutsch, F.N.; de Gouw, J.A.; Gilman, J.B.; Graus, M.; Hatch, C.D.; Holloway, J.; Horowitz, L.W.; et al. Formaldehyde production from isoprene oxidation across NOx regimes. Atmos. Chem. Phys. 2016, 16, 2597–2610. [Google Scholar] [CrossRef]
  31. Kaiser, J.; Jacob, D.J.; Zhu, L.; Travis, K.R.; Fisher, J.A.; González Abad, G.; Zhang, L.; Zhang, X.; Fried, A.; Crounse, J.D.; et al. High-resolution inversion of OMI formaldehyde columns to quantify isoprene emission on ecosystem-relevant scales: Application to the southeast US. Atmos. Chem. Phys. 2018, 18, 5483–5497. [Google Scholar] [CrossRef]
  32. Rampal, N.; Hobeichi, S.; Gibson, P.B.; Baño-Medina, J.; Abramowitz, G.; Beucler, T.; González-Abad, J.; Chapman, W.; Harder, P.; Gutiérrez, J.M. Enhancing Regional Climate Downscaling through Advances in Machine Learning. Artif. Intell. Earth Syst. 2024, 3, 230066. [Google Scholar] [CrossRef]
  33. Weiss, K.; Khoshgoftaar, T.M.; Wang, D. A survey of transfer learning. J. Big Data 2016, 3, 9. [Google Scholar] [CrossRef]
  34. Sdraka, M.; Papoutsis, I.; Psomas, B.; Vlachos, K.; Ioannidis, K.; Karantzalos, K.; Gialampoukidis, I.; Vrochidis, S. Deep Learning for Downscaling Remote Sensing Images: Fusion and super-resolution. IEEE Geosci. Remote Sens. Mag. 2022, 10, 202–255. [Google Scholar] [CrossRef]
  35. Chen, C.H. (Ed.) Signal and Image Processing for Remote Sensing, 3rd ed.; Signal and Image Processing of Earth Observations; Routledge: London, UK, 2024. [Google Scholar]
  36. Giganti, A.; Mandelli, S.; Bestagini, P.; Marcon, M.; Tubaro, S. Super-Resolution of BVOC Maps by Adapting Deep Learning Methods. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Kuala Lumpur, Malaysia, 8–11 October 2023; pp. 1650–1654. [Google Scholar]
  37. Giganti, A.; Mandelli, S.; Bestagini, P.; Marcon, M.; Tubaro, S. Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection. In Proceedings of the European Signal Processing Conference (EUSIPCO), Helsinki, Finland, 4–8 September 2023; pp. 1315–1319. [Google Scholar]
  38. Giganti, A.; Mandelli, S.; Bestagini, P.; Marcon, M.; Tubaro, S. Super-Resolution of Bvoc Emission Maps Via Domain Adaptation. In Proceedings of the IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Athens, Greece, 7–12 July 2024; pp. 738–741. [Google Scholar]
  39. Ben-David, S.; Blitzer, J.; Crammer, K.; Kulesza, A.; Pereira, F.; Vaughan, J.W. A theory of learning from different domains. Mach. Learn. 2010, 79, 151–175. [Google Scholar] [CrossRef]
  40. Xu, M.; Wu, M.; Chen, K.; Zhang, C.; Guo, J. The Eyes of the Gods: A Survey of Unsupervised Domain Adaptation Methods Based on Remote Sensing Data. Remote Sens. 2022, 14, 4380. [Google Scholar] [CrossRef]
  41. Zhu, J.Y.; Park, T.; Isola, P.; Efros, A.A. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2242–2251. [Google Scholar]
  42. Crippa, M.; Guizzardi, D.; Pagani, F.; Schiavina, M.; Melchiorri, M.; Pisoni, E.; Graziosi, F.; Muntean, M.; Maes, J.; Dijkstra, L.; et al. Insights into the spatial distribution of global, national, and subnational greenhouse gas emissions in the Emissions Database for Global Atmospheric Research (EDGAR v8.0). Earth Syst. Sci. Data 2024, 16, 2811–2830. [Google Scholar] [CrossRef]
  43. Bauwens, M.; Stavrakou, T.; Müller, J.F.; Van Schaeybroeck, B.; De Cruz, L.; De Troch, R.; Giot, O.; Hamdi, R.; Termonia, P.; Laffineur, Q.; et al. Recent past (1979–2014) and future (2070–2099) isoprene fluxes over Europe simulated with the MEGAN–MOHYCAN model. Biogeosciences 2018, 15, 3673–3690. [Google Scholar] [CrossRef]
  44. Geiss, A.; Silva, S.J.; Hardin, J.C. Downscaling atmospheric chemistry simulations with physically consistent deep learning. Geosci. Model Dev. 2022, 15, 6677–6694. [Google Scholar] [CrossRef]
  45. Siddique, M.A.; Naseer, E.; Usama, M.; Basit, A. Estimation of Surface-Level NO2 Using Satellite Remote Sensing and Machine Learning: A review. IEEE Geosci. Remote Sens. Mag. 2024, 12, 8–34. [Google Scholar] [CrossRef]
  46. Sokhi, R.S.; Moussiopoulos, N.; Baklanov, A.; Bartzis, J.; Coll, I.; Finardi, S.; Friedrich, R.; Geels, C.; Grönholm, T.; Halenka, T.; et al. Advances in air quality research – current and emerging challenges. Atmos. Chem. Phys. 2022, 22, 4615–4703. [Google Scholar] [CrossRef]
  47. Baño Medina, J.; Manzanas, R.; Gutiérrez, J.M. Configuration and intercomparison of deep learning neural models for statistical downscaling. Geosci. Model Dev. 2020, 13, 2109–2124. [Google Scholar] [CrossRef]
  48. Sha, Y.; II, D.J.G.; West, G.; Stull, R. Deep-Learning-Based Gridded Downscaling of Surface Meteorological Variables in Complex Terrain. Part II: Daily Precipitation. J. Appl. Meteorol. Climatol. 2020, 59, 2075–2092. [Google Scholar] [CrossRef]
  49. Chiang, C.H.; Huang, Z.H.; Liu, L.; Liang, H.C.; Wang, Y.C.; Tseng, W.L.; Wang, C.; Chen, C.T.; Wang, K.C. Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections. arXiv 2024, arXiv:2403.17847. [Google Scholar]
  50. Vandal, T.; Kodra, E.; Ganguly, S.; Michaelis, A.; Nemani, R.; Ganguly, A.R. DeepSD: Generating High Resolution Climate Change Projections through Single Image Super-Resolution. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Halifax, NS, Canada, 13–17 August 2017; pp. 1663–1672. [Google Scholar]
  51. Passarella, L.S.; Mahajan, S.; Pal, A.; Norman, M.R. Reconstructing High Resolution ESM Data Through a Novel Fast Super Resolution Convolutional Neural Network (FSRCNN). Geophys. Res. Lett. 2022, 49, e2021GL097571. [Google Scholar] [CrossRef]
  52. Wang, J.; Liu, Z.; Foster, I.; Chang, W.; Kettimuthu, R.; Kotamarthi, V.R. Fast and accurate learned multiresolution dynamical downscaling for precipitation. Geosci. Model Dev. 2021, 14, 6355–6372. [Google Scholar] [CrossRef]
  53. Brecht, R.; Bakels, L.; Bihlo, A.; Stohl, A. Improving trajectory calculations by FLEXPART 10.4+ using single-image super-resolution. Geosci. Model Dev. 2023, 16, 2181–2192. [Google Scholar] [CrossRef]
  54. Lloyd, D.T.; Abela, A.; Farrugia, R.A.; Galea, A.; Valentino, G. Optically Enhanced Super-Resolution of Sea Surface Temperature Using Deep Learning. IEEE Trans. Geosci. Remote Sens. (TGRS) 2022, 60, 1–14. [Google Scholar] [CrossRef]
  55. Park, S.; Singh, K.; Nellikkattil, A.; Zeller, E.; Mai, T.; Cha, M. Downscaling Earth System Models with Deep Learning. In Proceedings of the ACM SIGKDD Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 14–18 August 2022; pp. 3733–3742. [Google Scholar]
  56. Tian, T.; Cheng, L.; Wang, G.; Abraham, J.; Wei, W.; Ren, S.; Zhu, J.; Song, J.; Leng, H. Reconstructing ocean subsurface salinity at high resolution using a machine learning approach. Earth Syst. Sci. Data 2022, 14, 5037–5060. [Google Scholar] [CrossRef]
  57. Nguyen, B.M.; Tian, G.; Vo, M.T.; Michel, A.; Corpetti, T.; Granero-Belinchon, C. Convolutional Neural Network Modelling for MODIS Land Surface Temperature Super-Resolution. In Proceedings of the European Signal Processing Conference (EUSIPCO), Belgrade, Serbia, 29 August–2 September 2022; pp. 1806–1810. [Google Scholar]
  58. Quesada-Chacón, D.; Baño-Medina, J.; Barfus, K.; Bernhofer, C. Downscaling CORDEX Through Deep Learning to Daily 1 km Multivariate Ensemble in Complex Terrain. Earth’s Future 2023, 11, e2023EF003531. [Google Scholar] [CrossRef]
  59. Yu, M.; Liu, Q. Deep learning-based downscaling of tropospheric nitrogen dioxide using ground-level and satellite observations. Sci. Total Environ. 2021, 773, 145145. [Google Scholar] [CrossRef]
  60. Li, L.; Wang, J.; Franklin, M.; Yin, Q.; Wu, J.; Camps-Valls, G.; Zhu, Z.; Wang, C.; Ge, Y.; Reichstein, M. Improving air quality assessment using physics-inspired deep graph learning. Npj Clim. Atmos. Sci. 2023, 6, 152. [Google Scholar] [CrossRef]
  61. Oyama, N.; Ishizaki, N.N.; Koide, S.; Yoshida, H. Deep generative model super-resolves spatially correlated multiregional climate data. Sci. Rep. 2023, 13, 5992. [Google Scholar] [CrossRef]
  62. Mardani, M.; Brenowitz, N.; Cohen, Y.; Pathak, J.; Chen, C.Y.; Liu, C.C.; Vahdat, A.; Nabian, M.A.; Ge, T.; Subramaniam, A.; et al. Residual Corrective Diffusion Modeling for Km-scale Atmospheric Downscaling. arXiv 2024, arXiv:2309.15214. [Google Scholar]
  63. Peng, J.; Huang, Y.; Sun, W.; Chen, N.; Ning, Y.; Du, Q. Domain Adaptation in Remote Sensing Image Classification: A Survey. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. (JSTAR) 2022, 15, 9842–9859. [Google Scholar] [CrossRef]
  64. Guan, H.; Liu, M. Domain Adaptation for Medical Image Analysis: A Survey. IEEE Trans. Biomed. Eng. 2022, 69, 1173–1185. [Google Scholar] [CrossRef] [PubMed]
  65. Wang, W.; Zhang, H.; Yuan, Z.; Wang, C. Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada; 2021; pp. 4298–4307. [Google Scholar]
  66. Soh, J.W.; Cho, S.; Cho, N.I. Meta-Transfer Learning for Zero-Shot Super-Resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 13–19 June 2020; pp. 3516–3525. [Google Scholar]
  67. Wei, Y.; Gu, S.; Li, Y.; Timofte, R.; Jin, L.; Song, H. Unsupervised Real-World Image Super Resolution via Domain-Distance Aware Training. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 20–25 June 2021; pp. 13385–13394. [Google Scholar]
  68. Kim, J.; Jo, H.W.; Kim, W.; Jeong, Y.; Park, E.; Lee, S.; Kim, M.; Lee, W.K. Application of the domain adaptation method using a phenological classification framework for the land-cover classification of North Korea. Ecol. Inform. 2024, 81, 102576. [Google Scholar] [CrossRef]
  69. Soto Vega, P.J.; da Costa, G.A.O.P.; Feitosa, R.Q.; Ortega Adarme, M.X.; de Almeida, C.A.; Heipke, C.; Rottensteiner, F. An unsupervised domain adaptation approach for change detection and its application to deforestation mapping in tropical biomes. ISPRS J. Photogramm. Remote Sens. 2021, 181, 113–128. [Google Scholar] [CrossRef]
  70. Marsocci, V.; Gonthier, N.; Garioud, A.; Scardapane, S.; Mallet, C. GeoMultiTaskNet: Remote sensing unsupervised domain adaptation using geographical coordinates. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023; pp. 2075–2085. [Google Scholar]
  71. Yadav, N.; Sorek-Hamer, M.; Von Pohle, M.; Asanjan, A.A.; Sahasrabhojanee, A.; Suel, E.; Arku, R.E.; Lingenfelter, V.; Brauer, M.; Ezzati, M.; et al. Using deep transfer learning and satellite imagery to estimate urban air quality in data-poor regions. Environ. Pollut. 2024, 342, 122914. [Google Scholar] [CrossRef]
  72. Wang, M.; Franklin, M.; Li, L. Generating Fine-Scale Aerosol Data through Downscaling with an Artificial Neural Network Enhanced with Transfer Learning. Atmosphere 2022, 13, 255. [Google Scholar] [CrossRef]
  73. Gibson, P.B.; Chapman, W.E.; Altinok, A.; Delle Monache, L.; DeFlorio, M.J.; Waliser, D.E. Training machine learning models on climate model output yields skillful interpretable seasonal precipitation forecasts. Commun. Earth Environ. 2021, 2, 159. [Google Scholar] [CrossRef]
  74. Baño-Medina, J.; Iturbide, M.; Fernández, J.; Gutiérrez, J.M. Transferability and Explainability of Deep Learning Emulators for Regional Climate Model Projections: Perspectives for Future Applications. Artif. Intell. Earth Syst. 2024, 3, e230099. [Google Scholar] [CrossRef]
  75. Rasp, S.; Thuerey, N. Data-Driven Medium-Range Weather Prediction With a Resnet Pretrained on Climate Simulations: A New Model for WeatherBench. J. Adv. Model. Earth Syst. 2021, 13, e2020MS002405. [Google Scholar] [CrossRef]
  76. Nguyen, T.; Brandstetter, J.; Kapoor, A.; Gupta, J.K.; Grover, A. ClimaX: A Foundation Model for Weather and Climate. In Proceedings of the International Conference on Machine Learning (ICML), Honolulu, HI, USA, 23–29 July 2023. [Google Scholar]
  77. Ham, Y.G.; Kim, J.H.; Luo, J.J. Deep learning for multi-year ENSO forecasts. Nature 2019, 573, 568–572. [Google Scholar] [CrossRef] [PubMed]
  78. Materia, S.; García, L.P.; van Straaten, C.; O, S.; Mamalakis, A.; Cavicchia, L.; Coumou, D.; de Luca, P.; Kretschmer, M.; Donat, M. Artificial intelligence for climate prediction of extremes: State of the art, challenges, and future perspectives. WIREs Clim. Change 2023, e914. [Google Scholar] [CrossRef]
  79. Isola, P.; Zhu, J.Y.; Zhou, T.; Efros, A.A. Image-to-Image Translation with Conditional Adversarial Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 5967–5976. [Google Scholar]
  80. Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Nets. In Advances in Neural Information Processing Systems; Springer: Cham, Switzerland, 2014; Volume 27. [Google Scholar]
  81. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  82. Peterson, R.A.; Cavanaugh, J.E. Ordered quantile normalization: A semiparametric transformation built for the cross-validation era. J. Appl. Stat. 2020, 47, 2312–2327. [Google Scholar] [CrossRef]
  83. Dai, T.; Cai, J.; Zhang, Y.; Xia, S.T.; Zhang, L. Second-Order Attention Network for Single Image Super-Resolution. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019. [Google Scholar]
  84. Liu, T.; Chen, Z.; Li, Q.; Wang, Y.; Zhou, K.; Xie, W.; Fang, Y.; Zheng, K.; Zhao, Z.; Liu, S.; et al. MDA-SR: Multi-level Domain Adaptation Super-Resolution for Wireless Capsule Endoscopy Images. In Medical Image Computing and Computer Assisted Intervention (MICCAI); Springer: Cham, Switzerland, 2023; pp. 518–527. [Google Scholar]
  85. Stavrakou, T.; Müller, J.F.; Bauwens, M.; De Smedt, I.; Van Roozendael, M.; Guenther, A.; Wild, M.; Xia, X. Isoprene emissions over Asia 1979–2012: Impact of climate and land-use changes. Atmos. Chem. Phys. 2014, 14, 4587–4605. [Google Scholar] [CrossRef]
  86. Müller, J.F.; Stavrakou, T.; Oomen, G.M.; Opacka, B.; De Smedt, I.; Guenther, A.; Vigouroux, C.; Langerock, B.; Aquino, C.A.B.; Grutter, M.; et al. Bias correction of OMI HCHO columns based on FTIR and aircraft measurements and impact on top-down emission estimates. Atmos. Chem. Phys. 2024, 24, 2207–2237. [Google Scholar] [CrossRef]
  87. De Smedt, I.; Stavrakou, T.; Hendrick, F.; Danckaert, T.; Vlemmix, T.; Pinardi, G.; Theys, N.; Lerot, C.; Gielen, C.; Vigouroux, C.; et al. Diurnal, seasonal and long-term variations of global formaldehyde columns inferred from combined OMI and GOME-2 observations. Atmos. Chem. Phys. 2015, 15, 12519–12545. [Google Scholar] [CrossRef]
  88. Li, X.; Dong, W.; Wu, J.; Li, L.; Shi, G. Superresolution Image Reconstruction: Selective milestones and open problems. IEEE Signal Process. Mag. 2023, 40, 54–66. [Google Scholar] [CrossRef]
  89. Donini, E.; Bruzzone, L.; Bovolo, F. Super-Resolution of Radargrams with a Generative Deep Learning Model. IEEE Trans. Geosci. Remote Sens. (TGRS) 2024, 62. [Google Scholar] [CrossRef]
  90. Carbone, A.; Restaino, R.; Vivone, G.; Chanussot, J. Model-Based Super-Resolution for Sentinel-5P Data. IEEE Trans. Geosci. Remote Sens. (TGRS) 2024, 62. [Google Scholar] [CrossRef]
  91. Wang, Z.; Bovik, A.; Sheikh, H.; Simoncelli, E. Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process (TIP) 2004, 13, 600–612. [Google Scholar] [CrossRef] [PubMed]
  92. Wang, Z.; Bovik, A. A universal image quality index. IEEE Signal Process. Lett. 2002, 9, 81–84. [Google Scholar] [CrossRef]
  93. Zhou, J.; Civco, D.L.; Silander, J.A. A wavelet transform method to merge Landsat TM and SPOT panchromatic data. Int. J. Remote Sens. 1998, 19, 743–757. [Google Scholar] [CrossRef]
Figure 1. Comparison between isoprene emission maps corresponding to the same geographical area but to different domains: (a) simulated (BU approach), (b) observed (TD approach). Emissions are reported as (a) k g m 2 · s and (b) k g a r e a c e l l · d a y . Emission flux magnitudes and patterns differ according to the data aggregation strategies adopted [23,27].
Figure 1. Comparison between isoprene emission maps corresponding to the same geographical area but to different domains: (a) simulated (BU approach), (b) observed (TD approach). Emissions are reported as (a) k g m 2 · s and (b) k g a r e a c e l l · d a y . Emission flux magnitudes and patterns differ according to the data aggregation strategies adopted [23,27].
Remotesensing 16 03963 g001
Figure 2. Graphical representation of the original CycleGAN architecture presented in [41]: (a) the architecture design and relationships; (b) the cycle loss L Cyc ; (c) the identity loss L Id . (b,c) refer only to the loss terms related to the X domain; refer to (2) and (3) for more details. In (b), Y ˜ indicates that, by applying the G mapping function to x X , we aim to find an approximation of the Y -domain distribution, i.e., y ˜ = G ( x ) . The same holds for x ˜ , where the F mapping function aims at finding a proper approximation of the X -domain distribution. The optimal G and F mapping functions translate the domains X and Y to domains Y ˜ and X ˜ that exhibit the same distributions of Y and X , respectively.
Figure 2. Graphical representation of the original CycleGAN architecture presented in [41]: (a) the architecture design and relationships; (b) the cycle loss L Cyc ; (c) the identity loss L Id . (b,c) refer only to the loss terms related to the X domain; refer to (2) and (3) for more details. In (b), Y ˜ indicates that, by applying the G mapping function to x X , we aim to find an approximation of the Y -domain distribution, i.e., y ˜ = G ( x ) . The same holds for x ˜ , where the F mapping function aims at finding a proper approximation of the X -domain distribution. The optimal G and F mapping functions translate the domains X and Y to domains Y ˜ and X ˜ that exhibit the same distributions of Y and X , respectively.
Remotesensing 16 03963 g002
Figure 3. The proposed deployment pipeline of our DA framework for emission SR. The colors of the S LR and O HR highlight the different spatial resolutions of the data they work with and with which they are trained.
Figure 3. The proposed deployment pipeline of our DA framework for emission SR. The colors of the S LR and O HR highlight the different spatial resolutions of the data they work with and with which they are trained.
Remotesensing 16 03963 g003
Figure 4. The elements that compose the CycleGANs adopted in our method and their respective relationships: (a) the CycleGAN that handles LR emissions; (b) the CycleGAN that handles HR emissions.
Figure 4. The elements that compose the CycleGANs adopted in our method and their respective relationships: (a) the CycleGAN that handles LR emissions; (b) the CycleGAN that handles HR emissions.
Remotesensing 16 03963 g004
Figure 5. The proposed training pipeline of our DA framework for emission SR. The colors of the S LR and O HR and the feature extractors highlight the different spatial resolutions of the data they work with and are trained with. The SR network N s is frozen. Instead, the DA networks ( S LR and O HR ) and the feature extractors are fully trainable. * For the sake of clarity, we only show the emissions that serve to explain our proposed method. The complete set of emissions needed for training is ( I LR o , I LR s , I HR o , I HR s ).
Figure 5. The proposed training pipeline of our DA framework for emission SR. The colors of the S LR and O HR and the feature extractors highlight the different spatial resolutions of the data they work with and are trained with. The SR network N s is frozen. Instead, the DA networks ( S LR and O HR ) and the feature extractors are fully trainable. * For the sake of clarity, we only show the emissions that serve to explain our proposed method. The complete set of emissions needed for training is ( I LR o , I LR s , I HR o , I HR s ).
Remotesensing 16 03963 g005
Figure 6. The proposed pipeline for the L FeA 1 term of the FeA loss L FeA .
Figure 6. The proposed pipeline for the L FeA 1 term of the FeA loss L FeA .
Remotesensing 16 03963 g006
Figure 7. The proposed pipeline for the L FeA 2 term of the FeA loss L FeA .
Figure 7. The proposed pipeline for the L FeA 2 term of the FeA loss L FeA .
Remotesensing 16 03963 g007
Figure 8. Graphical representation of the elements and parameters involved in the EmC loss L EmC computation. The purple squared boxes in the right bottom corner highlight the pixel spatial coverage differences between emissions T LR o and T ^ HR o ˜ . For example, for a scale factor α = 2 , a single T LR o pixel covers the same geographical area of four T ^ HR o ˜ pixels.
Figure 8. Graphical representation of the elements and parameters involved in the EmC loss L EmC computation. The purple squared boxes in the right bottom corner highlight the pixel spatial coverage differences between emissions T LR o and T ^ HR o ˜ . For example, for a scale factor α = 2 , a single T LR o pixel covers the same geographical area of four T ^ HR o ˜ pixels.
Remotesensing 16 03963 g008
Figure 9. Example emission maps from the same geographical area from all the inventories adopted in this work: (a) BU-MEG-050; (b) BU-MEG-025; (c) TD-OMI-050; (d) TD-GOME2-050. Notice the different value ranges (flux) between the simulated domain, i.e., (a,b) k g m 2 · s , and the observed domain, i.e., (c,b) k g a r e a c e l l · d a y . The reported emission maps all refer to June 2010 (monthly average).
Figure 9. Example emission maps from the same geographical area from all the inventories adopted in this work: (a) BU-MEG-050; (b) BU-MEG-025; (c) TD-OMI-050; (d) TD-GOME2-050. Notice the different value ranges (flux) between the simulated domain, i.e., (a,b) k g m 2 · s , and the observed domain, i.e., (c,b) k g a r e a c e l l · d a y . The reported emission maps all refer to June 2010 (monthly average).
Remotesensing 16 03963 g009
Figure 10. MaxAE histograms for all the considered strategies, i.e., cases I–V. Vertical dashed lines denote the mean value of the related case.
Figure 10. MaxAE histograms for all the considered strategies, i.e., cases I–V. Vertical dashed lines denote the mean value of the related case.
Remotesensing 16 03963 g010
Figure 11. SCC histogram for all the considered strategies, i.e., cases I–V. Vertical dashed lines denote the mean value of the related case. The mean values of case I and case V are almost equivalent.
Figure 11. SCC histogram for all the considered strategies, i.e., cases I–V. Vertical dashed lines denote the mean value of the related case. The mean values of case I and case V are almost equivalent.
Remotesensing 16 03963 g011
Figure 12. NMSE histograms for all the considered strategies, i.e., cases I–V. Red dashed lines identify the NMSE threshold above which the associated SR emission maps are considered super-resolved poorly, i.e., NMSE > 5 dB. Blue histograms refer to the case under investigation. Gray histograms refer to the elements with an NMSE > 5 dB in case I.
Figure 12. NMSE histograms for all the considered strategies, i.e., cases I–V. Red dashed lines identify the NMSE threshold above which the associated SR emission maps are considered super-resolved poorly, i.e., NMSE > 5 dB. Blue histograms refer to the case under investigation. Gray histograms refer to the elements with an NMSE > 5 dB in case I.
Remotesensing 16 03963 g012
Figure 13. MaxAE histograms for all the considered strategies, i.e., cases I–V. Red dashed lines identify the MaxAE threshold above which the associated SR emission maps are considered super-resolved poorly, i.e., MaxAE > 6 × 10 5 . Blue histograms refer to the case under investigation. Gray histograms refer to the elements with an MaxAE > 6 × 10 5 in case I.
Figure 13. MaxAE histograms for all the considered strategies, i.e., cases I–V. Red dashed lines identify the MaxAE threshold above which the associated SR emission maps are considered super-resolved poorly, i.e., MaxAE > 6 × 10 5 . Blue histograms refer to the case under investigation. Gray histograms refer to the elements with an MaxAE > 6 × 10 5 in case I.
Remotesensing 16 03963 g013
Figure 14. Super-resolved emission map examples ( I ^ HR o ˜ ) for all the five strategies, i.e., cases I–V; the first column shows the LR emission input I LR o ; the last column shows the ground-truth HR emission I HR o . Emissions are reported as k g a r e a c e l l · d a y .
Figure 14. Super-resolved emission map examples ( I ^ HR o ˜ ) for all the five strategies, i.e., cases I–V; the first column shows the LR emission input I LR o ; the last column shows the ground-truth HR emission I HR o . Emissions are reported as k g a r e a c e l l · d a y .
Remotesensing 16 03963 g014
Figure 15. Violin plots for the NMSE (a), SCC (b), and UIQI (c) metrics, comparing the SR performance of our system for two simulated emission inventories in the training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° Matched scenario) and BU-MEG-025 ( 0.25 ° × 0.25 ° Unmatched scenario).
Figure 15. Violin plots for the NMSE (a), SCC (b), and UIQI (c) metrics, comparing the SR performance of our system for two simulated emission inventories in the training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° Matched scenario) and BU-MEG-025 ( 0.25 ° × 0.25 ° Unmatched scenario).
Remotesensing 16 03963 g015
Figure 16. Super-resolved emission map examples ( I ^ HR o ˜ ) for two different simulated emission inventories in training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° , Matched scenario) and BU-MEG-025 ( 0.25 ° × 0.25 ° , Unmatched scenario). The first column shows the ground-truth HR emission I HR o ; the last two columns report the squared absolute error related to these two scenarios. Emissions are reported as k g a r e a c e l l · d a y .
Figure 16. Super-resolved emission map examples ( I ^ HR o ˜ ) for two different simulated emission inventories in training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° , Matched scenario) and BU-MEG-025 ( 0.25 ° × 0.25 ° , Unmatched scenario). The first column shows the ground-truth HR emission I HR o ; the last two columns report the squared absolute error related to these two scenarios. Emissions are reported as k g a r e a c e l l · d a y .
Remotesensing 16 03963 g016
Table 1. Detailed information of the biogenic emission inventories adopted in this study. We report the BU-MEG-050 and BU-MEG-025 specifics for the simulated domain, and TD-OMI-050 and TD-GOME2-050 specifics for the observed domain.
Table 1. Detailed information of the biogenic emission inventories adopted in this study. We report the BU-MEG-050 and BU-MEG-025 specifics for the simulated domain, and TD-OMI-050 and TD-GOME2-050 specifics for the observed domain.
ParameterBiogenic Inventory
BU-MEG-050 BU-MEG-025 TD-OMI-050 TD-GOME2-050
Domain S S O O
MethodMEGANv3.2MEGAN2.1OMI-based InversionGOME-2-based Inversion
VariableIsoprene FluxIsoprene FluxIsoprene FluxIsoprene Flux
Reference[23][22][27][85]
Spatial CoverageGlobalGlobalGlobalGlobal
Spatial Resolution 0.50 ° × 0.50 ° 0.25 ° × 0.25 ° 0.50 ° × 0.50 ° 0.50 ° × 0.50 °
Temporal Coverage2001–20202000–20192005–20142007–2012
Temporal ResolutionHourly (Monthly Avg)Hourly (Monthly Avg)DailyDaily
Num. Emission Maps5760576036522192
Emission Range [min, max] [ 0 , 8.0 × 10 9 ] k g m 2 · s [ 0 , 6.6 × 10 9 ] k g m 2 · s [ 0 , 2.3 × 10 6 ] k g a r e a c e l l · d a y [ 0 , 3.5 × 10 6 ] k g a r e a c e l l · d a y
Table 2. Performance analysis of the FeA loss L FeA coefficients γ and δ . In bold, we denote the best-performing average value.
Table 2. Performance analysis of the FeA loss L FeA coefficients γ and δ . In bold, we denote the best-performing average value.
γ δ SSIM ↑NMSE ↓UIQI ↑SCC ↑
0.00.0 0.769 10.152 0.642 0.345
0.50.0 0.806 10.763 0.721 0.585
1.00.0 0.620 6.522 0.468 0.323
0.00.5 0.411 4.200 0.210 0.049
0.50.5 0.524 5.743 0.343 0.162
1.00.5 0.305 2.930 0.109 0.007
0.01.0 0.234 1.718 0.072 0.004
0.51.0 0.279 2.201 0.109 0.016
1.01.0 0.309 2.495 0.122 0.016
Arrow direction indicates the correct direction of the metric.
Table 3. Comparative study of the contribution of each loss term in the final training objective L Train . In bold, we denote the best-performing average value. Avg and std denote the average and standard deviation of the specific metric, respectively.
Table 3. Comparative study of the contribution of each loss term in the final training objective L Train . In bold, we denote the best-performing average value. Avg and std denote the average and standard deviation of the specific metric, respectively.
L CG L FeA L EmC SSIM ↑NMSE ↓UIQI ↑SCC ↑MaxAE ↓
avgstdavgstdavgstdavgstdavgstd
0.257 0.201 1.460 2.522 0.065 0.099 0.008 0.040 57,533.93291,992.264
0.806 0.173 10.763 3.984 0.721 0.179 0.585 0.194 34,207.77063,232.346
0.294 0.206 2.099 2.416 0.096 0.098 0.018 0.067 55,961.35189,127.842
0.812 0.133 10.744 3.723 0.716 0.154 0.642 0.196 30,228.78057,173.855
Arrow direction indicates the correct direction of the metric.
Table 4. Comparative study of all the considered strategies, i.e., cases I–V. Avg and std denote the average and standard deviation of the specific metric, respectively.
Table 4. Comparative study of all the considered strategies, i.e., cases I–V. Avg and std denote the average and standard deviation of the specific metric, respectively.
CaseSSIM ↑NMSE ↓UIQI ↑SCC ↑MaxAE ↓
avg std avg std avg std avg std avg std
I 0.913 0.111 13.875 6.348 0.860 0.125 0.641 0.162 101,071.325228,636.166
II 0.887 0.065 13.53 3.931 0.810 0.084 0.399 0.047 60,317.179160,407.888
III 0.257 0.201 1.460 2.522 0.065 0.099 0.008 0.040 57,533.93291,992.264
IV 0.806 0.173 10.763 3.984 0.721 0.179 0.585 0.194 34,207.77063,232.346
V 0.812 0.133 10.744 3.723 0.716 0.154 0.642 0.196 30,228.78057,173.855
Arrow direction indicates the correct direction of the metric.
Table 5. Spatial resolution generalization results, comparing the SR performance of our system for two simulated emission inventories in the training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° ) and BU-MEG-025 ( 0.25 ° × 0.25 ° ). Avg and std denote the average and standard deviation of the specific metric, respectively. In bold, we denote the best-performing average value.
Table 5. Spatial resolution generalization results, comparing the SR performance of our system for two simulated emission inventories in the training phase, i.e., BU-MEG-050 ( 0.50 ° × 0.50 ° ) and BU-MEG-025 ( 0.25 ° × 0.25 ° ). Avg and std denote the average and standard deviation of the specific metric, respectively. In bold, we denote the best-performing average value.
Train S InventorySSIM ↑NMSE ↓UIQI ↑SCC ↑MaxAE ↓
avg std avg std avg std avg std avg std
BU-MEG-050 0.812 0.133 10.744 3.723 0.716 0.154 0.642 0.196 30,228.78057,173.855
BU-MEG-025 0.849 0.120 12.071 3.731 0.773 0.139 0.647 0.163 32,889.44963,612.669
Arrow direction indicates the correct direction of the metric.
Table 6. Emission inventory generalization results: testing the performance of our system in super-resolving unseen emissions from TD-GOME2-050, considering also the Matched (BU-MEG-050) and the Unmatched (BU-MEG-025) scenarios from the previous study. Avg and std denote the average and standard deviation of the specific metric, respectively. In bold, we denote the best-performing average value.
Table 6. Emission inventory generalization results: testing the performance of our system in super-resolving unseen emissions from TD-GOME2-050, considering also the Matched (BU-MEG-050) and the Unmatched (BU-MEG-025) scenarios from the previous study. Avg and std denote the average and standard deviation of the specific metric, respectively. In bold, we denote the best-performing average value.
Train S InventorySSIM ↑NMSE ↓UIQI ↑SCC ↑MaxAE ↓
avg std avg center avg std avg std avg std
BU-MEG-050 0.809 0.134 10.511 3.582 0.712 0.153 0.636 0.193 37,153.54467,031.764
BU-MEG-025 0.843 0.120 11.359 3.228 0.757 0.138 0.641 0.162 42,370.64875,528.448
Arrow direction indicates the correct direction of the metric.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Giganti, A.; Mandelli, S.; Bestagini, P.; Tubaro, S. Learn from Simulations, Adapt to Observations: Super-Resolution of Isoprene Emissions via Unpaired Domain Adaptation. Remote Sens. 2024, 16, 3963. https://doi.org/10.3390/rs16213963

AMA Style

Giganti A, Mandelli S, Bestagini P, Tubaro S. Learn from Simulations, Adapt to Observations: Super-Resolution of Isoprene Emissions via Unpaired Domain Adaptation. Remote Sensing. 2024; 16(21):3963. https://doi.org/10.3390/rs16213963

Chicago/Turabian Style

Giganti, Antonio, Sara Mandelli, Paolo Bestagini, and Stefano Tubaro. 2024. "Learn from Simulations, Adapt to Observations: Super-Resolution of Isoprene Emissions via Unpaired Domain Adaptation" Remote Sensing 16, no. 21: 3963. https://doi.org/10.3390/rs16213963

APA Style

Giganti, A., Mandelli, S., Bestagini, P., & Tubaro, S. (2024). Learn from Simulations, Adapt to Observations: Super-Resolution of Isoprene Emissions via Unpaired Domain Adaptation. Remote Sensing, 16(21), 3963. https://doi.org/10.3390/rs16213963

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop