1. Introduction
Evapotranspiration (ET) is a critical component of the hydrological cycle, deeply influencing water availability in ecosystems and agricultural landscapes [
1]. Yet, predicting ET on a large scale is quite challenging, given the myriad of physical drivers at play. Factors such as wind patterns, cloud cover, soil moisture, surface albedo, precipitation, radiation scattering, and surface heterogeneity introduce considerable variability. Moreover, the intricate interplay of crop biophysical characteristics, including canopy structure, vegetation abundance, phenology, and energy partitioning, further compounds this variability. This complexity is especially pronounced in data-sparse regions and heterogeneous land use systems, such as the Mediterranean region [
2]. This region experiences a complex interplay of factors, including diverse land cover, heterogeneous topography, and variable climatic conditions, making ET estimation particularly challenging [
3]. Moreover, ET and precipitation are the principal mechanisms of the water balance in the Mediterranean Basin. Due to precipitation decline, Mediterranean ecosystems in specific arid and semi-arid production systems are more vulnerable to water stress leading to growing hydrological imbalances in the basin [
4]. The accelerating near-surface temperatures and declining near-surface soil moisture exacerbate the need for precise ET assessments [
5]. Our understanding of Mediterranean ecosystems’ reactions to stressed hydro-climatological conditions is still limited. This information gap demands strong monitoring and modeling capabilities in terrestrial water flux quantification (evapotranspiration, runoff, and groundwater storage), especially ET, which is a principal water-extracting component in arid regions. ET’s accurate quantification with precipitation is becoming critical in the basin with the growing water and food scarcity [
6,
7]. Since many ET modeling schemes exist, it is meaningful to find the best approach for the Mediterranean region through which we can manage and control water stress, seasonal water shortages, and the expansion of irrigated agriculture.
While various remote sensing and modeling approaches have been developed to estimate ET, their effectiveness in capturing the complexities of the Mediterranean region is still negotiable. Remote sensing-based models often rely on empirical relationships and simplifications, neglecting the influence of sub-surface hydrology and complex land use patterns. ET by these models is derived from the integration of radiometric land surface temperature (LST), and biophysical variables, i.e., leaf area index (LAI), photosynthetically active radiation (PAR), fractional vegetation cover (FVC), clumping index, leaf angles, canopy height, leaf abundance, ecosystem hierarchy, etc. [
8,
9]. Some other techniques are based on net radiation and temperature differentials [
10,
11], or an inferential approach constrained by crop coefficients [
12,
13,
14]. These simple generic models do not account for the horizontal movement of soil energy fluxes and water and heat storage [
15,
16,
17,
18]. Some examples are the Mapping ET at high Resolution with Internalized Calibration-Earth Engine Evapotranspiration Flux (METRIC-EEFlux), surface energy balance index (SEBI), Operational Simplified Surface Energy Balance (SSEBop), Atmosphere-Land Exchange Inverse algorithm (ALEXI), and Atmosphere Land Exchange Inverse Disaggregation algorithm (DisALEXI). In these models, LST is used as a proxy for soil moisture conditions of the upper soil layer. Surface resistances are modeled as a function of abstracted hydrology following the aerodynamic theory. The accurate estimation of aerodynamic resistance is crucial, due to complexities in atmospheric conditions and surface characteristics. Temperature gradients in the planetary boundary layer influence the rate of evaporation, and changes in surface roughness occur as soil moisture decreases and vegetation wilt, further affecting the exchange of moisture between the surface and the atmosphere. Though these models have made ET determination easier are more flexible in time and scale. With their generic settings, they have been applied over large continents in recent years such as Water Productivity Open-access portal version 2 (WaPORII) for Africa, the Global Daily Evapotranspiration Portal-Atmosphere-Land Exchange Inverse algorithm (GLODET-ALEXI) for the Mediterranean region, and OpenET-Ensemble for the United States Continent [
19,
20]. The lack of uniformity in the current model schemes made ET estimates more uncertain. LST-driven models do not have soil–water balancing and are preferred to use in crop ET monitoring due to their short temporal frequency and compatibility with crop phenology. Where ET in the semi-arid Mediterranean environment is very sensitive to soil moisture conditions, sub-surface hydrology in the soil plays a significant role in ET regulation [
21,
22,
23,
24,
25]. ET methods need to increase their modeling precision in stress parameters and better space/time resolutions with good-quality data to improve the quality of predictions in water-stressed environments.
Physically based soil–vegetation–atmosphere transfer (SVAT) models offer a more mechanistic representation of hydrological processes, coupling soil moisture dynamics, root zone interactions, and energy balance components. However, their application to diverse land use systems, such as rotational cropping, demands extensive parameterization and a large variable space [
26]. Large-scale applications require multi-calibration techniques to enhance ET accuracy within a hydrological model because SVAT models often struggle to accurately capture ET in complex and heterogeneous landscapes [
27]. The weak correlation between root zone soil moisture and canopy temperature under water-stressed conditions further emphasizes the challenges associated with these models [
28]. Some SVAT modeling schemes use coupled Penman-Monteith and water balance approaches in a spatially explicit manner and simulate the coupled interactions between water and energy regimes. Such schemes connect deep soil hydrology with atmospheric fluxes realistically, they often require trial and error-based intensive parameterization with a comparatively large number of datasets [
29,
30]. This study intends to explore which ET estimation approach is most suitable for the Mediterranean region in capturing spatial variability of ET across different ecosystems. With that aim, models based on energy balance, water balance, and coupled approaches are tested in which the first large-scale application of the SVAT model, STEPS, to the Mediterranean region is introduced with its comparison with other established models, i.e., SCOPE and TSEB.
STEPS is a process-based model that dynamically calculates lateral water movement, runoff, soil moisture content, water table depth, and snowmelt, and integrates it all together and estimates ET on a daily scale. It adopts a “four-leaf” approach to stratify leaves based on moisture and sunlit conditions and solves water balance equations for each depth of the soil layer to represent diverse ecological and hydrological constraints in a spatially explicit manner [
31]. Simulating the coupled behavior of water and energy cycle and accounting for subsurface lateral flux movement differentiates it from SCOPE and TSEB. In contrast, the SCOPE, being a radiative transfer model with energy balance calculations, first simulates spectral reflectance and transmittance of vegetation canopy and soil to derive fluxes of radiation, heat, water vapor, and carbon dioxide in the soil–atmosphere–vegetation continuum. It uses empirical and semi-empirical relationships and parameterizes the effects of soil moisture and LAI on canopy conductance and photosynthesis. However, it does not address the topography or canopy-clumping effects that STEPS do [
32]. TSEB, in comparison, is a simple dual-source energy balance model that estimates two-layer evaporation separately for soil and canopy from the satellite-based LST in which wind speed and canopy height are used to compute aerodynamic and surface resistances. It simulates turbulent fluxes in the loop of given boundary layer conditions and uses empirical and semi-empirical relationships to parameterize the effects of surface temperature, vegetation cover, and aerodynamic resistance on the energy fluxes. In TSEB, the choice of soil resistance formulation can affect the accuracy of ET estimation. Different studies proposed and evaluated different soil resistance formulations [
33]. Apart from others, it has the flexibility of choosing different ET methods, i.e., the Priestley–Taylor equation, dual-time difference approach, or component soil and canopy temperatures approach; all these methods require different climatic inputs, such as the maximum and the minimum air temperature, solar radiation, relative humidity, wind speed, etc. All these models are unique in their ET determination methods, input data, output frequency, assumptions, and sources of uncertainty.
This study evaluates the performances of these ET models (STEPS, SCOPE, and TSEB) across the Mediterranean region and compares them with those of existing global ET products (MOD16A2, GLEAM, and TerraClimate) and ground-based measurements, to identify their strengths, weaknesses, and potential for ET estimation in this complex environment. In the literature, mostly a comparison of annual and monthly estimates has been provided. This study provides a deeper insight into the modeled ET at a sub-regional scale to see if the modeled ET is reliable for all regions. This is a concern because low accuracy can change the water budget estimate if ET is not modeled accurately, and if some environmental constraints are ignored especially in arid and humid climates. Thus, our research contributes to a better understanding of ET dynamics in the Mediterranean region to support informed water resource management.
3. Results
3.1. Spatiotemporal Comparison among New and Existing Modeled Evaporative Fluxes
Figure 2a presents the annual evapotranspiration (ET) estimates across the study region from all models. As expected, the total annual ET varies based on climatic conditions. Lower values (0–500 mm) are observed over Turkey, the Middle East–North African belt, and Spain, reflecting their aridity. Conversely, regions like Italy, the Balkans, France, and Portugal exhibit higher ET (>500 mm), indicating a more humid climate with a larger contribution of ET (
Figure 2a). Interestingly, the SCOPE model shows comparatively higher ET estimates for semi-arid regions like Turkey compared to other models. This discrepancy highlights the importance of model selection and the influence of input data on ET estimates. In the case of SCOPE, the lack of explicit water availability constraints may be leading to overestimations in semi-arid regions. The spatial patterns of ET in cultivated regions like the Nile Delta in Egypt are generally well captured by STEPS, TSEB, and SCOPE, but not by GLEAM and TerraClimate. This highlights the difference in how these models estimate ET. Studies have shown that average ET rates in the Nile Delta range from 705 mm/year for croplands to 1099 mm/year for forests [
47]. Our findings of comparable ET patterns over the Nile with STEPS, TSEB, and SCOPE are consistent with these observations.
The discrepancies in the annual pattern between models and products highlight the difference of approaches for ET estimation. Models like STEPS, SCOPE, TSEB, and MOD16A2 rely on LAI- and LST-driven schemes. These models perform well in capturing ET over cultivated semi-arid regions. However, models like GLEAM and TerraClimate take precipitation as the main input, which can lead to significant underestimations in the arid areas. The variability in ET in agroecosystems is largely influenced by vegetation characteristics like species composition and LAI [
48]. For example, the spatial pattern of STEPS-ET closely resembles MOD16A2, particularly over southern Europe. This similarity is likely due to their use of a common LAI dataset and the Penman–Monteith approach. In contrast, TSEB and SCOPE, which incorporate LST, canopy temperature, and radiative responses, show greater spatial variability in ET across irrigated and non-irrigated zones.
This study represents the first combined regional-scale evaluation of these models (STEPS, SCOPE, TSEB) for ET estimation in the Mediterranean. The results provide some confidence in the ability of these models, particularly those incorporating LAI or LST, to capture ET over cultivated areas. However, limitations were also identified in areal coverage. Models relying solely on LAI exhibited missing values in arid regions of North Africa. Using precipitation as the main input in the case of GLEAM and TerraClimate also showed negligible ET estimates over irrigated zones.
Figure 2b highlights the differential responses of the models to different land cover types across the Mediterranean. TSEB’s estimates show minimal variation across land cover types at the 10 km resolution, likely due to pixel heterogeneity leading to uncertainty in canopy parameterization. This results in smaller gaps in ET between different land cover types. In contrast, SCOPE, due to its lack of explicit water limitations and its land cover-independent parameterization, produces higher ET values across all land cover types. However, LAI plays a dominant role in differentiating canopy response. Irrigated crops can also show higher ET than grasslands or shrublands, it is determined by SCOPE and TSEB. These models account for several surface characteristics, such as albedo, roughness, and vegetation properties, which influence energy balance calculations and can lead to higher ET estimates from irrigated crops. Other models show a larger gap of more than 100 mm in annual ET between forests and other land cover types, with a decreasing sequence of forest > shrubland > grassland > cropland. Forests typically have higher LAI, more complex canopies, and greater transpiration rates, leading to higher overall ET compared to other land covers. This finding aligns with observations from eddy covariance sites as well, where forests generally exhibit higher daily ET rates in the Mediterranean region. So, forests can show the highest cumulative ET in the Mediterranean region. Overall, the observed variations in annual ET patterns across land cover types highlight the need for accurate land cover parameterization and corresponding boundary conditions for reliable ET estimation.
3.2. Comparison of Seasonal Evapotranspiration Sums
Seasonal ET sums for 2019 from each model (STEPS, SCOPE, and TSEB) are also compared with reference datasets (MOD16, GLEAM, and TerraClimate). A spatial correlation was applied to examine the degree of agreement between the models and other products. The chosen period, March to November, captures the growing season and peak ET activity. Spatial variability among ET products is most prominent in summer (
Figure 3a) when atmospheric water demand is typically high in the Mediterranean region. This reflects the models’ ability to capture the spatial dynamics of ET linked to seasonal variations in water demand. As expected, ET estimates are proportionally higher in summer, ranging from 0 to 350 mm in spring, from 150 to over 650 mm in summer, and declining to 0 to 150 mm in autumn. This seasonal pattern aligns well with daily ET trends reported in many Mediterranean site-scale measurements.
During summer, when precipitation is minimal, GLEAM exhibited the lowest cumulative ET (<350 mm) compared to other models across all seasons. Other products showed some ET variability over the Balkans and France. Spatial autocorrelation analysis provided further insights into model agreement. STEPS showed a strong correlation with TSEB, MOD16A2, and GLEAM, with the strongest correlation observed between STEPS, TSEB, and GLEAM in summer. In contrast, SCOPE exhibited a generally lower correlation with other products, indicating that its spatial behavior is distinct. As expected, SCOPE also showed relatively high ET levels (>200–350 mm) over the Nile region, where ET is negligible in GLEAM’s estimates throughout the year (consistent with annual findings). On the other hand, STEPS, TSEB, and MOD16A2 exhibited good agreement across all seasons.
The likely reason for these spatial differences is how models are trained on closing water and energy balance when imbalances are caused by the radiation budget, latent heat flux, ground and sensible heat fluxes, or soil moisture dynamics. Additionally, differences in model parameterizations have influenced their outputs. The high correlation observed among STEPS, TSEB, and GLEAM in summer, and with MOD16A2 in spring, highlights their consistency and potential for intercomparison, particularly during the growing season.
3.3. Comparison of Total ET Estimates and Evaporative Stress Index over AOIs
Hourly potential evapotranspiration (PET), a derived dataset from the ASCE’s standardized reference ET model is used for countrywide atmospheric energy demand comparisons. Zonal statistics were applied to calculate the average precipitation (influx) and PET (outflux) for each country over a year.
Figure 4 highlights the hydro-climatic disparities for the year 2019 in the Mediterranean countries. Precipitation is surplus in some countries (e.g., Montenegro, Bosnia, Albania, Italy, and Croatia) and deficits in others. This suggests favorable conditions for water availability, agriculture, and ecosystems in some southeast Mediterranean countries with precipitation surpluses. In contrast, the majority of Middle Eastern and North African (MENA) countries received less than 500 mm of annual rainfall, with the exceptions of Lebanon and Cyprus. Notably, Egypt is one of the driest countries, where PET reached above 1500 mm. In Egypt and other MENA countries (Libya, Tunisia, Algeria, Morocco) rainfall is low but evaporation demand is high, and agriculture is irrigation dependent. This makes water demand a dominant factor controlling water availability in the region. However, irrigation supplements the water deficit and allows agricultural activities to persist. The dependence on irrigation systems also highlights these regions’ fragility to changes in climate variability. Strong climate variability over southern Europe, Turkey, and the western Balkans has also been observed in recent years, affecting the water budget, and making their land use systems more vulnerable to water stress, which makes the two distinct climate clusters of hydrological imbalances more intense [
4], as shown in
Figure 4 with a dotted borderline.
The comparison of ET across different land cover types of seven AOIs reveals more distinct model behaviors in arid regions.
Figure 5a presents the area-averaged annual ET sums from all models across seven AOIs, in which SCOPE exhibits the highest values over croplands in Turkey, Morocco, Egypt, and the grasslands of Spain. In these areas with high PET near 1500 mm, indicating a high atmospheric water demand, GLEAM and TerraClimate show considerably lower ET there compared to other models, indicating their limitation in the arid cultivated regions. These model discrepancies highlight the importance of not relying on a single ET product for water use assessment. A complementary approach using multiple ET datasets is recommended for the arid regions in the Middle East and North Africa (MENA). However, differences in case of France, Italy, and the Balkans are less pronounced. Here, TSEB shows relatively low ET estimates over the forest ecosystems.
The Evaporative Stress Index (ESI) is employed to further assess how different ET products can influence the assessment of water stress (
Figure 5b). An ESI value of −1.5 or lower indicates severe water stress, while +1.5 or higher suggests excessive soil moisture [
49]. By comparing ESI derived from different models’ ET outputs, we observed variations in model performance over AOIs. For example, over croplands in Egypt, STEPS-GLEAM indicated water stress in June and July, whereas SCOPE-TSEB indicated earlier stress in May. It shows their effectiveness for irrigated croplands, assessing water demand earlier. Over grasslands in Spain, TSEB, MOD16A2, GLEAM, and TerraClimate showed a summer deficit, while STEPS and SCOPE did not, showing usability of the former over Spain. In the Balkan Forest, all models behaved entirely differently, making it challenging to decipher the uniform ET pattern of this region. The response of models over croplands in France was more consistent. STEPS, GLEAM, and TerraClimate, indicated similar trends, with only TerraClimate showing water stress in July, TSEB in August, and STEPS indicating stress throughout the summer. In contrast, all models agreed on water stress for croplands in Morocco during the same period, with SCOPE and TSEB again showing earlier stress as observed in Egypt. The overlapping responses by models in Morocco are likely due to the dominant role of high atmospheric demand and dry conditions. Ground ET measurements in Morocco would be valuable for verifying this uniform response by all models.
The following is inferred:
Over Spain, TSEB, MOD16A2, GLEAM, and TerraClimate are showing consistent trends, giving robust signals of water stress in spring and summer, but this seasonal trend is absent in the case of STEPS and SCOPE, which means they need better parameterization for this climatic region, which is also observed in their ground validations over shrubland sites.
Over Egypt, SCOPE’s and MOD16A2’s behaviors are comparable, whereas models that account for precipitation have collectively shown July as the most stressed month for Egypt (although not severe for STEPS, TSEB, GLEAM, and TerraClimate)
Over France, similarity was found between SCOPE-GLEAM, TSEB-MOD16A2, and between STEPS-TerraClimate. These models also showed good accuracy for forest sites.
A robust agreement among all models was observed over croplands in Morocco, which may be due to the dominance of the crop ecosystem or the strong influence of atmospheric demand and dry conditions in summer.
Based on these results, the feasibility of using ensemble ET outputs from these models is likely supported for croplands in Morocco. ET estimation remains challenging in the Balkans, where irregular precipitation patterns further complicate the determination of ET trends with the least definite ET seasonality (
Figure 4). This evaluation highlights the highly diverse nature of ET across the Mediterranean region, suggesting that a single approach may not be optimal for modeling ET flux, particularly in regions where both energy and water limitations coexist. A pragmatic approach is needed with a combination of methods, necessitating careful consideration of climate clustering or regional segregation during model parameterization (as in
Figure 4).
Unlike many studies that focus on broad comparisons of annual and monthly ET estimates, this study provides deeper insights into modeled ET at sub-regional, seasonal, and ecosystem scales through its application to water stress assessment via evaporative stress index. Our findings suggest that modeled ET may be unreliable for specific regions, such as the Balkans and Turkey. Significant improvements in model parameterizations are needed in these areas. Given the limited ground observations of ET in North Africa and the eastern Mediterranean region, model validation over this region remains understudied, particularly for desert and oasis ecosystems [
14]. We need ongoing model refinement, validation, and integration with ground-based observations to address these limitations and improve ET estimates across the region.
3.4. Daily Time-Series: Modeled ET vs. Eddy Covariance ET
To assess the accuracy of the models on the ground, daily time series data for 2019 were compared with eddy covariance observations from 17 sites across southern Europe and 1 site in Israel (IL-Yat). These sites encompass a diverse range of ecosystems, including evergreen needle forests, deciduous boreal forests, evergreen boreal forests, croplands, grasslands, shrublands, and savannahs covering mostly ecosystems in southern Europe.
The results have shown daily ET varies across land cover types. Forests in France and Italy exhibited the highest daily ET (>6 mm/day), followed by croplands (<6 mm/day), and shrublands/savannahs (around 4 mm/day). This trend in higher ET in Mediterranean forests was consistent across some models (mainly STEPS) (
Figure 3b). However, seasonal ET patterns differed among models across sites. For instance, STEPS and SCOPE overestimated summer ET in Spanish shrublands, while GLEAM and TSEB captured the right ET pattern observed there. In contrast, all models agreed on seasonal patterns in some Italian and French croplands and forests. Discrepancies in boreal forests and savannahs highlight the need for improvements in capturing the phenological response of these ecosystems (e.g., STEPS, SCOPE, TSEB).
Forest and grassland sites generally showed the strongest agreement between modeled and eddy covariance (EC) ET observations (r
2 = 0.6–0.8), with low root mean square error (RMSE) and mean bias error (MBE) (
Figure 6). IT_MBo exhibited the highest coefficient of determination (r
2) scores, indicating strong agreement between modeled and ET observations in the range of 0.6–0.8. Their corresponding root mean square error (RMSE) and mean bias error (MBE) were also low (
Figure 6), showing the high accuracy of these models in these two ecosystems (forests and grasslands). SCOPE and STEPS also showed good agreement in some forest, grassland, and cropland sites, while GLEAM performed best in shrublands. GLEAM, along with other sites, showed the highest congruence in shrubland sites (ES_LM1, ES_LM2, and ES_LMa), whereas only TSEB showed high congruence afterward.
Five out of eighteen sites (mostly in France and Italy) showed significant agreement with both STEPS and SCOPE models (FR_HES, IT_Ren, FR_LGt, IT_MBo, and IT_Tor) (
Figure 7). However, several sites exhibited lower accuracies. The Israeli forest (IL_Yat), Italian cropland (IT_BCi), and Spanish shrublands (ES_LJu and ES_Agu) showed lower agreement with modeled ET. Similarly, all models showed poor performance in evergreen boreal forest sites (ES_Cnd, FR_Pue, and IT_CP2), suggesting a need for improved model parameterization for boreal ecosystems, which are prone to wildfire in the Mediterranean region in the context of heat waves and water stress.
In summary, SCOPE demonstrates good performance with grasslands, GLEAM shows better in savannahs, and STEPS shows good performance in certain forest and crop sites. The performance of these products was checked with both unadjusted fluxes’ original ET and corrected fluxes for energy balance closure (ETebc); accuracy results were the same. Diego Salazar et al., 2022, also noticed, when evaluating with the original unadjusted ET flux, that GLEAM and ALEXI showed the highest coefficient of determination (r
2), smallest percent bias (PBIAS), and best root mean square error (RMSE) with unadjusted ET [
50].
Weak agreement with other EC sites may stem from seasonal fluctuations in meteorological, soil, and land conditions, as well as the limitations of some models in capturing heterogeneity in ecosystems. Some models cannot perform well in heterogeneous ecosystems. This limitation was noticed by Burchard-Levine et al. (2020) first in TSEB, and they recommended seasonal adaptations of TSEB for a dominant vegetation cover for a given period to accurately estimate ET at a large spatial scale [
51]. This validation lacks ground-based ET data over North Africa, limiting our ability to validate model responses over irrigated croplands in countries like Egypt, Morocco, and other North African countries. These regions exhibit significant seasonal variability in irrigated ET as observed in the ET magnitude and spatial patterns above, ground validation is major limitation for building confidence of using these models.
4. Discussion
The presented study highlights the complexities and uncertainties surrounding ET estimation in the Mediterranean region. Different models rely on distinct approaches. Some, like MOD16A2 and TSEB, focus solely on the energy balance approach, while others like STEPS and GLEAM couple energy and water balance approaches. This inherent bias can lead to contrasting ET patterns even with identical meteorological data, as we used for our models. This study delves deeper into how each model’s underlying assumptions influence ET calculations across diverse landscapes in the Mediterranean region. The lesson learned is that the choice of model should be driven by the specific objective. Short-term crop water requirement estimations benefit from process-based models that incorporate a wider range of factors beyond just climate and vegetation cover. Conversely, long-term and seasonal assessments might find energy balance models more suitable [
52,
53]. Traditional approaches that do not consider topographic and radiative influences on ET also surpass water or energy limits due to differences in the available energy −−should not be used for arid regions. As observed in the case of the TerraClimate and GLEAM estimates over Egypt, it is highly recommended to use near-surface air temperature with precipitation in dry ecosystems to derive patterns of ET [
54].
ET can have a lot of uncertainties due to the misinterpretation of the models, types of input, differences in scale and temporal/spatial resolution, and data coverage. This study rightly emphasizes the need for the careful evaluation of models before applying their output in critical areas like budget estimation, water productivity assessment, water use assessments, etc. [
55]. A rigorous comparison of various models’ accuracy with established ground-truth data or other reliable sources is crucial. We identified the strengths and weaknesses of these models across diverse Mediterranean ecosystems to guide researchers in selecting the most appropriate tool for each specific scenario. Revising STEPS and SCOPE model parameterization is recommended for dry ecosystems, i.e., rainfed, shrublands, savannahs, etc. The comparison showed a contrasting difference in ET patterns: the annual and seasonal patterns showed that SCOPE’s response was closer to potential ET across the region, and STEPS is more aligned with MOD16A2-GLEAM compared to SCOPE-TSEB. A fundamental difference in the modeled ET estimates stems from their underlying approach: some are driven solely by the energy balance approach (such as MOD16A2 and TSEB) [
33,
43], radiative forcing and energy balancing (SCOPE), while others couple energy and water balance approaches (like STEPS and GLEAM). This disparity might have led to different patterns over southern Europe and MENA, and even the use of common meteorological forcings has no effect. Surface condition parameters (e.g., LST, LAI, LULC) have significantly influenced ET calculations. LST and LAI, being major components of these models, are not temporally consistent with climatic parameters, and this is another limitation. This might have also influenced the accuracy of our daily ET. Similar behavior is observed in the comparison of Landflux, GLDAS, GLEAM, and ERA-Interim, which all failed to capture evaporative flux over irrigated croplands, so high-resolution LST and LAI forcings were suggested [
56]. In addition, some recent products, i.e., CSIRO MODIS ReScaled EvapoTranspiration (CMRSET), SSEBop, and WaPOR are found as less biased than MOD16A2 and GLEAM with an improved land surface parameterization for dry ecosystems. Integrating fractional vegetation cover and climatic water deficit into algorithms can also reduce model uncertainties [
57,
58].
The choice of variables in each model affects how they capture and represent the complex processes involved in ET, which affects the accuracy of ET estimates [
59]. LAI is the most influential parameter in our models, as in SCOPE and STEPS, it determines photosynthetic foliage and influences the interception of rainfall and the subsequent evaporation from the vegetation canopy. In TSEB, it determines the fraction of incoming solar radiation intercepted by the vegetation canopy. Other models like GLEAM emphasize the role of vegetation dynamics and precipitation in driving ET by relying on VOD and precipitation, their comparison was challenging due to differences in input data and estimation algorithms; however, this revealed that accuracy can be increased over the region by improving inputs and modifying parameters for dry ecosystems. The area-specific parameterization can improve ET models’ performance in a given area or basin [
60,
61]. The “one-size-fits-all” approach for model parameters might not be optimal for the entire Mediterranean with its diverse climate clustering. Future research should prioritize area/climate-specific parameterization. This involves customizing model parameters based on unique regional conditions of soil types, vegetation characteristics, and prevailing climatic patterns. Such fine-tuning can significantly enhance model performance in specific regions within the Mediterranean basin considering the extremely arid and humid conditions. Future research should focus on parameterizing and training models to handle these extremes effectively. This is particularly critical given the increasing atmospheric demand leading to flash droughts and floods across the region.
The assimilation of high-resolution remote sensing data with advanced modeling techniques holds immense potential for generating robust ET estimates. By addressing these areas, we can ensure a more robust scientific foundation for ET estimation in the Mediterranean. This will not only improve the accuracy of water resource management but also enhance our understanding of the complex hydrological processes within this diverse and ecologically sensitive region. This inter-comparison somehow facilitated comprehending the shortcomings of each model, and how they can be modified for their future adaptation to the heterogeneous Mediterranean region.
5. Conclusions
While many studies broadly compare annual and monthly ET estimates, this study provides deeper insights into modeled ET at the sub-regional, seasonal, and ecosystem scales with its implication in seasonal stress prediction. Errors in ET estimates can have profound implications for regional water budget assessments and evaluations of agro-ecological drought risk. Therefore, it is imperative to continue refining model parameterization and enhancing the accuracy of ET simulations. Comparing models like STEPS, SCOPE, and TSEB reveals inconsistencies in their ET estimations for similar meteorological conditions. This exposes inherent biases within each model’s algorithms and assumptions for Mediterranean conditions. These inter-model comparisons revealed the significant influence of factors like LST, LAI, and subsurface hydrology on regional-scale ET calculations. This knowledge can help improve datasets on these factors during model improvement and parameter refining.
Key insights are as follows:
STEPS encountered challenges in effectively modulating the seasonality of ET over different ecosystems. SCOPE gave higher evapotranspiration (ET) estimates during summer, and TSEB yielded lower ET estimates over forest ecosystems.
TSEB showed larger uncertainty in its current parameterization routine, particularly when applied at a scale of 10 km. However, studies have shown that its less than 15% discrepancies in validation against ground observations when used at high resolution. To enhance its applicability across the large Mediterranean region, a seasonal adaptation of the model while considering dominant vegetation cover for a given period is recommended [
62,
63], and more specific vegetation parameterization, especially for forests, is required.
GLEAM could not perform well over irrigated Nile Egypt. GLEAM’s limitations in irrigated areas suggest the need for an improved representation of crop phenology. However, it showed good results in semi-arid ecosystems of Spain and France, with potential use of microwave satellite-based products.
STEPS showed some spatial similarity with existing global datasets and produced satisfactory simulation results, over some forest and crop sites. However, for large-scale applications, it requires extensive sub-regional scale parameterization, as this generic parameterization has yielded unreliable results in some locations.
The evaporative stress index across different regions has shown robust agreement and reliability among these models in some areas, i.e., Morocco, Spain, and Egypt, where ground data were missing but the absence of significant trends over Italy, Balkans, and Turkey indicates the need for region-specific model calibration and validation for all models.
None of the models could holistically capture ET patterns across the entire region, and each has limitations and weaknesses in particular climates and ecosystems. This comparison, however, indicates areas of improvement in these models. Further investigation into model physics and forcing errors can enhance the overall robustness of ET simulations. Generic parameterizations used in some models (e.g., STEPS, TSEB, and SCOPE) can lead to unreliable results in certain locations. We should prioritize creating parameter sets tailored to specific regions and ecosystems within the Mediterranean. This will lead to more accurate model representations of local ET dynamics. Integrating high-resolution remote sensing data using advanced data assimilation offers significant potential for improving model accuracy. Tuning models for accurately representing extremely humid and arid environments within the Mediterranean will enhance their applicability in water management and environmental planning across the Middle East, North Africa, and Scandinavian regions.