1. Introduction
Precipitation is one of the most complex natural process in the hydrological cycle that undergoes momentous variability at both the spatial and temporal scales. The acquisition of accurate precipitation measurements is crucial since it is the main input in a wide range of applications such as climate change prediction, environmental studies, hydrological modelling, flood forecasting, drought monitoring, and water resources assessment. In addition, precipitation measurements at a high spatial and temporal resolution are crucial to properly simulate the hydrological states of natural systems. Precipitation characteristics, such as the rainfall pattern, intensity, probability distribution of rainfall and return periods, are considered the basis for studying the hydrological behavior of any catchment [
1,
2,
3,
4,
5,
6].
In the last three decades, meteorologists and hydrologists were attracted by advancements in satellite information technology; primarily their focus was on developing algorithms to retrieve precipitation data from information remotely collected by these satellites. These algorithms estimate precipitation amounts from the characteristics of clouds as interpreted from infrared (IR), visible (VI), and microwave (MW) satellite images [
7]. Generally, passive microwave (PMW) measurements have demonstrated high performance at the global scale compared to algorithms that were based on IR and VI, while precipitation estimations that are based on IR have a higher temporal resolution than others [
8,
9]. Currently, most of the recognized satellite-based precipitation products rely on a combination of MW and IR to benefit from their complementary strengths. For instance, the Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks method (PERSIANN) generates rainfall estimates by deriving relations between MW data and IR data [
10]. Furthermore, other methods such as Climate Prediction Centre Morphing Method CMORPH generates rainfall estimates that are derived from MW data, whereas IR data are used to propagate rain pixels by a tracking approach to derive a cloud’s motion field [
11].
The first devoted meteorological precipitation satellite is the Multi-satellite Precipitation Analysis (TMPA) 3B42R that was produced by the Tropical Rainfall Measuring Mission (TRMM). The National Aeronautics and Space Administration (NASA) launched the successful satellite in late 1997. The TRMM products were used widely, and it provided two products: the post-processed product (3B42-V7) and the near-real-time product (3B42RT). TMPA applies an estimation method that relies on using IR-calibrated estimates with MW estimates. The TMPA product provides estimations of tropical precipitation with good accuracy [
12,
13,
14,
15]. Recently, in 2014, NASA and Japan Aerospace Exploration Agency (JAXA) cooperated to launch the Global Precipitation Measurement (GPM) satellite, after the impressive success of TRMM. It consists of one main observatory satellite and ten other partner satellites, carrying an up-to-date Dual-frequency Precipitation Radar (DPR), GPM Microwave Imager (GMI), and other innovative instruments [
12,
16]. The satellite is anticipated to perform efficiently in the prediction of flood hazards and reduce uncertainties in estimating short-term precipitation as it has a high spatiotemporal resolution [
17,
18]. According to NASA, the GPM provides four levels of data which are Level-0, Level-1, Level-2 and Level-3. The Level-3 product is the Integrated Multi-satellitE Retrievals for GPM (IMERG), released in early 2015, and has since gained more attention and recommendations from researchers and practitioners. IMERG products have a high resolution (spatially 0.1° latitude × 0.1° longitude) and multiple temporal resolutions (ranging from half-hourly, up to monthly basis). It includes three modes of output namely an, early, late, and final run product based on latency and accuracy.
One of the often-discussed challenges for satellite precipitation retrievals is estimating precipitation over areas of complex topography, where precipitation has high spatiotemporal variation [
9]. It is not common to find rain gauges in mountainous regions, particularly in Saudi Arabia, due to accessibility issues. In addition, since most developments exist in lowlands, it follows that most of the rain gauges are concentrated in lowlands, while the highlands are left under-represented. Since this under-representation can be augmented with satellite products, researchers have started focusing on evaluating satellite precipitation products over complex topography [
15,
19,
20,
21,
22,
23]. These efforts represent a good start, but more research is needed to cover different topographic and climatic regions around the globe. Results from previous studies show that the main sources of error in satellite precipitation measurements are from IR and PMW retrievals [
3,
15,
19,
24]. In addition to complex topography, low rainfall intensity (light rainfall) events represent another challenge for satellite precipitation products. Although light rain covers vast areas of the globe, particularly the subtropics, to date limited studies are attempting to evaluate the accuracy of satellite products in the detection of light rain. Previously, TRMM Precipitation Radar TRMM PR was only sensitive to precipitation exceeding 0.5 mm/h. This is particularly relevant to the evaluation of GPM precipitation products, since one of its main missions is to improve the monitoring of snowfall and light rain by using highly sensitive PMW sensors, which are sensitive to precipitation as low as 0.2 mm/h [
25].
In the past decade, several studies were conducted to evaluate the detection accuracy of satellite rainfall products in arid and semiarid regions. One study evaluated seven satellite rainfall products over an arid and semiarid region that spans from North-Western Africa to North-Western India [
26]. These satellite precipitation products included CMORPH, GSMaP-MVK, TRMM (3B42), TRMM (3B42RT), African rainfall climatology, and African rainfall estimation algorithm. Their findings showed that satellite precipitation measurements had an overall positive detection performance, particularly during the wet season. Few studies, however, specifically focused on evaluating the performance of GPM IMERG products over arid regions. The latest studies [
27,
28] assessed the performance of GPM IMERG rainfall products over the United Arab Emirates and Egypt, respectively. Both are considered arid countries with high rainfall scarcity. Mahmoud et al. [
27] found that the calibrated IMERG product outperformed the near-real-time products, giving the highest detection accuracy and the lowest errors. Moreover, Nashwan et al. [
28] evaluated two products other than IMERG, namely: Global Satellite Mapping of Precipitation (GSMaP), and Climate Hazards Group InfraRed Precipitation with Station data (CHIRPS). The results showed the superiority of IMERG products in rainfall occurrence detection and its capability to represent rainfall spatial variability during extreme events. Moreover, two earlier studies evaluated the accuracy of the TRMM products over Saudi Arabia and found that there was considerable variation in the accuracy of the products for different events and sub-regions; the conclusion was that TRMM cannot be the only source of data for hydrological applications as it provides limited input information [
16,
29]. A more recent study validated the IMERG products over Saudi Arabia, and the findings showed that the final-run product had a better performance than other products in detecting and estimating precipitation in the study area [
4]. This study was, however, limited to only one rainy season, and did not investigate the impact of varying topographical regions and rainfall intensity on the performance of satellite measurements. A more comprehensive study is thus needed to address these research gaps and investigate the ability of GPM satellite instruments in overcoming the impact of complex topography, and verifying the claim that it can detect low rainfall intensity events, which has not been addressed in arid and semi-arid regions—such as Saudi Arabia—to date.
The main purpose of this study is to evaluate the performance of GPM IMERG V05B products, namely early, late, and final run products, over Saudi Arabia. The evaluation was conducted by using ground observation data acquired from 275 rain gauge stations at daily scale over the period from March 2014 to June 2018. The evaluation of IMERG satellite products in this study was threefold: (1) evaluate the impact of spatial characteristics (topography) on IMERG products; (2) evaluate the variation in IMERG performance in different seasons (temporal); and (3) evaluate the performance of IMERG products in detecting rainfall of different intensities. More importantly, for the first time in arid and semi-arid regions, this study evaluated the performance of IMERG precipitation products across the five rainfall intensity categories (light, moderate, heavy, storm, and strong storm) with a focus on light rain detection. The outcome of this analysis could verify the utility of IMERG products as a source of rainfall data over arid regions such as Saudi Arabia. This analysis could also highlight the use of GPM data in forecasting and early warning against potentially hazardous rainfall events in less prepared arid regions.
2. Materials and Methods
The primary objective of this study is to assess the capability of the IMERG products for detecting the rainfall under low intensity, and over various topography over Saudi Arabia. The evaluation was carried out in the following main steps: preparation of rain gauge data, processing of the satellite data, and performing the spatiotemporal and intensity-based evaluation of the GPM satellite data versus the gauged-based data using a set of statistical indices.
2.1. Study Area
This study is focused on the country of Saudi Arabia that occupies an area of about 2,250,000 km
2, which is just under eighty percent of the Arabian Peninsula. The country has complex topographical features and falls between 34°–55°E and 16°–32°N, as represented in
Figure 1. Its vast area (with a wide latitude expanse) combined with its topographical variation resulted in diverse rainfall rates over the area [
4,
30]. The country has thirteen administrative regions and about 400 rainfall stations [
31].
Although the country has a wide climatic range due to spatial and temporal temperature variability, it is considered one of the driest countries in the world [
30]. Based on the aridity index, the majority of the country’s area is classified as a desert climate, where precipitation is infrequent, and temperatures are high, with the exception of the mountainous region as it is considered a semiarid region. Saudi Arabia is sandwiched between the massive continental land of Africa and Asia and is at the same time close to the circum-global latitudinal belt, that has high atmospheric pressure. These factors make Saudi Arabia one of the hottest and lowest humidity countries in the world, except for its coastal lands [
30,
32].
In general, precipitation over Saudi Arabia is variable. In the northern half of the country, the rainy season starts in October and ends by April while there is almost no precipitation for the remainder of the year. Rain in this area results from the feeble weather originating from the Mediterranean or North Africa. The southwestern region, on the other hand, experiences a different precipitation pattern resulting from a mountain range that extends to western Yemen in a north–south orientation along the Red Sea, reaching heights of more than 1500 m. These mountains cause an uplifting of the Indian monsoon and the occurrence of heavy rainfall in the region. Overall, this part of Saudi Arabia is characterized by rainfall throughout the whole year due to convective and orographic rain driven by topography [
30,
32]. Overall, in the desert areas, the mean annual precipitation is less than 100 mm while in mountain areas it ranges between 250 and 300 mm [
30,
31,
33].
Figure 1b shows the annual rainfall (mm) over Saudi Arabia during the period 1979–2009 [
30].
Table 1 provides summary statistics of annual rainfall based on rainfall events for the period from March 2014 to June 2018.
Appendix A shows descriptive statistics of large rainfall events that occurred during the study period (2014 to 2018).
2.2. Rainfall Datasets
2.2.1. Rain Gauge Dataset
Ideally, well-distributed and sufficiently dense rain gauge stations should be used for such a study. Rainfall data were obtained by special request from the Ministry of Environment, Water, and Agriculture (MEWA) of Saudi Arabia, the data were provided in the form of daily rainfall records. MEWA is the authority that owns, operates, and maintains the rain gauges used in this study [
4]. The data included 275 gauges distributed over the country (see
Figure 1) and extended for the period from March 2014 to June 2018. Overall, the data covers most of the study area with a slight variation in distribution. It can be observed that the rain gauge stations are concentrated in the western and southwestern parts of the country as well as in the middle part, which has the country’s capital city Riyadh. The eastern part also has a good distribution of the gauges. On the other hand, the northern part and the southeastern parts of the country have a sparse distribution of rain gauge stations.
In the past, Saudi Arabia had two meteorological authorities that observed and provided the precipitation data, namely MEWA and The General Authority of Meteorology and Environmental Protection. Currently, both datasets are merged under MEWA; however, in this study, we excluded the data of The General Authority of Meteorology and Environmental Protection because this data is used to produce the Global Precipitation Climatology Centre (GPCC) product, which is used to calibrate the GPM IMERG-F. Thus, to make this study independent, we excluded these stations.
2.2.2. IMERG Dataset
The IMERG V05B data are available from NASA at a spatial scale of approximately 11 × 11 km
2 and between 60°S–60°N with different temporal scales. In this study, the finest temporal resolution (half-hourly) data were used. It includes three modes of runs, near real-time: early-run (IMERG-E) and late-run (IMERG-L), and post-real-time: final-run (IMERG-F). The differences between the three products are the time of release and the calibration process. The near-real-time products are pure satellite products, which are released 4 h and 12 h after a real-time, respectively; while the post-real-time IMERG-F is calibrated with the GPCC data and released after about 2 months. The IMERG products were requested and collected from NASA’s website through the link (
https://pmm.nasa.gov/data-access/downloads/gpm).
2.3. Data Preparation and Processing
Data preparation was carried out in several steps. The first step involved sorting and storing ground precipitation data in a database. Then IMERG data were processed and converted to an ASCII format. This step required processing of 75,480 files to cover the period from March 2014 to June 2018. The processed satellite data were then adjusted to match Saudi Arabia’s local time (time was converted from Universal Time Coordinated (UTC) to Local Standard Time (LST), which is GMT +3) and aggregated from half-hourly to daily to sustain the homogeneity in comparison with gauge-based data, which were available only at a daily temporal resolution.
Several modules (scripts) were developed to carry on the analysis and further steps in data processing and preparation (
Figure 2). The first module determines large rainfall events that occurred during the study period based on ground observations. Due to the scarcity of the rain events over Saudi Arabia, the data analysis and evaluation process in this study were based on identified rain events and not on the time series rainfall data. Thus, the false positives cannot be analyzed as most of the data in the time series indicate zero rainfall. The second module deals with data mining, which is a process of specifying the nearest GPM grid point that represents the rain gauge station (point to point analysis). Thus, the outputs of the second module are two datasets that include rain gauge data and their corresponding IMERG estimates. The final module handles data analysis and visualization by applying statistical performance measures to evaluate the accuracy of the IMERG products.
Figure 2 demonstrates the structure of each module and how each part of the analysis was implemented.
2.4. Statistical Evaluation Indices
Quantitative statistical indices were used to evaluate the accuracy of the GPM IMERG products against ground station observations. In this study, statistical measures were divided into three main indices: the detection accuracy indicator, error and bias indices, and the correlation coefficient index (
Table 2). The probability of detection (POD) measures the ratio of ground observations that were correctly detected by the IMERG estimates. The second group measures the level of error and bias in IMERG products; it includes a mean absolute error (MAE), root-mean-square error (RMSE) and relative bias (RBIAS). Usually, RMSE is either larger than or equal to the MAE. However, the main difference between MAE and RMSE is that the latter correlates with the variance in the individual errors in an analyzed dataset. Finally, the consistency of the IMERG products with ground observations was tested using the correlation coefficient (CC). The equations of the statistical indices presented in
Table 2 are based on earlier peer-reviewed studies in the field of satellite products evaluation [
17,
34].
Where is number of records, is the detected precipitation value by the satellite, is the observed rainfall value by ground stations, and are mean values of and , is the number of observed and detected events by both satellite and rain gauge, is the number of events that are observed by rain gauge but not detected by the satellite, is the number of events that are not observed by the rain gauge but detected by the satellite.
2.5. Evaluation Techniques
To comprehensively evaluate the accuracy of GPM IMERG precipitation products, three evaluation scales were adopted, namely seasonal evaluation (temporal evaluation), rainfall intensity-based evaluation, and topographical (spatial) evaluation.
2.5.1. Seasonal Evaluation
The seasonal evaluation was conducted by analyzing rainfall on a seasonal basis throughout the study period. This evaluation intended to investigate the capability of IMERG products to accurately detect rainfall in each season and determine which product has the capability to represent the variability in rainfall distribution within each season. Ground observations and IMERG datasets were prepared and aggregated to represent the four seasons of Saudi Arabia, which are December to February (winter), March to May (spring), June to August (summer), and September to November (fall).
2.5.2. Rainfall Intensity-Based Evaluation
The objective of this evaluation was to explore the accuracy of GPM satellite products in detecting different precipitation intensities ranging from light rain to large storm events. This article will focus more on the detection of light rain since it is the most frequent type of precipitation in arid regions such as Saudi Arabia. Data aggregation was neither strictly temporal nor spatial, but rather a combination of both. The analysis encompassed the evaluation of the detectability of events that had fallen within the study area, and throughout the entire study period, categorized only by precipitation intensity. The approach required imposing various thresholds to classify precipitation intensities (
Table 3), these were adopted from the classification by the Chinese Meteorological department [
35].
2.5.3. Topographical (Spatial) Evaluation
The spatial evaluation was conducted based on the topographical characteristics of the study area. This evaluation is intended to assess the influence of topography on the performance of the IMERG precipitation measurements. This is particularly important since Saudi Arabia has a complex and highly varying topography. Data were aggregated for each analyzed topographical region, and the statistical measures were applied to compare between the two aggregated datasets formulated by point to point matching (rain gauge observation and corresponding satellite estimates). For example, if a set of rain gauges fall within one region, the observations from these rain gauges will form one dataset to be compared to the dataset formed from the matching (nearest) satellite point precipitation estimates. The study area was divided into five topographical regions as shown in
Figure 3, these regions were specified in previous studies such as [
36]. The topography in Saudi Arabia varies from low altitudes in the coastal areas (0 up to 100 m) to high altitudes in the mountainous areas (more than 2000 m).
Table 4 shows the distribution of ground stations and the number of observations (precipitation events) across the different topographical regions. It can be observed that there are few stations located in the high mountain regions, which resulted in a very low station density, whereas the inland region has the highest number of rain gauges.
5. Conclusions
This study assessed the performance of the GPM IMERG V05B products, including early, late, and final-run products, over Saudi Arabia. Ground observations from 275 rain gauge stations over the period March 2014 to June 2018 were used as a reference. The performance of IMERG satellite precipitation products was assessed using three evaluation techniques: seasonal-based (temporal) assessment, topographical (spatial) evaluation, and rainfall intensity-based evaluation. Quantitative statistical indices were used to quantify the performance of the IMERG products. The main conclusions of this study can be summarized as follows:
The seasonal analysis showed an improvement in the performance from IMERG-E, to IMERG-L, to IMERG-F. Nevertheless, all IMERG products showed very weak correlations with ground observations throughout all the seasons.
Spring and summer are the most detected seasons by IMERG products. This leads to a conclusion that IMERG products have the capability to detect seasonal rainfall with both the highest (maximum daily rainfall observed on spring) and the lowest precipitation.
It was interesting to observe the high performance of IMERG products across the various rainfall intensity classes. According to the calculated classical statistical indices, the light rain had the lowest detection errors by IMERG products. However, higher rainfall intensities exhibited higher detection errors in the IMERG products. The detectability of rainfall, as indicated by POD, was excellent for midrange classes of rainfall, whereas that for extreme rainfall intensities (light rain and large storms) was slightly lower. This finding is particularly promising for the applicability of IMERG products in arid regions dominated by light rain events.
Even though the CC values are generally low for different rainfall intensities, large storm events showed significantly higher CCs (0.5 to 0.7) compared to lower intensity events. This is probably induced by the rarity of such large storms over arid regions such as Saudi Arabia.
Topographical features had a significant influence on the performance of IMERG products. The detectability (POD) was improved significantly in higher altitudes (mountains and foothills regions), particularly for IMERG-F. However, the areas adjacent to the coasts showed a significant reduction in the estimation errors of IMERG-F, whereas the highest estimation errors were observed in coastal regions, foothills, and mountainous regions.
In conclusion, IMERG products have great potential for improving the temporal resolution of rainfall data. In addition, it could play a significant role in complementing or filling the spatial gaps in rainfall observations, especially for extremely arid regions such as Saudi Arabia. Furthermore, the consistency of the IMERG products in detecting high rainfall intensities could provide significant information on managing and mitigating hazardous events such as flash floods, which are characteristic of arid regions. Finally, due to the errors associated with the IMERG precipitations estimates, it is recommended to use sub-daily rainfall data to calibrate the satellite algorithms for better rainfall estimation over arid and semiarid regions. In addition, further investigations are needed to assess the impact of rain gauge density on the evaluation of satellite precipitation products [
41].
It is recommended that future studies incorporate uncertainty analyses into the investigation of satellite performance. Sources of uncertainty may include errors from ground observations (such as instrumental problems, weather conditions, and data gaps); or errors from the satellite instrument itself. Several factors could affect the satellite estimation of precipitation such as the brightness of clouds in the visible spectrum, and the misidentification of clouds. In addition, there are uncertainties due to error propagation through the IMERG algorithms into the applications relying on IMERG products, such as hydrological models. Thus, a comprehensive study on the uncertainties of the IMERG precipitation products and the propagation of error is highly recommended.