Evaluation of Satellite Precipitation Products and Their Potential Influence on Hydrological Modeling over the Ganzi River Basin of the Tibetan Plateau

1State Key Laboratory of Hydrology-Water Resources and Hydraulic Engineering, College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China 2Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou, Gansu 730000, China 3Research Center of Fluid Machinery & Engineering, National Research Center of Pumps, Water Saving Irrigation, Jiangsu University, Zhenjiang 212013, China


Introduction
Water resources management in remote regions or heterogeneous terrains, particularly for the mountain river basins, is one of the most important challenges facing decisionmakers and hydrologists due to the extreme scarcity of in situ monitoring stations, leading to dramatic effects on the ecological, agricultural, and economic activities [1].However, hydrological modeling can play a helpful role in effective water resources management under the absence of hydrological data [2], but there is still an urgent need to overcome the problematic lack of in situ meteorological data (i.e., precipitation, temperature, and wind speed) for many hydrological, hydrometeorological, and climatological applications.
Among these components, precipitation is one of the necessary forcing inputs of the global water cycle that cannot be exempt and is essential in order to satisfy requirements for calculating various land surface hydrological models.Additionally, the precision of precipitation measurements at spatiotemporal representations has a great influence on the effective predictions of hydrological models [3,4], as highly accurate representations may reduce the uncertainties in simulating the hydrological processes of watersheds [5,6].
A conventional approach in estimation of precipitation amount involves meteorological radar observations and/or there are no thorough investigations in mesoscale basins, especially so in the ungauged regions of the southeast Tibetan Plateau, which may be associated with poor performance of satellite precipitation estimates in hydrological simulations as compared to those of large basins [5,28].Additionally, the applicability of the latest satellite precipitation datasets for hydrological modeling framework, including 3B42, 3B42RT, PERSIANN-CDR, CMORPH-CRT, seems to lack adequate coverage in the literature, especially for the Ganzi River Basin (GRB) in the southeast Tibetan Plateau.
Considering these issues, the specific research objectives of this study are to (i) evaluate and compare the capability of the latest four satellite precipitation products to characterize precipitation patterns and capture the magnitude of precipitation events over GRB; (ii) investigate the use potential and limitations of satellite precipitation estimates as forcing inputs in driving the HEC-HMS model; (iii) investigate their influence on the simulation of daily hydrological processes at the basin scale.
The rest of the article is structured as follows: Section 2 presents a brief overview of the study area and datasets used.Section 3 provides a description of the HEC-HMS hydrological model, followed by a detailed discussion of the calibration procedures and simulation scenarios for the hydrologic model and then lists statistical criteria of its performance evaluation.The results of the comprehensive evaluations of the four satellite estimates and their hydrological utilities are compared and discussed in Section 4. Finally, the conclusions and summary of this study are given together with some advice for future studies in Section 5.

Study Area.
The region under study focuses on the GRB located at upper part of the Yalong River in southeastern Qinghai-Tibetan Plateau, China, as shown in Figure 1(a).The GRB covers a drainage area of approximately 32925 Km 2 and extends across the geographical range from 31.5 ∘ to 34.25 ∘ northern latitude and from 96.75 ∘ to 100 ∘ eastern longitude.The average altitude of the drainage area is around 4500 m with the highest elevation of 6102 m in the western and upper part of the basin and lowest elevation of 3394 m in the southeastern plains.
The basin is a cold and dry climate zone characterized with a dry winter and a rainy summer.The average annual maximum and minimum temperature within the region are about 15 ∘ C and −10 ∘ C, respectively.The highest temperature of 31 ∘ C is detected in July, and the lowest temperature of −28.9 ∘ C in January.The average annual precipitation reaches 650 mm, 90% of which occurs during rainy season (from May to October) and 10% in the rest of the year.The snowfall contribution of the total annual precipitation ranges from 50% in the relatively high areas to less than 30% in the low lying areas.The basin considered in this study is affected by solid precipitation when the temperature is less than or equal to 0 ∘ C. The average annual discharge at the GBR's outlet is about 290.82 m 3 s −1 with a maximum discharge amounting to 1820 m 3 s −1 , which occurred in the period between 1 January 2000 and 31 December 2012.
The land use and land cover classes consist of various types in the GBR, with croplands, herbaceous vegetation, grassland, forest, and bare areas being the main types.The distribution of land use ranges from croplands in the central and lower areas of the GBR to grassland and herbaceous vegetation in the upper areas (Figure 1(b)).Soil types of the watershed are dominated by Leptosols (82%), Gleysols (7%), Cambisols (5%), and Greyzems (3%).Other types include Histosols, Luvisols, Phaeozems, and Glaciers, and ice covers 3% of the total area (Figure 1(c)).

Datasets Description.
In the current study, due to restrictions on the availability of the rain gauge observations over the GRB, we have selected the derived datasets from satellites and ground gauges over a six-year period from 1st January 2008 to 31st December 2013.The choice of this period is attributed to the availability of both satellite and gauged precipitation datasets.

Gauge-Based Synthesis Datasets.
The historical records of daily datasets were collected from three meteorological ground-based stations, which consist of maximum and minimum air temperature, precipitation, and evapotranspiration; in particular the gauged precipitation estimates are first used as the reference datasets to evaluate the satellite-based precipitation products.Figure 1 shows the distribution of these stations over the GRB, and Table 1 summarizes their basic information involving latitude, longitude, altitude, average annual precipitation, and average annual min/max temperature.Besides meteorological observations, daily observed streamflow records at the outlet of the GRB were collected from the Ganzi hydrological station.Besides meteorological observations, daily observed streamflow records at the outlet of the GRB were collected from the Ganzi hydrological station.Besides meteorological observations, daily observed streamflow records at the outlet of the GRB were collected from the Ganzi hydrological station.
In this study, all meteorological and hydrological datasets were reported during the period 2008-2013, which are used as forcing inputs into the HEC-HMS model to generate streamflow simulations.It is also noteworthy that, not only to ensure effective and efficient comparison of the four satellite precipitation products versus gauge-based precipitation data but also to satisfy the requirement of the HEC-HMS hydrologic model inputs, the daily basin averaged precipitation was conducted by using the Thiessen polygon method, which is recommended as one of the simplest and most robust interpolation methods by Grayson and Bloschl [29].

Satellite-Based Precipitation Datasets.
In this section, we discuss the main characteristics of the four different satellite-based precipitation products used and investigate their suitability to capture precipitation patterns and extremes over GRB.These precipitation datasets were considered to be appropriate in this study because of their high spatial and temporal resolution, coverage domain, and periods availability.A description of these products is summarized as follows.

Hydrological station Meteorological station Boundary River Elevation (m)
High: 6102 Low: 3344  In particular, CMORPH is one type of near real-time satellite products, for which estimates are available about 18 hours past real-time.Data are available at various temporal and spatial resolutions, including 30 minute at ∼8 km resolution, 3-hourly at 0.25 ∘ lat/lon resolution, and daily at 0.25 ∘ lat/lon resolution.
Recently, the NOAA-CPC has produced new satellite precipitation datasets, called CMORPH Version 1.0.The main differences between the old version 0.x and the latest version 1.0 can be summarized as follows: The fixed algorithm and satellite precipitation datasets of fixed versions over the entire TRMM/GPM era (1998-present) were used, especially to ensure best possible homogeneity, in the latest version 1.0, whereas since 2002 the old version 0.x has been established using varied improving algorithms and changing versions of satellite-based precipitation products inputs [20].
Moreover, the CMORPH Version 1.0 datasets are available in the form of three products: a pure satellite precipitation product (CMORPH-RAW) as well as bias-corrected product (CMORPH-CRT), and gauge-satellite blended product (CMORPH-BLD), while the old version 0.x exclusively contains satellite only precipitation product [31].
Hence, 1-daily 0.25 ∘ × 0.25 ∘ CMORPH-CRT product is validated over GRB using gauged precipitation observations.The datasets in this study were freely obtained from the agency of the NOAA-CPC website (ftp://ftp.cpc.ncep.noaa.gov/precip/CMORPHV1.0/).The main feature of the CMORPH-CRT is that the probability density function (PDF) matching was used to conduct bias correction through adjusting the original CMORPH satellite precipitation estimates against the CPC unified daily gauge analysis over land and the pentad GPCP analysis over ocean [32].A full description of the CMORPH-CRT analysis and its applications has been provided by Xie et al. [31] and Xie et al. [32].
(ii) PERSIANN-CDR.The original PERSIANN, first established by Hsu et al. [33] at the University of California, Irvine, is one of the popular satellite-based precipitation algorithms for estimating historical precipitation from March 2000 to present.The PERSIANN algorithm has been developed by combining IR and PMW observations from Geostationary Earth Orbiting (GEO) and Low Earth Orbit (LEO) satellite imagery, respectively, for global precipitation estimation.Depending on an Artificial Neural Network (ANN) model in the PERSIANN algorithm, the local cloud textures provided by the geostationary satellite longwave infrared images (∼ 10.2-11.2m) approach are used to estimate surface rainfall rates, and it updates its network parameters based on the TMI2A12 product from the low-inclination orbiting TRMM satellite [34].
Compared to its existing product, the latest PERSIANN-CDR (PERSIANN-Climate Data Record) product was newly developed by Ashouri et al. [21] as a multisatellite, highresolution, and posttime precipitation product.The PER-SIANN-CDR product first used the archive of the Gridded Satellite (GridSat-B1) IR data [35] as the input to the trained PERSIANN model; then the biases in the PERSIANNestimated precipitation were adjusted by the Global Precipitation Climatology Project (GPCP) monthly 2.5 ∘ product version 2.2 [21].
Since no PMW is used in PERSSIAN-CDR product, the ANN model parameters were pretrained using the National Centers for Environmental Prediction (NCEP) stage IV hourly precipitation data; then the model is run using the full historical record of GridSat-B1 IR data with fixed model parameters as those reported in a calibration scheme by Ashouri et al. [21].
Currently, this version of PERSIANN-CDR is only available with high spatial resolution of 0.25 ∘ × 0.25 ∘ and daily temporal resolution.The precipitation datasets, from 1st January 1983 to present, have been collected and distributed by the NOAA's National Climatic Data Center (NCDC), as well as the Center for Hydrometeorology and Remote Sensing (CHRS) at the University of California, Irvine.In this study, the PERSIANN-CDR product datasets of the entire study period were downloaded from ftp://data.ncdc.noaa.gov/cdr/persiann/files/.More information on PERSIANN-CDR product can be found in [21,36]; thus this article only gives a brief description.
(iii) TMPA 3B42V7 and 3B42RT.The TMPA product [37] was launched in November 1997, jointly between the National Space Development Agency (NASDA) of Japan and the National Aeronautics and Space Administration (NASA) of the United States.It is designed primarily to monitor and study tropical precipitation for weather and climate research.
According to Huffman et al. [22], the accurate precipitation estimates of satellite TMPA were produced by merging three types of observations such as PMW, IR, and PR from multiple LEO and Geo satellites and ground observations at a spatial resolution of 0.25 ∘ × 0.25 ∘ , with a temporal resolution of 3-hourly; however, PR, PMW, and IR operate within the global latitude belt from over 35 ∘ N to 35 ∘ S, 40 ∘ N to 40 ∘ S, and 50 ∘ N to 50 ∘ S, respectively.Thus, the TMPA algorithm could be executed using the following three steps: first, the PMW precipitation estimates are calibrated and combined to generate the most accurate estimation from PMW; second, the calibrated PMW are used to create IR precipitation estimates; and finally, both the PMW and IR precipitation estimates are merged to provide the best TMPA precipitation estimates.
In this research, the TMA product's latest version 7 was employed, which was released in May 2012 by the NASA Goddard Earth Sciences Data and Information Services Center (GES DISC).The substantial improvement of the TMPA version 7 product is attributed to a combination of factors including the following: (1) The inclusion of additional satellite datasets, such as the Microwave Humidity Sounder (MHS), the entire operational Special Sensor Microwave Imager/Sounder (SSM/IS), and the Grisat-B1 IR datasets [38]; (2) using the uniformly reprocessed input data, surface precipitation gauge analysis, and a latitude-band calibration scheme for all satellites [39].
It is known that the TMPA version 7 consists of two standard products: near real-time version (3B42RT, hereafter) and post-real-time version (3B42, hereafter), which are summarized here.The 3B42 provides the bias-corrected precipitation estimates by inclusion ground gauge observations from the Global Precipitation Climatology Center (GPCC) and the Climate Assessment and Monitoring System (CAMS) at the monthly scale.The 3B42 is therefore available about 10-15 days after the end of each month, while the 3B42RT is available approximately nine hours after realtime observations, making it especially suitable to flood forecasting researches.The main difference is that the TRMM Combined Instrument (TCI) dataset has been used in 3B42 for calibration; however, it was replaced by the TRMM Microwave Imager (TMI) dataset in 3B42RT.The 3B42 and 3B42RT products have been utilized since January 1998 and March 2003, respectively, with a 0.25 ∘ × 0.25 ∘ spatial resolution and a 3-hourly temporal resolution.For detailed information regarding the 3B42 and 3B42RT products, the reader is referred to Huffman and Bolvin [40,41].
However, both the 3B42 and 3B42RT data used in this study were freely obtained from the following website: https://giovanni.sci.gsfc.nasa.gov/giovanni/.It is worthwhile to mention that both the 3B42 and 3B42RT precipitation datasets were aggregated into daily scale before use due to the hydrologic model requiring daily input data as well as for being needed for comparison objectives.Overall, the four satellite products (CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42) were considered in this study, since they have not been previously evaluated and tested over the GRB.Further details on the nature of the four satellites are presented in Table 2.

HEC-HMS Hydrologic
Model.The hydrologic model used in this study is the Hydrologic Engineering Center-Hydrologic Modeling System (HEC-HMS) model [42,43], which is well known as the semidistributed conceptual hydrologic model.The model has been extensively and successfully documented in much of the hydrological watersheds under climate conditions (humid, tropical, subtropical, etc.), since its development in the 1989s [44][45][46].
This study used HEC-HMS version 4 which was recently developed in 2013 [42].To set up HEC-HMS to simulate hydrologic processes for GRB, the model input requires a Digital Elevation Model (DEM), meteorological data, soil type, and land use.DEM data with spatial resolution of 30 m was generated from the U.S. Geological Survey (USGS) (https://gdex.cr.usgs.gov/gdex/).Land use data was obtained from the GlobCover 2009 land cover map (http://due.esrin.esa.int/page globcover.php).Soil map was derived from the soil and terrain (SOTER) database developed by FAO and the International Soils References and Information Center (ISRIC) (http://www.isric.org/data/soil-and-terraindatabase-china).
In the Geospatial Hydrologic Modeling Extension (HEC-GeoHMS) version 10.1 with ArcHydro extension in ArcGIS 10.2, the GRB was divided into fifty-one subbasins connected by a stream network (twenty-five reaches and twenty-five junctions) based on available DEM data, and all initial values of the model parameters were calculated, which were further used as the hydrologic input files for an HEC-HMS project.Figure 2 shows the schematic diagram of HEC-HMS of the GRB after delineating, using HEC-GeoHMS.For a more detailed description of HEC-GeoHMS, see the user's manual for HEC-GeoHMS version 10.1 [47].
There are three main components in HEC-HMS including basin model, meteorological model, and control specifications.The physical characteristics of the watershed and connectivity data are described in the basin model.The meteorological boundary conditions such as precipitation, snowmelt, and evapotranspiration data for the subbasins are prepared in the meteorological model.The time period of a simulation run is determined in the control specifications which include a starting date and time, an ending date and time, and a computation time step.The hydrologic simulation process requires a combination of datasets for each model.For a detailed description of the HEC-HMS simulation mechanisms, the reader is referred to HEC-HMS user's manual [42] and the Technical Reference Manual [43]; therefore, only a brief description is provided here.
The streamflow simulations of watershed in the HEC-HM model are carried out in four different runoff components: rainfall loss component; surface runoff component; baseflow component; and river routing component.In this study, the following methods are applied: (i) the deficit and constant loss (DCL) method for computing the actual infiltration losses, (ii) the Clark unit hydrograph method for transforming direct runoff of excess precipitation on a watershed, (iii) the exponential recession method for specifying the subsurface flow rates, and (iv) the Muskingum routing method for flow routing in each reach.
Additionally, the temperature index snowmelt method was used to measure runoff from snowfall by determining the volume of snow water equivalent (SWE), which is the depth of water from melting a unit column of the snowpack.This method included in the meteorological model of HEC-HMS has been considered as the simplest mathematical method for modeling the amount of snowmelt runoff in many previous studies [48,49].

HEC-HMS Calibration and Simulation Scenarios.
In the previous section, we have identified methods that are included in HEC-HMS for runoff estimates; however, each method requires a specific number of parameters, which can be evaluated directly from the characteristics of the subbasin and channel and GIS maps [44,46,49], as well as recommendations from HEC-HMS developers [42].
Even with the recent suggestions in the selection of HEC-HMS parameters values, Wheater et al. [50] found that reliable estimation of model parameters through a calibration process can lead to reduced model uncertainty and improve streamflow simulations.As the literature review [51] shows, necessity calibration for the temperature index snowmelt parameters can improve the efficiency simulating snowmelt runoff.Finally, considering these issues, a total of 13 HEC-HMS parameters related to rainfall loss, surface runoff, baseflow, river routing, and snowmelt runoff were chosen for model calibration at a daily time step.The physical meanings of each parameter and the recommended initial ranges for the parameters are presented in Table 3.
Despite the HEC-HMS model providing the automatic optimization algorithms (Nelder-Mead simplex method and the univariate gradient search), previous numerous studies explained that these methods are still not able to deal with all parameters, which can affect the model accuracy.Hence, to overcome this problem, the HEC-HMS for GRB was calibrated manually using trial and error method to yield best fit between observed and simulated hydrographs as evidenced by Gyawali and Watkins [48].The available observed data including daily precipitation, streamflow, air temperature, and evapotranspiration were divided into two 3-year periods (2008 to 2010 and 2011 to 2013); thus the HEC-HMS of the study area was calibrated for the first three years and then was further validated for the last three years.
Most importantly, Jiang et al. [52] highlighted that the streamflow simulation results of different precipitation inputs in both spatiotemporal resolutions and accuracy could be somewhat similar through model calibration with each of the input data.As a consequence, in the present study, to investigate the effect of the four satellite precipitation products on streamflow simulations, we performed the proposed calibration scheme by creating two simulation scenarios.In the first scenario, the HEC-HMS model was calibrated and validated using gauged precipitation datasets, and then the model was run again using forcing inputs from the four satellite-driven precipitation products with unaltered calibrated parameter values of gauge observations.In the second scenario, the four satellite precipitation products were used as the forcing inputs to recalibrate the HEC-HMS model and then for validation as in the same periods of scenario I.It is important to note that the first scenario is usually conducted because it provides better comparison of the streamflow simulation accuracy between the satellite products and the gauged precipitation observations, whereas the second scenario is aimed at examining the influence of satellite precipitation datasets uncertainty on streamflow simulations as reported in [11,14].

Statistical Criteria of Performance Evaluation.
In our research, for checking the accuracy of four satellite-based precipitation estimates, three different verification methods were adopted, which are detailed below.
where  obs, and  obs, are, respectively, the individual and averaged observed precipitation provided by ground-based data,  est, and  est, are, respectively, the individual and averaged estimated precipitation provided by satellite-derived products, and  is the total number of time steps.The observed and estimated precipitation are considered fully consistent without uncertainty of satellite rainfall products if the values of ME, MAE, RMSE, and BIAS = 0 and value of CC = 1.More detailed information about the continuous indices are referenced in [53].

Categorical Statistical Indices.
To better analyze the capacity of the satellite-based precipitation estimates in detecting the observed occurrence of precipitation at different precipitation thresholds, the authors adopted four categorical indices, as proposed by Wilks [54] and followed by various authors [27,55,56], depending on 2 × 2 contingency table including the frequency bias index (FBI), probability of detection (POD), false-alarm rate (FAR), and critical success index (CSI).These are calculated as in ( 6), ( 7), (8), and ( 9), respectively: where ,  , and  represent the total number of hits (observed precipitation correctly detected), misses (observed precipitation not detected), and false alarms (precipitation detected but not observed), respectively.The computed values of 1 for FBI, POD, and CSI indicate better accuracy in representation, while 0 is the optimal value of FAR.The explanations of these categorical criteria and other details are well described in [54][55][56].

Hydrologic Model Evaluation Indices.
In order to assess the impact of satellite precipitation products on streamflow simulation over the GRB by the HEC-HMS model, four statistical criteria were selected to measure the goodness-offit of HEC-HMS model simulations, which are the Nash-Sutcliffe Efficiency ( NS ), determination coefficient ( 2 ), and index of agreement () and the relative bias ratio.The relevant equations ( 10), ( 11), ( 12) and ( 13) are, respectively, given as follows: in which  obs, ,  sim, ,  obs, , and  sim, are, respectively, the observed streamflow, the simulated streamflow, the average observed streamflow, and the average simulated streamflow at any given time step , and  is the total number of time steps.The best consistency between simulated and observed streamflow is judged on the basis of the indicator values of ( NS ,  2 , and ), which should be close to 1, and a lower  bias value.The ranges of these indicators have been selected in this study according to the Moriasi et al. [53] recommendations.

Evaluation and Comparison of Satellite-Gauged Precipitation
Estimates.In the following sections, we evaluate the accuracy of the satellite precipitation estimates against gauged precipitation datasets over the study region because of the negative effects of their associated uncertainties on the hydrological modeling process.The comparative analysis was conducted over 6 years (2008-2013) and seasonal daily average datasets to investigate and characterize precipitation patterns and error quantification of the four satellite products over the GRB as described below.

Annual Spatial Patterns Analysis of Satellite Precipitation
Estimates.The spatial distribution of daily average precipitation from CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 for six years (2008-2013) on a 0.25 ∘ grid over GRB is compared and illustrated in Figure 3. Visual inspection of Figure 3 shows that the average annual precipitation ranges are between 0.7 and 2.1, 2.5 and 2.8, 3.4 and 4, and 1.1 and 2.0 for CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42, respectively.
It should be noted that the spatial distribution of all satellite precipitation estimates is clearly differentiated, with the precipitation intensities gradually decreasing from the western part of the basin to the eastern part.Also, the spatial variability analysis reveals that the low-altitude regions of the basin are characterized by higher spatial variability of precipitation in comparison with the high mountainous regions, which are greater than 4500 m in altitude.Despite the fact that the precipitation amount is noticeably lower with CMORPH-CRT, it is noted that the differences of spatial precipitation distribution patterns between CMORPH-CRT and 3B42 are relatively small, as both of them showed overestimation and underestimation precipitation amounts at the lower and upper regions of the basin, respectively.The highest precipitation amount was observed for 3B42RT, while the lowest spatial variability of precipitation was found for PERSIANN-CDR.On the whole, the precipitation patterns derived by CMORPH-CRT and 3B42 are somewhat more visually compatible than those retrieved from PERSIANN-CDR and 3B42RT.

Annual Intercomparison of Satellite-Gauged Precipitation Datasets.
The Quantile-Quantile (Q-Q) plot technique and scatterplots were adopted to illustrate more insight into the nature of the differences between the four satellites (CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42) and the gauged precipitation datasets for the total precipitation from 2008 to 2013 over the GRB (Figure 4).When looking at Figure 4, the Q-Q plots analysis shows that the daily precipitation amounts obtained by satellite and gauged datasets are significantly different.As shown, the differences in daily average precipitation estimates between satellite and gauged datasets become larger as the precipitation amount increases.Additionally, PERSSIANN-CDR and 3B42RT have the highest daily average precipitation estimates over the GRB compared with those obtained by the gauged observations of CMORPH-CRT and 3B42.However, at the same time, the precipitation estimates of CMORPH-CRT and 3B42 are lower than the gauged observations.Overall, the CMORPH-CRT, PERSIANNN-CDR, and 3B42 provide the best agreement with gauged observations than the precipitation estimates from 3B42RT, except some noteworthy biases in the middle and upper portions of the distributions.
In addition, the five criteria, ME, MAE, RMSE, CC, and  bias , have been also included in Figure 4

CMORPH-CRT PERSIANN-CDR 3B42RT 3B42
Winter CC = 0.31 from PERSIANN-CDR.These values indicate that both CMORPH-CRT and 3B42 have better agreements with gauged datasets in this area.As seen from the results of statistical analysis in Figure 4, the 3B42RT exhibits the largest positive percentage of biases which are relatively larger than PERSIANN-CDR.However, the underestimation of the precipitation rates occurs only with CMORPH-CRT and 3B42, which leads to significantly lower biases of PERSIANN-CDR and 3B42RT.Although 3B42RT strongly overestimates precipitation by 107.98, 3B42 slightly underestimates it (less than 3.84).The results suggest that both CMORPH-CRT and 3B42 have more reasonable performance than 3B42RT and PERSIANN-CDR in terms of all criteria over GRB region.

Seasonal Spatial Patterns Analysis of Satellite Precipitation Estimates.
Figure 5 shows the seasonal spatial patterns of daily average CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 precipitation estimates in four seasons, spring (March to May), summer (June to August), autumn (September to November), and winter (December to February), for the period January 2008-December 2013.Clearly, the spatial patterns of CMORPH-CRT and 3B42 precipitation estimates are significantly identical, as well as the precipitation intensity increasing from the northeast to the southwest over the GRB, which is especially consistent with spatial patterns of precipitation regardless of the season.In contrast, the spatial patterns of precipitation across the four seasons in PERSIANN-CDR and 3B42RT are very distinct with different intensity distributions of precipitation.
On the other hand, both 3B42RT and PERSIANN-CDR showed the highest volume of precipitation through the four seasons in comparison with 3B42 and CMORPH-CDR.
In addition, the largest values of precipitation amounts for CMORPH-CRT, 3B42RT, and 3B42 were mainly observed during the wet period (summer and autumn), while PERSIANN-CDR recorded highest values during the dry period (winter and spring).The important feature from Figure 5 is that the seasonal spatial variability for CMORPH-CRT, PERSIANN-CDR, and 3B42 gradually increases from northern parts to the southern parts of the basin.However, this relationship is not clear for 3B42RT, where the spatial variability of precipitation is more scattered and the highest along the GRB.

Seasonal Intercomparison of Satellite-Gauged
Precipitation Datasets.The seasonal differences of precipitation estimates derived from the CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 against gauged observations were analyzed using Q-Q plots and scatter plots (Figure 6).
An inspection of Figure 6 reveals that CMORPH-CRT and 3B42 exhibit the lowest average precipitation estimates over the GRB compared to gauged observations when compared to others regardless of the season.The precipitation estimates from 3B42RT are higher than gauged observations, except during the winter, which are relatively closer to gauged observations in the estimation of high precipitation.The percent agreement between satellite and gauged datasets increases for the wet seasons (summer and autumn) and decreases for the other seasons.However, it is notable that the differences between precipitation estimates from satellite and gauged datasets become larger during the winter, especially for the PERSIANN-CDR product.In addition, the biases in the estimation of the precipitation amount are much higher in the middle parts of the probability distributions, with relatively medium daily average precipitation events regardless of seasonal changes, except 3B42RT which showed higher medium and high precipitation biases in every season.
Besides Q-Q plots and scatterplots analysis, the five criteria ME, MAE, CC, RMSE, and Rbias were used to quantify a comparison of satellite and gauge precipitation datasets in four seasons, as shown in Figure 6.It is evident that the seasonal fluctuations have noteworthy influence on the accuracy of the satellite estimates.Although there is a slight difference between CMORPH-CRT and 3B42, PERSIANN-CRT and 3B42RT show remarkable dissimilarity in every season.The worst performance is found through PERSIANN-CDR and 3B42RT with the lowest CC and the largest ME, MAE, RMSE, and Rbias (the highest Rbias being 1635.15 and 246.63, resp.), especially during the winter, which can be attributed to substantial overestimation of precipitation.
In contrast, CMORPH-CRT and 3B42 perform better and are comparable among the other products, based on increased CC and reduced ME, MAE, RMSE, and Rbias.Broadly, CMORPH-CRT has the highest accuracy in terms of MAE and RMSE, whereas 3B42 outperforms others in terms of ME and Rbias in every season.The CC values of all estimates are higher in the wet period and reach 0.54 and 0.65 for summer and autumn, respectively, with 3B42; however, CMORPH-CRT and 3B42 have the highest correlations with gauge observations compared to others.Additionally, it is obviously seen that all products performed worst in winter with the lowest CC value of PERSIANN-CDR (−0.01), followed by 3B42RT (0.03), then CMORPH-CDR (0.05), and finally 3B42 (0.1).These are analogous to the findings by Wang et al. [55], who pointed out that the CMORPH-CRT, 3B42RT, and 3B42 cannot perform well in the winter season over basins of the Southeast Tibetan Plateau.

Evaluation of Satellite-Gauged Precipitation Datasets
at Different Thresholds.The intensity distributions of the daily precipitation at different precipitation thresholds along with their relative contributions to the total precipitation are plotted in Figure 7.The figure clearly shows that there are remarkable differences between satellite and gauged datasets for precipitation occurrence under different precipitation classes, which begins to decrease when precipitation intensity ranges are greater than 5 mm/day, except for 3B42RT in the high rainfall class (>20 mm/day).The largest intensity occurrence of gauge data is nonrainy days, which occur approximately 60% of the total days, but for satellite datasets, the largest precipitation occurrence is 0 < rainfall ≤ 1 mm/day estimated by CMORPH-CRT, 3B42RT, and 3B42, accounting for 35-50% of the total precipitation, while the 1 < rainfall ≤ 5 mm/day is the largest class for the PERSIANN-CDR data, occurring about 35% of the total days.While CMORPH-CRT and 3B42 tend to underestimate the occurrences of the precipitation class ranges of 5 < rainfall ≤ 20 mm/day and rainfall > 20 mm/day, PERSIANN-CDR and 3B42RT overestimate the occurrence of all precipitation classes except nonrainy days.In addition, the precipitation class of 1 < rainfall ≤ 5 mm/day has the largest precipitation contribution rates for CMORPH-CRT, PERSIANN-CDR, and 3B42, which contributes about 30-40% of the total rainfall for all three datasets.While the dominant precipitation classes for gauge data and 3B42RT are 5 < rainfall ≤ 10 mm/day and rainfall > 20 mm/day, respectively, both classes contribute more than 35% of the total precipitation amounts for 3B42RT and gauge datasets.
The occurrences of the first two classes, that is, nonrainy and light precipitation classes of 0 < rainfall ≤ 1 mm/day from all satellite datasets, are smaller than the gauge observations (maximal about 68% for CMORPH-CRT data), although their contributions to the total rainfall are larger than those from gauge data (maximal 14% for CMORPH-CRT data).The number of occurrences of recorded middle rainfall class range 1 < rainfall ≤ 15 mm/day from all satellite datasets is larger than that of gauge data, accounting for as high as 30-50% of the total days.But the contribution rates are larger for CMORPH-CRT and 3B42 and smaller for PERSIANN-CDR and 3B42RT than that of rain gauge rainfall.Notably, the occurrences and contribution rates for the precipitation class of 5 < rainfall ≤ 10 mm/day are almost equivalent between the 3B42 and gauge datasets.For high precipitation class range rainfall > 15 mm/day, CMORPH-CRT and 3B42 slightly underestimate the occurrence by 0.13% and 0.27% of total days, respectively, and their precipitation contribution rates are dramatically lower than those of PERSIANN-CDR, 3B42RT, and gauged data.For PERSIANN-CDR and 3B42RT, the occurrences are both below 6% of the total days, while the contribution rates to the total rainfall amounts are as high as 20 and 40%, respectively.Overall, when comparing with gauge data for light and middle class range rainfall, both the occurrences and contribution rates of CMORPH-CRT and 3B42 have better agreement with gauge data than PERSIANN-CRT and 3B42RT.
In this study, the detection analysis of precipitation events was also performed to examine the ability of CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 to make estimates over the GRB using contingency statistics (FBI, POD, FAR, and CSI) at different precipitation thresholds of 1 mm/day, 5 mm/day, 10 mm/day, 15 mm/day, and 20 mm/day (Figure 8).It should be noted that all index values of CMORPH-CRT and 3B42 are comparable to each other with exception of the precipitation threshold value equal to 1 mm/day, while index values for PERSIANN-CDR and 3B42RT show relatively high variance.However, for FBI, both CMORPH-CRT and 3B42 have a slight tendency to overestimate frequency of precipitation events for thresholds of 5 mm/day, 10 mm/day, 15 mm/day, and 20 mm/day, whereas PERSIANN-CDR and 3B42RT incline to underestimate precipitation events across all thresholds, especially the 3B42RT which shows the largest underestimation of precipitation frequency for all thresholds except 1 mm/day.On the other hand, the FBI values of PERSIANN-CDR and 3B42RT products increase with an increase in precipitation intensity, and FBI values from CMORPH-CRT and 3B42 decrease for the precipitation thresholds up to 5 mm/day, meaning the detection skill of satellite products is better for intense precipitation events.Overall, 3B42RT shows the lowest skill in capturing the magnitude of precipitation events.
It evident that all satellite products exhibit poor scores for the precipitation threshold of 1 mm/day, exhibiting lower POD and CSI and higher FAR.Despite the fact that the POD and CSI scores of CMORPH-CRT are larger than the other products, it seems to have relatively equivalent performance with 3B42 in all precipitation thresholds.This implies that the algorithms of both CMORPH-CRT and 3B42 satellite products not only are more effective but also incur a more positive effect on POD and CSI indices, compared to other precipitation products.Among all products, 3B42RT shows lower FAR scores during light and moderate precipitation, while all products yield comparable values for thresholds of 15 mm/day and 20 mm/day.This indicates that the PDF matching adopted by CMORPH-CRT does not excel on the monthly gauge adjustment scheme used in TRMM products to remove biases [56].Compared with 3B42RT, 3B42 has somewhat more falsely alarmed precipitation events, leading to less effective and more uncertain FAR scores.Although 3B42 uses gauge corrections and histogram matching, suggesting the monthly bias adjusted method used by 3B42 still needs to be improved in order to avoid the defects of FAR precipitation events which exist for the 3B42RT.Hence, PERSSIAN-CDR demonstrates worse FAR scores than CMORPH-CRT.Generally, all satellite products show improved performance with increasing precipitation thresholds, meaning an increased ability to capture the magnitude of intense precipitation events.

Evaluation and Comparison of Streamflow Simulation
Scenarios.For examining the efficacy of the four satellites' CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 precipitation estimates in simulating streamflow over GRB basin, we analyze their effects on the streamflow simulation using HEC-HMS model on daily time steps under two scenarios as detailed below.The hydrographs and the exceedance probability between observed daily and simulated streamflow by the HEC-HMS model, based on the four satellites and gauged precipitation datasets as precipitation forcing in the GRB during simulation time period (2008)(2009)(2010)(2011)(2012)(2013), are illustrated in Figure 9.It can be seen that the simulated streamflow hydrograph obtained with the gauged precipitation dataset fits well against the observed streamflow, especially for the calibration period and the relatively low discharges during the dry seasons, as shown in Figure 9(a).It is also observed that the streamflow simulation tends to underestimate the high peak discharges in wet seasons and overestimate some values of the hydrograph recession curves for the years 2010, 2011, and 2003.Overall, the HEC-HMS model is capable of capturing the timing and magnitude of the daily observed streamflow quite well.Figure 9(f) shows that the exceedance probabilities of daily streamflows display systematic underestimation of simulated streamflows at high and low observed streamflows and overestimation at moderate streamflows; the simulations show better estimation for the calibration period than for the validation period.
Subsequently, the skill indices of HEC-HMS simulations were carried out to evaluate the model performance, and their findings for the calibration and validation periods are listed in Table 4.It can be seen from this table that the values of  NS ,  bias ,  2 , and  are 0.80, −9.29, 0.81, and 0.94, respectively, for the calibration period but 0.63, −7.67, 0.67, and 0.89, respectively, for the validation period.These skill indices reveal that although the performance of streamflow simulation can be considered satisfactory during the validation period, it is not as good as that found for the calibration period, which exhibits good performance.It is apparent from these results that HEC-HMS model could successfully streamflow simulation at a daily time scale in the GRB, implying that the model is reliable to investigate the hydrological usefulness of satellite precipitation products in the GRB, specifically for streamflow simulation.Afterwards, the simulation streamflow hydrographs by the gauged precipitation data-forced HEC-HMS model are then compared to those forced by the four satellites' estimates of calibration and validation periods, as shown in Figures 9(b)-9(e).None of the CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42-driven model simulations resulted in improvements in the timing and magnitude of the observed streamflow for neither calibration nor validation periods in comparison with streamflow simulation forced by gauged precipitation data.
However, the simulations forced by 3B42RT largely overestimate peak discharges for the years 2010, 2011, and 2013 but could capture the peak discharges during other From Figure 9(f), it can be clearly seen that 3B42RT shows the largest overestimation at high discharges, especially in the validation period with exceedance probabilities of up to 45%.PERSIANN-CDR, on the contrary, consistently underestimates most of the observed streamflow series in the validation period but somewhat underestimates high discharges with exceedance probabilities under 1% during calibration period.Both CMORPH-CRT and 3B42 yield slight underestimation at large peak discharges with exceedance probabilities less than 10% and tend to overestimate moderate discharges.However, all four simulations exhibit significant underestimation for low discharges in the calibration and validation periods.Overall, the exceedance probabilities of simulations forced by CMORPH-CRT and 3B42 are comparatively similar to each other in the calibration and validation periods.
As it can be seen from Table 4, the model driven by gauged precipitation data indicates the best performance with the highest skill  NS of 0.80,  2 of 0.81, and  of 0.94 in the calibration period, compared to the simulations forced by the four satellite estimates under scenario I, as expected.Nevertheless, the simulation forced by 3B42 exhibits good performance ( NS of 0.77,  2 of 0.78, and  of 0.93) and is relatively close to the gauged data of the calibration period.Additionally, in the validation period, the performance of the hydrologic model is better than those based on satellite and gauged precipitation datasets as inputs, with the highest  NS of 0.64,  2 of 0.70, and  of 0.91.Meanwhile, CMORPH-CRT and 3B42RT derived simulations gave relatively similar performances in terms of  NS (0.68 and 0.67),  2 (0.69 and 0.72), and  (0.90 and 0.91) for the calibration period and  NS (0.50 and 0.50),  2 (0.57 and 0.67), and  (0.84 and 0.89) for the validation period.It is also evident that the simulation forced by PERSIANN-CRT produces the worst overall performance among these four satellite datasets, exhibiting the smallest values for  NS of 0.45 and 0.48,  2 of 0.49 and 0.56, and  of 0.82 and 0.83 in the calibration and validation periods, respectively.However, all simulations based on satellite and gauged precipitation datasets exhibit slight negative biases in the calibration period, specifically for  bias of −3.75% corresponding with PERSSIANN-CDR, which can be considered negligible.On the other hand, in the validation period,  bias significantly decreases to −7.67% and −1.85% in simulations forced by gauge datasets and 3B42, respectively, while the opposite is true for  bias of −20.04%, −24.77%, and 14.56% in CMORPH-CRT, PERSIANN-CDR, and 3B42RT, respectively.
The above comparison reveals that 3B42 has great potential to be used alternatively for the datasets of gauged observations for hydrological simulations over mountainous watershed in the GRB, while CMORPH-CRT, PERSIANN-CDR, and 3B42RT products have limited ability for streamflow simulations and are not recommended for direct use over this region, especially the PERSIANN-CDR which is not suitable to simulate daily streamflow in this study area, though it does not show the worst results in precipitation datasets evaluation.These findings reveal that the best streamflow simulation is not necessarily required to correspond with a better satellite precipitation product, possibly due to the interaction between the precipitation datasets and streamflows.This behavior is consistent with the conclusions of Qi et al. [57].

Scenario II: Calibration Procedure Using Individual
Satellite Datasets.In this study, scenario II is exclusively used to further analyze the effect of CMORPH-CRT, PERSIANN-CDR, 3B42RT, and 3B42 products' uncertainty on streamflow simulation.The HEC-HMS model is recalibrated with each of the four satellite estimates as forcing inputs, and the calibration and validation periods of scenario I are kept unaltered within scenario II.The results are reported in Figure 10.
Figures 10(a)-10(e) demonstrate that all model simulations forced by satellite estimates match the observed hydrographs relatively well and show a greater tendency to adequately capture peak flows as compared to those of scenario I.For further illustration, in Figure 10(f) only the results from the exceedance probabilities of simulated and observed discharges are shown.It is important to note that all simulations yield slight underestimation at large peak discharges with exceedance probabilities less than 10%.In contrast to scenario I, the exceedance probabilities of scenario II simulations tend to be almost similar for both the calibration and validation periods.However, Figure 10 shows that the parameter recalibration in scenario II causes an appreciable increase in enhancement in high and low streamflow simulations.Furthermore, the skill metrics of the simulation results for scenario II are summarized in Table 5.It is important to note that all satellite estimates, except PERSIANN-CDR in the validation period, show remarkably improved results with increased values of  NS ,  2 , and  and slightly decreased values of  bias relative to those in scenario I as explained in Table 4.The simulation of 3B42 achieves the best performance in terms of all metrics skill when compared to other satellite products, which surpass the gauged precipitation driven simulation, especially during the validation period.Similarly for scenario I, the simulation performances of CMORPH-CRT and 3B42RT are relatively identical for both calibration and validation.At the same time the simulations of PERSIANN-CDR in scenario II show a slight improvement only during calibration period, while there is a decline in performance during the verification period as compared to scenario I.This indicates that the PERSIANN-CDR driven HEC-HMS model may still be less suitable to simulate streamflow in this study area even with the parameter recalibration in scenario II.It is obvious that all satellite precipitation forced simulations have less bias compared to scenario I, suggesting a significant decrease in the uncertainty of streamflow simulations for output satellite products under scenario II.In general, similar findings by previous studies [11,52,56] reported increasing the effectiveness of hydrological simulation with input-specific recalibration.

Effectiveness Analysis of Calibration Procedure on
Hydrologic Processes.In addition to model predictions of streamflow, the parameter recalibration results should be examined based on their capability to compensate biases in the four satellite estimates for other hydrologic processes, including snow water equivalent (SWE), base flow (BF), melt rate ATI (MR-ATI), liquid water content (LWC), and moisture deficit (MD) as shown in Figures 11 and 12.
Not surprisingly, all hydrologic components are affected by precipitation datasets as forcing inputs during dry and wet year.It is evident that PERSIANN-CDR presents larger differences in terms of SWE, MR-ATI, and LWC as compared to those of other precipitation products.Compared to scenario I, PERSIANN-CDR and 3B42RT show higher differences regarding BF and MD in dry and wet seasons, which may be due to the precipitation estimates of PERSIANN-CDR and 3B42RT being higher than those of CMORPH-CRT and 3B42.In addition, one can also see that there are slight differences in the SWE, MR-ATI, and LWC under both scenarios, especially in dry year.Unlike CMORPH-CRT and 3B42 products, there are relatively high values of SWE, MR-ATI, and LWC with respect to PERSIANN-CDR and 3B42RT under scenario II in dry year.While the values are relatively reduced in wet year, this causes a remarkable increase of streamflow for PERSIANN-CDR and 3B42RT than it does for CMORPH-CRT and 3B42 during the autumn and summer months.Furthermore, it can be seen that CMORPH-CRT and 3B42 for all hydrological components are nearly the same within scenario I, while noticeable differences between them can be observed in scenario II.These results reveal that scenario II tends to compensate the errors of the satellite products through an increase or decrease in hydrological components values, which may limit the ability of the model to simulate snowmelt processes over the GRB.Nevertheless, this problem is a common occurrence in hydrological communities, especially in the Alpine region, which are attributed to the unrealistic representation of the model parameters under scenario II.Based on this information, more efforts should be made to choose suitable precipitation products as forcing inputs in snowmelt processes modeling.

Summary and Conclusions
In this paper, we have made a comprehensive analysis of the applicability and reliability of the latest satellite-based precipitation datasets (CMORPH-CRT, PERSIANN-CDR, 3B42, and 3B42RT) against gauge-based datasets in the mountainous GRB of the southeast Tibetan Plateau in China.To extensively find out weaknesses and strengths of the latest four satellite products, within this context we first focused on the evaluation and comparison of different precipitation estimates and then highlighted the usefulness and suitability of each product' driven hydrological simulations with the HEC-HMS model at a daily time step.The key findings of this study can be listed as the following: (i) Overall, the spatial patterns of CMORPH-CRT and 3B42 precipitation estimates are more similar than those retrieved from PERSIANN-CDR and 3B42RT at annual and seasonal scales.Also, the spatial variation for the four products obviously depends on the basin's elevation, and this variation decreases in the high mountainous areas with altitudes greater than 4500 m.Apparently, 3B42RT shows more scattering and the highest spatial fluctuations, especially on the seasonal scale.
(ii) Comparing satellite and gauged-based precipitation datasets at an annual scale, both CMORPH-CRT and 3B42 demonstrate better performance than 3B42RT and PERSIANN-CDR in terms of all criteria over GRB region.In addition, CMORPH-CRT

Figure 1 :
Figure 1: (a) (left) Location of GRB in China with meteorological and hydrological stations and (right) the DEM map of GRB; (b) land use; and (c) soil type.

1 Figure 4 :
Figure 4: Q-Q plots (green) and scatterplots (black) of basin averaged precipitation from gauge observations versus satellite-based estimates during the period 2008 to 2013.
and calculated based on the daily basin averaged precipitation of satellite and gauged datasets during the period 2008-2013.Generally, the best values of MAE = 1.88 and RMSE = 3.73 are found for CMORPH-CRT, whereas 3B42 observed the best value of ME = −0.22,with a similar CC = 0.51 value for both products.In contrast, the 3B42RT shows the poorest values of ME = 1.90, MAE = 3.53, and RMSE = 7.59, except for

Figure 7 :
Figure 7: The intensity distribution of daily precipitation in different precipitation classes and their relative contributions to the total precipitation during the period 2008 to 2013.

4. 2 . 1 .
Scenario I: Calibration Procedure Using Gauged Precipitation Datasets.As discussed in Methodology, gauged precipitation data was first used to drive the HEC-HMS model and optimize parameters against observed streamflow at the outlet in the GRB for the period of 1 January 2008-31 December 2010, while the period of 1 January 2011-31 December 2013 was subsequently used for model validation.The model was then forced by CMORPH-CDR, PERSIANN-CDR, 3B42RT, and 3B42 as inputs for six years (2008-2013) with model

Figure 8 :
Figure 8: FBI (a), POD (b), FAR (c), and CSI (d) of daily average between the four satellite estimates and gauge observations over the GRB.

Table 1 :
The main characteristics of meteorological ground-based stations across GRB.

Table 2 :
Summary of the selected satellite-based precipitation products used in the present study.

Table 3 :
Calibration parameters for hydrologic simulation of HEC-HMS model.

Table 4 :
Skill indices of HEC-HMS simulations for the calibration and validation periods under scenario I.

Table 5 :
Skill indices of HEC-HMS simulations for the calibration and validation periods under scenario II.