Spatiotemporal Assessment of Temperature Data Products for the Detection of Warming Trends and Abrupt Transitions over the Largest Irrigated Area of Pakistan

Reliable and accurate temperature data acquisition is not only important for hydroclimate research but also crucial for the management of water resources and agriculture. Gridded data products (GDPs) offer an opportunity to estimate and monitor temperature indices at a range of spatiotemporal resolutions; however, their reliability must be quantified by spatiotemporal comparison against in situ records. Here, we present spatial and temporal assessments of temperature indices (Tmax, Tmin, Tmean, and DTR) products against the reference data during the period of 1979–2015 over Punjab Province, Pakistan. This region is considered as a center for agriculture and irrigated farming. Our study is the first spatiotemporal statistical evaluation of the performance and selection of potential GDPs over the study region and is based on statistical indicators, trend detection, and abrupt change analysis. Results revealed that the CRU temperature indices (Tmax, Tmin, Tmean, and DTR) outperformed the other GDPs as indicated by their higher CC and R2 but lower bias and RMSE. Furthermore, trend and abrupt change analysis indicated the superior performances of the CRU Tmin and Tmean products. However, the Tmax and DTR products were less accurate for detecting trends and abrupt transitions in temperature. The tested GDPs as well as the reference data series indicate significant warming during the period of 1997–2001 over the study region. Differences between GDPs revealed discrepancies of 1-2°C when compared with different products within the same category and with reference data. The accuracy of all GDPs was particularly poor in the northern Punjab, where underestimates were greatest. This preliminary evaluation of the different GDPs will be useful for assessing inconsistencies and the capabilities of the products prior to their reliable utilization in hydrological and meteorological applications particularly over arid and semiarid regions.


Introduction
Future estimates of global climate patterns are directly concomitant with climate variations at regional scale [1]. e regional variation in climate parameters and assessing their statistical importance are rudimentary tools in detection of climate change [2]. e reliable climate statistics can play a dynamic role for climate change adaptation and mitigation at regional levels [3,4]. However, the consequence of climate change is large, especially in the regions of vulnerable population [5], which leads to the reduction in agricultural productivity due to high temperature, severe drought, and flood conditions [6]. Despite the importance of other climate parameters, the long-term trend in temperature (hereafter T mean ) is critical for the quantification of climate changes and their possible impacts on the environment [7,8]. erefore, understanding the spatiotemporal variation in T mean on regional scales is of great importance in climate monitoring and in hydroclimate studies [9]. However, the variations and increasing trend in T mean which eventually lead to changing climate patterns could be attributed to significant variations in specific temperature indices (hereafter T max , T min , and diurnal temperature range, DTR) [10][11][12]. Several studies have reported the spatiotemporal variations of temperature indices and effects of climate change for different regions of the world [13][14][15][16]. ese accurate and reliable air temperature records underpin our knowledge of regional and global climate changes as well as their possible impacts on water resources and agriculture [17,18]. e stations record may be considered as the most reliable source for retrieving meteorological data so far but, unfortunately, scarce gauge records and poor data processing quality are major obstacles in conducting such assessments [19]. Even if gauge data are available and reliable, the irregular distribution and poor spatial coverage hinder their use [20].
In recent decades, the development of temperature gridded data products (hereafter GDPs) has been proven to be reliable and cost-effective for retrieving gridded data at various scales across the globe [21]. ese global temperature data products are derived from the nationwide meteorological stations located all over the world. Individual station data are geographically interpolated over onto different grid sizes using several algorithms and computational techniques, considering the physical characteristics (slope and elevation) of different regions. ese multisource data products are often applied as climatological inputs for hydroclimate simulations to fill gaps in sparse observation data at regional scales [22], and the considerable increase in the use of these products is attributed to their easy accessibility, good spatiotemporal coverage, fine resolution, and continuous observations [23]. As defined by the World Meteorological Organization (WMO), at least 30 years of data are necessary for rigorous climate studies [1]. Most GDPs provide the historical records of temperature suitable for meteorological studies, whereas the records acquired from satellite products are restricted by their short duration and missing data under specific conditions such as cloud cover [24]. e evaluation of climatic GDPs has been proven to be useful for quantifying trends, variability, and various other hydroclimate applications for different regions across the globe [25]. e application of GDPs has grown rapidly with advances in their reliability, resolution, and latency. However, to date, uncertainty remains as a major concern for GDPs due to the variable spatiotemporal coverage, lack of in situ observations, relocation of gauges, and data processing practices; these constraints have required comprehensive research studies related to the assessment of GDPs at the regional scale [26]. erefore, the reliability and accuracy of GDPs vary with time and regional climate [27], such that the assessment and evaluation of the performance and capability of the GDPs at regional scales are of great importance: in particular in the arid to semiarid regions, which are sensitive to nonsignificant variations in climate variables due to their insubstantial ecosystems [28]. Such regions are characterized by very diverse hydrological cycle that often reveals extreme behaviors, such as severe floods and prolonged droughts [29]. e predominantly arid to semiarid climate, as well as geographical location in a region of accelerated temperature rise, has placed Pakistan among the most vulnerable countries to a warming climate. Moreover, most of the local population are engaged in the agriculture sector, which is extremely vulnerable to changes in climate, yet having limited resources with which to acclimatize [30].
In recent years, few studies have reported the spatiotemporal variation in climate with limited literature focused on evaluation and assessment of global climate products with in situ records over the different regions of the country. Reference [1] reported the spatial and temporal increase in temperature indices with highest significant warming trend in T min and T mean and insignificant T max variations over the different irrigation zones of Punjab Province. Similarly, [31] reported the spatial trends in temperature extremes using the Berkeley Earth Surface Temperature (BEST) data at 1°× 1°spatial resolution over Pakistan. Furthermore, [32] assessed the accuracy assessment of different precipitation GDPs over the arid region of Pakistan. However, the aims of these studies are either focused on assessment of precipitation data products or the detection of observed trends in temperature and precipitation. Meanwhile, the spatiotemporal accuracy of GDP-derived temperature indices across the agricultural region of the country has not yet been studied. erefore, this is the first study aiming to address this gap in knowledge by providing a detailed evaluation and assessment of the spatiotemporal uncertainties of GDPderived temperature indices in Punjab Province, Pakistan.
is region is of great significance for countries' economic growth as it harvests the country's major agriculture commodities, yet the region is highly vulnerable to changes in climate pattern [1], has a high frequency of meteorological hazards, and is highly susceptible to climate change [33]. e outcomes of such studies could be useful for further improvement of temperature GDPs as well as for meteorological applications across the study region [34].
In this study, we aim to evaluate the performance of the widely used GDP temperature indices (T max , T min , T mean , and DTR) against reference data during the period of 1979-2015 in Punjab Province. We also focus on the evaluation and comparison of temporal changes in trends and abrupt changes of GDPs. e GDPs assessed here are the Global Historical Climatology Network-Monthly (GHCN), Center for Climatic Research-University of Delaware (UDEL), Asian Precipitation Highly Resolved Observational Data Integration towards Evaluation (APHRODITE), Climate Prediction Centre (CPC), University of Princeton, Global meteorological Forcing dataset (PGF), and Climatic Research Unit (CRU). We consider the usefulness of this study as multidirectional because our findings could be used as a reference for the selection of potential GDPs in many different hydroclimate studies across Punjab Province.

Site Description.
Pakistan is located in SW Asia. e country has an area of 8 × 10 6 km 2 , including diverse landscapes ranging from the Karakoram and Himalayan mountains in the north and northwest to the agricultural plains of the Indus River basin in the center and the Arabian Sea along the southern coast [35]. Punjab Province is Pakistan's second largest province, with geographical coordinates of 31.17°N and 72.70°E ( Figure 1). Additionally, the province has the largest population and is the agricultural hub of the country, producing more than 50% of the country's agricultural commodities [36]. e province includes five major rivers, namely, the Jhelum, Chenab, Ravi, Bias, and Sutlej.
ere are seven irrigation zones in Punjab ( al, Sargodha, Lahore, Multan, Faisalabad, Dera Ghazi (D.G.) Khan, and Bahawalpur). e annual mean precipitation ranges from >800 mm in the northern part to <300 mm in the southern part [37]. ere are two precipitation seasons in this region: the monsoon season (July-September) and winter (December-March) [38]. e annual mean temperature varies from 23 to 26°C, with T min of 16-19°C and T max of 29-33°C. Overall, the northern part of the region is dominated by humid and subhumid climates, while the central and southern parts are dominated by tropical and coastal climates.

Stations Data and Processing.
Historical variations in annual air temperature indices (T max , T min , T mean , and DTR) were investigated as reference data for the evaluation of GDPs over the study region. Long-term records of temperature indices from 1979 to 2015 from 20 meteorological gauges in Punjab were provided by the Pakistan Meteorological Department (PMD) (Figure 1). ese gauges were selected on the basis of their maximum available time coverage, uniformity, and completeness of the series records. e in situ records and GDPs were investigated and evaluated on an annual scale. GDPs were developed using quality-controlled stations data. erefore quality-controlled reference data were required for the evaluation of selected GDPs. e annual gauge and GDP data series were annual means of monthly averages. e quality control of the gauge records (such as detection of outliers and procedure for missing gaps) is of primary importance. For quality assurance, outliers were fixed with neighboring gauge records and gaps were obtained from nearby gauges [39]. Missing values in the records were filled by interpolation technique using a time-based approach: e.g., the mean value of that month over a period of ±2 years surrounding the missing value [40].
Numerous methods have been used to detect the outliers and inhomogeneities in the gauge records [41]. Here, the double-mass curve method was applied to the stations records [42]. e adjustment and detection of inconsistencies in station records was achieved by concomitant its variability with other relatively stable records [43]. Nonlinearity or bends can indicate relocation of the gauges or installation of a new instrument [44]. e result of the double-mass curve indicated a straight line, with no evident break points, which confirms a high temporal uniformity in the station records. After performing standard quality control checks, the spatial interpolation thin-plate smoothing splines (ANUSPLIN) method was used to convert the gauge data onto grids of 0.5degree resolution for spatial evaluation of the GDPs. e original thin-plate spline fitting technique is described by [45], while [46] provides a theoretical description of its application to surface climate variables. e spline interpolation method is robust in areas where there are irregularly spaced gauges and a prior estimation of the spatial autocovariance structure is not required [46]. e degree or significance of autocorrelation was checked in the observed data series of air temperature indices by using time series autocorrelation technique prior to detecting the trends significance, trend magnitude, and abrupt transition over time by using Mann-Kendall (MK), Sen's slope, and Sequential Mann-Kendall (SQMK) methods [1]. e possibility of identifying a significant trend increases with the escalation in autocorrelation, which would influence the outcomes of the trend tests [47]. Consequently, the occurrence of autocorrelation should be checked prior to applying the MK tests [48]. Our analysis found no significant autocorrelation in annual time series of T max , T min , T mean , and DTR at lag-1. erefore, the station records are independent and the trend test is applicable to the original station records. e detailed procedure for autocorrelation analysis is reported in [49].

Gridded Data Products (GDPs).
In this study, six temperature index data products (GHCN, UDEL, APHRO-DITE, CPC, CRU, and PGF) were evaluated against the reference data ( Table 1). Details of the datasets are described as follows.

CRU.
e Climate Research Unit (CRU) TS V4 product was developed by the University of East Anglia, UK, and is continuously updated with the support from National Centre for Atmospheric Science (NCAS) and Natural Environment Research Council (NERC), UK. e product comprises several climate variables including temperature indices (T max , T min , T mean , and DTR), precipitation, cloud cover, vapor pressure, wet-day counts, potential evapotranspiration, and wet-day frequency. e product is prevalent because of its relatively long history (1901-present) and its horizontal spatial resolution of 0.5 degrees. e product has been developed from more than 11,800 meteorological stations around the world [50]. e main sources used for construction of monthly datasets were acquired from the national meteorological agencies (NMAs), the World Meteorological Organization (WMO), National Climatic Data Center, NCDC, USA, University of East Anglia CRU, the Food and Agriculture Organization (FAO), Centro Internacional de Agricultura Tropical, and others. In this study, we used monthly datasets of temperature indices (T max , T min , T mean , and DTR) for the period of 1979-2015 at 0.5-degree resolution.

UDEL.
e UDEL V5.01 product was developed by the University of Delaware, USA. e primary data of air surface temperature and precipitation were acquired from various sources including Global Historical Climatology Network (GHCN2, daily GHCN) at the National Centers for Advances in Meteorology Environmental Information, Atmospheric Environment Services Canada, Institute of Hydrometeorology in St. Petersburg, daily records from Greenland Climate Network data, daily records from the Global Surface Summary of the Day, the National Center for Atmospheric Research (NCAR) India, and Nicholson's precipitation data archive of Africa, station records from South America, and monthly records from the Automatic Weather Station Project Greenland [51].
e product provides monthly values for the period of 1900-2017 at 0.5-degree spatial resolution. In this study, we used monthly datasets of T mean for the period of 1979-2015 at 0.5-degree resolution.

CPC.
e CPC gauge-based product is the first product covering both land and ocean data from the CPC Unified Precipitation Project at the National Oceanic and Atmospheric Administration (NOAA). e CPC acquired data and constructed various climate parameters on daily and monthly scales; these include precipitation, temperature, snow cover, and degree-days. e product collected data reports from 30,000 stations, including reports from the Global System of Telecommunication (GTS) data, Cooperative Observer Network (COOP), and other national meteorological agencies (NMAs) [52]. e CPC product database covers the period from 1979 till the present at 0.5degree spatial resolution. In this study, we used monthly datasets of temperature indices (T max , T min , and T mean ) for the period of 1979-2015 at 0.5-degree resolution.

APHRODITE. Asian Precipitation-Highly Resolved
Observational Data Integration towards Evaluation of Water Resources (APHRODITE) products V1808 and V1101 of  daily surface air temperature and precipitation were developed by the Meteorological Research Institute of the Meteorological Agency and Japan Institute for Humanity and Nature. e gauge dataset is first interpolated at 0.05degree resolution. Further datasets are generated after regridding to 0.25-and 0.5-degree resolution by considering the local attributes, with the aid of improved algorithms for the weighting function [53]. e product provides daily values for the period of 1961-2015. e primary datasets were acquired from national meteorological agencies (NMAs), Global Telecommunication System (GTS) data, International Center for Integrated Mountain Development (ICIMOD), International Water Management Institute (IWMI), and other international projects on Asian climate. In this study, we used the monthly datasets of T mean for the period of 1979-2015 at 0.5-degree resolution.

GHCN.
e Global Historical Climatological Network-monthly (GHCN) data product V3 was developed by the National Climatic Data Center (NCDC) and National Centers for Environmental Information. It provides the temperature monthly mean data for 7280 stations in 226 countries [54]. e product covers the period from the early 1900s to present, with horizontal spatial resolutions of 5 and 0.5 degrees. e primary datasets are acquired from national meteorological agencies (NMAs), Global Telecommunication System (GTS) data, the National Center for Atmospheric Research (NCAR), World Weather Records, and other data sources. In this study, we used the monthly T mean product at 0.5-degree resolution from 1979 to 2015.

PGF.
e product was developed by assimilating the National Centers for Environmental Prediction-National Center for Atmospheric Research (NCEP-NCAR) reanalysis datasets with several global observed databases [55]. e product comprises different climate variables, including air temperature indices (T max and T min ), precipitation, downward short and longwave radiation, surface pressure, specific humidity, and wind speed. e product is available at 3hourly, daily, and monthly resolution from 1949 to the present with horizontal spatial resolutions of 1 and 0.5 degrees. In this study, we used the monthly datasets of T max and T min for the period of 1979-2015 at 0.5-degree resolution.

Statistical Evaluation.
In the present study, the spatiotemporal performance of each GDP was assessed against reference or gauge data by using significant statistical indicators. Several quantitative evaluation metrics, including the Pearson correlation coefficient (CC), root mean square error (RMSE), and standard deviation, were applied using a Taylor diagram, which is an accurate method of quantifying the degree of agreement between GDPs and reference data [56]. Moreover, the coefficient of determination (R-squared) was also used for each GDP against the reference data to quantify their respective linear relationships between the gauge and GDPs. e percentage bias (rBias) was then calculated to indicate the level of over-or underestimation of GDPs against observation data on a spatiotemporal scale. e equations for these statistical metrics are as follows: where G i and G refer to the gauge data and mean of gauge data or observation data, P i and P refer to the gridded products and mean of gridded products, respectively, and n is the total number of observations. According to [57], the statistical estimates are considered appropriate if their values satisfy CC > 0.7, RMSE and R-squared near 0, and rBias in the range of −10 to 10%. Positive values of rBias indicate underestimation of temperature by the GDP and vice versa for negative values [58].

Mann-Kendall (MK) Test.
e nonparametric MK trend analysis was used to evaluate the GDPs against reference data series. e MK test is robust against missing values and outliers [52] and is less sensitive to the abrupt disruption points and inhomogeneous data series [43]. e test has been widely used for the detection of hydrological and meteorological variations and trends in sequential data series [59,60]. e governing equations for statistics (S), variance Vas (S), and standardized (Z) statistics of MK test for identical and independent distributed datasets are as follows:

Advances in Meteorology
In these equations, n and x p , x j are defined as the series length and data sequential values, respectively. t h is the size of the pt h tied group and m is the number of tied groups. e standardized (Z) statistics follow the symmetric distribution with a null hypothesis (H 0 ) of no trend in the data series [61]. However, the alternative hypothesis was that there has been a significant trend in the time series data [62]. e negative and positive values of the (Z) statistics represent decreasing and increasing trends, respectively.

eil and Sen's Slope (TSS).
e nonparametric TSS technique is used to quantify the magnitude of slope in linear trends [63] and is widely acceptable and used for the investigation of hydrological and meteorological data series [64]. e TSS employs regression method and is broadly used for estimating the rate of slope magnitude in linear trends of temporal data series [65]. e TSS computed slope magnitude is not affected by the inconsistencies in the temporal data series [30]. e basic equation for the computation of trends magnitude is defined as follows: where N is the length of temporal data series and x j and x p are the data values at times j and p, respectively. Furthermore, the TSS equations for the even and odd length of temporal data series are as follows: if N is an even number,

Sequential Mann-Kendall (SQMK).
e SQMK trend test is used to detect significant positive or negative turning point in a time series data [48]. e trend test sets up two data series, that is, retrograde series (RS) and progressive series (PS). e statistics value of progressive series is the same as (Z) statistics, which ranges from initial to end data point. Conversely, the retrograde series is estimated backward and originates from end to initial data point in a temporal data series. If the two data series cross each other and attain explicit threshold values at a certain point, then there is significant positive or negative trend. e abrupt turning point shows the beginning of statistically significant trend in a time series [43]. e increasing and decreasing abrupt changes in PS and RS data series indicate positive and negative trends, respectively [66]. e following procedure is adopted for the estimation of abrupt changes in a given time series. e SQMK test statistics (T j ) are defined as follows: where 'n j ' is computed by comparing the magnitude of annual averages of sequential time series x j (j � 1, . . ., n) with x p (p � 1, . . ., j), as well as number of times x j > x p are counted. e mean (E) and variance (Var) of T j are computed by Subsequently, the test statistics, PS and RS, are calculated by the following equation: Similarly, RS(T) is estimated starting from end to initial data point. e hypothesis H 0 in the test would be accepted at a corresponding level of significance, if PS(T) ≤ PS(T) 1−α/2 , where PS(T) 1−α/2 is the captious value of a symmetric distribution with a value of probability above α/2. e level of α is set at 0.05 in the study. e abrupt point is contemplated to be significant at 95% confidence level (i.e., ±1.96).

Temporal and Spatial Dynamics.
e annual averages and temporal variability of temperature indices (T max , T min , T mean , and DTR) as estimated from reference and GDPs over Punjab are shown in Figures 2 and 3. e results indicated that all the GDPs of temperature indices overestimated the temperature when compared with the reference data. However, the degree of overestimation varied by 1-2°C between different products. e most representative temporal trends in the temperature indices were found in the CRU, PGF, and UDEL products. All other products also demonstrated their ability to capture the temporal variability over the study region but with notable overestimation. Overall, the performance of the CRU products was better in terms of temporal variability and magnitude. Figure 4 illustrates the spatial distribution pattern of GDPs and reference data. e spatial pattern of gauge-based temperature indices indicated a higher to lower south-north temperature gradient in Punjab, with magnitudes of 34.02-22.5°C, 19.03-11.45°C, 15.84-11.28°C, and 28.48-14.53°C, respectively. e GDPs exhibited similar patterns to those of the reference data with notable overestimation in all the products. e spatial pattern was best matched by the T mean products. Overall, the most accurate spatial patterns were achieved by the CRU, UDEL, and PGF products, while the lowest accuracies were achieved by the CPC product for temperature extremes (T max and T min ) and GHCNM T mean when compared with reference data over the study region. e performances of GDPs relative to the reference data were further evaluated based on statistical metrics as presented in Table 2.
e results again indicate that all the products overestimated temperature, with notable values of rBias in the CPC and GHCNM products. However, CRU products T max , T min , DTR, and T mean performed better in 32   e performance of other GDPs also showed moderate agreements but with higher error metrics when compared with reference data. To further compare the different GDPs against reference data, Taylor diagram was plotted [56], which quantifies agreement between the reference and GDPs in terms of their correlation coefficients (CC), standard deviation (SD), and root mean square error (RMSE), as shown in Figures 5(a)-5(d). e Taylor diagram illustrates the pattern and degree of similarity of GDPs and shows how far away the GDPs plot is from the reference data [67]. In the diagram, CC is shown by blue lines perpendicular to the parabolic scale; SD is shown by radii of the black circles and RMSE is shown by radii of the green circles.

Advances in Meteorology
e results indicate that the CRU GDP outperformed the other products, as indicated by the points plotting closer to the reference data while also having higher CC and lower RMSE values. e PGF and UDEL products also showed reasonable agreement with the reference data. Meanwhile, the CPC and DTR of CRU products showed relatively poor performance when reproducing the patterns in the reference data.  Advances in Meteorology e spatial distributions of Bias, RMSE, and CC used to evaluate T max , T min , DTR, and T mean of the GDPs against the reference data over the Punjab region during the whole research period are shown in Figure 6. e statistical metrics highlight discrepancies in the spatial patterns of GDPs against the reference data [67]. e spatial metrics indicate that the estimation of CRU, UDEL, and PGF T min products achieved the best agreements when compared with reference data. e relative GDPs of T max , T min , T mean , and DTR showed poor performances as indicated by the magnitudes of Bias, RMSE, and CC. Furthermore, intercomparison of the temperature indices demonstrates similar spatial patterns among the CRU, UDEL, APHRODITE, PGF, and CPC T mean products. However, the CPC T min and GHCNM T mean products yielded patterns that were the inverse of relative products with large overestimations.
Meanwhile, the CPC temperature indices were the least accurate among all the products when capturing the spatial distribution pattern. Most products showed similar patterns, with positive Bias in the northern part of the study region, indicating that temperatures were underestimated in the high altitude of northern Punjab. e temperature index products were less able to capture the spatial distribution pattern, particularly in northern Punjab. is could be attributed to topographic effects and the relatively coarse spatial resolution of GDPs when attempting to capture variability in high altitude areas. Overall, the range of statistical parameters in these areas demonstrates the importance of the bias correction of GDPs before their use in climate studies [67]. Overall, the CRU products showed the most consistent spatial pattern and best agreement with the reference data, particularly in central Punjab, with comparatively higher values of CC and lower values of Bias and RMSE.

Trend Detection.
e annual trends of GDPs and reference data series acquired by the MK and TSS approaches at 95% confidence interval (CI) during the study period of 1979-2015 are presented in Figures 7(a)-7(d), showing that the T max and T min GDPs overestimated and underestimated the trends when compared with their respective reference data series, respectively. T max from the CRU, PGF, and CPC products overestimated the insignificant trend magnitude by rates of 0.13, 0.15, and 0.15°C.decade −1 , respectively. However, the reference T max showed an insignificant increasing trend of 0.03°C.decade −1 . e observed trend in T max is consistent with the previous findings [1], where an insignificant positive trend was observed with a magnitude of 0.01°C.decade −1 during the period of 1967-2017. In the case of T min , the reference data showed a significant positive trend with a rate of 0.31°C.decade −1 , while CRU, PGF, and CPC products underestimated the significant increasing trend and yielded magnitudes of 0.23, 0.13, and 0.21°C.decade −1 , respectively. Similarly, the reference DTR indicated a significant negative trend with a rate of −0.32°C.decade −1 , whereas the CRU product underestimated the insignificant negative trend with a slope rate of −0.11°C.decade −1 . e study outcomes indicated that the slope magnitude of reference T min increased at a faster rate than T max , which resulted in higher rate of decrease in DTR. e conspicuous escalation in T min trends over the study region were well captured by the respective GDPs. e present results are well concurred with the findings of [31,68], which reported the faster rate of increase in T min than that of T max data series. However, the corresponding slope magnitude is different, which could be the result of different region, time period, and computational algorithms of GDPs.
Furthermore, the APHRODITE, CRU, GHCNM, UDEL, and CPC T mean products indicated positive trends with magnitudes of 0.16, 0.19, 0.14, 0.07, and 0.17°C.decade −1 , respectively. e slopes of the APHRODITE, CRU, and CPC products revealed significant trends at 95% CI, while insignificant increasing trends were observed in the GHCNM and UDEL products. e highest and lowest accuracies to detect the trends magnitude were observed in CRU and UDEL products as compared with reference T mean data series.
e comparative results of CRU T min and T mean products showed the best performance against reference data for capturing the significance and magnitude of trends. However, the GDPs for T max , T min , and DTR were comparatively less accurate when assessing the significance and magnitudes of respective trends over the study region. Overall, the reference and GDPs T min and T mean data series indicated the significant warming over the study region. However, the trends in T max showed insignificant changes during the study period. Almost all T max GDPs overestimated the magnitude as compared with reference data. Consequently, the T max products were less capable of detecting the trends significance and slope magnitude over the study region.

Abrupt Changes.
Abrupt changes in climate data series reveal the transition from one climate state to another, due to some external factors, at a rate determined by the climate system [69,70]. e Sequential Mann-Kendall (SQMK) test was applied to detect changes in the temperature index trends when evaluating and comparing the different GDPs with reference data series during the period of 1979-2015 over the Punjab region. e retrograde and progressive trend series were attained at the 0.05 significance level. e results of annual transition plots of T max , T min , DTR, and T mean GDPs and the reference data series are shown in Figures 8(a)-8(d). Figure 8(a) for T max of the GDPs (CRU, CPC, and PGF) shows similar trends in the progressive series when compared with the reference data. e reference series showed five transition points during the whole study period. However, all the T max products failed to capture the exact transition points in the T max data series.
e results of abrupt transition in the T min GDPs and reference data series are shown in Figure 8(b) and indicate that the CRU outperformed the other products when compared with the reference data. e CRU and reference data both showed similar patterns of gentle upward drive during the period of 1984. However, the CPC and PGF products were found to be relatively less accurate at detecting transitions in the temperature trend. e transition results of the DTR product and reference data series during the period of 1967-2017 are shown in Figure 8(c), indicating similar patterns with negative shifts in both the CRU and reference DTR series. However, the negative shift was stronger in the reference data series when compared with the CRU product. Moreover, no mutation point was detected in CRU and reference data series throughout the study period. e detection of rapid change points and comparison of T mean in the GDPs (APHRODITE, GHCNM, UDEL, CRU, and CPC) with those of the reference data are shown in Figure 8(d). e results show that the CRU and GHCNM products outperformed the other GDPs at capturing the abrupt transition points (negative during 1997-98 and positive during 2000-01) when compared with reference data. All the other GDPs also showed a reasonable ability to detect the patterns in progressive and retrograde series. However, the transition points of the APHRODITE, UDEL, and CPC T mean products were different from those of the reference data series. Overall, the CRU and GHCNM T mean products showed the best performance in detecting abrupt changes in temperature, when compared with the reference data series. e positive shift in temperature during  1979-2001 revealed that study region experienced a relatively hot period during this time. e results agree well with previous studies that have found hot, dry periods over several parts of Pakistan [71,72]. e transitions in temperature during this period were better captured by the CRU and GHCNM products than by the other GDPs.

Discussion
Accurate and reliable spatiotemporal temperature data acquisition is not only important for studies of climate variations but also crucial for the water resources and agriculture management [73]. e present study used in situ records to evaluate global temperature indices (T max , T min , T mean , and DTR) available as GDPs during the period of 1979-2015 in Punjab Province, Pakistan, by using several spatial and temporal statistical metrics (including trend and abrupt change analyses). e in situ records selected for the evaluation may also be subject to physical and nonphysical changes, and efforts were made to reduce their effects. Similarly, GDPs were selected for the evaluation as they are widely used, have longer time series, and are available at higher spatial resolution. e length of the in situ records and the spatial and temporal scales of the GDPs satisfy the conditions required for evaluation of climate forcing data for hydrological modelling at the regional level as prescribed by the WMO [74].
Results showed that the spatial and temporal performances of the CRU temperature indices outperformed the other GDPs as indicated by their high values of CC and R 2 but lower values of rBias and RMSE. Furthermore, the significance and abrupt transitions in temperature trends of the reference data series were well captured by the CRU T min and T mean products. However, the T max and DTR products were less able to detect the respective trends and abrupt changes. is is the first attempt to evaluate the performance of temperature GDPs over the Punjab region; therefore, the outcomes of this study could be used as a reference point for future studies with similar scope. e results of our investigation were found to be consistent with equivalent findings in other countries and with some previous studies of temperature variability over different parts of the country. e present findings of the superior performance of CRU products were well concurred with the findings of [31], which reported the promising results of CRU product against reference data in terms of higher correlation and lower bias over the Italy. Similarly, the current results are well associated with the findings of [75] that reported the moderate accuracy of CRU air temperature products against the station records by using Taylor statistic approach over the Canadian arctic during the period of 1950-2010. On the contrary, [76] reported the satisfactory behavior of GHCN and UDEL air temperature products by using different statistical metrics over the southwestern region of Brazil.

Advances in Meteorology 13
Moreover, the findings of [77,78] revealed the better performance of APHRODITE and UDEL air temperature GDPs in India and Middle East-North Africa region in terms of higher agreement in correlation, R 2 value, and NSE range between −0.5 and 0.5, respectively. e variation in GDPs performances pointed out the importance of global products evaluation at regional level as it fluctuates with time and region by using different statistical metrics [74]. Furthermore, the current results indicated significant warming trends during the period of 1979-2015 over the study region, which could be attributed to rapid urbanization, deforestation, or population growth across the study region [79]. According to [36], the population of the study region has been expanded exponentially in recent years, which may affect the long-term trends in temperature. e current findings showed significant positive trends in T min with a faster rate than T max , which is consistent with the findings of [31], which reported the significant increase in T min compared to T max by using gridded Berkeley Earth Surface Temperature (BEST) data of spatial resolution of 1°× 1°over the whole country. Similarly, the significant warming and conspicuous increase in T min data series were also reported by [43] for Iran and [48,68] for Italy and India, respectively. However, the difference in magnitude could be associated with the different study period, stations, and region. Moreover, the abrupt change results showed an abrupt transition of temperature in the reference data series during the period of 1997-2001. Several studies have reported severe droughts and hot conditions in the country during this period [80][81][82]. e significant warming during this period, which could be related to the extreme droughts and hot conditions over the study region, was well captured by the CRU GDPs. However, the T max GDPs were less accurate to detect the rate of change and abrupt transition over the study region.
Overall, we found considerable spread in the magnitude and temporal variability among the different GDPs over the study region, with differences reaching 1-2°C between different products within the same category as well as between GDPs and the reference data. e range of uncertainties was more notable in the extreme temperature (T max and T min ) products than in the T mean products. Furthermore, the spatial variability and range of uncertainty in the GDPs over northern Punjab were more prominent than those over central and southern Punjab. However, most of the GDPs underestimated the temperature indices in most of their pixels across northern Punjab. is may have been due to interannual weather variability, fewer gauges, or orographic effects in the northern part of the province. Several studies have documented such effects in climate studies of the northern belt of Pakistan [67,83]. Central and southern Punjab was covered by a relatively smaller number of pixels and showed a weaker ability of GDPs to capture the spatial variability, perhaps due to urbanization in these areas. Most of the gauge networks are installed in nonurban domains, which could impact the estimation accuracy in urban areas. Several studies related to climate variability have indicated that urbanization affects the spatial variability in climate [84]. e spatial and temporal performances of the datasets depend on a number of factors related to the processing of the GDPs, for example, data sources, interpolation techniques, temporal domain, missing data, topography, and spatial resolution [85]. Similarly, the quality, number of stations, and time scale of the reference data used for the comparison of global GDPs are also very important for the identification of potential climate GDPs in specific regions [32]. e detection of autocorrelation in the datasets is another important test to ensure the accuracy of trend detection and abrupt change analyses in the climate data series. However, the presence of autocorrelation was more significant for precipitation products, particularly for highaltitude regions above 4000 m [67]. e superior performance of the CRU temperature index (T max , T min , T mean , and DTR) GDPs over the Punjab region might be due to their better data processing procedures, higher number of gauge stations, and superior interpolation techniques. e CRU product acquired data from 11,800 stations worldwide, covering relatively long historical records and higher spatial resolution (1901-present). e quality of the CRU data is checked through a two-stage process for better consistency and reliability; for more details, see [50,86]. e GDPs used different numbers of stations in different years when calculating the spatial gridded data [32], so it is difficult to estimate the number of stations used by GDPs each year over the study region. e GDPs, even with their intrinsic biases and limitations, are still an important source of information related to climate variability on various spatial and temporal scales.
e global GDPs are also important for climate studies when there is a lack of funding or resources available for collecting field observations. Caution must be exercised when comparing and using the GDPs, since there may be large uncertainties where gauge density is low [87].
Our study summarizes and compares some potential GDPs for temperature indices over Punjab Province. e evaluation results can improve our understanding of the use of GDPs in arid and semiarid regions like Punjab Province. Meanwhile, spatial and temporal discrepancies were also identified, which will be useful in the further application of these GDPs in hydrometeorological applications. e reference data were used as a standard for the assessment of different global products. However, different systematic errors related to the reference data compromise the quality of evaluation process. us, the regions with less number of meteorological stations available with sufficient geographic distribution and the use of global products could be proved satisfactory [76]. Considering the significant biases in all six GDPs tested in the study region, we recommend establishing a correction factor for each dataset before use in further climate studies in the region. Further steps should also be considered, for example, the selection of station networks and homogenization to remove urbanization affects while constructing the GDPs in populated countries such as Pakistan. Moreover, higher-quality reference data should be considered once more accurate gauge data and a denser network are available for the evaluation of spatiotemporal GDPs over Punjab Province.

Conclusion
rough comparison with reference data, this study highlights the spatial and temporal strengths and weaknesses of different temperature GDPs; this will help in the selection of potential GDPs for Punjab Province, Pakistan. Notable differences and similarities in the bias, trends, and abrupt transition in temperature were identified for these different GDPs over the target region. e core findings of the study are listed below. e spatial and temporal performances of the CRU product were better than those of the other GDPs in terms of the higher values of CC and R 2 but lower values of Bias and RMSE. Furthermore, the trends and abrupt change analyses (using MK and SQMK tests) indicate the superior performance of the CRU T min and T mean products in terms of the trend significance, similar spatial patterns, and similar transition points when compared with reference data series during the whole study period and over the entire study region. However, the T max and DTR products were less able to detect the respective trends and abrupt changes. Transition points in the GDP and reference data series indicated significant warming during the period of 1997-2001, which could be the outcome of drought and hot condition in the country during this period [72]. Moreover, GDPs and reference data series revealed a significant change in T min as compared to T max and, as a consequence, higher reduction in DTR over the study region. e present findings are broadly consistent with the studies conducted at local, regional, and neighboring countries [31,43,77]. However, the rate of change and trends magnitude of GDPs data series are different from each other due to different data sources and computational algorithms. Overall, GDPs overestimated the temperature when compared with the reference data series. Nevertheless, the uncertainties were more notable in the extreme temperature (T max and T min ) products than in the T mean products. e temperature index GDPs showed similar patterns and moderate correlation with the reference data.
e spatial accuracy of all the GDPs was poor in northern Punjab, where the underestimation was strongest. e underestimation of air temperature in the northern parts of the country were also reported by different studies [88,89], which simulated the air temperature by using WRF model at the upper Indus Basin and Himalaya ranges, respectively. However, the prior studies reported the different ranges of temperature underestimations, which could be the outcome of different study periods, gauges, and resolution.
In conclusion, this research provides an inclusive comparison of the widely used temperature indices GDPs and enumerates the spatial and temporal inconsistencies in selected GDPs. e results and associated procedures are useful for assessing potential GDPs and for improving our Advances in Meteorology 15 understanding of their application over the study region. Despite finding that the CRU product performed better than other GDPs, uncertainties remain when applying GDPs over semiarid and arid regions like Punjab, as demonstrated by the ranges of RMSE and Bias. e spatial and temporal comparisons of GDPs with reference data showed large discrepancies, with temperature differences of up to 1-2°C between different products within the same category. ese results highlight the need for a further improvement in GDPs and for better accuracy over the arid and semiarid regions. It will also be important to note the number of stations used at each grid scale by GDPs for different regions across the globe. e magnitude of Bias for the GDPs, particularly in northern Punjab, demonstrates the importance of bias corrections before using GDPs in hydroclimate studies.
is evaluation of GDPs in the study area was limited to annual time scales; future studies should focus on higher temporal and spatial resolutions. Our evaluation of the different GDPs for temperature indices will be useful when assessing potential products and their weaknesses before their reliable utilization in hydrological and meteorological applications.

Data Availability
Data used in this study are available from the corresponding author upon request.

Conflicts of Interest
e authors declare no conflicts of interest.

Authors' Contributions
Xin Li and Yingying Chen supervised and designed this study. Zain Nawaz conducted the research and wrote the manuscript. Yanlong Guo and Xufeng Wang assisted in formatting and data analysis. Kun Zhang, Naima Nawaz, and Akynbekkyzy Meerzhan helped in editing the manuscript.