Regime-Switching Temperature Dynamics Model for Weather Derivatives

Weather is a key production factor in agricultural crop production and at the same time the most significant and least controllable source of peril in agriculture. These effects of weather on agricultural crop production have triggered a widespread support for weather derivatives as a means of mitigating the risk associated with climate change on agriculture. However, these products are faced with basis risk as a result of poor design and modelling of the underlying weather variable (temperature). In order to circumvent these problems, a novel time-varying mean-reversion L\'evy regime-switching model is used to model the dynamics of the deseasonalized temperature dynamics. Using plots and test statistics, it is observed that the residuals of the deseasonalized temperature data are not normally distributed. To model the non-normality in the residuals, we propose using the hyperbolic distribution to capture the semi-heavy tails and skewness in the empirical distributions of the residuals for the shifted regime. The proposed regime-switching model has a mean-reverting heteroskedastic process in the base regime and a L\'evy process in the shifted regime. By using the Expectation-Maximization algorithm, the parameters of the proposed model are estimated. The proposed model is flexible as it modelled the deseasonalized temperature data accurately.


Introduction
From tilling of the farmland to selling of the output of the crop yield, farmers around the world make countless decisions that affect their performance. Yet, there is one very important factor that they cannot control, climate. The world's climate keeps on changing and this change will persist at rates that are projected to be out of the ordinary for some centuries [1]. Africa is no exception of these extreme climate changes across the world. Extreme climate events cause strain on food security, water resources, and human health in Africa. Ordinarily, it is the cause of limited economic growth and obstructs poverty reduction efforts for most countries in Africa [2]. With agriculture being the major contributing factor of the gross domestic product (GDP) growth of most countries in Africa [3] and climate conditions having extensive and causal correlation with the production variables [4], there should be an effective management technique to hedge against agricultural production risk. Most agricultural producers have encountered crop failures because of extreme weather conditions due to changes in climate. As a result, most farmers in Africa have developed their own traditional ways to improve the effect of extreme weather changes.
As an institutional response to weather changes, the Chicago Mercantile Exchange (CME) introduced the weather derivative (WD). WD has been in existence in most developed countries (Canada, Europe, USA, and Japan). However, most farmers in Africa have rarely heard about this effective hedging tool. WD, if introduced in Africa, will be more viable, reliable, and efficient to the agricultural industry and can hedge against the increasing weather changes that affect agriculture since it is devoid of factors like loss adjustments, moral hazards, adverse selections, high premiums, and complex information requirements. To avoid 2 International Journal of Stochastic Analysis basis risk associated with WD, there should be an efficient model for the underlying weather variable used in pricing WD. The weather variable considered in this study is temperature. Temperature controls and influences other elements of weather like clouds, humidity, air pressure, and precipitation that affect crops during and after crop production (https://en.wikibooks.org/wiki/Basic Geography/Climate/ Climate Elements, accessed on 26/01/2018.) In the last decade, there has been empirical literatures on modelling the dynamics of temperature. Dischel [5] was the first to propose a continuous stochastic model for temperature. He modelled temperature as a mean-reverting process by adapting directly the Hull-White model. The noise process in his model was driven by two Wiener processes corresponding to the distribution of the temperature and the distribution of the changes in temperature. Thereafter, McIntyre and Doherty [6] proposed a meanreverting SDE with a constant volatility daily average temperature at Heathrow airport in the United Kingdom (UK). Dornier and Queruel [7] disagreed with the direct use of the Hull-White model adopted by Dischel. They rather used a conventional Autoregressive Moving Average model rather than the AR(1) model proposed by Dischel. By replacing Brownian motion with fractional Brownian motion, Brody et al. [8] modelled the evolution of temperature that allowed the integration of a long memory effect. Other researchers [9][10][11] used different kinds of mean-reverting OU model driven by Brownian motion. In contrast to other researchers using Brownian motion to capture the residuals, Benth anď Saltytė -Benth [12] proposed an OU model that incorporate seasonal volatility and mean. In the model of Benth anď Saltytė -Benth, the residuals were driven by the generalized hyperbolic Lévy process rather than Brownian motion. The process they employed was an adjustable class of Lévy process that captured the skewness and semiheavy tails properties of the residuals.
Clearly, it can be observed that most of these researchers assumed no changes in state of the dynamics of temperature and hence modelled temperature dynamics as a single regime. The above methods of modelling temperature may lead to intractable pricing techniques for temperature derivatives. As noted by Brockett et al. [13], temperature time-series data shows sudden changes due to artificial and natural factors. By employing regime-switching models, the researcher can capture such sudden and discrete shifts in the temperature dynamics. Regime-switching models do capture most of the stylized facts of temperature accurately more than the single stochastic differential equation model, hence the need for different stochastic model for each switching state.
With a mean-reverting process as their base regime and a Brownian motion with mean different from zero as their shifted regime, Elias et al. [14] presented a constant volatility two-state MRS model for temperature dynamics at the city of Toronto, Canada. The model of Elias et al. (from hence we will call the model developed by Elias et al. as Elias' Model) failed to capture the fact that volatility of temperature varies with varying temperature as it goes through discrete changes between the states of the regime process. Evarest et al. [15] improved on Elias' model by capturing the fact that volatility of temperature varies as temperature goes through discrete changes between the states of the regime. They priced weather derivatives contracts based on the daily temperature dynamics. They used their model to calculate the future contract of HDD, CDD, and CAT indices. The introduction of the local volatility in the base regime helped in capturing well the dynamics of the underlying process. This led to a better pricing process as compared to Elias' model. However, they failed to capture the extreme and fat tail characteristics of temperature data in their model. In his seminal thesis, Cui [16] modelled and priced temperature derivative. He modelled the dynamics of temperature by a standard mean-reverting Ornstein-Uhlenbeck process with a general Lévy process as the driving noise. He extended his model by proposing a continuous-time autoregressive (CAR) model driven by a general Lévy process which he calibrated to the Canadian data. The two models he proposed were used in deriving futures price on HDD, CDD, and CAT. He later developed a two-state MRS model with a "normal" regime and a "jump" regime. The "normal" regime depended on a standard OU process. For the "jump" regime, he used different noise process (Brownian motion with more extreme drift and volatility) to drive the abnormal positive or negative "jumps" in the temperature dynamics. However, he failed to capture the changes in volatility of temperature during the MRS model but rather assumed a constant volatility in both regimes.
Several models have been formulated over time to capture the stylized facts of temperature; however these models proposed in literature have failed to capture well the stylized features of temperature, thus affecting the pricing models of WD. Inaccurate representation of the dynamics of temperature affects the pricing of WD. WD also relies on accurate extensive long-term time-series data [17]. However, there is lack of accessible, accurate, complete, and usable weather data in most African countries. Calibrating the MRS model is not trivial because the regimes are not clearly observable but latent. To outwit these problems, we use Expectation-Maximization (EM) algorithm to estimate the parameters in the model.
From the above literatures presented, Brownian motion has been replaced with a fractional Brownian motion and subsequently by a generalized hyperbolic Lévy processes. Nevertheless, it will be interesting to explore both Brownian motion and Lévy processes in a MRS model that incorporates "normal" temperatures and "extremes" in temperature. The contribution of this paper is twofold; firstly we developed a mathematically tractable temperature dynamics model for the African farmer by using regime-switching model and secondly we showed that Gaussian distribution cannot capture the dynamics of reallife temperature. To the best of our knowledge, the twostate regime-switching model developed is the first kind of model that can be used to price futures and options on futures.
International Journal of Stochastic Analysis 3

Daily Temperature Dynamics
The most widely used temperature indices in most industries (energy consumers, energy industry, travel, transportation, agriculture, government, retailing, and construction) are the cumulative average temperature (CAT), cooling degree days (CDD), and heating degree days (HDD). Nevertheless, in this research, we use the CAT and growing degree days (GDD) since they are the dominant indices that affect agriculture in Africa [18,19]. GDD is the measure of the suitability for a crop to grow in relation to the standard temperature.
Definition 1. For a given single temperature weather station, let ( ) and ( ) represent the daily maximum and minimum temperature (the temperatures used in this research are measured in degrees Celsius) recorded at day , respectively. We define the daily average temperature at day as . (1) Definition 2. Assume the daily average temperature (DAT) ( ) at time ≥ 0, and then the and generated at a specific location over a specific measurement period [ 1 , 2 ] are defined as 2.1. Stylized Facts of Temperature. Temperature has clear characteristics which differs largely from commodities and other financial assets. The most palpable characteristics of temperature are the following.
(i) Seasonality Feature. Temperature exhibits annual (365 days) seasonal movements. The DAT ( ) at time ≥ 0 is defined as the sum of the deseasonalized temperaturẽ( ) and deterministic seasonal component ( ) given as To model the variations of temperature without the deterministic seasonality, the seasonal component in (4) will be removed to obtain the deseasonalized temperaturẽ( ). The deterministic seasonal model at time , ( ), is defined as where 0 and 1 represent the constant and coefficient in the linear seasonal trend of the raw data, respectively, 2 captures the amplitude of the variation, and is the phase angle.
(ii) Mean-Reverting Feature. It is practically impossible for daily temperature to deviate from the mean temperature over a long period. Daily temperature reverts toward the mean, a feature that is common to other commodities. As observed by Alaton et al. [9], long-term changes may be as a result different factors which includes but are not limited to global warming, green-house effects, and urbanization.
(iii) Extreme Feature. Temperature data have extremal data points. These extremal data points are "abnormal" movement caused by abrupt changes in temperature. In contrast to stocks which usually exhibit jumps in their price movements, daily temperature can show some signs of spikes which are normally short-lived and of very extreme size.
(iv) Locality Feature. Temperature has a strongly localized response in temperature modelling and as such requires caution in making generalization, hence the need for different models to capture these different characteristics at different locations.
(v) Volatility. In their two-state regime switching model formulation, Elias et al. [14] considered a constant volatility in either sate of his model. But this assumption might not be a reality since a shift in temperature residuals from one state to the other causes a change in the volatility from one state to the other. Extremal data points in temperature residues have greater volatility effects than is the case when there are no spikes or sudden increase in temperature. This is ascertained in the Engle test performed to check for heteroscedasticity in the temperature residuals (see Table 2). In model (9), the volatility is assumed to be dependent on the current deseasonalized temperaturẽ( ). More precisely, the higher the deseasonalized temperature level, the larger the changes in the deseasonalized daily average temperature. Hence, in this study we will propose a model whose volatility differ with each regime and underlying process.

Markov Regime-Switching (MRS) Model
The Markov switching model developed by Hamilton [20] and Hamilton [21] inferred that the distribution of a variable is known, conditional on the occurrence of a specific regime/state. The switching process between the regimes is Markovian and is determined by an unobserved random variable. The underlying regimes, however, do not necessarily have to be Markovian but should be independent. The daily temperatures do change from day to day and these changes are not directly observable but latent. Therefore, statistical inference with regard to the likelihood of occurrence of each of the regimes at any time should be drawn. MRS has been used effectively in modelling the behaviour of the stock market and spot price of electricity [22][23][24][25]. Chevallier and Goutte [26] used sixteen international stock markets to compare the performance of regime-switching Lévy models. Chevallier and Goutte [27] developed an estimation methodology that provided a better fit for electricity and 2 market prices by using mean-reverting Lévy jump processes.
In temperature modelling, it is typical to assume that there are different regimes that can capture distinct principal weather condition or the localized weather behaviour. In our study, the daily temperature is assumed to be latent with two possible regimes, either in the base regime ("normal or mean-reverting regime" = 1) or in the shifted regime ("extreme" regime = 2). Suppose that each regime in the regime-switching model undergoes discrete shifts between the regimes of the process, and then follows a first-order Markov process with the transition matrix: The transition probabilities of our temperature process in (6) is given as 0 ≤ ≤ 1 Due to the Markov property of the states at any given time , the future state of the underlying process (temperature) +1 is independent of the past state −1 of the underlying process given the present state of the underlying process.

Modelling Daily Temperature Dynamics.
To efficiently model the dynamics of temperature, it is assumed that the deseasonalized temperature is either under base regime or shifted regime and each regime is independent and parallel to the other regimes. The deseasonalized temperaturẽ( ) is assumed to be driven by two sources of randomness: a Markov process and Lévy process. We assume a constant mean-reversion rate in the base regime. Based on the stylized facts of temperature, a regime-switching stochastic model that describes the dynamics of temperature is formulated. This model can be used to price weather derivatives. The base regime model is assumed to follow a mean-reverting stochastic process with a time-varying volatility. The residuals of the base regime are assumed to be generated by a Brownian process.
To effectively capture the nonnormality of the temperature residuals (see Figures 3,5,6, and 7 and Table 5), the residuals of the shifted regime are captured by a Lévy process. By comparing the generalized hyperbolic distribution to its subclasses (normal-inverse Gaussian, Hyperbolic, and Variance-Gamma), we were able to find the best distribution that can model the asymmetry and heavy tails of the residuals data. As our first regime-switching model, we call it time-varying mean-reversion Lévy (TML) regime-switching model. This proposed model is distinctly appropriate to capture the dynamics of temperature. In sequel, the propose TML model for the deseasonalized temperature dynamics is given as wherẽis deseasonalized daily volatility of the base through time and is the volatility of the shifted regimes and is the mean-reversion rate of the deseasonalized temperature in the base regime which reverses the deseasonalized temperature to the long-term equilibrium level after the deseasonalized temperature has drifted from this equilibrium.
∼ (0, ) is the standard Brownian motion. is a Lévy process which is cádlág, adapted, real-valued general Lévy process with independent, stationary increments and stochastically continuous, and̃is the deseasonalized temperature at time .

Proposition 3.
If the deseasonalized daily average tempera-turẽ( ) follows model (9), then the explicit solution is given bỹ( Proof. Determining the stochastic integral of the base regime process demands a variation of parameters approach to spell out a new function [ ( ), ] =̃( ) − . By Itô's lemma, the derivative of the new function can be found.
For the shifted regimẽ

Analysis of Temperature Data
The daily maximum and minimum surface temperature data were taken from the weather measurement stations at Bole and Tamale. Bole and Tamale are located in the International Journal of Stochastic Analysis 5 The raw data is checked for missing data to avoid gaps in the historical data. Depending on the size of the missing data (the proportion of the missing data should not be more than 10%), the missing data is filled using the method of combined average. The combined average is calculated using two distinct averages: the average of 7 days (d) after and before the missing day, and the average of that missing day across previous years, The missing values in the dataset are filled by averaging the calculated value in (13) and (14). In Table 1, the descriptive statistics for the daily average temperature of the two measurement stations (Bole and Tamale) are presented. The values of the median, mean, maximum, and minimum temperature for both towns are consistent and this can be attributed to the fact that the geographical locations of these two measurement stations are not distant apart. The amount of variation (std) is relatively small but vary between two measurement stations. With a skewness value of 0.41 and 0.31 for Bole and Tamale, respectively, the empirical distribution of these two towns are asymmetrical. With a negative excess kurtosis for both towns, it can be explained that the distribution of the DAT data is more outlier-prone than the normal distribution. We present the values 2 -statistics of Pearson's criteria of goodness-of-fit with its P values of the DAT time-series data (see Table 1). From the values of the 2 goodness-of-fit test and at a 1% -level of significance, the null hypothesis (DAT data is normally distributed) can be rejected. With a Hurst exponent (H) greater than 0.5 for the two towns, there is a strong trend in the DAT data. However, the trend in Bole DAT data is more predictable than that of Tamale.

Seasonal Component.
Generally, temperature follows a seasonal pattern. These seasonal patterns can be decomposed into a seasonal trend and linear trend depending on the region where the data was taken and the number of years of temperature data used. The seasonal component in the DAT time-series data is captured in model (5). In order to calibrate the DAT time-series data to the proposed model (9), the DAT time series will be deseasonalized. Figure 2 shows the deseasonalized data of the daily average temperature for Bole and Tamale.
From Figure 1, there is a strong seasonality in the DAT time-series data. Temperature exhibits a seasonal trend in Africa; higher temperatures during the dry seasons; and lower temperatures during the rainy seasons. The seasonal trend is captured in model (5). The seasonal component of model (5) is given as 2 sin((2 /360)( − )). The DAT data is calibrated to the model and, using the least square method of estimation, the parameters are estimated.
Even though the linear trend in the data is weak, a close observation (see Figure 1) shows an increasing trend in the DAT time-series data. The linear trend in our data may be as a result of the longer period of years taken. Also, as observed by Alaton et al. [9], this increasing trend of DAT over some decades at this location is as a result of many factors which include but are not limited to global warming, green-house effect, and urbanization. The linear component of model (5) is given as 0 + 1 . Figure 4 shows the squared residual plot of the deseasonalized daily average temperature data of Bole and Tamale. There is evidence of continuous variation in the variance of noise, an indication of seasonal heteroscedasticity. This shows that volatility of the deseasonalized DAT residuals is not constant as assumed by Elias et al. [14]. To further validate this result, we use the Engle test of residual heteroscedasticity to test for conditional heteroscedasticity. From Table 2, we can reject the null hypothesis of no conditional heteroscedasticity and conclude that there are significant heteroscedasticity effects in the residual series of both towns. With a p value of 0 and at a 1% -level of significance for both towns, there is a strong evidence to reject the null hypothesis of no conditional heteroscedasticity effects.

Normality Test of Temperature Data Residuals.
In this section, statistical and graphical methods are applied to the DAT residuals time-series data to test for the normality of the residuals.
Most conventional literatures [9,14,15] assumed that the residuals are independent and identically and normally distributed. It should however be noted that inaccurate choice of the distribution of this residuals can cause model error and   mispricing when pricing weather derivatives. To model the residuals of the DAT data, it is important to test the residuals for normality using different goodness-of-fit test. Table 3 shows the normal distribution 2 -statistics of the residuals. The values of 2 for Bole and Tamale are significant at 99% confidence level implying that the residuals of Bole and Tamale are not normally distributed. The descriptive statistics of the residuals of Bole and Tamale are presented in Table 4. From the Hurst exponent in Table 4, the residuals of Bole and Tamale DAT have strong trend and are more predictable. The skewness and kurtosis in Table 4 further show that the residuals are not normally distributed. These results are congruous to the Q-Q plot for normal distribution  Figure 5) and the normal fit plot ( Figure 6). Figure 7 shows that there are cases of outliers in the residuals of the daily average temperature of Bole and Tamale. Additionally, using the Jacque-Bera (JB) goodness-of-fit test and the Anderson-Darling (AD) test for normality (see Table 5), we can reject the null hypothesis that the residuals 8 International Journal of Stochastic Analysis  are normally distributed at a 99% confidence level. From the test statistics (Table 5) and graphical representations ( Figures  5-7), there is enough evidence to state that the deseasonalized DAT residuals of Bole and Tamale do not follow the normal distribution. Thus, it can be concluded that it is not efficient to model the random noise with a Gaussian process. Due to the inability of the normal distribution to capture well the residuals of the deseasonalized DAT data, we propose using the Lévy process to model the residuals of the shifted regime. We use the generalized hyperbolic (GH) distribution and its subclasses (normal-inverse Gaussian (NIG), hyperbolic (HYP), and the variance-gamma (VG)) to capture the skewness and semiheavy tails in the residuals data. We test for the best fit for our residuals data using the above named distributions.

Generalized Hyperbolic (GH) Distribution.
We model the residuals of the shifted regime by the generalized hyperbolic (GH) distribution and the subclasses which are relevant for applications.
Definition 4. The one-dimensional generalized hyperbolic (GH) distribution was introduced by Barndorff-Nielsen [28] and its probability density function is defined as where , V is the modified Bessel function of the third kind of order V, and ∈ R Each parameter in (V, , , , )) distribution has different effect on the shape of the distribution: V ∈ R determines the characterization of subclasses of the GH distribution, > 0 controls the steepness around the peak (the larger , the steeper the density), with 0 ≤ | | ≤ is the asymmetry parameter ( = 0 gives a symmetric distribution around and the skewness of the density increases as increases), is the location of the distribution, and > 0 is the scaling. A proper choice of these parameters can help in describing different shapes of the distribution.
Suppose a random variable follows a generalized hyperbolic distribution, and then the characteristics function (moment generating function (MGF) or the cumulant function) is given as The subclasses of the GH distribution include the normalinverse Gaussian (NIG) distribution, the hyperbolic (HYP) distribution, and the variance-gamma (VG) distribution.
Definition 5. For V = −0.5 in the GH distribution, we obtain the normal-inverse Gaussian (NIG). The probability density function (pdf) of a NIG distribution ( , , , ) of a random variable is an infinitely divisible distribution which is given as where 0 ≤ | | ≤ , 0 ≤ , and , ∈ R. V is the modified Bessel function.
The NIG distribution was introduced in finance literature in 1997 by Barndorff-Nielsen [29]. NIG distribution has a heavier tail than the normal distribution and can take different kinds of shapes. The MGF of NIG distribution is The NIG distribution have the following properties: (1) Conditioned that = 0, → ∞, and / = 2 , the NIG distribution will approach the normal distribution ( , 2 ).
International Journal of Stochastic Analysis 9 (2) Assuming that ∼ ( , , , ), then, ∀ ∈ R + and ∀ ∈ R, we have that = + ∼ ( / , / , + , ) Definition 6. The Hyperbolic Distribution (HYP) is a subclass of the GH distribution when V = 1. Given a hyperbolic random variable , the pdf is given as where , ∈ R, 0 ≤ , | | < and 1 is t the Bessel function of the third kind with index 1 The hyperbolic distribution can also capture semiheavy tails. The MGF of the hyperbolic distribution is The hyperbolic distribution is not closed under convolution.
The Variance-Gamma density was introduced by Madan et al. [30]. The tails of VG distribution decrease slowly than the normal distribution. The class of VG distributions is closed under convolution. The MGF of VG distribution is To find the numerical values in model (5), the transformed deterministic seasonality process (model (23)) is fitted to the DAT data. The parameters are estimated using the least square methods.

Estimation of the TML Parameters.
Estimating the parameters of TML model is not trivial. The parameters of the model are estimated using the Expectation-Maximization algorithm developed by Dempster et al. [31]. The vector of unknown parameters will be estimated by two steps' iterative algorithm: an expectation-(E-) step and a maximization-(M-) step.
Discretization. Even though temperature is a continuous process, its data is not recorded continuously but rather recorded in discrete time points. Therefore, estimating the parameters in a continuous time will be computationally costly. Before the parameters of the TML model are estimated, they will have to be calibrated to the deseasonalized DAT data and this is possible by transforming the model from its continuous form to a discrete form. The discretized form of model (9) for the base and shifted regimes is given as where ( ) and ( ) are the Weiner residual and Lévy residual, respectively.

E-
Step. Assume the length of the DAT historical data is + 1 and = 0, 1, 2, 3, . . . , where represents a specific time that the DAT is recorded and is the computed vector of parameters in the ℎ iteration. The conditional distribution of the regimes for time update values of = 0, 1, 2, 3, . . . , will be calculated. Suppose where Θ = { 1 , 2 } and (̃( ) | = ; F̃( ) −1 ) is the density of the underlying regime process at time conditional that the underlying process was in regime . The conditional probability density function for the base and shifted regimes will be calculated from the CDF.
From (25) The probability density functions (pdf) of the base and shifted regimes based on their diffusion and drift coefficient is, respectively, given as

M-
Step. By maximizing the expected log-likelihood function Θ ( +1) = argmax (Θ | Θ ( ) ), the maximum likelihood (ML) estimate Θ ( +1) for the vector of unknown parameters will be calculated. Also, in this step the transition probabilities of the regime-switching model will be estimated. The log-likelihood function from the conditional probability density function of both regimes is, respectively, given as log By maximizing the log-likelihood function presented in (30), the vector of parameters ( +1) 1 can be estimated.
Proof. The first derivative of (30) with respect to 1 is By maximizing (34), The derivative of (30) with respect to is By maximizing (36), Also, the vector of unknowns of ( +1) 2 can be estimated by maximizing (31): Proof. By finding the derivative of (31) with respect to 2 , Maximizing (40) gives The derivative of (31) with respect to is By maximizing (42) International Journal of Stochastic Analysis The transition probabilities are estimated by making use of the formula proposed by Kim [32]:

Discussion and Results
By inserting the estimated parameters into the transformed deterministic seasonality process (model (23)), the deterministic seasonal DAT is obtained for Bole and Tamale. The linear trend in the model is evidently very small. However, the linear trend of Bole DAT is more evident than that of Tamale DAT.
Using the parameter estimates values in Table 6, seasonal sine graph is fitted to our Bole and Tamale DAT data (see Figure 8).
The estimated parameters of normal, HYP, GH, NIG, and VG distributions are presented in the Table 7.
To test for the goodness-of-fit of the distributions, two distance measures are used; the Kolmogorov-Smirnov (K-S) and the Anderson-Darling (A-D) statistic are used. The K-S test statistic and A-D test statistic are used to summarize the difference between the fitted cumulative density function (cdf) and the empirical cdf. Comparative to Kolmogorov-Smirnov, the A-D test statistics is more powerful because it incorporates integration over the entire range of data by paying more attention to the tail distances. The lower the value of K-S and A-D test statistics, the better the fit of that distribution.
Regardless of the test used, the distance between the fitted hyperbolic distribution and its empirical distribution is lower compared to the other distributions and their empirical. This affirms that the hyperbolic distribution does fit well to our Bole and Tamale random noise. A series of random numbers for GH, NIG, HYP, and VG are generated using the parameters estimated in Table 7. The Q-Q plots of the quantiles of the residuals versus the randomly generated quantiles of the GH, NIG, HYP, and VG distributions are plotted (see Figure 9). The straight line (in red) shows how the residual data would behave if it is perfectly distributed with the GH, NIG, HYP, and VG. From the illustrated figures (Figure 9), it is evident that the hyperbolic distribution fits plausibly well for our Bole and Tamale random components than the normal, GH, NIG, HYP, and VG distributions. This is consistent with the A-D and K-S goodness-of-fit test in Table 9.
The deseasonalized DAT̃and the conditional probability of being in the extreme regime P( = 2) for the historical deseasonalized DAT are shown in Figure 10. The deseasonalized DAT that are categorize as "extremes," that is, with P( = 2) > 0.8, are represented by red dots.
The estimates of the TML model for deseasonalized temperature is presented in Table 8. The speed of the meanreversion is fairly low for both Bole and Tamale. However, the mean-reversion rate of Tamale is higher than that of Bole. The Markov probability 11 of the deseasonalized DAT to stay in the "normal" regime of the TML model at Bole and Tamale is higher than the Markov probability 22 of the deseasonalized DAT to stay in the "extreme" regime at Bole and Tamale. We can conclude that the "normal" regime of the TML model at both Bole and Tamale is relatively stable comparative to the "extreme" regime of the TML model. However, it is evident that there are instances that the temperatures of Bole and Tamale are at their extremes since the probability of staying in the "extreme" regime is significant even though it is small comparative to the probability of staying in the "normal" regime.

Conclusion
In this research, a novel time-varying mean-reversion Lévy regime-switching (TML) temperature dynamics model which captures the normal variations and extreme variations in temperature is developed to characterize the stochastic dynamics of temperature. The model includes a time-varying volatility and a Lévy process giving rise to the innovations. The parameters of the TML model is estimated by using a robust estimation method called the Expectation-Maximization (EM) algorithm.
A study of the Bole and Tamale historical DAT data showed that the deseasonalized DAT data has the meanreversion property. Using plots and test statistics, it was observed that the residuals of the deseasonalized data are not normally distributed. To model the nonnormality in the residuals, we employed the generalized hyperbolic, normalinverse Gaussian, and hyperbolic distributions to capture the skewness and semiheavy tails in the residuals. The hyperbolic  distribution was found to be the best distribution that can capture the semiheavy tails and skewness in the empirical distributions of the residuals in the shifted ("extreme") regime. The introduction of the generalized hyperbolic and its subclasses led us to use the Lévy process in the shifted regime of the TML model for the deseasonalized temperature. The proposed regime-switching model is flexible as it modelled the deseasonalized temperature data reasonably well. Also, it was observed that there are instances that the temperature at both measurement stations are at their extremes as stated in the introduction.
From our results, it is evident that, due to the changes in volatility of the daily average temperature dynamics, conventional models that depend on Gaussian distribution will be ineffective when pricing weather derivatives in the derivative market. Our model however can be used to price weather derivative contracts written on temperature indices (CAT and GDD) for farmers in Africa since it incorporates the stylized facts of temperature in the model.

Data Availability
The data used to support the findings of this study are available from the corresponding author upon request.