Modelling Trends of Climatic Variability and Malaria in Ghana Using Vector Autoregression

Malaria is considered endemic in over hundred countries across the globe. Many cases of malaria and deaths due to malaria occur in Sub-Saharan Africa. The disease is of great public health concern since it affects people of all age groups more especially pregnant women and children because of their vulnerability. This study sought to use vector autoregression (VAR) models to model the impact of climatic variability on malaria. Monthly climatic data (rainfall, maximum temperature, and relative humidity) from 2010 to 2015 were obtained from the Ghana Meteorological Agency while data on malaria for the same period were obtained from the Ghana Health Service. Results of the Granger and instantaneous causality tests led to a conclusion that malaria is influenced by all three climatic variables. The impulse response analyses indicated that the highest positive effect of maximum temperature, relative humidity, and rainfall on malaria is observed in the months of September, March, and October, respectively. The decomposition of forecast variance indicates varying degree of malaria dependence on the climatic variables, with as high as 12.65% of the variability in the trend of malaria which has been explained by past innovations in maximum temperature alone. This is quite significant and therefore, policy-makers should not ignore temperature when formulating policies to address malaria.


Introduction
Malaria is considered endemic in over hundred countries across the globe and many cases of malaria and deaths due to malaria occur in Sub-Saharan Africa. The disease is of great public health concern since it affects people of all age groups more especially pregnant women and children because of their vulnerability [1,2]. Malaria is caused by blood parasites transmitted from person to person through the bites of infected mosquitoes [3,4]. In the absence of prompt and effective treatment, malaria remains a life-threatening disease and about 3.2 billion people, which are almost half of the world's population, are at risk of it [4]. As such, malaria has been described as an entrenched global health challenge. However, to a lesser extent, Asia, Latin America, the Middle East, and parts of Europe are also affected. Malaria can thus be said to affect people of all regions, both poor and rich [5].
Malaria is not only the burden of the health sector in various countries, but it also permeates every aspect of the social as well as economic lives of their people [6]. Indeed, attempts to control malaria in many Sub-Saharan African countries began decades ago. The aim was to reduce the malaria disease burden until it was no longer of public health significance. It was also recognized that malaria could not be controlled by the health sector alone; therefore multiple strategies were also put in place with other health related sectors. In view of this, a lot of interventions were put in place to help in the control of the deadly disease, but malaria continues to be the leading cause of morbidity (illness) in Ghana [7]. According to Rogers and Randolph [8], malaria is an extremely climate sensitive disease and Haines et al. [9] indicated that observations of short-term variations in climate or weather show that even small temperature increases and precipitation changes can result in measurable impacts on malaria. Current evidence, according to Githeko et al. [10], shows that interannual and interdecadal climate variability have influence on the epidemiology of vector borne diseases. At the continental level, this evidence has been assessed to help determine the possible consequences of climate change in the future. Climate change was defined by Palut and Canziani [11] in the Fourth Intergovernmental Panel on Climate Change (IPCC) Assessment Report as change in the state of the climate that can be identified (e.g., by using statistical test) by changes in mean and/or variability of its properties, and that persists for an extended period typically decades or longer. Climate change has therefore been a key issue around which global development policy is being framed especially in African countries and it is considered as one of the most serious environmental and human threats.
Adu-Prah and Tetteh [12] are of the opinion that generating relevant information on the role of temperature, rainfall, and humidity on malaria prevalence at different geographic scales is critical to efforts to combat the burden of prevalence. For effective climate change adaptation strategies which involves planning and the necessary implementation of disease control interventions appropriately, a better understanding of the relationship between climatic variables and malaria is needed.
Studies have been carried out that link climatic variability and incidence of malaria. For instance, Sena et al. [13] carried out studies which showed on the overall that monthly mean rainfall correlated positively but weakly at lag months two to four and relative humidity also correlated positively but weakly from zero to four months lag with occurrence of malaria in Southwest Ethiopia. Similarly, Weli and Efe [14] conducted a study that examined the effect of climate on malaria occurrence in Port Harcourt. The results showed that prevalence of malaria is significantly dependent on the increase in rainfall and decrease in temperature. Another study carried out by Omonjo et al. [15] indicated that an increase in air temperature and sea surface temperature was associated with monthly malaria occurrence in the derived savannah and humid forest zones at various percentages. Time series plots revealed that malaria cases are common to both zones of the state and it occurs throughout the year but more prevalent at the beginning of rainy season. Haque et al. [16] also carried out studies to determine the effect of climatic variability on malaria incidence in Bangladeshi Highlands. They however did not find any association between temperature, humidity, rainfall, and malaria cases.
Meanwhile, the results from a study by Asare and Amekudzi [17] to assess climate driven malaria variability indicated that the intra-and inter-agro ecological variability in terms of intensity and duration of malaria transmission are predominantly controlled by rainfall. Also, Darkoh et al. [18] investigated the effects of climatic variables on malaria incidence in Ghana. A time series analysis was carried out. From their analysis, they detected that mean minimum and maximum monthly temperatures lagged at three months were significant predictors of malaria. However, rainfall was found not to be significant. In a study carried out by Klutse et al. [19], the transition and coastal savannah zones of Ghana were chosen and used as study sites. Their results showed that maximum temperature was a better predictor of malaria trends than minimum temperature or precipitation especially in the transition zone. Krefis et al. [20] modelled the relationship between precipitation and malaria incidence in children from a Holoendemic Area in Ghana. They conducted a time series analysis using cross-correlation function and autoregressive modelling on weekly basis in two village clusters in the study site. They observed that the regression model showed that the level of rainfall predicted the malaria incidence after a time lag of nine weeks for both clusters. Further, the cross-correlation functions showed a strong association between rainfall and the malaria incidence one or two weeks later dependent on the village cluster. This implies that the past period values for the cross-correlation function are much shorter than the past period values for the regression models although both tests implied there is a relationship between rainfall and malaria incidence. Their results indicated that malaria incidence can be predicted directly by high-resolution (large amount of) precipitation data in highly endemic areas indicating that a strong temporal relationship between rainfall and incidence of malaria exists.
Within Ghana, there are variations in climatic conditions during any one season or period since the northern part is mostly savannah, the middle belt tropical forest, and the coastal area savannah. These differences will have direct links to mosquito habitats and therefore the transmission of malaria. Therefore, a knowledge of climatic variability and its influence on the incidence of malaria can be used to predict malaria occurrences and hence provide early warning system information to health administrators and decision-and policy-makers for the control of the disease. In relation to modelling malaria and climatic variability, the use of multivariate models, especially, vector autoregression models, is however limited. Little has been done in the area of developing multivariate models that combines climatic variability to predict the seasonal prevalence of malaria cases in the country and inform the development of warning systems. Most studies used univariate methods in predicting malaria incidence. This study will develop a multivariate model in the form: where ∏ i are (n×n) coefficient matrices and t is an (n×1) unobservable zero mean white noise vector process (serially uncorrelated or independent) with time invariant covariance matrix Σ. The objective of this study is to model the trends of climatic variability and malaria in Ghana using vector autoregression. The study area falls within tropical forest zone in Ghana and is dominated by the maritime air from the south and this results in two rainy seasons throughout the year. The first begins in April and ends in July whereas the second begins in September and ends in November. These two rainy seasons are both followed by the dry season.

Data Collection.
The quantitative data consisted basically of secondary data from the Ghana Meteorological Agency and Health Service. The meteorological variables (2010-2015) include rainfall, maximum temperature, and relative humidity and were on mean monthly basis. The malaria data were also obtained based on monthly cases for the years 2010-2015 in the Kumasi Metropolitan Area. [21] has been extensively used in econometrics for the analysis of multivariate time series. It is a natural extension of the univariate autoregressive model to dynamic multivariate time series and has superior forecast ability compared with those of univariate time series model. It also determines how each endogenous variable responds over time to a shock in its own value and in every other variable and allows the data to guide the researcher.

Vector Autoregression Model. Vector autoregression (VAR) model developed by Sims
The basic form of the VAR model of order p suggested by Sims [21] has the form For this study, = ( , , , ) , where Malaria represent the number of malaria cases per month, Tmax represent the maximum temperature, Rain is the amount of rainfall (mm), and RH is the relative humidity (%). contains all deterministic variables which may consist of a constant, linear trend or seasonal dummy variables. is a K-dimensional unobservable zero mean white noise process with positive definite covariance matrix ( ) = Σ . and C are parameter matrices of suitable dimension on which you can impose various restrictions.
The parameters in the model are estimated by generalised least squares.

Augmented Dickey Fuller Test. The Augmented Dickey
Fuller (ADF) test is a unit root test for stationarity. The null hypothesis for the ADF test is that there is a unit root while the alternative hypothesis differs slightly according to the equation used. The simplest alternative hypothesis is that the time series is stationary. Unit roots can cause unpredictable results in time series analysis.

Optimal Lag Length Selection Criteria.
For a range of lag orders the individual equations of the system are estimated by OLS. The optimal lag order is selected by minimizing one of the following Information Criteria: Hannan-Quinn criterion, ( ) Final Prediction Error criterion, where∑ ( ) is estimated by −1 ∑ =1̂̂a nd * is the total number of parameters in each equation of the model when is the lag order of the endogenous variables.
2.6. Structural Analysis. Despite the fact that VAR coefficients capture the anticipated impact of a variable, there are often a lot of coefficients to interpret. It is usually more common to examine the model's residuals which represent unforeseen contemporaneous events. The next subsections provide relatively nontechnical explanations of some of the common techniques used for structural analysis of VAR models.

Causality Analysis.
Both the Granger-causality and instantaneous causality were investigated. For both tests, the vector of endogenous variables is divided into two subvectors, 1 2 , with dimensions 1 2 , respectively, so that = 1 + 2 . The subvector 1 is said to be Granger-causal for 2 if the past of 1 significantly helps predicting the future of 2 via the past of 1 alone [22]. For testing this property, a model of the form Therefore, this null hypothesis is tested against the alternative that at least one of the 21, is nonzero. An F-test statistic which is distributed as ( 1 2 , − * ) is used for testing the restrictions. Here * is the total number of parameters in the system including the parameters of the deterministic term [23]. The role of 1 2 can be reversed to test Grangercausality from 2 to 1 .
Instantaneous causality is characterized by nonzero correlation of 1 2 . Thus, the null hypothesis is tested against the alternative of nonzero covariance between the two error vectors in testing for instantaneous causality. A Wald test statistic is used to test this hypothesis.

Impulse Response Analysis.
In impulse response analysis, the exogenous and deterministic variables are treated as fixed and may therefore be dropped from the system. The adjusted endogenous variables are now denoted by . If the process is stationary (I(0)), it has a Wold moving average (MA) representation Malaria Research and Treatment 5 where Φ 0 = and the Φ can be computed recursively as with Φ 0 = and A = 0 > . The coefficients of this representation may be interpreted as reflecting the responses to impulses hitting the system. The ( , ) ℎ elements of the matrices Φ , regarded as a function of , trace out the expected response of , + to a unit change in holding constant all past values of .

Forecast Error Variance Decomposition.
Denoting the ( , ) ℎ element of the orthogonalized impulse response coefficient matrix Ψ by , , the variance of the forecast error ( , +ℎ − , +ℎ| ) is The term ( 2 ,0 +⋅ ⋅ ⋅+ 2 ,ℎ−1 ) is interpreted as the contribution of variable to the ℎ-step forecast error variance of variable . Dividing the above terms by 2 (ℎ) gives the percentage contribution of variable to the ℎ-step forecast error variance of variable ,

Mean Absolute Percentage
Error. The accuracy of forecasts generated by the fitted model was evaluated using mean absolute percentage error (MAPE) values.
where is the forecasted value and is the observed value.

Description of the Data.
The data for this study comprises time series of monthly cases of malaria for the study area and some climatic variables including rainfall (Rain), maximum temperature (Tmax), and relative humidity (RH). The climatic data were obtained from the Ghana Meteorological Agency for January 2010 to December 2015 while the malaria data were obtained from the Ghana Health Service, also for January 2010 to December 2015.  Figure 2 which indicates an obvious seasonality in both malaria and the climatic variables considered. The first difference of the data was obtained and plotted in Figure 3 which gives an indication of stationarity in both malaria and the climatic variables. Each variable in the differenced data was tested for the presence of a unit root using a test suggested by Dickey and Fuller [24]. The test, presented in Tables 2(a) and 2(b), rejected the assumption of a unit root for all time series considered, implying that the relationships among the various variables analyzed below are not spurious.

VAR Estimation.
A VAR model of the monthly malaria and climatic variables was estimated with 12 lags for each variable in each equation. Each equation has 4x12 unrestricted coefficients plus one coefficient for a constant and one for a trend. The number of lags was chosen based on four tests: the Final Prediction Error (FPE) test [25], the Hannan Quinne (HQ) test [26], and the Information Criteria suggested by Akaike (AIC) [27] and by Schwarz (SC) [28]. All four tests indicated that as many as 12 lagged monthly values may be sufficient. A lag length of 12 ensures that all the dynamics in the data are captured and are used in this analysis (Table 3).

Granger and Instantaneous Causality
Tests. The instantaneous and Granger-causality tests [22] are simple ways to ascertain whether a particular variable is affected by innovations in other variables. These tests indicate if innovations in one variable help forecast a one-step ahead figure in another variable. An important advantage of these tests is that they are unaffected by the ordering of the VAR system. The test statistics are summarized in Tables 4 and 5. Each column contains the values of F-statistics testing the marginal effect of inclusion of lagged values of the climatic variables in the row on malaria.
Both tests indicate that malaria is influenced by all three climatic variables (relative humidity, rainfall, and maximum temperature). An important caveat about the above F-Tests must be noted. Although they indicate whether other variables Granger-cause malaria, it is still possible that other variables can influence malaria through other equations in the system. For this reason, we turn to the decomposition of the variances of forecast errors.

Decomposition of Variance and Residual Correlations.
Forecast error variance decomposition (FEVD) is popular in interpreting VAR models. Results for the FEVD for both malaria and the climatic variables are presented in Figure 4. The results reveal that on the average, while about 73.67% of the variability in the trend of malaria has been explained by past innovations in malaria cases, a significant proportion (about 12.65%) of the variability in the trend of malaria has been explained by past innovations in maximum temperature. Also, whereas about 11.10% of the variability in the trend of malaria has been explained by past innovations in rainfall figures, only 2.58% of the variability in the trend of malaria has been explained by past innovations in relative humidity.

Impulse
Response. Impulse response analysis was utilized to analyze the dynamic interactions between malaria and the climatic variables of the VAR (12) process. The orthogonal impulse response of malaria to the climatic variables is presented in Figure 5. The response of malaria has an obvious fluctuation; there is a highest positive effect of maximum temperature on malaria in the ninth month (September) and     Figure 6 which is converted to the original level in Table 6. These forecasts  3.7. Forecast Accuracy. The mean absolute percentage error (MAPE) provides an indication of the average size of forecasting error expressed as a percentage of the relevant observed value irrespective of whether that forecasting error is positive or negative [29]. The MAPE for the fitted model is calculated to be 12.95% using (14) which is an indication of potentially very good forecasts by the fitted model [29].

Discussion
In this study, it was discovered that all three climatic variables influence malaria in the Kumasi Metropolitan Area. The highest positive effect of maximum temperature on malaria is in September, whereas the lowest negative effect is in March. Also, the highest positive effect of relative humidity on malaria is observed in March while the highest positive effect of rainfall on malaria is observed in October.
Further results indicate that on the average, while a greater percentage of the variability in the trend of malaria has been explained by past innovations in malaria cases, a significant proportion (about 12.65%) of the variability in the trend of malaria has been explained by past innovations in maximum temperature. Also, about 11.10% of the variability in the trend of malaria has been explained by past innovations in rainfall figures and only 2.58% of the variability in the trend of malaria has been explained by past innovations in relative humidity. This further indicates that the three climatic variables have varying effect on malaria with maximum temperature having a greater effect, followed by rainfall and then relative humidity. Therefore, modelling malaria and the climatic variables together will improve the forecast of malaria. These results are in line with studies conducted by Darkoh et al. [18] and Klutse et al. [19]. The results from the study conducted by Klutse et al. [19] showed that maximum temperature was a better predictor of malaria trends than minimum temperature or precipitation especially in the transition zone, whereas results obtained by Darkoh et al. [18] indicate that rainfall is not significant. However, minimum and maximum temperature are better predictors. Other studies include that of Adu-Prah and Tetteh [12] who discovered that temperature and relative humidity both have influence on malaria. However, the effect of rainfall was found to be less significant. Komen et al. [30] also discovered that temperature was more important in influencing malaria in the Limpopo District. They also discovered that rainfall has an influence on temperature and vice versa. However, the findings in this study contradict those obtained by Asare and Amekudzi [17]. Their results indicated that the intraand inter-agro ecological variability in terms of intensity and duration of malaria transmission are predominantly controlled by rainfall. The variation in results obtained could be due to various reasons such as the study site chosen and the time frame of the study. The VAR (12) model developed can also be used as a predictive model for making forecasts of future malaria cases. The forecasted values obtained indicate an increasing trend of malaria for the first half of 2016 in the Kumasi Metropolitan Area.
One limitation of this study is the nonavailability of data over a longer period of time. For time series models, the rule of thumb is that one should have at least fifty (50) to sixty (60) data points but preferably more than hundred (100) observations [31]. It is therefore suggested that future studies in this area of interest should consider more than hundred data points.

Conclusions
This paper develops and estimates a vector autoregression (VAR) model of the monthly malaria cases and some important climatic variables including rainfall, maximum temperature, and relative humidity in the Kumasi Metropolitan Area. The model is used to investigate the dynamic linkages between malaria and climatic variability. The model is also used to simulate the responses of malaria to innovations in climatic variability.
Results of the Granger and instantaneous causality tests lead to a conclusion that malaria is influenced by all three climatic variables. The impulse response analyses indicate that the highest positive effect of maximum temperature, relative humidity, and rainfall on malaria is observed in the ninth, third, and tenth months, respectively. The decomposition of forecast variance indicates varying degree of malaria dependence on the climatic variables, with as high as 12.65% of the variability in the trend of malaria being explained by past innovations in maximum temperature alone. Results obtained from this study are useful for policy-makers as this will help come up with policies knowing the effects of climatic variability on malaria incidence in the Kumasi Metropolitan Area.

Disclosure
The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the

Conflicts of Interest
The authors declare no conflicts of interest.