Hybrid Models Based on Singular Values and Autoregressive Methods for Multistep Ahead Forecasting of Traffic Accidents

The traffic accidents occurrence urges the intervention of researchers and society; the human losses and material damage could be abated with scientific studies focused on supporting prevention plans. In this paper prediction strategies based on singular values and autoregressive models are evaluated for multistep ahead traffic accidents forecasting. Three time series of injured people in traffic accidents collected in Santiago de Chile from 2000:1 to 2014:12 were used, which were previously classified by causes related to the behavior of drivers, passengers, or pedestrians and causes not related to the behavior as road deficiencies, mechanical failures, and undetermined causes. A simplified form of Singular Spectrum Analysis (SSA), combined with the autoregressive linear (AR) method, and a conventional Artificial Neural Network (ANN) are proposed. Additionally, equivalent models that combine Hankel Singular Value Decomposition (HSVD), AR, and ANN are evaluated. The comparative analysis shows that the hybrid models SSA-AR and SSA-ANN reach the highest accuracy with an average MAPE of 1.5% and 1.9%, respectively, from 1to 14-step ahead prediction. However, it was discovered that HSVD-AR shows a higher accuracy in the farthest horizons, from 12to 14-step ahead prediction, which reaches an average MAPE of 2.2%.


Introduction
Traffic accidents with fatalities and severely injured people are a socioeconomic problem of focus.According to the WHO [1], the traffic accidents cost between 1% and 3% of the GDP of a country, regardless of invaluable emotional damage for the victims and their families.Several studies have been developed to explain the nature of the problem, most through classification; diverse methods have been used to detect the key factors that influence the incidents severity.Abellán et al. (2013) used decision trees to extract some key patterns of severe accidents in Granada; the rules were defined in function of the variables related to atmospheric factors, driver characteristics, road conditions, or a factor combination [2].Chang and Chien applied a method based on nonparametric classification and regression tree to establish the empirical relationship between injury severity and driver/vehicle characteristics in truck-involved accidents [3].De Oña et al. (2013) use Latent Class Clustering and Bayesian networks to identify the variables involved in traffic accidents; the accident type and sight distance were detected in all the traffic accidents on rural highways in Granada [4].Recently, Shiau et al. (2015) presented Fuzzy Robust Principal Component Analysis (FRPCA) combined with Backpropagation Neural Network (BPNN) to identify the relationships between the variables of road designs, rule-violation items, and accident types; the results showed an 85.89% classification accuracy [5].Lin et al. (2015) used real time traffic accidents in a highway in Virginia; they found that the best model was based on Frequent Pattern and a Bayesian network; the model predicted 61.1% of accidents while having a false alarm rate of 38.16% [6].The risk indicating variables of traffic accidents have been identified taking into consideration diverse factors; some factors are related to road design parameters [7,8], environment conditions [9], traffic signs, or interactions among some factors [10].
The data provided by the Chilean Commission of Traffic Safety (CONASET) [11] shows a high and increasing rate of fatalities and injuries in traffic accidents from 2000 to 2014.Santiago is the most populated Chilean region, whose time series of injured people is used in this analysis.The abundance of information and research about traffic accidents, severities, risks, and occurrence factors, among others, might appear somewhat unapproachable in terms of the prevention plans.
In this case study, with categorical causes defined by CONASET and the ranking method, the potential causes of injuries in traffic accidents, which can be counteracted through campaigns directed to the change of attitude in drivers, passengers, and pedestrians toward road safety, have been identified.Two groups have been created based on the primary and secondary causes which are present in around 75% of injured people, and a third group was created with the remaining 25%.At first, nonstationary and nonlinear characteristics were found in the time series, turning the forecasting into a difficult task.Some researches exploit the potentialities of the SSA technique to extract components in a time series; SSA commonly has been used to extract trend, seasonality, and/or noise [12]; the extracted components are used to explain the complex behavior of some time series in diverse ambit, from nature [13,14] to industrial process [15], or economic indicators [16].
On the other hand, the combination of SSA and autoregressive linear and nonlinear methods is a recent alternative which has demonstrated robustness and universal capacities in terms of short-term forecasting [17][18][19].SSA combined with artificial intelligence techniques was used by Xiao et al. (2014) for monthly air transport demand forecasting [20]; a similar combination was made by Abdollahzade et al. (2015) with a nonlinear and chaotic time series [21].
To our knowledge [22], one-step ahead forecasting based on HSVD combined with ARIMA and neural networks reaches high accuracy in traffic accidents prediction.HSVD has similarities with respect to SSA in the steps of embedding, decomposition, and grouping.The main difference is in the last step previous to the component extraction; HSVD avoids diagonal averaging; instead, a particular extraction process is proposed.Although this difference exists, in terms of computational complexity both SSA and HSVD have equal floating point operations, which is due to the Rbidiagonalization (RSVD) [23] implemented in both cases.For that reason, comparisons in terms of implementation and forecasting results are presented.This forecasting approach is described in two stages, preprocessing and prediction.In the first stage Singular Spectrum Analysis and Singular Value Decomposition (SVD) of Hankel are implemented to obtain the components of low and high frequency from the observed time series; an additive component of low frequency is extracted, whereas the component of high frequency is computed by subtraction.The result is a pair of smoothed time series which can be predicted robustly by computing low-order autoregressive methods in the second stage.The direct method is applied in the second stage to develop multistep ahead prediction, with multi-input and single-output.The inputs are the components lagged values, whose optimal number is identified through the Autocorrelation Function.
Conventional SSA is put into practice in four steps, embedding, decomposition, grouping, and diagonal averaging over all the elementary matrices [24].In this work, the SSA implementation is simplified in three steps: embedding, decomposition, and diagonal averaging; only one elementary matrix is needed and it is computed with the first SVD eigentriple.The time series of length  is embedded in a trajectory matrix of dimension  × , where  is the window length.The general rule used to delimit the windows length is 2 ≤  ≤ /2; a large decomposition is given with a high value of , while a short decomposition is the opposite.During the literature review, some strategies have been used to select the window length; some instances are  = /4 [25], weighted correlation [26], and extreme of autocorrelation [27].The method used in this work to select the optimal window length is based on the Shannon entropy of the singular values.The second step of SSA is Singular Value Decomposition of the matrix obtained in the embedding; with the first eigentriple an elementary matrix is computed.Finally, by diagonal averaging over the elementary matrix, the elements of low frequency component are extracted.
The contribution is an accurate multistep ahead forecasting methodology based on singular values decomposition and autoregressive models through the comparison of four hybrid models.The prediction is focused on causes of the traffic accidents with injured people, which is oriented to support prevention plans of the government and police.The paper is organized as follows.Section 2 describes the Methodology.Section 3 shows efficiency criteria to evaluate the prediction accuracy.Section 4 characterizes the Case Study.Section 5 presents the Empirical Research Result.Finally Section 6 concludes the paper.

Methodology
Initially, the ranking technique is applied to find the potential causes of at least 75% of the events registered in the historical time series of injured people in traffic accidents in Santiago de Chile.The causes related to drivers, pedestrians, and passengers behavior were prioritized.
The forecasting methodology applied in the analyzed hybrid models is described in two stages, preprocessing and prediction, as Figure 1 illustrates.In the preprocessing stage Singular Spectrum Analysis and Singular Value Decomposition of Hankel are used to extract an additive component of low frequency from the observed time series, and by simple subtraction between the observed time series and the component of low frequency, the component of high frequency is obtained.In the prediction stage, linear and nonlinear models are implemented.Conventional SSA is implemented in four steps, embedding, decomposition, grouping, and diagonal averaging [24].In this work, SSA is simplified in three steps: embedding, decomposition, and diagonal averaging.
The embedding step maps the time series  of length  to a sequence of multidimensional lagged vectors; the Hankel matrix structure is used in the embedding: where the elements (, ) =  +−1 .The window length  has an important role in the forecasting model; it has an initial value of  = /2, and  is computed as follows: The Singular Value Decomposition of the real matrix  has the form where each   is the th eigenvalue of the matrix  =  ⊤ arranged in decreasing order of magnitudes. 1 , . . .,   is the corresponding orthonormal eigenvectors system of the matrix .
Standard SVD terminology calls √  the th singular value of the matrix ;   is the th left singular vector and   is the th right singular vector of .The collection √      is called th eigentriple of the SVD.
The computation of the optimal window length  is based on the eigenvalues differential entropy.The first  eigenvalues obtained with  = /2 contain a high spread of energy; therefore, the window length is evaluated in the range [ = 2, . . ., ] by means of the differential entropy as follows: where Δ  is the th differential entropy,   is the th Shannon entropy, and   is the th normalized eigenvalue also known as eigenvalue energy.The embedding step is executed again with this decomposition that reaches a high energy spread and lower differential entropy.With the optimal  the embedding and decomposition are computed again.
The first eigentriple is used to obtain the elementary matrix , which will be used in the extraction of the low frequency component: The step of diagonal averaging is applied over  to extract the elements of the component   ; the process is shown below: Once   is obtained, the component   is computed with (1).

Hankel Singular Value Decomposition. The preprocessing based on Singular
Value Decomposition of Hankel is implemented in three steps: embedding, decomposition, and extraction.HSVD implements the steps of embedding and decomposition as SSA (presented in Section 2.1).The elementary matrix  is also computed with the first eigentriple obtained in the decomposition step (as ( 6)).
In the extraction step, the elements of the low frequency component   are obtained from the first row and the last column of the matrix , which has the same structure as matrix  (trajectory matrix); therefore, the elements of   are where  is a  ×  matrix.

Prediction with the Autoregressive Method.
The prediction is the second stage of the traffic accidents forecasting methodology (illustrated in Figure 1).In order to obtain the traffic accidents prediction x, during the preprocessing stage the low frequency component   and the high frequency component   were obtained.The components are estimated through the autoregressive method and the addition of the components is computed to obtain the prediction as follows: where  represents the time instant and ℎ represents the horizon, with values ℎ = 1, . . ., .The component   is used as exogenous variable in the computation of the   , due to a high influence of   over   .
The predicted components via AR model are defined with where  is the number of lagged values and   and   are the coefficients of   and   , respectively.The coefficients estimation is based on linear Least Square Method (LSM); the components   and   are defined with the linear relationship expressed in matrix form: where  and  are the regressor matrices of   and   , respectively;  and  are the coefficients vectors of  and , respectively.The coefficients are computed with the Moore-Penrose pseudoinverse matrices,  † and  † , as follows:

Prediction with the Autoregressive Neural Network. In this case study, a single hidden layer Autoregressive Neural
Network is used to approach each component; the ANN has a standard multilayer perceptron (MLP) structure of three layers [28].The training subset is iteratively used to adjust the connections weights via learning algorithm; the ANN with the lowest error is selected to implement the solution with the testing subset.The nonlinear inputs are the lagged terms, which are contained in the regressor matrix ; at hidden layer is applied the sigmoid transfer function, and at output layer is obtained the prediction.The ANN output is where x is the predicted value,  is the time instant, V  and   are the linear and nonlinear weights of the ANN connections, respectively; the sigmoid transfer function is computed with The ANN structure for   prediction is denoted with ANN(, , 1), with  inputs,  hidden nodes, and 1 output ĉ while the ANN structure for   is denoted with ANN(2, , 1), with 2 inputs,  hidden nodes, and 1 output ĉ .Levenberg-Marquardt is the learning algorithm applied for weight updating in both neural networks [29].

Efficiency Criteria
The forecasting accuracy is evaluated with conventional metrics and an improved evaluation metric.The conventional metrics are Mean Absolute Percentage Error (MAPE), Root Mean Square Error (RMSE), Determination Coefficient  2 , and Relative Error (RE).The Modified Nash-Sutcliffe Efficiency (MNSE) metric is computed in order to improve the evaluation criteria, which is sensitive to significant overfitting or underfitting [30].
where  is the observed signal, x is the predicted signal,   is the testing sample size, and var is the variance.Furthermore, two statistical tests are computed to evaluate the differences and superiorities of either model, the Wilcoxon test and Pitman's correlation test, respectively.
The Wilcoxon () signed rank test evaluates the pairwise differences in the squares of each multistep ahead residuals; the differences are ranked in ascending order, with no regard to the sign, and the ranks are assigned from one to the number of the forecast errors available for comparison.The sum of the ranks of positive differences is then computed to obtain  [31].The probability  of finding a test statistic as or more extreme than the observed value under the null hypothesis is found using the -statistics given by Pitman's correlations test is applied to identify the superiority of a model in pairwise comparisons [32]; the test is based on the computation of the correlation  between Υ and Ψ as follows: where cov is the covariance and  = 1, .

Case Study
The Chilean police (Carabineros de Chile) collects the features of the traffic accidents, and CONASET records the data.The regional population is estimated in 6,061,185 inhabitants, equivalent to 40.1% of the national population.Santiago shows a high rate of the events with severe injuries from 2000 to 2014, with 260,074 injured people.The entire series of 783 registers is shown in Figure 2, which have been collected in the mentioned period from January to December, with weekly sampling.The highest number of injured people was observed in weeks 84, 184, 254, and 280, while the lowest number of injured people was observed in weeks 344, 365, 426, 500, and 756.
One hundred causes of traffic accidents have been defined by CONASET and grouped into categories.In this case study, three analysis groups have been created.In groups 1 and 2, those causes directly related to behavior of drivers, passengers, or pedestrians have been prioritized.Group 3 involves the rest of the causes.Figures 3(a), 4(a), and 5(a) show the observed time series of the groups, Injured-G1, Injured-G2, and Injured-G3, respectively; the values have been normalized via division by the maximum value in each time series.
The categories of groups 1 and 2 are as follows: reckless driving, recklessness in passenger, recklessness in pedestrian, alcohol in driver, alcohol in pedestrian, and disobedience to signal.In Table 1 are shown 20 causes of groups 1 and 2, which cover 75% of the events with injured people.The causes are listed sequentially; the cause with the highest importance has value 1 (with the highest number of injured people), and the cause with minor importance has value 20.
From Table 1, the first three causes of injuries in traffic accidents are as follows: unwise distance, inattention to traffic conditions, and disrespect to red light.The cause with the lowest importance for injuries in traffic accidents is drunk pedestrian.The categories with the highest number of causes are as follows: imprudent driving and disobedience to signal.
Two groups of analysis were formed from the information presented in Table 1; the first group labeled with Injured-G1 corresponds to the first ten primary causes, and the second group labeled with Injured-G2 corresponds to the ten secondary causes.The first group overspreads around 60% of injured people in traffic accidents, whereas the second group overspreads around 15%.The third group, labeled with Injured-G3, corresponds to the categories road deficiencies, mechanical failures, and undetermined/noncategorized causes; this group overspreads around 25% of injured people.
Complementary information was observed about traffic accidents conditions with high rate of injured people.With regard to vehicles, automobiles are involved in 54% of events, followed by vans and trucks with 19%, bus and trolley with 16%, motorcycles and bicycles with 8%, and others with 3%.With regard to environmental conditions, 85% of events was observed with cloudless conditions.Additionally, 97% of the traffic accidents with injured people have taken place in urban areas, whereas 3% correspond to rural area.With regard to relative position, 46% of the events have been produced in intersections controlled by traffic signals or police officers with 46%, followed by accidents that happened in straight sections with 37%, and other relative positions with 17%.

Empirical Research Result
The results of the methodology implementation with linear and nonlinear models are described by stages: components extraction and prediction.

Components Extraction. The methodology presented in
Section 2 describes the preprocessing stage and the prediction stage.The preprocessing stage is based on two types of methods, Singular Spectrum Analysis and Singular Value Decomposition of Hankel.
Both preprocessing techniques SSA and HSVD embed the time series in a structure of two dimensions; the initial window length used is  = /2.Once matrix  is obtained, the decomposition is computed.The differential energy of the eigenvalues is obtained with (5a).A high energy content was observed in the first  = 20 eigenvalues; the lowest differential energy was observed in decompositions based on values of 15, 17, and 16 of window length, for Injured-G1, Injured-G2, and Injured-G3, respectively, and these values were set as window length.
The embedding process is implemented again with the optimal window , and the decomposition is recomputed.The first elementary matrix  is used by SSA and HSVD to   The nonstationary trend in the signals was verified for Injured-G1, Injured-G2, and Injured-G3 through Kwiatkowski, Phillips, Schmidt, and Shin test (KPSS) [33].The test assesses the null hypothesis that the signals are trend stationary; it was rejected at 5% significance level; consequently, nonstationary unit-root processes are present in the signals.
The time series Injured-G1 is observed in Figure 3(a); similar dynamic is observed with respect to full series, taking into consideration that this group contains the predominant causes.The   components extracted by SSA and HSVD are shown in Figure 3(b); long-memory periodicity features were observed.The   resultant components are presented in Figure 3(c); short-term periodic fluctuations were identified.The components obtained with SSA and HSVD are similar; however, a slight difference is observed in the components of low frequency; SSA extracts smoother components than HSVD.
Both techniques SSA and HSVD show that the principal ten causes (in Table 1 with importance 1 to 10) present the highest incidence between years 2002 and 2005 (weeks 106 to 312); it descends from 2006 until half 2012 (around week 710); an increment is observed between weeks 711 and 732 (second semester of 2013 and first semester of 2014).
Figure 4 shows the time series Injured-G2, the low frequency component, and the high frequency component.As previous series, the components of low frequency show similar dynamic of slow fluctuations with decreasing trend (  ), while the high frequency shows fast fluctuations.In this group is also observed   via SSA smoother than   via HSVD.Both techniques SSA and HSVD show that the secondary ten causes (described in Table 1 with importance 11 to 20) present the highest incidence of injured people in years 2000, 2003, and 2004; it descends from 2005; therefore, forward downtrend is observed in the number of injured people in traffic accidents due to the 10 secondary causes.
Figure 5 shows Injured-G3 and its components   and   ; as in previous analysis the components of low frequency show long-memory periodicity features, whereas the components of high frequency show short-term periodic fluctuations, and   via SSA is smoother than   via HSVD.
Both techniques SSA and HSVD show that the causes are related to road deficiencies, mechanical failures, and undetermined/noncategorized causes (with 25% of incidence).The highest incidence is observed in year 2001, from year 2002 it presents strong decay until 2006, and forward uptrend is observed with a temporal decrease in 2009.
Prevention plans and punitive laws have been implemented in Chile during the analyzed period, via education, drivers licensing reforms, zero tolerance law, Emilia's law, and transit law reforms, among others.The effect of a particular preventive or punitive action is not analyzed in this work; however, the proposed short-term prediction methodology based on observed causes and intrinsic components is a contribution to government and society in preventive plans formulation, its implementation, and the consequent evaluation.

Prediction. The prediction is implemented by means of the autoregressive models, linear (AR) or nonlinear (ANN).
The models use the lagged terms of the components   and   ; the optimal number of the lagged terms was fixed in  = 32 weeks, which was found through the computation of the Autocorrelation Function over the observed time series of injured people due to all causes.
The models based on Singular Spectrum Analysis (SSA-AR and SSA-ANN) receive the components of SSA preprocessing stage, whereas the models based on Singular Value Decomposition of Hankel (HSVD-AR and HSVD-ANN) receive the components of HSVD preprocessing stage.
The ANN has single hidden layer structure (32, 1, 1), with 32 inputs, 1 hidden node, and 1 output.The LM algorithm was used iteratively to adjust the linear and nonlinear weights.
The direct method was used to develop multistep ahead forecasting; in Tables 2 and 3 are presented the average prediction results with hybrid models SSA-AR, SSA-ANN, HSVD-AR, and HSVD-ANN with the three time series.The arithmetic mean of the resultant metrics is presented in Tables 2 and 3; the results shows that the accuracy decreases as the time horizon increases; therefore, the best accuracy was obtained for the nearest weeks, and the lowest accuracy was obtained for the farthest weeks.The best mean accuracy was reached by using SSA-AR model, with MNSE of 92.6%,  2 of 99.3%, MAPE of 1.5%, and RMSE of 0.7%.The lowest mean accuracy was obtained with HSVD-ANN model, with MNSE of 85.6%,  2 of 98.2%, MAPE of 2.9%, and RMSE of 1.4%.The second best average accuracy was reached by SSA-ANN, and the third best accuracy was reached by HSVD-AR.
The highest gain in average MNSE from 1-to 14-step ahead prediction is 7.6%, while average MAPE is 93.3%.However, it was observed that HSVD-AR shows a higher accuracy in farthest horizons, from 12-to 14-step ahead prediction, which reaches these average results: MNSE of 89.4%,  2 of 98.9%, MAPE of 2.2%, and of RMSE 1.0%; the gain in average MNSE from 12-to 14-step ahead prediction is 12.1%, and on average MAPE is 83.3%.
Prediction horizons higher than 14 weeks provide inaccurate results.
From previous tables, similar accuracy was identified in the prediction through SSA-AR, HSVD-AR, and SSA-ANN, for 11-step ahead prediction.The predicted signals are shown in Figures 6, 7, and 8, whereas the metrics residuals are presented in Tables 4, 5, and 6.
The results for 11-step ahead prediction of Injured-G1 are shown in Figures 6(a), 6(c), 6(e), and Table 4. From these figures and metrics, a good fit is observed; the highest accuracy was reached via SSA-AR with MNSE of 87.6%,  2 of 98.6%, MAPE of 2.0%, and RMSE of 1.1%.
In Figures 6(b), 6(d), and 6(f), the Relative Error of Injured-G2 prediction is shown.The model SSA-AR shows 94.7% of the predicted points with Relative Error lower than ±5%, HSVD-AR 86.7%, and SSA-ANN 83.1%.The gain was computed by means of residual metrics and the two best models (SSA-AR and HSVD-AR); the highest gain was observed in MAPE with 25%.
The results for 11-step ahead prediction of Injured-G2 are shown in Figures 7(a), 7(c), and 7(e) and Table 5; all models achieve a good fit; SSA-AR and HSVD-AR reach the highest and similar accuracy.In Figures 7(b), 7(d), and 7(f), the Relative Error is shown; SSA-AR presents 85.8% of the predicted points with a Relative Error lower than ±5%, HSVD-AR 85.3%, and SSA-ANN 80%.The gain was computed based on residual metrics and the two best models; the highest gain was observed in MAPE with 10.7%.
The results for 11-step ahead prediction of Injured-G3 are presented in Figures 8(a), 8(c), and 8(e) and Table 6; all models achieve also a good fit and similar accuracy.In Figures 8(b), 8(d), and 8(f), the Relative Error is illustrated; SSA-AR presents 92.4% of predicted points with a Relative Error lower than ±5%, HSVD-AR 94.2%, and SSA-ANN 91.6%.The gain was computed based on residual metrics and the two best models; the highest gain was observed in RMSE with 7.7%.
In the next section the differences and/or superiorities of either linear model SSA-AR or HSVD-AR are identified through the application of the statistical tests.

Models Statistical Tests.
The performance of the linear hybrid models SSA-AR and HSVD-AR is evaluated with the Wilcoxon hypothesis test and with Pitman's correlations test (( 16)-(17a), (17b), and (17c)).The Wilcoxon hypothesis test results are shown in Table 7, and Pitman's correlation test results are shown in Table 8.
From Table 7, in 37 comparisons between residuals of SSA-AR and HSVD-AR, the test rejects the null hypothesis that there is no difference in the prediction at 5% significance level.In the remaining 5 comparisons the null hypothesis that there is no difference in the prediction is accepted.In this case, there is no difference in 12-step ahead prediction for time series Injured-G1; the same situation was found in 10and 11-step ahead prediction for time series Injured-G2 and Injured-G3.Pitman's correlations test is applied with the residual values to identify the superiority of SSA-AR over HSVD-AR or the opposite.The correlations between Υ and Ψ are shown in Table 8.
The null hypothesis of Pitman's correlation is true at 5% significance level if || > 1.96/√  , where   = 225 testing samples.The results of the correlations are shown in Table 8.From Table 8, the results present similarities with respect to Wilcoxon test.In 5 predictions there is no superiority of either

Conclusions
In this paper has been developed multistep ahead traffic accidents forecasting approach based on singular values and autoregressive models.The nonstationary and nonlinear time series of injured people in traffic accidents of Santiago de Chile was used.Before the models methodology stages, ranking was applied to detect the relevant causes of injuries in traffic accidents; causes related to behavior of drivers, pedestrians, or passengers are predominant.Unwise distance, inattention to traffic conditions, and disrespect to red light are the first important causes of injuries in traffic accidents in concordance with previous studies that determine disrespect towards the road signs as a principal cause of traffic accidents.Complementary information was observed about traffic accidents conditions with high rate of injured people, automobiles type, environmental conditions, and relative position, among others.This approach was described in two stages, preprocessing and prediction; in the first stage two methods for components extraction were developed, Singular Spectrum Analysis and Singular Value Decomposition of Hankel, whereas in the second stage the linear autoregressive model and an Autoregressive Neural Network with Levenberg-Marquardt algorithm were used.
Four hybrid models were implemented: SSA-AR, HSVD-AR, SSA-ANN, and HSVD-ANN.The models were evaluated for 14-week ahead forecasting; comparative analysis shows that the proposed models SSA-AR and SSA-ANN achieved the highest accuracy with an average MNSE of 92.6% and 90.3%, respectively; the highest gain in average MNSE achieved by SSA-AR is 7.6%.However, it was observed that HSVD-AR shows a higher accuracy in farthest horizons from 12 to 14 steps, which reaches an average MNSE of 89.4%; in this case the highest gain achieved by HSVD-AR in MNSE is 12%.
The statistical tests application through Wilcoxon and Pitman has shown that SSA-AR is superior to HSVD-AR in 30 of 42 comparisons of resultant efficiency criteria (at nearest horizon) at 5% significance level, 5 comparisons show equivalence, and 7 comparisons show the superiority of HSVD-AR over SSA (at farthest horizon).
In further works, more strategies of components extraction will be explored; spectral analysis could help to explain the nature of traffic accidents in other geographic zones.Detailed work focused on the causes of traffic accidents will be done to support prevention plans aimed at promoting good habits on roads and highways.

Figure 2 :
Figure 2: Injured people in traffic accidents.
. .,   . 1 is the residual vector obtained with model 1, and  2 is the residual vector obtained with model 2. Model 1 would show superiority in front of model 2 at 5% significance level if || > 1.96/√  .

Table 1 :
Causes of injuries in traffic accidents (group 1 and group 2).
model (SSA-AR and HSVD-AR).SSA-AR shows superiority with respect to HSVD-AR in 30 predictions (when || > 0.123 for nearest horizons), whereas the opposite is observed