Intelligent Optimized Combined Model Based on GARCH and SVM for Forecasting Electricity Price of New South Wales, Australia

Daily electricity price forecasting plays an essential role in electrical power system operation and planning. The accuracy of forecasting electricity price can ensure that consumers minimize their electricity costs and make producers maximize their profits and avoid volatility. However, the fluctuation of electricity price depends on other commodities and there is a very complicated randomization in its evolution process. Therefore, in recent years, although large number of forecasting methods have been proposed and researched in this domain, it is very difficult to forecast electricity price with only one traditional model for different behaviors of electricity price. In this paper, we propose an optimized combined forecasting model by ant colony optimization algorithm (ACO) based on the generalized autoregressive conditional heteroskedasticity (GARCH) model and support vector machine (SVM) to improve the forecasting accuracy. First, both GARCH model and SVM are developed to forecast short-term electricity price of New South Wales in Australia. Then, ACO algorithm is applied to determine the weight coefficients. Finally, the forecasting errors by three models are analyzed and compared. The experiment results demonstrate that the combined model makes accuracy higher than the single models.


Introduction
It is a challenging task and significant role to forecast electricity price in competitive electricity market. However, electricity price has distinct characteristics from other commodities, which is due to its nonlinearity, nonstationarity, time variance, and uncertain bidding strategy of the market participants. All these characteristics can be attributed to the following reasons, which distinguish electricity from other commodities: (i) nonstorable nature of electrical energy, (ii) the requirement of maintaining constant balance between demand and supply, (iii) inelastic nature of demand over short time period, and (iv) oligopolistic generation side [1]. According to an accurate daily price forecasting in the electricity market, the power suppliers can reduce the cost of electricity production, optimize the allocation of resources, reduce the uncertainty of production, maximize profits, and ensure the dominant position in the market competition. At the same time, the consumers can also make a plan to maximize their utilities and minimize their costs using the electricity purchased from the pool or using self-production to protect themselves against high prices.
In the current power forecasting researches, the forecasting of electricity demand and price has emerged as one of the major research fields in electrical engineering [2]. A lot of researchers and academicians are engaged in the activity of developing tools and algorithms for load and price forecasting [1]. Whereas load forecasting has reached advanced stage of development and load forecasting algorithms with mean absolute percentage error (MAPE) below 3% are available [3,4], price-forecasting techniques, which are being applied, are still in their early stages of maturity. Although a few attempts have already been made in this direction, only qualitative aspects of price forecasting have been addressed. So the importance and complexity of electricity price forecasting motivate many researches in this area, especially in the recent years [5].
The electricity price prediction can be classified into two categories. One is the detailed market simulation that requires plenty of market information. The most popular approach is the artificial neural network (ANN) technique. ANN technique which possesses excellent robustness and error tolerance is an effective way to solve the complex nonlinear mapping problem. But the ANN contains a great many parameters. These parameters are always judged by experience, so the model is hard to be established [6]. Besides, it has been observed that while the neural network (NN) gives small error for training patterns, the error for testing patterns is usually of larger order [7]; in other words, when this method is applied to practical system, the accuracy is not good.
The other forecasting technology refers to some mathematical approaches without a thorough market modeling, but which attempt to discover the relation between some known inputs and the electricity price. Arciniegas and Arciniegas Rueda [8] applied a Takagi-Sugeno-Kang (TSK) fuzzy inference system to forecast the one-day-ahead real-time peak price of the Ontario Electricity Market. TSK's improvements in forecasting with respect to ANN and ARMAX are above 10%, and it has considerable value to forecast one-day-ahead peak price. Fuzzy system does not need to establish a precise mathematical model, but its adaptive capacity is limited, and a steady-state error may cause oscillations. Lei and Feng [9] proposed a novel grey model to forecast short-term electricity price for Nord Pool, California, and Ontario power markets. The experiment results showed that the forecasting error has decreased 1%∼6% compared with other grey models. Although grey theory needs very little data, it is more suitable to forecast linear data than nonlinear data. Wang et al. [10] used a combined model based on seasonal adjustment and chaotic theory to forecast electricity price of Austria power market. A phase space was reconstructed from the time series representing the data's chaotic characteristics. Particle swarm optimization (PSO) algorithm was also employed to determine the parameters of chaotic system. The forecasting performance illustrated that the combined model is better than single models. The most popular approach is the time series algorithm; stationary time series models such as autoregressive (AR) [11], dynamic regression and transfer function [12][13][14], and autoregressive integrated moving average (ARIMA) and nonstationary time series models like generalized autoregressive conditional heteroskedastic (GARCH) have been proposed for this purpose. Specifically, Contreras et al. [15] applied ARIMA model to predict the next-day electricity price of the Spanish electricity market. However, when facing a particularly big fluctuation time sequence, especially heteroscedastic time series prediction whose variance changes over time, ARIMA model does not work well. Comparing with the traditional ARIMA model, GARCH model can commendably predict the condition heteroscedastic time series, so it is widely applied in the field of finance. Garcia et al. [16] presented GARCH model to forecast hourly prices in the electricity markets of Spain and California.
Although time series models like ARIMA and GARCH are nonlinear predictors that can meet the condition of power price, behavior of price signal may not be completely captured by the time series techniques. To solve this problem, some other artificial methods can be proposed. Artificial intelligence (AI) approaches such as neural network and support vector machine (SVM), which have been successfully applied in load forecast, are also suitable for price forecast [17]. Unlike most of the traditional neural network models which implement the empirical risk minimization principle, SVM adopts the structural risk minimization principle which seeks to minimize an upper bound of the generalization error rather than minimize the training error [18]. SVM possesses a concise mathematical form and good generalization ability; it can well solve the practical problems of the small sample size, nonlinearity, high dimension, and local minimum point.
For the forecasting models, single model has its own independent information of the electric power system, and the proper selection of the individual model can lessen the systemic information loss. Although traditional single time series models like AR, MA, and ARMA are suitable to forecast stable and linear sequences and ARIMA model can be used to forecast nonstationary and nonlinear time series, the forecasting performance is not satisfied. In addition, the fluctuation of electricity price depends on many complicated random factors in its evolution process, and the combined forecasting model can make full use of the characteristic of single models and reduce the sensitivity of the poorer prediction method. Therefore, an optimized combined model by ant colony optimization algorithm (ACO) based on GARCH model and SVM model is proposed to predict electricity price. Section 2 introduces the combined forecasting model theory, GARCH model, SVM model, and an intelligent optimization algorithm called ACO. In Section 3, a case study about forecasting electricity price of New South Wales in Australia is demonstrated. Correspondingly, GARCH model and SVM model are utilized to forecast electricity price. Then, ACO algorithm can determine the weight coefficients. The results have shown that the optimized combined method is more reliable than the individual forecasting models. Section 4 concludes this study.
So, it is still an essential need to find more accurate and robust approaches for daily electricity price prediction and an overall assessment of the price-forecasting algorithms is still required.

The Optimized Combined Forecasting Model by ACO
Algorithm. The combined forecasting theory states that if there exist kinds of models to solve a certain forecasting problem, with properly selected weight coefficients, several forecasting methods' results can be added up. Assume that ( = 1, 2, . . . , ) is the actual time series data, is the number of sample points, and | ( = 1, 2, . . . , ) is the weight coefficient for the th forecasting model, the mathematical model of the combined forecasting model can be expressed as Abstract and Applied Analysis 3 wherêis the estimated value of and̂is the combined forecasting value. Determination of the weight coefficients for each individual model is the key step in constructing of a combined forecasting model. This can be achieved by solving an optimization problem which minimizes the mean absolute percentage error (MAPE) for the combined model. This objective function using can be expressed as The optimization process can employ ACO algorithm to minimize objective function .

GARCH
Model. The ARCH model was initially introduced by Engle [19], in order to account for the presence of heteroscedasticity in economic and financial time series. In an ARCH (q) process, the volatility at time is a function of the observed data at − 1, − 2, . . . , − . But in the practical application, the ARCH model often needs a very long condition variance equation, and in order to avoid negative variance parameter estimation, it often needs to forcibly demand a fixed hysteresis structure [20]. On this occasion, in order to make ARCH model have long-term memorizing ability and more flexible lag structure, it is essential to extend ARCH model. Later, Bollerslev [20] introduced the generalized ARCH (GARCH) process, where conditional variance not only depends on the squared error term, but also depends on the previous conditional variance [21]. A GARCH process of orders and , denoted as GARCH (p, q), (GARCH model is used in EVIEWS soft) can be described as follows [22]: where > 0, 0 > 0, and ≥ 0 for = 1, 2, . . . and ≥ 0 for = 1, 2, . . . . Again, the conditions 0 > 0, ≥ 0, and ≥ 0 are needed to guarantee that the conditional variance ℎ ≥ 0. This study uses GARCH (1, 1) model and the (1, 1) in parenthesis indicates that one length of ARCH log( 1 ) and one length of GARCH log( 1 ) are used.

Support Vector Machine (SVM).
According to the statistical learning theory and development, SVM is based on structural risk minimization (SRM) principle to minimize the generalization error; the general view is to minimize the training error and at the same time minimize the model complexity, which has become the cornerstone of modern intelligent algorithm [23]. The characteristics of SVM make it a good candidate model to apply in predicting defect-prone modules as such conditions are typically encountered. SVM principle is as follows: given the training sample {( , ) : ∈ , ∈ {−1, +1}} =1 , then the two-class pattern recognition problem can be cast as the primal problem of finding a hyperplane: T + = 1, where is a d-dimensional normal vector, such that these two classes can be separated by two margins both parallel to the hyperplane; that is, for each , = 1, 2, . . . , where ≥ 0, = 1, 2, . . . , , are slack variables and is the bias. This primal problem can easily be cast as the following quadratic optimization problem [24]: where = ( 1 , 2 , . . . , ).
The objective of a SVM is to determine the optimal w and optimal bias b such that the corresponding hyperplane separates the positive and negative training data with maximum margin and it produces the best generation performance. This hyperplane is called an optimal separating hyperplane [25].

Ant Colony Optimization Algorithm (ACO).
The ant colony optimization (ACO), developed by Dorigo et al. [26,27], is a metaheuristic method that aims to find approximate solutions to optimization problems. The original idea behind ant algorithms came from the observations of the foraging behavior of ants and stigmergy. Stigmergy is a term that refers to the indirect communication amongst a selforganizing emergent system by individuals modifying their local environment [28]. The detailed steps of ACO algorithm [29] can be described as follows.
Step 1 (initializing some parameters). The algorithm starts by initializing some specific variables such as the maximum of allowed iterations NCHO and the number of ants ANT and the initial point which is randomly selected.

Abstract and Applied Analysis
Step 3 (initializing the pheromone concentration). Each ant has pheromone, and the initial pheromone concentration of jth ant in ith subregion is defined as = (− ( )) = 1, 2, . . . , part, = 1, 2, . . . , , where represents the location of the jth ant in ith subregion and ( ) is the objective function. If the pheromone is greater, the function value is smaller.
Then based on (10), the pheromone of the global elite ant (the global optimal ant) can be got Step 5 (updating the ant's location of each group). If | − | ≥ (| max − min |/10), then If | − | < (| max − min |/10), then let = | − | where in this generation ( ∈ [1, ]) is the location of the elite ant, is the pheromone of the elite ant, ( ∈ [1, ]) is the location of the common ant, and is the pheromone of the common ant.

Evaluation Criterion of Forecasting Performance.
Two loss functions can be served as the criteria to evaluate the prediction performance relative to electricity price value, including mean absolute error (MAE) and mean absolute percentage error (MAPE); the forecasting effect is better when the loss function value is smaller. The two loss functions are expressed as follows: where and̂represent actual and forecasting electricity price at time and the value of in our study is 48.

Simulation and Analysis of Results.
The proposed optimized combined method is tested using a case study about forecasting electric price of New South Wales, Australia. The detailed forecasting procedure can be seen in Figure 1. The electricity price data were collected on a half-hourly basis (48 data points per day, starting from 0:00 AM to 23:30 PM) for 5 Mondays of electricity price values from February 12, 2012, to March 11, 2012, to predict 1 Monday of electricity price in March 18, 2012. The actual electricity price data can be seen in Figure 2. GARCH model is operated in EVIEWS soft to forecast electricity price series of New South Wales. Before doing work, this paper judges whether we can apply GARCH (1, 1) model by ARCH Lagrangian Multipliers Test (ARCH LM Test). At the beginning, the least square method is used to estimate electricity price data; then, ARCH test is performed over the residuals by observing the values of the F-statistics, which is not strong. Figure 1 illustrates these procedures. Let the confidence level be set to 0.05. If probability P is less than 0.05, the residual sequences will have ARCH effect. In other words, it is suitable to use GARCH model. Specifically, the results of the ARCH test are presented in Table 1 and it is found that GARCH (1, 1) model can be applied to forecast electricity price.
The SVM model, which is skilled in dealing with small simple and nonlinear data, will be used in the next step. The number of actual data is 240 (5 Mondays), so the number of is 240 and the detailed values of are seen in Figure 3. The value of the bias = −0.2874. After simulation, the forecasted values can be presented in Table 2.
However, single traditional model has some limitations which cannot present the characteristics of data well. Therefore, many combined models usually are used to forecast electricity price in power system. In this study, an optimized combined model based on GARCH and SVM can be proposed. Correspondingly, ACO algorithm is presented to optimize and determine the weight coefficients. In ACO algorithm, the experiment uses some parameters as follows: = 1, = 36, = 100, = 0.5, and = 0.5. In fact, we are not only interested in simulation of ant    colonies, but also in the use of artificial ant colonies as a one-dimensional optimization tool. The objective function is Its aim is to optimizêso as to minimize the objective function . The algorithm would stop when the number of the maximum iteration is 100. Through simulation and calculation, we can get̂1 = 0.8058,̂2 = 0.1942. So the estimated values of the combined model are written aŝ= 0.8058 ×̂1 + 0.1942 ×̂2.
The concrete values can be seen in Table 2.
In addition, we produce whisker plot with three boxes which have lines at the lower quartile, median, and upper quartile values of prediction electricity price value by GARCH (1, 1), SVM model, and the optimized combined model in Figure 4. It is easy to find that each of the boxes includes a notch in the position of the median value. The    Table 2. It is clear that MAE and MAPE using GARCH (1, 1) model or SVM model are higher than the combined model. Although the forecasted values of SVM model are close to the actual values before 7:00, the differences are great large after 7:00. Also, it has been observed that the proposed optimized combined model leads to 0.18 $AU reductions in total mean MAE and 0.67% reductions in total mean MAPE, respectively, in comparison with traditional individual GARCH (1, 1) model and results in 1.8 $AU reductions in total mean MAE and approximately 7% reductions in total mean MAPE, respectively, compared to the single SVM model. Consequently, the results obtained from the optimized combined model agree with the actual electricity price exceptionally well. In other words, the forecasting model using optimized combined model can yield better results than using GARCH (1, 1) and SVM model.

Conclusions
The development of industries, agriculture, and infrastructure depends on the electric power system; electricity consumption is relative to modern life; consumers want to minimize their electricity costs and producers expect to maximize their profits and avoid volatility; therefore, it is important for accurate electricity price prediction.
There are four advantages of the optimized combined method. At first, the optimized combined model by ACO algorithm based on GARCH (1, 1) model and SVM method for forecasting half-hourly real-time electricity prices of New South Wales creates commendable improvements that are relatively satisfactorily for current research. Second, the individual model has its own independent characteristic in the power system. The proper selection of traditional single model can lessen the systemic information loss. The optimized combined forecasting model can make full use of single models and it is less sensitive to the poorer forecasting approach. On the basis there is no doubt that the improved combined forecasting model performs better than conventional single model. Third, an intelligent optimization algorithm called ant colony optimization is used to determine the weight coefficients. Finally, the optimized combined model is essentially automatic and does not require to make complicated decision about the explicit form of models for each particular case. The combined forecasting procedure gives the minimum MAE and MAPE.
The final result is that the optimized model has high prediction accuracy and good prediction ability. With proper characteristic selection, redundant information is overlooked or even eliminated; a more efficient and straightforward model is got. To sum up, it is clear that the improved combined model is more effective than the existing individual models for the electricity price forecasting.