The Weighted Support Vector Machine Based on Hybrid Swarm Intelligence Optimization for Icing Prediction of Transmission Line

Not only can the icing coat on transmission line cause the electrical fault of gap discharge and icing flashover but also it will lead to the mechanical failure of tower, conductor, insulators, and others. It will bring great harm to the people’s daily life and work. Thus, accurate prediction of ice thickness has important significance for power department to control the ice disaster effectively. Based on the analysis of standard support vector machine, this paper presents a weighted support vector machine regressionmodel based on the similarity (WSVR). According to the different importance of samples, this paper introduces the weighted support vector machine and optimizes its parameters by hybrid swarm intelligence optimization algorithm with the particle swarm and ant colony (PSO-ACO), which improves the generalization ability of the model. In the case study, the actual data of ice thickness and climate in a certain area of Hunan province have been used to predict the icing thickness of the area, which verifies the validity and applicability of this proposed method. The predicted results show that the intelligent model proposed in this paper has higher precision and stronger generalization ability.


Introduction
Safe and reliable power supply is the security to ensure sound and rapid development of the national economy; as the main part of power transmission in the grid, normal and safe operation of transmission lines is an important guarantee for the grid to avoid a serious accident, while the icing on the transmission line will lead to excessive tension, conductor galloping, tripping, and break accident of transmission line.It will also cause the interruption of power supply, affecting the stability and security of power system operation seriously.Moreover, due to reverse distribution of China's resources and productivity, our country needs to vigorously promote construction of outgoing channel of power base.It will increase the possibility of the icing when transmission lines go through extremely harsh complex area of the contamination, high altitude, snow, strong acid rain, and fog [1,2].
In the early 30s of last century, Britain, Japan, Canada, and American had some reports on transmission line ice coating, which had caused safety accident and brought the huge economic loss [3].As one of the serious transmission line icing countries, the probability of occurrence of ice disaster accident of transmission line in China stays the forefront in the world [4].Icing of transmission lines has become one of the important factors affecting the safe operation of power grid in the world.In view of great harm to the power system operation brought by icing, in recent years, the related research work on conductor icing has gradually become a hot research at home and abroad.And they have made certain achievements in the aspect of the formation mechanism of transmission line icing, ice prevention measures, Icing image monitoring, and conductor icing prediction model.The methods used for the icing prediction include empirical models, statistical refinement model [5,6], and the intelligent models, such as neural network [7][8][9], support vector machine [10][11][12][13] and so on.The multivariate linear regression model is the most widely used in the statistical theory model.But the influence factors considered in the model are not 2 Mathematical Problems in Engineering comprehensive enough, and the model is required to meet the various statistical assumptions.There are restrictions on the types of influence factors, which limit the range of application [14].Experience refined model is set out to establish a model from the physical essence of the ice cover and it is relatively simple; the generalization ability is weak.The outputs of different meteorological have high volatility [15]; neural network has the great ability of approaching nonlinearity but is easy to fall into local optimum and appear over learning situation.This model also has the disadvantages of low efficiency and poor generalization ability.It is difficult to guarantee the prediction accuracy of models [16].As the climate factors have the characteristics of greater volatility and randomness, SVM can take comprehensive consideration of multiple factors of ice thickness and has better ability of nonlinear mapping and generalization [17].And the model has the advantages of repeated training and the faster speed of convergence [18]; it can solve practical problems of the small sample, nonlinearity, local extreme value, and so on.It is widely used in many fields such as machine control and speech recognition and has achieved good forecasting effects [19].
The standard SVM has the same punishment for deviation and accuracy requirements for different samples.However, in practical applications, we often find that some samples have large correlation degree which requires a smaller training error, while some has small correlation degree which has the permissible of relatively large error [20].At this point, the traditional SVM cannot obtain an accurate prediction result [21].In view of this problem, given comprehensive consideration of the factors of the environmental temperature, relative humidity, wind speed, wind direction, and elevation effects, combined with statistical prediction model and icing meteorological parameters model used for icing prediction, this paper proposes a weighted support vector (WSVR) regression algorithm for icing prediction [22].We determine the weights of different samples by calculating the correlation coefficient between samples and optimize the parameters  and  using hybrid optimization algorithm of particle swarm and ant colony (PSO-ACO) to improve the model's generalization ability.In the empirical analysis, ACO-SVM, BP neural network and linear regression methods are used for comparison.By contrast, the model proposed in this paper has higher precision and can accurately predict the ice thickness.The establishment of the model has practical significance for power department to effectively control the ice disaster and improve the safe and reliable operation of power grid [23].

The Hybrid Cluster Intelligent Optimization
2.1.1.Swarm Intelligence Optimization Algorithm.The basic theory of swarm intelligent optimization algorithm is to simulate exchanges and cooperation of the actual biological group lives between each individual, with a simple and limited individual behavior and intelligence to form the overall capability of the whole population of inestimable through interaction.Various organisms in a swarm intelligent optimization algorithm are handled by artificial, and individuals do not have the volume and quality of the actual biological.Its behavior has the necessary processing based on people's needs to solve the problem [24].
Swarm intelligence optimization algorithm is a search of probability essentially.It does not need gradient information of questions and has the following characteristics which are different from the traditional optimization algorithm.
(1) The interaction of individuals in the population is distributed, and there is no direct central control.Individual failure will not affect to solution of the problem.It has the strong robustness.(2) Each individual can only perceive the local information and the individual's ability to follow the rules.So the swarm intelligence method is simple and convenient.(3) The computing time is less and the platform is easy to expand.(4) Self-organization, namely, the complex behavior of community is via a simple individual interaction which exhibits a high degree of intelligence.
Swarm intelligence optimization algorithm theory mainly aims to study the algorithm characteristics and improve the shortage and performance.The research mainly includes two aspects: one is to study its own characteristics of this algorithm to improve its performance; the other is to combine the swarm intelligence optimization with other algorithm to produce a new hybrid intelligent algorithm through the fusion of different algorithms.

Hybrid Swarm Intelligence Optimization Algorithm.
Currently, the thinking of swarm intelligence is receiving increasing attention, which shows great characteristics in solving problems, especially optimization problems.There are many algorithms based on the group, such as genetic algorithm, differential evolution, ant colony optimization, particle swarm optimization, and evolutionary programming, which can be grouped into swarm intelligence algorithms.As the most commonly used swarm intelligence optimization algorithms, ant colony optimization and particle swarm optimization have greater optimization features.Ant colony optimization, which is the simulation of ant colony foraging process, has been successfully applied to many discrete optimization problems.Particle swarm optimization, which is the simulation of birds foraging process, is an efficient parallel search algorithm in continuous optimization field.
Ant colony optimization uses pheromone to transmit information, while particle swarm optimization uses three pieces of information of the information of its own, individual extreme information, and global extreme information to guide the particle to the next iteration.Using the organic combination of the positive feedback principle and some heuristic algorithms, ant colony optimization is easy to run into prematurity and fall into local optimum.The organic combination of those two algorithms can overcome the shortcomings of them effectively and improve the computational efficiency significantly.According to the mixing characteristics of ant colony optimization and particle swarm optimization, this paper proposes an improved ACO PSO hybrid algorithm.
(1) Ant Colony Optimization.ACO is inspired by Italy scholar Dorigo M from the foraging behavior of real ant colony in nature.He found that an individual ant does not have much wisdom or master the nearby geographic information.But the colony can find an optimal path from nest to food sources.Established on the findings, Dorigo M and other researchers proposed ACO theory in 1991, attracting research enthusiasm of many scholars.The basic ACO model consists of the following three equations: If the ant marked NO,  passes by the path from  to , where  is the number of ants,  is the number of iterations,  is the position of ants,  is the position where ants can reach, Λ is the set of the position where ants can reach,   is the heuristic information, which means the visibility of the path from  to , named   = 1/  ,   is the objective function,   is the pheromone intensity of the path from  to ,    is the number of pheromone left by ants on the path from  to ,  is the weight of the path,  is the weight of the heuristic information,  is the evaporation factor of the number of pheromone on the path,  is coefficient of the pheromone quality, and    denotes the transition probabilities of the NO  ant moving from  to .
(2) Particle Swarm Optimization.Particle swarm optimization (PSO) is a kind of evolutionary algorithm, derived from the observation of the birds' behavior of searching for food.The conversion process of the motion of whole flock from disorderly to orderly comes from information shared by each individual bird in the flock, so as to find food [25].
The method used by PSO to solve optimization problems is to initialize a group of random particles and find the optimal solution through several iterations (Figure 1).In the process of iterations, each particle updates its direction and position constantly according to two extreme values.The first one is the optimal solution found by the particle itself, called individual extreme , and the other one is the current optimal solution found by the entire particle swarm, named global extreme .At the beginning of the iterations, the position of each initialized particle is the individual extreme, while the best position of the particle swarm is the global extreme.After all of the particles in the swarm complete the first iteration, we should compare the position front and rear of each particle and update the individual extreme with the optimal solution in this iteration if the new position is better speed The current

The current position
The optimal position of individual ant The optimal position of ant colony The next position than the previous one.Then, we need to get the optimal solution throughout individual extremes of all particles in the swarm as global extreme by comparison and update the global extreme if the new one is better than the old one.The final global extreme obtained through these cycle iteration operations determines the optimal solution [26].
After obtaining individual extreme and global extreme in the process, each particle needs to update its velocity and position according to the following formulas: where V , denotes the velocity of the  particle after  iterations,  , means the position of the  particle after  iterations,  , and  , are on behalf of the individual extreme and global extreme of the  particle after  iterations,  is the inertia weight of the updated speed to the speed of pre update, random() is a random number within (0, 1), and  1 and  2 are learning factors within (0, 2].The velocity of particles in the swarm is limited in (0, V max ), and the updated value should be replaced with V max if it exceeds maximum V max in the process of iterations.
In the process of particle swarm optimization algorithm, particle shares global extreme value to other particles within the group.This one-way flow of shared information and data makes the whole search process follow the group within the current optimal solution.Therefore, the initial particle swarm optimization algorithm has fast global convergence capability.

ACO PSO Hybrid Optimization Algorithm.
Researches show that the initial search process of ant colony optimization operates slowly because of lack of initial pheromone, and the set continuous parameters , , and  are mainly determined by experimental methods, which makes it difficult to achieve the optimal value because of the relation of the accuracy, the calculation speed, the performance, and other aspects of the method to the experience of laboratory assistants.PSO algorithm performs excellently in optimizing problems of continuous space, but when we get local optimal through  iterations, the change in rate of particles is mainly affected by V , , while  is in the range of 0 and 1.That is to say, along with the increase in the number of iterations, the speed of the particles will become smaller and smaller, and tends to 0 gradually, which makes it easy to fall into local convergence.To overcome the disadvantages of the two algorithms, this paper will integrate and improve them and propose a hybrid algorithm based on particle swarm and ant colony.First, we use the particle swarm optimization algorithm for fast global search to determine the parameters of ant colony and transform the better value into initial information pheromone.Next, we use ant colony algorithm for path searching, at the same time we put the length of optimal solution, the running time, and the numbers of iterations calculated by this set of parameters into the PSO algorithm, and update the velocity and position of each particle according to formula until we get the optimal solution for ant colony algorithm [27,28].
The basic steps of ACO-PSO hybrid algorithm are shown in Figure 2.

Basic Theory of SVM.
The principle of SVM is to transform the samples which cannot be separated in linear low-dimensional space to high dimensional feature space by nonlinear mapping algorithm to make the samples linearly separable and then analyze linearly the samples transformed.SVM regression is to create a nonlinear mapping and map the data to high dimensional feature space for the linear regression work later [21].
Assuming that the samples are ( 1 ,  1 ), ( 2 ,  2 ), . . ., (  ,   ) ∈   ×  where,   ∈   is an input parameter,   ∈  is the corresponding output parameter, and  is the number of samples.Regression function can be expressed by the following formula: where  is the threshold and  is the weight vector.
According to the theory of statistics, we define SVM regression function based on the target of minimization and build the following goal programming: where   and  *  are nonnegative slack variables;  is the penalty factor, which indicates the degree of punishment for out of error samples and also a compromise between empirical risk and the complexity of the model;  is an insensitive loss function parameter.
By using Lagrange method to solve the constrained optimization problems described above, the original problem can be converted to its dual problem: where (  ,   ) = [(  )⋅(  )] is the kernel function of SVM and   and  *  are the Lagrange coefficients.The SVM regression function is shown as follows: where radial basis function (RBF) is used as the kernel function and  is the width of the kernel function.

The Weighted Support Vector Machine
Algorithm.The traditional support vector machine algorithm applies to the cases where the sample data obeys identically distribution and the samples are independent of each other, and it has same punishment for penalty parameters and error request parameters of different sample.In this case, the model can ensure an accuracy result.However, the actual data is more complex and we often find that some important samples demand lower training error while others allow a certain size of training error.Therefore, to get a more accurate regression estimate, we should assign different error requirements and penalty coefficients to each sample data when describing the optimization problem.To solve this problem, the weighted support vector machine regression algorithm [29] will be used to predict in this work.
Assume that the variance of random error term has the following properties: In circumstances of different  2  , it can be difficult for the traditional support vector machine regression model to achieve an ideal result.In the process of optimization, the status of each parameter  and  * in the model is the same, which makes it easy for the regression line to be pulled to the larger variance items while the smaller variance items have relatively poor fitting degree.In this paper, to control the effects caused by differences in error term, we will adjust the impact of items on the summary of model by introducing the weight   .The optimization model is where   and   are the weighted coefficients of the th training sample for parameters  and , respectively.Literature [30] has pointed out that the weighting for penalty parameter  of samples can greatly improve system performance, but the improvement is not so significant in the case of weighting for error requirements parameter ; therefore, only optimal weighting for   will be executed in this paper.The basic idea of gray correlation degree is to determine the degree of association according to similarities between curves.Therefore, to determine the weights, this paper introduces gray correlation degree to calculate the similarities between samples.The gray correlation degree is calculated as follows: The gray correlation degree ( 0 ,   ) is The weight   is defined as where  = 1, . . ., ,  is the number of independent variables,  is a relatively small number, and   () is the data after the normalization.In practical application, each weight may be selected according to the actual situation.

Case Study and Results Analysis
Among the numerous factors influencing the ice coating on the transmission line, meteorological conditions are the most important factor, including temperature, humidity, wind speed, wind direction and other external climate.A lot of scholars have studied the impact factors preliminary.Literature [3] summarizes the three necessary conditions for the formation of icing by reviewing the results of research of predecessors.Air relative humidity must be above 85%; wind speed should be greater than 1 m/s; the temperature have to reach 0 ∘ C and below.On this basis, a pole called "Fuwaixian" in Hunan province is as a case study to demonstrate the effectiveness of the proposed approach.The historical meteorological data and ice thickness data from January 10, 2008, to March 21, 2008, are taken as data base.The forecasting model is solved through Matlab on a single core of a 32-bit Lenovo workstation running on Windows7 with 2 dual-core 2.60 GHz CPU and 4.0 GB of RAM.We extract rules from the past information to forecast the ice thickness.The main factors considered here are average temperature, relative humidity, wind speed, choosing average temperature, relative humidity and wind speed of forecasting day, the ice thickness, temperature, and relative humidity of the day before as input factors of support vector machine to predict the ice thickness of the transmission line.The original data chart is shown in Figure 3.
Before training samples, we must screen and normalize the raw data, deleting the cases of small relative humidity, high temperature, and small wind speed which lead to ice thickness close to 0. Finally, we retain 289 groups of data, of which the first 200 groups are the training set and the latter 90 groups are a test set, to prove the validity of the model.
The formula of normalization is as follows: After normalization, the values of each variable are between [0, 1], which eliminates the influence of dimension.
As the climate factors have the characteristics of greater volatility and randomness, SVM can take comprehensive consideration of multiple factors of ice thickness and has better ability of nonlinear mapping and generalization.As the selection of kernel parameter  and penalty factor  have great influence on the accuracy of the model of SVM regression, this paper selects Gauss radial basis function as kernel function of support vector machine model; that is, This paper will adopt PSO-ACO hybrid cluster intelligent optimization algorithm to optimize the parameters  and  to find an optimal parameters to improve the accuracy of prediction.
This paper selects the average percentage error of predictive value and the actual value as the objective function to search for the target of the minimum value of the objective function.The optimal solution of ant colony corresponding to the global minimum is the parameters  and  of SVM when the iteration terminates.We predict the icing thickness after putting the optimized parameter into the SVM regression model.
Figure 4 shows the process of optimization iterative convergence of PSO-ACO parameters; we can see that MAPE presents the ladder convergence to the stationary after 100 iterations, the dynamic adjustment of SVM parameters.In the process of evolution, the minimum MAPE presenting in the tenth iteration convergence is 2.53%; the corresponding optimal kernel function  and the penalty factor  are 3.145 and 41.916.Putting the obtained parameters into the weighted SVM model, we get the prediction of ice thickness shown in Figure 5.
In order to assess the reasonableness of the model proposed in this paper, we select support vector machine optimized by ant colony (ACO-SVM), SVM model, and linear regression model as the model for comparison.The prediction results of the three algorithms are shown in Figure 6 and the error is shown in Figure 7.This paper selects the mean absolute percentage error index as quantitative evaluation of the result of prediction: Error distributions of different methods are shown in Figure 7.
It can be seen from comparison of the prediction curve and the actual load curve that the results of four algorithms of ice thickness have approximation to actual curve.Among them, weighted support vector machine regression (WSVR) based on PSO-ACO hybrid intelligent optimization has the best fitting effect, and the mean absolute percentage error prediction result of this proposed method is 2.533% while ACO-SVM, the traditional SVM and linear regression method are 6.47%, 10.62%, 12.24%.The error is much higher than the method proposed in this paper.Error evaluation results show that weighted support vector machine optimized by the hybrid algorithm has a higher prediction accuracy than the single optimization SVM algorithm, the hybrid optimization algorithm makes up for the defects of single algorithm, and weighted support vector machine makes full use of the sample information, which improve the accuracy and the generalization ability of the model and make better prediction effect.Compared with other methods, the proposed method has obvious advantages and can be used for the ice thickness prediction of transmission line.

Conclusions
This paper proposes an intelligent model of weighted support vector machine (WSVR) based on PSO-ACO hybrid optimization.It is used to predict the ice thickness of transmission line, which has extremely important realistic meaning for the power sector to control ice coating effectively.This intelligent model is combined with the advantages of dealing with small samples, nonlinearity, and local optimum of SVM.The PSO-ACO hybrid optimization algorithm can overcome shortcomings of the local optimum in particle swarm algorithm and lack of pheromone in ant colony algorithm; at the same time, weighted support vector machine can make full use of sample information, which improves the generalization ability of the mode and promote the actual application scope of the prediction model.During the process of thickness forecasting, we give full consideration of the impact factors of climate and select the key factors, which makes the model more credible and scientific.This paper selects the icing data of "Fuwaixian" tower in a certain region of Hunan province from January 10, 2008, to March 21, 2008, for calculation.Empirical results show that the accuracy and generalization ability of improved model are improved.

Figure 3 :
Figure 3: Original data chart of temperature, relative humidity, wind's speed, and ice thickness.

Figure 6 :Figure 7 :
Figure 6: Comparison of the prediction results of different algorithms.