An Optimized Grey Dynamic Model for Forecasting the Output of High-Tech Industry in China

The grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and n series related, abbreviated as GDMC(1, n), performs well in modelling and forecasting of a grey system. To improve the modelling accuracy of GDMC(1, n), n interpolation coefficients (taken as unknown parameters) are introduced into the background values of the n variables. The parameters optimization is formulated as a combinatorial optimization problem and is solved collectively using the particle swarm optimization algorithm. The optimized result has been verified by a case study of the economic output of high-tech industry in China. Comparisons of the obtainedmodelling results from the optimizedGDMC(1, n)model with the traditional one demonstrate that the optimal algorithm is a good alternative for parameters optimization of the GDMC(1, n) model. The modelling results can assist the government in developing future policies regarding high-tech industry management.


Introduction
Grey system theory [1,2], a rising interdiscipline in uncertainty, takes the uncertain systems of "small samples and incomplete information" with "partial known information and partial unknown information" as the research object. This theory extracts valuable information mainly by generating and developing the "partial" known information. Then, the accurate demonstration of the system running behavior and the evolvement rule is done and effective monitoring is achieved. Grey system theory can be used like this to deal with the practical problems in a situation where the sample data size is not large enough to demonstrate the statistical law or the deterministic law of the system behavior.
At present, the most widely used grey model GM(1, 1) [1, 2]-a first-order one-variable grey differential equationis proposed based on the aforementioned principle. Its modelling principle does not depend on distribution information from the raw data but on the application of a first-order accumulative generation operation (1-AGO), to make the sequence display grey exponent law behavior as a whole. Based on this, a first-order grey differential equation is constructed and solved. The forecast values are then derived from the first-order inverse accumulative generating operation . Due to the fact that the construction of GM(1, 1) does not require a large sample and it is easy to build and calculate, GM(1, 1) and its improved variant models have been widely used [3][4][5][6].
The GM(1, ) model [1], with −1 relative variables acting as an associated series besides the predicted series, is a basic grey multivariable model. The forecasts of a time series (0) 1 may be considerably improved by using information coming from some associated series (0) , = 2, 3, . . . , . This is particularly true if changes in (0) 1 tend to be anticipated by changes in (0) , = 2, 3, . . . , . Nevertheless, the solution of the whitening differential equation for GM(1, ) is rough [7]-it can easily result in large errors in actual forecasting applications. Thus, for a long time, this model has been little used. In fact, the only existing research applications have proceeded on the basis of improved models [8][9][10]. The grey model with convolution integral, GMC(1, ), has been proposed by Tien [11]. This is a new model which seeks to improve the traditional GM(1, ) model. Theoretically, the modelling values produced by GMC (1, ) are the exact solution of the traditional GM(1, ) model and the grey control parameter 2 Mathematical Problems in Engineering (similar to that of the GM(1, 1) model) introduced into the model. When the number of variables in the model = 1, the GMC(1, ) model reduces to GM(1, 1). Using GMC(1, ) greatly improves the forecasting accuracy of the multivariable grey model and has been successfully applied to various different fields [11][12][13]. The multivariable grey model GDMC(1, ) [14] is based on the model GMC(1, ). The higher frequency components of the 1-AGO data of the associated series were significant for the grey prediction system and the first-order derivatives of 1-AGO data of the associated series were added into the GDMC(1, ) model.
In this study, interpolation coefficients are introduced into the background values of the variables in GDMC(1, ) model. Then, by aiming to minimize the modelling error, the optimal parameters are solved using the particle swarm optimization algorithm. Thereby, the adaptability and forecasting accuracy of GDMC(1, ) to real data are enhanced. The remainder of this paper is organized as follows. Section 2 explains the modelling methods, including the traditional GDMC(1, ) and the optimization method. In Section 3, the traditional GDMC(1, ) and the optimized GDMC(1, ) are applied to forecast the output of high-tech industry in China. Finally, conclusions are presented in Section 4. 1 from some dynamic system. The existing GDMC(1, ) modelling process [14] is carried out as follows. Consider the original predicted series

Methodology
Then the 1-AGO data for (0) 1 , (0) 2 , . . . , (0) are given by the following equations, respectively: The grey forecasting model based on the predicted 1-AGO series and the associated 1-AGO series (1) (2) , . . . , (1) is given by the following differential equation: (1) where 1 and 2 are the developmental coefficient and the grey control parameter, respectively, and ( 2 −1 , 2 ) are the associated coefficients corresponding to the associated series (0) , = 2, 3, . . . , , respectively, is the number of entries for model building, is a period of delay, and 1 , 2 , . . . , 2 are model parameters to be estimated. Equation (7) is called the grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and series related, abbreviated as GDMC(1, )-the "1" represents the first-order derivative of the 1-AGO series of (1) 1 and the " " represents the total of relative series introduced into the grey differential equation.
The least-squares solution for the model parameters in the GMC(1, ) in (7) for t from 1 to r is given by where Mathematical Problems in Engineering 2.3. The Evaluation of̂( 0) 1 . In summary, in the right-hand side of (7), the discrete function ( ) can be obtained as The 1-AGO modelling values of the predicted series can be derived using the initial condition̂( 1) The second term of the right-hand side in (13) can be evaluated approximately by the two-point Gauss numerical integration [14] with the linear assumption on ( ) between any two neighboring times; we havê (1) where the coefficients 1 and 2 are both equal to 1, the nodes 1 and 2 are equal to −1/ √ 3 and 1/ √ 3, respectively, and ( − 2) is the unit step function. Applying 1-IAGO to (15) yields the following modelled values together with the forecasts: where is the number of entries to be forecasted or indirectly measured.
Assuming the system parameters in (7) to be constants in the postsampling period, then, by using the postsampling data combined with the data given for the corresponding associated series as a new input series, the corresponding forecasts or values of indirect measurement for the predicted series can be derived.
It is obvious that, when the number of associated series is zero, that is, when = 1, (7) reduces to the grey single variable forecasting model GM(1, 1).
The particle swarm optimization (PSO) algorithm [15] is a population-based heuristic algorithm that simulates the social behavior as birds flocking to a promising position to achieve precise objectives in a multidimensional space. Each particle is a potential solution to the optimization problem. A particle represents a point in an -dimensional space, and its status is characterized through its position and velocity. The position for the particle can be represented as = ( 1 , 2 , . . . , ). The velocity of this particle can be represented by another -dimensional vector V = (V 1 , V 2 , . . . , V ). The best previous position of the th particle can be represented as = ( 1 , 2 , . . . , ). The best position in the entire swarm is denoted as = ( 1 , 2 , . . . , ). To search for the optimal solution, each particle changes its velocity and position according to the following two formulas: where means the number of iterations. is called the inertia weight that controls the impact of previous velocity of particle on its current one. In a general way, we let min = 0.4, max = 0.9. 1 and 2 are positive constant parameters called acceleration factors which control the maximum step size. Typical values for 1 and 2 are 1 = 2 = 2. rand(⋅) is a random number between zero and one.
To avoid the phenomenon of "shock" when particles near global optimal solution, we can employ a linear gradient strategy for the inertia weight: where max is the total number of iterations. This study uses the PSO algorithm to solve the optimal background value coefficients for GDMC(1, ) model. The specific steps are as follows.
Step 1. Generate particles randomly in the -dimensional space. The position and velocity for the particle can be represented as Step 2. Set the initial position of the particles as 1 = (0, 0, . . . , 0) and generate an initial velocity vector V 1 randomly in the interval (0, 1.1]. Step 3. According to (10), substitute into the matrix and establish the fitness function Step 4. Calculate the fitness value for each particle.
Step 5. Compare each particle's fitness to the global best position best to adapt to the value; if it is the optimum, it can be taken as the best position; if not, turn to Step 6.
Step 6. Update particles velocity and position according to the evolution equation according to (19).
Step 7. If stopping criteria are met, show the output and its fitness, which corresponds to the optimal parameters and the RMSPEPR, or else go back to Step 3.

Forecasting the Output of High-Tech Industry in China
Forecasting the output of high-tech industry is essential for the development of projects and policy making. As Chinese high-tech industry is neonatal and so the data relating to existing economic indices are limited, it is difficult to apply existing statistical methods of analysis and forecasting to them. As a result, little research has been conducted on quantitative forecasting of Chinese high-tech industry. In this section, the advantage of the optimized GDMC(1, ) model (abbreviated as OGDMC(1, )) over the traditional one is demonstrated by a real case study of high-tech industry in China.

Variables and Data.
In economics theory, human resources and capital investment are the crucial factors of the economic system output [16]. The economic output of hightech industry is taken as the predicted series and is denoted as 1 . And the annual average employment and investment are adopted as the relative variables of the output of hightech industry and are denoted as 2 and 3 , respectively. The original data relating to the output, average employment, and investment of the high-tech industry in China for 2003-2010 are shown in Table 1. These data are all collected from the China Statistics Yearbook on High Technology Industry (2004-2011). GDMC(1, 3) and OGDMC (1,3). In the following, the industrial output value 1 is used as the forecasting variable. At the same time, with the average employment 2 and the investment 3 as relative variables, multivariable forecasting models GDMC(1, 3) and OGDMC(1, 3) models are established. The data of 1 , 2 , and 3 from 2003 to 2010  Table 2: The values of parameters , , and in (7), the estimates of model parameters 1 , 2 , and ( 2 −1 , 2 ) in (10), and the value of RMSPEPR in (17).  is employed as the original modelling series to construct GDMC(1, 3) and OGDMC(1, 3) models. (1,3). Applying the GDMC(1, 3) model given by (7)- (16), the values of the parameters , , and in (7) and the estimates of the model parameters 1 , 2 , and ( 2 −1 , 2 ) in (10) can be obtained ( Table 2). The resulting GDMC(1, 2) model from (7) has the form

Parameter
To sum up the right-hand side of (7), the discrete function ( ) in (12) for the OGDMC(1, 2) model is obtained and tabulated ( Table 5). The values of RMSPEPR in (17) are also listed in Table 4.

Evaluation of the Grey Forecasting Models.
In this study, the most commonly used four measurements in grey theory, including mean relative error (MRE), absolute degree of grey incidence (ADGI), ratio of standard deviation (RSD), and probability of small error (PSE) [2,17], are used to evaluate the accuracy of the models involved. The levels of accuracy and their critical values are given in Table 6 [17]. When all  Figure 1 demonstrates the degree of closeness between the modelling results of the models and the real values of the output of high-tech industry in China. Clearly, the model OGDMC(1, 3) is closer to the actual data than the traditional one. Moreover, as can be seen from a histogram of the modelling residual errors (Figure 2), the residual error from the OGDMC(1, 3) model is notably smaller than that of the GDMC (1, 3). The reason for this lies in the fact that, through optimization of the background value, the modelling ability of the GDMC(1, 3) model can be further improved in the OGDMC(1, 3) model. The model coefficients 3 , 4 , 5 , and 6 can reveal the role and importance of employment and investment and their impact on the output of high-tech industry.

Conclusions
This study presents a PSO-algorithm-based optimization method for the grey dynamic model by convolution integral with the first-order derivative of the 1-AGO data and series related. According to empirical modelling results of the high-tech industry in China, the modelling accuracy of the  traditional GDMC(1, ) model can be effectively increased using a background value optimization method as proposed in this study. This is important in practice as the government of high-tech industry needs to make decisions based on traditional parameter solutions which may be constrained by an incorrect local optimal solution.
Due to the lack of more additional data published by China's Statistic Department at present, the out-of-sample and the time-delayed forecasting results of this study have not fully been validated. Therefore, in future work, the outof-sample and the time-delayed forecasting using both the GDMC(1, ) and the OGDMC(1, ) needs to be taken into account after more data is released by China's Statistic Department. In addition, the grey forecasting model established in this study merely considers two basic input factors, that is, capital and labor, while ignoring the technical level. The main reason lies in the fact that sample data are rather limited in the short development of China's high-tech industries. Moreover, it is challengeable to estimate the technology level of high-tech industries using small sample data. To address this problem, further researches are also needed.