Multiple Feature Vectors Based Fault Classification for WSN Integrated Bearing of Rolling Mill

For rolling mill machines, the operation status of bearing has a close relationship with process safety and production effectiveness. Therefore, reliable fault diagnosis and classification are indispensable. Traditional methods always characterize fault feature using a single fault vector, which may fail to reveal whole fault influences caused by complex process disturbances. Besides, it may also lead to poor fault classification accuracy. To solve the above-mentioned problems, a fault extraction method is put forward to extract multiple feature vectors and then a classification model is developed. First, to collect sufficient data, a data acquisition system based on wireless sensor network is constructed to replace the traditional wired system which may bring dangers during production. Second, the measured signal is filtered by a morphological average filtering algorithm to remove process noise and then the empirical mode decomposition method is applied to extract the intrinsic mode function (IMF) which contains the fault information. On the basis of the IMFs, a time domain index (energy) and a frequency index (singular values) are proposed through Hilbert envelope analysis. From the above analysis, the energy index and the singular value matrix are used for fault classification modeling based on the enhanced extreme learning machine (ELM), which is optimized by the bat algorithm to adjust the input weights and threshold of hidden layer node. In comparison with the fault classification methods based on SVM and ELM, the experimental results show that the proposed method has higher classification accuracy and better generalization ability.


Introduction
As a key part of rolling mill, bearing operates in the environment of high temperature, high humidity, and heavy dust.Besides, bearing bears the largest impact force and load during production and it easily goes wrong under this circumstance.Thus, monitoring of bearing and timely classifying the faults into correct types are of great significance.
Recently, data-driven fault monitoring and classification methods have attracted more and more attention [1].In fact, the diversity and quality of modeling data influence the effectiveness of the fault classification.Now, operating data of rolling mill are collected through wired communication, which needs high cost and are hard to be constructed.However, with the development of wireless communication technology, wireless sensor network (WSN) has been widely applied in industrial processes because it has the advantages of low power consumption, low cost, wireless communication, and so forth [2].To the best of the authors' knowledge, the studies on fault classification of bearing based on WSN technology are rarely reported.Using the welldeveloped networking technologies, data transmissions and information exchanges within and between systems become more efficient, fast, and reliable [3,4].
After collection of data, for fault classification of bearing, several crucial points should be discussed: (1) how to extract the fault features from the collected signal; (2) how to improve the classification accuracy of the fault identification model.Liu and Pan [5] extracted bearing fault feature in time domain based on the analysis of data characteristic.Similarly, Shuang and Meng [6] analyzed the vibration signal of rolling bearing by using principal component analysis (PCA) and extracted data element as the reflection of the main characteristics of fault case.However, the vibration signals usually contain a large number of nonlinear components, while PCA is not capable of copping with the nonlinear characteristics.The above-mentioned methods judge the status of the rolling bearings only from a view of single fault feature, which could not give a comprehensive presentation of fault case.For the multi-fault-features extraction, Malhi and Gao [7] proposed a method to construct a mixed domain feature set based on wavelet decomposition.However, the time series is only extracted by wavelet decomposition in the frequency domain and time domain features are not considered.On the contrary, Lei et al. [8] extracted features from frequency instead of time domain from six aspects, which include original vibration signal and its spectrum and the filtered and demodulated signal by wavelet packet.Qin et al. [9] developed a fault classification model based on an improved extreme learning machine (ELM) method.However, to better describe fault, more fault features are needed to measure the fault influences.That is, multiple fault features should be decomposed rather than one single fault vector.
As for the fault classification, intelligent algorithms have been gradually applied to bearing, such as artificial neural network [10] and least squares support vector machines [11].The traditional back propagation neural network (BPNN) faces the problem of slow convergence rate, difficulty of configuring tunable parameters, and easily falling into the local optimum.In comparison with BPNN, the generalization performance of support vector machine (SVM) is improved.However, kernel function and its parameters are usually given according to human experiences.ELM [12] is a newly developed single hidden layer feed-forward neural network, which does not require continuously to adjust parameters of hidden layer.In ELM, iterative parameter optimization process of traditional neural network is replaced with solving linear equation groups and the outputs of the minimum norm least squares solution are employed as the weights of the network.Therefore, the network is trained one time without iterations.Compared with BPNN and SVM, ELM greatly improves the training speed and generalization ability, which has been successfully applied in the areas such as pattern recognition [13][14][15][16][17][18].The idea of introducing ELM into fault classification for bearing is given in this paper.In fact, the adjustment of the weights of the input and hidden layer adopts random selection method, which cannot guarantee the validity of the weight.Because the structure parameters of fault identification model determine the classification ability, how to obtain the optimal structure parameters is the key to improve the classification accuracy of fault identification model.
To solve the above-mentioned problems, a comprehensive and effective fault feature extraction and classification algorithm is proposed in this paper.First, a WSN is constructed in gearbox to collect the vibration signal.Second, to overcome the influence of disturbances, morphological average filtering algorithm is given to filter the collected signals and then the intrinsic mode function (IMF) is obtained through empirical mode decomposition (EMD) [19] after denoising.IMF presents fault features and IMF components that have large correlation coefficients and are used to calculate energy index in time domain.Besides, on the other hand, envelope spectrums of these IMFs are obtained by Hilbert envelope spectrum analysis, which can be used to obtain the singular values by singular decomposition on envelope spectrum matrix.The singular matrix and the energy index reconstructed multiple feature vectors used for classification.Third, bat algorithm [20] is utilized to optimize the weights of input and hidden layers of ELM, which use dynamic control of global and local search to avoid the results falling into the local optimum.At last, the fault classification model is developed by these feature vectors and the optimized ELM algorithm.The major contribution of this article is summarized as follows: (1) A data acquisition system based on wireless sensor network is constructed to replace the traditional wired system.(2) A multiple fault features decomposition method is proposed to explain the fault influences using two indices with physical significance.(3) A bat algorithm optimized ELM algorithm is proposed to determine the parameters to achieve better classification accuracy.
The remainder of this article is organized as follows.In Section 2, a simple description of rolling bearing and the developed WSN is briefly given.Next, the extraction of multiple feature vectors is proposed in Section 3. In Section 4, fault classification method is formulated based on the enhanced ELM algorithm.Experiment results clearly demonstrate the efficacy and feasibility of the proposed method in the last section.

Description of Rolling Bearing and the Developed Wireless Sensor Network
Rolling bearing has been widely used in industry, which is mainly composed of four parts: inner ring, outer ring, rolling body, and the holder.Figure 1 shows the physical structure of rolling bearing.Its main function is transforming sliding friction between the shaft and seat into rolling friction.The bearing studied in this paper is from Baotou Iron and Steel Group, of which the type is S-21062-C produced by the SOR Company, US.
To collect data from rolling bearing, wireless sensor network (WSN) is constructed.WSN usually consists of a number of sensor nodes, cluster head nodes, and sink nodes.Besides, it forms a multihop ad hoc network system through the wireless communication, which can be used to receive, send, and process the information of monitoring objects within the covered area [17].Considering that the object is a low speed and heavy load mill, we design a WSN that is battery-powered and of low power consumption, low cost, and rapid deployment to monitor the vibration of rolling mill.
Figure 2 shows the developed network topology of CSP-F1 rolling mill gearbox, which consists of sensor nodes, cluster head, and sink nodes.The sensor node collects and then sends vibration signals to cluster head node according to the given sampling interval; meanwhile, it also receives commands from the cluster head nodes.Cluster head node collects measured vibration data from sensor nodes and transmits the data to sink node.Besides, cluster head node has three roles, which are sending command to cluster measured vibration node, receiving the convergence order and maintenance time synchronization.The main functions of sink node are collecting data from cluster head node and transmitting the data to monitoring system.

The Extraction of Multiple Feature Vectors
In order to effectively extract plenty of information under different status of rolling bearing, the signals collected by WSN are denoised by morphological averaged filter.After denosing, Figure 3 shows the proposed procedure of feature extraction.Specifically, it includes three parts: (1) employing the EMD method to get intrinsic mode components (IMFs) that have large correlation coefficients; (2) calculating energy index based on the obtained IMFs; (3) performing Hilbert envelope analysis to IMFs to obtain the envelope spectrum and get their singular values.Through the above steps, the calculated energy index and singular values are employed as multiple feature vectors for bearing fault classification.

Denoising of Original Signals.
To remove the noise contained in the data collected from WSN system, mathematical morphology (MM) and average filtering algorithm [18] are used for filtering.The idea of MM is to use some structural elements that have certain shapes to measure and extract images corresponding to the shape and achieve the purpose of image analysis.Based on the geometric characteristics of the signal, MM based average filter can cope with the nonlinear signal noise by morphological operations between structural elements and the original signal.The proposed filter inherits the advantages of MM, including simple operation and analysis in time domain.Therefore, it is advantageous for the processing of mechanical fault signals.
Opening operator ∘ and closing operator • are two basic operations of MM, which are shown as follows, respectively: where Θ is erosion operation presenting the relationship in (3); ⊕ is dilation operation having the relationship in (4).
where symbols  and  indicate sampling time satisfying  larger than .
The linear combination of ( 1) and ( 2) can be used to construct the average filter (AVG): In this way, positive and negative impulses of the signal are eliminated.Besides, it can smooth the signal and reduce the signal noise.algorithm is employed to extract the inherent characteristics of signals.The concrete calculation procedures are given as follows.

Extraction of Multiple
Note the denoised signal still as () for brevity and decompose it according to EMD: where   ()  ∈ [1, ] is IMF,   () is the residual of the signal, and  is the index of sampling time.
In fact, different IMFs have different significances in comparison with the original signal ().And the significance can be evaluated by a correlation coefficient.Inspired by the definition of cross-correlation function, the correlation coefficient    between the original signal () and IMF   () is defined as follows: where () is the denoised signal,   () is the th IMF, and  is the number of sampling times.
A large value of the coefficient means that the corresponding IMF is relevant to the original signal.In this way, it eliminates the interference component and obtains the intrinsic component mode component that contains the most information of the original signal.

Energy Index.
The values of    calculated from (7) are sorted in descending order.Then 0.1 is defined as the threshold of correlation coefficient and the first  IMFs larger than the threshold are selected.On the basis of this, the energy index can be calculated as follows: After that, an energy eigenvector T = [ 1 ,  2 , . . .,   ] is developed.For easy comparison and processing, T is normalized as follows: where

IMF Based Hilbert Envelope Spectrum Analysis. The
IMFs  1 ,  2 , . . .,   calculated from Section 3.2.2 are taken to perform Hilbert transform according to Combined with (10), the envelope spectrum of each IMF is calculated as follows: Finally, the envelope spectrum of each IMF constructs a matrix B. By performing the singular value decomposition theory [18] on B, it obtains where S = diag( 1 ,  2 , . . .,   ) is the singular values of the matrix By processing each group of the signal under different status according to the above steps, we obtain the IMF Hilbert envelope spectrum singular value matrix and combine these singular value matrixes and energy features as multiple feature vectors to classify fault of rolling bearing.And multiple feature vectors are employed to train classification model of rolling bear based on bat algorithm (BA) optimized ELM, which will be given in the following section.

Enhanced ELM Algorithm for Fault Classification
The accuracy of fault classification depends on the intelligent model used in the process of machine learning methods.
In comparison with the BP method and the SVM method, ELM only needs to determine the number of nodes of hidden layer during the training of the network.Besides, it has the advantages of high efficiency, fast learning speed, and the unique solution.However, two structure parameters of ELM, that is, input weights and hidden layer threshold, are randomly given, which may result in poor accuracy.Having the advantages of dynamic control of global and local search conversion and avoiding falling into local optimum, BA is employed to optimize the two structure parameters of ELM.Thus, BA optimized ELM is proposed in the developed rolling bearing fault classification model to improve the precision and generalization ability.

The Establishment of Fault Classification Model.
In this part, the fault classification model is developed based on ELM. Figure 4 shows the proposed method.Only determining the number of neurons in hidden layer, ELM randomly generates connection weights and threshold of hidden layer neurons between the input layer and hidden layer and it can obtain the unique optimal solution.Assuming that the number of samples is , the number of nodes of hidden layer is , and the activation function is (), the mathematical model of ELM is defined as follows: where   = [ 1 ,  2 , . . .,   ] is the connection weights vector between the input node and the th node of hidden layer;   is threshold of the th node in hidden layer.In ( 13), a feed-forward neural network model of single hidden layer is developed, of which the output is close to zero error: Sequentially, parameters   ,   , and   satisfy the following relationship: And ( 14) can be further simplified as H = T, in which H is the output matrix of hidden layer and H(, ) stands for the output of the th training data in the th hidden node.
The goal of adjustment is to find a set of optimal parameters   ,   ,   that make the ‖(H)  − T‖ minimum.

Enhanced ELM Based on BA.
The weights of input layer and thresholds of hidden layer might be zero, which may result in the functionless of some hidden layers.Thus, the number of hidden layer nodes has to be increased to achieve higher classification accuracy.However, it may lead to poor adaptability and low generalization capacity for testing data.To solve this problem, BA is employed to optimize the input weights and threshold of hidden layer of ELM.In this way, the classification accuracy and generalization ability will be improved.Figure 5 shows the specific process.
BA is a new heuristic algorithm proposed by Yang et al. [21] and it has the advantages of fast convergence speed and high convergence precision.It is used to find the optimal solution of the problem by simulating the foraging behaviors of bat.The specifics are as follows: (1) Initialize the bat population location    and speed V   ( = 1, 2, . . ., ), in which  is the time index.Define the pulse frequency   of the  ℎ  bats at position   .Then initialize the pulse firing rate    and loudness    .According to the fitness value, determine the current optimal solution  * .

Start
Initialization: population number N, the initial pulse frequency f, the biggest voice loudness A, loudness attenuation coefficient alpha, pulse enhancement coefficient of beta, the largest number of iterations D Calculate the fitness value of each individual for a population (mean square error) Is it the optimal solution conditions?
To get optimal weights of input and hidden layer bias (2) Update the bat pulse frequency, speed, and position according to ( 17) through (19), respectively, where  ∈ [0,1] is a random number uniformly distributed; V   , V If  1 >   , it means that a new solution is produced by random perturbations and then carry out crossborder for new solution.(4) Generate uniformly distributed random number  2 .
If  2 >   and (  ) < ( * ), the solution of Step (3) is acceptable.Then update   and   according to (5) Sort the fitness value of all bats and find out the optimal solution.(6) Repeat Steps (1)-( 5) until a solution that meets the termination condition is found.

Data Preparations.
The application object of this article is a mill located in Baotou Iron and Steel Group, China.Figure 6 is the gearbox of the mill, which is the source of power and its operation status greatly affects the whole production line.A data collection system based on WSN is constructed and vibration signal can be collected online.In common, there are three types of fault: rolling bearing fault, inner ring fault, and outer ring fault.Combined with the normal status, Figure 7 shows the four kinds of signal collected for analysis.
Morphological average filter is used to denoise the above signals.The linear structural element is selected, and each structural element value is 0, namely,  = {0, 0, 0}.According to the determined structural elements, four states signals' noise is filtered by morphological average filter, as shown in Figure 8.In Figures 7(a) and 8(a), it can be observed that the noise of the normal signals is significantly reduced after morphological average filtering.The similar phenomena can be observed from other three fault cases.
For each operation status, experiment was performed 30 times.Each experiment contains 2048 data points.Then, EMD is used to decompose the state sample under different status.According to the rule given in Section 3.2.1,four IMFs will be retained.Figure 9 shows the decomposition of one experiment under normal status.decomposition of each state is evaluated.Table 1 summarized the results.Taking Hilbert envelope for these four IMFs, the results are shown in Figure 10.It is observed that approximate fault frequencies of different conditions are greatly different.bearing fault, and normal status.At the same time, it can be seen that under different conditions the discrimination ability of the two indices is very well and shows good performance.

The Development of Classification
For the proposed fault classification model, initial values of parameter of BA optimized ELM are as follows: the population number is 20; the range of pulse frequency is from [0, 2]; the initial pulse frequency is 0.0001; the biggest voice loudness is 1.6; loudness attenuation coefficient is 0.9; pulse enhancement coefficient is 0.99; and the largest number of iterations is set to be 200.Totally, experiment data are repeated thirty times under each condition.Twenty of them are used as training data and the remaining ten are used as testing data.Using the energy index and Hilbert envelope spectrum singular value index as the input, the fault classification model based on the BA-ELM algorithm is developed.In Figure 13, fault classification accuracy of BA-ELM model for testing samples is 97.5%, which is a high accuracy.The value of -axis stands for the different operation status.If the value is 1, it stands for normal condition.Similarly, inner ring fault, outer ring fault, and rolling bearing fault are identified when the value is 2, 3, and 4, respectively.To better illustrate the performance of the proposed method, SVM and the traditional ELM method are employed for comparison.Figure 14 shows the results of SVM and Figure 15 shows the results of ELM.Besides, these results are summarized in Table 2 for clear comparison.In summary, the proposed method has higher classification accuracy.

Conclusion
To solve the problems of data acquisition and fault classification for rolling bearing, several crucial points are solved in this paper.First, a data acquisition system based on wireless sensor network is constructed to replace the traditional wired system to collect sufficient data.Because rolling bearing works under a complex environment, the collected vibration signal is always polluted by noise.To effectively remove noise, a morphological average filtering algorithm is proposed.Then the empirical mode decomposition method is performed on the filtered data to obtain multiple feature vectors, including a frequency domain index and a time domain index.Then, these two indices are used as inputs for fault modeling.Finally, the fault classification model is developed based on enhanced extreme learning machine, which is optimized by bat algorithm to adjust the input weights and threshold of hidden layer node.In comparison with fault classification methods based on support vector machine and traditional extreme learning machine, the experimental results show that the proposed method has higher classification accuracy and better generalization ability.

Figure 3 :
Figure 3: Flow chart of the extraction of fault feature vectors.

Figure 4 :
Figure 4: Flow chart of intelligent fault classification.

Figure 5 :
Figure 5: The flow chart of BA optimized ELM algorithm.

− 1 𝑖
are speed at time  and  1 ;    ,  −1  represent the position of the bat at times  and  1 .(3) Generate uniformly distributed random number  1 .

Figure 6 :
Figure 6: The gearbox of rolling mill.

Figure 9 :
Figure 9: The results of EMD for normal condition.

Figure 14 :Figure 15 :
Figure 14: Classification of testing data based on SVM.

Table 1 :
Correlation coefficients between IMFs and the original signal in four cases.
plot these two indices, respectively.The normal state has the highest energy value, followed with inner ring fault, and outer ring fault and the last one is rolling fault.However, outer ring fault presents the highest singular value of the Hilbert envelope and then is followed by the inner ring fault, rolling