Application in Feature Extraction of AE Signal for Rolling Bearing in EEMD and Cloud Similarity Measurement

Due to the powerful ability of EEMD algorithm in noising, it is usually applied to feature extraction of fault signal of rolling bearing. But the selective correctness of sensitive IMF after decomposition can directly influence the correctness of feature extraction of fault signal. In order to solve the problem, the paper firstly proposes a new method on selecting sensitive IMF based on Cloud Similarity Measurement. By comparing this method in simulation experiment with the traditional mutual information method, it is obvious that the proposed method has overcome the misjudgment in the traditional method and it has higher accuracy, by factually collecting the normal, damage, and fracture fault AE signal of the inner ring of rolling bearing as samples, which will be decomposed by EEMD algorithm in the experiments. It uses Cloud Similarity Measurement to select sensitive IMF which can reflect the fault features. Finally, it sets the Multivariate Multiscale Entropy (MME) of sensitive IMF as the eigenvalue of original signal; then it is classified by the SVM to determine the fault types exactly. The results of the experiments show that the selected sensitive IMF based on Cloud Similarity Measurement is effective; it can help to improve the accuracy of the fault diagnosis and feature extraction.


Introduction
Rolling bearing is the most common part of the rolling mechanism, and about 30% of the mechanical faults are caused by rolling bearing, so the detection and diagnosis of rolling bearing are always hot issues studied by scholars all over the world.In the recent ten years, the technology of acoustic emission (AE) is important in monitoring the states of the rolling bearing; it can help to effectively test the early symptom of pitting corrosion defect and the crack initiation so as to avoid occurring disastrous consequences [1,2].Elforjani has completed a lot of researches and experiments on natural defects of bearing; the results show that AE technology can provide us with successful detecting and monitoring of natural crack initiation and propagation in slow-speed rotating bearing [3,4].Al-Ghamdi et al. have compared the RMS value, amplitude value, and kurtosis value of AE signal and vibration signal for the outer race fault of rolling bearing; they think that the technology of acoustic emission is more effective than the vibration in early fault diagnosis [5].Researchers from China and abroad have discussed and explored a lot on the feature extraction methods of acoustic emission signals of bearings.Choudhury and Tandon have applied two ways of parametric analysis based on ring down counts and peaks to analyze the failures of rollers and the inner ring of bearings under the conditions of load change, fault size, and speed change [6].Beck et al. have studied physical properties in material fracturing; the test results indicate that the relationship between AE energy and the fracture area or depth is linear dependence [7].In detecting bearing experiments, Kilundu et al. have written the record of cycle stationary AE; the results indicate that comparing the cyclic spectrum related index to the traditional time index it is more sensitive in continuously monitoring the fracture [8].
Due to bad working conditions of mechanical equipment, the collected AE signal from the spot is often polluted by all kinds of noises.To eliminate noise of the signal, some researchers have introduced the wavelet noise reduction technology into feature extraction of AE signal, and it has achieved good effects.Zhao et al. have decomposed the AE into different frequency bands by using wavelet analysis technique and reconstructed the signal, eliminated noise, and well separated faint AE signal [9].Lilin et al. have extracted eigenvalue of AE signal by using wavelet packet energy spectrum; it can help to improve the SNR of the signal [10].However, this method has the difficulties in selecting the wavelet base and determining the threshold.The Empirical Mode Decomposition (EMD) does not have fixed basis formula, which could avoid the difficulties in selecting wavelet base in wavelet analysis; it is more effective in nonstationary signal denoising than any other methods of wavelet transform [11].But EMD method has modal mixing problem; it can cause the distortion of the decomposed Intrinsic Mode Function (IMF).In order to solve the frequency mixing problem in EMD denoising, Wu and Huang have proposed Ensemble Empirical Mode Decomposition (EEMD) to perform signal decomposition to enhance the thoroughness of denoising [12].Only a part of the IMF after decomposing by EEMD is closely related to the original fault information.So it is significant for the proper selecting of the sensitive IMF which is closely related to the fault information in improving the accuracy of feature extraction of fault signal.In literatures [13][14][15], the sensitivity evaluation algorithm, cross-correlation coefficient, and mutual information (MI) algorithms are used to select the sensitive IMFs from all IMFs, and the sensitive IMFs could reflect the fault features.It does help to remove the illusive component, but the misjudgment still existed.
In view of the above, this paper proposes a solution of Cloud Similarity Measurement (CSM).CSM from cloud mode is used to describe the differences between clouds [16].In data mining, CSM in similarity measurement of two time series could overcome the disadvantages in Euclidean distance, Dynamic Time Warping (DTW) distance, and pattern distance of classical methods, so as to achieve better metrics accuracy [17].Meanwhile, CSM is extensively used in the e-business trade, biomedicine, and watermarking technology.But it has never be reported in the field of feature extraction of AE signal of rolling bearing.In view of this, in this paper, CSM is adopted to select the sensitive IMFs which could reflect the fault features from IMFs after being decomposed by EEMD for the first time.It can help to overcome the misjudgment of traditional method and improve the accuracy in selecting sensitive IMFs, which are calculated by Multivariate Multiscale Entropy (MME), so as to obtain the eigenvalues.All the above can effectively extract the fault characteristic information of AE signal and improve the accuracy of fault diagnosis.(2) To do  times EMD experiments after adding white noise.Consider the following:

AE Signal
(i) After a random Gaussian white noise   () is added in the input signal (), and the signal   () is obtained, (ii)   () is decomposed by EMD, to obtain  , . , which stands for the  IMF is obtained in the th decomposition ( = 1, 2, . . .,   ), where   stands for the number of IMFs in the th decomposition.
(iv) Take the minimum of model components in each group of IMFs which is obtained in the  times of decompositions as the final overall average number of IMFs.
(3) Each IMF in  times of decomposition being averaged: (4) To output   as the th IMF obtained after EEMD decomposition.The added white noise   () is generated randomly in each experiment.When  is bigger, the overall average of the added Gaussian white noise is closer to zero.

Cloud Similarity Measurement.
Cloud Similarity Measurement (CSM) consists of backward cloud generator algorithm and includes angle cosine of cloud eigenvector.The input sample point   = ( 1 ,  2 , . . .,   ) and sample point   = ( 1 ,  2 , . . .,   ), where  and  are the numbers of   and   , respectively.The steps are as follows [19]. ( (5) To calculate cloud vector ⃗ V  = (  ,   ,   ) of sample point   and cloud vector ⃗ The similarity of any two samples   and   may be expressed by the included angle cosine between ⃗ V  and ⃗ V  , as follows:  To set similarity threshold to be  = 0.95, to retain IMF  when cos( ⃗ V, ⃗ V  ) ≥ 0.95, the others are removed.

The Comparison of EEMD-CSM Algorithm.
Mitraković et al. [20] and other scholars use damped exponential signals with three different frequency and attenuation coefficients to simulate AE signal; the signal model is given as follows: In this equation,   ,   ,   , and   are the th harmonic signal amplitude, attenuation coefficient, peak instant, and the characteristic frequency, respectively.The parameters of typical AE signal are valued as follows:   = 2 ( = 1, 2, 3),  1 = 6.24 × 10 8 ,  2 = 1.56 × 10 8 ,  3 = 2.79 × 10 8 ,  1 = 0.4 ms,  2 = 0.6 ms,  3 = 0.8 ms,  1 = 70 KHz,  2 = 60 KHz, and  3 = 80 KHz.In fact, the noise signals are mainly white noise.Therefore, in the case of the sampling frequency  = 500 kHz, white noise signal of limited bandwidth is added in the analog AE signal, as is shown in Figure 1.The EEMD-CSM algorithm is proposed by this paper for simulating the analog AE signal to validate the effectiveness and accuracy of sensitive IMF selected by the CSM.
In EEMD algorithm, the value of added noise is 0.01 times bigger than the standard deviation, and the overall average time is 200.In the simulated AE signal after the decomposition of EEMD, IMF1∼IMF4 are obtained, as is shown in Figure 2.
As can be seen in Figure 2, the IMF1 and IMF2 of EEMD decomposition are actual original signal component in which the noise interference is eliminated.IMF3 and IMF4 are meaningless illusive components, which will be excluded in subsequent analysis.The comparison between the mutual information and cloud similarity of each IMF and original signal is shown in Table 1.According to the literature [21], the calculated mutual information (MI) threshold is 0.0357; then IMF1, IMF2, and IMF3 are considered to be actual components; IMF4 component is illusive.Therefore, it could  easily cause misjudgment when the mutual information excludes illusive component.In view of threshold  = 0.95 of CSM, it is apparent that IMF3 and IMF4 are illusive, while the other two actual components are retained.Therefore, it is effective to select sensitive IMF by the CSM, which is higher than the accuracy of the mutual information method and overcomes the misjudgment.

Feature Extraction Method Based on EEMD-CSM-MME.
Above all, feature extraction method based on EEMD-CSM-MME can be summarized as the following steps.
(3) To obtain calculated vectors ⃗ V  = (  ,   ,   ) of signal IMF  by using backward cloud generator algorithm. ( to determine the similarity of cloud vector of signal V and IMF  , to set similarity threshold to be  = 0.95, to retain IMF  as the sensitive IMF, when cos( ⃗ V, ⃗ V  ) ≥ 0.95.(5) To calculate the entropy of selected sensitive IMF  by MME algorithm and set the obtained value of multivariate samples as eigenvalue.The formula for calculation of MME is shown as follows [22]: In the formula, to embed vector  = [ 1 ,  2 , . . .,   ], time lag vector  = [ 1 ,  2 , . . .,   ], tolerance level , and

Instrumentation.
The test instrument used in the experiment is using four-channel signal acquisition system of PCI-2-PAC produced by American Acoustic Physics Company; the acoustic sensor is R15, whose response frequency is 60∼ 500 kHz and service temperature is −20∼80 ∘ C. The acoustic sensor is fixed to the stents, close to the bearing by M20 magnetic fixture.Then to connect the acoustic sensor to the data acquisition system with preamplifier (40 dB), and the output impedance of preamplifier 50 Ω, the working frequency is 10 KHz∼2 MHz.The data acquisition systems use AEwin3.5 software in data acquisition and analysis.The

Feature Extraction of AE Signal.
To respectively test the three states of acoustic emission signal of bearing's inner ring from the fault experimental platform, including the normal, fracture, and damage states, the time domain waveform is, respectively, shown in Figures 6-8.This paper studies damage fault of bearing's inner ring; for example, it illustrates the process of feature extraction of AE signal.Firstly it decomposes the AE signal of inner damage fault by EEMD algorithm.In EEMD algorithm, the value of added noise is 0.01 times more than the standard deviation, and the overall average time is 200.The result decomposed by EEMD algorithm is shown in Figure 9.To calculate the cloud similarity values between the original signal and IMF1∼IMF7 by using CSM, the similarity values are shown in Table 2.
The threshold of cloud similarity is set to be 0.95 by the present method according to the algorithm proposed by the paper, as can be seen in Table 2; the cloud similarity values between original signal and IMF1∼IMF3 are bigger than 0.95; it indicates that IMF1∼IMF3 are closely related to the original signal.The cloud similarity values between the remaining components and original signal are smaller than 0.95; it indicates that they are not so closely related to the original signal, and it will be judged to be illusive and removed.In order to further observe the distribution laws of cloud similarity values and universality of the threshold selection, to set 20 group damage samples and let them be calculated by EEMD, and to obtain the cloud similarity values between original signal and IMF1∼IMF7, the fitting curves of the cloud similarity values of all damage samples are shown in Figure 10.
The marks of A∼D, respectively, represent the fitting curves of the cloud similarity values between original signal and IMF1∼IMF4 of each sample.E represents the fitting curve of the cloud similarity values between original signal and IMF5∼IMF7 of each sample in Figure 10.As can be seen in Figure 10, the cloud similarity values of the fitting curves A, B, and C have some fluctuation, but all the values are bigger than 0.95; this shows that the original signal and IMF1, IMF2, and IMF3 are sensitive IMFs, and they can completely reflect the characteristic information of the original signal.The cloud similarity values of the fitting curves D and E are smaller than 0.95, which shows that IMF4∼IMF7 have no relation with the original signal; they need to be removed.
To take 20 group samples, respectively, in normal and fracture failure state, so as to calculate the cloud similarity values of each separately, the obtained fitting curves are  shown in Figures 11 and 12.As shown in Figures 11 and 12, the cloud similarity values under the two situations share the same distribution laws with the cloud similarity values of the damage samples, which means the cloud similarity values between original signal and IMF1, IMF2, and IMF3 are bigger than 0.95; the others are smaller than 0.95.So it is suitable to select 0.95 as the threshold.
For the multiple variables composed of sensitive IMFs, MME algorithm should be used in calculating eigenvalue, where   = 2 and   = 1; that is;  = [2 2 2],  = [1 1 1], and  = 0.15 × (normalized standard deviation of time series).To take three groups of AE signal samples under three states separately in the bearing inner ring, the obtained curve changes of MME are shown in Figure 13.As shown in Figure 13, the multiple sample entropy of fracture signal at 20 scales is always the minimal one, which means when fracture fault occurs, the complexity of AE signal is the lowest.The values of multiple sample entropy of normal state and damage state at 20 scales are relatively bigger; it shows the higher complexity of AE signal.There are obvious differences among the three values of multiple sample entropy, and it can be distinguished easily.

Comparison of the Fault Diagnosis Results.
To set the most optimal multiple sample entropy of scale factors 9 and 15 are obtained by calculation with one-factor analysis of variance.So as the eigenvalue, it will be classified by the support vector machine (SVM) algorithm.The sample number of eigenvalues of the signal under normal, fracture failure, and damage state is a total of 180, among which 120 are for training samples and 60 are for testing samples.The classification results can be divided into three categories."1" represents normal state, "0" represents damage state, and "−1" represents the fracture failure state.When the penalty factor  is 150,  is 1; the fault diagnosis is shown in Figure 14(a).MI algorithm is used to select the sensitive IMFs from all IMFs, and the sensitive IMFs are calculated by the MME; then the eigenvalues are obtained, and they can be classified by SVM algorithm; the results of fault diagnosis are shown in Figure 14(b).
As shown in Figure 14, the accuracy of EEMD-CSM-MME-SVM algorithm is In order to further illustrate the effect of the EEMD-CSM-MME-SVM algorithm, a certain amount of testing samples is added into the original experiments.Meanwhile, the EEMD-CSM-MME-SVM and EEMD-MI-MME-SVM are used to classify the faults.The fault diagnosis results are listed in Table 3.
As shown in Table 3, when the effect of fault diagnosis by EEMD-CSM-MME-SVM algorithm is good, the accuracy rate can be 99%, while when the effect of fault diagnosis by EEMD-MI-MME-SVM algorithm is not so good, the accuracy rate is only 78%.After comparison, it can be

Conclusion
In this paper, the Cloud Similarity Measurement is used to select the sensitive IMF; this method has been proved with high accuracy and could overcome the misjudgment, after the simulation experiments.It is more accurate and it can help to overcome the misjudgment caused by mutual information method.It utilizes the experiment platform to collect the normal, damage, and facture failure of AE signal, and it selects the sensitive IMF according to Cloud Similarity Measurement.Then to obtain the fault eigenvalue calculated by Multivariate Multiscale Entropy, it is classified by SVM.The results of the experiments show that the selected sensitive IMFs by CSM are effective; it can also improve the accuracy of feature extraction and fault diagnosis.
This study can be considered as the first investigative step since it concerns a single application of the method to specific  type of bearings and to unique specimens and therefore its effectiveness has to be proved with further investigations.

Table 2 :
Cloud similarity values between original signal and rotating is 14000 r/min and the sampling rate is 1MSPS during the experiments.Schematic diagram of data acquisition process is shown in Figure5.

Figure 5 :Figure 6 :
Figure 5: Schematic of the data acquisition systems.

Figure 10 :Figure 11 :
Figure 10: The fitting curves of cloud similarity values of 20 groups of damage samples.

Figure 12 :Figure 13 :
Figure 12: The fitting curves of cloud similarity values of 20 groups of fracture samples.

Figure 14 :
Figure 14: Two methods on the effect on fault diagnosis.

Table 1 :
Comparison between the mutual information and cloud similarity of each IMF and original signal.

Table 3 :
Comparison of fault diagnosis results.
concluded that this EEMD-CSM-MME method of feature extraction is better, and it can improve the accuracy of fault diagnosis of rolling bearings.

Table 4 :
Notation list of the mathematical symbols used in the analysis.