By focusing on the issue of rolling element bearing (REB) performance degradation assessment (PDA), a solution based on variational mode decomposition (VMD) and Gath-Geva clustering time series segmentation (GGCTSS) has been proposed. VMD is a new decomposition method. Since it is different from the recursive decomposition method, for example, empirical mode decomposition (EMD), local mean decomposition (LMD), and local characteristic-scale decomposition (LCD), VMD needs a priori parameters. In this paper, we will propose a method to optimize the parameters in VMD, namely, the number of decomposition modes and moderate bandwidth constraint, based on genetic algorithm. Executing VMD with the acquired parameters, the BLIMFs are obtained. By taking the envelope of the BLIMFs, the sensitive BLIMFs are selected. And then we take the amplitude of the defect frequency (ADF) as a degradative feature. To get the performance degradation assessment, we are going to use the method called Gath-Geva clustering time series segmentation. Afterwards, the method is carried out by two pieces of run-to-failure data. The results indicate that the extracted feature could depict the process of degradation precisely.
National Natural Science Foundation of China515415061. Introduction
Rolling element bearings (REBs) are critical components of “rotor-bearing” system in rotating machinery. Since the cruel working condition, the REBs are vulnerable. So it is important to monitor their condition to avoid catastrophic accident in modern industry. There are many ways to monitor bearings, such as vibration [1], acoustic emission [2], oil-debris [3], and ultrasound [4]. Among them, the method based on vibration signal analysis is extensively used for a bearing.
Recently, the prediction of the residual (or remaining) useful life (RUL) is a hotspot issue. To get a better bearing fault prediction result, the so-called performance degradation assessment (PDA) is a premise. The PDA contains two important aspects. One is to extract proper features that can reflect the process of degradation. The other one is to use a method to assess the REB’s performance. Feature extraction is the fundamental of PDA. The feature must reveal the real performance of the REB and be sensitive to the degradation. The types of features are usually classified into three categories, time domain features, frequency domain features, and time-frequency domain features. The time-frequency domain features are always based on time-frequency analysis, combined with the concept of spectrum, entropy, and complexity, for example, the Rényi entropy [5], the permutation entropy [6], and the general mathematical morphology particle [7]. In general, mechanical equipment undergoes a complete degradation from normal stage to failure. With the increasing of the running time and deepening of degradation, the amplitude of defect frequency (ADF) is raising therewith. So the ADF can reflect the degradation of REB directly.
To get the ADF, the first thing is to locate the defect frequency. In extraction of ADF, the common method is to use EMD and select appropriate intrinsic mode function (IMF) components, taking the envelope spectrum, and finally get the ADF [8]. However, EMD remains an exclusively empirical algorithm, and it lacks a solid mathematical foundation. Despite numerous attempts to improve its performance, EMD is still of low efficiency and has endpoint effect and mode mixing problems. In 2014, Dragomiretskiy and Zosso [9] proposed a method called variational mode decomposition (VMD) as an alternative to EMD, which can adaptively decompose a multicomponent signal into a number of quasiorthogonal IMFs. It has been verified that VMD outperforms EMD with regard to tone detection and separation as well as noise robustness [10]. It has a good performance and high operation efficiency. Particularly, it has solid foundation for a mathematical theory. Now the method has been applied to the abnormal ECG signal detection [11], the stock market forecasting [12], wind power prediction [13], power quality classification [14], and so on.
However, the parameters of VMD are selected by experience. Jun Zhu presented a method to optimize the parameters based on the kurtosis index through artificial fish swarm algorithm [15]. Tang and Wang used Shannon entropy as an index to optimize parameters by panicle swarm algorithm [16]. In this paper, we proposed a method to optimize the parameters based on BLIMFs themselves using genetic algorithm. As to PDA, Pan et al. [17] proposed a bearing degradation assessment method based on lifting wavelet packet decomposition and fuzzy C-means clustering. Wang et al. [18] proposed a method for PDA based on mathematical morphology fractal dimension and fuzzy C-means clustering. However, when using clustering, they only considered the distance factor, and no time parameter was recommended. The clusters should be contiguous in time. From this point of view, after extraction of ADF from the raw signals, the assessment of bearing performance is carried out by GGCTSS.
The remainder of this paper is organized as follows. In Section 1, the theory of VMD is briefly introduced. The optimization of the parameters is proposed in Section 2. In Section 3, by carrying out two run-to-failure data, we have extracted the ADF and used GGCTSS for PDA. Our conclusions are presented in Section 4.
2. VMD and Its Parameters2.1. Variational Mode Decomposition
VMD can nonrecursively decompose a real-valued multicomponent signal into a discrete number of quasiorthogonal band-limited subsignals uk with specific sparsity properties in the spectral domain. Each mode is compacted around a center pulsation ωk. For convenience, let us call these modes obtained by VMD as band-limited IMFs (BLIMFs). The VMD technique is essentially written as a constrained variational problem in [9](1)min∑k=1K∂tσt+jπt∗ukte-jwkt22Subject to∑k=1Kuk=f,where uk=u1,…,uk and ωk=ω1,…,ωk are the decomposed BLIMFs for the set of all K modes and their estimated center frequencies, respectively. The VMD process can be briefly described as follows; for further details, refer to [9].
2.2. Parameters Involved in VMD
There are six parameters for VMD, where K represents the numbers of decomposition modes and α represents moderate bandwidth constraint; they are two parameters that significant influence on the decomposition results. In general, the parameter τ is set up to zero, which means that the Lagrangian multiplier is effectively shut off. The parameter init is set to be one, which suggests that center frequencies of all the modes are initialized in the uniform distribution. The parameter DC is set to zero, for no DC part imposed. And the parameter tol for tolerance is set as default value 1e-6.
2.3. The Optimization of VMD Parameters
Figure 1 shows the impact of the parameter α. It can be seen that the parameter α seems to have an inverse ratio of the decomposition modes’ bandwidth. In Figure 1(a), there is no high frequency mode. It seems to be a better one with α=200 in Figure 1(b). Because the parameter init is set to be one, the bandwidth of each mode is almost the same. To optimize the parameter K and α, A criterion must be raised. We have proposed a criterion η in formula (2).(2)η=∑iKl-l0fmax,where l0 is the supposed frequency band of each mode and l is the actual frequency band of each mode. For example, in Figure 1(a), l0 is 0–2000 Hz, 2000–4000 Hz, 4000–6000 Hz, 6000–8000 Hz, and 8000–10000 Hz, and the actual frequency bands of BLIMFs are 0–1396 Hz, 1426–2549 Hz, 2900–4014 Hz, 3686–4620 Hz, and 3926–5376 Hz with the y-axis threshold 0.001. η is calculated to 0.4021, since K must be set as integer, but not for parameter α. We can find the optimized α using genetic algorithm of each mode. The settings of genetic algorithm are using default in MATLAB and finally get the best K and α. The minimum of K is 2. EMD can be used to find the maximum value of K. There are 10 IMFs of the signal decomposed by EMD, so the maximum of K is set up to 10.
The impact of the parameter α.
α=2000
α=200
3. The Experimental Verification3.1. Experiment Illustration
The run-to-failure REB’ test data are derived from IMS center [19]. The bearing test rig hosted four test bearings on a shaft. The shaft was driven by an AC motor. The rotation speed was kept constant at 2000 rpm. A radial load of 6000 lbs was added to the shaft and bearing by a spring mechanism. All the bearings are force oil-lubricated. Four Rexnord ZA-2115 double row bearings were installed on the shaft as shown in Figure 2. The bearings had 16 rollers in each row, a pitch diameter of 2.815 inch, roller diameter of 0.331 inch, and a tapered contact angle of 15.17°. A high sensitivity accelerometer was installed on each bearing’s housing. The data sampling rate was 20 kHz and the data length was 20480 points. The sampling interval was ten minutes. We use the Bearing1_1’s data of set number 2 which exhibits outer race defect as shown in Figure 3. The number of the data files is 982 (the original files number is 984, for the last 2 files it is vague and distinctive).
Bearing test rig.
Outer race defect of set number 2.
For the theoretical value of outer race defect frequency, we can use the formula as follows and then have the theoretical frequency of 236.4 Hz:(3)fop=Z21-dDcosθfr.
From Figure 4, we could not find the defect frequency of #600 (where # means the number of files). The range of α is set from 1000 to 10000. Now, by using genetic algorithm, the optimized parameter α of K mode decomposition is presented in Table 1.
The optimized parameter α of each K mode decomposition.
K
The optimized parameter α
Criterion η
2
1382
0.9502
3
1044
1.1704
4
9534
0.9302
5
9651
1.2129
6
9736
1.5024
7
1423
1.5122
8
2321
1.6714
9
1537
2.2026
10
9286
1.5204
The vibration wave and frequency spectrum of #600.
From Table 1, the optimized parameter K is for 4 and α for 9534. The decomposition result by optimized parameters is shown in Figure 5.
The BLIMFs and frequency spectrum of #600.
As shown in the figure, the signal is decomposed into 4 BLIMFs. The spectrum is in accordance with the order from low to high; that is, VMD has the role of band pass filtering. The BLIMFs do not show the phenomenon of mode mixing. The contrast group EMD, as shown in Figure 6, is decomposed for 11 IMFs. As we can see, EMD has a characteristic that, with the order from low to high, IMF is narrowed approximate dichotomy, and each IMF has low frequency components, and LCD and LMD have similar characteristics. At the same time, the computing time of VMD is far less than EMD, because there is no recursive iteration process of VMD.
The IMFs and frequency spectrum of #600.
The signal of #600 is decomposed into 4 BLIMFs by optimized VMD and the envelope spectrum is shown in Figure 7.
The envelope spectrum of BLIMF1 to BLIMF4 of #600.
BLIMF1’s envelope
BLIMF2’s envelope
BLIMF3’s envelope
BLIMF4’s envelope
It can be seen that the BLIMF2 and BLIMF3 have the peak of 230.7 Hz, which is close to the theoretical value 236.4 Hz. So the BLIMF2 and BLIMF3 are the right mode which have the defect frequency. By extracting the amplitude of all BLIMF2 and BLIMF3’s defect frequency, an ADF of all REB’s data is shown in Figure 8.
The ADF of the data.
It can be seen from Figure 8 that, before #520, the amplitude is close to zero. That indicates that there is no obvious defect before #520. From #520 to #700, the ADF increases in a linear way. Guess that spalling is formed and become larger, and the REB is in the slight fault stage. About #700, there is a sudden change, and we speculate that it is caused by a bulge or a crack. From #700 to #850, the amplitude decreases and rises again. The fluctuating trend can be explained by the nature of the propagating process of the damage, when the spalling formed and later smoothed by the continuous rolling contact. As the damage spread over a broader area, the vibration level raises again. This is called “healing” phenomenon and has been stated in [1, 20, 21]. Form #850 to the end #982, the condition of REB is becoming fierce. The “healing” phenomenon expands, and the variances enlarge. Overall, the ADF is increasing. If the ADF is segmented in 4 parts, it is rational for #1 to #520 for normal stage, #521 to #699 for slight fault stage, #700 to #850 for serious stage, and #851 to end for fault stage.
3.2. The PDA Using GGCTSS
As mentioned before, fuzzy C-means clustering has its drawback for PDA. As shown in Figure 9. The four clustering results of the ADF are conducted by fuzzy C-means (the data have been normalized).
The clustering result of the ADF proposed by fuzzy C-means.
According to the clustering order from left to right, the first two clustering centers are in the normal state of the REB. The third center is in the slight fault state. The fourth one is almost close to the boundary of severely fault and failure. The result is unacceptable. References [17, 18] both used fuzzy C-means to realize PDA, Where owing to the degradation features which they have proposed is roughly monotonic, the cluster results of fuzzy C-means are basically distributed in time growth order. If the degradation features are inconspicuous or existing oscillation, we could not get a good clustering result. Therefore, it is necessary to import the concept of time series segmentation.
A sequence of N times observed data, T=x1,x2,…,xN, is called time series. Time series segmentation is to find a partitioning of T into c segments that are internally homogeneous. After a segmentation, T=x1,x2,…,xN can be partitioned into SN1,SN2,…,SNc, where SN1=S1,t1, SNn=Stn-1+1,tn, and 0<t1<⋯<tn-1<⋯<tc=N. The times t1,…,tn-1,…,tc are called segment boundaries or breaks, and the number c is called the order of the segmentation. This segmentation is a crisp one. A time series segmentation problem can convert into an optimization. Himberg et al. [22] have defined the cost function. Under the definition, Kehagias et al. [23] proposed a dynamic programming (DP) segmentation procedure and Gedikli et al. [24] improved it with modified dynamic programming (mDP). Both methods are based on minimization of the summation of cost functions followed by distances between actual values and constant or linear fittings.
Expanding and improving the crisp bounds of the segmentation, Abonyi et al. [25] developed an algorithm for dividing time series into fuzzy segments, which considered time series segmentation as Gath-Geva clustering with time parameter as an additional variable. The GGCTSS used local probabilistic principal component analysis (PPCA) models to measure the homogeneity of the segments and fuzzy sets to represent segments in time. The function to be optimized can be written as(4)J=∑i=1c∑k=1Nμi,kmD2zk,ηi,where the data point zk=tk,xkTT can be effectively modeled as a mixture of multivariate Gaussian distribution and μi,k stands for the ith segment membership of zk. The term m is weighing exponent of fuzzy degree, m∈1,∞. For general, m=2. The procedure of the GGCTSS can be written as follows:
(1) Initial the order segmentation c, iteration number s1, and the threshold for compatible cluster merging (0.75 as default).
(2) Input the number of principal components q and make sure that the sum of the anterior q eigenvalues account for more than 98% accuracy of the sum of all eigenvalues.
(3) Finish the iteration and give the results of segmentation.
The results include the fuzzy segmentation of a time series, βkti, which is defined as(5)βkti=Akti∑k=1KAkti∈0,1,where Akti is Gaussian membership function given by (6)Akti=exp-12ti-μk2σk2.See [22] for more details of the method.
The mDP algorithm is used to treat the ADF for the fourth-order segmentations, and the results are shown in Figure 10.
The optimal fourth-order segmentations of ADF by mDP algorithm. (a) Constant fitting and (b) linear fitting.
It can be seen that the segmentations by mDP algorithm are not good. So, we use GGCTSS for PDA. The process of life was divided into 4 segments. Set the iteration to be 100 times. It is about 70 iterations that the result is basically unchanged. Figure 11 shows the result.
The result of the ADF based on GGCTSS.
It can be seen that the four segmentations are partitioned according to the order of time. The normal stage is from #0 to #522, the slight stage is from #522 to #700, the severe stage is from #700 to #900, and the failure stage is from #900 to #982. It is possible to show that GGCTSS can be a good one for PDA.
3.3. Another Experiment Verification
IEEE PHM 2012 Prognostic Challenge data is another REBs’ data. Figure 12 shows the experiment rig, the PRONOSTIA platform, where the data included three different loads for 1800 rpm and 4000 N, 1650 rpm and 4200 N, and 1500 rpm and 5000 N. The sampling frequency was 25.6 kHz, and 2560 samples were recorded each 10 seconds. There were 6 pieces of run-to-failure data which were the learning sets, and 11 test bearings were truncated so that participants were supposed to predict the remaining life (see Table 2) and thereby preformed RUL estimating. The learning set was quite small while the spread of the life duration of all bearings was very wide. Performing good estimates was thereby difficult and this made the challenge more exciting. More details can be seen in [26].
Datasets of PHM 2012 Prognostic Challenge.
Datasets
Operating conditions
Conditions 1(1800 rpm and 4000 N)
Conditions 1(1650 rpm and 4200 N)
Conditions 1(1500 rpm and 5000 N)
Learning set
Bearing1_1
Bearing2_1
Bearing3_1
Bearing1_2
Bearing2_2
Bearing3_2
Test set
Bearing1_3
Bearing2_3
Bearing3_3
Bearing1_4
Bearing2_4
Bearing1_5
Bearing2_5
Bearing1_6
Bearing2_6
Bearing1_7
Bearing2_7
Overview of PRONOSTIA.
The data is difficult to analysis because we know nothing about the nature and the origin of the degradation: inner or outer races. We are going to use Bearing1_1’s data and the characteristics of the tested bearing are shown in Table 3. And the characteristics’ frequency of REB is 168.3 Hz for outer race, 221.7 Hz for inner race, 107.7 Hz for ball race, and 12.9 Hz for cage.
Characteristics of tested bearings.
Z (ball numbers of the REB)
13
d (diameter of the REB)
3.5 mm
Dm (bearing mean diameter)
25.6 mm
r (rotation rate)
1800 r/min
The numbers of Bearing1_1 are 2803, thus taking #2600 for analysis, since the REB is almost run to failure. Figure 13 shows the signal.
The signal of #2600.
Now, by using genetic algorithm, we have the optimized K and α, and K is for 3 and α for 4032. The signal is decomposed into 3 BLIMFs and the envelope spectrum is shown in Figure 14. It is obvious that BLIMF1 and BLIMF2 have the frequency of inner defect. So Bearing1_1 gets inner defect finally.
The envelope spectrum of BLIMF1 to BLIMF3 of #2600.
BLIMF1’s envelope
BLIMF2’s envelope
BLIMF3’s envelope
By extracting the amplitude of all BLIMF1 and BLIMF2’ defect frequency, an ADF of all Bearing1_1’s data is shown in Figure 15.
The ADF of the Bearing1_1.
From Figure 15, there are no evident boundaries of each degradation stage. At the end of the bearing’s life, there is a sudden increase. If we set the segment number as 3, the result is shown in Figure 16. The normal stage is from #0 to #1432, slight stage is from #1433 to #2264, and severe stage is from #2265 to #2803.
The result of Bearing1_1’s ADF based on GGCTSS.
4. Conclusions
This paper has proposed a method to assess the performance degradation of REB using VMD and GGCTSS. We have given a novel method to optimize the VMD parameter. By using GGCTSS, we can effectively segment the degradation process. From the above analysis, we can conclude that
the ADF extracted by VMD can commendably reflect the degradation development of REBs;
compared with crisp time series segmentation, the GGCTSS is more suitable for PDA of REBs.
However, there are also some problems that should be focused on in the future study, for example, to optimize the selection of GGCTSS segment number.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Acknowledgments
This project is supported by National Natural Science Foundation of China (Grant no. 51541506). The authors also appreciate the Centre for Intelligent Maintenance System and FEMTO-ST Institute for providing the experimental data.
El-ThaljiI.JantunenE.A summary of fault modelling and predictive health monitoring of rolling element bearings201560-6125227210.1016/j.ymssp.2015.02.0082-s2.0-84925964775CaesarendraW.KosasihB.TieuA. K.ZhuH.MoodieC. A. S.ZhuQ.Acoustic emission-based condition monitoring methods: review and application for low speed slew bearing201672-7313415910.1016/j.ymssp.2015.10.0202-s2.0-84955654386IwaiY.HondaT.MiyajimaT.YoshinagaS.HigashiM.FuwaY.Quantitative estimation of wear amounts by real time measurement of wear debris in lubricating oil2010431-23883942-s2.0-7154912294310.1016/j.triboint.2009.06.019TaylorS.WigginsK.Diagnosis of bearing defects using a heterodyning ultrasound detector201515447047310.1007/s11668-015-9975-y2-s2.0-84937978046BoškoskiP.GašperinM.PetelinD.JuričićĐ.Bearing fault prognostics using Rényi entropy based features and Gaussian process models201552-5332733710.1016/j.ymssp.2014.07.011YanR. Q.LiuY. B.GaoR. X.Permutation entropy: a nonlinear statistical measure for status characterization of rotary machines201229547448410.1016/j.ymssp.2011.11.0222-s2.0-84859427324LiH.WangY.WangB.SunJ.LiY.The application of a general mathematical morphological particle as a novel indicator for the performance degradation assessment of a bearing201482149050210.1016/j.ymssp.2016.05.0382-s2.0-84971597325PanM.TsaoW.Using appropriate IMFs for envelope analysis in multiple fault diagnosis of ball bearings2013691141242-s2.0-8487583344910.1016/j.ijmecsci.2013.01.035DragomiretskiyK.ZossoD.Variational mode decomposition201462353154410.1109/TSP.2013.2288675MR31602932-s2.0-84893418756WangY.MarkertR.Filter bank property of variational mode decomposition and its applications201612050952110.1016/j.sigpro.2015.09.0412-s2.0-84946569715TripathyR. K.SharmaL. N.DandapatS.Detection of shockable ventricular arrhythmia using variational mode decomposition2016404, article no. 7911310.1007/s10916-016-0441-52-s2.0-84955119590LahmiriS.Intraday stock price forecasting based on variational mode decomposition20161223272-s2.0-8495415693510.1016/j.jocs.2015.11.011ZhangY.LiuK.QinL.AnX.Deterministic and probabilistic interval prediction for short-term wind power generation based on variational mode decomposition and machine learning methods20161122082192-s2.0-8495525975410.1016/j.enconman.2016.01.023AbdoosA. A.MianaeiP. K.GhadikolaeiM. R.Combined VMD-SVM based feature selection method for classification of power quality events20163863764610.1016/j.asoc.2015.10.0382-s2.0-84946434294ZhuJ.WangC.HuZ.KongF.LiuX.Adaptive variational mode decomposition based on artificial fish swarm algorithm for fault diagnosis of rolling bearings201723146356542-s2.0-8501158640110.1177/0954406215623311TangG.WangX.Parameter optimized variational mode decomposition method with application to incipient fault diagnosis of rolling bearing2015495738110.7652/xjtuxb2015050122-s2.0-84941755036PanY.ChenJ.LiX.Bearing performance degradation assessment based on lifting wavelet packet decomposition and fuzzy c-means20102425595662-s2.0-7035076482410.1016/j.ymssp.2009.07.012WangB.LiH.-R.ChenQ.-H.XuB.-H.Rolling bearing performance degradative state recognition based on mathematical morphological fractal dimension and fuzzy center means201536101982199010.3969/j.issn.1000-1093.2015.10.0222-s2.0-84954520816QiuH.LeeJ.LinJ.YuG.Wavelet filter-based weak signature detection method and its application on rolling element bearing prognostics20062894-51066109010.1016/j.jsv.2005.03.0072-s2.0-33644547646WilliamsT.RibadeneiraX.BillingtonS.KurfessT.Rolling element bearing diagnostics in run-to-failure lifetime testing20011559799932-s2.0-003545592810.1006/mssp.2001.1418El-ThaljiI.JantunenE.A descriptive model of wear evolution in rolling bearings2014452042242-s2.0-8490487198110.1016/j.engfailanal.2014.06.004HimbergJ.KorpiahoK.MannilaH.TikanmäkiJ.ToivonenH. T. T.Time series segmentation for context recognition in mobile devicesProceedings of the 1st IEEE International Conference on Data Mining (ICDM '01)December 20012032102-s2.0-33745467111KehagiasA.NidelkouE.PetridisV.A dynamic programming segmentation procedure for hydrological and environmental time series2006201-2779410.1007/s00477-005-0013-6MR2227458Zbl1096.620842-s2.0-30344487253GedikliA.AksoyH.UnalN. E.KehagiasA.Modified dynamic programming approach for offline segmentation of long hydrometeorological time series20102455475572-s2.0-7795505270310.1007/s00477-009-0335-xAbonyiJ.FeilB.NemethS.ArvaP.Modified Gath-GEVa clustering for fuzzy segmentation of multivariate time-series20051491395610.1016/j.fss.2004.07.008MR2113205Zbl1071.685432-s2.0-9644270328NectouxP.GouriveauR.MedjaherK.An experimental platform for bearings accelerated life testProceedings of the IEEE International Conference on Prognostics and Health Management2012Denver, CO, USA