DCA-Based Real-Time Residual Useful Life Prediction for Critical Faulty Component Funa

Residual useful life (RUL) prediction is significant for condition-based maintenance. Traditional data-driven RUL prediction method can only predict fault trend of the system rather than RUL of a specific system component. Thus it cannot tell the operator which component should bemaintained.The innovation of this paper is as follows: (1)Wavelet filtering basedmethod is developed for early detection of slowly varying fault. (2) Designated component analysis is introduced as a feature extraction tool to define the fault precursor of a specific component. (3) Exponential life predictionmodel is established by nonlinear fitting of the historical RUL and the fault size characterized by the statistics used. Once online detection statistics is obtained, real-time RUL of the critical component can be predicted online. Simulation shows the effectiveness of this algorithm.


Introduction
With the rapid development of modern industrial technology, reliability, maintainability, and security of largescale system have widely received attention [1][2][3][4].Abnormal detection, root cause identification, and RUL prediction are the stages for efficient system monitoring.
Abnormality is expected to be detected as quickly as possible to prevent major accidents and reduce loss of downtime and maintenance.In the past decades, multivariate statistical analysis techniques such as principal component analysis (PCA) and partial least squares (PLS) [5][6][7] have been widely used in complex system monitoring [8].However, abnormal detection and fault diagnosis usually answer the question of "having trouble," not "how long it will happen." Detection and maintenance after system failure are usually too late for system with higher security requirement and expensive downtime loss, such as power station, power transmission, petrochemical industry, large-scale iron, steel enterprise, and other industry fields [9].
If condition-based maintenance rather than breakdown maintenance is used, severe faults leading to shutdown can be avoided, and a lot of maintenance cost can be saved [10].Fault prediction technology is a critical step of condition-based maintenance.In recent years, fault prediction has become a hot topic in the field of system monitoring [11].Artificial neural network (ANN) [12][13][14][15][16][17], autoregressive model (AR) [18,19], support vector machine (SVM) [20], vector autoregressive model (VAR), and so forth [21] are commonly used fault prediction methods.But these pure data-driven methods without reference to expert experience can only predict fault trend rather than directly predict RUL online.
An important stage of RUL prediction strategy is how to properly assess fault size that reflects the degradation process.If a damage precursor is available, damage precursor based method for RUL prediction is an objective choice [22].So fault damage precursor as well as the control limits should be determined before online RUL prediction.Existing method for establishing the fault damage precursor can be categorized into 2 classes: data-driven method and model-based method.Accurate fault propagation model to establish fault damage precursor is usually unavailable.However, thanks to the rapid development of sensor technology and condition monitoring technology, large amount of observation data reflecting status of the system can be used to establish damage precursor by using data-driven method [9,13].Therefore, data-based method is gradually attracting the attention of researchers.Li et al. use historical normal data to establish PCA model, and 1norm of the residual vector is used to define the fault damage precursor [23,24].RUL can be estimated by multistep recursive prediction using AR model with forgetting factor [25,26].Ma et al. [27] use square prediction error based fault reconstruction method to estimate fault size in the first step.Then RUL can be estimated by multilevel recursive prediction.Li et al. [24] propose a fault prediction method for industry process with performance degradation by using multiscale PCA.Fault size computed by fault reconstruction can be used to establish a prediction model based on exponential smoothing technique.These methods share the same deficiency that fault direction is computed by PCA related method [27].But pattern compounding problem of PCA makes it impossible to establish 1-1 correspondence between the failure component and the reconstructed fault direction [28].Thus PCA based RUL prediction cannot tell the operator which critical component should be maintained.
Designated component analysis (DCA) is a knowledgeguided data-driven feature extraction method which can be used to diagnose root cause of the abnormality [29].In this paper, DCA is introduced as a feature extraction tool for establishing fault damage precursor and life prediction model.Since small fault is usually buried in noise, pretreatment of filtering is developed for early detection of slowly varying small fault.
Remark 1.It is assumed that failure of critical component defined by expert has disastrous impact on the system.So RUL prediction of critical component is much significant in the sense that unnecessary surplus maintenance can be reduced to save maintenance cost.
The remainder of this paper is organized as follows: Section 2 describes principal component analysis and designated component analysis.DCA-based real-time prediction for critical component is developed in Section 3. Simulation analysis in Section 4 shows the efficiency of the method proposed.Conclusions and further research are given in the last section of the paper.

Review of PCA and DCA
2.1.PCA Modeling.Assume  0 ∈  × is the observation data matrix collected in the normal operation conditions, where  is the number of observation variables and  is the number of samples.Firstly,  0 is standardized via where  = [ 0 (1)  0 (2) ⋅ ⋅ ⋅  0 ()] is the mean of  0 ,  0 () is the variable of  0 ,  = [ 1  2 ⋅ ⋅ ⋅   ] is the standard deviation of  0 . 0 can be decomposed into two parts as follows [11]: where   ∈  ×1 is the score vector,   ∈  ×1 is the loading vector, V (V < ) is the number of significant principal components, and  = ∑  =V+1      is the residual error.
After establishing PCA modeling under normal operating conditions, multivariate statistics called squared prediction error (SPE), can be used for fault detection and diagnosis.SPE is described as follows: where   ( = 1, 2, . . ., ) represents a sample of online observation data, and  is a matrix composed of the first V loading vectors.SPE control limit can be determined via where   is the eigenvalues of the covariance matrices and   is the threshold of hypothesis testing to normal distribution with confidence level .
The system is considered to be abnormal if 2.2.DCA.DCA is a knowledge-guided multivariate statistical feature extraction method [29].The basic idea is to define a designated pattern via the fault-symptom relation.First, different from the loading vector   ∈  ×1 of PCA, the definition of designated pattern   is defined by a knowledge-guided means via where   ( = 1, 2, . . ., ) can be taken as 0 or 1 according to the fault-symptom relation, and   = 1 means that the symptom  is shown in the th designated pattern; otherwise   = 0. Table 1 lists the typical fault-symptom relation of an air compressor.There are 17 typical faults and 19 observation parameters included in the table.Insufficient supply of lubricating oil or oil blockage, lubricant contamination, motor drive failure, and so on are the typical faults listed in Table 1.First-stage exhaust temperature, intercooler exhaust temperature, two-stage exhaust temperature, oil cooling outlet temperature and motor current, and so forth are the observed parameters [29].
Then, project the observation variable  to designated patterns   to obtain the designated component   : Similarly to (2), the observation matrix  0 can be decomposed as where  is the number of designated patterns,   = [  (1),   (2), . . .,   ()] ∈  1× , and  is the residual matrix.Finally, Shewhart chart of each designated component is used to implement fault diagnosis.

DCA-Based Real-Time Prediction for Critical Component
DCA-based RUL prediction method needs to determine critical component by expert experience.In this paper, the component corresponding to designated pattern  1 is assumed to be a critical component.

Knowledge-Guided Data-Driven Feature Extraction.
Project historical normal observation data  0 to the designated pattern   to obtain the designated component vector  0 : The designated component  0 is the feature extracted from  0 .In general, statistical distribution of  0 is normal distribution as long as the observation of each observation variable is normal distribution.

Data-Driven Fault Damage
Precursor.In the case when there is no fault damage precursor determined by expert or by physical fault propagation model, it is necessary to establish a data-driven fault damage precursor for describing the fault evolution process.For this purpose, the failure control limit  to determine the failure time and the fault trend control limit  1 for early detection of slowly varying fault should first be determined.Specific steps are as follows.
Step 1 (failure control limit and fault trend control limit).As it is analyzed in Section 3.1.1, 0 has the normal distribution.By 3 criteria of hypothesis testing to normal distribution, the critical designated component  01 falls into interval [−3, 3] with confidence level 99.7% [30,31]: So the failure control limit  can be determined via where  is the standard deviation of  01 .
For the sake of determining fault trend control limit  1 , wavelet filtering is firstly introduced as a preprocessing tool to historical normal observation data, and then DCA is used to perform feature extraction to the filtered normal observation.The filtering process can be described as follows [32,33].
First, discrete wavelet transform (DWT) for the observation vector of the th observation variable  ()  0 can be depicted via where  = 0, 1, 2, . . .,  − 1;  is the number of discrete sampling points;  () () is the scaling coefficient for signal on the scale ;  () () is the wavelet coefficient for signal on the scale ; ℎ is a low-pass filter,  is a high-pass filter, and  is the length of the filter.
After DWT, the next step is filtering.How to select the threshold is the key problem because it will directly affect the result of wavelet filter.In this paper, we choose the soft threshold method shown in where  is the filtering threshold.Finally, inverse discrete wavelet transform (IDWT) is implemented to obtain the filtered observation: Based on the filtered normal observation, fault trend control limit  1 can be determined via (19): where  1 is the standard deviation of  01 .
Step 2 (fault trend detection point and the failure point).Project historical faulty observation data  1 to the designated pattern  1 to get the designated component  1 : The extracted feature  1 can characterize the fault evolution process of the critical component.The failure point   can be calculated as follows: Similarly, the filtered designated component can be computed via (22).
Once  1 and fault trend control limit  1 have been obtained, the definition of fault trend detection point   can be expressed as Step 3 (fault damage precursor).Smoothing technique descripted in ( 24) is required to get a more smooth fault damage precursor: where  is the size of the smoothing window;  1 () ( = 1, 2, . . ., ) is the element of vector  1 .The fault evolution process in the period from   to   can be defined as fault damage precursor, denoted by :  ( −   + 1) =  1 () , ( =   ,   + 1, . . .,   ) .(25)
Once the fault precursor defined in ( 26) is established, the algorithm of establishing RUL prediction model can be divided into three steps.
Step 1. Computerize the deviation between fault damage precursor and the control limit denoted by : Step 2. Once the deviation  and the historical RUL have been obtained, historical data can be determined as follows: Step The smoothed designated component is denoted by   .Shewhart charts based method is used in this paper to implement fault diagnosis.

Online Life Prediction.
A key step of online RUL prediction for the critical component is to judge whether () is positive or negative.If () is positive, it means that the system is abnormal.Real-time RUL prediction results can be obtained via exponential life prediction model:

Simulation
where  = 10 is the number of designated patterns,  1 ,  7 represent fault patterns, and the rest are normal random disturbance patterns.  is the sample vector of designated component for simulation.In MATLAB,   can be defined by the function of "randn" and some linear operations.

Data Feature Extraction
Based on DCA. Figure 1 shows SPE chart of PCA.The control limit with confidence level  = 0.003, which can be calculated by (4).From Figure 1, the abnormally can be detected from the 411th sample time.It can be concluded from Figure 1 that the detection time using PCA is not consistent with the failure point of any component.Therefore, DCA is introduced to overcome the pattern compounding deficiency of PCA.DCA-based faults diagnosis results are depicted in Figure 2. The dotted red line is the control limit.As shown in Figure 2, the Shewhart charts correspond to  1 ,  7 beyond the upper control limit from 451 and 501, respectively.The results indicate that DCA is an effective multiple faults diagnosis method for multiple faults diagnosis.

Early Fault Feature Extraction Based on DCA.
In this part, wavelet based denoising technique is used to extract early fault feature.The filtered SPE chart of PCA is shown in Figure 3.The blue line denotes the filtered SPE (abbreviated as F-SPE in the following); the black point line denotes the filtered SPE after smoothing (abbreviated as F-S-SPE in the following).The fault trend can be detected from the 154th sample point.
Figure 4 shows the faults results of DCA preprocessed by wavelet filter (WF-DCA).The blue line denotes the filtered designated components (the F-dcs); the black line denotes the filtered dcs after smoothing (the F-S-dcs).It can be seen from Figure 4 that the fault trend of the 1st dc can be detected at 161.The 7th dc is abnormal from 326.These simulation results imply that the early fault detection can be achieved by wavelet filtering which can increase the signal-to-noise ratio (SNR) of the fault signal.This paper determines the fault trend detection point using wavelet filtering technique.

RUL Prediction Model.
In this paper, the system component corresponding to  1 is assumed to be a critical component.In order to get more precise RUL prediction model, smoothing technique is used to postprocess the fault precursor.The smoothed evolution precursor between fault trend detection point and failure point can be defined as the damage precursor to establish RUL prediction model.
For establishing RUL prediction model based on WF-PCA, the failure time   = 411 can be determined from Figure 1.The fault trend detection point   = 154 can be obtained from Figure 3.The F-S-SPE between 154 and 411 can be defined as the fault damage precursor.Use RUL =  ×   as the fitting function to establish RUL prediction model, where  is the deviation between the value of F-S-SPE and the control limit.Figure 5 shows the fitted RUL prediction model based on WF-PCA.
Figure 6 shows the fitted RUL prediction model of the critical component based on WF-DCA.The first F-S-dc between 161 and 451 can be defined as fault damage precursor, where   and   can be obtained from Figures 2 and 4, respectively.

Online Life Prediction and Fault Diagnosis Based on DCA.
Once online data is obtained, we can predict the online RUL based on the aforementioned prediction model.Figure 7 depicts the online fault detection results by online F-S-SPE.It can be seen from Figure 7 that the fault trend can be detected from the 152nd sample point.
To further demonstrate the superiority of the proposed method, two commonly used prediction algorithms are employed for comparison, namely, AR and BPNN.For BPNN and AR, fault trend prediction and recursive RUL prediction are implemented in two separate stages which makes them unavailable to predicting the RUL online.The parameters of BPNN are shown in Table 2.In this paper, the order of AR model is 1, and the regression parameter is .The recursive RUL prediction process can be formulated as follows: where  is the number of recursive steps and  is the failure control limit.Figure 8 shows the online RUL prediction result of PCA.The blue line is the real RUL.The dotted red line is the predicted RUL by the method proposed in this paper.The dotted black line is the predicted RUL by AR.The dotted blue line is the predicted RUL by BPNN.As we can see from Figure 8, the prediction accuracy of the dotted red line is higher than that of the dotted black line and the dotted blue line.Extra recursive steps are required for both AR and BPNN based RUL prediction model.The method proposed in this paper can avoid this problem and reduce the prediction error.Figure 9 depicts the online faults diagnosis results based on WF-DCA.From Figure 9, the fault trends corresponding to  1 and  7 can be detected from 165 and 324, respectively.Figure 10 shows the online prediction result of the critical component.The blue line is the real RUL.The dotted red   10 that fitted RUL prediction model is more efficient than other models.The mean of RUL prediction error is listed in Table 3.It can be concluded from Table 3 that the proposed method is a good choice for the RUL prediction of the critical component.

Conclusions
Traditional RUL prediction is based on autoregression which cannot ensure a real-time RUL prediction since necessary computation time is required for online recursive prediction.In addition in order to implement RUL prediction for critical component that has disastrous impact on the system, DCA is introduced as a fault feature extraction tool for a certain system component to overcome the pattern compounding problem of PCA.The fault feature extracted by DCA can be used to define the fault precursor.For the sake of establishing RUL prediction model, wavelet filter technology based preprocessing is used for early detection of those slowly varying faults.Finally, exponential fitting based online RUL

4. 1 .
Simulation Data.In this section,  = 20,  = 1000 are used for simulation.Normal observation data is the compound of ten coexisting change patterns:

Figure 5 :Figure 6 :
Figure 5: The fitted RUL prediction model based on PCA.

Figure 7 :
Figure 7: Online early fault detection based on WF-PCA.

Figure 9 :Figure 10 :
Figure 9: Online early fault diagnosis based on DCA.

Table 1 :
Typical fault-symptom relation of air compressor.
Firstly, online observation data after pretreatment of wavelet filtering is denoted by   .Project   to the designated patterns   to obtain the designated component vector   :   =      ,  = 1, 2, . . ., .
3. Exponential life prediction model is established by nonlinear fitting of the historical data.Fitting equation is shown as follows: RUL =  +  ×   .(29) 3.2.Online Fault Diagnosis.

Table 3 :
Mean of RUL prediction error.The dotted black line is the predicted RUL by AR.The dotted blue line is the predicted RUL by BPNN.It can be seen from Figure