Fault Diagnosis of Rotating Machinery Based on Multisensor Information Fusion Using SVM and Time-Domain Features

Multisensor information fusion, when applied to fault diagnosis, the time-space scope, and the quantity of information are expanded compared to what could be acquired by a single sensor, so the diagnostic object can be described more comprehensively. This paper presents a methodology of fault diagnosis in rotating machinery using multisensor information fusion that all the features are calculated using vibration data in time domain to constitute fusional vector and the support vector machine (SVM) is used for classification.The effectiveness of the presentedmethodology is tested by three case studies: diagnostic of faulty gear, rolling bearing, and identification of rotor crack. For each case study, the sensibilities of the features are analyzed. The results indicate that the peak factor is the most sensitive feature in the twelve time-domain features for identifying gear defect, and the mean, amplitude square, root mean square, root amplitude, and standard deviation are all sensitive for identifying gear, rolling bearing, and rotor crack defect comparatively.


Introduction
Typical rotating machinery systems such as water turbine, steam turbine, wind turbine, and rotary kiln are critical core equipment support of the important industries of the national economy [1,2].The safety, reliability, efficiency, and performance of rotating machinery are major concerns in industry, so, the task of condition monitoring and fault diagnosis of rotating machinery is significant [3].The common mechanical defects of rotating machinery are divided into three categories: (1) rotor body defects, such as unbalance, misalignment, rubbing, and rotor crack; (2) rotor supportbearing defects, such as inner race, outer race or ball defect of rolling bearing, and oil whirl or oil whip of sliding bearing; (3) transmission gear defects, such as chipped tooth defect or missing tooth defect.In-process monitoring and diagnostics of rotating machinery require reasoning about defect and process states from sensor readings.Often the relationship between the sensor readings and the process states is complex and nondeterministic.For a complex system, a single sensor is incapable of collecting enough data for accurate condition monitoring and fault diagnosis.Multiple sensors are needed in order to do a better job.When multiple sensors are used, data collected from different sensors may contain different partial information about the same machine condition.The diagnostic object can be described more comprehensively [4][5][6].Compared with single sensor, the time-space scope and the quantity of information are expanded.The diagnostic accuracy and reliability can be improved.Multisensor information fusion can be categorized into three levels [7,8]: datalevel fusion, feature-level fusion, and decision-level fusion.
At data-level fusion, all sensor data from a measured object are combined directly and features are then calculated from the fused data.Fusion of data at this level contains most information and can deliver good results.However, the sensors used in this level must be commensurate.That means 2 Shock and Vibration the measurement has to be the same or has similar physical quantities or phenomena.During the most popular datalevel fusion methodology, such as weighted fusion [9], the weighted value of multisensor signals is difficult to determine.As a consequence, data-level applications are limited in real environment.At feature-level fusion, the features are calculated from each sensor according to the type of raw data.Then, these noncommensurate sensors features are combined at the feature level.All features are combined in turn into a bigger single feature set, which are then used in a special classification model such as artificial neural network (ANN), support vector machine (SVM), and cluster algorithm for decisions [10].The feature-level fusion is a compromise form of data-level fusion and decision-level fusion.Its data alignment requirements are not strict as the data-level fusion that heterogeneous sensors are allowed, and its information loss is less serious than the decision-level fusion but still achieved a better information compression.As a consequence, feature-level applications are flexible and popular.At decision-level fusion, the processes of features calculation and pattern recognition are applied in sequence for single-source data obtained from each sensor.The decision vectors are then fused using decision-level fusion techniques such as voting strategy, Bayesian method, behaviorknowledge space, and Dempster-Shafer theory [11].Relatively speaking, there is maximum amount of information loss at decision-level.
This paper proposes a feature-level fusion method for rotating machinery fault diagnosis.Generally, heterogeneous information fusion is executed at feature-level fusion for mechanical condition monitoring and fault diagnosis in the present literature.For example, Barad et al. put forward the development of an ANN based model for condition monitoring of a power turbine that blends parameters belonging to performance, vibration, and lubrication [8]; Loutas et al. combined use of vibration, acoustic emission, and oil debris monitoring of rotating machinery [6].The condition of mechanical system may be described in more detail by using heterogeneous information fusion, but this process needs multiclass sensors and its matching data acquisition systems, which would lead to higher monitoring costs and inconvenient operation of data acquisition in the real environment.ANN and SVM are the most popular classification models to execute decision at feature-level fusion [12,13].The main difference between ANN and SVM is in their risk minimization.SVM is based on structural risk minimization principle, whereas ANN is based on traditional empirical risk minimization principle.The difference in risk minimization leads to a better generalization performance for SVM than that of ANN [14,15].SVM is powerful for solving the problem with small sampling, nonlinear and high dimension in machinery condition classification.In this paper, the proposed feature-level fusion method belongs to homologous information fusion that the raw data all come from vibration sensors, so only a vibration testing system is needed for raw signal collected, which makes the process simpler.In this method, time-domain features are calculated from each vibration signal to compose a multidimensional feature set, and the SVM is selected as the classification model to process information fusion.In order to verify the effectiveness of the proposed method, fault diagnostic cases are tested, which include fault diagnosis of rolling bearing (identifying normal, inner race defect, outer race defect, and ball defect), fault diagnosis of gear (identifying normal, chipped tooth, and missing tooth), and fault diagnosis of rotor crack (identifying normal, crack depth of 3 mm, and crack depth of 5 mm).For each case study, the sensibilities of the features are analyzed.

Support Vector Machine (SVM).
The SVM is a machine learning method based on the statistical learning theory and structural risk minimization principle.Given two category sample sets (  ,   ) (  ∈   ;   ∈ {−1, +1};  = 1, 2, . . ., ),  is the number of samples.The optimal hyperplane separating the data can be obtained as a solution to the following optimization problem [15,16]: where  is weight vector,  is scalar,   is slack variable, and  is error penalty.
The dual quadratic optimization description can be obtained by converting the problem with Kuhn-Tucker condition into the equivalent Lagrangian dual problem: where   is Lagrange coefficient, which must meet the following equation: The support vector is the sample which satisfies the equation   [( ⋅   ) + ] = 1 −   at the time of the nonzero   .It reveals that the samples at the edge of distribution are essential for classification.This leads to the optimal classification decision function: where  is the number of support vectors.
In linear inseparable condition, the samples (  ,   ) (  ∈   ;   ∈ {−1, +1};  = 1, 2, . . ., ) in input space are mapped into high dimensional space  where the optimal classification surface can be established through the nonlinear mapping Φ :   → .The nonlinear mapping Φ is usually difficult to be solved while kernel functions (  ,   ) meeting Mercer conditions can be used to solve this problem dexterously.The kernel function is described as follows: The optimal classification decision function of linear inseparable samples is obtained using ( 5) into (4): The common kernel functions include linear kernel function, poly kernel function, radial basis function (RBF) kernel function, and sigmoid kernel function.
The traditional SVM was originally designed for binary classification problems.However, many practical problems in fault diagnosis field are multiclassification.Now some effective multiclass support vector machines were proposed which include "one-against-one, " "one-against-all, " directed acyclic graph (DAG), and so on [15].Hsu et al. have given a comparison of these methods and pointed out that the "oneagainst-one" method is more suitable for practical use than other methods [17,18].

Time-Domain Features.
When the running conditions of the rotating machinery deviate from the normal condition, the time-domain statistical features of the vibration signal will be different from the normal condition.Furthermore, the time-domain statistical features will be also different under different defect models.Therefore, the time-domain statistics contain abundant defect information, and they can be used as sensitive character applied to fault diagnosis of rotating machinery.The time-domain statistical features used in this study are shown in Table 1.

Multisensors Information Fusion Model. The model of multisensor information fusion is used in this study and
shown in Figure 1.The same character of different sensors is extracted to constitute a multidimensional vector and the SVM is used for pattern recognition.Twelve different timedomain features are analyzed one by one.

Case Studies
3.1.Data Acquisition.Experiments were performed on the machinery fault simulator (MFS) from SpectraQuest, Inc., shown in Figure 2. It can simulate most of faults that commonly occur in rotating machinery, such as rotor body defects, bearing defects, and gearbox defects.The shaft rotating speed was obtained by a laser speedometer.Acceleration signals were collected using the Dewetron 16 channels data acquisition system and IMI 608A11 accelerometers.
In the vibration testing experiments for roller bearing fault diagnosis, the simulator is composed of a motor,  2(a) that a total of 8 sensors from  1 to  8 are used.
In the vibration testing experiments for gear fault diagnosis, the drive from the motor transmits to the gearbox through bearing-rotor system and belt.The gearbox consists of a two-stage parallel shaft with rolling bearings, helical gears, and a magnetic brake.The simplified diagram of gearbox transmission is shown in Figure 3, where  1 is the testing gear.The MFS provides a gear fault kit consisting of one normal, one chipped tooth, and one missing tooth for performing experiments and studying gear fault diagnosis.The acquisition frequency rate is 20 kHz.The sensors layout is depicted schematically in Figure 2(b) that a total of 8 sensors from  1 to  8 are used.
In the vibration testing experiments for rotor crack fault diagnosis, the rotor-bearing system is driven by the motor.In Figure 1: The multisensor information fusion process model.order to simulate the expanding of crack, crack faults were introduced to the test rotor by using the electrodischarge machining.The defect with crack width of 0.12 mm and crack depth of 3 mm represents slight defect, and that with crack width of 0.12 mm and crack depth of 5 mm represents serious defect.The acquisition frequency rate is 10 kHz.The sensors layout is depicted schematically in Figure 2(a) that a total of 4 sensors from  1 to  4 are used.LibSVM-mat-2.9 is chosen for SVM calculation.LibSVM is developed by Lin Chih-Jen from Taiwan [19].It is a simple and easy-to-use SVMs tool for classification.RBF kernel function is chosen as kernel function shown as follows:

Fault Diagnostic
The cross-validation combination with network search method is used to search the best parameters: the error penalty  of SVM and  of RFB.One-against-one multiclassification is chosen for pattern recognition.The diagnostic results of gear by using different time-domain features are listed in Table 2.It can be found from Table 2 that the highest diagnostic accuracy is 93.33% by using the peak factor as feature to constitute fusional vector for gear fault diagnosis.Sensitivity of the features can be indicated by diagnostic accuracy when using the same classifier SVM, so, the peak factor is the most sensitive feature in the twelve time-domain features for identifying gear defect, followed by the amplitude square, root amplitude, mean, root mean square, standard deviation, and peak.The diagnostic accuracy is all above 80% by using these features.The skewness, kurtosis, waveform factor, and margin factor are less sensitive comparatively.The diagnostic accuracy is all under 70% by using these features.It also can be found from Table 2 that the accuracy of normal testing samples is all above 90% by using any feature.During the analysis, we also found that the samples of defect with chipped tooth and defect with missing tooth are easy to be misclassified with each other, but defect samples are seldom mistakenly regarded as normal samples, so it can be deduced that normal and defect gear are always easy to distinguish.
In order to compare with single sensor for gear fault diagnosis, take eight features from a single sensor to constitute an eight-dimensional vector as a fault sample.The eight features are the peak factor, amplitude square, root amplitude, mean, root mean square, standard deviation, peak, and pulse factor, which are the first eight sensitive features for identifying gear defect selected on the basis of the above analysis result.In order to avoid the orders of magnitude difference of different features, normalized eigenvector is processed before inputting SVM.In fact, during the proposed multisensors information analysis, the fault sample is constituted by the same feature from multisensors, so the orders of magnitude difference are nonexistent and normalized eigenvector is not needed.The sensors  1 to  8 are analyzed one by one.
The diagnostic results of gear by using different single sensors are listed in Table 3.
Comparing with Tables 2 and 3, it can be found that there is higher diagnostic accuracy by using multisensors information fusion method than using single sensor method as a whole.

Fault Diagnostic Case of Rolling Bearing.
Vibration signals of rolling bearing with four fault models including normal, inner race defect, outer race defect, and ball defect are taken for analysis.A certain time-domain feature is calculated from eight sensors ( 1 to  8 ) to constitute an eightdimensional vector as a fault sample.One hundred and ten fault samples from each model, a total of four hundred and forty samples, are used to constitute the fault sample sets.Fifty fault samples from each model, a total of two hundred samples, are selected randomly as training samples and the others are used as testing samples.Twelve timedomain statistics are analyzed one by one.
LibSVM-mat-2.9 is chosen for SVM calculation.Gaussian kernel function is chosen as kernel function.The crossvalidation combination with network search method is used to search the parameters  and .One-against-one multiclassification is chosen for pattern recognition.The diagnostic results of rolling bearing by using different time-domain features are listed in Table 4.It can be found from Table 4 that the mean, amplitude square, root mean square, root amplitude, and standard deviation are the first five sensitive features for identifying rolling bearing defect.The diagnostic accuracy is all 100% by using these features.Comparing with Tables 4 and 2, it can be found that there is a higher diagnostic accuracy for rolling bearing fault diagnosis than for gear fault diagnosis by using the proposed information fusion method as a whole.The main cause is that the way from the defect position of rolling bearing to the sensor installation position is shorter and simpler than the way from the defect position of gear.
In order to compare with single sensor for rolling bearing fault diagnosis, take eight features from a single sensor to constitute an eight-dimensional vector as a fault sample.The eight features are the mean, amplitude square, root mean square, root amplitude, standard deviation, peak, kurtosis, and waveform factor, which are the first eight sensitive features for identifying rolling bearing defect selected on the basis of the above analysis result.In order to avoid the orders of magnitude difference of different features, normalized eigenvector is processed before inputting SVM.The sensors  1 to  8 are analyzed one by one.The diagnostic results of rolling bearing by using different single sensor are listed in Table 5.
Comparing with Tables 4 and 5, it can be found that there is higher diagnostic accuracy by using multisensors information fusion method than using single sensor method as a whole.

Fault Diagnostic Case of Rotor Crack.
Vibration signals of rotor crack with three fault models including normal, crack depth of 3 mm, and crack depth of 5 mm are taken for analysis.A certain time-domain feature is calculated from four sensors ( 1 to  4 ) to constitute a four-dimensional vector as a fault sample.One hundred fault samples from each model, a total of three hundred samples, are used to constitute the fault sample sets.Fifty fault samples from each model, total of one hundred and fifty samples, are selected randomly as training samples and the others are used as testing samples.Twelve time-domain statistics are analyzed one by one.
LibSVM-mat-2.9 is chosen for SVM calculation.Gaussian kernel function is chosen as kernel function.The crossvalidation combination with network search method is used to search the parameters  and .One-against-one multiclassification is chosen for pattern recognition.The diagnostic results of gear by using different time-domain features are listed in Table 6.
It can be found from Table 5 that the mean, amplitude square, root mean square, root amplitude, and standard deviation are the first five sensitive features for identifying rotor crack defect.The diagnostic accuracy is all 90% by using these features.The result is similar to fault diagnostic case of rolling bearing.

Conclusion
In this paper, a feature-level information fusion methodology is proposed that all the features are calculated using vibration data in time domain to constitute fusional vector and the SVM is used for classification.Only a vibration testing system is needed for raw signal collected in this method, so the process is simpler.The effectiveness of the proposed methodology is tested with examples of gear, rolling bearing, and rotor crack fault diagnosis.Sensitivities of the twelve time-domain features are discussed in each case study.The analyzed results indicate that the peak factor is the most sensitive feature in the twelve time-domain features for identifying gear defect, but it is not very sensitive for identifying rolling bearing and rotor crack defect.The mean, amplitude square, root mean square, root amplitude, and standard deviation are all sensitive for identifying gear, rolling bearing, and rotor crack defect comparatively.
The features used and discussed in this paper are all in time domain; however, features in frequency domain also can be used for fault diagnosis of rotating machinery and the sensibilities of the features for identifying rolling bearing, gear, and rotor defect are also worth studying in the future.
Case of Gear.Vibration signals of gear with three fault models including normal, chipped tooth, and missing tooth are taken for analysis.A certain timedomain feature is calculated from eight sensors ( 1 to  8 ) to constitute an eight-dimensional vector as a fault sample.One hundred and ten fault samples from each model, a total of three hundred and thirty samples, are used to constitute the fault sample sets.Sixty fault samples from each model, a total of one hundred and eighty samples, are selected randomly as training samples and the others are used as testing samples.Twelve time-domain statistics are analyzed one by one.

Table 1 :
The statistic features in time domain.coupling, a testing roller bearing fitted on the left of the shaft near the motor, a working roller bearing on the other side, a bearing load, and a shaft.The MFS provides a rolling bearing fault kit consisting of one normal, one inner race defect, one outer race defect, one with ball defect, and one combination of defects for performing experiments and studying bearing fault diagnosis.The acquisition frequency rate is 10 kHz.The sensors layout is depicted schematically in Figure in the table is discrete time series signal.a

Table 2 :
Diagnostic results of gear by using different features for fusion.

Table 3 :
Diagnostic results of gear by using different single sensors.

Table 4 :
Diagnostic results of rolling bearing by using different features for fusion.

Table 5 :
Diagnostic results of rolling bearing by using different single sensors.

Table 6 :
Diagnostic results of rotor crack by using different features for fusion.