Gearbox Fault Diagnosis Based on a Sparse Principal Component-Generalized Regression Neural Network

e fault diagnosis of urban rail transit gearboxes has the characteristics of complex vibration signals and large amounts of data. e daily scheduled maintenance cannot meet the needs of gearbox maintenance, so it is necessary to predict the fault types in advance. In this paper, a method of gearbox fault prediction based on sparse principal component analysis and a generalized regression neural network is presented, and the result of fault prediction can provide a reference for making maintenance plans. Based on principal component analysis (PCA), a sparse principal component is obtained by adding the LASSO penalty term, which reduces the risk of overtting of PCA while obtaining a sparse solution. en, the sparse reduced dimension principal component is input into the generalized regression neural network model for fault diagnosis. e results show that the fault diagnosis method based on the sparse principal component-generalized regression neural network model has high accuracy and is time-consuming.


Introduction
e gearbox is a key component of the urban rail transit walking arrangement, and its failure can threaten the normal operation of the train and bring safety problems. Fault diagnosis allows the maintainer to predict the gearbox fault types to be repaired, organize the maintenance strategy in advance, determine the maintenance time, and reduce the maintenance cost and the loss caused by the fault event.
e progress of arti cial intelligence technology makes the application of neural network models in device diagnosis research common. e literature [1] applied the short-term memory (LSTM) recurrent neural network to aircraft fault prediction. Reference [2] used a CNN-GRU neural network model to predict short-term transformer faults. Reference [3] predicted the possible faults of power systems using a deep convolutional neural network model. e literature [4] used the convolutional neural network method for fault determination of characteristic turnout characteristic pattern. Reference [5] used the PCA-NARA neural network model for the degradation trend assessment and prediction of wind turbine gearboxes. Reference [6] constructed a GRU recurrent neural network-based fault prediction model for cloud data center applications. When performing datacentric fault diagnosis and prediction, data noise and redundancy can a ect the accuracy of fault type determination. Based on this, this paper adopts sparse principal component analysis for sparse processing of fault vibration data on the basis of traditional principal component analysis and combines a generalized regression neural network algorithm to establish a fault diagnosis model, which can be used for gearbox fault diagnosis and provide guidance on the development of maintenance plans for urban rail transit vehicles.

Sparse Principal Component Analysis
is method simplifies the data structure by constructing a new set of variables and uses fewer eigenvectors to approximate the information of the original data [8]. e calculation process is as follows: (1) After normalizing the original data set with i observations and j process variables with different dimensions, the matrix X is obtained, X ∈ R i×j . (2) e eigenvalue decomposition of X is performed; that is, the covariance solution is performed, and the score matrix T is calculated. at is, where Λ is the diagonal matrix of eigenvalues, Λ ∈ R j×j . V is the matrix [9], andV ∈ R j×j . P is the load matrix, constructed from the top n columns of V, V ∈ R j×k (3) By projecting T back into the j × i-dimensional space, the new matrix X after dimensionality reduction can be obtained, and the cumulative contribution rate v t can be calculated. Usually, the top n principal components corresponding to indicators with a v t value greater than 85% are selected [10] which is PCA has an obvious effect on the dimensionality reduction of fault data, but it also has disadvantages; that is, it is difficult to use for nonlinear faults, and it is difficult to explain the characteristics of the principal component load of nonzero vectors [11] characteristic. Sparse dynamic principal component analysis (SDPCA) can be used to compensate for the shortcomings of the PCA method. Based on PCA, SPCA obtains the sparse principal components of data by adding LASSO penalty items [12]. is paper takes Zou [13]'s sparse method as an example to introduce the solution process of the sparse principal component. (4) Based on the principal component analysis method, the residual vector E is calculated, and the residual vector is the difference between the original matrix X and the new matrix after dimensionality reduction.
e principal component solution is to make the residual vector as small as possible and retain as much information as possible about the principal component in the original data, so the principal component solution can be performed using the following equation, where Q is the solution of the principal component matrix P; that is, (5) Adding the LASSO penalty term to sparse the obtained principal component reduces the risk of overfitting of the PCA solution while obtaining a sparse solution [13]. at is, where A and B are the load matrix and sparse load matrix, respectively. A s , B s is the solution of A, B when the right side of the equation takes the minimum value. β is the load vector after sparsity. λ, λ 1,m is the coefficient of the penalty terms, λ > 0.
SΛ is the diagonal array of explained variance after sparsity. P s is the load vector matrix after sparsity.

GRNN Fault Prediction Model
A generalized regression neural network (GRNN) consists of an input layer, a pattern layer, a summation layer, and an output layer [14]. GRNN has a good fitting effect on nonlinear data when the data accuracy is poor or the amount of data is small. e structure of the GRNN network is shown in Figure 1. e vector X � [x 1 , x 2 , . . ., x n ] T is input from the input layer, and the number of original characteristic parameters is n. e pattern layer accepts the data output from the input layer, which is transformed into P � [p 1 , p 2 , . . ., p n ] T by the neuron transfer function and passed to the summation layer. σ in the GRNN model represents the smoothing factor, and the ideal σ value can usually be calculated according to the distribution probability of the sample. at is, ere are two types of neurons in the summation layer, the denominator unit and numerator unit [15], and the S D and S Nj (j � 1,2, . . ., k, where k is the dimension of the output vector) are obtained after the output of the hidden layer is multiplied by the corresponding weight and then added and summed, and the output vector is Y � [y 1 ,y 2 , . . ., y n ] T ; that is, Suppose x and y are two random variables. e joint probability density is f(x, y), in which x 0 is the observed value of x, and the regression relationship between y and x is Applying the Parzen nonparametric estimation, the probability density function f(x 0 , y) can be estimated from the sample data set. at is, In Equation (10), n is the sample size, and p is the dimensionality of the random variable x. Substituting this into Equation (9), exchange the order of integration and summation and simplify it to obtain:

Data Preprocessing.
e gearbox contains mechanical parts, such as gears, shafts, rolling bearings, and boxes, which are common power transmission devices in urban rail vehicles. Its working principle is that the rotating shaft drives the sector gears in the gearbox and transmits torque vertically to the other shaft. According to the data [16], gearbox faults occur more often in the gears and rolling bearings, 59% and 20%, respectively. is paper takes the transmission system diagnostic simulator (DDS) as the data acquisition object, the speed is 2000 r/min, and the sampling frequency is 5120 Hz. Four typical faults of the gearbox were selected for fault prediction, including bearing ball error, bearing inner and outer ring fault, gear missing teeth, and gear surface cracks. e gearbox parameters are shown in Table 1.
e collected vibration signals were taken as a sample set every 1024 points, and 30 sample sets were extracted under five working conditions: normal, bearing ball misalignment, bearing inner and outer ring fault, gear missing teeth, and gear surface cracks. A total of 29 indicators in the time domain and frequency domain were extracted from each sample set, which can comprehensively describe the faults of gears and bearings in the gearbox. However, characteristics in the frequency domain were obtained through a Fourier transform on the basis of the time domain, so there is information redundancy. In addition, some indicators are sensitive to faults, and some indicators are not sensitive to faults, so if not processed, it will cause inaccurate prediction results. is paper uses PCA and SPCA to reduce the dimensionality of 29 characteristic indicators, eliminate the indicators with low fault sensitivity, eliminate data redundancy, and improve the fault prediction accuracy. e 150 gearbox vibration data sets under five working conditions are used as input for PCA and SPCA analysis, and the dimensionality reduction results obtained are shown in Table 2.
In Table 2, PC1∼PC6 are the first six principal components after dimensionality reduction. It can be seen from the dimensionality reduction results that although the cumulative contribution rate of SPCA is 85.2% lower than that of PCA, 88.3%, there are zero elements in the principal component load extracted by the SPCA dimensionality reduction method, and the number of the corresponding zero elements are 2, 4, 8, 7, 8, and 10, which reduces the complexity of variable dimensionality reduction. e first six Mathematical Problems in Engineering 3 principal components of SPCA can represent 85.2% of the original information, the first six principal components of PCA represent 88.3% of the original information, all greater than 85%, and the first six principal components can be used as input to represent the original information. e partial load matrix of the first six principal components of SPCA is shown in Table 3.

Comparison of Fault Prediction
Results. e 150 groups of fault characteristic sets composed of five working conditions and 29 characteristic indicator values were transformed into a fault characteristic set composed of 6 characteristic indicators through PCA and SPCA, and 24 groups and 6 sets of fault characteristics of each working condition were randomly selected. e set consists of 120 training sets and 30 testing sets. To compare the fault prediction effects of the GRNN, PCA-GRNN, and SPCA-GRNN models, 29 time-domain and frequency-domain fault characteristic sets were used as the input of the GRNN model, and the fault characteristic sets transformed by the PCA and SPCA analysis were used as input to the PCA-GRNN and SPCA-GRNN models. e value of the smoothing factor in the GRNN model determines the prediction result. In this paper, the smoothing factor corresponding to the highest accuracy rate is determined by inputting the training samples and the changed smoothing factor value. e interval of the smoothing factor value is [0. 1,1]. When this value is 0.7, the maximum accuracy is 96%. e main parameters of the models were determined based on the input indicator characteristics of the three models and the type of faults predicted, as shown in Table 4.
By inputting the training set for model training and inputting the test set to test the performance of the model, the fault prediction results of the gearbox are obtained, as shown in Figures 2-4, and the comparison of the fault prediction results of the three models is shown in Table 5.   When the original data before dimension reduction are used as input for fault prediction, the accuracy rate is 43.3%, and the prediction time is 10.23 s. e fault prediction accuracy of the generalized regression neural network based on principal component analysis can reach 90%, and the prediction time is 2.63 s. Although the SPCA-GRNN model, which extracted six sparse principal elements for generalized regression neural network prediction, had the lowest cumulative contribution of the three models, the prediction accuracy reached 96.6%, and the prediction time was 82% shorter than the GRNN model. Compared with the PCA-GRNN model, it was 30% shorter and the model predicted the best results.

Conclusion
(1) SPCA reduces the dimension of sample data on the basis of PCA by adding the LASSO penalty item, and the six sparse principal components obtained contain more than 85% of the information of the original data, which improves the interpretability of the principal component while reducing the computational complexity. (2) Aiming at the gearbox fault prediction problem, this paper adopts the prediction method combining sparse principal component analysis and a generalized regression neural network. After comparing the fault prediction results of the three models of GRNN, PCA-GRNN, and SPCA-GRNN, it is concluded that the prediction ability is SPCA-GRNN > PCA-GRNN > GRNN.      Mathematical Problems in Engineering (3) In this paper, sparse principal component analysis is introduced into the research of fault vibration data processing, and combined with GRNN, it is used for gearbox fault prediction. Future research will consider the feasibility of integrating kernel principal component analysis with other machine learning and fault prediction effects.

Data Availability
e labeled dataset used to support the findings of this study is available from the corresponding author upon request.

Conflicts of Interest
e author declares no conflicts of interest.