Comprehensive Evaluation Model of Bearing Transportation Protection Effect of Bogie Traction Motor under Data Imbalance

Under different transportation protection, the sample data of bogie traction motor bearings of urban rail vehicles are seriously unbalanced, and the fault diagnosis ability and generalization effect are poor, which makes it difficult to evaluate the protection effect of bearings effectively. In this paper, a multimeasure hybrid evaluation model based on compressed sensing is proposed to evaluate the effect of bearing transportation protection under data imbalance. Firstly, bearing vibration signals under different transport protection conditions were compressed and sampled, and the original high-Witt collection in time domain, frequency domain, and time-frequency domain was extracted. Then, a multimeasure mixed feature evaluation model of correlation, distance, and signal was constructed, and the optimal multimeasure combination strategy was optimized by using comprehensive sensitivity score evaluation index. Finally, an evaluation model of bearing protection effect based on unified feature index was constructed by using the best feature subset evaluated, and the unified indicator was quantified to characterize the protection effect of different protection states. The experimental results show that the model can effectively evaluate bearings under different transport protection.


Introduction
As an important part of the walking part of urban rail transit vehicles, the bogie plays the role of motion orientation, bearing, and vibration reduction and is also the ultimate executor of traction and braking and plays an important role in the safe driving of the train [1]. At the present stage, the bogies put into use in China are mainly based on welding. Due to the characteristics of the welding process itself, the weld position of the bogies is easy to transmit residual stress and deformation will have influence, and the traction and driving device of the vehicle will be directly related to the quality of force transmission, and whether the structure is safety and reliable will directly affect the operation safety of the train [2]. Traction motor is the key part of urban rail vehicle driving device, its operation state will directly affect the train performance and transportation efficiency, including rolling bearing which is one of the most widely used parts of traction motor, and the relevant research results show that the most prone to failure parts of traction motor is rolling bearing; bearing damage accounts for about 44% of traction motor failure [3]. Therefore, ensuring the quality of the bogie traction motor bearing is one of the important factors to ensure the safe, stable, and comfortable operation of the urban rail vehicles. Urban rail vehicles are often transported to the owner's site [4]. If the bearings are not protected during transportation, pseudocloth marks will occur due to the road turbulence, abnormal bearing sound during site operation, and eventually mass replacement, which seriously affects the manufacturer's product quality of the bearing and causes serious hidden dangers to the safe operation of urban rail trains.
The protection effect assessment of the traction motor bearing of urban rail vehicles under different transportation protection has two main difficulties: (1) In the signal collection of the urban rail vehicle steering gear traction motor bearing, there is a serious data imbalance phenomenon. The main reasons of this phenomenon are as follows. (a) With the vehicle running speed and the complex environment, the data flow accelerates, and the amount of state data increases. Due to the development and deterioration of the equipment state, the shortening of the monitoring period will also lead to the increase of the state data volume. (b) The bearing produces vibration signals containing impact attenuation components under different transportation protection, all of which have obvious nonlinear behavior, which eventually leads to the spectral bandwidth of the signal, excessive number of data collection, and massive data bring great pressure on data transmission, storage, and processing. (c) Under the actual working condition, the number of bearing fault signal samples is generally far less than the normal state signal.
In conclusion, the mentioned factors will cause unbalanced bearing dataset (2) The fault diagnosis ability of the bearing is poor, and the protective effect assessment is difficult to be effectively evaluated. The reasons of this phenomenon are as follows: the complex rotating mechanical structure and bad operating environment make the fault mode usually show strong complexity [5], and the "underlearning" problem of fault feature sensitivity in the widely used single measure evaluation model and the common feature parameter extraction of different analysis fields can only state the information of mechanical equipment characterized by vibration signal be described from different angles. The above situation causes the failure to obtain the status information in the bearing fault diagnosis, so as not to provide an accurate data basis for the pattern recognition and then affect the subsequent classification performance and effect assessment

Related Research Work
The application object of previous research is generally limited to the error data with similar distribution under constant and stable conditions. Unfortunately, this restriction has little effect in real-life scenarios. The working conditions are complex and changeable, which makes it difficult to extract effective fault diagnosis representation in practical application by previous deep representation learning methods [6,7]. To solve the serious data imbalance of bearing vibration signal under continuous high sampling, this paper adopts compression sensing (compressive sensing (CS)) theory also called compression sampling theory. A compressed sensing theory proposed by Donoho et al. [8][9][10], whose signal compression theory breaks through the Nyquist limit, can achieve less measurement compression sampling and complete high probability accurate reconstruction. Although the observation data is reduced, it contains enough original signal information for signal recovery to realize the "compression sampling" of the signal. The concept of this theory can be described as follows: for a collected time-domain target signal, as long as the signal is compression on a certain sparse transformation basis, a linear measurement matrix unrelated to the sparse transformation base and select an appropriate reconstruction algorithm to accurately reconstruct the signal based on the low-dimensional compression measurement. Currently, compressed sensing hotspots focus on three key problems: sparse representation, signal observation, and signal reconstruction [11,12]. Early studies performed limited sampling studies using the compressibility of signals and sampled continuous signals using a fixed structural basis function at twice the information rate rather than twice the sampling frequency [13]. The notion of the uncertainty principles of sparse representation is first proposed by Donoho et al. [14]. Based on this result, El Ad and Bruckstein [15] further discuss the uniqueness of the sparse representation and prove the boundary conditions for the exact sparse reconstruction. Tropp [16] proposed more general conditions, unifying the reconstruction conditions for the sparse representation problem of the l 0 and l 1 norm constraints. Candes et al. [16] propose the exact reconstruction principle that further demonstrates the uniqueness of the sparse representation problem and discuss the stability, robustness, and the extension of the algorithm. Baraniuk et al. [17,18] proposed that constrained isometries provide a theoretical basis for the observation matrix design and signal reconstruction. In recent years, CS has made some progress in image compression, face recognition, radar imaging, communication, and other fields [19], but at present, the research in the field of mechanical equipment fault diagnosis is still widely involved, and little research is applied in the field of urban rail vehicles. The fault feature sensitivity "underlearning" problem in the single measure evaluation model and the feature parameter characterization information extracted by different single analysis domain often have significant undercompleteness. It is not difficult to find that extracting the multidomain multifeature parameter is a necessary guarantee to comprehensively describe the equipment fault state mode. Meanwhile, the feature weighted has rich feature sensitivity information to the original feature set with eigenweights without losing any feature parameter, giving the weighted feature set with better category discrimination ability. According to the principle of feature weighting, the calculation of its weight coefficient is closely related to various types of feature evaluation criteria, from which various characteristic weighting methods based on typical characteristic evaluation measures or their improved versions are born, for example, fault feature weighting based on compensation distance evaluation technology [20], nuclear space distance measure [21], feature weighted scheme based on entropy, mean variance and mutual information, feature weighted based on joint Laplacian score, and feature weighting based on Fisher linear discrimination analysis. Moreover, Sáez et al. [22] proposed a novel feature weighted scheme based on interpolation method and Kolmogorov-Smirnov parametric statistical test for the performance improvement limited to noise-containing, redundancy, and weak correlation features; Ismail and Frigui [23] proposed a robust unsupervised learning algorithm for finite generalized Dirichlet hybrid model for realizing fuzzy clustering and feature weighted of noise-containing high-dimensional data. Similar to the selection of the optimal feature subset in feature selection, the determination of feature-weighted weight coefficient can be treated as a class of combinatorial optimization problems, so various evolutionary algorithms (including taboo search, harmonic search, and multiobjective optimization algorithm) [24] have been introduced into the construction of feature-weighted framework. However, most of the existing weight coefficient calculation methods based on feature evaluation criterion learn the sensitivity of features from a single measure such as distance, information, and correlation, which often causes poor learning results of fault feature sensitivity, which is not conducive to the improvement of feature clustering and classification performance. Based on the research of scholars, this paper proposes a multimeasure mixed evaluation model for accurate and comprehensive evaluation of bearings.
This paper studies the bogie traction motor bearing in urban rail vehicles and evaluates the reasonable bearing transportation protective measures during road transportation. A multimeasure hybrid evaluation model based on compression perception is proposed and applied to the transportation protection example of Ningbo Line 3 bearings, to verify the feasibility and technical advantages of the proposed method, evaluate the best protection measures to reduce the failure rate of bearings, provide theoretical basis and technical reference for the characterization of the transportation protection effect of urban rail vehicles, and then provide certain guarantee for the safe operation of urban rail vehicles. The overall technical route for evaluating the protection effect of urban rail vehicle bogie traction motor bearings under different transportation protection is shown in Figure 1

Compression Sampling of Bearing Vibration
Signal in the Field of Rail Transit The status data of the mechanical equipment can reflect the real-time operation state of the equipment, which is an important basis for obtaining the changing trend of the equipment status and analyzing the root cause of the equipment failure. In the fault diagnosis of rail transit field, status data is an indispensable and important resource related to the operation, monitoring, and management of urban rail vehicles. In order to accurately grasp the health state of the equipment, it is the urgent problem to solve the imbalance of bearing dataset. Compression sensing theory provides new ideas for signal acquisition, analysis, and feature extrac-tion, which is currently rarely involved in fault diagnosis in the field of rail transit. Compact data is used to obtain compressed data and deal with the storage of massive data, analyze compressed data from compressed data, reduce analysis and calculation links, save computational resources, and improve the efficiency of monitoring and diagnosis. The application of compression sensing theory to the signal sampling of equipment in rail vehicles provides an effective solution to the problem of data imbalance and its application in the storage, transmission, and analysis of urban rail vehicle equipment data. CS first performs the sparse transformation processing of the original signal and then measures the sparse data with the observation matrix, obtains the observation value far lower than the original signal dimension, and then restores the signal reconstruction algorithm of the compression perception theory framework to obtain the reconstruction signal with very little error with the original signal. The specific process is shown in Figure 2. CS has been widely studied in many areas, but relatively little in the field of mechanical troubleshooting. This paper uses CS technology for the vibration signal of vehicle key components in the field of urban rail transit. In the whole evaluation process of the traction motor bearing of urban rail vehicle bogie, the processing process of compression sensing from the processing process of bearing vibration signal is mainly divided into two parts: one, acquisition and compression of signal; two, data reconstruction and analysis. The processing process of the bearing signal of the traction motor is shown in Figure 3.

Sparse Representation of the Signals in the Compressed
Sampling. Sparse representation of signal in perception observation problem: compressed perception theory is the inverse problem of sparse representation theory, whose theory foundation implies the basis of signal sparsity. Shown in Figure 4 is a schematic diagram of sparse representation theory. The sparse representation of the signal is to transform the signal to a particular sparse space, yielding a small value of most of the transformation coefficients, namely, the resulting transformation coefficients are sparse or approximately sparse. Finding a base or compact frame Ψ makes the signal x sparse on Ψ and finding the transform coefficients: S = ΨXwhere S is an equivalent or approximated sparse representation of X. The choice of transform basis Ψ can be some kind of basis that has been widely used, such as wavelet base, Fourier base, and local Fourier base. In addition, a compact framework can be used to make a sparse representation of signals such as curve waves and contour waves, two classes of transform bases with better directivity [25]. The prerequisite for compressed sensing is that the signal must be sparingly sparse. For most signals, it is usually not sparse, but it is always able to find the appropriate base for the sparse transformation of the signal.

Selection of the Observation Matrix in the Compressed
Sampling. Main content of the signal observation research: how to design the M × N-dimensional observation matrix Φ, unrelated to the sparse basis to ensure that after the signal x decreases from N dimension length to M dimension, the 3 Journal of Sensors main information is still not lost, and the original signal x can be recovered from the dimension-dimensional signal y. The core of signal observation: to study sampling protocols that can effectively extract useful information in the original signal and to obtain as few sampling points as possible. The need x to design a stable m × n dimensional observation matrix Φ is required to obtain the S set Y = ΦS = ΦΨ T X. This process can also be represented as a signal x for nonadaptive observations via matrix A cs : Y = A CS X (where A CS = ΦΨ T is called the CS information operator). The concern is the selection of the observation matrix Φ, which needs to guarantee that the important information is not broken when the sparse vector S drops from n dimension to m dimension. In compressed sensing theory, an important criterion for judging whether a matrix can become a measurement matrix is limited isometric property (RIP). For example, for the k sparse vector S ∈ R N , when it satisfies formula (1), the measurement matrix Φ satisfies the RIP. Most random matrices satisfy RIP, such as Gaussian and Bernoulli stochastic measurements [17].      Journal of Sensors 3.3. Design of the Reconstruction Algorithm in the Compression Sampling. The reconstructed recovery signal mainly solves the owed problem under a given sparse constraint and recovers the original signal with high probability from partially sampled data. The reconstruction of signals refers to the reconstruction of the original signal y ∈ R M×1 from the dimension reduction signal x ∈ R N×1 . The key problem is how to design efficient, robust, and stable reconstruction algorithms. At the present stage, the CS reconstruction algorithm can be roughly divided into four categories: the first class: greedy iterative algorithm, aiming for the combinatorial optimization problem, which mainly take the link between signal and atomic dictionary as a more effective or nonzero way to measure atoms (coefficients). The fundamental principle is to find the support set of sparse vectors in an iterative way and to reconstruct the signal using a constrained support least squares estimation. The second class: convex optimization algorithm or optimization approximation methods, which find approximations of the signal by transforming nonconvex problems into convex problem solutions. The third kind of algorithm: based on the reconstruction algorithm proposed by the Bayesian framework, this algorithm takes into account the signal temporal correlation, especially when the signal has a strong temporal correlation, which can provide superior reconstruction accuracy than other reconstruction algorithms. Class 4: other algorithms: these methods support rapid signal sampling and reconstruction by packet testing, such as Fourier sampling, chain tracking, and Heavy Hitters on Steroids (HHS) tracking.

Evaluation Model Based on Multimeasure Mixed Evaluation Based on Compression Sensing
4.1. Compression Sensing of the Bearing Vibration Signal of the Bogie Traction Motor. The observation matrix dimension required to reconstruct the signal in the compressed sensing theory process is independent of the data sampling rate and the maximum frequency of the signal and is only directly related to the sparsity. To achieve compression sampling, the necessary sparsity processing of the signal is necessary to obtain the sparse signal in a specific space. In real life, the signal data is mostly nonsparse and requires some spatial transformation methods. In this paper, we analyze the sparse representation performance of bearings using three common discrete cosine transform (DCT) [17], discrete Fourier transform (DFT) [22], and discrete wavelet transform (DWT) [26]. In order to intuitively analyze the sparse performance from the perspective of the quantization indicators, 2% of the threshold ε, data x peak value is first set as the threshold interval. That is, ε = 2%jmax ðx i − min ðx i ÞÞj, assumes the data points between data values as ½−ε, ε to 0, and then exhausters the number of nonzero elements in N 0 data points in x, defines the sparse ratio η 2% , η 2% = N 0 /N, where N 0 represents the number of data with numerical nonzero, the less sparse than η 2% , the better the sparse ability of the signal. The generalized orthogonal matching tracking algorithm (GOMP) is chosen as the reconstruction algorithm, enter the observation matrix Φ P×l , probational measurements obtained from the observation matrix y p×l , sparsity degree k, select the number of atoms S, ðS ≤ K&S ≤ p/KÞ the output is the reconstructed signalx. GOMP algorithm training procedure: first initialize the residual value of r 0 = y, index set Γ 0 = ∅, number of changes k = 1 (line 1); continuous search for S matching atoms, k = k + 1, search out the S atoms with the largest residual product change its corresponding index value added, ; update the candidate support set as well as the residual and correlation coefficients. The corresponding support set Φ Γ K is obtained using the updated candidate atomic index set Γ K . Calculate the updated residual and correlation coefficients, If the iterative stop condition meets k < min ðK, p/SÞ, the iterative process returns to step 2; otherwise, the iteration stops with the reconstruction signalx = x Γ k , specifically, as shown in Algorithm 1.  ,⋯,T 10 Þ (respectively: peak, mean, variance, harmonic average, margin index, cliff index, wavedness index, pulse index, peak index, and deviation index). The frequency domain characteristic parameters form the original frequency domain characteristic sets ðF 1 , F 2 ,⋯,F 5 Þ (respectively: mean frequency, center of gravity frequency, root mean square frequency, standard difference frequency, and cliff frequency). Time-frequency characteristic parameters are based on the adaptive decomposition of the original fault signal using EMD. Energy features IMF = ½IMF 1 , IMF 2 , ⋯,IMF 5 extract the first five eigenmode components containing useful information and Lempel-Ziv complexity features L = ½L 1 , L 2 ,⋯,L 5 .

Comprehensive Characteristic Index of Bearing Vibration
Signal Based on Multimeasure Mixed Evaluation Model. The extraction of multicategory fault characteristic parameters is usually regarded as the basic guarantee for accurate fault diagnosis. However, if it is applied improperly, it is difficult to improve the fault diagnosis accuracy, because the sensitivity of the characteristic parameters is different. If the original Gavett collection is directly used as the input of the subsequent classifier, the advantage of sensitive characteristics on fault classification cannot be outstanding, and the correlation between nonsensitive characteristics will weaken the classification effect and improve the fault diagnosis accuracy. Characteristic evaluation just provides an effective solution for the above problems. On the basis of certain evaluation criteria, the sensitivity of each characteristic parameter is learned, and the original characteristic set is filtered or weighted according to the learning results, which thus plays the purpose of improving the characteristic clustering and classification performance. Most of the widely used feature weighted methods use the weight coefficient (weight coefficient) as the core content [22] and screen the features and weighting of high Vite solicitation based on a single sensitivity evaluation model. At present, the feature evaluation model uses more distance, consistency, information, correlation measures, etc. [26] Common distance measures are pasteurized distance, Euclidean distance, and Marchhedean distance; information measure mainly includes mutual information and information gain; consistency measure is the main indicator; correlation measure includes Pearson (Pearson) correlation coefficient [27], minimum square regression error, probability error, Fisher score, Laplacian score, linear classification and analysis, and high calculation efficiency. The above single measurement feature evaluation methods can screen deredundancy features from a specific perspective, but their limitations are strong, which cannot be comprehensive screening features and fully reflect the protection information. The existing feature-weighted methods are realizing the learning and weighted processing of feature sensitivity based on a feature evaluation model of a single measure, ignoring the complementary effect of the multimeasure evaluation model and the composite gain effect on high-dimensional feature screening. Therefore, the experimental mixed measure feature evaluation model is composed of a random combination of n typical feature evaluation models in each single measure index benchmark, and the specific implementation process is shown in Figure 5. Different measure combination mixed strategies form Q = C 2 n + C 3 n + ⋯C n n and then select the optimal measure combination strategy with the maximum variation coefficient. Secondly, the multimeasure optimal mixed 1: Initialize: residual value r 0 = y, index set Γ 0 = ∅, number of changes k = 1 2: for k training iterations do 3: for k d iterations do 4: Search for S matching atoms continuously 5: The S atoms with the largest inner product of residuals are searched and added with their corresponding index value 6: Γ K = Γ K−1 ∪ P 8: end for 9: for k p iterations do 10: update Φ j , r j , Γ j 11: The updated candidate atomic index set is used to obtain the corresponding support set, and the updated residuals and correlation coefficients are calculated. 12:  Journal of Sensors evaluation model is calculated as the combined feature weight and the original eigenvalue. The detailed flow is shown in Figure 5. Ten common feature evaluation models are selected as alternative subevaluation models for the mixed measure strategy model. It includes information measurement, correlation measure, and distance measure feature evaluation model. Due to the large number of mixed measure models with random free combination of different single measures, a large operational workload and too cumbersome mixed measurement strategy will produce a preferred comparison analysis process. Considering that the sensitivity of each single measure to the high-Witt solicitation varies, different feature subset dimensions remain when feature-weighted and filtered, cases where a measure removes only less nonsensitive features. The feature subsets are still too redundant, and it may even be difficult to select more sensitive features to form feature subsets for classification. Thus, initial screening and filtering of the submodel constituting the mixed measure model are required. A single measurement feature evaluation model with good sensitivity discrimination is retained. Avoid the difficulty of learning the poor compre-hensive performance of a large number of mixed measure combination strategy sensitivity, resulting in the optimization process. Therefore, a preliminary selection of a single measure feature evaluation model before a mixed combination of a single measure feature evaluation model is done, and several single measure feature evaluation models with outstanding sensitivity are selected for the subsequent random mixing strategy and then the principle of the largest discrete coefficient of the mixed measure sensitivity and the largest cliff to select the best measure combination strategy.
Starting from ten common single measure feature evaluation models, the sensitivity score of time domain, frequency domain, and time-frequency domain of three different protective states of bogie is obtained of ten single measure models. It is found that different measures have different sensitivity to the feature set, some single measures are very sensitive to the feature set, and it is almost impossible to get a reduced subset of good features. Some single measures have good sensitivity learning effect to the feature set, high sensitivity features are more prominent, and nonsensitivity features are weakened, making the feature sensitivity differentiation more obvious. This section preliminarily selects  7 Journal of Sensors ten single measure evaluation models and is selected as the submodel of the subsequent mixed measure feature evaluation model. Because the dimension of each measure is inconsistent, but the dimension of the feature on each single measure is comparable, a uniformly defined threshold σ i = ð∑ j l λ i j Þ/j, λ i j is the type j feature sensitivity score of a single measure which is set for each single measure. If the sensitivity score of more than 10 characteristics of a single measure is greater than the mean value σ i , the sensitivity score learning effect of the single measure is poor, and if the single measure is abandoned, the submodel for the subsequent evaluation model of mixed measurement characteristics is retained. Secondly, construct a mixed measure feature sensitivity learning model with cumulative effect composed of individual measures, whose mathematical expression is as follows: In the formula, w H j represents the comprehensive measure sensitivity value of the j feature of the sample feature set in the H hybrid strategy; w 1 j , w 2 j , ⋯, w n j , respectively, represents the 1, 2, 3, ⋯, n kind single measure sensitivity value result of a sensitivity learned j feature in the feature set of a single measure feature evaluation model of w 1 j ∼ w p j which is a sensitivity learning value that is positively    9 Journal of Sensors avoid the diversity of sensitivity learning results of different single measure feature evaluation models, affecting the sensitivity learning of mixed measure evaluation models, the sensitivity score value w i j of each single measure model should be normalized. The formula for the normalization of the comprehensive measure sensitivity score w i j of the type j feature in the type i hybrid model is as follows:w i j = 1 + 9 After product mixing, the comprehensive measure sensitivity learning results w i j are obtained. The significance is to compare and evaluate the comprehensive measure sensitivity scores w i j obtained from the product effects of different scales under the same scale coefficient, which also makes the sensitivity of high-sensitivity features more prominent and weakens the effect on classification and clustering in the nonsensitivity features. Thus, the clustering effect of the optimal feature subset is improved.
In order to evaluate the multimeasure mixed evaluation model after a single measure feature evaluation model, the multimeasure evaluation model for the multimeasure evaluation model with the maximum discrete coefficient and the maximum cliff and extract the sensitive feature subset of the best combination strategy for subsequent clustering and evaluation. The core idea is a dimensionless parameter index that reflects the degree of data discretization according to the discretization coefficient and cliff of the time series. The specific criterion of Ψ m = ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi V m × K m p is that the greater the cliff value K m and the greater the discrete coefficient V m of the mixed characteristic sensitivity score sequence, the higher the discrete degree of the characteristic data sequence and the greater the Ψ m value. The reaction performance in the feature concentration is strengthening the feature sensitivity learning effect with high sensitivity and weakening the learning effect of nonsensitivity features. Make the high-sensitivity features stand out in the mixed measure strategy. Thus, it can be considered that the higher the cliff value of the eigenvalue and the greater the discrete   Journal of Sensors coefficient of the mixed characteristic sensitivity score value, the hybrid measure is best effective in sensitivity learning, and the more streamlined the subset of eigenvalue used for clustering classification. The formula for calculating the discrete coefficient of the comprehensive feature sensitivity score sequence defining V m as the m kind mixed measure feature evaluation model is as follows: In the formula, S m is the standard difference in the sequence of mixed feature sensitivity scores of type and w H−m is the mean of the mixed feature sensitivity score series of the m kind mixed measure feature evaluation model: In the formula, x m , x m , and σ m are the data series, mean, and standard difference of the mixed measure feature evaluation model of type m, respectively.
The specific criterion of Ψ m = ffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ffi V m × K m p is based on the greater the cliff value K m and the greater the discrete coefficient V m of the mixed characteristic sensitivity score sequence, the higher the characteristic data sequence and the greater the Ψ m value. The reaction performance in the feature concentration is strengthening the feature sensitivity learning effect with high sensitivity and weakening the learning effect of nonsensitivity features. Make the high sensitivity features stand out in the mixed measure strategy. Thus, it can be considered that the higher the cliff value of the eigenvalue and the greater the discrete coefficient of the mixed characteristic sensitivity score value, the hybrid measure is best effective in sensitivity learning and the more streamlined the subset of eigenvalue used for clustering classification.
The traditional feature evaluation model with a single measure usually considers only from a single angle when extracting the optimal feature subset, which makes it difficult to evaluate the vibration signals in the whole time domain under different protection conditions. In order to better evaluate the protection effect of traction motor bearings under different protection conditions during transportation, this chapter proposes a multimeasure mixed evaluation model, which can extract multicategory and multimeasure characteristic parameters as feature sets. The novelty lies in that based on the feature index, sensitivity learning is carried out for the features in the multianalysis domain, a multimeasure mixed evaluation model is established based on the sin-gle measure feature evaluation feature index, and the optimal feature subset that is more conducive to evaluating the effects of different protection states is optimized by using the comprehensive feature evaluation index. Then, the unified feature index was established based on the optimal feature subset to comprehensively evaluate the protection effect under different protection conditions.

Build a Multimeasure Hybrid Evaluation Model
Based on Compression Sensing. In this paper, based on the actual operating conditions of bogie traction motor bearings in urban rail vehicles, because the bearings are difficult to be effectively evaluated under different transportation protection, a multimeasure hybrid evaluation model based on compressive sensing is proposed. Firstly, vibration signals of traction motor bearing are sampled and compressed based on compressive sensing theory. Secondly, an optimal hybrid model feature evaluation framework with single measures such as distance, correlation, and information is constructed to learn feature sensitivity from the original feature set composed of time-frequency, frequency-domain, and frequency-domain feature parameters. At the same time, to design comprehensive features based on sensitivity score sequence variation coefficient of the multimeasure portfolio strategy optimization method and mixed with the optimal combination strategy measure corresponding comprehensive evaluation model to the original fault feature set sensitivity study and then the optimal comprehensive score of each feature sensitivity for each feature weights, build a new weighted feature set. Finally, the proposed feature sensitivity learning method is applied to the transportation of bogie traction motor bearings of Ningbo Line 3 urban rail vehicles, so as to verify the feasibility and technical advantages of the proposed new method. This method provides a technical basis for the difficulties existing in the traction motor bearings of transit sentence frame in the field of rail transit and provides a new idea for the data collection and reduction of vibration signals of bearings in the process of transportation, as well as reasonable transportation protection. The test object is the power bogie of Ningbo Line 3. The bogie wheel pair is supported and fixed with iron shoes. In the transportation process, the bogie two guided motors adopt different protection methods to test the vibration acceleration of bogie and motor bearings in Zhuzhou to Ningbo section. In the process of road transportation, motor bearings are protected as shown in Figure 6. Protection status 1 and protection status 2 are used for the two motors, respectively, during the departure, and protection status 1 remains unchanged while protection status 2 is changed to protection status 3 during the return, so as to study the protective effects of the three protection status on traction motor bearings. Figure 6 shows the defense mode in defense state 1, Figure 7 shows the defense mode in defense state 2, and Figure 8 shows the overall layout in defense state. In 13 Journal of Sensors addition, the coordinate directions of the three-way acceleration sensor are defined. Since the vertical direction along the Z direction is most affected by road turbulence, the analysis of vertical vibration acceleration signal is mainly carried out. Table 1 shows three settings of the same protection state.

Measuring Point Arrangement of Traction Motor
Bearing. A total of 8 acceleration sensors are arranged on the bogie, and the acceleration sensors at each measuring point are fixed by glue and magnetic suction seat. According to the designed sensor point position and sensor type, the sensor is arranged on the bogie traction motor. According to the position of the traction motor bearing in the bogie, in order to better collect the bearing vibration signal in the process of traction motor in the road, the arrangement of vibration sensor measuring points is shown in Figure 9. Figures 10(a)-10(h) correspond to sensors numbers 1-8.

Bearing Data Compression Perception.
After the experimental test, the experimental data was up to 254 GB and was difficult for subsequent data analysis. This paper proposes a method to solve the problems of data mass and redundant data. Firstly, the vibration signal of a transport bearing is randomly selected to conduct compression sensing sparse representation analysis, and the DCT, DFT, and DWT transformation of the experimental signal are performed. In Figure 11, the sparse ratio of the original vibration signal is 91.4%. Figure 12 intuitively shows that the sparse ratio is 27.6% after the DCT transformation, and the sparsity of the vibration signal after the DCT transformation is higher than that of the original signal. Figure 13 is the result of the vibration signal after the DFT transformation, and the sparse ratio of the DFT coefficient is 24.4% and 25.2%, respectively, which is more prominent compared with the DCT transformation. Figure 14 shows that the sparsity of the vibration signal is well improved compared with the original signal. The sparse score of Daubechies wavelet 14 Journal of Sensors lengths 8, 12, and 16 is 41.8%, 39.6%, and 37%, and the change of Daubechies wavelet length has little impact on the sparsity after the coefficient transformation. Next, consider the influence of different sparse dictionary transformations on the compression effect of the bearing vibration signal, by setting three different sparse transformation dictionaries (DCT, DFT, and DWT), in which the Daubechies wavelet length of DWT is set to 16, selecting Gaussian random observation matrix and GOMP reconstruction algorithm. Using these three different sparse dictionaries to perform data compression measurement and reconstruct the experimental signal, we obtain the parameter indicators of the signal reconstruction performance of three different sparse dictionaries. Comparing Figures 15-17 shows that the signal reconstruction performance index of the base DFT is superior to that of DCT and DWT; in three different sparse transformation methods, DFT has better sparse performance and performance in data compression.

Quantitative Characterization of the Bearing Protection
Effect Based on the Unified Characteristic Index. The original high-Witt collection including time domain features, frequency domain features, energy features based on EMD, and Lempel-Ziv complexity features of EMD is extracted. In order to further realize the dimensionality reduction of high-Witt collection and improve the performance of cluster classification, ten sensitivity scores as described above are learned, and the 3, 6, 7, and 10 single measure feature evaluation models are selected as the submodels of mixed multimeasure feature evaluation models according to the sensitivity learning results. They are intraclass and interclass integrated distance model, Pearson correlation coefficient model, Fisher score model, and Laplacian scoring model. The best sensitivity feature subset sensitivity score distribution under the four single measurement feature evaluation indicators is shown in Figure 18. A hybrid measure feature evaluation model is constructed for the four preferred measure evaluation submodels. Since four random combinations into mixed measure models have 11 combination methods, the 11 mixed measure feature evaluation models learn the sensitivity, obtain the comprehensive sensitivity score sequence, and obtain the discrete coefficient, cliff value, and comprehensive evaluation index of the sensitivity learning results based on the comprehensive sensitivity score evaluation index Ψ m . The results are shown in Table 2.
It can be seen from the table above that the discrete coefficient, cliff value, and comprehensive sensitivity score evaluation indicators of the mixed measure feature evaluation model after the random combination of four single measures are best performed when the 3, 6, 7, and 10 measures are fully combined. It can therefore be argued that the hybrid strategy exhibits outstanding performance in strengthening the sensitive features, weakening the nonsensitive features, and a subset of reduced optimal features. Features with large sensitivity scores are extracted as subfeatures of the subset of optimal feature subsets. Its feature number is 1, 2, 3, 7, 9, and 16.
To further verify that the extracted sensitivity scores perform well for the clustering effect of a subset of optimal features, principal component analysis (PCA [28]) of the best subsets of the optimal evaluation model is applied. To facilitate the application of PCA, visual analysis is performed. Figure 19 shows a visual analysis diagram of the first characteristic subset of the 3, 6, 7, and 10 single measure evaluation models. It can be seen that the best characteristic subset characteristics of the 3 and 6 single measures cannot be classified and clustered by data samples regardless of the serious travel overlap between the distance and different categories. The best characteristic subset corresponding to the 7 and 10 single measure evaluation models can cluster different protective state data samples effectively, but the distribution is scattered and the inner class range is large. From the perspective of interclass distance, the clustering effect is not prominent, and the optimal multimeasure mixed evaluation model is far better than the other four single measure subsets in classification clustering. A Cartesian product is performed for the preferred optimal subfeature subset based on a multimeasure mixed eval-uation model. The Cartesian product calculation expressions defining the unified feature metrics are

Journal of Sensors
In the formula, v 1 ∼ v p protection is negative, and v p+1 ∼ v n is the positive value with the protection effect, where Q is a unified feature indicator; 1 ≤ p ≤ n, 1 ≤ n ≤ 6 . The characteristics of the characteristic subset are negatively related to the protection effect, that is, the greater the characteristics, the worse the protection effect, so the comprehensive characteristic indicators. And the larger the unified characteristic indicators, the worse the protection effect. Figure 20 shows the distribution of different protective states in the time domain and the comprehensive evaluation from a single feature. Figure 21 is the comprehensive characteristic index of bogie bearing vibration signal after characteristic combination. The unified characteristic index based on the full time domain is the most significant in the protection state 3 (no protection) and the worst protection effect, while the unified characteristic index of protection state 1 is not significant; its peak is far less than the protection states 17 Journal of Sensors 2 and 3, the best protection effect, and the protection effect of protection state 2 is secondary.

Conclusion
In this paper, the state characterization of bogie traction motor bearings under different transportation protection is studied, and the multimeasure hybrid characterization model based on compressive sensing theory is proposed, which is verified by the highway transportation example of Ningbo Line 3 urban rail vehicle bogie traction motor bearings. Experiment: when the protection state is undecoupled + the screw is not jacked (protection state 3), the bearing damage is the most serious. The second is when the protection state is decoupling + screw jacking (protection state 2). In the case that the protection state is undecoupled + binding + screw top (protection state 1), the bearing has almost no damage. That is, the protection effect is protection state 1 > protection state 2 > protection status 3.

Data Availability
The data and the Matlab programs used to support the findings of this study are available from the corresponding author upon request.