A Fatigue Life Prediction Method for the Drive System of Wind Turbine Using Internet of Things

-e wind turbine drive system is one of the key components in converting wind energy into electrical energy.-e life prediction of drive system is very important for themaintenance of wind turbine.With increasing capacity, the wind turbine system has become more complicated. Consequently, for the life prediction of drive system, it is necessary to consider the problems of multiinformation fusion of big data, quantification of time-varying dynamic loads, and analysis of multiple-damage coupling. In order to solve the above challenges, the fatigue life analysis and evaluation method considering the interaction of coupled multiple damages are proposed in this study. -e hierarchical Bayesian theory with fault physics technology is introduced to deal with the uncertainty of wind turbine drive system. -en, a time-varying performance analysis model is established based on the multipledamage coupling competition failure mechanism. Moreover, the Internet of-ings (IoT) technology is introduced and combined with the proposed model. -rough the data collection by IoT, the time-stress curve of drive system can be obtained. A case study about the remaining fatigue life estimation of drive system is utilized to illustrate the effectiveness of the proposed method.


Introduction
Wind energy is a common clean and renewable energy. e effective utilization of wind energy can reduce the energy crisis and protect the environment. e early development of wind energy is mainly carried out on land. Engineers have successfully built many onshore wind turbines. Compared with offshore wind farms, onshore wind farms are more mature in technology and management [1]. Generally, the offshore wind farms are located on the ocean and have very little impact on humans. erefore, offshore wind power will account for more proportion in the future wind power industry.
Uncertain environment has restricted the development of offshore wind turbines and threatened the wind turbine drive system security [2]. e design requirements for offshore wind farms are more difficult than those for onshore wind farms [3][4][5][6]. Risk assessment can improve the reliability of systems [7][8][9][10][11]. For wind turbine drive system, it is important to study the failure mechanism, fault diagnosis, and life prediction of key components. Accordingly, recent research involves multiple subjects like dynamics, material mechanics, fatigue, reliability, and signal analysis [12,13].
Taking the wind turbine planetary gearbox as an example, the dynamic analysis mainly focuses on the characteristics of gears and bearings. e gear dynamics model mainly analyzes the gear mesh stiffness characteristics [14], the tooth surface contact force [15,16], and the influence of assembly and machining errors [17]. e bearing dynamics model mainly analyzes the interaction force between the rolling element and the cage [18,19] and the influence of the cage parameters on the bearing characteristics [20]. During the operation of the gearbox, the tooth surfaces and roots are subjected to alternating and impact loads, which leads to fatigue damage of the gears. Many uncertainties also cause the failure of the bearing.
us, the dynamic model is proposed for failure mechanism analysis, fault diagnosis, and life prediction. e research of gear failure mechanism starts from the tooth root crack [21], the tooth surface pitting [15], and the relationship between crack change and meshing stiffness change [14]. e research of bearing failure mechanism starts from the change of bearing rolling element clearance [22,23], the number of pressure rolling elements [24], the lubrication characteristics [25] and the cage flexibility characteristics [26]. Recent research results include rolling bearing-rotor systems [27], gear transmission systems [28], gear-rotor-bearing system [29], and gearbox bearing system [30]. e main working mode of wind turbine planetary gearbox is that the load changes with the wind speed. erefore, the fault diagnosis is the core technology in research process. Many methods have been proposed, such as using instantaneous power spectrum and genetic programming methods to identify fault feature parameters [31]; fault diagnosis of rotating machinery under variable operating conditions based on wavelet packet coefficients and entropy information indicators [32]; and fault information extraction based on the Vold-Kalman method [33].
In the process of fault diagnosis, it is necessary to analyze the internal and external load characteristics. However, the current research mainly focuses on the relationship between a certain fault state and system dynamics. ese research works are short on developing dynamic fault response mechanism. is paper proposes the coupling fault diagnosis mechanism and the fatigue life prediction method based on wind farm complex conditions.
Many scholars have carried out research on product remaining life prediction methods, such as model-based methods, data-driven methods, and hybrid methods [34][35][36][37]. e product monitoring data have the characteristics of large quantity and variety. Consequently, it is necessary to achieve automatic real-time analysis of product monitoring data by the Internet of ings (IoT). Deep learning methods are also effective ways to solve this problem. However, most of them cannot solve the product remaining life prediction problems. Hence, the product remaining life prediction based on the Internet of ings has a good research prospect.
In this paper, a remaining life prediction method based on deep learning is proposed, and it can be operated under multiple failure models.
is method can integrate the multisensor monitoring data with the dynamic working condition data into the fusion data. rough the usage of bidirectional neural network in the deep learning, the time correlation model of the fusion data can be obtained. en, this method can achieve nonlinear regression analysis of the fusion data and product remaining life under multiple failure models. is paper takes the remaining life prediction of wind turbine drive system as a case to prove the effectiveness of the method.

The Construction of Internet of Things (IoT) in the Drive System of Wind Turbine
e working environment of wind turbine drive system is hard, so it is necessary to monitor working conditions and environmental information. erefore, a sensor is used to collect information as data support for the system. e sensor can convert physical signals into electrical signals so as to satisfy the actual demand [38]. Its accuracy affects the performance of the system, so the selection of sensors should take the system requirements and environment factors into consideration. e state monitoring and life prediction system of the wind turbine drive system based on the Internet of ings can be divided into three functional levels. (a) Perception layer: e perception layer includes transmission equipment, sensors, and wireless sensor nodes. Its function is to perceive the working conditions of the transmission system. (b) Transmission layer: e transmission layer uses the CAN communication protocol to achieve wired communication between devices and uses the ZigBee communication protocol to build a wireless sensor network. Its function is to deliver the working condition information of the transmission system. (c) Application layer: e application layer is composed of PC/104, operating system, software, and life prediction algorithm. Its function is to achieve acceptance, storage, display, and intelligent processing of the working condition information and manage the entire life cycle of the wind turbine. Figure 1 shows the system hardware framework. e wireless sensor network collects speed and temperature information of the transmission system and delivers them to the PC/104 for processing. At the same time, the ARM processor collects fault signals, status signals, current signals, and vibration signals of the transmission system and delivers them to the PC/104 for processing. e PC/104 processes the collected information and displays the results and status monitoring information on the LCD screen. e PC/104 can share the information with the centralized control center through the Ethernet communication module. en, the centralized control center can monitor the transmission system in real time.

Data-Driven Fatigue Life Prediction Based on IoT
A large amount of data has been accumulated during the sensor operation. After being processed, the data still contain unused information. e data-driven multivariate statistical algorithm can use these data to carry out process monitoring, fault monitoring and diagnosis, and quality and life prediction [39][40][41]. e product remaining life is the time interval between the current moment and the failure moment. Specifically, the product monitoring information at this moment is compared with the historical information to get the comparison conclusion and predict the product remaining life.

Data Collection.
e degradation status information of products during operation is monitored by multiple sensors. en, the data is defined as online measurement data. e monitoring data of training products are recorded until failure and saved to s i , y i L i�1 . s i ∈ R M×p represents the online monitoring data of the i-th training product. e data are monitored by p sensors. Each sensor employs M data points. L is the total number of training products. y i represents the degradation state of the i-th training product and needs to be detected offline. e monitoring data of texting products are recorded and saved to s j C j�1 . s j ∈ R M×p represents the online real-time monitoring data of the j-th testing product. C is the total number of testing products. e information about the degradation status and remaining life of the testing product is unknown, and it needs to be further estimated based on s j C j�1 .

Data Preprocessing.
e data preprocessing can be divided into three sections, namely, feature extraction, data regularization processing, and sliding time window processing. e feature extraction of the monitoring data can reduce the data size and increase work efficiency. e noisy and abnormal data points will affect prediction accuracy of the method. e feature extraction can reduce the abovementioned adverse effects. First, the time domain features of the monitoring data can be extracted, such as mean, standard deviation, root mean square, skewness, and kurtosis. Second, the frequency domain features of the monitoring data can be extracted by the fast Fourier transform technique, such as average frequency, spectral skewness, spectral kurtosis, and spectral power. ird, the time-frequency domain features can be extracted by the empirical mode decomposition and wavelet change technique.
After the above steps, the monitoring data s i L i�1 and s j C j�1 can be converted into set x i L i�1 and x j C j�1 . x i ∈ R 1×N and x j ∈ R 1×N represent the initial features of the i-th training product s i and the j-th testing product s j . N represents the number of initial features. However, the extracted features have different data size, so the z-score regularization method is used to normalize data, as follows: where x k d represents the d-th feature quantity for the k-th sample, ⌢ x k d represents the result after normalization, and μ d and σ d represent the mean and standard deviation of the d-th feature in set.

Life Prediction Method Based on Particle Filter Algorithm.
e network structure based on the bidirectional long shortterm memory (Bi-LSTM) model can be divided into two sections: (1) extracting the time-dependent features of normalized data by the Bi-LSTM network; (2) processing the time-dependent features and outputting the prediction result of product degradation indicators. A schematic diagram of the correlation between the two sections is shown in Figure 2.
More details can be found as follows: Step 1: the training set X k , y k K k�1 can be obtained by processing monitoring data.
represents the data of the k-th training feature. t TW represents the length of the data. x k t ∈ R 1×N represents the normalized feature data at time t. y k represents the product degradation state of X k . e value of y k is the data of the offline degradation state. X k is normalized and its time-dependent features are extracted by the Bi-LSTM network, as follows: where f 1 represents the hidden layer function of the Bi-LSTM network. e parameter set of the Bi-LSTM network is described by (Θ → BI−LSTM , Θ ← BI−LSTM ). All output features of the Bi-LSTM network are given in (2). However, only the output features of the last time point of the second layer are used for the fully connected layer; the calculation process is as follows: where h → t TW and h ← t TW represent the output features of the forward and reverse Bi-LSTM networks at the last moment, and ⊕ represents the sum operation point-by-point.
Step 2: e output feature h t TW of the Bi-LSTM network is used in the fully connected layer to further extract the advanced features. e calculation process is as follows:

Advances in Materials Science and Engineering
where o k is the output features of the fully connected layer and Θ FC is the parameter set of the fully connected layer, including the weight matrix W F and the bias vector b F . en, o k is used to predict the degradation state of the product. e process can be described by the following equation: where y k is the prediction of the product degradation index and W R is the weight vector of the last linear regression layer.
Step 1 and Step 2 both use single-layer bidirectional long short-term memory neural network expression and fully connected neural network expression. In actual engineering, more layers can be stacked to form a deep structure. However, the deep neural network structure has a more complex model and needs more training data. e calculation process of the model forward propagation has been given earlier. en, the model loss function based on mean variance can be defined as follows: where y k represents the normalized value of the degradation indicators in the shutdown state, y k � f Θ (X k ) represents the prediction of the product degradation indicators X k , and f Θ (•) is defined by (2)-(5). For some complex products, the degradation state data is difficult to obtain online directly because of actual engineering limitations. Hence, online monitoring techniques are used as auxiliary means to obtain indirect data and complete the estimation of the degradation state. In order to define the problem of product degradation state estimation, firstly, the relationship between the degradation state and time is given as follows: where k represents the time point, f(•) represents the product state transition model, y k−1 represents the internal degradation state of the product at time k-1, and v k−1 represents an independent and identically distributed process noise. At any time point k, the online measurement data s k ∈ R M×p of the product can be obtained, and the nonlinear relationship between s k and y k is given as follows: where h(·) is the observation model and w k is an independent and identically distributed measurement noise. e online measurement data is collected by multiple sensors at very high sampling frequency, and its signal to noise ratio is low, so it is difficult to obtain the display analytical model between the online monitoring data and the internal degradation state. erefore, the online measurement data is processed earlier, including feature extraction, normalization, and serialization. en, the online monitoring data can be converted into a set of normalized data . Finally, the model between the extracted feature x k and the internal degradation state y k is established based on the Bi-LSTM network and is given as follows: where z k represents the predicted output value of the product degradation index, y k represents the internal degradation state of the product in the shutdown state, and w k is the measurement noise and follows a normal distribution.
In the framework of Bayesian learning, the main task is to calculate the posterior probability distribution p(y k+1 | s k+1 ) of the product degradation state when the realtime online measurement data s k+1 is known [42][43][44][45][46][47]. s k+1 is processed and transmitted to the Bi-LSTM network to obtain the product degradation index predicted output value Figure 2: e diagram of the correlation between the two sections.
us, the posterior probability distribution p(y k+1 | z k+1 ) is actually obtained. If p(y k | z k ) is known at time k, the probability density function p(y k+1 | z k ) based on the Chapman-Kolmogorov equation is given as follows: p y k+1 z k � p y k+1 y k y k z k dy k . (10) At time k + 1, the new online measurement data s k+1 is processed and transmitted to the Bi-LSTM network to obtain the product degradation index prediction output z k+1 . en, the posterior probability distribution p(y k+1 | z k+1 ) can be updated based on the Bayesian criterion as follows: where p(z k+1 | y k+1 ) represents the likelihood function at time k + 1 and can be defined by (9), the probability density distribution p(y k+1 | z k ) can be calculated through the prediction, and p(z k+1 | z k ) represents the normalization factor. e calculation process is as follows: p z k+1 z k � p z k+1 y k+1 y k+1 z k dy k+1 . (12) Because of the nonlinear features, it is difficult to obtain exact solutions of above equations, so particle filtering is introduced to solve these problems. e structure of particle filtering is actually the Monte Carlo method (which refers to the probability of the event by the frequency of occurrence at a certain time) with a layer of importance sampling proposed in it. e basic idea of this method is to use a set of samples (or particles) to approximate the posterior probability distribution of the system, and then use this approximate representation to estimate the state of the nonlinear system. Using this idea, particle filtering can handle any form of probability in the filtering process. At the same time, p(y k+1 | z k+1 ) is approximated by Bayesian filtering method based on Monte Carlo simulation. If p(y k | z k ) is known at time k and can be described by discrete values y 1 k , . . . , y i k , . . . , y N k and their weights ω 1 k , . . . , ω i k , . . . , ω N k , the calculation process is given as follows: where N i�1 ω i k � 1, δ(·) represents the Dirac δ function. Equation (10) can be rewritten as follows: where i represents the i-th particle, N represents the total number of particles, and p(y k+1 | y k ) can be calculated based on the state transition model. e weight value ω i k of (9) can be updated by the importance sampling method, and the update process is given as follows: where p(z k+1 | y i k ) represents the likelihood function of the predicted output z k+1 .
In order to avoid the particle degradation during the operation of the particle filtering algorithm, the resampling method based on the inverse cumulative function is used to obtain particles with equal weights. e steps are as follows: Step 1. Construct the cumulative density function based on the likelihood function p(z k+1 | y i k ).
Step 2. Sample a random value from the uniform distribution U (0, 1) and use it as the input of the cumulative density function in Step 1.
Step 3. Take out the particle whose value is most similar to the output of CDF in Step 2 and consider it as the result after resampling.
Step 4. Repeat the calculation process of Steps 1-3 N times and obtain N equal weight resampling particles.
en, use these particles to approximate the posterior probability density distribution p(y k+1 | z k+1 ).

Doubly Fed Wind Turbine Drive System.
e wind turbine drive system can transmit torque and change rotating speed. Its structure has a great influence on the layout of wind turbine. At present, the wind turbine has two types: doubly fed and direct-drive. e difference between them is the structure of the drive system. e drive system of the doubly fed wind turbine has a speedincreasing gearbox, while the drive system of the directdrive wind turbine does not have one. e object of this paper is the doubly fed wind turbine drive system. e wheel hub of the doubly fed wind turbine is connected to one side of the low-speed main shaft. e other side is connected to the input of the speed-increasing gearbox. e output of the gearbox (high-speed shaft) is connected to the generator rotor through the flexible coupling. e hydraulic brake disc is installed on the highspeed shaft. e entire drive system is installed on the main frame. e thrust and radial load generated by the wind wheel are transmitted to the main frame first and then to the tower.
Because of the speed-increasing gearbox, the generator rotor can reach a high rotating speed and the requirement for magnetic poles is low. e gearbox calls for a high speedincreasing ratio and use multistage planetary gears combined with fixed-shaft gears. e characteristics of the doubly fed wind turbine drive system are as follows: (1) e operating conditions change with the wind speed and lead to constant change of vibration signal amplitude and frequency. As a result, the signal contains a large number of uncertainties. (2) e drive system is located in the top of the tower. Strong wind will cause the body shake. Hence, the signal will contain noise interference. (3) e mechanical structure of wind turbine drive system is complex. It calls for high standard of related technologies. (4) Wind turbines are Advances in Materials Science and Engineering severely affected by environmental conditions and have high requirements for the reliability and anti-interference ability of the test system.

Dynamic Modeling of Wind Turbine Drive System.
is paper uses the dynamic analysis process of the wind turbine gearbox drive system as shown in Figure 3.
First of all, the external and internal incentives of the gearbox are obtained through the monitoring of the Internet of ings technology. ey are transmitted as dynamic incentives to the analysis model of gearbox system. Second, the analysis model of gearbox system processes the input dynamic incentives through centralized parameter method, finite element method, hybrid modeling method, and bong graph theory. After processing, the inherent features and dynamic responses are output, and they are used as the dynamic responses of the analysis system. Finally, the particle filter algorithm is used for condition monitoring and life prediction applications. e above is the main application process of the method proposed in this paper for the wind turbine drive system. e external excitation of the gearbox mainly includes the dynamic input torque, the nontorsional loads, the generator feedback torque, the braking torque, and the impact load. e internal excitation includes time-varying stiffness excitation, time-varying error excitation, meshing shocks, and tooth backlash.
is paper uses dynamic response analysis method, dynamic stability analysis method, and inherent characteristic analysis method to optimize the system structure. Based on the IoT technology, the dynamic model has been proposed to monitor the gearbox status and provide theoretical basis of fault diagnosis and life prediction.

Method Validation.
e gearbox wear tests are recorded as A1, A2, and A3. Each test includes online monitoring data from six signal channels, and each test collected 600 sets of online monitoring data during transmission process. In order to describe the online monitoring data clearly, 50 equidistant data monitoring points are used for each online monitoring data group.
In order to reduce the data size and improve prediction efficiency, the method extracts the time domain feature first and then the frequency domain feature. e BL-STM model has been used to simplify the feature processing flow.
During the experiment, A1 is used as the test sample, and A2 and A3 are used as the training samples. e prediction result of the final transmission pair loss is shown in Table 1. According to the experiment results, the top five features with the highest scores can achieve the best prediction performance. e influence of the feature fusion step on the final transmission pair loss prediction result is shown in Table 2.
e optimal feature set used in Table 2 is composed of the top eight features with the highest scores. It can be inferred from Table 2 that the fusion feature with the highest scores can bring the best prediction performance.
Combined with the state transfer model of transmission pair wear and its observation model based on Bi-LSTM, the particle filter algorithm is used to predict the evolution of the transmission pair degradation index. When the index exceeds the failure threshold, the remaining useful life (RUL) of transmission pair can be obtained.

Conclusions
Based on the bidirectional long short-term memory neural network and particle filter technology, the remaining life prediction method for wind turbine drive system is proposed. Compared with the traditional method, it simplifies the feature processing flow by using deep learning technology. rough the engineering examples, the effectiveness of the method proposed in this paper has been proved. e research results in this paper will further promote the engineering application of deep learning technology to the product fatigue life prediction higher safety requirements.

Data Availability
All data used to support the findings of this study are included within the article.

Conflicts of Interest
e authors declare that they have no conflicts of interest regarding the publication of this paper.