Aircraft Gas Turbine Engine Health Monitoring System by Real Flight Data

Modern condition monitoring-based methods are used to reduce maintenance costs, increase aircraft safety, and reduce fuel consumption. In the literature, parameters such as engine fan speeds, vibration, oil pressure, oil temperature, exhaust gas temperature (EGT), and fuel ﬂ ow are used to determine performance deterioration in gas turbine engines. In this study, a new model was developed to get information about the gas turbine engine ’ s condition. For this model, multiple regression analysis was carried out to determine the e ﬀ ect of the ﬂ ight parameters on the EGT parameter and the arti ﬁ cial neural network (ANN) method was used in the identi ﬁ cation of EGT parameter. At the end of the study, a network that predicts the EGT parameter with the smallest margin of error has been developed. An interface for instant monitoring of the status of the aircraft engine has been designed in MATLAB Simulink. Any performance degradation that may occur in the aircraft ’ s gas turbine engine can be easily detected graphically or by the engine performance deterioration value. Also, it has been indicated that it could be a new indicator that informs the pilots in the event of a fault in the sensor of the EGT parameter that they monitor while ﬂ ying.


Introduction
Aircraft engines produce the power needed for aircraft.For this reason, aircraft engines are very important for flight safety [1].Today's complex and advanced technology systems require advanced and expensive maintenance strategies [2].Maintenance services are costly for airline companies.For manufacturers, maintenance is a source of revenue.According to Dennis and Kambil, though after-sales service and parts sales constitute 25% of the manufacturer's income, it makes up 40-50% of company profits [3].
Because of the high cost of maintenance, gas turbine engines must be operated within specified physical limits [4].Today's aircraft engines are made safer by increasing the number of control parameters and sensors [5].The engines have a complex mechanical system.Because aircraft engines operate at high temperatures, high pressures, and high speeds, there are lots of possibilities of various faults in the aircrafts [6].
Gas turbine engines show the effects of wear and tear over time.A small fault during the flight does not prevent the engine from running, but if this fault is not detected, it could lead to a bigger fault.If these bigger faults cannot be prevented in the aircraft, it can lead to high maintenance costs and accidents.When aircrafts are taken for maintenance, the condition of the gas turbine engine is investigated by various tests and measurements [7].Condition-based maintenance (CBM) is being performed to provide effective and efficient maintenance in today's maintenance services.In the literature, there are major developments in the CBM method with studies on machine condition monitoring and fault diagnostics [8].A typical CBM program consists of three steps [9]: (i) Data acquisition step (ii) Data processing step (iii) Maintenance decision-making step In a CBM program, we can group two important parts as diagnostics and prognostics.With diagnostic monitoring, malfunctions or abnormal conditions are detected when a system or its subsystems are in operation [10].With the prognostics approach, the current state of the motor and possible future failures can be predicted.The following diagnostic systems have been applied on an aircraft: the typical diagnostic procedure consists of three steps: observation, comparison, and diagnosis, which are shown in Figure 1 [11].
In the literature, advanced engine condition monitoring methods can be classified as a model-based method (such as equality equation, parameter prediction, gas path analysis, nonlinear least square, and Kalman filter) and soft computing method (such as artificial neural networks, fuzzy logic, rough sets, decision tree, and expert systems).
In the first phase of the study, some sample studies on failure prediction in gas turbine engines have been mentioned.In the second phase, the main operating parameters of the aircraft gas turbine engines and the importance of the EGT parameter are discussed.In the third phase, multiple regression analysis is performed between the EGT parameter and the aircraft performance parameters using the SPSS program.In the next phases, N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, and ground/air variables from aircraft performance parameters are taken as input parameters to the artificial neural networks, and prediction of EGT parameters has been done in MATLAB Simulink environment to monitor the condition of an aircraft gas turbine engine.

Literature
There are many studies in the literature where different methods are used to monitor the condition of gas turbine engines used on airplanes.Zhang et al. predicted failure of aircraft engines using nonlinear adaptive estimators [12], Pourbabaee et al. predicted sensor failures in gas turbine engines using multiple model-based methods [13], Zhao et al. made a prediction of failure in the gas turbine engines using fuzzy matrix and the principle of maximum membership degree [14], Afghan et al. have designed an expert system to detect faults for gas turbine combustion chambers [15], and Zaidan et al. have made failure prediction in gas turbine engines with Bayesian hierarchical models [8].Lu and his colleagues performed a turbofan engine status monitoring with extended Kalman filter [16], Amozegar et al. have made failure prediction in gas turbine engines using dynamic neural network [17], Zhou et al. used the support vector machine method to diagnose gas turbine engine failures [18], and Chacartegui et al. have worked on real-time simulations of medium-sized gas turbines [19].

Main Operation Parameters of the Engine
There are many electronic indicators and systems that help pilots on board during flight.The pilots monitor the indicators, have knowledge of the current status of the aircraft systems, and carry out the flight.When the values appearing on the indicators indicate that they are out of the normal working range or that there is a malfunction in the systems, the pilots are able to request help from the operation services via the communication systems, or in cases where they are not considered to be very serious, pilots record them in the flight book and let them be added to the maintenance program.Pilots must monitor the main operating parameters while controlling the aircraft and making decisions.In Figure 2, fan speed (N1 speed) and exhaust gas temperature (EGT), the gas turbine engine's main operating parameters, are shown.The fan speed is usually used as a push indicator, while the EGT is the parameter that allows the gas turbine engine health to be monitored.In some engine models, engine pressure ratio (EPR) and N2/N3 speed are used to monitor thrust [5].The importance of EGT parameter is given below.
3.1.Exhaust Gas Temperature (EGT) and EGT Margin.The EGT is the temperature of the gas turbine engine's exhaust in degrees and indicates the performance status of the gas turbine engine's design.High EGT values result in further deterioration and wear in the engines.Table 1 shows which failures and causes will lead to high EGT values in engines.
In addition, pilots should pay attention to the EGT value, especially at takeoff, because exceeding the EGT limit may cause damage to the gas turbine engine components or reduce the service life of components.It is therefore important to keep the EGT as low as possible [20].At the maximum EGT temperature, the gas turbine engines can reach at takeoff and has a certificate showing a red reference line.
An example is shown in Figure 3.The difference between the highest EGT temperature reached during the takeoff and the EGT determined by the red reference line is called EGT margin (Figure 3) [21].EGT margin is used to obtain information about motor performance.
EGT margin deterioration is due largely to engine components.Compressor fouling, seal leakage, increased tip clearances, and airfoil erosion are among the deterioration reasons.EGT margin deterioration on airplanes causes the increase in engine thrust, the decrease in engine usage, the decrease in average flight distance, and the difficulty of the working environment.Figure 4 shows some sample faults that may increase the EGT value.
The EGT margin value is a parameter that is controlled by experts only when in maintenance at the takeoff and is used to evaluate the condition of the gas turbine engine.The more the difference between the max EGT value   International Journal of Aerospace Engineering reached at takeoff and the EGT redline determined by the manufacturer is, the better the performance of the gas turbine engine.This assessment is valid only for takeoff not for other flight stages.

Method
In this study, by comparing the EGT value that should be at the takeoff, cruise, and landing phase of the aircraft and the EGT value obtained from the sensor, RMS value is obtained, and this value displays the deterioration in performance.For this purpose, multiple regression analysis (MRA) and artificial neural networks (ANN) have been chosen to predict the EGT values obtained from real flight data.MATLAB program for artificial intelligence analysis and SPSS Statistics 22 package program for statistical regression analysis were used.
In MRA and modeling with ANN, 12,935 samples were used.70% for training, 15% for validation, and 15% for testing were used from these samples.The data used for modeling was recorded during the flight of the aircraft, and no changes were made to the data.Training, validation, and test data were randomly selected from the dataset.The same training, validation, and test data were used in both analyses to evaluate the comparison and modeling.The training, test, and validation data statistical parameters used in the prediction of EGT parameters are given in Tables 2-4, respectively.

Statistical Regression Analysis.
In the regression, the goal is to find relationships between the descriptive (independent) variables and the described (dependent) variable.As y is a dependent variable, x 1 , x 2 ,…, x k are independent variables, the method which expresses the causal relationship between variables as a mathematical model is called a multiple linear regression model [22].In multivariable linear regression analysis, the general formula of the regression equation that gives the shape of the relationship between variables can be expressed by where y, x 1 , x 2 , … , x k represent the observable values and β j , j = 0, 1, … , k, as the regression coefficients β k parameters as Table 1: Gas turbine engine failures and causes [28].3 International Journal of Aerospace Engineering the expected change in y for the unit changes in x k 's and e error.The model investigated for regression analysis in this study is given in Table 5.
Where x i values are, respectively, N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, ground/air, and β 0 , β 1 , β 2 , β 3 , β 4 , β 5 , β 6 , β 7 , β 8 are unknown parameters.The investigation of whether the coefficients of the regression model are significantly different is carried out by the t-test.When the significance value (Sig.) of the t-statistic is less than 0.05, the assumption is accepted and it is concluded that the coefficients in the model are meaningful.Sig.value is the value used with the purpose of determining the existence of statistical significance and the level of proof of the difference, if any.The level of significance is usually taken as 0.05 and 0.01.A level of 5% significance means a confidence interval of 95%, and a level of In ANN, there are many cells and many bonds between inputs and outputs.ANN can be divided into several subclusters.These small groups are called layers.The network is formed by hierarchically connecting layers.In ANN, learning takes place in two stages.In the first step, the random weight values are taken and the net output is determined for the sample shown to the network.The weights are regenerated by feedback or forward feed according to the response suitability.In the second step, the weight values are changed by showing different samples to the network in order to find the best weight value that the correct output can obtain.
There are different algorithms used in the learning process.The backpropagation algorithm is the most common.In the backpropagation algorithm, it tried to reduce backward errors.The training algorithm specifies how to adapt the learning rule to ANN according to the nature of the problem at hand.The samples selected during training should be selected from each region of the problem space to represent the problem space.The input and corresponding output vector are used to train the network.The output obtained from each iteration result is compared to the target, and network training is continued with the weight renewal process or the training is terminated depending on the given error.After the weights, providing the best output data is determined in artificial neural networks, the processes carried out to evaluate the learning status of the network are defined as testing the network.In this phase, the output is produced by using the most suitable weight values determined after the training by introducing not seen samples to the network.This output reveals the learning success of the network.The more successful the result, the better the training performance of the network.

Measurement Accuracy of Prediction Models.
The accuracy of a prediction model depends on how close the prediction values are to the actual observation value.If the model is successful in predicting actual values, the prediction error will be relatively low.The methods used in this study were analyzed by three different error performance methods.The first of these is the R 2 determination coefficient and can be expressed by where n is the number of observations, y is the actual values, x is the predicted value, and y is the average actual value.The other error analysis method is expressed by the mean absolute percentage error (MAPE).

MAPE =
〠 n i=1 e t /y t n 100 % 3 In this formula, e t = y t − x t , y t is the actual observation value, x t the prediction value, n is number of observations in the prediction period, and e t is the prediction error in t period.Another error analysis method is expressed by the mean square error (MSE).
where x i is the predicted value of the ith unit, and y i is the actual value of the ith unit.The closer the predicted values are to the actual values, the smaller the MSE becomes; the further away from the actual values the greater the MSE is.Root mean square error (RMSE) in ( 5) is a frequently used measure of the differences between values (sample values) predicted by a model or an estimator and the values actually observed.

Determination of Effective Aircraft Performance Parameters and Prediction by Multiple Regression Analysis.
Multiple regression analysis was performed to determine the effect of independent variables on dependent variable EGT parameter.When the significance value (Sig.) of t-statistic is greater than 0.05 in multiple regression models, the analysis is repeated by subtracting the meaningless variable from the model.A new model has been derived by subtracting meaningless variables from the data recorded during flight of the aircraft.In the obtained model, N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, and ground/air parameters were found to be important parameters affecting EGT parameter (Table 7).Here, EGT is a dependent variable, N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, and ground/air are independent variables,

International Journal of Aerospace Engineering
B is unregulated regression coefficient, beta is standardized regression coefficient, Sig. is significance level, R is correlation coefficient, and R 2 is the determination coefficient.According to Table 8, N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, and ground/air parameters are highly correlated with dependent variable EGT parameter (R = 0.969).Independent variables explain 93.9% (adjusted R square = 0.939) of the total variance (change) in the dependent variable EGT (see Table 8).According to the standardized regression coefficient (beta), the relative importance of the independent variables over the EGT parameter is N1 speed, total air temperature, pitch, roll, angle of attack, vertical acceleration, ground/air, and N2 speed.When the t-test results on the significance of regression coefficients are examined, it is seen that N1 speed parameter is the most important (significant) independent variable on EGT.According to the results of regression analysis, multiple regression equation's mathematical model for EGT prediction is expressed by The analysis of the linear multiple regression model is shown in Table 7.The fact that Sig.values of the parameters used in this model are less than 0.01 means that it is a highly significant model according to Table 6.
As a result of the correlation analysis, whether there is a linear relationship and the degree of this relation if there is any is calculated by a correlation coefficient, the correlation coefficient takes values between −1 and +1.If there is a positive correlation, the value of the variable y increases as the value of the variable x increases, or the value of the variable y tends to decrease as the value of the variable x decreases.Negative correlation (negative relationship)  When the values in Table 7 are interpreted according to Table 9, it could be seen that there is a strong, positive, and significant relationship between the pitch parameter and the EGT parameter and that there is a strong positive correlation between the N1 speed and N2 speed parameters and the EGT parameter.It could also be seen that there is a very weak, positive, and significant relationship between total air temperature and ground/air parameters and EGT parameters and that the angle of attack, roll, and vertical acceleration seem to be very weak and negative and have a significant relationship.
The fact that the Sig.value shown in Table 10 (Sig.= 0.000) is smaller than 0.001 indicates that the developed model is very highly significant.According to 6, EGT parameter, training, test, and validation data are predicted by multiple regression analysis.The error performance values between the measured values and the predicted values for the training, test, and validation stages are determined according to 2, 3, 4, and 5.The comparison of EGT values predicted by multiple regression analysis and EGT value from the aircraft sensor at training, test, validation stage is shown in Table 11.

Prediction with Artificial Neural
Network.In the ANN model; N1 speed, N2 speed, pitch, angle of attack, Roll, vertical acceleration, total air temperature, and ground/air are given as the input data to the network and EGT is predicted as the output data.All the data have been presented to the network as it was received from the aircraft without any change.
The limit values of the input and output parameters of the ANN model are shown in Tables 2-4.12,935 data obtained from the flight of the aircraft were used in the MATLAB program with "dividerand" command 70% of which for training, 15% of which for testing, and 15% of which for validation.The feed forward network type was chosen as the artificial neural network model to be used in practice.In the feed forward calculation, the most important event is trying to predict the output values with the lowest error margin.The model shown in Figure 5 is obtained by changing the number of iterations and the number of neurons in the hidden layer for the optimal network architecture for predicting the EGT parameter with the experiments performed.The best results of 11 different training functions to predict the EGT parameter with the smallest error are shown in Table 12.For experiments, a computer with Intel (R) Core (TM) i5-4590 CPU @ 3.30 GHz 4 GB RAM was used.
When the results in Table 12 are compared against the test MSE value, it is determined that the Levenberg-Marquardt backpropagation function (trainlm) is the best resultant.In Figures 6-8, it illustrates neural network regression plots for training, validation, and test.The correlation coefficient of the EGT value from the aircraft sensor and the predicted values of training, test, and validation phases are higher (R > 0.99).In order to evaluate the performance of the developed network, the error performance values between measured values and predicted values for training, validation, and test phases were determined according to 2, 3, 4, and 5 and the results are presented in Table 13.In addition, the network characteristics developed are given in Table 14.
When the results are evaluated, determination coefficient (R 2 > 0.99) could explain more than 99% of EGT parameter at training, testing, and the validation stage with N1 speed, N2 speed, pitch, angle of attack, roll, vertical acceleration, total air temperature, ground/air parameters.The closer the R 2 parameter is to 1, the better the performance of the network.Lewis has classified the models with a MAPE value of less than 10% as "very good," models between 10% and 20% as "good," models between 20% and 50% as "acceptable," and models higher than 50% as "wrong and faulty" [24,25].When we look at the MAPE values of the enhanced network, it is suggested that the network is predicated on high accuracy since MAPE is <1% during the training, testing, and validation phases.It also indicates that the smaller the RMSE parameter, the higher the performance of the network.

Modeling of the Engine Health Monitoring Program in MATLAB Simulink
Environment.In order for our system to work instantly, we need to design an interface in the MATLAB Simulink environment with the best ANN model we have obtained as a result of the experiments.Figure 9 shows the structure of the ANN model modeled in MATLAB Simulink environment.
Babbar et al. evaluated the difference between EGT temperature and flight parameters of two symmetric engines at various stages of the flight to examine the condition of the aircraft [26].By differential analysis, it is aimed to compare EGT value obtained from ANN with real EGT value coming from the sensor of the aircraft and to easily notice when there is any problem.In this study, it is necessary to instantly compare the values in order to evaluate the condition of the gas turbine engine.In this analysis, EGT.1 was determined as the actual EGT value from the sensor of aircraft and EGT.2 was determined as the parameter predicted by ANN.The similarity scale used to determine the status of the gas turbine engine is illustrated in Using EGT differences of each value, 8 is used to calculate the mean square root of the given flight phase.

RMS
The RMS value is shown as THE aircraft's performance deterioration value in the interface program.Differential analysis model needs to be modeled in MATLAB Simulink environment.The interface designed for the aircraft's engine health monitoring in the MATLAB Simulink environment is shown in Figure 10.
When the graph shown in Figure 11 is examined, the values shown in the pink color represent the data obtained from the sensor of the aircraft, the values shown in the yellow color represent the EGT values predicted by the ANN, and the values shown in green indicate the motor performance.
The limit determined for the alarm level is shown in red.The alarm level assumed for this aircraft engine is EGT margin value.In order to make a system monitoring, values obtained from the system needs to be used.Two types of data are used in the applications: (1) Data gathered from normal and faulty operation conditions.
(2) Data gathered only from monitoring of normal conditions.Collecting faulty data from gas turbine engines like systems are very difficult and expensive.Due to aviation rules like EASA Part-M, aircrafts with only a completed maintenance and with a Certificate of Release to Service are allowed to fly.
An aircraft with a completed maintenance means a collection of systems with a healthy operation.Therefore, condition monitoring systems are constructed only for normal flight data.
Since we used data from a real aircraft, only normal operation data is in hand.It is apparent from simulation results that instantaneous performance of the gas turbine engine can be monitored from takeoff to landing.In this study, we compare EGT values one from instant prediction we did and another gathered from the EGT sensor of the aircraft.Even though there exists a difference in between, this is evaluated as an abnormal situation.
In Figure 12, a fault detection example is shown.Gas turbine engines are composed of various components.A failure on a single or multiple components will result in a performance defection in those engines.Corrosion, oxidation, damaged bearings, high blade clearance, damaged or burned turbine blade and vanes, damaged combustors, damaged fuel nozzles, and broken rotor discs and blades are some of the failures that lead to performance degradation.Performance degradation will result a change in EGT value.This study is not supposed to determine where and why a failure has occurred.Exceeding EGT margin (+100 °C) is considered as an abnormal situation.In this study, it is shown that the performance of the aircraft can be monitored from takeoff to landing.
Pilots follow various parameters during operation of the aircraft.EGT parameter is one controlled during the takeoff.Engine manufacturers determine maximum EGT thresholds during a takeoff.If the pilot does not track EGT, there is no a secondary system that generates warnings.If the engine exceeds the maximum allowed EGT values, it will experience serious failures with high probability.The system developed in this study will warn the pilot in case he misses the EGT value from the engine and warning display system.

Figure 5 :
Figure 5: Architecture of the proposed neural model.

Figure 7 :
Figure 7: Neural network regression plots for validation.

Figure 6 :
Figure 6: Neural network regression plots for training.

Figure 8 :
Figure 8: Neural network regression plots for test.

Figure 12 :
Figure 12: Graphical representation of an unusual EGT increase.

Table 2 :
Training data range used in the prediction of EGT.

Table 3 :
Test data range used in the prediction of EGT.

Table 4 :
Validation data range used in the prediction of EGT.
4.2.Artificial Neural Networks.ANN are nonlinear modalities that are composed of artificial neurons connected with each other and that contain an input set and a single output.

Table 7 :
Multiple regression analysis results for EGT training data.
a Dependent variable: Exhaust gas temperature.

Table 10 :
ANOVA results for EGT training.

Table 11 :
MRA error performance values.

Table 12 :
Comparison of the feedforward neural network training algorithms for their test MSE value.