Fault Diagnosis of Batch Reactor Using Machine Learning Methods

Fault diagnosis of a batch reactor gives the early detection of fault and minimizes the risk of thermal runaway. It provides superior performance and helps to improve safety and consistency. It has becomemore vital in this technical era. In this paper, support vector machine (SVM) is used to estimate the heat release (Q r ) of the batch reactor both normal and faulty conditions.The signature of the residual, which is obtained from the difference between nominal and estimated faultyQ r values, characterizes the different natures of faults occurring in the batch reactor. Appropriate statistical and geometric features are extracted from the residual signature and the total numbers of features are reduced using SVM attribute selection filter and principle component analysis (PCA) techniques. artificial neural network (ANN) classifiers like multilayer perceptron (MLP), radial basis function (RBF), and Bayes net are used to classify the different types of faults from the reduced features. It is observed from the result of the comparative study that the proposed method for fault diagnosis with limited number of features extracted from only one estimated parameter (Q r ) shows that it is more efficient and fast for diagnosing the typical faults.


Introduction
Batch and semibatch reactors are extensively used in fine chemicals, biochemicals, pharmaceuticals, and food industries for the production of small amounts of products with high added value.As these processes have become more automated and more flexible, the demands on their efficiency have increased, creating more complexity in operation and control.However, the frequency of accidents increased involving important consequences on the human, natural, and economic environment [1,2].Therefore, fault diagnosis has become a major research topic.Batch processes usually involve a lot of variables that interrelate with one another.When any of these variables diverge away from their specified limits, a fault occurs.There are a plenty of studies on fault diagnosis varying from analytical methods to artificial intelligence and statistical approaches.The approaches of fault detection are based on the threshold checking in the past.Venkatasubramanian and Chan [3] proposed fault diagnosis to the continuous stirred tank reactors with neural network and identified six kinds of faults.The application of neural networks in fault diagnosis of chemical process focuses on the following aspects such as using as a classifier, using as a dynamic forecast model, and combining with other diagnostic methods.Later, pattern classification and model identification [4], EKF based fault detection [5], and fault diagnosis of ball bearing using machine learning method [6] were used.
In this paper, SVM model is used to generate the residual images.Fault classification has been done from the extracted image features.SVM is a novel machine learning method based on statistical learning theory.The SVM utilizes a hyperplane with maximum margin to produce a good generalization performance by separating different classes.As a result, SVM has been widely used for many applications, such as time series forecasting, fault detection, modelling of nonlinear dynamic systems, one-class SVM for machine fault detection and classification [7], SVM for copper clad laminate defects classification [8], and SVM for fault diagnosis of a steam turbine [9].
For this study, different major fault types like actuator fault, sensor fault, and process fault are considered.Hence the fault residuals are generated from the output of the nominal model and this faulty model.Data based fault diagnosis method requires a large amount of historical data.To make this method be more efficient, the first step is feature extraction [10].From the residuals signature, the most appropriate statistical and geometrical features are extracted and among these 15 features are selected.Through this characteristic extraction, the data can be transformed to the prior information of fault diagnosis system.SVM attribute filter is employed to rank the features, in order to reduce the input data dimension which makes the better performance of the classifier.And its performance is compared based on PCA feature reduction also.
This paper is mainly focused on identifying fault classification of batch reactor from the residual features using artificial intelligent classifiers such as multilayer perceptron (MLP), radial basis function (RBF), and Bayes net.The paper is organized as follows: Section 2 describes machine learning methods used in this work; Section 3 explains the case study of the well-known batch reactor; Section 4 gives fault identification method; Section 5 shows and discusses the simulation results.Finally, the conclusion from this work is presented in Section 6.

Machine Learning Methods
In this paper, the machine learning methods like SVM, ANN like MLP, and RBF are used.SVM is a supervised learning method, which is motivated in maximizing the ability to generalize well from a small number of training samples by mapping the original space into a high dimensional inner product space called feature space via a kernel.The SVM formulation follows structural risk minimization (SRM) principle in which an upper bound on the generalization error is minimized, whereas the error risk minimization (ERM) minimizes the prediction error on the training data.This equips the SVM with a greater potential to generalize the input-output relationship learnt during its training phase for making good predictions for new input data [11].
SVM's solutions are characterized by convex optimization problems.So it can be applied in settling pattern recognition problems with small samples, nonlinearity, and higher dimensions.SVM can easily be introduced into learning problems such as function estimation.
In the batch reactor, the heat released by the reaction affects the dynamics of the reactor temperature.This term depends on the initial concentration, which is usually measurable at a very low sampling rate, not suitable for real time control or estimation.So any fault occurring in the batch reactor will have the impact on the heat release of the reactor.So a model developed based on the heat release of the reactor (  ) which can predict the type of fault occurred.But the quantity of   is not directly measurable one.So an estimator is required to estimate it.Here SVM is used to build the estimator model.
Similarly, for classification of different faults, the usual ANN classifiers are used in this work.There are various neural network architectures but the application considered in this work has used MLP with back propagation learning algorithm, RBF, and the Bayes net [12][13][14][15][16].

Case Study
The proposed fault diagnosis scheme is applied to the batch reactor by Cott and Macchietto [17], Aziz et al. [18], Mujtaba et al. [19], and Sujatha and Pappa [20].The complex reaction scheme of a batch reactor is a representative of many industrial reactions as shown in Figure 1.The batch reactor is inherently a dynamic process.The reactions I and II of the batch reactor are given in the following equation: where ,  are the raw materials,  is the desired product, and  is the waste product.This benchmark model is given on the basis of reaction equation (1), competent of simulating the reactions like the reactor temperature, jacket temperature, and heat release of reactor under nominal operating conditions and also under various faulty conditions.
These simulations are run under closed loop control with generic model controller (GMC) [20,21].The total running time of the batch process is 120 minutes (2 hours).The batch reactor model equations are given below: ( The initial values of the above mentioned process parameters of [  ,   ,   ,   ,   ,   ] are [12.0,12.0, 0.0, 0.0, 20.0, 20.0] at  = 0.The reactor temperature is used as the control variable and is bounded between 20 ∘ and 100 ∘ C and the jacket temperature is the manipulated variable and it is bounded between 20 ∘ and 120 ∘ C. All the nominal parameters and constant values used in the model equations ( 2) are given in Table 1.

Fault Diagnosis of the Batch Reactor
Fault can be defined as any nonpermitted deviation of process behaviour from an acceptable one.So fault diagnosis is the problem of identifying and isolating unanticipated changes in a process.Diagnosis is a challenging problem due to several factors such as monitoring the number of variables, occurrences of the process complexity, and the variety of process failures.The failures can be broadly categorized into three types such as actuator faults, sensor faults, and process faults.The fault types are inspired from the relevant literature [22] presented in Figure 2.  Faults can be defined as follows.
(i) Process faults: processes that occur below a certain level of detail are generally represented as lumped parameters in process models.An example of such a lumped parameter in this batch reactor is the fouling factor.Changes in these parameters are termed as process or parametric faults.
(ii) Sensor failure: while all sensors have random errors, sensor failures refer to gross errors such as bias, measurement with noise, and frozen sensors.
Controller and actuator fault: the actuator faults are mostly caused by the nonlinear characteristics of the control valve by hysteresis, stiction, friction, and poor controller tuning.Actuator action in the presence of fault can be represented as   () = () +   () where   () is the actuator fault vector.Abrupt constant bias has been given via the vector   () = , so that the actuator action becomes T r (k − 1) T r (k)   () = () +  as shown in Figure 3. Similarly the freezing of the actuator at certain time has been shown as   () = −(), so that the actuator action is   () = 0 as shown in Figure 4.The faults that occurred in the batch reactor have a relationship with the heat release of the reactor.

SVM Estimator Model Based Fault Detection.
Model based fault detection method is developed based on the assumption that the developed model is replica of the real plant dynamics.The input-output data are obtained by simulating the batch reactor with nominal operating conditions.The different faults have been introduced in the reactor through simulation by using MATLAB software.From the simulated input and output data, SVM estimator model is developed using LIBSVM toolbox.The heat release of the reactor (  ) which is not a measurable parameter is estimated through the SVM model.

Training and Testing of the Estimator.
Training the SVM estimator as shown in Figure 6 is an iterative process in which the SVM is given inputs along with the desired outputs.In this work, the SVM estimates the heat release of the reactor (  ).The input and output mapping of the estimator is shown in Figure 5 where the past and present values of the reactor temperature and jacket temperature and the past values of the   are considered as the input data.The estimator model is developed by selecting the SVM parameters such as  = 90,  = 100 and the Radial Basis Function (RBF) as kernel to build and train the estimator as shown in Figure 6.Without any fault, the response of the estimator is shown in Figure 7.
The estimator models are developed under both normal and faulty conditions.The difference between faulty and nominal model is called residual (in terms of   ) which is the important part of the fault diagnosing method as shown in Figure 8.Based on the residual patterns, the faults are identified through ANN classifiers.

Different Faults.
Three different types of faults such as process fault, sensor fault, and actuator fault are introduced and data is collected for estimation of heat release of the reactor (  ).Each fault is introduced through simulation and the respective plant input and output data are collected every time.Table 2 shows the assigned fault for this work.

Process Fault
(i) Change in heat transfer coefficient (): because of the fouling effect present in the heat exchanger,  changes from its nominal value from batch to batch.Variation  in   residual for 10%, 20%, 30%, and 40% increase and decrease in  are shown in Figure 9.
(ii) Change in heat of reaction (Δ): the actual value of heat of reaction may not be available in the open literature and subsequently it can be the basic for model mismatch due to change in unmeasured parameter.The variation in   residual for the heat of the reaction is reduced and increased by 10% and 25% from the nominal value as shown in Figure 10.
(iii) Change in initial charge of reactants (  ,   ): a change in product demand and accidental failure of the charging system or scale-up issues at the design stage will be the causes for the change in the operating conditions.The variation in   residual for 10%, 20%, 25%, and 30% decrease and increase in initial charge are as shown in Figure 11.
In the heat release of the reactor residual patterns for the process fault, the differences in sign of magnitude, sluggishness of the response, decrease of the magnitude, and change of the starting position of the curve are observed.

Actuator Fault.
The actuator fault considered here is as follows.
(i) The addition of bias in the actuator shows the sticking nature of the actuator as shown in Figure 3. (ii) The actuator freezing occurred in the time interval 80 to 100 minutes as shown in Figure 4.
The residuals from the actuator faults as freezing at the different time intervals and biasing are shown in Figures 12  and 13.
From the actuator fault of both freezing and biasing signatures, abrupt change is identified at the moment of fault occurring and based on the duration; the magnitude and the pattern of the residual vary.

Sensor Faults.
The following sensor faults are considered in this work: (i) jacket temperature and reactor temperature measurements with the white noise; (ii) abrupt bias at sensor; (iii) abrupt zero at sensor.Variation in   residual for the sensor faults is shown in Figure 14.
The signature of the sensor fault pattern is varying with respect to the sources of different components and at the time of fault.
From the fault signatures, the relevant statistical and geometrical features are extracted.Here, 15 features such as area, mean, standard deviation, skew, kurtosis, fractional area, Feret's diameter, integrated density, and raw integrated density are extracted by using Image J software.Few of the features are explained below.
(i) Mean: average value of a signal is termed as mean value as given in (ii) Standard deviation: it is a measure of energy content in the fault signature shown in (iii) Skewness: it is a measure of symmetry or more precisely the lack of symmetry as expressed in (iv) Kurtosis: it is a measure of whether the data are peaked or flat relative to the normal distribution as given in where  is the sample size and  is the standard deviation.

Results and Discussion
The extracted features from the 24 simulated fault instances are listed in Table 3.The instance means the number of data points in the input data.The listed features are reduced to decrease the computational complexity.The feature reduction as well as fault classification is carried out by using WEKA software.

SVM Attribute Filter
Based Feature Selection.SVM attribute filter is used to reduce the number of features by ranking method.Intuitively, the SVM model is a representation of the points in space mapped, so that the features of separate categories are divided by a clear gap.The new features are then mapped into the same space and predicted to belong to a category based on which side of the gap they fall on.The features are given as the input to the SVM attribute filter.The attributes are ranked by the square of the weight assigned by the SVM.An attribute selection for multiclass problem is handled by ranking attributes for each class separately using a one-versus-all method.Based on the attribute ranking Table 4, the number of input features is selected for the classifier to improve the performance of it.

Classifier Performance.
After selecting the fault as an attribute for class, the classification is carried out with the ANN classifiers.The output of the WEKA software gives    3 0 0 0 0 0 3 0 0 0 0 0 a = FAULT1 2 0 0 0 3 0 0 5 0 0 0 0 0 50 0 0 0 b = FAULT2 0 0 4 0 0 0 0 0 4 0 0 0 0 0 4 0 0 0 c = FAULT3 0 0 0 3 1 0 0 0 0 4 0 0 0 0 0 4 0 0 d = FAULT4 0 0 0 0 5 0 0 0 0 0 5 0 0 0 0 0 5 0 e = FAULT5 0 0 0 0 0 2 0 0 0 0 0 2 0 0 0 0 0 2 f = FAULT6 the confusion matrix, which shows a detailed accuracy of the classifier based on the class and evaluation of the success of the numeric prediction.In the multiple classification, the outcome of this test data displays a two-dimensional confusion matrix with a row and column for each class.Each matrix element shows the number of test examples for which the actual class is the row and the predicted class is the column.The confusion matrix shows each classifier's ability to achieve in the classification of faults as shown in Table 5.From this, the radial basis function and multilayer perceptron have given better results as compared with the Bayes net classifier.
The training parameters for various classifiers for varying the number of features as 15 features, 9 features, and 5 features set as the input for the different classifiers are listed in Table 6, which denotes the values of the true positive (TP rate), false positive (FP rate), precision, recall, F-measure, and ROC area.
The values of the TP rate, precision, recall, F-measure, and ROC area are "1" for RBF and MLP as compared with the Bayes net.
Classifiers performance criteria are shown in Table 7 for three sets of training pattern.From this table, MLP and RBF classified all instances correctly (100%) compared with Bayes net.Comparing MLP and RBF, the mean absolute error, root mean square error, relative absolute error, and root relative  for correlation is 1 and then the patterns are similar.If the correlation is "0" or negative then the patterns are dissimilar.The diagonal values of the correlation matrix shown in Table 8 are "1" which indicates that the similarity measure is calculated for the image with itself (autocorrelation).The other entries denote the cross correlation that is computed.
The eigenvectors of a square matrix are the nonzero vectors after being multiplied by the matrix, remaining proportional to the original vector (i.e., change only in magnitude, not in direction).The eigenvectors corresponding to the features are displayed in Table 9.For each eigenvector, the corresponding eigenvalue is the factor by which the eigenvector changes when multiplied by the matrix as they represent the characteristic values shown in Table 10.
Table 11 shows the classifier performance based on the SVM with 9 features and PCA with full data as two attributes are more or less same for the RBF classifier.But for MLP and also for Bayes net, the SVM attribute filter based classification results show its better nature than PCA.

Conclusion
This paper proposes a method for classification of various faults of the batch reactor by using artificial neural network (ANN) classifiers such as multilayer perceptron (MLP), radial basis function (RBF), and Bayes net from fault signatures.It is based on the relationship of each fault's impact on heat release (  ) of the reactor, which is the immeasurable parameter, estimated by SVM estimator.The fault signature trend is varying from fault to fault.Most appropriate features are selected from the variety of fault signatures and the numbers of attributes are reduced based on the SVM attribute filter by using ranking method and PCA, to reduce the computational difficulty of classifiers.It is observed that the classification accuracy of RBF and MLP is better than the Bayes net from the classifiers performance during training.Based on the performance criteria, the RBF performs well compared to MLP and Bayes net within 24 instances under the assumption that in practical situation less historical data are available.The results verify that the capability of the proposed fault diagnosis scheme is efficient and fast by using general ANN

Figure 1 :
Figure 1: Schematic diagram of a jacketed batch reactor.

Figure 3 :Figure 4 :
Figure 3: Simulated actuator fault by introducing bias at the time of 80 minutes.

Figure 5 :
Figure 5: Input/output mapping of the SVM heat release estimator.

Figure 6 :Figure 7 :
Figure 6: Training method for the heat release estimator.

Figure 8 :
Figure 8: Block diagram of fault diagnosis process.

Figure 9 :
Figure 9: Residual (in terms of   ) due to change in heat transfer.(a) Residual (in terms of   ) due to 10% sudden increase in the heat transfer coefficient ().(b) Residual (in terms of   ) due to 20% reduction in the heat transfer coefficient ().(c) Residual (in terms of   ) due to 40% increase in the heat transfer coefficient ().(d) Residual (in terms of   ) due to 30% increase in the heat transfer coefficient ().

Figure 10 :
Figure 10: Residual (in terms of   ) due to change in heat of reaction.(a) Residual (in terms of   ) due to 25% decrease in heat of reaction.(b) Residual (in terms of   ) due to 25% increase in heat of reaction.(c)Residual (in terms of   ) due to 10% increase in heat of reaction.

Figure 11 :
Figure 11: Residual (in terms of   ) due to change in initial values of the raw material (  ,   ).(a) Residual (in terms of   ) due to 25% increase in the initial values of the raw material (  ,   ).(b) Residual (in terms of   ) due to 30% decrease in the initial values of the raw material (  ,   ).(c) Residual (in terms of   ) due to 20% decrease in the initial values of the raw material (  ,   ).(d) Residual (in terms of   ) due to 10% decrease in the initial values of the raw material (  ,   ).

Figure 13 :
Figure 13: Residual (in terms of   ) due to actuator biasing at different intervals.(a) Residual (in terms of   ) due to actuator biasing at the time interval (80 min).(b) Residual (in terms of   ) due to actuator biasing at the time interval (80 min).(c) Residual (in terms of   ) due to actuator biasing at the time interval (60 min).(d) Residual (in terms of   ) due to actuator biasing at the time interval (90 min).

Table 1 :
Nominal values of the parameters.

Table 3 :
Extracted features from

Table 5 :
Confusion matrix for different classifier.

Table 6 :
Results for training of fault features.

Table 7 :
Performance criteria of the classifiers.

Table 9 :
Eigen vectors for each feature.