Fault Detection for Multimodal Process Using Quality-Relevant Kernel Neighborhood Preserving Embedding

A newmethod named quality-relevant kernel neighborhood preserving embedding (QKNPE) has been proposed. Quality variables have been considered for the first time in kernel neighborhood preserving embedding (KNPE)method for monitoringmultimodal process. In summary, the whole algorithm is a two-step process: first, to improve manifold structure and to deal with multimodal nonlinearity problem, the neighborhood preserving embedding technique is introduced; and second to monitoring the complete production process, the product quality variables are added in the objective function. Compared with the conventional monitoring method, the proposed method has the following advantages: (1) the hidden manifold which related to the character of industrial process has been embedded to a low dimensional space and the identifying information of the different mode of the monitored system has been extracted; (2) the product quality as an important factor has been considered for the first time inmanifoldmethod. In the experiment section, we applied this method to electrofused magnesia furnace (EFMF) process, which is a representative case study. The experimental results show the effectiveness of the proposed method.


Introduction
Product quality monitoring has attracted a lot of attention like process monitoring in nearly two decades.Scholars and enterprises have made great efforts and obtained many worthy achievements [1].Data-based process monitoring is an important means in solving some such issue [2][3][4].Typical data-based process monitoring methods are used to deal with the input variable space, such as principal component analysis (KPCA) [5][6][7] or independent component analysis (KICA) [8][9][10].Partial least squares (PLS) [11][12][13][14] or kernel partial least squares (KPLS) [15][16][17][18] can build the model of input/process variables and output/quality variables; however, the effect is not ideal for multimodal process [19,20].Because root causes of potential quality problems are in process variables [21], synthetically considering the whole variables is a subject worthy of study.
Large-scale data can be obtained and stored from industrial process, which contain process variables and quality variables [22][23][24].Storing and processing these received data are a difficult problem; we need the mathematical transformation and dimensionality reduction.Moreover, because process variable is usually measured directly, it does not cause big time delay generally.However, quality variable is measured by many complex means, such as chemical test; it will cause big time delay.Sample frequencies of process variables and quality variables are usually not synchronized [21].This problem may increase the false alarm rate and missed alarm rate in the monitoring of multimodal process using the above traditional method.Manifold-based method, such as locally linear embedding [25][26][27][28], is a kind of method which keeps relative relationship among the data points from a high dimension space to a low dimension space.As a manifold learning framework, it calculates the Euclidean distance between every data point and its neighbors and then keeps this relationship to a lower dimensional manifold in the process of dimension reduction.Time serials of sample will not affect the effectiveness of the previous mentioned method.However, for strong-nonlinearity problems, it may not be used directly, because some data points may lay on nonmanifold surfaces.In this case, kernel technique was introduced [20], which projects the raw data into a higher dimensional linear space, that is, the feature space.The manifold structure of data is improved by this transformation.In addition, since change of feed, adjustment of production plan, and switching set value by human, industrial production processes have multimodal characteristics [29,30].The traditional monitoring methods are not available for multimodal processes monitoring as previously stated.The quality variables of each mode and the product quality variables were unnoticed in the existing manifold-based monitoring methods.
This paper proposed a new manifold-based monitoring method used to detect fault in multimodal process, which considered the quality variables for the first time.We can decompose the process into two steps as follows: first, using KNPE technique to improve manifold structure and to deal with multimodal nonlinearity problem; second, adding preprocessed quality variables in the objective function to monitoring the complete production process.The proposed method has the following advantages: Firstly, the hidden manifold has been embedded in a low dimensional space and the identifying information of the different mode has been extracted.Secondly, the product quality has been considered in manifold method for the first time.To verify the effectiveness of the proposed method, EFMF is introduced in this paper.EFMF process will be described in the experiment section, which is a typical industrial object with characteristic of multimode and nonlinearity [31,32].Through experimental examples, detection capability of the algorithm proposed will be verified.
The main parts of this paper are organized as follows.In Section 2, KNPE algorithm and statistic are illustrated.In Section 3, we show how QKNPE works in process monitoring.An important example of EFMF process is studied to verify the effectiveness of the proposed method.

Kernel Neighborhood Preserving
Embedding Algorithm and Statistic 2.1.The Theory of Kernel Neighborhood Preserving Embedding Algorithm.Nonlinear characteristics of complex industrial process limit the application of many linear processes monitoring method.The basic principle of kernel function method is as follows.We assume process data after standardization as X  = [x 1 , x 2 , . . ., x  ] ∈ R × ,  is sample size obtained by sampling, and  is dimension of measure variable.We use a nonlinear mapping Φ(⋅) to map input data into a high dimensional feature space and then process the data in high dimensional feature space.We assume that the data points in the neighborhood fit locally linearly in the feature space.As shown in Figure 1, it can convert the computation of nonlinear high dimensional feature space after inner product operation into kernel function operation of the original data space by kernel function K(X  , X  ) = Φ  (X  )Φ(X  ).The kernel function used commonly in process monitoring areas is radial basis kernel function K(X  , X  ) = exp(−‖X  − X  ‖ 2 /), where  =  2 ,  is parameter,  is the dimensions of the input data, and  2 is variance [33].Φ(X) = [Φ(x 1 ), Φ(x 2 ), . . ., Φ(x  )] ∈ R V is data of high dimensional feature space.V is the dimensions of the high dimensional feature space.The Euclidean distance between two points in high dimensional feature space can be calculated according to the following formula: And then, the following weight matrix of KNPE W can be calculated according to the following formula [34]: Mathematical Problems in Engineering 3 Solving (2), ) . ( The global public information Φ(X  ) = [Φ(x ,1 ), . . ., Φ(x , )] ∈ R  can be extracted by keeping local structure of the input data.The cost error of global public subspace is defined as where M = M  = (I − W)  (I − W) and I  is  order unit matrix.
Using the Lagrange multiplier method to deduce the global public space Φ(X  ), the Lagrange function is Setting Lagrange function as Φ(X  ) and partial derivative of  as 0, we can obtain The minimal solution Φ(X  ) satisfies this form: So the global public space Φ(X  ) is the first  minimum eigenvalue feature vector of M.

Calculations of 𝑇 2 and SPE Statistic for Kernel Neighborhood Preserving
Embedding.The method based on KNPE process monitoring can reduce , the false alarm rate and missed alarm rate, and improve the accuracy by separating the data space into global public space and the local special subspace.
We project the matrix from the high dimension space to low dimension space as Φ(X  ) = G  Φ(X).According to (4), we can obtain where G is mapping projection matrix from high dimension space to low dimension space.The constraint condition for the projection mapping matrix is G  G = I  .Lagrange multiplier can be used to solve it as follows: Setting G = Φ(X)S, coefficient matrix is S = [s 1 , . . ., s  ] ∈ R  .Equation ( 9) can be transformed to Perform eigenvalue decomposition of matrix MK.And then the projection mapping matrix S from high dimension space to low dimension space can be gotten by reconstructing eigenvector referring to the  smallest eigenvalue.
Based on the subspace KNPE separation modeling process is as follows: where Φ(X  ) is global public subspace, Φ(X  ) is each local special subspace, Φ(X , ) ∈ R V ,  = 1, . . ., , and  is the number of modes in the multiple mode process.
For the new sample data Φ(X new ), the global public subspace Hotelling  2 statistics and SPE statistics of each local special subspace are as follows: where Λ = Φ(X  )Φ  (X  )/−1 is covariance matrix of global public subspace on the training set.

Quality-Relevant Kernel Neighborhood Preserving Embedding Method
is the total length of various models.The relationship of global public subspace and quality variable information is fully extracted.The extraction of the global public space will meet requirements of complex industrial process and improve the accuracy of the multimode process monitoring.The model using QKNPE method to extract the global public subspace is as follows: where min tr(Φ(x , )MΦ  (x , )) corresponds to (4) and min −Φ(x , )Φ  (X)r  − Φ(x , )Y  c  means maximizing the projection of global public data along the score directions of Φ  (X) and Y  .Thus, we can make the point Φ(x , ) in the global public subspace establish the relations with quality variables.Φ(x , ) is obtained by using the Lagrange multiplier method.The Lagrange function is as follows: Setting the partial derivative of Lagrange function for Φ(x , ), r  , c  , and   as 0, It is available by deriving The only least solution Φ(x , ) will meet the following form: So the feature decomposition of matrix )) is made.The eigenvectors which the  smallest eigenvalue corresponds to are the best refactoring point matrix Φ(X  ).The load vector of each model input data and output quality variables is

Process Monitoring Based on Quality-Relevant Kernel
Neighborhood Preserving Embedding.According to (12), we set Φ(x , ) = G   Φ(X) and where G = [G 1 , G 2 , . . ., G  ] ∈ R V is mapping matrix from high dimension space to low dimension space.The constraint condition is G   G  = 1.And then Whether SPE or T 2 of the sampling data Let G  = Φ(X)S  and let S = [S 1 , . . ., S  ] ∈ R  ; (20) can be transformed as The minimal solution of S  meets the following form: So the feature decomposition for matrix (2MK+KK/ 2 + Y  YK/ 3 ) is made.The eigenvector corresponding to the  smallest eigenvalue is projection mapping matrix S  from high dimension space to low dimension space.
The flow chart of the whole multimode process monitoring method is shown in Figure 2.

The Experimental Results
The EFMF is one type of the main equipment for the production of magnesia, and it belongs to mine heat electric arc furnace.Arc is heat source.It can well smelt magnesia by concentrating heat.Overall equipment of fused magnesia furnace generally includes transformer, circuit short net, electrode lifting gear, electrode, and furnace.Furnace shell is commonly rounded and slightly tapered.For the convenience of easing shell molten lead, the ring is welded in a furnace shell wall.The mobile car is equipped below furnace, so that clinker after complete melting can be moved to fixed station and cooled [20].
The EFMF produces high temperature to complete melting process by introducing large current.The temperature of area near the electrode is high.The temperature of area far away from the electrode is low.Once the temperature of the area around electrode is too high, it is easy to cause security incident.However, once the temperature of area far away from the electrode is too low, lots of material will be wasted.It will seriously affect the product yield and quality.It is required to timely detect abnormal and failure in the process: therefore, the process monitoring for working process of fused magnesia furnace is very necessary and meaningful.
The difference of magnesite stone raw materials and the difference of charging operation are corresponding to change of characteristics of the magnesium furnace smelting process and process data has strong nonlinear characteristic.This paper selects the process data obtained through the two conditions with and without charging smelting process of the massive magnesite, mixed massive magnesite, and powder magnesite of smelting process as the six different operation modes to model.For simplifying process dividing the model, this section assumes that each model is isometric.The definition of different modes in EFMF is shown in Table 1.We use process monitoring method based on the proposed QKNPE subspace separation to monitor the work process.To show good effectiveness of the proposed method, LLE and KPLS are used for comparison.According to the comparing results, we can verify the efficiency and accuracy of the proposed QKNPE subspace separation method for monitoring the multimodal process.
Firstly, we select sampling data from six modes of EFMF for offline modeling, and all of these sampling data are obtained under normal working condition.Each group of data contains three phase current values and three key variables.The modeling data set contains 1500 samples.For validating the monitoring performance of the proposed method for multimode process, we use 6 groups which contain 1500 samples data with fault.Faults 1, 2, and 3 happen from 200th, 400th, and 700th samples, respectively.The reason is excessive heating.Faults 4, 5, and 6, which are caused by actuator stuck fault, start from 900th, 1200th, and 1400th samples, respectively.
Using multimodal process modeling and monitoring methods based on LLE, KPLS, and QKNPE subspace separation for fault 1 to fault 3,  2 and SPE statistics are shown in the diagrams in Figures 3-5.According to the results, it can be found that the local special subspace of LLE and KPLS has high nonresponse rates and low accuracy compared with QKNPE.Global public space monitoring of KPLS and QKNPE and local special subspace monitoring have high accuracy and low nonresponse rates, while QKNPE has lower rate of false positives.For faults 4-6,  2 and SPE are shown in Figures 6-8.We can find that the local special subspace of LLE monitoring for faults 4-6 has high nonresponse rates and low accuracy and the global public subspace monitoring, global public space monitoring of KPLS and QKNPE, and local special subspace monitoring have high accuracy and low nonresponse rates, while QKNPE has lower rate of false positives.
In fused magnesia furnace multimodal process, the monitoring accuracy, the rate of false positives, and nonresponse rates of six faults based on LLE, KPLS, and QKNPE subspace separation method are shown in Table 2.The experimental results show the process monitoring method based on QKNPE has very good monitoring effectiveness.

Conclusion
In this paper, the method based on QKNPE is proposed and applied to multimodal process monitoring.This method can solve the problem of nonlinear multimodal process monitoring.KPLS method can solve the problems that manifold method cannot directly be applied to nonlinear multimodal process monitoring and KPLS changes did not explain ability of quality variables.In monitoring fused magnesia furnace smelting process, QKNPE method can use subspace separation method and considering the advantages of the relationship between input variable and output variable quality, it can improve the monitoring accuracy, can reduce false positives and omission, and can be effective to detect the fault during the process of multimode.

Figure 2 :
Figure 2: The monitoring flow of multimode processes based on QKNPE.
Monitoring based on QKNPE for fault 1

2 (
c) Monitoring based on QKNPE for fault 2
Monitoring based on QKNPE for fault 6

14 Mathematical
Problems in Engineering ) = [Φ(x ,1 ), . . ., Φ(x , )] ∈ R  to keep the local structural information of high dimension space of input data and at the same time make the covariance information of score matrix of output quality variable Y = [y 1 , . . ., y  ] ∈ R  after standardization largest.T = [t 1 , . . ., t  ] ∈ R  is score matrix of input data variable Φ  (X) of each mode, where T Calculate the sampling data SPE or T 2

Table 1 :
The definition of different modes in EFMF.

Table 2 :
The monitoring accuracy rate, false alarm rate, missing alarm rate based on LLE, KPLS and QKNPE methods.