Gas Classification Using Combined Features Based on a Discriminant Analysis for an Electronic Nose

This paper proposes a gas classification method for an electronic nose (e-nose) system, for which combined features that have been configured through discriminant analysis are used. First, each global feature is extracted from the entire measurement section of the data samples, while the same process is applied to the local features of the section that corresponds to the stabilization, exposure, and purge stages. The discriminative information amounts in the individual features are then measured based on the discriminant analysis, and the combined features are subsequently composed by selecting the features that have a large amount of discriminative information. Regarding a variety of volatile organic compound data, the results of the experiment show that, in a noisy environment, the proposed method exhibits classification performance that is relatively excellent compared to the other feature types.


Introduction
An electronic nose (e-nose) is an electronic system that uses a sensor to classify the types of specified gas [1][2][3][4].As it becomes easily tired, the human olfactory function does not recognize different smells continuously, and it is therefore limited in the types of smells that it can recognize.In contrast, an e-nose can monitor smells constantly and this capability is especially advantageous to humans in terms of the detection of harmful gases [5,6].For early e-nose systems, a calorimetric sensor was used to express the measured gas values in the form of a color array [7]; additionally, this system requires the completion of a number of complex analyses, as well as precision equipment such as a gas-chromatography system for gas classification or a mass spectrometer that combines complex machine intelligences, thereby limiting the service environment [8,9].In terms of its real-world application, the calorimetric-sensor-based system, it is therefore limited to the specific applications; however, with the development of electrochemical sensors and IT (both hardware and software), electronic systems became more intelligent and portable [10,11].Due to these technological developments, an increasingly diverse array of e-nose applications continues to gradually emerge and includes medical check-ups [12], pollution and gas leak searches [13], environmental protection [14], and cosmetics industry [4,15,16].
The e-nose system is composed of sensor arrays that consist of multiple channels and computing systems [2,17].Since each channel of the sensor array has a different characteristic, it responds differently depending on the type (class) of gas, and an electronic interface converts the sensor array response into vector-type numerical data.In the computing system, the features that are useful for the gas are extracted from the converted data, and a classifier is used to discern the gas classes.Sensor array sensitivity, feature extraction, and classifier performance are all important factors in a performance assessment of an e-nose system.
Among the variety of sensor array types that are used in an electronic system, conducting polymer composites, intrinsically conducting polymers, and metal oxides are often used for the detection of substances in conductivity sensors [6,16,18,19].Despite the drawbacks of polymer composites such as sensor drift, a limit to sensor life, and sensitivity to temperature or humidity, their use has been more frequently compared to other substances [20] because of the corresponding advantages such as a wide range of available polymetric materials, an inexpensive cost, a stable operation capability at room temperature, and a lesser amount of electricity consumption.Sensor measurement largely consists of the following three stages: stabilization, exposure, and purge.Each sensor channel shows different responses depending on the type of gas, and the reactions of each stage are recorded as a single vector [2,11].
Computing systems use a variety of pattern-recognition techniques to discern the data classes that are entered through a sensor array.Gas classification processes can be divided into the following two stages: the extraction of useful features from the gas for classification and the use of the extracted features to design a classifier.Since the data samples that are obtained through the sensor arrays in an electronic system are high-dimensional data that consist of the measured values per sampling point for each channel, a variety of different dimension-reduction methods can be employed to increase the degree of classification accuracy and to effectively and simultaneously reduce the amount of data processing [2,11,21,22].PCA [23], one of the typical dimension-reduction methods, performs classification by projecting data samples on the low-dimensional feature space that allows for the maximal dispersion of the data samples.LDA [24,25] is a method that focuses on finding an optimal linear discriminant function and generally shows a more effective performance compared to PCA when it is applied to classification problems.LDA composes a feature space that narrows distribution within the same class and widens a distance between the means of different classes by using the class information of training data.In [21,22], the two OCA variants CC-PCA and CC-CPCA were used, respectively, in an e-nose system.In [6,11], the CLDA method, which is used to develop the LDA method so that high-dimensional data with high correlations are implemented between the input variables, was used to classify the gas.
The above methods represent the entirety of the measured values from the gas-sample stabilization stage to the purge stage as a single vector and compose a feature space through statistical analysis.The measured value of the sensor array, however, contains different information according to each of the measurement sections, so it could be effective to use the local information together with the global information.In this paper, the combination of the global features that are extracted from the entire measurement section and the local features that are extracted from each local section was therefore used for the gas classification classifier.For this purpose, the ReliefF method [26] was used to evaluate the individual global and local features in terms of the usefulness of not only each feature that classifies the gas but also the combined features that are used for the classifier of an electronic system, whereby only those features with useful classification information were screened.
The proposed method is greatly advantageous in noisy environments because only the features that maintain the discernment information from the variety of sections through the feature selection processes are screened and used, even if the problem occurs during the data measurement process or if noise is mixed with the measured value.The results of the experiment for a variety of volatile organic compound types [20] confirm that the proposed combined features exhibited a sound classification performance in a comparison with the other features that are obtained from the use of the feature extraction method.
This paper is composed as follows: Section 2 explains the discriminant feature extraction method that can be used in an e-nose system and proposes a method for the configuration of combined features that is based on a discriminant analysis, whereby the discernment of the basic features that are extracted from the entire measurement section as well as the local samples sections is measured; Section 3 presents the results of the gas classification experiment; and, finally, Section 4 presents the conclusion.

Construction of Combined Features for Gas Classification
2.1.Gas Measurement.Figure 1 is a schematic diagram of the e-nose system that is used in this paper; additionally, the micromachined sensor array chips that were used in [20] were used for the gas measurement for this paper.The sensor array of this paper consists of 16 channels that are composed of carbon-black (CB) polymer composite sensors with an interdigitated electrode, microheater, and machined The resistance change of each polymer composite film was observed in response to the chemical-gas combination, and it was recorded every 0.1 sec and measured for a total of 200 sec.The entire measurement section can be divided into the following three sections: stabilization (30 sec), exposure (60 sec), and purge (1,100 sec).First, if a resistance signal is stabilized after a sensor array is placed in the chamber, gas is introduced for 60 sec before it discharges the residual gas, to the outside for the remaining 110 sec through the systemic flow control [27].The measured data is stored in a PC with the use of the DAQ6062E data acquisition (DAQ) board and LabVIEW (National Instrumentation, USA).A voltage divider operates from −10 V to 10 V and the gain of the 16 identical amplifiers was set to 10, which is approximately the maximum DAQ resolution [20].

Discriminant Feature Extraction.
Those features that are suitable for classification were extracted by using the LDA method, which is also one of the dimension-reduction methods for a typical classification, whereby gas data that is obtained through the sensor array was involved.When the -dimensional data samples x  ∈  ×1 ( = 1, . . ., ) that belong to one of the  classes are given, the LDA constitutes a feature space that reduces the scatter of samples in the same class and simultaneously furthers the distance between the class means.For this, the within-class scatter matrix   and the between-class scatter matrix   are first defined as follows [24]: where x  refers to the th sample that belongs to class   ;  and  are the numbers of the whole samples and classes, respectively; and   and  are the sample means that belong to class   and the whole samples, respectively.Then, the LDA method calculates the projection matrix  LDA = [w  1 , . . ., w    ] that is made up of the projection vector (w   ,  = 1, . . .,   ) from   and   , satisfying the following defined objective function: where w   s satisfy   w   =     w   that is obtained by calculating the eigenvectors of  −1    [24].In the case of high-dimensional data such as gas data (in this paper, x  ∈  32,000 ), however,  generally becomes larger than the rank of   (=  − ), and it is with respect to the SSS (Small Sample Size) problem [24] that   becomes singular.As a method to solve this problem, the PCA + LDA method [25] reduces the dimension of the data into  ≤  −  by performing the PCA method prior to the commencement of the LDA method.If the total scatter matrix   is defined as  3).To measure the usefulness of each basic feature in terms of the classification of patterns, the ReliefF [26] method, which is effective in selecting useful features to recognize a highdimensional face image like an electronic nose data [28], is used.The ReliefF method can distinguish both useful and nonuseful features depending on how effectively each feature can discern the samples that are near each other; for this purpose, the weighted vector A = [ 1 ,  2 , . . .,  4(−1) ]  that is the value of each feature is defined.For a randomly selected th sample (x  ), let the -nearest samples in the same class be Y Pool  x tot k ∈ R 32,000×1 x st k ∈ R 4,800×1 (2) Find H  s,  = 1, . . ., .In (4) of the above process, (  ) refers to the prior probability of class   [26,29].
After repeating this process for all of the training data samples x  ( = 1, 2, . . ., ), the final combined feature vector (y CF ), which will be used for the gas classification, is composed of features   's that correspond to large   .Through the previously mentioned procedure, we can obtain the combined features that robustly classify a gas even if noise is generated during the gas measurement process, because the unnecessary information in the noisy sections is removed and only the useful features selected from among the global features and local features with a high degree of discernment can be used in the classification process.The entire flow of the proposed method is presented in Figure 2.

Experimental Condition.
To ensure whether the proposed method effectively classifies gas in an e-nose, the classification performance for the VOC-measurement data includes evaluations of the following eight gas types: acetone, benzene, cyclohexane, ethanol, heptane, methanol, propanol, and toluene [20].The total of 160 samples that was used in the experiment comprises 20 collected samples for each type of gas.Each of the samples consists of the values that were measured at the 2,000 time points over 200 sec with a 10 Hz sampling rate for each channel.The measured values of the 16 channels were saved as the matrix of a 2,000 × 16 size before a lexicographic ordering operator was used to convert them into a 32,000-dimensional vector [6].To determine the effectiveness of the proposed method when noise occurs in the sensing data, the original data samples were mixed with Gaussian noises that have standard deviations of 2, 3, and 4, and the performance was evaluated (Figure 3).The classification performance of the 160 data samples was assessed using an eightfold cross-validation strategy [30], whereby a seed-value-based random-number generator was used to mix the 160 samples before they were then divided into eight sections to include samples of the same number.One sample per group, which is a total of 16 samples, was used as the test data and the remaining 140 samples were used as the training data.After performing the experiment eight times in the same way so that all of the data samples of each section were used as test data at least once, the average value of the classification rate was calculated as the final result.In this paper, to increase the statistical reliability of this eightfold cross-validation method, the classification performance was evaluated from the mean value of the results that were obtained by performing the process eight times repeatedly with use of eight different random-seed numbers.All of the data samples were normalized with the mean and standard deviation of the training data to produce a zero mean and a unit standard deviation.When the PCA + LDA method was used to extract the discriminant features, the dimension () was reduced by the PCA; the PCA method significantly affects the classification performance of the resultant discriminant features.In this experiment,  (=105) was set to take up 99% of the entire eigenvalue of   according to an investigation of the performance evaluation of different  values.The NN (Nearest Neighbor) method was used as a classifier and  2 norm was used for the calculation of the distance between two samples [6].

Classification Rate Evaluation.
First, the classification rates of the local features (y st , y ex , y pu ) that were extracted from three types of local samples (x st , x ex , x pu ) were compared.As shown in Figure 4, the noise-free original data shows high classification rates more than 97% in all of the local features.In Figures 4(b)-4(d), the sensor-based gas concentrates of the local feature of the exposure section show a relatively sound performance compared with the other sections; however, as the degree of noise for all of the local features intensified, the classification rates decreased rapidly.Figure 5 shows the comparison of the classification performances of the proposed combined features (y CF ), the FF feature (y FF ) [2], the PCA + LDA global feature (y tot ), the CC-PCA feature (y CC PCA ) [21], and the CPCA feature (y CC CPCA ) [22].The PCA method was applied to y CC PCA and y CC CPCA once more after the CC-PCA and CC-CPCA methods were performed.As shown in Figure 5(a), each method shows a sound performance more than 96% in noise-free data; however, as the degree of noise became more severe and while the classification performances of the other methods rapidly decreased, the proposed method maintained more than 90% of the classification rate, even with a Gaussian noise standard deviation of 4 (Figures 5(b)-5(d)).This maintenance level is due to the effect of the noise that is directly reflected in the classifier input when only global features or local features are used, whereas, in the proposed method, a wide variety of discriminant features were obtained by using global features in combination with a variety of local features.Through the feature selection process, only the features that are less affected by noise and have a sound discernment were selectively used as a classifier input, although some of the features are relatively polluted by noise.By using the proposed method that operates in the presence of noise more robustly than the other methods, the combined features were consequently configured.

Conclusions
Gas data measurement that uses sensors in an e-nose system can be classified into several stages according to the process.
The sensor response at each stage has different information, from which a variety of features for classification can be extracted.In this paper, three types of local features were extracted in the section that was divided into stabilization, exposure, and purge; these features were then used for the classification, whereby the combined features were configured with the global features that were extracted from the entire section.were selected, whereby the discernment of each feature was evaluated for the purpose of selection; the selected features were then used to construct the combined features.The proposed method is not limited by a particular feature extraction method.It can be used with a variety of features and the advantages of each feature can be utilized effectively.
For the eight types of gas data that were measured by a sensor array consist of 16 channels, the proposed method showed a sound classification performance; in particular, when noise is generated during the sensing process, the proposed method showed a classification performance that is more effective than those of the other methods.Given the capability to

Figure 1 :
Figure 1: Schematic diagram of the e-nose system.

𝑘
and let the -nearest samples in the different classes from Y Pool  in the feature space be "Hit" (H  ,  = 1, . . ., )

Figure 2 :( 1 )
Figure 2: Construction process flow of the combined features.

Figure 3 :
Figure 3: E-nose data without and with Gaussian noise.(a) E-nose data without noise.(b) Data with Gaussian noise (std 3).

Figure 4 :
Figure 4: Classification rates of local features for the e-nose data.(a) Original e-nose data.(b) E-nose data with Gaussian noise (std 2).(c) E-nose data with Gaussian noise (std 3).(d) E-nose data with Gaussian noise (std 4).

Figure 5 :
Figure 5: Comparison of classification rates between the proposed method and other methods.(a) Original e-nose data.(b) E-nose data with Gaussian noise (std 2).(c) E-nose data with Gaussian noise (std 3).(d) E-nose data with Gaussian noise (std 4).

Table 1 :
The CB polymer composites used in the sensor array.
membrane in each channel; in Table 1, the 16 kinds of CB polymer composites are indicated.
the projection matrix of the PCA + LDA method is  PCA+LDA =   LDA   PCA , where  PCA = arg max  |    | and  LDA = arg max  (|    PCA    PCA |/|    PCA    PCA |).If   (≤  − 1) of the projection vectors w PL  are selected according to the large eigenvalues among the projection vectors that consist of  PCA+LDA , the gas data sample x  is represented as the   -dimensional feature vector y  that consists of the discriminant features   's as follows: y  =   PCA+LDA x  = [ 1 ,  2 , . . .,    ] 600×1 are first obtained by, respectively, separating the measured values that only correspond to the only stabilization, exposure, and purge sections from the global sample x tot  ∈  32,000×1 that contains all of the measured values.The basic feature sets such as Y Pool  = [ 1 , . . .,  4(−1) ]  (= [ tot ,1 , . . .,  tot ,−1 ,  st ,1 , . . .,  st ,−1 ,  ex ,1 , . . .,  ex ,−1 ,   ) that have an element of 4( − 1) number are then composed by extracting the global features y tot  and the local features y st  , y ex  , and y .(3)2.3.Combined Feature Configuration Based on a DiscriminantAnalysis.In the proposed method, the local samples such as x st  ∈  4,800×1 , x ex  ∈  9,600×1 , and x pu  ∈  17,pu ,1 , . . .,  pu ,−1 ] pu  using ( Based on a discriminant analysis involving the individual global features and the local features, only those features with a large amount of discernment information y tot /y CC PCA /y CC CPCA /y FF /y CF ) y tot /y CC PCA /y CC CPCA /y FF /y CF ) ((