A Method for Selecting Optimal Number of Sensors to Improve the Credibility

With the development of sensors, it is possible to embed many sensors within a certain space, which makes the monitor and alarm system with multisensor possible. There are two important parameters in a monitor and alarm system, namely, the false alarm rate and the missed alarm rate. In this work, a method for selecting optimal number of sensors in the sensor array is presented to improve the credibility. The influence factors of the weights and the false alarm rate and the missed alarm rate of one sensor and total number of sensors are discussed. An experimental setup was developed. The monitoring methods of common strategies and the proposed optimal number of sensors strategy are compared graphically by the receiver operating characteristic curves and the area under receiver operating characteristic curve values. The receiver operating characteristic curves graphically prove that the optimal number of sensors’ method presents the best performance, and it is shown that the optimal number of sensors’ method has the highest area under receiver operating characteristic value (0.9631). This method may aid future users of the monitor and alarm system by providing an optimal number of sensors strategy for high credibility.


Introduction
With the development of sensor technologies, sensors have been improved with smaller size, lower power consumption, and better anti-interference ability, which makes it possible to embed many sensors within a certain local space.Multisensor information fusion pursues process redundant or complementary information from the multiple sources provided by the sensors to achieve results that are not feasible from a single sensor [1,2].The monitor and alarm system (MAS) based on multisensor information fusion technologies has been widely used in many fields, such as the disease diagnosis [3,4], the image fusion [1,5], the environment monitoring [6], and the security surveillance.
In security surveillance of high-risk industries, such as the traffic system [7], the vibration fault diagnosis [8], the recursive track [9], the medical surveillance [10], and the monitoring of hazardous materials [11], the credibility is one of the most important topics which attract extensive attention.A receiver operating characteristic (ROC) graph, which can combine FAR and MAR to one evaluation criteria, is a graphical plot that illustrates the performance of a binary classifier system as its discrimination threshold is varied [12][13][14].To compare the discrimination we may want to reduce the ROC performance to a single scalar value representing the expected performance.A common method is to calculate the area under the ROC curve (AUC) [15,16].A single performance indicator from the AUC can summarise the ROC curve as the higher the AUC value, the better the performance of the method.The credibility in the region of security surveillance means low false alarm rate (FAR) and low missed alarm rate (MAR) of the MAS [17].The false alarm (FA) means that the MAS is triggered when it should not be triggered, while the missed alarm (MA) means that the MAS is not triggered when it should be triggered [18].Smaller FAR or smaller MAR could be easily obtained by a sensor array and the proper tradeoff alarm strategy in an MAS.However, FAR and MAR are associated and it is difficult to simultaneously minimize the FAR and MAR in one MAS.To obtain small FAR and MAR simultaneously is the goal of high credibility MAS; thus the tradeoff between FAR and MAR is a fundamental problem which has attracted extensive interest by many researchers [19][20][21][22][23].
The FAR and MAR of an MAS are determined by the total number of sensors (TNS), the false alarm rate and the missed alarm rate of each sensor, and the alarm strategy.In this work, to make comprehensive estimation for the credibility of an MAS, the equivalent false alarm rate (EFAR), which reflects the FAR and MAR as well as the weights, is defined by the authors.The weights denote the loss of the FAR relative to the MAR.A quantitative method to select the optimal number of sensors (ONS) for minimizing the EFAR is given.The research results show that the proposed ONS strategy could improve the credibility when the TNS of the MAS is constant.

The Method for Selecting Optimal Number of Sensors
The parameters  fa and  ma are used to represent the false alarm rate and the missed alarm rate of a single sensor in the sensor array.The parameters  fa and  ma are equivalent to the false positive rate and one minus true positive rate, respectively, in ROC space.In a practical application, the values of  fa and  ma can be evaluated by the average value, where  fa is the number of the negatives incorrectly classified divided by the total negatives and  ma is the number of the positives incorrectly classified divided by the total positives.
We propose an EFAR model with the following assumption.The sum of  ma and  fa is less than 1.The expression 1 −  ma (i.e., true positive rate in ROC space) represents the correct alarm decision rate.Therefore, the assumption demands that each sensor in the array produces useful information for detection.In ROC space, this assumption means that the (false positive rate and true positive rate) pairs should be in the upper triangular region, because the pairs in the diagonal represent the strategy of random guessing and in lower right triangle will perform even worse.
We may choose the alarm stratagem as follows: the result of the MAS is positive if  sensors in the system give positive results.Here  is the selected number of sensors (SNS).Assuming that the TNS in MAS is , then  is in the range from 1 to .Obviously,  = 1 corresponds to the lowest MAR but the highest FAR; and  =  possesses the lowest FAR but the highest MAR.To study the optimized  (ONS) of the system, the FAR and MAR associated with  are derived from Bernoulli trials as follows: where (   ) is the combination of selecting  items from a set .The FAR exists only when the output of the MAS is in alarm state; therefore, the expression FAR[] is the sum of the false alarm rate with more than  sensors being in alarm state.In contrast, the MAR exists only when the output of the MAS is in nonalarm state, and consequently the expression MAR[] denotes the sum of the missed alarm rate with less than  sensors being in nonalarm state.
The objective function and constraints are as follows: where the coefficients  fa and  ma denote the weight of FAR and MAR, respectively.The EFAR is the normalized rate of the FAR and MAR and  fa is equal to 1 −  ma .Bigger  fa shows that the FAR is more important than MAR in the monitoring strategy in the MAS.
The objective function and constraints are nonlinear programming with integer variables.In order to get the solution of the problem, a variable  is introduced.If the index  is the ONS of the MAS, then the range is from 1 to  − 1 in the forward difference and from 2 to  in the backward difference.A method based on recurrence relations of forward difference and backward difference is proposed for selecting the ONS.Suppose that EFAR() denotes the solution of ( 2), the expressions are as follows: According to the first formula in (3), the expression can be changed into As (   ) = (  − ), (4) can be changed into Further, ( 5) can be changed into Taking natural logarithms on both sides, (6) can be changed into According to the assumption, the sum of  ma and  fa is less than 1.The expression Therefore, the result of forward difference is as follows: According to the second formula in (3), the expression can be changed into As (  −1 ) = (  −+1 ), (9) can be changed into Further, ( 10) can be changed into Taking the natural logarithms on both sides, ( 11) can be changed into According to the assumption, the sum of  ma and  fa is less than 1.The expression could be expressed as (1 −  ma )(1 −  fa )/ ma  fa > 1 and ln((1 −  ma )(1 −  fa )/ ma  fa ) > 0. Therefore, the result of the forward difference is as follows: The results of (3) are as follows: where the notation   denotes the ONS, which is given as follows: Then the solutions of (2) can be divided into two situations: namely,   is an integer or not an integer: The EFAR denotes the comprehensive effect of FAR and MAR.In what follows, a case study was presented, namely, the relationship between FAR, MAR, and EFAR.We assume that the TNS  is 10,  fa is 0.5, and false alarm rate  fa and the missed alarm rate  ma are 0.3 and 0.4, respectively.Figure 1 shows the FAR, MAR, and EFAR as a function of SNS.With the increase of SNS, the FAR decreases (Figure 1(a)) and MAR increases (Figure 1(b)).For a fixed TNS, the feasible range of SNS should be an integer which is greater than or equal to 1 and less than or equal to TNS .The more the number of sensors chosen is, the smaller the FAR and the bigger the MAR of the MAS would be.The reason why MAR increases is that the probability of more sensors in alarm state is smaller than less sensors in alarm state.If bigger SNS is chosen, the probability of MA would increase.The tradeoffs of FAR and MAR are substituted by EFAR which shows concave upward in the feasible range (Figure 1(c)).The minimum value of EFAR is 0.16 (Figure 1(c)), at SNS = 5, which is much less than the EFAR at SNS = 1 or SNS = 10.

The Influences of Parameters
There are four parameters in the expression of ONS (5), namely, the weight  fa , the false alarm rate of each sensor  fa , the missed alarm rate of each sensor  ma , and the total number of sensors .The factors affecting the EFAR can be divided by three aspects: weight and TNS and the rates including  fa and  ma .
With the constant TNS,  fa , and  ma , the relationship between the EFAR and the weight  fa can be obtained.An example is given with TNS = 10,  fa = 0.4, and  ma = 0.4, and the ONS and the minimum EFAR (MiEFAR) as the functions of  fa are shown in Figure 2. Figure 2(a) shows that the ONS increases with the increase of weight  fa .The scope of  fa covers from 0 to 1, exclusively indicating that more attention should be paid to the FAR.The increasing ONS means that if the output of the MAS is alarm state, more and more SNS would be taken with the increase of  fa .Figure 3(b) illustrates the MiEFAR as a function of the weight  fa .With the increase of  fa from 0.1 to 0.9, the MiEFAR increases initially but decreases later (Figure 2(b)).The FAR and MAR of the MAS are equally important at  fa = 0.5.The maximum value of MiEFAR shows that it is harder to reduce the FAR and the MAR simultaneously when the FAR and the MAR are equally important.Fortunately, in many real-life applications, the weights of FAR and MAR are often not equal [19].If  fa = 0.1 or  fa = 0.9, the EFAR is mainly determined by only one parameter, and the reduction of that parameter is relatively easier by adjusting the SNS.In particular, at  fa = 0.1, the EFAR reduces to be nearly a third of that at  fa = 0.5.
Figure 3 shows the MiEFAR as a function of TNS.With the increase of TNS, the MiEFAR decreases.In spite of the changes in  fa ,  fa , and  ma , the MiEFAR decreases with the increase of TNS, which indicates the usefulness of increasing TNS for the MAS.The differences between the weights  fa = 0.1 and 0.5 show that the nonequal attention paid to FAR and MAR can obtain faster decrease of MiEFAR.The reason is that smaller MAR is more efficient to reduce the EFAR when  fa = 0.1, although it may cause the increase of FAR.However, the efficiency to reduce EFAR is equal for MAR or FAR when  fa = 0.5.Figure 4 shows the MiEFAR as functions of  fa and  ma .With the increase of either  fa or  ma , the MiEFAR will increase.The parameters  fa and  ma denote the performance of a single sensor in the sensor array, and lower value means better performance.The curve clusters in Figure 4 illustrate that if the performance of a single sensor is better (i.e., smaller  fa or  ma ), the sensor array will have a better performance.

Experimental Results on Gas Sensor Array
The sensor array, which was placed in a 20 L volume test chamber, was composed of ten metal oxide semiconductor gas sensors TGS 2620.The sensor array resistances were acquired by a half-bridge configuration and then were collected by a multifunction switch/measure unit 34980A via electrical interface on the chamber.
A computer-supervised continuous flow system was built to generate the desired gas concentrations in a highly reproducible way.The experimental setup was shown in Figure 5.The gas cylinders provided standard dry gases, and in this paper the target gas was methane and the carrier gas was dry air.
A written-in-house LABVIEW program running on a PC platform was used for controlling the mass flow controller (MFC) and collecting data from 34980A.The test concentration was set to 5000 ppm, since this concentration was usually treated as the forecasting concentration of methane explosion.The gas flow was set to 300 mL/min and kept constant.The measurement was divided into two steps.The very first step is to estimate the parameters  fa and  ma via averaging each sensor's false alarm rate and missed alarm rate.
For a single sensor, the false alarm rate  fa is evaluated by the number of negatives incorrectly classified divided by the total negatives and the missed alarm rate  ma is calculated by the number of positives incorrectly classified divided by the total positives.Second, using the theoretical values of ( 15) and ( 16), the ONS of the sensor array are, respectively, obtained by the thresholds (range from 4800 ppm to 5200 ppm).
The EFARs were tallied when the TNS changes from 1 to 10.The ONS method presented in this paper was compared with the following common monitoring strategies.(a) All negative resulting negative (ANRN) means that if the outputs of all sensors are in nonalarm state, the output of the MAS is in nonalarm state, which is equivalent to  = 1.This strategy pays more attention to reducing the MAR and may result in high FAR.(b) All positive resulting positive (APRP) means that if the outputs of all sensors are in alarm state, the output of MAS is in alarm state, which is corresponding to  = .This strategy pays more attention to reducing the FAR and may result in high MAR.(c) The average strategy (AS) means that the SNS is the median of the TNS of the sensor array, which is corresponding to  = ( + 1)/2 when TNS is an odd.There are two detailed strategies of AS, namely, AS1 and AS2.AS1 is the strategy that if the TNS is an even number, the SNS takes middle value and AS2 takes the middle value plus one.This strategy takes both FAR and MAR into account equally.
Our objective is to assess the current performance of the MAS with the four common monitoring strategies, namely, ANRN, APRP, AS1, and AS2, and the provided ONS strategy.To get a summarised illustration on the performance of ONS, ROC curves and AUC results are used based on some different thresholds.Then ten ROC curves are plotted using the thresholds settings (range from 4800 ppm to 5200 ppm) in Figure 6(a) and five ROC curves are plotted in Figure 6(c), where the solid line is for the ONS, the dash-dotted line is for the ANRN, the dash-dot-dotted line is produced by the APRP, the dash line is for the AS1, and the dot line is for the AS2.Based on the theoretical values of ( 15) and ( 16), the optimised parameters of ONS vary across different thresholds, while the other strategies do not.This illustrates that the true positive rate of the ONS method increases much faster than those of the other four methods in terms of this experiment.Therefore, the AUC of the ONS method is the highest among the five strategies.A higher AUC value indicates a better classification performance.Figure 6(c) graphically proves that the ONS method presents the best performance, and it is shown in Figure 6(d) that the ONS method has the highest AUC value (0.9631).

Conclusions
The development of sensors technologies makes higher credibility possible by increasing the number of sensors in the MAS.Credibility is one of the most important parameters for the MAS, especially in high-risk industries.Using a sensor array is one of the potential efficient approaches to enhance the credibility but should be along with an efficient strategy for monitoring.This work provides an analytical method to select the ONS for alarm in the MAS based on four parameters, namely, false alarm rate of one sensor  fa , missed alarm rate of one sensor  ma , weight  fa , and TNS .Feasible ONS of the monitoring strategy will release the potential of the MAS with the sensor array to improve the credibility.The results show the effectiveness of credibility enhancement in the MAS.The weight  fa has the effect of adjusting the ONS, and when more attention is paid for FAR, the bigger ONS will be taken if the TNS is constant.With the increase of TNS, the smaller EFAR could be obtained which shows the effectiveness of using larger number of sensors and the proposed approach for the situation demanding higher credibility.An experimental setup, which contained ten gas sensors, was developed.The ROC curves and the AUC values of ANRN, APRP, AS1, and AS2 and the proposed approach show the effectiveness of the method.The results of this work have the potential application in providing the ONS of a monitoring sensor array, especially in the high-risk industry.

Figure 1 :
Figure 1: False alarm rate (FAR), missed alarm rate (MAR), and equivalent false alarm rate (EFAR) as a function of selected number of sensors (SNS).(a) The curve of false alarm rate (FAR); (b) the curve of missed alarm rate (MAR); (c) the curve of equivalent alarm rate (EFAR).

Figure 2 : 3 Figure 3 :Figure 4 :
Figure 2: Optimal number of sensors (ONS) and minimum equivalent false alarm rate (MiEFAR) as a function of weight  fa .(a) Optimal number of sensors (ONS); (b) minimum equivalent false alarm rate (MiEFAR).

Figure 5 :
Figure 5: The experimental setup: ten metal oxide semiconductor gas sensors TGS 2620 were installed in the middle of the test chamber.