Improving the Performance Metric of Wireless Sensor Networks with Clustering Markov Chain Model and Multilevel Fusion

The paper proposes a performance metric evaluation for a distributed detection wireless sensor network with respect to IEEE 802.15.4 standard. A distributed detection scheme is considered with presence of the fusion node and organized sensors into the clustering and non-clustering networks. Sensors are distributed in clusters uniformly and nonuniformly and network hasmultilevel fusion centers. Fusion centers act as heads of clusters for decisionmaking based onmajority-like received signal strength (RSS) with comparison the optimized value of the common threshold. IEEE 802.15.4 Markov chain model derived the performance metric of proposed network architecture with MAC, PHY cross-layer parameters, and Channel State Information (CSI) specifications while it is including Path-loss, Modulation, Channel coding and Rayleigh fading. Simulation results represent significant enhancement on performance of network in terms of reliability, packet failure, average delay, power consumption, and throughput.


Introduction
In the recent years, employments of wireless sensor networks (WSNs) have increased in many aspects of modern lifestyle.Those applications have motivated the researchers around the world to attempt into this field and investigate Quality of Service (QoS) and improve performance and efficiency of network.Usually, wireless sensor networks are supposed to be in harsh environments; consequently, performance metric evaluation at the real situation is difficult, where human intervention for evaluating process, even maintenance, repair, or fix purposes are in jeopardy.Hence, performance evaluation based on the mathematical model of network and simulation is highly considered.Sometimes controlling a process in the large scale needs sensing a unique phenomenon of interest with several sensors.An actuator reacts precisely in relation to decision which is made based on received signals from sensors.Fusion of multiple sensing signals makes a decision more accurate than just one sensor and consequently increases system efficiency.
To address problem, a novel performance evaluation framework would be proposed.Mathematical model framework of a decentralized distributed detection is studied in cluster-based network with a Markov chain model for IEEE 802.15.4 Medium Access Control (MAC) with respect to CSMA/CA mechanism interplay by physical layer and channel state information.The framework investigates appropriated strategies by configuration of wireless sensor nodes based on the optimal tuning of IEEE 802.15.4 MAC and PHY layer key parameters [1].Head node of each cluster is called Fusion Center (FC).Decision making at fusion node performs with respect to majority-like reception of RSS with Maximum-Likelihood Test.
Performance metric is evaluated for a clustering network topology with respect to a Markov chain model for CSMA/CA medium access control which proposed in [2] for a single node.Model describes a generalized analytical of the slotted CSMA/CA mechanism of beacon-enabled IEEE 802.15.4 with retry limits for each packet transmission.Behavior of the Markov model proposed at [2] is Mathematical Problems in Engineering describing CSMA/CA algorithm for a single node within star network with  sensor nodes whereas our attempt updates performance metric equations with clustering topology and is accompanied by FCs.Model in [2] is only considered to packet collision probability as case of loss.Nevertheless, physical layer and channel state are provoking factors to loss indeed [3].Therefore, physical-layer and CSI specification such as modulation and channel coding are utilized through the equations as a probability that denotes with ( csi ).Network is supposed to be high data rate generation for assessment of performance.Simulation is carried out to represent probability of decision error at FC (  ) in a clustered network with significant enhancement on performance metric in terms of reliability (  ), packet failure (  ), average delay (E (av)   ), power consumption with considering different operation modes, idle (  ), sensing ( sc ), transmission (  ) and receiving (  ), and also Network aggregation throughput (  ).

Related Work
In the literature, for instance see [4] and the references therein, wireless sensor network is studied with a small amount of sensors and low signal to noise ratio (SNR), distributed detection, and decision making fusion rules carried out on multi-bit knowledge of local detecting sensors with Monte-Carlo simulation methods.The performance of proposed decision fusion rules is integrated with parameters such as channel Rayleigh fading and adaptive Gaussian noise.In [5,6], the authors with respect to similar field of efforts in [7], proposed a simulation-based analysis impact of data fusion mechanisms in a Zigbee sensor network.It is used to monitor a particular constant binary phenomenon and evaluated performance indicators of interest, for example, Bit Error Rate (BER) and networking oriented (delay and aggregate throughput).In [8,9] a distributed detection (DD) system is considered for multiple sensors/detectors work, collaboratively and the fusion center is responsible for the final decision-making task based on information gathered from local sensors; moreover, the integration of wireless channel conditions in algorithm design is also taken into the account (also see [10,11]).In [12], an important channel dynamic is well defined; their studies are represented by the behavior of a real link impact in low-power wireless networks.In particular, there is a large transitional region in wireless link quality which is characterized by significant levels of unreliability and asymmetry, significantly impacting on performance of higher-layer protocols.In [3], the authors used the first way to better understand IEEE 802.15.4 standard.Indeed, they provided a comprehensive model, able more faithfully to mimic the functionalities of this standard at the PHY and MAC layers.They have proposed a combination of two relevant models for the two layers.The PHY layer behavior is reproduced by a mathematical framework, which is based on radio and channel models, in order to quantify link reliability.In [2,13,14] the authors proposed a generalized analysis of the IEEE 802.15.4 medium access control (MAC) protocol with focus on CSMA/CA algorithm in terms of reliability, delay, and energy consumption (for more see [15,16]).The rest of this paper is as follows.In Section 3, we will describe the analytical framework to evaluate performance metric.This section consists of the several subsections.In Section 4 we will represent simulation results and finally Section 5 would conclud the paper.

Problem Framework
In this section, we investigate the problem of decentralized distribution detection particularly when the sensor nodes detect a constant binary phenomenon.Sensing data packages and forwards to access point (AP) through intermediate fusion center (FC).Decision making fusion rule performs at FC with majority-like signal power level reception compared to an optimized threshold.Two ideal and noisy (nonideal) channels assume and channel state information (CSI) considers with its impacts on decision-making fusion rule Probability of decision error measures at FC versus signal to noise ratio with modulation and channel coding influences.Sensor nodes distribution at each cluster is supposed to be uniform and nonuniform.
The rest of section is organized as Sections 3.1 and 3.2 depict sensing model and distributed detection in Parallel Fusion Architecture, respectively and Section 3.3 describes distributed detection in clustered Sensor Networks.Section 3.4 comprises communication channel state information such as The Rayleigh fading, path-loss and modulation, and channel coding.Section 3.5 describes medium access control role on clustered network and its performance metric equations with presence FC and impacts of CSI.

The Sensing Model.
According to the stochastic geometry of sensing model, distribution of the nodes over the observing region A can be modelled by a homogeneous Poisson point process (PPP) with intensity .Sensing model is a isotropic signal source model for detecting phenomena of interest (PoI) with path loss factor  depends on distance of sensor from PoI and type of signal (chemical contamination, sound, radioactive radiation, etc.) [17].Here, we assume  is equal to 1 and sensor distance from PoI is  = 1 meter.Due to sensors are integrated with transmitters as a element of a WSN, thus, the received detection signal strength to sensor with a distance  away from the PoI is given by: where   is a Poisson r.v. with mean   = E{  } = |A|, whereas  is intensity of distribution nodes over observing a finite region of phenomenon with size |A|.We suppose the nodes sensing periodically independent condition whether PoI is absent or present.Particularly, while the PoI is present, observations are not similar between nodes belong into the Sensor n Area of binary phenomenon same group of sensors.In this case, observation independently remarks at each sensor node after proper sampling and processing is given by where  = 1, 2, . . .,   ,   is an independent observation Gaussian distribution noise with zero-mean and variance  2  .(  ) is the received signal strength at the th node with a distance   far from the PoI given by (1).Thus, problem status could be defined as follows: Information is gathered from observers of PoI, located in center region  (environment of observed PoI); hence, equal probability is assumed in term of present or absent PoI, where  0 = P{ =  0 }, P{⋅} being the probability of a given PoI.

Distributed Detection in Parallel Fusion Architecture.
Sensor nodes are organized within Parallel Fusion Architecture (PFA) which is represented on Figure 1.Each sensor independently detects the event under observation and generates information and sends to FC through an ideal communication link.Information could be sequence of bits as symbol of present or absent PoI.According to (2) and (3), sensors send 1 bit unit information to FC for decision making.A basic equation derived for received sensor observation signal at the FC from the th sensor node is given by: where   = √    and   is a channel noise modeled zero-mean Gaussian distribution with variance  0 /2 and across the nodes there is independent identical distribution (i.i.d).  is transmission energy per bit and  is up-link path loss coefficient between sensor node and FC.Assume  is identical for all nodes.The   is quantized local decision for observation of an event and characterized with two levels of unit function as follow: whereas H(  ) is the decision that made at the th node [17].The FC would be synchronized with whole nodes in the region A because of FC sends a beacon periodically when we want to retrieve observation data.All nodes exactly trigger and send observing data to corresponding fusion node at region A. With hypothesis ideal communication channels, decision is made at FC with Likelihood Raito Test (LRT) level of received signal by comparison an optimized common threshold value which denotes by .Threshold level could be adapted and trained during detection period according to level of transmission signal power.
Area of binary phenomenon

LRT with Neyman-Pearson Hypothesis Testing.
Here, observing signal received to fusion node might be affected by many factors in an unforeseen manner, hence, the decisionmaking would be doing necessarily statistical.This formulates with a decision rule based on optimality criterion.Normally, optimal criteria are using three major methods, the Bayes risk criterion, the min-max criterion, and the Neyman-Pearson (NP) criterion.LRT is performed regarding NP criterion.Under NP criterion, the optimal decision rule derives from an LRT choosen based on the null and alternative hypotheses conditional probabilities: whereas data vector r is given under the alternative as P{r |  1 } and data vector r under the null hypothesis as P{r |  0 }.FC decision performs based on the   received observations of nodes.The vector r denotes as a gain of received signal in ideal Binary Symmetric Channels (BSCs).This is corresponding to   specified in (5).Nevertheless, the  for simplicity is adapted with √ SNR/2 where SNR =   / 0 is received signal energy per bit per noise power spectral density, can be expressed using signal to noise (SNR), to FC from each sensor node through communication channel.The received signals vector from   sensor nodes is considered as follow: With the Bayesian approach, a priori probabilities of the absent or present hypothesis PoI are P{ 0 } and P{ 1 } at fusion center, respectively.Probability of decision error is defined at fusion center as follow:

Distributed Detection in Clustered Sensor Networks.
A network with  sensors observes a common binary phenomenon whose status is defined at (4) with  0 = P{ =  0 }, P{⋅} denotes the probability of given PoI.The  sensors might be organized into several clusters whereas number of cluster is   <  sensor nodes.Sensors belong to a cluster working as a RFD (Reduce Function Device) just communicates with corresponding FC which is a FFD (Full Function Device).Each cluster with collection of sensors is a PFA represented in Section 3.2 and Figure 2 shows   cluster-based architecture [6].The sensors are distributed in each cluster uniformly or nonuniformly.Initially, the channel between the sensors and fusion center is supposed to be an ideal communication link such as a Binary Symmetric Channels (BSCs) with probability  cross-over, memoryless communication.To continue, wireless channel also would be a non-ideal with respect to CSI specification.

Communication Channel State Information.
In this section, channel rules will be explained in interplaying with decision-making at fusion.Generated packet bits from detected event sequentially, bit to bit would be sent to fusion node through a communication channel.The impact of channel condition or channel state information (CSI) is significant on decision which would be made at fusion node.In addition to sensor observation quality, probability  Therefore, new element is taken into the account as CSI probability of channel which is denoted by  csi .Impact of  csi will investigate decision-making accuracy.Here, the sensor network is modeled with no interference impact (orthogonal transmission) because of an exact scheduling between the sensors and fusion node or AP.A beacon message transmits periodically for synchronization to each sensor node when FC and AP are ready for PoI sample reception.
3.4.1.The Rayleigh Fading.Equation ( 5) with Rayleigh fading is given by: where   is a random variable with Rayleigh distribution which is perfectly coherent demodulation and   ∈ {0, 1} is the symbol transmitted from a sensor,   is an information bit from sensor nodes [9].The total number of transmission in sensor network is  +  whereas,  is number of sensors and  is bits according to the parity-check equations of the Hamming code.The   is the energy per coded bit whereas   ≜     .  denotes the energy per bit information and   = 1/ being code rate that interpreted as a system embedding a repetition code at each sensor when  is consecutive and independent observations of the same phenomenon for a sensor network with multiple observations [18].A systematic block channel code hypothesizing that each sensor makes a single observation, by using Hamming systematic block code, generates parity bits and sends them to the FC or AP.For  =  = 4 observer sensors generate  =  −  = 3 bits according to the parity-check equations.
The total number of transmission acts in the proposed sensor network is  + .  is computed in this distributed coded scheme   = /( + ) = 4/7.Bit Error Rate (BER) with QPSK modulation at fusion node for Rayleigh fading channel is given by: where   ≜   / 0 is SNR received at Fusion node or AP [9].

Pathloss.
According to channel model distance () between transmitter and receiver (FC or AP), the received power   in dB is as follow: where   is the output power,  is the pathloss exponent which takes the rate of signal attenuation based on different environment obtains with empirical measurement [12].(0, ) is a Gaussian random variable with mean 0 and variance  (standard deviation due to multipath shadowing effects).( 0 ) is power attenuation at source with distance  0 with frequency  = /,  is velocity light and  is wavelength.Equation ( 13) is an isotropic transmission.SNR in dB( dB ) as a function of distance (meter) is: where   is noise floor, more details see [12].With substitute consequently,

Modulation and Channel
Coding.The QPSK Modulation and NRZ (non-return zero) channel coding impact, respectively, are: where   = 10  dB /10 and   is noise bandwidth and  is bit data rate with channel coding given by, where  is Preamble length,  is frame length, for more details see [12].Rewriting (12) with channel state probability for QPSK modulation and NRZ channel coding we get Probability of decision error   at Fusion or AP given in [6,9] and updated with  Rayleigh csi is where , and ℓ = {1, . . .,   }.

Medium Access Control Role on Clustered Network.
Basically, Markov chain and performance metric expression proposed in [2,14] are considered with fusion and clustered network (also see [19]).Three major parameters which reformed into scenario are the probability of a node attempts a first carrier sensing (CCA1) in randomly chosen time slot is denoted with  and given by where approximation of state probability is b0,0,0 ≈ and,   , probability of transmitted packet encounter collision when  is number of whole nodes, is given by also,  0 =  0  0 /(1 −  0 ) whereas,  0 is the idle state length without generating packets and,  0 is the probability of going back to the idle state.Consider The busy channel probabilities (CCA1) and (CCA2) are , , respectively, given as follows: where with While the Markov chain just declared the probability of collision   as cause of loss, we bring  csi which is derived in (18), into the account as another possibility of loss due to different SNR, modulation, and channel coding.Probability of failure is defined as where   is given in (23) as probability of packet collision.
Cluster network could be modeled with binominal random variable with independent th clusters  ()  , where  = 1, . . .,   ,  ()   is referring to cluster size, denotes a probability  mac ( ()   ) corresponding to th cluster.Performance metric expression that has been extracted from Markov model could be updated according to our assumptions: where I denotes possible variable which could be computed by Using Markov chain performance metric equations, we will be obtaining the following.

Reliability.
The probability of successful delivery of packets R as a clustering topology network, regarding reliability in [2], (31), and (30) redefining the probability of successful delivery of packets majority sensors per cluster which satisfy majority-like fusion strategy, is: where  = ⌊ (ℓ)  /2⌋ + 1, ℓ = {1, . . .,   }.Two-level fusion at FC and AP,   ℓ  is given as the probability of successful delivery distributed sensors in first level fusion; the probability of successful delivery FC to AP has similarity by assuming as a cluster with   sensors for second level fusion.Hence, reliability equation for both levels of fusion at FC and AP is remarked with R  given by, where  ()  is obtained from ( 30) and (31) with  ()  =   for second level.

Average Delay.
It is noted that communication delays can deteriorate the performance of the network and even can destabilize the systems when they are not considered in the design (see [20,21]).Therefore, the average delay for clustering with two-level fusion is defined as average delay of successfully received packet as the time interval from the instant the packet is at the head of its MAC queue and ready to be transmitted, until the transmission is successful and the ACK is received from both level of fusion nodes, respectively.According to [2], in framework except the constants (frame length, Ack length, etc.), MAC parameters have only two terms, (  |   ) and P(  |   ) that could be computed based on (30) and (31).However, initially , ,  should be calculated with respect to a given topology at clusters and also , , and   with term (29).Obviously, MAC parameters are similar for all equations with optimal tune.Framework concerning majority-like mechanism should be taken into account when encountered with  number sensors in original Markov chain equations that are replaced by ⌊  /2⌋ + 1, . . .,   for each cluster by corresponding sensors.So far, average delay is described for first level fusion of each cluster separately.For second fusion level, it is acting as a cluster with   sensors.Average delay of whole network is proposed by: where, }, first term is average delay for which packets arriving for first level fusion at FC clusters head, and second term for which packets arriving second level fusion at AP.Because of synchronized network, transmission happens at the same time and concurrently; hence, Max and Min are computed regarding to cluster size and parameters.

Network Aggregate Throughput.
Network aggregate throughput would be computed for minimum effective number of nodes each cluster network with two-level fusion and data rate (bps) is given by: where ℎ = ∑   ℓ=1 ⌊ (ℓ)  /2⌋ + 1, R  is computed at (33) and  = 80 bit/0.32ms is a normalization constant to convert to bps.

Average Power Consumption.
The average power consumption equations are proposed in [2,14] taken into consideration by the clustering framework with two-level fusion.Constant values given in Table 2 are used for first level fusion; however, for second level fusion they are valid except   ≈ 0 because of assuming fusion center does not have ideal state at second level, also hypothesis  sc sensing power constant at sensor is corresponding with power of decision-making at fusion node and assumed same computation term.

Simulation Results
This section represents the results of simulation based on problem framework.Basically, simulations are figured out   with "32" nodes as detector of an event of interest, each node generates high traffic data rates.Performance metric is evaluated with probability of decision error and developed equations of Markov model.Rest of section is organized into two subsections based on those evaluations.

Probability of Decision Error.
Simulation results shows for evaluating probability of decision error in fusion center based on described framework.Probability of decision error is considered at fusion node with respect to clustering topology as long as presence of uniform and non-uniform distributions of "32" sensors.Figure 4 shows the probability of decision error with presence of  csi and fading effect.Impact of  csi and fading effect are measured by attenuation on level of signal to change probability in order to increscent decision error due to channel influence.According to literatures of Monte Carlo simulation of corresponding expressions in given framework has confirmed our simulation.MAC and PHY parameters values used for  csi are shown in Table 3.

Performance Metric Evaluation Based on Markov Chain.
Impact of MAC appraised on proposed framework.Simulation of performance metric equations is carried out with MAC and PHY-layers parameters denoted in Tables 3 and 4.

Reliability.
Reliability is obtained for non-uniform and uniform topology which is supposed to be with different   probability of decision error   also proving this improvement at FC.However, packet failure is increased versus packet generation rate; therefore, we can expect high packet generation rate more effective than increment of SNR ratio in failure term.

Average Delay.
Measurement of average delay is explained in framework.Simulation performs with high traffic regime with two SNRs 3 dB and 12 dB, see Figure 7. Important issue here is synchronizing between nodes by specifying a time slot from FC to nodes for retrieve data.Obviously, this time slot is corresponding to size of each cluster, therefor, time slot for cluster with 8 sensors is four    times greater than time slot for cluster with 2 sensors because of preventing collision in each cluster during transmitting and each node of cluster has its own time slot to send.Clusters are independent from each other and transmit in their appropriated bandwidth.IEEE 802.15.4 has 16 channels in 2.4 GHz, based on simulation with maximum 4 clusters there is not any constraint in bandwidth scheduling; hence, each cluster works in a unique bandwidth.Slotted Markov chain model specification satisfies the condition.Non-clustering topology has more average delay.That delay is imaginable because time slot scheduling scenario for "32" nodes need longer time slot length.Uniformed distribution 8.8.8.8 needs a time slot with 8 portions at each cluster for retrieving data process.Clusters that have more nodes need at least a time slot longer than with 8 portions.Basically, effect of number sensors on  and  and  are important exact contribution of less sensors causes increasing probability of access channel and directly reduces delays.

Average Power
Consumption.An increment of mean power consumption with higher data generation rate obviously is illustrated in Figure 8. Basically, data transmission consumes more power rather than computational matter in sensor module.Nevertheless, number of sensors and fusion level have critical roles to achieve power consumption.Topology without clustering "32" sensors contribute in decision-making in fusion node coordinates at least half plus one received bits stream signal.Average power is increased with 12 dB signal to noise ratio due to transmission power consumption,   .While power consumption is a very critical issue to wireless sensor network, increment of power consumption is unwilling with respect to restriction on battery capability.On the other view, preciseness of packet receipt sometimes has privilege to power consumption.

Network Aggregate Throughput.
Network aggregate throughput is shown in Figure 9 as function of data generation rate with two SNR ratios.Throughput relation with reliability is explained in framework description.All issues represent improvement in higher signal to noise ratio.Throughput reduction happens when data generation rate is up to 900 bits per seconds in each node.

Conclusion
This paper considered a distributed detection in cluster sensor network with fusion node as a decision maker head of each cluster.We utilized a Markov chain model for evaluation network performance.Generally speaking, network clusterbased topology with data fusion has better performance with aim of data accuracy.Presence of clustering with balance distribution of sensors is acting more efficiently than nonuniform clustering with more number of distributed sensors.Number of distribution sensors directly impacts average delay in clusters; hence, a topology should be selected for less delay achievement.Throughput has better outcome in cluster-based with balance distribution sensors.Power consumption has been acting better in uniformly distributed topology instead of non-uniformly as well as clustering and non-clustering.The main reason of this difference between sensor arrangements is scheduling and timing issue on network.Those issues influence directly on average delay and power consumption.However, it can affect packet failure and also reliability of system.
Based on the results in the paper, interesting future research may be prospective as follows: (1) optimized sensor arrangement in cluster and network state estimation could be considered; (2) fault detection and time delays in the network with Markovian jump systems under partially known
(  ) at FC completely is related to channel condition and Received Signal Strength Indication (RSSI).

Figure 3 :
Figure 3: Probability of decision error as a function of SNR,  = 32 sensors with AWGN.

Figure 7 :
Figure 7: Average delay for two-level decision.

Figure 8 :
Figure 8: Average power consumption for two-level decision.
Model.Decision is made at fusion node and carries out with majority-like mechanism.In some literature this method is called consensus flooding or voting mechanism.Basically, this mechanism is based on majority similar received signal from sensors on the same cluster and event under observe in precise time.According to Figure2two-level fusion is shown; in first level, each cluster contains   distributed sensors uniformly and   is number of clusters, thus,  =   ×   is number of all sensors in network.=[/2]+ 1 is acceptable floor of majority-like for first level of fusion.In second level, decision-making is performed at access Point (AP) similarly with assuming FCs as   sensors.Obviously, AP accepts mechanism with at least   = [  /2]+1 majority-likes.Non-uniform distribution of sensors is defined as unequal number of sensors for each cluster.It denotes clusters size vector by  ≜ {(1) , (2) , ...,  (  )  }, where () is the number sensors in the th cluster ( = 1, 2, . . .,   ) and ∑

Table 1 :
Possible configuration of c , for   = 4 clusters.

Table 2 :
Power consumption of different operation modes.
According to various sensors distribution, Figure3is shown that non-clustering is worst case with respect to our scenario; the decision is made at fusion based on vector received signals on majority-like strategy.Hence, in case of non-clustering at least 17 sensors similar to record as correct decision should be received but for clustering this limitation reduces to [  /2] + 1. Number of sensors at each cluster for example in 16.8.4.4 design by 4 clusters have 16, 8, 4, 4 sensors at each cluster; therefore, fusion node at head of clusters

Table 3 :
Parameters value for physical layer.

Table 4 :
Parameters value for MAC layer.
to 12 dB in solid line by comparison with 3 dB in dots line.Reliability is enhanced in order to increment signal to noise ratio.Direct relation between probability of success packet reception or reliability, with probability of packet failure shown in Figure6, has consequent improvement on reliability.Increasing SNR from 3 dB to 12 dB causes less failure packet reception at fusion node.Result represents in