An Enhanced Distributed Scheme for WSNs

This paper investigates data processing schemes that define the distribution of decision making that affects system accuracy and energy consumption. There exist two typical schemes, namely: centralized and distributed schemes. In a centralized scheme, nodes collect samples and send them to a “fusion center.” This scheme provides optimal decision accuracy; however, it consumes considerable energy. In contrast, distributed schemes allow nodes to make local 1-bit decisions, which are sent to the fusion center to make the final decision. In a hybrid scheme, the network specifies the level of accuracy required for the whole system. This can be achieved by manipulating the scheme to work interchangeably as centralized or distributed. Most of the energy consumed is in the transmission process; therefore, this paper proposes an energy-saving hybrid scheme that focuses on optimizing transmission energy. In this proposed scheme, each node is able to alternate between centralized and decentralized scheme according to its location and path length. To validate the proposed approach, it is simulated and the results are compared with the hybrid scheme.


Introduction
The main goal of WSNs is to detect certain events in the environment.It is important to try to achieve maximum detection accuracy and to minimize false alarms.At the same time, availability of WSN resources and accessibility limitations should be considered.The solution is a trade-off between two factors: accuracy and energy efficiency.With sensor capabilities a tremendous societal benefit is achieved when sensors are integrated into available devices, machines, and environments.They can help to avoid infrastructure failure disasters, protect precious natural resources, enhance security, and enable new "smart" applications such as contextaware systems and home technologies.Wireless sensor networks are based on numerous advanced technologies such as very large scale integration (VLSI), microelectromechanical systems (MEMS), and wireless communications.The development of these technologies is contributing to a wider application of WSNs.For example, with the enhancement of MEMS technology, sensors are becoming smaller, and developments in semiconductor technologies are producing smaller microprocessors with higher processing capacities.The improvement of computing and sensing technologies is enabling the development of flexible WSNs, which can be widely applied [1,2].
Monitoring environmental changes and detecting specified events is the main function of sensor networks.This function is achieved through four basic components of a sensor network [3]: distributed or localized sensors, an interconnecting network (most often wireless based), a central point of information clustering a set of computing resources at the central point or network core to handle data collecting, event trending, status querying, and data mining.WSNs use centralized fusion centers (sinks), which work as cluster gateways, and many distributed sensors (motes) [4].These sensors sense and send observations to the centralized unit.The centralized unit decides if an event is initiated or not.
Most of the power consumed in a network is used in processing, transmitting, and sensing.Until now limited power resources for sensors has been the main constraint in WSNs.It is very important to reduce sensor power consumption while maintaining acceptable detection accuracy according to application requirements.Many researchers have focused on the above three processes [5], attempting to enhance the power consumption efficiency of the sensors for each of them.Some schemes enhance the operating system and reduce  the required processing cycles; other schemes optimize the RF part including collision space and noise filtering.This thesis focuses on schemes which study decision processing and transmitting where those schemes define how to collect observations (sampling rate), where to process them (locally or centralized), and the data to be sent from nodes to the fusion center, which will affect the degree of loss of data and accuracy [6].The main goal of this paper is to produce an optimum controlling scheme that extends network/sensor lifetime by reducing power consumption and maximizing the network efficiency and accuracy.Our proposed scheme balances between the reduction of data transmission and processing by distributing these two activities among the nodes and the central unit (sink).
Existing wireless communication protocols are shown in The main WSN solutions such as ZigBee, WirelessHART, 6LoWPAN, and ISA-100 are based on this standard, which offers a complete networking solution by developing the remaining upper communication layers.
ZigBee.ZigBee [7-9] is a simple, low cost, low power wireless technology used in LR-PANs embedded applications.ZigBee provides the network layer and the framework for the application layer.The MAC sublayer and lower layers are based on the IEEE 802.15.4 standards (see Figure 1).
The ZigBee network layer supports star, tree, and mesh topologies.It is utilized in three types of devices: ZigBee coordinator, ZigBee routers, and end devices.
WirelessHART.Based on IEEE 802.15.4 standard highway addressable remote transducer (HART) foundation, Wire-lessHART [10][11][12] was developed for low-power 2.4 GHz operation.Similar to ZigBee, WirelessHART specifies four principal devices (Figure 2): network manager, gateways, field devices, and handhelds, as well as adapters, which allow existing HART field devices to be integrated into the network.

Related Work
As mentioned earlier, each node has the responsibility of collecting, processing, transmitting, and receiving data.The common functions of every detection scheme are that (1) nodes collect observations, and (2) the fusion center (sink) takes the final decision.
There are two traditional detection schemes: the centralized detection scheme and the distributed detection scheme, the methodologies of which will be covered in details.In our approach, we use a tree topology where nodes are "independently and identically distributed" (i.i.d.).They are connected to the Fusion Center (FC) through a multihop route, where nodes also act as hops to receive data from child nodes and forward it to the FC with any processing, encryption or encoding.The focus here is only on accuracy and energy consumption, and it is assumed that lower layers are working perfectly and that there are no efficiency problems caused by RF or by packet collisions; that is, there is no data retransmission.

Bayesian Decision Theory
2.1.1.Binomial Distribution.A Bernoulli trial can result in a success with probability p and a failure with probability  = 1 − .Equation (1) gives the probability distribution of the binomial random variable .The number of successes in  independent trials is The probability that two events, A and B, will both occur will be ( ∩ ) = ( ∩ ) = ()( | ) = ()( | ).From this formula the main equations (2) of Bayes' rule can be derived: From Figure 3 we find ( ∩ ) = ()( | ) = (2/3)(3/50) = 0.04.The distribution of H, given D, which is called the posterior distribution, where () is the marginal distribution of , is given by Binary assumption is defined by , which represents event occurrence: ( =  1 ) if an event happens and ( =  0 ) if not; Ĥ is the actual event status:   is the final decision; K is node counts;  is the collected samples,  is sample count at each node, and  is the node path length.For each sample   where  is varied from 1 to  and J from 1 to  collected in a node , ( Bayes Decision.Choose event happened if ( 1 | ) ≥ ( 0 | ); otherwise choose not happened.

Centralized Detection Schemes.
In centralized detection schemes a network will have K number of nodes.These nodes will collect  samples of observations from the environment every specific period, and they will send  samples together at the end of the period.At the fusion center  = [ * ] samples will be received.
According to Bayes, a final decision can be calculated as shown in the following: The probability of error can be calculated using the following equation: To calculate power consumption for the whole network in ( 6), the following equation can be used, where E = total energy,   = transmission energy,   = receiving energy, and   = processing energy:

Distributed Detection Scheme.
In this scheme nodes collect data and make local decisions according to these observations and conclude the event appearance as a 1-bit result.This result is sent to the fusion center to make a final decision according to the collected 1-bit results from all nodes.In this scheme data accuracy between nodes and the fusion center has been lost.
We propose  number of nodes; these nodes will collect  samples of environmental observations with (  = number of 1s for node ) and will send a 1-bit local decision every specific period.
At the fusion center,  = [1 * ] samples will be received.
According to Bayes, a local decision can be calculated as shown in the following: From ( 1) and ( 4), we can calculate local decision as shown in the following: For final decision we collect  = total 1s if local decision From ( 1) and ( 4), we can calculate final decision as shown in the following: The probability of error can be calculated using To calculate power consumption for the whole network (12) can be used, where  = total energy,   = transmission energy,   = receiving energy, and   = processing energy: 2.4.Hybrid Detection Scheme.Neither the centralized nor the distributed detection scheme is flexible enough for designers to choose between detection accuracy and energy consumption.Yu et al. [16] proposed a hybrid scheme that balances detection accuracy and total energy consumption.According to a defined level of accuracy, the nodes will vary between sending all collected data and sending a 1-bit result.Thus, such schemes attempt to balance accuracy and energy consumption.
In this scheme, assume there are  number of nodes.These nodes will collect  samples of environmental observations with (  = number of 1s for node ).There will be upper and lower bounds  0 and  1 , where 0 ≤  0 <  1 ≤ .The node result will be 0 if the number of 1s is less than  0 .In other words, if the number of 0s collected is greater than or equal to  −  0 , 1 will be sent if the number of 1s is greater than or equal to  1 .Otherwise all the collected data will be sent, as shown in (17): From (12), assume that out of  sensor nodes,  nodes send 1s,  nodes send 0s, and −− nodes send all their observations.The total data sent, Ω, will be Ω = {1, . . ., 1;  1 , . . .,  −− ; 0, . . ., 0; } .
From ( 12) we can derive the following probability: Following from ( 16) and (17), the final probability of error can be determined using To calculate power consumption for the whole network ( 16) can be used, where  = total energy,

System Model (Telos)
In order to evaluate and develop our scheme, the behavior of WSN sensors should be understood, and a power consumption model should be defined [17].For this purpose the typical operation conditions of TelosB (Figure 4) have been selected as a basis for our power model [18,19].I use the datasheet of Telos working at 250 kbps and 2.4 GHz (ultralow power IEEE 802.15.4 compliant wireless sensor module), which is one of the most widely known WSN nodes, and I use the power details from Table 2.
From the power consumption model in Figure 5, it can be concluded that transmission is responsible for a large amount of node power consumption.So far, only the behavior of a single node has been discussed.Now the power consumption of all the nodes in the network should be addressed.
The formula for calculating the overall power consumed while sending a single bit from source (sensor node) to destination (fusion center) through  hop nodes equals the energy required to send and receive this bit between all of the  hops, where  = total energy,   = sending energy, and   = receiving energy: Energy consumed per node: So, for the special case where a node is directly connected to the FC, energy used =   +   .
Thus, the length of a route affects the overall power consumed in the network and the network life time as well.

Power Calculation.
We use TelosB model to calculate power consumption [20,21], with the following:   3.4.Simulation Network Design.In order to derive more realistic data from our simulation model, we will apply TelosB properties in our network, with maximum coverage = 100 m.A total of  sensors are randomly deployed in the region of interest (ROI) which is a square area of  2 .I have selected  to be equal to 300 m; hence the ROI = 0.09 km 2 .The locations of sensors are unknown before deployment time.However, it is known that all sensors are i.i.d., and every sensor locations (, ) will follow a uniform distribution in the ROI.

Enhanced Hybrid Detection Scheme
The hybrid scheme adjusts the behavior of the network to vary between centralized and distributed schemes and also establishes  0 and  1 parameters to define that behavior, and  is the number of observations equal to 1.We propose to enhance the hybrid scheme by dynamically choosing the  0 ,  1 parameter instead of it being static.
In the hybrid scheme if  is between  0 and  1 the node will act as centralized; otherwise it will act as distributed, as shown in Figure 6.For the special case  1 −  0 ≤ 1 the node will always act as distributed, and the node will act more centralized if  1 −  0 becomes larger (until  0 = 0 and  1 = MAX).
Mobile Information Systems 7 However,  0 and  1 can be made dynamic, with a preference for a distributed orientation for nodes with longer route paths; for the remaining nodes it can remain more centralized, as in the original hybrid scheme, according to the requirements of the application.
For every sensor  varied from 1 to  we will assign specific { 0 ,  1 }.These sensors will be classified according to the route path: Since 0 ≤  0 <  1 ≤ , we will have a finite number of combinations for  0 and  1 .These combinations-or groups of them should be mapped to all sensors, depending on sensor path weight.
The above mapping is a normal -to-one mapping problem, which in our case can be solved experimentally by testing it in different deployed wireless sensor applications.

Network Deployment.
Similar to most wireless sensor networks, the FC is deployed in an accessible location, while the sensor nodes are deployed randomly in the targeted area.This can be considered as a mesh network.After deploying the nodes, discovering the network is the first action to be taken.The FC broadcasts discover packets to all nodes in the covered area.These nodes then broadcast to further nodes until all the nodes are covered and routing paths; next hop and node configuration are defined.
To simplify the problem, nodes discover paths to the FC based on SPF (shortest path first) and select subsequent hops to forward received packets to.This information is crucial.
Since the nodes have already collected the broadcast packets, which include source, destination, and path, the nodes are able to select their routes based on the path length.

Nodes Configuration.
For each node there are fixed configurations such as Tx/Rx power, frequencies, and calculation process.Some parameters need to be configured by the manufacturer or the user in order to optimize network efficiency.On the other hand, some parameters may be configured by the network itself.
In our scheme, every node has certain parameters that are defined during the manufacturing phase or by the developer during network deployment, that is, sampling rate (), CP min , and CP max .
CP is the probability that the system works as centralized, which can be calculated from  0 and  1 .In our scheme each node will select  0 and  1 depending on its path length. 0 and  1 should generate CP where CP Min ≤ CP  0 , 1 ≤ CP Max .
From those inputs {,  0 ,  1 , max(path length), CP min , CP max } each node is able to calculate the  0 and  1 that satisfy application requirements.
From  we can find ( 0 ,  1 ) combinations = 2  , which is our ROT.Every node should be able to map the proper Figure 7: Mapping ( 0 ,  1 ) combinations to available path length.
( 0 ,  1 ) from both (CP min , CP max ) and max(path length), as can be seen in Figure 7.We can map these ranges by means of a simple 1-to-many mapping technique, where we index the initial range and assign the remaining ones equally to the next range, as applied in Algorithm 1.

4.
3.  0 ,  1 Selection.To be able to select  0 and  1 at each node the problem is divided into simpler problems as follows: (1) Define all combination of  0 and  1 and calculate the equivalent CP.
(2) Define valid  0 and  1 based on CP min and CP max .

4.3.1.
All  0 ,  1 Combinations.A selection of  0 and  1 will define when the node will act as more centralized or more distributed.For  collected samples we will have 0 ≤  0 <  1 ≤ .We can have C different combinations for those  0 and  1 , where  = !/(( − 2)!2!).
For each combination we can calculate its CP CENT , and we will have the  list of CP, which should be sorted according to its values.
CP CENT can be calculated as in the following: where cdf is the cumulative distribution function which can be calculated as in the following: For example, for  = 5, [ = 1] = 0.45, we generate all possible combinations of ( 0 ,  1 ) as shown in Table 3.
First of all we find the valid ( 0 ,  1 ) combinations.The selection of CP min = 0.50, CP max = 0.90 gives us 4 valid combinations (Table 4).We have here a random WSN where maximum path = 6, which means 6 different zones.Figure 8 shows an example of WSN nodes classified into zones.We define minimum CP min and maximum CP max , where CP max is the maximum CP combination that can be assigned for zone 1, and CP min is the minimum CP that can be assigned to the next zones (where all CP ≤ CP min ).
For every zone z, we assign CP() = CP  from (23), which can be used to map between zones and ( 0 ,  1 ) combinations (Table 5):

Count of all Zones
Count of CPs between CP min and CP max ⌉⌉ , In this scheme we find that ( 0 ,  1 ) are dynamically selected by the nodes, depending on the path length, and the entire procedure described above requires minimal extra processing at the node, since the whole process is part of the network discovery phase.
Distributed.Similar to the centralized scheme, in the RF scheme, the most power consuming process in is the transmission part, (Figure 10).
Enhanced Hybrid.We define the probability of node behavior as centralized CP; we get CP values out of possible combination of  0 and  1 .Figure 12 shows the power consumption of enhanced hybrid scheme, where we assign CP min = 0.25 and CP max = 0.70.
All Energy Schemes.The performance of the four schemes is shown in Figure 13, as we can see that enhanced scheme outperforms other schemes.
The performance of the four schemes is shown in Figure 15, as we can see that enhanced scheme outperforms other schemes.In comparison to hybrid scheme our scheme gives a very close level of accuracy, as illustrated in Figure 15.

Conclusions
The main purpose of this paper is to enhance the event collection and detection capability of current WSN schemes.Two of the available schemes-centralized and distributed schemes-are basic and have no flexibility.The third scheme-the hybrid scheme-uses the two previous schemes to balance accuracy and energy; nevertheless all the nodes remain with fixed configuration and limited flexibility.On the contrary, the proposed scheme is designed to be more flexible in order to balance the power consumption and the detection accuracy at the node level.Every node is flexible in deciding how to behave, that is, whether to be more centralized or distributed.
To be able to compare the tradeoff between accuracy and power consumption in these schemes the model used should be realistic.This is one of the most significant weaknesses of the previous research, in which all power consumption calculations have been based solely on an assumed model.In the proposed scheme this weakness has been overcome by using a real WSN model, which produces realistic results.
As can be seen from the simulation results, our scheme saves a substantial amount of energy compared to the hybrid scheme, while retaining accuracy to almost the same degree.In addition, our scheme deals more efficiently with larger network area and denser node-number.Although the focus of this paper has been on reducing overall power consumption, other future enhancements are also possible, for example, enhancing overall network lifetime by improving ( 0 ,  1 ) selection techniques where the following factors can be considered: (i) individual node power level, (ii) event occurrence probability at each node, (iii) next-hop distance and real Tx, Rx power.

Figure 2 :
Figure 2: Elements of a typical WirelessHART installation.

Figure 4 :Figure 5 :
Figure 4: TelosB by the University of California.

Figure 14 :Figure 15 :
Figure 14: Comparison of three schemes in detection accuracy.

Table 4 :
( 0 ,  1 ) valid combination list.CP min , and CP max .All nodes are classified according to path length to  zones.