A Hybrid Optimized Weighted Minimum Spanning Tree for the Shortest Intrapath Selection in Wireless Sensor Network

,


Introduction
Wireless sensor network (WSN) is a cooperative collection of sensor nodes, each having processing capability.Routing in WSN is different from conventional fixed network routing by several ways.WSNs are infrastructureless, have unreliable wireless links, contain sensor nodes that might fail, and its routing protocols face rigorous energy saving requirements [1].WSN is a distributed real-time system and many routing algorithms have been proposed in literature [2][3][4].In the earlier research for distributed systems, it was assumed that wired systems had unlimited power.They had user interfaces, had fixed resources, treated each system node as important, and were location independent.WSNs in contrast are wireless systems with limited power, are constrained in energy consumption, and are real time, with dynamically varying resources [5,6].Routing in WSN utilizing minimal energy has been proposed in the literature [7][8][9][10][11].The power management solutions, at the software level, aim at reducing communications as broadcasting or listening to messages uses up energy.Minimizing message numbers cuts costs and a good MAC protocol ensures reduced collisions and retries.Better routing minimizes the number of messages sent by the use of short paths and congestion avoidance.Factors like efficient neighbor detection, localization, time synchronization, flooding, and query dissemination reduce the number of messages and increase the life of the network.There are varied solutions for scheduling sleep/wake-up patterns [12,13] with most trying to keep up minimum nodes labeled sentries.The latter provides sensing coverage and allows others to sleep.
The clustering process divides a network into interconnected substructures, called clusters, with each cluster having many Sensor Node (SN) led by a Cluster Head (CH) which is the coordinator in this substructure [14] as seen in Figure 1.The CH is also a temporary base station which keeps in touch with other CHs.Nodes have four possible states: normal, isolated, cluster head, and gateway.Basically, nodes are in an isolated state with each maintaining the neighbor table where neighbour node information is stored.Electing the CH is the basic step in clustering.Clustering is widely applied in WSN for managing power efficiently [15][16][17].
A clustering architecture in WSN environment enables features such as network scalability, communication overhead reduction, and fault tolerance.Cluster formation benefits routing as the cluster head and cluster gateways are responsible for the intercluster routing, thus, restricting, creating, and spreading of routing information.Local changes such as nodes changing cluster are updated only in the corresponding clusters and no update is required by the whole network.This significantly reduces information stored by each mobile node.
The major problem in WSNs is experienced by the sensor nodes nearest to the base station that require transmitting more number of packets than that of faraway nodes.Generally clustering algorithms use two methods for extending the lifetime.In the first method, the cluster heads with high residual energy are selected, and in the second method, to distribute the energy utilization between nodes in all clusters, the cluster heads are rotated periodically [18].To choose the cluster heads, various techniques have been proposed in literature.
Other than clustering technique, tree based routing has been popularly used in WSN due to its energy efficiency [19][20][21].Tree based techniques use the concept of selecting a root node before data transmission.A tree-like hierarchical path of nodes is constructed to connect the nodes.The WSN nodes construct a tree which can either be Minimum Spanning Tree (MST) or be optimal tree through which data is transmitted till it reaches the sink.The root node uses tree traversal algorithm to gather data about the children nodes.There are three methods in WSN tree routing protocols [19].While the first does not create clusters the other two mix the clustering strategy with a tree routing algorithm.The latter strategy reduces low latency, while the tree routing algorithm improves energy efficiency [22,23].In cluster based routing strategy, the sensor nodes in the network divide into several clusters.Each cluster then chooses a cluster head randomly or by cluster head election algorithm.The cluster head is responsible for gathering the data from the sensor nodes in its cluster.The aggregated data collected by the cluster head is then transmitted to the sink.Generally, a MST is created to connect the nodes and this reduces energy consumption either by transmitting packets through smaller distances or by reducing number of packets transmitted or by both techniques.
A modified Kruskal's Minimum Spanning Tree (MST) search algorithm based on distributed search by hierarchical clusters was proposed to search the network for small balanced weight routing spanning trees [5].The proposed technique provided spanning trees with low maximum degree and larger diameter to balance energy consumption in WSN's routing.Based on energy matrix transmission the results proved that this approach extended WSN functional life by more than three times with respect to sensor transmission energy.
An energy efficient spanning tree (EESR) was proposed for multihop routing to increase the lifetime of the network [24].The EESR provides location of the sensor nodes and base station and produces a sequence of routing paths consisting of suitable number of rounds.The results obtained by simulation reveal that the EESR method outperforms the other existing methods in relation to increasing the lifetime of the network.
A trajectory clustering technique was proposed for the purpose of selecting the cluster heads [25].In this algorithm, the cluster heads are selected on the basis of traffic and they are periodically rotated.The cluster heads are selected using the trajectory based clustering technique and thus network lifetime is extended.
Guangyan et al. [26] proposed Dynamic Minimal Spanning Tree Routing Protocol (DMSTRP), an innovative cluster-based routing protocol, that enhanced Base Station Controlled Dynamic Clustering Protocol (BCDCP) by means of initiating MSTs rather than clubbing for the purpose of connecting nodes in clusters.When compared to LEACH and BCDCP, the DMSTRP performed well even in large network in terms of network lifetime and delay.
A distributed topology control technique was proposed in [27] to enhance energy efficiency and reduce radio interference in WSNs.Each network node makes local decisions about transmission power.These decisions conclude in a network topology preserving global connectivity.The fundamental control technique is the novel Smart Boundary Yao Gabriel Graph (SBYaoGG) and optimization ensures that all network links are symmetric and energy efficient.This technique was effective as compared to other approaches to topology control.
A combined algorithm (COM), a generalization of the MST and SPT, was proposed in [28] which dealt with the issue of executing the operation of Data Aggregation enhanced Convergecast (DAC) in an energy and latency efficient manner.The valuable portion of the total data gathered is approximated by assuming that each and every node in the network consists of a data item and a known application dependent data compression factor.
Multiple Cluster Heads Routing Protocol (MCHRP) [29] was proposed to address cluster head overload.This method improved LEACH by incorporating a decision function which is based on the cluster head's remaining energy, location, and frequency.The decision function selects the main cluster heads and the alternative cluster heads used for data acquisition, data fusion, and data transmission.
Cluster-based and Tree-based Power Efficient Data Collection and Aggregation (CTPEDCDA) protocol by Wang et al. [30] was based on clustering and MST to minimize energy consumption in WSN.MSTs are built by connecting the cluster heads to improve the transmission routing mechanism.Chhabra and Sharma [31] improved the power consumption by improving the first node death.This method combined both the cluster-based and the tree-based protocol to improve evenness of dissipated network energy.
Kumrai et al. [32] proposed evolutionary algorithm heuristically that optimizes the sensing coverage area and the installation cost in WSN by considering the sensor network connectivity as a constraint.The algorithm uses a population of individuals, each of which represents a set of wireless sensor nodes types and positions and evolves them via the proposed genetic operators.The proposed mutation and constraint-domination operators were designed to quickly seek the optimal solutions that meet the WSN installation requirements.Simulation result shows that the sensing coverage and installation cost were improved.
Karimi et al. [33] proposed two algorithms such as GP-Leach and HS-Leach.The energy consumption was improved by partitioning the network and using evolutionary algorithms for optimized cluster head selection considering WSN nodes position information and residual energy.The simulation results performed in MATLAB show that the proposed algorithms were more efficient and they increased the lifetime of network.
In this work, a Weighted Minimum Spanning Tree, Bee Algorithm-Simulated Annealing (BASA-WMST) algorithm is proposed.Cluster heads are selected based on the proposed optimization technique and WMST is used to find the shortest intrapath selection within the cluster.The proposed method computes the distance-based minimum spanning tree of the weighted graph for the multihop network.During route selection, the weights are dynamically adjusted based on the mobility, energy level, and distance of each sensor.Section 2 presents the problem formulation, Section 3 deals with the proposed methodology in detail, Section 4 shows the experimental results, and Section 5 concludes the paper.

Problem Formulation
Tree based routing has the advantage of lower control packet overheads but suffers from approximation error compared to cluster based routing.Cluster based routing provides better energy savings compared to tree based techniques.In this work it is proposed to combine the features of cluster based routing for cluster formation and cluster head selection and use minimum spanning tree for intracluster communication.Ideal clusters are formed when the network parameters like energy spent, lifetime, Packet Delivery Ratio, and end to end delay are optimized.Since most of the network parameters are additive in nature the optimization problem is NP hard.Several metaheuristic techniques including genetic algorithm have been proposed in literature.In this paper, bee algorithm in combination with simulated annealing was chosen due to its faster convergence and its capability to avoid local minima problem.
Since node mobility, delay, and remaining energy are used as the edge in the graph, the objective function can be formulated as in min   () =  (min (mobility)) +  (min (delay)) The following assumptions are made for the sensor network.
(2) The energy of sensor nodes is limited and uniform initially.
(4) The transmitting power of the nodes varies depending on the distance to the receiver.
(5) Approximate distance is estimated based on the received signal strength.

Methodology
The node energy model is based on [35].The energy dissipated to transmit  bit is given in =  (Energy dissipated tranmitter electronics + (Energy dissipated transmitter amplifier * distance squared)) . ( The energy dissipated to receive  bit is given in Power consumed for a given time period  can be computed by dividing the dissipated energy by time and is given by The mobility of a node is estimated using the Free Space Path Loss (FSPL) model.The relation between FSPL, frequency of radio signal, and distance between the transmitter and receiver is given by where  is the distance,  is the frequency, and log is the logarithm to base 10.  is a constant and is equal to 32.44 when frequency is measured in Mhz and distance is measured in Kilometer.Another method to compute the FSPL is using the fade margin and it is given by FSPL = Energy dissipated Tx electronics + Energy dissipated Tx amplifier Using the two FSPL equations ( 8) and ( 9), the distance can be computed by To find the distance travelled by nodes  and  with respect to each other during time , the distance between the nodes is computed at time  and  =  if high mobility increases the reclustering process and increases the energy consumption.
The objective is to form clusters based on low mobility which leads to lower energy consumption and lower delays due to lower link breakages.The mobility of the node can be computed by 5 implies high mobility ≤ 0.5 implies normal mobility 0 implies no mobility < 0 implies nodes converging Each node stores in its neighborhood table the information about its neighbors, as shown in Table 1.Each node broadcasts the Ech Msg, at the beginning of each round which contains residual energies, within radio range .All nodes within the cluster range of one node are considered as the neighbors of this node.On receiving the Ech Msg nodes update the neighborhood table.
The flow chart of the proposed technique is shown in Figure 2. Bee algorithm-simulated annealing algorithm is proposed to avoid the local minima problem faced by bee algorithm and to select the best cluster heads by forming ideal clusters.Clustering is achieved by dividing arbitrarily organized sensors into the best possible number of selfdetermining clusters with cluster head and optimal route to form the initial population.The edge weights between nodes are computed and the objective function is computed.These initial solutions become the initial food source in the proposed BASA algorithm.Once the initial population is found, the bee algorithm is initiated.Each node broadcasts its ID along with its weight   to the neighboring nodes and stores the weights   of the other nodes within its transmission range.
Bee's algorithm is a population-based search algorithm inspired by bees foraging behaviour [36].The algorithm starts with search space being populated by worker bees being placed randomly at the location of the initial food source.The fitness of sites visited by worker bees is evaluated and bees with the best fitness continue to be worker bees.Bees which have visited sites with lower fitness value are delegated to onlooker bees.The location of food source with the best fitness becomes the new search location for better solutions.Effectively a cluster with CH can either add new nodes to increase the cluster or remove some nodes to decrease the cluster.Similarly, the CH can be rotated within the cluster.This is achieved by searching neighborhoods of selected sites by assigning scout bees to search near the best  sites.Neighborhood searches of the best  sites are made detailed by recruiting more bees other than the selected bees to follow them.This differential recruitment along with scouting is a key bee's algorithm operation.
The probability   of selecting a food source  can be determined by using  and the fitness is given as in where fit  is the fitness value of th solution which represents the nectar amount at food source at th position and SN is the number of employed bees and also a number of food sources.The process is iterated till the termination criteria are reached or till the improvement in the fitness does not increase by more than 0.001 in the last 10 iterations.If there is no improvement in the solution, the algorithm could have struck in the local minima.Simulated annealing is starting to climb out of the local minima problem.
The simulated annealing (SA) was introduced in 1983 which is based on the ideas formulated in the early 1950s [37].Simulated annealing is a relatively straight forward algorithm which includes metropolis Monte Carlo method.The metropolis Monte Carlo algorithm is well suited for simulated annealing, since only energetically feasible states can be sampled at any given temperature.Therefore the simulated annealing algorithm starts at a high temperature with simulation of metropolis Monte Carlo algorithm.The temperature is slowly reduced such that the search space becomes smaller for the metropolis simulation, and when the temperature is low then the system has hopefully settled into the most favorable state.Simulated annealing can also be used for searching the optimum solution of the problems by properly determining the initial (high) and final (low) effective temperatures which are used in place of  (where  is a Boltzmann's constant) in the acceptance checking and deciding what constitutes a Monte Carlo step [38].
Simulated annealing is a probabilistic method [39] to find global minimum of a cost function that can have several local minima.Simulated annealing emulates the physical process wherein a solid is slowly cooled so that when eventually its structure is frozen, this happens at a minimum energy configuration.Simulated annealing to compute the probability of acceptance: where Δ is the difference between the solution error after it has perturbed and the solution error before it was perturbed,  is the current temperature, and  is a suitable constant.On identifying the potential cluster heads, the MST algorithm is used for tree construction to find the intracluster routes.Suppose that  points are given in different dimensions, then a tree spanning to these points is a set of straight line segments joining pairs of points [40], so that (1) there are no closed loops, (2) a line visits each point at least once, and (3) the tree is connected.Figure 3 shows an example of a tree of integer segment lengths.If, for example, vertices N3 and N7 are joined, a closed loop is formed and the result would not be a tree.The length of a tree is the sum of its segments lengths.When a set of  points and the lengths of all (/2) segments are given, a spanning tree of minimum length (MST) is required.The MST is computed using reaching the base station.With traditional MST algorithms, construction cost of a minimum spanning tree is ( log ), where  is the number of graph edges and  is the number of vertices [41].The weight of a tree edge is computed by Euclidean distance between two end points.The average weight  of MST edges based on remaining energy and delay is computed.Any edge with a weight  >  avg is removed leading to a set of disjoint subtrees  = { 1 ,  2 , . . .  }.
The so formed routes are optimal from the spatial perspective since the cluster heads are uniformly distributed over the imperfectly formed wireless sensor network.

Experimental Set-Up and Results
Experiments were conducted with different number of mobile sensor nodes, spread over an area of 1000 m by 1000 m with the Base Station being stationary at location (500, 500).The simulation parameters and bee algorithm parameters are shown in Table 2.
Experiments were conducted to simulate the proposed technique and are compared to cluster based routing, GA based cluster formation, and ABC based cluster optimization.Figure 4 shows the number of clusters formed.The proposed BASA WMST technique increases the average number of  clusters across by 10.41% compared to cluster based routing for varying number of nodes in the network.ABC produces better clusters compared to GA. Figure 5 shows the average end to end delay obtained in the network for different number of nodes.As the number of nodes increases, the performance improvement of the proposed technique is in par with GA based technique showing a small marginal average improvement of 1.27% compared to GA based technique.
However, both GA and proposed BASA-WMST show significant decrease in end to end delay as the number of nodes is increased.End to end delay decreased over 14% when compared to cluster based technique.This becomes significant for WSN used in streaming applications.Figure 6 shows the Packet Delivery Ratio obtained under different number of nodes.
The average PDR in the proposed BASA WMST improved by 6.49% when compared to cluster based routing for various numbers of nodes in the network.Compared to GA the PDR improvement was significant by 4.98% and by 1.47% compared to ABC based technique.Figure 7 shows the life time of the network.ABC and the proposed technique significantly improve the life of the network compared to GA based technique.Figure 8 shows the average remaining energy in the nodes.The energy savings are significant in the proposed technique compared to GA based technique.Average energy to the tune of 21.67% is observed compared to GA.

Conclusion
WSN routing protocols must perform efficiently under mobility and energy constraints.In this paper, clustering is achieved through a hybrid algorithm that divides arbitrarily organized sensors into the best possible number of selfdetermining clusters with cluster head and optimal route to base station using a novel optimization BASA-WMST.Bee algorithm is incorporated to increase the information exchange among bees and SA is used to escape local optima.Intracluster route is selected from the optimal trees based on weights described.The proposed routing was simulated and compared with conventional cluster based routing and other optimization techniques showing improvements in the QOS.

Figure 1 :
Figure 1: Block diagram of WSN deployment with cluster heads.

Figure 2 :
Figure 2: Flowchart of the proposed technique.

Table 1 :
Information maintained in the neighborhood table.  A neighbor node in cluster range of   RE   Residual energy of   Dis   Distance between   and

Table 2 :
The parameters of the network simulation.