Elitist Genetic Algorithm Based Energy Balanced Routing Strategy to Prolong Lifetime of Wireless Sensor Networks

Wireless sensor networks have gained worldwide attention in recent years due to the advances made in wireless communication. Unequal energy dissipation causes the nodes to fail. The factors causing the unequal energy dissipation are, firstly, the distance between the nodes and base station and, secondly, the distance between the nodes themselves. Using traditional methods, it is difficult to obtain the high precision of solution as the problem is NP hard. The routing in wireless networks is a combinatorial optimization problem; hence, genetic algorithms can provide optimized solution to energy efficient shortest path. The proposed algorithm has its inherent advantage that it keeps the elite solutions in the next generation so as to quickly converge towards the global optima also during path selection; it takes into account the energy balance of the network, so that the life time of the network can be prolonged. The results show that the algorithm is efficient for finding the optimal energy constrained route as they can converge faster than other traditional methods used for combinatorial optimization problems.


Introduction
A wireless sensor network (WSN) consists of randomly/ manually deployed sensors that sense the physical or environmental events and send the data to the base station. A large number of small, inexpensive, disposable, and autonomous sensor nodes are generally deployed in an ad hoc manner in vast geographical areas for remote operations. Sensor nodes in a WSN are constrained in storage capacity, computation power, bandwidth, and power supply [1][2][3]. The development of low-cost, low-power, multifunctional sensor has received increasing attention from various industries. Sensor nodes are smaller in size and capable of sensing, gathering, and processing data. They also communicate with other nodes in the network, via radio frequency (RF) channel. The areas of applications of WSNs vary from civil, healthcare, and environmental to military [4].
Recent advances in WSN have led to searching for new routing schemes for wireless sensors where energy awareness is essential consideration. Traditional networks aim to achieve high quality of service (QoS) provisions; thus sensor network schemes must focus primarily on power conservation. Though there are some similarities between the WSN and ad hoc network, like both are multihop communications, they differ in many ways. Some of the power-aware routing protocols proposed for ad hoc networks can be examined for energy constraints, but ad hoc routing techniques proposed in the literature are not effective, feasible and do not usually fit the requirements of the sensor networks. The applications and routing requirements for the two are different in various aspects; first, the typical mode of communication in a sensor network is from multiple data sources to a data recipient/sink rather than communication between any pair of nodes. Second, since the data being collected by multiple sensors is based on common phenomena, there is likely to be some redundancy in the data being communicated by the various sources in sensor networks. Third, in most envisioned scenarios the sensors are not mobile, so the nature of the dynamics in the two networks is different. Finally, the single major resource constraint in sensor networks is that of energy. The situation is much worse than in traditional wireless networks, where the communicating devices handled by human users can be replaced or recharged relatively often. The scale of sensor networks and the necessity of unattended operation for months at a time mean that energy resources have to be managed even more carefully. This, in turn, precludes high data rate communication and demands energy-efficient routing protocols. Hence, it is necessary to design a special multihop wireless routing technique between the sensor nodes and the sink node with a focus on energy efficiency.
A typical energy-efficient routing protocol can be described in the following three phases as shown in Figure 1. In the first phase, sink broadcasts the interest or sensor nodes broadcast an advertisement for the available data and wait for a response. In the second phase, if the observations of some sensor nodes are matched with the interest list propagated by the sink, then the nodes forward data packets containing the required information to the sink using a certain routing scheme. In the third phase, the sink infrequently initiates a localized flooding in the network in order to keep all paths alive. The second phase is primarily considered due to the fact that data forwarding consumes much more energy than the two other phases.

Literature Review
Energy-efficient routing in wireless sensor networks has gained a lot of attraction from the researchers in the recent years. A summary of recent research results on energy efficient data routing in sensor networks is discussed in [5,6]. To increase the network lifetime, the design of efficient routing protocol for communication is very important. An illustration of the existing routing protocol for wireless sensor network using data centric approach and present performance analysis of these protocols is done in [7], which may have been further improved by considering optimization of other parameters like cost, distance, and so forth along with energy optimization. Evolutionary Algorithms can be used effectively to find the energy efficient path in wireless sensor networks [8]. A simple approach to minimize the average path length is proposed in [9] where they considered the wireless network of transceiver nodes with a known spatial distribution using a genetic algorithm optimization method. Each of the nodes consists of a relatively simple a transceiver (antennas, a receiver, and a transmitter). The goal of the optimization is to minimize the average path length from source to destination to minimize the transmitted power. This may sometimes result in early depletion of energy in some nodes in the shortest path selected for data communication. Further, a method proposed in [10] has used a multipath routing protocol for wireless sensor networks that enhances the reliability of WSN by using multipath routing. It is useful for delivering data in unreliable environments. The idea is to define many paths from source to sink and send through them the same subpackets. This implies that the traffic will increase significantly, at the same time it will increase the reliability of the network. The idea is to split the original data packet into subpackets through each path. This can offer at the end, even with the loss of subpackets, the reconstruction of the original message. The only drawback of this strategy is that it is not energy aware and provides more reliability at the cost of more energy depletion. Energy aware consideration is not done which may reduce the lifetime of the sensor network. The energy awareness in multipath routing is done in [11][12][13][14] with consideration of maximum lifetime routing in wireless sensor networks. This protocol routes data through a path whose nodes have the largest residual energy. The path is switched whenever a better path is discovered. The primary path will be used until its energy is below the energy of the backup path. By means of this approach, the nodes in the primary route will not exhaust their energy by the continual use of the same path, thus achieving longer lifetime.
Recent advances in WSN have led to searching for new routing schemes for wireless sensors where energy awareness is essential consideration [15]. The aim of networks is to achieve high quality of service (QoS) communication; thus, sensor network schemes must focus primarily on power conservation. Ad hoc routing techniques proposed in the literature do not usually fit the requirements of the sensor networks. Hence, it is necessary to design a special multihop wireless routing technique between the sensor nodes and the sink node with a focus on energy efficiency. A clustering method is proposed in [16] where a fixed number of cluster heads are selected based on maximum residual energy of the nodes. Each cluster head is associated with a group of nodes based on the minimum distance among them. In such scheduling, all the nodes dissipate uniform energy and subsequently remain alive for long time. The energy load balancing strategy used in this paper not only ensures the balanced energy consumption of the nodes in the network so that the network lifetime is prolonged, but it also considers the minimum cost path towards the sink so that energy can be further saved and elitism-based GA ensures faster convergence.
The rest of the paper is organized as follows. Section 3 describes the network model used in this paper, Section 4 talks about the modified GA approach with elitism used for WSN routing, Section 5 provides and analysis of the results obtained using the proposed scheme, and Section 6 is the conclusion.

The Proposed Network Model
The model considered in this paper realizes a powerful Base Station which has adequate source of energy supply and it is located far away from the sink. The sensor nodes are homogeneous and have limited energy. All the nodes have uniform initial energy allocation. The first-order radio model is used for calculation of the energy dissipation for data communication operations like transmission and reception. The energy spent by a node in transmitting a k-bit packet to another node meters away is given by and the energy spent in receiving the packet is Chinese Journal of Engineering 3 Step 1: adv. broadcast  The energy spent in transmitting a packet from node to node is the same as that from node to node for any given value of SNR; that is, there is a symmetric channel. The computational energy is negligible in comparison to communicational energy and can be ignored. A routing path or chain is an ordered sequence of the nodes in the network that transmit the data message towards the sink node. A time interval in which a node generates a packet and transmits it towards its neighbour is called a data collecting round. After receiving a packet from its previous node, every node relays it to the next node in the chain. The task is to minimize the energy dissipation in the nodes, by the formation of an optimal data collecting chain. Considering nodes in the network, the total energy expended in a typical data gathering round is the summation of the energy dissipated by the individual sensor nodes. Total energy spent can be calculated for a packet size of as follows: where denotes the distance between the th node and the ( + 1)th node in the data collecting chain. A threshold value is imposed on the communication radius between the nodes as th. This is determined by the communicating radius limit of the sensors as the sensor nodes are not able to transmit or receive reliably beyond this limit. This ensures reliable communication in between the nodes reducing unwanted noise and packet loss probability also this is necessary to reduce the energy consumption.

Improved Genetic Algorithm
Genetic algorithms (GAs) are stochastic search mechanisms. They are mainly a probabilistic search and optimization technique inspired by the mechanics of (Darwinian) natural selection and genetics [17]. It has been shown [18][19][20] that genetic algorithm performs well in solving the problem of finding path optimization for WSN. It works on a group of optimal or suboptimal solutions during the search and gives the optimal search path within a short time.
The algorithm works by storing useful information about the individuals from the current population either implicitly through redundant representations [21][22][23]. The elitist strategy ensures that the best individual will not be destroyed. Instead, the best individual will take place of the worst individual after the operations of crossover and mutation. If the energy of a particular node in the routing reaches below a predefined level, the node is replaced in the routing chain based on some probability. This is done, to prolong the lifetime of the network. Also new individuals are inserted into the population after every generation so that the diversity is maintained and GA does not get converge into local optima as shown in Figure 2. The steps of the modified GA are as follows.
(i) Population Initialization. In the first step individuals (chromosomes) are produced randomly and the evolutionary generation starts with iteration 0. The distance threshold (communication radius) is initialized.
(ii) Fitness Calculation. The fitness function is designed to increase the lifetime of the network, which evaluates whether a particular chromosome increases lifetime of the network or not. The algorithm preserves the historically obtained best chromosome; that is, with the highest fitness value this is called elitism. The fitness of each chromosome is calculated by The above equation calculates the energy of a chromosome containing genes and denotes the distance between the ( +1)th node (or, gene) and the th node in the data gathering chain. A greater value of the chromosome energy indicates a longer data gathering chain which means an inferior solution. The energy balance for each individual is computed by the sum of the residual energy of all the nodes in the individual divided by the total number of nodes: The fitness function which takes into account the cost (proportional to the distance between the nodes) between the nodes as well as the energy balance is thus given by (iii) Selection. This step selects the better individuals according to the corresponding selection operator. The selection operator used here is tournament selection where by a mating pool of the individuals with above average fitness value is maintained and two parents are randomly selected for crossover.
(iv) Crossover. This step produces the new individuals to complete the global search according to the corresponding crossover operator. Two-point crossover operator is used in this paper. In two-point crossover two points are randomly selected in a chromosome and all the genes from one point to another point are copied from the first chromosome to the second chromosome.
(v) Repair. This step removes the routing loop. The genes values which are repeated in the chromosome are deleted and thus the looping is removed. If a child violates the imposed constraints (distance threshold), then it is rejected and again the crossover is performed.
(vi) Mutation. This step produces the new individuals to maintain the diversity in the population. In mutation, a node is randomly selected from the best chromosome obtained in the past generation and a gene value is changed randomly to produce a new offspring. If the new offspring violates the constraints like distance threshold, it is rejected and mutation is performed again using repair function. The mutation operation can help the search to exit out of the local optima and search for global optima.
(vii) Elitism. In this step the best solutions from the population are maintained in the next generation so that the convergence is faster.
(viii) Checking the Terminating Criteria. If the maximum number of generations has elapsed or the energy of the network has fallen below predefined level, the algorithm is terminated.
(ix) Producing the Result. The program outputs the best chromosome found so far as the result.

Simulation Results
The energy dissipated per bit (in (3)) to run the radio electronics is elec (50 nJ/bit) and amp (10 pJ/bit/m 2 ) is the energy required by the transmit amplifier to maintain an acceptable signal to noise ratio (SNR) in order to transfer data messages reliably. is called the path loss exponent, whose value enhances with increasing channel nonlinearity (usually, 2.0 ≤ ≤ 4.0). The value of is 2 for free space transmission. The simulations are done under the MATLAB programming environment. Custom code is written to simulate the wireless sensor network. The number of sensor nodes varies from 10 to 50. The sensors are randomly distributed in the area of 100 m × 100 m. The sink node is at (100, 100). The initial energy of sensor nodes is 1 J. Every node transmits ten 4000bits messages every round. The number of individuals in the population is the same as that of total number of nodes in the network, and the maximum number of generations for genetic algorithm is ten times as that of total number of nodes in the network. The paper shows the comparison of different energyefficient routing schemes for WSN: (a) direct transmission between sensor nodes and the sink, (b) genetic algorithm based routing with least average energy consumption (ELGA) [17], (c) genetic algorithm based routing with energy balance being taken into account (EBGA) [17], and (d) Elitism based GA routing which takes into account the cost (distance) between the nodes as well as the energy balance of the individual (routing). Figure 3 shows the comparison of the following four situations mentioned above. The algorithms are tested for up to maximum of 50 nodes. It is observed that there is much improvement in the network lifetime using the proposed GA with elitism. There is 6% improvement in the network lifetime with the proposed scheme for 50 nodes, as compared to EBGA, whereas with direct method it is 208% improvement which is more significant. Figure 4 shows the residual energy of twenty nodes for 100 m × 100 m network area when the first dead node appears. The average residual energy of the network using direct, ELGA, and EBGA method is 0.6972 J, 0.3791 J, and 0.0184 J [17]. The average energy of the network using the proposed algorithm is 0.1530 J which is 16.8478% improvement over the EBGA. This shows that the proposed algorithm better utilizes the network energy which increases the network   lifetime significantly. When finding the optimized routing, the elitism-based GA using the fitness criteria mentioned in (6) converges much faster as compared to simple GA for the same fitness criteria. This is because it preserves the best solution for the next generation which improves the GA to a greater extent as shown in Figure 5. The simple GA converges in about 70 generations whereas the elitism-based GA takes about 20 generations to converge, which is much faster.

Conclusion
In this paper, the basic genetic algorithm is improved using the elitism concept to provide the solution to combinatorial optimization problem of finding the energy efficient shortest routing for the wireless sensor networks. The modified GA uses elitist strategy so that the best individual is preserved and carried to the next generation so that there is significant improvement in the convergence. The node energy falling below predefined level is restrained from the path based on some probability to increase the overall lifetime of the network. The energy load balancing strategy prevents uneven energy dissipation and thus the network lifetime is prolonged considerably.