Optimization for Service Routes of Pallet Service Center Based on the Pallet Pool Mode

Service routes optimization (SRO) of pallet service center should meet customers' demand firstly and then, through the reasonable method of lines organization, realize the shortest path of vehicle driving. The routes optimization of pallet service center is similar to the distribution problems of vehicle routing problem (VRP) and Chinese postman problem (CPP), but it has its own characteristics. Based on the relevant research results, the conditions of determining the number of vehicles, the one way of the route, the constraints of loading, and time windows are fully considered, and a chance constrained programming model with stochastic constraints is constructed taking the shortest path of all vehicles for a delivering (recycling) operation as an objective. For the characteristics of the model, a hybrid intelligent algorithm including stochastic simulation, neural network, and immune clonal algorithm is designed to solve the model. Finally, the validity and rationality of the optimization model and algorithm are verified by the case.


Introduction
In the pallet pool mode, the usage rate of pallet can be improved and logistics cost can be reduced by duty-cycle operation. However, this will increase the liquidity of the pallets. After pallet service center is determined, the pallets are circulated in the circulation of service center, each demand point and freight station by the scheduling and coordination of service center and according to the actual supply and demand situation.
The purpose of pallet service route optimization is to determine the optimal service path, so that the vehicle can provide services (delivery and recovery operation) for each demand point efficiently. This is a problem of solving the optimal routes for multipoints. Now, a few researches for the route optimization of pallet service center are reported in the literature. Through analyzing the purpose of pallet service route optimization, we find that the pallet service route optimization is similar to logistics distribution vehicle routing optimization (VRP), so we can draw lessons from the relevant theory research of VRP.
Vehicle routing optimization problem is defined as follows: in the case of warehouse and when the customer's position is fixed, the distance, the demand, and other information are all known. Take the customer's various demand as the restraint condition, and seek the best plan to achieve the shortest path (the province with the highest expense, the least time, etc.) [1]. Due to the complexity of the actual demand, the vehicle routing problem with the related constraints based on the basic vehicle routing optimization problem is generated. Capacitated vehicle routing problem (CVRP) takes into account the vehicle loading capacity; each vehicle can only be loaded with a certain amount of goods, and vehicles' capacity must be increased when the goods exceed the vehicle loading capacity. CVRP is the most common vehicle routing problem. Vehicle routing problem with time windows (VRPTW) mainly researches the service time according to the requirement of customers; service time is limited by 2 Computational Intelligence and Neuroscience customer; the vehicle can only arrive at destination within the specified time; the service is not accepted or the service is punished with penalty at other times. The vehicle routing problem with time windows is a common problem in reality, especially for urban customers or when there are many random transportation processes [2]. Vehicle routing problem with pickups and deliveries (VRPPD) considers the demand of delivery and recovery for each customer; delivery and recovery services need to be completed at the same time for one customer; this can save the vehicle transportation cost for the logistics enterprises. It is a subproblem with economic efficiency and operating efficiency of the vehicle routing optimization problem, but it does not consider the time constraint [3]. Psaraftis [4,5] presented a dynamic vehicle routing problem. Since then, many scholars have done a lot of research in this field. Larsen [6] studied the method of dynamic degree feature index of DVRP. Brotcorne et al. [7] studied the dynamic scheduling problem of emergency vehicles, which is mainly about the emergence of new customers online. Taniguchi and Shimamoto [8] and Fleischmann et al. [9] studied the problem of real-time traffic information of DVRP. Melachrinoudis et al. [10] considered the impact of the special environment of medical service for the DVRP and constructed the corresponding model. Khouadjia et al. [11] and Ferrucci et al. [12], respectively, studied DVRP with dynamic customer, and the corresponding solving strategy was put forward. Albareda-Sambola et al. [13] verified the feasibility of dividing DVRP by time slots. Ghannadpour et al. [14] studied DVRP of multiobjective.
In addition, the delivery and recovery operation of pallet service center is similar to the operation of sending and receiving mails, so the route optimization can also learn from the Chinese postman problem. The Chinese postman problem as an important research field of operational research was first proposed by Professor Guan [15]; the problem is described as follows: the postman executes delivery task from the post office; he is required to select one of the shortest routes and must pass through the streets at least once and then return to the post office. Subsequently, the Chinese postman problem drew the attention of other scholars around the world [16]. Wang [17] for the first time put forward the "multipostman problem"; several mathematical models of the Chinese postman problem for multipostman were constructed; an effective algorithm for limiting conditions and the computational complexity of the Chinese postman problem for multipostman under normal circumstances were given. Rong et al. [18] put forward the model of multistage decision process conversion algorithm, so as to solve the dynamic programming model of the Chinese postman problem for multipostman. Zhu [19] proposed a single center Royd algorithm of the Chinese postman problem for multipostman and iterative diagram of computer program. "Multipostman problem" is the extension of the Chinese postman problem; it can be described as follows: a post office has ( ≥ 2) postmen; they were set out at the same time to send and receive messages of each street and finally returned to the post office. It is required that the total delivery route is the shortest by the reasonable allocation of the delivery route for each postman.

Problem Description and Model Construction
For any pallet service area, the supply points, demand points, and service centers have fixed distribution. When providing supply and demand service (delivery and recovery operation), it is necessary that the distribution vehicle traverses the demand points along the optimal route in order to achieve high efficiency of service operation with the minimum cost. This is consistent with the Chinese postman problem; it can be described using a mathematical language: if the road is expressed with edge ( , V), the length of the road is expressed with edge weight ( , V), and the pallet service center, freight station, and pallet demand enterprise are all expressed with point, so a pallet service area constitutes an undirected graph connected with edge weight. Routes optimization of pallet service center can be described with the language of graph theory, that is, how to find a loop in an undirected graph connected with edge weight which passes each edge at least once, where the length of is the shortest and is the best service route in the service area. Different from the logistics distribution, pallet service includes delivery and recovery. Generally speaking, the pallets used by customers cannot be provided to other customers directly; they should be recovered to the service center and be inspected, cleaned, and repaired. Finally, they are sent to the customer. Therefore, in order to make the total service routes the shortest, in every service work, we should first determine the type and amount of the pallets and vehicles according to pallet demand and recovery information and then find the optimal route for each vehicle.

Optimization Method of Service Route.
Based on the analysis of the previous section, we use solution ideas of the Chinese postman problem and draw the vehicle routing problem to solve the route optimization problem of pallet service center. Before using the Chinese postman problem to optimize the route, it is necessary to describe concepts of the Chinese postman problem. Euler loop is the graph that starts from a vertex and goes back to the starting point along each edge that is not repeated.
In the Chinese postman problem, there are ( − 1)! ways of arranging for delivery points without the starting point; each arrangement is delivery route planning. With the increase of , the amount of computation shows exponential growth, and the time spent by the enumeration method is difficult to bear.
Based on the related literatures, the Chinese postman problem can be converted to 0-1 programming.
First, we determine whether each edge is included in the delivery line. ,V ( ̸ = V) is 0-1 integer variable; ,V = 1 indicates that the route is from to V, that is, ( , V) in the route; ,V = 0 means ( , V) is not in the route.
The objective function is as follows: where ,V is the cost of vehicle running in the route ( , V). The constraint of visit frequency for demand points is as follows: (1) Starting from the delivery point once and only once (to the other delivery points), it is expressed as (2) From a delivery point to V once and only once, it is expressed as The model established above is similar to the model of assignment problem; it is only a necessary condition for the Chinese postman problem, but not sufficient condition.
For example, there are six points connected by the routes (as shown in Figure 1); they meet the above two constraints, but there are two unreasonable subcircuits, so they cannot constitute the whole circuit.
For this situation, we need to consider increasing the sufficient conditions to avoid the subcircuit.
Add a variable , = 1, 2, . . . , (its size can be taken as an integer, e.g., the delivery point = 2, which is achieved by the vehicle from start point).
An additional constraint is as follows: We prove the constraint as follows: (1) Any line that contains the subcircuit cannot meet the constraint (no matter what the value of is).
(2) The whole circuit without the subcircuit can meet the constraint (as long as takes the proper value).
We use apagoge to prove (1), if there are subcircuits, at least two. Then, at least one subcircuit does not include the starting point 1, as shown in Figure 1; then, there will be Add these three inequalities and get ≤ − 1; this is obviously incorrect, so the assumption cannot be established. For the whole circuit, because of V ≥ 2 in the additional constraints, it does not contain the starting point 1, so it will not be contradictory.
For the whole circuit, as long as takes the appropriate value, it can meet the constraints: (1) For the edge of the whole circuit, ,V = 1, can take the order number of access point ; then there will be − V = −1, and constraint condition − V + ,V ≤ − 1 becomes −1 + ≤ − 1; this is inevitable.
(2) For the edge of nonwhole circuit, because of , In summary, the constraint condition only restricts the subcircuit and does not have other effects. So, the Chinese postman problem is transformed into a mixed integer linear programming problem.

Assumptions for Modeling.
Before modeling, the following preconditions and assumptions are given: (1) The influence of pallet type for delivery and recovery is ignored. (2) Each pallet service center is only responsible for pallet delivery and recovery within its own jurisdiction range. (3) Each vehicle has only one transport route; the start and end points of each route are the same pallet service center. (4) Each vehicle can provide service for more than one demand point, but each demand point can only be provided by one vehicle. For the demand of demand point more than a single vehicle loading capacity, we should determine the number of vehicles required to load the pallets and only take the amount of pallets which is less than one vehicle as the optimization target.  Computational Intelligence and Neuroscience (6) In the decision period, the amount of pallets waiting to be delivered and recovered is known and is random.
(2) Parameters and Decision Variables. The number of demand points serviced by vehicle is , and ∑ =1 = , so = { | = 1, 2, . . . , }; V is the amount of pallets that need to be distributed to the demand point V; V is the amount of pallets that need to be recycled from the demand point V; is the amount of loading after the vehicle runs steps; is the amount of pallets readied for distribution of pallet service center in the decision period; is the amount of pallets readied for recovering of pallet service center in the decision period; is the maximum loading of the vehicle ; V is the distance from the service center to the demand points; ,V is the shortest path between and V; V is the time of vehicle arriving at the demand point V; 0 is the departure time of vehicle from the service center; ,V is the theoretical traveling time of vehicle from the point to V; V is the service time of the vehicle at the point V; [ V , V ] is the time window of servicing required by point V; V and V denote the start and end time, respectively.
represents the distance of vehicle traveling from the service center to the first demand point and from the last demand point to the service center; represents the distance between the first demand point and the last demand point of vehicle that arrived; ,V is the 0-1 decision variable; it represents the vehicle at the step reaching the demand point V; subscript represents the number of steps that vehicle passed; and V indicates the arriving demand point. ,V = 1 expresses that the loading and unloading points of vehicle that arrived at the step are demand point V; otherwise, ,V = 0.

Model Building.
First, it is necessary to use the fewest vehicles in the process of pallets delivering and recovering; the number of vehicles is determined according to the amount of pallets waiting for distribution and recycling. The formula is = [max( , )/ ], and [⋅] expresses the rounding operation.
For example, the total number of pallets waiting for distribution is = 1760, and the total number of pallets waiting for recycling from the demand points is = 1400; each vehicle can accommodate up to 355 pieces; the minimum number of vehicles required is = [max(1760, 1400)/355] = 5.
The number of service routes is determined by the above method. Vehicles start from the service center, passing some demand points to load and unload, and finally come back to the service center.
The demand points within the service area are considered as a whole; the number of service routes is determined. The demand points of the vehicles passing are unknown; by reasonably arranging the transport routes of the vehicle, the vehicles are determined corresponding to the routes. Under the premise of meeting the transportation capacity, take the total mileage of the routes minimum as the objective. The problem is equivalent to the multipostman problem with constraints.
Based on the above analysis, the objective function and the constraint conditions of service routes optimization are given.
(1) Objective Function. Take the total mileage minimum of all vehicles completing a delivery and recovery operation in the decision period as an objective; the function is where (2) Constraint Conditions. (1) For any vehicle in any one step arriving at one and only one demand point, (2) For any demand point, there must be a vehicle arriving once and only once: (3) Loading constraint: in the process of delivering and recycling for pallets, the loading quantity exhibits a dynamic change in transport process; considering the uncertainty of damage and demand in the transport process, the stochastic demand is added for pallets distribution, in order to deal with the uncertainty of demand. Take the vehicle as an example; the loading quantity at the time of departure is where V is random demand. After the operation of loading and unloading at the first demand point, the loading quantity is Computational Intelligence and Neuroscience 5 In the process of delivering and recycling, the recursive formula of the loading quantity is The constraint of loading quantity is (4) Time constraints: the departure time of vehicle from service center is The time of arriving at the first demand point is In the process of delivering and recycling, the recursive formula of vehicle arriving at each demand point is According to the above analysis, V should meet the following conditions: Due to the uncertainty of vehicles in the transport process, they cannot arrive at the point in the required time window completely; there will be more or less some deviations, this is expressed as follows: where + V and − V are time variables of random arrival, + V indicates a positive deviation from the target value V of vehicle arriving at the demand point V, and − V indicates a negative deviation from the target value V of vehicle arriving at the demand point V.

Solution to the Model
The traditional method for solving the constrained programming model is by converting the opportunity constraint into its equivalent form and then solving the equivalent model. Combining the theory of stochastic constraints, constraints (12) and (17) can be transformed into the chance constraints [20]. Due to the random demand, for the vehicle, the initial loading quantity containing the random demand should be no more than the maximum loading capacity, and each vehicle needs to meet the loading constraints at the specified confidence level . In addition, each demand point is served by a confidence level V within its specified time window 6 Computational Intelligence and Neuroscience To meet the above conditions, the following chance constraints were given: Because the deterministic processing of stochastic constraint programming is only suitable for special cases, the hybrid intelligent algorithm based on stochastic simulation, neural network, and immune clonal algorithm is used to solve this model. Compared with genetic algorithm, immune clonal algorithm can promote or inhibit the generation of antibodies; it embodies the self-regulation function of the immune system and ensures the diversity of individual.
The solving steps of hybrid intelligent algorithm are as follows.
Step 1. Employ stochastic simulation technique to generate input-output data for the uncertain function.
Step 2. Train a neural network to approximate the uncertain function according to the input and output data generated.
Step 3. Initialize the antibody population and use the trained neural network to test the feasibility of antibody.
Step 4. Use immune gene operation to update antibody and examine the feasibility of offspring using the trained neural network.
Step 5. Use the trained neural network to calculate the target values of all antibodies.
Step 6. The fitness of each antibody is calculated according to the target values.
Step 7. Select antibodies by clonal selection.
Step 8. Repeat Step 4 to Step 7 until the end condition is met.

Antibody Description.
,V is decision variable of service route optimization. It signifies vehicle arriving at the demand point V at the step. When using hybrid intelligent algorithm to solve the optimization model of pallet service routes, we make the alternative routes as encoding of solution, as shown in Figure 2.
If it is proved that the antibody v = (x, y, t) is feasible, it is acceptable; otherwise, repeat the above process until a viable antibody is obtained.

Clone Operation.
Perform clone operation for antibody populations ; update to : x, y, t) , k 2 (x, y, t) , . . . , k (x, y, t)) , where k (x, y, t) = (k (x, y, t)) = × k (x, y, t), = 1, 2, . . . , , and is the -dimensional row vector where elements are 1, namely, clone of antibody k (x, y, t). For the single antibody k (x, y, t), the clone size should be properly adjusted according to the affinity between antibodies and the affinity between antibody and antigen. According to formula (21), the clone size is calculated: , y, t)) , y, t)) ⋅ ) , where ceil(⋅) is the function of rounding up and > is the setting value associated with the clone size. (k (x, y, t)) Computational Intelligence and Neuroscience 7 · · · · · · · · · · · · · · · P 1  is affinity between antibody k (x, y, t) and antigen (fitness); is the affinity between antibody and other antibodies; the value can be calculated according to the following formula: where ‖ ⋅ ‖ is Euclidean distance after normalization; that is, 0 ≤ ‖ ⋅ ‖ ≤ 1. Obviously, the lower the antibody affinity and the higher the similarity degree, the stronger the inhibition among antibodies and the smaller the value of .
For gene t, a mutation direction d is randomly generated in ; for the prespecified step size , t + ⋅ d is not in ; then, set as a random number between 0 and , until it meets the time window [ , ] . If the gene t cannot be generated within the specified number of iterations through the above process, then take = 0. Finally, the parent gene t is replaced by offspring t = t+ ⋅d.

Clone Selection.
Through crossover and mutation of high frequency, we choose an optimum individual of the highest fitness from parent individuals and offspring as the next generation individuals [21]. The mixture of parent individuals and offspring individuals can avoid degradation of the algorithm. The realization process is shown in Figure 3.
In Figure 3, 1 , . . . , are parent individuals and 1 and were copied as and ; we perform mutation of high frequency for duplicate individuals; 1 and are offspring individuals after cloning and selection.

Case Study
Now, there are two pallet service centers in the region; they provide pallet delivery and recovery service for each pallet demand point according to the demand in the decision period (as shown in Figure 4). There have direct routes 8 Computational Intelligence and Neuroscience Table 1: Delivery and recovery of demand points.  Table 2: Distance between service center and the points (km).  Tables 2 and 3). The number of pallets that will be distributed to the demand point and recovered from the demand points is shown in Table 1.
Arrival time of vehicle required by the demand point and the time matrix of vehicle traveling on the road are shown in Tables 4, 5, and 6, and the operation time of vehicles at the point of demand is shown in Table 7. The maximum loading per vehicle is 450 pieces, and the transportation cost of one vehicle is 5 yuan/km. The departure time of the vehicles for each service center is 8:00∼14:00 (integral point). Time variables + V and − V are all obeying normal distribution (10, 1). The stochastic demand V obeys normal distribution ( , ) (as shown in Table 6). The confidence level at which demand points can be serviced in the service window is assumed to be 90%; then, the chance constraint is The confidence level at which all demand points can be satisfied is 85%; then, the chance constraint is First, we calculate the number of vehicles; the result shows that there need to be 3 vehicles to complete the service for the pallet service center 2 . Then, use hybrid intelligent algorithm to solve the problem (3000 cycle simulations, 1000 clone operations); the algorithm was coded in C++ and run on a Core i5-560M CPU with 2.67 GHz computer. On average, the best solutions were obtained with a population size of 300, and the best result can be found in a very short time. Figure 5 depicts the convergence diagram of the proposed hybrid intelligent algorithm. The scheme of optimal service routes is as follows (Figure 6): The departure time of the first vehicle is 9:00, and walking route is 2 -11 -2 -9 -1 -2 .
The departure time of the second vehicle is 9:00, and walking route is The departure time of the third vehicle is 10:00, and walking route is 2 -5 -13 -1 -2 .
The total length of three vehicles running is 235 km, the total time is 803 minutes, and the total transportation cost is 1175 yuan.
For the pallet service center 5 , it only needs 1 vehicle to complete the service; the scheme of optimal service route is as follows (Figure 6): The departure time of vehicle is 14:00, and walking route is 5 -7 -12 -6 -3 -8 -10 -5 .
The total length of vehicle running is 106 km, the total time is 329 minutes, and the total transportation cost is 530 yuan.
In addition, if we do not consider the stochastic conditions and time, the scheme of optimal service routes will be as follows.
For the pallet service center 5 , the walking route of vehicle is 5 -7 -8 -10 -3 -6 -12 -5 ; the total length of vehicle running is 106 km, and the total transportation cost is 530 yuan.
Compared with optimization results without consideration of time and stochastic conditions (Figure 7), the optimization method which considers time and stochastic conditions in the paper costs more time and money, but this can meet the demands of customers. It is more suitable for reality.

Conclusions
The purpose of route optimization is to find the shortest path of vehicle traveling under the premise of meeting the customers' demand. It has a great impact on sustainability of supply chains. Based on the vehicle routing problem and the Chinese postman problem (CPP), the service route of pallet service center was optimized in this research. Taking full account of various factors that impact the route optimization, a new optimization model with stochastic constraints was formulated and applied to find the best routes of vehicles. Customer demand and transportation time were considered to be stochastic. This assumption adds a more realistic element in the transportation. Because it cannot be solved by optimization software directly, according to the feature of the chance constrained programming model, a novel hybrid intelligent algorithm based on stochastic simulation, neural network, and immune clonal algorithm was proposed for solving the model. Compared with genetic algorithm, the individual affinity of immune clone algorithm includes both the affinity between antibody and antigen (matching degree) and the affinity between antibody and antibody (similarity), so the individual evaluation is more comprehensive and the selection mode for individuals is more reasonable. The validity of the model and algorithm was verified by a case.
However, in actual service, the vehicle loading capacity of pallet service center may be different, and the differences of pallet types have a certain influence on vehicle loading capacity. In addition, the service mode of first delivering and then recycling in the service process might be better. Although the