A Study of Network Violator Interception Based on a Reliable Game Model

This study focuses on planning interceptor locations in a general transportation network to maximize the expected benefits from catching violators mixing in public traveler flow. Two reliability-related characteristics are also integrated into the planning model to make it more practical. One is that each interceptor (e.g., a sensor or a checkpoint) has a failure probability. The second is the existence of a “game” between the interceptor planner and violators. A nonlinear nonconvex binary integer programming model is presented. We develop a simulated annealing (SA) algorithm to solve this model, and numerical experiments are conducted to illustrate the computational efficiency of the proposed algorithm. We also analyze the sensitivity of the disruption probability of interceptors to optimal objective function values and discuss how to determine the values of these parameters in a violator route choice model.


Introduction
There currently exist many types of violators in daily life networks (such as ground transportation networks, commercial airline networks, and communication networks), any of whom may pose a serious threat to public safety.A typical example is that of the drunk drivers who frequently emerge in an urban road network [1][2][3][4].This is a difficult, real, and common problem troubling many local governments in many countries.Car accidents caused by drunk driving claim the lives of more than 15,000 people in the US every year and injure many more.More intuitively, as concluded by authorities [5], (1) one-third of all Americans will be involved in drunk driving accidents at some point in their lifetimes; (2) someone is injured in an alcohol-related car accident approximately every 60 seconds; (3) someone is killed in an alcohol-related car accident approximately every 40 minutes; and (4) drunk driving car accidents cost the US an average of $114 billion annually.This phenomenon also exists in China, where drunk driving and consequent traffic accidents are even more prevalent and serious.Eliminating drunk driving has long been a major focus of traffic safety professionals at the federal, state, and local levels.Terrorists, another type of violators, aim to attack airline networks [6][7][8][9].They mix in with the public flow of passengers and watch for an opportunity to launch an attack.As further examples, viruses and illegal data may exist within the public information flow in a local intranet or a world-wide internet.Viruses, in particular, may steal or violate a public user's private information or cause disruption to public internet users.Means of intercepting and eliminating illegal viruses is an important subject in the field of network security.
A common problem faced by decision makers who want to protect networks is how to catch violators mixing in the public flow as soon as possible by setting up interceptors (which could be DUI checkpoints for catching drunk drivers, or a virtual network police force that screens for viruses and illegal information).A network cannot have unlimited numbers of interceptors due to budgetary or logistical limits (e.g., a DUI checkpoint for drunk driving at one location will cause delays because vehicles need to stop and be checked, limiting the number of checkpoints that can be established).Furthermore, an unlimited number of interceptors is not necessary.One can always attempt to find an optimal interceptor layout with less than a given number of interceptors to obtain the largest expected benefit.This interceptor location problem (ILP) is very similar to "network interdiction" problems.The deterministic network interdiction problem has been studied by Zenklusen [10], Granata et al. [11], Rad and Kakhki [12], and Yates and Sanjeevi [13] with military applications and with applications to the interdiction of illegal drugs and precursor chemicals.In this problem, given a capacitated network, a source s and a sink t, the aim is to interdict arcs in the network to minimize the maximum flow from s to t, subject to constraints on the number of prohibited arcs.The stochastic version of this problem studied by Cormican et al. [14] and Ramirez-Marquez and Rocco S. [15] assumes that interdictions on arcs do not always succeed but may be either completely unsuccessful, partially successful, or completely successful.
All network interdiction problems are rooted in the wellknown max-flow-min cut theorem.These problems consider violators a type of commodity and attempt to break arcs to leave as few as possible arc-disjoint paths remaining for violators to use.However, for the ILP, we treat violators as a minority mixing in with public travelers, who are the majority.We attempt to discriminate violators from innocent public travelers as soon as possible, while they share the same network at the same time.ILP is not a max-flow-min problem in nature, as the purpose of network interdiction problems is to intercept as many violators as possible regardless of how long the violator will travel in the network or how many public travelers will be threatened by the violator.Additionally, in practice, we implement a "game" between interceptor planners and violators.After planners instantiate a specific interceptor layout, violators will adjust their route choices as much as possible to avoid being caught by interceptors.Thus, we expect that a robust solution to the interceptor layout will achieve equilibrium between planners and violators.
For the ILP, which is different from network interdiction research, we propose a reliable interceptor location problem (RILP).We assume that, in a general network, two types of flow exist: one is the public traveler flow, composed of innocent travelers, and the other is the flow of violators, who are mixed in with the public traveler flow and pose a threat to public travelers.We aim to effectively identify violators from the public flow by setting up interceptors along routes of flow.The installed interceptors are assumed to have a given probability of failure at capturing violators as they pass by.Additionally, "game" behaviors between planners and violators are integrated into the proposed model.To the best of our knowledge, our attempt is the first attempt to study ILP in a reliable and gamified manner.
This paper is organized as follows.Section 2 presents the model formulation.Section 3 discusses the algorithms for this proposed model.Section 4 presents some numerical tests to investigate the computational performance of the proposed model and algorithms.Section 5 concludes the paper and discusses future research directions.

Model Formulation
To formulate RILP, a network should first be classified into a series of origin-destination (OD) pairs between which daily traffic demand (including violators) is generated.We use  to represent the set of all OD pairs,   to represent the set of available routes between a specified OD pair  ∈  and  to represent the set of all possible routes between OD pairs.Let   be the set of candidate locations along route  ∈   for interceptors and let  be the set of all candidate interceptor locations.  represents the expected number of violators generated in OD pair  ∈  during any specified time interval.
For any OD flow, an interceptor will check passengers if and only if the flow passes the interceptor.Any violators in the flow will be caught.In this case, we say that the flow is covered by the interceptor or that we have flow coverage.The benefit obtained from setting up an interceptor along a specific route, which we denote by path coverage, depends on not only the inspected passenger volume but also the lengths of the covered OD paths.For simplicity, this study adopts a vehicle-mile coverage measure [16] such that the path coverage benefit for an OD path is proportional to both its traffic volume and the covered length.
The existence of violators will pose a safety threat to the traveling public.The magnitude of the threat is dependent on the public traveler flow encountered and the length of the path traveled by the violators before their interception.For a specific route , setting up an interceptor at candidate location  ∈   results in a corresponding potential benefit   stemming from the public's protection from violators on that route.Therefore, in a network with link set , the path coverage benefit   can be defined as follows: where   and   are, respectively, the length and the public traveler flow on link  ∈  and parameter    is defined as follows: 1, link  is on path  and downstream of location  0, otherwise.
In practice, due to the performance limit of interceptors, violators may not always be caught when they pass by interceptors.Even when interceptors are fully competent to discriminate violators from the public flow, we should not assume that all passing passenger flows are checked by interceptors because it is not practical to stop the flow to check every passenger (i.e., checking for drunk drivers at a DUI checkpoint will cause traffic delays if the sampling rate is high).This is similar to the concept of "sensor failure" that is widely studied in the literature [17][18][19][20][21].In this paper, we define this failure as "interceptor failure." Complementary to the introduction of interceptor failure, the expected intercepting benefit from setting up an interceptor at candidate location  along route  is related to the "head level" of that interceptor.Supposing that there are   interceptors installed on route .We see that once the locations with installations on  are given (i.e., {  1 ,   2 , . . .,     } ordered from upstream to downstream), their head levels are defined as follows.
Definition 1 (head level).An interceptor at location    is the level- head interceptor along route .
The primal decision variables  = {  } determine where to install DUI sites, where 1, a interceptor is installed at candidate location  0, otherwise.
Given , the auxiliary variables ℎ = {ℎ   } decide how interceptors are assigned to paths, where 1, a interceptor is installed at candidate location  along route  and assigned to head level  0, otherwise. ( Assume that each interceptor fails independently with an identical probability 0 ≤  < 1.The path coverage benefit from setting up interceptors at location  along route  is   .As interceptor  functions to catch violators with probability (1 − ) −1 , its expected contribution to catching violators is (1 − ) −1 ℎ     .Additionally, each route  ∈   has a probability   of being selected by violators, such that ∑ ∈    = 1,∀ ∈ .This route selecting probability should not be defined statically but should instead be related to the layout of interceptors.For example, locating a DUI checkpoint at a certain location one night may induce drunk drivers to avoid this location later.To account for this "game" between interceptor planners and violators, we assume that violators will select a route between a specific OD pair according to a Logit model, in which the utility of violators to select a specific route is determined by two aspects: the numbers of installed interceptors along all possible OD routes and the travel lengths of these routes.We assume that violators always try to find a route that possesses as few interceptors as possible and a short travel distance.We define   as follows: where =1 ℎ   is the number of installed interceptors along route ,   is the travel length of route , and  1 ,  2 are two coefficients corresponding to interceptor number and travel length, respectively, in the utility function.
Taking into account the discussion above, the expected intercepting benefits across an exponential number of failure scenarios and the game behavior between planners and violators can be consolidated into a compact expression, and RILP can be written as follows: Objective function (6) aims to maximize the total expected benefit from all interceptors.Constraint (7) enforces the budget limit, which is that no more than  interceptors are allowed to be installed.Constraint (8) ensures that each installed interceptor is assigned to each of its corresponding paths at one and only one head level.Constraint (9a) indicates that no more than one head interceptor is assigned to each path at each level.Constraint (9b) implies that, for each path , all the implemented head assignment levels { | ∑ ∈  ℎ   = 1} start from 1 and form a consecutive sequence.Constraint (10) is the definition of the head level number along a specific route.Constraint (11) defines the binary variables.

Solution Algorithms
Solving the model of RILP is relatively difficult, as it is a nonlinear, nonconvex binary integer programming model, the objective function of which contains exponential, fractional forms of decision variables.We will introduce simulated annealing (SA) to solve this model.
SA is a generic probabilistic metaheuristic for the global optimization problem of locating a good approximation to the global optimum of a given function in a large search space.SA is often used when the search space is discrete (e.g., all tours that visit a given set of cities).For certain problems, simulated annealing may be more efficient than exhaustive enumeration, provided that the goal is merely to find an acceptably good solution in a fixed amount of time, rather than to find the best possible solution.The algorithm is well Step 1. Initialization.
Step 2. For the given , perform the following: (2.1) Set the inner iteration counter to  = 1.
(2.2) Randomly select an element from  (selected candidate locations) and  −  (unselected candidate locations), respectively, and exchange them to produce a new solution x. described by Liu [22].A brief summary of the algorithm is given below.
(1) Initialization.An initial solution is generated randomly from the feasible region.The initial temperature should be high enough to allow all candidate solutions to be accepted.
(2) Markov Length.The iteration number  is used in each temperature.This number should be set appropriately high such that the objective function values reach a Boltzmann distribution.
(3) Cooling Schedule.The cooling schedule is the rate at which the temperature is reduced.In this paper, 0.8 is used for the first 12th temperature reductions.A cooling schedule of 0.8 means the temperature of the next stage is 0.8 times the current temperature.A cooling schedule of 0.5 is used after the 12th temperature reduction.
Step size at each move should be decreased along with the reduction in temperature.Feasible solutions at lower temperatures are close to the optimal solution.When the temperature is low, a stochastic search tends to be a deterministic search.If the step size is too large, at low temperature, some feasible solutions will be rejected, thereby wasting computation time.
(5) Neighboring Solutions.Neighboring solutions are the set of feasible solutions that can be generated from the current solution.Each feasible solution can be directly reached from the current solution by a move and the resulting neighboring solution.
(6) Stopping Criteria.The algorithm stops when the number of temperature transitions reaches a prespecified number, or when the temperature is reduced to a threshold or when the neighboring solution was not improved after a given period of time.
The algorithm of SA on RILP is summarized in Algorithm 1.

Case Study
In this section, we consider a special type of violator, drunk drivers who blend in with public travelers in an urban road network and aim to find a reliable layout of interceptors (DUI checkpoints) to catch them and consequently protect public innocent travelers.Several sets of numerical experiments are performed to test the computational performance of the proposed model and solution algorithm.All algorithms are coded in MATLAB R2010b and run on a desktop computer with 2.70 GHz CPU and 4.00 GB memory.
The proposed models are applied to the Sioux Falls network, which includes 24 nodes and 76 links, as shown in Figure 1.Every node represents a zone.The nodes with red/green colors are assumed to be the origins/destinations of drunk drivers.All OD trip data and network data, including the link lengths, link free flow travel times, link capacity, link characteristics (i.e., BPR function), and link indices, are downloaded from Bar-Gera [23].Using network data and OD trip data, the traffic assignment of user equilibrium (UE) criteria [24] is implemented to obtain the link flow   , ∀ ∈ .We then use   to calculate   with formula (1).
The difficulty of implementing RILP lies in the exponential number of routes between each OD pair.This will be exacerbated for larger networks.However, it can reasonably be assumed that not every route will be used by drunk drivers.Based on this fact, we select a feasible and reasonable number of routes between each OD pair for drunk drivers.We only use those routes whose probabilities of being taken by drunk drivers are no less than a certain small value (e.g., 0.1).The probability that each route between a given OD pair is taken by drunk drivers is computed by the following Logit-like formula: where   ,   , and   are the travel times of routes , , and  between OD pair , respectively, which can be obtained by the implementation of UE.The loop-less -shortest path algorithm presented by Yen [25] can find an arbitrary number of ranked shortest paths between each OD pair.We use that algorithm to successively find and add a new route to the set of k paths until the selected probability, computed by formula (12), is smaller than a predefined small value.
In our example, there are a total of 30 OD pairs (with 5 origins and 6 destinations) for generating and absorbing drunk drivers.The "demand" of drunk drivers between each OD pair is randomized between 1 and 100 in Excel 2010.The total number of routes generated by the -shortest path algorithm is 99.In this example, all 24 nodes in the Sioux Falls network are assumed to be candidate locations for interceptors.

Solution Algorithm Performance.
To analyze the performance of the SA algorithm in solving our RILP model, we run a series of instances for  = 0.1,  1 = 1000,  2 = 2.5,  ∈ {5, 10},  0 = 500, V ℎ = 100,  = 0.8, and  = 0.005.In each instance, the initial solution for the interceptor layout is randomized by the MATLAB function "Randperm".The results are shown in Figures 2 and 3.As observed, the best objective function value always stabilizes after several iterations of SA in each run, and the nearoptimal result is very similar across all runs for a specified  ( = 5 or  = 10).This indicates that the proposed SA algorithm for RILP is nonsensitive to the initial solution  and performs stably.Additionally, each run almost stabilizes at close to 20 iterations, and in each iteration 100 (because Markov length = 100) computations are necessary to obtain objective function values.Thus, SA uses approximately 22 × 100 computations to approach an optimal solution, while an enumeration algorithm may cost  5  24 = 42504 computations.Therefore, the proposed SA algorithm is more effective at solving RILP than enumeration.

Sensitivity Analysis of Disruption Probability 𝑝. In this set of experiments, we aim to investigate changes in objective values and interceptor deployment solutions under different disruption scenarios.
Using four different disruption probability () values, we use SA to solve RILP with  values from 1 to 24.For each , we implement SA 4 times and use the average objective value as the final result.Figure 4 shows the benefits obtained for each  over varying numbers of installed interceptors.Overall, objective values approach stability when the number of installed interceptors is more than 5 under each disruption scenario.We call  = 5 the "critical point".This is a reasonable result because we assume there are only 5 potential origins for violators, and if these 5 origins all contain installed interceptors, all of the first nodes of each violator route will be covered.Before the number of installed interceptors achieved at the "critical point", the potential improvement of the disruption probability, the sharper the increase in the objective value before the "critical point" and the slower the increase after the "critical point".Figure 4 also indicates that while  is determined, the decrease in objective values is not linear across different disruption probabilities, although we attempt to linearly increase the probability.The difference in objective values between any two neighbor disruption scenarios is exacerbated sequentially.Table 1 presents interceptor solutions under four different disruption scenarios (with only  = 3, 4, 5 shown as examples).We can see that when the disruption probability varies, the "optimal" interceptor solution also changes.

Additional Discussions.
In practice,  1 represents the resultant benefit from a one unit increase in the probability of a drunk driver avoiding an interceptor, and  2 represents the cost incurred by increasing the travel distance by one unit.One should be careful in determining the values of these two parameters because they will cause significant differences in how drunk drivers select routes and, consequently, differences in the optimal interceptor layout.If a government increases the amount of attention it pays to eliminating drunk driving,  1 should be very large (e.g., in China, drunk driving leads to high fines and also may lead to suspended driver's licenses and jail time).

Conclusion
This paper studies an interceptor location problem in an arc-node network.The interceptor identifies violators from network traffic and reduces the safety risk for the general public.We propose a reliable interceptor location model (RILP) to account for interceptor disruption and the "game" played between interceptor planners and violators.The RILP is a nonlinear, nonconvex binary integer programming model that is difficult to solve.We develop a simulated annealing (SA) algorithm to solve the RILP, and numerical case studies are conducted to test the algorithm.Various managerial insights are also drawn from the numerical results.We find that the proposed algorithm is able to effectively and efficiently solve the RILP.Sensitivity analysis of the disruption probability shows that (i) the total benefit of the system decreases as the disruption probability increases and (ii) there exists a "critical point" for the number of installed interceptors.We further discussed the physical meanings of two parameters in a Logit model for the selection of routes by drunk drivers and how to determine these parameters.
In this study, we consider a heuristic algorithm, SA.Because of the uncertainty inherent in heuristic methods, future studies should aim to develop global optimal algorithms to solve this nonlinear programming model.

Table 1 :
Interceptor deployment solutions under four disruption scenarios.