UAV-Assisted Data Dissemination in Delay-Constrained VANETs

Due to the high mobility of vehicles, the frequent path failures caused by dynamic network topology, and a variety of obstructions, efficient data dissemination with delay constraint in vehicular ad hoc networks (VANETs) is a challenging issue. To address these problems, a novel mobile relaying technique by employing unmanned aerial vehicles (UAVs) is considered to facilitate data dissemination in vehicular environments where the communication infrastructures are not available or the network connectivity is poor. +is paper studies and formulates the throughput maximization problem in UAV-assisted VANETs, which aims to achieve high throughput while guarantee the delay constraint of data flows to the vehicles in the area. To maximize the network throughput, the maximization problem tries to find an optimal delivery strategy for data dissemination by optimizing the transmission rate. To solve the problem, the knapsack problem can be reduced to the maximization problem, which is proved NPhard. A polynomial time approximation scheme is proposed to achieve an approximate solution. Detailed theoretical analysis including time complexity and approximation ratio of the proposed algorithm is presented. Simulation results demonstrate the effectiveness of the proposed algorithm.


Introduction
As important components of Intelligent Transportation System (ITS), vehicular ad hoc networks (VANETs) are large-scale mobile ad hoc networks composed of vehicles with communication functions and roadside infrastructures, which aim to provide services for autonomous driving and high-speed information sharing [1,2].In VANETs, drivers mainly obtain real-time road conditions and safety information sent by other vehicles through wireless communication technology.In this way, traffic accidents and road congestion can be effectively avoided while travel time and energy consumption can be reduced.Meanwhile, VANETs can provide information services, such as news and entertainment, which can add fun to the boring journey.
However, VANETs have some unique characteristics that other ad hoc networks do not share, such as high vehicle mobility, dynamic network topology, and intermittent network connectivity.
ese features bring a variety of challenges to data dissemination.To deal with the issues, unmanned aerial vehicles (UAVs) can be utilized to cooperate with VANETs.Compared to traditional terrestrial wireless communications, UAV-enabled communications are significantly less affected by channel impairments such as shadowing and fading and in general possess more reliable air-to-ground channels due to higher possibility of having line-of-sight (LoS) links with ground users [3].Additionally, in the areas where the infrastructures are difficult or too costly to install and maintain to provide ideal network coverage, UAVs can serve as a viable option, as they can collect information from an area of interest and transmit the information to ground VANETs [4,5].ey can also act as relays to ground networks when direct multihop communications are not available.
Considering the advantages of UAVs, a cooperative hybrid network framework is proposed, integrating UAVs with ground vehicles for data dissemination in VANETs.In the studied scenario, a vehicle carries a message and aims to transmit the message to a target area where exist a number of vehicles and UAVs.To complete the transmission, the message can be either transmitted over vehicle-to-vehicle (V2V) links, vehicle-to-infrastructure (V2I), or air-toground (A2G) communication links.To improve the performance of data dissemination, it should transmit data as much as possible in a specific period, which means to maximize the network throughput.Transmission rate and transmission delay of the links over which data is transmitted are utilized to reflect the throughput.erefore, to achieve the maximum throughput is equivalent to maximize the sum of transmission rate of selected links on data delivery path.As there may exist more than one path from source to destination, the study aims to select a path with the maximum throughput while satisfying a predefined delay threshold.
Graph theory is applied to abstract the network as a connected graph, then the well-known 0/1 knapsack problem can be reduced to the throughput maximization problem.Due to the property of transmission in VANETs, the problem is regarded as the graph knapsack problem which is one of the classical NP-complete problems [6].
en, a polynomial time approximation algorithm for the graph knapsack problem is derived based on the approximation scheme for subset sum problem [7].Since the throughput maximization problem can be reduced from the graph knapsack problem, the proposed approximation algorithm can be applied to the maximization problem and to obtain an end-to-end path with the maximum throughput.
e main contributions of this paper are described as below.
(i) A throughput maximization problem in delayconstrained UAV-assisted VANETs is formulated, which considers the tradeoff of data transmission rate and transmission delay.en, a multiedge graph knapsack problem is constructed based on 0/1 knapsack problem and reduced to the throughput maximization problem, which is proved to be NP-hard.(ii) A polynomial time approximation scheme is developed for the multiedge graph knapsack problem to obtain the approximate solution.In the proposed scheme, the edges and vertices are assigned with values to indicate their weight.To select a path with the maximum weight, a trim procedure is applied to remove the unnecessary values.eoretical analysis proves that the algorithm runs in polynomial time with a bound which is polynomial in the size of the input and 1/ϵ, where ϵ denotes the approximation parameter.Additionally, the approximation ratio caused by trimming the unnecessary edges in path selection is also derived as 1 + ϵ. e results can be applied to general graph knapsack problem.(iii) An efficient data dissemination algorithm based on the approximate scheme for the graph knapsack problem is proposed to solve the throughput maximization problem.e values of edges in the graph knapsack problem correspond to the transmission rate and delay of the links.Considering the approximation in the knapsack problem, the proposed algorithm for the maximization problem has a quadratic approximation, which is the combination of the approximation to obtain the optimal transmission rate of links and the approximation to trim the unnecessary edges when selecting the path with the maximum throughput.e time complexity and approximation ratio of the proposed algorithm are also given.e remainder of the paper is organized as follows.Section 2 overviews the related work.Section 3 describes system model and problem formulation.Section 4 develops a polynomial time approximation scheme for the graph knapsack problem, based on which an algorithm for the throughput maximization problem is proposed.Performance evaluation is presented in Section 5. Section 6 concludes the paper.Finally, Section 7 discusses the tradeoff between the benefit and cost of employing UAVs and gives the direction of future work.

Related Work
Lots of research has been done to achieve data dissemination with high efficiency in vehicular networks, most of which is devoted to analyzing the performance of delay, throughput and utility of data dissemination [8].In this section, data dissemination in ground VANETs and UAV-assisted VANETs is mainly discussed.

Data Dissemination in Ground
VANETs.Tan et al. [9] proposed an analytical model to characterize the downlink average throughput and distribution achieved for each vehicle during the sojourn time by the Markov reward model.Zhang et al. [10] proposed an analytical model to facilitate the real-time data delivery as well as delay-tolerant data delivery, in which the theoretical per-vehicle throughput was derived.Lin et al. [11] developed an analytical model that accurately characterized the maximum throughput rate performance achievable under a prescribed outage probability constraint.As the first study on reliable transmission for bulk or stream-like data in DTNs (delay tolerant networks) [12], Zeng et al. proposed a dynamic segmented network coding scheme to efficiently exploit the transmission opportunity.Xing et al. [13] formulated the multimedia scheduling problem to maximize the utility and designed a heuristic algorithm.As continuous research, the authors [14] investigated multimedia dissemination for large-scale VANETs considering the tradeoff of delivery delay, the quality of service (QoS) of delivered data, and the storage cost.
As an emergent paradigm, some research work has applied SDN (Software Defined Network) to support applications in DTNs while reduce the operating costs [15] as it separates the control and data communication layers to simplify the network management.Liu et al. [16] described the application of SDN concept in VANETs and studied data scheduling problem.Nobre et al. [17] defined an architecture that adapted SDN to battlefield networking (BN), which integrated BN and SDN into dynamic and heterogeneous 2 Mobile Information Systems network-centric environments.Zacarias et al. [18] combined SDN and DTN concepts to address the needs of tacticaloperational networks, which could support the diverse range of strict requirements for applications.

UAV-Assisted Wireless Communications.
To provide wireless communications to a given geographical area, Mozaffari et al. [19] analyzed the deployment of an UAV as a flying base station and derived an analytical framework for the coverage and rate analysis for the device-to-device communication network.en they investigated the optimal 3D deployment of multiple UAVs [20] to maximize the downlink coverage performance with a minimum transmit power.Orfanus et al. [21] utilized the self-organizing paradigm to design efficient UAV relay networks, to provide robust connections to the devices on the military field.
Oubbati et al. [22] proposed a UAV-assisted routing protocol to assist data dissemination and improve the reliability of data delivery by filling the communication gap.Wang et al. [23] studied hybrid VANETs that utilized on-vehicle drones and proposed a distributed location-based routing protocol.Xiao et al. [24] employed UAVs to improve network performance against smart jammers and formulated the interaction between UAVs and jammers as an antijamming UAV relay game.Seliem et al. [25] proposed a mathematical framework to obtain the minimum drone density, which was equivalent to the maximum separation distance between two adjacent drones, to limit the worst delay of vehicle-drone packet transmissions.Shilin et al. [26] considered a drone-aided communication network model in an isolated VANET segment to enhance network connectivity.Fawaz et al. [27] developed a mathematical model that utilized drones to evaluate the impact of non-cooperative vehicles on forwarding path availability.Most of the related work did not consider maximizing the network throughput taking consideration of delay constraint in UAV-aided vehicular networks, which motivates this research.

System Model and Problem Formulation
In this section, the system model is presented while the maximization problem is formulated.

System Model.
To improve the reliability and efficiency of data dissemination in VANETs, UAVs are employed to form a cooperative air-to-ground network.By exploiting the UAV-aided VANETs, UAVs can help ground vehicles explore the area of interest and enhance network connectivity.
As stated in [22], most urban applications that use UAVs like small Quad-Copters do not fly at high altitudes [28].
us, this study assumes that UAVs have a low and constant altitude during the flight in order to communicate with vehicles on the ground.IEEE 802.11pMAC protocol is adopted for both V2V and A2G communications.UAVs in the network use a large transmission range (i.e., up to 1000 m [29]) and have a global view of the network.Vehicles and UAVs are equipped with GPS and digital maps to obtain their geographical positions.UAVs can also act as relay nodes to forward data packets when direct multihop V2V links are not available.
e cooperative network architecture of the UAVassisted VANETs is depicted as Figure 1, which is composed of the UAV network and the ground vehicular network.e scenario includes A2G and V2V communication links, which is a hybrid mode that allows the network to apply both A2G and V2V communications for data dissemination in VANETs.
e network can be abstracted as an edge-weighted graph G(V, E) (see Definition 1), where V is a set of vehicles and UAVs, E is a set of edges to indicate the communication links for data dissemination.e weight of each edge is represented by the transmission condition of the corresponding link.
Definition 1.Given a weighted network graph G(V, E), where V is the set of vertices and E is the set of edges.Let (w e , d e ) denotes the value of edge e, where w e indicates the transmission rate of e, and d e indicates the transmission delay.e tuple (W i , D i ) denotes the value of node i ∈ G • V, where W i and D i indicates the total transmission rate and transmission delay from source node s to i, respectively.
Note that G(V, E) only considers the edges over which two nodes can communicate, which means no silent edges are included.

Problem Formulation.
Assume a packet with size K carried by vehicle s needs to be transmitted to a specific area.
ere may exist more than one end-to-end path from source node s to other nodes in V, which can be denoted by P s and p indicates a path in P s .It is important to note that the paths may only exist among vehicles through V2V links or they may contain a hybrid of A2G and V2V communications.For simplicity, the A2G and V2V links are considered as common links with different properties hereinafter.e differences of the links are reflected by their transmission rate and delay.
As the network throughput can be mapped by the transmission rate of the end-to-end path, this study discusses how to optimize the transmission rate of each individual path to achieve the maximum throughput.To guarantee the real-time transmission, the end-to-end delay is limited to a predefined threshold.A continuous convex function f(r l ) � log r l with the transmission rate as parameter is utilized to depict the throughput, where l denotes a link on path p and r l denotes the transmission rate of link l.
e reason to consider f(r l ) instead of r l is that the logarithmic utility function log r l can better reflect the transmission rate of the delivery path and guarantee the maximum transmission rate.Meanwhile, the logarithm is concave and, hence, has diminishing returns.Here, it seeks a utility for that naturally achieves the maximum throughput and some level of fairness among the links.
To optimize the transmission rate of links and improve the network throughput, the throughput maximization problem can be formulated as below: where r l ∈ (1, c l ], c l indicates the maximum capacity of link l, d l is the transmission delay of link l, and δ is the predefined delay threshold.If link l is selected, x l is equal to 1, otherwise, x l is 0. Transmission delay d l can be calculated by the following equation according to the channel model [30]: e problem can be stated as follows.Given a delay threshold δ and n pairs of positive values (r l , d l ) to indicate the transmission rate and transmission delay of link l, it aims to select a delivery path which contains a few links to maximize the transmission rate while satisfying the delay constraint.e well-known 0/1 knapsack problem can be reduced to the throughput maximization problem.
en, a polynomial time approximation scheme is proposed to solve the problem.

An
Example.An example is given to illustrate how to derive an approximation solution for the maximization problem.An undirected graph G(V, E) with six vertices is shown in Figure 2. In graph G, each edge has a pair of values (w e , d e ) and each node has (W i , D i ) as its weight.e goal is to find a path from source node s to node d, such that the path has the maximum transmission rate W d while the transmission delay D d does not exceed δ. e procedure to obtain the path utilizing the approximation method is described.
First, the weight of each edge is given as (w sa , d sa ), (w sb , d sb ), (w ab , d ab ), (w ac , d ac ), (w be , d be ), (w ae , d ae ), (w ce , d ce ), (w cd , d cd ), and (w ed , d ed ).
e initial value of node s is (0, 0) and other nodes is (0, ∞). e values of the nodes are recorded, and a list of values for each node will be generated.ere might be quite a few values if there are a large number of nodes.To eliminate redundant values, a trim procedure will be executed if two values in L are close to each other since there is no need to keep both of them.More accurately, a trimming parameter α is utilized such that 0 < α < 1.When trimming a list by α, remove as many elements as possible, in such a way if L ′ is the result of trimming L, then for every element y that was removed from L, there is still an element z still in L ′ that approximates y, that is, rough the trim procedure, the approximate values (W d , D d ) of node d can be obtained; thus, an approximate path from s to d will be achieved.
Before explaining the details of the proposed scheme, a list of variables that will be used throughout this research is provided as Table 1.

Proposed Solution
In this section, the knapsack problem is reduced to the throughput maximization problem first.en, a polynomial time approximation scheme is proposed to solve the graph knapsack problem, which can return an approximate solution.Finally, a throughput maximization algorithm is presented based on the approximate scheme for graph knapsack.e approximation ratio and the running time of the proposed algorithm are also analyzed.

Problem Reduction.
To maximize the network throughput, this study optimizes the transmission rate r l while the sum of d l does not exceed δ is satisfied.e relation between the transmission rate r l and transmission delay d l is presented as d l � K/(c l − r l ), subject to  l∈L d l ≤ δ.From the equation, it can be seen that the transmission delay d l will increase when r l increases.
ere should exist an optimal transmission rate r * l with corresponding d * l , so the selected path could achieve the maximum throughput while the total delay is within delay constraint.To reduce the complexity of obtaining the optimal values, the approximate values for r l , d l are derived.

Mobile Information Systems
According to the equation while the range of r l is (1, c l ].To achieve the approximate values of r l and d l , let r l increase 1 + ϵ each time until it reaches the largest value, where 0 < ϵ < 1 is the parameter used for approximation.en a list of values for r l is obtained, which is shown as 1, 1 erefore, t pairs of (r l , d l ) can be derived.After f(r l ), t pairs of corresponding values (w l , d l ) are generated.Consequently, the approximation ratio to obtain the approximate value of r l is 1 + ϵ.
Different pairs of values for each link can be treated as different weights of corresponding edges between two nodes.Accordingly, there may exist multiple edges between two nodes.en Definition 2 is described.Definition 2. Given a weighted graph G(V, E), there may exist multiple tuples of values for each edge, which can be treated that there are multiple edges with different values between the corresponding nodes.Accordingly, G(V, E) becomes a multiedge-weighted graph.
As each link is represented by an edge of G(V, E), the graph knapsack problem can be reduced to the throughput maximization problem.e study aims to select a set of links over which the maximum throughput can be achieved while the delay constraint is satisfied.

Approximation Scheme for the Graph Knapsack Problem.
As there may exist more than one path from source node s to node i, node i could have different pairs of weighted values (W i , D i ).Let Y denote the set of values.Assume Y is sorted into monotonically increasing order of W i .A procedure trim() (see Algorithm 1) is designed to remove unnecessary values of node i, based on the idea of approximation.e procedure scans the elements of Y in monotonically increasing order.An element is appended onto the returned list Y ′ only if it is the first element of Y or if it cannot be represented by the most recent values placed into Y ′ .e output of the procedure trim() described as Algorithm 1 is a trimmed, sorted list.
Given the trim procedure, a polynomial time approximation scheme can be constructed for the graph knapsack problem, which is described as Algorithm 2. e approximation procedure takes as input a set of values for node u, (in arbitrary order), the delay threshold δ, and the approximation parameter ϵ.Algorithm 2 calls Algorithm 1 to trim the input list.An approximate solution denoted by P within a 1 + ϵ factor of the optimal solution will be returned by the scheme.Lemma 1 is developed to prove that the proposed scheme runs in polynomial time.Meanwhile, eorem 1 is derived to show that there is a polynomial time approximation algorithm for the multiedge graph knapsack problem.Now the running time of trim process will be analyzed.Assume W * is the optimal weighted transmission rate of link l and y.W < y.W * .After trimming, successive elements y and Y ′ of Y have the relationship Y ′ .W/y.W > 1 + ϵ/2n; that is, they differ by a factor of at least 1 + ϵ/2n.us, each list contains possibly the value 1 and up to log 1+ϵ/2n W * values.It can be deduced that the number of elements in each list Y is at most In summary, the overall running time of the algorithm is O(3n 2 m ln W * /ϵ). is bound is polynomial in the size of the input n and 1/ϵ.

□ Theorem 1.
e proposed algorithm for the multiedge graph knapsack problem is a polynomial time approximation scheme with an approximation ratio 1 + ϵ.

Mobile Information Systems
Proof.Let P * denote the optimal solution of the problem.From the proposed scheme, it is easily seen that P ≤ P * .It needs to show that P * /P ≤ 1 + ϵ.After trimming, successive tuples y and y ′ of Y ′ have the relationship y • W/y • W > 1 + ϵ/2n, where n indicates the number of nodes in G. Scan all the edges, find the path from source node s to destination node u, and let Due to the trim process executed at the receiver node of each edge, there exists a (1 + ϵ/2n) factor approximation.
As to v i , there are i − 1 edges between s to v i ; therefore, the approximation ratio should be (1 + ϵ/2n) i−1 .
en, v i reaches v i+1 through edge v i v i+1 , the approximation ratio at v i after trimming should be (1 + ϵ/2n) i−1 • (1 + ϵ/2n), which is equal to (1 + ϵ/2n) i .Since there are totally n nodes, the number of edges on the path is at most n − 1.From the induction of the above procedure, an overall approximation ratio can be expressed as (1 + ϵ/2n) n−1 , which can be presented as below: (5) Now, it needs to show that P * /P ≤ 1 + ϵ, by proving monotonically increasing, which means the function increases with n as it approaches the limit e ϵ/2 .us, the following inequality stands: Combine with P * /P ≤ (1 + ϵ/2n) n−1 , it has and the analysis of the approximation ratio is completes.Combined with Lemma 1, it proves that the proposed scheme is a polynomial time approximation scheme.Input: Q: a list of (W u , D u ) for node u, every u ∈ G.V; δ: the predefined delay threshold Output: approximation solution P (1) INITIALIZE G(V, E), set up the value of (w e , d e ) for corresponding link; end for (9) Trim(Y v ); (10) end for (11) end for (12) return P, which contains the set of links selected ALGORITHM 2: Polynomial time approximation scheme for graph knapsack.6 Mobile Information Systems maximization algorithm is proposed based on the approximation scheme for the graph knapsack problem in this section.
Given n items, the ith item is worth w i and d i pounds in weight.e 0/1 knapsack problem aims to find a subset of items that the total value is maximum while the total weight is limited to a value.Assume that d i is at most δ and the items are indexed in monotonically increasing order of their values, that is, w 1 ≤ w 2 ≤ • • • ≤ w n .eorem 2 is derived to show that the throughput maximization problem with delay constraint is NP-hard.

Theorem 2.
e throughput maximization problem is NP-hard.
Proof.Reduce the 0/1 knapsack problem to the throughput maximization problem.Consider Q is a list of n items, denoted by , where s and t denote the source node and destination node, respectively.For every node v i ∈ V, there is a pair of values (w i , d i ) for every edge (v, v i ) that goes from v to v i for any v ≠ v i and v ≠ t. ere is a pair of values (0, 0), for every edge (v, t) that goes from v to t for any v ≠ s and v ≠ t. e knapsack problem aims to select a subset of items such that  v i ∈U w i is maximized and the total weight  v i ∈U d i ≤ δ.A subset U is a feasible solution for the knapsack problem if and only if there is a path that goes from s to the vertex v i with v i ∈ U and then to t.It is easy to see that the time of construction is in polynomial time.
Select items satisfying the required conditions and add them to the knapsack, which is also the way to select the path for the problem.erefore, if there exists a solution for the knapsack problem, the maximization problem can be solved.Vice Versa, existence of a solution to the maximization problem means there is a solution to the knapsack problem.us, the maximization problem is NP-hard.
After reducing the graph knapsack problem to the throughput maximization problem, a roughput Maximization algoRithm (TMR) is proposed based on Algorithm 2, shown as Algorithm 3. Assume source node s intends to disseminate information to a specific area, an approximate delivery path is desired to achieve the maximum throughput and satisfy the delay constraint.
To be more clearer, a detailed description on how the proposed TMR works on the throughput maximization problem to solve the path-finding issue is presented as follows.
In the initialization process, let N[v] represent a set of v's neighbors.Starting from source node s, execute the following steps to each edge e ∈ G • E.
(1) To find the neighbor vehicles N [v] is with larger transmission rate but also with larger delay compared with Y v (j), a trim procedure will be executed to determine whether to remove an element.
After the iterative operations, paths containing a set of selected links are obtained.If there are more than one path from the source to the destination, choose the one with the largest transmission rate, which is the approximate solution intended to achieve for data delivery.eorem 3 is presented to show that the proposed TMR is a polynomial approximation algorithm with an approximation ratio of 1 + c, where 0 < c < 1.

Theorem 3.
e throughput maximization algorithm can achieve an approximation ratio 1 + c within running time O(n 2 m ln C/c), where m denotes the number of edges, C �  m i�1 c i , c i indicates the transmission capacity of link i.
Proof. e input of the proposed maximization algorithm is w l , d l .As stated in 4.1, the value of w l , d l is within a 1 + ϵ factor approximation of the optimal value.Considering the approximation in the multiedge knapsack problem, the throughput maximization problem should have a quadratic approximation.According to theorem 1, the comprehensive approximation ratio is Assume c � ϵ/3, the inequality Hence, an approximation solution within a 1 + c factor of the optimal solution can be achieved.
As r l ∈ (1, c l ], assume there are m edges on the path.Let C �  i∈m c i , then ∀ y ∈ (m, C).From Lemma 1, it has erefore, the total running time is O(n 2 m ln C/c).□

Performance Evaluation
In this section, simulation settings and results are presented and analyzed.

Simulation Settings.
To evaluate the performance of the proposed algorithm, TMR is implemented and compared with other algorithms.In the simulations, the following default settings are used.e simulations select a 2000 m × 2000 m rectangle street area on the map of Los Angeles and extract the area using openstreetmap [31], the satellite map of which is presented as Figure 3(a).en, Simulation of Urban Mobility (SUMO) [32] is used to convert the extracted area to the road topology layout, shown in Figure 3(b).e realistic mobility trace of vehicles is generated by the open-source microscopic space-continuous and time-discrete vehicular traffic generator package SUMO.SUMO uses a collision-free carfollowing model to determine the speeds and the positions of the vehicles.
e simulations deploy a number of UAVs which can cooperatively form a full coverage of the simulated area.e speed of UAVs varies from 0 to 15 m/s, and the UAVs maintain a constant altitude that does not exceed 200 m during the flight.e random walk mobility model is applied for the UAVs covering the area.Table 2 gives a list of simulation parameters.
e simulations implement the proposed algorithm and two other algorithms which are UVAR [33] and VBN [11], respectively.Extensive simulations are conducted to thoroughly investigate the efficiency of the proposed algorithm in aspect of delivery ratio, throughput, and number of hops when the number of vehicles varies and the deployed UAVs are set to 20.Additionally, the performance of the proposed algorithm with different UAV densities is evaluated when the number of vehicles is set to 300.A comparison between the proposed solution and optimal solution in terms of throughput is also presented.

Impact of Number of Vehicles on Delivery Ratio.
Delivery ratio is defined as the percentage of packets that are successfully delivered, that is, the ratio of the total number of data packets received by the target destinations to the total number of data packets generated from the sources.A higher delivery ratio means better performance.
In Figure 4, the delivery ratio under different number of vehicles for the compared algorithms is compared.As shown in the figure, the evaluated schemes achieve higher delivery ratio when there are more vehicles in the network.Besides, it can be seen that the proposed scheme and UVAR have better delivery ratio, due to the advantage of UAVs that applied to maintain better network connectivity and guarantee a significant accuracy of path selection.VBN mainly chooses the delivery paths based on cooperation among RSUs and vehicles, which cannot be accurate all the time and may select the paths that are not appropriate for data transmission, resulting in lower delivery ratio.

Impact of Number of Vehicles on roughput.
An important performance indicator of the algorithms is the throughput of the path from the source to the destination nodes.
In Figure 5, the throughput is plotted versus the number of vehicle nodes.It is observed that the proposed algorithm TMR outperforms the other two algorithms and has the highest network throughput.When the number of vehicles is 50, the throughput of TMR is 1.48 Mbps, higher than that of UVAR and VBN.It also shows that all the compared algorithms achieve higher network throughput as the vehicle density level increases.As the number of vehicles increases to 400, the corresponding throughput of the three schemes increases to 3.45, 3.3, and 3.1 Mbps.
Input: A list V containing all the vehicles and UAVs in the network Output: e selected path (1) Let N[v] denote the neighbor of node v; (2) a � 0, which denotes the number of executions; (3) for each node v do (4) Send a request to its neighbors; (5) Receive the channel information (CI) from its neighbors; (6) Calculate c l according to CI; (7) Calculate the approximate values of r l and d l according to d l � K/(c l − r l ); (8) Obtain the transmission rate and delay of the link between each neighbor and node v, denoted as (w e , d e ); (9) end for (10)  is is because that with the help of employed UAVs, available delivery paths from the source node to the destination area can be quickly found by avoiding unnecessary transmissions among vehicle nodes, such that the number of hops consumed is smaller.As the network size becomes larger, the number of hops increases for all the compared schemes.
is is mainly because that as the number of vehicles in the target area increases, more hops are needed to deliver the message to the vehicles and complete the dissemination.

Evaluating the roughput Maximization
Algorithm.Figure 7 shows the impact of the number of UAVs on data delivery ratio and delivery delay.As more UAVs participate in data transmission, the delivery ratio increases while the delivery delay tends to decrease.is is because that the UAVs can serve relay nodes in data dissemination when there are no available vehicles to carry and forward the data.When more UAVs participate in the communications, the vehicle nodes could select a better UAV relay with a higher probability, which results in the changing data delivery delay and delivery ratio shown in the figure.
To show the di erence between the proposed solution and the best possible solution, Figure 8 compares the proposed and optimal solutions in terms of throughput, considering the UAV-assisted vehicular environment.Comparing the simulation throughput with the optimal throughput in Figure 8, it can be seen that the simulation result consists with the optimal throughput to a great extent.Meanwhile, the approximation ratio is smaller than 1.1, which also veri es the e ectiveness of the proposed algorithm.Observing the changing trends of the throughput, it is easy to nd that the system throughput improves when the number of nodes in the network increases.

Conclusion
In this paper, e cient data dissemination in cooperative UAV-assisted VANETs is investigated.To optimize the network throughput, this study formulates a network throughput maximization problem to nd the best delivery strategy and select the optimal paths for data delivery, with consideration of the transmission rate of links and the delay constraint for data dissemination.en reduce the graph knapsack problem to the throughput maximization problem, and a polynomial time approximation scheme is proposed to solve the graph knapsack problem.As to the maximization problem, a throughput maximization algorithm is developed based on the approximation scheme.roughput comparison when the number of nodes increases from 50 to 400. 10 Mobile Information Systems eoretical analysis including the approximation ratio and running time of the proposed solution is provided.Finally, simulations are conducted to evaluate the performance of the proposed algorithm.

Discussions and Future Work
While the utilization of UAVs brings significant advantages, it also faces the cost problem.UAV communications are subjected to the additional energy consumption to fly at high altitudes, which is more significant than the communication energy consumption due to signal processing.Nevertheless, the limited on-board energy due to high propulsion energy consumption of UAVs poses critical limits on their communication performance and endurance.
It can be seen that there exists a fundamental tradeoff between the achievable utility benefit and system cost in UAV-assisted communication networks.Using UAVs can increase the network throughput and improve quality of service, which is important to users, especially to the applications with high quality of service requirements.Although the use of UAVs increases the system cost, UAVs have significant advantages over common roadside infrastructure.
e tradeoff between the benefit and cost can be achieved by energy-efficient design to enhance the performance of UAVassisted communication, which is a promising future work direction, such that the deployment and trajectory of UAVs can be carefully designed to save the energy consumption and improve the quality of communications (improved transmission rate and transmission delay).
Despite the contributions presented in this work, many challenges remain to be solved by academia and industry.Future work will focus on the frequent handover problem and interference caused by the high mobility of UAVs and vehicles.Also, energy efficiency of the UAVs remains a relevant topic to be explored to achieve fully utilization of UAVs and improve data dissemination in cooperative network.Further, the integration of the proposed architecture with the concept of SDN and the development of envisaged applications which can adapt to more complicated scenarios might be considered as another future work direction.

Figure 1 :Figure 2 :
Figure 1: An overview of the cooperative air-to-ground network architecture.
and t is the largest integer satisfying the inequality, which means (1 + ϵ) t+1 > c l .d l may have different values according to different r l calculated by d l � K/(c l − r l ), satisfying the condition that d l ∈ (K/(c l − 1), δ].

Lemma 1 .
e algorithm for the multiedge graph knapsack problem runs in O(3n 2 m ln W * /ϵ) time, where m, n denotes the number of edges and vertices of graph G, respectively.Proof.It will show that the running time of the proposed scheme is polynomial in both 1/ϵ and the size of the input.e first part of the algorithm runs in time O(nm), since the initialization in line 1 takes Θ(n) time, each of the |V| − 1 passing over the edges takes Θ(m) time, where n � |V|, m � |E|.

Figure 6 :
Figure 6: Consumed hops comparison when the number of nodes increases from 50 to 400.

Figure 7 :
Figure 7: Impact of number of UAVs on data delivery e ciency, illustrated by delivery ratio and delivery delay.

Figure 8 :
Figure8: roughput comparison between the proposed solution and the optimal solution, the approximation ratio re ects the throughput di erence and the e ectiveness of the proposed solution.

Table 1 :
Variables used in this paper.
To obtain the links connecting node v with its neighbors, calculate the channel capacity c l according to the channel condition.en,get the transmission rate and delay of the corresponding link.Calculate (W v , D v ) ∈ Y v for v by adding the value of its neighbor u, (W u , D u ) with (w e , d e ) of the corresponding link e. erefore, a list Y v for v can be achieved.(3)According to the previous step, several pairs of values (W v , D v ) may exist for node v. First, remove the values with delay that are larger than the delay threshold δ. en, if there are values with the same delay, keep those with larger transmission rate.Also, remove the values with larger delay and smaller transmission rate.In the following case, such as for node v, exchange information and obtain the corresponding values (W N[i] , D N[i] ) of the neighbors.(2) (11)at(11)Calculate the transmission rate and delay for each node v in the network, by adding the neighbor's corresponding values to (w e , d e ), denoted as W v , D v ; (12) Apply the trim procedure to remove unnecessary values of node v;