Game-Based Channel Selection for UAV Services in Mobile Edge Computing

Computation offloading is a hot research topic in mobile edge computing (MEC). Computation offloading among multiedge nodes in heterogeneous networks can help reduce offloading cost. In addition, the unmanned aerial vehicles (UAVs) play a key role in MEC, where UAVs in the air communicate with ground base stations to improve the network performance. However, limited channel resources can lead to the increase of transmission delay and the decline of communication quality. Effective channel selectionmechanisms can help address those issues by improving transmission rate and ensuring communication quality. In this paper, we study channel selection during communication betweenmultiple UAVs and base stations in anMEC systemwith heterogeneous networks. To maximize the transmission rate of each UAV user, we formulate a channel selection problem and model it as a noncooperative game. %en, we prove the existence of Nash equilibrium (NE). In addition, we design a multiple UAV-enabled transmission channel selection (UTCS) algorithm to obtain the equilibrium strategy profile of all the UAV users. Experimental results validate that UTCS algorithm can converge after a finite number of iterations and it outperforms random transmission algorithm (RTA) and sequential transmission algorithm (STA).


Introduction
With the development of mobile computing, the number of mobile users has soared. However, the network resources are limited. To reduce the communication and computing delay, MEC is proposed as a promising paradigm [1,2]. e computing and storage capacities are provided at edge nodes, providing services for terminal tasks and effectively reducing data processing delay and energy consumption of terminal devices. 5G-enabled MEC [3] is composed of heterogeneous base stations, where small cell base stations (SBSs) [4] reuse the channel resources with macro cell base station (MBS). Users of MBS and SBSs use the same set of communication resources for data transmission, which can effectively improve the utilization of communication resources.
With the development of unmanned aerial vehicles (UAVs) technologies, smaller and cheaper UAVs are available. As such, UAVs are no longer only used in specific fields (such as military domain) but are also introduced into civil networks [5]. Due to the high altitude flight characteristics, UAVs have been used as a key component of MEC [6], providing aerial network services As the number of computing tasks brought by UAVs increases, the demand for communication resources keeps growing. However, channel resources for data transmission are still heavily limited. When the number of users is too large, the channel is multiplexed by deploying micro base stations. In this case, channel multiplexing would bring the issue of interference, resulting in a decrease in the transmission rate. In order to improve the data transmission rate, it is necessary to design a reasonable channel selection strategy to reduce the interference of data transmission. An effective channel selection strategy can improve the utilization of channel resources and ensure the high transmission rate of each computing task.
During network rescue, UAVs can communicate with user terminals and base stations. Many existing studies merely focused on the communication between UAVs and users, and the communication between multiple UAVs and ground base stations had not been explored widely. us, we study the communication between UAVs and ground base stations. UAVs transmit computing data to the aerial and ground integrated wireless networks and process the task with the help of the computing resources [12][13][14] of the ground base stations. As the number of tasks increases, the transmission rate [15] of each task decreases. at is mainly because of the interference caused by channel multiplexing. Considering the characteristics of tasks, each computing task [16] selects the appropriate channel for data transmission, which can effectively reduce channel multiplexing. We aim to propose a multi-UAV transmission channel selection method to achieve distributed and high-quality data transmission.
In this paper, aiming at the offloading problem in edge computing, we study channel selection strategies for communication between multi-UAV users and base stations in MEC with heterogeneous networks. e following are our main contributions: (i) We consider the MEC with heterogeneous networks consisting of an MBS and multiple SBSs, in which each SBS is assigned an MEC server to provide computing resources for relevant users. In addition, the UAV users of SBSs use channel resources of the MBS for data transmission. We formulate a noncooperative game to model and analyze the multi-UAV channel selection problem in such a heterogeneous network. In particular, our model considers the interference generated by channel multiplexing to ensure high-quality data transmission for each UAV user. (ii) e channel selection strategies of users are coupled to each other, so it is difficult to optimize each user's data transmission rate simultaneously. To show the existence of Nash equilibrium (NE) in the formulated noncooperative game, we prove its equivalence to an exact potential game which has at least one NE. (iii) To reach the NE of the formulated noncooperative game, we propose a multiple UAV-enabled transmission channel selection (UTCS) algorithm. e proposed algorithm operates in a completely distributed manner; that is, each user does not know the selection strategies of other users and independently adjusts the channel selection strategy according to their prior experience. (iv) We perform extensive numerical simulations to validate UTCS algorithm to compute the NE solution (i.e., the equilibrium channel selection and the equilibrium data transmission rate of each user). Experimental results show that UTCS algorithm can converge quickly through a finite number of iterations. Compared with the random transmission algorithm (RTA) and sequential transmission algorithm (STA), UTCS algorithm can obtain NE channel selection strategy and each user can achieve a higher data transmission rate. e remainder of this paper is organized as follows. In Section 2, we briefly summarize the related researches. We model the communication between UAV users and base stations based on heterogeneous network scenarios and establish a noncooperative game model in Section 3. In Section 4, we analyze the existence of NE and design UTCS algorithm to obtain the equilibrium strategy. In addition, we illustrate the performance of UTCS algorithm through parameter analysis, convergence analysis, and comparison experiments in Section 5 and conclude our research work in Section 6.

Related Work
Some researches have been done on UAV network [17][18][19][20][21][22]. Zhang et al. [19] considered two communication modes, UAV to base station and UAV to UAV, and divided the problem into three subproblems to optimize channel allocation and UAV speed, respectively. Gu et al. [20] considered the network scenario of multi-UAV collaborative work in the context of environment awareness and studied resource allocation and task scheduling of multi-UAV collaborative work based on reinforcement learning. Berate et al. [21] regarded the UAV as an aerial base station and proposed a user cache framework involving multiple UAVs.
en, QoE was used as an indicator to study the user cache for multiple UAV base station deployment and the optimal caching strategy was obtained. Zhao et al. [22] considered a network scenario in which UAV and base station (BS) cooperate to serve ground users and investigated the problem of transmission rate optimization through joint optimization of UAV trajectory and NOMA precoding. e above studies introduced UAVs into different network scenarios to optimize the data transmission process from different perspectives of network communication. We introduce UAVs into 5G heterogeneous networks, which can greatly improve the power efficiency and spectrum efficiency.
Channel selection is a hot issue in mobile network communication. Gour et al. [23] selected underlay D2D network and proposed D2D channel allocation and power allocation schemes with and without quality of service constraints and transformed the problem into a nonconvex mixed-integer nonlinear programming problem. Shattal et al. [24] focused on a new vehicle-mounted ad hoc network (VANETs) architecture in which nodes continuously and autonomously selected one of three channel selection strategies and applied evolutionary games to solve the channel selection problem. Ko et al. [25] studied the joint optimization problem of LTE channel selection and frame scheduling to maximize LTE throughput and proposed heuristic algorithms to solve the problem. However, these works do not consider the cochannel transmission interference factor, which leads to the result that channel selection is not accurate enough.
us, considering the transmission interference in the same channel, we study the channel selection decision problem for multi-UAV users.
Game theory is a good tool to solve the problem of multiplayer competitive decision-making. Cui et al. [26] considered the dynamics and uncertainties in the environment for modeling, constructed the long-term resource allocation problem as a stochastic game to maximize the expected return, and applied the reinforcement learning theory to design algorithm to solve the problem. In order to solve the optimization problem of relay selection in UAV network, Liu et al. [27] proposed a matching game classification method based on the competitive relationship between players and constructed a basic preliminary model of UAV relay model. However, these works do not highlight the competitive relationship between users. In this paper, we consider the communication resource competition in the process of data transmission by multi-UAV users, and a noncooperative game model is established to describe the channel resource competition in multi-UAV communication.
is paper considers a heterogeneous network of multiple types of multiple base stations communicating with multiple UAV users. Since different base station users select the same channel for transmission and cause interference, we apply the noncooperative game method to construct the communication model between users and base stations. In order to maximize the transmission rate of each user, a scheme of user equilibrium channel selection strategy is presented. en, we prove the existence of equilibrium strategy. Combining the strategy selection probability, we design UTCS algorithm to ensure that multi-UAV users can obtain their own equilibrium strategy in the 5G heterogeneous networks. Finally, the convergence and effectiveness of UTCS algorithm are verified by experiments.

The UAV-Enabled Heterogeneous Network
Model for MEC and the Game Problem Formulation

UAV-Enabled Heterogeneous Network for MEC.
We consider a UAV-enabled heterogeneous network as shown in Figure 1, where multiple SBSs are distributed in the macro cell. Each SBS owns an MEC server to provide computing resources for corresponding users. In this network, the UAV user transmits the data to the base station for task computation. e set of UAV users is defined as N � 1, 2, . . . , N { }, which can be further divided into macro cell base station users (MUs) and small cell base station users (SUs). Specifically, the MU is the user outside of the service area of the SBS and is served by MBS, and the set of MUs is defined as N 0 � 1, 2, . . . , N 0 . In addition, the set of SUs is e main symbols are given in Table 1.
We define the channel set of the MBS as S � 1, 2, . . . , S { }. SBSs are deployed in the service area macro cell and the channels of the MBS are reused for data transmission. In addition, with dense deployment of SBS, it is considered that the number of users in each base station is no more than the number of channels, and users within the same base station do not need to reuse channels.
Due to the multiplexing of channels, there is interference when the user transmits the data, including the interference from MUs to SUs and the interference from SUs to SUs. Interference affects the data transmission rate, and users obtain efficient data transmission strategies by the analysis of the interference. d i denotes the channel selection strategy of user i, and d i ∈ S. e channel selection strategy profile of all the users is represented as D � d 1 , d 2 , . . . , d N . In addition, the users in the same base station do not reuse the channels; that is, When data is transmitted from UAV user to base station, the UAV is in a hover state and U i � [x i , y i , H i ] T represents the location information of user i. e data transmission rate is affected by the channel gain which is related to the distance  Different from ordinary users on the ground, the UAV and the base station may transmit data through the LoS link [28]. e path loss for LoS link transmission α LoS in and the path loss for NLoS link transmission α NLoS in between UAV user i and base station n are expressed, respectively, as follows: where ε LoS represents the average value of excessive path loss in LoS link and ε NLoS is the average value of excessive path loss in NLoS link. f c is the carrier frequency, c is the speed of light, and ‖Q n − U i ‖ is the distance between UAV user i and base station n. It is worth noting that base station n is either an MSB or an SBS. e probability of data transmission between UAV user i and base station n through the LoS link is given as where b and ς are environmental parameters determined by the deployment environment of UAVs. arcsin( is the horizontal angle between UAV user i and base station n. e average path loss between UAV user i and base station n is denoted as and the channel gain h 0 i of MU i can be calculated as follows: where ‖Q 0 − U i ‖ represents the distance between MU i and MBS, and β 0 is the channel power of MBS. e channel gain h m i between SU i and SBS m is defined as where ‖Q m − U i ‖ is the transmission distance between SU i and SBS m. β m represents the channel power of SBS m. Based on the channel gain and multiplexing, users can attain the channel selection strategy. We define a s i to indicate whether user i selects channel s for data transmission. Specifically, when user i selects channel s for data transmission, a s i � 1; otherwise, a s i � 0. e SINR of MU i is given as follows: where p i and p j are the transmission powers of MU i and SU j, respectively. σ 2 represents the noise power. m∈M m j∈N m h m j p j a s j represents the interference to MU i by SU j that selects the same channel for transmission as MU i.
For SUs, we calculate the interference between the MBS and the SBS and the interference between SBSs. e SINR of SU i is as follows: where j∈N o h 0 j p j a s j denotes the interference of MU j to SU i, and m∈M m j∈N m h m j p j a s j represents the interference of other SBS users to SU i.
Each user considers the impact of strategies for other users on themselves when obtaining channel selection strategy. We define the channel selection strategy profile of all the users other than user i as follows: When user i selects channel s for data transmission, the transmission rate of user i can be given by where B is the bandwidth of the base station. e bandwidth of each base station is the same, and the base station allocates the bandwidth resource evenly to each user in its coverage. In addition, not only is the data transmission rate affected by  Security and Communication Networks the user's own channel selection strategy, but also it depends on other users' strategies.

e Game Problem Formulation.
In this model, we regard UAV users as rational users. Each user is a selfish decision-maker, and they only care about their own benefits. Specifically, UAV users select the channel with the highest data transmission rate, considering their locations and channel reuse. erefore, the objective function of user i is given as follows: Because of channel multiplexing, the channel selection strategies of users affect each other. If there are too many users selecting the same channel, interference will be more severe among these users, and the transmission rate of these users will decrease. erefore, each user prefers to select the channel selected by a small number of users in pursuit of the maximum transmission rate. However, in real multiuser data transmission, users do not know the channel selection strategies of other users when making the transmission decision. erefore, users are independent decision-makers and can only follow their prior experience to make decisions.
In the process of pursuing their own benefits maximization, users form a competitive relationship with each other. We can describe this process of user channel selection as a noncooperative game model. en, the game can be formulated as follows: where N is the set of game players, which include both MUs and SUs. S is the feasible strategy profile of the participant and also is the channel set. Note that the set of feasible strategies is the same for each user. In addition, R i is the benefit of participant i's strategy, which is the transmission rate. e transmission rate of each user varies as the channel selection strategy changes. In Γ, each user adjusts its channel selection strategy to gain more benefits. After a certain number of iterations, all the users reach a state in which they can no longer improve their benefits by changing their strategies. erefore, all the users would keep the strategy unchanged. In this state, there is an equilibrium between all the users.

Analysis of Nash Equilibrium and Decision
Algorithm Design

Proof of the Existence of NE.
We consider an equilibrium state. Since users can no longer improve their own benefits by adjusting their strategies in this equilibrium state, the channel selection strategies of all users are no longer changed. e equilibrium state makes a tradeoff among all the users; that is, considering the satisfaction of all the users, the benefits of each user reach the relative best state. In other words, the equilibrium state satisfies the goal of maximizing the transmission rate of each user in Γ, which is called a Nash equilibrium (NE), and we give the definition of the NE of Γ in the following.
Definition 1. For a noncooperative game Γ, if there is a channel selection strategy profile D * � d * 1 , d * 2 , . . . , d * N , and no user is willing to unilaterally change its channel selection strategy to improve its transmission rate in this environment, then and D * is the NE strategy profile of Γ. When all the users select the equilibrium strategy for data transmission, they have no intention to further change the channel selection strategy because each user reaches the maximum benefit in the current environment. erefore, NE is a stable state and a solution that satisfies the goal of Γ. In the following, we discuss the existence of NE. e exact potential game has the finite improvement property (FIP), which indicates that the exact potential game can reach NE through a finite number of iterations. erefore, we further prove the existence of NE in Γ by proving that Γ is an exact potential game. en, we give the definition of the exact potential game.
where Γ is an exact potential game, and there is at least a pure strategy NE. In Γ, we consider two types of users and they are independent of each other. erefore, we prove the existence of NE for two kinds of users, respectively. In the following, we first analyze the case for MU.

Lemma 1. For Γ, if user i is the MU, there is a function
Φ 0 (D) as follows: so that the relationship between Φ 0 (D) and R i (d i , d −i ) satisfies the following condition: Proof. When user i is an MU, the utility function of MU i, that is, the transmission rate, can be given as We analyze the change of the utility function when the channel selection strategy of user i changes. Specifically, when the transmission channel of user i changes from channel s to channel s ′ , that is, from a s i to a s′ i , the utility function changes as follows: en, we show that when the channel selection strategy of user i changes, the potential function changes as follows: rough the above proof, we can obtain Lemma 1. □

Lemma 2. In Γ, when user i is a small cell base station user, there is a potential function Φ m (D):
When the channel selection strategy of user i changes, the following condition is satisfied between Φ m (D) and Proof. If user i is an SU, according to (9), the utility function of SU i is equivalent to where ξ 0 � j∈N 0 h 0 j p j a s j represents the interference of MUs to user i.
When the channel selection strategy of user i changes, the utility function changes as follows: e relationship between the potential function and the utility function when the channel selection strategy changes is as follows:

Security and Communication Networks
We can obtain that when user i is an SU, Γ is an exact potential game. us, Lemma 2 is proved.
As shown above, we respectively prove the relationship between the potential function and the utility function of two types of users. Now we give the potential function of the whole Γ. □ Theorem 1. Γ is an exact potential game, and the potential function is as follows: Proof. In Γ, we consider two types of users, MUs and SUs. In Lemmas 1 and 2, we theoretically prove whether the game is an exact potential game in two user cases. In addition, the two types of users are independent of each other. Specifically, a user will not be both an MU and an MU. us, the feasible strategy profiles of users do not affect each other. From Lemmas 1 and 2, we derive the following: According to the above proof, we conclude that, for each user i ∈ N, when the channel selection strategy changes, the utility function and potential function satisfy the equality relation given in (25). us, Γ is an exact potential game, and eorem 1 is proved.

Multiple UAV-Enabled Transmission Channel Decision Algorithm (UTCD).
Because the exact potential game has FIP property, users in Γ can reach NE after a finite number of iterations. We design the algorithm to reach the NE. Specifically, we consider how the user selects the transmission channel in each iteration. Designing a proper channel selection mechanism can make the game reach equilibrium quickly. How to design effective methods to solve network game problems has been widely studied, for example, adopting the idea of best response. In the idea of best response, each user, in accordance with the principle of maximizing their own interests, continuously adjusts their strategies in each iteration to achieve equilibrium. Each user obtains the channel selection strategy by traversing its feasible strategy profile based on the current game environment and calculates the benefits respectively for comparison. It is a strategy iterative process. In each iteration, the user calculates the benefits of all the feasible strategies to obtain the most profitable strategy.
In the above solution process, the default strategies of different users can be the same. However, in the heterogeneous network, users of the same base station service cannot reuse the same channel for data transmission. erefore, we introduce the channel selection probability and combine the channel selection probability with the best response [29] idea to design our channel selection algorithm. Considering the particularity of the scenario, we propose a channel strategy selection algorithm applicable to the UAV-enabled game model in Algorithm 1, that is, the multiple UAV-enabled transmission channel decision (UTCD) algorithm. Specifically, each user's feasible strategy profile is accompanied by a selection probability vector. For a user, the probability of selecting the feasible channel to transmit is different, and the change of probability is related to the corresponding benefits of the strategy. If the corresponding benefits of the selected strategy are higher, the selection probability of the strategy will be higher. However, the total probability for each user's feasible strategy profile is constant. e specific update method of the strategy selection probability is shown in Algorithm 2.
e probability of all users' strategy selection constitutes the strategy selection matrix, which is used to update the game environment. e selection of probability affects the benefits. When we update the game environment, we select the strategy with the maximal probability and at the same time ensure that users of the same base station service select different strategies. e main steps of UTCS algorithm are given as follows: Strategy selection probability initialization. Each user's strategy profile corresponds to a selection probability vector. erefore, the number of elements in the vector is equal to the number of elements in the strategy profile. e probability is initially evenly allocated to each strategy. e probability vector of initial strategy selection for user i is given as follows: Game environment initialization. For users of a base station service, we assign the initial channel strategy to users in order of channel set S. e initial channel selection for users of base station m is E m (t � 0) � 1, 2, . . . , N m , E m ⊆S. e initial channel selection of all the users constitutes the initial game environment. Benefit calculation. In each iteration, the user adjusts the strategy selection probability by calculating the benefits. e benefits are calculated by (9), and the current game environment is considered in the calculation process. In addition, in order to satisfy the fairness and simultaneity of the user's decision, the user, after adjusting the probability vector, keeps the current strategy unchanged until the game environment updates. Selection probability update. e selection probability is updated according to the calculated benefits of the selected strategy. e probability after the update is related to the current benefits and the probability before the update, and the specific calculation method is as follows: where ε is the update step size applied to control the overall change rate of the probability and e t d i is the unit vector whose d i -th element is 1. In addition, r t i is the utility, which is obtained by r t i � η i R t i , and η i ≤ 1/max R i . After the selection probability of all the users has been updated, the next iteration is entered. Game environment update. e user follows the principle of selecting the strategy of maximum probability; that is, However, in the base station m, in order to avoid channel multiplexing, the users' strategy selections are different. Specifically, N m elements are selected with different rows and columns in the following probability matrix: e probability of the strategy is generated according to the amount of the strategy benefits, and the greater the benefits are, the greater the probability increases. erefore, when the maximum selection probability strategies of different users are the same, the user strategy with the highest probability value is selected. Termination. Each user selects a transmission strategy for each iteration. When the selections of all the users are no longer changed, the iteration ends, and the strategy profile is the desired equilibrium strategy profile as follows: As shown above, all the users select strategies in the same game environment, which can ensure the synchronization of all the user decisions, and more in line with the reality.

Performance Evaluation
In this section, we first analyze the influence of the values of two parameters on the model and the convergence of the algorithm.
en, the performance of UTCS algorithm is analyzed by comparison experiment.
(1) Initialize: N, S, U i , Q m , Q 0 , β 0 , β m , α 0 , α m , p i , p j , σ 2 , a s i , B; (2) According to channel set S, UAV users of the same base station service are allocated the initial transmission channel in order; (3) e initial game environment is constituted as follows, E(t � 0) � (d 1 , d 2 , . . . , d N ) ; (4) e update strategy probability matrix of users is obtained from Algorithm 2 as follows, E(t) � (5) According to E(t), the strategy with the highest probability of each UAV user i is selected to form a new game environment. (6) e selection method is, d i (t) � arg max P(d i , d −i ), ∀d i ∈ S; (7) While satisfying the maximization principle, the selection of the channel selection strategy should ensure that there is no channel multiplexing between users of the same base station service; (8) When all the users' selection strategies no longer change, the update is stopped; Output: e equilibrium strategy profile, ALGORITHM 1: e multiple UAV-enabled transmission channel selection (UTCS) algorithm.
(1) e strategy selection probability of each MU i is set according to the fairness principle, P i (t � 0) � ((1/S), (1/S), . . . , (1/S)); (2) for each MU i ∈ N do (3) e transmission rate are calculated according to 10; (4) Each UAV user's interference calculation is based on the current game environment; (5) e following is applied to update the stratgy selection probability of user i; P i (t + 1) � P i (t) + εr t i (e t d i − P i (t)); (6) end for (7) e probability of strategy selection for all UAV users constitutes the strategy selection matrix; Output: e update strategy probability matrix of users is, E(t) � ALGORITHM 2: e transmission strategy update algorithm.

Security and Communication Networks
We consider that there are 50 UAV users in a UAV-enabled heterogeneous network with one MBS and 5 SBSs deployed. e user's location information is randomly generated within the service scope of the base station to which it belongs. For ease of calculation, we assume that all the UAVs hover at a fixed height of 20 m. For the MBS, the altitude is 10 m, and the altitude of SBSs is 5 m. e power gain of the MBS is −50 db, and the power gain of the SBSs is generated randomly from [−30, −40] dB. In addition, the transmission power for each UAV user is randomly generated from [100, 500] mW. e bandwidth is 5 MHz, and the bandwidth is evenly distributed among users. e background noise is −100 dbm. e initial values for the main parameters are set in Table 2.

Parameter
Analysis. α is the path loss exponent in data transmission between the UAV user and the base station. We set different path loss exponents (i.e., 2.5, 2.6, and 2.7) to analyze the effect of path loss on the transmission rate. Figure 2 shows the impact of path loss exponent on channel gain at different transmission distances. We can see that, with the increase of distance, the overall transmission rate tends to decline, and the greater the distance is, the slower the transmission rate decreases. When the transmission distance is the same, the smaller α is, the higher the channel gain will be. In addition, Figure 3 shows the impact of path loss exponent on the channel gain with base stations of different channel power gains. Obviously, when the channel power gain increases, the channel gain tends to increase. e smaller the loss coefficient is, the higher the channel gain is. According to Figures 2 and 3, the higher the path loss is, the slower the channel gain is. e path loss index is determined by the actual environment in which the data is transmitted. Specifically, when the transmission environment is complex, the transmission loss will be high. Conversely, in a simple environment, the transmission loss is low; and the channel gain affects the transmission rate, so that the UAV hover location would be selected in a simple transmission environment to increase the transmission rate. Figure 4 shows the impact of the update step size on the change rate of the selection probability of a strategy. ε is the update step size of the selection probability for the strategy. As the transmission rate increases, the selection probability increases from 0.1. Moreover, the larger the update step size is, the faster the probability grows. In Figure 5, we show the effect of update step size on the convergence rate of transmission rate. As can be seen in the figure, with the increase of the number of iterations, the transmission rate of the UAV user gradually reaches a convergence state. e reason is that all users obtain the optimal transmission rate. Meanwhile, no user has the will to change strategy. e transmission rate's convergence is the fastest and the user's transmission rate is the largest when all situations have converged after 11 iterations, which is 0.6. erefore, the selection of the appropriate update step size can affect the convergence rate and the user's equilibrium rate simultaneously. Figure 6 shows the convergence of user transmission strategies.

Convergence Analysis.
ree users (i.e., user 9, user 33, and user 39) are randomly selected to observe the trend of their channel selection strategies as the number of iterations increased. eoretically, we prove the existence of NE in the game in Section 3. erefore, it can be seen from Figure 6 that the channel selection strategies of these users show a convergence trend after a finite number of iterations. Specifically, user 9 converges after 4 iterations and finally selects channel 5 for data transmission. In the 10th iteration, the channel selection strategy of user 33 no longer changes. After 8 iterations, user 39 selects channel 6 for data transmission. Due to the different transmission power, channel gain, and other factors, different users finally reach the convergence of the strategy after different iterations.
When the user's channel selection strategy no longer changes, the user's transmission rate may not reach the constant state due to the influence of other users' adjustment strategies. erefore, the convergence state of user's transmission rate can show the convergence rate more accurately. Figure 7 shows the convergence of user transmission rates. We randomly select user 2, user 21, and user 45 from 50 UAV users and analyze the change of user transmission rate as the number of iterations increases. As shown in Figure 7, with the increase of the number of iterations, the user's transmission rate gradually converges and eventually remains unchanged. Specifically, after 8 iterations of the UAV user, the user's transmission rate no longer changes. Due to channel multiplexing, there is interference between users. erefore, the strategy changes of other users can affect the transmission rate of the user, and the transmission rates of all the users converge at the same time.

Comparison Analysis.
Compared with other algorithms, we analyze the performance of UTCS algorithm. In the random transmission algorithm (RTA), each UAV user randomly selects a channel for data transmission. In the sequential transmission algorithm (STA), each user allocates channel resources in the order of the channel set. In UTCS algorithm, users adjust channel selection strategies with the goal of maximizing their own transmission rate and obtain the final channel selections through multiple iterations. Figure 8 shows the change in the total transmission rate for all the users as the number of iterations increases. Obviously, the total transmission rate of RTA is unstable due to its randomness, and the STA's fixed channel selection strategy keeps the total transmission rate unchanged. Notably, the transmission rate obtained by UTCS algorithm converges after 11 iterations. In the convergent state, we compare the total transmission rates obtained by three algorithms. Specifically, the total transmission rate obtained by UTCS algorithm is 2.15% higher than the maximum transmission rate of RTA and 4.3% higher than STA. It is worth noting that the random channel selection of RTA has a certain probability to make the total transmission rate reach the global optimal state; that is, the total transmission rate obtained by RTA is higher than that of UTCS algorithm. However, the probability of applying RTA to reach the global optimal transmission rate is very low and the global optimal state does not consider the maximum of each user's transmission rate, which does not conform to the goal of the model. erefore, UTCS algorithm can perform the best. In heterogeneous network, there is no channel multiplexing among MUs. erefore, channel interference does not exist among MUs. e data transmission of SUs needs to reuse channel resources, so we analyze the impact of the number of SBSs deployed on the average transmission rate of users. In Figure 9, with the increase of the SBSs, the average transmission rate of users shows a downward trend after rising first. Because the number of MUs is less than the number of channels, channel resources are underutilized. When SUs reuse channel resources for data transmission,   some channel resources are idle. erefore, the interference is relatively low, and the average rate of users increases. In peak state, channel reaches saturation of resource utilization, and interference increases at a faster rate, so the average transmission rate of users goes down. When the number of SBSs reaches 5, the average transmission rate obtained by UTCS algorithm is 7.36% higher than that of RTA and 4.63% higher than that of STA. erefore, UTCS algorithm can effectively reduce interference and slow down the rate of decline of the average transmission rate.

Conclusion
In this paper, we study the channel selection for 5G heterogeneous networks to maximize the transmission rate of each user. We apply a noncooperative game method to construct the communication model and prove the existence of NE. A multiple UAV-enabled transmission channel selection (UTCS) algorithm has been proposed to obtain the equilibrium strategy profile of all the UAV users. Experimental results demonstrate that the UTCS algorithm can converge and can perform the best.

Data Availability
e data used to support the findings of this study are available from the corresponding author upon request. The total transmission rate 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 RTA algorithm STA algorithm UTCS algorithm

Conflicts of Interest
e authors declare that they have no conflicts of interest.