Interference Mitigation Framework for Cellular Mobile Radio Networks

For today’s cellular mobile communication networks, the needed capacity is hard to realize without much more of (expensive) bandwidth. Thus new standards like LTE were developed. LTE advanced is in discussion as the successor of LTE and cooperative multipoint transmission (CoMP) is one of the hot topics to increase the system’s capacity. System simulations often show only weak gains of the signal-to-interference ratio due to high interference from noncooperating cells in the downlink. This paper presents an interference mitigation framework to overcome the hardest issue, that is, the low penetration rate of mobile stations that can be served from a cluster composed of their strongest cells in the network. The results obtained from simulation tools are discussed with values resulting from testbed on the TU Dresden. They show that the theoretical ideas can be transferred into gains on real systems.


Introduction
From the beginning of mobile radio communication there is a constant effort to reduce intercell interference to an acceptable level.Since the success of the worldwide deployed Global System Mobile (GSM) [1] the enormous increased number of users together with over proportionally increasing traffic demands have put this topic into the main focus of research.
Wireless transmitters generally spread power over large areas, and at very low frequencies even worldwide communication is possible [2].In current cellular deployments, carrier frequencies ranging from a few hundred MHz to several GHz are being used.At these frequency bands, the radiation of a single cell will typically cover smaller areas, but cell diameters of 100 km and more are still possible in rural areas.In urban areas intersite distance (ISD) of few 100 m to few km are quite common, leading to a so-called interference limited scenario, where the achievable performance is mainly bounded by the intercell interference (ICI) and not the receiver noise [3].
Interference power decreases with distance, which in the beginning of cellular communications has been the main means for overcoming intercell interference, leading to the well-known frequency reuse concept.Important enhancements were the introduction of frequency reuse 1 systems by code division multiple access (CDMA) in the so-called 3rd generation of mobile systems.On the downside the performance of this scheme suffers from the near far effect and a variable soft capacity of the system [4].
In the standardization of the current 3GPP standard named LTE, CDMA was replaced in the downlink by orthogonal frequency division multiple access (OFDMA), being again a frequency reuse 1 system, but providing additionally multiuser scheduling gains by fast adaptation to frequency selective channel conditions [5].This concept allows also fast adaptation to interference conditions and thereby approaches the performance of the so-called inter-cell interference coordination (ICIC) schemes.ICIC harmonizes resources usage between cell edge and cell centre users over neighbouring cells [6] depending on their mutual interference.Note ICIC has generated also a lot of interest under the name of fractional frequency partitioning (FFP) with a different power allocation for cell center and cell edge users on different frequency subbands.

International Journal of Antennas and Propagation
In long-term evolution (LTE) Release 10 and now 11 there is a lot of discussions about the so-called enhanced ICIC (eICIC) schemes in the context of heterogeneous networks comprising pico-and macro-or femto-and macro-stations.By transmitting the so-called almost blank subframes (ABS)-almost blank because the common reference signals (CRS) have to be transmitted continuously for backward compatibility-in time domain subframes with different level of interference are generated, providing some options for enhanced scheduling.
In 2000, Baier et al. [7] proposed the idea of cooperative transmission systems-also known as network multiple input multiple output (MIMO) or for example in 3rd generation partnership project (3GPP) as cooperative multi point transmission (CoMP).This is a much more fundamental solution, partly being inspired from the research in the area of MIMO systems.Many groups predicted large theoretical performance gains with spectral efficiencies being several factors higher than that of conventional mobile radio systems.For example for 2 and 4 antennas per cell (BS) and mobile station (MS) 10th of bits/s/Hz/cell have been promised at signal to noise and interference ratios (SINRs) of less than 10 to 20 dB [8][9][10].
Common to all these approaches is that a network wide precoder over all cells including all MSs is assumed, which eliminates any interference after proper precoding.In practical mobile radio systems, network wide precoding is unrealistic as networks might span a whole city or even a whole country.Neither an according backhaul network can be implemented, nor channel estimation as well as the reporting of all channel components is feasible.For that reason it is necessary to partition the network into so-called cooperation areas (CAs) connecting a limited number of cells.Unfortunately, this step turns the interference free system back into an interference limited one, because of the remaining intercooperation area interference.For quite sometime a lot of research concentrated on the area of clustering [11], applying different types of optimization algorithms to this nonconvex optimization problem.
Despite the research effort, there is still a significant gap between the theoretical predicted gains and the results observed from system level simulations so far [12].One main activity in this area is the so-called 3GPP LTE CoMP study item, which started in LTE Release 10, has been re-invoked for LTE Release 11 and finally has been converted into a 3GPP LTE working item in September, 2011 [13].While CoMP includes many different kinds of cooperation like coordinated scheduling (CS), coordinated beamforming (CB), dynamic or fast cell selection (DCS), and so forth, in our contribution the focus is mainly on joint precoding (JP) or more specifically on joint transmission (JT) CoMP.JT CoMP is generally accepted to be the scheme with the highest potential gains, as signals being interferers in noncooperative scenarios are turned into useful signal power for JT CoMP.This is the main benefit over pure interference avoidance strategies such as CS/CB.Reference [14] contains the best so far achieved results for the main 3GPP CoMP schemes.In case of JT CoMP the best performance for a 4 × 2 MIMO system doing joint transmission over 9 cells was 4.028 bit/s/Hz/cell, being far away from the projected 10 bit/s/Hz/cell from theoretical analysis.
So one main issue to be tackled in this contribution is to reveal and analyze a fundamental limit of practical and realizable CoMP schemes, mainly due to the required clustering and/or due to different assumptions for the radio channels.Note, for theoretical analysis Rayleigh channels are often assumed, which are known to have an optimal eigenvalue distribution, while macrocellular radio channels like the spatial channel model extended (SCME) as being used in 3GPP [15] may suffer from correlation of collocated antenna elements and certain line of sight (LOS)/nonline of sight (NLOS) probabilities.
The important question is whether the optimization problem is limited by fundamental reasons, for example, esteeming from clustering, or just due to the dimension of the optimization problem.The first case would be in combination with its significant complexity almost a killing point against JT CoMP, while in the second case it would be just a matter of doing things right.
We have no final answer to this question yet, but in this contribution we propose an overall framework for interference mitigation of which JT CoMP is just one of the building blocks.Based on an in depth analysis of interference conditions, scheduling effects, and so forth different concepts are combined to get much closer to the theoretical results of network wide cooperation with clustered cooperation areas in macrocellular radio environments.
Reference [16] mentions the importance of interference and provides an optimal power control provided for two opportunistic cooperative base stations.However the authors mention that the algorithm does not bring performance benefits if there are only cell edge users.Besides these ideas [17], suggests a concept to improve the bandwidth efficiency with two-way relay nodes where these nodes have multiple antennas.Upper bounds regarding symbol error probability are derived as well as the order of diversity of the two proposed schemes.
The proposed framework is more than CoMP and includes techniques like interference rejection combining (IRC), multiuser MIMO (MU-MIMO), interference floor shaping, wideband beamforming, antenna tilting, power adaptation, clustering, user grouping, effective precoding including  diversity, per cell and per cooperation area scheduling, optimum channel state information (CSI) estimation, and according to low rate but accurate feedback links.
Implementation of specific issues like the just mentioned CSI estimation and feedback as well as intersite synchronization, backhaul overhead, and so forth are investigated in more detail in the EU founded project Artist4G [18], while this contribution has its main focus on maximizing system level gains under ideal conditions.In the end, both parts have to be combined appropriately to form an overall system concept.
Some items like partial CoMP, cover shifts or interference floor shaping by cooperation wide vertical beam steering have been partly described in [19,20], while the CoMP scheduler results and verification of main parts measurements in the real world CoMP testbed from TUD [21] are presented here for the first time.
The main outcome of the overall proposed concept is a factor of 2.4 higher spectral efficiency than achieved so far by MU MIMO and about 1.8 times higher than for the up to date best 3GPP JT CoMP scheme as being given in [13].This we think is really a great progress.

Contribution of This Paper
In the introduction, an overview about the issues relevant for interference mitigation based on JP CoMP was given.This chapter presents an interference mitigation framework designed to overcome the low penetration rate of MS that can be served by their individual set of network wide strongest BSs.To reach that goal, several concepts are combined to tackle each issue that hinders a higher penetration rate of MSs with high SIRs.These concepts can be roughly categorized into the following points.
(i) Oversized Clusters.Oversized clusters provide a higher probability that MSs find all of their strongest BSs in a single cluster.
(ii) Partial User Centric Assignment.It is sufficient if MS has only a subset of its serving BSs as its network wide strongest ones.This increases the probability to find multiple MSs to be served jointly from the same BS cluster.
(iii) Overlapping Clustering.The clusters are overlapping such that, around the border of two clusters, a third one is added to serve the cluster-edge-MSs as center MSs.
(iv) Interference Floor Shaping.Residual intercooperation area interference is reduced by a suitable combination of wideband (WB) beamforming and per WB beam individual vertical antenna tilting.This is done utilizing the multiple antenna elements (AEs) mounted at BSs.

Considered Network and Signal Model
We consider a downlink scenario within a cellular network with transmitting BSs and receiving MSs.The cells are organized in sites with three cells per site each covering a sector of 120 ∘ as illustrated in Figure 2 consisting of three adjacent sites.Each BS covers a single cell.The mathematical notation is listed in Table 2. First, a simple single input single outpout (SISO) radio channel between MS  and BS , each with only one AE, is introduced as complex scalar value ℎ  .If MSs and BSs have multiple AEs, denoted by   and   , respectively, the channel ℎ  becomes a matrix H  ∈ C   ×  .All of these values represent the channel for a certain set of 12 subcarriers in the base band, accordingly physical resource block (PRB) in case of LTE.Radio channels from multiple BSs, denoted by a set, for example, B = { 1 , . . .,   }, are concatenated horizontally.Similar, the radio channels to multiple MSs in a set (e.g., M = { 1 , . . .,   }) are concatenated vertically.This and the concatenation to a matrix H MB is shown in (1) 3.1.Joint Precoding/Transmission.In the following, M  is the set of MSs that are jointly served by BS .We assume further that MSs have only one receive antenna, that is   = 1. Thus, To separate all jointly served MSs in the set M  , BS  uses a pre- Interference between these MSs mostly depends on the precoding algorithm, like for example, MMSE or zero forcing.These algorithms strive to suppress interference power below the noise or in the case of zero forcing to remove it completely.
In reality an interference floor remains, due to impairments such as channel estimation errors and inter-carrier interference.The error floor depends also on the precoding algorithm's "robustness." For instance, the zero forcing algorithm is quite sensitive against precoding errors and more robust solutions are available [22,23], especially for multiple user separation [24,25].We focus on methodical gains achieved by cooperation between multiple, distributed BSs and not on the joint precoding itself.Thus, we assume that joint precoding algorithms, channel prediction and estimation, and so forth are already applied in an optimal manner [26][27][28].
International Journal of Antennas and Propagation

Precoding Normalization Loss. Data streams of all MSs
M  served by BS  are transmitted in parallel.The transmit power used at that BS is limited by  BS max , yielding to a sum power constraint for all signals transmitted in parallel.Furthermore, each single AE at the BS has also a maximum transmit power given by  AE max .Hence, we have a second sum power constraint.These power limitations can either be considered within the joint precoding algorithm as presented, for example, in [29,30] or by an additional normalization between all jointly transmitted data streams.Hence, the precoding vectors, like v   in this article, are normalized.The application of a separated precoding and normalization is typically suboptimal and results in reduced SINR.This precoding normalization loss (PNL) is denoted by (, ): . (4)

Signal-to-Interference Ratio.
This article is about interference mitigation.Thus, we assume an interference limited system with noise power  2  being negligible compared to the interference power.Hence, the signal to interference ratio (SIR) is used in the following.The SIR seen at MS  which is served by BS  can be clearly described using the aforementioned notation:

Cooperation between Base
The SIR defined in ( 5) is now extended: Assuming that a certain MS  is assigned to BS  1 , the BS indices  1 ,  2 , . . .fulfil a special sorting of the BSs.
(i) The BS to which MS  would be connected without any cooperation is denoted by  1 .Typically this is the BS with the strongest radio channel for MS .
(ii) The set of all BSs at the same site B site with BS  1 being one of them is denoted by B site 1 , consisting of BSs { 1 ,  2 ,  3 }.Note that BS  1 is connected to MS  because it has the strongest radio channel to it, but BSs  2 ,  3 have in general an arbitrary connectivity to MS , since they are only connected to MS  now because they are located at the same site as BS .(iii) A CA is a special cluster consisting of three adjacent sites, that is, B site 1 , B site 2 , B site 3 like illustrated in Figure 2. Thus, the comprehensive set C contains 9 BSs { 1 , . . .,  9 }.All BSs   with  > 9 and sites B site  with  > 3 are not in that CA and cause interference at MS .Note that, in general, B site 2 and B site 3 , that is { 4 ,  5 ,  6 } and { 7 ,  8 ,  9 } have also arbitrary channel conditions as already explained above for BS  2 ,  3 .
Using that numbering scheme, the SIR for intra-and intersite cooperation is given by

The Interference Mitigation Framework
In general, cooperation between different BSs can be done in different ways.In this article, it is assumed that JP between BSs in a CA separates all jointly served MSs in the spatial domain, using the AEs of all cooperating BSs as one single and distributed antenna array.The precoding itself can be done in a so-called central unit (CU) like the one sketched in Figure 9 with   = 2 AEs for each of the three BSs in the cluster.Now, we introduce the interference mitigation framework step by step.The specific parameter settings given below are an example for the explanation of the main concepts.The values like the size of the CAs, and so forth have been chosen based on practical considerations, but might be further optimized or adapted in case of other scenarios.

User Centric Assignment.
For high SIRs, the clustering should be organized such that a cluster contains the network wide  strongest BSs for each of its served MSs.This individual list of network wide strongest BSs is referred to as an MS's "wish list" in the following.The assignment is called "user centric" because it depends on the users, that is the MSs and their wish lists.In contrast, the assignment is network centric if cooperation areas are predefined on network level without adaptation to any wish lists.For user centric assignment the interference from adjacent clusters is weak as the strongest BSs belong to the serving cluster and all other BSs have lower power than the serving BSs.In realistic scenarios, user centric assignment is hard to realize because the BSs according to an MS's wish list often do not belong to a single predefined cluster and vary between different MSs.Since in scenarios with intrasite cooperation only BSs on the same site can cooperate, MSs will often have strong BSs at adjacent sites that cause strong inter site interference.Above all, this motivates the introduction of the aforementioned CAs because then, more of the strongest BSs might be in a single cluster, that is a CA.
Regarding the SIR expressions as expressed in (8) the goal of user centric assignments is to minimize the right side of the denominator by maximizing the connectivity ‖h B ‖ of an MS's serving cluster B: Within the optimization in ( 9), the number  of requested strongest BS in the serving cluster is equal to the total amount Ω of BSs in a cluster.If the argument of the max-function is-by chance-identical to the 'th site B site  , intrasite cooperation fulfils these requirements for user centric assignment with  = 3, but often the strongest  BSs are not collected at a single site.In that case, for example, two sites with 6 BSs must cooperate to have at least the strongest  BSs in the serving CA.This case is referred to as partial user centric assignment and formulated in However, the strongest BSs are often distributed over a larger area than just between two adjacent sites within a single CA.This illustrates the challenge of (partial) user centric assignment.Recalling the indexing introduced in Section 3.4, there is the following "user centric condition" concerning the connectivity: Equation ( 10) fulfils the user centric condition (11).With  = Ω, also (9) does so.In the following, the latter case with  = Ω is referred to as "full user centric assignment" and  < Ω is called "partial user centric assignment." Due to the allocation of the strongest  BS, the SIR depends not only on the cardinality of B. For example, recall the indexing from Section 3.
The example above makes clear that large cooperation gains can only be realized if all of the relevant interferers are in the serving cluster and contribute to the wanted signal for an MS, that is, the signal gain alone will not result in strong SIR gains.Thus, the most important issue is to combine the relevant interferers, that is the network wide strongest BSs into the serving cluster for each MS.The SIR can be increased more by interference mitigation of all relevant interferers than due to a signal gain.In the following, three concepts are presented that exploit these issues.
(i) Oversized Clusters.Partial user centric assignment is explicitly permitted.The conditions for user centric assignment are reduced to  < Ω.The MSs should find now their  strongest BSs within a single cluster of size Ω.This is the notion behind the introduction of CAs because in their central area, MSs are surrounded with cooperating BSs.
(ii) Partial Channel Reporting.To realize such large clusters, only channels of the  strongest BSs are reported from the MSs and used as transmitters for the desired signal.The channel coefficients to all unreported BSs are assumed to be zero.This concept is referred to as "partial channel reporting (PCR)" in the following.
(iii) Overlapping Clusters.For MSs positioned on the border between two CAs, a third (overlapping) CA with its center area at that border is added.Since it operates on orthogonal resources, for example, another frequency sub-band, interference towards the first two CAs is completely avoided.This overlapping scenario is illustrated in Figure 7.

Oversized Clustering.
Note that PCR will introduce intra-CA interference at signal power level as it would also appear from adjacent and strong BSs that do not belong to the serving CA, but for an MS, it is unimportant whether interference comes from an unreported BS inside of its serving cluster or any other BSs in the network.The advantage lies in the relaxed condition that only  out of Ω BSs must be the strongest ones for the MS.This increases the so-called "penetration rate, " that is, the percentage of MSs that can be served that way.Note that the penetration rate also depends on the probability to find enough MSs with identical wish lists, an issue which is addressed later.One reason for the very low probability that MSs can be served in a full user centric assignment is the huge number of possibilities for wish lists of each single MS.Considering a network of  BS BSs, the number of possibilities to select a group consisting of Ω BSs is given by the binomial coefficient: Neglecting pathloss and assuming an equal distribution of the connectivity between each BS and MS, the probability that a certain group of Ω BSs is the optimum serving set of an arbitrary MS can be estimated just by the inverted binomial coefficient ( . Thus, the probability that an arbitrary set of BSs is selected and belongs to any of  C clusters, that is, the penetration rate, is given by . The penetration rate is increased using the relaxed assigning condition  < Ω according to the PCR concept.Thus, there are ( Ω  ) possibilities that an MS has its strongest  BSs within a cluster consisting of Ω BSs.Furthermore, the fact that in general multiple MSs with an identical wish list are needed to form a group of jointly served MSs would drastically reduce the penetration rate, but the aforementioned relaxed condition  < Ω allows grouping MSs with different wish lists since only their  strongest BSs must be in the serving cluster of size Ω.
Figure 3 contains results from a system level simulation (details in Table 1) and shows the penetration rate of MSs with 3, 6, and 9 of their strongest BSs in their serving CA.A significant increased penetration rate can already be expected if MSs should have their strongest 6 BSs in a single CA.If  is decreased further to 3, in combination with CAs of cardinality Ω = 9, we have penetration rates of more than 90%.

Overlapping
Clustering.If each possible cluster would be realizable, we have  C → (  BS Ω ) and the penetration rate approaches one, but with a nonoverlapping clustering, the number of clusters that can be defined in parallel from  BS BSs is limited to  BS /Ω.Thus, overlapping clusters allow a higher probability that the wish list of a certain MS is identical to one of these clusters.On a first glance, the idea of overlapping clustering seems similar to fractional frequency partitioning, but the overlapping clustering is chosen such that each BS contributes to multiple CAs that overlap each other.The frequency subbands are assigned to these overlapping clusters such that each BS offers the complete system bandwidth distributed between all clusters it is contributing to.Thus, the MSs must be divided into subgroups served only by a fraction of the system's bandwidth.If these subbands are chosen big enough that each MS can use as much subcarriers as needed for its purposes, the MSs will not suffer from any bandwidth limitation.Of course, the multiplexing gain might be reduced by the reduced bandwidth here.
The overlapping clustering consists of different clustering layers, so called "cover shifts" built from a basic clustering that consists of nonoverlapping CAs as illustrated in Figure 2.Each of these clustering layers is "shifted" into different directions to be located directly on the borders between two adjacent CAs from the basic clustering.As aforementioned, overlapping clusters use different frequency subbands to avoid interference between different cover shifts.The shape of a CA according to Figure 2 allows to built 6 of such cover shifts as illustrated in Figure 4 from left to right.The triangles have their edges on the three sites forming a CA.The center area of each CA is highlighted within the triangles.From left to the right, Figure 4 depicts the basic clustering with more and more cover shifts being added to fill all gaps between the CAs of the basic clustering.With all six cover shifts, the overlapping is such that each border between two CAs in, for example, cover shift "A" is the centre area of another cover shift "B" as illustrated in Figure 7.

Partial Channel Reporting.
As aforementioned, the high cooperation overhead might become an issue in CAs due to intersite cooperation.For full channel knowledge within a cluster, each channel coefficient on each subcarrier for each served MS must be known at each of the cooperating BSs.Furthermore, signals to be transmitted jointly according to Figure 9 must first be distributed between cooperating sites.The PCR concept reduces this overhead significantly, but since the full channel matrix between all transmit and receive antennas is needed for JP, the CU assumes all unreported radio channel coefficients simply to be zero.
A channel coefficient of zero means that no power can be transmitted.Thus, the joint precoding is calculated with wrong assumptions and MSs will see unknown interference from all unreported BSs.
As an example, we consider a channel matrix H M B B between a cluster B = {1, . . ., 5} and the jointly served MSs in M B = {1, . . ., 5}.In addition, we assume one AE at each BS and MS and that the first MS has only reported the channels corresponding to BSs 1, 2, and 5. Thus, the resulting precoding matrix V B contains errors such that the underlined entries (corresponding to the AEs from unreported BSs) contain interference for the first MS: ) .(14) Since this kind of interference appears also in the data stream for the first MS, that is, the first column in V, the PCR concept bounds the achievable SIR, but in noncooperative scenarios the unreported BSs would be interferers as well.Thus, the SIR bound is not really limiting the capacity in real world scenarios.
It is important that each BS in the CA is reported by at least one MS to make sure that the channel matrix does not contain any rows which are completely zero.This can be guaranteed if the scheduling applies a simple rule.
(i) The set M B is composed such that each BS in B has at least one MS of which it is the strongest BS in B. (ii) Each MS reports at least its strongest BS in the CA.This is easy to fulfil because MSs are generally connected to their strongest BS in the network.If each BS contributing to B adds one of "its" MSs to M B , the wish lists from this set of MSs will span all BSs in B. To increase the system's capacity, also more than one MS per BS can be served.This is of course limited by the number of AEs locally mounted at BSs.
There are a lot of different approaches how an MS can select the BS whose radio channels are reported.In this article, two of them are investigated: a "top-" and a threshold-based selection.
(i) A top  selection means to select the strongest  =  BSs in the serving cluster to be reported.(ii) For threshold-based selection, the MS reports all BSs of the serving CA that have a radio channel connectivity stronger than a certain threshold below the strongest BS.
While Top  selection yields to a variable SIR gain with a fixed cooperation overhead, the threshold-based selection has a variable cooperation overhead, but tries to keep a certain cooperation gain which is only limited by inter-CA interference.
With the same simulations as used for Figure 3 (see Table 1 for details) PCR in combination with an overlapping clustering according to Figure 4 has been simulated.Especially for MSs with a low SINR, the gains due to partial CoMP are significant and higher than those of intrasite cooperation.

Scheduling between Cover Shifts. The division into cover shifts with frequency subbands motivates a load balancing between them and the presented clustering in combination
with PCR allows a certain degree of freedom there, because the scheduling of MSs between cover shifts can also be reduced to the reported BS.
With  = 3 and the top- selection approach, the reported BSs can be distributed between one, two, or three different sites.If all of the reported BSs belong to the same site, the corresponding MS can be scheduled to each cover shift because the overlapping clustering as illustrated in Figure 4 was designed such that each site contributes to a CA in each cover shift.If the three strongest BSs are distributed between two sites, there are still two cover shifts left where these MSs could be scheduled to.The cover shift is only fixed if all three reported BSs belong to different sites.Figure 8 sketches the possible cover shifts for different locations of an MS's three network wide strongest BSs, denoted by the three dark hexagons.Of course, the number of sites that are included in a MS's set of reported BSs depends also on .Until now,  = 3 with a top- selection approach was assumed, but if the selection is done threshold based,  only limited by Ω.
Figure 6 compares the top- and threshold-based selection approaches.It comes out that a threshold of about 10 dB results in intrasite cooperation for 80% of all MS.Vice versa this means that the SIR is limited to 10 dB for 80% of all MSs due to intersite interference.This coincides with the low gains that are often seen in such simulations as already mentioned in the introduction of this article.The first idea for load balancing is to schedule MSs that can be scheduled only to one single cover shift first.Second, MSs with two "allowed" cover shifts are distributed between the different cover shifts with the goal in mind to equalize the number of MSs in each of these cover shifts.In a third step, all MSs that are free in their choice are added.As long as there are enough of the latter type of MSs to schedule, an equal load between all cover shifts can be reached.This technique is described more in detail in [31].More powerful scheduling ideas are sketched briefly later in this article.4.6.Interference Floor Shaping.Both, overlapping clusters and PCR exploit the fact that the main cooperation gain arises from a significant interference reduction and less from a signal gain.Despite oversized and overlapping clusters there is still a considerable amount of inter-CA interference, especially for CA edge users.In the following, we focus on the mitigation of that interference part.
Multiple antennas at BSs can be used for a static wideband beamforming before joint preprocessing is applied.This allows steering the transmit power more towards the center of a CA for mitigating inter-CA interference.The concept is presented in Figure 1.The depicted beams are divided into the so-called "centre beams" and "edge-beams." Each BS has for example, 4 AEs and uses them to provide 2 wideband beams.Additionally to the horizontally beamforming, edge beams have a stronger down tilt than centre beams and are reduced in their transmit power to further mitigate the interference power to adjacent CAs.Regarding the channel matrix, the effective channels resulting from the wideband beams are used instead of the physical antenna coefficients.Thus, the number of columns in a channel matrix between a CA and its served MSs is equal to the number of wideband beamformers used per BS and the wideband beams are referred to as virtual antenna elements (vAEs).In order to be specific, the term "physical antenna elements (pAEs)" instead of AEs is used from now on.
Only vAEs are used for joint precoding.The corresponding matrix is denoted by Ṽ.Thus, each antenna weight in Ṽ must be applied in the same manner at all pAEs, the corresponding vAE consists of.
Assuming a uniform linear array with a /2 antenna spacing ( = wavelength), 4 pAEs at each BS and using sin(30 deg) = 0.5, the (normalized) beamforming vectors for each BS, directed to the left and right are given by According to Figure 1 the wideband beams should also have an individual down tilt.Different down tilts can be realized if each single pAE of the uniform linear antenna arrays is extended to a vertical antenna array, again with, for example, /2 spacing.The influences of the vertical antenna array-basically describing a so-called "active antenna"-can be included easily into the mathematical model if all rows of the (now) two-dimensional antenna array are concatenated to one long row which leads to increased lengths of Elements corresponding to different rows of the antenna array must have an additional phase offset according to the down tilt of center and edge beams.Furthermore,  power adaptation for cell edge beams is included by a −6 dB powerdown-scaling of the corresponding beamforming vectors.In     (edge beams).Figure 12 shows the positions of all these wideband beamformers.To keep the notation clear, the length extension of wideband beamformers is suppressed in the following since it has no relevant influence on the mathematical model.Furthermore, the down tilt could also be realized by two different pAEs with different down tilts, but would increase the hardware complexity.
In Figure 12, the first 6 wideband beams of a CA are depicted and numbered.The corresponding beamforming vectors are concatenated to a block diagonal matrix: ) .
(15)  The complete MIMO signal model can be described by The central joint processing (CJP) calculates the joint precoding matrix V based on a channel matrix that is a result of the product between two matrices, that is, H and V. Thus, H is changed by Ṽ.These changes can also decrease the correlation between the individual elements in H compared to H.In other words the rank of a matrix consisting of vAEs might be higher compared to the case when the original pAEs are used as matrix elements.This improvement can occur in particular if Ṽ is of reduced size.Figure 10 depicts a possible realization of virtual AEs: The virtual antenna elements with tildes are to be applied on top of the physical antenna weights on the BSs by physical beamformers.(Here:, three BSs are used).
Figure 11 illustrates the results comparing SINR values utilizing the WB beams with and without CA-specific down tilt and power adaptation.In the latter case, a lower SINR can be observed compared to the CoMP simulations presented earlier in this article.Results presented in [32,33] indicate that this effect comes from an increased inter-CA interference because the pathloss is not as dominant here compared to the CoMP reference scenario without any WB-beamforming.
If the WB-beam down tilt and power adaptation are activated, the results in Figure 11 illustrate the effect of interference floor shaping.Compared to the CoMP reference, a significant SINR gain can be observed, confirming the performance of the proposed interference shaping concept.Beamforming only (no down-tilt, no power red.) Beamforming, -tilting, and -power adaptation The blue line (diamonds) is the reference scenario as in Figure 5. Red (crosses) denotes the wide band (WB) beamforming switched on, but without any additional downtilt or power adaptation.Green (circles) contains the results for additional downtilt and power adaptation.

Real World Issues
The accuracy of the simulation results that were presented in the previous section is limited by the accuracy of the channel models that have been applied.However, the 3GPP 3D antenna model does not accurately describe the vertical dimension of a cellular system since it is constantly considered to have LOS characteristics.A more solid modelling  would require the consideration of multiple reflection clusters as considered in the horizontal dimension.In order to validate the assumptions and results of the previous sections, we, thus, used extensive large scale field trials with regard to these most critical aspects.As depicted in Figure 13, the testbed consists of a total of 16 base stations that are deployed at seven sites.It covers a large part of the down town of the German city Dresden, an area with a typical urban building morphology that is characterized by large apartments of 20-50 m height, a soccer stadium, a train station, railway tracts, In a first field trial, we were interested in the impact of the downtilt on pathloss and coverage.In order to investigate this aspect, the same measurement route was traversed twice.For the first trial a downtilt of 6 ∘ and for the second trial a downtilt of 12 ∘ were used.During the trial a single UE transmitted OFDM pilots with transmit power of 37 dBm over a bandwidth of 5.4 MHz at a carrier frequency of 2.6 GHz.The received signal at all base stations was synchronously captured every 10 s and evaluated for channel and SNR estimation.For further details on the evaluation procedure, we refer the reader to [21,34].The measurement locations are marked in Figure 13 as well, using coloured dots that indicate the SNR to the strongest base station for each location.In total 479 measurements were taken in each trial.For technical reasons, the measurement locations for both trials are not exactly the same which, however, does not effect the following results since the data is evaluated statistically.In Figure 14, we show how the average SNR depends on the distance to the base stations for the two different downtilt settings.
In order to evaluate the relationship between SNR and distance to the base station, we determined the distance to all 16 base stations for each measurement location.Figure 14 shows the average SNR that was measured at different distances to the base stations.However, for each location only the SNR of those base stations was considered which antennas were facing to the measurement location, with a tolerance of 30 ∘ .Figure 14 also shows that the SNR in dB decreases linearly with distance with an average SNR degradation (pathloss increase) of 2.1 dB and 3.7 dB every 100 m and for a downtilt of 6 ∘ and 12 ∘ , respectively.The height of the base stations has a rather small effect on the average pathloss in our field trial.The results are useful to validate the results of our system level simulations and they show that an appropriate setting of the downtilts in a cover shift allows for separation of cooperation areas in a way that BSs that are facing outside the cooperation areas should apply a larger downtilt in order to decrease the interference to other cooperation areas.As described previously, coverage holes that would occur due to this strong isolation of cooperation areas would be closed by using different cover shifts on several layers that are separated by using different resources for each layer.Layer specific cover shifts could be realized by vertical beamforming.Our goal in the field trial that is described in the following was to explore how well different cooperation areas could be separated using these techniques.Since vertical beamforming was not available in the employed field trial system, we concentrated on the separation of a single cooperation area from neighboring base stations.The general field trial setup is the same as shown in Figure 14.The cooperation area of interest was made up of the three sites Hbf-Süd, Hbf, and Lennéplatz.Thus, in order to achieve a good coverage and large cooperation gains inside of the cooperation area a rather low downtilt of 7 ∘ was chosen at base station 2, 6, 8 that are facing inward.At all other base stations of the cooperation area a larger downtilt of 17 ∘ was chosen in order to minimize the interference to other cooperation areas.The downtilt at all surrounding base stations was 17 ∘ as well as those would be interferers from surrounding cooperation clusters that transmit on the same resources.Figure 15 shows the result of a field trial where a single transmitting UE traversed through a wide area of the testbed.The colour of the measurement locations indicates the maximum SNR measured at any base station of the cooperation cluster.A black dot is shown at measurement locations where the SNR to all base station in the cooperation cluster is below 0 dB or where a base station that is not part of the cooperation cluster has a higher SNR than all base stations in the cooperation cluster.Compared to the previous trial in Figure 13, the transmit power was reduced to 18 dBm, thus the SNR is clearly lower as it would be achievable in a cellular downlink with macrobase stations.
An inherent feature of the interference mitigation concept being proposed is joint transmission of base stations in a cooperation cluster.The statistics for the SNR that is instantaneously achieved at several base stations for all measurement locations in the cooperation area is shown in Figure 16.For example the blue curve shows the CDF of SNR of the strongest base station at each location.The other solid curves show the CDF of the minimum SNR of instantaneously achieved at several base stations.For example at 40% of the measurement locations the SNR was larger than 10 dB at two base stations of the cooperation cluster instantaneously.Considering that the transmit power at a typical base station would be about 30 dB higher as that used in this field trial shows that there would be definitely a strong benefit from cooperation.The dashed curves show the SNR CDF of the strongest interfering base station.Thus, a partial feedback (and precoding) scheme would typically use cooperation of no more than four base stations.In order to increase this number, the presented framework incorporates the option for   power control as presented in Section 4.6.Figure 17 shows the impact of power setting, where base stations that face out of the cooperation area have a transmit power that is reduced by 6 dB.Since this field trial was done in the uplink, the impact of a reduced transmit power was obtained by a reduction of the measured SNR.While the reduced transmit power has only small effect on the SNR in the cooperation cluster, the strongest interferer is reduced by 6 dB resulting in a significant increase of the average SINR.Figure 17: The same as Figure 15 but with power control.

Cooperative Scheduling Strategies-Converting SINR into Spectral Efficiency
Applying the above described framework increases the SINR per UE significantly for the single UE per cell scenario as obvious from Figure 11.This is a big step into the direction of noticeable system level performance gains, but leaves a main topic unsolved, that is, the optimum multiuser CoMP scheduling.
CoMP scheduling is-similar as for MU-MIMO-a challenge, requiring ideally an exhaustive search in all potential scheduling and precoding dimensions.The goal is to maximize the spectral efficiency while keeping a predefined degree of fairness or even better improving the degree of fairness.This is equivalent to maximizing the constrained sum rate over all simultaneously served data streams per CA, where the constraint depends on the intended degree of fairness.
For JP CoMP the search space generally comprise the following search dimensions.
(i) Setup of CAs with the goal to serve as many as possible MSs from their most appropriate cells, as being explained above.
(ii) In case of overlapping clusters according to the cover shift concept the scheduler has to consider different allocations of MSs to cover shift.
(iii) The grouping of simultaneously served MSs per CA is a very important step for finding the mutually most orthogonal set of users for a certain subcarrier or PRB.Independent of the specifically used linear precoding technique, like optimum Wiener filtering, zero forcing, signal to noise and leakage minimization, and so forth the PNL  as being introduced in (4) will explode for a badly conditioned channel matrix H.
(iv) User grouping is closely related with determining the optimum number of data streams per CA, that is how many users are served with how many data streams.Generally there is a tradeoff.With increasing number of data streams the sum rate increases up to the point, where the precoding normalization loss  overcomes the multistream throughput gains. combines a splitting loss due the partitioning of the  power to more streams and the loss for orthogonalization of the data streams.
(v) For each user group and rank the optimum and beamformers for the BS and MSs have to be derived.
(vi) As one of the last steps frequency dependent scheduling of MS sets to PRBs or subcarriers has to maximize overall capacity, while ensuring at least proportional fairness for all MSs.Taking all these 6 dimensions into account, one can formulate the overall optimization which must be done comprising Obviously, the overall search space easily explodes even for small number of MSs requiring a powerful heuristic scheduling strategy.
A completely new score based scheduler has been implemented taking advantage of the concepts presented in last sections, that is, being limited to one single CA.The description of all applied scheduling strategies is beyond the scope of this paper, but so far achieved results look very promising.It was possible to serve 3 MSs per cell with reasonable data rates, leading to a mean throughput of about 7 bit/s/Hz/cell for an urban macro 4 × 2 MIMO scenario.This is more than a factor of 2 larger than what has been reported in 3GPP for the same scenario as outcome of the CoMP study item (3.1 bits/s/Hz/cell).

Conclusion
Interference mitigation in cellular mobile radio systems is a challenging problem for high number of interferers that are potentially wide spread between far off BSs.Network wide cooperation is well known to be extremely powerful in theory, but practical feasible systems have to divide the network into smaller cooperation areas (i.e., clusters).As a consequence there will be inter cooperation area interference, spoiling most of the gains observed for network wide interference free transmission.To get back to similar performance as for network wide cooperation we propose a framework for interference mitigation, including oversized clusters and International Journal of Antennas and Propagation overlapping cooperation areas, interference floor shaping by means of wideband beamformers, and a novel cooperative score based scheduler.The performance of all that concepts has been investigated by a software simulation framework written in Matlab and by real world field trials made at the TU-Dresden-LTE-Testbed.
The results indicate that the ideas can really be applied in future networks with significant gains regarding the SIR/ SINR.For the future, further fine tuning is planned to reach even higher gains.
Stations.A BS cluster is described by a set B = { 1 , . . .,  Ω } of Ω cooperating BSs with cardinality |B| = Ω.The set M B = { 1 , . . .,   } of  MSs is served jointly by all BSs in B. Equivalently as M  is the set of all MSs served by BS , M B is defined as the union of all served MSs from the cooperating BSs in B:

Figure 1 :
Figure 1: The complete interference mitigation framework consists of three adjacent sites.Multiple antennas at each of the cooperating BSs perform wideband beamforming to steer the signal more towards the CA's center.

Figure 2 :
Figure 2: A CA consists of three adjacent sites.

Figure 3 :
Figure 3: Penetration rate of user centric served MSs.

Figure 4 :
Figure 4: The basic clustering is illustrated on the most left picture.From left to the right, more and more cover shifts are added.

Figure 5 :
Figure 5: The partial CoMP concepts have a significant gain compared to intrasite cooperation scenarios, especially in the low-SINR regions.

Figure 6 :
Figure6: The number of involved sites in the set of reported BS depends on the number of reported BSs and thus on the threshold if that selection approach is chosen.

Figure 8 : 3 Figure 9 :
Figure 8: Dark hexagons denote the three network wide strongest BSs.Depending on their distribution, an MS can be scheduled into one, two, or six different cover shifts.

Figure 10 :
Figure10: Each element of ṽ must be spread to all pAEs locally mounted at the BSs.Thus, there is much less cooperation overhead on the backhaul.

Figure 11 :
Figure 11:  The blue line (diamonds) is the reference scenario as in Figure5.Red (crosses) denotes the wide band (WB) beamforming switched on, but without any additional downtilt or power adaptation.Green (circles) contains the results for additional downtilt and power adaptation.

Figure 12 :
Figure 12: Numbering scheme for wideband beams at first site in a CA.

Figure 13 :
Figure 13: Field trial setup and measurement locations (colored dots) for the first field trial.The color indicates the SNR to the strongest base station for a downtilt of 6 ∘ .

Figure 14 :
Figure 14: Average SNR (dB) over distance from base stations.Average over results at 16 BSs.The dashed lines show a linear fit.

Figure 15 :
Figure 15: SNR in cooperation area using the tortoise concepts.

Figure 16 :
Figure 16: CDF of SNR at the base stations in the CA.The dashed curve shows the CDF of the SNR of the first interferer from outside of the CA.We see that typically 4 base stations in the CA have a higher SNR than the first interferer.
each MS  ∈ M  .The precoding vectors are columns from the joint precoding matrix V  , used at BS .It is calculated based on the channel matrix H M   and could be, for example, the moore penrose pseudo-inverse of H M   , denoted by H † M   .Intracell interference received at MS  is caused by all other data streams transmitted by BS  for MSs  with  ∈ M  \ .Finally, an additive white and Gaussian distributed noise signal  with power  2  is assumed.If MS  is served just from BS  that serves all MSs in set M  , the signal   received at MS  can be expressed by   = h  v    + ∑ , . ..,  6 and  7 , . ..,  9 belong to B site 2 and B site 3 , respectively.Furthermore it is assumed that the connectivity from BSs that are not members of { 1 , . ..,  9 } is not relevant, that is the interference floor from BSs that are located further away from MS .Now, we compare the SIRs resulting from two different serving clusters for MS  under some further assumptions:     h , 4:  1 , . . .,  3 belongs to the site cluster B site 1 while  4