Angular Domain Data-Assisted Channel Estimation for Pilot Decontamination in Massive MIMO

Powered by TCPDF (www.tcpdf.org) This material is protected by copyright and other intellectual property rights, and duplication or sale of all or part of any of the repository collections is not permitted, except that material may be duplicated by you for your research use or educational purposes in electronic or print form. You must obtain permission for any other use. Electronic or print copies may not be offered, whether for sale or otherwise to anyone who is not an authorised user. Beyene, Yihenew; Ruttik, Kalle; Jäntti, Riku


Introduction
Catering to throughput/data-rate demands of users in very densely populated areas is costly using legacy solutions.This would typically require operators to have very dense mobile networks cell sites with increase in cost of backhauling, powering, maintaining, and securing the sites.This is particularly critical in emerging markets which will increasingly have the most densely populated areas [1] but low Average Revenues Per Users (ARPUs) [2].The extreme Mobile BroadBand (eMBB) capabilities envisioned in 5G provide an opportunity for operators to accommodate highly scalable throughput demands in very densely populated areas through use of advanced radio technologies, one of the most promising being large Multiple-Input-Multiple-Output (MIMO) system, usually referred to as Massive MIMO (M-MIMO) [3].M-MIMO is considered as one of enabling technologies for future cellular systems [4][5][6].Studies have shown that M-MIMO is able to suppress the impacts of additive noise and uncorrelated intercell interference [3,7].However, gains from MIMO are highly dependent on the quality of available Channel State Information (CSI) [8].In dense concentration of Users/User Equipment (UE), M-MIMO suffers from pilot contamination.
Optimal MIMO precoding requires CSI between each user and each Base Station (BS).A natural way to achieve this information is to use DownLink-(DL-) UpLink (UL) reciprocity of a TDD system [9][10][11].The channel estimated from UL pilots can be used for precoding DL signal (and vice versa).However, M-MIMO system is characterized not only by large number of antennas but also by a large number of users.The set of orthogonal pilot sequences is usually limited, and for a large number of users we have to reuse the sequences [12].Two users with the same pilot sequence contaminate each other's channel estimations.Pilot contamination due to nonorthogonal training pilots has been shown to be the main capacity limiting factor of a M-MIMO system [7].
Since pilot contamination occurs due to the reuse of same pilot sequences a way to combat it is to reorthogonalize the sequences.In various papers this has been along different dimensions: such as time and space [13].Superimposed data and pilot transmission were proposed in [14].Coordination 2 Mobile Information Systems among BSs allows for joint processing [15] and pilot assignment [16] in order to minimize the interference.
Minimum Mean-Squared Error (MMSE) estimator is able to suppress pilot contamination from interfering channels if channel statistics of the interfering users are available.This requirement can be avoided by simply weighting pilot sequence with user specific channel coefficients that are estimated from reciprocal TDD channel [17].The method assumes long enough coherence time for estimating the reciprocal channels in the training phase and later arranging transmission of pilots.A subspace projection using a singular value decomposition can also be used for filtering (cleaning) the interference [18,19].Those methods assume that receiving antennas have uncorrelated channels.Hence, the desired and the interfering signals can be projected into different subspaces based on eigenvalue decomposition of received signal matrix.The subspace based separation improves the channel estimation quality unboundedly as the number of antennas increases.For sufficiently sparse channels, simple Discrete Fourier Transform (DFT) projection can be used to remove the interference [20].
In this paper we propose data-assisted channel covariance matrix estimation algorithm for angular domain MMSE estimate in linear antenna array.In M-MIMO, such data-assisted estimation can reduce the impact of pilot contamination regardless of sparsity of the channels.Instead of statistical averages, the algorithm uses instantaneous channel power information that is extracted from the data.The algorithm works relatively well with only first data-aided steps and does not need to be iterative.However, the proposed estimator can be used as initial estimate for computationally intensive iterative algorithms such as [21].
We studied the channel estimation problem in multicell system where BSs have massive antenna arrays.The channel coherence time is allowed to be smaller than the number of BS antennas.More importantly, we assume that the BSs do not cooperate and have no explicit knowledge of channel secondorder statistics.Finite-path channel model for linear antenna array [22] is considered in this work.The channel is assumed to have finite number of reflections.Angle-of-Arrivals (AoA) of multipath components are random and do not need to be orthogonal.In realistic scenario there might be very large number of dominant reflections compared to the number of receiving antennas.We introduce a codebook that projects the channel into finite (and not necessarily orthogonal) quantized beams, called angle bins.The motivation behind this approach is that the projection exposes angular sparsity of the channels.Different channels will have different power distributions over the angle bins.Channel estimates over these angle bins can be combined in such a way that pilot contamination is minimized.This is done by applying MMSE criterion to projected channel.
The paper is organized as follows.In Section 2, multicell TDD based system model is presented.Section 3 is devoted for detailed description of proposed data-aided channel estimator where practical estimation algorithms are presented.Comparison of performances of channel estimators based on numerical results is presented in Section 4. Finally, conclusions are made in Section 5.
Notation.Bold face uppercase and lowercase letters are used to denote matrices and vectors, respectively, where I  denotes an  ×  identity matrix.Transpose, conjugate, and hermitian transpose operators are denoted by (⋅)  , (⋅) * , and (⋅) † , respectively.E[⋅] denotes expectation and tr(A) and row(A) denote trace and row space of matrix A, respectively.vec(A) denotes vector formed by concatenating columns of matrix A. h[] denotes  th element of h, H[ :] denotes  th row of H, and H[, ] denotes entry of H at  th row and  th column.| ⋅ | and ‖ ⋅ ‖ denote absolute value and Frobenius norm, respectively.≜ represents definition, ⊗ denotes Kronecker product, and diag( 1 ,  2 , . . .,   ) denotes a diagonal matrix with entries  1 ,  2 , . . .,   .Variables with bar below correspond to angular domain representations: where G is projection matrix.

System Model
Consider an Orthogonal Frequency Division Multiplexing-(OFDM-) based multicell system with  BSs that are using the same time and frequency resources.Each BS has  antennas serving  users equipped with single antenna.All BSs are synchronized and operate in TDD fashion.Users in the same BSs use orthogonal pilot codebook; S × = [s (1) , . . ., s () ], where s () is the pilot sequence used by the  th user.The same pilot codebook is reused in each BS.The uplink and downlink channels are reciprocal, and they are estimated from uplink pilots.

Uplink Training.
While users in a cell have orthogonal pilot sequences, the same pilot sequences are reused in other cells.Received frequency-domain signal at the BS of cell 1 is given as where h ()    ∈ C ×1 is the uplink channel between the  th user in cell  and BS in cell 1 and W ∈ C × is complex Additive White Gaussian Noise (AWGN) having entries with zero-mean and variance  2  .We assume that ‖s () ‖ 2 = .Let us vectorize (2) as where y = vec(Y), S () = s () ⊗ I  , and w = vec(W).

Angular Domain Channel Estimation
Spatial MMSE (SMMSE) estimator for MIMO systems [7,16] requires prior knowledge of covariance matrices of the desired and interfering channels which is a difficult task.The Scaled Least-Squares (SLS) estimate [23] that needs only estimate of the Signal-to-Interference-plus-Noise Ratio (SINR) does not discriminate contaminating pilots.We propose angular domain MMSE channel estimator for antenna arrays.
The angular domain channel covariance matrix is estimated by the aid of data symbols.The covariance estimation is done every Transmission Time Interval (TTI) without the need for prior information such as long-term statistics of the channel.Therefore, it is suitable for fast-fading channels.
where h  = Gh  , y = vec(GY), S = s⊗I  , and w = vec(GW) ∼ CN(0,  2  I  ).The angular MMSE (AMMSE) estimate of h 1 is where This implies that the channel has independent entries, and hence its covariance matrix is diagonal.
For channel estimation, we approximate the covariance matrix with its M-MIMO limit (11).Therefore, where ĥ1 LS is the Least-Squares (LS) channel estimate and Λ ≜ diag( 1 , . . .,   ) is an angle bin weighting matrix which corresponds to the ratio of signal power to the total received power; we call it Fractional Signal Power (FSP).Unlike the SLS approximation of SMMSE, the AMMSE exposes the angular sparsity of the channel as illustrated in Figure 1.

Data
This done by exploiting data symbols in the estimation process.In other words, we estimate the FSP from the transmitted data symbols.The main challenge in this approach is the fact that data symbols are unknown prior to channel estimation.We solve this problem by having a two-stage channel estimation.The first stage involves a simple LS channel estimation from the strongest beam.We assume that the desired signal is stronger than interfering signals, and hence, the strongest beam is relatively less contaminated.Our algorithm uses the channel estimate from this beam to get initial estimate of data symbols.While these data symbols are likely to be erroneous, they can be used to estimate the instantaneous signal power (and hence FSP) in each beam.The accuracy of FSP estimation improves as the number of data symbols increases.Data estimation and FSP estimation make up the two stages of our algorithm.These steps can be repeated iteratively.

Phase 1.
The goal of this phase is to have initial softestimate of data symbols which will be used for computing channel power.Data symbols are used due to their large number compared to pilot symbols.At initial stage, channel estimate is not available.Therefore, we rely on the strongest beam.The channel response for the strongest beam is estimated using LS.This allows us to have initial data symbols.After beam-steering, received pilot and data symbols, respectively, are where x  is vector of  data symbols transmitted from the user in cell  th .The data symbols from the desired user, x 1 , will be decoded from the strongest beam using (17).We assume that the desired channel is stronger than other interfering channels.Hence, with high probability, sum of absolute values of each row of T is maximized where h 1 has largest entry.The angle bin where largest entry of h 1 falls is found as follows.Compute c, where (15) such that  max ≜ arg max  c[] is the index of h 1 with maximum amplitude.Channel estimate for the  th max angle bin becomes Now we can have initial estimate of data symbols where x 1 = T[ max :]/ ĥmax is equalized data.

Phase 2.
We employ the initial data estimate, x1 , to find FSP.The FSP estimation is split into two parts: signal power and interference-plus-noise power estimation.The former is done by correlating the estimated data x1 with received signal on each angle bin.Correlation of x1 with received symbols on  th angle bin ( th row of T) becomes where is the estimation error which vanishes as  → ∞ such that we take |]  | 2 as estimate of signal power on  th angle bin.
Now, consider LS channel estimate from pilots Subtracting ( 18) from ( 20) leaves the interfering channels and the noise such that we can estimate interference-plus-noise power as The approximate FSP on  th angle bin is given as Hence, we can construct the weighting matrix for the estimator (12) as Λ ≜ diag(ρ 1 , . . ., ρ ).The corresponding Data-Aided AMMSE (DA-AMMSE) becomes

Numerical Simulation
We evaluated performances of channel estimators for a hexagonal cellular structure with one tier of neighboring cells (see Figure 2).In order to study the impact of pilot contamination, we assume that one user in each cell located at 2/3 of the cell radius transmits the same pilot sequence.All users synchronously transmit pilot symbols followed by data symbols within the channel coherence time and coherence bandwidth.The BS in the central cell receives signals from the desired user as well as interference from users of other cells.All the interfering users are three times as far from the BS as the desired user such that   = 3 −/2  1 ,  = 2, 3, Unless mentioned explicitly, simulation parameters shown in Table 1 are used.We used normalized MSE and average uplink rate with Maximum-Ratio-Combining (MRC) as performance metrics.

MRC.
MRC is a linear detector, and hence we consider only one data symbol transmission such that the received signal is expressed as where w  ∼ CN(0,  2  I  ) is AWGN and   is the symbol transmitted from the user in  th cell.The ergodic uplink rate for MRC receiver is given as In the Appendix we show that the uplink rate has the following upper bound: 4.2.Results.For a large linear antenna array, interfering channels lie on orthogonal subspace of the signaling channel if their AoAs do not overlap [24].In this case, MMSE estimate is interference-free.On the other hand, for overlapping AoAs, MMSE estimate is corrupted due to pilot contamination.In practical scenarios, the AoAs of desired and interfering channels can overlap.To examine both extremes, we consider two types of AoA distributions: (i) uniform: AoAs of all the users are independent and uniformly distributed over [0, ) and (ii) directed: AoAs a user's channel are concentrated in a narrow beam that has a width of 30 ∘ such that   ∼ U(  − /12,   + /12),  = 1, . . ., , where   ∼ U(0, ).When all the AoAs of all the users are narrow (directed), there is less chance of overlap between users.In Figure 3 the normalized MSE of DA-AMMSE with single and multiple iterations is illustrated.SMMSE uses channel covariance knowledge that is not available at the receiver and therefore is just a bound.DA-AMMSE performs much better than the SLS approach.The gap between DA-AMMSE and the ideal SMMSE is small.The average uplink rate shown in Figures 4 and 5 confirms this claim.The uplink rate for DA-AMMSE assuming known FSP indicates that the loss due to FSP estimation error is small.Figure 5 further reveals that when the number of reflections is large, DA-AMMSE performs almost as good as SMMSE.
Pilot contamination problem becomes worse in a rich scattering environment where covariance matrices of the desired and other interfering channels span overlapping subspaces.As can be seen from Figures 6 and 7, the uplink rate for uniform AoA distribution is far from ideal due to channel estimation error.When the number of antennas is sufficiently larger than the number of reflections, DA-AMMSE has slightly poor performance compared to the SMMSE accounting for covariance estimation error.Interestingly, when there are more reflections than the spatial dimension, DA-AMMSE outperforms the SMMSE.This indicates that the gain from instantaneous signal power based angular covariance estimation is higher than the loss due  to covariance estimation error.Moreover, simulation results prove that DA-AMMSE converges in single iteration.

Conclusion
We studied the impact of pilot contamination in a multicell environment with noncooperative BSs having large number of antenna arrays.

Figure 1 :
Figure 1: Channel sparsity in spatial domain versus angular domain. = 100,  = 10, and  = 7. Cell-edge SNR = 10 dB.The desired and interfering channel powers (a) are spread across all antenna elements and are difficult to separate.After angular transformation (b), channel is more sparse and different beams can be combined with the corresponding FSP weights (d) to suppress pilot contamination effect.

Figure 2 :
Figure 2: A hexagonal grid of single-user cells.The blue and red dotted lines depict uplink transmissions from desired and interfering users, respectively.