Iterative Multiuser Equalization for Subconnected Hybrid mmWave Massive MIMO Architecture

Millimeter waves and massive MIMO are a promising combination to achieve the multi-Gb/s required by future 5G wireless systems. However, fully digital architectures are not feasible due to hardware limitations, which means that there is a need to design signal processing techniques for hybrid analog-digital architectures. In this manuscript, we propose a hybrid iterative block multiuser equalizer for subconnected millimeter wave massive MIMO systems. The low complexity user-terminals employ pureanalog random precoders, each with a single RF chain. For the base station, a subconnected hybrid analog-digital equalizer is designed to remove multiuser interference. The hybrid equalizer is optimized using the average bit-error-rate as a metric. Due to the coupling between the RF chains in the optimization problem, the computation of the optimal solutions is too complex. To address this problem, we compute the analog part of the equalizer sequentially over the RF chains using a dictionary built from the array response vectors.The proposed subconnected hybrid iterative multiuser equalizer is compared with a recently proposed fully connected approach.The results show that the performance of the proposed scheme is close to the fully connected hybrid approach counterpart after just a few iterations.


Introduction
A new generation of cellular network (fifth generation, 5G) is coming, and some innovative technologies are needed to ensure better performance and quality of service (QoS).Two enabling technologies have been considered to meet the QoS requirements for future wireless communication, massive MIMO (mMIMO), and millimeter wave (mmWave) communications [1].By using mmWave bands, several tens of GHz of bandwidth become available for wireless systems [2], while the mMIMO allows the continued increasing demand of higher data rates for future wireless networks [3].Comparing the mMIMO with conventional MIMO approaches, the mMIMO can scale up the conventional MIMO by orders of magnitude [4].A survey on mMIMO, also identified as large-scale MIMO, with channel modelling, applications scenarios, and physical/networking techniques can be seen in [5].The use of mmWave with mMIMO is very promising, because smaller wavelength compared to conventional communication systems allows the same volume to pack more antennas [6], which means that the terminals can support a large number of antennas.
The mmWave mMIMO combination can be used to exploit new efficient spatial processing techniques, such as beamforming/precoding and spatial multiplexing, at both transmitter and/or receiver terminals [7].These techniques are different than those used for sub-6 GHz bands due to limitations in hardware [8].In these systems, it is not practical to have one fully dedicated radio frequency (RF) chain by antenna [9] as in sub-6 GHz conventional MIMO systems [10] due to the power consumption and the high cost of mmWave mixed-signal components.Another issue is the mmWave propagation characteristics, which are quite different from sub-6 GHz because the mmWave channels are not so rich in multipath propagation effects [11,12], which should be taken into consideration in the techniques design for these systems.To overcome the limitation of the RFs chains number, purely analog beamforming can use phase shifters [8], with some schemes proposed in [13,14], where statistical channel knowledge is used through phase shifters, 2 Wireless Communications and Mobile Computing to optimally adjust the arrays response in space, applying a beam steering solution.
The performance of the pure-analog techniques is limited by constraints on the amplitudes of phase shifters and due to the phases of the ones quantized.Therefore, analog beamforming is usually limited to single-stream transmission [15].These limitations are overcome by doing some signal processing at an analog level and the rest at the digital level.These architectures are called hybrid analog/digital architectures and have been addressed in [15,16].Precoding and/or combining/equalization designs for single-user systems have been addressed for fully connected hybrid architectures in [17][18][19].In these architectures, each of the RF chains is connected to all receive-and-transmit antennas.In [17], a hybrid spatially sparse precoding/combining approach was designed for mmWave mMIMO systems.The spatial structure of mmWave channels was used to transform the single-user multistream precoding and combining scheme into a sparse reconstruction problem.In [18], joint turbo-like beamforming was designed to compute transmit/receiving analog beamforming coefficients; however the digital processing part was not considered.In [19], codebook design approaches were addressed for single-stream transmissions through an analog beamforming structure.For multiuser systems, some beamforming approaches have been proposed for fully connected hybrid architectures [20][21][22].The authors of [20] proposed uplink receiving beamforming where they assume only single antenna user-terminals (UTs), and at both stages analog and digital ones dealt with multiuser interference.Reference [21] proposed for the downlink a limited feedback analog/digital two-stage precoding and combining algorithm.Transmit/receiving analog beamforming are jointly computed in the first stage to maximize the power of the desired signal, and then the interference is explicitly mitigated using conventional linear zero-forcing (ZF) precoding in the second stage, that is, in the digital domain.An efficient hybrid iterative block space-time multiuser equalizer was proposed in [22].This equalizer was designed based on the iterative block decision feedback equalization (IB-DFE) principle [23].IB-DFE was originally proposed in [24].It does not need the feedback loop of the channel decoder output, and it can be considered as a low complexity turbo equalizer.IB-DFE has been extended to several scenarios, like diversity scenarios and conventional and cooperative MIMO systems, among many others [25][26][27][28][29][30].
In addition to the fully connected architectures, there are subconnected architectures that allow us to reduce the number of phase shifters from  RF to , when compared with fully connected counterparts, where  and  RF are the number of antennas and the number of RF chains [31].Thus, the power consumption used to excite and to compensate the insertion loss of phase shifters is reduced, and the computational complexity is also lower [31].There are two types of subconnected architectures, dynamic and fixed [32].In the dynamic subconnected case, each RF chain can dynamically connect to a different set of antennas, and in the fixed subconnected one, each RF chain is always physically connected to the same set of antennas.Precoding schemes for dynamic subconnected hybrid architectures have been proposed in [32,33].Reference [32] uses a relaxation of the mutual information maximization problem to design a technique that adapts the subarray structure according to the channel covariance matrix for frequency selective channels.The authors of [33] proposed a two-step algorithm for single-user narrowband systems that iteratively optimizes the hybrid precoder for spectral efficiency maximization, obtaining an extra data stream via the index of the active antenna set without any extra RF chain.Fixed subconnected hybrid architectures were addressed in [34][35][36][37].In [34], precoder and combiner schemes for narrowband singleuser systems are proposed, where a two-layer optimization method jointly exploiting the interference alignment and fractional programming was employed.First, the analog precoder and combiner are optimized via the alternatingdirection optimization method, and then the digital precoder and combiner are optimized based on an effective channel matrix.The authors of [35] designed a precoder and a combiner for a wideband single-user system where the overall spectral efficiency is maximized considering a power budget constraint for each subcarrier.The works in [36,37] are focused on multiuser downlink systems.In [36], the total achievable rate optimization problem with nonconvex constraints is decomposed to a series of subrate optimization problems for each subantenna array, and then an algorithm is implemented to perform a successive interference cancelation-based hybrid precoding.Precoding and combining schemes are performed in [37] for downlink, where virtual path selection maximizes the channel gain of the analog effective channel, and then a zero-forcing precoding in the digital domain is applied to mitigate the interference.
In this manuscript, we propose an efficient hybrid iterative block multiuser equalizer for subconnected mmWave mMIMO systems.The limitation that each RF chain is only physically connected to a subset of antennas makes the design of the proposed subconnected hybrid iterative multiuser equalizer harder than for the fully connected based approaches.To the best of our knowledge, iterative block detection designed for subconnected mmWave mMIMO architectures has not been addressed in the literature.We propose low complexity UTs without access to the Channel State Information (CSI), using a single RF chain and pure-analog random precoding.A time encoder/precoder is applied to guarantee that the transmit signal and thus the noise plus interference at the receiver side are Gaussian distributed.This not only simplifies the receiver optimization problem but also increases the diversity effects on the mmWave mMIMO system.The designed subconnected hybrid equalizer is optimized by using the average bit-error-rate (BER) as a metric.We compute the analog part of the equalizer sequentially over the RF chains using a dictionary built by the array response vectors.Finally, we show that the BER performance of the proposed subconnected hybrid iterative multiuser equalizer tends to the BER performance of a fully connected hybrid equalizer as the number of iterations increases.
This paper is organized as follows.Section 2 presents the subconnected hybrid multiuser mmWave mMIMO system

System Characterization
In this section, we describe the channel model, the UTs, and the receiver signals for the mmWave mMIMO system.We consider a multiuser system with  users, each one with  tx transmit antennas, that sends one data stream per time slot to the base station with  rx receiving antennas.

Channel Model.
We consider a -sized block fading channel, that is, a channel that remains constant during a block but varies independently between blocks.The channel follows the clustered sparse mmWave channel model discussed in [17] where H  ∈ C  rx × tx is the channel matrix, which contributes the sum of  cl clusters, each one with the contribution of  ray propagation paths.The channel matrix may be expressed to the th user as = √( tx  rx )/( cl  ray ) is a normalization factor, and  , is the complex gain of the th ray in the th scattering cluster.The functions Λ rx, ( rx,  ,  rx,  ) and Λ tx, ( tx,  ,  tx,  ) represent transmit and receiving antenna element gain for  rx,  ( rx,  ) and  tx,  ( tx,  ), that is, the azimuth (elevation) angles of arrival and departure.The vectors a rx, ( rx,  ,  rx,  ) and a tx, ( tx,  ,  tx,  ) represent the normalized receiving and transmit array response vectors for the corresponding angles.Reference [17] addressed the random distributions used to generate the path gains and the angles of channel, such that

User-Terminal Model Description.
We assume that each user has only a single RF chain and sends only one data stream per time slot over the  tx transmit antennas, as shown in Figure 1.We also consider that the UTs have no access to CSI, simplifying the overall system design.The analog precoder of th UT at the instant  is mathematically modelled by f ,, ∈ C  tx and is physically realized using a vector of analog phase shifters, where all elements of vector f ,, have equal norm (|f ,, ()| 2 =  −1 tx ).Therefore, the analog precoder vector of the th UT at the instant  is generated randomly according to where  ,  with  ∈ {1, . . .,  tx },  ∈ {1, . . ., } and  ∈ {1, . . ., } are i.i.d.uniform random variables such that  ,  ∈ [0, 1].
To guarantee that the transmit signal and then the noise plus interference are Gaussian distributed at the receiver side, the transmit signal X  = [x ,1 , . . ., x , ] is built by using a space-time block code (STBC).A Discrete Fourier Transform (DFT) performs the time and the analog precoder performs the space encoding.The STBC simplifies receiver optimization and increases the diversity of the mmWave mMIMO system.Mathematically, the operation is expressed by where  selected from a QAM constellation with , where ∑  =1  2  = .The transmitter total power constraint is ‖X  ‖ 2  = .For the sake of simplicity, and without loss of generality, in this manuscript, only QPSK constellations are considered.

Receiver Model Description.
For a given -sized block, the received signal is given by where y  ∈ C  rx denotes the received signal vector and n  ∈ C  rx is the zero mean Gaussian noise vector with variance  2  .This signal is processed at the receiver by an iterative block decoder based in subconnected hybrid architecture, as seen in Figure 2. First, the received signal y  is processed through the phase shifters in the analog part, modelled by the vector g ,, ∈ C  , where  ∈ {1, . . .,  RF rx } represents the index of the RF chain connected to a set of  =  rx / RF rx antennas.The global analog matrix G , ∈ C  RF rx × rx has a block-diagonal structure: Baseband processing follows that contains  RF rx chains, each one connected to a subset of  antennas, and a closedloop comprising a digital forward and feedback path.All elements of the vectors g ,, have equal norms (|g ,, ()| 2 =  −1 rx ).For the digital forward path, the signal first passes by a linear filter G , ∈ C × RF rx and then follows a time equalizer and decoding.In the digital feedback path, the recovered data from the forward path first passes through the time precoder and then the feedback matrix B , = [b ,1, , . . ., b ,, ] ∈ C × .The time encoding and decoding of the data symbols obey (4) and where the code-word matrix is a soft estimate of transmitted symbols.Both feedback and feedforward paths are combined; the signal output of the feedback path is subtracted from the filtered received signal G , G , y  .This proposed receiver structure is a subconnected hybrid iterative feedback multiuser equalizer, where the main difference from the conventional iterative block decision feedback based equalizers is the analog front-end of constant amplitude phase shifters.The limitation that all elements of each analog vector must have the same norm makes the design of the proposed equalizer harder than for the conventional fully digital one.On the other hand, it assumes a subconnected hybrid architecture, where only a set of antennas is connected to each RF chain, which simplifies the physical implementation when compared with the fully connected hybrid architecture.In the following sections the analog and digital forward matrices, G , and G , , are designed, as well as the digital feedback matrix, B , .

Iterative Receiver Design
In this section, the proposed hybrid iterative multiuser equalizer is derived for subconnected mmWave mMIMO based systems, discussed in the previous section.We assume a decoupled joint transmitter-receiver optimization problem, with the focus on the design of the subconnected hybrid multiuser equalizer.

Design of Subconnected Hybrid Iterative Multiuser Equalizer.
We start by considering the optimal case, that is, a full digital linear feedforward filter, without the sparsity constraint.We design the iterative multiuser equalizer based on the IB-DFE principles, and we use the average BER as the metric.The average BER minimization, according to [22], is equivalent to the MSE minimization: ((g where g () ,, is the linear feedforward filter, b () ,, the feedback filter, and z  = [1, 1, . . ., 1]  ∈ C  a vector with  ones.The solution to optimization problem ( 18) is (see the Appendix) ,, ) The previous optimization problem of (18) does not reflect the analog domain constraints.Designate G  as the set of vectors with constant-magnitude entries, that is, the set of feasible RF equalizers.Thus, the reformulated optimization problem for the subconnected hybrid iterative equalizer is as follows: ((g ()  ,, ) opt , ( The feedback equalizer for the subconnected hybrid iterative equalizer is analogous to the fully digital iterative equalizer referred to in the previous section, because the analog domain constraints do not impose any restriction on the vector b ()  ,, and thus are given by From ( 17) and ( 21), the MSE expression is Because the feasible set G  has a nonconvex nature, an analytical solution to problem (20) is very hard to obtain.However, an approximate solution is possible by assuming that the vector g ()  ,, is a version of vector a rx, ( 1,1 ), . . ., a rx, ( rx,  cl , ray )] ∈ C  rx × cl  ray , forming the matrix with the array response vectors of the receiver, corresponding to user .As each RF chain is not connected to all antennas, in the selection process of the vector for the th RF chain, the entries corresponding to the unconnected antennas are removed.For the th RF chain we use the dictionary A  rx = [A rx (, )] (−1)+1≤≤,1≤≤ cl  ray  ∈ C × cl  ray  , which is a submatrix of A rx .Then, the vector g ()  ,, is selected from dictionary A  rx .Therefore, optimization problem (20) The constraint ‖ ...

g (𝑖)
,, is nonzero, representing the sparsity constraint.The optimum digital feedforward vector ( ()  ,, ) opt is computed from the solution ( ,, ) opt of the optimization problem in (24), by removing the zero elements.The optimum analog feedforward vector (g ()  ,, ) opt is given by the selected column from (A  rx )  , corresponding to the nonzero element of ...
An iterative greedy method is used to select the best column of the dictionary A  rx allowing enforcement of the sparsity constraint.Due to the mmWave channel nature, the number of the paths is small and, consequently, the complexity of this selection process is low.
Denote  () ,, and (g () ,, ) opt as the digital and analog terms of the feedforward vector, corresponding to the selected indices from ...

g (𝑖)
,, .For example, if the selected indices are in the first positions, then ...

Performance Results
In this section, we show the performance of the proposed hybrid iterative block multiuser equalizer for subconnected millimeter wave massive MIMO systems.The BER is considered the performance metric, presented as a function of   / 0 , where   is the average bit energy and  0 is the onesided noise power spectral density.The power of each UT is normalized to  2 1 = ⋅ ⋅ ⋅ =  2  = 1 and the channel matrix, as previously mentioned, is normalized, such that E[‖H  ‖ 2  ] =  rx  tx .Then, the average   / 0 for all users  ∈ {1, . . ., } is identical and given by   / 0 =  2  /(2 2  ) =  −2  /2.The carrier frequency was set to 72 GHz and was considered a clustered narrowband channel model [40] for each user with  cl = 8 clusters, all with the same average power, and  ray = 4 rays per cluster.The azimuth angles of arrival and departure of the channel model are Laplacian distributed as in [21].We assumed an angle spread equal to 8 ∘ at both the transmitter and receiver and uniform linear arrays (ULAs) with antenna element spacing equal to half-wavelength; however, the subconnected hybrid equalizer developed in this paper can be applied to any antenna arrays.The channel remains constant during a block with size  but varies independently between them.We assume perfect synchronization and CSI knowledge at the receiver side.All results were obtained for a QPSK modulation.We present results for two main scenarios, all with  tx = 8.The other parameters are as follows.(2.b)  rx = 16 ( = 4).
Here  =  rx / RF rx represents the number of antennas per RF chain.In both scenarios, we assume the full load case (worst case), where the number of users and RF chains is equal.To compute the proposed subconnected hybrid equalizer, we consider that f ,, , the analog precoder for the th user, is generated according to (2).As mentioned before, the motivation is to keep the UTs with very low complexity without the knowledge of CSI before transmission.The proposed subconnected hybrid iterative multiuser equalizer is referred to here as subconnected.The results are compared with both fully digital, referred to here as digital, and fully connected, referred to as fully connected, which was recently proposed in [22], approaches.
Start with the results obtained for scenario (1.b) presented in Figures 3 and 4. In Figure 3, we present results for iterations 1, 2, 3, and 4 of the proposed subconnected hybrid iterative multiuser equalizer.It can be seen that the performance improves as the number of iterations increases, as expected.
Additionally, the gaps between the 1st and 2nd iterations are much higher than between the 3rd and 4th iterations.This occurs because most of the residual intersymbol and multiuser interferences are removed from the 1st to the 2nd iteration.For this scenario, the results with the fully digital and fully connected hybrid multiuser equalizer are presented in Figure 4.In this figure, we verify that the subconnected performance is worse than both digital and fully connected approaches, mainly for the first iteration.However, for the 4th iteration we can see that the performance of the proposed subconnected hybrid multiuser equalizer is close to the digital and fully connected counterparts.Therefore, we can argue that the dictionary approximation and the sequential optimization are quite precise.Compare scenario (1.b) with scenario (2.b) presented in Figure 5, where the number of users and RF chains ( =  RF rx ) changes, but the number of antennas per RF chain is the same; that is,  = 4.In scenario (1.b), at a target BER of 10 −3 , the penalties of the proposed subconnected equalizer for the fully connected and digital approaches are 2.3 dB and 3.8 dB, respectively, at the 4th iteration.In scenario (2.b), for the same target BER and iteration, the penalties for fully connected and digital are approximately 1.6 dB and 3.2 dB, slightly decreasing for lower  =  RF rx .We can also see that for scenario (1.b) a lower   / 0 is needed to achieve a BER of 10 −5 compared with scenario (2.b).This is because scenario (1.b) can achieve higher diversity and array gain given by the larger dimension of the receiving antenna array.When we reduce the number of antennas per RF chain by half, that is, for  = 2, which corresponds to scenario (1.a) presented in Figure 6, we verify that the penalty for fully connected and digital approaches is approximately 1.2 dB and 1.9 dB, thus decreasing for lower .This occurs because when we reduce , the subconnected architecture tends toward the fully digital case.In the extreme case where  = 1, we have one antenna per RF chain, that is, the digital architecture.As expected, by reducing  the degrees of freedom of the subconnected architecture increase, and the gap with the fully connected and digital approaches decreases.For the digital equalizer the curve for iteration 1 is not presented in Figures 4,  5, and 6 to improve the intelligibility since it almost overlaps with the curve for iteration 4 of the proposed scheme.In Figures 7 and 8, we compare the performance of the subconnected approach for Scenarios 1 and 2 by considering  = 2, 4, and 6 antennas per RF chain.Although the penalty of the proposed subconnected multiuser equalizer relative to the fully connected and digital increases with , as previously verified, the BER performance of the subconnected multiuser equalizer improves with  because the diversity and antenna gain are higher due to a larger number of receiving antennas.Notably, the above was considered the worst case, where  =  RF rx .Improved BER performance would be achieved for  <  RF rx , that is, for a number of users (streams) lower than the number of RF chains.For that case, the performance gaps between the proposed subconnected hybrid multiuser equalizer and the digital/fully connected approaches would be even smaller.

Conclusions
We designed a new hybrid iterative multiuser equalizer for a subconnected mmWave massive MIMO architecture.We considered low complexity UTs employing randomly analogonly precoding and a single RF chain.First, it was verified that the proposed hybrid iterative multiuser equalizer converges and that it achieves a performance close to the digital and fully connected counterparts, requiring very few iterations.This structure efficiently manages multiuser interference; the dictionary approximation and the sequential optimization are quite precise.We observed that when we reduce the number of receiving antennas per RF chain, that is,  =  rx / RF rx , the BER performance gets worse due to the reduction of both the antenna gain and diversity.We also observed that the gap between the proposed subconnectedscheme and the digital/fully connected approaches decreases when  decreases, because the architecture becomes closer to the digital one.
The performance of the proposed receiver structure shows that it is interesting for practical mmWave massive MIMO based systems, that is, the systems where hardware constraints are considered.This means that the number of transmit/receiving RF chains is much lower than the number of transmit/receiving antennas, and each RF chain is only connected to a reduced number of antennas, ensuring good performance at a low cost.

Figure 4 :
Figure 4: Performance comparison of the proposed subconnected hybrid multiuser equalizer with a fully connected approach for scenario (1.b).

Figure 5 :Figure 6 :
Figure 5: Performance comparison of the proposed subconnected hybrid multiuser equalizer with a fully connected approach for scenario (2.b).