SVD-Aided Beamforming and Power Allocation Algorithm for Multiuser Turbo-BLAST System Uplink with Imperfect CSI

The SVD-aided joint transmitter and receiver design for the uplink of CDMA-based synchronous multiuser Turbo-BLAST systems is proposed in the presence of channel state information (CSI) imperfection. At the transmitter, the beamforming and power allocation schemes are developed to maximize the capacity of the desired user. At the receiver, a suboptimal decorrelating scheme is first proposed to mitigate the multiuser interference (MUI) and decouple the detection of different users with imperfect CSI, and then the iterative detecting algorithm that takes the channel estimation error into account is designed to cancel the coantenna interference (CAI) and enhance the bit error rate (BER) results further. Simulation results show that the proposed uplink CDMA-based multiuser Turbo-BLAST model is effective, the detection from every user is completely independent to each other after decorrelating, and the system performance can be enhanced by the proposed beamforming and power allocation schemes. Furthermore, BER performance can be enhanced by the modified iterative detection. The effect of CSI imperfection is evaluated, which is proved to be a useful tool to assess the system performance with imperfect CSI.


Introduction
Multiple-input multiple-output (MIMO) communication systems provide a significant capacity increment over the conventional one through appropriate space-time processing [1,2].Bell-Labs Layered Space-Time (BLAST) has shown to be a promising MIMO communication technique that can achieve tremendous bandwidth efficiencies [3].The combination of BLAST and turbo-decoding principle is called Turbo-BLAST [4,5], where the coantenna interference (CAI) caused by the major source of channel impairment can be removed by a serially concatenated iterative decoding algorithm.
Although Turbo-BLAST is considered only for singleuser application, we show herein that it naturally extends to a multiuser code division multiple access (CDMA) scenario [6], which is called CDMA-based multiuser Turbo-BLAST system, where a unique signature code is assigned to each other and used to identify the user and to spread the spectrum of the user's data.Since all the users "simultaneously" occupy the same spectrum, they create multiuser interference (MUI) to one another because of the nonzero cross correlation of their signatures [7,8].The MUI not only limits the system capacity but also increases the system error performance.One of the most promising approaches to meet these challenges is the use of adaptive beamforming and power control [9][10][11][12][13].
In the uplink (UL) of CDMA-based multiuser Turbo-BLAST systems, both the multiple MUI and CAI have to be mitigated.Adaptive interference suppression techniques based on multiuser detection (MUD) and antenna array processing have recently been considered as powerful methods for increasing the quality, capacity, and coverage of these systems.However, the method of optimal MUD has a very high computation cost due to its nonlinear nature [14][15][16].
Moreover, the performance of multiuser Turbo-BLAST system is closely related to the channel state information (CSI).Practically, CSI at the receiver is subject to the error performance because of the non-real-time data processing, quantization error, and imperfect channel estimations, and so forth [17].This gives rise to significant challenges to system design and analysis.Spreader u Regarding these problems, the SVD-assisted beamforming and power allocation schemes are first developed to maximize the capacity of every active user, and a suboptimal decorrelating scheme for the uplink of multiuser Turbo-BLAST system is proposed to mitigate the MUI, then the detection is performed independently for different users after decorrelating.Based on this, an improved iterative detection algorithm is adopted to cancel the CAI.We assume that the spreading structures of all the users are perfectly known to both the transmitter and the receiver at the instant of transmission and reception, and the channel estimation is imperfect, only the channel estimation matrix and the statistical characteristic of channel estimation error are known to the system.Besides, in this paper, the common assumption that true channel matrix and channel estimation error are complex Gaussian is employed, and a CDMA Frequency Division Duplexing (FDD) system is considered.
Based on the above analysis, in this paper, we derive a beamforming and power allocation algorithm for multiuser Turbo-BLAST system in the presence of imperfect CSI, which avoids that most of the existing schemes are based on perfect CSI and designed for single-user case.Moreover, by using the obtained beamforming and power allocation, an improved iterative detection scheme is also presented.With this detection scheme, the system performance is improved further.Simulation results verify the validity of the presented schemes.
This paper is organized as follows: after the introduction in Section 1, the basic system is briefly described in Section 2. Section 3 presents an equivalent model for multiuser Turbo-BLAST system, the SVD-aided beamforming and power allocation scheme, and a suboptimal decorrelating algorithm, as well as the iterative detecting method which are described respectively.The simulation results are analyzed in Section 4, and Section 5 makes the conclusion of the paper.

Channel Model and Basic System Description
2.1.Transmitter Model.We consider an uplink (UL) of synchronous CDMA-based multiuser Turbo-BLAST system with U active users operating in a frequency-nonselective Rayleigh fading channel, that is, a narrowband multiuser Turbo-BLAST system is considered, where CDMA is mainly employed as a multiple access technique to differentiate different users and the corresponding spreading gain is not considered.At the transmitter, a data stream of each user is first encoded, mapped into a symbol stream that is subsequently demultiplexed into n T substreams, and then preprocessed by the right singular matrix, of channel estimation matrix, where each stream is allocated with proper power, finally the preprocessed streams are spread and transmitted by the MS each of which employs n T transmit antennas.The schematic transmit structure of the uth (u = 1, 2, . . ., U) mobile station (MS) is shown in Figure 1, where V (u) represents the transmitter's beamforming matrix formulated for the transmission of the uth MS's data and P (u) is the power allocation matrix for the uth MS under the constrain of total transmit power.

Imperfect Channel Model and Singular
Value Decomposition.Consider a multiuser Turbo-BLAST system where the base station (BS) employs n R antennas and each user is equipped with n T antennas (n T ≤ n R ).We introduce an imperfect channel model in this section [17].For the uth user, let H (u) ∈ C nR×nT denote the true complex channel matrix and H (u) ∈ C nR×nT denote the complex channel estimation matrix.Note that the channel estimation matrix and the true channel matrix are different from one another for the uplink.Based on the imperfect channel model, the true complex matrix H (u) can be formulated as where Ξ (u) ∈ C nR×nT is a complex matrix related to the imperfect CSI for the uth user.The true channel matrix H (u)  and channel estimation errors Ξ (u) are complex Gaussian, which leads the estimated channel matrix H (u) to be also complex Gaussian.Hence their statistical distributions are shown as e (u) i j ∼ CN (0, σ 2 e ), h (u) i j ∼ CN (0, 1 − σ 2 e ), and h (u)   i j ∼ CN (0, 1), which are all distributed by complex Gaussian law, and σ 2 e indicates the inaccuracy of the CSI.Since the practical system cannot detect the degree of the imperfection of the CSI, the channel estimation matrix H (u) is treated as the true channel matrix, which can be decomposed as where (•) H denotes a matrix conjugate transpose, Φ and V (u) are the unitary matrices with left and right singular vectors of H (u)   2, where the uth user is assumed as the desired user.The decorrelating scheme and The receiver structure of the multiuser Turbo-BLAST system.
the iterative detection algorithm are derived based on the availability of the spreading structure of both the desired and interfering users, as well as the CSI of the desired user.
At the receiver, the received signal is decorrelated and then the signal of the desired user can be extracted after decorrelating through the use of the orthogonal spreading codes.The decoupled signal is orthogonally converted by , the left singular matrix of H (u) .Finally an iterative detection strategy based on the "turbo" principles is used for the symbol detection after orthogonally converting.For the iterative decoding, the optimal decoding process can be separated into two stages: soft-input/soft-output (SISO) channel detector and SISO channel decoder, which mutually exchange the extrinsic information sent from one stage to the other iteratively until the decoding process converges.
Let n T data streams be transmitted by the uth MS to the BS hosted by a vector expressed as x (u) = [x (u)  1 , . . ., x (u)  nT ] T , u = 1, 2, . . ., U, {s (u) (l)} L l=1 denotes the spreading codes of the length L assigned to the desired user, and n(l) = [n 1 (l), . . ., n nT (l)] T is a column vector of additive Gaussian noise variables during the lth epoch, where each component is independent and identically distributed (i.i.d) zero-mean complex Gaussian variables with the variance of σ 2 n .Under these assumptions, the baseband discrete-time signal received by n R receive antennas at the BS can be expressed as where P (u) = diag( P (u) 1 , . . ., P (u) nT ) is the diagonal transmit power matrix for the desired user with total power constraint nT k=1 P (u)  k = P total = n T .

Equivalent System Model and Receiver Design
3.1.Equivalent System Model.In this subsection, an equivalent system model is proposed for the uplink of the multiuser Turbo-BLAST system with imperfect CSI.
For a synchronous multiuser Turbo-BLAST system, at a sampling instant, the received signal can be formulated as where n(l) is the equivalent additive noise at the lth instant, which is consisted of the interference caused by the channel estimation errors of all the interfering users and the complex Gaussian noise, respectively.The variance of the equivalent noise n(l) is calculated as follows.
First, let G (u) = Ξ (u) V (u) ; the mean and variance of G (u) can be evaluated as Clearly, the components of G (u) are i.i.d zero-mean complex Gaussian variables with the variance of σ 2 e , thus statistical International Journal of Antennas and Propagation distributions can be expressed as g (u)   i j ∼ CN (0, σ 2 e ) and ε{g (u)  ki [g ; the mean and variance of f(l) can be evaluated as . . .
Clearly, f(l) is a zero-mean complex Gaussian column vector with its component f k (l) . ., L S , whose variance can be evaluated as Since the normalized spreading codes are used, that is, 10) can be written as Because n(l) = f(l) + n(l), the component of n(l) is given as by the aforementioned definition with the variance calculated as Each component of n(l) is i.i.d zero-mean complex Gaussian variables with the variance of When σ 2 e = 0, that is, the channel matrix is estimated perfectly, the variance of the equivalent noise is σ 2 n = σ 2 n , which is equal to the variance of the scenario where the channel is estimated perfectly.

Decorrelating Scheme.
Optimal coherent and noncoherent detection based on maximum likelihood (ML) processing can be readily derived from (4) [14,15].These optimal detectors perform joint detection for all users and incur an exponential complexity in both the number of active users and that of transmit antennas.The enormous complexity of the optimum receivers renders them mainly of theoretical interest.
In this part, a suboptimal decorrelating receiver is presented which decouples the detection of different users with a linear complexity in the number of active users [16].
In order to express it compactly, we define Y y(1), . . ., y(l), . . ., y(L S ) , A A (1) , . . ., A (u) , . . ., A (U) . ( Then, ( 4) is expressed as The variance of any component of Since the information about all the transmitted symbols is contained in A, the key to the differential and coherent receivers introduced in this paper is to obtain an initial estimate of A, which is used for the subsequent detection.
When we assume that SS H has a full rank, the ML estimate of A, which is conditioned on { H (u) } U u=1 , { V (u) } U u=1 , P (u) and {S (u) } U u=1 , is given by The equation of ( 15) is suitable for all the spreading structure, but if the nonorthogonal spreading structures, such as GOLD code, are used, it is very difficult to get the variance of the equivalent noise.We use normalized orthogonal WALSH code in this paper, which satisfies SS H = I U .
The ML estimate R is effectively the output of a decorrelator with the input being the received matrix Y.The decorrelator mitigates the MUI and decouples the detection of different users.
Then we study the impact of the decorrelation on the variance of the equivalent noise, via (14), (15) which can be rewritten as where N = NS H is the equivalent noise after decorrelating and the variance of N is calculated as follows: that is, the variance of equivalent noise is unchanged after decorrelating.
Due to the block structure of A, the ML estimate of the desired user denoted by r (u) can be easily obtained from R, which is the uth column of R where n e /L S +σ 2 n )I nR , that is, the variance is increased with the number of active users.
The received signal of the desired user is achieved after decorrelating, corresponding to the preprocessing at the transmitter, and the decoupled signal is first orthogonally converted by Φ (u) , which is denoted as where n the variance of the equivalent noise is calculated as follows: International Journal of Antennas and Propagation Clearly, the components of n (u) are also i.i.d zero-mean Gaussian variables with the variance of (Un T σ 2 e /L S + σ 2 n ).

SVD-Aided Beamforming and Power Allocation Scheme.
Given the proposed equivalent multiuser Turbo-BLAST system model, we give an SVD-aided beamforming and transmit power allocation scheme in this part, which aims at maximizing the capacity performance of the desired user in the presence of imperfect CSI.
When CSI is imperfect, the right singular matrix of H (u) is used for beamforming and the channel capacity of the desired user with SVD-aided beamforming and power allocation can be expressed as [1,18] where λ nT ] is the diagonal matrix with the eigenvalues of H (u) [ H (u) ] H and P (u)  k is the transmit power allocated for the kth subchannel of the desired user, which satisfies the total transmit power constrain.The capacity subject to an optimization problem is given as below The Lagrange multiplier method is employed to find the optimum power allocation matrix that can maximize the capacity with total power constraint.After solving the equations in (23), we find that the optimal power allocated for the kth subchannel is where (x) + max{x, 0} and μ is the Lagrange multiplier restricted by nT In a frequency-nonselective, Rayleigh fading MIMO channel, the channel capacity is the statistical mean of λ (u) approximately in the presence of CSI imperfection.
Because { λ (u) k } nT k=1 are i.i.d variables and all of them have the same probability density p[ λ (u)  k ], formula (25) can be expressed as Since singular values are random variables, it is hard to achieve the function of probability density straightly.The channel capacity can be calculated by averaging (26) over a large number of channel realizations, which is expressed as where N is the number of channel realizations, which is used for simulation, {P (u) k,i } nT k=1 are the transmit power sequence for ith channel matrix H (u)  i , { λ (u) , and σ 2 e,i and σ 2 n,i are the variances of channel estimation error and AWGN, respectively.

Iterative Detection Algorithm for the Uplink of Multiuser
Turbo-BLAST Systems.In this section, an iterative detection strategy is employed to enhance the error performance [4] in the presence of imperfect CSI.Because the orthonormal spreading codes are used, the MUI is removed absolutely after decorrelating, so the iterative detection of every user is completely independent from each other.
Let x (u) k be the kth transmitted signal of desired user at a sampling epoch, by (19); the received symbol vector r (u)  which is corrupted by the channel noise and interferences, can be written as the sum of desired response, the CAI, and the equivalent noise: k is the kth column of the matrix Λ (u) , P (u) k is the transmit power allocated to the kth subchannel of the desired user, and Λ (u) k , x (u)  k , P (u)  k is defined as follows: (29) By ( 29), the decision statistic of kth substream of the uth user using a linear filter w (u)  k can be expressed as where q (u) k , d (u) k , and z (u) k are the desired response obtained by the linear filter, the coantenna interference, and the phaserotated equivalent noise, respectively.
The CAI can be removed from y (u)   k by the proposed iterative detector and soft interference cancellation based on mean square error (MMSE) principle.The improved estimation of the transmitted symbol x (u)  k can be formulated as where k is the linear combination of the interfering substreams.The estimation error is defined as k .The weighted vector w (u)  k and the interference combination d (u)   k are optimized by minimizing the mean-square estimation error Δx (u)  k between each substream and the related estimation, by the following cost function: where the expectation is taken over the equivalent noise n and the statistics of the data sequence x (u) .
We use standard minimization techniques to solve the optimization problem formulated in (32).In order to arrive at the solution, we write the cost function as where International Journal of Antennas and Propagation Via (34), the cost function in (33) can be further evaluated as The linear combination of interfering substreams d (u) k and the weighted vector where Therefore, for the desired user, the weighted vector w (u)  k is used for the iterative detection in the presence of imperfect CSI.Thus (31) can be rewritten as For the first iteration, we assume ε[x (u)  k ] = 0 and thus the linear MMSE detection for the kth substream of the desired user becomes Next, we assume that ε[x (u)  k ] → x (u)  k with the increasing number of iterations, the CAI interference canceller for the kth substream can be reduced to the linear MMSE receiver expressed as (40), where the SVD-assisted beamforming and power allocation scheme are used The performance of iterative scheme depends on the veracity of channel estimation matrix and the power allocation scheme

Simulation Results
In this section, we compare the capacity and BER performance of the traditional and modified multiuser Turbo-BLAST systems, where the traditional multiuser Turbo-BLAST systems adopt the equal power allocated strategy and without the SVD-aided preprocessing and postprocessing, denoted by "EPA", and the modified multiuser Turbo-BLAST systems employ the proposed SVD-aided beamforming and transmit power allocation scheme, denoted by "BF-TPA", respectively.
At the transmitter, the data stream is first encoded by a rate-1/2 convolutional code with generator (7,5), modulated by 8-PSK modulation scheme, and spread by WALSH codes, whose capacity performance over a large number of channel realizations is exhibited in Figures 3-5, extra capacity, which is defined as the capacity gap between the modified multiuser Turbo-BLAST system and the conventional one, is given in Figure 6, and finally, BER performance is presented in Figures 7-8 with different inaccuracy of channel estimation and different number of active users.4 active users are considered in Figure 3 and Figures 5-7.
Figure 3 shows capacity performance of traditional and modified multiuser Turbo-BLAST systems with four transmit and four receive antennas under the conditions of different channel estimation errors.Figure 3 indicates that the proposed SVD-aided beamforming and transmit power allocation strategy is an effective means to improve capacity performance even with the imperfect CSI, and the larger capacity gains can be achieved for lower E b /N 0 and/or higher σ 2 e .For example, at E b /N 0 = 10 dB, there are 0.4 bit/s/Hz) and 0.8 (bit/s/Hz) capacity gains for σ 2 e = 0.1 and σ 2 e = 0.4, respectively.
The capacity performance of the modified multiuser Turbo-BLAST systems with different number of active users in the presence of imperfect CSI are given in Figure 4, where U denotes the number of active users.It is clear that from Figure 4, in the presence of imperfect CSI, the capacity is lower for larger number of active users, and that the impact of the number of active users on the capacity performance is higher for higher E b /N 0 .For instance, at E b /N 0 = 0 dB, the capacity of desired user is almost the same for different number of active users.As E b /N 0 becomes high, the capacity degradation becomes large with the number of active users; for example, at E b /N 0 = 30 dB, there are 17 (bit/s/Hz) capacity degradation for the desired user when system with 64 active users is compared to that with only one active user.This is due to the existence of the channel estimation error, which will make the equivalent noise of the system large, especially for large U (which can be seen from ( 17) and ( 21)).This large noise will degrade the system performance greatly because it can affect  the following iterative detection and the resultant capacity calculation.
Figure 5 shows the capacity performance of the modified multiuser Turbo-BLAST systems in different values of E b /N 0 with imperfect CSI.The simulation results are obtained for five different E b /N 0 values of 10 dB, 15 dB, 20 dB, 25 dB, and 30 dB, and under various degrees of CSI imperfection.From Figure 5, we see that for CSI imperfection with σ 2 e < 10 −3 , the actual capacity is almost constant, as the CSI imperfection gets more severe; in particular, σ 2 e > 10 −2 ; the capacity becomes very sensitive to the CSI imperfection.
And we see that the CSI imperfection on the capacity performance is higher for higher E b /N 0 because for higher E b /N 0 , that is, σ 2 n → 0, the variance of the equivalent noise mainly depends on the error of channel estimation, and therefore, the capacity performance becomes sensitive to the CSI imperfection.
In Figure 6, we give the extra capacity results, which are defined as the capacity gap between modified and traditional multiuser Turbo-BLAST systems with sixteen receive antennas and various transmit antennas configuration under the conditions of imperfect CSI (σ 2 e = 0.01) for five different  E b /N 0 values of 0 dB, 5 dB, 10 dB, 15 dB, and 20 dB.It is clear that from Figure 6, we can see the extra capacity is higher for lower E b /N 0 and/or large number of transmit antennas in the presence of CSI imperfection.For instance, at E b /N 0 = 0 dB, there are 16 (bit/s/Hz) capacity gains for the multiuser Turbo-BLAST system with sixteen transmit antennas, while only 3 (bit/s/Hz) capacity gains can be achieved for twelve transmit antennas.BER performance of traditional and modified multiuser Turbo-BLAST systems under the condition of different inaccuracy of channel estimation is shown in Figure 7, where I denotes the number of iterative detection.It is observed from Figure 7 that as the BER results are concerned, the modified multiuser Turbo-BLAST system outperforms the traditional one under the same conditions, regardless of the status of CSI.This implies that the proposed SVDaided beamforming and power allocation strategy are also valid to improve BER performance even with the imperfect CSI.For example, at a BER of 10 −3 , we can see that there are 4 dB gains for the modified system over the traditional one in the 1st detection iteration under the condition of imperfect CSI, that is, σ 2 e = 0.01.Moreover, the BER results are quickly lowered with the increasing detection iterations.
For instance, at a BER of 10 −3 , there are 2.5 dB extra gains attained for the modified system, where BF-TPA scheme is adopted through 3rd iterative detection under the condition of σ 2 e = 0.01, which indicates that the proposed iterative detection scheme is an effective means to enhance the BER performance further even with imperfect CSI.
Figure 8 gives the BER performance of the modified Turbo-BLAST system with different number of active users under the condition of imperfect CSI (σ 2 e = 0.01), where U denotes the number of active users and I denotes the number of iterative detection.Figure 8 shows that at the same inaccuracy of channel estimation and for the identical number of iterative detection, the BER performance gets worse for large number of active users.For example, at a BER of 10 −4 , comparing the system that has sixty-four active users with the one having only one active user, there is 4.5 dB degradation of E b /N 0 .This is because the variance of each components of the equivalent noise is Un T σ 2 e /L S +σ 2 n ; when the multiuser system was with more active users; the variance is approximately linear to the number of active users, so the BER results get worse with the increase of the number of active users.

Conclusions
In this paper, we first introduce a CDMA-based multiuser Turbo-BLAST system model, based on the imperfect channel model, the equivalent system model, and the variance of the equivalent noise.In order to maximize the capacity performance of the desired user, the SVD-assisted beamforming and power allocation strategy are theoretically obtained and the Lagrange multiplier method is then employed to find the optimized power allocation matrix, which is subject to the total transmit power constraint.And then, to circumvent the complexity of multiuser detection, the suboptimal decorrelating scheme is proposed to decouple the detection of different users.Finally, the iterative detection technique is adopted after postprocessing to enhance the BER results further.The complexity of the suboptimal receivers is exponential only in the transmission rate, similarly to singleuser Turbo-BLAST receivers.Numerical results show that the newly introduced method is effective to enhance the capacity results and the BER performance of multiuser Turbo-BLAST systems in the presence of imperfect CSI.

Figure 1 :
Figure 1: Transmitter structure of the uth mobile station.

Figure 5 :Figure 6 :
Figure 5: Capacity performance of the modified multiuser Turbo-BLAST system in different values of E b /N 0 with imperfect CSI (U = 4).

Figure 7 :
Figure 7: BER performance of traditional and modified Turbo-BLAST systems under the conditions of different inaccuracy of channel estimation (U = 4).

Figure 8 :
Figure 8: BER performance of modified Turbo-BLAST systems with different number of active users under the conditions of imperfect CSI (σ 2 e = 0.01).
as their columns, Λ