On Matrix Completion-Based Channel Estimators for Massive MIMO Systems

Ding, Mingjun; Yang, Xiaodong; Hu, Rui; Xiao, Zhitao; Tong, Jun; Xi, Jiangtao

doi:10.3390/sym11111377

Open AccessArticle

On Matrix Completion-Based Channel Estimators for Massive MIMO Systems

by

Mingjun Ding

^1,2

,

Xiaodong Yang

^1,2,*,

Rui Hu

^3,*

,

Zhitao Xiao

^1,2,

Jun Tong

³ and

Jiangtao Xi

³

¹

Tianjin Key Laboratory of Optoelectronic Detection Technology and System, Tianjin Polytechnic University, Tianjin 300387, China

²

School of Electronics and Information Engineering, Tianjin Polytechnic University, Tianjin 300387, China

³

School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Wollongong, NSW 2522, Australia

^*

Authors to whom correspondence should be addressed.

Symmetry 2019, 11(11), 1377; https://doi.org/10.3390/sym11111377

Submission received: 19 October 2019 / Revised: 4 November 2019 / Accepted: 4 November 2019 / Published: 6 November 2019

Download

Browse Figures

Versions Notes

Abstract

:

Large-scale symmetric arrays such as uniform linear arrays (ULA) have been widely used in wireless communications for improving spectrum efficiency and reliability. Channel state information (CSI) is critical for optimizing massive multiple-input multiple-output(MIMO)-based wireless communication systems. The acquisition of CSI for massive MIMO faces challenges such as training shortage and high computational complexity. For millimeter wave MIMO systems, the low-rankness of the channel can be utilized to address the challenge of training shortage. In this paper, we compared several channel estimation schemes based on matrix completion (MC) for symmetrical arrays. Performance and computational complexity are discussed and compared. By comparing the performance in different scenarios, we concluded that the generalized conditional gradient with alternating minimization (GCG-Alt) estimator provided a low-cost, robust solution, while the alternating direction method of multipliers (ADMM)-based hybrid methods achieved the best performance when the array response was perfectly known.

Keywords:

low-rankness; massive MIMO; matrix completion; compressive sensing

Graphical Abstract

1. Introduction

Millimeter-wave (mmWave) wireless communications have drawn great attention in the industry and academia [1] thanks to the large bandwidth available in the 30–300 GHz band. To compensate for the significant path loss in this band and also thanks to the short wavelength, massive MIMO have been suggested for mmWave systems. In particular, large-scale symmetric antenna arrays, such as the uniform linear arrays (ULAs) and uniform planer arrays (UPAs) have been extensively considered for transmitters and receivers due to their neat structures and high gains for directional transmissions [2,3,4]. However, the coherence time in the millimeter-wave system is suggested to be short and as the number of antennas increases, the complexity of channel estimation increases. Therefore, it is challenging to acquire instantaneous channel state information (CSI) for a mmWave massive MIMO.

MmWave channels are often dominated by a small number of propagation paths, indicating that the channel is sparse in the angular domain [5]. The channel matrix can be expressed in terms of dictionary matrices, which are formed by the transmitting and receiving array response vectors, and path gains. Compressive sensing (CS) [6] can then be applied to search for the dominant paths [7,8,9,10,11,12,13]. Different measurement matrices can be used by choosing the precoders and combiners, as well as various recovery algorithms such as orthogonal matching pursuit (OMP) [7,14] and the adaptive CS [8,9,10] can be applied. In general, the above mentioned CS schemes require knowledge of the array response, which depends on the array geometry and calibration of the antenna arrays. Such knowledge can be inaccurate when there are unknown hardware impairments, e.g., due to phase and gain errors, and imperfect calibration of the antenna arrays.

In the meantime, a small number of propagation paths also indicate that the channel is low-rank [15] and can be depicted in low-dimensional subspace. Furthermore, such low-rankness is independent of the array response and calibration errors. In [16,17], both the sparsity and low-rank property of the mmWave channel are exploited to enhance CS-based channel estimators. In [16], a two-stage estimator is proposed, where the low-rankness is exploited at the first stage while sparsity in the angular domain is exploited at the second stage. In [17], the improved alternating direction method of multipliers (ADMM) method [18] is applied to exploit the low-rankness and sparsity while at the same time enhancing the performance. These estimators, however, still require knowledge of the array response vectors. They can still suffer from performance loss when there are uncertainties in the array response.

To achieve robust channel estimation, matrix completion (MC) methods exploiting only the low-rank property of the mmWave channel have recently been proposed [19,20]. The analysis in [21] shows that the rank of the channel matrix is generally very low, which is usually much smaller than the antenna dimension. In [19], the singular value projection (SVP) algorithm [22] is adopted to solve the mmWave channel estimation problem. Later, the GCG-Alt method is developed in [20] by applying the generalized conditional gradient (GCG) framework [23] together with the alternating minimum (AltMin) method [24]. There are also other widely-studied MC algorithms that can be used for mmWave channel estimation, such as the singular value thresholding algorithm (SVT) [25] and the fixed point continuation algorithm (FPC) [26]. In this paper, we discuss several mmWave channel estimators based on MC, focusing on their performance and complexity comparisons with alternative methods based on CS. We aim to examine the pros and cons for several MC estimators and the factors that influence their performances.

The rest of this paper is organized as follows. Section 2 introduces the mmWave MIMO channel model and formulates the channel estimation problem. Section 3 introduces channel estimators based on MC, including their detailed implementation. Section 4 presents simulation results, in terms of the mean squared error (MSE) and computational complexity. Section 5 concludes the paper.

Notation:

A^{T}

,

A^{*}

, and

A^{H}

denote transpose, conjugate, and conjugate transpose of matrix

A

, respectively.

{∥ A ∥}_{1}

denotes the

l_{1}

norm.

I

and

0

represent the identity matrix and zero matrix/vector, respectively.

A \otimes B

and

A ⊙ B

denote the Kronecker product and the Hadamard product, respectively.

tr {A}

is the trace of

A

and

〈 A, C 〉 = tr \{A^{H} C\}

denotes the inner product of matrices

A

and

C

.

E [\cdot]

denotes the statistical expectation and

abs (\cdot)

represents taking element-wise absolute value. ∇ denotes the gradient of a function. For a matrix

A \in C^{M \times N}, vec (A) \in C^{M N \times 1}

denotes the vectorization of

A

and

{vec}^{- 1} (A) \in C^{M \times N}

denotes the inverse of vectorization.

R (\cdot)

and

I (\cdot)

denote the real and imaginary part of a number or vector, respectively.

CN (a, b^{2})

denotes complex Gaussian distribution with mean a and variance

b^{2}

.

2. System Model

Consider a point-to-point, switch-based mmWave hybrid MIMO system, with the receiver at the mobile station (MS) shown in Figure 1. For simplicity and clarity, this paper assumes switch-based mmWave systems to investigate MC-based channel estimators. MC-based estimators can also be applied to phase shifter-based mmWave systems, when the hybrid precoders/combiners are properly designed, as shown in [20]. Therefore, the discussion in this paper can be easily extended to phase shifter-based mmWave systems. At the receiver, each of the

N_{MS}

antennas is equipped with a switch used to select one of the

N_{RF}

RF chains. The base station (BS) has the same structure with

N_{BS}

antennas and

N_{RF}

RF chains. Assume that

N_{s}

data streams are transmitted, with

N_{s} \leq N_{RF} \leq min (N_{BS}, N_{MS})

[14,27]. The switching operation can be represented as a precoder

F

, where the nonzero entries indicate the entries of the channel matrix that are sampled. A symbol

s \in C^{N_{s} \times 1}

with

E [{ss}^{H}] = \frac{1}{N_{s}} I

is precoded, resulting in the transmitted signal

x = Fs

. We consider a narrow-band flat fading channel whose channel matrix

H \in C^{N_{MS} \times N_{BS}}

satisfies

E [{∥ H ∥}_{F}^{2}] = N_{MS} N_{BS}

. The received signal is expressed as:

r = \sqrt{ρ} HFs + \hat{n},

(1)

where

ρ

indicates the average received power and

\hat{n} \in C^{N_{MS} \times 1}

is a noise vector with i.i.d. entries distributed as

CN (0, σ_{n}^{2})

. Applying a combiner

W

to the received signal at the MS, the processed received signal is given by:

y = \sqrt{ρ} W^{H} H F s + W^{H} \hat{n} .

(2)

In the switch-based system, the combiner

W

has a similar structure with the precoder

F

.

Following [8], the ray-clustering model of

H

is given as:

H = \frac{1}{\sqrt{R}} \sum_{c = 1}^{C} \sum_{r = 1}^{R} g_{c r} a_{MS} (ϕ_{c r}^{MS}) a_{BS}^{H} (ϕ_{c r}^{BS}),

(3)

where

C \sim \max {Poisson (λ), 1}

is the number of clusters with

λ

as the mean of the Poisson distribution, and R is the number of rays in each cluster. The complex small-scale fading gain on the r-th ray of the c-th cluster is

g_{c r}

with

g_{c r} \sim CN (0, γ_{c})

, where

γ_{c}

is the sub-power on the c-th cluster. In Equation (3),

a_{MS} (ϕ_{c r}^{MS})

and

a_{BS} (ϕ_{c r}^{BS})

represent the array response vector for the receiver and transmitter, respectively, where

ϕ_{c r}^{MS}

and

ϕ_{c r}^{BS}

represent the corresponding azimuth AoA and AoD, which follow the Laplacian distribution [28].

Considering a uniform linear array (ULA) with distance between adjacent antennas being d, the array response is given by:

\begin{matrix} a_{BS} (ϕ_{c r}^{BS}) = \frac{1}{\sqrt{N_{BS}}} {[1, e^{j \frac{2 π}{λ_{c}} d sin (ϕ_{c r}^{BS})}, \dots, e^{j (N - 1) \frac{2 π}{λ_{c}} d sin (ϕ_{c r}^{BS})}]}^{T} \end{matrix}

(4)

where

λ_{c}

is the wavelength of the carrier wave. The array response

a_{MS} (ϕ_{c r}^{MS})

is constructed in the same manner as

a_{BS} (ϕ_{c r}^{BS})

.

3. Compressive Sensing-Based Channel Estimation

It has been shown that, without considering quantization errors, the mmWave channel estimation problem can be formulated as a sparse recovery problem by modeling the channel as [29,30,31]:

H = A_{MS} H_{v} A_{BS}^{H},

(5)

where

A_{MS} = [a_{MS} (ϕ_{1}^{MS}), \dots, a_{MS} (ϕ_{N_{1}}^{MS})]

is a unitary dictionary matrix when

N_{1} = N_{MS}

and it is an overcomplete dictionary matrix when

N_{1} > N_{MS}

, and

A_{BS} = [a_{BS} (ϕ_{1}^{BS}), \dots, a_{BS} (ϕ_{N_{2}}^{BS})]

with

N_{2} \geq N_{BS}

. Each column in

A_{BS}

and

A_{MS}

consists of a predefined array response vector.

H_{v} \in C^{N_{1} \times N_{2}}

is a sparse matrix with only L non-zero values, with each of its non-zero values corresponding to the complex gain of a channel path. Vectorization of the channel matrix (5) produces:

vec (H) = (A_{BS}^{*} \otimes A_{MS}) x,

(6)

where

x = vec (H_{v})

is a

N_{1} N_{2} \times 1

sparse vector with L non-zero values. We define

Ψ = (A_{BS}^{*} \otimes A_{MS})

as a

N_{BS} N_{MS} \times N_{1} N_{2}

dictionary matrix. Sparse recovery schemes can then be used to estimate the channel, which transforms the task of estimating

H

to estimating the non-zero coefficients in

x

.

A widely used method to estimate

x

from the the received signal is orthogonal matching pursuit (OMP) [7,14]. Using OMP, L path directions from the

N_{1} N_{2}

candidates in the dictionary are determined. The computational complexity of the OMP method is approximately

O (N L N_{1} N_{2})

, where N is the length of the received signal. In general, a larger dictionary leads to better performance but also higher computational complexity.

The above mentioned CS approach uses a discretized approximation of the channel. It may suffer from the off-grid issue if the physical propagation paths are off the assumed grid of the angles. In this case, the number of non-zero entries in the beamspace channel

H_{v}

may not be exactly equal to L, leading to a power leakage. Another challenge is that the knowledge of the array response is required, which may be imperfect in practice due to unknown hardware impairments and imperfect calibrations.

4. Matrix Completion-Based Channel Estimation

In this section, we introduce MC-based estimation methods for the mmWave channel by exploiting the low-rankness of the channel matrix. By appropriately choosing the training scheme with proper precoders and combiners, the received signal provides noisy observations of a subset of the entries of

H

:

{[\tilde{Y}]}_{i, j} = \{\begin{matrix} {[\tilde{H}]}_{i, j}, & (i, j) \in Ω \\ 0, & otherwise \end{matrix}

(7)

where

\tilde{H} = H + N

is the perturbed channel matrix,

N

is a noise matrix,

Ω

denotes a sample domain, and

{[\tilde{H}]}_{i, j}

is the

(i, j)

th entry of

\tilde{H}

. Define

p = N / (N_{BS} N_{MS})

as the sampling density, where N is the total number of samples observed.

It is discussed in [19] that when the mmWave channel matrix has strong non-coherent characteristics, it can be recovered from a subset samples of the channel matrix. We can thus formulate the channel estimation problem as a low-rank matrix completion problem as:

min_{\hat{H}} rank (\hat{H}), s . t . | | P_{Ω} (\hat{H}) - P_{Ω} (\tilde{H}) {| |}_{F}^{2} \leq δ_{n}^{2},

(8)

where

δ_{n}^{2}

is the error tolerance parameter and the sampling operator

P_{Ω} (\cdot)

is defined as:

{[P_{Ω} (\tilde{H})]}_{i, j} = \{\begin{matrix} {[\tilde{H}]}_{i, j} & (i, j) \in Ω \\ 0, & otherwise \end{matrix}

(9)

where

{[\tilde{H}]}_{i, j}

denotes the

(i, j)

-th entry of

\tilde{H}

. The sampling operator

P_{Ω}

significantly influences the performance of the algorithm [32]. Bernoulli and uniform sampling models are proposed in [32] and a uniform spatial sampling model (USS), which improves the performance, is proposed in [33]. With USS,

N / N_{BS}

samples are taken for each column of the target matrix.

4.1. MC Estimators

In the following, we discuss several MC estimators that can be used to solve the problem in Equation (8).

4.1.1. SVT Estimator

Before presenting the SVT algorithm, let us define the matrix shrinkage operator:

S_{τ} (X) = U_{X} S_{τ} (Σ_{X}) V_{X}^{H},

(10)

where

Σ_{X}

denotes the singular value matrix of

X

and

S_{τ} (Σ_{X})

is the element-wise shrinkage operator:

S_{τ} (x) = \{\begin{matrix} 0, & x \leq τ \\ x - τ, & x > τ \end{matrix}

(11)

where

τ

is the threshold.

The SVT algorithm [25] can be applied to provide a heuristic solution to Equation (8), which consists of two major steps,

\{\begin{matrix} {\hat{H}}^{k} = S_{τ} (X^{k - 1}) \\ X^{k} = X^{k - 1} + δ P_{Ω} (\tilde{H} - {\hat{H}}^{k}) \end{matrix}

(12)

where

τ > 0

,

δ

is a step size, and

k = 1, 2, \dots

. The iteration is stopped when a stopping criterion is met or a maximum number of iterations

J_{SVT}

is reached. Singular value decomposition (SVD) is required at each iteration. Some comments regarding the implementation of the SVT algorithm to the mmWave channel estimation problem are as follows:

The threshold is set as $τ = 5 \sqrt{N_{MS} N_{BS}}$ following [25];
The stepsize is set as $δ = 1.2 / p$ ;
Assuming the initialization $X^{0} = 0$ , ${\hat{H}}^{k} = 0$ for a small $k < k_{0}$ . As such, $X^{k} = k δ P_{Ω} (\tilde{H}), k = 1, \dots, k_{0}$ . The algorithm can begin with computing ${\hat{H}}^{k_{0}}$ to save work. From [25], the integer $k_{0}$ is determined by

$\frac{τ}{δ {∥P_{Ω} (\tilde{H})∥}_{2}} \in (k_{0} - 1, k_{0}] .$

SVT Estimator is shown in below Algorithm 1:

Algorithm 1 SVT Estimator

Require:

P_{Ω} (\tilde{H})

,

δ

,

ϵ

,

τ

,

J_{SVT}

,

k_{0}

.

1:: Set $X^{0} = k_{0} δ P_{Ω} (\tilde{H})$ ;
2:: for $k = 1$ to $J_{SVT}$ do
3:: Set ${\hat{H}}^{k} = S_{τ} (X^{k - 1})$
4:: Set $X^{k} = X^{k - 1} + δ P_{Ω} (\tilde{H} - {\hat{H}}^{k})$
5:: if $\frac{∥ P_{Ω} ({\hat{H}}^{k} - \tilde{H}) ∥_{F}}{∥ P_{Ω} (\tilde{H}) ∥_{F}} \leq ϵ$ then break;
6:: end if
7:: end for
8:: return $\hat{H} = {\hat{H}}^{k}$

From [25], SVT is effective for completing large matrices with low ranks. Its performance degrades as the rank increases. The computational complexity of the SVT algorithm mainly arises from Step 3.

4.1.2. FPC Estimator

The FPC algorithm [26] reformulates the MC problem using the nuclear norm, which is the summation of the singular values,

min_{\hat{H}} μ {∥\hat{H}∥}_{*} + \frac{1}{2} {∥ P_{Ω} (\tilde{H}) - P_{Ω} (\hat{H}) ∥}_{F}^{2},

(13)

where

μ > 0

is the regularization parameter. The algorithm consists of two steps similar to SVT:

\{\begin{matrix} Y^{k} = {\hat{H}}^{k} - δ (P_{Ω} (\tilde{H}) - P_{Ω} (\hat{H})) \\ {\hat{H}}^{k + 1} = S_{δ μ_{m}} (Y^{k}) \end{matrix}

(14)

where the threshold of the singular value thresholding operator is set as a variable

δ μ_{m}

rather than a fixed value as that in SVT. A continuous strategy [34] is used to accelerate the convergence by adapting

μ_{m}

. The details are presented in Algorithm 2. Some comments are as below:

We can set the step size $δ \in (0, 2 / λ_{max} (P_{Ω} {(\tilde{H})}^{H} P_{Ω} (\tilde{H})))$ according to [26], where $λ_{max}$ is the maximum eigenvalue;
$μ_{m + 1}$ decreases as

$μ_{m + 1} = max \{μ_{m} η_{μ}, μ_{final}\}, m = 1, 2, \dots, M,$

where M, determined by $(μ_{final}, η_{μ})$ , controls the step size and the estimation accuracy, $μ_{final}$ is small (e.g., $μ_{final} = 0.01$ ), and the parameter $0 < η_{μ} < 1$ determines the decreasing rate for consecutive $μ_{m}$ .

Algorithm 2 FPC Estimator

Require:

P_{Ω} (\tilde{H})

,

ϵ

,

J_{FPC}

,

δ

,

μ_{final}

, and

η_{μ}

1:: Initialization: ${\hat{H}}^{0} = 0$ , $m = 0$ , $μ_{m} = {∥P_{Ω} (\tilde{H})∥}_{2}$
2:: while $μ_{m} > μ_{final}$ do
3:: $μ_{m} = max (μ_{m} η_{μ}, μ_{final})$
4:: for $k = 1 : J_{FPC}$ do
5:: $Y^{k} = {\hat{H}}^{k} - δ (P_{Ω} (\tilde{H} - {\hat{H}}^{k}))$
6:: ${\hat{H}}^{k} = S_{δ μ_{m}} (Y^{k})$
7:: if $\frac{{∥{\hat{H}}^{k + 1} - {\hat{H}}^{k}∥}_{F}}{{∥{\hat{H}}^{k}∥}_{F}} < ϵ$ then break;
8:: end if
9:: end for
10:: end while
11:: return $\hat{H} = {\hat{H}}^{k}$

The main computational cost of the FPC algorithm is in Step 6 of Algorithm 2 due to the SVD. In addition, the FPC algorithm needs to choose the step size

δ

by calculating the maximum eigenvalue of

P_{Ω} {(\tilde{H})}^{H} P_{Ω} (\tilde{H})

and has a higher computational complexity per iteration than that of the SVT algorithm.

4.1.3. SVP Estimator

The SVP algorithm [22] is based on the projected gradients and is detailed in Algorithm 3. This algorithm requires that the rank L of the channel matrix to be known. The step size can be chosen empirically as

η = 1 / (1 + δ_{0}) p

with

0 < δ_{0} < 1 / 3

. The stopping criterion is based on the norm of the difference in the sampled channel matrix, where the small threshold

0 < ϵ < 1 / 2

can be set such as

ϵ = 10^{- 3}

. The SVP algorithm also needs to calculate the SVD in Step 4, which is the most computationally expensive step of the algorithm.

Algorithm 3 SVP Estimator

Require:

P_{Ω} (\tilde{H})

, L,

η

,

ϵ

1:: $Initialization$ : ${\hat{H}}^{0} = 0, t = 0$
2:: while $∥ P_{Ω} ({\hat{H}}^{t} - \tilde{H}) ∥_{F} \leq ϵ$ do
3:: $X^{t + 1} \leftarrow {\hat{H}}^{t} - η (P_{Ω} ({\hat{H}}^{t} - \tilde{H}))$
4:: Compute the L principal singular vectors of $X^{t + 1}$ : $U_{L}, Σ_{L}, V_{L}$ .
5:: ${\hat{H}}^{t + 1} = U_{L} Σ_{L} V_{L}^{H}$
6:: $t = t + 1$ ;
7:: end while
8:: return $\hat{H} = {\hat{H}}^{k}$

4.1.4. GCG-Alt Estimator

In [20], a generalized conditional gradient framework with alternating minimization (GCG-Alt) is developed for the MC problem. The nuclear norm is used to promote low-rankness as:

min_{\hat{H}} \frac{1}{2} ∥ P_{Ω} (\hat{H} - \tilde{H}) ∥_{F}^{2} + μ {∥ \hat{H} ∥}_{*}

(15)

where

μ > 0

is a regularization factor. Let:

f (\hat{H}) ≜ \frac{1}{2} {∥P_{Ω} (\hat{H}) - P_{Ω} (\tilde{H})∥}_{F}^{2} .

(16)

The gradient direction of the kth iteration of

f (\hat{H})

[23]:

X_{k} = u_{k - 1} v_{k - 1}^{H},

(17)

where

(u_{k - 1}, v_{k - 1})

is the top singular vector pair of

\nabla f ({\hat{H}}_{k - 1}) = P_{Ω} ({\hat{H}}_{k - 1} - \tilde{H})

which can be found iteratively. The channel matrix is updated as:

{\hat{H}}_{k} = (1 - η_{k}) {\hat{H}}_{k} + θ_{k} X_{k},

(18)

where

η_{k} \in [0, 1]

is the step size and

θ_{k}

is adaptively chosen.

By using a property of nuclear norm [20], the optimization problem can be reformulated as:

\tilde{ϕ} (U, V) ≜ f (U V^{H}) + \frac{1}{2} μ ({∥ U ∥}_{F}^{2} + {∥ V ∥}_{F}^{2}),

(19)

where

U \in C^{N_{MS} \times \hat{r}}

and

V \in C^{N_{BS} \times \hat{r}}

with

\hat{r}

being the rank of

\hat{H}

. Alternating minimization can then be used to optimize Equation (19). The details of solving the alternate minimization problem can be found in [20]. The overall algorithm is summarized in Algorithm 4.

Algorithm 4 GCG-Alt Estimator

Require:

P_{Ω} (\tilde{H}), μ, ϵ, ϵ_{a}

1:: Initialization: $U_{0} = ⌀, V_{0} = ⌀, k = 0, {\hat{H}}_{k} = 0, ϵ_{0} = \infty$ , $δ_{k}^{2} = {∥P_{Ω} (\tilde{H})∥}_{F}^{2}$
2:: while $δ_{k}^{2} > (N + \sqrt{8 N}) σ^{2}$ do
3:: Compute the top singular vector pair $(u_{k}, v_{k})$ of $P_{Ω} ({\hat{H}}_{k} - \tilde{H})$
4:: $k = k + 1$
5:: $η_{k} \leftarrow 2 / (k + 1)$
6:: $x_{k Ω} = vec (P_{Ω} (u_{k} v_{k}^{H}))$
7:: ${\tilde{h}}_{Ω} = vec (P_{Ω} (\tilde{H}))$
8:: ${\hat{h}}_{k Ω} = vec (P_{Ω} ({\hat{H}}_{k - 1}))$
9:: $θ_{k} = \frac{2 R (x_{k Ω}^{H} {\tilde{h}}_{Ω}) - (1 - η_{k}) x_{k Ω}^{H} {\hat{h}}_{k Ω} - 2 μ}{2 x_{k Ω}^{H} x_{k Ω}}$
10:: $U_{k} = [\sqrt{1 - η_{k}} U_{k - 1}, \sqrt{θ_{k}} u_{k - 1}]$
11:: $V_{k} = [\sqrt{1 - η_{k}} V_{k - 1}, \sqrt{θ_{k}} v_{k - 1}]$
12:: $Initialization$ : $i = 0, ϵ_{k}^{0} = \infty$ , $(U_{k}^{0}, V_{k}^{0}) \leftarrow (U_{k}, V_{k})$ .
13:: while $ϵ_{k}^{i} > ϵ_{a}$ do
14:: $i = i + 1$
15:: Find $V_{k}^{i}$ that minimizes Equation (19) given $U = U_{k}^{i}$ ;
16:: Find $U_{k}^{i}$ that minimizes Equation (19) given $V = V_{k}^{i}$ ;
17:: Calculate

$ϵ_{k}^{i} = \frac{\tilde{ϕ} (U_{k}^{i - 1}, V_{k}^{i - 1}) - \tilde{ϕ} (U_{k}^{i}, V_{k}^{i})}{\tilde{ϕ} (U_{k}^{i - 1}, V_{k}^{i - 1})}$
18:: end while
19:: $(U_{k}, V_{k}) \leftarrow (U_{k}^{i}, V_{k}^{i})$
20:: Calculate $ϵ_{k} = \frac{{∥{\hat{H}}_{k}∥}_{F}^{2} - {∥{\hat{H}}_{k - 1}∥}_{F}^{2}}{{∥{\hat{H}}_{k - 1}∥}_{F}^{2}}$
21:: $δ_{k}^{2} = {∥P_{Ω} ({\hat{H}}_{k} - \tilde{H})∥}_{F}^{2}$
22:: end while
23:: return $\hat{H} = U_{k} V_{k}^{H}$

The above MC methods have different computational complexities. SVT, SVP, and FPC all need SVD, which can be implemented using the PROPACK [35] based on the iterative Lanczos bidiagonalization algorithm with partial reorthogonalization. The FPC has a higher complexity as SVD is repeated for different values of

μ_{m}

. The GCG-Alt has the least complexity as the full SVD is not required. SVP is effective for large matrix completion problems with very low ranks, while the FPC, SVT, and GCG-Alt estimators allow higher ranks. The SVP estimator requires rank knowledge, while the FPC and GCG-Alt estimators implicitly determine the rank through choosing regularization parameters or thresholds.

4.2. MC-Based Hybrid Estimators

Next we discuss two MC-based hybrid methods that jointly exploit the sparsity and low-rankness of the channel.

4.2.1. ADMM Estimator

In [17], the low-rankness of

\tilde{H}

and the sparsity of the beamspace channel

{\tilde{H}}_{v}

are jointly exploited and an ADMM method is proposed. Leveraging the side information that

H

has a sparse virtual representation given by Equation (5), the channel estimation problem is formulated following [36] as:

\begin{matrix} min_{\hat{H}, {\hat{H}}_{v}} τ_{L} ∥ \hat{H} ∥_{*} + τ_{S} {∥ {\hat{H}}_{v} ∥}_{1} \\ s . t . & P_{Ω} (\hat{H}) = P_{Ω} (\tilde{H}) and \hat{H} = A_{MS} {\hat{H}}_{v} A_{BS}^{H} \end{matrix}

(20)

where the nuclear norm and

l_{1}

-norm together with the regularization parameters

τ_{L}

and

τ_{S}

are used to promote low-rankness and sparsity, respectively. The above problem is then reformulated by incorporating the constraints as penalty terms as:

\begin{array}{l} min_{\hat{H}, E, {\hat{H}}_{v}, C} τ_{L} ∥ \hat{H} ∥_{*} + τ_{S} {∥{\hat{H}}_{v}∥}_{1} + \frac{1}{2} {∥ C ∥}_{F}^{2} + \frac{1}{2} {∥P_{Ω} (E - \hat{H})∥}_{F}^{2} \\ s . t . & \hat{H} = E and C = E - A_{MS} {\hat{H}}_{v} A_{BS}^{H} . \end{array}

(21)

where

E \in C^{N_{MS} \times N_{BS}}

and

C

are two auxiliary matrix variables. This problem is then solved by using ADMM which involves the iterative updates of the variables and Lagrangian multipliers. The augmented Lagrangian function of Equation (21) is given by:

\begin{array}{l} L_{1} (\hat{H}, E, {\hat{H}}_{v}, C, Z_{1}, Z_{2}) & ≜ & τ_{L} ∥ \hat{H} ∥_{*} + τ_{S} ∥ {\hat{H}}_{v} ∥_{1} + \frac{1}{2} {∥ C ∥}_{F}^{2} \\ + \frac{1}{2} {∥P_{Ω} (E - \hat{H})∥}_{F}^{2} + tr (Z_{1}^{H} (\hat{H} - E)) + \frac{t}{2} {∥ \hat{H} - E ∥}_{F}^{2} \\ + tr (Z_{2}^{H} (E - A_{MS} {\hat{H}}_{v} A_{BS}^{H} - C)) \\ + \frac{t}{2} {∥E - A_{MS} {\hat{H}}_{v} A_{BS}^{H} - C∥}_{F}^{2}, \end{array}

(22)

where

Z_{1}

and

Z_{2} \in C^{N_{MS} \times N_{BS}}

are dual variables (the Lagrange multipliers) and

t > 0

is the step size. The estimator is summarized in Algorithm 5, where:

$τ = ρ_{1} ∥P_{Ω} (\tilde{H})∥$ with $ρ_{1} = \frac{3 N}{N_{BS} N_{MS}}$ in Step 3;
$z_{i}$ denotes the vectorization of $Z_{i}$ , and similarly for other variables;
$A ≜ \sum_{i = 1}^{N_{MS}} diag {({[Ω_{*}]}_{i})}^{T} \otimes I_{i i}$ where $Ω_{*} \in {0, 1}^{N_{MS} \times N_{BS}}$ is composed of N ones and $N_{BS} N_{MS} - N$ zeros, the value 1 indicates the position of a sample from the channel matrix, and ${[Ω_{*}]}_{i}$ denotes the i-th row of $Ω_{*}$ , and $I$ is the $N_{MS} \times N_{MS}$ matrix that the value at its $(i, i)$ -th position is 1 and the remaining position is 0 [17];
The parameters in Equation (20) are chosen empirically as $τ_{L} = t {∥P_{Ω} (\tilde{H})∥}_{2}$ and $τ_{S} = \frac{0.1}{1 - 10 log (σ_{n}^{2})}$ , where $σ_{n}^{2}$ is the noise power.

The computational cost of the ADMM algorithm is mainly due to the SVD in Step 3 and the matrix operations in Steps 4, 5, 6, and 7.

Algorithm 5 ADMM Estimator

Require:

P_{Ω} (\tilde{H})

, t,

τ_{L}

,

τ_{S}

,

J_{ADMM}

,

A

,

Ψ

,

τ

.

1:: $Initialization$ : ${\hat{H}}^{0} = {\hat{H}}_{v}^{0} = C^{0} = E^{0} = Z_{2}^{0} = Z_{1}^{0} = 0$ ; their vectorizations ${\hat{h}}^{0} = {\hat{h}}_{v}^{0} = c^{0} = e^{0} = z_{2}^{0} = z_{1}^{0} = 0$ ;
2:: for $ℓ = 0, 1, \dots, J_{ADMM} - 1$ do
3:: Update ${\hat{H}}^{ℓ + 1} = U_{A} S_{τ} (S_{A}) V_{A}^{H}$ , where $[U_{A}, S_{A}, V_{A}] = E^{ℓ} - \frac{1}{t} Z_{1}^{ℓ}$
4:: Update

$\begin{matrix} e^{ℓ + 1} = & {(A^{H} A + 2 t I)}^{- 1} (z_{1}^{ℓ} + t {\hat{h}}^{ℓ + 1} + A^{H} vec (P_{Ω} (\tilde{H})) + z_{2}^{ℓ} + t c^{ℓ} + t Ψ {\hat{h}}_{v}^{ℓ}) . \end{matrix}$
5:: Update

$\begin{matrix} {\hat{h}}_{v}^{ℓ + 1} = & sign (R (v^{ℓ + 1})) \circ max (|R (v^{ℓ + 1})| - τ_{S}^{'}, 0) & + sign (I (v^{ℓ + 1})) \circ max (|I (v^{ℓ + 1})| - τ_{S}^{'}, 0) \end{matrix}$

with $V^{ℓ + 1} ≜ A_{MS}^{H} (\frac{1}{t} Z_{2}^{ℓ} - C^{ℓ} + E^{ℓ + 1}) A_{BS}$ and $τ_{S}^{'} ≜ τ_{S} / t$ .
6:: Update

$C^{ℓ + 1} = \frac{t}{t + 1} (E^{ℓ + 1} - A_{MS} {\hat{H}}_{v}^{ℓ + 1} A_{BS}^{H} + \frac{1}{t} Z_{2}^{ℓ})$
7:: Update

$Z_{1}^{ℓ + 1} = Z_{1}^{ℓ} + t (P_{Ω} (E^{ℓ + 1} - \tilde{H}))$

$Z_{2}^{ℓ + 1} = Z_{2}^{ℓ} + t (E^{ℓ + 1} - A_{MS} {\hat{H}}_{v}^{ℓ + 1} A_{BS}^{H} - C^{ℓ + 1})$
8:: end for;
9:: return $\hat{H} = {\hat{H}}^{J_{ADMM} - 1}$

4.2.2. Two-Stage Estimator

The two-stage estimator [16] also exploits the sparsity and low-rankness of the channel matrix. In the first stage, MC is applied to provide a denoised channel estimation based on the low rankness of the channel matrix and then the second stage employs CS to refine the estimation based on the array response and the virtual representation of the channel matrix.

The SVT algorithm [25] introduced above solves the low-rank matrix completion problem:

min_{\hat{H}} rank (\hat{H}), s . t . | | P_{Ω} (\hat{H}) - P_{Ω} (\tilde{H}) {| |}_{F}^{2} \leq δ_{n}^{2} .

(23)

The fast iterative shrinkage threshold algorithm (FISTA) proposed by [37] is used to solve the sparse vector recovery problem:

min_{vec ({\hat{H}}_{v})} ∥ Ψ vec ({\hat{H}}_{v}) - vec (\hat{H}) ∥_{F}^{2} + λ {∥ vec ({\hat{H}}_{v}) ∥}_{1} .

(24)

The estimator is summarized in Algorithm 6, where:

The parameters of the first stage, SVT, is the same with Algorithm 1;
$Ψ = A_{BS}^{*} \otimes A_{MS}$ ;
$λ$ is a constant stepsize, e.g., $λ = 0.001$ ;
$λ_{max}$ is the top eigenvalue of $Ψ^{H} Ψ$ .

The complexity of SVT in the first stage has been anlyzed. The cost of the FISTA algorithm mainly consists of applying the sensing matrix in Step 4, which has a complexity of

O (N_{BS}^{2} N_{MS}^{2})

.

Algorithm 6 Two-Stage Estimator

Require:

P_{Ω} (\tilde{H})

1:: Use SVT to recover $\hat{H}$ as

$\begin{matrix} min_{\hat{H}} ∥ \hat{H} ∥_{*} s . t . | | P_{Ω} (\hat{H}) - P_{Ω} (\tilde{H}) {| |}_{F}^{2} \leq δ_{n}^{2} \end{matrix}$

Require: $Ψ, vec (\hat{H}), λ, J_{FISTA}$ , the top eigenvalue $λ_{max}$ of $Ψ^{H} Ψ$
2:: Initialize $y_{1} = x_{1} = 0$ , $t_{1} = 1$
3:: for $i = 1 : J_{FISTA}$ do
4:: $c_{i} = y_{i} - (1 / λ_{max}) Ψ^{H} (Ψ y_{i} - vec (\hat{H}))$
5:: $x_{i + 1} = (max (abs (c_{i}) - 2 λ λ_{max}, 0)) sign (c_{i})$
6:: $t_{i + 1} = \frac{1 + \sqrt{1 + 4 t_{i}^{2}}}{2}$
7:: $y_{i + 1} = x_{i + 1} + (\frac{t_{i} - 1}{t_{i + 1}}) (x_{i + 1} - x_{i})$
8:: $x_{i} \leftarrow x_{i + 1}$
9:: $ϵ_{i + 1} = {∥ Ψ x_{i} - vec (\hat{H}) ∥}_{2}$
10:: if $abs (ϵ_{i + 1} - ϵ_{i}) \leq 10^{- 6}$ then
11:: Break
12:: end if
13:: end for
14:: return $\hat{H} = A_{MS} {vec}^{- 1} (x_{i}) A_{BS}^{H}$

5. Numerical Results

Consider switch-based MIMO systems over a mmWave channel at 90 GHz. When not otherwise specified, the number of clusters

C \sim max (Poisson (1.8), 1)

, and the number of rays

R \sim U [1, 20]

(the total number of paths is

L = C R

); the AoDs and AoAs follow Laplace distributions with a standard deviation of

15^{\circ}

[38]; the sub-power of the clusters

γ_{c} = 1

; and the ULA at the MS and BS has

N_{MS} = 32

antennas and

N_{BS} = 128

antennas, respectively, and

N_{RF} = 4

RF chains. Here, the CS method based on OMP is compared with MC-based estimators. The parameter settings are as follows:

OMP: The unitary dictionary is set with $N_{1} = N_{MS} = 32$ and $N_{2} = N_{BS} = 128$ . The stopping threshold is set as $ϵ_{O M P} = 0.025 σ^{2}, 0.05 σ^{2}, 0.075 σ^{2}, 0.1 σ^{2}, 0.125 σ^{2}, 0.15 σ^{2}$ with $PNR = 0, 5, 10, 15, 20, 25 dB$ [20];
SVT: $δ = 3.2$ , $ϵ = 10^{- 4}$ , $τ = 5 \sqrt{N_{MS} N_{BS}}$ , $J_{SVT} = 100$ , and $k_{0} = 5$ ;
SVP: $η = 0.5$ and $ϵ = 10^{- 4}$ ;
FPC: $μ_{final} = 0.01$ , $J_{FPC} = 100$ , $ϵ = 10^{- 4}$ , $η_{μ} = 0.25$ , and $δ = 1.99$ ;
GCG-Alt: $μ = σ_{n}^{2}$ , $ϵ = 0.01$ , and $ϵ_{a} = 0.1$ ;
ADMM: $τ_{L} = t {∥ P_{Ω} (\tilde{H}) ∥}_{2}$ , $t = 0.005$ , $τ_{S} = \frac{0.1}{1 - 10 log (σ_{n}^{2})}$ , $δ = 3.2$ ;
Two-Stage: $J_{FISTA} = 100$ , $λ = 0.001$ .

The normalized MSE (NMSE) is defined as:

NMSE = \frac{∥ \hat{H} {- H ∥}_{F}^{2}}{{∥ H ∥}_{F}^{2}}

where

\hat{H}

is an estimate of the channel matrix

H

.

5.1. Comparison of NMSE When There Are No Hardware Impairments

Figure 2 compares different estimators in terms of NMSE with different PNRs, which is defined as

PNR = 10 log 10 (\frac{ρ}{σ_{n}^{2}})

, when

N_{MS} = 32

,

N_{BS} = 128

, and

N_{RF} = 4

. The SVP algorithm performed worse than others. The ADMM algorithm performed the best, and the NMSE of the ADMM estimator was at least 5 dB better than other estimators. The NMSE with

N_{BS} = 64

,

N_{MS} = 64

and

N_{RF} = 8

is shown in Figure 3. From Figure 2 and Figure 3, the relative performance of the different estimators is similar with different numbers of antennas.

We next show the performance with different number of channel paths. Here, we assume the number of clusters

C = 1

, the numbers of rays R is changed from 1 to 22, and so the total number of channel paths

L = R

. From Figure 4, as the numbers of paths L increases, the NMSE of different algorithms degrades. The SVP algorithm is the most sensitive to the number of paths while the OMP estimator is the most robust. The sampling ratio is also critical for performance. From Figure 5, the NMSE improves substantially with increasing sampling ratio. It is noted that CS-based schemes such as the OMP and the two-stage algorithms are more advantageous when the sampling ratio is low.

5.2. NMSE Comparison When There Are Hardware Impairments

In practice, it is inevitable to have impairments of the antenna elements, which are typically time-varying, e.g., due to temperature changes or hardware aging [39]. Therefore, the array response may be severely impacted. Due to mechanical reasons and uncertainty regarding the precise position of the antenna phase center, the actual antenna position may deviate from the assumed ideal array shape. Following [20], we define the gain and phase error vector at the BS as:

e_{BS} = {[β_{1} e^{j ω_{1}}, β_{2} e^{j ω_{2}}, \dots, β_{N_{BS}} e^{j ω_{N_{BS}}}]}^{T}

(25)

where

ω_{i}

represents the phase errors and

β_{i}

denotes the amplitude gain of each antenna element. The gain and phase error vector

e_{MS}

at the MS is similar to

e_{BS}

. Considering such errors, the received signal in Equation (2) can be expressed as:

\tilde{y} = W^{H} E_{MS} H E_{BS}^{H} F s + W^{H} E_{MS} \hat{n}

(26)

where

E_{MS}

is a diagonal matrix with

e_{MS}

as the diagonal elements, and

E_{BS}

is defined similarly.

We carry out simulations to examine the robustness of the different approaches when there are phase and gain errors in the array response. Those errors are assumed to be uniformly distributed within certain range and characterized by the level of phase and gain errors, respectively, following [20]. It is found that MC estimators were not affected by the phase or gain errors since the estimators were independent of the array response vectors. The sparsity-based methods use array response which depends on the phase and gain information, and can result in poor channel estimation when unknown gain and phase errors are present. These were validated by simulation experiments. From Figure 6 and Figure 7 where NMSE achieved with different levels of phase and gain errors are compared, the MC estimators based on SVP, FPC, GCG-Alt, and SVT, were insensitive to phase errors or gain errors. The estimators exploiting CS, such as the one based on OMP and hybrid methods (the ADMM and two-stage estimators), were more sensitive to phase errors or gain errors.

5.3. Computational Complexity

Finally, we compare the computational complexity of different algorithms in Figure 8 and Figure 9. It can be seen that in general, the hybrid estimators (the ADMM and two-stage estimators) exhibited higher complexity because they both involved SVD and the application of the sensing matrices were of large sizes. In particular, the ADMM algorithm had the highest complexity. The MC estimators had moderate complexity that did not vary significantly with the PNRs and the FPC exhibited higher complexity than SVT and SVP as more SVD operations were required. The GCG-Alt algorithm had the lowest complexity. This was because the GCG-Alt algorithm had a fast convergence rate and also avoided the SVD operations used in the other MC algorithms such as SVP, SVT, and FPC. The computational complexity of the OMP estimator increased with the PNR because the numbers of paths recovered by the OMP increased as the PNR increased.

5.4. Performance with Line of Sight (LoS) Propagation

In the above, we have focused on the comparison for the channels where the different paths had the same average power. In practical applications, there may exist line of sight (LoS) propagation, where a path contributes a significant portion of the power gain [40,41]. In this case, the channel model can be modified from Equation (3) to:

H = \frac{1}{\sqrt{R}} \sum_{c = 1}^{C} \sum_{r = 1}^{R} g_{c r} a_{MS} (ϕ_{c r}^{MS}) a_{BS}^{H} (ϕ_{c r}^{BS}) + β a_{MS} (ϕ_{LoS}^{MS}) a_{BS}^{H} (ϕ_{LoS}^{BS}) .

(27)

In our simulations we set

g_{c r} \sim CN (0, 0.5)

and

β = \sqrt{C / 2}

. The NMSE results are shown in Figure 10. With a LoS path which dominates the power gain, the effective rank of the channel matrix may be reduced. The various channel estimators considered are more effective in this case, leading to lower NMSE as compared with the case where the LoS path is absent. The results also demonstrate the effectiveness of the MC-based methods in realistic scenarios.

6. Conclusions

In this paper, we compared the performance of several MC-based channel estimators for mmWave massive MIMO systems. It was observed that the hybrid ADMM algorithm exhibited the best performance in general, which jointly exploited the low rank property of the channel matrix and the sparsity in the angular domain. However, it also exhibited the highest complexity among the estimators compared. The MC-based estimators (using GCG-Alt, SVT, SVP, or FPC) were robust against array impairments as they did not rely on array response vectors. Among them the GCG-Alt estimator exhibited the lowest complexity, better performance, and provided a competitive solution when the arrays were not perfectly calibrated.

In this work, we considered a point-to-point mmWave system. The estimators could also be applied to multiuser systems when orthogonal training schemes are deployed. The comparison of these methods when nonorthogonal training is used would be an interesting study for future work.

Author Contributions

Conceptualization, M.D., X.Y., and J.T.; Methodology, M.D. and X.Y.; Validation, R.H. and J.T.; Formal analysis, X.Y., R.H., and J.T.; Investigation, M.D., X.Y., and R.H.; Resources, M.D., Z.X. and J.X.; Data curation, X.Y. and R.H.; Writing—original draft preparation, X.Y. and R.H.; Writing—review and editing, Z.X., J.T., and J.X.; Visualization, X.Y.; Supervision, Z.X., J.T., and J.X.; Project administration, Z.X. and J.X.; Funding acquisition, Z.X., J.T., and J.X.

Funding

This research was funded by NSFC under Grant 61601325.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rangan, S.; Rappaport, T.S.; Erkip, E. Millimeter-wave cellular wireless networks: Potentials and challenges. Proc. IEEE 2014, 102, 366–385. [Google Scholar] [CrossRef]
Rebeiz, G. Large-scale millimeter-wave phased arrays for 5g systems. In Proceedings of the IEEE 16th Topical Meeting Silicon Monolithic Integrated Circuits in RF Systems (SiRF), Austin, TX, USA, 24–27 January 2016. [Google Scholar]
Rappaport, T.S.; Heath, R.W.; Daniels, R.C.; Murdock, J. Millimeter Wave Wireless Communications; Prentice-Hall: Englewood Cliffs, NJ, USA, 2014. [Google Scholar]
Heath, R.W.; González-Prelcic, N.; Rangan, S.; Roh, W.; Sayeed, A.M. An Overview of Signal Processing Techniques for Millimeter Wave MIMO Systems. IEEE J. Sel. Top. Signal Process. 2016, 10, 436–453. [Google Scholar] [CrossRef]
Wei, L.; Hu, R.; Qian, Y.; Wu, G. Key elements to enable millimeter wave communications for 5g wireless systems. IEEE Wirel. Commun. 2014, 21, 136–143. [Google Scholar]
Eldar, Y.C.; Kutyniok, G. Compressed Sensing; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Lee, J.; Gil, G.T.; Lee, Y.H. Channel estimation via orthogonal matching pursuit for hybrid MIMO systems in millimeter wave communications. IEEE Trans. Commun. 2016, 64, 2370–2386. [Google Scholar] [CrossRef]
Alkhateeb, A.; Ayach, O.E.; Leus, G.; Heath, R.W. Channel estimation and hybrid precoding for millimeter wave cellular systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 831–846. [Google Scholar] [CrossRef]
Gao, Z.; Dai, L.; Wang, Z.; Chen, S. Spatially common sparsity based adaptive channel estimation and feedback for FDD massive MIMO. IEEE Trans. Signal Process. 2015, 63, 6169–6183. [Google Scholar] [CrossRef]
Sun, S.; Rappaport, T.S. Millimeter wave MIMO channel estimation based on adaptive compressed sensing. In Proceedings of the 2017 IEEE International Conference on Communications Workshops (ICC Workshops), Paris, France, 21–25 May 2017; pp. 47–53. [Google Scholar]
Xiao, Z.; Yin, C.; Xia, P.; Xia, X. Codebook design for millimeter-wave channel estimation with hybrid precoding structure. IEEE Trans. Wirel. Commun. 2017, 16, 141–153. [Google Scholar] [CrossRef]
Zhu, D.; Choi, J.; Heath, R.W. Auxiliary beam pair enabled AoD and AoA estimation in closed-loop large-scale millimeter-wave MIMO systems. IEEE Trans. Wirel. Commun. 2017, 16, 4770–4785. [Google Scholar] [CrossRef]
Gurbuz, A.C.; Yapici, Y.; Guvenc, I. Sparse channel estimation in millimeter-wave communications via parameter perturbed OMP. In Proceedings of the 2018 IEEE International Conference on Communications Workshops (ICC Workshops), Kansas City, MO, USA, 20–24 May 2018. [Google Scholar]
Ayach, O.E.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R.W. Spatially sparse precoding in millimeter wave MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [Google Scholar] [CrossRef]
Eliasi, P.A.; Rangan, S.; Rappaport, T.S. Low-rank spatial channel estimation for millimeter wave cellular systems. IEEE Trans. Wirel. Commun. 2017, 16, 2748–2759. [Google Scholar] [CrossRef]
Li, X.; Fang, J.; Li, H.; Wang, P. Millimeter wave channel estimation via exploiting joint sparse and low-rank structures. IEEE Trans. Wirel. Commun. 2018, 17, 1123–1133. [Google Scholar] [CrossRef]
Vlachos, E.; Alexandropoulos, G.C.; Thompson, J. Massive MIMO channel estimation for millimeter wave systems via matrix completion. IEEE Signal Process. Lett. 2018, 25, 1675–1679. [Google Scholar] [CrossRef]
Sun, J.; Lu, J.; Liang, G.; Bi, J. A sparse interactive model for matrix completion with side information. Adv. Neural Inf. Process. Syst. 2016, 29, 4071–4079. [Google Scholar]
Hu, R.; Tong, J.; Xi, J.; Guo, Q.; Yu, Y. Robust channel estimation for switch-based mmWave MIMO systems. In Proceedings of the 2017 9th International Conference on Wireless Communications and Signal Processing (WCSP), Nanjing, China, 11–13 October 2017; pp. 1–7. [Google Scholar]
Hu, R.; Tong, J.; Xi, J.; Guo, Q.; Yu, Y. Matrix completion-based channel estimation for MmWave communication systems with array-inherent impairments. IEEE Access 2018, 6, 62915–62931. [Google Scholar] [CrossRef]
Akdeniz, M.R.; Liu, Y.; Samimi, M.K.; Sun, S.; Rangan, S.; Rappaport, T.S.; Erkip, E. Millimeter wave channel modeling and cellular capacity evaluation. IEEE J. Sel. Areas Commun. 2014, 32, 1164–1179. [Google Scholar] [CrossRef]
Jain, P.; Meka, R.; Dhillon, I.S. Guaranteed rank minimization via singular value projection. In Proceedings of the 23rd International Conference on Neural Information Processing Systems, Vancouver, BC, USA, 6–9 December 2010; pp. 937–945. [Google Scholar]
Yu, Y.; Carbonell, J.; Yu, A.W.; Ma, W.; Sra, S. Efficient structured matrix rank minimization. In Proceedings of the 28th Annual Conference on Neural Information Processing Systems (NIPS 2014), Montreal, QC, USA, 8–13 December 2014; pp. 1350–1358. [Google Scholar]
Yu, X.; Shen, J.C.; Zhang, J.; Letaief, K.B. Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems. IEEE J. Sel. Top. Signal Process. 2016, 10, 485–500. [Google Scholar] [CrossRef]
Cai, J.F.; Candès, E.J.; Shen, Z. A singular value thresholding algorithm for matrix completion. SIAM J. Optim. 2010, 20, 1956–1982. [Google Scholar] [CrossRef]
Ma, S.; Goldfarb, D.; Chen, L. Fixed point and bregman iterative methods for matrix rank minimization. Math. Program. 2009, 128, 321–353. [Google Scholar] [CrossRef]
Xia, P.; Yong, S.K.; Oh, J.; Ngo, C. A practical SDMA protocol for 60 GHz millimeter wave communications. In Proceedings of the 2008 42nd Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 26–29 October 2008; pp. 2019–2023. [Google Scholar]
Forenza, A.; Love, D.J.; Heath, R.W. Simplified spatial correlation models for clustered MIMO channels with different array configurations. IEEE Trans. Veh. Technol. 2007, 56, 1924–1934. [Google Scholar] [CrossRef]
Rial, R.M.; Rusu, C.N.; Prelcic, G.; Alkhateeb, A.; Heath, R.W. Hybrid MIMO architectures for millimeter wave communications: Phase shifters or switches? IEEE Access 2016, 4, 247–267. [Google Scholar] [CrossRef]
Bajwa, W.U.; Haupt, J.; Sayeed, A.M.; Nowak, R. Compressed channel sensing: A new approach to estimating sparse multipath channels. Proc. IEEE 2010, 98, 1058–1076. [Google Scholar] [CrossRef]
Brady, J.; Behdad, N.; Sayeed, A.M. Beamspace MIMO for millimeter-wave communications: System architecture, modeling, analysis, and measurements. IEEE Trans. Antennas Propag. 2013, 61, 3814–3827. [Google Scholar] [CrossRef]
Candes, E.J.; Plan, Y. Matrix completion with noise. Proc. IEEE 2010, 98, 925–936. [Google Scholar] [CrossRef]
Weng, Z.; Wang, X. Low-rank matrix completion for array signal processing. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan, 25–30 March 2012; pp. 2697–2700. [Google Scholar]
Hale, E.T.; Yin, W.; Zhang, Y. Fixed-point continuation for ℓ₁-minimization: Methodology and convergence. SIAM J. Optim. 2008, 19, 1107–1130. [Google Scholar] [CrossRef]
Larsen, R.M. Propack-Software for Large and Sparse svd Calculations. Available online: http://sun.stanford.edu/~rmunk/PROPACK/ (accessed on 1 October 2019).
Chiang, K.Y.; Hsieh, C.; Dhillon, I.S. Matrix completion with noisy side information. Adv. Neural Inf. Proc. Syst. 2015, 28, 3447–3455. [Google Scholar]
Beck, A.; Teboulle, M. A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM J. Imag. Sci. 2009, 2, 183–202. [Google Scholar] [CrossRef]
Park, S.; Park, J.; Yazdan, A.; Heath, R.W. Exploiting spatial channel covariance for hybrid precoding in massive MIMO systems. IEEE Trans. Signal Process. 2017, 65, 3818–3832. [Google Scholar] [CrossRef]
Groschel, P.; Zarei, S.; Carlowitz, C.; Lipka, M.; Sippel, E.; Ali, A.; Weigel, R.; Schober, R.; Vossiek, M. A system concept for online calibration of massive MIMO transceiver arrays for communication and localization. IEEE Trans. Microw. Theory Technol. 2017, 65, 1735–1750. [Google Scholar] [CrossRef]
Liu, L.; Hong, W.; Wang, H.; Yang, G.; Zhang, N.; Zhao, H.; Chang, J.; Yu, C.; Yu, X.; Tang, H.; et al. Characterization of Line-of-Sight MIMO Channel for Fixed Wireless Communications. IEEE Antennas Wirel. Prop. Lett. 2007, 6, 36–39. [Google Scholar] [CrossRef]
Xue, C.; He, S.; Ou, F.; Wei, M.; Huang, Y.; Yang, L. Asymmetric Subarray Structure Design for mmWave LoS MIMO Communication Systems. In Proceedings of the 2016 IEEE/CIC International Conference on Communications in China (ICCC), Chengdu, China, 27–29 July 2016. [Google Scholar]

Figure 1. Switch-based mmWave (millimeter-wave) receiver.

Figure 2.

NMSE

(normalized mean square error) of the channel estimation in the ULA (uniform linear arrays) system with

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, the sampling ratio

p = 0.375

, different PNRs (pilot-to-noise ratios), and without array impairments.

Figure 2.

NMSE

(normalized mean square error) of the channel estimation in the ULA (uniform linear arrays) system with

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, the sampling ratio

p = 0.375

, different PNRs (pilot-to-noise ratios), and without array impairments.

Figure 3.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 64

,

N_{MS} = 64

,

N_{RF} = 8, p = 0.3281

, different PNRs, and without array impairments.

Figure 3.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 64

,

N_{MS} = 64

,

N_{RF} = 8, p = 0.3281

, different PNRs, and without array impairments.

Figure 4.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

,

p = 0.375

,

PNR = 20 dB

, different number of paths, and without array impairments.

Figure 4.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

,

p = 0.375

,

PNR = 20 dB

, different number of paths, and without array impairments.

Figure 5.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4,

PNR = 20 dB

, different sample ratios, and without array impairments.

Figure 5.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4,

PNR = 20 dB

, different sample ratios, and without array impairments.

Figure 6.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4, p = 0.375, PNR = 20 dB

, and different levels of phase errors.

Figure 6.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4, p = 0.375, PNR = 20 dB

, and different levels of phase errors.

Figure 7.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4, p = 0.375, PNR = 20 dB

, and different levels of gain errors .

Figure 7.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4, p = 0.375, PNR = 20 dB

, and different levels of gain errors .

Figure 8. Complexity comparison for the ULA system with different PNRs

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, and

p = 0.375

.

Figure 8. Complexity comparison for the ULA system with different PNRs

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, and

p = 0.375

.

Figure 9. Zoomed-in section of Figure 8 with different PNRs

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, and

p = 0.375

.

Figure 9. Zoomed-in section of Figure 8 with different PNRs

N_{BS} = 128

,

N_{MS} = 32

,

N_{RF} = 4

, and

p = 0.375

.

Figure 10.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4

, the sampling ratio

p = 0.375

, different PNRs, and without array impairments.

Figure 10.

NMSE

of the channel estimation in the ULA system with

N_{BS} = 128, N_{MS} = 32, N_{RF} = 4

, the sampling ratio

p = 0.375

, different PNRs, and without array impairments.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ding, M.; Yang, X.; Hu, R.; Xiao, Z.; Tong, J.; Xi, J. On Matrix Completion-Based Channel Estimators for Massive MIMO Systems. Symmetry 2019, 11, 1377. https://doi.org/10.3390/sym11111377

AMA Style

Ding M, Yang X, Hu R, Xiao Z, Tong J, Xi J. On Matrix Completion-Based Channel Estimators for Massive MIMO Systems. Symmetry. 2019; 11(11):1377. https://doi.org/10.3390/sym11111377

Chicago/Turabian Style

Ding, Mingjun, Xiaodong Yang, Rui Hu, Zhitao Xiao, Jun Tong, and Jiangtao Xi. 2019. "On Matrix Completion-Based Channel Estimators for Massive MIMO Systems" Symmetry 11, no. 11: 1377. https://doi.org/10.3390/sym11111377

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On Matrix Completion-Based Channel Estimators for Massive MIMO Systems

Abstract

1. Introduction

2. System Model

3. Compressive Sensing-Based Channel Estimation

4. Matrix Completion-Based Channel Estimation

4.1. MC Estimators

4.1.1. SVT Estimator

4.1.2. FPC Estimator

4.1.3. SVP Estimator

4.1.4. GCG-Alt Estimator

4.2. MC-Based Hybrid Estimators

4.2.1. ADMM Estimator

4.2.2. Two-Stage Estimator

5. Numerical Results

5.1. Comparison of NMSE When There Are No Hardware Impairments

5.2. NMSE Comparison When There Are Hardware Impairments

5.3. Computational Complexity

5.4. Performance with Line of Sight (LoS) Propagation

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI