Loading web-font TeX/Math/Italic
Is NOMA Efficient in Multi-Antenna Networks? A Critical Look at Next Generation Multiple Access Techniques | IEEE Journals & Magazine | IEEE Xplore

Is NOMA Efficient in Multi-Antenna Networks? A Critical Look at Next Generation Multiple Access Techniques


Abstract:

In the past few years, a large body of literature has been created on downlink Non-Orthogonal Multiple Access (NOMA), employing superposition coding and Successive Interf...Show More

Abstract:

In the past few years, a large body of literature has been created on downlink Non-Orthogonal Multiple Access (NOMA), employing superposition coding and Successive Interference Cancellation (SIC), in multi-antenna wireless networks. Furthermore, the benefits of NOMA over Orthogonal Multiple Access (OMA) have been highlighted. In this paper, we take a critical and fresh look at the downlink Next Generation Multiple Access (NGMA) literature. Instead of contrasting NOMA with OMA, we contrast NOMA with two other multiple access baselines. The first is conventional Multi-User Linear Precoding (MU-LP), as used in Space-Division Multiple Access (SDMA) and multi-user Multiple-Input Multiple-Output (MIMO) in 4G and 5G. The second, called Rate-Splitting Multiple Access (RSMA), is based on multi-antenna Rate-Splitting (RS). It is also a non-orthogonal transmission strategy relying on SIC developed in the past few years in parallel and independently from NOMA. We show that there is some confusion about the benefits of NOMA, and we dispel the associated misconceptions. First, we highlight why NOMA is inefficient in multi-antenna settings based on basic multiplexing gain analysis. We stress that the issue lies in how the NOMA literature, originally developed for single-antenna setups, has been hastily applied to multi-antenna setups, resulting in a misuse of spatial dimensions and therefore loss in multiplexing gains and rate. Second, we show that NOMA incurs a severe multiplexing gain loss despite an increased receiver complexity due to an inefficient use of SIC receivers. Third, we emphasize that much of the merits of NOMA are due to the constant comparison to OMA instead of comparing it to MU-LP and RS baselines. We then expose the pivotal design constraint that multi-antenna NOMA requires one user to fully decode the messages of the other users. This design constraint is responsible for the multiplexing gain erosion, rate and spectral efficiency loss, ineffectiveness to serve...
Page(s): 1310 - 1343
Date of Publication: 10 June 2021
Electronic ISSN: 2644-125X

Funding Agency:


CCBY - IEEE is not the copyright holder of this material. Please follow the instructions via https://creativecommons.org/licenses/by/4.0/ to obtain full-text articles and stipulations in the API documentation.
SECTION I.

Introduction

Multiple access is a crucial part of any communication system and refers to techniques that make use of the resources (e.g., time, frequency, power, antenna, code) to serve multiple users, ideally in the most efficient way. In contrast to Orthogonal Multiple Access (OMA) that assigns users to orthogonal dimensions (e.g., Time-Division Multiple Access - TDMA, Frequency-Division Multiple Access - FDMA), (power-domain) Non-Orthogonal Multiple Access (NOMA)1 superposes users in the same time-frequency resource and distinguishes them in the power domain [1]–​[5]. By doing so, NOMA has been promoted as a solution for 5G and beyond to deal with the vast throughput, access (serving a large number of users), and Quality-of-Service (QoS) requirements that are projected to grow exponentially for the foreseeable future.

In the downlink, NOMA refers to communication schemes where at least one user is forced to fully decode the message(s) of other co-scheduled user(s). This operation is commonly performed through the use of transmit-side Superposition Coding (SC) and receiver-side Successive Interference Cancellation (SIC) in downlink multi-user communications. Such techniques have been studied for years before being branded with the NOMA terminology. NOMA has indeed been known in the information theory and wireless communications literature for several decades, under the terminology of superposition coding with successive interference cancellation (denoted in short as SC–SIC), as the strategy that achieves (and has been used in achievability proofs for) the capacity region of the Single-Input Single-Output (SISO) (Gaussian) Broadcast Channel (BC) [6]. The superiority of NOMA over OMA was shown in the seminal paper by Cover in 1972. It is indeed well known that the capacity region of the SISO BC (achieved by NOMA) is larger than the rate region achieved by OMA (i.e., contains the achievable rate region of OMA as a subset) [6], [8], [9]. The use of SIC receivers is a major difference between NOMA and OMA, although it should be mentioned that SIC has also been studied for a long time in the 3G and 4G research phases in the context of interference cancellation and receiver designs [10]. Unfortunately, despite the existence of well-established textbooks on the topic in the past few decades [7]–​[9], the recent literature on NOMA has been the subject of some confusion, misunderstandings, and misconceptions [11].

In today’s wireless networks, access points commonly employ more than one antenna, which opens the door to the spatial domain and multi-antenna processing. The key building block of the downlink of multi-antenna networks is the multi-antenna (Gaussian) BC. Contrary to the SISO BC that is degraded and where users can be ordered based on their channel strengths, the multi-antenna BC is nondegraded and users cannot be ordered based on their channel strengths [8], [12]. This is why SC–SIC/NOMA is not capacity-achieving in this case,2 and Dirty Paper Coding (DPC) is the only known strategy that achieves the capacity region of the multi-antenna (Gaussian) BC with perfect Channel State Information at the Transmitter (CSIT) [12]. Due to the high computational burden of DPC, linear precoding is often considered the most attractive alternative to simplify the transmitter design [13]–​[17]. Interestingly, in a multi-antenna BC, Multi-User Linear Precoding (MU–LP) relying on treating the residual multi-user interference as noise, although suboptimal, is often very useful since the interference can be significantly reduced by spatial precoding. This is the reason why it has received significant attention in the past twenty years and it is the basic principle behind numerous 4G and 5G techniques such as Space-Division Multiple Access (SDMA) and multi-user (potentially massive) Multiple-Input Multiple-Output (MIMO) [17].

In view of the benefits of NOMA over OMA and multi-antenna over single-antenna, numerous attempts have been made in recent years to combine multi-antenna and NOMA schemes [1]–​[5], [18]–​[44] (and references therein). Although there are a few contributions comparing NOMA with MU–LP schemes, such as Zero-Forcing Beamforming (ZFBF) or DPC [29], [41]–​[44], much emphasis is put on comparing (single/multi-antenna) NOMA and OMA, and showing that NOMA outperforms OMA. But there is a lack of emphasis on contrasting multi-antenna NOMA to other multi-user multi-antenna baselines developed for the multi-antenna BC, such as MU–LP (or other forms of multi-user MIMO techniques) and Rate-Splitting Multiple Access (RSMA) [45]. RSMA is a form of (power-domain) non-orthogonal transmission strategy based on multi-antenna Rate-Splitting (RS). RS designed for the multi-antenna BC also relies on SIC and has been developed in parallel and independently from NOMA [45]–​[51]. Such a comparison is essential to assess the benefits and the efficiency of NOMA, since all these communication strategies can be viewed as different achievable schemes for the multi-antenna BC and all aim in their own way for the same objective, namely to meet the throughput, reliability, QoS, and connectivity requirements of beyond-5G multi-antenna wireless networks.

In this paper, we take a critical look at multi-antenna NOMA and Next Generation Multiple Access (NGMA) techniques for the downlink of communication systems and ask the important questions “Is multi-antenna NOMA an efficient strategy?” and “What are the important design principles for NGMA techniques?” To answer those questions, we go beyond the conventional NOMA vs. OMA comparison, and contrast multi-antenna NOMA with MU–LP and RS strategies. This allows us to highlight some misconceptions and shortcomings of multi-antenna NOMA. Explicitly, we show that in most scenarios the short answer to the first question is no, and demonstrate based on first principles and numerical performance evaluations why this is the case. Our discussions and results unveil the scenarios where MU–LP outperforms NOMA and vice versa, and demonstrate that multi-antenna NOMA is inefficient compared to RS. By contrasting multi-antenna NOMA to MU–LP and RS, we show that there is some confusion about multi-antenna NOMA and its merits and expose major misconceptions. Our results and discussions also reveal new insights and perspectives for the design of NGMA techniques.

The contributions of this paper are summarized as follows.

First, we analytically derive both the sum multiplexing gain as well as the max-min fair multiplexing gain of multi-antenna NOMA and compare them to those of MU–LP and RS. The scenarios considered are very general and include multi-antenna transmitter with single/multi-antenna receivers, perfect and imperfect CSIT, and underloaded and overloaded regimes. On the one hand, multi-antenna NOMA can achieve gains, but can also incur losses compared to MU–LP. On the other hand, multi-antenna NOMA always leads to a waste of multiplexing gain compared to RS. This multiplexing gain loss translates in a spectral efficiency loss and in an inability to serve a large number of users. The multiplexing gain analysis provides a firm theoretical ground to infer that multi-antenna NOMA is not as efficient as RS in exploiting the spatial dimensions and the available CSIT, and in serving a large number of users. This analysis is instrumental to identify the scenarios where the multiplexing gain gaps among NOMA, MU–LP, and RS are the smallest/largest, therefore highlighting deployments that are suitable/unsuitable for the different multiple access strategies.

Second, we show that multi-antenna NOMA leads to a high receiver complexity due to the inefficient use of SIC. For instance, we show that the higher the number of SIC operations (and therefore the higher the receiver complexity) in multi-antenna NOMA, the lower the sum multiplexing gain (and therefore the lower the sum-rate at high Signal-to-Noise Ratio SNR). Comparison with MU–LP and RS show that higher multiplexing gains can be achieved and a larger number of users can be served at a lower receiver complexity and a reduced number of SIC operations. Indeed, our results show that NOMA requires $K-1$ SIC layers to support $K$ users with $M$ transmit antennas, while RS can support $M-1+K$ users with only one SIC layer.

Third, we show that most of the misconceptions behind NOMA are due to the prevalent comparison to OMA instead of comparing to MU–LP and RS. We show and explain that the misconceptions, the multiplexing gain reduction, and the inefficient use of SIC receivers in both underloaded and overloaded multi-antenna settings relying on both perfect and imperfect CSIT originate from a limitation of the multi-antenna NOMA design philosophy, namely that one user is forced to fully decode the messages of the other users. Hence, while forcing a user to fully decode the messages of the other users is an efficient approach in single-antenna degraded BC, it may not be an efficient approach in multi-antenna networks.

Fourth, we stress that an efficient design of non-orthogonal transmission and multiple access/NGMA strategies ensures that the use of SIC never leads to a performance loss but rather leads to a performance gain over MU–LP. We show that such non-orthogonal solutions based on RS exist and truly benefit from the multi-antenna multiplexing gain and from the use of SIC receivers in both underloaded and overloaded regimes relying on perfect and imperfect CSIT. In fact, multi-antenna RS completely resolves the design limitations of multi-antenna NOMA. Consequently, RS with only one SIC layer can achieve higher spectral efficiency and support a larger number of users than NOMA with multiple SIC layers.

Fifth, we depart from the multiplexing gain analysis and design the transmit precoders to maximize the sum-rate and max-min rate for multi-antenna NOMA, followed by numerically comparing the sum-rate and the max-min fair rate of NOMA to those of MU–LP and RS. We show that the multiplexing gain analysis is accurate and instrumental to predict the rate performance of the multiple access strategies considered.

Sixth, our numerical simulations confirm the inefficiency of multi-antenna NOMA in general settings. Multi-antenna NOMA is shown to lead to performance gains over MU–LP in some settings but also to losses in other settings despite the use of SIC receivers and a higher receiver complexity. Our results also highlight the significant benefits, performance-wise and receiver complexity-wise, of RSMA and multi-antenna RS over multi-antenna NOMA. It is indeed possible to achieve a significantly better performance than MU–LP and NOMA with just one layer of SIC by adopting RS so as to partially decode messages of other users (instead of fully decoding them as in NOMA).

Organization: The remainder of this paper is organized as follows. Section II introduces two-user Multiple-Input Single-Output (MISO) NOMA (with single-antenna receivers) as a basic building block (and toy example) for our subsequent studies, compares to MU–LP, and raises some questions about the efficiency of NOMA. Section III studies the multiplexing gain of $K$ -user MISO NOMA with perfect CSIT. Sections IV and V extend the discussion to imperfect CSIT and MIMO NOMA, respectively. Section VI and Section VII study the multiplexing gains of the baseline schemes considered, namely MU–LP and RS, respectively. Section VIII compares the multiplexing gains of all multiple access schemes considered and exposes the misconceptions and shortcomings of multi-antenna NOMA. Section IX provides simulation results. Section X concludes this paper, discusses future research and pathways to 6G standardization. An overview of the paper is shown in Table 1.

TABLE 1 Overview of the Paper
Table 1- 
Overview of the Paper

Notation: $|\cdot |$ refers to the absolute value of a scalar or to the cardinality of a set depending on the context. $\|\cdot \|$ refers to the $l_{2}$ -norm of a vector. $\max \{a_{1},\ldots, a_{n}\}$ refers to the maximum value between $a_{1}$ to $a_{n}$ . $\mathbf {a}^{H}$ denotes the Hermitian transpose of vector $\mathbf {a}$ . $\textrm {Tr}(\mathbf {Q})$ refers to the trace of matrix $\mathbf {Q}$ . $\mathbf {I}$ is the identity matrix. $P\nearrow $ means as $P$ grows large. $\mathcal {CN}(0,\sigma ^{2})$ denotes the circularly symmetric complex Gaussian distribution with zero mean and variance $\sigma ^{2}$ . ~ stands for “distributed as”. $O(\cdot)$ refers to the big O notation. $\mathbb {E}\{\cdot \}$ denotes statistical expectation. $A\cap B$ and $A\cup B$ refer to the intersection ($A$ and $B$ have to be satisfied) and the union ($A$ or $B$ to be satisfied) of two sets/events $A$ and $B$ , respectively.

SECTION II.

Two-User MISO NOMA With Perfect CSIT: The Basic Building Block

We commence by studying two-user MISO NOMA and show that, by comparing NOMA to MU–LP instead of to OMA, the potential merits of NOMA are less obvious. Limited to two single-antenna users with perfect CSIT, this system model illustrates the simplest though fundamental building block of multi-antenna NOMA.

A. System Model

We consider a downlink single-cell multi-user multi-antenna scenario with $K=2$ users, also known as two-user MISO BC, consisting of one transmitter with $M\geq 2$ antennas3 communicating with two single-antenna users. The transmitter aims to transmit simultaneously two messages $W_{1}$ and $W_{2}$ intended for user-1 and user-2, respectively.

The transmitter adopts the so-called multi-antenna NOMA or MISO NOMA strategy, illustrated in Fig. 1, that encodes one of the two messages using a codebook shared by both users4 so that it can be decoded and cancelled from the received signal at the co-scheduled user (following the same principle as superposition coding for the degraded BC). Consider $W_{2}$ is encoded into $s_{2}$ using the shared codebook and $W_{1}$ is encoded into $s_{1}$ . The two streams are then linearly precoded by $M \times 1$ precoders5$\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ and superposed at the transmitter so that the transmit signal is given by \begin{equation*} \mathbf {x}=\mathbf {p}_{1} s_{1}+\mathbf {p}_{2} s_{2}.\tag{1}\end{equation*} View SourceRight-click on figure for MathML and additional features. Defining $\mathbf {s}=[s_{1},s_{2}]^{T}$ and assuming that $\mathbb {E}[\mathbf {s}\mathbf {s}^{H}]=\mathbf {I}$ , the average transmit (sum) power constraint is written as $P_{1}+P_{2}\leq P$ where $P_{k}=\|\mathbf {p}_{k}\|^{2}$ with $k=1,2$ .

FIGURE 1. - Two-user system architecture with NOMA (decoding order: user-
$2\rightarrow $
user-1).
FIGURE 1.

Two-user system architecture with NOMA (decoding order: user-$2\rightarrow $ user-1).

The channel vector for user $k$ is denoted by $\mathbf {h}_{k}$ , and the received signal at user $k$ can be written as $y_{k}=\mathbf {h}_{k}^{H} \mathbf {x}+n_{k}$ , $k=1,2$ , where $n_{k}\sim \mathcal {CN}(0,1)$ is Additive White Gaussian Noise (AWGN). We assume perfect CSIT and perfect Channel State Information at the Receivers (CSIR).

At both users, stream $s_{2}$ is decoded first into6$\widehat {W}_{2}$ by treating the interference from $s_{1}$ as noise. Using SIC at user-1, $\widehat {W}_{2}$ is re-encoded, precoded, and subtracted from the received signal, such that user-1 can decode its stream $s_{1}$ into $\widehat {W}_{1}$ . Assuming proper Gaussian signaling and perfect SIC,7 the achievable rates of the two streams with MISO NOMA are given by8\begin{align*} R_{1}^{(\text {N})}=&\log _{2}\left ({1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}\right), \tag{2}\\ R_{2}^{(\text {N})}=&\min \left ({\log _{2}\left ({1+A}\right),\log _{2}\left ({1+B}\right)}\right), \tag{3}\end{align*} View SourceRight-click on figure for MathML and additional features. where \begin{equation*} A=\frac {\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{2}}\right |^{2}}{1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}, B=\frac {\left |{\mathbf {h}_{2}^{H} \mathbf {p}_{2}}\right |^{2}}{1+\left |{\mathbf {h}_{2}^{H} \mathbf {p}_{1}}\right |^{2}}.\tag{4}\end{equation*} View SourceRight-click on figure for MathML and additional features. In (3), $\log _{2}(1+A)$ is the rate supportable by the channel of user-1 when user-1 decodes $s_{2}$ and treats its own stream $s_{1}$ as noise. Similarly, $\log _{2}(1+B)$ is the rate supportable by the channel of user-2 when user-2 decodes its own stream $s_{2}$ while treating stream $s_{1}$ of user-1 as noise. The $\min $ in (3) is due to the fact that $s_{2}$ , though carrying message $W_{2}$ intended to user-2, is decoded by both users and is therefore transmitted at a rate decodable by both users.

The most common performance metric of a multi-user system is the sum-rate. In this two-user MISO NOMA system model, the sum-rate is defined as $R_{\text {s}}^{(\text {N})}=R_{1}^{(\text {N})}+R_{2}^{(\text {N})}$ and can be upper bounded9 as \begin{align*} R_{\text {s}}^{(\text {N})}\leq&\log _{2}\left ({1+\frac {\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{2}}\right |^{2}}{1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}}\right)+\log _{2}\left ({1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}\right), \\=&\log _{2}\left ({1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{2}}\right |^{2}+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}\right). \tag{5}\end{align*} View SourceRight-click on figure for MathML and additional features.

It is important to note that (5) can be interpreted as the sum-rate of a two-user multiple access channel (MAC) with a single-antenna receiver. Indeed, user-1 acts as the receiver of a two-user MAC whose effective SISO channels for both links are given by $\mathbf {h}_{1}^{H} \mathbf {p}_{2}$ and $\mathbf {h}_{1}^{H} \mathbf {p}_{1}$ , respectively. This observation will be revisited in the next few sections, and will be shown very helpful to explain the performance of multi-antenna NOMA.

A drawback of the sum-rate is that it does not capture rate fairness among the users. Another popular system performance metric is the Max-Min Fair (MMF) rate or symmetric rate defined as $R_{\text {mmf}}^{(\text {N})}=\min _{k=1,2}R_{k}^{(\text {N})}$ . The MMF metric provides uniformly good QoS since it aims for maximizing the minimum rate among all users.

Throughout the manuscript, we will focus on the sum-rate and the MMF rate as two very different metrics to assess the system performance. We choose these two metrics as they are commonly used in wireless networks, and in the NOMA literature in particular (see, e.g., [20], [24], [30], [32], [33] for the sum-rate and [34], [35], [39], [53], [54] for the MMF rate). They are representative for two very different operational regimes, with the former focusing on high system throughput and the latter on user fairness.

In the sequel, we introduce some useful definitions and then make some observations based on this two-user system model.

B. Definition of Multiplexing Gain

Throughout the paper, we will often refer to the multiplexing gain to quantify how well a communication strategy can exploit the available spatial dimensions. We define the multiplexing gain, also referred to as Degrees-of-Freedom (DoF), of user-$k$ achieved with communication strategy10$j$ as \begin{equation*} d_{k}^{(j)}=\lim _{P\rightarrow \infty }\frac {R_{k}^{(j)}(P)}{\log _{2}(P)},\tag{6}\end{equation*} View SourceRight-click on figure for MathML and additional features. and the sum multiplexing gain as \begin{equation*} d_{\text {s}}^{(j)}=\lim _{P\rightarrow \infty }\frac {R_{\text {s}}^{(j)}(P)}{\log _{2}(P)}=\sum _{k=1}^{K} d_{k}^{(j)},\tag{7}\end{equation*} View SourceRight-click on figure for MathML and additional features. where $R_{\text {s}}^{(j)}=\sum _{k=1}^{K} R_{k}^{(j)}$ is the sum-rate. We also define the MMF multiplexing gain as \begin{equation*} d_{\text {mmf}}^{(j)}=\lim _{P\rightarrow \infty }\frac {R_{\text {mmf}}^{(j)}(P)}{\log _{2}(P)}=\min _{k=1,\ldots, K} d_{k}^{(j)},\tag{8}\end{equation*} View SourceRight-click on figure for MathML and additional features. where $R_{\text {mmf}}^{(j)}=\min _{k=1,\ldots, K} R_{k}^{(j)}$ is the MMF rate.

The multiplexing gain $d_{k}^{(j)}$ is a first-order approximation of the rate of user-$k$ at high SNR. $d_{k}^{(j)}$ can be viewed as the pre-log factor of the rate of user-$k$ at high SNR and be interpreted as the number or fraction of interference-free stream(s) that can be simultaneously communicated to user-$k$ by employing communication strategy $j$ . The larger $d_{k}^{(j)}$ , the faster the rate of user-$k$ increases with the SNR. Hence, ideally a communication strategy should achieve the highest multiplexing gain possible.

The sum multiplexing gain $d_{\text {s}}^{(j)}$ is a first-order approximation of the sum-rate at high SNR and therefore the pre-log factor of the sum-rate and can be interpreted as the total number of interference-free data streams that can be simultaneously communicated to all $K$ users by employing communication strategy $j$ . In other words, $R_{\text {s}}^{(j)}$ scales as $d_{\text {s}}^{(j)} \log _{2}(P)+\delta $ where $\delta $ is a term that scales slowly with SNR such that $\lim _{P\rightarrow \infty }\frac {\delta }{\log _{2}(P)}=0$ (e.g., $O(1) $ , $O(\log _{2}(\log _{2}(P)))$ or $O(\sqrt {\log _{2}(P)})$ ), and the larger $d_{\text {s}}^{(j)}$ , the faster the sum-rate increases with the SNR.

The MMF multiplexing gain $d_{\text {mmf}}^{(j)}$ , also referred to as symmetric multiplexing gain, corresponds to the maximum multiplexing gain that can be simultaneously achieved by all users, and reflects the pre-log factor of the MMF rate at high SNR. In other words, $R_{\text {mmf}}^{(j)}$ scales as $d_{\text {mmf}}^{(j)} \log _{2}(P)+\delta $ , and the larger $d_{\text {mmf}}^{(j)}$ , the faster the MMF rate increases with the SNR.

Remark 1:

Much of the analysis and discussion in this paper emphasizes the (sum and MMF) multiplexing gain as a metric to assess the capability of a strategy to exploit multiple antennas. As it becomes plausible from its definition, the multiplexing gain is an asymptotic metric valid in the limit of high SNR, and hence, does not precisely reflect specific finite-SNR rates. Nevertheless, it provides firm theoretical grounds for performance comparisons and has been used in the MIMO literature for two decades [55]. Furthermore, the multiplexing gain also impacts the performance at finite SNRs as shown in numerous papers [47], [48], [56] and in our simulation results in Section IX. Moreover, it enables to gain deep insights into the performance limits and to guide the design of efficient communications strategies, as we will see throughout this paper. The notion of multiplexing gain is illustrated in Fig. 2 where the strategy characterized by the red curves is preferred over the other strategy in blue. A larger slope/multiplexing gain is indeed offered by the red strategy at high SNR which translates into rate gains at finite SNR.

FIGURE 2. - Illustration of the notion of multiplexing gain/DoF.
FIGURE 2.

Illustration of the notion of multiplexing gain/DoF.

Remark 2:

In this manuscript, we will derive the sum multiplexing gain $d_{\text {s}}^{(j)}$ and the MMF multiplexing gain $d_{\text {mmf}}^{(j)}$ for strategy $j\in \{\text {N},\text {M},\text {R}\}$ . The corresponding proofs rely on obtaining an upper bound (i.e., converse) on the (sum/MMF) multiplexing gain and then showing that this upper bound is tight since it is achievable by the strategy under study. In other words, we show that $d_{\text {s/mmf}}^{(j)} \leq a$ (upper bound) and then $d_{\text {s/mmf}}^{(j)} \geq a$ (achievability). Consequently, this paper characterizes the exact (sum/MMF) multiplexing gains achieved by each strategy, i.e., $d_{\text {s/mmf}}^{(j)} = a$ . We confirm the multiplexing gains by numerical simulations in evaluations.

C. Discussions

Note that (2) and (3), respectively, suggest that $s_{1}$ is received interference-free at user-1, and that $s_{2}$ is always decoded in the presence of interference from $s_{1}$ . We can now draw some important conclusions from (2), (3), and (5).

The sum-rate bound (5) of this two-user MISO NOMA strategy and user ordering user-$2\rightarrow $ user-1 can be further upper bounded as \begin{equation*} R_{\text {s}}^{(\text {N})}\leq \log _{2}\left ({1+\left \|{\mathbf {h}_{1}}\right \|^{2} P}\right), \tag{9}\end{equation*} View SourceRight-click on figure for MathML and additional features. where the equality in (9) is achieved (i.e., upper bound is tight) by choosing $\mathbf {p}_{1}=\sqrt {P_{1}}\mathbf {h}_{1}/\|\mathbf {h}_{1}\|$ and $\mathbf {p}_{2}=\sqrt {P_{2}}\mathbf {h}_{1}/\|\mathbf {h}_{1}\|$ with $P_{1}+P_{2}= P$ . Note that the right hand side of (9) is the rate achieved by OMA when serving user-1. In other words, (9) is not just an upper bound on the sum-rate of MISO NOMA but is actually the maximum achievable sum-rate of MISO NOMA. This maximum achievable sum-rate of MISO NOMA is the same as that of OMA (when serving user-1).

Had we considered the other decoding order where the shared codebook is used to encode $W_{1}$ and user-2 decodes $s_{1}$ , the role of user-1 and user-2 in Fig. 1 would have been switched (user-$1\rightarrow $ user-2) and we would have obtained \begin{equation*} R_{\text {s}}^{(\text {N})}\leq \log _{2}\left ({1+\left \|{\mathbf {h}_{2}}\right \|^{2} P}\right). \tag{10}\end{equation*} View SourceRight-click on figure for MathML and additional features. This sum-rate upper bound is achievable by choosing $\mathbf {p}_{1}=\sqrt {P_{1}}\mathbf {h}_{2}/\|\mathbf {h}_{2}\|$ and $\mathbf {p}_{2}=\sqrt {P_{2}}\mathbf {h}_{2}/\|\mathbf {h}_{2}\|$ with $P_{1}+P_{2}= P$ and the maximum achievable sum-rate of MISO NOMA with decoding order user-$1\rightarrow $ user-2 is the same as that of OMA (when serving user-2 only).

Hence, from (9) and (10), the sum-rate of MISO NOMA considering adaptive decoding order is upper bounded as \begin{equation*} R_{\text {s}}^{(\text {N})}\leq \log _{2}\left ({1+\max \left \{{\left \|{\mathbf {h}_{1}}\right \|^{2}, \left \|{\mathbf {h}_{2}}\right \|^{2}}\right \} P}\right). \tag{11}\end{equation*} View SourceRight-click on figure for MathML and additional features. This sum-rate is again achievable and is the same as that of OMA when serving the strongest of the two users $\arg \max _{k=1,2}\|\mathbf {h}_{k}\|$ .

Importantly, (9), (10), and (11) reveal the strong result that the sum-rate of MISO NOMA is actually no higher than that of OMA for any SNR! This fact is not surprising in the SISO case ($M=1$ ) since it is well known that to achieve the sum capacity of the SISO BC, one can simply transmit to the strongest user all the time (i.e., OMA) [57]. The above result shows that this also holds for the two-user MISO NOMA basic building block.

Considering the high SNR regime, (9), (10), (11) all scale at most as $\log _{2}(P)$ , i.e., \begin{equation*} R_{\text {s}}^{(\text {N})}\stackrel {P\nearrow }{=} \log _{2}\left ({P}\right) + \delta, \tag{12}\end{equation*} View SourceRight-click on figure for MathML and additional features. which highlights that the sum multiplexing gain of two-user MISO NOMA (irrespectively of the decoding order) is (at most) one, i.e., $d_{\text {s}}^{(\text {N})}=1$ . Hence, MISO NOMA limits the sum multiplexing gain to $d_{\text {s}}^{(\text {N})}=d_{1}^{(\text {N})}+d_{2}^{(\text {N})}=1$ , i.e., the same as OMA.

The sum multiplexing gain of one can be further split equally between the two users, which leads to an MMF multiplexing gain of two-user MISO NOMA given by $d_{\text {mmf}}^{(\text {N})}=\frac {1}{2}$ . This is achieved by scaling the power allocated to user-1 as $O(P^{1/2})$ and that to user-2 as $O(P)$ . In other words, the MMF rate of this two-user MISO NOMA scales at most as $\frac {1}{2}\log _{2}(P)$ at high SNR.

The above contrasts with the optimal sum multiplexing gain $d_{\text {s}}^{(\star)}$ of the two-user MISO BC, that is equal to 2, i.e., two interference-free streams can be transmitted.11 This can be achieved by performing conventional MU–LP, illustrated in Fig. 3. Recall the MU–LP system model where $W_{1}$ and $W_{2}$ are independently encoded into streams $s_{1}$ and $s_{2}$ and respectively precoded by $\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ such that the transmit signal is given by \begin{equation*} \mathbf {x}=\mathbf {p}_{1} s_{1}+\mathbf {p}_{2} s_{2}.\tag{13}\end{equation*} View SourceRight-click on figure for MathML and additional features. At the receivers, $y_{k}=\mathbf {h}_{k}^{H} \mathbf {x}+n_{k}$ , $k=1,2$ , and $s_{1}$ and $s_{2}$ are respectively decoded by user-1 and user-2 by treating any residual interference as noise, leading to MU–LP rates \begin{equation*} R_{1}^{(\text {M})}=\log _{2}\left ({1+C}\right), R_{2}^{(\text {M})}=\log _{2}\left ({1+B}\right), \tag{14}\end{equation*} View SourceRight-click on figure for MathML and additional features. with \begin{equation*} C=\frac {\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{1}}\right |^{2}}{1+\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{2}}\right |^{2}},\tag{15}\end{equation*} View SourceRight-click on figure for MathML and additional features. and $B$ as specified in (4). It is then indeed sufficient12 to transmit two streams using uniform power allocation and Zero-Forcing Beamforming (ZFBF), so that $\mathbf {h}_{1}^{H} \mathbf {p}_{2}=\mathbf {h}_{2}^{H} \mathbf {p}_{1}=0$ , to reap the sum multiplexing gain $d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\star)}=2$ and the MMF multiplexing gain $d_{\text {mmf}}^{(\text {M})}=d_{\text {mmf}}^{(\star)}=1$ (i.e., each user gets one full interference-free stream). Indeed, with MU–LP, the sum-rate scales as $2\log _{2}(P)$ and the MMF rate as $\log _{2}(P)$ at high SNR [58]–​[60]. Such sum-rate and MMF rate would always strictly outperform that of NOMA (and OMA) at high SNR. Since both OMA and NOMA achieve only half the (sum/MMF) multiplexing gain of MU–LP in the two-user MISO BC considered, it is not clear whether (and under what conditions) multi-antenna NOMA can outperform MU–LP and other forms of multi-user multi-antenna communication strategies, and if it does, whether multi-antenna NOMA is worth the associated increase in receiver complexity. The above discussion exposes some weaknesses of multi-antenna NOMA and highlights the uncertainty regarding the potential benefits of multi-antenna NOMA. Hence, in the following sections, we derive the multiplexing gains of generalized $K$ -user multi-antenna NOMA, so as to better assess its potential.

FIGURE 3. - Two-user system architecture with MU–LP/SDMA.
FIGURE 3.

Two-user system architecture with MU–LP/SDMA.

Remark 3:

It appears from (1) and (13) that the transmit signal vectors for 2-user MISO NOMA and 2-user MU–LP are the same, therefore giving the impression that NOMA is the same as MU–LP. This is obviously incorrect. Recall the major differences in the encoding and the decoding of NOMA and MU–LP:

  • Encoding: In NOMA, $W_{1}$ is encoded into $s_{1}$ and $W_{2}$ is encoded into $s_{2}$ at a rate such that $s_{2}$ is decodable by both users, while $W_{1}$ and $W_{2}$ are independently encoded into streams $s_{1}$ and $s_{2}$ in MU–LP.

  • Decoding: User-1 decodes $s_{1}$ and $s_{2}$ and user-2 decodes $s_{2}$ by treating $s_{1}$ as noise in NOMA while $s_{1}$ is decoded by user-1 by treating $s_{2}$ as noise and $s_{2}$ is decoded by user-2 by treating $s_{1}$ as noise in MU–LP.

Consequently the rate expressions (2), (3) and (14) are different, which therefore suggests that the best pair of precoders $\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ that maximizes a given objective function (e.g., sum-rate, MMF rate) would be different for NOMA and MU–LP. Choosing $\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ according to ZFBF would commonly work reasonably well for MU–LP but would lead to $R_{2}^{(\text {N})}=0$ in (3) for NOMA. Nevertheless, the above discussion on multiplexing gain loss of MISO NOMA always holds, even in the event where MISO NOMA is implemented with the best choice of precoders, since the above analysis for MISO NOMA is based on an upper bound.

SECTION III.

$K$ -User MISO NOMA With Perfect CSIT

We now study $K$ -user MISO NOMA relying on perfect CSIT and derive the sum and MMF multiplexing gains.

A. MISO NOMA System Model

We consider a $K$ -user MISO NOMA scenario where a single transmitter equipped with $M$ transmit antennas serves $K$ single-antenna users indexed by $\mathcal {K}=\{1,\ldots,K\}$ . The $K$ users are grouped into $1\leq G< K$ groups13 with groups indexed by $\mathcal {G}=\{1,\ldots,G\}$ . There are $g$ users per group, i.e., we therefore assume for simplicity that $K=gG$ . Users in group $i$ are indexed by $\mathcal {K}_{i}=\{ig-g+1,\ldots,ig\}$ . Hence, $\mathcal {K}=\bigcup _{i\in \mathcal {G}}\mathcal {K}_{i}$ and $|\mathcal {K}_{i}|=g$ . Without loss of generality, we assume that users 1, $g+1$ , $2g+1$ , ${\dots }$ , $K-g+1$ are the “strong users”14 respectively in group 1 to $G$ , and perform $g-1$ layers of SIC to fully decode the messages (and therefore remove interference) from the other $g-1$ users within the same group. Similarly, the second user in each group (i.e., $ig-g+2$ in group $i$ ) performs $g-2$ layers of SIC to fully decode messages from other $g-2$ users within the same group, and so on. The two most popular MISO NOMA strategies employ either $G=1$ [20]–​[23] or $G=K/2$ [26]–​[31] but we keep here the scenario general for any value of $1\leq G< K$ . The general architecture of MISO NOMA is illustrated in Fig. 4. The two-user building block in Section II can be viewed as a particular instance with $K=2$ and $G=1$ .

FIGURE 4. - 
$K$
-user system architecture with MISO NOMA (containing 
$G$
 user groups and 
$g$
 users within each group).
FIGURE 4.

$K$ -user system architecture with MISO NOMA (containing $G$ user groups and $g$ users within each group).

At the transmitter, the messages $W_{1}$ to $W_{K}$ intended for user-1 to user-$K$ , respectively, are encoded into $s_{1}$ to $s_{K}$ . However, some of the messages in each group have to be encoded using codebooks shared by a subset of the users in that group so that they can be decoded and cancelled from the received signals at the co-scheduled users in that group. In particular, taking group 1 as an example, $W_{2}$ to $W_{g}$ are encoded using codebooks shared with user-1 such that user-1 can decode all of these $g-1$ messages. After encoding, the $K$ streams are linearly precoded by precoders15$\mathbf {p}_{1}$ to $\mathbf {p}_{K}$ , where $\mathbf {p}_{k}\in \mathbb {C}^{M}$ is the precoder of $s_{k}$ , and superposed at the transmitter. The resulting transmit signal is \begin{equation*} \mathbf {x}=\sum _{k=1}^{K}\mathbf {p}_{k} s_{k}.\tag{16}\end{equation*} View SourceRight-click on figure for MathML and additional features. Defining $\mathbf {s}=[s_{1},\ldots, s_{K}]^{T}$ and assuming that $\mathbb {E}[\mathbf {s}\mathbf {s}^{H}]=\mathbf {I}$ , the average transmit power constraint is written as $\sum _{k=1}^{K} P_{k}\leq P$ , where $P_{k}=\|\mathbf {p}_{k}\|^{2}$ .

At the receiver side, the signal received at user-$k$ is $y_{k}=\mathbf {h}_{k}^{H} \mathbf {x}+n_{k}$ , $k\in \mathcal {K}$ , where $\mathbf {h}_{k}$ is the channel vector16 of user-$k$ , perfectly known at the transmitter and that user, and $n_{k}\sim \mathcal {CN}(0,1)$ is the AWGN. By employing SIC, user-$j$ in group $i$ (i.e., $j\in \mathcal {K}_{i}$ ) decodes the messages of users-$\{k{\mid } k\geq j,k\in \mathcal {K}_{i}\}$ within the same user group in a descending order of the user index while treating the interference from users in different groups as noise. Under the assumption of Gaussian signaling and perfect SIC, the rate at user-$j, j\in \mathcal {K}_{i}$ , to decode the message of user-$k, k\geq j,k\in \mathcal {K}_{i}$ , is given by \begin{equation*} R_{j,k}=\log _{2}\left ({1+\frac {\left |{\mathbf {h}_{j}^{H}\mathbf {p}_{k}}\right |^{2}}{I_{j,k}^{(in)} +I_{j,k}^{(ou)}+1}}\right),\tag{17}\end{equation*} View SourceRight-click on figure for MathML and additional features. where \begin{equation*} I_{j,k}^{(in)}=\sum _{m< k,m\in \mathcal {K}_{i}}\left |{\mathbf {h}_{j}^{H}\mathbf {p}_{m}}\right |^{2}, I_{j,k}^{(ou)}=\sum _{l\neq i, l\in \mathcal {G}}\sum _{m\in \mathcal {K}_{l}} \left |{\mathbf {h}_{j}^{H}\mathbf {p}_{m}}\right |^{2}\tag{18}\end{equation*} View SourceRight-click on figure for MathML and additional features. are the intra-group interference and inter-group interference received at user-$k$ , respectively. As the message of user-$k, k\in \mathcal {K}_{i}$ , has to be decoded by users-$\{j| j\leq k,j\in \mathcal {K}_{i}\}$ , to ensure decodability, the rate of user-$k$ should not exceed \begin{equation*} R_{k}^{(\text {N})}=\min _{j\leq k, j\in \mathcal {K}_{i}}R_{j,k}.\tag{19}\end{equation*} View SourceRight-click on figure for MathML and additional features.

In the next subsection, we study the sum multiplexing gain and the MMF multiplexing gain of $K$ -user MISO NOMA.

B. Multiplexing Gains

The following proposition provides the sum multiplexing gain of MISO NOMA for perfect CSIT.

Proposition 1:

The sum multiplexing gain of $K$ -user MISO NOMA with $M$ transmit antennas, $G$ groups of $g=K/G$ users, and perfect CSIT is $d_{\text {s}}^{(\text {N})}=\min (M,G)$ .

Proof:

The proof is obtained by showing that an upper bound on the sum multiplexing gain is achievable. The upper bound is obtained by applying the MAC argument (used in (5)) to the strong user in each group and noticing that the sum-rate in groups 1 to $G$ is upper bounded as \begin{align*} \sum _{k=1}^{g} R_{k}^{(\text {N})}\leq&\log _{2}\left ({1+\sum _{k=1}^{g}\left |{\mathbf {h}_{1}^{H} \mathbf {p}_{k}}\right |^{2}}\right), \tag{20}\\ \sum _{k=g+1}^{2g} R_{k}^{(\text {N})}\leq&\log _{2}\left ({1+\sum _{k=g+1}^{2g}\left |{\mathbf {h}_{g+1}^{H} \mathbf {p}_{k}}\right |^{2}}\right), \tag{21}\\ &\!\!\!\!\!\!\!\! \vdots \\ \sum _{k=K-g+1}^{K} R_{k}^{(\text {N})}\leq&\log _{2}\left ({1+\sum _{k=K-g+1}^{K}\left |{\mathbf {h}_{K-g+1}^{H} \mathbf {p}_{k}}\right |^{2}}\right). \tag{22}\end{align*} View SourceRight-click on figure for MathML and additional features. Note that the left-hand sides of (20), (21), and (22) refer to the sum of the rates of the messages in group 1, 2, and $G$ , respectively, but can also be viewed as the total rate to be decoded by user 1, $g+1$ , and $K-g+1$ (since those users decode all the messages in their respective group). We now notice that the right-hand sides of (20), (21), and (22) scale as $\log _{2}(P)+ \delta $ for large $P$ (following the same argument as in the two-user case). This implies that each group $i$ achieves at most a (group) sum multiplexing gain $d_{\text {s},i}^{(\text {N})}=\sum _{k=ig-g+1}^{ig}d_{k}^{(N)}$ of 1, i.e., at most one interference-free stream can be transmitted to each group. Summing up all inequalities, we obtain in the limit of large $P$ that \begin{equation*} R_{\text {s}}^{(\text {N})}=\sum _{k=1}^{K} R_{k}^{(\text {N})} \stackrel {P\nearrow }{\leq } G \log _{2}(P) +\delta,\tag{23}\end{equation*} View SourceRight-click on figure for MathML and additional features. which shows that $d_{\text {s}}^{(\text {N})}=\sum _{i=1}^{G}d_{\text {s},i}^{(\text {N})}\leq G$ . Additionally, since $d_{\text {s}}^{(\text {N})}\leq d_{\text {s}}^{\star }=\min (M,K)$ , we have $d_{\text {s}}^{(\text {N})}\leq \min (M,G)$ .

The achievability part shows that $d_{\text {s}}^{(\text {N})}\geq \min (M,G)$ . To this end, it is indeed sufficient to perform ZFBF and transmit $\min (M,G)$ interference-free streams to $\min (M,G)$ of the $G$ “strong users”. Combining the upper bound and achievability leads to the conclusion that $d_{\text {s}}^{(\text {N})}=\min (M,G)$ .

The following result derives the MMF multiplexing gain of MISO NOMA with perfect CSIT.

Proposition 2:

The MMF multiplexing gain of $K$ -user MISO NOMA with $M$ transmit antennas, $G$ groups of $g=K/G$ users and perfect CSIT is \begin{align*} d_{\text {mmf}}^{(\text {N})}=\begin{cases} \frac {1}{g},& M\geq K-g+1, \\ 0, & M < K-g+1. \end{cases}\tag{24}\end{align*} View SourceRight-click on figure for MathML and additional features. For $G=1$ , i.e., $g=K$ , $d_{\text {mmf}}^{(\text {N})}=\frac {1}{K}$ .

Proof:

Let us first consider $M\geq K-g+1$ . The MMF multiplexing gain is always upperbounded by ignoring the inter-group interference, i.e., the $G$ groups are non-interfering. Following again the MAC argument, the sum multiplexing gain of one in each group can then be further split equally among the $g$ users, which leads to an upper bound on the MMF multiplexing gain of $\frac {1}{g}$ . Achievability is simply obtained by designing the precoders using ZFBF to eliminate all inter-group interference, and allocating the power similarly to Section II-C. Indeed let us consider group 1 for simplicity, and allocate the power to user $k=1,\ldots,g$ as $O(P^{k/g})$ . This leads to an SINR for user-$k$ scaling as $O(P^{1/g})$ and to an achievable MMF multiplexing gain of $\frac {1}{g}$ . For $G=1$ , one can simply allocate the power to user $k=1,\ldots,K$ as $O(P^{k/K})$ , which leads to an achievable MMF multiplexing gain of $\frac {1}{K}$ .

Let us now consider $M < K-g+1$ . Take $M=K-g$ (any smaller $M$ cannot improve the multiplexing gain). Precoder $\mathbf {p}_{k}$ of any user-$k$ can be made orthogonal to the channel of at most $K-g-1$ co-scheduled users and will therefore cause interference to at least one user in another group. As a result, the MMF multiplexing gain collapses to 0.

Remark 4:

For the MMF multiplexing gain analysis, it should be noted that we consider one-shot transmission schemes with no time-sharing between strategies. This is suitable for systems with rigid scheduling and/or tight latency constraints, and also allows for simpler designs. This assumption is also commonly used in the NOMA literature [34], [35], [39], [53], [54].

SECTION IV.

$K$ -User MISO NOMA With Imperfect CSIT

We now go one step further and extend the multiplexing gain analysis to the imperfect CSIT setting. The results in this section therefore generalize the results in the previous section (with perfect CSIT being a particular case of imperfect CSIT). In this section, the achievable rates are defined in the ergodic sense in a standard Shannon theoretic fashion, and the corresponding sum and MMF mutiplexing gains are defined similarly to Section II-B using ergodic rates. We first introduce the CSIT error model before deriving the multiplexing gains of MISO NOMA relying on imperfect CSIT.

A. CSIT Error Model

For each user, the transmitter acquires an imperfect estimate of the channel vector $\mathbf {h}_{k}$ , denoted as $\hat {\mathbf {h}}_{k}$ . The CSIT imperfection is modelled by \begin{equation*} \mathbf {h}_{k}=\hat {\mathbf {h}}_{k}+\tilde {\mathbf {h}}_{k},\tag{25}\end{equation*} View SourceRight-click on figure for MathML and additional features. where $\tilde {\mathbf {h}}_{k}$ denotes the corresponding channel estimation error at the transmitter. For compactness, we define $\mathbf {H}=[\mathbf {h}_{1} \ldots \mathbf {h}_{K}]$ , $\hat {\mathbf {H}}=[\hat {\mathbf {h}}_{1} \ldots \hat {\mathbf {h}}_{K}]$ , and $\tilde {\mathbf {H}}=[\tilde {\mathbf {h}}_{1} \ldots \tilde {\mathbf {h}}_{K}]$ , which implies $\mathbf {H}=\hat {\mathbf {H}}+\tilde {\mathbf {H}}$ . The joint fading process is characterized by the joint distribution $f_{\mathbf {H},\hat {\mathbf {H}}}(\mathbf {H},\hat {\mathbf {H}})$ of $\{\mathbf {H},\hat {\mathbf {H}}\}$ , assumed to be stationary and ergodic. The joint distribution $f_{\mathbf {H},\hat {\mathbf {H}}}(\mathbf {H},\hat {\mathbf {H}})$ is continuous and known to the transmitter. The ergodic rates capture the long-term performance over a long sequence of channel uses $\{\mathbf {H},\hat {\mathbf {H}}\}$ spanning almost all possible joint channel states.

For each user-$k$ , we define the average channel (power) gain as $\Gamma _{k}=\mathbb {E}\{\|\mathbf {h}_{k}\|^{2}\}$ . Similarly, we define $\hat {\Gamma }_{k}=\mathbb {E}\{\|\hat {\mathbf {h}}_{k}\|^{2}\}$ and $\tilde {\Gamma }_{k}=\mathbb {E}\{\|\tilde {\mathbf {h}}_{k}\|^{2}\}$ . For many CSIT acquisition mechanisms [61], $\hat {\mathbf {h}}_{k}$ and $\tilde {\mathbf {h}}_{k}$ are uncorrelated according to the orthogonality principle [62]. By further assuming that $\hat {\mathbf {h}}_{k}$ and $\tilde {\mathbf {h}}_{k}$ have zero means, we have $\Gamma _{k}=\hat {\Gamma }_{k}+\tilde {\Gamma }_{k}$ , based on which we can write $\hat {\Gamma }_{k}=(1-\sigma _{e,k}^{2})\Gamma _{k}$ and $\tilde {\Gamma }_{k}=\sigma _{e,k}^{2}\Gamma _{k}$ for some $\sigma _{e,k}^{2}\in [{0,1}]$ . Note that $\sigma _{e,k}^{2}$ is the normalized estimation error variance for user-$k$ ’s CSIT, e.g., $\sigma _{e,k}^{2}=1$ corresponds to no instantaneous CSIT, while $\sigma _{e,k}^{2}=0$ represents perfect instantaneous CSIT.

For simplicity, we assume identical normalized CSIT error variances for all users, i.e., $\sigma _{e,k}^{2}=\sigma _{e}^{2}$ for all $k=1,\ldots,K$ . To facilitate the multiplexing gain analysis, we assume that $\sigma _{e}^{2}$ scales with SNR as $\sigma _{e}^{2}=P^{-\alpha }$ for some CSIT quality parameter $\alpha \in [0,\infty$ ) [46], [47], [60], [63], [64]. This is a convenient and tractable model extensively used in the information theoretic literature that allows us to assess the performance of the system in a wide range of CSIT quality conditions. Indeed, the larger $\alpha $ , the faster the CSIT error decreases with the SNR. The two extreme cases, $\alpha =0$ and $\alpha =\infty $ , correspond to no or constant CSIT (i.e., that does not scale or improve with SNR) and perfect CSIT, respectively. As far as the multiplexing gain analysis is concerned, however, we may truncate the CSIT quality parameter as $\alpha \in [{0,1}]$ , where $\alpha =1$ amounts to perfect CSIT in the multiplexing gain sense. The regime $\alpha \in (0,1)$ corresponds to partial CSIT, resulting from imperfect CSI acquisition. The CSIT quality $\alpha $ can be interpreted in many different ways, but a plausible interpretation of $\alpha $ is related to the number of feedback bits, where $\alpha =0$ corresponds to a fixed number of feedback bits for all SNRs, $\alpha =\infty $ corresponds to an infinite number of feedback bits, and $0< \alpha < \infty $ reflects how quickly the number of feedback bits increases with the SNR. As a reference, a system like 4G and 5G use $\alpha =0$ when limited feedback (or codebook-based feedback) is used to report the CSI, since the number of feedback bits is constant and does not scale with SNR, e.g., 4 bits of CSI feedback in 4G LTE for $M=4$ .

B. Multiplexing Gains

The following result quantifies the sum multiplexing gain of MISO NOMA for imperfect CSIT.

Proposition 3:

The sum multiplexing gain of $K$ -user MISO NOMA with $M$ transmit antennas, $G$ groups of $g=K/G$ users, and CSIT quality $0\leq \alpha \leq 1$ is $d_{\text {s}}^{(\text {N})}=\max (1,\min (M,G)\alpha)$ .

Proof:

Similar to the proof of Proposition 1, let us look at the $G$ strong users since they have to decode all messages. We recall that $d_{\text {s},i}^{(\text {N})}=\sum _{k=ig-g+1}^{ig}d_{k}^{(N)}$ reflects the multiplexing gain of the total rate to be decoded by the strong user $ig-g+1$ in group $i$ as a consequence that this user decodes all $g$ messages in group $i$ . Making use of the results of MU–LP in the $G$ -user MISO BC with imperfect CSIT [47].17 we obtain $d_{\text {s}}^{(\text {N})}=\sum _{i=1}^{G} d_{\text {s},i}^{(\text {N})}=\sum _{k=1}^{K} d_{k}^{(N)}\leq \max (1,\min (M,G)\alpha)$ .

The achievability part shows that $d_{\text {s}}^{(\text {N})}\geq \max (1,\min (M,G)\alpha)$ . It is indeed sufficient to perform ZFBF and transmit $\min (M,G)$ streams, each at a power level of $P^{\alpha }/\min (M,G)$ , to $\min (M,G)$ of the $G$ “strong users”. If $\min (M,G)\alpha < 1$ , one can simply transmit a single stream (i.e., perform OMA) and reap a sum multiplexing gain of 1. Combining the upper bound and achievability leads to the conclusion that we have $d_{\text {s}}^{(\text {N})}=\max (1,\min (M,G)\alpha)$ .

For $\alpha =1$ (perfect CSIT from a multiplexing gain perspective), Proposition 3 boils down to the perfect CSIT result in Proposition 1.

The following proposition provides the MMF multiplexing gain of MISO NOMA with imperfect CSIT.

Proposition 4:

The MMF multiplexing gain of $K$ -user MISO NOMA with $M$ transmit antennas, $G$ groups of $g=K/G$ users, and CSIT quality $0\leq \alpha \leq 1$ is \begin{align*} d_{\text {mmf}}^{(\text {N})}=\begin{cases} \frac {\alpha }{g}, & G>1 \:\textrm {and}\: M\geq K-g+1, \\ 0, & G>1 \:\textrm {and}\: M < K-g+1,\\ \frac {1}{K}, & G=1. \end{cases}\tag{26}\end{align*} View SourceRight-click on figure for MathML and additional features.

The proof is relegated to Appendix A.

It is interesting to note that the sensitivity of the multiplexing gain of MISO NOMA to the CSIT quality $\alpha $ is different for $G>1$ and $G=1$ . Indeed the sum and MMF multiplexing gains of MISO NOMA with $G>1$ decay as $\alpha $ decreases, while the multiplexing gains of MISO NOMA with $G=1$ are not affected by $\alpha $ . This can be interpreted in two different ways. On the one hand, this suggests that MISO NOMA $G=1$ is inherently robust to CSIT imperfection since the multiplexing gains are not affected by $\alpha < 1$ . On the other hand, this also reveals that MISO NOMA with $G=1$ is unable to exploit the presence of CSIT since its multiplexing gains are the same as in the absence of CSIT ($\alpha =0$ ).

SECTION V.

MIMO NOMA

We now consider multi-antenna receivers and extend the two-user MISO NOMA toy example of Section II to a two-user MIMO NOMA setting with perfect CSIT.

We consider a two-user MIMO BC, consisting of one transmitter with $M$ antennas and two users equipped with $N$ antennas each. The signal vector $\mathbf {y}_{k}\in \mathbb {C}^{N\times 1}$ received at user-$k$ is written as $\mathbf {y}_{k}=\mathbf {H}_{k}^{H}\mathbf {x}+\mathbf {n}_{k}$ , where $\mathbf {H}_{k}\in \mathbb {C}^{M\times N}$ is the channel matrix18 of user-$k$ and $\mathbf {n}_{k}$ is the AWGN vector at user-$k$ . Following the NOMA principle, the transmit signal vector $\mathbf {x}$ is generated such that the messages intended for user-2 are encoded using a shared codebook so as to be decodable by user-1. Defining the transmit covariance matrix of user-$k$ as $\mathbf {Q}_{k}$ subject to the average transmit power constraint $\textrm {Tr}(\mathbf {Q}_{1})+\textrm {Tr}(\mathbf {Q}_{2})\leq P$ , and assuming Gaussian signaling, the achievable rates of both users are given by \begin{align*} R_{1}^{(\text {N})}=&\log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {H}_{1}^{H} \mathbf {Q}_{1} \mathbf {H}_{1}}\right), \tag{27}\\ R_{2}^{(\text {N})}=&\min \left ({\log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {R}_{1}}\right), \log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {R}_{2}}\right)}\right), \tag{28}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\mathbf {R}_{k}=\mathbf {H}_{k}^{H} \mathbf {Q}_{2} \mathbf {H}_{k}(\mathbf {I}_{N}+\mathbf {H}_{k}^{H} \mathbf {Q}_{1} \mathbf {H}_{k})^{-1}$ , $k=1,2$ .

The sum-rate $R_{s}^{(\text {N})}$ of the two-user MIMO NOMA can then be bounded as \begin{align*} R_{\text {s}}^{(\text {N})}\leq&\log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {R}_{1}}\right)+ \log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {H}_{1}^{H} \mathbf {Q}_{1} \mathbf {H}_{1}}\right), \\=&\log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {H}_{1}^{H} \mathbf {Q}_{1} \mathbf {H}_{1}+\mathbf {H}_{1}^{H} \mathbf {Q}_{2} \mathbf {H}_{1}}\right). \tag{29}\end{align*} View SourceRight-click on figure for MathML and additional features. The sum-rate bound achieved with this two-user MIMO NOMA strategy can be further upper bounded as \begin{align*} R_{\text {s}}^{(\text {N})}\leq&\log _{2}\det \left ({\mathbf {I}_{N}+\mathbf {H}_{1}^{H} \mathbf {Q}_{1}^{\star } \mathbf {H}_{1}}\right), \\\stackrel {P\nearrow }{=}&\min \left ({M,N}\right)\log _{2}\left ({P}\right) + O(1), \tag{30}\end{align*} View SourceRight-click on figure for MathML and additional features. where $\mathbf {Q}_{1}^{\star }$ refers to the optimal covariance matrix for user-1 in a single-user (OMA) setup with $\textrm {Tr}(\mathbf {Q}_{1})= P$ , i.e., obtained by transmitting along the dominant eigenvector of $\mathbf {H}_{1}\mathbf {H}_{1}^{H}$ and allocating power $P$ according to the water-filling solution.

Similarly to the MISO case, the other decoding order could also be considered and a similar analysis can be obtained. Ultimately, the sum-rate of MIMO NOMA (irrespective of the decoding order) is actually no larger than that of OMA at any SNR. The sum multiplexing gain is limited by $d_{\text {s}}^{(\text {N})}=\min (M,N)$ , which is smaller than the optimal sum multiplexing gain of the MIMO BC $d_{\text {s}}^{(\star)}=\min (M,2N)$ , achieved by conventional MU-MIMO/MU–LP precoding [59]. This analysis highlights that MIMO NOMA incurs a sum multiplexing loss whenever $N< M$ , i.e., when the number of receive antennas at each device is smaller than the number of transmit antennas at the base station, which would occur in most realistic and practical MIMO deployments. Similarly, the MMF multiplexing gain is also affected since $d_{\text {mmf}}^{(\text {N})}=\min (M,N)/2$ , obtained by equally splitting the sum multiplexing gain amongst the two users, which again incurs a loss whenever $N< M$ . Taking for instance $M=6$ and $N=4$ leads to $d_{\text {s}}^{(\text {N})}=4$ and $d_{\text {mmf}}^{(\text {N})}=2$ , though one could easily transmit using multi-user MIMO (e.g., block diagonalization [65], [66]) 6 interference-free streams with 3 streams per user.

Recall that the above MIMO NOMA scheme and analysis were based on the principle that one user is forced to fully decode the messages of the other co-scheduled user. Nevertheless other MIMO NOMA schemes have recently appeared that may not satisfy this definition of MIMO NOMA and may therefore achieve different (and hopefully superior) sum and MMF multiplexing gains [67], [68].

SECTION VI.

Baseline Scheme I: Conventional Multi-User Linear Precoding

The first baseline to assess the performance of multi-antenna NOMA is conventional Multi-User Linear Precoding. In the sequel, we recall the multiplexing gains achieved by MU–LP.

A. MU–LP System Model

Following Section III-A, we consider a $K$ -user MISO BC with one transmitter equipped with $M$ transmit antennas and $K$ single-antenna users. As per Fig. 5, the messages $W_{1},\ldots,W_{K}$ respectively for user-1 to user-$K$ are independently encoded into $s_{1}$ to $s_{K}$ , which are then mapped to the transmit antennas through precoders $\mathbf {p}_{1},\ldots,\mathbf {p}_{K}$ . The resulting transmit signal is $\mathbf {x}=\sum _{k=1}^{K}\mathbf {p}_{k} s_{k}$ .

FIGURE 5. - 
$K$
-user system architecture with MU–LP. Receiver architecture is illustrated for user-
$k$
 though the same applies to other users, i.e., all 
$K$
 users are equipped with a decoder that maps the received signal into an estimated message by treating residual interference as noise.
FIGURE 5.

$K$ -user system architecture with MU–LP. Receiver architecture is illustrated for user-$k$ though the same applies to other users, i.e., all $K$ users are equipped with a decoder that maps the received signal into an estimated message by treating residual interference as noise.

The signal received at user-$k$ is $y_{k}=\mathbf {h}_{k}^{H} \mathbf {x}+n_{k}$ with $n_{k}\sim \mathcal {CN}(0,1)$ . Each user-$k$ directly decodes the intended message $W_{k}$ by treating the interference from the other users as noise. Under the assumption of Gaussian signaling, the rate of user-$k$ for $k\in \mathcal {K}$ is given by \begin{equation*} R_{k}^{(\text {M})}=\log _{2}\left ({1+\frac {\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{k}}\right |^{2}}{1+\sum _{q\neq k}\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{q}}\right |^{2}}}\right).\tag{31}\end{equation*} View SourceRight-click on figure for MathML and additional features. The sum-rate of MU–LP is therefore $R_{\mathrm {s}}^{(\text {M})}=\sum _{k=1}^{K} R_{k}^{(\text {M})}$ , and the MMF rate of MU–LP is given as $R_{\mathrm {mmf}}^{(\text {M})}=\min _{k=1,\ldots,K}R_{k}^{(\text {M})}$ .

B. Multiplexing Gains With Perfect CSIT

We recall the sum multiplexing gain and the MMF multiplexing gain of MU–LP with perfect CSIT from [56] and [59], respectively.

Proposition 5:

The sum multiplexing gain of $K$ -user MU–LP with $M$ transmit antennas and perfect CSIT is $d_{\text {s}}^{(\text {M})}=\min (M,K)$ .

This result19 is simply achieved by choosing the MU–LP precoders based on ZFBF and transmitting $\min (M,K)$ interference-free streams. Note that $\min (M,K)$ is also the optimal20 sum multiplexing gain of the $K$ -user MISO BC21 [59]. In other words, $d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\star)}=\min (M,K)$ .

Proposition 6:

The MMF multiplexing gain of the $K$ -user MU–LP with $M$ transmit antennas and perfect CSIT is \begin{align*} d_{\text {mmf}}^{(\text {M})}=\begin{cases} 1, & M\geq K, \\ 0, & M < K. \end{cases}\tag{32}\end{align*} View SourceRight-click on figure for MathML and additional features.

When $M\geq K$ , ZFBF can be used to fully eliminate interference. On the other hand, for $M < K$ interference cannot be eliminated anymore and $d_{\text {mmf}}^{(\text {M})}$ collapses, therefore leading to a rate saturation at high SNR.

C. Multiplexing Gains With Imperfect CSIT

We use the CSIT error model introduced in Section IV-A. We recall the sum multiplexing gain and the MMF multiplexing gain of MU–LP with imperfect CSIT from [47] and [48], [69], respectively.

Proposition 7:

The sum multiplexing gain of the $K$ -user MU–LP with $M$ transmit antennas and CSIT quality $0\leq \alpha \leq 1$ is $d_{\text {s}}^{(\text {M})}=\max (1,\min (M,K)\alpha)$ .

This result is simply achieved by choosing the MU–LP precoders based on ZFBF and transmitting $\min (M,K)$ streams, each with power level $P^{\alpha }/\min (M,K)$ . This enables each stream to reap a multiplexing gain of $\alpha $ and therefore a sum multiplexing gain of $\min (M,K)\alpha $ . If $\min (M,K)\alpha < 1$ , one can simply transmit a single stream (i.e., perform OMA) and reap a sum multiplexing gain of 1.

Comparing Propositions 5 and 7, we note that imperfect CSIT leads to a reduction of the sum multiplexing gain. For $\alpha =1$ (perfect CSIT in a multiplexing gain sense), Proposition 7 matches Proposition 5. Importantly, in contrast to the $K$ -user MISO BC with perfect CSIT setting where MU–LP achieves the information theoretic optimal sum multiplexing gain $d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\star)}$ , in the imperfect CSIT setting, MU–LP does not achieve the information theoretic optimal sum multiplexing gain [47], [63].

Proposition 8:

The MMF multiplexing gain of the $K$ -user MU–LP with $M$ transmit antennas and CSIT quality $0\leq \alpha \leq 1$ is \begin{align*} d_{\text {mmf}}^{(\text {M})}=\begin{cases} \alpha, & M\geq K, \\ 0, & M < K. \end{cases}\tag{33}\end{align*} View SourceRight-click on figure for MathML and additional features.

This is achieved by performing ZFBF when $M \geq K$ . When $M< K$ , rate saturation occurs (similarly to the perfect CSIT setting).

SECTION VII.

Baseline Scheme II: Rate-Splitting

The second baseline to assess multi-antenna NOMA performance is Rate-Splitting Multiple Access (RSMA), based on multi-antenna Rate-Splitting (RS), for the multi-antenna BC [45]–​[51]. This approach leverages and extends the concept of RS, originally developed in [70] for the two-user single-antenna interference channel, to design multi-antenna non-orthogonal transmission strategies for the multi-antenna BC.

A. Rate-Splitting System Model

We consider again a MISO BC consisting of one transmitter with $M$ antennas and $K$ single-antenna users. As per Fig. 6, the architecture relies on rate-splitting of messages $W_{1}$ to $W_{K}$ intended for user-1 to user-$K$ , respectively. To that end, message $W_{k}$ of user-$k$ is split into a common part $W_{\mathrm {c},k}$ and a private part $W_{\mathrm {p},k}$ . The common parts $W_{\mathrm {c},1},\ldots, W_{\mathrm {c},K}$ of all users are combined into the common message $W_{\mathrm {c}}$ , which is encoded into the common stream $s_{\mathrm {c}}$ using a codebook shared by all users. Hence, $s_{\mathrm {c}}$ is a common stream required to be decoded by all users and contains parts of messages $W_{1}$ to $W_{K}$ intended for user-1 to user-$K$ , respectively. The private parts $W_{\mathrm {p},1},\ldots,W_{\mathrm {p},K}$ , respectively containing the remaining parts of messages $W_{1}$ to $W_{K}$ , are independently encoded into private stream $s_{1}$ for user-1 to $s_{K}$ for user-$K$ . From the $K$ messages, $K+1$ streams $s_{\mathrm {c}}, s_{1},\ldots, s_{K}$ are therefore created. The streams are linearly precoded such that the transmit signal is given by \begin{equation*} \mathbf {x}=\mathbf {p}_{\mathrm {c}} s_{\mathrm {c}}+\sum _{k=1}^{K}\mathbf {p}_{k} s_{k}.\tag{34}\end{equation*} View SourceRight-click on figure for MathML and additional features. Defining $\mathbf {s}=[s_{\mathrm {c}},s_{1},\ldots,s_{K}]^{T}$ and assuming that $\mathbb {E}[\mathbf {s}\mathbf {s}^{H}]=\mathbf {I}$ , the average transmit power constraint is written as $P_{\mathrm {c}}+\sum _{k=1}^{K} P_{k}\leq P$ , where $P_{\mathrm {c}}=\|\mathbf {p}_{\mathrm {c}}\|^{2}$ and $P_{k}=\|\mathbf {p}_{k}\|^{2}$ .

FIGURE 6. - 
$K$
-user system architecture with 1-layer rate-splitting. Receiver architecture is illustrated for user-
$k$
 though the same applies to other users.
FIGURE 6.

$K$ -user system architecture with 1-layer rate-splitting. Receiver architecture is illustrated for user-$k$ though the same applies to other users.

At each user-$k$ , the common stream $s_{\mathrm {c}}$ is first decoded into $\widehat {W}_{\mathrm {c}}$ by treating the interference from the private streams as noise. Using SIC, $\widehat {W}_{\mathrm {c}}$ is re-encoded, precoded, and subtracted from the received signal, such that user-$k$ can decode its private stream $s_{k}$ into $\widehat {W}_{\mathrm {p},k}$ by treating the remaining interference from the other private stream as noise. User-$k$ reconstructs the original message by extracting $\widehat {W}_{\mathrm {c},k}$ from $\widehat {W}_{\mathrm {c}}$ , and combining $\widehat {W}_{\mathrm {c},k}$ with $\widehat {W}_{\mathrm {p},k}$ into $\widehat {W}_{k}$ . Assuming proper Gaussian signaling, the rate of the common stream is given by \begin{equation*} R_{\mathrm {c}}=\min _{k=1,\ldots, K} \log _{2}\left ({1+\frac {\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{\mathrm {c}}}\right |^{2}}{1+\sum _{q=1}^{K}\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{q}}\right |^{2}}}\right).\tag{35}\end{equation*} View SourceRight-click on figure for MathML and additional features. Assuming perfect SIC, the rates of the private streams are obtained as \begin{equation*} R_{k}=\log _{2}\left ({1+\frac {\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{k}}\right |^{2}}{1+\sum _{q\neq k}\left |{\mathbf {h}_{k}^{H} \mathbf {p}_{q}}\right |^{2}}}\right).\tag{36}\end{equation*} View SourceRight-click on figure for MathML and additional features. The rate of user-$k$ is given by $R_{k}+R_{\mathrm {c},k}$ where $R_{\mathrm {c},k}$ is the rate of the common part of the $k$ th user’s message, i.e., $W_{\mathrm {c},k}$ , and satisfies $\sum _{k=1}^{K} R_{\mathrm {c},k}=R_{\mathrm {c}}$ . The sum-rate is therefore simply written as $R_{\mathrm {s}}^{(\text {R})}=\sum _{k=1}^{K} (R_{k}+R_{\mathrm {c},k})=R_{\mathrm {c}}+\sum _{k=1}^{K} R_{k}$ , and the MMF rate is written as $R_{\mathrm {mmf}}^{(\text {R})}=\min _{k=1,\ldots,K}R_{k}+R_{\mathrm {c},k}$ .

The above RS architecture is called 1-layer RS since it only relies on a single common stream and a single layer of SIC at each user as illustrated in Fig. 6.

B. Multiplexing Gains With Perfect CSIT

We here summarize the sum and MMF multiplexing gains achieved by 1-layer RS with perfect CSIT.

Proposition 9:

The sum multiplexing gain of $K$ -user 1-layer RS with $M$ transmit antennas and perfect CSIT is $d_{\text {s}}^{(\text {R})}=\min (M,K)$ .

Proof:

Since MU–LP is a subscheme of 1-layer RS,22 it is sufficient23 to design the private precoders using ZFBF and allocate zero power to the common stream at high SNR. Note that $d_{\text {s}}^{(\text {R})}=d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\star)}=\min (M,K)$ .

Proposition 10:

The MMF multiplexing gain of the $K$ -user 1-layer RS with $M$ transmit antennas and perfect CSIT is \begin{align*} d_{\text {mmf}}^{(\text {R})}=\begin{cases} 1, & M\geq K \\ \frac {1}{1+K-M}, & M < K.\\ \end{cases}\tag{37}\end{align*} View SourceRight-click on figure for MathML and additional features.

The MMF multiplexing gain of 1-layer RS was derived and proved in [56],24 under the same assumption as in Remark 4. Readers are referred to [56] for more details of the proof of Proposition 10.

C. Multiplexing Gains With Imperfect CSIT

Again, we use the CSIT error model introduced in Section IV-A. We recall the sum multiplexing gain of RS with imperfect CSIT from [47].

Proposition 11:

The sum multiplexing gain of $K$ -user 1-layer RS with $M$ transmit antennas and CSIT quality $0\leq \alpha \leq 1$ is $d_{\text {s}}^{(\text {R})}=1+(\min (M,K)-1)\alpha $ .

Achievability of $d_{\text {s}}^{(\text {R})}$ in Proposition 11 is obtained by using random precoding to design $\mathbf {p}_{c}$ with power level $P_{c}=O(P)$ , transmitting $\min (M,K)$ private streams and using ZFBF to design the precoders of those $\min (M,K)$ private streams, each with power level $P_{k}=O(P^{\alpha })$ . From the SINR expressions at the right-hand side of (35), it follows that the received SINR of the common stream at each user scales as $O(P^{1-\alpha })$ , leading to the multiplexing gain of $1-\alpha $ achieved by the common stream $s_{c}$ . By performing ZFBF, the transmitter transmits $\min (M,K)$ interference-free private streams. The received SINR of each private stream scales as $O(P^{\alpha })$ leading to multiplexing gain $\alpha $ . Hence, we obtain the sum multiplexing gain of $1+(\min (M,K)-1)\alpha $ .

Importantly, for the underloaded regime $M \geq K$ , 1-layer RS achieves the information theoretic optimal sum multiplexing gain $d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\star)}$ in the imperfect CSIT setting [47], [63]. Hence, 1-layer RS attains the optimal sum multiplexing gain for both perfect CSIT and imperfect CSIT (underloaded regime). Actually, for $M \geq K$ , 1-layer RS is optimal, achieving the maximum multiplexing gain region of the underloaded $K$ -user MISO BC25 with imperfect CSIT [71], [72].

This optimality of RS (including 1-layer RS), shown through multiplexing gain analysis, is very significant since it implies that one cannot find any other scheme achieving a better multiplexing gain region in the multi-antenna BC. As a consequence of this optimality, MU–LP and multi-antenna NOMA will always incur a multiplexing gain loss or at best will achieve the same multiplexing gain as RS for both perfect and imperfect CSIT.

Proposition 12:

The MMF multiplexing gain of $K$ -user 1-layer RS with $M$ transmit antennas and CSIT quality $0\leq \alpha \leq 1$ is \begin{align*} d_{\text {mmf}}^{(\text {R})}=\begin{cases} \frac {1+(K-1)\alpha }{K}, & M\geq K \\ \frac {1+(M-1)\alpha }{K}, & M < K \:\text {and}\: \alpha \leq \frac {1}{1+K-M}\\ \frac {1}{1+K-M}, & M < K \:\text {and}\: \alpha > \frac {1}{1+K-M}. \end{cases}\tag{38}\end{align*} View SourceRight-click on figure for MathML and additional features.

The MMF multiplexing gain of 1-layer RS with imperfect CSIT was derived in [69] (by considering the specific case where there is a single user per group), under the same assumption as in Remark 4. Readers are referred to [69] for more details of the proof of Proposition 12.

This highlights that when $M< K$ , the CSIT quality $\alpha $ can be reduced to $\frac {1}{1+K-M}$ without impacting the MMF multiplexing gain of 1-layer RS.

Following our discussion of Proposition 11, when $M\geq K$ , the respective multiplexing gains of the common and each private streams are $1-\alpha $ and $\alpha $ . The MMF multiplexing gain when $M\geq K$ is achieved by evenly sharing the common stream among users and is the sum of the evenly allocated multiplexing gain of the common stream $\frac {1-\alpha }{K}$ and the multiplexing gain of one private stream $\alpha $ , yielding $\frac {1+(K-1)\alpha }{K}$ .

When $M< K$ , the achievability is obtained by partitioning users into two subsets $\mathcal {K}_{1}$ and $\mathcal {K}_{2}$ with set sizes of $|\mathcal {K}_{1}|=M$ and $|\mathcal {K}_{2}|=K-M$ . Users in $\mathcal {K}_{1}$ are served via the common and private streams while users in $\mathcal {K}_{2}$ are served using the common stream only. Random precoding and ZFBF are respectively used for the common stream and the private streams with power allocation $P_{c}=O(P)$ and $P_{k}=O(P^{\beta }), \forall k\in \mathcal {K}_{1}$ . It may be readily shown that the respective multiplexing gains of the common stream and each private stream are given by $1-\beta $ and $\min \{\alpha,\beta \}$ , respectively. By further introducing a fraction $z\in [{0,1}]$ to specify the fraction of the rate of the common stream allocated to the users in the two subsets, we obtain that the respective sum multiplexing gains of the common stream for the users in $\mathcal {K}_{1}$ and $\mathcal {K}_{2}$ are $z(1-\beta)$ and $(1-z)(1-\beta)$ , respectively. By equally dividing the multiplexing gain of the common stream between the users in the two subsets, the multiplexing gain of each user in $\mathcal {K}_{2}$ is $d_{k,2}=\frac {(1-z)(1-\beta)}{K-M}$ , and the multiplexing gain of each user in $\mathcal {K}_{1}$ is $d_{k,1}=\min \{\alpha,\beta \}+\frac {z(1-\beta)}{M}$ . The MMF multiplexing gain of the users is $\max _{z}\min \{d_{k,1}, d_{k,2}\}$ . When $\beta =\alpha $ , the optimal rate allocation factor $z^{\star }$ is obtained when $\frac {(1-z)(1-\alpha)}{K-M}= \frac {z(1-\alpha)}{M}+\alpha $ . We have $z^{\star }=\frac {(1-\alpha -\alpha K+\alpha M)M}{(1-\alpha)K}$ and the optimal MMF multiplexing gain is $\frac {1+(M-1)\alpha }{K}$ . As $z^{\star }\in [{0,1}]$ , we have $1-\alpha -\alpha K+\alpha M\geq 0$ . Hence, when $\alpha \leq \frac {1}{1+K-M}$ , $d_{\text {mmf}}^{(\text {R})}=\frac {1+(M-1)\alpha }{K}$ . When $\beta < \alpha $ and $z=0$ , the optimal power allocation $\beta ^{\star }$ is obtained when $\frac {1-\beta }{K-M}=\beta $ . We have $\beta ^{\star }=\frac {1}{1+K-M}$ and the optimal MMF multiplexing gain is $\frac {1}{1+K-M}$ . Hence, when $\alpha > \frac {1}{1+K-M}$ , $d_{\text {mmf}}^{(\text {R})}=\frac {1}{1+K-M}$ .

For $\alpha =1$ , the results in Propositions 11 and 12 boil down to the perfect CSIT results in Propositions 9 and 10, respectively.

SECTION VIII.

Shortcomings and Misconceptions of Multi-Antenna NOMA

In this section, we first compare the multiplexing gains of multi-antenna NOMA to those of the MU–LP and 1-layer RS baselines. The sum and MMF multiplexing gains of multi-antenna NOMA, MU–LP, and 1-layer RS for both perfect and imperfect CSIT are summarized in Table 2. The objective of this section is to identify under which conditions NOMA provides performance gains/losses over the two baselines. We then use these comparisons to reveal several misconceptions and shortcomings of multi-antenna NOMA.

TABLE 2 Comparison of Sum and MMF Multiplexing Gains of Different Strategies With Perfect and Imperfect CSIT
Table 2- 
Comparison of Sum and MMF Multiplexing Gains of Different Strategies With Perfect and Imperfect CSIT

A. NOMA vs. Baseline I (MU–LP)

We show in the following corollaries that MISO NOMA can achieve a performance gain over MU–LP but it may also incur a performance loss, depending on the values of $M$ , $K$ , $G$ , and $\alpha $ .

The performance (expressed in terms of multiplexing gain) gain/loss of multi-antenna NOMA vs. MU–LP is obtained by comparing Propositions 3 and 7 (for sum multiplexing gain), and Propositions 4 and 8 (for MMF multiplexing gain), and is summarized in Corollaries 1, and 2 ($G=1$ ), and 3 ($G>1$ ), respectively. For the MMF multiplexing gain with imperfect CSIT, we consider $G=1$ and $G>1$ in two different corollaries.

Corollary 1:

The sum multiplexing gain comparison between MISO NOMA and MU–LP is summarized in (39), at the bottom of the next page.

MISO NOMA never achieves a sum multiplexing gain higher than MU–LP.

Corollary 1 shows that MISO NOMA can achieve a lower or the same sum multiplexing gain compared to MU–LP, but cannot outperform MU–LP.

If $\alpha =1$ (perfect CSIT), Corollary 1 boils down to $d_{\text {s}}^{(\text {N})}< d_{\text {s}}^{(\text {M})}$ whenever $M>G$ , and $d_{\text {s}}^{(\text {N})}=d_{\text {s}}^{(\text {M})}$ whenever $M\leq G$ . This is instrumental as it says that the slope of the sum-rate of MISO NOMA at high SNR will be strictly lower than that of MU–LP (i.e., the sum-rate of MISO NOMA will grow more slowly than that of MU–LP) whenever the number of transmit antennas is larger than the number of groups, and hence in this case, MU–LP is guaranteed to outperform MISO NOMA at high SNR. Consequently, in the massive MIMO regime where $M$ grows large, MISO NOMA would achieve a sum multiplexing gain strictly lower than MU–LP (and the role of NOMA in massive MIMO is therefore questionable as highlighted in [76]). If $G=1$ as in, e.g., [20]–​[23], MISO NOMA always incurs a sum multiplexing gain loss compared to MU–LP irrespective of $M$ (except in single-antenna systems when $M=1$ ). In other words, from a sum multiplexing gain perspective, one cannot find any multi-antenna configuration at the transmitter, i.e., any value of $M$ , that would motivate the use MISO NOMA with $G=1$ compared to MU–LP. If $G=K/2$ as in [26]–​[30], MISO NOMA incurs a sum multiplexing gain loss compared to MU–LP whenever $M>K/2$ . In other words, from a sum multiplexing gain perspective, the only multi-antenna deployments for which MISO NOMA with $G=K/2$ would not incur a multiplexing gain loss (but no improvement either) over MU–LP is when $M\leq K/2$ . Note that these conclusions are not limited to MISO NOMA. From Section V, we note that two-user MIMO NOMA incurs a sum multiplexing gain loss compared to two-user MU–LP whenever $M>N$ . If $M\leq N$ , MIMO NOMA and MU–LP achieve the same sum multiplexing gain.

If $\alpha < 1$ (imperfect CSIT), a sum multiplexing gain loss of MISO NOMA over MU–LP occurs in two different scenarios: 1) medium CSIT quality setting with $\frac {1}{\min (M,K)}< \alpha < \frac {1}{\min (M,G)}$ or 2) sufficiently large number of antennas and high CSIT quality with $M>G$ and $\alpha \geq \frac {1}{\min (M,G)}$ . In other scenarios where the CSIT quality is poor $\alpha \leq \frac {1}{\min (M,K)}$ or the CSIT quality is good $\alpha \geq \frac {1}{\min (M,G)}$ but the number of transmit antennas is low $M\leq G$ , MISO NOMA and MU–LP achieve the same sum multiplexing gains.

Corollary 2:

The MMF multiplexing gain comparison between MISO NOMA with $G=1$ and MU–LP is summarized as follows\begin{align*} d_{\text {mmf}}^{\left ({\text {N}}\right)}-d_{\text {mmf}}^{\left ({\text {M}}\right)} \begin{cases} < 0, & \textrm {if }~ \left ({M \geq K}\right)\cap \left ({\alpha >\frac {1}{K}}\right) \\ =0, & \textrm {if } ~\left ({M\geq K}\right)\cap \left ({\alpha =\frac {1}{K}}\right)\\ >0, & \textrm {if }~ \left ({M< K}\right)\cup \left ({\left ({M \geq K}\right)\cap \left ({\alpha < \frac {1}{K}}\right)}\right). \end{cases} \\ {}\tag{40}\end{align*} View SourceRight-click on figure for MathML and additional features.

Corollary 3:

The MMF multiplexing gain comparison between MISO NOMA with $G>1$ and MU–LP is summarized as follows \begin{align*} d_{\text {mmf}}^{(\text {N})}-d_{\text {mmf}}^{(\text {M})}\begin{cases} < 0, & \textrm {if }~ M \geq K \\ =0, & \textrm {if }~ M < K-g+1 \\ >0, & \textrm {if }~ K > M\geq K-g+1. \end{cases}\tag{41}\end{align*} View SourceRight-click on figure for MathML and additional features.

Corollaries 2 and 3 show that MISO NOMA can achieve either a higher or a lower MMF multiplexing gain compared to MU–LP, depending on the values of $M$ , $G$ , $K$ , and $\alpha $ .

If $\alpha =1$ (perfect CSIT), with $G=1$ as in, e.g., [20]–​[23], $d_{\text {mmf}}^{(\text {N})}>d_{\text {mmf}}^{(\text {M})}$ whenever $M< K$ , and incurs an MMF multiplexing loss otherwise ($M\geq K$ ). With $G=K/2$ as in [26]–​[30], $d_{\text {mmf}}^{(\text {N})}< d_{\text {mmf}}^{(\text {M})}$ whenever $M\geq K$ , and $d_{\text {mmf}}^{(\text {N})}>d_{\text {mmf}}^{(\text {M})}$ whenever $K>M\geq K-1$ , and $d_{\text {mmf}}^{(\text {N})}=d_{\text {mmf}}^{(\text {M})}$ whenever $M< K-1$ . In other words, from an MMF multiplexing gain perspective, the multi-antenna deployments for which MISO NOMA with $G=1$ and $G=K/2$ can outperform or achieve the same performance as MU–LP when $M< K$ .

If $\alpha < 1$ (imperfect CSIT), we note from Corollary 3, that for $G>1$ , CSIT quality $\alpha $ does not affect the operational regimes where MISO NOMA outperforms/incurs a loss compared to MU–LP. This is different from $G=1$ where the condition for $d_{\text {mmf}}^{(\text {N})}< d_{\text {mmf}}^{(\text {M})}$ is a function of $\alpha $ in Corollary 2. MISO NOMA incurs an MMF multiplexing loss whenever the number of antenna and the CSIT quality are sufficiently large, i.e., $M \geq K$ and $\alpha >\frac {1}{K}$ .

B. NOMA vs. Baseline II (RS)

We show in the following corollaries that, for all $M$ , $K$ , $\alpha $ , 1-layer RS (that relies on a single SIC at each user) achieves the same or higher (sum and MMF) multiplexing gains than the best of the MISO NOMA schemes (i.e., whatever $G$ and the number of SICs). In other words, 1-layer RS outperforms (multiplexing gain-wise) MISO NOMA and simultaneously requires fewer SICs (only one) than MISO NOMA. Hence, employing MISO NOMA over 1-layer RS can only cause a multiplexing gain loss and/or a complexity increase at the receiver.

The performance loss of MISO NOMA vs. RS is obtained by comparing Propositions 3 and 11 (for the sum multiplexing gain), and Propositions 4 and 12 (for the MMF multiplexing gain), and is summarized in Corollaries 4, and 5 ($G=1$ ), and 6 ($G>1$ ), respectively.

Corollary 4:

The sum multiplexing gain comparison between MISO NOMA and 1-layer RS is summarized as follows \begin{align*} d_{\text {s}}^{\left ({\text {N}}\right)}- d_{\text {s}}^{\left ({\text {R}}\right)}\begin{cases} < 0, & \textrm {if} \left ({0< \alpha < 1}\right)\cup \left ({\left [{\alpha >0}\right]\cap \left [{M>G}\right]}\right) \\ =0, & \textrm {if} \left ({\alpha =0}\right)\cup \left ({\left [{\alpha =1}\right]\cap \left [{M\leq G}\right]}\right). \end{cases}\!\!\! \\ {}\tag{42}\end{align*} View SourceRight-click on figure for MathML and additional features. MISO NOMA never achieves a sum multiplexing gain higher than 1-layer RS.

If $\alpha =1$ (perfect CSIT), Corollary 4 boils down to $d_{\text {s}}^{(\text {N})}< d_{\text {s}}^{(\text {R})}$ , whenever $M>G$ , and $d_{\text {s}}^{(\text {N})}=d_{\text {s}}^{(\text {R})}$ whenever $M\leq G$ .

Corollary 5:

The MMF multiplexing gain comparison between MISO NOMA with $G=1$ and 1-layer RS is summarized as follows \begin{align*} d_{\text {mmf}}^{(\text {N})}-d_{\text {mmf}}^{(\text {R})}\begin{cases} < 0, & \textrm {if } (\alpha >0)\cap (M>1) \\ =0, & \textrm {if } (\alpha =0)\cup (M = 1). \end{cases}\tag{43}\end{align*} View SourceRight-click on figure for MathML and additional features. MISO NOMA with $G=1$ never achieves an MMF multiplexing gain higher than 1-layer RS.

Corollary 6:

The MMF multiplexing gain comparison between MISO NOMA with $G>1$ and 1-layer RS is summarized in (44) as shown at the bottom of the equation.

MISO NOMA with $G>1$ never achieves an MMF multiplexing gain larger than 1-layer RS.

If $\alpha =1$ (perfect CSIT), Corollaries 5 and 6 simply boil down to $d_{\text {mmf}}^{(\text {N})}< d_{\text {mmf}}^{(\text {R})}$ , whenever $M \neq K-g+1$ , and $d_{\text {mmf}}^{(\text {N})}=d_{\text {mmf}}^{(\text {R})}$ , whenever $M = K-g+1$ .

We recall again from [71]–​[74] that RS achieves the optimal multiplexing gain region in the multi-antenna BC with imperfect CSIT and multi-antenna NOMA (and MU–LP/MU-MIMO) will therefore always incur a multiplexing gain loss compared to RS.

C. Misconceptions of Multi-Antenna NOMA

The comparisons with the MU–LP and 1-layer RS baselines reveal that depending on the particular setting NOMA may incur a multiplexing gain loss at the additional expense of an increased receiver complexity, as detailed in the following.

First, NOMA is an inefficient strategy to exploit the spatial dimensions. This issue could already be observed from the two-user MISO case with perfect CSIT, where NOMA limits the sum multiplexing gain to one, same as OMA, which is only half of the sum multiplexing gain obtained with MU–LP. Moreover, even when considering a fair metric such as MMF, NOMA limits the MMF multiplexing gain to $\frac {1}{2}$ , which is again only half of the MMF multiplexing gain obtained by MU–LP. Similarly, in the two-user MIMO case, NOMA limits the sum multiplexing gain to $\min (M,N)$ , again the same as OMA, and the MMF multiplexing gain to $\frac {\min (M,N)}{2}$ , which are lower than what is achievable with MU–LP.

In the general $K$ -user case, it is clear from Corollaries 1 and 4 that NOMA incurs a loss in sum multiplexing gain in most scenarios, and the best NOMA can achieve is the same sum multiplexing gain as the baselines in some specific configurations. NOMA with $G=1$ achieves $d_{\text {s}}^{(\text {N})}=1$ irrespectively of the number of transmit antennas $M$ , i.e., it achieves the same sum multiplexing gain as OMA and the same as a single-antenna transmitter (hence, wasting the transmit antenna array). NOMA with $G=K/2$ achieves $d_{\text {s}}^{(\text {N})}=\min (M,K/2)$ with $\alpha =1$ . On the other hand, MU–LP and 1-layer RS achieve the full sum multiplexing gain $d_{\text {s}}^{(\text {M})}=\min (M,K)$ with $\alpha =1$ .

Considering the MMF multiplexing gain of the general $K$ -user case, the situation appears to be better for NOMA. Assuming $\alpha =1$ , from Corollaries 2 and 3, we observe that NOMA incurs a loss compared to MU–LP in the underloaded regime $M \geq K$ but outperforms MU–LP in the overloaded regime. In particular, NOMA with $G=1$ achieves a higher MMF multiplexing gain than NOMA with $G=K/2$ and MU–LP whenever $M< K-1$ . Hence, though the receiver complexity increase of NOMA does not pay off in the underloaded regime, it appears to pay off in the overloaded regime (since $G=1$ with more SICs outperforms $G=K/2$ with fewer SICs). Nevertheless, the MMF multiplexing gain of NOMA with $G=1$ is independent of $M$ , suggesting again that the spatial dimensions are not properly exploited. This can indeed be seen from Corollary 5 where NOMA is consistently outperformed by 1-layer RS, i.e., the increase in MMF multiplexing gain attained by NOMA ($G=1$ ) over MU–LP is actually marginal in light of the complexity increase, and is much lower than what can be achieved by 1-layer RS with just a single SIC operation. In other words, while NOMA has some merits over MU–LP in the overloaded regime, NOMA makes inefficient use of the multiple antennas, and fails to boost the MMF multiplexing gain compared to the 1-layer RS baseline.

We note that the above observations hold for both the perfect and imperfect CSIT settings. Nevertheless, it is interesting to stress that the sensitivity to the CSIT quality $\alpha $ differs largely between MU–LP, NOMA with $G>1$ , NOMA with $G=1$ , and 1-layer RS. Indeed the sum and MMF multiplexing gains of MU–LP, NOMA with $G>1$ , and 1-layer RS decay as $\alpha $ decreases, while the multiplexing gains of NOMA with $G=1$ are not affected by $\alpha $ . This can be interpreted in two different ways. On the one hand, this implies that NOMA with $G=1$ is inherently robust to CSIT imperfections since the multiplexing gains are unchanged. On the other hand, this means that NOMA with $G=1$ is unable to exploit the available CSIT since the resulting multiplexing gain is the same as in the absence of CSIT ($\alpha =0$ ). One can indeed see from the above Propositions and Corollaries that the sum and MMF multiplexing gains for 1-layer RS with imperfect CSIT are clearly larger than those of MU–LP and NOMA. In other words, NOMA and MU–LP are inefficient in fully exploiting the available CSIT in multi-antenna settings.

We conclude from the theoretical results and above discussions that NOMA fails to efficiently exploit the multiplexing gain of the multi-antenna BC and is an inefficient strategy to exploit the spatial dimensions and the available CSIT, especially compared to the 1-layer RS baseline. The first misconception behind NOMA is to believe that because NOMA is capacity achieving in the single-antenna BC, NOMA is an efficient strategy for multi-antenna settings. As a consequence, the single-antenna NOMA principle has been applied to multi-antenna settings without recognizing that such a strategy would waste the primary benefit of using multiple antennas, namely the capability of transmitting multiple interference-free streams. In contrast to NOMA, other non-orthogonal transmission strategies such as 1-layer RS do not lead to any sum multiplexing gain loss. On the contrary, 1-layer RS achieves the information theoretic optimal sum multiplexing gain in both perfect and imperfect CSIT scenarios (and therefore has the capability of transmitting the optimal number of interference-free streams). 1-layer RS also achieves higher MMF multiplexing gains than NOMA and MU–LP.

Second, the multiplexing gain loss of NOMA is encountered despite the increased receiver complexity.26 In the two-user MISO BC with perfect CSIT, MU–LP does not require any SIC receiver to achieve the optimal sum multiplexing gain of two (assuming $M>1$ ) and an MMF multiplexing gain of one, while NOMA requires one SIC and only provides half the (sum and MMF) multiplexing gains of MU–LP. This is surprising since one would expect a performance gain from an increased architecture complexity. Here instead, NOMA causes a complexity increase at the receivers and a (sum and MMF) multiplexing gain loss compared to MU–LP, therefore highlighting that the SIC receiver is inefficiently exploited.

This inefficient use of SIC in NOMA also persists in the general $K$ -user scenario. Recall that NOMA with $G$ groups requires $g-1$ layers of SIC at the receivers. Among the two popular NOMA architectures $G=1$ and $G=K/2$ , the former requires an even higher number of SIC layers than the latter (namely $K-1$ for $G=1$ and 1 for $G=K/2$ ) and has an even lower sum multiplexing gain ($d_{\text {s}}^{(\text {N})}=1$ for $G=1$ and $d_{\text {s}}^{(\text {N})}=\min (M,K/2)$ for $G=K/2$ with $\alpha =1$ ). On the other hand, MU–LP achieves the full sum multiplexing gain $d_{\text {s}}^{(\text {M})}=\min (M,K)$ with $\alpha =1$ without any need for SIC. This highlights the inefficient (and detrimental) use of SIC receivers in NOMA: the higher the number of SICs, the lower the sum multiplexing gain!

Comparing to the 1-layer RS baseline further highlights the inefficient use of SIC in NOMA. We note that 1-layer RS causes a complexity increase at the receivers (due to the one SIC needed) but also an increase in the (sum and MMF) multiplexing gains compared to MU–LP (i.e., it is easy to see from Propositions 7, 8, 11, and 12 that the sum and MMF multiplexing gains with RS are always either identical to or higher than those with MU–LP). Hence, in contrast to NOMA, the SIC in 1-layer RS is beneficial since it boosts the (sum and MMF) multiplexing gains and therefore introduces a performance gain compared to (or at least maintains the same performance as) MU–LP. Actually, 1-layer RS achieves the information theoretic optimal sum multiplexing gain for imperfect CSIT, and does so with a single SIC per user. This shows that to achieve the information theoretic optimality, it is sufficient to use a single SIC per user.27 This is in contrast to NOMA whose sum multiplexing gain is far from optimal and for which the sum multiplexing gain decreases as the number of SICs increases. The inefficient use of SIC in NOMA is also obvious from the MMF multiplexing gain. Indeed, from Propositions 2 and 10 and Corollary 5, the single SIC in 1-layer RS achieves a much larger MMF multiplexing gain than the $K-1$ layers of SIC needed for NOMA with $G=1$ . This again illustrates how inefficient the use of SIC in NOMA often is. It also shows that there exists a non-orthogonal transmission strategy based on RS with better performance and lower receiver complexity requiring just a single SIC per user.

We conclude from the theoretical analysis and above discussion that NOMA often does not make efficient use of the SIC receivers compared to the considered baselines. The second misconception regarding multi-antenna NOMA is to believe that adopting SIC receivers always boosts the rate since the interference is fully cancelled at the receiver. Considering the two-user toy example, and comparing (2) and (14), the interference power term $|\mathbf {h}_{1}^{H} \mathbf {p}_{2}|^{2}$ appearing in the SINR of user-1 in the MU–LP rate has indeed disappeared in NOMA thanks to the SIC receiver, such that $R_{\text {M},1}\leq R_{\text {N},1}$ . However, this comes at the cost of a reduced rate for user-2 since $R_{\text {N},2}=\min (\log _{2}(1+A),R_{\text {M},2})\leq R_{\text {M},2}$ . In other words, for a given pair of precoders $\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ , NOMA increases the rate (or maintains the same rate) of user-1 but decreases the rate (or maintains the same rate) of user-2 compared to MU–LP.

Third, reflecting on the above two misconceptions, the NOMA design philosophy does not leverage the extensive research in multi-user MIMO, which has been fundamental to 4G and 5G in achieving the optimal sum multiplexing gain of the multi-antenna BC with perfect CSIT and low-complexity transmitter and receiver architectures. The third misconception behind multi-antenna NOMA is to believe that, since NOMA is routinely compared to OMA in SISO BC, it is also sufficient to compare NOMA to OMA in multi-antenna settings to demonstrate its merits. In fact, the Corollaries in Sections VIII-A and VIII-B show that NOMA is far from being an efficient strategy if NOMA is compared to alternative baselines. Unfortunately, simply comparing with OMA has led the NOMA literature to the misleading conclusion that multi-antenna NOMA is an efficient strategy. It should therefore be stressed that comparing NOMA to OMA does not demonstrate the merits of NOMA in multi-antenna settings and most importantly, the baseline for any multi-antenna NOMA design, optimization, and evaluation should be MU–LP and RS, not simply OMA28! In contrast to MISO NOMA, the gain of 1-layer RS over MU–LP is guaranteed, i.e., the rate of 1-layer RS is equal to or higher than that of MU–LP, since MU–LP is a particular instance of RS when no power is allocated to the common stream.

Fourth, the SISO BC is naturally overloaded (more users than the number of transmit antennas, namely one), and NOMA was therefore concluded to be suitable for overloaded scenarios. The fourth misconception behind multi-antenna NOMA is to believe that MISO NOMA is an efficient strategy for overloaded regimes, namely whenever $K > M$ . The Corollaries in Sections VIII-A and VIII-B nevertheless expose that this is incorrect. It is clear that NOMA incurs a sum multiplexing gain erosion compared to MU–LP and 1-layer RS whenever $M>G$ . Such a loss can occur also in the overloaded regime, namely whenever we have $K > M > G$ . Moreover, NOMA incurs an MMF multiplexing gain loss compared to 1-layer RS whenever $M\neq K-g+1$ . Here again, such a loss occurs also in the overloaded regime. In contrast to NOMA (and MU–LP), 1-layer RS is an efficient strategy for both the underloaded and overloaded regimes. Though NOMA with $G=1$ was shown in Proposition 2 to achieve a non-vanishing MMF multiplexing gain of $1/K$ in the overloaded regime, this MMF multiplexing gain is considerably smaller than that of 1-layer RS, therefore highlighting the inefficiency of NOMA in the overloaded regime. In particular, we note that the MMF multiplexing gain of 1-layer RS increases with $M$ in contrast to that of NOMA with $G=1$ which is constant regardless of $M$ .

D. Illustration of the Misconceptions With an Example

To illustrate the above discussion and make the statements more explicit based on numbers, we consider a MISO BC with $K=6$ , and compare in Table 3 the sum multiplexing gains $d_{\text {s}}^{(\text {N})}$ of NOMA with $G=1$ and $G=3$ and the sum multiplexing gain of MU–LP $d_{\text {s}}^{(\text {M})}$ and 1-layer RS $d_{\text {s}}^{(\text {R})}$ (recall that $d_{\text {s}}^{(\text {M})}=d_{\text {s}}^{(\text {R})}=d_{\text {s}}^{(\star)}$ ) as a function of $M$ for perfect CSIT. We observe that NOMA incurs a sum multiplexing gain reduction (highlighted in red in Table 3) in the underloaded regime but also in the overloaded regime depending on the values of $M$ and $G$ . Specifically, in this example with $K=6$ , $G=1$ incurs a sum multiplexing erosion compared to MU–LP and 1-layer RS whenever $M\geq 2$ and $G=3$ whenever $M\geq 4$ . This shows that in an overloaded regime associated with $M< K$ , although $M$ is the limiting factor of the sum multiplexing gain in MU–LP and 1-layer RS, $\min (M,G)$ is the limiting factor in NOMA. Morever, Table 3 clearly illustrates that the higher the number of SICs in NOMA, the lower the sum multiplexing gain. NOMA with $G=1$ requires 5 layers of SIC to achieve a multiplexing gain $d_{\text {s}}^{(\text {N})}=1$ , NOMA with $G=3$ requires 1 layer of SIC and achieves at most $d_{\text {s}}^{(\text {N})}=3$ . On the other hand, MU–LP does not require any SIC and achieves the optimal sum multiplexing gain $d_{\text {s}}^{(\star)}$ (that can be as high as 6). 1-layer RS achieves the same (and optimal) sum multiplexing gain as MU–LP.

TABLE 3 Sum Multiplexing Gain With $K=6$ - Perfect CSIT
Table 3- 
Sum Multiplexing Gain With 
$K=6$
 - Perfect CSIT

Table 4 highlights the MMF multiplexing gains of NOMA, MU–LP, and 1-layer RS for $K=6$ with perfect CSIT and stresses the significant benefit of 1-layer RS over NOMA and MU–LP. The entries highlighted in red relate to configurations for which 1-layer RS provides a multiplexing gain strictly higher than that of NOMA and MU–LP. Recall that 1-layer RS provides these multiplexing gains with a single SIC per user!

TABLE 4 MMF Multiplexing Gain With $K=6$ - Perfect CSIT
Table 4- 
MMF Multiplexing Gain With 
$K=6$
 - Perfect CSIT

In Fig. 7, we further illustrate the tradeoff between the multiplexing gains and the number of SIC layers for $M=4$ , $K=6$ and perfect CSIT. We observe that 1-layer RS enables higher performance and lower receiver complexity compared to NOMA, stressing that the non-orthogonal transmission enabled by RS is much more efficient than NOMA. We see that NOMA with different $G$ is suited for very different settings in this $M=4$ , $K=6$ configuration, namely NOMA with $G=3$ performs better in terms of sum multiplexing gain, whereas NOMA with $G=1$ achieves a higher MMF multiplexing gain. The baseline 1-layer RS achieves a higher performance for both metrics and entails a lower receiver complexity.29

FIGURE 7. - Multiplexing gains with single-antenna receivers and perfect CSIT vs. number of SIC layers for 
$M=4$
, 
$K=6$
.
FIGURE 7.

Multiplexing gains with single-antenna receivers and perfect CSIT vs. number of SIC layers for $M=4$ , $K=6$ .

Though the above example was provided for perfect CSIT ($\alpha =1$ ), it is easy to calculate from the above propositions the multiplexing gains for the imperfect CSIT setting for a given CSIT quality $\alpha $ . For imperfect CSIT, the strict superiority of 1-layer RS over MU–LP and NOMA will become much more apparent, as illustrated in Fig. 8 for $\alpha =0.5$ .

FIGURE 8. - Multiplexing gains with single-antenna receivers and imperfect CSIT vs. number of SIC layers for 
$M=4$
, 
$K=6$
, 
$\alpha =0.5$
.
FIGURE 8.

Multiplexing gains with single-antenna receivers and imperfect CSIT vs. number of SIC layers for $M=4$ , $K=6$ , $\alpha =0.5$ .

In Figs. 9 and 10, the sum and MMF multiplexing gains are illustrated for $M=6$ when we vary the number of users $K$ under the assumption of perfect and imperfect CSIT. Results here again confirm that NOMA achieves a lower sum multiplexing gain than MU-LP and 1-layer RS and a lower MMF multiplexing gain than 1-layer RS.

FIGURE 9. - Sum multiplexing gain vs. number of users 
$K$
 for 
$M=6$
.
FIGURE 9.

Sum multiplexing gain vs. number of users $K$ for $M=6$ .

FIGURE 10. - MMF multiplexing gain vs. number of users 
$K$
 for 
$M=6$
.
FIGURE 10.

MMF multiplexing gain vs. number of users $K$ for $M=6$ .

Recall that the MMF multiplexing gain reflects how fast the minimum rate among all $K$ users increases with SNR. A zero MMF multiplexing gain means that the rate of the worst user does not scale with the SNR, which is something to avoid if one wants to simultaneously serve many users and maintain fairness and QoS among users. Hence, we can also interpret the results in Fig. 10 differently in terms of the number of users that a given strategy can serve while maintaining a target MMF multiplexing gain (and hence a certain QoS). We note from Fig. 10 that NOMA ($G=1$ ) with its $K-1$ SIC layers is more suitable than NOMA ($G=K/2$ ) and MU-LP to serve a large number of users when $K>M$ . Indeed for $K \geq 8$ , the MMF multiplexing gains of NOMA ($G=K/2$ ) and MU-LP collapse (are equal to 0), while that of NOMA ($G=1$ ) is strictly positive. However, it is still outperformed by 1-layer RS which can support a larger number of users than any other strategy (and any combination thereof) despite using one single SIC layer. Indeed, assuming perfect CSIT and taking for instance a target MMF multiplexing gain of 0.1, NOMA ($G=1$ ) can serve at most 10 users by using 9 SIC layers while 1-layer RS can serve 15 users with just 1 SIC layer. This can be indeed inferred from Table 2. Indeed, considering perfect CSIT and a target MMF multiplexing gain $d_{\text {mmf}}$ , NOMA ($G=1$ ) can serve $K=1/d_{\text {mmf}}$ users while 1-layer RS can serve $K=M-1+1/d_{\text {mmf}}$ users (assuming $M< K$ ). Hence, 1-layer RS with one SIC layer can serve $M-1$ extra users compared to NOMA ($G=1$ ) with $K-1$ SIC layers while guaranteeing the same MMF multiplexing gain. As the target $d_{\text {mmf}}$ decreases and both strategies can accommodate more users, NOMA requires an increasing number of SIC layers while 1-layer RS can still operate with a single SIC layer. In conclusion, 1-layer RS is significantly more efficient than NOMA since RS with only one SIC layer can support a larger number of users than NOMA with many SIC layers. This demonstrates the inefficiency of NOMA to support a large number of users.

E. Shortcomings of Multi-Antenna NOMA

The previous subsections have highlighted that comparing multi-antenna NOMA to MU–LP and 1-layer RS, instead of OMA, provides a completely different picture of the actual merits of multi-antenna NOMA. In view of the previous results highlighting the waste of multiplexing gain and the inefficient use of the SIC receivers by multi-antenna NOMA, we can ask ourselves multiple questions, which help to pinpoint the shortcomings and limitations of the multi-antenna NOMA design philosophy.

The first question is “What prevents multi-antenna NOMA from reaping the multiplexing gain of the system?” The answer lies in (5), and similarly in (20), (21), and (22). Equation (5) can be interpreted as the sum-rate of a two-user MAC with a single antenna receiver. Indeed, in (5), user-1 acts as the receiver of a two-user MAC whose effective SISO channels of both links are given by $\mathbf {h}_{1}^{H} \mathbf {p}_{2}$ and $\mathbf {h}_{1}^{H} \mathbf {p}_{1}$ . Similarly, in (20), user-1 acts as the receiver of a $g$ -user MAC whose effective SISO channels of the $g$ links are given by $\mathbf {h}_{1}^{H} \mathbf {p}_{k}$ for $k=1,\ldots,g$ . Such a MAC is well known to have a sum multiplexing gain of one [8], [17]. The multiplexing gain losses compared to the MU–LP and 1-layer RS baselines therefore come from forcing one user to fully decode all streams in a group, i.e., its intended stream and the co-scheduled streams in the group. This is radically different from MU–LP where streams are encoded independently and each receiver decodes its intended stream treating any residual interference as noise. By contrast, in 1-layer RS, no user is forced to fully decode the co-scheduled streams since all private streams are encoded independently and each receiver decodes its intended private stream treating any residual interference from the other private streams as noise.

The second question is “Does an increase in the number of SICs always come with a reduction in the sum multiplexing gain?” The answer is clearly no. This anomaly is deeply rooted in the way MISO NOMA was developed by applying the single-antenna NOMA principle to multi-antenna settings. The proof of Proposition 1 indeed tells us that the fundamental principle of NOMA consisting in forcing one user in each group to fully decode the messages of $g-1$ co-scheduled users is an inefficient design in multi-antenna settings that leads to a sum multiplexing gain reduction in each group.

The third question is “Are non-orthogonal transmission strategies inefficient for multi-antenna settings?” The answer is no. As we have seen, there exist frameworks of non-orthogonal transmission strategies also relying on SIC, such as RS, that do not incur the limitations of multi-antenna NOMA and make efficient use of the non-orthogonality and SIC receivers in multi-antenna settings. The key for the design of such non-orthogonal strategies is not to fall into the trap of blindly applying the SISO NOMA principle to multi-antenna settings, and therefore constraining the strategy to always fully decode the message of other users. Non-orthogonal transmission strategies and multiple access need to be re-thought for multi-antenna settings and one such strategy is based on the multi-antenna Rate-Splitting (RS) and Rate-Splitting Multiple Access (RSMA) literature for the multi-antenna BC.

The fourth question is “Since NOMA and RS both rely on SIC, is there any relationship between NOMA and RS?” The answer is yes in a two-user setting, but not necessarily in the general $K$ -user case as it would depend on the specifc RS scheme used. In the two-user case, 1-layer RS is a superset of MU–LP, NOMA, and multicasting, i.e., MU–LP, NOMA, and multicasting are particular instances of 1-layer RS, as shown in [77] and in Table 5 and Fig. 11. Indeed, MU–LP is obtained as a special case from 1-layer RS by allocating no power to the common stream ($P_{\mathrm {c}}=0$ ) such that $W_{k}$ is encoded directly into $s_{k}$ . No interference is decoded at the receiver using the common message, and the interference between $s_{1}$ and $s_{2}$ is fully treated as noise. NOMA is obtained by encoding $W_{2}$ entirely into $s_{\mathrm {c}}$ (i.e., $W_{\mathrm {c}}=W_{2}$ ) and $W_{1}$ into $s_{1}$ , and turning off $s_{2}$ ($P_{2}=0$ ).30 In this way, user-1 fully decodes the interference created by the message of user-2. OMA is a sub-strategy of MU–LP and NOMA, which is encountered when only user-1 (with the stronger channel gain) is scheduled ($P_{\mathrm {c}}=0, P_{2}=0$ ). Multicasting is obtained when both $W_{1}$ and $W_{2}$ are entirely encoded into $s_{\mathrm {c}}$ . In the $K$ -user case, 1-layer RS is a superset of MU–LP since by turning off (i.e., allocating no power to) the common stream, 1-layer RS boils down to MU–LP. On the other hand, 1-layer RS is not a superset of NOMA. 1-layer RS and NOMA are particular instances/schemes of the RSMA framework based on the generalized RS relying on multiple layers of SIC at each receiver [50], [51], [78], [89],31 as illustrated in Fig. 12. As stated in the introduction, NOMA refers to communication schemes where at least one user is forced to fully decode the message(s) of other co-scheduled user(s). MU-LP and RSMA do not do that since they both do not force users to fully decode the messages of other co-scheduled users. MU-LP actually treats any residual interference as noise, and RSMA is built upon the principle of splitting the messages so as to partially treat interference as noise and partially decode the remaining interference. Consequently, RSMA is a superset of MU-LP and NOMA as per Fig. 12.

TABLE 5 Messages-to-Streams Mapping in Two-User MISO BC
Table 5- 
Messages-to-Streams Mapping in Two-User MISO BC
FIGURE 11. - The relationship between existing strategies and 1-layer RS in two-user case. Each set illustrates the optimization space of the corresponding communication strategy. The optimization space of 1-layer RS is larger such that MU–LP, NOMA, and multicasting are just subsets.
FIGURE 11.

The relationship between existing strategies and 1-layer RS in two-user case. Each set illustrates the optimization space of the corresponding communication strategy. The optimization space of 1-layer RS is larger such that MU–LP, NOMA, and multicasting are just subsets.

FIGURE 12. - The relationship between existing strategies and the 
$K$
-user RSMA framework.
FIGURE 12.

The relationship between existing strategies and the $K$ -user RSMA framework.

The fifth question is “How does 1-layer RS achieve simultaneously higher multiplexing gains and a lower receiver complexity than NOMA?” In view of the previous sections, the key is to build non-orthogonal transmission strategies upon MU–LP (and therefore SDMA/multi-user MIMO) such that the performance benefits (including sum multiplexing gain) of MU–LP are guaranteed but extra performance (e.g., in MMF multiplexing gain) is observed by the use of SIC receivers. Indeed, a performance gain over MU–LP should be expected from a more complex receiver architecture in the multi-antenna BC. To achieve this, one should enable the flexibility at the transmitter to encode messages such that parts of them can be decoded by all users using SIC while the remaining parts are decoded by their intended receivers and treated as noise by non-intended receivers. Hence, we provide the flexibility to partially decode interference and partially treat the remaining interference as noise. This contrasts with MU–LP where interference is always treated as noise, and with NOMA where interference is fully decoded. This flexibility is achieved by extending the concept of RS, originally developed in [70] for the two-user single-antenna interference channel, to the multi-antenna BC. To manage multi-user interference by partially decoding the interference and treating the remaining interference as noise, RS facilitates a complete message-to-streams mapping flexibility for each user to have part of its message transmitted in the common stream and the remaining part in one of the $K$ private streams. By adjusting the power levels of the common and private streams, one can adjust the amount of interference caused to the private streams such that its level is weak enough to be treated as noise. This contrasts with MU–LP where the communication strategy is fundamentally constrained such that the messages are mapped to private streams only (i.e., there is no common stream, and multi-user interference between private streams is treated as noise even when its level is not weak enough to be treated as noise), and with NOMA where the constraint is that the entire message of one of the users is mapped onto a common stream (e.g., $W_{2}$ mapped to $s_{2}$ decoded by both user-1 and user-2 in Section II). These constraints imposed by MU–LP and NOMA are well illustrated by the message-to-stream mapping in Table 5 [77] and by the following example.

Example 1:

To further illustrate the split of the messages and the flexibility of RS, let us consider a two-user scenario. Let us imagine that the message of user-$1\,\,W_{1}=(a_{1}\:a_{2}\:a_{3}\:a_{4})\in \mathcal {W}_{1}=\{0000,0001,0010,\ldots,1111\}$ , where $|\mathcal {W}_{1}|=16$ . Similarly, the message of user-2 is $W_{2}=(b_{1}\:b_{2}\:b_{3})\in \mathcal {W}_{2}=\{000,001,010,\ldots,111\}$ , where $|\mathcal {W}_{2}|=8$ . In SDMA/MU–LP, $W_{1}$ would be encoded into $s_{1}$ and $W_{2}$ into $s_{2}$ . In NOMA, $W_{1}$ would be encoded into $s_{1}$ and $W_{2}$ into $s_{\mathrm {c}}$ . In RS, we split user-1’s message in, e.g., $W_{\mathrm {c},1}=(a_{1}\:a_{2})$ , $W_{\mathrm {p},1}=(a_{3}\:a_{4})$ , and user-2’s message in, e.g., $W_{\mathrm {c},2}=(b_{1})$ , $W_{\mathrm {p},2}=(b_{2}\:b_{3})$ . The common message is then constructed as $W_{\mathrm {c}}=(W_{\mathrm {c},1}\:W_{\mathrm {c},2})=(a_{1}\:a_{2}\:b_{1})$ , which is then encoded into $s_{\mathrm {c}}$ . $W_{\mathrm {p},1}$ and $W_{\mathrm {p},2}$ are encoded into $s_{1}$ and $s_{2}$ , respectively.

A consequence of the above flexibility is that by decreasing the amount of power allocated to the common stream, $K$ -user 1-layer RS progressively converges to $K$ -user MU–LP and in the limit where no power is allocated to the common stream, $K$ -user 1-layer RS swiftly boils down to $K$ -user MU–LP. Hence, 1-layer RS really builds upon MU–LP and MU–LP is a subscheme of 1-layer RS, which provides a guarantee to 1-layer RS that its rate and multiplexing gains are always the same or better than those of MU–LP. This is completely different from MISO NOMA. MISO NOMA does not build upon MU–LP. With $G$ groups, $K$ -user MISO NOMA can boil down to $G$ -user MU–LP by turning off the power to the weaker users in each group, but $K$ -user MISO NOMA can mathematically never boil down to $K$ -user MU–LP (recall footnote 13). The rate/multiplexing gains of $K$ -user MISO NOMA can therefore be worse than that of $K$ -user MU–LP.

Another interpretation arises by noting that MU–LP (and other forms of multi-user MIMO), as one extreme, can be viewed as a full transmit-side interference management strategy. On the other extreme, NOMA can be seen as a full receiver-side interference cancellation strategy. In between stands RS that can be viewed as a smart combination of transmit-side and receive-side interference management/cancellation strategies where the contribution of the common stream is adjusted according to the level of interference that can be canceled by the receiver.

Consequently, RS is an enabler of a general class of communication strategies and can cover a wider set of communication strategies than SDMA and NOMA, which leads to significant multiplexing gain and complexity reduction benefits.

The sixth question is “Can we use other types of receivers than SIC for NOMA and RS and would the multiplexing gains be improved?” We can indeed use other types of receivers but the multiplexing gains will not improve. Instead of using stream-by-stream SIC, we can use any other joint (Maximum Likelihood) decoder. Hence, a strong user in NOMA could use a joint decoder to decode its intended stream jointly with all other streams intended for its co-scheduled users in the same group. The multiplexing gains would not improve since the strong user would still act as the receiver of an effective MAC (as discussed in relationship with (5), (20), (21), and (22) and the first question) which limits the multiplexing gains. Similarly, in 1-layer RS, each user could use a joint decoder to decode its private stream jointly with the common stream and the multiplexing gains would not improve (recall that 1-layer RS already achieves the information theoretic optimal multiplexing gain region, hence any other scheme, receiver or multi-layer RS would not increase the multiplexing gains any further).

The seventh question is “When does it make sense to use NOMA?” As we have seen from the multiplexing gain analysis, RS achieves the same or higher multiplexing gains than NOMA with a lower number of SIC layers. Hence, it is difficult to motivate the use of NOMA based on the above analysis. Nevertheless, recall that our analysis relies on having the concatenated matrix of the user channels being full rank, or in other words that the user channels are not aligned, as per footnotes 11 and 16. Whenever the channels are aligned (though aligned channels are unlikely to occur in real wireless settings) and CSIT is perfect, NOMA can achieve the same performance as DPC, and could therefore be used as an alternative to DPC32 in that outlier scenario [41], [42], [44]. This should not appear as a surprise since a multi-antenna setting with aligned channel vectors can effectively be seen as a SISO setting where users are distinguished only based on their channel strengths. In such a SISO setting (i.e., degraded BC), it is well-known that both NOMA and DPC are capacity achieving [8], [9], [17].

Once the channels are not aligned, our results show that NOMA generally incurs a multiplexing gain loss. This corroborates our previous results [77] that showed that in a 2-user MISO BC, RS always outperforms NOMA. In particular, RS was shown to boil down to NOMA and achieve the same rate performance as NOMA whenever the following conditions are met: 1) the SNR is low, 2) the channels are closely aligned, 3) there is a sufficiently large disparity of channel gains, and 4) the CSIT is perfect. In this regime, all NOMA, RS, and DPC schemes achieve very similar performance (if not the same performance). As we depart from that regime, NOMA incurs a loss over RS (and DPC) due to its inferior multiplexing gain.

SECTION IX.

Numerical Results

Through numerical evaluation, we illustrate the misconceptions and the shortcomings of MISO NOMA. Moreover, we show that, by adopting 1-layer RS, the optimal sum multiplexing gain of the MISO BC is guaranteed in both underloaded and overloaded deployments for both perfect and imperfect CSIT scenarios. Furthermore, results also demonstrate that the MMF multiplexing gain (and MMF rate) is significantly enhanced when using 1-layer RS compared to MU–LP and MISO NOMA, and the complexity of the receivers is reduced compared to MISO NOMA. In other words, our evaluations show that 1-layer RS makes a more efficient use of the spatial dimensions (multiplexing gains) and of the SIC receivers than MISO NOMA, and it is more robust to CSIT inaccuracy.

The following two precoder optimization problems are solved in the simulation for the $K$ -user MISO NOMA system model specified in Section III-A. The first problem is maximizing the sum-rate of MISO NOMA subject to the transmit power constraint, which is given by \begin{align*} \max _{\mathbf {P}}&\sum _{k\in \mathcal {K}}R_{k}^{(\text {N})}\tag{45a}\\ \text {s.t.}&~\text {tr}(\mathbf {P}\mathbf {P}^{H})\leq P\tag{45b}\end{align*} View SourceRight-click on figure for MathML and additional features. where $R_{k}^{(\text {N})}$ is the rate of user-$k$ in the MISO NOMA system as specified in (17)–​(19). The second problem is maximizing the minimum rate subject to the transmit power constraint, which is formulated as \begin{align*} \max _{\mathbf {P}}&\min _{k\in \mathcal {K}}R_{k}^{(\text {N})}\tag{46a}\\ \text {s.t.}&~\text {tr}(\mathbf {P}\mathbf {P}^{H})\leq P.\tag{46b}\end{align*} View SourceRight-click on figure for MathML and additional features. The Weighted Minimum Mean Square Error (WMMSE) optimization framework proposed in [80] (originally developed for MU–LP) is extended to solve both problems (45) and (46). The details of the algorithm are specified in Appendix B. The optimization problems requiring interior-point methods are solved using the CVX toolbox [79].

We will assume $K=6$ in the simulations, so as to be able to relate the numerical results to the theoretical results of Tables 3 and 4. The channel $\mathbf {h}_{k}$ of user-$k$ has i.i.d. complex Gaussian entries drawn from the distribution $\mathcal {CN}(0,\sigma _{k}^{2})$ . The presented results are averaged over 100 channel realizations.

The following five strategies are compared and analyzed for both perfect and imperfect CSIT:

  • MISO NOMA ($G={3}$ ): MISO NOMA ($G=3$ ) is the MISO NOMA strategy specified in Section III-A when $G=3$ . Each user requires $\frac {K}{3}-1=1$ layer of SIC (since each user can be selected as the “strong user” in the corresponding user group). Ideally, the sum-rate (or max-min) rate is maximized by solving (45) (or (46)) for all possible user grouping methods and decoding orders within each group. Due to the high computational complexity of jointly optimizing the precoders, grouping, and decoding order, we assume that the user grouping is fixed33 while the decoding order in each group $i$ is the ascending order of users’ channel strength $\|\mathbf {h}_{k}\|, \forall k\in \mathcal {K}_{i}$ in the following results. To keep aligned with the system model in Section III-A, user indices are updated within each group such that $\|\mathbf {h}_{k}\|\leq \|\mathbf {h}_{j}\|, \forall k< j$ and $k,j\in \mathcal {K}_{i}$ . When the CSIT is imperfect, the decoding order follows the same method but based on $\|\hat {\mathbf {h}}_{k}\|, \forall k\in \mathcal {K}_{i}$ .

  • MISO NOMA ($G={1}$ ): MISO NOMA ($G=1$ ) is the MISO NOMA strategy in Section III-A when $G=1$ . Each user requires $K-1=5$ layers of SIC (since each user can potentially be selected as the “strong user”). There is no user grouping optimization issue at the transmitter since all users are assumed to be in the same user group. However, the decoding order at users should be jointly optimized with the precoders in order to maximize the sum-rate (or the max-min rate), which however, is computationally prohibitive. Following the literature of single-cell MISO NOMA [22], [23], we assume that the decoding order is the ascending order of the users’ channel strength $\|\mathbf {h}_{k}\|, \forall k\in \mathcal {K}$ . User indices are updated such that $\|\mathbf {h}_{k}\|\leq \|\mathbf {h}_{j}\|, \forall k< j$ and $k,j\in \mathcal {K}$ . Similarly, the decoding order follows the same method but based on $\|\hat {\mathbf {h}}_{k}\|, \forall k\in \mathcal {K}$ when the CSIT is imperfect.

  • MU–LP: MU–LP is the baseline strategy studied in Section VI. Each user directly decodes the intended stream by fully treating the interference as noise. The WMMSE algorithm specified in Appendix B can be applied and extended to solve the corresponding sum-rate and max-min problems of MU–LP [56], [80]. The transmitter and receiver complexity of MU–LP is low since there is no SIC deployed at each user and no user grouping and decoding order optimization issue at the transmitter.

  • Orthogonal Multiple Access (OMA): This is the single-user transmission where only the user with the highest channel strength is served.

  • 1-layer RS: 1-layer RS is the RS strategy we specified in Section VII. The corresponding sum-rate and max-min rate maximization problems are solved by using the WMMSE algorithm proposed in [47], [56]. Compared with MISO NOMA, the transmitter and receiver complexities of 1-layer RS are much reduced. Similarly to MU–LP, no user grouping and decoding order optimization is needed. Each user only requires a single layer of SIC.

A. Perfect CSIT

Following [47], the initialization of the precoding matrix $\mathbf {P}$ in Algorithm 1 is designed by using Maximum Ratio Transmission (MRT) combined with Singular Value Decomposition (SVD). Specifically, the precoder for the message to be decoded by a group of users is designed based on the SVD of the channel matrix formed by the channel vectors of the corresponding users while the precoder for the message to be decoded by a single user is designed based on MRT. For example, when considering MISO NOMA ($G=3$ ), the message for user-$k, k\in \mathcal {K}_{i}$ , is decoded by users-$\{j{\mid } j\leq k,j\in \mathcal {K}_{i}\}$ . The precoders are initialized as $\mathbf {p}_{k}=\sqrt {p_{k}}\widehat {\mathbf {p}}_{k}$ , where $\widehat {\mathbf {p}}_{k}$ is the largest left singular vector of the channel estimate ${\mathbf {H}}_{k}$ formed by channels $\{{\mathbf {h}}_{j}{\mid } j\leq k,j\in \mathcal {K}_{i}\}$ . The precoder $\mathbf {p}_{k}$ of the stream to be decoded at last in each group is initialized as $\mathbf {p}_{k}=\sqrt {p_{k}}\frac {{\mathbf {h}}_{k}}{||{\mathbf {h}}_{k}||}$ , where $p_{k}$ is the power allocated to the corresponding precoder $\mathbf {p}_{k}$ and it satisfies that $\sum _{k=1}^{K}p_{k}=P$ .

Algorithm 1: - AO Algorithm
Algorithm 1:

AO Algorithm

Fig. 13 illustrates the sum-rate vs. SNR comparison of the five strategies considered when there are $K=6$ users and the number of transmit antennas is $M=3$ and $M=6$ . In Fig. 13(a) and Fig. 13(b), all users have equal channel variances, i.e., $\sigma _{k}^{2}=1,\forall k\in \mathcal {K}$ while the users’ channel variances are randomly generated from [0.1, 1] in Fig. 13(c) and Fig. 13(d), i.e., $\sigma _{k}^{2}\in [{0.1, 1}],\forall k\in \mathcal {K}$ . In other words, the average channel strength disparities among users are randomly generated between 0 and 10 dB34 in Fig. 13(c) and Fig. 13(d). In the high SNR regime of each subfigure, the multiplexing gains of all strategies are found to match the theoretical sum multiplexing gains specified in Table 3. Specifically, when $M=3, K=6$ , the sum multiplexing gains of 1-layer RS, MU–LP, and MISO NOMA ($G=3$ ) in Fig. 13(a) and Fig. 13(c) approach $d_{\text {s}}^{(\star)}=3$ (which is optimal). In Fig. 13(c) and Fig. 13(d) where $M=K=6$ , the sum multiplexing gains of 1-layer RS and MU–LP are $d_{\text {s}}^{(\star)}=6$ . The sum multiplexing gain of MISO NOMA ($G=3$ ) remains 3. The sum multiplexing gains of MISO NOMA ($G=1$ ) and OMA are limited to 1 in all subfigures of Fig. 13. Therefore, MISO NOMA has a reduced sum multiplexing gain, inefficiently makes use of the available multiple antennas, and incurs a significant rate loss, especially at medium and high SNRs. It is not an efficient strategy for multi-antenna settings. The first misconception behind multi-antenna NOMA is confirmed.

FIGURE 13. - Sum-rate vs. SNR comparison of different strategies, 
$K=6$
.
FIGURE 13.

Sum-rate vs. SNR comparison of different strategies, $K=6$ .

As pointed out earlier in this section, the complexity of MISO NOMA at both the transmitter and the receiver is the highest among all strategies studied in this work. At the transmitter, the scheduling complexity is high since the user grouping and decoding order are required to be jointly optimized with the precoders. At the receivers, each user requires multiple layers of SIC and the number of SIC layers at each user increases with the number of users $K$ in the system. In addition to such a high complexity, as evident from Fig. 13, the sum-rate performance of MISO NOMA is worse than that of MU–LP35 which exhibits a much lower complexity at the transmitter and each receiver. Adopting SIC receivers does not always boost the rate performance. On the contrary, an inefficient and inappropriate use of SIC as in MISO NOMA can make the rate performance worse than simply not using SIC (as in MU–LP). This illustrates the second misconception behind multi-antenna NOMA.

We also observe from Fig. 13 that the sum-rate performance of OMA and MISO NOMA ($G=1$ ) is the worst, which is also reflected in their sum multiplexing gains. Hence, comparing MISO NOMA with OMA is not sufficient in multi-antenna settings. Both MU–LP and 1-layer RS should be considered as the baselines for all MISO NOMA schemes. This verifies the third misconception behind multi-antenna NOMA.

In Fig. 14 and Fig. 15, we focus on the MMF rate performance when there are $K=6$ users and the number of transmit antennas is varied from $M=3$ to $M=6$ . All users have equal channel variances in Fig. 14 while the users’ channel variances are randomly generated from [0.1, 1] in Fig. 15. The MMF multiplexing gains of all the strategies in both Fig. 14 and Fig. 15 match the corresponding theoretical MMF multiplexing gain results specified in Table 4. In the overloaded regime when $M=3/4/5$ , the corresponding MMF multiplexing gains of MISO NOMA ($G=3$ ) and MISO NOMA ($G=1$ ) are $d_{\text {mmf}}^{(\text {N,G=3})}=0/0/\frac {1}{2}$ , and $d_{\text {mmf}}^{(\text {N,G=1})}=\frac {1}{6}/\frac {1}{6}/\frac {1}{6}$ , respectively. In contrast, the MMF multiplexing gain of 1-layer RS is $d_{\text {mmf}}^{(\text {R})}=\frac {1}{4}/\frac {1}{3}/\frac {1}{2}$ when $M=3/4/5$ , which is significantly higher. The low MMF multiplexing gains of the MISO NOMA strategy translates into a poor MMF rate performance as illustrated in Fig. 14 and Fig. 15. Though MISO NOMA has been promoted as a strategy to enhance user fairness and to deal with overloaded regimes, its MMF rate in the overloaded regime is actually worse than that of 1-layer RS. MISO NOMA is not an efficient strategy for overloaded regimes. This underscores the validity of the fourth misconception behind multi-antenna NOMA.

FIGURE 14. - Max-min rate vs. SNR comparison of different strategies, 
$K=6$
, 
$\sigma _{k}^{2}=1, \forall k\in \mathcal {K}$
.
FIGURE 14.

Max-min rate vs. SNR comparison of different strategies, $K=6$ , $\sigma _{k}^{2}=1, \forall k\in \mathcal {K}$ .

FIGURE 15. - Max-min rate vs. SNR comparison of different strategies, 
$K=6$
, 
$\sigma _{k}^{2}\in [{0.1, 1}], \forall k\in \mathcal {K}$
.
FIGURE 15.

Max-min rate vs. SNR comparison of different strategies, $K=6$ , $\sigma _{k}^{2}\in [{0.1, 1}], \forall k\in \mathcal {K}$ .

B. Imperfect CSIT

Let us now consider ergodic sum-rate and minimum ergodic rate maximization problems when the CSIT is imperfect. The two problems are solved by extending the WMMSE algorithm specified in Section X to the corresponding imperfect CSIT setting [47]. This is achieved by using the Sample Average Approximation (SAA) method [82] to transform the original ergodic problem to its deterministic counterpart and then using WMMSE to solve the corresponding deterministic problem. In the following results, for a given channel estimate $\hat {\mathbf {h}}_{k}, k\in \mathcal {K}$ , $M=1000$ channel samples are generated. The ergodic sum-rate or max-min ergodic rate is obtained by averaging over 100 channel estimates. The channel estimate $\hat {\mathbf {h}}_{k}$ and channel estimation error $\tilde {\mathbf {h}}_{k}$ have i.i.d. complex Gaussian entries respectively drawn from the distributions $\mathcal {CN}(0,\sigma _{k}^{2}-\sigma _{e,k}^{2})$ , $\mathcal {CN}(0,\sigma _{e,k}^{2})$ , where $\sigma _{e,k}^{2}=\sigma _{k}^{2}P^{-\alpha }$ . As only channel estimate $\hat {\mathbf {h}}_{k}, k\in \mathcal {K}$ , is known at the transmitter, the precoders are initialized using the same method as in the perfect CSIT scenario but based on realistic channel estimates. Figs. 16, 17, and 18 are the imperfect CSIT results corresponding to Fig. 13, 14, and 15, respectively. The unspecified parameters in this subsection remain the same as the corresponding ones used for perfect CSIT.

FIGURE 16. - Sum-rate vs. SNR comparison of different strategies with imperfect CSIT, 
$\alpha =0.5$
, 
$K=6$
.
FIGURE 16.

Sum-rate vs. SNR comparison of different strategies with imperfect CSIT, $\alpha =0.5$ , $K=6$ .

FIGURE 17. - Max-min rate vs. SNR comparison of different strategies with imperfect CSIT, 
$\alpha =0.5$
, 
$K=6$
, 
$\sigma _{k}^{2}=1, \forall k\in \mathcal {K}$
.
FIGURE 17.

Max-min rate vs. SNR comparison of different strategies with imperfect CSIT, $\alpha =0.5$ , $K=6$ , $\sigma _{k}^{2}=1, \forall k\in \mathcal {K}$ .

FIGURE 18. - Max-min rate vs. SNR comparison of different strategies with imperfect CSIT, 
$\alpha =0.5$
, 
$K=6$
, 
$\sigma _{k}^{2}\in [{0.1, 1}], \forall k\in \mathcal {K}$
.
FIGURE 18.

Max-min rate vs. SNR comparison of different strategies with imperfect CSIT, $\alpha =0.5$ , $K=6$ , $\sigma _{k}^{2}\in [{0.1, 1}], \forall k\in \mathcal {K}$ .

Fig. 16 illustrates the sum-rate vs. SNR comparison of the five strategies for imperfect CSIT. The sum multiplexing gains of all strategies in Fig. 16 match the theoretical sum multiplexing gains in Table 2. When $\alpha =0.5$ and $M=3/6$ , the sum multiplexing gains of the five strategies are $d_{\text {s}}^{(\text {R})}=2/3.5$ for 1-layer RS, $d_{\text {s}}^{(\text {M})}=1.5/3$ for MU–LP, $d_{\text {s}}^{(\text {N,G=3})}=1.5/1.5$ for MISO NOMA ($G=3$ ), and $d_{\text {s}}^{(\text {N,G=1})}=d_{\text {s}}^{(\text {O})}=1/1$ for MISO NOMA ($G=1$ ) and OMA. As suggested by the multiplexing gain results, where MISO NOMA ($G=1$ ) has the lowest multiplexing gain, we also observe from Fig. 16 that though MISO NOMA ($G=1$ ) has the highest receiver complexity, its ergodic sum rate performance is the worst even in the preferred NOMA overloaded setting when the users have channel strength disparities. MISO NOMA ($G=1$ ) always achieves a worse sum-rate than MU–LP. It is not beneficial for enhancing the sum-rate of multi-antenna scenarios regardless of whether perfect or imperfect CSIT is used. In comparison, 1-layer RS achieves explicit sum multiplexing gains and sum-rate improvement over all other strategies.

Figs. 17 and 18 illustrate the MMF ergodic rate results. In general, the MMF multiplexing gains of all strategies in both figures match the theoretical MMF multiplexing gain results specified in Table 2. When $M=3/4/5/6$ , the corresponding MMF multiplexing gains of MISO NOMA ($G=3$ ) and MISO NOMA ($G=1$ ) when $\alpha =0.5$ are $d_{\text {mmf}}^{(\text {N,G=3})}=0/0/\frac {1}{4}/\frac {1}{4}$ and $d_{\text {mmf}}^{(\text {N,G=1})}=\frac {1}{6}/\frac {1}{6}/\frac {1}{6}/\frac {1}{6}$ , respectively, and the corresponding MMF multiplexing gain of MU–LP and RS are $d_{\text {mmf}}^{(\text {M})}=0/0/0/\frac {1}{2}$ , and $d_{\text {mmf}}^{(\text {R})}=\frac {1}{4}/\frac {1}{3}/\frac {1}{2}/\frac {1}{2}$ . We observe that 1-layer RS achieves significantly higher multiplexing gains, which is also reflected in the MMF ergodic rate performance in Figs. 17 and 18. In both the perfect and imperfect CSIT settings, user fairness cannot be improved by MISO NOMA. The MMF ergodic rate performance of MISO NOMA is much worse than that of 1-layer RS.

Therefore, the four misconceptions behind multi-antenna NOMA are further verified for imperfect CSIT. Higher sum-rate and MMF rate gaps between RS and MU–LP/multi-antenna NOMA are generally observed by comparing the corresponding perfect and imperfect CSIT results. By partially decoding the interference and treating the remaining interference as noise, 1-layer RS is more robust to CSIT inaccuracy. The large performance gain of RS makes it an appealing strategy for application in future communication networks.

C. Discussions

The presented simulations fully validate the theoretical multiplexing gain analysis and confirm the inefficiency of MISO NOMA. We therefore conclude that the fundamental design principle of NOMA, namely forcing one user to decode the message(s) of other user(s), should be reconsidered or very carefully used for multi-antenna settings.

Thanks to its ability to partially decode interference and partially treat interference as noise, 1-layer RS achieves equal or higher sum-rate and MMF rate performance than all other strategies in both underloaded and overloaded regimes, especially when it comes to metrics that favor user fairness (e.g., MMF rate) in an overloaded regime. This is due to the fact that the inter-user interference becomes stronger in the setting when all users are active and the number of transmit antennas is limited. The superiority of 1-layer RS in managing multi-user interference becomes more pronounced when users suffer from stronger interference. Most importantly, 1-layer RS requires no user grouping and decoding order optimization at the transmitter and only one layer of SIC at each user. Compared with MISO NOMA, the sum-rate and MMF rate performance gain of RS comes at a much reduced transmitter and receiver complexity. 1-layer RS enables a better trade-off between the rate performance gains and the number of SIC layers. Hence, we conclude that 1-layer RS is a more powerful and promising strategy for multi-antenna networks.

Though the evaluations have been limited to 1-layer RS as the basic RSMA scheme, further rate enhancements over 1-layer RS can be obtained with multi-layer RS where the message of a user is split multiple times and multiple SIC layers are implemented at the receivers, as demonstrated in [50], [51], [78], [89], [90].

SECTION X.

Conclusions, Future Research, and Pathways to 6G Standardization

This paper provides a broad, different, and useful perspective on multi-antenna NOMA and non-orthogonal transmission to the community working on NOMA and multiple access, and to the future generations of researchers working on multi-user multi-antenna communications. Although NOMA in single-antenna settings has been well understood for a long time, the paper shows that the design of non-orthogonal transmission strategies for multi-antenna settings should be done with care so as to benefit from the multi-antenna dimensions and SIC receivers.

The paper showed in Section II that two-user multi-antenna NOMA increases the receiver complexity and at the same time incurs a loss in multiplexing gain (and therefore rate at high SNR) compared to conventional multiuser precoding (as in used in 4G and 5G), therefore raising concerns on the efficiency of multi-antenna NOMA. Subsequently, a general $K$ -user setting with perfect CSIT and imperfect CSIT were studied in Section III and Section IV, respectively and various multiplexing gains of multi-antenna NOMA were derived. Then, we introduced two baseline schemes, namely $K$ -user conventional multiuser precoding in Section VI and $K$ -user multi-antenna rate-splitting in Section VII, and studied the multiplexing gains of those schemes. Section VIII compares the multiplexing gains of all considered schemes and provides strong theoretical grounds for performance comparisons among all schemes. In particular, it identifies the scenarios where NOMA incurs a gain and a loss compared to multiuser linear precoding and demonstrates how NOMA always leads to lower multiplexing gains than rate-splitting though it makes use of a larger number of SIC layers at the receivers. This section is instrumental and exposes various misconceptions and shortcomings of multi-antenna NOMA. Simulation results are then used in Section IX to confirm our findings and predictions from the multiplexing gain analysis.

Our results show that NOMA is not an efficient solution to cope with the high throughput, reliability, heterogeneity of QoS, and connectivity requirements of the downlink of future 5G and beyond multi-antenna wireless networks. This is due to the fact that the fundamental principle of NOMA consisting in forcing one user in each group to fully decode the messages of other co-scheduled users is an inefficient design in multi-antenna settings. Consequently, the benefits to the research community and future standards and networks of multi-antenna NOMA for downlink communications (e.g., MISO/MIMO techniques for NOMA, NOMA for massive MIMO and cell-free massive MIMO, multi-antenna NOMA for millimeter and terahertz communications, NOMA for multi-beam satellite communications, multi-antenna NOMA in reconfigurable intelligent surfaces, multi-antenna in Multiuser Superposition Transmission (MUST) in 3GPP, etc) are questionable and should be considered carefully in light of the results in this paper.

Instead, non-orthogonal transmission strategies for multi-antenna settings should be designed such that interference is partially decoded and partially treated as noise based on the rate-splitting (multiple access) literature so as to truly benefit from multi-antenna transmitters (and potentially multi-antenna receivers) and SIC receivers.

In this paper, we limited the multiplexing gain analysis and the numerical evaluations to two metrics, namely sum multiplexing gain/sum-rate and MMF multiplexing gain/MMF rate, and to the MISO BC. Nevertheless, the results can be extended to other metrics such as the weighted sum-rate (WSR) and to other scenarios. Readers are invited to check [51] that confirms the inefficiency of NOMA and the superiority of RS from a WSR perspective, and are encouraged to consult the growing literature on RS (and RSMA) demonstrating the superiority of RS over NOMA and MU-LP in numerous scenarios and applications, namely multi-user multi-antenna communications [51], [77], [83], multigroup multicast [39], [56], energy efficiency [78], [84], [85], multi-cell joint transmission [86], non-orthogonal unicast and multicast transmission [78], wireless information and power transfer [87], cooperative communication with user relaying [88], imperfect CSIT [89], [90], link-level simulations [91], C-RAN [92], secrecy rate [93], [94], aerial networks [85], [95], imperfect CSIT and CSIR [96], visible light communications [97], [98], network performance analysis [99], reconfigurable intelligent surface [100]. It would also be of interest for future work to understand how more recent MIMO NOMA schemes such as [67], [68] compare to RS [73], [75].

The emphasis of the paper was on downlink multi-user communications. Results suggest that future downlink multi-user multi-antenna communications would strongly benefits from RSMA. Indeed, RSMA achieves higher multiplexing gains and rates. It is capable of serving a larger number of users and is more robust to user deployments, network loads and inaccurate CSI. Moreover, RSMA has a lower receiver complexity than NOMA. RSMA is a gold mine of research problems for academia and industry with issues spanning numerous areas: RSMA to achieve the fundamental limits of wireless networks; RSMA for multi-user/multi-cell multi-antenna networks; RSMA-based robust interference management; RSMA in MU-MIMO, coordinated multi-point (CoMP), Massive MIMO, millimeter wave and higher frequency bands, relay, cognitive radio, caching, physical layer security, cooperative communications, cloud/fog-enabled platforms and Radio Access Networks (RAN) (such as cloud-RAN and fog-RAN), intelligent reflecting surfaces; RSMA to unify, generalize and outperform SDMA and NOMA; physical layer design of RSMA-based network; coding and modulation for RSMA; cross-layer design, optimization and performance analysis of RSMA; implementation and standardization of RSMA; RSMA in B5G services such as enhanced Mobile Broadband (eMBB), enhanced Ultra-Reliable Low Latency Communications (URLLC), enhanced Machine-Type Communications (MTC), massive MTC, massive Internet-of-Things (IoT), Vehicle-to-everything (V2X), cellular, Unmanned Aerial Vehicle (UAV) and satellite networks, wireless powered communications, integrated communications and sensing, etc.

RSMA can also be used in the uplink, as originally shown for single-antenna systems in [101]. The key benefit of RSMA in the uplink is its ability to achieve the capacity region of the MAC without the need for time sharing. Nevertheless, much is left to be done to identify the benefits of RSMA for general uplink multi-user multi-antenna communications. The performance benefits of RSMA vs. NOMA vs. OMA vs. other multiple access techniques in the uplink, beyond the existing NOMA vs. OMA comparison [102], is also much worth investigating. It should however be mentioned that thanks to the polymatroid structure of the Gaussian MAC capacity region, the solution to the max weighted sum rate problem is always at a vertex of the original region, i.e., RS is not needed.

Standardization is very important for a widespread adoption of multiple access techniques. MU-LP has been heavily discussed and standardized in 4G and 5G as part of MU-MIMO and Massive MIMO. NOMA was also investigated as part of a study item in 5G but was not considered any further in 5G because its gains compared to MU-MIMO were not found convincing [103]. Hence, in 5G New Radio (NR), NOMA was seen as a competing technology to MU-MIMO and an unwanted add-on technology. The standardization of RSMA has not been considered by 3GPP yet but is receiving a growing interest from academia and industry.36 Parts of the features required by RSMA are already being studied, discussed and developed. Some current work items and features in 5G, i.e., MU-MIMO/Massive MIMO/CoMP, multiuser superposition transmission (MUST), network-assisted interference cancellation and suppression (NAICS), multicast functionality can be leveraged for RSMA. However, some more work is needed to realize the full potential of RSMA. On-going activities consist in investigating the potential benefits of RSMA for 6G [104] and demonstrating the significant benefits of RSMA over 5G NR design [105].

Appendix A

Proof of Proposition 4

Let us first consider $G>1$ and $M\geq K-g+1$ . Recalling from the proof of Proposition 3 that the sum multiplexing gain of $G\alpha $ can be split equally among the $G$ groups so that each group gets a (group) sum multiplexing gain of $\alpha $ , and following the MAC argument, the (group) sum multiplexing gain of $\alpha $ in each group can then be further split equally among the $g$ users, which leads to an upper bound on the MMF multiplexing gain of $\frac {\alpha }{g}$ . Achievability is obtained by designing precoders using ZFBF, and allocating power (consider group 1 for simplicity) to user $k=1,\ldots,g$ as $O\left({P^{1-\frac {g-k}{g}\alpha }}\right)$ , which causes the SINR for user-$k$ to scale as $O(P^{\alpha /g})$ and an achievable MMF multiplexing gain of $\frac {\alpha }{g}$ .

To illustrate the achievability in more detail, we consider a simple example for $K=4$ , $G=2$ , $g=2$ , and $M\geq 3$ . First, we design the precoders $\mathbf {p}_{1}$ and $\mathbf {p}_{2}$ in group 1 to be orthogonal to the channel estimates $\hat {\mathbf {h}}_{3}$ and $\hat {\mathbf {h}}_{4}$ of users 3 and 4. Similarly, $\mathbf {p}_{3}$ and $\mathbf {p}_{4}$ in group 2 are made orthogonal to $\hat {\mathbf {h}}_{1}$ and $\hat {\mathbf {h}}_{2}$ . Second, allocate power $O(P^{b})$ with $b=1-\alpha /2$ to users 1 and 3, and $O(P-P^{b})=O(P)$ to users 2 and 4. Using these precoders and power allocations, the received signals in group 1 can be written as \begin{align*} y_{1}=&\underbrace {\mathbf {h}_{1}^{H}\mathbf {p}_{1} s_{1}}_{P^{b}}+\underbrace {\mathbf {h}_{1}^{H}\mathbf {p}_{2} s_{2}}_{P}+\underbrace {\tilde {\mathbf {h}}_{1}^{H}\mathbf {p}_{3} s_{3}}_{P^{b-\alpha }}+\underbrace {\tilde {\mathbf {h}}_{1}^{H}\mathbf {p}_{4} s_{4}}_{P^{1-\alpha }}+\underbrace {n_{1}}_{P^{0}}, \tag{47}\\ y_{2}=&\underbrace {\mathbf {h}_{2}^{H}\mathbf {p}_{1} s_{1}}_{P^{b}}+\underbrace {\mathbf {h}_{2}^{H}\mathbf {p}_{2} s_{2}}_{P}+\underbrace {\tilde {\mathbf {h}}_{2}^{H}\mathbf {p}_{3} s_{3}}_{P^{b-\alpha }}+\underbrace {\tilde {\mathbf {h}}_{2}^{H}\mathbf {p}_{4} s_{4}}_{P^{1-\alpha }}+\underbrace {n_{2}}_{P^{0}}, \tag{48}\end{align*} View SourceRight-click on figure for MathML and additional features. where the quantities under the brackets refer to how the power level of each term scales. From (47) and (48), $s_{2}$ can be decoded at an SINR level scaling as $\frac {P}{P^{b}+P^{1-\alpha }+P^{b-\alpha }+P^{0}}=\frac {P}{P^{b}}=P^{\alpha /2}$ (since $b\geq 1-\alpha \geq b-\alpha $ and $b\geq 0$ ). Using SIC, $s_{2}$ is cancelled in (47), and $s_{1}$ can be decoded at an SINR level scaling as $\frac {P^{b}}{P^{1-\alpha }+P^{b-\alpha }+P^{0}}=P^{\alpha /2}$ . Similar expressions hold for group 2, and we note that all four streams have an SINR scaling as $P^{\alpha /2}$ , therefore achieving an MMF multiplexing gain of $\frac {\alpha }{2}$ .

Let us now consider $G>1$ and $M < K-g+1$ . Since the MMF multiplexing gain collapses to 0 in the perfect CSIT setting, the same holds for imperfect CSIT.

Let us now consider $G=1$ . The situation here is the same as in the perfect CSIT setting. There is no inter-group interference and the sum multiplexing gain of one in the single group can be split equally among the $K$ users, which leads to an upper bound on the MMF multiplexing gain of $\frac {1}{K}$ . Achievability is obtained by choosing the powers of users $k=1,\ldots,K$ as $O(P^{k/K})$ , which causes the SINR of user-$k$ to scale as $O(P^{1/K})$ and results in an achievable MMF multiplexing gain of $\frac {1}{K}$ .

Appendix B

WMMSE Optimization Framework

The WMMSE optimization framework to solve both problems (45) and (46) is specified as follows.

At user-$j, j\in \mathcal {K}_{i}$ , equalizer $g_{j,k}$ is employed to decode stream $s_{k}, k\in \{k{\mid } k\geq j,k\in \mathcal {K}_{i}\}$ . The estimate of $s_{k}$ at user-$j$ is obtained as $\widehat {s}_{j,k}=g_{j,k}y_{j,k}$ , where $y_{j,k}=\sum _{m\leq k, m\in \mathcal {K}_{i}}\mathbf {h}_{j}^{H}\mathbf {p}_{m}s_{m}+\sum _{l\neq i, l\in \mathcal {G}}\sum _{m\in \mathcal {K}_{l}}\mathbf {h}_{j}^{H}\mathbf {p}_{m}s_{m}+n_{j}$ is the signal received at user-$j$ after removing the streams decoded before $s_{k}$ . The corresponding Mean Square Error (MSE) is given by \begin{align*} \varepsilon _{j,k}=&\mathbb {E}\left \{{\left |{\widehat {s}_{j,k}-s_{k}}\right |^{2}}\right \} \\=&\left |{g_{j,k}}\right |^{2}T_{j,k}-2 \Re \left \{{g_{j,k}\mathbf {h}_{j}^{H}\mathbf {p}_{k}}\right \}+1,\tag{49}\end{align*} View SourceRight-click on figure for MathML and additional features. where $T_{j,k}=|\mathbf {h}_{j}^{H}\mathbf {p}_{k}|^{2}+I_{j,k}^{(in)}+I_{j,k}^{(ou)}$ is the power received at user-$j$ when decoding $s_{k}$ . Furthermore, $I_{j,k}^{(in)}$ and $I_{j,k}^{(ou)}$ are respectively the intra-group and inter-group interference power defined in (18).

By solving $\frac {\partial \varepsilon _{j,k}}{\partial g_{j,k}}=0$ , the optimal Minimum MSE (MMSE) equalizer is calculated as \begin{equation*} g_{j,k}^{\textrm {MMSE}}=\mathbf {p}_{k}^{H}\mathbf {h}_{j}\left ({{T}_{j,k}}\right)^{-1}.\tag{50}\end{equation*} View SourceRight-click on figure for MathML and additional features.

Substituting (50) back to (49), the corresponding MMSE is then obtained as \begin{equation*} \varepsilon _{j,k}^{\textrm {MMSE}}=\min _{g_{j,k}} \varepsilon _{j,k}=T_{j,k}^{-1}\left ({I_{j,k}^{(in)}+I_{j,k}^{(ou)}}\right).\tag{51}\end{equation*} View SourceRight-click on figure for MathML and additional features. With the introduced $\varepsilon _{j,k}^{\textrm {MMSE}}$ , the rate at user-$j$ to decode the message of user-$k$ in (17) is equivalently written as $R_{j,k}=-\log _{2}(\varepsilon _{j,k}^{\textrm {MMSE}})$ . Defining the Weighted MSE (WMSE) of $\varepsilon _{j,k}$ with a weight $u_{j,k}>0$ as \begin{equation*} \xi _{j,k}= u_{j,k}\varepsilon _{j,k}-\log _{2}\left ({u_{j,k}}\right),\tag{52}\end{equation*} View SourceRight-click on figure for MathML and additional features. and defining its Weighted MMSE (WMMSE) by minimizing $\xi _{j,k}$ over $u_{j,k}$ and $g_{j,k}$ as \begin{equation*} \xi _{j,k}^{\textrm {MMSE}}= \min _{u_{j,k},g_{j,k}}\xi _{j,k},\tag{53}\end{equation*} View SourceRight-click on figure for MathML and additional features. we then establish the rate-WMMSE relationship, which is given by \begin{equation*} \xi _{j,k}^{\textrm {MMSE}}=1-R_{j,k}.\tag{54}\end{equation*} View SourceRight-click on figure for MathML and additional features. The rate-WMMSE relation in (54) is obtained as follows. The optimum equalizer is calculated as $g_{j,k}^{*}=g_{j,k}^{\textrm {MMSE}}$ from $\frac {\partial \xi _{j,k}}{\partial g_{j,k}}=0$ . Substituting $g_{j,k}^{\textrm {MMSE}}$ back to (52) yields $\xi _{j,k}(g_{j,k}^{\textrm {MMSE}})=u_{j,k}\varepsilon _{j,k}^{\textrm {MMSE}}-\log _{2}(u_{j,k})$ . By solving $\frac {\partial \xi _{j,k}(g_{j,k}^{\textrm {MMSE}})}{\partial g_{j,k}}=0$ , we then obtain the optimal MMSE weight, which is given as \begin{equation*} u_{j,k}^{*}=u_{j,k}^{\textrm {MMSE}}=\left ({\varepsilon _{j,k}^{\textrm {MMSE}}}\right)^{-1}.\tag{55}\end{equation*} View SourceRight-click on figure for MathML and additional features. Substituting $u_{j,k}^{\textrm {MMSE}}$ back to $\xi _{j,k}(g_{j,k}^{\textrm {MMSE}})$ , we have $\min _{u_{j,k},g_{j,k}}\xi _{j,k}=1-R_{j,k}$ . Following (53), we obtain (54).

Motivated by the rate-WMMSE in (54), we find that the achievable rate of user-$k$ in (19) is equal to $R_{k}=1-\xi _{k}^{\textrm {MMSE}}$ , where $\xi _{k}^{\textrm {MMSE}}=\max _{j\leq k, j\in \mathcal {K}_{i}}\xi _{j,k}^{\textrm {MMSE}}$ . By defining the WMSE of user-$k$ as \begin{equation*} \xi _{k}=\max _{j\leq k, j\in \mathcal {K}_{i}}\xi _{j,k},\tag{56}\end{equation*} View SourceRight-click on figure for MathML and additional features. and the respective set of equalizers and weights as $\mathbf {g}=\{g_{j,k}{\mid } j\leq k, k,j\in \mathcal {K}_{i}, i\in \mathcal {G}\}$ , $\mathbf {u}=\{u_{j,k}{\mid } j\leq k, k,j\in \mathcal {K}_{i}, i\in \mathcal {G}\}$ , the sum-rate WMMSE problem is formulated as \begin{align*} \min _{\mathbf {P}, \mathbf {u}, \mathbf {g}}&\sum _{k\in \mathcal {K}}\xi _{k}\tag{57a}\\ \text {s.t.}&~\text {tr}\left ({\mathbf {P}\mathbf {P}^{H}}\right)\leq P.\tag{57b}\end{align*} View SourceRight-click on figure for MathML and additional features. Following the proof of [47], we find that the MMSE solutions of the equalizers $\mathbf {g}^{\textrm {MMSE}}=\{g_{j,k}^{\textrm {MMSE}}{\mid } j\leq k, k,j\in \mathcal {K}_{i}, i\in \mathcal {G}\}$ and weights $\mathbf {u}^{\textrm {MMSE}}=\{u_{j,k}^{\textrm {MMSE}}{\mid } j\leq k, k,j\in \mathcal {K}_{i}, i\in \mathcal {G}\}$ satisfy the KKT optimality conditions of (57). Substituting ($\mathbf {g}^{\textrm {MMSE}},\mathbf {u}^{\textrm {MMSE}}$ ) back to (57) with affine transformations applied to the objective function, (57) boils down to (45). In fact, for any point ($\mathbf {P}^{*}, \mathbf {u}^{*}, \mathbf {g}^{*}$ ) satisfying the KKT optimality conditions of (57), the solution $\mathbf {P}^{*}$ satisfies the KKT optimality conditions of (45). Hence, (57) yields a solution for (45).

Although the transformed problem (57) is still non-convex, it is block-wise convex with respect to $\mathbf {P}$ and ($\mathbf {g}, \mathbf {u}$ ). For a given $\mathbf {P}$ , the optimal solution of the weights and equalizers are $\mathbf {g}^{\textrm {MMSE}}(\mathbf {P}), \mathbf {u}^{\textrm {MMSE}}(\mathbf {P})$ . When ($\mathbf {g}, \mathbf {u}$ ) are fixed, problem (57) becomes convex and can be solved by interior-point methods. Motivated by the block-wise convexity, we use the Alternating Optimization (AO) algorithm as illustrated in Algorithm 1 to solve (57). In each iteration, the equalizers and weights are first updated by ($\mathbf {g}^{\textrm {MMSE}}(\mathbf {P}), \mathbf {u}^{\textrm {MMSE}}(\mathbf {P})$ ) for a given $\mathbf {P}$ . The updated equalizers and weights ($\mathbf {g}^{\textrm {MMSE}}(\mathbf {P}), \mathbf {u}^{\textrm {MMSE}}(\mathbf {P})$ ) are substituted back to (57). Precoder $\mathbf {P}$ is then updated by solving (57). $\mathbf {P}$ and ($\mathbf {g}, \mathbf {u}$ ) are updated in an alternating manner until the convergence of the sum-rate. Algorithm 1 is guaranteed to converge and it converges to the KKT solution of problem (45). Readers are referred to [47] for the proof.

Following the same procedure, we are able to obtain the transformed WMMSE problem for max-min rate maximization, which is given by \begin{align*} \min _{\mathbf {P},\mathbf {u}, \mathbf {g}}&\max _{k\in \mathcal {K}}\,\xi _{k}\tag{58a}\\ \text {s.t.}&~\text {tr}\left ({\mathbf {P}\mathbf {P}^{H}}\right)\leq P.\tag{58b}\end{align*} View SourceRight-click on figure for MathML and additional features. By substituting problem (57) in Algorithm 1 with problem (58), we obtain the corresponding AO Algorithm to achieve the KKT solution of the max-min rate problem (46).

References

References is not available for this document.