Introduction
Orthogonal frequency division multiplexing (OFDM), presented by Robert W. Chang of Bell Labs in 1966, operates as a FDM system [1]. In OFDM, the complete channel is subdivided into numerous narrow-band subchannels, which are concurrently transmitted to uphold high-data-rate transmission [2]. Moreover, OFDM effectively mitigates intersymbol interference (ISI) induced by the frequency selectivity of the wireless channel. Due to this capability, OFDM has emerged as the predominant multicarrier transmission technique in wireless communications and has been widely adopted as an essential component of IEEE 802.16 standards [3]. However, the orthogonality of subchannels is compromised due to the rapid fluctuations of the wireless channel throughout the OFDM block transmission for the frequency-selective fading channels where mobile terminals reach high vehicular speeds. Therefore, it is a challenging task to design an OFDM system that operates efficiently under high mobility conditions, as mobility support stands out as a crucial feature in next-generation broadband wireless communication systems [4], [5]. Due to its low spectral decay rate, traditional OFDM is inadequate for accommodating asynchronous access, customized subcarrier spacing, spectrum aggregation, and symbol period, all of which are anticipated as requirements for the fifth-generation (5G) wireless networks [6].
OFDM with index modulation (OFDM-IM) seems to be an attractive candidate for next-generation wireless communication to transmit information compared to traditional communication systems because of their appealing benefits in terms of high energy efficiency, superior bit error rate (BER) performance, and less hardware complexity [7], [8], [9]. It uses the corresponding communication systems’ building block indices to transmit extra information bits [10]. IM can map information bits by adjusting the ON/OFF state of the transmission entities. OFDM-IM schemes possess the capability to transmit information in a more energy-efficient way by selectively deactivating specific elements of the system compared to traditional OFDM systems [11].
Many studies related to OFDM-IM were researched in the past few years [3], [12], [13], [14], [15], [16], [17], [18], [19]. In [3], low complex near-optimal detectors for OFDM-IM were introduced to address the challenge posed by the very high computational complexity of maximum likelihood (ML) detection, primarily attributed to the presence of index bits. For the family of OFDM subcarrier IM systems, the authors provided a signaling technique supported by compressed sensing (CS) in [18]. After considering the joint ML detection, they suggested a low-complexity detection approach for the CS-assisted index-modulated (CSIM) and the conventional APM symbols. Despite its high complexity, this algorithm is called the iterative residual check (IRC)-based detector. In [13], the authors presented a greedy detector (GD) utilizing energy detection to estimate the performance of OFDM-IM with very low complexity. However, it has failed to achieve near-optimal performance. The log-likelihood ratio (LLR) detector was introduced in [14]. It can achieve near-ML performance, but it comes with the drawback that the received signal noise power spectral density must be known.
Recently, numerous studies have been conducted utilizing deep learning (DL) in the field of wireless communication [20], [21], [22], [23], [24], [25], [26]. DL-based detectors have been explored in various experiments to decrease complexity and attain near-optimal performance in OFDM-IM systems [27], [28], [29], [30], [31], [32]. In [27], the author proposed a deep learning-based detector called DeepIM for OFDM-IM. This DeepIM consists of a fully connected neural network (FNN). The authors used both Rectified Linear Units (ReLU) and Hyperbolic Tangents (Tanh) as activation functions to detect the information bits. The performance of DeepIM is near optimal for imperfect channel state information (CSI), and it outperforms the GD method. Although this model provides very low complexity and a short runtime, it has some performance gaps with ML for perfect CSI. To minimize the performance disparity, a bi-directional long short-term memory (Bi-LSTM)-based Y-BLSTM detector was proposed in [28]. The Y-BLSTM architecture employs two parallel sub-neural networks to independently learn the constellation and information regarding active indices, respectively. In [29], the authors introduced a detector based on convolutional neural networks (CNN), known as CNN-IM. The received symbols in the CNN-IM framework are converted to polar coordinates to help the neural network determine the indices of the subcarriers that have been activated. Both the CNN and Y-BLSTM models outperformed the GD and Deep-IM detectors. However, they are not able to attain a bit error rate (BER) performance that is nearly optimal. A transformer-based detector known as TransIM was proposed in [33]. TransIM functions with a mid-level modulation order of 16QAM produce soft probabilities for various transmitted symbols. Due to the structure of the transformer, the complexity of TransIM is high, but it demonstrates enhanced BER performance compared to DeepIM and CNN-IM. A deep-based detector named IMNet was proposed in [34]. The proposed IMNet model is applied to detect the transmitted signal in an OFDM-IM for multiple input, multiple output (MIMO) systems. The IMNet model contains two CNNs referred to as the antenna detection (AD) subnet and signal detection (SD) subnet, respectively. The AD subnet consists of four layers of CNN, and they choose a state-of-the-art denoising network commonly used in image processing fields as their SD subnet. The BER performance of IMNet is better than ML and LLR, but the model complexity is higher than other DL models. A dual mode (DM) OFDM-IM detector was proposed in [30] called DeepDM. Both CNN and deep neural network (DNN) models are utilized in this paper, where CNN is used as IndexNet to detect the index bit and DNN is used as CarrierNet to detect the carrier bit. The authors in [35], introduced a TSIMNet detector-based two-stage index-modulated- universal filtered multi-carrier (TSIM-UFMC) system aimed at improving performance and introducing the emerging UFMC technology to underwater acoustic (UWA) communications.
LSTM is a recurrent neural network (RNN) designed to process sequence data with long-term dependencies [36]. Many studies have been done by utilizing LSTM in wireless communication [37], [38], [39]. LSTM architectures add gating mechanisms, such as the forget gate, which enable them to regulate the information flow and gradients throughout the network. This aids in preventing gradients from diminishing significantly during the training process [40]. Also, LSTMs can effectively tackle the vanishing gradient problem generated by backpropagation [41]. Due to the sequential nature of subcarrier activations, OFDM-IM signals usually display temporal dependence. Modeling the temporal dynamics found in OFDM-IM signals is a useful use of LSTM networks because of their prowess at capturing and analyzing data flows. A lot of subcarriers and complicated channel circumstances can be present in OFDM-IM systems. Systems may be efficiently modeled thanks to LSTMs, which provide a balance between performance and complexity [42]. LSTMs are resistant to changes in channel conditions and signal characteristics because they may modify their internal states based on the input data and prior states. Operating in dynamic and noisy wireless environments, this adaptability is essential for OFDM-IM systems. So, LSTMs are well-suited for the particular difficulties presented by OFDM-IM communication systems because of their capacity to capture sequential dependencies, preserve long-term memory, manage complexity, adapt to changing conditions, and take advantage of parallel processing [43]. Motivated by the above-mentioned advantages of DL and literature, in this paper, we propose an LSTM-based detector for the OFDM-IM system. The proposed LSTM-IM detector can achieve better performance than existing manually designed detectors. The proposed model contains only one non-linear LSTM unit, which has a Tanh activation layer to detect the received signal under the Rayleigh fading channel efficiently. The key contributions of this paper can be outlined as follows:
An LSTM-based OFDM-IM detector is proposed in this paper, which can extract features very efficiently by capturing information from earlier time steps and retaining it for an extended period to process sequence data with long-term dependencies. Furthermore, the number of nodes in the hidden layer can be dynamically adjusted to detect the received signal with a suitable balance between complexity and performance.
Before being fed to the LSTM-IM, the received signal and channel data are pre-processed based on the OFDM-IM domain knowledge. This mechanism improves the detection accuracy of LSTM to identify the indices of the activated subcarriers.
We estimate the suggested LSTM-IM detector’s BER performance at various signal-to-noise ratios (SNR). The results confirm that the suggested LSTM-IM detector can achieve better detection performance under both perfect and imperfect channel conditions.
The remainder of the paper is structured as follows: section II presents the system modeling. Section III elaborates on the proposed model, including offline training and online testing procedures. Section IV demonstrates the simulation results, and Section VI presents the conclusions.
System Model
In OFDM-IM, the information bits are not only conveyed by the standard amplitude phase modulation (APM) symbol but also by activated subcarrier indices [44]. We consider that the total transmitted bandwidth is divided into the G groups. Each group contains N subcarriers. So, the total transmitted subcarrier is
The frequency domain received signal at the receiver is represented by \begin{equation*} \mathbf {y}=\mathbf {H}\odot \mathbf {x}+\mathbf {w}, \tag {1}\end{equation*}
LSTM-IM Based Detection Framework
First of all, we describe the structure of the LSTM-IM model in this section. Then we present the proposed model of offline training and online testing procedures with the generated data.
A. LSTM-IM Model Structure
A general structure of the proposed LSTM-IM model is shown in Fig. 2. Comparable to the existing detection methods utilized in OFDM-IM, it is presumed that the channel information will be known to the receiver. Therefore, channel H and the received signal y are regarded as the preliminary inputs to the LSTM-IM model. For the imperfect CSI conditions, we study an actual system that experiences problems due to the receiver’s imprecise CSI calculation. By using \begin{equation*} h\left ({{\alpha }}\right )=\hat {h}\left ({{\alpha }}\right )+e\left ({{\alpha }}\right ), \tag {2}\end{equation*}
Domain knowledge describes how to use IM to efficiently assign subcarriers based on the data to be conveyed and to activate or deactivate subcarriers to express additional information. Furthermore, domain knowledge denotes an awareness of the properties of the communication channel, including impacts like noise, interference, and multipath fading [46]. Before feeding as input to the LSTM-IM model, we pre-process y and H based on the OFDM-IM domain knowledge. Specifically, to obtain an equalized received signal vector, the widely recognized zero-forcing (ZF) equalizer is utilized in the first place such as
Our proposed model is constructed with an LSTM layer and a fully connected layer (FC) with a sigmoid activation. The pre-processed data D feeds as input to the LSTM-IM input layer. The received data is in a 3-dimensional (3D) shape. This 3D data is reshaped according to the LSTM layer input shape by the following function:\begin{equation*} \mathbf {D}= reshape (\mathbf {y}, [-1, N, 3]). \tag {3}\end{equation*}
Then this reshaped data is fed to the LSTM hidden layer. The internal function of the LSTM layer is shown in Fig. 3. It consists of input get, forget get, and output get [47]. Each gate calculates its value utilizing a fully connected layer assisted by a sigmoid activation function. The values of each gate fall within the range of (\begin{align*} \mathbf {i_{t}}& = {\sigma }_{s}({\mathbf {D}}_{t}{\mathbf {W}_{di}}+{\mathbf {h}}_{t-1}{\mathbf {W}_{hi}}+\mathbf {b}_{i}), \tag {4}\\ \mathbf {f_{t}}& = {\sigma }_{s}({\mathbf {D}}_{t}{\mathbf {W}_{df}}+{\mathbf {h}}_{t-1}{\mathbf {W}_{hf}}+\mathbf {b}_{f}), \tag {5}\\ \mathbf {o_{t}}& = {\sigma }_{s}({\mathbf {D}}_{t}{\mathbf {W}_{do}}+{\mathbf {h}}_{t-1}{\mathbf {W}_{ho}}+\mathbf {b}_{o}), \tag {6}\end{align*}
Now, the model performs the memory cell and hidden state operations by utilizing the input nodes at time step t with the following function:\begin{align*}\check {\mathbf {c_{t}}}& = tanh({\mathbf {D}}_{t}{\mathbf {W}_{dc}}+{\mathbf {h}}_{t-1}{\mathbf {W}_{hc}}+\mathbf {b}_{c}), \tag {7}\\ {\mathbf {c_{t}}}& = {\mathbf {f}}_{t}{\odot }{\mathbf {c}_{t-1}}+{\mathbf {i}}_{t}{\odot }\check {\mathbf {c_{t}}}, \tag {8}\\ {\mathbf {H_{t}}}& = {\mathbf {o}}_{t}{\odot }tanh({\mathbf {c}_{t}}), \tag {9}\end{align*}
The output of the LSTM hidden layer is fed as the input to the FC layer. In this layer, Sigmoid activation function \begin{equation*}\mathbf {b}_{g}= \mathbf {f}_{Sigmoid}({\mathbf {W}}_{s}\times {\mathbf {H}_{t}}+ \mathbf {b}_{s}), \tag {10}\end{equation*}
B. Offline Training Process
Before employing the proposed LSTM-IM detector, it is necessary to train the model in the offline phase using simulation data. Specifically, a corresponding set of transmitting vectors is produced to generate multiple sequences of p bits
The parameters used to train the proposed model are shown in Table 1. In all experimental setups under consideration, the proposed LSTM-IM model is trained with 50 epochs. Each epoch contains 20 batches with a 5000 batch size. Since there are a total of 5000000 data samples used to train the model for 100000 batches. We employ the adaptive moment estimation (Adam) optimizer. This optimizer is readily implementable on many commercial DL platforms, including Tensorflow and Keras. LSTM-IM training requires careful selection of the SNR level because the model’s performance is highly dependent on it. More especially, it is essential to select the optimal learning rate so that the model can perform in different ranges of SNR. We apply different training SNRs for different training sequences. We train our model for different learning rates and compare them to evaluate the best performance. As we discussed earlier, the total subcarrier is divided into several groups. Each group contains
The LSTM-IM model is trained with the collected data to minimize the disparity between the true bit and predicted bit, and the BER. In this paper, we apply the means square error function (MSE) to calculate the training loss as follows:\begin{equation*}\mathcal {L}({\mathbf {b}_{g}},{\hat {\mathbf {b}_{g}}};\theta )= \frac {1}{P} ||({\mathbf {b}_{g}}-{\hat {\mathbf {b}}_{g}})||^{2}, \tag {11}\end{equation*}
\begin{equation*} \theta ^{+} \mathrel {\mathrel {\mathop :}\hspace {-0.0672em}=}\theta -\eta \nabla \mathcal {L}\left ({{\mathbf {b}_{g},\hat {\mathbf {b}_{g}};\theta }}\right ), \tag {12}\end{equation*}
The training performances of the LSTM-IM model are shown in Fig. 5 and Fig. 6 using the MSE function. The structure and depth of the LSTM hidden layers are important factors in capturing temporal relationships, and they can affect the model’s capacity to reduce the MSE loss during training. We calculate the training loss for different LSTM-IM hidden layers with 10 dB training SNR and 1 active subcarrier. From Fig. 5, we can see that the estimated loss is decreased with increasing LSTM-IM hidden layers. In Fig. 6, we compare our model loss with CNN-IM and Deep-IM models with 3 active subcarriers. To reduce the discrepancy between the model’s predictions and the actual target values, as shown by the MSE loss function, the LSTM hidden layers are adjusted by backpropagation during the training process [42]. Optimizing the model’s performance in minimizing this loss involves iteratively updating the activations and weights of the hidden layers. According to Fig. 6, at epoch 45, the DeepIM model has a loss of around 0.06, while the CNN-IM model exhibits a loss of approximately 0.045. In contrast, our proposed model achieves a lower loss of nearly 0.035. The loss of the proposed model is comparatively low, and it is stable after 40 epochs.
The proposed LSTM-IM training loss for different numbers of Q with SNR= 10 dB and K = 1.
The comparison of training loss for the proposed LSTM-IM with CNN-IM, and Deep-IM models for SNR = 10 dB and K = 3.
C. Online Testing Process
After completion of offline training, employ the model for online OFDM-IM signal detection with optimized parameters
Simulation Results
The learning rate is an important hyperparameter that determines how much the model is adjusted based on the estimated error for each update of the model weights. Batch size is another important hyperparameter that has a direct impact on training efficiency. Fig. 7 (a) compares the performance of the LSTM-IM model for different batch sizes and Fig. 7 (b) compares the performance for different learning rates. The LSTM-IM model performs well at a 5000 batch size. With increasing or decreasing the batch size, performance can be decreased. In the case of Fig. 7 (b), we estimate the model performance for 0.001, 0.01, and 0.02 learning rates. The LSTM-IM model demonstrates good BER performance with a learning rate of 0.02 at lower SNR levels. However, its performance significantly is deteriorated at higher SNR levels. Specifically, the model exhibits nearly 0.9 dB worse performance with a learning rate of 0.001 compared to a learning rate of 0.01 at an SNR of 25 dB. Based on the given information, it appears that the LSTM-IM model performs well when it is trained with a learning rate of 0.01.
Comparetive result of the LSTM-IM model for different batch size in (a) and different learning rates in (b) with Q =128 and K =1.
We estimate the BER for different values of K and different training SNRs, as shown in Fig. 8 (a) and Fig. 8(b) respectively. From Fig. 8 (a), we can see that, the LSTM-IM detectors perform very well when trained for K =1. In this study, K =1 means that only one subcarrier is activated out of 4 subcarriers. When a small number of subcarriers is activated, transmission is completed at lower data rates. Similarly, when the number of active subcarriers is increased, data transmission rates also increase. Higher data rates correspond to larger p or, equivalently, a larger number of classes involved in LSTM-IM. Consequently, the BER performance decreases with an increase in the value of K and the data rate, as observed in the figure. From Fig. 8(a), it is evident that the BER performance for K =2 and K =3 is very poor compared to the performance with K =1. Thus, it is clear from the results that the proposed model performs better with K =1, i.e., at lower data transmission rates. In the case of Fig 8 (b), the model is trained with K =1. From Fig. 8(b), it is evident that the model demonstrates good performance when trained with a 10 dB SNR. However, increasing or decreasing the value of the training SNR can lead to a decrease in performance. Specifically, in the case of a 5 dB training SNR, it performs better at lower SNR levels but exhibits very poor performance at higher SNR levels. For a training SNR of 20 dB, the LSTM-IM model exhibits approximately 0.8 dB worse performance compared to a training SNR of 10 dB at a point 20 dB SNR. Similarly, for a training SNR of 25 dB, the model demonstrates nearly 2 dB worse performance than a training SNR of 10 dB. So, it is clear that the LSTM-IM model shows better performance at 10 dB training SNR and K =1, i.e., for the
Comparetive BER of the LSTM-IM detector for different K in (a) and different training SNR in (b).
We investigate the proposed model’s BER performance with respect to SNR for different numbers of hidden units Q, as shown in Fig. 9. In this case, we train the model with 10 dB SNR and a
Simulation result of the proposed model for different numbers of Q with SNR = 10 dB and K = 1.
In Fig. 10, we compare the performance of the LSTM-IM model with DeepIM (Relu and Tanh) [27], Y-BLSTM [28], GD, and ML models under perfect CSI conditions with the same parameter settings. This comparison is specifically designed for the
BER comparison of the proposed detector with reference detectors under the perfect CSI condition at K = 1.
Figure 11 compares the performance among LSTM-IM, CNN-IM [29], DeepIM [27], and Y-BLSTM [28] detectors for perfect CSI. In this comparison, the LSTM-IM model is trained with
Comparison of BER performance of the LSTM-IM model with reference models under the perfect CSI condition at K = 3.
We compare the BER of LSTM-IM with other competing schemes in Fig. 12 under uncertain CSI. Specifically, we use the variable imperfect CSI model based on MMSE as described in [45]. In this model, the CSI error variance
Evaluation of BER performance of the LSTM-IM detector with other detectors for the imperfect channel at K = 1.
Computational Complexity
We calculate the computational complexity of the proposed model and compare it with other models, as shown in Table. 2. Addition, multiplication, and other real number operations are included in the total number of real floating-point operations (flops) that are counted [48]. The first layer of our proposed model is the LSTM layer. The LSTM layer takes
Conclusion
This paper outlines our effort to develop an LSTM-based detection scheme to detect the OFDM-IM symbols in wireless communication systems. In order to effectively extract features from the OFDM-IM symbols, the LSTM layer is employed. In this paper, pre-processing is done to improve the detection accuracy based on the structure of the OFDM-IM symbol. The LSTM-IM detector can recover data bits very efficiently by utilizing the pre-processed received signal and channel vector based on domain knowledge. The simulation results validate that the proposed detector outperforms the CNN-IM, Y-BLSTM, DeepIM, GD, and ML detectors under both perfect and imperfect CSI conditions. We believe that the proposed framework will play an important role in future wireless communication systems.