Journals & Magazines >IEEE Access >Volume: 12

Bidirectional Deep Learning Decoder for Polar Codes in Flat Fading Channels

Proposed deep learning framework for polar decoding.

Abstract:

One of the main issues facing in the future wireless communications is ultra-reliable and low-latency communication. Polar codes are well-suited for such applications, an...Show More

Metadata

Abstract:

One of the main issues facing in the future wireless communications is ultra-reliable and low-latency communication. Polar codes are well-suited for such applications, and recent advancements in deep learning have shown promising results in enhancing polar code decoding performance. We propose a robust decoder based on a bidirectional long short-term memory (Bi-LSTM) network, which processes sequences in both forward and backward directions simultaneously. This approach leverages the strengths of bidirectional recurrent neural networks to improve the decoding of polar-coded short packets. Our study focuses on packet transmission over frequency-flat quasi-static Rayleigh fading channels, using a simple codebook originally designed for additive white Gaussian noise channels. We evaluate the packet error rate for various signal-to-noise ratio levels using different modulation schemes. The simulation results demonstrate that the proposed Bi-LSTM-based decoder closely approaches the theoretical outage performance and achieves significant coding gains in fading channels. Furthermore, the proposed decoder outperforms convolutional neural network and deep neural network-based decoders, validating its superiority in decoding polar codes for short packet transmission in challenging wireless environments.

Proposed deep learning framework for polar decoding.

Published in: IEEE Access ( Volume: 12)

Page(s): 149580 - 149592

Date of Publication: 09 October 2024

Electronic ISSN: 2169-3536

DOI: 10.1109/ACCESS.2024.3476471

Funding Agency:

Contents

SECTION I.

Introduction

Recently, designing capacity-achieving codes to provide high performance over fading channels has been the main emphasis and difficulty of digital communication. To achieve channel capacity for discrete memoryless channels with symmetric binary input, Erdal Arikan presented polar codes, which are the first error-correcting codes that have been demonstrated to work [1]. Polar codes are highly valued for their remarkable performance and adaptability in a range of communication contexts, making them highly essential in the field of error correction coding. Another important obstacle of future wireless communications is ultra-reliable and low-latency communication (URLLC) [2]. This latency can be minimized by using short packets. However, it leads to a significant reduction in channel coding gain. On the other side, ensuring reliability typically demands more resources, such as employing robust channel codes with greater redundancy or incorporating re-transmission techniques, which in turn increase latency. Low-code rate channel codes are commonly used for transmitting data in URLLC scenarios. Low-density parity checks (LDPC), turbo codes, and polar are often evaluated and considered for these applications. LDPC and polar codes demonstrate comparable performance at large packet sizes, whereas Turbo codes exhibit superior performance over both [3]. However, polar codes demonstrate better performance than LDPC and Turbo codes with small packet sizes [4]. Moreover, polar codes perform better than LDPC codes and turbo codes mainly due to their simpler encoding and decoding processes [5]. This adaptability makes them suitable for a wide range of applications requiring both high-throughput capabilities and low latency, including fifth-generation (5G) wireless networks, optical communication, satellite communication, and storage systems [6]. Polar codes have also been incorporated into a number of communication protocols, such as 5G new radio, demonstrating their significance and broad adoption in the telecom sector [7]. Their alignment with contemporary communication standards and feasibility for hardware implementation further highlight their practical utility. In summary, polar codes are important because they can deliver nearly optimal error correction performance with minimal complexity. This makes them an attractive choice for a wide range of communication applications in both present-day and future wireless systems.

Polar codes are designed specifically for binary-symmetric channels to achieve channel capacity. Polar codes are therefore desirable for coding over a variety of channels. As mentioned in [8], one capacity-achieving polar coding method is intended for additive white Gaussian noise (AWGN) channels. In [9], the author designed a memory-less AWGN channel for polar coding. A nonlinear large-kernel polar coding technique over the AWGN channel is applied in [10]. Polar codes rely on reliable channel state information (CSI) to adapt their encoding and decoding strategies effectively. These robust and efficient polar codes encounter notable challenges when used over fading channels, especially when there is substantial variation in the channel response across the transmitted signal’s bandwidth. All frequencies suffer from comparable fading characteristics [11]. Fading can degrade the error-correction capabilities of polar codes since they assume a more uniform noise distribution. Moreover, the performance of polar code is significantly affected when deployed over fading channels due to the challenges in loss of code rate efficiency and adaptive coding complexities [12]. Consequently, the recommended approaches for constructing polar codes often lead to complicated designs. For example, the author in [13] suggested constructing the code using the estimated error probabilities of subchannels, which are created by polarizing a fading channel and then transforming it into another fading channel with multiplicative and additive Gaussian noise. For the short packet transmitting system, estimating approximations may not be accurate, which can result in poor packet error rate performance. The Monte Carlo approach recommended in [1] and [14] was used to determine the information set during code construction under the Rayleigh fading channel. Designing a fading channel for polar code is a challenging task. Addressing these challenges often requires a careful balance of system design and adaptive algorithms.

Due to the strong error correction capacity and efficiency, decoding plays an important role in polar codes. Polar code uses complex decoding techniques to extract the original data from noisy channels. Decoding involves complex calculations to accurately estimate the transmitted bits, which is essential for ensuring reliable communication. Some important traditional algorithms, such as belief propagation (BP), successive cancellation (SC), successive cancellation list (SCL), SC-Flip (SCF), and SC-stack (SCS) are significant approaches for polar decoding. The SC decoding methods have low complexity, but when dealing with long polar codes, they have poor throughput and excessive latency [15]. Various studies have aimed to decrease the decoding latency of SC-based algorithms with minimal complexity overhead. Using the recursive generation of polar codes, the methods outlined in [16] and [17] identified certain subcodes within their structure and suggested quick decoders for these subcodes that can be applied to SC decoding. In order to further minimize SC-based decoding delay, the author presented a generalized method for quick polar code decoding in [18]. By combining Fano sequential decoding with SC decoding, the author of [19] offered an alternate enhancement to SC decoding. SCL decoding offers better error correction capabilities compared to SC decoding, especially in channels with high noise levels. The list decoding approach maintains a list of possible codeword candidates, which helps in recovering from errors more effectively [20], [21], [22], [23]. In [22], a simplified SCL decoder was introduced by eliminating unnecessary computations, and it has been demonstrated that this simplified approach is equivalent in performance to conventional SCL algorithms. An adaptive SCL decoder was proposed in [20], and the author demonstrated that it can reduce complexity. The BP algorithm provides significant advantages over SC-based decoding. It supports efficient parallelization, allowing for high throughput and low latency implementations. Moreover, BP decoding inherently enables soft-in/soft-out decoding, which aids in joint iterative detection and decoding tasks. Many studies have applied BP decoding systems to polar codes [24], [25]. However, in order to attain the intended performance with fewer iterations, enhancements are required. Deep learning (DL) approaches have been used to overcome the current issues with polar channels and decoders, which have been identified as a type of classification challenge.

DL holds significant importance across various fields due to its capabilities to handle complex tasks that are traditionally challenging. DL algorithms can automatically learn intricate hierarchical representations of data, enabling them to extract meaningful features from raw inputs without explicit human intervention. Moreover, DL models often achieve state-of-the-art performance in tasks, and they can handle large-scale datasets and learn from massive amounts of data to continuously improve accuracy [26], [27], [28], [29], [30]. DL has been widely employed in wireless communication to leverage its advantages and to enhance various aspects of the technology [31], [32], [33], [34], [35], [36], [37]. Notably, some recent works have focused on unsupervised DL models [38], which explore their potential in this domain.

A decoder for polar codes based on DL utilizes neural networks (NN) to improve decoding efficiency and performance. These decoders leverage DL techniques to enhance error correction capabilities and adapt to different channel conditions, potentially leading to faster and more accurate decoding compared to conventional methods [39]. Recently, deep learning methods have been applied in numerous studies for polar decoding [7], [40], [41], [42], [43], [44]. Convolutional neural networks (CNNs) are exceptionally powerful in computer vision and image processing due to their ability to autonomously acquire hierarchical features, and manage extensive datasets effectively through parameter sharing and sparse connectivity [45], [46]. On the other hand, deep neural networks (DNNs) are versatile for a variety of tasks because of their multilayer learning capabilities, global information processing capabilities, and adaptability to a wide range of input structure types. This makes them useful in a variety of fields where obtaining high-performance results depends on comprehending a larger context and capturing abstract representations [47], [48]. Another type of NN is the recurrent neural network (RNN), which uses the output from the preceding step as the input for the current step. The hidden state of an RNN, which retains some information about a sequence, is its primary and most significant feature, and it is also known as memory state [49], [50]. Long short-term memory (LSTM) is an example of RNN. LSTMs are adept at managing long-term relationships in sequential data through their gated design, enabling them to choose whether to retain or discard information as needed. They effectively address the vanishing gradient issue encountered in traditional DNNs and RNNs and have proven highly effective in diverse sequence modeling applications, showcasing superior performance [51], [52]. Bidirectional LSTMs (Bi-LSTMs) are a key advancement in LSTM models, capable of capturing contextual information from both past and future sequences. This bi-directional processing enhances their ability to understand input sequences comprehensively, improving prediction and classification accuracy. By modeling intricate relationships across entire sequences, Bi-LSTMs excel in tasks that require complex temporal data comprehension [53], [54], [55].

A deep NN decoder and a multiple-scaled BP method were proposed in [7] in order to improve performance, low latency, and complexity. The authors of the paper [5], described a DL technique to improve the performance of the polar BP decoder using a one-bit quantizer in order to get higher performance and faster training convergence. The suggested DL-based decoder with a one-bit quantizer outperforms traditional BP NN decoders running over an AWGN channel in terms of learning efficiency and error performance in a zero-delay system model. In [56], the authors introduced a modified log-likelihood ratios (LLR) for the free-space optical (FSO) channel, illustrating it as an instance of the neural successive cancellation (NSC) decoder. This NSC decoder demonstrated robustness across a broad spectrum of turbulence conditions, having been trained specifically for high and medium turbulence environments. In [57], the authors devised an RNN-based polar BP decoder that utilizes weight quantization via a codebook. This approach aims to overcome the additional memory requirements and computational challenges associated with DNN-based BP methods. In [58], the author designed an SCL decoder as a maze-traversing game, which is solved using deep reinforcement learning (DRL). While the game-based method demonstrates lower complexity, its frame error rate (FER) performance is comparable to that of state-of-the-art SCL decoding processes. A DNN-aided SCL with a shifted pruning decoder was designed in [59], eliminating the need for costly transcendental function calculations. A DNN-aided adaptive dynamic SCL flip (D-SCLF) decoder was proposed in [60], where the author introduced an approximation scheme to reduce computational complexity. In this paper, the bit error rate (BER) performance is improved by up to 0.35 dB, and the average complexity is reduced by up to 57.65%. A new artificial intelligence-based framework utilizing a multilayer perceptron (MLP) for adaptive polar coding under the SCL decoder was proposed in [61]. In [62], the author introduced a DL-aided SCF decoding technique that uses LSTM networks to precisely identify the erroneous bits under the binary symmetric channel (BSC). Consequently, a two-phase training procedure that integrates supervised and reinforced learning has been suggested for the LSTM network. In this paper, the author evaluates the block error rate (BLER) for different signal to ratios (SNRs).

Polar codes, known for their capacity-achieving properties in communication systems, benefit from Bi-LSTM models, which analyze forward and backward bit dependencies. This improves the decoding process, allowing for more accurate error correction compared to traditional methods. By leveraging LLRs, Bi-LSTMs make more informed decisions, enhancing noise resilience and accuracy. This approach makes Bi-LSTM-based decoders particularly effective for real-time communication systems requiring URLLC, ensuring efficient handling of complex data. The structure of a Bi-LSTM model with input, forward, backward, activation, and output layers is shown in Fig 1. In this paper, we propose a robust polar decoding technique over flat fading channels based on the Bi-LSTM model, leveraging its advantages to overcome the challenges posed by fading channels and decoding issues mentioned earlier. In this study, we evaluate the packet error rate (PER) across average SNR for CNN and Bi-LSTM models with similar configurations and compare the results with other learning models. The key contribution of this paper is illustrated below:

We study packet transmission over frequency-flat quasi-static Rayleigh fading channels, which allow efficient resource allocation and simplify equalization, scheduling, and transmission planning techniques. Consequently, the channel coefficient changes from packet to packet but remains constant within each packet.
In this paper, we propose a Bi-LSTM network for decoding polar codes, taking advantage of its bidirectional processing to manage extended dependencies and fluctuating channel conditions, thus improving the accuracy and decoding performance of the proposed systems.
We calculate the PER for the proposed system across various SNR levels. Simulation results indicate that the proposed model achieves coding gain in fading channels. The evaluation includes different modulation orders and optimizers, where the Adam optimizer shows superior performance compared to the others. Additionally, the results confirm that the proposed Bi-LSTM model outperforms both CNN and DNN models.

FIGURE 1.

The Bi-LSTM model structure with forward and backward layer.

MIT Libraries

MIT Libraries

Bidirectional Deep Learning Decoder for Polar Codes in Flat Fading Channels

Alerts

Abstract:

Metadata

Abstract:

Funding Agency:

Introduction

System Model

Polar Codes

Proposed Bi-Directional Learning Framework

Internal Operations of Bi-LSTM Model

A. Training and Testing Process

Simulation Results

Conclusion

References

IEEE Account

Purchase Details

Profile Information

Need Help?