Convolutional Neural Network-Based Polar Decoding

In this work, we extend the capability of convolutional neural network (CNN) to polar code decoding. Previous work has shown that a multi-layer perceptron (MLP), which is a basic form of deep neural network (DNN), can achieve high decoding accuracy and speed for polar code when the block length is very short. However, its performance drops drastically for longer codes, due to the bulky network structure. In this work, we design and implement a CNN for polar decoding. In order to improve the decoding accuracy, we introduce auxiliary labels into CNN output based on the encoding structure of polar code. In addition, we propose to prune the CNN to preserve the decoding accuracy of wider network while reducing the computation and the parameters. With these two innovations, the decoding accuracy of original CNN can be improved. We carry out extensive simulations to compare our designed CNN decoder with MLP decoder. Results show that when the code length is 64, our model is 60 times smaller than the MLP decoder, and the accuracy of our model increases with model size, while MLP reaches saturation. Additional results show that our proposed method outperforms original CNN with regard to the BER performance under marginal parameter increase.

[1]  Erdal Arikan,et al.  Channel Polarization: A Method for Constructing Capacity-Achieving Codes for Symmetric Binary-Input Memoryless Channels , 2008, IEEE Transactions on Information Theory.

[2]  Warren J. Gross,et al.  Neural offset min-sum decoding , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[3]  Sreeram Kannan,et al.  Communication Algorithms via Deep Learning , 2018, ICLR.

[4]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5]  Yair Be'ery,et al.  Learning to decode linear codes using deep learning , 2016, 2016 54th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[6]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Stephan ten Brink,et al.  On deep learning-based channel decoding , 2017, 2017 51st Annual Conference on Information Sciences and Systems (CISS).

[8]  Stephan ten Brink,et al.  Scaling Deep Learning-Based Decoding of Polar Codes via Partitioning , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[9]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[10]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[11]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[12]  Julie Ann A. Salido,et al.  Using Deep Learning for Melanoma Detection in Dermoscopy Images , 2018 .

[13]  Fernando A. Mujica,et al.  An Empirical Evaluation of Deep Learning on Highway Driving , 2015, ArXiv.

[14]  Kiran Karra,et al.  Learning to communicate: Channel auto-encoders, domain specific regularizers, and attention , 2016, 2016 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[15]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[16]  Alexander Vardy,et al.  List decoding of polar codes , 2011, 2011 IEEE International Symposium on Information Theory Proceedings.

[17]  Cong Shen,et al.  An Iterative BP-CNN Architecture for Channel Decoding , 2017, IEEE Journal of Selected Topics in Signal Processing.

[18]  Hanan Samet,et al.  Pruning Filters for Efficient ConvNets , 2016, ICLR.

[19]  Rüdiger L. Urbanke,et al.  Polar Codes for Channel and Source Coding , 2009, ArXiv.