Compression and Acceleration of Neural Networks for Communications

DL has achieved great success in signal processing and communications and has become a promising technology for future wireless communications. Existing works mainly focus on exploiting DL to improve the performance of communication systems. However, the high memory requirement and computational complexity constitute a major hurdle for the practical deployment of DL-based communications. In this article, we investigate how to compress and accelerate the neural networks (NNs) in communication systems. After introducing the deployment challenges for DL-based communication algorithms, we discuss some representative NN compression and acceleration techniques. Afterwards, two case studies for multiple-input-multiple- output (MIMO) communications, including DL-based channel state information feedback and signal detection, are presented to show the feasibility and potential of these techniques. We finally identify some challenges on NN compression and acceleration in DL-based communications and provide a guideline for subsequent research.

[1]  Jakob Hoydis,et al.  An Introduction to Deep Learning for the Physical Layer , 2017, IEEE Transactions on Cognitive Communications and Networking.

[2]  Cong Shen,et al.  An Iterative BP-CNN Architecture for Channel Decoding , 2017, IEEE Journal of Selected Topics in Signal Processing.

[3]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[4]  Shi Jin,et al.  Deep Learning for Massive MIMO CSI Feedback , 2017, IEEE Wireless Communications Letters.

[5]  Ami Wiesel,et al.  Learning to Detect , 2018, IEEE Transactions on Signal Processing.

[6]  Qiang Liu,et al.  Acquisition of channel state information in heterogeneous cloud radio access networks: challenges and research directions , 2015, IEEE Wireless Communications.

[7]  Robert W. Heath,et al.  Non-Stationarities in Extra-Large-Scale Massive MIMO , 2019, IEEE Wireless Communications.

[8]  Tao Jiang,et al.  Deep learning for wireless physical layer: Opportunities and challenges , 2017, China Communications.

[9]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[10]  Geoffrey Ye Li,et al.  Convolutional Neural Network-Based Multiple-Rate Compressive Sensing for Massive MIMO CSI Feedback: Design, Simulation, and Analysis , 2019, IEEE Transactions on Wireless Communications.

[11]  Geoffrey Ye Li,et al.  Deep Learning-Based CSI Feedback Approach for Time-Varying Massive MIMO Channels , 2018, IEEE Wireless Communications Letters.

[12]  Forrest N. Iandola,et al.  SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[13]  Rich Caruana,et al.  Model compression , 2006, KDD '06.

[14]  Geoffrey Ye Li,et al.  Model-Driven Deep Learning for Physical Layer Communications , 2018, IEEE Wireless Communications.

[15]  Biing-Hwang Juang,et al.  Deep Learning in Physical Layer Communications , 2018, IEEE Wireless Communications.