Communication-Efficient Edge AI Inference Over Wireless Networks

Given the fast growth of intelligent devices, it is expected that a large number of high-stake artificial intelligence (AI) applications, e.g., drones, autonomous cars, tactile robots, will be deployed at the edge of wireless networks in the near future. As such, the intelligent communication networks will be designed to leverage advanced wireless techniques and edge computing technologies to support AI-enabled applications at various end devices with limited communication, computation, hardware and energy resources. In this article, we shall present the principles of efficient deployment of model inference at network edge to provide low-latency and energy-efficient AI services. This includes the wireless distributed computing framework for low-latency device distributed model inference as well as the wireless cooperative transmission strategy for energy-efficient edge cooperative model inference. The communication efficiency of edge inference systems is further improved by building up a smart radio propagation environment via intelligent reflecting surface.

[1]  Wei Chen,et al.  The Roadmap to 6G: AI Empowered Wireless Networks , 2019, IEEE Communications Magazine.

[2]  Vivienne Sze,et al.  Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Mehdi Bennis,et al.  Wireless Network Intelligence at the Edge , 2018, Proceedings of the IEEE.

[4]  T. P. Dinh,et al.  Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[5]  Yong Zhou,et al.  Reconfigurable Intelligent Surface for Green Edge Inference , 2019, IEEE Transactions on Green Communications and Networking.

[6]  Zhi Zhou,et al.  Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing , 2019, IEEE Transactions on Wireless Communications.

[7]  K. B. Letaief,et al.  Mobile Edge Intelligence and Computing for the Internet of Vehicles , 2019, Proceedings of the IEEE.

[8]  Xu Chen,et al.  Edge Intelligence: Paving the Last Mile of Artificial Intelligence With Edge Computing , 2019, Proceedings of the IEEE.

[9]  Xiao Lu,et al.  Towards Smart Radio Environment for Wireless Communications via Intelligent Reflecting Surfaces: A Comprehensive Survey , 2019, ArXiv.

[10]  Xiao Lu,et al.  Toward Smart Wireless Communications via Intelligent Reflecting Surfaces: A Contemporary Survey , 2019, IEEE Communications Surveys & Tutorials.

[11]  Meixia Tao,et al.  Generalized Signal Alignment: On the Achievable DoF for Multi-User MIMO Two-Way Relay Channels , 2015, IEEE Transactions on Information Theory.

[12]  Massoud Pedram,et al.  JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services , 2018, IEEE Transactions on Mobile Computing.

[13]  Mohammad Ali Maddah-Ali,et al.  Coding for Distributed Fog Computing , 2017, IEEE Communications Magazine.

[14]  Wei Yu,et al.  Multi-Cell MIMO Cooperative Networks: A New Look at Interference , 2010, IEEE Journal on Selected Areas in Communications.

[15]  A. Salman Avestimehr,et al.  A Fundamental Tradeoff Between Computation and Communication in Distributed Computing , 2016, IEEE Transactions on Information Theory.

[16]  Qingqing Wu,et al.  Accessing From the Sky: A Tutorial on UAV Communications for 5G and Beyond , 2019, Proceedings of the IEEE.

[17]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[18]  Zhi Ding,et al.  Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference , 2019, IEEE Internet of Things Journal.

[19]  K. B. Letaief,et al.  A Survey on Mobile Edge Computing: The Communication Perspective , 2017, IEEE Communications Surveys & Tutorials.

[20]  Xiaojun Yuan,et al.  Reconfigurable-Intelligent-Surface Empowered Wireless Communications: Challenges and Opportunities , 2020, IEEE Wireless Communications.

[21]  Meixia Tao,et al.  Exploiting Computation Replication for Mobile Edge Computing: A Fundamental Computation-Communication Tradeoff Study , 2019, IEEE Transactions on Wireless Communications.

[22]  Sami Haddadin,et al.  Tactile Robots as a Central Embodiment of the Tactile Internet , 2019, Proceedings of the IEEE.

[23]  Zhi Ding,et al.  Data Shuffling in Wireless Distributed Computing via Low-Rank Optimization , 2018, IEEE Transactions on Signal Processing.