Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference

Edge machine learning can deliver low-latency and private artificial intelligent (AI) services for mobile devices by leveraging computation and storage resources at the network edge. This article presents an energy-efficient edge processing framework to execute deep learning inference tasks at the edge computing nodes whose wireless connections to mobile devices are prone to channel uncertainties. Aimed at minimizing the sum of computation and transmission power consumption with probabilistic Quality-of-Service (QoS) constraints, we formulate the joint inference tasking and the downlink beamforming problem that is characterized by a group sparse objective function. We provide a statistical learning-based robust optimization approach to approximate the highly intractable probabilistic-QoS constraints by nonconvex quadratic constraints, which are further reformulated as matrix inequalities with a rank-one constraint via matrix lifting. We design a reweighted power minimization approach by iteratively reweighted $\ell _{1}$ minimization with difference-of-convex-functions (DC) regularization and updating weights, where the reweighted approach is adopted for enhancing group sparsity whereas the DC regularization is designed for inducing rank-one solutions. The numerical results demonstrate that the proposed approach outperforms other state-of-the-art approaches.

[1]  Ju Ren,et al.  EdgeSanitizer: Locally Differentially Private Deep Inference at the Edge for Mobile Data Analytics , 2019, IEEE Internet of Things Journal.

[2]  Rose Qingyang Hu,et al.  Energy Efficient and Robust Beamforming for MISO Cognitive Small Cell Networks , 2018, IEEE Internet of Things Journal.

[3]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[4]  H. T. Kung,et al.  Distributed Deep Neural Networks Over the Cloud, the Edge and End Devices , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[5]  T. P. Dinh,et al.  Convex analysis approach to d.c. programming: Theory, Algorithm and Applications , 1997 .

[6]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[7]  Yuanming Shi,et al.  Robust Group Sparse Beamforming for Multicast Green Cloud-RAN With Imperfect CSI , 2015, IEEE Transactions on Signal Processing.

[8]  Mohammad Ali Maddah-Ali,et al.  Completely Stale Transmitter Channel State Information is Still Very Useful , 2010, IEEE Transactions on Information Theory.

[9]  F. Bach,et al.  Optimization with Sparsity-Inducing Penalties (Foundations and Trends(R) in Machine Learning) , 2011 .

[10]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[11]  Vivienne Sze,et al.  Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Paul Tseng,et al.  Approximation Bounds for Quadratic Optimization with Homogeneous Quadratic Constraints , 2007, SIAM J. Optim..

[13]  Wei Chen,et al.  Smoothed $L_p$-Minimization for Green Cloud-RAN With User Admission Control , 2015, IEEE Journal on Selected Areas in Communications.

[14]  Xu Chen,et al.  Edge Intelligence: Paving the Last Mile of Artificial Intelligence With Edge Computing , 2019, Proceedings of the IEEE.

[15]  Robert W. Heath,et al.  Limited Feedback in Single and Multi-User MIMO Systems With Finite-Bit ADCs , 2018, IEEE Transactions on Wireless Communications.

[16]  Mohsen Guizani,et al.  Reliable Federated Learning for Mobile Networks , 2019, IEEE Wireless Communications.

[17]  Julien Mairal,et al.  Optimization with Sparsity-Inducing Penalties , 2011, Found. Trends Mach. Learn..

[18]  Alexander Shapiro,et al.  Convex Approximations of Chance Constrained Programs , 2006, SIAM J. Optim..

[19]  Song Han,et al.  Deep Compression: Compressing Deep Neural Network with Pruning, Trained Quantization and Huffman Coding , 2015, ICLR.

[20]  Shengli Xie,et al.  Incentive Mechanism for Reliable Federated Learning: A Joint Optimization Approach to Combining Reputation and Contract Theory , 2019, IEEE Internet of Things Journal.

[21]  Antti Tölli,et al.  Efficient Solutions for Weighted Sum Rate Maximization in Multicellular Networks With Channel Uncertainties , 2013, IEEE Transactions on Signal Processing.

[22]  Xiliang Luo,et al.  Pilot Contamination in Massive MIMO Induced by Timing and Frequency Errors , 2018, IEEE Transactions on Wireless Communications.

[23]  Jun Zhang,et al.  Communication-Efficient Edge AI: Algorithms and Systems , 2020, IEEE Communications Surveys & Tutorials.

[24]  Qiong Wu,et al.  Nonconvex and Nonsmooth Sparse Optimization via Adaptively Iterative Reweighted Methods , 2018, Journal of Global Optimization.

[25]  Wei Yu,et al.  Content-Centric Sparse Multicast Beamforming for Cache-Enabled Cloud RAN , 2015, IEEE Transactions on Wireless Communications.

[26]  Mehdi Bennis,et al.  Wireless Network Intelligence at the Edge , 2018, Proceedings of the IEEE.

[27]  Zhi-Quan Luo,et al.  Joint Base Station Clustering and Beamformer Design for Partial Coordinated Transmission in Heterogeneous Networks , 2012, IEEE Journal on Selected Areas in Communications.

[28]  Yuanming Shi,et al.  Optimal Stochastic Coordinated Beamforming for Wireless Cooperative Networks With CSI Uncertainty , 2013, IEEE Transactions on Signal Processing.

[29]  Wei Yu,et al.  Two-Timescale Hybrid Compression and Forward for Massive MIMO Aided C-RAN , 2019, IEEE Transactions on Signal Processing.

[30]  L. Jeff Hong,et al.  Learning-Based Robust Optimization: Procedures and Statistical Guarantees , 2017, Manag. Sci..

[31]  Wei Chen,et al.  The Roadmap to 6G: AI Empowered Wireless Networks , 2019, IEEE Communications Magazine.

[32]  Wei Yu,et al.  Energy Efficiency of Downlink Transmission Strategies for Cloud Radio Access Networks , 2016, IEEE Journal on Selected Areas in Communications.

[33]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[34]  Zhi Zhou,et al.  Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing , 2019, IEEE Transactions on Wireless Communications.

[35]  Yuanming Shi,et al.  Group Sparse Beamforming for Green Cloud-RAN , 2013, IEEE Transactions on Wireless Communications.

[36]  Victor C. M. Leung,et al.  Joint User Scheduling and Power Allocation Optimization for Energy-Efficient NOMA Systems With Imperfect CSI , 2017, IEEE Journal on Selected Areas in Communications.

[37]  Tao Zhang,et al.  Model Compression and Acceleration for Deep Neural Networks: The Principles, Progress, and Challenges , 2018, IEEE Signal Processing Magazine.

[38]  Jason Cong,et al.  Scaling for edge inference of deep neural networks , 2018 .