Rateless Codes for Low-Latency Distributed Inference in Mobile Edge Computing

We consider a mobile edge computing scenario where users want to perform a linear inference operation Wx on local data x for some network-side matrix W . The inference is performed in a distributed fashion over multiple servers at the network edge. For this scenario, we propose a coding scheme that combines a rateless code to provide resiliency against straggling servers—hence reducing the computation latency—and an irregular-repetition code to provide spatial diversity—hence reducing the communication latency. We further derive a lower bound on the total latency—comprising computation latency, communication latency, and decoding latency. The proposed scheme performs remarkably close to the bound and yields significantly lower latency than the scheme based on maximum distance separable codes recently proposed by Zhang and Simeone.

[1]  Meixia Tao,et al.  Exploiting Computation Replication for Mobile Edge Computing: A Fundamental Computation-Communication Tradeoff Study , 2019, IEEE Transactions on Wireless Communications.

[2]  Peter Vary,et al.  Analysis of LT Codes over Finite Fields under Optimal Erasure Decoding , 2013, IEEE Communications Letters.

[3]  Amin Shokrollahi,et al.  Raptor codes , 2011, IEEE Transactions on Information Theory.

[4]  R. Yavne,et al.  An economical method for calculating the discrete Fourier transform , 1899, AFIPS Fall Joint Computing Conference.

[5]  Min Chen,et al.  Task Offloading for Mobile Edge Computing in Software Defined Ultra-Dense Network , 2018, IEEE Journal on Selected Areas in Communications.

[6]  Min Sheng,et al.  Mobile-Edge Computing: Partial Computation Offloading Using Dynamic Voltage Scaling , 2016, IEEE Transactions on Communications.

[7]  Tony Q. S. Quek,et al.  Offloading in Mobile Edge Computing: Task Allocation and Computational Frequency Scaling , 2017, IEEE Transactions on Communications.

[8]  Mohammad Ali Maddah-Ali,et al.  Polynomial Codes: an Optimal Design for High-Dimensional Coded Matrix Multiplication , 2017, NIPS.

[9]  Osvaldo Simeone,et al.  Fundamental Limits of Cloud and Cache-Aided Interference Management With Multi-Antenna Edge Nodes , 2019, IEEE Transactions on Information Theory.

[10]  Gerhard Bauch,et al.  Inactivation Decoding of LT and Raptor Codes: Analysis and Code Design , 2017, IEEE Transactions on Communications.

[11]  Meixia Tao,et al.  Coded Computing and Cooperative Transmission for Wireless Distributed Matrix Multiplication , 2021, IEEE Transactions on Communications.

[12]  Sergio Barbarossa,et al.  Joint Optimization of Radio and Computational Resources for Multicell Mobile-Edge Computing , 2014, IEEE Transactions on Signal and Information Processing over Networks.

[13]  Scott Shenker,et al.  Usenix Association 10th Usenix Symposium on Networked Systems Design and Implementation (nsdi '13) 185 Effective Straggler Mitigation: Attack of the Clones , 2022 .

[14]  Osvaldo Simeone,et al.  On Model Coding for Distributed Inference and Transmission in Mobile Edge Computing Systems , 2019, IEEE Communications Letters.

[15]  Kannan Ramchandran,et al.  Speeding Up Distributed Machine Learning Using Codes , 2015, IEEE Transactions on Information Theory.

[16]  Osvaldo Simeone,et al.  Improved Latency-communication Trade-off for Map-shuffle-reduce Systems with Stragglers , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Albin Severinson,et al.  A Droplet Approach Based on Raptor Codes for Distributed Computing With Straggling Servers , 2018, 2018 IEEE 10th International Symposium on Turbo Codes & Iterative Information Processing (ISTC).

[18]  Gregory W. Wornell,et al.  Using Straggler Replication to Reduce Latency in Large-scale Parallel Computing , 2015, PERV.

[19]  Qianbin Chen,et al.  Computation Offloading and Resource Allocation in Wireless Cellular Networks With Mobile Edge Computing , 2017, IEEE Transactions on Wireless Communications.

[20]  Luiz André Barroso,et al.  The tail at scale , 2013, CACM.

[21]  Alfred Kobsa,et al.  The Adaptive Web, Methods and Strategies of Web Personalization , 2007, The Adaptive Web.

[22]  Albin Severinson,et al.  Block-Diagonal and LT Codes for Distributed Computing With Straggling Servers , 2017, IEEE Transactions on Communications.

[23]  Zdenek Becvar,et al.  Mobile Edge Computing: A Survey on Architecture and Computation Offloading , 2017, IEEE Communications Surveys & Tutorials.

[24]  Malhar Chaudhari,et al.  Rateless codes for near-perfect load balancing in distributed matrix-vector multiplication , 2018, Proc. ACM Meas. Anal. Comput. Syst..

[25]  Michael Luby,et al.  LT codes , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[26]  Giuliano Garrammone On Decoding Complexity of Reed-Solomon Codes on the Packet Erasure Channel , 2013, IEEE Communications Letters.

[27]  Li Zhou,et al.  Energy-Latency Tradeoff for Energy-Aware Offloading in Mobile Edge Computing Networks , 2018, IEEE Internet of Things Journal.

[28]  Ke Zhang,et al.  Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks , 2016, IEEE Access.