Coded Federated Computing in Wireless Networks with Straggling Devices and Imperfect CSI

Distributed computing platforms typically assume the availability of reliable and dedicated connections among the processors. This work considers an alternative scenario, relevant for wireless data centers and federated learning, in which the distributed processors, operating on generally distinct coded data, are connected via shared wireless channels accessed via full-duplex transmission. The study accounts for both wireless and computing impairments, including interference, imperfect Channel State Information, and straggling processors, and it assumes a Map-Shuffle-Reduce coded computing paradigm. The total latency of the system, obtained as the sum of computing and communication delays, is studied for different shuffling strategies revealing the interplay between distributed computing, coding, and cooperative or coordinated transmission.

[1]  Paul de Kerret,et al.  On the degrees of freedom of the K-user time correlated broadcast channel with delayed CSIT , 2013, 2013 IEEE International Symposium on Information Theory.

[2]  Osvaldo Simeone,et al.  Improved Latency-communication Trade-off for Map-shuffle-reduce Systems with Stragglers , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Mohammad Ali Maddah-Ali,et al.  A Unified Coding Framework for Distributed Computing with Straggling Servers , 2016, 2016 IEEE Globecom Workshops (GC Wkshps).

[4]  Li Zhou,et al.  Energy-Latency Tradeoff for Energy-Aware Offloading in Mobile Edge Computing Networks , 2018, IEEE Internet of Things Journal.

[5]  Sheng Yang,et al.  A Storage-Computation-Communication Tradeoff for Distributed Computing , 2018, 2018 15th International Symposium on Wireless Communication Systems (ISWCS).

[6]  A. Salman Avestimehr,et al.  A Fundamental Tradeoff Between Computation and Communication in Distributed Computing , 2016, IEEE Transactions on Information Theory.

[7]  Joonhyuk Kang,et al.  Wireless Map-Reduce Distributed Computing with Full-Duplex Radios and Imperfect CSI , 2019, 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC).

[8]  Bruno Clerckx,et al.  Sum-Rate Maximization for Linearly Precoded Downlink Multiuser MISO Systems With Partial CSIT: A Rate-Splitting Approach , 2016, IEEE Transactions on Communications.

[9]  Amir Salman Avestimehr,et al.  Lagrange Coded Computing: Optimal Design for Resiliency, Security and Privacy , 2018, AISTATS.

[10]  Farzin Haddadpour,et al.  On the optimal recovery threshold of coded matrix multiplication , 2017, 2017 55th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[11]  Peter Richtárik,et al.  Federated Learning: Strategies for Improving Communication Efficiency , 2016, ArXiv.

[12]  Kannan Ramchandran,et al.  Speeding Up Distributed Machine Learning Using Codes , 2015, IEEE Transactions on Information Theory.

[13]  Natalie D. Enright Jerger,et al.  Outstanding Research Problems in NoC Design: System, Microarchitecture, and Circuit Perspectives , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[14]  Osvaldo Simeone,et al.  Fog-Aided Wireless Networks for Content Delivery: Fundamental Latency Tradeoffs , 2016, IEEE Transactions on Information Theory.

[15]  Mehdi Bennis,et al.  Wireless Network Intelligence at the Edge , 2018, Proceedings of the IEEE.

[16]  Khaled Ben Letaief,et al.  Dynamic Computation Offloading for Mobile-Edge Computing With Energy Harvesting Devices , 2016, IEEE Journal on Selected Areas in Communications.

[17]  Mohamed-Slim Alouini,et al.  Wireless Data Center Networks: Advances, Challenges, and Opportunities , 2018, ArXiv.

[18]  Fan Li,et al.  Wireless MapReduce Distributed Computing , 2018, 2018 IEEE International Symposium on Information Theory (ISIT).