Load scheduling for distributed edge computing: A communication-computation tradeoff

Due to the intensive computation requirements of emerging applications and the limited computational capability of edge computing servers, the computation task must be executed on multiple edge servers in a distributive and cooperative manner. However, the large amount of information exchanged among the edge servers is a major obstacle for improving the computing performance. By utilizing the excess computational resource, coded MapReduce provides an effective approach to reduce the communication load. In this paper, we develop a stochastic load scheduling framework to complete the computation tasks with coded MapReduce considering the intrinsic tradeoff between the communication and computation loads. Our goal is to minimize the communication load under time-varying excess computational resources. We first reduce this problem to a task scheduling problem by exploiting the property of the computing repetition in the coded MapReduce framework. Since the task scheduling problem is still a stochastic optimization problem, it is generally difficult to solve. In the offline setting, we obtain the optimal computation load scheduling algorithm by adopting the augmented Lagrangian method. In the online setting, we derive a worst-case performance bound of the online equal task scheduling (ETS) algorithm by using competitive analysis. Furthermore, we make full use of past state information of computing resources for pre-planing and propose an improved algorithm based on the ETS algorithm in a learning manner. Finally, our proposed algorithm is evaluated by simulation to demonstrate that the proposed algorithms are superior over the conventional algorithms, and the performance gap between the online and offline algorithms is fairly small.

[1]  Feng Wang,et al.  Joint offloading and computing optimization in wireless powered mobile-edge computing systems , 2017, ICC.

[2]  Zhenni Li,et al.  Tology-Aware Optimal Data Placement Algorithm for Network Traffic Optimization , 2016, IEEE Transactions on Computers.

[3]  Domenico Talia,et al.  How distributed data mining tasks can thrive as knowledge services , 2010, Commun. ACM.

[4]  Lei Ying,et al.  Map task scheduling in MapReduce with data locality: Throughput and heavy-traffic optimality , 2013, INFOCOM.

[5]  Mohammad Ali Maddah-Ali,et al.  Fundamental tradeoff between computation and communication in distributed computing , 2016, ISIT.

[6]  Zhisheng Niu,et al.  Tasks scheduling and resource allocation in heterogeneous cloud for delay-bounded mobile edge computing , 2017, 2017 IEEE International Conference on Communications (ICC).

[7]  Daniel M. Batista,et al.  A Survey of Large Scale Data Management Approaches in Cloud Environments , 2011, IEEE Communications Surveys & Tutorials.

[8]  Eytan Modiano,et al.  A Calculus Approach to Energy-Efficient Data Transmission With Quality-of-Service Constraints , 2009, IEEE/ACM Transactions on Networking.

[9]  V. Lalitha,et al.  Locality-aware hybrid coded MapReduce for server-rack architecture , 2017, 2017 IEEE Information Theory Workshop (ITW).

[10]  Wei Wang,et al.  A POMDP-based optimal spectrum sensing and access scheme for cognitive radio networks with hardware limitation , 2012, 2012 IEEE Wireless Communications and Networking Conference (WCNC).

[11]  Wei Wang,et al.  Delay-Constrained Hybrid Computation Offloading With Cloud and Fog Computing , 2017, IEEE Access.

[12]  Xiaoqiao Meng,et al.  Coupling task progress for MapReduce resource-aware scheduling , 2013, 2013 Proceedings IEEE INFOCOM.

[13]  Murali S. Kodialam,et al.  Joint scheduling of processing and Shuffle phases in MapReduce systems , 2012, 2012 Proceedings IEEE INFOCOM.

[14]  Richard M. Karp,et al.  On-Line Algorithms Versus Off-Line Algorithms: How Much is it Worth to Know the Future? , 1992, IFIP Congress.

[15]  Sanjay Ghemawat,et al.  MapReduce: a flexible data processing tool , 2010, CACM.

[16]  Albert G. Greenberg,et al.  Scarlett: coping with skewed content popularity in mapreduce clusters , 2011, EuroSys '11.

[17]  Cristina L. Abad,et al.  Pandas: Robust Locality-Aware Scheduling With Stochastic Delay Optimality , 2017, IEEE/ACM Transactions on Networking.

[18]  Allan Borodin,et al.  On the Competitive Theory and Practice of Portfolio Selection (Extended Abstract) , 2000, LATIN.

[19]  Mohammad Ali Maddah-Ali,et al.  Coding for Distributed Fog Computing , 2017, IEEE Communications Magazine.

[20]  Robert Birke,et al.  Optimizing Energy, Locality and Priority in a MapReduce Cluster , 2015, 2015 IEEE International Conference on Autonomic Computing.

[21]  Mohammad Ali Maddah-Ali,et al.  Coded MapReduce , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[22]  Wei Ni,et al.  Optimal Schedule of Mobile Edge Computing for Internet of Things Using Partial Information , 2017, IEEE Journal on Selected Areas in Communications.

[23]  Jie Wu,et al.  Optimizing MapReduce Based on Locality of K-V Pairs and Overlap between Shuffle and Local Reduce , 2015, 2015 44th International Conference on Parallel Processing.

[24]  Ling Liu,et al.  Purlieus: Locality-aware resource allocation for MapReduce in a cloud , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[25]  Michael I. Jordan,et al.  Managing data transfers in computer clusters with orchestra , 2011, SIGCOMM 2011.

[26]  A. Salman Avestimehr,et al.  A Fundamental Tradeoff Between Computation and Communication in Distributed Computing , 2016, IEEE Transactions on Information Theory.

[27]  Jianwei Niu,et al.  Bandwidth Aware Application Partitioning for Computation Offloading on Mobile Devices , 2011, GreeNets.

[28]  Tony Q. S. Quek,et al.  Offloading in Mobile Edge Computing: Task Allocation and Computational Frequency Scaling , 2017, IEEE Transactions on Communications.

[29]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[30]  Amir Salman Avestimehr,et al.  Coded computation over heterogeneous clusters , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[31]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[32]  Amir Salman Avestimehr,et al.  On Heterogeneous Coded Distributed Computing , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[33]  Guenter Klas,et al.  Edge Computing and the Role of Cellular Networks , 2017, Computer.

[34]  Lin Chen,et al.  Heterogeneous Spectrum Aggregation: Coexistence From a Queue Stability Perspective , 2018, IEEE Transactions on Wireless Communications.

[35]  Richard Wang,et al.  OpenFlow-Based Server Load Balancing Gone Wild , 2011, Hot-ICE.

[36]  Mohammad Ali Maddah-Ali,et al.  Coded distributed computing: Fundamental limits and practical challenges , 2016, 2016 50th Asilomar Conference on Signals, Systems and Computers.

[37]  Victor C. M. Leung,et al.  Joint computation and communication resource allocation in mobile-edge cloud computing networks , 2016, 2016 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC).

[38]  Scott Shenker,et al.  Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.

[39]  Boon Thau Loo,et al.  Performance Modeling of MapReduce Jobs in Heterogeneous Cloud Environments , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.