Network-aware task selection to reduce multi-application makespan in cloud

Abstract One new metric that plays a vital role in evaluating the cloud service is the multi-application makespan. There are usually multiple applications without a deadline in the cloud, while the makespan of each application should be minimized, such as parallel applications for training neural networks. However, the previous scheduling rule, called moving the computation tasks of applications closer to data, fails to reduce multi-application makespan. Because it is unknown which application the task belongs to, the number of optimized tasks (close-to-data tasks) is unbalanced among multiple applications, which causes a big gap between the makespan of multiple applications. To address this issue, we propose a cooperative scheduler that considers the application-task relationship, whose goal is reducing the multi-application makespan. Firstly, we propose a cost sharing game model to guide the balance of optimized task selection between multiple applications, in which the cost is referred to as the makespan. Specifically, we develop a network-aware method that can accurately estimate the makespan with a given task selection strategy. In the end, we evaluate the cooperative scheduler in the actual Hadoop cluster with diverse network environments. Experimental results demonstrate that the gap between multi-application makespan decreases. Moreover, compared to the baseline schedulers, the longest makespan decreases by 61.5%, and the network traffic is saved by more than 50%.

[1]  Carlo Curino,et al.  Apache Hadoop YARN: yet another resource negotiator , 2013, SoCC.

[2]  Ling Liu,et al.  Purlieus: Locality-aware resource allocation for MapReduce in a cloud , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[3]  Jieun Choi,et al.  Data-Locality Aware Scientific Workflow Scheduling Methods in HPC Cloud Environments , 2016, International Journal of Parallel Programming.

[4]  Atul Negi,et al.  A data locality based scheduler to enhance MapReduce performance in heterogeneous environments , 2019, Future Gener. Comput. Syst..

[5]  Deep Medhi,et al.  Building Application-Aware Network Environments using SDN for Optimizing Hadoop Applications , 2016, SIGCOMM.

[6]  Dependency-Aware Data Locality for MapReduce , 2018, IEEE Transactions on Cloud Computing.

[7]  Federico Alvarez,et al.  An Edge-to-Cloud Virtualized Multimedia Service Platform for 5G Networks , 2019, IEEE Transactions on Broadcasting.

[8]  Dan Wu,et al.  TADOOP: Mining Network Traffic Anomalies with Hadoop , 2015, SecureComm.

[9]  Francis C. M. Lau,et al.  BAShuffler: Maximizing Network Bandwidth Utilization in the Shuffle of YARN , 2016, HPDC.

[10]  Ruini Xue,et al.  BOLAS+: Scalable Lightweight Locality-Aware Scheduling for Hadoop , 2016, 2016 IEEE Trustcom/BigDataSE/ISPA.

[11]  Huaxi Gu,et al.  Improving Cloud-Based IoT Services Through Virtual Network Embedding in Elastic Optical Inter-DC Networks , 2019, IEEE Internet of Things Journal.

[12]  Jesús Carretero,et al.  A data‐aware scheduling strategy for workflow execution in clouds , 2017, Concurr. Comput. Pract. Exp..

[13]  Deep Medhi,et al.  Application-Aware Network Design for Hadoop MapReduce Optimization Using Software-Defined Networking , 2017, IEEE Transactions on Network and Service Management.

[14]  Jia Wang,et al.  Task scheduling for MapReduce in heterogeneous networks , 2015, Cluster Computing.

[15]  C. Newcombe,et al.  How Amazon web services uses formal methods , 2015, Commun. ACM.

[16]  Lei Ying,et al.  MapTask Scheduling in MapReduce With Data Locality: Throughput and Heavy-Traffic Optimality , 2013, IEEE/ACM Transactions on Networking.

[17]  Youlong Luo,et al.  Cost-effective replication management and scheduling in edge computing , 2019, J. Netw. Comput. Appl..

[18]  Helen D. Karatza,et al.  Scheduling a Job Mix of Bag-of-Tasks and Bag-of-Task-Chains on Distributed Resources , 2020, 2020 11th International Conference on Information and Communication Systems (ICICS).

[19]  Franck Le,et al.  Phurti: Application and Network-Aware Flow Scheduling for Multi-tenant MapReduce Clusters , 2016, 2016 IEEE International Conference on Cloud Engineering (IC2E).

[20]  Feng Wang,et al.  On Datacenter-Network-Aware Load Balancing in MapReduce , 2015, 2015 IEEE 8th International Conference on Cloud Computing.

[21]  Jordi Torres,et al.  Deadline-Based MapReduce Workload Management , 2013, IEEE Transactions on Network and Service Management.

[22]  Jian Xiao,et al.  QKnober: A Knob-Based Fairness-Efficiency Scheduler for Cloud Computing with QoS Guarantees , 2018, ICSOC.

[23]  Geoffrey G. Xie,et al.  Hadoop MapReduce for Mobile Clouds , 2019, IEEE Transactions on Cloud Computing.

[24]  Christof Fetzer,et al.  EHadoop: Network I/O Aware Scheduler for Elastic MapReduce Cluster , 2015, 2015 IEEE 8th International Conference on Cloud Computing.

[25]  Zhuo Tang,et al.  An Optimal Locality-Aware Task Scheduling Algorithm Based on Bipartite Graph Modelling for Spark Applications , 2020, IEEE Transactions on Parallel and Distributed Systems.

[26]  Yi Liu,et al.  DScheduler: Dynamic Network Scheduling Method for MapReduce in Distributed Controllers , 2016, 2016 IEEE 22nd International Conference on Parallel and Distributed Systems (ICPADS).

[27]  Jianping Wu,et al.  Joint optimization of tasks placement and routing to minimize Coflow Completion Time , 2019, J. Netw. Comput. Appl..

[28]  Song Wang,et al.  A Deadline-Aware Coflow Scheduling Approach for Big Data Applications , 2018, 2018 IEEE International Conference on Communications (ICC).

[29]  Frédéric Giroire,et al.  When Network Matters: Data Center Scheduling with Network Tasks , 2019, IEEE INFOCOM 2019 - IEEE Conference on Computer Communications.

[30]  Cong Xu,et al.  Optimal construction of virtual networks for Cloud-based MapReduce workflows , 2017, Comput. Networks.

[31]  Ravindra K. Ahuja,et al.  Network Flows: Theory, Algorithms, and Applications , 1993 .

[32]  Lei Yu,et al.  Probabilistic Network-Aware Task Placement for MapReduce Scheduling , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[33]  Namgi Kim,et al.  An Enhanced Data-Locality-Aware Task Scheduling Algorithm for Hadoop Applications , 2018, IEEE Systems Journal.

[34]  Scott Shenker,et al.  Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.

[35]  Michael Lang,et al.  Load‐balanced and locality‐aware scheduling for data‐intensive workloads at extreme scales , 2016, Concurr. Comput. Pract. Exp..

[36]  Marcelo Veiga Neves,et al.  Application-aware software-defined networking to accelerate mapreduce applications (Application-aware software-defined networking to accelerate mapreduce applications) , 2015 .

[37]  Min Li,et al.  CAM: a topology aware minimum cost flow based resource manager for MapReduce applications in the cloud , 2012, HPDC '12.

[38]  Heeseung Jo,et al.  Enhancing network I/o performance for a virtualized Hadoop cluster , 2017, Concurr. Comput. Pract. Exp..

[39]  Jie Wu,et al.  Optimizing MapReduce Based on Locality of K-V Pairs and Overlap between Shuffle and Local Reduce , 2015, 2015 44th International Conference on Parallel Processing.

[40]  Jiangchuan Liu,et al.  vLocality: Revisiting Data Locality for MapReduce in Virtualized Clouds , 2017, IEEE Network.

[41]  Minyi Guo,et al.  OFScheduler: A Dynamic Network Optimizer for MapReduce in Heterogeneous Cluster , 2013, International Journal of Parallel Programming.

[42]  Olivier Beaumont,et al.  Performance analysis and optimality results for data-locality aware tasks scheduling with replicated inputs , 2020, Future Gener. Comput. Syst..