Network-Aware Task Assignment for MapReduce Applications in Shared Clusters

Running MapReduce applications in shared clusters is becoming increasingly compelling to improve the cluster utilization. However, the network sharing across diverse applications can make the network bandwidth for MapReduce applications constrained and heterogeneous, which inevitably increases the severity of network hotspots in racks, and makes the existing task assignment policies that focus on the data locality no longer effective. To deal with this issue, this paper proposes a lightweight networkaware task assignment strategy for MapReduce applications in shared clusters. By analyzing the relationship between job completion time and the assignment of both map and reduce tasks across racks, it devises and integrates two simple yet effective greedy heuristics, which can minimize the completion time of map phase and reduce phase, respectively. With extensive prototype experiments on a 12-node 3-rack MapReduce cluster and complementary large-scale simulations driven by Facebook job traces, we demonstrate that our network-aware strategy can shorten the completion time of MapReduce jobs, in comparison to the state-of-the-art task assignment strategies, yet with an acceptable computational overhead.

[1]  Athanasios V. Vasilakos,et al.  Managing Performance Overhead of Virtual Machines in Cloud Computing: A Survey, State of the Art, and Future Directions , 2014, Proceedings of the IEEE.

[2]  Dinan Gunawardena,et al.  Chatty Tenants and the Cloud Network Sharing Problem , 2013, NSDI.

[3]  Mohammad Hammoud,et al.  Locality-Aware Reduce Task Scheduling for MapReduce , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[4]  Ling Liu,et al.  Purlieus: Locality-aware resource allocation for MapReduce in a cloud , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[5]  Anand Raghunathan,et al.  ShuffleWatcher: Shuffle-aware Scheduling in Multi-tenant MapReduce Clusters , 2014, USENIX Annual Technical Conference.

[6]  Archana Ganapathi,et al.  The Case for Evaluating MapReduce Performance Using Workload Suites , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[7]  Xiaobo Zhou,et al.  Improving MapReduce performance in heterogeneous environments with adaptive task tuning , 2014, Middleware.

[8]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[9]  Randy H. Katz,et al.  Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center , 2011, NSDI.

[10]  Antony I. T. Rowstron,et al.  Bridging the tenant-provider gap in cloud services , 2012, SoCC '12.

[11]  Harold P. Benson Multi-objective Optimization: Pareto Optimal Solutions, Properties , 2009, Encyclopedia of Optimization.

[12]  David A. Maltz,et al.  Network traffic characteristics of data centers in the wild , 2010, IMC '10.

[13]  Hai Jin,et al.  Boosting MapReduce with Network-Aware Task Assignment , 2013, CloudComp.

[14]  Changjun Jiang,et al.  FlexSlot: Moving Hadoop Into the Cloud with Flexible Slot Management , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[15]  Murali S. Kodialam,et al.  Joint scheduling of processing and Shuffle phases in MapReduce systems , 2012, 2012 Proceedings IEEE INFOCOM.

[16]  Hari Balakrishnan,et al.  Choreo: network-aware task placement for cloud applications , 2013, Internet Measurement Conference.

[17]  M. Fischer,et al.  Assigning Tasks for Efficiency in Hadoop ∗ , 2010 .

[18]  Michael J. Fischer,et al.  Assigning tasks for efficiency in Hadoop: extended abstract , 2010, SPAA '10.

[19]  Geoffrey C. Fox,et al.  Investigation of Data Locality in MapReduce , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[20]  Scott Shenker,et al.  Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling , 2010, EuroSys '10.

[21]  Bo Li,et al.  iAware: Making Live Migration of Virtual Machines Interference-Aware in the Cloud , 2014, IEEE Transactions on Computers.

[22]  Albert G. Greenberg,et al.  Reining in the Outliers in Map-Reduce Clusters using Mantri , 2010, OSDI.

[23]  Cheng-Zhong Xu,et al.  Interference and locality-aware task scheduling for MapReduce applications in virtual clusters , 2013, HPDC.

[24]  Jasbir S. Arora,et al.  Survey of multi-objective optimization methods for engineering , 2004 .

[25]  Michael I. Jordan,et al.  Managing data transfers in computer clusters with orchestra , 2011, SIGCOMM.

[26]  Shalini Arora,et al.  A variant of time minimizing assignment problem , 1998, Eur. J. Oper. Res..

[27]  Hai Jin,et al.  Falloc: Fair network bandwidth allocation in IaaS datacenters via a bargaining game approach , 2013, 2013 21st IEEE International Conference on Network Protocols (ICNP).