A Profit-Maximum Resource Allocation Approach for Mapreduce in Data Centers

Resource allocation for Mapreduce data processing poses difficult challenges to system administrators in data centers. The extreme scale of Mapreduce applications require an efficiently profitable resource allocation algorithm that minimizes the energy consumption cost while maintaining the highest level of performance. In this paper, we propose a profit-maximum model that minimizes the cost of energy consumption and makespan. By adopting a minimum-weight b-matching rounding algorithm (MBRA) to find an integer solution, then assign map/reduce tasks to individual slots to build a complete resource allocation. Finally, we perform experiments on real workload to evaluate the profit-maximum model and analyze the performance of our proposed algorithm. The results show that MBRA is able to find a near-optimal integer solution that maximizes the profit per unit time in a lower runtime, and it is up to 30%~70% in profit that is better than the current heuristic scheduling algorithm and the rounding algorithm.

[1]  Keke Chen,et al.  Towards Optimal Resource Provisioning for Running MapReduce Programs in Public Clouds , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[2]  Klaus Jansen,et al.  Improved Approximation Schemes for Scheduling Unrelated Parallel Machines , 2001, Math. Oper. Res..

[3]  Li Shan Seadown: SLA-Aware Size-Scaling Power Management in Heterogeneous MapReduce Cluster , 2013 .

[4]  Anthony A. Maciejewski,et al.  Energy-Aware Profit Maximizing Scheduling Algorithm for Heterogeneous Computing Systems , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[5]  Weisong Shi,et al.  Energy-Aware Scheduling of MapReduce Jobs for Big Data Applications , 2015, IEEE Transactions on Parallel and Distributed Systems.

[6]  Yuping Wang,et al.  Energy-efficient Task Scheduling Model based on MapReduce for Cloud Computing using Genetic Algorithm , 2012, J. Comput..

[7]  Bert Huang,et al.  Fast b-matching via Sufficient Selection Belief Propagation , 2011, AISTATS.

[8]  Anthony A. Maciejewski,et al.  Deadline and energy constrained dynamic resource allocation in a heterogeneous computing environment , 2011, 2011 40th International Conference on Parallel Processing Workshops.

[9]  Andreas Thor,et al.  Load Balancing for MapReduce-based Entity Resolution , 2011, 2012 IEEE 28th International Conference on Data Engineering.

[10]  Deying Li,et al.  Minimizing makespan and total completion time in MapReduce-like systems , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[11]  Weisong Shi,et al.  Workload Analysis, Implications, and Optimization on a Production Hadoop Cluster: A Case Study on Taobao , 2014, IEEE Transactions on Services Computing.

[12]  Fang-Yie Leu,et al.  Impact of MapReduce Policies on Job Completion Reliability and Job Energy Consumption , 2015, IEEE Transactions on Parallel and Distributed Systems.

[13]  Éva Tardos,et al.  An approximation algorithm for the generalized assignment problem , 1993, Math. Program..

[14]  Anirban Dasgupta,et al.  On scheduling in map-reduce and flow-shops , 2011, SPAA '11.

[15]  Lei Ying,et al.  MapTask Scheduling in MapReduce With Data Locality: Throughput and Heavy-Traffic Optimality , 2013, IEEE/ACM Transactions on Networking.

[16]  Ling Liu,et al.  Cost-Effective Resource Provisioning for MapReduce in a Cloud , 2015, IEEE Transactions on Parallel and Distributed Systems.

[17]  Xuejie Zhang,et al.  A Task-Type-Based Algorithm for the Energy-Aware Profit Maximizing Scheduling Problem in Heterogeneous Computing Systems , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[18]  Roy H. Campbell,et al.  Two Sides of a Coin: Optimizing the Schedule of MapReduce Jobs to Minimize Their Makespan and Improve Cluster Performance , 2012, 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[19]  Bu-Sung Lee,et al.  Dynamic Job Ordering and Slot Configurations for MapReduce Workloads , 2016, IEEE Transactions on Services Computing.

[20]  Vasudeva Varma,et al.  Dynamic energy efficient data placement and cluster reconfiguration algorithm for MapReduce framework , 2012, Future Gener. Comput. Syst..