An architecture for scheduling with the capability of minimum share to heterogeneous Hadoop systems

Job scheduling in Hadoop has been thus far investigated in several studies. However, some challenges including minimum share (min-share), heterogeneous cluster, execution time estimation, and scheduling program size facing Hadoop clusters have received less attention. Accordingly, one of the most important algorithms with regard to min-share is that presented by Facebook Inc., i.e., FAIR scheduler, based on its own needs, in which an equal min-share has been considered for users. In this article, an attempt has been made to make the proposed method superior to existing methods through automation and configuration, performance optimization, fairness and data locality. A high-level architectural model is designed. Then a scheduler is defined on this architectural model. The provided scheduler contains four components. Three components schedule jobs and one component distributes the data for each job among the nodes. The given scheduler will be capable of being executed on heterogeneous Hadoop clusters and running jobs in parallel, in which disparate min-shares can be assigned to each job or user. Moreover, an approach is presented for each problem associated with min-share, cluster heterogeneity, execution time estimation, and scheduler program size. These approaches can be also utilized on its own to improve the performance of other scheduling algorithms. The scheduler presented in this paper showed acceptable performance compared with First-In, First-Out (FIFO), and FAIR schedulers.

[1]  Archana Ganapathi,et al.  The Case for Evaluating MapReduce Performance Using Workload Suites , 2011, 2011 IEEE 19th Annual International Symposium on Modelling, Analysis, and Simulation of Computer and Telecommunication Systems.

[2]  Shengchao Zhou,et al.  A random-keys genetic algorithm for scheduling unrelated parallel batch processing machines with different capacities and arbitrary job sizes , 2018, Appl. Math. Comput..

[3]  Chan-Hyun Youn,et al.  Multihybrid job scheduling for fault-tolerant distributed computing in policy-constrained resource networks , 2015, Comput. Networks.

[4]  Christian Trudeau,et al.  Stability and fairness in the job scheduling problem , 2019, Games Econ. Behav..

[5]  Hassan Ghasemzadeh,et al.  Big vs little core for energy-efficient Hadoop computing , 2019, J. Parallel Distributed Comput..

[6]  Faten Hamad,et al.  An Overview of Hadoop Scheduler Algorithms , 2018, Modern Applied Science.

[7]  Jordi Pereira,et al.  Minimizing the number of machines with limited workload capacity for scheduling jobs with interval constraints , 2019, Applied Mathematical Modelling.

[8]  Shanlin Yang,et al.  Algorithms for scheduling incompatible job families on single batching machine with limited capacity , 2014, Comput. Ind. Eng..

[9]  Atul Negi,et al.  A data locality based scheduler to enhance MapReduce performance in heterogeneous environments , 2019, Future Gener. Comput. Syst..

[10]  Yang Hu,et al.  Concurrent container scheduling on heterogeneous clusters with multi-resource constraints , 2020, Future Gener. Comput. Syst..

[11]  Shoney Sebastian,et al.  Comparative study of Job Schedulers in Hadoop Environment , 2017 .

[12]  Xiao Qin,et al.  Research on Scheduling Scheme for Hadoop Clusters , 2013, ICCS.

[13]  Matei Zaharia,et al.  Job Scheduling for Multi-User MapReduce Clusters , 2009 .

[14]  Gabriel Antoniu,et al.  Enabling fast failure recovery in shared Hadoop clusters: Towards failure-aware scheduling , 2017, Future Gener. Comput. Syst..

[15]  Florin Pop,et al.  Deadline scheduling algorithm for sustainable computing in Hadoop environment , 2018, Comput. Secur..

[16]  M. Kumar,et al.  Tolhit – A Scheduling Algorithm for Hadoop Cluster , 2016 .

[17]  N. P. Gopalan,et al.  An Optimal Task Selection Scheme for Hadoop Scheduling , 2014 .

[18]  Yong Chen,et al.  CARS: A contention-aware scheduler for efficient resource management of HPC storage systems , 2019, Parallel Comput..

[19]  Michael J. Freedman,et al.  SLAQ: quality-driven scheduling for distributed machine learning , 2017, SoCC.

[20]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[21]  Maozhen Li,et al.  MRSim: A discrete event based MapReduce simulator , 2010, 2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery.

[22]  Douglas G. Down,et al.  COSHH: A classification and optimization based scheduler for heterogeneous Hadoop systems , 2014, Future Gener. Comput. Syst..

[23]  T. Breur Statistical Power Analysis and the contemporary “crisis” in social sciences , 2016 .

[24]  Cees Witteveen,et al.  ThroughputScheduler: Learning to Schedule on Heterogeneous Hadoop Clusters , 2013, ICAC.

[25]  Min Chen,et al.  Job schedulers for Big data processing in Hadoop environment: testing real-life schedulers using benchmark programs , 2017, Digit. Commun. Networks.