MapReduce task scheduling based on deadline constraints —A study

In the data world, it is not easy to calculate the total capacity of records deposited electronically, in the unit of zeta bytes, yotta bytes stated as Big Data. Hadoop system is used to process huge datasets efficiently and inexpensively. MapReduce program is used to assemble data as per the request, and simple parallel programming model designed for scalability and work distribution on commodity hardware in a consistent approach. To achieve greater performance proper scheduling is required. This paper reports the survey work on Task Scheduling under Deadline Constraint using Map Reduce programming framework. Also this article presented various existing tactics, implementation idea, advantages and disadvantages on Deadline Constraints.

[1]  Shankar Ganesh Manikandan,et al.  Big Data Analysis Using Apache Hadoop , 2014, 2014 International Conference on IT Convergence and Security (ICITCS).

[2]  Garima Sharma,et al.  Performance evaluation of fair and capacity scheduling in Hadoop YARN , 2015, 2015 International Conference on Green Computing and Internet of Things (ICGCIoT).

[3]  Jia Wang,et al.  Task scheduling for MapReduce in heterogeneous networks , 2015, Cluster Computing.

[4]  Liu Yan,et al.  Load Balancing Task Scheduling Algorithm in Hadoop Platform , 2015, 2015 Seventh International Conference on Measuring Technology and Mechatronics Automation.

[5]  Swati Yadav,et al.  Efficient & Accurate Scheduling Algorithm for Cloudera Hadoop , 2015, 2015 International Conference on Computational Intelligence and Communication Networks (CICN).

[6]  Kenli Li,et al.  A self-adaptive scheduling algorithm for reduce start time , 2015, Future Gener. Comput. Syst..

[7]  Kemafor Anyanwu,et al.  Scheduling Hadoop Jobs to Meet Deadlines , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[8]  R. Saravanan,et al.  A comprehensive survey on big data analytics tools , 2016, 2016 Online International Conference on Green Engineering and Technologies (IC-GET).

[9]  P. Dhavachelvan,et al.  Big Data and Hadoop-a Study in Security Perspective , 2015 .

[10]  Xiangming Dai,et al.  Scheduling for response time in Hadoop MapReduce , 2016, 2016 IEEE International Conference on Communications (ICC).

[11]  J. Geetha,et al.  Hadoop Scheduler with Deadline Constraint , 2014, CloudCom 2014.

[12]  Feng Li,et al.  SLA-aware energy-efficient scheduling scheme for Hadoop YARN , 2015, 2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems.

[13]  Yi Yang,et al.  A MapReduce Task Scheduling Algorithm for Deadline-Constraint in Homogeneous Environment , 2014, 2014 Second International Conference on Advanced Cloud and Big Data.

[14]  Zhiping Jia,et al.  SLA-Aware Energy-Efficient Scheduling Scheme for Hadoop YARN , 2015, HPCC/CSS/ICESS.

[15]  Indranil Gupta,et al.  WOHA: Deadline-Aware Map-Reduce Workflow Scheduling Framework over Hadoop Clusters , 2014, 2014 IEEE 34th International Conference on Distributed Computing Systems.

[16]  Weishan Zhang,et al.  A genetic algorithm-based job scheduling model for big data analytics , 2016, EURASIP Journal on Wireless Communications and Networking.

[17]  Kenli Li,et al.  A MapReduce task scheduling algorithm for deadline constraints , 2013, Cluster Computing.

[18]  Yi Yao,et al.  LsPS: A Job Size-Based Scheduler for Efficient Task Assignments in Hadoop , 2015, IEEE Transactions on Cloud Computing.