Survey on job scheduling mechanisms in grid environment

Grid systems provide geographically distributed resources for both computational intensive and data-intensive applications.These applications generate large data sets.However, the high latency imposed by the underlying technologies; upon which the grid system is built (such as the Internet and WWW), induced impediment in the effective access to such huge and widely distributed data.To minimize this impediment, jobs need to be scheduled across grid environments to achieve efficient data access.Scheduling multiple data requests submitted by grid users onto the grid environment is NP-hard.Thus, there is no best scheduling algorithm that cuts across all grids computing environments.Job scheduling is one of the key research area in grid computing.In the recent past many researchers have proposed different mechanisms to help scheduling of user jobs in grid systems.Some characteristic features of the grid components; such as machines types and nature of jobs at hand means that a choice needs to be made for an appropriate scheduling algorithm to march a given grid environment.The aim of scheduling is to achieve maximum possible system throughput and to match the application needs with the available computing resources.This paper is motivated by the need to explore the various job scheduling techniques alongside their area of implementation.The paper will systematically analyze the strengths and weaknesses of some selected approaches in the area of grid jobs scheduling.This helps researchers better understand the concept of scheduling, and can contribute in developing more efficient and practical scheduling algorithms.This will also benefit interested researchers to carry out further work in this dynamic research area.

[1]  Jemal H. Abawajy,et al.  Data Replication Approach with Consistency Guarantee for Data Grid , 2014, IEEE Transactions on Computers.

[2]  Kavitha Ranganathan,et al.  Decoupling computation and data scheduling in distributed data-intensive applications , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[3]  Anindya Jyoti Pal,et al.  Deadline Based Performance Evaluation of Job Scheduling Algorithms , 2012, 2012 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery.

[4]  Jing Wang,et al.  A Heuristic Algorithm for Scheduling on Grid Computing Environment , 2012, 2012 Seventh ChinaGrid Annual Conference.

[5]  Lakshmi Ravi Anikode,et al.  Integrating Scheduling and Replication in Data Grids with Performance Guarantee , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[6]  Albert Y. Zomaya,et al.  Survey on Grid Resource Allocation Mechanisms , 2014, Journal of Grid Computing.

[7]  Abbas Horri,et al.  A Novel Job Scheduling Algorithm for Improving Data Grid's Performance , 2011, 2011 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[8]  S. Rajkumar,et al.  Hybrid approach for monitoring and scheduling the job in heterogeneous system , 2014, International Conference on Information Communication and Embedded Systems (ICICES2014).

[9]  Albert Y. Zomaya,et al.  Intelligent Scheduling and Replication in Datagrids: a Synergistic Approach , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[10]  Shirin Dehghani Zahedani,et al.  A hybrid batch job scheduling algorithm for grid environment , 2014, 2014 4th International Conference on Computer and Knowledge Engineering (ICCKE).

[11]  Guanfeng Liu,et al.  Evaluation of nine heuristic algorithms with data-intensive jobs and computing-intensive jobs in a dynamic environment , 2015, IET Softw..

[12]  Manpreet Singh,et al.  JS2DR2: An Effective Two-Level Job Scheduling Algorithm and Two-Phase Dynamic Replication Strategy for Data Grid , 2009, 2009 International Conference on Advances in Computing, Control, and Telecommunication Technologies.

[13]  Luciano Serafini,et al.  Towards an Economy-Based Optimisation of File Access and Replication on a Data Grid , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[14]  K. Kousalya,et al.  An Enhanced Adaptive Scoring Job Scheduling algorithm for minimizing job failure in heterogeneous grid network , 2014, 2014 International Conference on Recent Trends in Information Technology.

[15]  Lathigara Amit Maheshbhai Job scheduling based on reliability, time and cost constraints under Grid environment , 2011, 2011 Nirma University International Conference on Engineering.

[16]  Hanene Chettaoui,et al.  An Efficient Replication Strategy for Dynamic Data Grids , 2010, 2010 International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[17]  Pangfeng Liu,et al.  Data-bandwidth-aware Job Scheduling in Grid and Cluster Environments , 2009, 2009 15th International Conference on Parallel and Distributed Systems.

[18]  Hafida Belbachir,et al.  Dynamic threshold for replicas placement strategy , 2010, 2010 International Conference on Machine and Web Intelligence.

[19]  Fábio Coutinho,et al.  GGreen: A Greedy Energy-Aware Scheduling Algorithm on Grid Systems , 2014, 2014 IEEE 17th International Conference on Computational Science and Engineering.

[20]  Ibm Redbooks Introduction to Grid Computing With Globus , 2003 .

[21]  Hanene Chettaoui,et al.  A Decentralized Periodic Replication Strategy Based on Knapsack Problem , 2012, 2012 ACM/IEEE 13th International Conference on Grid Computing.

[22]  Soonwook Hwang,et al.  Improvement of Data Grid's Performance by Combining Job Scheduling with Dynamic Replication Strategy , 2007, Sixth International Conference on Grid and Cooperative Computing (GCC 2007).

[23]  Xinyu Shao,et al.  A comparative analysis of job scheduling algorithm , 2011, MSIE 2011.

[24]  Sanjeev K. Aggarwal,et al.  Multi-objective Evolution Based Dynamic Job Scheduler in Grid , 2014, 2014 Eighth International Conference on Complex, Intelligent and Software Intensive Systems.

[25]  Floriano Zini,et al.  Evaluating scheduling and replica optimisation strategies in OptorSim , 2003, Proceedings. First Latin American Web Congress.

[26]  Quan Liu,et al.  Grouping-Based Fine-Grained Job Scheduling in Grid Computing , 2009, 2009 First International Workshop on Education Technology and Computer Science.

[27]  Yaw-Ling Lin,et al.  Performance issues of grid computing based on different architecture cluster computing platforms , 2005, 19th International Conference on Advanced Information Networking and Applications (AINA'05) Volume 1 (AINA papers).

[28]  Gao Gai-Mei,et al.  Design and Simulation of Dynamic Replication Strategy for the Data Grid , 2012, 2012 International Conference on Industrial Control and Electronics Engineering.

[29]  Tao Xie,et al.  FIRE: A File Reunion Based Data Replication Strategy for Data Grids , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[30]  Wuu-Yee Chen,et al.  Job Schedule Model Based on Grid Environment , 2007, 2009 Fifth International Joint Conference on INC, IMS and IDC.

[31]  Ruay-Shiung Chang,et al.  A dynamic weighted data replication strategy in data grids , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[32]  Albert Y. Zomaya,et al.  Intelligent scheduling and replication: a synergistic approach , 2009 .

[33]  M. Zarina,et al.  Job scheduling for dynamic data replication strategy in heterogeneous federation data grid systems , 2013, 2013 Second International Conference on Informatics & Applications (ICIA).

[34]  Hamidah Ibrahim,et al.  A new load balancing scheduling model in data grid application , 2008, 2008 International Symposium on Information Technology.

[35]  Manoj Kumar Mishra,et al.  An analysis of various job scheduling strategies in grid computing , 2010, 2010 2nd International Conference on Signal Processing Systems.

[36]  Wei Du,et al.  Reputation-Aware Scheduling for Storage Systems in Data Grids , 2009, 2009 Fourth International Conference on Frontier of Computer Science and Technology.