HSGA: a hybrid heuristic algorithm for workflow scheduling in cloud systems

In heterogeneous distributed computing systems like cloud computing, the problem of mapping tasks to resources is a major issue which can have much impact on system performance. For some reasons such as heterogeneous and dynamic features and the dependencies among requests, task scheduling is known to be a NP-complete problem.In this paper, we proposed a hybrid heuristic method (HSGA) to find a suitable scheduling for workflow graph, based on genetic algorithm in order to obtain the response quickly moreover optimizes makespan, load balancing on resources and speedup ratio.At first, the HSGA algorithm makes tasks prioritization in complex graph considering their impact on others, based on graph topology. This technique is efficient to reduction of completion time of application. Then, it merges Best-Fit and Round Robin methods to make an optimal initial population to obtain a good solution quickly, and apply some suitable operations such as mutation to control and lead the algorithm to optimized solution. This algorithm evaluates the solutions by considering efficient parameters in cloud environment.Finally, the proposed algorithm presents the better results with increasing number of tasks in application graph in contrast with other studied algorithms.

[1]  V. A. Kostenko,et al.  A parallel algorithm of simulated annealing for multiprocessor scheduling , 2008 .

[2]  Fatma A. Omara,et al.  Genetic algorithms for task scheduling problem , 2010, J. Parallel Distributed Comput..

[3]  Jaspal Singh Efficient Tasks scheduling for heterogeneous multiprocessor using Genetic algorithm with Node duplication , 2011 .

[4]  Sabine Van Huffel,et al.  On the Design of a Web-Based Decision Support System for Brain Tumour Diagnosis Using Distributed Agents , 2006, 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology Workshops.

[5]  Richard P. Brent,et al.  Efficient implementation of the first-fit strategy for dynamic storage allocation , 1989, TOPL.

[6]  Albert Y. Zomaya,et al.  Author manuscript, published in "Journal of Parallel and Distributed Computing (2011)" A Parallel Bi-objective Hybrid Metaheuristic for Energy-aware Scheduling for Cloud Computing Systems , 2011 .

[7]  Bo Zhang,et al.  Research on the Resource Monitoring Model Under Cloud Computing Environment , 2010, WISM.

[8]  Eui-in Choi,et al.  A Scheduling Middleware for Data Intensive Applications on a Grid , 2006, KES.

[9]  Arash Ghorbannia Delavar,et al.  A Synthetic Heuristic Algorithm for Independent Task Scheduling in Cloud Systems , 2011 .

[10]  Kenli Li,et al.  Reliability-aware scheduling strategy for heterogeneous distributed computing systems , 2010, J. Parallel Distributed Comput..

[11]  Ewa Deelman,et al.  The cost of doing science on the cloud: the Montage example , 2008, HiPC 2008.

[12]  Rajkumar Buyya,et al.  Workflow scheduling algorithms for grid computing , 2008 .

[13]  Henri Casanova,et al.  On cluster resource allocation for multiple parallel task graphs , 2010, J. Parallel Distributed Comput..

[14]  Kenli Li,et al.  List scheduling with duplication for heterogeneous computing systems , 2010, J. Parallel Distributed Comput..

[15]  Adnan Fida,et al.  Workflow scheduling for service oriented cloud computing , 2008 .

[16]  Meikang Qiu,et al.  Feedback Dynamic Algorithms for Preemptable Job Scheduling in Cloud Systems , 2010 .

[17]  Rajkumar Buyya,et al.  Optimizing the makespan and reliability for workflow applications with reputation and a look-ahead genetic algorithm , 2011, Future Gener. Comput. Syst..

[18]  C. Ribeiro,et al.  A Tabu Search Approach to Task Scheduling on Heterogeneous Processors under Precedence Constraints , 1995, Int. J. High Speed Comput..

[19]  Jong Hyuk Park,et al.  SPHINX: a scheduling middleware for data intensive applications on a grid , 2006, Int. J. Internet Protoc. Technol..

[20]  Sanaz Litkouhi,et al.  A Scheduling Algorithm for Increasing the Quality of the Distributed Systems by using Genetic Al g orithm , 2011 .

[21]  Richard Wolski,et al.  The Eucalyptus Open-Source Cloud-Computing System , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[22]  Myungryun Yoo,et al.  Real-time task scheduling by multiobjective genetic algorithm , 2009, J. Syst. Softw..

[23]  Rajkumar Buyya,et al.  Decentralized Overlay for Federation of Enterprise Clouds , 2008, ArXiv.

[24]  Suraj Pandey,et al.  Scheduling and management of data intensive application workflows in grid and cloud computing environments , 2010 .

[25]  Jean-Marc Nicod,et al.  A Genetic Algorithm to Schedule Workflow Collections on a SOA-Grid with Communication Costs , 2011 .