A hybrid genetic algorithm for optimization of scheduling workflow applications in heterogeneous computing systems

Workflow scheduling is a key component behind the process for an optimal workflow enactment. It is a well-known NP-hard problem and is more challenging in the heterogeneous computing environment. The increasing complexity of the workflow applications is forcing researchers to explore hybrid approaches to solve the workflow scheduling problem. The performance of genetic algorithms can be enhanced by the modification in genetic operators and involving an efficient heuristic. These features are incorporated in the proposed Hybrid Genetic Algorithm (HGA). A solution obtained from a heuristic is seeded in the initial population that provides a direction to reach an optimal (makespan)solution. The modified two fold genetic operators search rigorously and converge the algorithm at the best solution in less amount of time. This is proved to be the strength of the HGA in the optimization of fundamental objective (makespan) of scheduling. The proposed algorithm also optimizes the load balancing during the execution side to utilize resources at maximum. The performance of the proposed algorithm is analyzed by using synthesized datasets, and real-world application workflows. The HGA is evaluated by comparing the results with renowned and state of the art algorithms. The experimental results validate that the HGA outperforms these approaches and provides quality schedules with less makespans. Proposed HGA (hybrid GA) to schedule workflow in heterogeneous environment.Simulation results are presented with synthesized and real-world workflows.HGA identifies lesser length of the workflows.HGA also improves load balancing.Significant improvement in schedule lengths as compared to existing work.

[1]  Óscar Corcho,et al.  Data-intensive architecture for scientific knowledge discovery , 2012, Distributed and Parallel Databases.

[2]  Kenli Li,et al.  A DAG scheduling scheme on heterogeneous computing systems using double molecular structure-based chemical reaction optimization , 2013, J. Parallel Distributed Comput..

[3]  Thorsten Meinl,et al.  KNIME - the Konstanz information miner: version 2.0 and beyond , 2009, SKDD.

[4]  Stefano Giordani,et al.  A fast metaheuristic for scheduling independent tasks with multiple modes , 2009, Comput. Ind. Eng..

[5]  Rajkumar Buyya,et al.  A taxonomy of scientific workflow systems for grid computing , 2005, SGMD.

[6]  Hamid Arabnejad,et al.  List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table , 2014, IEEE Transactions on Parallel and Distributed Systems.

[7]  Frumkin,et al.  NAS Grid Benchmarks Version 1.0 , 2002 .

[8]  Daniel S. Katz,et al.  Swift: A language for distributed parallel scripting , 2011, Parallel Comput..

[9]  Nawwaf N. Kharma,et al.  A hybrid heuristic-genetic algorithm for task scheduling in heterogeneous processor networks , 2011, J. Parallel Distributed Comput..

[10]  Daniel S. Katz,et al.  Pegasus: A framework for mapping complex scientific workflows onto distributed systems , 2005, Sci. Program..

[11]  Albert Y. Zomaya,et al.  A Bee Colony based optimization approach for simultaneous job scheduling and data replication in grid environments , 2013, Comput. Oper. Res..

[12]  Ehsan Ullah Munir,et al.  SDBATS: A Novel Algorithm for Task Scheduling in Heterogeneous Computing Systems , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[13]  Homayun Motameni,et al.  Task scheduling using NSGA II with fuzzy adaptive operators for computational grids , 2014, J. Parallel Distributed Comput..

[14]  Ian J. Taylor,et al.  Workflows and e-Science: An overview of workflow system features and capabilities , 2009, Future Gener. Comput. Syst..

[15]  Bertram Ludäscher,et al.  Scientific workflow management and the Kepler system: Research Articles , 2006 .

[16]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[17]  Kenli Li,et al.  A genetic algorithm for task scheduling on heterogeneous computing systems using multiple priority queues , 2014, Inf. Sci..

[18]  Moacir Godinho Filho,et al.  Literature review regarding Ant Colony Optimization applied to scheduling problems: Guidelines for implementation and directions for future research , 2013, Eng. Appl. Artif. Intell..

[19]  David E. Smith,et al.  Integrating Policy with Scientific Workflow Management for Data-Intensive Applications , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[20]  Carole A. Goble,et al.  Taverna: a tool for building and running workflows of services , 2006, Nucleic Acids Res..

[21]  Ehsan Ullah Munir,et al.  PEGA: A Performance Effective Genetic Algorithm for Task Scheduling in Heterogeneous Systems , 2012, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems.

[22]  Edward A. Lee,et al.  Scientific workflow management and the Kepler system , 2006, Concurr. Comput. Pract. Exp..

[23]  Peter Brezany,et al.  The Data Bonanza: Improving Knowledge Discovery in Science, Engineering, and Business , 2013 .

[24]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[25]  Yogesh L. Simmhan,et al.  The Trident Scientific Workflow Workbench , 2008, 2008 IEEE Fourth International Conference on eScience.

[26]  Chuan Wang,et al.  A Hybrid Heuristic-Genetic Algorithm for Task Scheduling in Heterogeneous Multi-core System , 2012, ICA3PP.