An Ant-Colony-Optimization Based Approach for Determination of Parameter Significance of Scientific Workflows

In the process of a scientific experiment a workflow is executed multiple times using various values of the parameters of activities. For real-world workflows that may contain hundreds of activities, each having several parameters, it is practically not feasible to conduct a parameter sensitivity study by simply following a ”brute-force approach” (that is experimental evaluation of all possible cases). We believe that a heuristic-guided approach enables to find a near-optimal solution using a reasonable amount of resources without the need for the evaluation of all possibilities. In this paper we present a novel methodology for determination of parameter significance of scientific workflows that is based on Ant Colony Optimization (ACO). We refer to our methodology, which is a customization of ACO for Parameter Significance determination, as ACO4PS. We use ACO4PS to identify (1) which parameter strongly affects the overall result of the workflow and (2) for which combination of parameter values we obtain the expected result. ACO4PS generates a list of all workflow parameters sorted by significance as well as is capable of generating a subset of significant parameters. We empirically evaluate our methodology using a real-world scientific workflow that deals with the Non-Invasive Glucose Measurement.

[1]  Lennart Ljung,et al.  System Identification: Theory for the User , 1987 .

[2]  N. Biggs THE TRAVELING SALESMAN PROBLEM A Guided Tour of Combinatorial Optimization , 1986 .

[3]  Manuel López-Ibáñez,et al.  Ant colony optimization , 2010, GECCO '10.

[4]  Luca Maria Gambardella,et al.  MACS-VRPTW: a multiple ant colony system for vehicle routing problems with time windows , 1999 .

[5]  Luca Maria Gambardella,et al.  HAS-SOP: Hybrid Ant System for the Sequential Ordering Problem , 1997 .

[6]  Frank Leymann,et al.  Modeling Stateful Resources with Web Services , 2004 .

[7]  Marco Dorigo,et al.  Distributed Optimization by Ant Colonies , 1992 .

[8]  Fakhri Alam Khan,et al.  Estimation of Parameters Sensitivity for Scientific Workflows , 2009, 2009 International Conference on Parallel Processing Workshops.

[9]  S V Zwaan,et al.  ANT COLONY OPTIMISATION FOR JOB SHOP SCHEDULING , 1998 .

[10]  Ivan Janciak,et al.  Advanced Data Mining and Integration Research for Europe , 2009 .

[11]  Fakhri Alam Khan,et al.  Grid-Enabled Non-Invasive Blood Glucose Measurement , 2008, ICCS.

[12]  Rajkumar Buyya,et al.  A Linear Programming Driven Genetic Algorithm for Meta-Scheduling on Utility Grids , 2008, 2008 16th International Conference on Advanced Computing and Communications.

[13]  Ivona Brandic,et al.  Specification, Planning, and Execution of QoS‐Aware Grid Workflows , 2009 .

[14]  Dennis Gannon,et al.  Workflows for e-Science, Scientific Workflows for Grids , 2014 .

[15]  Giovanni Maciocia CAc The Foundations of Chinese Medicine: A Comprehensive Text for Acupuncturists and Herbalists , 1989 .

[16]  W. Gutjahr On the Finite-Time Dynamics of Ant Colony Optimization , 2006 .

[17]  Christos H. Papadimitriou,et al.  Local Search for the Asymmetric Traveling Salesman Problem , 1980, Oper. Res..

[18]  Luca Maria Gambardella,et al.  Solving symmetric and asymmetric TSPs by ant colonies , 1996, Proceedings of IEEE International Conference on Evolutionary Computation.

[19]  Dorit S. Hochba,et al.  Approximation Algorithms for NP-Hard Problems , 1997, SIGA.

[20]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[21]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[22]  Radu Prodan,et al.  Dynamic scheduling of scientific workflow applications on the grid: a case study , 2005, SAC '05.

[23]  P Bourgine,et al.  Towards a Practice of Autonomous Systems , 1992 .

[24]  Fakhri Alam Khan,et al.  Provenance Support for Grid-Enabled Scientific Workflows , 2008, 2008 Fourth International Conference on Semantics, Knowledge and Grid.

[25]  Marc Gravel,et al.  Comparing an ACO algorithm with other heuristics for the single machine scheduling problem with sequence-dependent setup times , 2002, J. Oper. Res. Soc..

[26]  Naresh K. Sinha,et al.  System identification - Theory for the user : Lennart Ljung , 1989, Autom..

[27]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[28]  Ivan Janciak,et al.  Workflow enactment engine for WSRF-compliant services orchestration , 2008, 2008 9th IEEE/ACM International Conference on Grid Computing.

[29]  Plamenka Borovska,et al.  Comparison of parallel metaheuristics for solving the TSP , 2008, CompSysTech.