Pewss: A platform of extensible workflow simulation service for workflow scheduling research

This paper presents a Platform of Extensible Workflow Simulation Service (Pewss), which we have developed to provide a cloud service for aiding research work in workflow scheduling. The simulation has been a major tool for performance evaluation and comparison in workflow scheduling research. However, researchers usually have to develop their own simulation programs with limited functionality, simply outputting summarized performance results. Pewss has been developed for easing and improving current practices in conducting performance simulations during studying of existing workflow scheduling algorithms or designing of new scheduling algorithms. Pewss has been designed based on the Software as a Service (SaaS) model, adopting a multiuser Web‐based client/server architecture. Conducting simulation experiments on Pewss, researchers simply have to implement the scheduling algorithm under study instead of a whole simulation environment, allowing them to focus on their research work without spending unnecessary efforts on the simulation implementation details. Pewss provides the visualization of a workflow execution schedule based on simulation results, offering a convenient way for researchers to gain an insight into the effectiveness, characteristics, and performance bottleneck of scheduling algorithms. As a multiuser environment, Pewss also provides functionality for researchers to facilitate comparative performance analysis and collaborative research works effectively. Pewss has been used in our research work on task‐parallel workflow scheduling and has been planned to be extended to support other types of workflow scheduling research problems, eg, mixed‐parallel workflows.

[1]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[2]  Kuo-Chan Huang,et al.  Adaptive dual-criteria task group allocation for clustering-based multi-workflow scheduling on parallel computing platform , 2015, The Journal of Supercomputing.

[3]  Joshua R. Smith,et al.  LIGO: The laser interferometer gravitational-wave observatory , 2006, QELS 2006.

[4]  Kuo-Chan Huang,et al.  Scheduling online mixed-parallel workflows of rigid tasks in heterogeneous multi-cluster environments , 2016, Future Gener. Comput. Syst..

[5]  Rajkumar Buyya,et al.  ContainerCloudSim: An environment for modeling and simulation of containers in cloud data centers , 2017, Softw. Pract. Exp..

[6]  Rajkumar Buyya,et al.  iFogSim: A toolkit for modeling and simulation of resource management techniques in the Internet of Things, Edge and Fog computing environments , 2016, Softw. Pract. Exp..

[7]  Salim Hariri,et al.  Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing , 2002, IEEE Trans. Parallel Distributed Syst..

[8]  Helen D. Karatza,et al.  Scheduling multiple task graphs in heterogeneous distributed real-time systems by exploiting schedule holes with bin packing techniques , 2011, Simul. Model. Pract. Theory.

[9]  Hamid Arabnejad,et al.  List Scheduling Algorithm for Heterogeneous Systems by an Optimistic Cost Table , 2014, IEEE Transactions on Parallel and Distributed Systems.

[10]  Kuo-Chan Huang,et al.  An iterative expanding and shrinking process for processor allocation in mixed-parallel workflow scheduling , 2016, SpringerPlus.

[11]  Michael Pinedo,et al.  Scheduling: Theory, Algorithms, and Systems , 1994 .

[12]  Dong Zhou,et al.  Translation techniques in cross-language information retrieval , 2012, CSUR.

[13]  Kuo-Chan Huang,et al.  Task Ranking and Allocation Heuristics for Efficient Workflow Schedules , 2016, 2016 International Computer Symposium (ICS).

[14]  Kuo-Chan Huang,et al.  Moldable Job Scheduling for HPC as a Service , 2014 .

[15]  George S. Fishman,et al.  Discrete-Event Simulation : Modeling, Programming, and Analysis , 2001 .

[16]  RobertYves,et al.  A survey of pipelined workflow scheduling , 2013 .

[17]  Li Zhao,et al.  SCEC CyberShake Workflows - Automating Probabilistic Seismic Hazard Analysis Calculations , 2007, Workflows for e-Science, Scientific Workflows for Grids.

[18]  Kuo-Chan Huang,et al.  Revenue maximisation for scheduling deadline-constrained mouldable jobs on high performance computing as a service platforms , 2018, Int. J. High Perform. Comput. Netw..

[19]  Rizos Sakellariou,et al.  DAG Scheduling Using a Lookahead Variant of the Heterogeneous Earliest Finish Time Algorithm , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.

[20]  Rajkumar Buyya,et al.  CloudSim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms , 2011, Softw. Pract. Exp..

[21]  Bharat Jayaraman,et al.  Compact visualization of Java program execution , 2017, Softw. Pract. Exp..

[22]  L. F. Bittencourt,et al.  A Path Clustering Heuristic for Scheduling Task Graphs onto a Grid , 2013 .

[23]  Ewa Deelman,et al.  WorkflowSim: A toolkit for simulating scientific workflows in distributed environments , 2012, 2012 IEEE 8th International Conference on E-Science.

[24]  George S. Fishman,et al.  Discrete-event simulation , 2001 .

[25]  Kuo-Chan Huang,et al.  Task ranking and allocation in list-based workflow scheduling on parallel computing platform , 2014, The Journal of Supercomputing.

[26]  Thomas Fahringer,et al.  Predicting Workflow Task Execution Time in the Cloud Using A Two-Stage Machine Learning Approach , 2020, IEEE Transactions on Cloud Computing.

[27]  Kuo-Chan Huang,et al.  Task Clustering Heuristics for Efficient Execution Time Reduction in Workflow Scheduling , 2017 .

[28]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[29]  Ewa Deelman,et al.  Community Resources for Enabling Research in Distributed Scientific Workflows , 2014, 2014 IEEE 10th International Conference on e-Science.

[30]  Jan Sedivý,et al.  Modifying CloudSim to accurately simulate interactive services for cloud autoscaling , 2017, Concurr. Comput. Pract. Exp..

[31]  Kuo-Chan Huang,et al.  Online scheduling of workflow applications in grid environments , 2011, Future Gener. Comput. Syst..

[32]  Luiz Fernando Bittencourt,et al.  Towards the Scheduling of Multiple Workflows on Computational Grids , 2010, Journal of Grid Computing.

[33]  Yves Robert,et al.  A survey of pipelined workflow scheduling: Models and algorithms , 2013, CSUR.