Evaluating Energy-Aware Scheduling Algorithms for I/O-Intensive Scientific Workflows

Improving energy efficiency has become necessary to enable sustainable computational science. At the same time, scientific workflows are key in facilitating distributed computing in virtually all domain sciences. As data and computational requirements increase, I/O-intensive workflows have become prevalent. In this work, we evaluate the ability of two popular energy-aware workflow scheduling algorithms to provide effective schedules for this class of workflow applications, that is, schedules that strike a good compromise between workflow execution time and energy consumption. These two algorithms make decisions based on a widely used power consumption model that simply assumes linear correlation to CPU usage. Previous work has shown this model to be inaccurate, in particular for modeling power consumption of I/O-intensive workflow executions, and has proposed an accurate model. We evaluate the effectiveness of the two aforementioned algorithms based on this accurate model. We find that, when making their decisions, these algorithms can underestimate power consumption by up to 360%, which makes it unclear how well these algorithm would fare in practice. To evaluate the benefit of using the more accurate power consumption model in practice, we propose a simple scheduling algorithm that relies on this model to balance the I/O load across the available compute resources. Experimental results show that this algorithm achieves more desirable compromises between energy consumption and workflow execution time than the two popular algorithms.

[1]  Henri Casanova,et al.  Developing accurate and scalable simulators of production workflow management systems with WRENCH , 2020, Future Gener. Comput. Syst..

[2]  Laurent Lefèvre,et al.  A survey on techniques for improving the energy efficiency of large-scale distributed systems , 2014, ACM Comput. Surv..

[3]  Bin Luo,et al.  Cost and Energy Aware Scheduling Algorithm for Scientific Workflows with Deadline Constraint in Clouds , 2018, IEEE Transactions on Services Computing.

[4]  Malcolm P. Atkinson,et al.  Using simple PID-inspired controllers for online resilient resource management of distributed scientific workflows , 2019, Future Gener. Comput. Syst..

[5]  Rizos Sakellariou,et al.  Workflow Scheduling on Power Constrained VMs , 2015, 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC).

[6]  Manojit Ghose,et al.  Energy Efficient Scheduling of Scientific Workflows in Cloud Environment , 2017, 2017 IEEE 19th International Conference on High Performance Computing and Communications; IEEE 15th International Conference on Smart City; IEEE 3rd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[7]  Hua Wang,et al.  An energy-aware scheduling algorithm for budget-constrained scientific workflows based on multi-objective reinforcement learning , 2019, The Journal of Supercomputing.

[8]  Rizos Sakellariou,et al.  Energy-Constrained Provisioning for Scientific Workflow Ensembles , 2013, 2013 International Conference on Cloud and Green Computing.

[9]  Shantenu Jha,et al.  Next Generation Workload Management System For Big Data on Heterogeneous Distributed Computing , 2015 .

[10]  Xiaoming Chen,et al.  Delay-cost tradeoff for virtual machine migration in cloud data centers , 2017, J. Netw. Comput. Appl..

[11]  Rizos Sakellariou,et al.  A characterization of workflow management systems for extreme-scale applications , 2016, Future Gener. Comput. Syst..

[12]  Xuyun Zhang,et al.  EnReal: An Energy-Aware Resource Allocation Method for Scientific Workflow Executions in Cloud Environment , 2016, IEEE Transactions on Cloud Computing.

[13]  Ewa Deelman,et al.  WorkflowHub: Community Framework for Enabling Scientific Workflow Research and Development , 2020, 2020 IEEE/ACM Workflows in Support of Large-Scale Science (WORKS).

[14]  Henri Casanova,et al.  Characterizing, Modeling, and Accurately Simulating Power and Energy Consumption of I/O-intensive Scientific Workflows , 2020, J. Comput. Sci..

[15]  Wei Zhang,et al.  Investigation of the evaluation system of SMEs' industrial cluster management performance based on wireless network development , 2019, EURASIP J. Wirel. Commun. Netw..

[16]  Xiao Liu,et al.  Soft error-aware energy-efficient task scheduling for workflow applications in DVFS-enabled cloud , 2018, J. Syst. Archit..

[17]  Rizos Sakellariou,et al.  Energy-Aware Workflow Scheduling Using Frequency Scaling , 2014, 2014 43rd International Conference on Parallel Processing Workshops.

[18]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[19]  Esther Pacitti,et al.  Data-Intensive Workflow Management: For Clouds and Data-Intensive and Scalable Computing Environments , 2019, Data-Intensive Workflow Management.

[20]  Honghao Gao,et al.  An IoT-based task scheduling optimization scheme considering the deadline and cost-aware scientific workflow for cloud computing , 2019, EURASIP Journal on Wireless Communications and Networking.

[21]  Henri Casanova,et al.  Accurately Simulating Energy Consumption of I/O-Intensive Scientific Workflows , 2019, ICCS.