Accurately Simulating Energy Consumption of I/O-Intensive Scientific Workflows

While distributed computing infrastructures can provide infrastructure-level techniques for managing energy consumption, application-level energy consumption models have also been developed to support energy-efficient scheduling and resource provisioning algorithms. In this work, we analyze the accuracy of a widely-used application-level model that have been developed and used in the context of scientific workflow executions. To this end, we profile two production scientific workflows on a distributed platform instrumented with power meters. We then conduct an analysis of power and energy consumption measurements. This analysis shows that power consumption is not linearly related to CPU utilization and that I/O operations significantly impact power, and thus energy, consumption. We then propose a power consumption model that accounts for I/O operations, including the impact of waiting for these operations to complete, and for concurrent task executions on multi-socket, multi-core compute nodes. We implement our proposed model as part of a simulator that allows us to draw direct comparisons between real-world and modeled power and energy consumption. We find that our model has high accuracy when compared to real-world executions. Furthermore, our model improves accuracy by about two orders of magnitude when compared to the traditional models used in the energy-efficient workflow scheduling literature.

[1]  Tomoya Enokido,et al.  An Integrated Power Consumption Model for Distributed Systems , 2013, IEEE Transactions on Industrial Electronics.

[2]  Rodney S. Tucker,et al.  Green Cloud Computing: Balancing Energy in Processing, Storage, and Transport , 2011, Proceedings of the IEEE.

[3]  Albert Y. Zomaya,et al.  Energy-aware parallel task scheduling in a cluster , 2013, Future Gener. Comput. Syst..

[4]  Hai Jin,et al.  Performance and energy modeling for live migration of virtual machines , 2011, Cluster Computing.

[5]  Tomoya Enokido,et al.  An Extended Power Consumption Model for Distributed Applications , 2012, 2012 IEEE 26th International Conference on Advanced Information Networking and Applications.

[6]  Ann L. Chervenak,et al.  Characterizing and profiling scientific workflows , 2013, Future Gener. Comput. Syst..

[7]  Shantenu Jha,et al.  The anatomy of successful ECSS projects: lessons of supporting high-throughput high-performance ensembles on XSEDE , 2012, XSEDE '12.

[8]  Dzmitry Kliazovich,et al.  DENS: data center energy-efficient network-aware scheduling , 2010, Cluster Computing.

[9]  Rizos Sakellariou,et al.  Energy-Aware Workflow Scheduling Using Frequency Scaling , 2014, 2014 43rd International Conference on Parallel Processing Workshops.

[10]  Laurent Lefèvre,et al.  A survey on techniques for improving the energy efficiency of large-scale distributed systems , 2014, ACM Comput. Surv..

[11]  Miron Livny,et al.  Pegasus, a workflow management system for science automation , 2015, Future Gener. Comput. Syst..

[12]  Manojit Ghose,et al.  Energy Efficient Scheduling of Scientific Workflows in Cloud Environment , 2017, 2017 IEEE 19th International Conference on High Performance Computing and Communications; IEEE 15th International Conference on Smart City; IEEE 3rd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[13]  Rizos Sakellariou,et al.  Workflow Scheduling on Power Constrained VMs , 2015, 2015 IEEE/ACM 8th International Conference on Utility and Cloud Computing (UCC).

[14]  Jiming Chen,et al.  Energy-Efficient Capture of Stochastic Events under Periodic Network Coverage and Coordinated Sleep , 2012, IEEE Transactions on Parallel and Distributed Systems.

[15]  Bin Luo,et al.  Cost and Energy Aware Scheduling Algorithm for Scientific Workflows with Deadline Constraint in Clouds , 2018, IEEE Transactions on Services Computing.

[16]  Douglas Thain,et al.  Practical Resource Monitoring for Robust High Throughput Computing , 2015, 2015 IEEE International Conference on Cluster Computing.

[17]  Albert Y. Zomaya,et al.  A Taxonomy and Survey of Energy-Efficient Data Centers and Cloud Computing Systems , 2010, Adv. Comput..

[18]  Albert Y. Zomaya,et al.  Energy efficient utilization of resources in cloud computing systems , 2010, The Journal of Supercomputing.

[19]  Christine Morin,et al.  Energy Consumption Models and Predictions for Large-Scale Systems , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[20]  Henri Casanova,et al.  WRENCH: A Framework for Simulating Workflow Management Systems , 2018, 2018 IEEE/ACM Workflows in Support of Large-Scale Science (WORKS).

[21]  Xiao Liu,et al.  Soft error-aware energy-efficient task scheduling for workflow applications in DVFS-enabled cloud , 2018, J. Syst. Archit..

[22]  Rajkumar Buyya,et al.  Energy-aware simulation with DVFS , 2013, Simul. Model. Pract. Theory.

[23]  Miron Livny,et al.  Online Task Resource Consumption Prediction for Scientific Workflows , 2015, Parallel Process. Lett..

[24]  Emmanuel Jeannot,et al.  Adding Virtualization Capabilities to the Grid'5000 Testbed , 2012, CLOSER.

[25]  Laurent Lefèvre,et al.  Towards Energy Aware Reservation Infrastructure for Large-Scale Experimental Distributed Systems , 2009, Parallel Process. Lett..