Characterizing and Modeling Power and Energy for Extreme-Scale In-Situ Visualization

Plans for exascale computing have identified power and energy as looming problems for simulations running at that scale. In particular, writing to disk all the data generated by these simulations is becoming prohibitively expensive due to the energy consumption of the supercomputer while it idles waiting for data to be written to permanent storage. In addition, the power cost of data movement is also steadily increasing. A solution to this problem is to write only a small fraction of the data generated while still maintaining the cognitive fidelity of the visualization. With domain scientists increasingly amenable towards adopting an in-situ framework that can identify and extract valuable data from extremely large simulation results and write them to permanent storage as compact images, a large-scale simulation will commit to disk a reduced dataset of data extracts that will be much smaller than the raw results, resulting in a savings in both power and energy. The goal of this paper is two-fold: (i) to understand the role of in-situ techniques in combating power and energy issues of extreme-scale visualization and (ii) to create a model for performance, power, energy, and storage to facilitate what-if analysis. Our experiments on a specially instrumented, dedicated 150-node cluster show that while it is difficult to achieve power savings in practice using in-situ techniques, applications can achieve significant energy savings due to shorter write times for in-situ visualization. We present a characterization of power and energy for in-situ visualization; an application-aware, architecturespecific methodology for modeling and analysis of such in-situ workflows; and results that uncover indirect power savings in visualization workflows for high-performance computing (HPC).

[1]  Kwan-Liu Ma Runtime volume visualization for parallel CFD , 1995 .

[2]  Robert Haimes,et al.  Concurrent distributed visualization and solution steering , 1996 .

[3]  Bernd Hamann,et al.  ProteinShop: A tool for interactive protein manipulation and steering , 2004, J. Comput. Aided Mol. Des..

[4]  David Maier,et al.  From databases to dataspaces: a new abstraction for information management , 2005, SGMD.

[5]  Karsten Schwan,et al.  Flexible IO and integration for scientific codes through the adaptable IO system (ADIOS) , 2008, CLADE '08.

[6]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[7]  Ray W. Grout,et al.  Ultrascale Visualization In Situ Visualization for Large-Scale Combustion Simulations , 2010 .

[8]  Jeremy S. Meredith,et al.  Parallel in situ coupling of simulation with a fully featured visualization system , 2011, EGPGV '11.

[9]  Kesheng Wu,et al.  Scientific Discovery at the Exascale , 2011 .

[10]  Michael E. Papka,et al.  Toward simulation-time data analysis and I/O acceleration on leadership-class systems , 2011, 2011 IEEE Symposium on Large Data Analysis and Visualization.

[11]  Kenneth Moreland,et al.  Sandia National Laboratories , 2000 .

[12]  Fan Zhang,et al.  Combining in-situ and in-transit processing to enable extreme-scale scientific analysis , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[13]  Alexander Aiken,et al.  Legion: Expressing locality and independence with logical regions , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[14]  Fan Zhang,et al.  Enabling In-situ Execution of Coupled Scientific Workflow on Multi-core Platform , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[15]  John Biddiscombe,et al.  Parallel Computational Steering for HPC Applications Using HDF5 Files in Distributed Shared Memory , 2012, IEEE Transactions on Visualization and Computer Graphics.

[16]  Manish Parashar,et al.  Exploring energy and performance behaviors of data-intensive scientific workflows on systems with deep memory hierarchies , 2013, 20th Annual International Conference on High Performance Computing.

[17]  Mahidhar Tatineni,et al.  In-situ visualization for global hybrid simulations , 2013, XSEDE.

[18]  Scott Pakin,et al.  Exploring power behaviors and trade-offs of in-situ data analytics , 2013, 2013 SC - International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[19]  Gokcen Kestor,et al.  Quantifying the energy cost of data movement in scientific applications , 2013, 2013 IEEE International Symposium on Workload Characterization (IISWC).

[20]  Philip W. Jones,et al.  A multi-resolution approach to global ocean modeling , 2013 .

[21]  Sabela Ramos,et al.  Exploring energy-performance-quality tradeoffs for scientific workflows with in-situ data analyses , 2014, Computer Science - Research and Development.

[22]  Torsten Wilde,et al.  A power-measurement methodology for large-scale, high-performance computing , 2014, ICPE.

[23]  James P. Ahrens,et al.  An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[24]  Wu-chun Feng,et al.  On the Greenness of In-Situ and Post-Processing Visualization Pipelines , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

[25]  James P. Ahrens Increasing Scientific Data Insights about Exascale Class Simulations under Power and Storage Constraints , 2015, IEEE Computer Graphics and Applications.

[26]  Hans Hagen,et al.  In Situ Eddy Analysis in a High-Resolution Ocean Climate Model , 2016, IEEE Transactions on Visualization and Computer Graphics.

[27]  Frank Mueller,et al.  Power tuning HPC jobs on power-constrained systems , 2016, 2016 International Conference on Parallel Architecture and Compilation Techniques (PACT).

[28]  Valerio Pascucci,et al.  Evaluation of In-Situ Analysis Strategies at Scale for Power Efficiency and Scalability , 2016, 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid).

[29]  Michael Lang,et al.  Power usage of production supercomputers and production workloads , 2016, Concurr. Comput. Pract. Exp..