The READEX formalism for automatic tuning for energy efficiency

Energy efficiency is an important aspect of future exascale systems, mainly due to rising energy cost. Although High performance computing (HPC) applications are compute centric, they still exhibit varying computational characteristics in different regions of the program, such as compute-, memory-, and I/O-bound code regions. Some of today’s clusters already offer mechanisms to adjust the system to the resource requirements of an application, e.g., by controlling the CPU frequency. However, manually tuning for improved energy efficiency is a tedious and painstaking task that is often neglected by application developers. The European Union’s Horizon 2020 project READEX (Runtime Exploitation of Application Dynamism for Energy-efficient eXascale computing) aims at developing a tools-aided approach for improved energy efficiency of current and future HPC applications. To reach this goal, the READEX project combines technologies from two ends of the compute spectrum, embedded systems and HPC, constituting a split design-time/runtime methodology. From the HPC domain, the Periscope Tuning Framework (PTF) is extended to perform dynamic auto-tuning of fine-grained application regions using the systems scenario methodology, which was originally developed for improving the energy efficiency in embedded systems. This paper introduces the concepts of the READEX project, its envisioned implementation, and preliminary results that demonstrate the feasibility of this approach.

[1]  Emilio Luque,et al.  Modeling Master/Worker applications for automatic performance tuning , 2006, Parallel Comput..

[2]  Alena Vasatová,et al.  Solving Contact Mechanics Problems with PERMON , 2015, HPCSE.

[3]  Ananta Tiwari,et al.  Online Adaptive Code Generation and Tuning , 2011, 2011 IEEE International Parallel & Distributed Processing Symposium.

[4]  Dirk Schmidl,et al.  Score-P: A Joint Performance Measurement Run-Time Infrastructure for Periscope, Scalasca, TAU, and Vampir , 2011, Parallel Tools Workshop.

[5]  Alejandro Duran,et al.  Ompss: a Proposal for Programming Heterogeneous Multi-Core Architectures , 2011, Parallel Process. Lett..

[6]  Robert Schöne,et al.  Integrating performance analysis and energy efficiency optimizations in a unified environment , 2013, Computer Science - Research and Development.

[7]  Henk Corporaal,et al.  System-scenario-based design of dynamic embedded systems , 2009, TODE.

[8]  Luca Benini,et al.  The ANTAREX approach to autotuning and adaptivity for energy efficient HPC systems , 2016, Conf. Computing Frontiers.

[9]  Francky Catthoor,et al.  Exploration of energy efficient memory organisations for dynamic multimedia applications using system scenarios , 2013, Des. Autom. Embed. Syst..

[10]  Chun Chen,et al.  A scalable auto-tuning framework for compiler optimization , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[11]  Thomas Ilsche,et al.  The shift from processor power consumption to performance variations: fundamental implications at scale , 2016, Computer Science - Research and Development.

[12]  Ondrej Meca,et al.  Massively Parallel Hybrid Total FETI (HTFETI) Solver , 2016, PASC.

[13]  Wolfgang E. Nagel,et al.  Extending the Functionality of Score-P through Plugins: Interfaces and Use Cases , 2017 .

[14]  Franz Franchetti,et al.  Automatic Application Tuning for HPC Architectures (Dagstuhl Seminar 13401) , 2013, Dagstuhl Reports.

[15]  Allen D. Malony,et al.  Event-based performance perturbation: a case study , 1991, PPOPP '91.

[16]  Wolfgang E. Nagel,et al.  HDEEM: High Definition Energy Efficiency Monitoring , 2014, 2014 Energy Efficient Supercomputing Workshop.

[17]  Cédric Augonnet,et al.  PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems , 2011, IEEE Micro.