ELASTIC: A large scale dynamic tuning environment

The spectacular growth in the number of cores in current supercomputers poses design challenges for the development of performance analysis and tuning tools. To be effective, such analysis and tuning tools must be scalable and be able to manage the dynamic behaviour of parallel applications. In this work, we present ELASTIC, an environment for dynamic tuning of large- scale parallel applications. To be scalable, the architecture of ELASTIC takes the form of a hierarchical tuning network of nodes that perform a distributed analysis and tuning process. Moreover, the tuning network topology can be configured to adapt itself to the size of the parallel application. To guide the dynamic tuning process, ELASTIC supports a plugin architecture. These plugins, called ELASTIC packages, allow the integration of different tuning strategies into ELASTIC. We also present experimental tests conducted using ELASTIC, showing its effectiveness to improve the performance of large-scale parallel applications.

[1]  B.P. Miller,et al.  MRNet: A Software-Based Multicast/Reduction Network for Scalable Tools , 2003, ACM/IEEE SC 2003 Conference (SC'03).

[2]  Jeffrey K. Hollingsworth,et al.  An API for Runtime Code Patching , 2000, Int. J. High Perform. Comput. Appl..

[3]  Anna Sikora,et al.  How to Scale Dynamic Tuning to Large Parallel Applications , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[4]  G D Riley,et al.  Towards performance control on the Grid , 2005, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[5]  Michael Gerndt,et al.  PERISCOPE: An Online-Based Distributed Performance Analysis Tool , 2009, Parallel Tools Workshop.

[6]  Anna Sikora,et al.  How to Determine the Topology of Hierarchical Tuning Networks for Dynamic Auto-Tuning in Large-Scale Systems , 2013, ICCS.

[7]  Anna Sikora,et al.  AutoTune: A Plugin-Driven Approach to the Automatic Tuning of Parallel Applications , 2012, PARA.

[8]  Daniel A. Reed,et al.  The Autopilot Performance-Directed Adaptive Control System , 1997 .

[9]  Tomàs Margalef,et al.  Design and implementation of a dynamic tuning environment , 2007, J. Parallel Distributed Comput..

[10]  Andrea Martínez Trujillo Dynamic Tuning for Large-Scale Parallel Applications , 2013 .

[11]  Allen D. Malony,et al.  The Tau Parallel Performance System , 2006, Int. J. High Perform. Comput. Appl..

[12]  I-Hsin Chung,et al.  Active Harmony: Towards Automated Performance Tuning , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[13]  Brian J. N. Wylie,et al.  Performance measurement and analysis tools for extremely scalable systems , 2010, ISC 2010.