Dynamic Compilation for Reducing Energy Consumption of I/O-Intensive Applications

Tera-scale high-performance computing has enabled scientists to tackle very large and computationally challenging scientific problems, making the advancement of scientific discovery at a faster pace. However, as computing scales to levels never seen before, it also becomes extremely data intensive, I/O intensive, and energy consuming. Amongst these, I/O is becoming a major bottleneck, impeding the expected pace of scientific discovery and analysis of data. Furthermore, the applications are becoming increasingly dynamic in terms of their computation patterns as well as data access patterns to cope with larger problems and data sizes. Due to the complexities of systems and applications and their high energy consumptions, it is, therefore, very important to address research issues and develop dynamic techniques at the level of run-time systems and compilers to scale I/O in the right proportions. This paper presents the details of a dynamic compilation framework developed specifically for I/O-intensive large-scale applications. Our dynamic compilation framework includes a set of powerful I/O optimizations designed to minimize execution cycles and energy consumption, and generates results that are competitive with hand-optimized codes in terms of energy consumption.

[1]  Mahmut T. Kandemir,et al.  APRIL: A Run-Time Library for Tape-Resident Data , 2000, IEEE Symposium on Mass Storage Systems.

[2]  Frederica Darema,et al.  Dynamic Data Driven Applications Systems: A New Paradigm for Application Simulations and Measurements , 2004, International Conference on Computational Science.

[3]  R. A. Coyne,et al.  The high performance storage system , 1993, Supercomputing '93.

[4]  Rakesh Krishnaiyer,et al.  PASSION: Parallel And Scalable Software for Input-Output , 1994 .

[5]  Jeffrey S. Chase,et al.  Balance of Power: Energy Management for Server Clusters , 2001 .

[6]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[7]  Rajeev Thakur,et al.  Passion: Optimized I/O for Parallel Applications , 1996, Computer.

[8]  Marianne Winslett,et al.  Server-Directed Collective I/O in Panda , 1995, Proceedings of the IEEE/ACM SC95 Conference.

[9]  Arie Shoshani,et al.  Optimizing tertiary storage organization and access for spatio-temporal datasets , 1994 .

[10]  Jack Dongarra,et al.  Computational Science - ICCS 2005, 5th International Conference, Atlanta, GA, USA, May 22-25, 2005, Proceedings, Part I , 2005, International Conference on Computational Science.

[11]  Stephen Gilmore,et al.  Evaluating the Performance of Skeleton-Based High Level Parallel Programs , 2004, International Conference on Computational Science.

[12]  Frederica Darema,et al.  Dynamic Data Driven Applications Systems: New Capabilities for Application Simulations and Measurements , 2005, International Conference on Computational Science.

[13]  Jeffrey S. Chase,et al.  Energy management for server clusters , 2001, Proceedings Eighth Workshop on Hot Topics in Operating Systems.

[14]  Rodney Van Meter,et al.  Network attached storage architecture , 2000, CACM.

[15]  Arie Shoshani,et al.  Efficient organization and access of multi-dimensional datasets on tertiary storage systems , 1995, Inf. Syst..