Using multiple energy gears in MPI programs on a power-scalable cluster

Recently, system architects have built low-power, high-performance clusters, such as Green Destiny. The idea behind these clusters is to improve the energy efficiency of nodes. However, these clusters save power at the expense of performance. Our approach is instead to use high-performance cluster nodes that are frequency- and voltage-scalable; energy can than be saved by scaling down the CPU. Our prior work has examined the costs and benefits of executing an entire application at a single reduced frequency.This paper presents a framework for executing a single application in several frequency-voltage settings. The basic idea is to first divide programs into phases and then execute a series of experiments, with each phase assigned a prescribed frequency. During each experiment, we measure energy consumption and time and then use a heuristic to choose the assignment of frequency to phase for the next experiment.Our results show that significant energy can be saved without an undue performance penalty; particularly, our heuristic finds assignments of frequency to phase that is superior to any fixed-frequency solution. Specifically, this paper shows that more than half of the NAS benchmarks exhibit a better energy-time tradeoff using multiple gears than using a single gear. For example, IS using multiple gears uses 9% less energy and executes in 1% less time than the closest single-gear solution. Compared to no frequency scaling, multiple gear IS uses 16% less energy while executing only 1% longer.

[1]  Ken Kennedy,et al.  Automatic data layout for distributed-memory machines , 1998, TOPL.

[2]  Flavius Gruian Hard real-time scheduling for low-energy using stochastic data and DVS processors , 2001, ISLPED '01.

[3]  Yuanyuan Zhou,et al.  Reducing Energy Consumption of Disk Storage Using Power-Aware Cache Management , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).

[4]  Rolf Rabenseifner,et al.  Automatic Profiling of MPI Applications with Hardware Performance Counters , 1999, PVM/MPI.

[5]  J. Flinn,et al.  Energy-aware adaptation for mobile applications , 1999, SOSP.

[6]  E. N. Elnozahy,et al.  Energy Conservation Policies for Web Servers , 2003, USENIX Symposium on Internet Technologies and Systems.

[7]  Rong Ge,et al.  Performance-constrained Distributed DVS Scheduling for Scientific Applications on Power-aware Clusters , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[8]  David K. Lowenthal,et al.  Accurate data redistribution cost estimation in software distributed shared memory systems , 2001, PPoPP '01.

[9]  Michael C. Huang,et al.  Positional adaptation of processors: application to energy reduction , 2003, ISCA '03.

[10]  Robin Kravets,et al.  Power management techniques for mobile communication , 1998, MobiCom '98.

[11]  Fred Douglis,et al.  Adaptive Disk Spin-Down Policies for Mobile Computers , 1995, Comput. Syst..

[12]  Paul Horton,et al.  A Quantitative Analysis of Disk Drive Power Management in Portable Computers , 1994, USENIX Winter.

[13]  Scott Shenker,et al.  Scheduling for reduced CPU energy , 1994, OSDI '94.

[14]  E. N. Elnozahy,et al.  Energy-Efficient Server Clusters , 2002, PACS.

[15]  Enrique V. Carrera,et al.  Load balancing and unbalancing for power and performance in cluster-based systems , 2001 .

[16]  Michael Kistler,et al.  The case for power management in web servers , 2002 .

[17]  Sandhya Dwarkadas,et al.  Dynamic adaptation to available resources for parallel computing in an autonomous network of workstations , 2001, PPoPP '01.

[18]  Surendar Chandra Wireless network interface energy consumption , 2003, Multimedia Systems.

[19]  Krisztián Flautner,et al.  Automatic Performance Setting for Dynamic Voltage Scaling , 2001, MobiCom '01.

[20]  Karthick Rajamani,et al.  Energy Management for Commercial Servers , 2003, Computer.

[21]  Luca Benini,et al.  Compilers and Operating Systems for Low Power , 2012, Springer US.

[22]  Mahmut T. Kandemir,et al.  Scheduler-based DRAM energy management , 2002, DAC '02.

[23]  Trevor Pering,et al.  Energy Efficient Voltage Scheduling for Real-Time Operating Systems , 1998 .

[24]  Ulrich Kremer,et al.  The design, implementation, and evaluation of a compiler algorithm for CPU energy reduction , 2003, PLDI '03.

[25]  Hari Balakrishnan,et al.  Minimizing Energy for Wireless Web Access with Bounded Slowdown , 2002, MobiCom '02.

[26]  Gang Quan,et al.  Energy efficient fixed-priority scheduling for real-time systems on variable voltage processors , 2001, DAC '01.

[27]  Amin Vahdat,et al.  ECOSystem: managing energy as a first class operating system resource , 2002, ASPLOS X.

[28]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[29]  Mahadev Satyanarayanan,et al.  PowerScope: a tool for profiling the energy usage of mobile applications , 1999, Proceedings WMCSA'99. Second IEEE Workshop on Mobile Computing Systems and Applications.

[30]  Kang Li,et al.  Client-centered energy and delay analysis for TCP downloads , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[31]  Mary Baker,et al.  Non-volatile memory for fast, reliable file systems , 1992, ASPLOS V.

[32]  Remzi H. Arpaci-Dusseau,et al.  Architectural Requirements and Scalability of the NAS Parallel Benchmarks , 1999, ACM/IEEE SC 1999 Conference (SC'99).

[33]  David F. Heidel,et al.  An Overview of the BlueGene/L Supercomputer , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[34]  Karsten Schwan,et al.  Power-aware communication for mobile computers , 1999, 1999 IEEE International Workshop on Mobile Multimedia Communications (MoMuC'99) (Cat. No.99EX384).

[35]  James E. Smith,et al.  Comparing program phase detection techniques , 2003, Proceedings. 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36..

[36]  Hal Wasserman,et al.  Comparing algorithm for dynamic speed-setting of a low-power CPU , 1995, MobiCom '95.

[37]  Soonhoi Ha,et al.  Dynamic voltage scheduling technique for low-power multimedia applications using buffers , 2001, ISLPED '01.

[38]  Johan A. Pouwelse,et al.  Energy priority scheduling for variable voltage processors , 2001, ISLPED '01.

[39]  Y. Charlie Hu,et al.  Program counter based techniques for dynamic power management , 2004, 10th International Symposium on High Performance Computer Architecture (HPCA'04).

[40]  Amin Vahdat,et al.  Application-specific Network Management for Energy-Aware Streaming of Popular Multimedia Formats , 2002, USENIX Annual Technical Conference, General Track.

[41]  Ricardo Bianchini,et al.  Conserving disk energy in network servers , 2003, ICS '03.

[42]  Wu-chun Feng,et al.  High-Density Computing: A 240-Processor Beowulf in One Cubic Meter , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[43]  Thomas D. Burd,et al.  The simulation and evaluation of dynamic voltage scaling algorithms , 1998, Proceedings. 1998 International Symposium on Low Power Electronics and Design (IEEE Cat. No.98TH8379).

[44]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[45]  Kang G. Shin,et al.  Real-time dynamic voltage scaling for low-power embedded operating systems , 2001, SOSP.

[46]  Yann-Hang Lee,et al.  Voltage-clock-scaling adaptive scheduling techniques for low power in hard real-time systems , 2000, Proceedings Sixth IEEE Real-Time Technology and Applications Symposium. RTAS 2000.

[47]  Amin Vahdat,et al.  Every joule is precious: the case for revisiting operating system design for energy efficiency , 2000, ACM SIGOPS European Workshop.

[48]  Margaret Martonosi,et al.  Power-performance simulation: design and validation strategies , 2004, PERV.

[49]  Feng Pan,et al.  Exploring the energy-time tradeoff in MPI programs on a power-scalable cluster , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[50]  Trevor Mudge,et al.  Dynamic voltage scaling on a low-power microprocessor , 2001 .

[51]  Carla Schlatter Ellis,et al.  The case for higher-level power management , 1999, Proceedings of the Seventh Workshop on Hot Topics in Operating Systems.

[52]  Darrell D. E. Long,et al.  A dynamic disk spin-down technique for mobile computing , 1996, MobiCom '96.

[53]  Ricardo Bianchini,et al.  Application transformations for energy and performance-aware device management , 2002, Proceedings.International Conference on Parallel Architectures and Compilation Techniques.

[54]  Mahadev Satyanarayanan,et al.  Agile application-aware adaptation for mobility , 1997, SOSP.

[55]  Jason Flinn,et al.  Self-Tuning Wireless Network Power Management , 2003, MobiCom '03.

[56]  Philip Levis,et al.  Policies for dynamic clock scheduling , 2000, OSDI.

[57]  Vijay Kumar,et al.  Adaptive broadcast protocols to support power conservant retrieval by mobile users , 1997, Proceedings 13th International Conference on Data Engineering.

[58]  Michael L. Scott,et al.  Energy efficiency through burstiness , 2003, 2003 Proceedings Fifth IEEE Workshop on Mobile Computing Systems and Applications.

[59]  Ricardo Bianchini,et al.  Dynamic cluster reconfiguration for power and performance , 2003 .

[60]  Luca Benini,et al.  Operating-system directed power reduction , 2000, ISLPED '00.

[61]  Vincent W. Freeh,et al.  Dynamic Power Management using Feedback , 2002 .

[62]  Mark Horowitz,et al.  Energy dissipation in general purpose microprocessors , 1996, IEEE J. Solid State Circuits.

[63]  Alvin R. Lebeck,et al.  Power aware page allocation , 2000, SIGP.

[64]  Alan Jay Smith,et al.  Software strategies for portable computer energy management , 1998, IEEE Wirel. Commun..

[65]  David K. Lowenthal,et al.  Client-centered energy savings for concurrent HTTP connections , 2004, NOSSDAV '04.

[66]  Mahmut T. Kandemir,et al.  Reducing Disk Power Consumption in Servers with DRPM , 2003, Computer.

[67]  Alan Jay Smith,et al.  Improving dynamic voltage scaling algorithms with PACE , 2001, SIGMETRICS '01.

[68]  Surendar Chandra,et al.  Wireless network interface energy consumption implications of popular streaming formats , 2001, IS&T/SPIE Electronic Imaging.

[69]  Mahmut T. Kandemir,et al.  DRPM: dynamic speed control for power management in server class disks , 2003, 30th Annual International Symposium on Computer Architecture, 2003. Proceedings..

[70]  Amin Vahdat,et al.  Currentcy: Unifying Policies for Resource Management , 2002 .