Energy efficiency and performance management of parallel dataflow applications

Parallelizing software is a popular way of achieving high energy efficiency since parallel applications can be mapped on many cores and the clock frequency can be lowered. Perfect parallelism is, however, not often reached and different program phases usually contain different levels of parallelism due to data dependencies. Applications have currently no means of expressing the level of parallelism, and the power management is mostly done based on only the workload. In this work, we provide means of expressing QoS and levels of parallelism in applications for more tight integration with the power management to obtain optimal energy efficiency in multi-core systems. We utilize the dataflow framework PREESM to create and analyze program structures and expose the parallelism in the program phases to the power management. We use the derived parameters in a NLP (Non Linear Programming) solver to determine the minimum power for allocating resources to the applications.

[1]  Miodrag Potkonjak,et al.  Power optimization of variable voltage core-based systems , 1998, Proceedings 1998 Design and Automation Conference. 35th DAC. (Cat. No.98CH36175).

[2]  Henry Hoffmann,et al.  Application heartbeats for software performance and health , 2010, PPoPP '10.

[3]  Jean-François Nezan,et al.  An Open Framework for Rapid Prototyping of Signal Processing Applications , 2009, EURASIP J. Embed. Syst..

[4]  Hannu Tenhunen,et al.  Energy-aware-task-parallelism for efficient dynamic voltage, and frequency scaling, in CGRAs , 2013, 2013 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS).

[5]  Carl Tim Kelley,et al.  Iterative methods for optimization , 1999, Frontiers in applied mathematics.

[6]  Dakai Zhu,et al.  Energy-aware task replication to manage reliability for periodic real-time applications on multicore platforms , 2013, 2013 International Green Computing Conference Proceedings.

[7]  Brad Calder,et al.  Discovering and Exploiting Program Phases , 2003, IEEE Micro.

[8]  Chia-Lin Yang,et al.  Smart cache: an energy-efficient D-cache for a software MPEG-2 video decoder , 2003, Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint.

[9]  Siti Arpah Ahmad Parallel approach of Sobel Edge Detector on Multicore Platform , 2011 .

[10]  Luca Benini,et al.  Single-Chip Cloud Computer thermal model , 2011, 2011 17th International Workshop on Thermal Investigations of ICs and Systems (THERMINIC).

[11]  Christoph Meinel,et al.  Accurate Mutlicore Processor Power Models for Power-Aware Resource Management , 2011, 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing.

[12]  Pascal Frossard,et al.  Markov Decision Process Based Energy-Efficient On-Line Scheduling for Slice-Parallel Video Decoders on Multicore Systems , 2013, IEEE Transactions on Multimedia.

[13]  Michael A. Saunders,et al.  SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[14]  Bronis R. de Supinski,et al.  Adagio: making DVS practical for complex HPC applications , 2009, ICS.

[15]  Simon Holmbacka,et al.  QoS Manager for Energy Efficient Many-Core Operating Systems , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[16]  Edward A. Lee,et al.  Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing , 1989, IEEE Transactions on Computers.

[17]  Mahmut T. Kandemir,et al.  Leakage Current: Moore's Law Meets Static Power , 2003, Computer.

[18]  Zhi-bo Du,et al.  The Impact of the Clock Frequency on the Power Analysis Attacks , 2011, 2011 International Conference on Internet Technology and Applications.

[19]  Alexandra I. Cristea,et al.  Speed-up opportunities for ANN in a time-share parallel environment , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[20]  Rami G. Melhem,et al.  On the Interplay of Parallelization, Program Performance, and Energy Consumption , 2010, IEEE Transactions on Parallel and Distributed Systems.

[21]  Simon Holmbacka,et al.  Thermal influence on the energy efficiency of workload consolidation in many-core architectures , 2013, 2013 24th Tyrrhenian International Workshop on Digital Communications - Green ICT (TIWDC).

[22]  Thomas Rauber,et al.  Energy-Aware Execution of Fork-Join-Based Task Parallelism , 2012, 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[23]  Omar Hammami,et al.  Parallel programming and speed up evaluation of a NoC 2-ary 4-fly , 2010, 2010 International Conference on Microelectronics.

[24]  Philippe Codognet,et al.  Prediction of Parallel Speed-Ups for Las Vegas Algorithms , 2012, 2013 42nd International Conference on Parallel Processing.