Sharing Resources for Performance and Energy Optimization of Concurrent Streaming Applications

We aim at finding optimal mappings for concurrent streaming applications. Each application consists of a linear chain with several stages, and processes successive data sets in pipeline mode. The objective is to minimize the energy consumption of the whole platform, while satisfying given performance-related bounds on the period and latency of each application. The problem is to decide which processors to enroll, at which speed (or mode) to use them, and which stages they should execute. We distinguish two mapping categories, interval mappings without reuse, and fully arbitrary general mappings. On the theoretical side, we establish complexity results for this tri-criteria mapping problem (energy, period, latency). Furthermore, we derive an integer linear program that provides the optimal solution in the most general case. On the experimental side, we design polynomial-time heuristics, and assess their absolute performance thanks to the linear program. One main goal is to evaluate the impact of processor sharing on the quality of the solution.

[1]  Yves Robert,et al.  Scheduling algorithms for workow optimization , 2009 .

[2]  Mitsuhisa Sato,et al.  Profile-based optimization of power performance by using dynamic voltage scaling on a PC cluster , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[3]  Rami G. Melhem,et al.  Minimizing expected energy in real-time embedded systems , 2005, EMSOFT.

[4]  Rami G. Melhem,et al.  Minimizing expected energy consumption in real-time systems through dynamic voltage scaling , 2007, TOCS.

[5]  Lothar Thiele,et al.  Energy-Efficient Task Partition for Periodic Real-Time Tasks on Platforms with Dual Processing Elements , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[6]  Yves Robert,et al.  Scheduling algorithms for linear workflow optimization , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[7]  Andrew A. Chien,et al.  A heuristic algorithm for mapping communicating tasks on heterogeneous resources , 2000, Proceedings 9th Heterogeneous Computing Workshop (HCW 2000) (Cat. No.PR00556).

[8]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[9]  Chase Qishi Wu,et al.  Supporting Distributed Application Workflows in Heterogeneous Computing Environments , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[10]  Viktor K. Prasanna,et al.  Bandwidth-aware resource allocation for heterogeneous computing systems to maximize throughput , 2003, 2003 International Conference on Parallel Processing, 2003. Proceedings..

[11]  Rami G. Melhem,et al.  On the Interplay of Parallelization, Program Performance, and Energy Consumption , 2010, IEEE Transactions on Parallel and Distributed Systems.

[12]  S. Sitharama Iyengar,et al.  Self-Adaptive Configuration of Visualization Pipeline Over Wide-Area Networks , 2008, IEEE Transactions on Computers.

[13]  Ben H. H. Juurlink,et al.  Leakage-Aware Multiprocessor Scheduling , 2009, J. Signal Process. Syst..

[14]  Petru Eles,et al.  Overhead-conscious voltage selection for dynamic and leakage energy reduction of time-constrained systems , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[15]  Hiroto Yasuura,et al.  Voltage scheduling problem for dynamically variable voltage processors , 1998, Proceedings. 1998 International Symposium on Low Power Electronics and Design (IEEE Cat. No.98TH8379).

[16]  Yves Robert,et al.  Performance and energy optimization of concurrent pipelined applications , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[17]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[18]  Yves Robert,et al.  Mapping pipeline skeletons onto heterogeneous platforms , 2007, J. Parallel Distributed Comput..

[19]  Yves Robert,et al.  Scheduling algorithms for workflow optimization , 2009 .

[20]  Krzysztof Kuchcinski,et al.  LEneS: task scheduling for low-energy systems using variable supply voltage processors , 2001, ASP-DAC '01.

[21]  David P. Bunde Power-aware scheduling for makespan and flow , 2006, SPAA '06.

[22]  Tai-Yi Huang,et al.  A Near-optimal Solution for the Heterogeneous Multi-processor Single-level Voltage Setup Problem , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[23]  Yves Robert,et al.  Mapping Linear Workflows with Computation/Communication Overlap , 2008, 2008 14th IEEE International Conference on Parallel and Distributed Systems.

[24]  Radu Marculescu,et al.  Communication-Aware Task Scheduling and Voltage Selection for Total Systems Energy Minimization , 2003, ICCAD.

[25]  Ian T. Foster,et al.  MPICH-G2: A Grid-enabled implementation of the Message Passing Interface , 2002, J. Parallel Distributed Comput..

[26]  Füsun Özgüner,et al.  Precedence-Constrained Task Allocation onto Point-to-Point Networks for Pipelined Execution , 1999, IEEE Trans. Parallel Distributed Syst..

[27]  Iain S. Duff,et al.  On Algorithms For Permuting Large Entries to the Diagonal of a Sparse Matrix , 2000, SIAM J. Matrix Anal. Appl..