Evaluating orthogonality between application auto-tuning and run-time resource management for adaptive OpenCL applications

The ever increasing number of processing units integrated on the same many-core chip delivers computational power that can exceed the performance requirements of a single application. The number of chips (and related power consumption) can thus be reduced to serve multiple applications - a practice which is called resource consolidation. However, this solution requires techniques to partition and assign resources among the applications and to manage unpredictable dynamic workloads.

[1]  Jürgen Teich,et al.  Invasive Computing: An Overview , 2011, Multiprocessor System-on-Chip.

[2]  Giuseppe Massari,et al.  A RTRM proposal for multi/many-core platforms and reconfigurable applications , 2012, 7th International Workshop on Reconfigurable and Communication-Centric Systems-on-Chip (ReCoSoC).

[3]  Vittorio Zaccaria,et al.  ARTE: An Application-specific Run-Time managEment framework for multi-cores based on queuing models , 2013, Parallel Comput..

[4]  Chantal Ykman-Couvreur,et al.  Run-time resource management based on design space exploration , 2012, CODES+ISSS.

[5]  Massoud Pedram,et al.  Resource allocation and consolidation in a multi-core server cluster using a Markov decision process model , 2013, International Symposium on Quality Electronic Design (ISQED).

[6]  Henry Hoffmann,et al.  Power Optimization in Embedded Systems via Feedback Control of Resource Allocation , 2013, IEEE Transactions on Control Systems Technology.

[7]  Kay Weckemann,et al.  Remote Utilization of OpenCL for Flexible Computation Offloading using Embedded ECUs, CE Devices and Cloud Servers , 2011, PARCO.

[8]  Henry Hoffmann,et al.  Managing performance vs. accuracy trade-offs with loop perforation , 2011, ESEC/FSE '11.

[9]  Henry Hoffmann,et al.  Application heartbeats: a generic interface for specifying program performance and goals in autonomous computing environments , 2010, ICAC '10.

[10]  George Kurian,et al.  Self-aware computing in the Angstrom processor , 2012, DAC Design Automation Conference 2012.

[11]  Aaftab Munshi,et al.  The OpenCL specification , 2009, 2009 IEEE Hot Chips 21 Symposium (HCS).

[12]  Jürgen Teich,et al.  Game-theoretic analysis of decentralized core allocation schemes on many-core systems , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[13]  Chantal Ykman-Couvreur,et al.  Linking run-time resource management of embedded multi-core platforms with automated design-time exploration , 2011, IET Comput. Digit. Tech..

[14]  Gauthier Lafruit,et al.  Cross-Based Local Stereo Matching Using Orthogonal Integral Images , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Henry Hoffmann,et al.  Dynamic knobs for responsive power-aware computing , 2011, ASPLOS XVI.