Exploring manycore architectures for next-generation HPC systems through the MANGO approach

Abstract The Horizon 2020 MANGO project aims at exploring deeply heterogeneous accelerators for use in High-Performance Computing systems running multiple applications with different Quality of Service (QoS) levels. The main goal of the project is to exploit customization to adapt computing resources to reach the desired QoS. For this purpose, it explores different but interrelated mechanisms across the architecture and system software. In particular, in this paper we focus on the runtime resource management, the thermal management, and support provided for parallel programming, as well as introducing three applications on which the project foreground will be validated.

[1]  David Atienza,et al.  Neural Network-Based Thermal Simulation of Integrated Circuits on GPUs , 2012, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[2]  Guilherme Corrêa,et al.  Complexity scalability for real-time HEVC encoders , 2013, Journal of Real-Time Image Processing.

[3]  Giovanni Agosta,et al.  OpenCL performance portability for general‐purpose computation on graphics processor units: an exploration on cryptographic primitives , 2015, Concurr. Comput. Pract. Exp..

[4]  Giovanni Agosta,et al.  Optimizing Memory Management in Deeply Heterogeneous HPC Accelerators , 2017, 2017 46th International Conference on Parallel Processing Workshops (ICPPW).

[5]  Muhammad Shafique,et al.  TONE: Adaptive temperature optimization for the next generation video encoders , 2014, 2014 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED).

[6]  Giovanni Agosta,et al.  Towards Transparently Tackling Functionality and Performance Issues across Different OpenCL Platforms , 2014, 2014 Second International Symposium on Computing and Networking.

[7]  Giuseppe Massari,et al.  Effective Runtime Resource Management Using Linux Control Groups with the BarbequeRTRM Framework , 2015, TECS.

[8]  Vikram S. Adve,et al.  LLVM: a compilation framework for lifelong program analysis & transformation , 2004, International Symposium on Code Generation and Optimization, 2004. CGO 2004..

[9]  Bernhard Preim,et al.  Visual Computing for Medicine: Theory, Algorithms, and Applications , 2007 .

[10]  Robert G. Gallager,et al.  Low-density parity-check codes , 1962, IRE Trans. Inf. Theory.

[11]  Frank Kienle,et al.  A novel LDPC decoder for DVB-S2 IP , 2009, 2009 Design, Automation & Test in Europe Conference & Exhibition.

[12]  Somayeh Sardashti,et al.  The gem5 simulator , 2011, CARN.

[13]  David Atienza,et al.  3D-ICE: Fast compact transient thermal modeling for 3D ICs with inter-tier liquid cooling , 2010, 2010 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[14]  Radford M. Neal,et al.  Near Shannon limit performance of low density parity check codes , 1996 .

[15]  Mario Kovac,et al.  E-Health Demystified: An E-Government Showcase , 2014, Computer.

[16]  Alessandro Cilardo,et al.  MANGO: Exploring Manycore Architectures for Next-GeneratiOn HPC Systems , 2017, 2017 Euromicro Conference on Digital System Design (DSD).

[17]  Wolfgang Ziegler,et al.  Implementing a “one-stop-shop” providing SMEs with integrated HPC simulation resources using Fortissimo resources , 2014, eChallenges e-2014 Conference Proceedings.

[18]  Jackson Braz Marcinichen,et al.  Two-phase mini-thermosyphon electronics cooling: Dynamic modeling, experimental validation and application to 2U servers , 2017 .

[19]  Jung Ho Ahn,et al.  McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[20]  William Fornaciari,et al.  PowerProbe: Run-time power modeling through automatic RTL instrumentation , 2018, 2018 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[21]  Giovanni Agosta,et al.  Managing Heterogeneous Resources in HPC Systems , 2018, PARMA-DITAM '18.

[22]  Alessandro Cilardo,et al.  Enabling HPC for QoS-sensitive applications: The MANGO approach , 2016, 2016 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[23]  Muhammad Usman Karim Khan,et al.  Power efficient and workload balanced tiling for parallelized high efficiency video coding , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[24]  Mihaela van der Schaar,et al.  Big-Data Streaming Applications Scheduling Based on Staged Multi-Armed Bandits , 2016, IEEE Transactions on Computers.