Artificial bee colony-inspired run-time task management for many-core systems

Efficient resource and application management is one of the most complex and challenging tasks in high performance computing. Large-scale computing systems that contain hundreds, thousands or even millions of cores demand solutions that can operate in a distributed, robust, and scalable fashion. However, while hardware parallelism is relatively straight forward to achieve, this is not generally the case for software. This leads to under-utilization of the hardware parallelism as well as imbalanced load distribution causing inefficiency and hotspots. In response to this challenge, this paper introduces a novel distributed and decentralized run-time management algorithm. The proposed method is guided by an optimization model inspired by artificial bee colonies (ABC). While ABC have proven useful for optimizing large sets of numerical test functions, this is the first time they are applied in the context of many-core system management. The initial result shows that, the ABC model is promising in context of run-time management for many-core systems. It is also anticipated that the algorithms bio-inspired foundations will inherently enable scalability, reliability, and adaptation. We are showing initial experiments, where the initial results indicate the capability of our model to improve the thermal distribution across the system.

[1]  Cosimo Birtolo,et al.  Modeling an Artificial Bee Colony with Inspector for Clustering Tasks , 2014, EvoCOP.

[2]  Bashir M. Al-Hashimi Hardware reliability of embedded systems: Are we there yet? , 2013, 2013 23rd International Workshop on Power and Timing Modeling, Optimization and Simulation (PATMOS).

[3]  Dervis Karaboga,et al.  A novel clustering approach: Artificial Bee Colony (ABC) algorithm , 2011, Appl. Soft Comput..

[4]  Amit Kumar Singh,et al.  Mapping on multi/many-core systems: Survey of current and emerging trends , 2013, 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC).

[5]  Wolfgang Schröder-Preikschat,et al.  DistRM: Distributed resource management for on-chip many-core systems , 2011, 2011 Proceedings of the Ninth IEEE/ACM/IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS).

[6]  Steve B. Furber,et al.  The SpiNNaker Project , 2014, Proceedings of the IEEE.

[7]  Timothy Mattson,et al.  A 48-Core IA-32 message-passing processor with DVFS in 45nm CMOS , 2010, 2010 IEEE International Solid-State Circuits Conference - (ISSCC).

[8]  Medhat A. Tawfeek,et al.  A comparative study into swarm intelligence algorithms for dynamic tasks scheduling in cloud computing , 2015, 2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS).

[9]  Radu Marculescu,et al.  User-Aware Dynamic Task Allocation in Networks-on-Chip , 2008, 2008 Design, Automation and Test in Europe.

[10]  Martin Middendorf,et al.  Searching for a new home—scouting behavior of honeybee swarms , 2007 .

[11]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[12]  Anthony P. Reeves,et al.  Strategies for Dynamic Load Balancing on Highly Parallel Computers , 1993, IEEE Trans. Parallel Distributed Syst..

[13]  John Doe Load balancing Strategies in Parallel Computing : Short Survey , 2015 .

[14]  Martin Trefzer,et al.  Social-insect-inspired adaptive task allocation for many-core systems , 2016, 2016 IEEE Congress on Evolutionary Computation (CEC).

[15]  Leandro Soares Indrusiak,et al.  Deadline, Energy and Buffer-Aware Task Mapping Optimization in NoC-Based SoCs Using Genetic Algorithms , 2017, 2017 VII Brazilian Symposium on Computing Systems Engineering (SBESC).

[16]  Ranga Vemuri,et al.  optiMap: a tool for automated generation of NoC architectures using multi-port routers for FPGAs , 2006, Proceedings of the Design Automation & Test in Europe Conference.

[17]  C. Grüter,et al.  Informational conflicts created by the waggle dance , 2008, Proceedings of the Royal Society B: Biological Sciences.

[18]  Geoff V. Merrett,et al.  Workload-Aware Runtime Energy Management for HPC Systems , 2018, 2018 International Conference on High Performance Computing & Simulation (HPCS).

[19]  Pedro B. Campos,et al.  XL-STaGe: A cross-layer scalable tool for graph generation, evaluation and implementation , 2016, 2016 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS).

[20]  Fernando Gehm Moraes,et al.  Evaluation of Algorithms for Low Energy Mapping onto NoCs , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[21]  Dervis Karaboga,et al.  AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION , 2005 .