Appropriate allocation of workloads on performance asymmetric multicore architectures via deep learning algorithms

Abstract Asymmetric multicore processors (AMP) have become popular in both high-end and low-end computing systems due to its flexibility and high performance. A performance asymmetric multicore architecture (P-AMP) is the subcategory of AMP, which integrates the different micro-architecture cores in the same chip. Due to the heterogeneity nature of cores and applications, recognizing an optimal hardware configuration in terms of core, voltage-frequency pair for each application is still an NP-hard problem. Optimization of energy-delay product (EDP) is an additional challenging task in such architectures. To address these challenges, we developed a novel core prediction model called lightweight-deep neural network (LW-DNN) for asymmetric multicore processors. The proposed LW-DNN includes three phases, feature selection, feature optimization, and core prediction module. In the first and second phases, workload characteristics are extracted and optimized using the pre-processing algorithm and in the third phase, it predicts the appropriate cores for each workload at runtime to enhance the energy-efficiency and performance. We modeled a deep learning neural network using scikit-learn python library and evaluated in ODROID XU3 ARM big-Little performance asymmetric multicore platform. The embedded benchmarks we considered are MiBench, IoMT, Core-Mark workloads. The proposed LW-DNN prediction module compared with other traditional algorithms in terms of accuracy, execution time, energy consumption, and energy-delay product. The experimental results illustrate that accuracy achieved up to 97% in core prediction, and the average improvement in minimization of energy consumption is 33%, 35% in energy-delay product, 33% minimized in execution time correspondingly.

[1]  Vivienne Sze,et al.  Efficient Processing of Deep Neural Networks: A Tutorial and Survey , 2017, Proceedings of the IEEE.

[2]  Abdoulaye Gamatié,et al.  Empirical model-based performance prediction for application mapping on multicore architectures , 2019, J. Syst. Archit..

[3]  Trevor Mudge,et al.  MiBench: A free, commercially representative embedded benchmark suite , 2001 .

[4]  Keqin Li,et al.  Energy-Efficient Scheduling Algorithms for Real-Time Parallel Applications on Heterogeneous Distributed Embedded Systems , 2017, IEEE Transactions on Parallel and Distributed Systems.

[5]  Juan Carlos Saez,et al.  Towards completely fair scheduling on asymmetric single-ISA multicore processors , 2017, J. Parallel Distributed Comput..

[6]  Osman S. Unsal,et al.  A Machine Learning Approach for Performance Prediction and Scheduling on Heterogeneous CPUs , 2017, 2017 29th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD).

[7]  Samee Ullah Khan,et al.  An Energy-Efficient Task Scheduling Algorithm in DVFS-enabled Cloud Environment , 2015, Journal of Grid Computing.

[8]  Margaret Martonosi,et al.  Thread criticality predictors for dynamic performance, power, and resource management in chip multiprocessors , 2009, ISCA '09.

[9]  Daniel Mossé,et al.  Exploring Machine Learning for Thread Characterization on Heterogeneous Multiprocessors , 2017, OPSR.

[10]  Rupesh Nasre,et al.  Optimizing Graph Algorithms in Asymmetric Multicore Processors , 2018, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[11]  Mark P. Wachowiak,et al.  Adaptive Particle Swarm Optimization with Heterogeneous Multicore Parallelism and GPU Acceleration , 2017, IEEE Transactions on Parallel and Distributed Systems.

[12]  Xu Jiang,et al.  ReLeTA: Reinforcement Learning for Thermal-Aware Task Allocation on Multicore , 2019, ArXiv.

[13]  Lieven Eeckhout,et al.  Scheduling heterogeneous multi-cores through performance impact estimation (PIE) , 2012, 2012 39th Annual International Symposium on Computer Architecture (ISCA).

[14]  Gang Li,et al.  Ant Colony Optimization Task Scheduling Algorithm for SWIM Based on Load Balancing , 2019, Future Internet.

[15]  Tosiron Adegbija,et al.  HERMIT: A Benchmark Suite for the Internet of Medical Things , 2018, IEEE Internet of Things Journal.

[16]  Habib Izadkhah,et al.  Learning Based Genetic Algorithm for Task Graph Scheduling , 2019, Appl. Comput. Intell. Soft Comput..