Leakage aware resource management approach with machine learning optimization framework for partially reconfigurable architectures

Developing a comprehensive framework for integrating Genetic Algorithm and Machine Learning techniques to optimize our Leakage-aware RMA: from generating data to building predictive models and predicting Pareto fronts for new Task Graphs.Building a Linear Regression model describing the dependency between the range of Pareto front and Task Graphs features.Applying Density-based Clustering Algorithm to generate near-Pareto-optimal design points.Conducting experiments to examine the performance and efficiency of the proposed ML Optimization Framework. Shrinking size of transistors has enabled us to integrate more and more logic elements into FPGA chips leading to higher computing power. However, it also brings a serious concern to the leakage power dissipation of the FPGA devices. One of the major reasons for leakage power dissipation in FPGA is the utilization of prefetching technique to minimize the reconfiguration overhead (delay) in Partially Reconfigurable (PR) FPGAs. This technique creates delays between the reconfiguration and execution parts of a task, which may lead up to 38% leakage power of FPGA since the SRAM-cells containing reconfiguration information cannot be powered down. In this work, a resource management approach (RMA) containing scheduling, placement and post-placement stages has been proposed to address the aforementioned issue. In scheduling stage, a leakage-aware priority function is derived to cope with the leakage power. The placement stage uses a cost function that allows designers to determine the desired trade-off between performance and leakage-saving. The post-placement stage employs a heuristic approach to close the gaps between reconfiguration and execution of tasks, hence further reduce leakage waste. To further examine the trade-off between performance (schedule length) and leakage waste, we propose a framework to utilize the Genetic Algorithm (GA) for exploring the design space and obtaining Pareto optimal design points. Addressing the time-consuming limitation of GA, we apply Regression technique and Clustering algorithm to build predictive models for the Pareto fronts using a training task graph dataset. Experiments show that our approach can achieve large leakage savings for both synthetic and real-life applications with acceptable extended deadline. Furthermore, different variants of the proposed approach can reduce leakage power by 4065% when compared to a performance-driven approach and by 1543% when compared to state-of-the-art works. Its also proven that our Machine Learning Optimization framework can estimate the Pareto front for new coming task graphs 10x faster than well-established GA approach with only 10% degradation in quality.

[1]  Renato J. O. Figueiredo,et al.  MALMOS: Machine Learning-Based Mobile Offloading Scheduler with Online Training , 2015, 2015 3rd IEEE International Conference on Mobile Cloud Computing, Services, and Engineering.

[2]  Lothar Thiele,et al.  Multiobjective Optimization Using Evolutionary Algorithms - A Comparative Case Study , 1998, PPSN.

[3]  Sujit Dey,et al.  High-Level Power Analysis and Optimization , 1997 .

[4]  Muhammad Shafique,et al.  REMiS: Run-time energy minimization scheme in a reconfigurable processor with dynamic power-gated instruction set , 2009, 2009 IEEE/ACM International Conference on Computer-Aided Design - Digest of Technical Papers.

[5]  Luca Benini,et al.  Row-Based Power-Gating: A Novel Sleep Transistor Insertion Methodology for Leakage Power Optimization in Nanometer CMOS Circuits , 2011, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[6]  Jonathan Rose,et al.  Measuring the Gap Between FPGAs and ASICs , 2007, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[7]  Carla E. Brodley,et al.  Predictive application-performance modeling in a computational grid environment , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[8]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems (Wiley Series on Parallel and Distributed Computing) , 2007 .

[9]  P.T. Balsara,et al.  Exploiting temporal idleness to reduce leakage power in programmable architectures , 2005, Proceedings of the ASP-DAC 2005. Asia and South Pacific Design Automation Conference, 2005..

[10]  Domenik Helms,et al.  Modelling the Impact of High Level Leakage Optimization Techniques on the Delay of RT-Components , 2007, PATMOS.

[11]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[12]  Heike Trautmann,et al.  On the properties of the R2 indicator , 2012, GECCO '12.

[13]  Ishfaq Ahmad,et al.  Benchmarking and Comparison of the Task Graph Scheduling Algorithms , 1999, J. Parallel Distributed Comput..

[14]  Jason Cong,et al.  Energy efficient multiprocessor task scheduling under input-dependent variation , 2009, 2009 Design, Automation & Test in Europe Conference & Exhibition.

[15]  Wayne H. Wolf,et al.  TGFF: task graphs for free , 1998, Proceedings of the Sixth International Workshop on Hardware/Software Codesign. (CODES/CASHE'98).

[16]  Enrico Macii,et al.  Design Techniques and Architectures for Low-Leakage SRAMs , 2012, IEEE Transactions on Circuits and Systems I: Regular Papers.

[17]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[18]  Pascal Bouvry,et al.  A Survey of Evolutionary Computation for Resource Management of Processing in Cloud Computing [Review Article] , 2015, IEEE Computational Intelligence Magazine.

[19]  Jordi Torres,et al.  Power-Aware Multi-data Center Management Using Machine Learning , 2013, 2013 42nd International Conference on Parallel Processing.

[20]  Oliver Sinnen,et al.  Task Scheduling for Parallel Systems , 2007, Wiley series on parallel and distributed computing.

[21]  A.M. Rahmani,et al.  A Novel Intelligent Algorithm for Fault-Tolerant Task Scheduling in Real-Time Multiprocessor Systems , 2008, 2008 Third International Conference on Convergence and Hybrid Information Technology.

[22]  Balázs Kégl,et al.  Utility-Based Reinforcement Learning for Reactive Grids , 2008, 2008 International Conference on Autonomic Computing.

[23]  Andrea Lodi,et al.  Low leakage circuit design for FPGAs , 2004, Proceedings of the IEEE 2004 Custom Integrated Circuits Conference (IEEE Cat. No.04CH37571).

[24]  Bharadwaj Veeravalli,et al.  Reinforcement learning-based inter- and intra-application thermal optimization for lifetime improvement of multicore systems , 2014, 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC).

[25]  James Kao,et al.  Subthreshold leakage modeling and reduction techniques , 2002, ICCAD 2002.

[26]  Lothar Thiele,et al.  Thermally optimal stop-go scheduling of task graphs with real-time constraints , 2011, 16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011).

[27]  Pier Luca Lanzi,et al.  Ant Colony Heuristic for Mapping and Scheduling Tasks and Communications on Heterogeneous Embedded Systems , 2010, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[28]  Albert Y. Zomaya,et al.  Toward Energy-Aware Scheduling Using Machine Learning , 2012 .

[29]  Yuan-Hao Chang,et al.  An enhanced leakage-aware scheduler for dynamically reconfigurable FPGAs , 2011, 16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011).

[30]  Jürgen Teich,et al.  A Dynamic Scheduling and Placement Algorithm for Reconfigurable Hardware , 2004, ARCS.

[31]  Bo-Cheng Lai,et al.  Leakage power analysis of a 90nm FPGA , 2003, Proceedings of the IEEE 2003 Custom Integrated Circuits Conference, 2003..

[32]  Scott Hauck,et al.  Configuration prefetch for single context reconfigurable coprocessors , 1998, FPGA '98.

[33]  Anantha Chandrakasan,et al.  Design methodology for fine-grained leakage control in MTCMOS , 2003, ISLPED '03.

[34]  Chia-Lin Yang,et al.  Leakage-aware task scheduling for partially dynamically reconfigurable FPGAs , 2009, TODE.

[35]  Chittaranjan Tripathy,et al.  Fault tolerant scheduling of hard real-time tasks on multiprocessor system using a hybrid genetic algorithm , 2014, Swarm Evol. Comput..

[36]  Nikil D. Dutt,et al.  Physically-aware HW-SW partitioning for reconfigurable architectures with partial dynamic reconfiguration , 2005, Proceedings. 42nd Design Automation Conference, 2005..

[37]  José Manuel Moya,et al.  Leakage and temperature aware server control for improving energy efficiency in data centers , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[38]  Fei Li,et al.  Field Programmability of Supply Voltages for FPGA Power Reduction , 2007, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[39]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[40]  Mohammad Kazem Akbari,et al.  Using of Machine Learning into Cloud Environment (A Survey): Managing and Scheduling of Resources in Cloud Systems , 2012, 2012 Seventh International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[41]  Marco Platzner,et al.  Heuristics for Onine Scheduling Real-Time Tasks to Partially Reconfigurable Devices , 2003, FPL.

[42]  Mahmut T. Kandemir,et al.  Reducing leakage energy in FPGAs using region-constrained placement , 2004, FPGA '04.