Analysis and Control of Power–Temperature Dynamics in Heterogeneous Multiprocessors

Virtually, all electronic systems try to optimize a fundamental tradeoff between higher performance and lower power consumption. The latter becomes critical in mobile computing systems, such as smartphones, which rely on passive cooling. Otherwise, the heat concentrated in a small area drives both the junction and skin temperatures up. High junction temperatures degrade the reliability, while skin temperature deteriorates the user experience. Therefore, there is a strong need for a formal analysis of power consumption-temperature dynamics and predictive thermal management algorithms. This article presents a theoretical power–temperature analysis of multiprocessor systems that are modeled as multi-input multi-output dynamic systems. We analyze the conditions under which the system converges to a stable steady-state temperature. Then, we use these models to design a control algorithm that manages the temperature of the system without affecting the performance of the application. Experiments on the Odroid-XU3 board show that the control algorithm is able to regulate the temperature with a minimal loss in performance compared with the default thermal governors.

[1]  Carla E. Brodley,et al.  Heat stroke: power-density-based denial of service in SMT , 2005, 11th International Symposium on High-Performance Computer Architecture.

[2]  Li Shang,et al.  System-Level Dynamic Thermal Management for High-Performance Microprocessors , 2008, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[3]  Sung Woo Chung,et al.  On the Thermal Attack in Instruction Caches , 2010, IEEE Transactions on Dependable and Secure Computing.

[4]  K. Skadron,et al.  Potential thermal security risks , 2005, Semiconductor Thermal Measurement and Management IEEE Twenty First Annual IEEE Symposium, 2005..

[5]  Kevin Skadron,et al.  HotSpot: a compact thermal modeling methodology for early-stage VLSI design , 2006, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[6]  Umit Y. Ogras,et al.  Algorithmic Optimization of Thermal and Power Management for Heterogeneous Mobile Platforms , 2018, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[7]  Lawrence T. Pileggi,et al.  Efficient full-chip thermal modeling and analysis , 2004, IEEE/ACM International Conference on Computer Aided Design, 2004. ICCAD-2004..

[8]  Ümit Y. Ogras,et al.  Predictive dynamic thermal and power management for heterogeneous mobile platforms , 2015, 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[9]  W. Rudin Principles of mathematical analysis , 1964 .

[10]  Heba Khdr,et al.  Thermal Safe Power (TSP): Efficient Power Budgeting for Heterogeneous Manycore Systems in Dark Silicon , 2017, IEEE Transactions on Computers.

[11]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[12]  Norman P. Jouppi,et al.  Heterogeneous chip multiprocessors , 2005, Computer.

[13]  Arnold Neumaier,et al.  Introduction to Numerical Analysis , 2001 .

[14]  Giovanni De Micheli,et al.  A control theory approach for thermal balancing of MPSoC , 2009, 2009 Asia and South Pacific Design Automation Conference.

[15]  Stephen P. Boyd,et al.  Processor Speed Control With Thermal Constraints , 2009, IEEE Transactions on Circuits and Systems I: Regular Papers.

[16]  Onur Sahin,et al.  QScale: Thermally-efficient QoS management on heterogeneous mobile platforms , 2016, 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[17]  Ümit Y. Ogras,et al.  Power-Temperature Stability and Safety Analysis for Multiprocessor Systems , 2017, ACM Trans. Embed. Comput. Syst..

[18]  Saibal Mukhopadhyay,et al.  Leakage current mechanisms and leakage reduction techniques in deep-submicrometer CMOS circuits , 2003, Proc. IEEE.

[19]  Alberto Leva,et al.  Event-Based Power/Performance-Aware Thermal Management for High-Density Microprocessors , 2018, IEEE Transactions on Control Systems Technology.

[20]  Siddharth Garg,et al.  Energy-efficient computing from systems-on-chip to micro-server and data centers , 2015, 2015 Sixth International Green and Sustainable Computing Conference (IGSC).

[21]  Krste Asanovic,et al.  Reducing power density through activity migration , 2003, ISLPED '03.

[22]  Sachin S. Sapatnekar,et al.  A high efficiency full-chip thermal simulation algorithm , 2005, ICCAD-2005. IEEE/ACM International Conference on Computer-Aided Design, 2005..

[23]  Dilip Krishnaswamy,et al.  PROMETHEUS: A Proactive Method for Thermal Management of Heterogeneous MPSoCs , 2013, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[24]  Lei He,et al.  Coupled Power and Thermal Simulation with Active Cooling , 2003, PACS.

[25]  Luca Benini,et al.  An Effective Gray-Box Identification Procedure for Multicore Thermal Modeling , 2014, IEEE Transactions on Computers.

[26]  M. Sachdev,et al.  Thermal runaway in integrated circuits , 2006, IEEE Transactions on Device and Materials Reliability.

[27]  Sudhakar Yalamanchili,et al.  Temperature regulation in multicore processors using adjustable-gain integral controllers , 2015, 2015 IEEE Conference on Control Applications (CCA).

[28]  Ümit Y. Ogras,et al.  Power and Thermal Analysis of Commercial Mobile Platforms: Experiments and Case Studies , 2019, 2019 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[29]  Sherief Reda,et al.  Thermal prediction and adaptive control through workload phase detection , 2013, TODE.

[30]  Trevor Mudge,et al.  MiBench: A free, commercially representative embedded benchmark suite , 2001 .

[31]  Oguz Ergin,et al.  User-specific skin temperature-aware DVFS for smartphones , 2015, 2015 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[32]  Radu Marculescu,et al.  Design and Management of Voltage-Frequency Island Partitioned Networks-on-Chip , 2009, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.