On-line learning based dynamic thermal management for multicore systems

Power consumption of a high-end microprocessor increases very rapidly. High power consumption will lead to rapid increase in chip temperature as well. If temperature reaches beyond a certain level, chip operation becomes either slow or unreliable. Therefore various approaches for dynamic thermal management (DTM) have been proposed. In this paper, we propose a new application-oriented learning-based dynamic thermal management (LDTM) technique for a multi-core system. From repetitive executions of an application, we learn the thermal patterns of the chip, and we control the future temperature through DTM. When the predicted temperature may rise above a threshold value, we reduce the temperature by decreasing the operation frequency of the corresponding core. We implement our learning-based thermal management on an Intel's dual core system which is equipped with digital thermal sensors (DTS). The dynamic frequency scaling (DFS) is implemented to have three frequency steps on a Linux kernel. We carried out experiments using Phoronix Test Suite benchmarks for Linux. The peak temperature has been reduced by on average 7degC using our LDTM, and the overall average temperature reduced from 72degC to 65degC.

[1]  T. N. Vijaykumar,et al.  Heat-and-run: leveraging SMT and CMP to manage power density through the operating system , 2004, ASPLOS XI.

[2]  Li Shang,et al.  HybDTM: a coordinated hardware-software approach for dynamic thermal management , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[3]  Jaeheon Lee,et al.  A 26mW dual-mode RF receiver for GPS/Galileo with L1/L1F and L5/E5a bands , 2008, 2008 International SoC Design Conference.

[4]  Narayanan Vijaykrishnan,et al.  Interconnect and thermal-aware floorplanning for 3D microprocessors , 2006, 7th International Symposium on Quality Electronic Design (ISQED'06).

[5]  Stefan Schaal,et al.  Locally Weighted Projection Regression : An O(n) Algorithm for Incremental Real Time Learning in High Dimensional Space , 2000 .

[6]  Jinwook Burm,et al.  Design of optical fiber sensor driving IC for structure monitoring system , 2008, 2008 International SoC Design Conference.

[7]  Kevin Skadron,et al.  Temperature-aware microarchitecture: Modeling and implementation , 2004, TACO.

[8]  Stephen H. Gunther,et al.  Managing the Impact of Increasing Microprocessor Power Consumption , 2001 .

[9]  Krste Asanovic,et al.  Reducing power density through activity migration , 2003, ISLPED '03.

[10]  Kevin Skadron,et al.  Compact thermal modeling for temperature-aware design , 2004, Proceedings. 41st Design Automation Conference, 2004..

[11]  Stefan Schaal,et al.  Incremental Online Learning in High Dimensions , 2005, Neural Computation.

[12]  Yiannakis Sazeides,et al.  A study of thread migration in temperature-constrained multicores , 2007, TACO.

[13]  Margaret Martonosi,et al.  Dynamic thermal management for high-performance microprocessors , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[14]  Eun Jung Kim,et al.  Predictive dynamic thermal management for multicore systems , 2008, 2008 45th ACM/IEEE Design Automation Conference.