A computer-architecture approach to thermal management in computer systems: opportunities and challenges

Cooling costs for notebook, desktop, and server computer systems are rising exponentially as power densities for high-performance chips continue to double every three years. Research has led to a range of advances in modeling and design of thermal packaging and circuit boards. Yet a major front that has been absent is thermal design in the computer architecture domain, where processor utilization, the interleaving of different computation processes, and the flow of instructions through the CPU are controlled. The architecture domain presents a rich opportunity for thermal management that complements other advances by controlling temperature at runtime in response to the dynamic behavior of the computer's current workload. This paper describes a compact modeling algorithm that is appropriate for this domain, presents some architecture techniques for runtime thermal management, and describes some interesting problems that remain to be solved.

[1]  Clemens J. M. Lasance Thermally driven reliability issues in microelectronic systems: status-quo and challenges , 2003, Microelectron. Reliab..

[2]  Ramon Canal,et al.  A cost-effective clustered architecture , 1999, 1999 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00425).

[3]  Erven Rohou,et al.  Dynamically Managing Processor Temperature and Power , 1999 .

[4]  M. Rencz,et al.  Friendly tools for the thermal simulation of power packages , 2000, IWIPP 2000. International Workshop on Integrated Power Packaging (Cat. No.00EX426).

[5]  Sarita V. Adve,et al.  Predictive dynamic thermal management for multimedia applications , 2003, ICS '03.

[6]  Frank Bellosa,et al.  Event-Driven Energy Accounting for Dynamic Thermal Management , 2002 .

[7]  Kevin Skadron,et al.  HotSpot: a dynamic compact thermal model at the processor-architecture level , 2003, Microelectron. J..

[8]  Kevin Skadron,et al.  Control-theoretic techniques and thermal-RC modeling for accurate and localized dynamic thermal management , 2002, Proceedings Eighth International Symposium on High Performance Computer Architecture.

[9]  Stephen H. Gunther,et al.  Managing the Impact of Increasing Microprocessor Power Consumption , 2001 .

[10]  Michael C. Huang,et al.  A framework for dynamic energy efficiency and temperature management , 2000, MICRO 33.

[11]  Sung-Mo Kang,et al.  A temperature-aware simulation environment for reliable ULSI chipdesign , 2000, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[12]  Kevin Skadron,et al.  Hybrid architectural dynamic thermal management , 2004, Proceedings Design, Automation and Test in Europe Conference and Exhibition.

[13]  Kevin Skadron,et al.  Temperature-Aware Computer Systems: Opportunities and Challenges , 2003, IEEE Micro.

[14]  Margaret Martonosi,et al.  Wattch: a framework for architectural-level power analysis and optimizations , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[15]  Andrzej J. Strojwas,et al.  MONSTR: A Complete Thermal Simulator of Electronic Systems , 1994, 31st Design Automation Conference.

[16]  Kevin Skadron,et al.  Temperature-aware microarchitecture: Modeling and implementation , 2004, TACO.

[17]  C.E. Christoffersen,et al.  Global coupled EM-electrical-thermal simulation and experimental validation for a spatial power combining MMIC array , 2002, 2002 IEEE MTT-S International Microwave Symposium Digest (Cat. No.02CH37278).

[18]  Mircea R. Stan,et al.  Active threshold compensation circuit for improved performance in cooled CMOS systems , 2001, ISCAS 2001. The 2001 IEEE International Symposium on Circuits and Systems (Cat. No.01CH37196).

[19]  K. Torki,et al.  IC thermal map from digital and thermal simulations , 2002 .

[20]  Andras Poppe,et al.  A Scalable Multi-Functional Thermal Test Chip Family: Design and Evaluation , 2001 .

[21]  Kevin Skadron,et al.  Temperature-aware microarchitecture , 2003, ISCA '03.

[22]  Mike Alexander,et al.  Thermal management system for high performance PowerPC/sup TM/ microprocessors , 1997, Proceedings IEEE COMPCON 97. Digest of Papers.

[23]  Richard E. Kessler,et al.  The Alpha 21264 microprocessor architecture , 1998, Proceedings International Conference on Computer Design. VLSI in Computers and Processors (Cat. No.98CB36273).

[24]  Margaret Martonosi,et al.  Dynamic thermal management for high-performance microprocessors , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[25]  W. Robert Daasch,et al.  A thermal-aware superscalar microprocessor , 2002, Proceedings International Symposium on Quality Electronic Design.

[26]  Brad Calder,et al.  Basic block distribution analysis to find periodic behavior and simulation points in applications , 2001, Proceedings 2001 International Conference on Parallel Architectures and Compilation Techniques.

[27]  Márta Rencz,et al.  Electro-thermal and logi-thermal simulation of VLSI designs , 1997, IEEE Trans. Very Large Scale Integr. Syst..

[28]  Krste Asanovic,et al.  Reducing power density through activity migration , 2003, ISLPED '03.

[29]  Todd M. Austin,et al.  The SimpleScalar tool set, version 2.0 , 1997, CARN.