Multi-Objective Optimization to Improve Both Thermal and Device Performance of a Nonuniformly Powered Micro-Architecture

Integration of different functional components such as level two (L2) cache memory, high-speed I/O interfaces, and memory controller has enhanced microprocessor performance. In this architecture, certain functional units on the microprocessor dissipate a significant fraction of the total power while other functional units dissipate little or no power. This highly nonuniform power distribution results in a large temperature gradient with Localized hot spots that may have detrimental effects on computer performance, product reliability, and yield. Moving the functional units may reduce the junction temperature but can affect performance by a factor as much as 30%. In this paper, a multi-objective optimization is performed to minimize the junction temperature without significantly altering the computer performance. The analysis was performed for 90 nm Pentium IV Northwood architecture operating at 3 GHz clock speed. Each functional unit on the die has a specific role, so functional units with similar roles were grouped together. Thus, the actual Pentium IV die was divided into four groups (front end, execution cores, bus and L2, and out-of-order engine). Repositioning constraints were determined using circuit delay models of major functional units in a micro-architectural simulator. Thus, depending on the scenario, relocating functional units can result in virtually no performance loss (less than 2% is assumed to be minimal and is reported as 0%) to as much as 30% performance loss. From the results, the minimum and the maximum temperatures were 56.6°C and 62.2°C. This ΔT corresponds to thermal design power of 60.2 W For microprocessors with higher thermal design power (115 W) and operating at higher clock speed, higher ΔT can be realized. Based on this paper's analysis, the optimized scenario resulted in a junction temperature of 56.6 ° C at the cost of a 14% performance loss.

[1]  Donald J. Patterson,et al.  Computer organization and design: the hardware-software interface (appendix a , 1993 .

[2]  Wei Wu,et al.  A systematic method for functional unit power estimation in microprocessors , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[3]  D. Marr,et al.  Hyper-Threading Technology Architecture and MIcroarchitecture , 2002 .

[4]  K. Sikka,et al.  An analytical temperature prediction method for a chip power map , 2005, Semiconductor Thermal Measurement and Management IEEE Twenty First Annual IEEE Symposium, 2005..

[5]  Shekhar Y. Borkar,et al.  Design challenges of technology scaling , 1999, IEEE Micro.

[6]  Guoping Xu,et al.  Substantiation of numerical analysis methodology for CPU package with non-uniform heat dissipation and heat sink with simplified fin modeling , 2004, The Ninth Intersociety Conference on Thermal and Thermomechanical Phenomena In Electronic Systems (IEEE Cat. No.04CH37543).

[7]  D. Agonafer,et al.  Design rule for minimizing thermal resistance in a non~uniformly powered microprocessor , 2006, Twenty-Second Annual IEEE Semiconductor Thermal Measurement And Management Symposium.

[8]  Chia-Pin Chiu,et al.  Cooling a Microprocessor Chip , 2006, Proceedings of the IEEE.

[9]  Kevin Skadron,et al.  Temperature-aware microarchitecture , 2003, ISCA '03.

[10]  Dereje Agonafer,et al.  Thermal Based Optimization of Functional Block Distributions in a Non-Uniformly Powered Die , 2005 .

[11]  Bahgat Sammakia,et al.  Development of a Numerical Model for Non-Uniformly Powered Die to Improve Both Thermal and Device Clock Performance , 2009 .

[12]  Hui Zeng,et al.  MPTLsim: a cycle-accurate, full-system simulator for x86-64 multicore architectures with coherent caches , 2009, CARN.

[13]  Teck Joo Goh Thermal methodology for evaluating the performance of microelectronic devices with non-uniform power dissipation , 2002, 4th Electronics Packaging Technology Conference, 2002..

[14]  Robert W. Dutton,et al.  Fast placement-dependent full chip thermal simulation , 2001, 2001 International Symposium on VLSI Technology, Systems, and Applications. Proceedings of Technical Papers (Cat. No.01TH8517).

[15]  Norman P. Jouppi,et al.  Introduction to the special issue on the 2008 workshop on design, analysis, and simulation of chip multiprocessors (dasCMP'08) , 2009, CARN.