Event-Driven Thermal Management in SMP Systems

Actions usually taken to prevent processors from overheating, such as decreasing the frequency or stopping the execution flow, also degrade performance. Multiprocessor systems, however, offer the possibility of moving the task which caused a CPU to overheat away to some other, cooler CPU, so throttling becomes only a last resort taken if all of a system’s processors are hot. Additionally, the different energy characteristics that different tasks are showing can be exploited and hot tasks as well as cool tasks can be distributed evenly among all CPUs. This work presents a mechanism for determining the energy characteristics of tasks by means of event monitoring counters, and an energy–aware scheduling policy, which strives to assign tasks to CPUs in a way that avoids overheating individual CPUs. We implemented energy–aware scheduling for the Linux kernel. Evaluations show that the overhead incurred by additional task migrations is negligible compared to the benefit of avoiding throttling.

[1]  Peter Druschel,et al.  Resource containers: a new facility for resource management in server systems , 1999, OSDI '99.

[2]  Margaret Martonosi,et al.  Wattch: a framework for architectural-level power analysis and optimizations , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[3]  Frank Bellosa,et al.  The Case for Event-Driven Energy Accounting , 2001 .

[4]  Frank Bellosa,et al.  Process cruise control: event-driven clock scaling for dynamic power management , 2002, CASES '02.

[5]  Niraj K. Jha,et al.  High-level energy macromodeling of embedded software , 2002, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[6]  Frank Bellosa,et al.  Dynamic Thermal Management for Distributed Systems , 2002 .

[7]  Frank Bellosa,et al.  Event-Driven Energy Accounting for Dynamic Thermal Management , 2002 .

[8]  Margaret Martonosi,et al.  Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data , 2003, MICRO.

[9]  Kevin Skadron,et al.  Temperature-aware microarchitecture , 2003, ISCA '03.

[10]  Krste Asanovic,et al.  Reducing power density through activity migration , 2003, ISLPED '03.

[11]  T. N. Vijaykumar,et al.  Heat-and-run: leveraging SMT and CMP to manage power density through the operating system , 2004, ASPLOS XI.

[12]  Kevin Skadron,et al.  Compact thermal modeling for temperature-aware design , 2004, Proceedings. 41st Design Automation Conference, 2004..

[13]  Kevin Skadron,et al.  Using performance counters for runtime temperature sensing in high-performance processors , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.