Accurate Temperature Estimation Using Noisy Thermal Sensors for Gaussian and Non-Gaussian Cases

Multicore system-on-chips (SOCs) rely on runtime thermal monitoring using on-chip thermal sensors for dynamic thermal management (DTM). However, on-chip sensors are highly susceptible to noise due to fabrication randomness, VDD fluctuations, etc. This causes discrepancy between the actual temperature and the one observed by thermal sensor. In this paper, we address the problem of estimating the accurate temperature of on-chip thermal sensor when the sensor reading has been corrupted by noise. We present statistical techniques for the following: 1) when the underlying randomness exhibits jointly-Gaussian characteristics we present the optimal solution for temperature estimation; 2) for close to Gaussian cases we give a heuristic based on Moment Matching; 3) when the underlying randomness is non-Gaussian a hypothesis testing framework is used to predict the sensor temperatures. The previous three techniques are investigated in both single sensor and multisensor scenarios, respectively. The latter tries to estimate the actual temperatures for several sensors simultaneously while exploiting the correlations in temperature and circuit parameters among different sensors. The experiments showed that using our estimation schemes the root mean square (RMS) error can be reduce (with very small runtime overhead) by 71.5% as compared to blindly trusting the sensors to be noise-free.

[1]  Tajana Simunic,et al.  Accurate Temperature Estimation for Efficient Thermal Management , 2008, 9th International Symposium on Quality Electronic Design (isqed 2008).

[2]  Greg Hamerly,et al.  SimPoint 3.0: Faster and More Flexible Program Analysis , 2005 .

[3]  B. Cline,et al.  Analysis and modeling of CD variation for statistical static timing , 2006, ICCAD '06.

[4]  S. Nassif,et al.  Delay variability: sources, impacts and trends , 2000, 2000 IEEE International Solid-State Circuits Conference. Digest of Technical Papers (Cat. No.00CH37056).

[5]  S. M. Sze Physics of semiconductor devices /2nd edition/ , 1981 .

[6]  Charlie Chung-Ping Chen,et al.  3-D Thermal-ADI: a linear-time chip level transient thermal simulator , 2002, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[7]  Yufu Zhang,et al.  Accurate temperature estimation using noisy thermal sensors , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[8]  Lawrence T. Pileggi,et al.  Efficient full-chip thermal modeling and analysis , 2004, IEEE/ACM International Conference on Computer Aided Design, 2004. ICCAD-2004..

[9]  Kevin Skadron,et al.  Temperature-aware microarchitecture , 2003, ISCA '03.

[10]  Margaret Martonosi,et al.  Dynamic thermal management for high-performance microprocessors , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[11]  Alan J. Weger,et al.  Thermal-aware task scheduling at the system software level , 2007, Proceedings of the 2007 international symposium on Low power electronics and design (ISLPED '07).

[12]  Karim Arabi,et al.  Built-in temperature sensors for on-line thermal monitoring of microelectronic structures , 1997, Proceedings International Conference on Computer Design VLSI in Computers and Processors.

[13]  Margaret Martonosi,et al.  Wattch: a framework for architectural-level power analysis and optimizations , 2000, Proceedings of 27th International Symposium on Computer Architecture (IEEE Cat. No.RS00201).

[14]  Robert G. Meyer,et al.  Modeling and analysis of substrate coupling in integrated circuits , 1996 .

[15]  W. Burleson,et al.  Low-power and robust on-chip thermal sensing using differential ring oscillators , 2007, 2007 50th Midwest Symposium on Circuits and Systems.

[16]  Ying-Yen Chen,et al.  Extraction of Statistical Timing Profiles Using Test Data , 2007, 2007 44th ACM/IEEE Design Automation Conference.

[17]  Seda Ogrenci Memik,et al.  Optimizing Thermal Sensor Allocation for Microprocessors , 2008, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[18]  Jinjun Xiong,et al.  Non-Linear Statistical Static Timing Analysis for Non-Gaussian Variation Sources , 2007, 2007 44th ACM/IEEE Design Automation Conference.

[19]  Ali M. Niknejad,et al.  Numerically stable Green function for modeling and analysis of substrate coupling in integrated circuits , 1998, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[20]  Anand Sivasubramaniam,et al.  Dynamic Thermal Management for High-Performance Storage Systems , 2012, Handbook of Energy-Aware and Green Computing.

[21]  A. Sedra Microelectronic circuits , 1982 .

[22]  Brad Calder,et al.  SimPoint 3.0: Faster and More Flexible Program Phase Analysis , 2005, J. Instr. Level Parallelism.

[23]  Massoud Pedram,et al.  A stochastic local hot spot alerting technique , 2008, 2008 Asia and South Pacific Design Automation Conference.

[24]  Tajana Simunic,et al.  Proactive temperature management in MPSoCs , 2008, Proceeding of the 13th international symposium on Low power electronics and design (ISLPED '08).

[25]  Sailes K. Sengijpta Fundamentals of Statistical Signal Processing: Estimation Theory , 1995 .

[26]  Kevin Skadron,et al.  Temperature-aware microarchitecture: Modeling and implementation , 2004, TACO.

[27]  Sachin S. Sapatnekar,et al.  High-Efficiency Green Function-Based Thermal Simulation Algorithms , 2007, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[28]  R. Mukherjee,et al.  Thermal Sensor Allocation and Placement for Reconfigurable Systems , 2006, 2006 IEEE/ACM International Conference on Computer Aided Design.

[29]  S. M. Sze,et al.  Physics of semiconductor devices , 1969 .

[30]  Taewhan Kim,et al.  Optimal allocation and placement of thermal sensors for reconfigurable systems and its practical extension , 2008, 2008 Asia and South Pacific Design Automation Conference.