Modeling Energy Dissipation in Low Power Caches

Modern microprocessors employ one or two levels of on–chip caches that are implemented using static RAM and take up a large portion of the Silicon real estate, consuming a significant amount of power. We present detailed analytical models for estimating the energy dissipated in conventionally–organized caches as well as caches that are organized to have reduced energy dissipations. We also validate the accuracy of these analytical models by comparing the analytically obtained power dissipations with the dissipations obtained using a detailed simulator called CAPE (CAache Power Estimator) for the simulated execution of many SPEC 95 benchmarks. Our analytical models for the energy dissipation in caches use counts for hits, misses, as well as the number of reads and writes, which are obtained from an architectural simulator, and assumes transitions due to the actual values of the address and data bits to be stochastically distributed. We show that the analytical models for conventional caches estimate the overall power dissipation within less than 10% error (compared to the energy estimated from actual transition counts). On the average, the analytical model overpredict the energy dissipations by about 3% to 15%. However, for some of the energy efficient cache organizations, the analytical model over–predict the energy dissipations by as much as 20%. We compare the results of our analysis with actual transition–count based power estimations obtained from CAPE to show that the inaccuracies in the analytical models are due to correlated transitions, a factor that actually plays a significant role in making some cache organizations energy efficient.

[1]  T. Wada,et al.  An analytical access time model for on-chip cache memories , 1992 .

[2]  P. Boyle,et al.  A 300-MHz 115-W 32-b bipolar ECL microprocessor , 1993 .

[3]  Jim Handy,et al.  The cache memory book , 1993 .

[4]  Mircea R. Stan,et al.  Limited-weight codes for low-power I/O , 1994 .

[5]  Norman P. Jouppi,et al.  WRL Research Report 93/5: An Enhanced Access and Cycle Time Model for On-chip Caches , 1994 .

[6]  Mircea R. Stan,et al.  Bus-invert coding for low-power I/O , 1995, IEEE Trans. Very Large Scale Integr. Syst..

[7]  Uming Ko,et al.  Energy optimization of multi-level processor cache architectures , 1995, ISLPED '95.

[8]  David A. Rennels,et al.  Reducing the frequency of tag compares for low power I-cache design , 1995, ISLPED '95.

[9]  Ruben W. Castelino,et al.  Internal Organization of the Alpha 21164, a 300-MHz 64-bit Quad-issue CMOS RISC Microprocessor , 1995, Digit. Tech. J..

[10]  Paul D. Franzon,et al.  Energy consumption modeling and optimization for SRAM's , 1995, IEEE J. Solid State Circuits.

[11]  Richard T. Witek,et al.  A 160 MHz 32 b 0.5 W CMOS RISC microprocessor , 1996, 1996 IEEE International Solid-State Circuits Conference. Digest of TEchnical Papers, ISSCC.

[12]  K. Yelick,et al.  The Energy Efficiency Of Iram Architectures , 1997, Conference Proceedings. The 24th Annual International Symposium on Computer Architecture.

[13]  Kanad Ghose,et al.  Energy-efficiency of VLSI caches: a comparative study , 1997, Proceedings Tenth International Conference on VLSI Design.

[14]  Kanad Ghose,et al.  Analytical energy dissipation models for low-power caches , 1997, ISLPED '97.

[15]  K. Kavi Cache Memories Cache Memories in Uniprocessors. Reading versus Writing. Improving Performance , 2022 .

[16]  Bhanu Kapoor,et al.  Low power memory architectures for video applications , 1998, Proceedings of the 8th Great Lakes Symposium on VLSI (Cat. No.98TB100222).

[17]  Taewhan Kim,et al.  Bus-invert coding for low-power I/O - a decomposition approach , 2000, Proceedings of the 43rd IEEE Midwest Symposium on Circuits and Systems (Cat.No.CH37144).