Static energy efficient cache reconfiguration for dynamic NUCA in tiled CMPs

Rapid growth in semiconductor technology permits to integrate multiple number of processor cores with multi-level on-chip caches. Integration of more on-chip components increases the on-chip power density. As per recent studies, on-chip caches are the principal contributors to the total power consumed by the chip. This cache power consumption can be divided into two major parts: dynamic power and static power. Dynamic power is consumed during cache accesses and static power is referred to as the leakage power of the cache. This increased power consumption increases effective chip-temperature which in turn increases the leakage power. In this paper we attempt to reduce the static power consumption by powering off cache ways from the cache banks of a Tiled DNUCA cache. We use a bank utilisation based criteria for the way shutdown decision. The number of ways to be turned off from a bank is chosen based on bank's usage statistics. The contents of the powered off cache ways are written back to main memory. Thus, depending upon the application's working set size and data distribution, a controlled number of ways from a set of banks can be dynamically shutdown to save leakage power dissipation. For a 4MB 8 way L2 Tiled DNUCA cache, experimental analysis shows 17% reduction in EDP and 33% reduction in the static power. The powered-off ways are also aligned, simplifying the gating circuitry.

[1]  Norman P. Jouppi,et al.  CACTI 6.0: A Tool to Model Large Caches , 2009 .

[2]  Jung Ho Ahn,et al.  McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[3]  Fredrik Larsson,et al.  Simics: A Full System Simulation Platform , 2002, Computer.

[4]  Julio Sahuquillo,et al.  Drowsy cache partitioning for reduced static and dynamic energy in the cache hierarchy , 2013, 2013 International Green Computing Conference Proceedings.

[5]  Sharad Malik,et al.  Orion: a power-performance simulator for interconnection networks , 2002, MICRO.

[6]  Milo M. K. Martin,et al.  Multifacet's general execution-driven multiprocessor simulator (GEMS) toolset , 2005, CARN.

[7]  Subramanian Ramaswamy,et al.  Improving cache efficiency via resizing + remapping , 2007, 2007 25th International Conference on Computer Design.

[8]  N. Muralimanohar,et al.  CACTI 6 . 0 : A Tool to Understand Large Caches , 2007 .

[9]  Kaushik Roy,et al.  Gated-Vdd: a circuit technique to reduce leakage in deep-submicron cache memories , 2000, ISLPED '00.

[10]  Pierfrancesco Foglia,et al.  A workload independent energy reduction strategy for D-NUCA caches , 2013, The Journal of Supercomputing.

[11]  Alessandro Bardine,et al.  Way adaptable D-NUCA caches , 2010, Int. J. High Perform. Syst. Archit..

[12]  Eric Rotenberg,et al.  Adaptive mode control: A static-power-efficient cache design , 2003, TECS.

[13]  Shirshendu Das,et al.  Exploration of Migration and Replacement Policies for Dynamic NUCA over Tiled CMPs , 2015, 2015 28th International Conference on VLSI Design.

[14]  Shirshendu Das,et al.  Performance Constrained Static Energy Reduction Using Way-Sharing Target-Banks , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop.

[15]  Niraj K. Jha,et al.  GARNET: A detailed on-chip network model inside a full-system simulator , 2009, 2009 IEEE International Symposium on Performance Analysis of Systems and Software.

[16]  Kai Ma,et al.  Cache Latency Control for Application Fairness or Differentiation in Power-Constrained Chip Multiprocessors , 2012, IEEE Transactions on Computers.

[17]  Shirshendu Das,et al.  Static energy reduction by performance linked cache capacity management in tiled CMPs , 2015, SAC.

[18]  Bharadwaj Amrutur,et al.  Adaptive Power Optimization of On-chip SNUCA Cache on Tiled Chip Multicore Architecture Using Remap Policy , 2011, 2011 Second Workshop on Architecture and Multi-Core Applications (wamca 2011).

[19]  Alessandro Bardine,et al.  Analysis of static and dynamic energy consumption in NUCA caches: initial results , 2007, MEDEA '07.

[20]  Sparsh Mittal,et al.  A survey of architectural techniques for improving cache power efficiency , 2014, Sustain. Comput. Informatics Syst..

[21]  Ann Gordon-Ross,et al.  A survey on cache tuning from a power/energy perspective , 2013, CSUR.

[22]  Norman P. Jouppi,et al.  Multi-Core Cache Hierarchies , 2011, Multi-Core Cache Hierarchies.

[23]  Kai Li,et al.  The PARSEC benchmark suite: Characterization and architectural implications , 2008, 2008 International Conference on Parallel Architectures and Compilation Techniques (PACT).

[24]  Mahmut T. Kandemir,et al.  Leakage Current: Moore's Law Meets Static Power , 2003, Computer.