Asymmetric-access aware optimization for STT-RAM caches with process variations

STT-RAM (Spin Transfer Torque Random Access Memory) has been extensively researched as a potential replacement of SRAM (Static RAM) as on-chip caches. Prior work has shown that STT-RAM caches can improve performance and reduce power consumption because of its advantages of high density, fast read speed, low standby power, etc. However, under the impact of process variations, using worst-case design can induce significant performance and power overhead in STT-RAM caches. In order to overcome the problem of process variations, we propose to apply the variable-latency access method to STT-RAM caches by introducing a variation-aware LRU (Least Recently Used) policy. Moreover, we show that simply applying traditional variable-latency access method is inefficient due to the read/write asymmetry. First, we demonstrate that a write-oriented data migration is preferred. Second, a block remapping is necessary to prevent some cache sets from being significantly affected by process variations. After using our techniques, the experimental results show that the performance can be improved by 13.8% and power consumption can be reduced by 14.1% compared to a prior approach [3].

[1]  Hidetoshi Onodera,et al.  A Detailed Vth-Variation Analysis for Sub-100-nm Embedded SRAM Design , 2006, 2006 IEEE International SOC Conference.

[2]  Jung Ho Ahn,et al.  Matching cache access behavior and bit error pattern for high performance low Vcc L1 cache , 2011, 2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC).

[3]  David Blaauw,et al.  Statistical Timing Analysis for Intra-Die Process Variations with Spatial Correlations , 2003, ICCAD 2003.

[4]  Yu Wang,et al.  Improving energy efficiency of write-asymmetric memories by log style write , 2012, ISLPED '12.

[5]  Wenqing Wu,et al.  Multi retention level STT-RAM cache designs with a dynamic refresh scheme , 2011, 2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[6]  Madhu Mutyam,et al.  Block remap with turnoff: A variation-tolerant cache design technique , 2008, 2008 Asia and South Pacific Design Automation Conference.

[7]  Csaba Andras Moritz,et al.  Data Memory Subsystem Resilient to Process Variations , 2008, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[8]  Yu Wang,et al.  PS3-RAM: A fast portable and scalable statistical STT-RAM reliability analysis method , 2012, DAC Design Automation Conference 2012.

[9]  Rajendran Panda,et al.  Statistical delay computation considering spatial correlations , 2003, ASP-DAC '03.

[10]  Yehea I. Ismail,et al.  Variable latency caches for nanoscale processor , 2007, Proceedings of the 2007 ACM/IEEE Conference on Supercomputing (SC '07).

[11]  Cong Xu,et al.  Modeling and design exploration of FBDRAM as on-chip memory , 2012, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[12]  Aviral Shrivastava,et al.  LA-LRU: A Latency-Aware Replacement Policy for Variation Tolerant Caches , 2011, 2011 24th Internatioal Conference on VLSI Design.

[13]  Cong Xu,et al.  NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory , 2012, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[14]  Hai Li,et al.  Process variation aware data management for STT-RAM cache design , 2012, ISLPED '12.