Exploiting Soft Redundancy for Error-Resilient On-Chip Memory Design

Memory design is facing the upcoming challenges due to a combination of technology scaling and higher levels of integration and system complexity. In particular, memory circuits become vulnerable to transient (soft) errors caused by particle strikes and process spread. In this paper, we propose a new error-tolerance technique referred to as the soft redundancy for on-chip memory design. Program runtime variations in memory spatial locality cause wasted memory spaces occupied by the irrelevant data. The proposed soft-redundancy allocated memory exploits these wasted memory spaces to achieve efficient memory access and effective error protection in a coherent manner. Simulation results on the SPEC CPU2000 benchmarks demonstrate 73.7% average error protection coverage ratio on the 23 benchmarks, with average of 52% and 48.3% reduction in memory miss rate and bandwidth requirement, respectively, as compared to the existing techniques

[1]  Aneesh Aggarwal,et al.  Increasing the cache efficiency by eliminating noise , 2006, The Twelfth International Symposium on High-Performance Computer Architecture, 2006..

[2]  Rajesh K. Gupta,et al.  Adapting cache line size to application behavior , 1999, ICS '99.

[3]  Joel S. Emer,et al.  The soft error problem: an architectural perspective , 2005, 11th International Symposium on High-Performance Computer Architecture.

[4]  Kazuaki Murakami,et al.  Dynamically variable line-size cache exploiting high on-chip memory bandwidth of merged DRAM/logic LSIs , 1999, Proceedings Fifth International Symposium on High-Performance Computer Architecture.

[5]  Pinaki Mazumder,et al.  A physical design tool for built-in self-repairable RAMs , 2001, IEEE Trans. Very Large Scale Integr. Syst..

[6]  Lloyd W. Massengill,et al.  Impact of scaling on soft-error rates in commercial microprocessors , 2002 .

[7]  Peter Hazucha,et al.  Characterization of soft errors caused by single event upsets in CMOS processes , 2004, IEEE Transactions on Dependable and Secure Computing.

[8]  M. Calvet,et al.  Simulation of nucleon-induced nuclear reactions in a simplified SRAM structure: scaling effects on SEU and MBU cross sections , 2001 .

[9]  M. Milenkovic,et al.  Exploiting streams in instruction and data address trace compression , 2003, 2003 IEEE International Conference on Communications (Cat. No.03CH37441).

[10]  H. Hughes,et al.  Radiation effects and hardening of MOS technology: devices and circuits , 2003 .

[11]  Frank Vahid,et al.  Energy benefits of a configurable line size cache for embedded systems , 2003, IEEE Computer Society Annual Symposium on VLSI, 2003. Proceedings..

[12]  Konstantinos Konstantinides,et al.  Design and evaluation of an architecture for a digital signal processor for instrumentation applications , 1990, IEEE Trans. Acoust. Speech Signal Process..

[13]  Tuvi Etzion,et al.  Optimal codes for single-error correction, double-adjacent-error detection , 2000, IEEE Trans. Inf. Theory.