A Low Cost Multi-Tiered Approach to Improving the Reliability of Multi-Level Cell Pram

Phase change RAM (PRAM) is a promising memory technology because of its fast read access time, very low standby power and high storage density. Multi-level Cell (MLC) PRAM, which has been introduced to further improve the storage density, comes at a price of lower reliability. This paper focuses on a cost-effective solution for improving the reliability of MLC-PRAM. As a first step, we study in detail the causes of hard and soft errors and develop error models to capture these effects. Next we propose a multi-tiered approach that spans architecture, circuit and system levels to increase the reliability. At the architecture level, we use a combination of Gray code encoding and 2-bit interleaving to partition the errors so that a lower strength error control coding (ECC) scheme can be used for half of the bits. We use subblock flipping and threshold resistance tuning to reduce the number of errors in the remaining bits. For even higher reliability, we use a simple BCH based ECC on top of these techniques. We show that the proposed multi-tiered approach enables us to use ECC with 2-error correction capability (t = 2) instead of one with t = 8 to achieve a block failure rate (BFR) of 10−8. We propose to use a non-iterative algorithm to implement the BCH t = 2 decoder because of its small latency. We evaluate the latency and energy overhead of the proposed scheme using CACTI and the IPC performance using GEM5. We show that for SPEC CINT 2006 and DaCapo benchmarks, the proposed system can achieve BFR = 10−8 with 2.2 % IPC reduction and 7 % additional energy compared to a memory without any error correction capability.

[1]  Wei Liu,et al.  VLSI Implementation of BCH Error Correction for Multilevel Cell NAND Flash Memory , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[2]  Kinarn Kim,et al.  Reliability investigations for manufacturable high density PRAM , 2005, 2005 IEEE International Reliability Physics Symposium, 2005. Proceedings. 43rd Annual..

[3]  Chaitali Chakrabarti,et al.  Multi-Tiered Approach to Improving the Reliability of Multi-Level Cell PRAM , 2012, 2012 IEEE Workshop on Signal Processing Systems.

[4]  Yuan Xie,et al.  PCRAMsim: System-level performance, energy, and area modeling for Phase-Change RAM , 2009, 2009 IEEE/ACM International Conference on Computer-Aided Design - Digest of Technical Papers.

[5]  Vijayalakshmi Srinivasan,et al.  Scalable high performance main memory system using phase-change memory technology , 2009, ISCA '09.

[6]  C. Wayne Walker New Formulas for Solving Quadratic Equations over Certain Finite Fields , 1999, IEEE Trans. Inf. Theory.

[7]  Cong Xu,et al.  NVSim: A Circuit-Level Performance, Energy, and Area Model for Emerging Nonvolatile Memory , 2012, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[8]  Robert H. Deng,et al.  Decoding of DBEC-TBED Reed-Solomon Codes , 1987, IEEE Transactions on Computers.

[9]  Lin Li,et al.  Scaling analysis of Phase Change Memory (PCM) driving devices , 2008, 2008 IEEE International Conference on Electron Devices and Solid-State Circuits.

[10]  H.-S. Philip Wong,et al.  Phase Change Memory , 2010, Proceedings of the IEEE.

[11]  Tajana Simunic,et al.  PDRAM: A hybrid PRAM and DRAM main memory system , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[12]  Guangyu Sun,et al.  A Hybrid solid-state storage architecture for the performance, energy consumption, and lifetime improvement , 2010, HPCA 2010.

[13]  D. Elmhurst,et al.  A 1.8-V 128-Mb 125-MHz multilevel cell memory with flexible read while write , 2003 .

[14]  Guido Torelli,et al.  A Bipolar-Selected Phase Change Memory Featuring Multi-Level Cell Storage , 2009, IEEE Journal of Solid-State Circuits.

[15]  Chaitali Chakrabarti,et al.  Hierarchical modeling of Phase Change memory for reliable design , 2012, 2012 IEEE 30th International Conference on Computer Design (ICCD).

[16]  K. Gopalakrishnan,et al.  Phase change memory technology , 2010, 1001.1164.

[17]  D. Ielmini,et al.  Reliability Impact of Chalcogenide-Structure Relaxation in Phase-Change Memory (PCM) Cells—Part II: Physics-Based Modeling , 2009, IEEE Transactions on Electron Devices.

[18]  Norman P. Jouppi,et al.  FREE-p: Protecting non-volatile memory against both hard and soft errors , 2011, 2011 IEEE 17th International Symposium on High Performance Computer Architecture.

[19]  Hsien-Hsin S. Lee,et al.  SAFER: Stuck-At-Fault Error Recovery for Memories , 2010, 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture.

[20]  Shyue-Win Wei,et al.  High-speed hardware decoder for double-error-correcting binary BCH codes , 1989 .

[21]  Karin Strauss,et al.  Use ECP, not ECC, for hard failures in resistive memories , 2010, ISCA.

[22]  Byung-Gil Choi,et al.  A 0.1-$\mu{\hbox {m}}$ 1.8-V 256-Mb Phase-Change Random Access Memory (PRAM) With 66-MHz Synchronous Burst-Read Operation , 2007, IEEE Journal of Solid-State Circuits.

[23]  Three-Dimensional Numerical Simulation of Switching Dynamics for Cylindrical-Shaped Phase Change Memory , 2008, 2008 11th IEEE International Conference on Computational Science and Engineering - Workshops.

[24]  Dhiraj K. Pradhan,et al.  A Routing-Aware ILS Design Technique , 2011, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[25]  Wei Xu,et al.  A Time-Aware Fault Tolerance Scheme to Improve Reliability of Multilevel Phase-Change Memory in the Presence of Significant Resistance Drift , 2011, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.