An Integrated Approach for Managing Read Disturbs in High-Density NAND Flash Memory

The read-disturb problem is emerging as one of the main reliability issues in high-density NAND flash memory. A read-disturb error, which causes data loss, occurs to a page when a large number of reads are performed to its neighboring pages. In this paper, we propose a novel integrated approach for managing the read-disturb problem. Our approach is based on our key observations from the NAND physics that the read disturbance to neighboring pages is a function of the read voltage and the read time. Since the read disturbance has an exponential dependence on the read voltage, lowering the read voltage can improve the read-disturb resistance of a NAND block. By modifying NAND chips to support multiple read modes with different read voltages, our approach allows a flash translation layer module to exploit the tradeoff between the read disturbance and write speed. Since the read disturbance is also proportional to the read time, our approach exploits the difference in the read time among different NAND pages so that frequently read pages can be less intensively read-disturbed using fast page reads. By intelligently relocating read-intensive data to read-disturb resistant blocks and pages, our approach can reduce a large portion of the time overhead from managing read-disturb errors. We also propose a proactive data migration technique which is effective in reducing large variations in I/O response times of the existing on-demand read reclaim (RR) technique. Our experimental results show that our proposed techniques can reduce the execution time overhead by 73% over the existing read-disturb management technique while reducing I/O response time fluctuations during RR activations.

[1]  Sungjin Lee,et al.  FlashBench: A workbench for a rapid development of flash-based storage devices , 2012, 2012 23rd IEEE International Symposium on Rapid System Prototyping (RSP).

[2]  Mahmut T. Kandemir,et al.  Revisiting widely held SSD expectations and rethinking system-level implications , 2013, SIGMETRICS '13.

[3]  Calton Pu,et al.  Response Time Reliability in Cloud Environments: An Empirical Study of n-Tier Applications at High Resource Utilization , 2012, 2012 IEEE 31st Symposium on Reliable Distributed Systems.

[4]  A. Goda,et al.  25nm 64Gb 130mm² 3bpc NAND Flash Memory , 2011, 2011 3rd IEEE International Memory Workshop (IMW).

[5]  M. Lenzlinger,et al.  Fowler‐Nordheim Tunneling into Thermally Grown SiO2 , 1969 .

[6]  Joon-Sung Yang,et al.  Bit-error rate improvement of TLC NAND Flash using state re-ordering , 2012, IEICE Electron. Express.

[7]  Nikolaos Papandreou,et al.  Using adaptive read voltage thresholds to enhance the reliability of MLC NAND flash memory systems , 2014, GLSVLSI '14.

[8]  Jihong Kim,et al.  A read-disturb management technique for high-density NAND flash memory , 2013, APSys.

[9]  Nanning Zheng,et al.  LDPC-in-SSD: making advanced error correction codes work effectively in solid state drives , 2013, FAST.

[10]  Myounggon Kang,et al.  Improving Read Disturb Characteristics by Self-Boosting Read Scheme for Multilevel NAND Flash Memories , 2009 .

[11]  M. Lenzlinger,et al.  Fowler-Nordheim tunneling into thermally grown SiO 2 , 1968 .

[12]  C. Hu,et al.  Random telegraph noise in flash memories - model and technology scaling , 2007, 2007 IEEE International Electron Devices Meeting.

[13]  Tong Zhang,et al.  Exploiting Memory Device Wear-Out Dynamics to Improve NAND Flash Memory System Performance , 2011, FAST.

[14]  Sungjin Lee,et al.  Improving NAND Endurance by Dynamic Program and Erase Scaling , 2013, HotStorage.

[15]  Onur Mutlu,et al.  Threshold voltage distribution in MLC NAND flash memory: Characterization, analysis, and modeling , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[16]  Steven Swanson,et al.  The bleak future of NAND flash memory , 2012, FAST.

[17]  Yuan-Hao Chang,et al.  Read leveling for flash storage systems , 2015, SYSTOR.

[18]  Yeong-Taek Lee,et al.  A Zeroing Cell-to-Cell Interference Page Architecture With Temporary LSB Storing and Parallel MSB Program Scheme for MLC NAND Flash Memories , 2008, IEEE Journal of Solid-State Circuits.

[19]  Dong Wook Lee,et al.  The Operation Algorithm for Improving the Reliability of TLC (Triple Level Cell) NAND Flash Characteristics , 2011, 2011 3rd IEEE International Memory Workshop (IMW).

[20]  Y. Charlie Hu,et al.  Program-Counter-Based Pattern Classification in Buffer Caching , 2004, OSDI.

[21]  Onur Mutlu,et al.  Read Disturb Errors in MLC NAND Flash Memory: Characterization, Mitigation, and Recovery , 2015, 2015 45th Annual IEEE/IFIP International Conference on Dependable Systems and Networks.

[22]  Young-Hyun Jun,et al.  A new 3-bit programming algorithm using SLC-to-TLC migration for 8MB/s high performance TLC NAND flash memory , 2012, 2012 Symposium on VLSI Circuits (VLSIC).

[23]  Andrew A. Chien,et al.  Moore's Law: The First Ending and a New Beginning , 2013, Computer.