A mean field model for a class of garbage collection algorithms in flash-based solid state drives

Garbage collection (GC) algorithms play a key role in reducing the write amplification in flash-based solid state drives, where the write amplification affects the lifespan and speed of the drive. This paper introduces a mean field model to assess the write amplification and the distribution of the number of valid pages per block for a class C of GC algorithms. Apart from the Random GC algorithm, class C includes two novel GC algorithms: the d-Choices GC algorithm, that selects d blocks uniformly at random and erases the block containing the least number of valid pages among the $d$ selected blocks, and the Random++ GC algorithm, that repeatedly selects another block uniformly at random until it finds a block with a lower than average number of valid blocks. Using simulation experiments we show that the proposed mean field model is highly accurate in predicting the write amplification (for drives with $N=50000$ blocks). We further show that the d-Choices GC algorithm has a write amplification close to that of the Greedy GC algorithm even for small d values, e.g., d = 10, and offers a more attractive trade-off between its simplicity and its performance than the Windowed GC algorithm introduced and analyzed in earlier studies. The Random++ algorithm is shown to be less effective as it is even inferior to the FIFO algorithm when the number of pages $b$ per block is large (e.g., for b ≥ 64).

[1]  Xiaodong Zhang,et al.  Understanding intrinsic characteristics and system implications of flash memory based solid state drives , 2009, SIGMETRICS '09.

[2]  Werner Bux,et al.  Performance of greedy garbage collection in flash-based solid-state drives , 2010, Perform. Evaluation.

[3]  M. Benaïm,et al.  A class of mean field interaction models for computer and communication systems , 2008, 2008 6th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks and Workshops.

[4]  Jean-Yves Le Boudec,et al.  A class of mean field interaction models for computer and communication systems , 2008, Perform. Evaluation.

[5]  Ramesh K. Sitaraman,et al.  The power of two random choices: a survey of tech-niques and results , 2001 .

[6]  Peter Desnoyers,et al.  Analytic modeling of SSD write performance , 2012, SYSTOR '12.

[7]  John T. Robinson Analysis of steady-state segment storage utilizations in a log-structured file system with least-utilized segment cleaning , 1996, OPSR.

[8]  Jaishankar Moothedath Menon,et al.  A performance comparison of RAID-5 and log-structured arrays , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[9]  Steven Swanson,et al.  The bleak future of NAND flash memory , 2012, FAST.

[10]  Bruno Gaujal,et al.  Markov chains with discontinuous drifts have differential inclusions limits , 2012 .

[11]  Brian M. Kurkoski,et al.  An improved analytic expression for write amplification in NAND flash , 2011, 2012 International Conference on Computing, Networking and Communications (ICNC).

[12]  Sang-Won Lee,et al.  SFS: random write considered harmful in solid state drives , 2012, FAST.

[13]  Evangelos Eleftheriou,et al.  Write amplification analysis in flash-based solid state drives , 2009, SYSTOR '09.

[14]  John C. S. Lui,et al.  Stochastic modeling of large-scale solid-state storage systems: analysis, design tradeoffs and optimization , 2013, SIGMETRICS '13.

[15]  Marcus Marrow,et al.  A closed-form expression for write amplification in NAND Flash , 2010, 2010 IEEE Globecom Workshops.

[16]  Benny Van Houdt,et al.  Performance of garbage collection algorithms for flash-based solid state drives with hot/cold data , 2013, Perform. Evaluation.

[17]  Heeseung Jo,et al.  A superblock-based flash translation layer for NAND flash memory , 2006, EMSOFT '06.

[18]  R. L. Dobrushin,et al.  Queueing system with selection of the shortest of two queues: an assymptotic approach , 1996 .

[19]  Eli Upfal,et al.  Balanced Allocations , 1999, SIAM J. Comput..

[20]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.