Minimizing Read Seeks for SMR Disk

Log-structured storage systems are file or block storage systems which write data in temporal order, rather using e.g. LBA or other spatial information to determine physical location. They are widely used as translation layers for flash and now Shingled Magnetic Recording (SMR) disk, performing the out-of-place write function needed for media where direct overwriting of data is not possible. Although certain aspects of log-structured storage system performance (i.e. write amplification) have been extensively studied, few or no efforts have examined the effect of seek overhead on workloads in this context. We examine seek overheads due to logstructured writing across a range of well-known and more recent block I/O traces, demonstrating that while some workloads are relatively unaffected, others suffer significantly. In addition we propose two novel mechanisms,opportunistic defragmentation and translation aware selective caching, for seek reduction in log-structured systems, as well as an application of existing disk read-ahead techniques (which we name translation aware look-ahead-behind prefetching), and demonstrate that these mechanisms greatly reduce or even eliminate increases in read seek cost in almost all workloads examined. We evaluate our proposed techniques using a metric we define, seek amplification factor. Results show up to 4x, 4x and 18x improvement of seek amplification factor among the studied workloads for opportunistic defragmentation, translation aware selective caching and translation aware selective caching respectively.

[1]  Sang-Won Lee,et al.  A log buffer-based flash translation layer using fully-associative sector translation , 2007, TECS.

[2]  Zvonimir Bandic,et al.  Shingled file system host-side management of Shingled Magnetic Recording disks , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[3]  Mansour Shafaei,et al.  Virtual Guard: A Track-Based Translation Layer for Shingled Disks , 2017, HotStorage.

[4]  Larry J. Stockmeyer,et al.  An Age-Threshold Algorithm for Garbage Collection in Log-Structured Arrays and File Systems , 1998 .

[5]  Chao Jin,et al.  HiSMRfs: A high performance file system for shingled storage array , 2014, 2014 30th Symposium on Mass Storage Systems and Technologies (MSST).

[6]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[7]  Fred Douglis,et al.  Beating the I/O bottleneck: a case for log-structured file systems , 1989, OPSR.

[8]  P. Desnoyers,et al.  Skylight—A Window on Shingled Disk Operation , 2015, FAST.

[9]  Jongmoo Choi,et al.  Model and Validation of Block Cleaning Cost for Flash Memory , 2007, SAMOS.

[10]  Benny Van Houdt,et al.  A mean field model for a class of garbage collection algorithms in flash-based solid state drives , 2013, Queueing Systems.

[11]  Peter Desnoyers,et al.  Analytic Models of SSD Write Performance , 2014, TOS.

[12]  Garth A. Gibson,et al.  Caveat-Scriptor: Write Anywhere Shingled Disks , 2015, HotStorage.

[13]  Evangelos Eleftheriou,et al.  Write amplification analysis in flash-based solid state drives , 2009, SYSTOR '09.

[14]  Mahmut T. Kandemir,et al.  An Evaluation of Different Page Allocation Strategies on High-Speed SSDs , 2012, HotStorage.

[15]  Kanad Ghose,et al.  yFS: A Journaling File System Design for Handling Large Data Sets with Reduced Seeking , 2003, FAST.

[16]  David Hung-Chang Du,et al.  SMaRT: An Approach to Shingled Magnetic Recording Translation , 2017, FAST.

[17]  Jun Wang,et al.  WOLF - A Novel Reordering Write Buffer to Boost the Performance of Log-Structured File Systems , 2002, FAST.

[18]  Brian M. Kurkoski,et al.  An improved analytic expression for write amplification in NAND flash , 2011, 2012 International Conference on Computing, Networking and Communications (ICNC).

[19]  Irfan Ahmad,et al.  Efficient MRC Construction with SHARDS , 2015, FAST.

[20]  Peter Desnoyers,et al.  Performance models of flash-based solid-state drives for real workloads , 2011, 2011 IEEE 27th Symposium on Mass Storage Systems and Technologies (MSST).

[21]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[22]  John T. Robinson Analysis of steady-state segment storage utilizations in a log-structured file system with least-utilized segment cleaning , 1996, OPSR.

[23]  Zvonimir Bandic,et al.  Indirection systems for shingled-recording disk drives , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[24]  Youngjae Kim,et al.  DFTL: a flash translation layer employing demand-based selective caching of page-level address mappings , 2009, ASPLOS.

[25]  David R. Cheriton,et al.  Log files: an extended file service exploiting write-once storage , 1987, SOSP '87.

[26]  Abutalib Aghayev,et al.  Modeling Drive-Managed SMR Performance , 2017, ACM Trans. Storage.

[27]  Ethan L. Miller,et al.  SMRDB: key-value data store for shingled magnetic recording disks , 2015, SYSTOR.