Geriatrix: Aging what you see and what you don't see. A file system aging approach for modern storage systems

File system performance on modern primary storage devices (Flash-based SSDs) is greatly affected by aging of the free space, much more so than were mechanical disk drives. We introduce Geriatrix, a simple-to-use profile driven file system aging tool that induces target levels of fragmentation in both allocated files (what you see) and remaining free space (what you don't see), unlike previous approaches that focus on just the former. This paper describes and evaluates the effectiveness of Geriatrix, showing that it recreates both fragmentation effects better than previous approaches. Using Geriatrix, we show that measurements presented in many recent file systems papers are higher than should be expected, by up to 30% on mechanical (HDD) and up to 80% on Flash (SSD) disks. Worse, in some cases, the performance rank ordering of file system designs being compared are different from the published results. Geriatrix will be released as open source software with eight built-in aging profiles, in the hopes that it can address the need created by the increased performance impact of file system aging in modern SSD-based storage.

[1]  E. I. Organick,et al.  The Multics Input/Output system , 1971, SOSP '71.

[2]  Douglas Comer,et al.  Ubiquitous B-Tree , 1979, CSUR.

[3]  Samuel J. Leffler,et al.  A Fast File System for UNIX (Revised July 27, 1983) , 1983 .

[4]  Ken Thompson,et al.  The UNIX time-sharing system , 1974, CACM.

[5]  J. Howard Et El,et al.  Scale and performance in a distributed file system , 1988 .

[6]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[7]  James Lau,et al.  File System Design for an NFS File Server Appliance , 1994, USENIX Winter.

[8]  Srinivasan Seshan,et al.  A Statistical Study for File System Meta Data On High Performance Computing Sites Submitted in partial fulfillment of the requirements for the degree of Master of Science in Information Networking , 1994 .

[9]  Margo Seltzer,et al.  File Layout and File System Performance , 1994 .

[10]  Sara McMains,et al.  File System Logging versus Clustering: A Performance Comparison , 1995, USENIX.

[11]  Margo I. Seltzer,et al.  Heuristic Cleaning Algorithms in Log-Structured File Systems , 1995, USENIX.

[12]  Anna R. Karlin,et al.  A study of integrated prefetching and caching strategies , 1995, SIGMETRICS '95/PERFORMANCE '95.

[13]  Jim Zelenka,et al.  Informed prefetching and caching , 1995, SOSP.

[14]  Margo I. Seltzer,et al.  File system aging—increasing the relevance of file system benchmarks , 1997, SIGMETRICS '97.

[15]  Jeanna Neefe Matthews,et al.  Improving the performance of log-structured file systems with adaptive methods , 1997, SOSP.

[16]  William J. Bolosky,et al.  A large-scale study of file-system contents , 1999, SIGMETRICS '99.

[17]  David A. Thompson,et al.  The future of magnetic data storage technology , 2000, IBM J. Res. Dev..

[18]  Thomas E. Anderson,et al.  A Comparison of File System Workloads , 2000, USENIX Annual Technical Conference, General Track.

[19]  D. A. Thompson,et al.  The Future of Magnetic Data Storage Technology , 2000 .

[20]  Jun Wang,et al.  WOLF - A Novel Reordering Write Buffer to Boost the Performance of Log-Structured File Systems , 2002, FAST.

[21]  Kanad Ghose,et al.  yFS: A Journaling File System Design for Handling Large Data Sets with Reduced Seeking , 2003, FAST.

[22]  Erez Zadok,et al.  Tracefs: A File System to Trace Them All , 2004, FAST.

[23]  Tzi-cker Chiueh,et al.  TBBT: scalable and accurate trace replay for file server evaluation , 2005, SIGMETRICS '05.

[24]  Nikolai Joukov,et al.  Accurate and efficient replaying of file system traces , 2005, FAST'05.

[25]  Koji Sato,et al.  The Linux implementation of a log-structured file system , 2006, OPSR.

[26]  Jacob R. Lorch,et al.  A five-year study of file-system metadata , 2007, TOS.

[27]  Mark D. Corner,et al.  TFS: A Transparent File System for Contributory Storage , 2007, FAST.

[28]  David R. O'Hallaron,et al.  //TRACE: Parallel Trace Replay with Approximate Causal Events , 2007, FAST.

[29]  Kai Li,et al.  Avoiding the Disk Bottleneck in the Data Domain Deduplication File System , 2008, FAST.

[30]  Bin Zhou,et al.  Scalable Performance of the Panasas Parallel File System , 2008, FAST.

[31]  Milo Polte,et al.  Enabling Enterprise Solid State Disks Performance , 2009 .

[32]  Randal C. Burns,et al.  CA-NFS: A congestion-aware network file system , 2009, TOS.

[33]  Andrea C. Arpaci-Dusseau,et al.  Generating realistic impressions for file-system benchmarking , 2009, TOS.

[34]  David Flynn,et al.  DFS: A file system for virtualized flash storage , 2010, TOS.

[35]  Cezary Dubnicki,et al.  HydraFS: A High-Throughput File System for the HYDRAstor Content-Addressable Storage System , 2010, FAST.

[36]  Dutch T. Meyer,et al.  A study of practical deduplication , 2011, TOS.

[37]  Michael Vrable,et al.  BlueSky: a cloud-backed file system for the enterprise , 2012, FAST.

[38]  Hai Huang,et al.  Understanding performance implications of nested file systems in a virtualized environment , 2012, FAST.

[39]  Sang-Won Lee,et al.  SFS: random write considered harmful in solid state drives , 2012, FAST.

[40]  Dinan Gunawardena,et al.  ZZFS: a hybrid device and cloud file system for spontaneous users , 2012, FAST.

[41]  Garth A. Gibson,et al.  PRObE: A Thousand-Node Experimental Cluster for Computer Systems Research , 2013, login Usenix Mag..

[42]  Andrea C. Arpaci-Dusseau,et al.  ROOT: replaying multithreaded traces with resource-oriented ordering , 2013, SOSP.

[43]  Josef Bacik,et al.  BTRFS: The Linux B-Tree Filesystem , 2013, TOS.

[44]  Wei Wang,et al.  ReconFS: a reconstructable file system on flash storage , 2014, FAST.

[45]  P. Desnoyers,et al.  Skylight—A Window on Shingled Disk Operation , 2015, FAST.

[46]  Joo Young Hwang,et al.  F2FS: A New File System for Flash Storage , 2015, FAST.

[47]  Michael A. Bender,et al.  BetrFS: A Right-Optimized Write-Optimized File System , 2015, FAST.

[48]  Jihong Kim,et al.  Application-Managed Flash , 2016, FAST.

[49]  Jian Xu,et al.  NOVA: A Log-structured File System for Hybrid Volatile/Non-volatile Main Memories , 2016, FAST.

[50]  An-I Wang,et al.  The Composite-file File System: Decoupling the One-to-One Mapping of Files and Metadata for Better Performance , 2016, FAST.

[51]  Michael A. Bender,et al.  Optimizing Every Operation in a Write-optimized File System , 2016, USENIX Annual Technical Conference.

[52]  Thomas E. Anderson,et al.  Strata: A Cross Media File System , 2017, SOSP.

[53]  Michael A. Bender,et al.  How to Fragment Your File System , 2017, login Usenix Mag..

[54]  Michael A. Bender,et al.  File Systems Fated for Senescence? Nonsense, Says Science! , 2017, FAST.