Coerced Cache Eviction and discreet mode journaling: Dealing with misbehaving disks

We present Coerced Cache Eviction (CCE), a new method to force writes to disk in the presence of a disk cache that does not properly obey write-cache configuration or flush requests. We demonstrate the utility of CCE by building a new journaling mode within the Linux ext3 file system. When mounted in this discreet mode, ext3 uses CCEs to ensure that writes are properly ordered and thus maintains file system integrity despite the presence of an improperly behaving disk. We show that discreet mode journaling operates with acceptable overheads for most workloads.

[1]  Margo I. Seltzer,et al.  Disk Scheduling Revisited , 1990 .

[2]  Remzi H. Arpaci-Dusseau,et al.  Microbenchmark-based Extraction of Local and Global Disk Characteristics , 1999 .

[3]  Peter F. Corbett,et al.  Row-Diagonal Parity for Double Disk Failure Correction (Awarded Best Paper!) , 2004, USENIX Conference on File and Storage Technologies.

[4]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[5]  Yale N. Patt,et al.  On-line extraction of SCSI disk drive parameters , 1995, SIGMETRICS '95/PERFORMANCE '95.

[6]  Andrea C. Arpaci-Dusseau,et al.  Parity Lost and Parity Regained , 2008, FAST.

[7]  James Lau,et al.  File System Design for an NFS File Server Appliance , 1994, USENIX Winter.

[8]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[9]  John Wilkes,et al.  An introduction to disk drive modeling , 1994, Computer.

[10]  Alan Jay Smith,et al.  Measuring Cache and TLB Performance and Their Effect on Benchmark Runtimes , 1995, IEEE Trans. Computers.

[11]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[12]  Andrea C. Arpaci-Dusseau,et al.  IRON file systems , 2005, SOSP '05.

[13]  David Kotz,et al.  A Detailed Simulation Model of the HP 97560 Disk Drive , 1994 .

[14]  Lei Zhang,et al.  Generalized file system dependencies , 2007, SOSP.

[15]  Gregory R. Ganger,et al.  Track-Aligned Extents: Matching Access Patterns to Disk Drive Characteristics , 2002, FAST.

[16]  Steven Swanson,et al.  Reliably Erasing Data from Flash-Based Solid State Drives , 2011, FAST.

[17]  Joseph F. Murray,et al.  Reliability and security of RAID storage systems and D2D archives using SATA disk drives , 2005, TOS.

[18]  Andrea C. Arpaci-Dusseau,et al.  An analysis of data corruption in the storage stack , 2008, TOS.

[19]  Erik Riedel,et al.  More Than an Interface - SCSI vs. ATA , 2003, FAST.

[20]  Remzi H. Arpaci-Dusseau,et al.  Empirical evaluation of the CRAY-T3D: a compiler perspective , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[21]  Robert B. Hagmann,et al.  Reimplementing the Cedar file system using logging and group commit , 1987, SOSP '87.

[22]  GhemawatSanjay,et al.  The Google file system , 2003 .

[23]  Andreas Reuter,et al.  Transaction Processing: Concepts and Techniques , 1992 .

[24]  Yale N. Patt,et al.  Metadata update performance in file systems , 1994, OSDI '94.

[25]  J. Spencer Love,et al.  Caching strategies to improve disk system performance , 1994, Computer.

[26]  Gregory R. Ganger,et al.  Automated Disk Drive Characterization , 1999 .

[27]  John Wilkes,et al.  Disk scheduling algorithms based on rotational position , 1991 .

[28]  Shankar Pasupathy,et al.  An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.

[29]  Arif Merchant,et al.  An analytic behavior model for disk drives with readahead caches and request reordering , 1998, SIGMETRICS '98/PERFORMANCE '98.

[30]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[31]  Andrea C. Arpaci-Dusseau,et al.  Exploiting Gray-Box Knowledge of Buffer-Cache Contents , 2002 .