Smoke and Mirrors: Reflecting Files at a Geographically Remote Location Without Loss of Performance

The Smoke and Mirrors File System (SMFS) mirrors files at geographically remote datacenter locations with negligible impact on file system performance at the primary site, and minimal degradation as a function of link latency. It accomplishes this goal using wide-area links that run at extremely high speeds, but have long round-trip-time latencies--a combination of properties that poses problems for traditional mirroring solutions. In addition to its raw speed, SMFS maintains good synchronization: should the primary site become completely unavailable, the system minimizes loss of work, even for applications that simultaneously update groups of files. We present the SMFS design, then evaluate the system on Emulab and the Cornell National Lambda Rail (NLR) Ring testbed. Intended applications include wide-area file sharing and remote backup for disaster recovery.

[1]  Mendel Rosenblum,et al.  The design and implementation of a log-structured file system , 1991, SOSP '91.

[2]  Mary Baker,et al.  The LOCKSS peer-to-peer digital preservation system , 2005, TOCS.

[3]  David A. Patterson,et al.  Serverless network file systems , 1995, SOSP.

[4]  Luigi Rizzo,et al.  RMDP: an FEC-based reliable multicast protocol for wireless environments , 1998, MOCO.

[5]  Bianca Schroeder,et al.  Disk Failures in the Real World: What Does an MTTF of 1, 000, 000 Hours Mean to You? , 2007, FAST.

[6]  Dorian J. Cougias The Backup Book: Disaster Recovery from Desktop to Data Center , 2003 .

[7]  M. Factor,et al.  Advanced functions for storage subsystems: Supporting continuous availability , 2003, IBM Syst. J..

[8]  Dirk Beyer,et al.  Designing for Disasters , 2004, FAST.

[9]  Mike Hibler,et al.  An integrated experimental environment for distributed systems and networks , 2002, OPSR.

[10]  John Wilkes,et al.  Seneca: remote mirroring done write , 2003, USENIX Annual Technical Conference, General Track.

[11]  John Kubiatowicz,et al.  Antiquity: exploiting a secure log for wide-area distributed storage , 2007, EuroSys '07.

[12]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[13]  Kenneth P. Birman,et al.  Maelstrom: Transparent Error Correction for Lambda Networks , 2008, NSDI.

[14]  Donald F. Towsley,et al.  Parity-based loss recovery for reliable multicast transmission , 1997, TNET.

[15]  Luigi Rizzo,et al.  Effective erasure codes for reliable computer communication protocols , 1997, CCRV.

[16]  GhemawatSanjay,et al.  The Google file system , 2003 .

[17]  Michael Williams,et al.  Replication in the harp file system , 1991, SOSP '91.

[18]  Darrell D. E. Long,et al.  Deep Store: an archival storage system architecture , 2005, 21st International Conference on Data Engineering (ICDE'05).

[19]  Zhen Zhang,et al.  Distributed Source Coding for Satellite Communications , 1999, IEEE Trans. Inf. Theory.

[20]  Jeanna Neefe Matthews,et al.  Improving the performance of log-structured file systems with adaptive methods , 1997, SOSP.

[21]  John H. Hartman,et al.  The Zebra striped network file system , 1995, TOCS.

[22]  Michael Luby,et al.  A digital fountain approach to reliable distribution of bulk data , 1998, SIGCOMM '98.

[23]  Marcos K. Aguilera,et al.  Improving Recoverability in Multi-tier Storage Systems , 2007, 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN'07).

[24]  Shankar Pasupathy,et al.  An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.

[25]  James Lau,et al.  File System Design for an NFS File Server Appliance , 1994, USENIX Winter.

[26]  Amar Phanishayee,et al.  Ricochet: Lateral Error Correction for Time-Critical Multicast , 2007, NSDI.

[27]  Andrea C. Arpaci-Dusseau,et al.  Improving file system reliability with I/O shepherding , 2007, SOSP.

[28]  H. Apte,et al.  Serverless Network File Systems , 2006 .

[29]  Christian Huitema,et al.  The case for packet level FEC , 1996, Protocols for High-Speed Networks.

[30]  Sharon E. Perl,et al.  Myriad: Cost-Effective Disaster Tolerance , 2002, FAST.

[31]  Mary Baker,et al.  A fresh look at the reliability of long-term digital storage , 2005, EuroSys.

[32]  Chandramohan A. Thekkath,et al.  Petal: distributed virtual disks , 1996, ASPLOS VII.

[33]  Dirk Beyer,et al.  On the road to recovery: restoring data after disasters , 2006, EuroSys '06.

[34]  Alessandro Bassi,et al.  Managing Data Storage in the Network , 2001, IEEE Internet Comput..

[35]  Chandramohan A. Thekkath,et al.  Frangipani: a scalable distributed file system , 1997, SOSP.

[36]  Kimberly Keeton,et al.  A framework for evaluating storage system dependability , 2004, International Conference on Dependable Systems and Networks, 2004.

[37]  Jiwu Shu,et al.  An Implementation of Semi-synchronous Remote Mirroring System for SANs , 2004, GCC Workshops.

[38]  Steve R. Kleiman,et al.  SnapMirror: File-System-Based Asynchronous Mirroring for Disaster Recovery , 2002, FAST.