Application-Tailored Cache Consistency for Wide-Area File Systems

The inability to perform optimizations based on application-specific information presents a hurdle to the deployment of pervasive LAN file systems across WAN environments. This paper proposes a novel approach addressing this problem through application-tailored caching and consistency in widearea file systems. It leverages widely available Network File System (NFS) deployments without any modifications to kernels nor applications, and employs middleware to dynamically establish Grid-wide Virtual File System (GVFS) sessions with application-tailored cache consistency. Two consistency models are discussed in this paper: a relaxed model based on invalidation polling, and a stronger model based on delegation and callback. Experimental evaluation based on microbenchmarks and scientific applications show that with application-tailored cache consistency, GVFS is able to both improve application runtimes and reduce server load significantly, compared to kernel-level NFS in WAN.

[1]  Balachander Krishnamurthy,et al.  Study of Piggyback Cache Validation for Proxy Caches in the World Wide Web , 1997, USENIX Symposium on Internet Technologies and Systems.

[2]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[3]  Brent Callaghan,et al.  NFS Illustrated , 1999 .

[4]  Renato J. O. Figueiredo,et al.  VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[5]  Beng-Hong Lim,et al.  Virtualizing I/O Devices on VMware Workstation's Hosted Virtual Machine Monitor , 2001, USENIX Annual Technical Conference, General Track.

[6]  Renato J. O. Figueiredo,et al.  A case for grid computing on virtual machines , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[7]  Jing Xu,et al.  Towards Autonomic Grid Data Management with Virtualized Distributed File Systems , 2006, 2006 IEEE International Conference on Autonomic Computing.

[8]  Andrea C. Arpaci-Dusseau,et al.  Explicit Control in the Batch-Aware Distributed File System , 2004, NSDI.

[9]  David E. Culler,et al.  PlanetLab: an overlay testbed for broad-coverage services , 2003, CCRV.

[10]  Xiaomin Zhu,et al.  From virtualized resources to virtual computing grids: the In-VIGO system , 2005, Future Gener. Comput. Syst..

[11]  David Mazières,et al.  Fast and secure distributed read-only file system , 2000, TOCS.

[12]  Mahadev Satyanarayanan,et al.  Internet suspend/resume , 2002, Proceedings Fourth IEEE Workshop on Mobile Computing Systems and Applications.

[13]  Renato J. O. Figueiredo,et al.  Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing , 2006, Cluster Computing.

[14]  Mark Carson,et al.  NIST Net: a Linux-based network emulation tool , 2003, CCRV.

[15]  Renato J. O. Figueiredo,et al.  Support for data-intensive, variable-granularity grid applications via distributed file system virtualization - a case study of light scattering spectroscopy , 2004, Proceedings of the Second International Workshop on Challenges of Large Applications in Distributed Environments, 2004. CLADE 2004..

[16]  Andrew S. Grimshaw,et al.  LegionFS: A Secure and Scalable File System Supporting Cross-Domain High-Performance Applications , 2001, International Conference on Software Composition.

[17]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[18]  ZhangJian,et al.  From virtualized resources to virtual computing grids , 2005 .

[19]  Renato J. O. Figueiredo,et al.  The PUNCH virtual file system: seamless access to decentralized storage services in a computational grid , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[20]  David Brumley,et al.  Virtual Appliances for Deploying and Maintaining Software , 2003, LISA.

[21]  Matt Blaze,et al.  A cryptographic file system for UNIX , 1993, CCS '93.

[22]  Jeffrey C. Mogul,et al.  Spritely NFS: experiments with cache-consistency protocols , 1989, SOSP '89.

[23]  David Mazières,et al.  A Toolkit for User-Level File Systems , 2001, USENIX Annual Technical Conference, General Track.

[24]  Magnus Karlsson,et al.  Taming aggressive replication in the Pangaea wide-area file system , 2002, OPSR.

[25]  Tal Garfinkel,et al.  Traps and Pitfalls: Practical Problems in System Call Interposition Based Security Tools , 2003, NDSS.

[26]  Rick Macklem,et al.  Not Quite NFS, Soft Cache Consistency for NFS , 1994, USENIX Winter.

[27]  Jeffrey Katcher,et al.  PostMark: A New File System Benchmark , 1997 .

[28]  Ben Y. Zhao,et al.  OceanStore: an architecture for global-scale persistent storage , 2000, SIGP.

[29]  Renato J. O. Figueiredo,et al.  Supporting application-tailored grid file system sessions with WSRF-based services , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[30]  J. Howard Et El,et al.  Scale and performance in a distributed file system , 1988 .

[31]  Chris J. Scheiman,et al.  UFO: a personal global file system based on user-level extensions to the operating system , 1998, TOCS.

[32]  Jeff Dike,et al.  A user-mode port of the Linux kernel , 2000, Annual Linux Showcase & Conference.