Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing

This paper presents a data management solution which allows fast Virtual Machine (VM) instantiation and efficient run-time execution to support VMs as execution environments in Grid computing. It is based on novel distributed file system virtualization techniques and is unique in that: (1) it provides on-demand cross-domain access to VM state for unmodified VM monitors; (2) it enables private file system channels for VM instantiation by secure tunneling and session-key based authentication; (3) it supports user-level and write-back disk caches, per-application caching policies and middleware-driven consistency models; and (4) it leverages application-specific meta-data associated with files to expedite data transfers. The paper reports on its performance in wide-area setups using VMware-based VMs. Results show that the solution delivers performance over 30% better than native NFS and with warm caches it can bring the application-perceived overheads below 10% compared to a local-disk setup. The solution also allows a VM with 1.6 GB virtual disk and 320 MB virtual memory to be cloned within 160 seconds for the first clone and within 25 seconds for subsequent clones.

[1]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1988, TOCS.

[2]  Jeff Dike,et al.  A user-mode port of the Linux kernel , 2000, Annual Linux Showcase & Conference.

[3]  Monica S. Lam,et al.  The collective: a cache-based system management architecture , 2005, NSDI.

[4]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[5]  Andrea C. Arpaci-Dusseau,et al.  Flexibility, manageability, and performance in a Grid storage appliance , 2002, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing.

[6]  Renato J. O. Figueiredo,et al.  Seamless Access to Decentralized Storage Services in Computational Grids via a Virtual File System , 2004, Cluster Computing.

[7]  Renato J. O. Figueiredo,et al.  A case for grid computing on virtual machines , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[8]  Chris J. Scheiman,et al.  UFO: a personal global file system based on user-level extensions to the operating system , 1998, TOCS.

[9]  Carl Smith,et al.  NFS Version 3: Design and Implementation , 1994, USENIX Summer.

[10]  David Robinson,et al.  NFS version 4 Protocol , 2000, RFC.

[11]  Mahadev Satyanarayanan,et al.  Scale and performance in a distributed file system , 1987, SOSP '87.

[12]  Robert P. Goldberg,et al.  Survey of virtual machine research , 1974, Computer.

[13]  Beng-Hong Lim,et al.  Virtualizing I/O Devices on VMware Workstation's Hosted Virtual Machine Monitor , 2001, USENIX Annual Technical Conference, General Track.

[14]  Monica S. Lam,et al.  Optimizing the migration of virtual computers , 2002, OPSR.

[15]  Mahadev Satyanarayanan,et al.  Andrew: a distributed personal computing environment , 1986, CACM.

[16]  Andrew S. Grimshaw,et al.  Grid-based file access: the Legion I/O model , 2000, Proceedings the Ninth International Symposium on High-Performance Distributed Computing.

[17]  Renato J. O. Figueiredo,et al.  Single sign-on in In-VIGO: role-based access via delegation mechanisms using short-lived user identities , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[18]  Sugih Jamin,et al.  RTTometer: measuring path minimum RTT with confidence , 2003, Proceedings of the 3rd IEEE Workshop on IP Operations & Management (IPOM 2003) (IEEE Cat. No.03EX764).

[19]  David A. Patterson,et al.  Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[20]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[21]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[22]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[23]  H. Wilson,et al.  Flexibility , 1911, Encyclopedia of Autism Spectrum Disorders.

[24]  Renato J. O. Figueiredo,et al.  Grid-computing portals and security issues , 2003, J. Parallel Distributed Comput..

[25]  Xiaomin Zhu,et al.  From virtualized resources to virtual computing grids: the In-VIGO system , 2005, Future Gener. Comput. Syst..

[26]  Renato J. O. Figueiredo,et al.  VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[27]  Douglas Thain,et al.  The Kangaroo approach to data movement on the Grid , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[28]  Renato J. O. Figueiredo,et al.  Enhancing the scalability and usability of computational grids via logical user accounts and virtual file systems , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[29]  Renato J. O. Figueiredo,et al.  Supporting application-tailored grid file system sessions with WSRF-based services , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[30]  Ian T. Foster,et al.  GASS: a data movement and access service for wide area computing systems , 1999, IOPADS '99.

[31]  Douglas Thain,et al.  Parrot: Transparent User-Level Middleware for Data-Intensive Computing , 2005, Scalable Comput. Pract. Exp..

[32]  David Mazières,et al.  Separating key management from file system security , 1999, SOSP.

[33]  Ian T. Foster,et al.  Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing , 2001, 2001 Eighteenth IEEE Symposium on Mass Storage Systems and Technologies.

[34]  Brent Callaghan,et al.  NFS Illustrated , 1999 .