Adaptive virtual distributed environments for shared cyberinfrastructures

A shared distributed cyberinfrastructure is formed by federating computation resources from multiple domains. Such shared infrastructures are increasing in popularity and are providing massive amounts of aggregated computation resources to a large numbers of users. Meanwhile, virtualization technologies, at machine and network levels, are maturing and enabling mutually isolated virtual computation environments for executing unmodified parallel and distributed applications on top of such a shared physical cyberinfrastructure. In this dissertation, we go one step further by supporting runtime adaptation of virtual computation environments as integrated, active entities. More specifically, driven by both dynamic availability of infrastructure resources and dynamic application resource demand, a virtual computation environment is able to automatically relocate itself across the infrastructure and adjust its share of infrastructural resources. Such runtime adaptation is transparent to both users of a virtual environment and applications running inside the environment, providing the look and feel of a private, well-provisioned computational environment to the users and the applications. This dissertation presents the design, implementation, and evaluation of a middleware system, VIOLIN, which enables adaptive virtual computational environments. Each VIOLIN virtual environment is composed of a virtual network of virtual machines and is capable of live migration across a multi-domain physical infrastructure. This research effort includes the development of the enabling mechanisms for virtual environment creation and adaptation, as well as, policies for effectively utilizing these mechanisms. Through this effort, we have found that by combining virtual environments with adaptation policies and mechanisms, we can provide significant increases in both usability and performance of shared computational infrastructures. Further, the VIOLIN middleware is currently deployed on a production cyberinfrastructure, called the nanoHUB, as a major computational facility used to execute parallel and distributed nanotechnology simulations.

[1]  Renato J. O. Figueiredo,et al.  A case for grid computing on virtual machines , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[2]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[3]  Anantha Chandrakasan,et al.  Energy Efficient Real-Time Scheduling , 2001, ICCAD.

[4]  Douglas Thain,et al.  Distributed computing in practice: the Condor experience , 2005, Concurr. Pract. Exp..

[5]  David P. Anderson,et al.  Public Computing: Reconnecting People to Science , 2003 .

[6]  Xuxian Jiang,et al.  Virtual distributed environments in a shared infrastructure , 2005, Computer.

[7]  Gil Neiger,et al.  IntelŴVirtualization Technology: Hardware Support for Efficient Processor Virtualization , 2006 .

[8]  Peter A. Dinda,et al.  Towards Virtual Networks for Virtual Machine Grid Computing , 2004, Virtual Machine Research and Technology Symposium.

[9]  Renato J. O. Figueiredo,et al.  Science gateways made easy: the In-VIGO approach , 2007, Concurr. Comput. Pract. Exp..

[10]  Wei Jin,et al.  USENIX Association Proceedings of USITS ’ 03 : 4 th USENIX Symposium on Internet Technologies and Systems , 2003 .

[11]  Larry L. Peterson,et al.  Experiences building PlanetLab , 2006, OSDI '06.

[12]  Fabrice Bellard,et al.  QEMU, a Fast and Portable Dynamic Translator , 2005, USENIX Annual Technical Conference, FREENIX Track.

[13]  Danilo Ardagna,et al.  SLA based profit optimization in autonomic computing systems , 2004, ICSOC '04.

[14]  David E. Irwin,et al.  Dynamic virtual clusters in a grid site manager , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[15]  Xuxian Jiang,et al.  SODA: a service-on-demand architecture for application service hosting utility platforms , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[16]  Ole Agesen,et al.  A comparison of software and hardware techniques for x86 virtualization , 2006, ASPLOS XII.

[17]  Herman Lam,et al.  A service-oriented, scalable approach to grid-enabling of legacy scientific applications , 2005, IEEE International Conference on Web Services (ICWS'05).

[18]  P. Oscar Boykin,et al.  IP over P2P: enabling self-configuring virtual IP networks for grid computing , 2006, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium.

[19]  Carl A. Waldspurger,et al.  Memory resource management in VMware ESX server , 2002, OSDI '02.

[20]  Ian T. Foster,et al.  Virtual Workspaces in the Grid , 2005, Euro-Par.

[21]  Jeffrey O. Kephart,et al.  An architectural approach to autonomic computing , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[22]  Ian T. Foster,et al.  The anatomy of the grid: enabling scalable virtual organizations , 2001, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid.

[23]  David E. Irwin,et al.  Balancing risk and reward in a market-based task service , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[24]  Dongyan Xu,et al.  VioCluster: Virtualization for Dynamic Computational Domains , 2005, 2005 IEEE International Conference on Cluster Computing.

[25]  Ian T. Foster,et al.  From sandbox to playground: dynamic virtual environments in the grid , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[26]  Ian T. Foster,et al.  Globus and PlanetLab resource management solutions compared , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[27]  Steven Tuecke,et al.  The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration , 2002 .

[28]  Rajarshi Das,et al.  A multi-agent systems approach to autonomic computing , 2004, Proceedings of the Third International Joint Conference on Autonomous Agents and Multiagent Systems, 2004. AAMAS 2004..

[29]  Robert N. M. Watson,et al.  Jails: confining the omnipotent root , 2000 .

[30]  David E. Irwin,et al.  Virtual Machine Hosting for Networked Clusters: Building the Foundations for "Autonomic" Orchestration , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[31]  Eli M. Dow,et al.  Xen and the Art of Repeated Research , 2004, USENIX Annual Technical Conference, FREENIX Track.

[32]  I. Ahmad,et al.  An analysis of disk performance in VMware ESX server virtual machines , 2003, 2003 IEEE International Conference on Communications (Cat. No.03CH37441).

[33]  Peter A. Dinda,et al.  Increasing application performance in virtual environments through run-time inference and adaptation , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[34]  Robert C. Daley,et al.  An experimental time-sharing system , 1962, AIEE-IRE '62 (Spring).

[35]  Borja Sotomayor,et al.  Virtual Clusters for Grid Communities , 2006, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06).

[36]  Borja Sotomayor,et al.  Overhead Matters: A Model for Virtual Resource Management , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[37]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[38]  Jing Xu,et al.  Towards Autonomic Grid Data Management with Virtualized Distributed File Systems , 2006, 2006 IEEE International Conference on Autonomic Computing.

[39]  Ian T. Foster,et al.  Virtual workspaces: Achieving quality of service and quality of life in the Grid , 2005, Sci. Program..

[40]  Miron Livny,et al.  A worldwide flock of Condors: Load sharing among workstation clusters , 1996, Future Gener. Comput. Syst..

[41]  Beng-Hong Lim,et al.  Virtualizing I/O Devices on VMware Workstation's Hosted Virtual Machine Monitor , 2001, USENIX Annual Technical Conference, General Track.

[42]  Xiaomin Zhu,et al.  From virtualized resources to virtual computing grids: the In-VIGO system , 2005, Future Gener. Comput. Syst..

[43]  Robert J. Creasy,et al.  The Origin of the VM/370 Time-Sharing System , 1981, IBM J. Res. Dev..

[44]  Peter A. Dinda,et al.  Automatic dynamic run-time optical network reservations , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[45]  Jeff Dike,et al.  A user-mode port of the Linux kernel , 2000, Annual Linux Showcase & Conference.

[46]  Jeffrey Casazza,et al.  Redefining Server Performance Characterization for Virtualization Benchmarking , 2006 .

[47]  Renato J. O. Figueiredo,et al.  Distributed File System Virtualization Techniques Supporting On-Demand Virtual Machine Environments for Grid Computing , 2006, Cluster Computing.

[48]  Renato J. O. Figueiredo,et al.  Single sign-on in In-VIGO: role-based access via delegation mechanisms using short-lived user identities , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[49]  Of references. , 1966, JAMA.

[50]  David E. Irwin,et al.  Sharing Networked Resources with Brokered Leases , 2006, USENIX Annual Technical Conference, General Track.

[51]  Renato J. O. Figueiredo,et al.  Distributed file system support for virtual machines in grid computing , 2004, Proceedings. 13th IEEE International Symposium on High performance Distributed Computing, 2004..

[52]  David E. Culler,et al.  A blueprint for introducing disruptive technology into the Internet , 2003, CCRV.

[53]  David Wolinsky,et al.  On the Design of Virtual Machine Sandboxes for Distributed Computing in Wide-area Overlays of Virtual Workstations , 2006, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006).

[54]  David E. Culler,et al.  Operating Systems Support for Planetary-Scale Network Services , 2004, NSDI.

[55]  Mike Hibler,et al.  An integrated experimental environment for distributed systems and networks , 2002, OPSR.

[56]  Manish Parashar,et al.  Enabling self-management of component-based high-performance scientific applications , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[57]  Jack Dongarra,et al.  LINPACK Users' Guide , 1987 .

[58]  Chris I. Dalton,et al.  SoftUDC: a software-based data center for utility computing , 2004, Computer.

[59]  J. Demmel,et al.  Sun Microsystems , 1996 .

[60]  L. Ramakrishnan,et al.  Toward a Doctrine of Containment: Grid Hosting with Adaptive Resource Control , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[61]  P. Oscar Boykin,et al.  WOW: Self-Organizing Wide Area Overlay Networks of Virtual Workstations , 2006, 2006 15th IEEE International Conference on High Performance Distributed Computing.

[62]  Peter A. Dinda,et al.  VSched: Mixing Batch And Interactive Virtual Machines Using Periodic Real-time Scheduling , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[63]  Daniel Price,et al.  Solaris Zones: Operating System Support for Consolidating Commercial Workloads , 2004, LISA.

[64]  Gerhard Klimeck,et al.  Development of a Nanoelectronic 3-D (NEMO 3-D ) Simulator for Multimillion Atom Simulations and Its Application to Alloyed Quantum Dots , 2002 .

[65]  Xuxian Jiang,et al.  VIOLIN: Virtual Internetworking on Overlay Infrastructure , 2004, ISPA.

[66]  Renato J. O. Figueiredo,et al.  VMPlants: Providing and Managing Virtual Machine Execution Environments for Grid Computing , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[67]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.