Resource-Aware Migratory Services in Wide-Area Shared Computing Environments

In this paper we present the design and evaluation of a system for deploying highly available and migratable services in shared infrastructures, such as the PlanetLab, where the available resource capacities at a node can fluctuate significantly. A migratable service can monitor its operating conditions and autonomously relocate itself to another node when the available resource capacities at the current node fall below certain acceptable limits. We utilize the autonomous mobile agent paradigm for building such migratable services. Such agents can monitor their operating conditions and follow various migration policies. We investigate here the mechanisms for service relocation, and client-side protocols to access migratory services. The ``blackout periods'', i.e. the time during which the clients are unable to access a migrating service, need to be minimized and kept within some tolerable limits for services required to be highly available. We first present the design of a migratable service implemented using a mobile agent, and evaluate its performance in terms of the blackout periods and the service agent's abilities to autonomously migrate in the network. We replicate service agents to reduce the blackout periods, and develop the coordination protocols for autonomous agent migration in a group of service agents. We also present here our work for monitoring PlanetLab nodes for their available resource capacities in order to assist a migratory service in selecting a target node for relocation.

[1]  Keith Marzullo,et al.  Highly-available services using the primary-backup approach , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[2]  Raymond A. Paul,et al.  An Infrastructure for Web Services Migration for Real-Time Applications , 2006, 2006 Second IEEE International Symposium on Service-Oriented System Engineering (SOSE'06).

[3]  David E. Culler,et al.  Operating Systems Support for Planetary-Scale Network Services , 2004, NSDI.

[4]  Manish Marwah,et al.  TPC server fault tolerance using connection migration to a backup server , 2003, 2003 International Conference on Dependable Systems and Networks, 2003. Proceedings..

[5]  Karen R. Sollins,et al.  Functional Requirements for Uniform Resource Names , 1994, RFC.

[6]  I. Foster,et al.  The grid grows up , 2003, IEEE Internet Computing.

[7]  Wilhelm Hasselbring,et al.  Availability of Globally Distributed Nodes: An Empirical Evaluation , 2008, 2008 Symposium on Reliable Distributed Systems.

[8]  Anand R. Tripathi,et al.  Mobile agent programming in Ajanta , 1999, Proceedings. 19th IEEE International Conference on Distributed Computing Systems (Cat. No.99CB37003).

[9]  Anand R. Tripathi,et al.  Autonomic configuration and recovery in a mobile agent‐based distributed event monitoring system , 2007, Softw. Pract. Exp..

[10]  Riccardo Bettati,et al.  HydraNet-FT: network support for dependable services , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[11]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[12]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[13]  Fred Douglis,et al.  Transparent process migration: Design alternatives and the sprite implementation , 1991, Softw. Pract. Exp..

[14]  Abhishek Chandra,et al.  Resource Bundles: Using Aggregation for Statistical Wide-Area Resource Discovery and Allocation , 2008, 2008 The 28th International Conference on Distributed Computing Systems.

[15]  Fred B. Schneider,et al.  The primary-backup approach , 1993 .

[16]  KyoungSoo Park,et al.  CoMon: a mostly-scalable monitoring system for PlanetLab , 2006, OPSR.

[17]  Richard Wolski,et al.  The network weather service: a distributed resource performance forecasting service for metacomputing , 1999, Future Gener. Comput. Syst..

[18]  Liviu Iftode,et al.  Service continuations: an operating system mechanism for dynamic migration of Internet service sessions , 2003, 22nd International Symposium on Reliable Distributed Systems, 2003. Proceedings..

[19]  Dejan S. Milojicic,et al.  Process migration , 1999, ACM Comput. Surv..

[20]  Daniel M. Zimmerman,et al.  benefits and drawbacks of current Java mobile agent systems , 1997 .

[21]  Liviu Iftode,et al.  Context-Aware Migratory Services in Ad Hoc Networks , 2007, IEEE Transactions on Mobile Computing.

[22]  Amin Vahdat,et al.  Service Placement in a Shared Wide-Area Platform , 2006, USENIX Annual Technical Conference, General Track.

[23]  Mukesh Singhal,et al.  Load distributing for locally distributed systems , 1992, Computer.

[24]  Alexander Poylisher,et al.  Mobile agents for aggregation of network management data , 1999, Proceedings. First and Third International Symposium on Agent Systems Applications, and Mobile Agents.