Disruption-aware service composition and recovery in dynamic networking environments

The dynamic and heterogeneous natures of large-scale systems pose fundamental challenges to the design of service composition methods with minimum service disruptions. Improving reliability has long been a topic of extensive research in large-scale systems. Little existing work, however, has considered service deliveries spanning multiple components and taken both failure duration and frequency into account. This paper proposes a new service composition and recovery framework designed to achieve minimum service disruptions. The framework consists of two-tiers: service routing, which selects the service components, and network routing, which finds the network path that connects these service components. Our framework is based on a novel concept: disruption index, which characterizes different aspects of service disruptions, including frequency and duration. We formulate the problem of minimum-disruption service composition and recovery (MDSCR) as a dynamic programming problem and give its optimal solution under the assumption of complete knowledge of future failure distribution. We then present our MDSCR heuristic, which approximates the optimal solution with one-step lookahead prediction, where service link lifetime is predicted through statistical regression. We present the preliminary performance results of our algorithm via simulation study.

[1]  Leandros Tassiulas,et al.  Service discovery in mobile ad hoc networks: an overall perspective on architectural choices and network layer support issues , 2004, Ad Hoc Networks.

[2]  Sam Malek,et al.  Improving Availability in Large, Distributed Component-Based Systems Via Redeployment , 2005, Component Deployment.

[3]  Klara Nahrstedt,et al.  Dynamic QoS-aware multimedia service configuration in ubiquitous computing environments , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[5]  Leandros Tassiulas,et al.  Network layer support for service discovery in mobile ad hoc networks , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[6]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[7]  Scott Shenker,et al.  Minimizing churn in distributed systems , 2006, SIGCOMM.

[8]  Aniruddha S. Gokhale,et al.  The design and performance of component middleware for QoS-enabled deployment and configuration of DRE systems , 2007, J. Syst. Softw..

[9]  Douglas C. Schmidt,et al.  Minimum Disruption Service Composition and Recovery over Mobile Ad Hoc Networks , 2007, MobiQuitous.

[10]  Aniruddha S. Gokhale,et al.  DAnCE: A QoS-Enabled Component Deployment and Configuration Engine , 2005, Component Deployment.

[11]  J.-P. Hubaux,et al.  Enforcing service availability in mobile ad-hoc WANs , 2000, 2000 First Annual Workshop on Mobile and Ad Hoc Networking and Computing. MobiHOC (Cat. No.00EX444).