Self-Adaptive Resilient Service Composition

One aspect that permeates all large scale systems is the occurrence of failures. Continually, on any data center, failures are happening, either caused by malfunctioning disks, memories, network connections, or software bugs. Large scale failures - possibly caused be a ripple effect of smaller failures - are obviously even worse. The fact that failures are extremely hard or even impossible to predict makes them particularly challenging to cope with. A better alternative to predicting failures is creating systems that can cope with failures and autonomously adapt. In this paper, we investigate a decentralized self-adaptive approach to a resilient system for service composition. Our approach is based on an agent coordination mechanism known as "delegate MAS", which is particularly suited for large-scale coordination of systems. We thoroughly evaluate this approach through large and huge scale experiments of composite services. The results from these experiments show that it is possible to create service compositions which are resilient to large scale failures.

[1]  Johan A. K. Suykens,et al.  QoS prediction for web service compositions using kernel-based quantile estimation with online adaptation of the constant offset , 2014, Inf. Sci..

[2]  Schahram Dustdar,et al.  Monitoring, Prediction and Prevention of SLA Violations in Composite Services , 2010, 2010 IEEE International Conference on Web Services.

[3]  Danny Weyns,et al.  A Decentralized Approach for Anticipatory Vehicle Routing Using Delegate Multiagent Systems , 2011, IEEE Transactions on Intelligent Transportation Systems.

[4]  Radu Calinescu,et al.  Dynamic QoS Management and Optimization in Service-Based Systems , 2011, IEEE Transactions on Software Engineering.

[5]  Anand S. Rao,et al.  BDI Agents: From Theory to Practice , 1995, ICMAS.

[6]  Minjie Zhang,et al.  Trustworthy Stigmergic Service Compositionand Adaptation in Decentralized Environments , 2016, IEEE Transactions on Services Computing.

[7]  Marcos K. Aguilera,et al.  Failure detection and consensus in the crash-recovery model , 2000, Distributed Computing.

[8]  Fuyuki Ishikawa,et al.  Towards network-aware service composition in the cloud , 2012, WWW.

[9]  Ambuj K. Singh,et al.  Analyzing Performance of Lease-Based Schemes under Failures , 2011, 2011 IEEE 30th International Symposium on Reliable Distributed Systems.

[10]  Nicholas R. Jennings,et al.  Robust Execution of Service Workflows Using Redundancy and Advance Reservations , 2011, IEEE Transactions on Services Computing.

[11]  Christos Faloutsos,et al.  Graphs over time: densification laws, shrinking diameters and possible explanations , 2005, KDD '05.

[12]  Schahram Dustdar,et al.  A survey on web services composition , 2005, Int. J. Web Grid Serv..

[13]  Michael P. Georgeff,et al.  Commitment and Effectiveness of Situated Agents , 1991, IJCAI.

[14]  I-Ling Yen,et al.  QoS-Driven Service Composition with Reconfigurable Services , 2013, IEEE Transactions on Services Computing.

[15]  Michael Stonebraker,et al.  A Formal Model of Crash Recovery in a Distributed System , 1983, IEEE Transactions on Software Engineering.

[16]  Valeria Cardellini,et al.  Designing a Broker for QoS-driven Runtime Adaptation of SOA Applications , 2010, 2010 IEEE International Conference on Web Services.

[17]  Tom Holvoet,et al.  Beliefs, desires and intentions through the environment , 2006, AAMAS '06.

[18]  Håkan Sivencrona,et al.  Byzantine Fault Tolerance, from Theory to Reality , 2003, SAFECOMP.