Reinforcement Learning Techniques for Decentralized Self-adaptive Service Assembly

This paper proposes a self-organizing fully decentralized solution for the service assembly problem, whose goal is to guarantee a good overall quality for the delivered services, ensuring at the same time fairness among the participating peers. The main features of our solution are: (i) the use of a gossip protocol to support decentralized information dissemination and decision making, and (ii) the use of a reinforcement learning approach to make each peer able to learn from its experience the service selection rule to be followed, thus overcoming the lack of global knowledge. Besides, we explicitly take into account load-dependent quality attributes, which lead to the definition of a service selection rule that drives the system away from overloading conditions that could adversely affect quality and fairness. Simulation experiments show that our solution self-adapts to occurring variations by quickly converging to viable assemblies maintaining the specified quality and fairness objectives.

[1]  Kensuke Fukuda,et al.  Adaptive agent selection in large-scale multi-agent systems , 2006 .

[2]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[3]  Bradley R. Schmerl,et al.  On Patterns for Decentralized Control in Self-Adaptive Systems , 2010, Software Engineering for Self-Adaptive Systems.

[4]  Kurt Rothermel,et al.  Adaptive Composition of Distributed Pervasive Applications in Heterogeneous Environments , 2013, TAAS.

[5]  Carlo Ghezzi,et al.  QoS Driven Dynamic Binding in-the-many , 2010, QoSA.

[6]  Moshe Tennenholtz,et al.  Adaptive Load Balancing: A Study in Multi-Agent Learning , 1994, J. Artif. Intell. Res..

[7]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[8]  Ching-Lai Hwang,et al.  Multiple attribute decision making : an introduction , 1995 .

[9]  Ann Nowé,et al.  Adaptive load balancing of parallel applications with multi-agent reinforcement learning on heterogeneous systems , 2004, Sci. Program..

[10]  John N. Tsitsiklis,et al.  Congestion-dependent pricing of network services , 2000, TNET.

[11]  Rogério de Lemos,et al.  08031 -- Software Engineering for Self-Adaptive Systems: A Research Road Map , 2008, Software Engineering for Self-Adaptive Systems.

[12]  Vincenzo Grassi,et al.  GoPrime: A Fully Decentralized Middleware for Utility-Aware Service Assembly , 2016, IEEE Transactions on Software Engineering.

[13]  Raj Jain,et al.  A Quantitative Measure Of Fairness And Discrimination For Resource Allocation In Shared Computer Systems , 1998, ArXiv.

[14]  Anne-Marie Kermarrec,et al.  Gossip-based peer sampling , 2007, TOCS.

[15]  Márk Jelasity,et al.  A Robust and Scalable Peer-to-Peer Gossiping Protocol , 2003, AP2PC.

[16]  Guozhi Xu,et al.  Modeling and analysis of software aging and software failure , 2007, J. Syst. Softw..

[17]  Faruk Polat,et al.  A layered approach to learning coordination knowledge in multiagent environments , 2007, Applied Intelligence.

[18]  Kristina Lerman,et al.  Resource allocation games with changing resource capacities , 2003, AAMAS '03.