Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition

Service-oriented architecture is a widely used software engineering paradigm to cope with complexity and dynamics in enterprise applications. Service composition, which provides a cost-effective way to implement software systems, has attracted significant attention from both industry and research communities. As online services may keep evolving over time and thus lead to a highly dynamic environment, service composition must be self-adaptive to tackle uninformed behavior during the evolution of services. In addition, service composition should also maintain high efficiency for large-scale services, which are common for enterprise applications. This article presents a new model for large-scale adaptive service composition based on multi-agent reinforcement learning. The model integrates reinforcement learning and game theory, where the former is to achieve adaptation in a highly dynamic environment and the latter is to enable agents to work for a common task (i.e., composition). In particular, we propose a multi-agent Q-learning algorithm for service composition, which is expected to achieve better performance when compared with the single-agent Q-learning method and multi-agent SARSA (State-Action-Reward-State-Action) method. Our experimental results demonstrate the effectiveness and efficiency of our approach.

[1]  J. Nash,et al.  NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[2]  Karl Tuyls,et al.  An Overview of Cooperative and Competitive Multiagent Learning , 2005, LAMAS.

[3]  Eyhab Al-Masri,et al.  Discovering the best web service , 2007, WWW '07.

[4]  Craig Boutilier,et al.  Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.

[5]  Xiang Zhou,et al.  Adaptive Service Composition Based on Reinforcement Learning , 2010, ICSOC.

[6]  Kwang Mong Sim,et al.  Agent-Based Service Composition in Cloud Computing , 2010, FGIT-GDC/CA.

[7]  Zibin Zheng,et al.  Integrating On-policy Reinforcement Learning with Multi-agent Techniques for Adaptive Service Composition , 2014, ICSOC.

[8]  Ying Chen,et al.  Partial Selection: An Efficient Approach for QoS-Aware Web Service Composition , 2014, 2014 IEEE International Conference on Web Services.

[9]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[10]  Tatsuya Suda,et al.  Automated generation of composite web services based on functional semantics , 2009, J. Web Semant..

[11]  Zakaria Maamar,et al.  Toward an agent-based and context-oriented approach for Web services composition , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  Huaglory Tianfield,et al.  Decentralized multi-agent service composition , 2013, Multiagent Grid Syst..

[13]  Hans-Arno Jacobsen,et al.  Whitening SOA Testing via Event Exposure , 2013, IEEE Transactions on Software Engineering.

[14]  Jörg Hoffmann,et al.  SAP Speaks PDDL , 2010, AAAI.

[15]  Schahram Dustdar,et al.  Domain-Specific Service Selection for Composite Services , 2012, IEEE Transactions on Software Engineering.

[16]  Vincenzo Grassi,et al.  MOSES: A Framework for QoS Driven Runtime Adaptation of Service-Oriented Systems , 2012, IEEE Transactions on Software Engineering.

[17]  Craig Boutilier,et al.  The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[18]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[19]  Stephan Reiff-Marganiec,et al.  Markov-HTN Planning Approach to Enhance Flexibility of Automatic Web Service Composition , 2009, 2009 IEEE International Conference on Web Services.

[20]  Luciano Baresi,et al.  Self-Supervising BPEL Processes , 2011, IEEE Transactions on Software Engineering.

[21]  Zibin Zheng,et al.  Investigating QoS of Real-World Web Services , 2014, IEEE Transactions on Services Computing.

[22]  MengChu Zhou,et al.  A Multilevel Index Model to Expedite Web Service Discovery and Composition in Large-Scale Service Repositories , 2016, IEEE Transactions on Services Computing.

[23]  Bart De Schutter,et al.  A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[24]  Natalya Keberle,et al.  Towards a Framework for Agent-Enabled Semantic Web Service Composition , 2004, Int. J. Web Serv. Res..

[25]  Minjie Zhang,et al.  Multi-Objective Service Composition Using Reinforcement Learning , 2013, ICSOC.

[26]  Andrew W. Moore,et al.  Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[27]  Marco Aiello,et al.  Continual Planning with Sensing for Web Service Composition , 2011, AAAI.

[28]  Xin Chen,et al.  Integrating Gaussian Process with Reinforcement Learning for Adaptive Service Composition , 2015, ICSOC.

[29]  Haiyan Zhao,et al.  A Multi-agent Learning Model for Service Composition , 2012, 2012 IEEE Asia-Pacific Services Computing Conference.

[30]  Xiaofeng Wang,et al.  Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.

[31]  Boualem Benatallah,et al.  A Petri Net-based Model for Web Service Composition , 2003, ADC.

[32]  Pascal Poizat,et al.  Self-Adaptive Service Composition Through Graphplan Repair , 2010, 2010 IEEE International Conference on Web Services.

[33]  H. Young,et al.  The Evolution of Conventions , 1993 .

[34]  L. Shapley,et al.  Fictitious Play Property for Games with Identical Interests , 1996 .

[35]  Lijuan Wang,et al.  Facilitating an ant colony algorithm for multi-objective data-intensive service provision , 2015, J. Comput. Syst. Sci..

[36]  Thomas Risse,et al.  Selecting skyline services for QoS-based web service composition , 2010, WWW '10.

[37]  Shiwei Tang,et al.  Web Service Composition Using Markov Decision Processes , 2005, WAIM.

[38]  MengChu Zhou,et al.  Automatic Web Service Composition Based on Uncertainty Execution Effects , 2016, IEEE Transactions on Services Computing.

[39]  Fuyuki Ishikawa,et al.  SanGA: A Self-Adaptive Network-Aware Approach to Service Composition , 2014, IEEE Transactions on Services Computing.

[40]  Marco Saerens,et al.  Dynamic Web Service Composition within a Service-Oriented Architecture , 2007, IEEE International Conference on Web Services (ICWS 2007).

[41]  Israel Ben-Shaul,et al.  Dynamic Adaptation and Deployment of Distributed Components In Hadas , 2001, IEEE Trans. Software Eng..

[42]  Prashant Doshi,et al.  Dynamic workflow composition using Markov decision processes , 2004 .

[43]  Bernd Bruegge,et al.  Object-Oriented Software Engineering Using UML, Patterns, and Java , 2009 .

[44]  Ville Könönen,et al.  Asymmetric multiagent reinforcement learning , 2003, Web Intell. Agent Syst..

[45]  Hongbing Wang,et al.  A Novel Approach to Large-Scale Services Composition , 2013, APWeb.

[46]  Sean Luke,et al.  Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[47]  Michael L. Littman,et al.  Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.

[48]  Michael L. Littman,et al.  Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[49]  Bernd Bruegge,et al.  Object-Oriented Software Engineering: Using UML, Patterns and Java, Second Edition , 2003 .

[50]  Danilo Ardagna,et al.  Adaptive Service Composition in Flexible Processes , 2007, IEEE Transactions on Software Engineering.

[51]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[52]  Annapaola Marconi,et al.  Automated Composition of Web Services by Planning at the Knowledge Level , 2005, IJCAI.

[53]  Soundar R. T. Kumara,et al.  Effective Web Service Composition in Diverse and Large-Scale Service Networks , 2008, IEEE Transactions on Services Computing.

[54]  Zibin Zheng,et al.  Adaptive and Dynamic Service Composition via Multi-agent Reinforcement Learning , 2014, 2014 IEEE International Conference on Web Services.

[55]  Tommi S. Jaakkola,et al.  Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[56]  Piergiorgio Bertoli,et al.  Automated composition of Web services via planning in asynchronous domains , 2005, Artif. Intell..

[57]  Tiziana Margaria,et al.  Automated service composition with adaptive planning , 2019 .

[58]  Thomas Vogel,et al.  Model-Driven Engineering of Self-Adaptive Software with EUREMA , 2014, ACM Trans. Auton. Adapt. Syst..

[59]  Hongbing Wang,et al.  Effective service composition using multi-agent reinforcement learning , 2016, Knowl. Based Syst..

[60]  Hongbing Wang,et al.  A Multi-agent Reinforcement Learning Model for Service Composition , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[61]  Zaki Brahmi QoS-aware Automatic Web Service Composition based on Cooperative Agents , 2013, 2013 Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[62]  Gwen Salaün,et al.  Adaptation of Service Protocols Using Process Algebra and On-the-Fly Reduction Techniques , 2008, IEEE Transactions on Software Engineering.

[63]  Ladan Tahvildari,et al.  Self-adaptive software: Landscape and research challenges , 2009, TAAS.

[64]  Shensheng Zhang,et al.  A Distributed Algorithm for Web Service Composition Based on Service Agent Model , 2011, IEEE Transactions on Parallel and Distributed Systems.

[65]  Manuela M. Veloso,et al.  Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[66]  Michael P. Wellman,et al.  Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[67]  Yao Li A Multi-agent Cooperative Reinforcement Learning Algorithm Based on Team Markov Game , 2004 .