论文信息 - Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition

Integrating Reinforcement Learning with Multi-Agent Techniques for Adaptive Service Composition

Service-oriented architecture is a widely used software engineering paradigm to cope with complexity and dynamics in enterprise applications. Service composition, which provides a cost-effective way to implement software systems, has attracted significant attention from both industry and research communities. As online services may keep evolving over time and thus lead to a highly dynamic environment, service composition must be self-adaptive to tackle uninformed behavior during the evolution of services. In addition, service composition should also maintain high efficiency for large-scale services, which are common for enterprise applications. This article presents a new model for large-scale adaptive service composition based on multi-agent reinforcement learning. The model integrates reinforcement learning and game theory, where the former is to achieve adaptation in a highly dynamic environment and the latter is to enable agents to work for a common task (i.e., composition). In particular, we propose a multi-agent Q-learning algorithm for service composition, which is expected to achieve better performance when compared with the single-agent Q-learning method and multi-agent SARSA (State-Action-Reward-State-Action) method. Our experimental results demonstrate the effectiveness and efficiency of our approach.

[1] J. Nash,et al. NON-COOPERATIVE GAMES , 1951, Classics in Game Theory.

[2] Karl Tuyls,et al. An Overview of Cooperative and Competitive Multiagent Learning , 2005, LAMAS.

[3] Eyhab Al-Masri,et al. Discovering the best web service , 2007, WWW '07.

[4] Craig Boutilier,et al. Planning, Learning and Coordination in Multiagent Decision Processes , 1996, TARK.

[5] Xiang Zhou,et al. Adaptive Service Composition Based on Reinforcement Learning , 2010, ICSOC.

[6] Kwang Mong Sim,et al. Agent-Based Service Composition in Cloud Computing , 2010, FGIT-GDC/CA.

[7] Zibin Zheng,et al. Integrating On-policy Reinforcement Learning with Multi-agent Techniques for Adaptive Service Composition , 2014, ICSOC.

[8] Ying Chen,et al. Partial Selection: An Efficient Approach for QoS-Aware Web Service Composition , 2014, 2014 IEEE International Conference on Web Services.

[9] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[10] Tatsuya Suda,et al. Automated generation of composite web services based on functional semantics , 2009, J. Web Semant..

[11] Zakaria Maamar,et al. Toward an agent-based and context-oriented approach for Web services composition , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12] Huaglory Tianfield,et al. Decentralized multi-agent service composition , 2013, Multiagent Grid Syst..

[13] Hans-Arno Jacobsen,et al. Whitening SOA Testing via Event Exposure , 2013, IEEE Transactions on Software Engineering.

[14] Jörg Hoffmann,et al. SAP Speaks PDDL , 2010, AAAI.

[15] Schahram Dustdar,et al. Domain-Specific Service Selection for Composite Services , 2012, IEEE Transactions on Software Engineering.

[16] Vincenzo Grassi,et al. MOSES: A Framework for QoS Driven Runtime Adaptation of Service-Oriented Systems , 2012, IEEE Transactions on Software Engineering.

[17] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.

[18] Peter Dayan,et al. Q-learning , 1992, Machine Learning.

[19] Stephan Reiff-Marganiec,et al. Markov-HTN Planning Approach to Enhance Flexibility of Automatic Web Service Composition , 2009, 2009 IEEE International Conference on Web Services.

[20] Luciano Baresi,et al. Self-Supervising BPEL Processes , 2011, IEEE Transactions on Software Engineering.

[21] Zibin Zheng,et al. Investigating QoS of Real-World Web Services , 2014, IEEE Transactions on Services Computing.

[22] MengChu Zhou,et al. A Multilevel Index Model to Expedite Web Service Discovery and Composition in Large-Scale Service Repositories , 2016, IEEE Transactions on Services Computing.

[23] Bart De Schutter,et al. A Comprehensive Survey of Multiagent Reinforcement Learning , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[24] Natalya Keberle,et al. Towards a Framework for Agent-Enabled Semantic Web Service Composition , 2004, Int. J. Web Serv. Res..

[25] Minjie Zhang,et al. Multi-Objective Service Composition Using Reinforcement Learning , 2013, ICSOC.

[26] Andrew W. Moore,et al. Reinforcement Learning: A Survey , 1996, J. Artif. Intell. Res..

[27] Marco Aiello,et al. Continual Planning with Sensing for Web Service Composition , 2011, AAAI.

[28] Xin Chen,et al. Integrating Gaussian Process with Reinforcement Learning for Adaptive Service Composition , 2015, ICSOC.

[29] Haiyan Zhao,et al. A Multi-agent Learning Model for Service Composition , 2012, 2012 IEEE Asia-Pacific Services Computing Conference.

[30] Xiaofeng Wang,et al. Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games , 2002, NIPS.

[31] Boualem Benatallah,et al. A Petri Net-based Model for Web Service Composition , 2003, ADC.

[32] Pascal Poizat,et al. Self-Adaptive Service Composition Through Graphplan Repair , 2010, 2010 IEEE International Conference on Web Services.

[33] H. Young,et al. The Evolution of Conventions , 1993 .

[34] L. Shapley,et al. Fictitious Play Property for Games with Identical Interests , 1996 .

[35] Lijuan Wang,et al. Facilitating an ant colony algorithm for multi-objective data-intensive service provision , 2015, J. Comput. Syst. Sci..

[36] Thomas Risse,et al. Selecting skyline services for QoS-based web service composition , 2010, WWW '10.

[37] Shiwei Tang,et al. Web Service Composition Using Markov Decision Processes , 2005, WAIM.

[38] MengChu Zhou,et al. Automatic Web Service Composition Based on Uncertainty Execution Effects , 2016, IEEE Transactions on Services Computing.

[39] Fuyuki Ishikawa,et al. SanGA: A Self-Adaptive Network-Aware Approach to Service Composition , 2014, IEEE Transactions on Services Computing.

[40] Marco Saerens,et al. Dynamic Web Service Composition within a Service-Oriented Architecture , 2007, IEEE International Conference on Web Services (ICWS 2007).

[41] Israel Ben-Shaul,et al. Dynamic Adaptation and Deployment of Distributed Components In Hadas , 2001, IEEE Trans. Software Eng..

[42] Prashant Doshi,et al. Dynamic workflow composition using Markov decision processes , 2004 .

[43] Bernd Bruegge,et al. Object-Oriented Software Engineering Using UML, Patterns, and Java , 2009 .

[44] Ville Könönen,et al. Asymmetric multiagent reinforcement learning , 2003, Web Intell. Agent Syst..

[45] Hongbing Wang,et al. A Novel Approach to Large-Scale Services Composition , 2013, APWeb.

[46] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.

[47] Michael L. Littman,et al. Value-function reinforcement learning in Markov games , 2001, Cognitive Systems Research.

[48] Michael L. Littman,et al. Markov Games as a Framework for Multi-Agent Reinforcement Learning , 1994, ICML.

[49] Bernd Bruegge,et al. Object-Oriented Software Engineering: Using UML, Patterns and Java, Second Edition , 2003 .

[50] Danilo Ardagna,et al. Adaptive Service Composition in Flexible Processes , 2007, IEEE Transactions on Software Engineering.

[51] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[52] Annapaola Marconi,et al. Automated Composition of Web Services by Planning at the Knowledge Level , 2005, IJCAI.

[53] Soundar R. T. Kumara,et al. Effective Web Service Composition in Diverse and Large-Scale Service Networks , 2008, IEEE Transactions on Services Computing.

[54] Zibin Zheng,et al. Adaptive and Dynamic Service Composition via Multi-agent Reinforcement Learning , 2014, 2014 IEEE International Conference on Web Services.

[55] Tommi S. Jaakkola,et al. Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms , 2000, Machine Learning.

[56] Piergiorgio Bertoli,et al. Automated composition of Web services via planning in asynchronous domains , 2005, Artif. Intell..

[57] Tiziana Margaria,et al. Automated service composition with adaptive planning , 2019 .

[58] Thomas Vogel,et al. Model-Driven Engineering of Self-Adaptive Software with EUREMA , 2014, ACM Trans. Auton. Adapt. Syst..

[59] Hongbing Wang,et al. Effective service composition using multi-agent reinforcement learning , 2016, Knowl. Based Syst..

[60] Hongbing Wang,et al. A Multi-agent Reinforcement Learning Model for Service Composition , 2012, 2012 IEEE Ninth International Conference on Services Computing.

[61] Zaki Brahmi. QoS-aware Automatic Web Service Composition based on Cooperative Agents , 2013, 2013 Workshops on Enabling Technologies: Infrastructure for Collaborative Enterprises.

[62] Gwen Salaün,et al. Adaptation of Service Protocols Using Process Algebra and On-the-Fly Reduction Techniques , 2008, IEEE Transactions on Software Engineering.

[63] Ladan Tahvildari,et al. Self-adaptive software: Landscape and research challenges , 2009, TAAS.

[64] Shensheng Zhang,et al. A Distributed Algorithm for Web Service Composition Based on Service Agent Model , 2011, IEEE Transactions on Parallel and Distributed Systems.

[65] Manuela M. Veloso,et al. Multiagent Systems: A Survey from a Machine Learning Perspective , 2000, Auton. Robots.

[66] Michael P. Wellman,et al. Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm , 1998, ICML.

[67] Yao Li. A Multi-agent Cooperative Reinforcement Learning Algorithm Based on Team Markov Game , 2004 .