Reinforcement Learning for Interactive QoS-Aware Services Composition

An important and challenging research problem in web of things is how to select an appropriate composition of concrete services in a dynamic and unpredictable environment. The main goal of this article is to select from all possible compositions the optimal one without knowing a priori the users' quality of service (QoS) preferences. From a theoretical point of view, we give bounds on the problem search space. As the QoS user's preferences are unknown, we propose a vector-valued MDP approach for finding the optimal QoS-aware services composition. The algorithm alternatively solves MDP with dynamic programming and learns the preferences via direct queries to the user. An important feature of the proposed algorithm is that it is able to get the optimal composition and, at the same time, limits the number of interactions with the user. Experiments on a real-world large size dataset with more than 3500 web services show that our algorithm finds the optimal composite services with around 50 interactions with the user.

[1]  Jinjun Chen,et al.  Combining Local Optimization and Enumeration for QoS-aware Web Service Composition , 2010, 2010 IEEE International Conference on Web Services.

[2]  Abdelghani Chibani,et al.  Energy-Centered and QoS-Aware Services Selection for Internet of Things , 2016, IEEE Transactions on Automation Science and Engineering.

[3]  Quan Z. Sheng,et al.  Quality driven web services composition , 2003, WWW '03.

[4]  Qingsheng Zhu,et al.  QoS-Aware Multigranularity Service Composition: Modeling and Optimization , 2016, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[5]  Ying Chen,et al.  A Partial Selection Methodology for Efficient QoS-Aware Service Composition , 2015, IEEE Transactions on Services Computing.

[6]  Minjie Zhang,et al.  Multi-Objective Service Composition in Uncertain Environments , 2015 .

[7]  Jun Gu,et al.  QoS and Preference Based Web Service Evaluation Approach , 2009, 2009 Eighth International Conference on Grid and Cooperative Computing.

[8]  Wei-Tek Tsai,et al.  Dependence-Guided Service Composition for User-Centric SOA , 2014, IEEE Systems Journal.

[9]  Yann Chevaleyre,et al.  Advantage based value iteration for Markov decision processes with unknown rewards , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[10]  Rajdeep Niyogi,et al.  Web service selection with global constraints using modified gray wolf optimizer , 2016, 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[11]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[12]  W. Liu,et al.  Trustworthy service selection and composition - reducing the entropy of service-oriented Web , 2005, INDIN '05. 2005 3rd IEEE International Conference on Industrial Informatics, 2005..

[13]  Maude Manouvrier,et al.  Web services composition: Complexity and models , 2015, Discret. Appl. Math..

[14]  Wolfgang Nejdl,et al.  A hybrid approach for efficient Web service composition with end-to-end QoS constraints , 2012, TWEB.

[15]  Thomas Risse,et al.  Selecting skyline services for QoS-based web service composition , 2010, WWW '10.

[16]  Nicolai M. Josuttis,et al.  Soa In Practice The Art Of Distributed System Design , 2007 .

[17]  Yixin Chen,et al.  QoS-Aware Dynamic Composition of Web Services Using Numerical Temporal Planning , 2014, IEEE Transactions on Services Computing.

[18]  Karim Djouani,et al.  A Distributed Agent-Based Approach for Optimal QoS Selection in Web of Object Choreography , 2018, IEEE Systems Journal.

[19]  Athman Bouguettaya,et al.  Efficient Service Skyline Computation for Composite Service Selection , 2013, IEEE Transactions on Knowledge and Data Engineering.

[20]  Zhaohui Wu,et al.  Top-${\rm k}$ Automatic Service Composition: A Parallel Method for Large-Scale Service Sets , 2014, IEEE Transactions on Automation Science and Engineering.

[21]  M. Shamim Hossain,et al.  Big Data-Driven Service Composition Using Parallel Clustered Particle Swarm Optimization in Mobile Environment , 2016, IEEE Transactions on Services Computing.

[22]  Bo Yang,et al.  Web Service Composition Based on Reinforcement Learning , 2015, 2015 IEEE International Conference on Web Services.

[23]  Mohamed Graiet,et al.  A Genetic-Based Adaptive Approach for Reliable and Efficient Service Composition , 2018, IEEE Systems Journal.

[24]  S. Govindarajan,et al.  Weighted Quality of Service based Ranking of Web Services , 2016 .

[25]  J. Leon Zhao,et al.  Service Selection for Composition with QoS Correlations , 2016, IEEE Transactions on Services Computing.

[26]  T.V. Prabhakar,et al.  Dynamic selection of Web services with recommendation system , 2005, International Conference on Next Generation Web Services Practices (NWeSP'05).

[27]  Wei Zhao,et al.  Integrating recurrent neural networks and reinforcement learning for dynamic service composition , 2020, Future Gener. Comput. Syst..

[28]  Anne H. H. Ngu,et al.  QoS-aware middleware for Web services composition , 2004, IEEE Transactions on Software Engineering.

[29]  Hongbing Wang,et al.  Automatic Hierarchical Reinforcement Learning for Efficient Large-Scale Service Composition , 2016, 2016 IEEE International Conference on Web Services (ICWS).

[30]  Baskaran Ramachandran,et al.  An Automatic Web Service Composition Framework Using QoS-Based Web Service Ranking Algorithm , 2015, TheScientificWorldJournal.

[31]  Qiang He,et al.  Alliance-Aware Service Composition Based on Quotient Space , 2016, 2016 IEEE International Conference on Web Services (ICWS).

[32]  Zibin Zheng,et al.  Web Service Recommendation via Exploiting Location and QoS Information , 2014, IEEE Transactions on Parallel and Distributed Systems.

[33]  Siobhán Clarke,et al.  Goal-Driven Service Composition in Mobile and Pervasive Computing , 2018, IEEE Transactions on Services Computing.

[34]  MengChu Zhou,et al.  Web Service Configuration Under Multiple Quality-of-Service Attributes , 2009, IEEE Transactions on Automation Science and Engineering.

[35]  Bruno Zanuttini,et al.  Interactive Value Iteration for Markov Decision Processes with Unknown Rewards , 2013, IJCAI.

[36]  Zibin Zheng,et al.  Investigating QoS of Real-World Web Services , 2014, IEEE Transactions on Services Computing.

[37]  Xianzhong Xie,et al.  An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems , 2020, IEEE Systems Journal.

[38]  Athanasios V. Vasilakos,et al.  Web services composition: A decade's overview , 2014, Inf. Sci..

[39]  Dimitris Sacharidis,et al.  Ranking and Clustering Web Services Using Multicriteria Dominance Relationships , 2010, IEEE Transactions on Services Computing.

[40]  Hang Xu,et al.  A Reinforcement Learning Method for Constraint-Satisfied Services Composition , 2020, IEEE Transactions on Services Computing.

[41]  Manuel Mucientes,et al.  An Integrated Semantic Web Service Discovery and Composition Framework , 2015, IEEE Transactions on Services Computing.

[42]  Oguzhan Alagoz,et al.  Markov Decision Processes: A Tool for Sequential Decision Making under Uncertainty , 2010, Medical decision making : an international journal of the Society for Medical Decision Making.

[43]  R. Venkata Rao,et al.  Teaching-learning-based optimization: A novel method for constrained mechanical design optimization problems , 2011, Comput. Aided Des..

[44]  Valérie Issarny,et al.  Set-Based Bi-level Optimisation for QoS-Aware Service Composition in Ubiquitous Environments , 2015, 2015 IEEE International Conference on Web Services.

[45]  Ju Bin Song,et al.  QoS-aware device selection using user preferences for tasks in ubiquitous environments , 2011, 2011 7th International Conference on Emerging Technologies.

[46]  Li-Ning Xing,et al.  Large-scale and adaptive service composition based on deep reinforcement learning , 2019, J. Vis. Commun. Image Represent..

[47]  Xiang Zhou,et al.  Adaptive Service Composition Based on Reinforcement Learning , 2010, ICSOC.

[48]  Hongbing Wang,et al.  Combining quantitative constraints with qualitative preferences for effective non-functional properties-aware service composition , 2017, J. Parallel Distributed Comput..