Mining and clustering service goals for RESTful service discovery

In recent years, RESTful services that are mainly described using short texts are becoming increasingly popular. The keyword-based discovery technology adopted by existing service registries usually suffers from low recall and is insufficient to retrieve accurate RESTful services according to users’ functional goals. Moreover, it is often difficult for users to specify queries that can precisely represent their requirements due to the lack of knowledge on their desired service functionalities. Toward these issues, we propose a RESTful service discovery approach by leveraging service goal (i.e., service functionality) knowledge mined from services’ textual descriptions. The approach first groups the available services into clusters using probabilistic topic models. Then, service goals are extracted from the textual descriptions of services and also clustered based on the topic modeling results of services. Based on service goal clusters, we design a mechanism to recommend semantically relevant service goals to help users refine their initial queries. Relevant services are retrieved by matching user selected service goals with those of candidate services. To improve the recall of the goal-based service discovery approach, we further propose a hybrid approach by integrating it with two existing service discovery approaches. A series of experiments conducted on real-world services crawled from a publicly accessible registry, ProgrammableWeb, demonstrate the effectiveness of the proposed approaches.

[1]  Keqing He,et al.  An On-Demand Services Discovery Approach Based on Topic Clustering , 2014 .

[2]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[3]  Martti Juhola,et al.  Stemming and lemmatization in the clustering of finnish text documents , 2004, CIKM '04.

[4]  Beatrice Santorini,et al.  Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision) , 1990 .

[5]  Ralf Steinmetz,et al.  Adaptive matchmaking for RESTful services based on hRESTS and MicroWSMO , 2010, WEWST '10.

[6]  Jia Zhang,et al.  A Unified RGPS-Based Approach Supporting Service-Oriented Process Customization , 2014, Web Services Foundations.

[7]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Roy Fielding,et al.  Architectural Styles and the Design of Network-based Software Architectures"; Doctoral dissertation , 2000 .

[10]  M. S. Rajasree,et al.  RESTDoc: Describe, Discover and Compose RESTful Semantic Web Services using Annotated Documentations , 2013 .

[11]  Marcelo R. Campo,et al.  A Survey of Approaches to Web Service Discovery in Service-Oriented Architectures , 2011, J. Database Manag..

[12]  Omar Boucelma,et al.  Flexible Matchmaking for RESTful Web Services , 2013, OTM Conferences.

[13]  Zibin Zheng,et al.  WT-LDA: User Tagging Augmented LDA for Web Service Clustering , 2013, ICSOC.

[14]  Andrew McCallum,et al.  Efficient methods for topic model inference on streaming document collections , 2009, KDD.

[15]  Mark Stevenson,et al.  Comparing Information Extraction Pattern Models , 2006 .

[16]  Keqing He,et al.  WSGM-SD: An Approach to RESTful Service Discovery Based on Weighted Service Goal Model , 2016 .

[17]  S. Swamynathan,et al.  Content Based Service Discovery in Semantic Web Services Using WordNet , 2011, ADCONS.

[18]  Andrzej Bargiela,et al.  Probabilistic Topic Models for Learning Terminological Ontologies , 2010, IEEE Transactions on Knowledge and Data Engineering.

[19]  Dieter Fensel,et al.  WSMO-Lite and hRESTS: Lightweight semantic annotations for Web services and RESTful APIs , 2015, J. Web Semant..

[20]  Barbara Pernici,et al.  URBE: Web Service Retrieval Based on Similarity Evaluation , 2009, IEEE Transactions on Knowledge and Data Engineering.

[21]  Hui Xiong,et al.  Semantics-Based Automated Service Discovery , 2012, IEEE Transactions on Services Computing.

[22]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Yutao Ma,et al.  Mining Domain Knowledge on Service Goals from Textual Service Descriptions , 2020, IEEE Transactions on Services Computing.

[24]  Willem-Jan van den Heuvel,et al.  Leveraging Web Services Discovery with Customizable Hybrid Matching , 2006, ICSOC.

[25]  Matthias Klusch,et al.  OWLS-MX: A hybrid Semantic Web service matchmaker for OWL-S services , 2009, J. Web Semant..

[26]  Matthias Klusch,et al.  Hybrid Adaptive Web Service Selection with SAWSDL-MX and WSDL-Analyzer , 2009, ESWC.

[27]  Colette Rolland,et al.  Guiding Goal Modeling Using Scenarios , 1998, IEEE Trans. Software Eng..

[28]  Jia Zhang,et al.  Leveraging Incrementally Enriched Domain Knowledge to Enhance Service Categorization , 2012, Int. J. Web Serv. Res..

[29]  Matthias Klusch,et al.  WSMO-MX: A hybrid Semantic Web service matchmaker , 2009, Web Intell. Agent Syst..

[30]  Klaus Moessner,et al.  Probabilistic Matchmaking Methods for Automated Service Discovery , 2014, IEEE Transactions on Services Computing.

[31]  Tomas Vitvar,et al.  hRESTS: An HTML Microformat for Describing RESTful Web Services , 2008, 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.

[32]  Mathias Lux,et al.  How Do Users Express Goals on the Web? - An Exploration of Intentional Structures in Web Search , 2007, WISE Workshops.

[33]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[34]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[35]  Fangfang Liu,et al.  Measuring Similarity of Web Services Based on WSDL , 2010, 2010 IEEE International Conference on Web Services.

[36]  Wei Jiang,et al.  Large-Scale Longitudinal Analysis of SOAP-Based and RESTful Web Services , 2012, 2012 IEEE 19th International Conference on Web Services.

[37]  Shang-Pin Ma,et al.  Web Service Discovery Using Lexical and Semantic Query Expansion , 2013, 2013 IEEE 10th International Conference on e-Business Engineering.

[38]  Cheng Zeng,et al.  Towards Services Discovery Based on Service Goal Extraction and Recommendation , 2013, 2013 IEEE International Conference on Services Computing.

[39]  Xiaohui Hu,et al.  Time Aware and Data Sparsity Tolerant Web Service Recommendation Based on Improved Collaborative Filtering , 2015, IEEE Transactions on Services Computing.

[40]  Eleni Stroulia,et al.  Flexible interface matching for Web-service discovery , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[41]  Qi Yu,et al.  An LDA-SVM Active Learning Framework for Web Service Classification , 2016, 2016 IEEE International Conference on Web Services (ICWS).

[42]  Michael R. Lyu,et al.  Learning to suggest questions in social media , 2015, Knowledge and Information Systems.

[43]  John Domingue,et al.  Investigating Web APIs on the World Wide Web , 2010, 2010 Eighth IEEE European Conference on Web Services.

[44]  Athman Bouguettaya,et al.  Web Service Classification Using Support Vector Machine , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[45]  Yanchun Zhang,et al.  Efficiently finding web services using a clustering semantic approach , 2008, CSSSIA '08.

[46]  Yoonsung Cho,et al.  Automatic Tagging of Functional-Goals for Goal-Driven Semantic Service Discovery , 2013, 2013 IEEE Seventh International Conference on Semantic Computing.

[47]  Zibin Zheng,et al.  WTCluster: Utilizing Tags for Web Services Clustering , 2011, ICSOC.

[48]  David Ruiz,et al.  Improving semantic web services discovery using SPARQL-based repository filtering , 2012, J. Web Semant..

[49]  Keqing He,et al.  A Web Service Discovery Approach Based on Common Topic Groups Extraction , 2017, IEEE Access.

[50]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[51]  Marin Lujak,et al.  Service discovery acceleration with hierarchical clustering , 2015, Inf. Syst. Frontiers.