Semantic Sparse Service Discovery Using Word Embedding and Gaussian LDA

Nowadays, a growing number of web services are offered in API marketplaces browsed by service developers or third-party registries. Under this situation, API marketplaces’ users greatly rely on a search engine to find suitable web services. However, due to the fact that functional attributes of web services are usually described in short texts, the search engine-based discovery approach suffers from the semantic sparsity problem, which hinders the effect of service discovery. To address this issue, we propose a novel web service discovery approach using word embedding and Gaussian latent Dirichlet allocation (Gaussian LDA). Unlike most existing service discovery approaches, our approach first uses context information generated by word embedding to enrich the semantics of service descriptions and users’ queries. Then, the enriched service description is loaded into the Gaussian LDA model to acquire service description representation. Finally, the services are ranked by considering the relevance between the extended user’s query and service description representation. The experiments conducted on a real-world web service dataset and the results demonstrate that the proposed approach achieves superior effectiveness on web service discovery.

[1]  Mohamed Quafafou,et al.  Leveraging Formal Concept Analysis with Topic Correlation for Service Clustering and Discovery , 2014, 2014 IEEE International Conference on Web Services.

[2]  Keqing He,et al.  Web service discovery based on goal-oriented query expansion , 2018, J. Syst. Softw..

[3]  Rajarshi Das,et al.  Gaussian LDA for Topic Models with Word Embeddings , 2015, ACL.

[4]  Nan Sun,et al.  Exploiting internal and external semantics for the clustering of short texts using world knowledge , 2009, CIKM.

[5]  Mohamed Quafafou,et al.  Correlated Topic Model for Web Services Ranking , 2013 .

[6]  Ning Zhang,et al.  Virtual Vehicle Coordination for Vehicles as Ambient Sensing Platforms , 2018, IEEE Access.

[7]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.

[8]  Patrick Martin,et al.  Clustering WSDL Documents to Bootstrap the Discovery of Web Services , 2010, 2010 IEEE International Conference on Web Services.

[9]  Keqing He,et al.  An approach to business process registration for enterprise collaboration: using BPEL as an example , 2015, Int. J. Bus. Process. Integr. Manag..

[10]  Richi Nayak,et al.  Web Service Discovery with additional Semantics and Clustering , 2007, IEEE/WIC/ACM International Conference on Web Intelligence (WI'07).

[11]  Joe Tekli,et al.  An Overview on XML Semantic Disambiguation from Unstructured Text to Semi-Structured Data: Background, Applications, and Ongoing Challenges , 2016, IEEE Transactions on Knowledge and Data Engineering.

[12]  Yuanyuan Qiao,et al.  A Survey on Machine Learning-Based Mobile Big Data Analysis: Challenges and Applications , 2018, Wirel. Commun. Mob. Comput..

[13]  Zibin Zheng,et al.  Cloud Service Reliability Enhancement via Virtual Machine Placement Optimization , 2017, IEEE Transactions on Services Computing.

[14]  Zibin Zheng,et al.  WTCluster: Utilizing Tags for Web Services Clustering , 2011, ICSOC.

[15]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[16]  Mingdong Tang,et al.  WE-LDA: A Word Embeddings Augmented LDA Model for Web Services Clustering , 2017, 2017 IEEE International Conference on Web Services (ICWS).

[17]  Zibin Zheng,et al.  WT-LDA: User Tagging Augmented LDA for Web Service Clustering , 2013, ICSOC.

[18]  Enhong Chen,et al.  Word Embedding Revisited: A New Representation Learning and Explicit Matrix Factorization Perspective , 2015, IJCAI.

[19]  Keqing He,et al.  An Approach of Service Discovery Based on Service Goal Clustering , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[20]  Yanchun Zhang,et al.  Algorithm for web services matching , 2004 .

[21]  M. de Rijke,et al.  Short Text Similarity with Word Embeddings , 2015, CIKM.

[22]  Keqing He,et al.  A Web Service Discovery Approach Based on Common Topic Groups Extraction , 2017, IEEE Access.

[23]  Qiang Yang,et al.  Transferring topical knowledge from auxiliary long texts for short text clustering , 2011, CIKM '11.

[24]  Chengqi Zhang,et al.  Boosting for graph classification with universum , 2016, Knowledge and Information Systems.

[25]  Manuel Mucientes,et al.  An Integrated Semantic Web Service Discovery and Composition Framework , 2015, IEEE Transactions on Services Computing.

[26]  Matthias Klusch,et al.  Automated semantic web service discovery with OWLS-MX , 2006, AAMAS '06.

[27]  Keqing He,et al.  ST-LDA: High Quality Similar Words Augmented LDA for Service Clustering , 2018, ICA3PP.

[28]  Denivaldo Lopes,et al.  Semantic-based Clustering of Web Services , 2015, J. Web Eng..

[29]  Wray L. Buntine Estimating Likelihoods for Topic Models , 2009, ACML.

[30]  Wilson Wong,et al.  Web service clustering using text mining techniques , 2009, Int. J. Agent Oriented Softw. Eng..

[31]  Alejandro Zunino,et al.  Exploiting named entity recognition for improving syntactic-based web service discovery , 2018, J. Inf. Sci..

[32]  Fakhri Karray,et al.  Short-Text Clustering using Statistical Semantics , 2015, WWW.

[33]  Alejandro Zunino,et al.  Word embeddings for improving REST services discoverability , 2017, 2017 XLIII Latin American Computer Conference (CLEI).

[34]  Zibin Zheng,et al.  Clustering Web services to facilitate service discovery , 2013, Knowledge and Information Systems.

[35]  Philip S. Yu,et al.  Service Recommendation Based on Topics and Trend Prediction , 2016, CollaborateCom.

[36]  Lei Yu,et al.  Semantically structured service community discovery: based on relationship and functionality , 2016, Int. J. Comput. Sci. Eng..

[37]  Klaus Moessner,et al.  Probabilistic Methods for Service Clustering , 2010, SMRR@ISWC.