A scalable automatic service discovery approach based on probabilistic topic model

Current service discovery approaches mainly focus on syntax matchmaking, which contains little semantic information to discover services automatically. This paper proposes a scalable automatic service discovery approach based on probabilistic topic model. Specifically, a novel service description model PTWSDM is proposed. With this model, heterogeneous service descriptions can be represented in a topic vector form on the same homogeneous plane. For the scarcity of word co-occurrence patterns in service functional descriptions, Biterm topic model is introduced to extract latent topics. Finally, a stream algorithm for topic model updating is introduced in order that the proposed approach is scalable and adaptable for large-scale dynamic registry. Experimental results confirm that the proposed approach outperforms the state-of-the-art solutions in terms of precision and normalised discounted cumulative gain values. It also has good time performance and scalability.

[1]  Mohamed Quafafou,et al.  Probabilistic Topic Models for Web Services Clustering and Discovery , 2013, ESOCC.

[2]  Ye Lei and Zhang Bin A Method of Web Service Discovery Based on Functional Semantics , 2007 .

[3]  Wei Deng,et al.  Web Service Discovery by Integrating Structure and Reference Features of Description Documents , 2011 .

[4]  Manuel Mucientes,et al.  An Integrated Semantic Web Service Discovery and Composition Framework , 2015, IEEE Transactions on Services Computing.

[5]  George Karypis,et al.  Hierarchical Clustering Algorithms for Document Datasets , 2005, Data Mining and Knowledge Discovery.

[6]  Sandro Morasca,et al.  Supporting the semi-automatic semantic annotation of web services: A systematic literature review , 2015, Inf. Softw. Technol..

[7]  Chi-Chun Lo,et al.  A trustworthy QoS-based mechanism for web service discovery based on collaborative filtering , 2013, 2013 Fifth International Conference on Ubiquitous and Future Networks (ICUFN).

[8]  Yu Jianjun,et al.  Topic model based structural Web services discovery , 2008 .

[9]  Shi Zhong Reasoning About Semantic Web Services with an Approach Based on Dynamic Description Logics , 2008 .

[10]  Peng Wang,et al.  A robust framework for short text categorization based on topic model and integrated classifier , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[11]  Joonho Kwon,et al.  Scalable and efficient web services composition based on a relational database , 2011, J. Syst. Softw..

[12]  Frank Siqueira,et al.  Discovery of Semantic Web Services Compositions Based on SAWSDL Annotations , 2012, 2012 IEEE 19th International Conference on Web Services.

[13]  Jiafeng Guo,et al.  BTM: Topic Modeling over Short Texts , 2014, IEEE Transactions on Knowledge and Data Engineering.

[14]  Jinpeng Huai,et al.  A Probabilistic Approach for Web Service Discovery , 2013, 2013 IEEE International Conference on Services Computing.

[15]  Zhao Yun,et al.  An approach to discover semantic web services in distributed environment based on Chord , 2010, 2010 IEEE International Conference on Intelligent Systems and Knowledge Engineering.

[16]  Chen Wu WSDL term tokenization methods for IR-style Web services discovery , 2012, Sci. Comput. Program..

[17]  Szu-Yin Lin,et al.  A trustworthy QoS-based collaborative filtering approach for web service discovery , 2014, J. Syst. Softw..

[18]  Mohamed Quafafou,et al.  Leveraging Formal Concept Analysis with Topic Correlation for Service Clustering and Discovery , 2014, 2014 IEEE International Conference on Web Services.

[19]  Qi He,et al.  TwitterRank: finding topic-sensitive influential twitterers , 2010, WSDM '10.

[20]  Ma Heng-tai A Semantic Matchmaking System Mechanism for Web Service Discovery Based on OWL-S , 2010 .

[21]  Sun Da,et al.  Big Data Stream Computing: Technologies and Instances , 2014 .

[22]  Ke Chang Research on Constraint-Oriented Web Service Discovery , 2012 .

[23]  Mara Nikolaidou,et al.  A Specialized Search Engine for Web Service Discovery , 2012, 2012 IEEE 19th International Conference on Web Services.

[24]  Brian D. Davison,et al.  Empirical study of topic modeling in Twitter , 2010, SOMA '10.

[25]  Klaus Moessner,et al.  Probabilistic Matchmaking Methods for Automated Service Discovery , 2014, IEEE Transactions on Services Computing.

[26]  Cao Juan,et al.  A Method of Adaptively Selecting Best LDA Model Based on Density , 2008 .