Multilabel Learning for Automatic Web Services Tagging

Recently, some web services portals and search engines as Biocatalogue and Seekda!, have allowed users to manually annotate Web services using tags. User Tags provide meaningful descriptions of services and allow users to index and organize their contents. Tagging technique is widely used to annotate objects in Web 2.0 applications. In this paper we propose a novel probabilistic topic model (which extends the CorrLDA model - Correspondence Latent Dirichlet Allocation-) to automatically tag web services according to existing manual tags. Our probabilistic topic model is a latent variable model that exploits local correlation labels. Indeed, exploiting label correlations is a challenging and crucial problem especially in multi-label learning context. Moreover, several existing systems can recommend tags for web services based on existing manual tags. In most cases, the manual tags have better quality. We also develop three strategies to automatically recommend the best tags for web services. We also propose, in this paper, WS-Portal; An Enriched Web Services Search Engine which contains 7063 providers, 115 sub-classes of category and 22236 web services crawled from the Internet. In WS-Portal, severals technologies are employed to improve the effectiveness of web service discovery (i.e. web services clustering, tags recommendation, services rating and monitoring). Our experiments are performed out based on real-world web services. The comparisons of Precision@n, Normalised Discounted Cumulative Gain (NDCGn) values for our approach indicate that the method presented in this paper outperforms the method based on the CorrLDA in terms of ranking and quality of generated tags.

[1]  Thomas L. Griffiths,et al.  Probabilistic Topic Models , 2007 .

[2]  Wilson Wong,et al.  Web service clustering using text mining techniques , 2009, Int. J. Agent Oriented Softw. Eng..

[3]  Natallia Kokash,et al.  A Comparison of Web Service Interface Similarity Measures , 2006, STAIRS.

[4]  Thomas L. Griffiths,et al.  The Author-Topic Model for Authors and Documents , 2004, UAI.

[5]  Zibin Zheng,et al.  Modeling and exploiting tag relevance for Web service mining , 2013, Knowledge and Information Systems.

[6]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine-mediated learning.

[7]  Zibin Zheng,et al.  QoS-Aware Web Service Recommendation by Collaborative Filtering , 2011, IEEE Transactions on Services Computing.

[8]  Zibin Zheng,et al.  WT-LDA: User Tagging Augmented LDA for Web Service Clustering , 2013, ICSOC.

[9]  Jun Zhang,et al.  Simlarity Search for Web Services , 2004, VLDB.

[10]  Thomas Hofmann,et al.  Collaborative filtering via gaussian probabilistic latent semantic analysis , 2003, SIGIR.

[11]  Mohamed Quafafou,et al.  WS-Portal an Enriched Web Services Search Engine , 2014, ICSOC Workshops.

[12]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[13]  Martin F. Porter,et al.  An algorithm for suffix stripping , 1997, Program.

[14]  Mathias Weske,et al.  Light-Weight Semantic Service Annotations Through Tagging , 2006, ICSOC.

[15]  Lu Fang,et al.  Towards Automatic Tagging for Web Services , 2012, 2012 IEEE 19th International Conference on Web Services.

[16]  Mohamed Quafafou,et al.  Leveraging Formal Concept Analysis with Topic Correlation for Service Clustering and Discovery , 2014, 2014 IEEE International Conference on Web Services.

[17]  Mohamed Quafafou,et al.  Correlated Topic Model for Web Services Ranking , 2013 .

[18]  Gustavo Alonso,et al.  Web Services: Concepts, Architectures and Applications , 2009 .

[19]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[20]  Maria Ganzha,et al.  WSColab: Structured Collaborative Tagging for Web Service Matchmaking , 2010, WEBIST.

[21]  Patrick Martin,et al.  Clustering WSDL Documents to Bootstrap the Discovery of Web Services , 2010, 2010 IEEE International Conference on Web Services.

[22]  Naonori Ueda,et al.  Probabilistic latent semantic visualization: topic model for visualizing documents , 2008, KDD.

[23]  Mohamed Quafafou,et al.  Encoding Local Correspondence in Topic Models , 2013, 2013 IEEE 25th International Conference on Tools with Artificial Intelligence.

[24]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[25]  Chouki Tibermacine,et al.  Automatic Web Service Tagging Using Machine Learning and WordNet Synsets , 2010, WEBIST.

[26]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[27]  Maria Ganzha,et al.  Evaluation of Structured Collaborative Tagging for Web Service Matchmaking , 2012, Semantic Web Services, Advancement through Evaluation.

[28]  Mohamed Quafafou,et al.  Probabilistic Topic Models for Web Services Clustering and Discovery , 2013, ESOCC.