An automatic approach for tagging Web services using machine learning techniques

Web services have become popular and increasingly important in e-business and e-commerce applications especially in large scale distributed systems. As a result, an increasing number of Web services have been developed. However, this huge collection of Web services makes the task of locating a suitable one more challenging as well as more difficult. Automatic clustering of Web services groups services with similar functions together. Clustering could greatly boost the power of Web service search engines and generate tags to improve the search accuracy of tag-based service recommendation. In this paper, we propose a Web service clustering technique based on Carrot search clustering and K-means to group similar services together. These clustered groups are then tagged. We also develop a tag-based service recommendation for WSDL documents using naive bayes algorithm to classify Web services into different tags. We demonstrate that the proposed clustering approach is effective for Web service discovery through two sets of real data.

[1]  Francisco Moo-Mena,et al.  Use of Classification Algorithms for Semantic Web Services Discovery , 2013, J. Comput..

[2]  Dawid Weiss,et al.  A concept-driven algorithm for clustering search results , 2005, IEEE Intelligent Systems.

[3]  Patrick Martin,et al.  Clustering WSDL Documents to Bootstrap the Discovery of Web Services , 2010, 2010 IEEE International Conference on Web Services.

[4]  Dawid Weiss,et al.  Carrot2: Design of a Flexible and Efficient Web Information Retrieval Framework , 2005, AWIC.

[5]  Esko Ukkonen,et al.  On-line construction of suffix trees , 1995, Algorithmica.

[6]  San-Yih Hwang,et al.  A Hybrid Approach to Automatic Web Services Discovery , 2011, 2011 International Joint Conference on Service Sciences.

[7]  Wilson Wong,et al.  Web service clustering using text mining techniques , 2009, Int. J. Agent Oriented Softw. Eng..

[8]  Natallia Kokash,et al.  A Comparison of Web Service Interface Similarity Measures , 2006, STAIRS.

[9]  Nicholas Kushmerick,et al.  Learning to Attach Semantic Metadata to Web Services , 2003, International Semantic Web Conference.

[10]  Zibin Zheng,et al.  Modeling and exploiting tag relevance for Web service mining , 2013, Knowledge and Information Systems.

[11]  Samir Tata,et al.  A Web Service Recommender System Using Vector Space Model and Latent Semantic Indexing , 2011, 2011 IEEE International Conference on Advanced Information Networking and Applications.

[12]  Jian Yin,et al.  Text Clustering Using a Suffix Tree Similarity Measure , 2011, J. Comput..

[13]  Meng Li,et al.  CoWS: An Internet-Enriched and Quality-Aware Web Services Search Engine , 2011, 2011 IEEE International Conference on Web Services.

[14]  Schahram Dustdar,et al.  A vector space search engine for Web services , 2005, Third European Conference on Web Services (ECOWS'05).

[15]  Mohamed Quafafou,et al.  Probabilistic Topic Models for Web Services Clustering and Discovery , 2013, ESOCC.

[16]  Mathias Weske,et al.  Light-Weight Semantic Service Annotations Through Tagging , 2006, ICSOC.

[17]  Athman Bouguettaya,et al.  Web Service Classification Using Support Vector Machine , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.

[18]  Yanchun Zhang,et al.  Efficiently finding web services using a clustering semantic approach , 2008, CSSSIA '08.

[19]  Zibin Zheng,et al.  Clustering Web services to facilitate service discovery , 2013, Knowledge and Information Systems.

[20]  Zibin Zheng,et al.  WT-LDA: User Tagging Augmented LDA for Web Service Clustering , 2013, ICSOC.

[21]  Chouki Tibermacine,et al.  Automatic Tag Identification in Web Service Descriptions , 2010, WEBIST.

[22]  Zhaoyun Ding,et al.  A Web Service Discovery Method Based on Tag , 2010, 2010 International Conference on Complex, Intelligent and Software Intensive Systems.

[23]  Maria Ganzha,et al.  WSColab: Structured Collaborative Tagging for Web Service Matchmaking , 2010, WEBIST.

[24]  Feng Deng Web service matching based on semantic classification , 2012 .

[25]  Zibin Zheng,et al.  Titan: a system for effective web service discovery , 2012, WWW.

[26]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[27]  Y. Amghar,et al.  Enhancing Web Service Discovery by Using Collaborative Tagging System , 2008, 2008 4th International Conference on Next Generation Web Services Practices.

[28]  C. A. Murthy,et al.  Application of rough ensemble classifier to web services categorization and focused crawling , 2010, Web Intell. Agent Syst..

[29]  Lu Fang,et al.  Towards Automatic Tagging for Web Services , 2012, 2012 IEEE 19th International Conference on Web Services.

[30]  Mingdong Tang,et al.  An Effective Web Service Recommendation Method Based on Personalized Collaborative Filtering , 2011, 2011 IEEE International Conference on Web Services.

[31]  Chouki Tibermacine,et al.  Automatic Web Service Tagging Using Machine Learning and WordNet Synsets , 2010, WEBIST.

[32]  Chen Wu,et al.  An Empirical Approach for Semantic Web Services Discovery , 2008 .

[33]  Oren Etzioni,et al.  Clustering web documents: a phrase-based method for grouping search engine results , 1999 .

[34]  Jun Zhang,et al.  Simlarity Search for Web Services , 2004, VLDB.

[35]  Zibin Zheng,et al.  WTCluster: Utilizing Tags for Web Services Clustering , 2011, ICSOC.

[36]  Oren Etzioni,et al.  Web document clustering: a feasibility demonstration , 1998, SIGIR '98.

[37]  Mukesh Kumar,et al.  Web Service Discovery Research: A Study of Existing Approaches , 2011 .

[38]  Jing Li,et al.  Suffix Tree Based Incremental Web Services Clustering Method , 2013, J. Networks.

[39]  Ying Zou,et al.  An Approach for Context-Aware Service Discovery and Recommendation , 2010, 2010 IEEE International Conference on Web Services.

[40]  Wen Gao,et al.  Two-phase Web site classification based on hidden Markov tree models , 2003, Proceedings IEEE/WIC International Conference on Web Intelligence (WI 2003).

[41]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[42]  Dawid Weiss,et al.  Lingo: Search Results Clustering Algorithm Based on Singular Value Decomposition , 2004, Intelligent Information Systems.

[43]  Mara Nikolaidou,et al.  A Specialized Search Engine for Web Service Discovery , 2012, 2012 IEEE 19th International Conference on Web Services.

[44]  Ying Zou,et al.  An Approach for Service Discovery and Recommendation Using Contexts , 2014, Web Services Foundations.

[45]  Dawid Weiss,et al.  Carrot and Language Properties in Web Search Results Clustering , 2003, AWIC.

[46]  Dawid Weiss,et al.  Carrot2: Making Sense of the Haystack , 2008, ERCIM News.