Automatic Web Service Tagging Using Machine Learning and WordNet Synsets

The importancy of Web services comes from the fact that they are an important means to realize SOA applications. Their increasing popularity caused the emergence of a fairly huge number of services. Therefore, finding a particular service among this large service space can be a hard task. User tags have proven to be a useful technique to smooth browsing experience in large document collections. Some service search engines proposes the facility of service tagging. It is usually done manually by the providers and the users of the services, which can be a fairly tedious and error prone task. In this paper we propose an approach for tagging Web services automatically. It adapts techniques from text mining and machine learning to extract tags from WSDL descriptions. Then it enriches these tags by extracting relevant synonyms using WordNet. We validated our approach on a corpus of 146 services extracted from Seekda.

[1]  Schahram Dustdar,et al.  A vector space search engine for Web services , 2005, Third European Conference on Web Services (ECOWS'05).

[2]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[3]  Gilad Mishne,et al.  Learning domain ontologies for Web service descriptions: an experiment in bioinformatics , 2005, WWW '05.

[4]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[5]  Marcelo R. Campo,et al.  AWSC: An approach to Web service classification based on machine learning techniques , 2008, Inteligencia Artif..

[6]  Eric Bouillet,et al.  A Folksonomy-Based Model of Web Services for Discovery and Automatic Composition , 2008, 2008 IEEE International Conference on Services Computing.

[7]  Yijun Yu,et al.  Web Service Search: Who, When, What, and How , 2007, WISE Workshops.

[8]  Hui Guo,et al.  Learning Ontologies to Improve the Quality of Automatic Web Service Matching , 2007, IEEE International Conference on Web Services (ICWS 2007).

[9]  Jun Zhang,et al.  Simlarity Search for Web Services , 2004, VLDB.

[10]  Lerina Aversano,et al.  Using Concept Lattices to Support Service Selection , 2006, Int. J. Web Serv. Res..

[11]  Aoying Zhou,et al.  Management and Retrieval of Web Services Based on Formal Concept Analysis , 2005, The Fifth International Conference on Computer and Information Technology (CIT'05).

[12]  Yi-fang Brook Wu,et al.  Domain-specific keyphrase extraction , 2005, CIKM '05.

[13]  Yanchun Zhang,et al.  Efficiently finding web services using a clustering semantic approach , 2008, CSSSIA '08.

[14]  Gordon W. Paynter,et al.  Automatic extraction of document keyphrases for use in digital libraries: Evaluation and applications , 2002, J. Assoc. Inf. Sci. Technol..

[15]  Ewa Orlowska,et al.  Service-Oriented Computing - ICSOC 2003 , 2003, Lecture Notes in Computer Science.

[16]  Pinar Yolum,et al.  Structural and Semantic Similarity Metrics for Web Service Matchmaking , 2007, EC-Web.

[17]  Gosse Bouma,et al.  Mapping Metadata for SWHi: Aligning Schemas with Library Metadata for a Historical Ontology , 2007, WISE Workshops.

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[19]  Peter D. Turney Coherent Keyphrase Extraction via Web Mining , 2003, IJCAI.

[20]  Peter D. Turney Learning Algorithms for Keyphrase Extraction , 2000, Information Retrieval.

[21]  Nicholas Kushmerick,et al.  Learning to Attach Semantic Metadata to Web Services , 2003, International Semantic Web Conference.

[22]  Chi-Hong Leung,et al.  A Statistical Learning Approach to Automatic Indexing of Controlled Index Terms , 1997, J. Am. Soc. Inf. Sci..

[23]  Gordon W. Paynter,et al.  Human evaluation of Kea, an automatic keyphrasing system , 2001, JCDL '01.

[24]  Marcelo R. Campo,et al.  Query by example for web services , 2008, SAC '08.

[25]  Chouki Tibermacine,et al.  WSPAB: A Tool for Automatic Classification & Selection of Web Services Using Formal Concept Analysis , 2008, 2008 Sixth European Conference on Web Services.

[26]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[27]  Susan T. Dumais,et al.  Inductive learning algorithms and representations for text categorization , 1998, CIKM '98.

[28]  Eleni Stroulia,et al.  Semantic Structure Matching for Assessing Web-Service Similarity , 2003, ICSOC.

[29]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .