Collecting, Annotating, and Classifying Public Web Services

The limitations of the traditional SOA operational model, such as the lack of rich service descriptions, weaken the role of service registries. Their removal from the model violates the basic principles of SOA, namely, dynamic binding and loose coupling. Currently, most service providers publish their Web Services on their websites instead of publishing them in service registries. This results in poor usability of these Web Services especially wrt. service discovery and service composition. To handle this problem, we propose to increase the usability of public Web Services by collecting them automatically from the websites of their providers with the help of web crawling techniques. Additionally, the collected services are annotated with descriptions that are extracted from the crawled web pages and tags that are generated from the same web pages. These annotations are then used to derive a classification for each Web Service into different application domains. In this paper, we introduce the details of our approach and show its practical feasibility through several evaluation experiments.

[1]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[2]  Schahram Dustdar,et al.  Towards recovering the broken SOA triangle: a software engineering perspective , 2007, IW-SOSWE '07.

[3]  Tobias Vogel,et al.  Posr: A Comprehensive System for Aggregating and Using Web Services , 2009, 2009 Congress on Services - I.

[4]  Schahram Dustdar,et al.  Active Web Service Registries , 2007, IEEE Internet Computing.

[5]  John Mylopoulos,et al.  The Semantic Web - ISWC 2003 , 2003, Lecture Notes in Computer Science.

[6]  Moses Charikar,et al.  Similarity estimation techniques from rounding algorithms , 2002, STOC '02.

[7]  Pablo Castells,et al.  Semi-automatic Semantic-Based Web Service Classification , 2006, Business Process Management Workshops.

[8]  Zhang Duo,et al.  Web service annotation using ontology mapping , 2005, IEEE International Workshop on Service-Oriented System Engineering (SOSE'05).

[9]  Martin Porter,et al.  Snowball: A language for stemming algorithms , 2001 .

[10]  Steffen Staab,et al.  Semantic Service Provisioning , 2008 .

[11]  Nicolai M. Josuttis,et al.  Soa In Practice The Art Of Distributed System Design , 2007 .

[12]  Nicholas Kushmerick,et al.  Learning to Attach Semantic Metadata to Web Services , 2003, International Semantic Web Conference.

[13]  Gilad Mishne,et al.  Learning domain ontologies for Web service descriptions: an experiment in bioinformatics , 2005, WWW '05.

[14]  Guido Governatori,et al.  Compliance aware business process design , 2008 .

[15]  Eyhab Al-Masri,et al.  Investigating web services on the world wide web , 2008, WWW.