Pollock: automatic generation of virtual web services from web sites

As the usage of Web Services proliferates dramatically, new tools to help quickly generate web services are needed. In this paper, we propose a methodology that helps to automatically generate Web Services from the FORM-based query interfaces of a web site. Since the majority of web data are rather "hidden" behind such a FORM interface, we believe turning such a human-oriented query interface into machine-oriented web services is an important problem. Toward this goal, we adopt the Wrapper technology successfully developed and deployed in Database community, and demonstrate how to generate Web Services components (e.g., WSDL, UDDI, SOAP) automatically. We present the overall architecture of our developed prototype and a few showcases based on real web sites.

[1]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[2]  Craig A. Knoblock,et al.  The Ariadne Approach to Web-Based Information Integration , 2001, Int. J. Cooperative Inf. Syst..

[3]  Arthur G. Ryman,et al.  Developing XML Web services with WebSphere Studio Application Developer , 2002, IBM Syst. J..

[4]  Byung-Won On,et al.  System Support for Name Authority Control Problem in Digital Libraries: OpenDBLP Approach , 2004, ECDL.

[5]  Craig A. Knoblock,et al.  Proteus: A System for Dynamically Composing and Intelligently Executing Web Services , 2003, ICWS.

[6]  Alexander S. Szalay,et al.  SkyQuery: A Web Service Approach to Federate Databases , 2003, CIDR.

[7]  Alberto O. Mendelzon,et al.  Database techniques for the World-Wide Web: a survey , 1998, SGMD.

[8]  安藤 一秋,et al.  Google Web APIs を利用した英文作成支援ツール , 2006 .

[9]  Divesh Srivastava,et al.  The Information Manifold , 1995 .

[10]  Jennifer Widom,et al.  The TSIMMIS Approach to Mediation: Data Models and Languages , 1997, Journal of Intelligent Information Systems.

[11]  Maria-Esther Vidal,et al.  Wrapper generation for Web accessible data sources , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[12]  Calton Pu,et al.  XWRAP: an XML-enabled wrapper construction system for Web information sources , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[13]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.