Towards Extensible Information Brokers Based on XML

The exponential growth in the number and size of information services available on the internet has created an urgent need for information agents which act as brokers in that they can autonomously search, gather and integrate information on behalf of a user. Moreover, the inherent volatility of the internet and the wide range of information processing tasks to be carried out, calls for a framework that facilitates both the construction and evolution of such information brokers. This paper proposes such a framework named XIB (extensible Information Brokers). Based on descriptions of relevant information services, XIB supports the interactive generation of an integrated query interface, generates wrappers for each information service dynamically, and returns to the user the composed result to a query. XIB depends heavily on XML-related techniques. More specifically, we will use DTDs to model the input and output of the service, use XML elements to denote the input and output values. By using this representation, service integration is investigated in the form of DTD integration, and query decomposition is studied in the form of XML element decomposition. Within the proposed framework, it is easy to add or remove information services on the internet to a broker, thereby facilitating maintenance, evolution and customization.

[1]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[2]  David Konopnicki,et al.  A comprehensive framework for querying and integrating WWW data and services , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[3]  Louiqa Raschid,et al.  Locating Data Repositories using XML , 1998, QL.

[4]  Joann J. Ordille,et al.  Querying Heterogeneous Information Sources Using Source Descriptions , 1996, VLDB.

[5]  Craig A. Knoblock,et al.  Semi-automatic wrapper generation for Internet information sources , 1997, Proceedings of CoopIS 97: 2nd IFCIS Conference on Cooperative Information Systems.

[6]  Charles Axel Allen,et al.  WIDL, Application Integration with XML , 1997, World Wide Web journal.

[7]  Arnaud Sahuguet,et al.  WysiWyg Web Wrapper Factory (W4F) , 1999 .

[8]  Yannis,et al.  Capabilities-Based Query Rewriting in Mediator Systems ( extendedversion ) , 1996 .

[9]  Kevin Chen-Chuan Chang,et al.  Mind your vocabulary: query mapping across heterogeneous information sources , 1999, SIGMOD '99.

[10]  Matthias Klusch,et al.  Matchmaking among Heterogeneous Agents on the Internet , 1999 .

[11]  Garcia-MolinaHector,et al.  Predicate rewriting for translating Boolean queries in a heterogeneous information system , 1999 .

[12]  Mike Williamson,et al.  Matchmaking and Brokering , 1996 .

[13]  Yannis Papakonstantinou,et al.  Expressive Capabilities Description Languages and Query Rewriting Algorithms , 2000, J. Log. Program..

[14]  Dan Connolly,et al.  XML : principles, tools, and techniques , 1997 .

[15]  所 真理雄,et al.  ICMAS-96 : proceedings Second International Conference on Multi-Agent Systems, December 10-13, 1996, Kyoto, Japan , 1996 .

[16]  H. Marais,et al.  AUTOMATING THE WEB WITH WEBL , 1999 .

[17]  Chaitanya K. Baru,et al.  XML-based information mediation with MIX , 1999, SIGMOD '99.

[18]  Craig A. Knoblock,et al.  Modeling Web Sources for Information Integration , 1998, AAAI/IAAI.

[19]  Bruce Krulwich Automating the Internet: Agents as User Surrogates , 1997, IEEE Internet Comput..

[20]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[21]  Craig A. Knoblock,et al.  STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources * , 1998 .

[22]  Maria-Esther Vidal,et al.  A meta-wrapper for scaling up to multiple autonomous distributed information sources , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[23]  Maria-Esther Vidal,et al.  A flexible meta-wrapper interface for autonomous distributed information sources , 1997 .

[24]  Paolo Merialdo,et al.  Semistructured and structured data in the Web: going back and forth , 1997, SGMD.

[25]  Kevin Chen-Chuan Chang,et al.  Predicate rewriting for translating Boolean queries in a heterogeneous information system , 1999, TOIS.

[26]  Alin Deutsch,et al.  XML-QL: A Query Language for XML , 1998 .