TheaterLoc: a case study in building an information integration application

While there has been much written about various information integration technologies, there has been little said regarding how to combine these technologies together to build an entire application. In this paper, we describe the design and implementation of TheaterLoc, an information integration application that allows users to retrieve information about theaters and restaurants for cities in the Los Angeles area, including an interactive map depicting their relative locations and video trailers of the movies playing at the various area theaters. The data retrieved by TheaterLoc comes from five distinct heterogeneous and distributed sources. The enabling technology used to achieve the integration includes: the Ariadne information mediator, wrappers for each of the web-based data sources, and a video server to stream the movie trailers. We focus in detail on the mediator technologies, such as data modeling, source axiom compilation, and query planning. We also describe how the wrappers present an interface for querying data on web sites, aiding in information retrieval used during data integration. Finally, we discuss some of the major integration problems we encountered, as well as our plans to deal with them.

[1]  Nicholas Kushmerick,et al.  Wrapper Induction for Information Extraction , 1997, IJCAI.

[2]  Craig A. Knoblock,et al.  Flexible and Scalable Query Planning in Distributed and Heterogeneous Environments , 1998, AIPS.

[3]  Craig A. Knoblock,et al.  Planning by Rewriting: E ciently Generating High-Quality Plans , 1999 .

[4]  Craig A. Knoblock,et al.  A hierarchical approach to wrapper induction , 1999, AGENTS '99.

[5]  Craig A. Knoblock,et al.  STALKER: Learning Extraction Rules for Semistructured, Web-based Information Sources * , 1998 .

[6]  Michael R. Genesereth,et al.  Infomaster: an information integration system , 1997, SIGMOD '97.

[7]  Craig A. Knoblock,et al.  Handling Inconsistency for Multi-Source Integration , 1998, AAAI/IAAI.

[8]  Craig A. Knoblock,et al.  Modeling Web Sources for Information Integration , 1998, AAAI/IAAI.

[9]  Gio Wiederhold,et al.  Intelligent integration of information , 1993, Springer US.

[10]  Toby Walsh,et al.  Proceedings of AAAI-96 , 1996 .

[11]  Joann J. Ordille,et al.  Query-Answering Algorithms for Information Agents , 1996, AAAI/IAAI, Vol. 1.

[12]  Hector Garcia-Molina,et al.  Template-based wrappers in the TSIMMIS system , 1997, SIGMOD '97.

[13]  Oren Etzioni,et al.  A scalable comparison-shopping agent for the World-Wide Web , 1997, AGENTS '97.

[14]  Craig A. Knoblock,et al.  Selectively materializing data in mediators by analyzing user queries , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[15]  Vipul Kashyap,et al.  InfoSleuth: agent-based semantic integration of information in open and dynamic environments , 1997, SIGMOD '97.

[16]  Daniel S. Weld,et al.  Planning to gather inforrnation , 1996, AAAI 1996.