Selectively materializing data in mediators by analyzing source structure, query distribution and maintenance cost

We present an approach to selecting data to materialize in Web based information mediators by analyzing multiple factors. An issue in building Web based information mediators is how to improve the query response time given the high response time for retrieving data from remote Web sources. We had earlier presented a framework for optimizing the performance of information mediators by selectively materializing data. In this paper we describe our approach for automatically selecting the portion of data that must be materialized by analyzing a combination of several factors, namely the distribution of user queries, the structure of sources and the update cost.

[1]  Michael R. Genesereth,et al.  Infomaster: an information integration system , 1997, SIGMOD '97.

[2]  Craig A. Knoblock,et al.  Modeling Web Sources for Information Integration , 1998, AAAI/IAAI.

[3]  K. Selçuk Candan,et al.  Query caching and optimization in distributed mediator systems , 1996, SIGMOD '96.

[4]  Jennifer Widom,et al.  Information translation, mediation, and mosaic-based browsing in the TSIMMIS system , 1995, SIGMOD '95.

[5]  Craig A. Knoblock,et al.  Intelligent caching: selecting, representing, and reusing data in an information server , 1994, CIKM '94.

[6]  Vipul Kashyap,et al.  InfoSleuth: Semantic Integration of Information in Open and Dynamic Environments (Experience Paper) , 1997, SIGMOD Conference.

[7]  Oren Etzioni,et al.  A softbot-based interface to the Internet , 1994, CACM.

[8]  Robert M. MacGregor,et al.  A Deductive Pattern Matcher , 1988, AAAI.

[9]  Zachary G. Ives,et al.  An adaptive query execution engine for data integration , 1999 .

[10]  Craig A. Knoblock,et al.  Selectively materializing data in mediators by analyzing user queries , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[11]  Peter Scheuermann,et al.  WATCHMAN : A Data Warehouse Intelligent Cache Manager , 1996, VLDB.

[12]  Alon Y. Halevy,et al.  An adaptive query execution system for data integration , 1999, SIGMOD '99.

[13]  Craig A. Knoblock,et al.  Intelligent Caching for Information Mediators: A KR Based Approach , 1998, KRDB.