Query caching and optimization in distributed mediator systems

Query processing and optimization in mediator systems that access distributed non-proprietary sources pose many novel problems. Cost-based query optimization is hard because the mediator does not have access to source statistics information and furthermore it may not be easy to model the source's performance. At the same time, querying remote sources may be very expensive because of high connection overhead, long computation time, financial charges, and temporary unavailability. We propose a cost-based optimization technique that caches statistics of actual calls to the sources and consequently estimates the cost of the possible execution plans based on the statistics cache. We investigate issues pertaining to the design of the statistics cache and experimentally analyze various tradeoffs. We also present a query result caching mechanism that allows us to effectively use results of prior queries when the source is not readily available. We employ the novel invariants mechanism, which shows how semantic information about data sources may be used to discover cached query results of interest.

[1]  Patricia G. Selinger,et al.  Access path selection in a relational database management system , 1979, SIGMOD '79.

[2]  Umeshwar Dayal,et al.  View Definition and Generalization for Database Integration in a Multidatabase System , 1984, IEEE Transactions on Software Engineering.

[3]  Alon Itai,et al.  Maintenance of views , 1984, SIGMOD '84.

[4]  Per-Åke Larson,et al.  Updating derived relations: detecting irrelevant and autonomously computable updates , 1986, VLDB.

[5]  Hisao Tamaki,et al.  OLD Resolution with Tabulation , 1986, ICLP.

[6]  Eric N. Hanson,et al.  A performance analysis of view materialization strategies , 1987, SIGMOD '87.

[7]  Amihai Motro,et al.  Superviews: Virtual Integration of Multiple Databases , 1987, IEEE Transactions on Software Engineering.

[8]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[9]  Umeshwar Dayal,et al.  Queries and Views in an Object-Oriented Data Model , 1989, DBPL.

[10]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[11]  Ravi Krishnamurthy,et al.  Towards on Open Architecture for LDL , 1989, VLDB.

[12]  A. Sheth Federated database systems for managing distributed, heterogeneous, and autonomous databases , 1990, CSUR.

[13]  Michael Stonebraker,et al.  On rules, procedure, caching and views in data base systems , 1990, SIGMOD '90.

[14]  M. Stonebraker,et al.  On Rules, Procedures, Caching and Views in Data Base Systems , 1994, SIGMOD Conference.

[15]  Jennifer Widom,et al.  Deriving Production Rules for Incremental View Maintenance , 1991, VLDB.

[16]  Inderpal Singh Mumick Query Optimization in Deductive and Relational Databases , 1991 .

[17]  Serge Abiteboul,et al.  Objects and views , 1991, SIGMOD '91.

[18]  Markus Tresch,et al.  Updatable Views in Object-Oriented Databases , 1991, DOOD.

[19]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[20]  Inderpal Singh Mumick,et al.  Counting solutions to the View Maintenance Problem , 1992, Workshop on Deductive Databases, JICSLP.

[21]  David Scott Warren,et al.  Memoing for logic programs , 1992, CACM.

[22]  Weimin Du,et al.  Query Optimization in a Heterogeneous DBMS , 1992, VLDB.

[23]  Laks V. S. Lakshmanan,et al.  On the Logical Foundations of Schema Integration and Evolution in Heterogeneous Database Systems , 1993, DOOD.

[24]  Kyuseok Shim,et al.  Query Optimization in the Presence of Foreign Functions , 1993, VLDB.

[25]  V. S. Subrahmanian,et al.  Maintaining views incrementally , 1993, SIGMOD Conference.

[26]  Serge Abiteboul,et al.  Querying and Updating the File , 1993, VLDB.

[27]  Sushil Jajodia,et al.  Integrating Temporal Data in a Heterogeneous Environment , 1993, Temporal Databases.

[28]  Beng Chin Ooi,et al.  Multidatabase query optimization: issues and solutions , 1993, Proceedings RIDE-IMS `93: Third International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems.

[29]  Narain H. Gehani,et al.  OdeFS: A File System Interface to an Object-Oriented Database , 1994, VLDB.

[30]  V. S. Subrahmanian Amalgamating knowledge bases , 1994, TODS.

[31]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[32]  James J. Lu,et al.  Computing Annotated Logic Programs , 1994, ICLP.

[33]  Nick Roussopoulos,et al.  Adaptive selectivity estimation using query feedback , 1994, SIGMOD '94.

[34]  Guido Moerkotte,et al.  Function Materialization in Object Bases: Design, Realization, and Evaluation , 1994, IEEE Trans. Knowl. Data Eng..

[35]  Per-Åke Larson,et al.  A query sampling method for estimating local cost parameters in a multidatabase system , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[36]  Guido Moerkotte,et al.  Efficient maintenance of materialized mediated views , 1995, SIGMOD '95.

[37]  Kyuseok Shim,et al.  Optimizing queries with materialized views , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[38]  Sibel Adali,et al.  A uniform framework for integrating knowledge in heterogeneous knowledge systems , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[39]  V. S. SubrahmanianDepartment Intelligent Caching in Heterogeneous Reasoning andMediator Systems , 1995 .

[40]  K. Selçuk Candan,et al.  The Advanced Video Information System: data structures and query processing , 1996, Multimedia Systems.

[41]  Xiaolei Qian,et al.  Query folding , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[42]  V. S. Subrahmanian,et al.  Amalgamating Knowledge Bases, III: Algorithms, Data Structures, and Query Processing , 1993, J. Log. Program..

[43]  Anil Nerode,et al.  Hybrid Knowledge Bases , 1996, IEEE Trans. Knowl. Data Eng..

[44]  LAKS V.S. LAKSHMANAN,et al.  Logic and Algebraic Languages for Interoperability in Multidatabase Systems , 1997, J. Log. Program..