Using Horizontal-Vertical Decompositions to Improve Query Evaluation

We investigate how relational restructuring may be used to improve query performance. Our approach parallels recent research extending semantic query optimization (SQO), which uses knowledge about the instance to achieve more e cient query processing. Our approach di ers, however, in that the instance does not govern whether the optimization may be applied; rather, the instance governs whether the optimization yields more e cient query processing. It also di ers in that it involves an explicit decomposition of the relation instance. We use approximate functional dependencies as the conceptual basis for this decomposition and develop query rewriting techniques to exploit it. We present experimental results using both synthetic and real-world data. These results lead to a characterization of a well-de ned class of queries for which improved processing time is observed.

[1]  K. K. Nambiar,et al.  Some Analytic Tools for the Design of Relational Database Systems , 1980, VLDB.

[2]  Clement T. Yu,et al.  Semantic Query Optimization for Tree and Chain Queries , 1994, IEEE Trans. Knowl. Data Eng..

[3]  Z. Meral Özsoyoglu,et al.  Design and Implementation of a Semantic Query Optimizer , 1989, IEEE Trans. Knowl. Data Eng..

[4]  Matthias Jarke,et al.  An optimizing prolog front-end to a relational query system , 1984, SIGMOD '84.

[5]  Clement T. Yu,et al.  Automatic Knowledge Acquisition and Maintenance for Semantic Query Optimization , 1989, IEEE Trans. Knowl. Data Eng..

[6]  Hamid Pirahesh,et al.  Extensible/rule based query rewrite optimization in Starburst , 1992, SIGMOD '92.

[7]  John Grant,et al.  Logic-based approach to semantic query optimization , 1990, TODS.

[8]  David Toman,et al.  Logics for Databases and Information Systems , 1998 .

[9]  Heikki Mannila,et al.  Dependency Inference , 1987, VLDB.

[10]  Craig A. Knoblock,et al.  Using Inductive Learning To Generate Rules for Semantic Query Optimization , 1996, Advances in Knowledge Discovery and Data Mining.

[11]  Tony T. Lee,et al.  An Infornation-Theoretic Analysis of Relational Databases—Part I: Data Dependencies and Information Metric , 1987, IEEE Transactions on Software Engineering.

[12]  Calisto Zuzarte,et al.  Exploiting constraint-like data characterizations in query optimization , 2001, SIGMOD '01.

[13]  Hamid Pirahesh,et al.  A rule engine for query transformation in Starburst and IBM DB2 C/S DBMS , 1997, Proceedings 13th International Conference on Data Engineering.

[14]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[15]  Jean-Marc Petit,et al.  Efficient Discovery of Functional Dependencies and Armstrong Relations , 2000, EDBT.

[16]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[17]  Hannu Toivonen,et al.  Efficient discovery of functional and approximate dependencies using partitions , 1998, Proceedings 14th International Conference on Data Engineering.

[18]  Heikki Mannila,et al.  Approximate Dependency Inference from Relations , 1992, ICDT.

[19]  Francesco M. Malvestuto Theory of random observables in relational data bases , 1983, Inf. Syst..

[20]  Shashi Shekhar,et al.  Learning Transformation Rules for Semantic Query Optimization: A Data-Driven Approach , 1993, IEEE Trans. Knowl. Data Eng..

[21]  Mehmet M. Dalkilic,et al.  Information dependencies , 2000, PODS '00.

[22]  Frank Wm. Tompa,et al.  Exploiting functional dependence in query optimization , 2000 .

[23]  Siegfried Bell Deciding Distinctness of Query Results by Discovered Constraints , 2003 .

[24]  Heikki Mannila,et al.  Discovering functional and inclusion dependencies in relational databases , 1992, Int. J. Intell. Syst..

[25]  Michael Stonebraker,et al.  Implementation of integrity constraints and views by query modification , 1975, SIGMOD '75.

[26]  Stanley B. Zdonik,et al.  Knowledge-Based Query Processing , 1980, VLDB.

[27]  Per-Åke Larson,et al.  Exploiting uniqueness in query optimization , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[28]  Jonathan J. King QUIST: A System for Semantic Query Optimization in Relational Databases , 1981, VLDB.

[29]  Z. Meral Özsoyoglu,et al.  A system for semantic query optimization , 1987, SIGMOD '87.

[30]  Rosine Cicchetti,et al.  FUN: An Efficient Algorithm for Mining Functional and Embedded Dependencies , 2001, ICDT.

[31]  Yehoshua Sagiv Quadratic Algorithms for Minimizing Joins in Restricted Relational Expressions , 1983, SIAM J. Comput..