Optimisation Strategies for Functional Queries in a Distributed Environment

In this chapter we describe the architecture of two distributed systems based on the Functional Data Model (FDM): the P/FDM database management system and the P/FDM mediator. These systems are closely related to one another and they share several source modules. In describing these systems we shall focus on their modular design, and the different kinds of optimisation that can be performed when processing queries and constraints. Both systems make use of an internal intermediate code for queries and constraints that is based on ZF-expressions. We describe how different query processing strategies can be adopted to improve performance when using different kinds of storage module. In describing the architecture of the P/FDM mediator we explain and how modules from the Daplex compiler have been reused in the P/FDM mediator’s own architecture, and the different kinds of optimisation that can be performed when processing queries that will be executed in a federated system.

[1]  Suzanne M. Embury,et al.  The Evolving Role of Constraints in the Functional Data Model , 1999, Journal of Intelligent Information Systems.

[2]  Johann-Christoph Freytag,et al.  A rule-based view of query optimization , 1987, SIGMOD '87.

[3]  Ignacio Gil,et al.  A visual interface and navigator for the P/FDM object database , 1999, Proceedings User Interfaces to Data Intensive Systems.

[4]  Limsoon Wong,et al.  A Data Transformation System for Biological Data Sources , 1995, VLDB.

[5]  Mark E. Deppe,et al.  Distributed Data Bases; A Summary of Research , 1976, Comput. Networks.

[6]  Peter M. D. Gray,et al.  Architecture of a mediator for a bioinformatics database federation , 2002, IEEE Transactions on Information Technology in Biomedicine.

[7]  Guido Moerkotte,et al.  Optimizing Queries with Universal Quantification in Object-Oriented and Object-Relational Databases , 1997, VLDB.

[8]  Peter M. D. Gray,et al.  Efficient Access to FDM Objects Stored in a Relational Database , 1994, BNCOD.

[9]  Peter M. D. Gray,et al.  Rewrite rules for quantified subqueries in a federated database , 2001, Proceedings Thirteenth International Conference on Scientific and Statistical Database Management. SSDBM 2001.

[10]  Thure Etzold,et al.  SRS - an indexing and retrieval tool for flat file data libraries , 1993, Comput. Appl. Biosci..

[11]  Carole A. Goble,et al.  Query processing in the TAMBIS bioinformatics source integration system , 1999, Proceedings. Eleventh International Conference on Scientific and Statistical Database Management.

[12]  Harry K. T. Wong,et al.  Optimization of nested SQL queries revisited , 1987, SIGMOD '87.

[13]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[14]  Won Kim,et al.  On optimizing an SQL-like nested query , 1982, TODS.

[15]  Terry A. Landers,et al.  An Overview of MULTIBASE , 1986, DDB.

[16]  Peter M. D. Gray,et al.  A schema-based approach to building a bioinformatics database federation , 2000, Proceedings IEEE International Symposium on Bio-Informatics and Biomedical Engineering.

[17]  Robert J. Robbins,et al.  Bioinformatics: Essential Infrastructure for Global Biology , 1996, J. Comput. Biol..

[18]  Larry Kerschberg,et al.  Guest Editor Introduction: Functional Approach to Intelligent Information Systems , 2004, Journal of Intelligent Information Systems.

[19]  Peter M. D. Gray,et al.  Combining Computation with Database Access in Biomolecular Computing , 1994, ADB.

[20]  David W. Shipman The functional data model and the data language DAPLEX , 1979, SIGMOD '79.

[21]  M. Muralikrishna,et al.  Improved Unnesting Algorithms for Join Aggregate SQL Queries , 1992, VLDB.

[22]  Zhuoan Jiao Optimisation studies in a Prolog object-oriented database , 1992 .

[23]  Peter M. D. Gray,et al.  CORBA and XML: Design Choices for Database Federations , 2000, BNCOD.

[24]  Hamid Pirahesh,et al.  Extensible/rule based query rewrite optimization in Starburst , 1992, SIGMOD '92.

[25]  Laura M. Haas,et al.  Integrating life sciences data-with a little Garlic , 2000, Proceedings IEEE International Symposium on Bio-Informatics and Biomedical Engineering.

[26]  Norman W. Paton,et al.  Object-oriented databases - a semantic data model approach , 1992, Prentice Hall International Series in Computer Science.

[27]  Alon Y. Halevy,et al.  An adaptive query execution system for data integration , 1999, SIGMOD '99.

[28]  Limsoon Wong,et al.  Kleisli, a functional query system , 2000, J. Funct. Program..

[29]  Peter M. D. Gray,et al.  Optimization of Methods in a Navigational Query Language , 1991, DOOD.

[30]  Matthias Jarke,et al.  Range nesting: a fast method to evaluate quantified queries , 1982, SIGMOD '83.

[31]  Norman W. Paton,et al.  Optimising and Executing DAPLEX Queries Using Prolog , 1990, Comput. J..

[32]  Suzanne M. Embury,et al.  A Modular Compiler Architecture for a Data Manipulation Language , 1996, BNCOD.