SchemaSQL: An extension to SQL for multidatabase interoperability

We provide a principled extension of SQL, called SchemaSQL, that offers the capability of uniform manipulation of data and schema in relational multidatabase systems. We develop a precise syntax and semantics of SchemaSQL in a manner that extends traditional SQL syntax and semantics, and demonstrate the following. (1) SchemaSQL retains the flavor of SQL while supporting querying of both data and schema. (2) It can be used to transform data in a database in a structure substantially different from original database, in which data and schema may be interchanged. (3) It also permits the creation of views whose schema is dynamically dependent on the contents of the input instance. (4) While aggregation in SQL is restricted to values occurring in one column at a time, SchemaSQL permits "horizontal" aggregation and even aggregation over more general "blocks" of information. (5) SchemaSQL provides a useful facility for interoperability and data/schema manipulation in relational multidatabase systems. We provide many examples to illustrate our claims. We clearly spell out the formal semantics of SchemaSQL that accounts for all these features. We describe an architecture for the implementation of SchemaSQL and develop implementation algorithms based on available database technology that allows for powerful integration of SQL based relational DBMS. We also discuss the applicability of SchemaSQL for handling semantic heterogeneity arising in a multidatabase system.

[1]  Renée J. Miller Using schematically heterogeneous structures , 1998, SIGMOD '98.

[2]  Jeffrey Scott Vitter,et al.  Scalable mining for classification rules in relational databases , 1998 .

[3]  Laks V. S. Lakshmanan,et al.  On Efficiently Implementing SchemaSQL on an SQL Database System , 1999, VLDB.

[4]  Won Kim,et al.  Schema Architecture of the UniSQL/M Multidatabase System , 1995, Modern Database Systems.

[5]  Diego Calvanese,et al.  Information integration: conceptual modeling and reasoning support , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[6]  Jan Chomicki,et al.  Declarative Definition of Object-Oriented Multidatabase Mappings , 1992, IWDOM.

[7]  Arnon Rosenthal,et al.  Using semantic values to facilitate interoperability among heterogeneous information systems , 1994, TODS.

[8]  Michael J. Carey,et al.  XPERANTO: Middleware for Publishing Object-Relational Data as XML Documents , 2000, VLDB.

[9]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[10]  Sunita Sarawagi,et al.  Integrating Mining with Relational Database Systems: Alternatives and Implications. , 1998, SIGMOD 1998.

[11]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[12]  A. Zeroual,et al.  MSQL: A Multidatabase Language , 1989, Inf. Sci..

[13]  Silvana Castano,et al.  Semantic dictionary design for database interoperability , 1997, Proceedings 13th International Conference on Data Engineering.

[14]  Weimin Du,et al.  The Pegasus heterogeneous multidatabase system , 1991, Computer.

[15]  Laks V. S. Lakshmanan,et al.  Tables as a paradigm for querying and restructuring (extended abstract) , 1996, PODS '96.

[16]  Laks V. S. Lakshmanan,et al.  nD-SQL: A Multi-Dimensional Language for Interoperability and OLAP , 1998, VLDB.

[17]  Rakesh Agrawal,et al.  Storage and Querying of E-Commerce Data , 2001, VLDB.

[18]  Ravi Krishnamurthy,et al.  RBE: Rendering by example , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[19]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[20]  Kenneth A. Ross Relations with relation names as arguments: algebra and calculus , 1992, PODS '92.

[21]  Jeffrey F. Naughton,et al.  Query size estimation by adaptive sampling (extended abstract) , 1990, PODS.

[22]  Peter Bernus,et al.  Query Transformation for Accessing Heterogeneous Databases , 1992, Workshop on Deductive Databases, JICSLP.

[23]  Sunita Sarawagi,et al.  Integrating association rule mining with relational database systems: alternatives and implications , 1998, SIGMOD '98.

[24]  Laks V. S. Lakshmanan,et al.  SchemaSQL - A Language for Interoperability in Relational Multi-Database Systems , 1996, VLDB.

[25]  Jeffrey F. Naughton,et al.  Practical selectivity estimation through adaptive sampling , 1990, SIGMOD '90.

[26]  Jeffrey F. Naughton,et al.  On the Computation of Multidimensional Aggregates , 1996, VLDB.

[27]  Renée J. Miller,et al.  The Use of Information Capacity in Schema Integration and Translation , 1993, VLDB.

[28]  Silvana Castano,et al.  Semantic integration of semistructured and structured data sources , 1999, SGMD.

[29]  Volume 22 , 1998 .

[30]  Amit P. Sheth,et al.  Management of heterogeneous and autonomous database systems , 1998 .

[31]  Arie Shoshani,et al.  On the Semantic Completeness of Macro-Data Operators for Statistical Aggregation , 1992, SSDBM.

[32]  Fereidoon Sadri,et al.  Optimization of schemaSQL queries , 2001, Proceedings 2001 International Database Engineering and Applications Symposium.

[33]  Paolo Missier,et al.  Extending a Multidatabase Manipulation Language to Resolve Schema and Data Conflicts , 1995, DS-6.

[34]  Frédéric Gingras nD-SQL : extending schemaSQL towards multidimensional databases and OLAP , 1998 .

[35]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[36]  Jeffrey Scott Vitter,et al.  Scalable mining for classification rules in relational databases , 1998, Proceedings. IDEAS'98. International Database Engineering and Applications Symposium (Cat. No.98EX156).

[37]  Gultekin Özsoyoglu,et al.  A language and a physical organization technique for summary tables , 1985, SIGMOD Conference.

[38]  Anthony C. Klug Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions , 1982, JACM.

[39]  Michael Kifer,et al.  Logical foundations of object-oriented and frame-based languages , 1995, JACM.

[40]  Kenneth A. Ross,et al.  Noodle: A Language for Declarative Querying in an Object-Oriented Database , 1993, DOOD.

[41]  Jeffrey F. Naughton,et al.  Query Size Estimation by Adaptive Sampling , 1995, J. Comput. Syst. Sci..

[42]  Alberto Giretti,et al.  A Case-Based Approach to Information Integration , 2000, VLDB.

[43]  Shamim A. Naqvi,et al.  Towards a Real Horn Clause Language , 1988, VLDB.

[44]  Fereidoon Sadri,et al.  Implementation of SchemaSQL - A Language for Relational Multi-Database Systems , 1997 .

[45]  Arie Shoshani,et al.  OLAP and statistical databases: similarities and differences , 1997, PODS '97.

[46]  LAKS V.S. LAKSHMANAN,et al.  Logic and Algebraic Languages for Interoperability in Multidatabase Systems , 1997, J. Log. Program..

[47]  Amit P. Sheth,et al.  Semantic Issues in Multidatabase Systems - Preface by the Special Issue Editor , 1991, SIGMOD Rec..

[48]  Laks V. S. Lakshmanan,et al.  On the Logical Foundations of Schema Integration and Evolution in Heterogeneous Database Systems , 1993, DOOD.

[49]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[50]  Laks V. S. Lakshmanan,et al.  Languages for multi-database interoperability , 1997, SIGMOD '97.

[51]  Ravi Krishnamurthy,et al.  Language features for interoperability of databases with schematic discrepancies , 1991, SIGMOD '91.

[52]  Renée J. Miller,et al.  DataWeb: Customizable Database Publishing for the Web , 1997, IEEE Multim..

[53]  David Beech Collections of Objects in SQL3 , 1993, VLDB.

[54]  Gultekin Özsoyoglu,et al.  Query processing techniques in the summary-table-by-example database query language , 1989, TODS.

[55]  Venkataraman Ramesh,et al.  Management of Heterogeneous and Autonomous Database Systems , 1999 .

[56]  Dennis McLeod,et al.  An Approach to Resolving Semantic Heterogenity in a Federation of Autonomous, Heterogeneous Database Systems , 1993, Int. J. Cooperative Inf. Syst..

[57]  Michael Kifer,et al.  Querying object-oriented databases , 1992, SIGMOD '92.