Local transformations and conjunctive-query equivalence

Over the past several decades, the study of conjunctive queries has occupied a central place in the theory and practice of database systems. In recent years, conjunctive queries have played a prominent role in the design and use of schema mappings for data integration and data exchange tasks. In this paper, we investigate several different aspects of conjunctive-query equivalence in the context of schema mappings and data exchange. In the first part of the paper, we introduce and study a notion of a local transformation between database instances that is based on conjunctive-query equivalence. We show that the chase procedure for GLAV mappings (that is, schema mappings specified by source-to-target tuple-generating dependencies) is a local transformation with respect to conjunctive-query equivalence. This means that the chase procedure preserves bounded conjunctive-query equivalence, that is, if two source instances are indistinguishable using conjunctive queries of a sufficiently large size, then the target instances obtained by chasing these two source instances are also indistinguishable using conjunctive queries of a given size. Moreover, we obtain polynomial bounds on the level of indistinguishability between source instances needed to guarantee indistinguishability between the target instances produced by the chase. The locality of the chase extends to schema mappings specified by a second-order tuple-generating dependency (SO tgd), but does not hold for schema mappings whose specification includes target constraints. In the second part of the paper, we take a closer look at the composition of two GLAV mappings. In particular, we break GLAV mappings into a small number of well-studied classes (including LAV and GAV), and complete the picture as to when the composition of schema mappings from these various classes can be guaranteed to be a GLAV mapping, and when they can be guaranteed to be conjunctive-query equivalent to a GLAV mapping. We also show that the following problem is decidable: given a schema mapping specified by an SO tgd and a GLAV mapping, are they conjunctive-query equivalent? In contrast, the following problem is known to be undecidable: given a schema mapping specified by an SO tgd and a GLAV mapping, are they logically equivalent?

[1]  Emanuel Sallinger,et al.  On the Undecidability of the Equivalence of Second-Order Tuple Generating Dependencies , 2015, AMW.

[2]  Ronald Fagin,et al.  Schema Mapping Evolution Through Composition and Inversion , 2011, Schema Matching and Mapping.

[3]  G KolaitisPhokion,et al.  Composing schema mappings , 2005 .

[4]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[5]  Jaroslav Nesetril,et al.  The core of a graph , 1992, Discret. Math..

[6]  David Maier,et al.  Testing implications of data dependencies , 1979, SIGMOD '79.

[7]  Marcelo Arenas,et al.  Inverting Schema Mappings: Bridging the Gap between Theory and Practice , 2009, Proc. VLDB Endow..

[8]  Ronald Fagin,et al.  Locally consistent transformations and query answering in data exchange , 2004, PODS '04.

[9]  Emanuel Sallinger,et al.  Relaxed Notions of Schema Mapping Equivalence Revisited , 2011, ICDT '11.

[10]  Jayant Madhavan,et al.  Composing Mappings Among Data Sources , 2003, VLDB.

[11]  Benjamin Rossman,et al.  Homomorphism preservation theorems , 2008, JACM.

[12]  Marcelo Arenas,et al.  The recovery of a schema mapping: bringing exchanged data back , 2008, TODS.

[13]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[14]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[15]  Ronald Fagin,et al.  Composition with target constraints , 2010, ICDT '10.

[16]  Renée J. Miller,et al.  Composing local-as-view mappings: closure and applications , 2010, ICDT '10.

[17]  Marcelo Arenas,et al.  Relational and XML Data Exchange , 2010, Relational and XML Data Exchange.

[18]  Ronald Fagin,et al.  Towards a theory of schema-mapping optimization , 2008, PODS.

[19]  Alfred V. Aho,et al.  The theory of joins in relational data bases , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[20]  J. Nesetril,et al.  From Sparse Graphs to Nowhere Dense Structures: Decompositions, Independence, Dualities and Limits , 2010 .

[21]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[22]  Ronald Fagin,et al.  On Monadic NP vs. Monadic co-NP , 1995, Inf. Comput..

[23]  Ronald Fagin Inverting schema mappings , 2007 .

[24]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS.