Simplifying schema mappings

A schema mapping is a formal specification of the relationship holding between the databases conforming to two given schemas, called source and target, respectively. While in the general case a schema mapping is specified in terms of assertions relating two queries in some given language, various simplified forms of mappings, in particular LAV and GAV, have been considered, based on desirable properties that these forms enjoy. Recent works propose methods for transforming schema mappings to logically equivalent ones of a simplified form. In many cases, this transformation is impossible, and one might be interested in finding simplifications based on a weaker notion, namely logical implication, rather than equivalence. More precisely, given a schema mapping M, find a simplified (LAV, or GAV) schema mapping M' such that M' logically implies M. In this paper we formally introduce this problem, and study it in a variety of cases, providing techniques and complexity bounds. The various cases we consider depend on three parameters: the simplified form to achieve (LAV, or GAV), the type of schema mapping considered (sound, or exact), and the query language used in the schema mapping specification (conjunctive queries and variants over relational databases, or regular path queries and variants over graph databases). Notably, this is the first work on comparing schema mappings for graph databases.

[1]  Robert L. Berger The undecidability of the domino problem , 1966 .

[2]  Diego Calvanese,et al.  Rewriting of regular expressions and regular path queries , 1999, PODS '99.

[3]  Georg Gottlob,et al.  Normalization and Optimization of Schema Mappings , 2009, Proc. VLDB Endow..

[4]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[5]  Peter Buneman,et al.  Semistructured data , 1997, PODS.

[6]  Diego Calvanese,et al.  Containment of Conjunctive Regular Path Queries with Inverse , 2000, KR.

[7]  Georg Gottlob,et al.  Distributed XML Design , 2011, J. Comput. Syst. Sci..

[8]  Nicolás Marín,et al.  Review of Data on the Web: from relational to semistructured data and XML by Serge Abiteboul, Peter Buneman, and Dan Suciu. Morgan Kaufmann 1999. , 2003, SGMD.

[9]  Filippo Furfaro Querying semi-structured data with graph grammars , 2002, Proceedings. International Conference on Information Technology: Coding and Computing.

[10]  Ronald Fagin,et al.  Towards a theory of schema-mapping optimization , 2008, PODS.

[11]  Divesh Srivastava,et al.  Answering Queries Using Views. , 1999, PODS 1995.

[12]  Ronald Fagin,et al.  Locally consistent transformations and query answering in data exchange , 2004, PODS '04.

[13]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[14]  Marcelo Arenas,et al.  XML data exchange: consistency and query answering , 2005, PODS '05.

[15]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[16]  Alberto O. Mendelzon,et al.  Tableau Techniques for Querying Information Sources through Global Schemas , 1999, ICDT.

[17]  Diego Calvanese,et al.  Query processing using views for regular path queries with inverse , 2000, PODS 2000.

[18]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[19]  Mihalis Yannakakis,et al.  Equivalences Among Relational Expressions with the Union and Difference Operators , 1980, J. ACM.

[20]  Alberto O. Mendelzon,et al.  A graphical query language supporting recursion , 1987, SIGMOD '87.

[21]  Pablo Barceló Logical foundations of relational data exchange , 2009, SGMD.

[22]  Diego Calvanese,et al.  View Synthesis from Schema Mappings , 2010, ArXiv.

[23]  Marcelo Arenas,et al.  Foundations of schema mapping management , 2010, PODS '10.

[24]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[25]  Sergio Tessaris,et al.  The logic of RDF and SPARQL: a tutorial , 2006, PODS '06.

[26]  Philip A. Bernstein,et al.  Model Management and Schema Mappings: Theory and Practice , 2007, VLDB.

[27]  Ronald Fagin,et al.  Composition with target constraints , 2010, ICDT '10.

[28]  Thomas Schwentick,et al.  Schema design for XML repositories: complexity and tractability , 2010, PODS '10.

[29]  Ondřej Klobušník,et al.  ArXiv.org e-print archive , 2004 .

[30]  Georg Gottlob,et al.  Normalization and optimization of schema mappings , 2009, The VLDB Journal.

[31]  Anthony Widjaja Lin,et al.  Expressive Languages for Path Queries over Graph-Structured Data , 2012, TODS.

[32]  Phokion G. Kolaitis,et al.  Structural characterizations of schema-mapping languages , 2009, ICDT '09.

[33]  Jayant Madhavan,et al.  Composing Mappings Among Data Sources , 2003, VLDB.

[34]  Jean-Éric Pin,et al.  Syntactic Semigroups , 1997, Handbook of Formal Languages.

[35]  Cristina Sirangelo,et al.  Data exchange and schema mappings in open and closed worlds , 2011, J. Comput. Syst. Sci..

[36]  Renée J. Miller,et al.  Composing local-as-view mappings: closure and applications , 2010, ICDT '10.

[37]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[38]  Ralf Küsters,et al.  Unification in a Description Logic with Transitive Closure of Roles , 2001, LPAR.

[39]  Diego Calvanese,et al.  Reasoning on regular path queries , 2003, SGMD.

[40]  Ronald Fagin,et al.  Quasi-inverses of schema mappings , 2007, PODS '07.

[41]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[42]  Jeffrey D. Ullman,et al.  Information integration using logical views , 1997, Theor. Comput. Sci..

[43]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[44]  Diego Calvanese,et al.  View-based query processing for regular path queries with inverse , 2000, PODS '00.

[45]  Joann J. Ordille,et al.  Data integration: the teenage years , 2006, VLDB.

[46]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[47]  Marcelo Arenas,et al.  The recovery of a schema mapping: bringing exchanged data back , 2008, TODS.

[48]  Phokion G. Kolaitis,et al.  Peer data exchange , 2005, PODS '05.

[49]  Vasilis Vassalos,et al.  Answering Queries Using Views , 2009, Encyclopedia of Database Systems.

[50]  Ronald Fagin,et al.  Reverse data exchange: Coping with nulls , 2009, TODS.