Certain answers for XML queries

The notion of certain answers arises when one queries incompletely specified databases, e.g., in data integration and exchange scenarios, or databases with missing information. While in the relational case this notion is well understood, there is no natural analog of it for XML queries that return documents. We develop an approach to defining certain answers for such XML queries, and apply it in the settings of incomplete information and XML data exchange. We first revisit the relational case, and show how to present the key concepts related to certain answers in a new model-theoretic language. This new approach naturally extends to XML. We prove a number of generic, application-independent results about computability and complexity of certain answers produced by it. We then turn our attention to a pattern-based XML query language with trees as outputs, and present a technique for computing certain answers that relies on the notion of a basis of a set of trees. We show how to compute such bases for documents with nulls and for documents arising in data exchange scenarios, and provide complexity bounds. While in general complexity of query answering in XML data exchange could be high, we exhibit a natural class of XML schema mappings for which not only query answering, but also many static analysis problems can be solved efficiently.

[1]  Filip Murlak,et al.  XML schema mappings , 2009, PODS.

[2]  Serge Abiteboul,et al.  On the representation and querying of sets of possible worlds , 1987, SIGMOD '87.

[3]  Diego Calvanese,et al.  Regular XPath: Constraints, Query Containment and View-Based Answering for XML Documents , 2008 .

[4]  Limsoon Wong,et al.  Kleisli, a functional query system , 2000, J. Funct. Program..

[5]  Alberto O. Mendelzon,et al.  Foundations of semantic web databases , 2004, PODS.

[6]  Denilson Barbosa,et al.  Studying the XML Web: Gathering Statistics from an XML Sample , 2006, World Wide Web.

[7]  Cristina Sirangelo,et al.  XML with incomplete information: models, properties, and query answering , 2009, PODS.

[8]  Andrea Calì,et al.  On the decidability and complexity of query answering over inconsistent and incomplete databases , 2003, PODS.

[9]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[10]  Pablo Barceló Logical foundations of relational data exchange , 2009, SGMD.

[11]  Diego Calvanese,et al.  Representing and Reasoning on XML Documents: A Description Logic Approach , 1999, J. Log. Comput..

[12]  Thomas Schwentick,et al.  Conjunctive query containment over trees , 2011, J. Comput. Syst. Sci..

[13]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[14]  Dan Suciu,et al.  Comprehension syntax , 1994, SGMD.

[15]  Thomas Schwentick,et al.  Optimizing Conjunctive Queries over Trees Using Schema Information , 2008, MFCS.

[16]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[17]  Philip A. Bernstein,et al.  Model management 2.0: manipulating richer mappings , 2007, SIGMOD '07.

[18]  Yehoshua Sagiv,et al.  Modeling and querying probabilistic XML data , 2009, SGMD.

[19]  Alin Deutsch,et al.  XML queries and constraints, containment and reformulation , 2005, Theor. Comput. Sci..

[20]  Raymond Reiter,et al.  A sound and sometimes complete query evaluation algorithm for relational databases with null values , 1986, JACM.

[21]  Serge Abiteboul,et al.  Representing and querying XML with incomplete information , 2006, TODS.

[22]  Gösta Grahne,et al.  The Problem of Incomplete Information in Relational Databases , 1991, Lecture Notes in Computer Science.

[23]  Jaroslav Nesetril,et al.  The core of a graph , 1992, Discret. Math..

[24]  Cong Yu,et al.  Constraint-based XML query rewriting for data integration , 2004, SIGMOD '04.

[25]  Raymond Reiter,et al.  Towards a Logical Reconstruction of Relational Database Theory , 1982, On Conceptual Modelling.

[26]  Yehoshua Sagiv,et al.  Matching Twigs in Probabilistic XML , 2007, VLDB.

[27]  Ronald Fagin,et al.  Translating Web Data , 2002, VLDB.

[28]  Sergio Greco,et al.  Repairs and Consistent Answers for XML Data with Functional Dependencies , 2003, Xsym.

[29]  Claire David,et al.  On the tradeoff between mapping and querying power in XML data exchange , 2010, ICDT '10.

[30]  Philip Wadler,et al.  Comprehending monads , 1990, Mathematical Structures in Computer Science.

[31]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[32]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[33]  Marcelo Arenas,et al.  XML data exchange: consistency and query answering , 2005, PODS '05.

[34]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[35]  Georg Gottlob,et al.  Conjunctive queries over trees , 2004, JACM.

[36]  Andrew B. Whinston,et al.  Model management , 1994 .