On the finite controllability of conjunctive query answering in databases under open-world assumption

In this paper we study queries over relational databases with integrity constraints (ICs). The main problem we analyze is OWA query answering, i.e., query answering over a database with ICs under open-world assumption. The kinds of ICs that we consider are inclusion dependencies and functional dependencies, in particular key dependencies; the query languages we consider are conjunctive queries and unions of conjunctive queries. We present results about the decidability of OWA query answering under ICs. In particular, we study OWA query answering both over finite databases and over unrestricted databases, and identify the cases in which such a problem is finitely controllable, i.e., when OWA query answering over finite databases coincides with OWA query answering over unrestricted databases. Moreover, we are able to easily turn the above results into new results about implication of ICs and query containment under ICs, due to the deep relationship between OWA query answering and these two classical problems in database theory. In particular, we close two long-standing open problems in query containment, since we prove finite controllability of containment of conjunctive queries both under arbitrary inclusion dependencies and under key and foreign key dependencies. The results of our investigation are very relevant in many research areas which have recently dealt with databases under an incomplete information assumption: e.g., data integration, data exchange, view-based information access, ontology-based information systems, and peer data management systems.

[1]  Alin Deutsch,et al.  The chase revisited , 2008, PODS.

[2]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[3]  David S. Johnson,et al.  Testing containment of conjunctive queries under functional and inclusion dependencies , 1982, J. Comput. Syst. Sci..

[4]  Jie Zhao,et al.  Schema Mediation in Peer Data Management Systems , 2011, Int. J. Cooperative Inf. Syst..

[5]  Alon Y. Halevy,et al.  Recursive Query Plans for Data Integration , 2000, J. Log. Program..

[6]  Wenfei Fan,et al.  Integrity constraints for XML , 2000, PODS.

[7]  Maurizio Lenzerini,et al.  PODS '08 : proceedings of the twenty-seventh ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, Vancouver, BC, Canada, June 9-11, 2008 , 2008 .

[8]  Catriel Beeri,et al.  A Proof Procedure for Data Dependencies , 1984, JACM.

[9]  Ronald Fagin,et al.  Composing schema mappings: second-order dependencies to the rescue , 2004, PODS '04.

[10]  Ronald Fagin,et al.  Inclusion Dependencies and Their Interaction with Functional Dependencies , 1984, J. Comput. Syst. Sci..

[11]  John C. Mitchell The Implication Problem for Functional and Inclusion Dependencies , 1984, Inf. Control..

[12]  Boris Motik,et al.  Data Complexity of Reasoning in Very Expressive Description Logics , 2005, IJCAI.

[13]  Mark Levene,et al.  How to Prevent Interaction of Functional and Inclusion Dependencies , 1999, Inf. Process. Lett..

[14]  Moshe Y. Vardi The implication and finite implication problems for typed template dependencies , 1982, PODS '82.

[15]  Riccardo Rosati On the decidability and finite controllability of query processing in databases with incomplete information , 2006, PODS '06.

[16]  Alberto O. Mendelzon,et al.  Notions of dependency satisfaction , 1982, PODS '82.

[17]  Moshe Y. Vardi,et al.  Polynomial-time implication problems for unary inclusion dependencies , 1990, JACM.

[18]  Gunter Saake,et al.  Logics for databases and information systems , 1998 .

[19]  R. V. D. Meyden The complexity of querying indefinite information: defined relations, recursion and linear order , 1992 .

[20]  James Bailey,et al.  Decidability and undecidability results for the termination problem of active database rules , 1998, PODS '98.

[21]  Ronald Fagin,et al.  Data exchange: semantics and query answering , 2003, Theor. Comput. Sci..

[22]  Z. Meral Özsoyoglu,et al.  Implication and Referential Constraints: A New Formal Reasoning , 1997, IEEE Trans. Knowl. Data Eng..

[23]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[24]  Jianwen Su,et al.  Conjunctive Query Containment with Respect to Views and Constraints , 1996, Inf. Process. Lett..

[25]  Andrea Calì,et al.  On the decidability and complexity of query answering over inconsistent and incomplete databases , 2003, PODS.

[26]  Andrea Calì,et al.  Taming the Infinite Chase: Query Answering under Expressive Relational Constraints , 2008, Description Logics.

[27]  Moshe Y. Vardi On the integrity of databases with incomplete information , 1985, PODS.

[28]  Wenfei Fan,et al.  On XML integrity constraints in the presence of DTDs , 2001, JACM.

[29]  Riccardo Rosati,et al.  The Limits of Querying Ontologies , 2007, ICDT.

[30]  Ron van der Meyden,et al.  Logical Approaches to Incomplete Information: A Survey , 1998, Logics for Databases and Information Systems.

[31]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[32]  Catriel Beeri,et al.  Formal Systems for Tuple and Equality Generating Dependencies , 1984, SIAM J. Comput..

[33]  Moshe Y. Vardi,et al.  The Implication Problem for Functional and Inclusion Dependencies is Undecidable , 1985, SIAM J. Comput..

[34]  Ron van der Meyden The Complexity of Querying Indefinite Data about Linearly Ordered Domains , 1997, J. Comput. Syst. Sci..

[35]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[36]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[37]  James Bailey,et al.  Decidability of First-Order Logic Queries over Views , 1999, ICDT.

[38]  Anthony C. Klug On conjunctive queries containing inequalities , 1988, JACM.