Optimizing the Computation of Approximate Certain Query Answers over Incomplete Databases

In many database applications there is the need of extracting information from incomplete data. In such scenarios, certain answers are the most widely adopted semantics of query answering. Unfortunately, the computation of certain query answers is a coNP-hard problem. To make query answering feasible in practice, recent research has focused on developing polynomial time algorithms computing sound (but possibly incomplete) sets of certain answers. In this paper, we propose a novel technique that allows us to improve recently proposed approximation algorithms, obtaining a good balance between running time and quality of the results. We report experimental results confirming the effectiveness of the new technique.

[1]  Witold Lipski On relational algebra with marked nulls preliminary version , 1984, PODS '84.

[2]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[3]  Sergio Greco,et al.  Incomplete Data and Data Dependencies in Relational Databases , 2012, Incomplete Data and Data Dependencies in Relational Databases.

[4]  Magdalena Ortiz,et al.  Ontology-Mediated Query Answering with Data-Tractable Description Logics , 2015, Reasoning Web.

[5]  Andrea Calì,et al.  A general Datalog-based framework for tractable query answering over ontologies , 2012, J. Web Semant..

[6]  Maurizio Lenzerini,et al.  On reconciling data exchange, data integration, and peer data management , 2007, PODS '07.

[7]  Sergio Greco,et al.  Computing Approximate Certain Answers over Incomplete Databases , 2017, AMW.

[8]  Witold Lipski On Relational Algebra with Marked Nulls. , 1984, PODS 1984.

[9]  Sergio Greco,et al.  A three-valued semantics for querying and repairing inconsistent databases , 2007, Annals of Mathematics and Artificial Intelligence.

[10]  Leopoldo E. Bertossi,et al.  Database Repairing and Consistent Query Answering , 2011, Database Repairing and Consistent Query Answering.

[11]  Leonid Libkin Certain answers as objects and knowledge , 2016, Artif. Intell..

[12]  Leonid Libkin,et al.  Making SQL Queries Correct on Incomplete Databases: A Feasibility Study , 2016, PODS.

[13]  Sergio Greco,et al.  ACID: A System for Computing Approximate Certain Query Answers over Incomplete Databases , 2018, SIGMOD Conference.

[14]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[15]  Sergio Greco,et al.  Approximation algorithms for querying incomplete databases , 2019, Inf. Syst..

[16]  Leonid Libkin,et al.  How to Define Certain Answers , 2015, IJCAI.

[17]  Leonid Libkin,et al.  Approximations and Refinements of Certain Answers via Many-Valued Logics , 2016, KR.

[18]  Jef Wijsen,et al.  The Data Complexity of Consistent Query Answering for Self-Join-Free Conjunctive Queries Under Primary Key Constraints , 2015, ACM Trans. Database Syst..

[19]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.