Causes for query answers from databases: Datalog abduction, view-updates, and integrity constraints

Causality has been recently introduced in databases, to model, characterize, and possibly compute causes for query answers. Connections between QA-causality and consistency-based diagnosis and database repairs (wrt. integrity constraint violations) have already been established. In this work we establish precise connections between QA-causality and both abductive diagnosis and the view-update problem in databases, allowing us to obtain new algorithmic and complexity results for QA-causality. We also obtain new results on the complexity of view-conditioned causality, and investigate the notion of QA-causality in the presence of integrity constraints, obtaining complexity results from a connection with view-conditioned causality. The abduction connection under integrity constraints allows us to obtain algorithmic tools for QA-causality.

[1]  Neil Immerman A Characterization of the Complexity of Resilience and Responsibility for Conjunctive Queries , 2015 .

[2]  Georg Gottlob,et al.  Complexity and expressive power of logic programming , 1997, Proceedings of Computational Complexity. Twelfth Annual IEEE Conference.

[3]  David Poole,et al.  Logic programming, abduction and probability , 1993, New Generation Computing.

[4]  John Grant,et al.  Logic-based approach to semantic query optimization , 1990, TODS.

[5]  Joseph Y. Halpern,et al.  Causes and Explanations: A Structural-Model Approach. Part I: Causes , 2000, The British Journal for the Philosophy of Science.

[6]  Jianzhong Li,et al.  On the Complexity of View Update Analysis and Its Application to Annotation Propagation , 2012, IEEE Transactions on Knowledge and Data Engineering.

[7]  James Cheney,et al.  Provenance in databases , 2009, SIGMOD '07.

[8]  Pierre Marquis,et al.  A Knowledge Compilation Map , 2002, J. Artif. Intell. Res..

[9]  Letizia Tanca,et al.  Logic Programming and Databases , 1990, Surveys in Computer Science.

[10]  Bert Van Nuffelen,et al.  Coherent Integration of Databases by Abductive Logic Programming , 2004, J. Artif. Intell. Res..

[11]  Raymond Reiter,et al.  Towards a Logical Reconstruction of Relational Database Theory , 1982, On Conceptual Modelling.

[12]  Babak Salimi,et al.  Query-Answer Causality in Databases and Its Connections with Reverse Reasoning Tasks in Data and Knowledge Management , 2016 .

[13]  Leopoldo E. Bertossi,et al.  Query-Answer Causality in Databases: Abductive Diagnosis and View Updates , 2015, ACI@UAI.

[14]  Benny Kimelfeld,et al.  A dichotomy in the complexity of deletion propagation with functional dependencies , 2012, PODS '12.

[15]  Dan Suciu,et al.  A formal approach to finding explanations for database queries , 2014, SIGMOD Conference.

[16]  Grigoris Karvounarakis,et al.  Semiring-annotated data: queries and provenance? , 2012, SGMD.

[17]  Georg Gottlob,et al.  Abduction from Logic Programs: Semantics and Complexity , 1997, Theor. Comput. Sci..

[18]  Pietro Torasso,et al.  On the Relationship between Abduction and Deduction , 1991, J. Log. Comput..

[19]  Dan Suciu,et al.  The Complexity of Causality and Responsibility for Query Answers and non-Answers , 2010, Proc. VLDB Endow..

[20]  Dan Suciu,et al.  Probabilistic databases , 2011, SIGA.

[21]  Carlo Zaniolo,et al.  A formal approach to the definition and the design of conceptual schemata for databased systems , 1982, TODS.

[22]  Robert A. Kowalski,et al.  Integrity Checking in Deductive Databases , 1987, VLDB.

[23]  Joseph Y. Halpern Appropriate Causal Models and Stability of Causation , 2014, KR.

[24]  Georg Gottlob,et al.  Bounded treewidth as a key to tractability of knowledge representation and reasoning , 2006, Artif. Intell..

[25]  V. Vianu,et al.  Edinburgh Why and Where: A Characterization of Data Provenance , 2017 .

[26]  Jan Vondrák,et al.  Maximizing conjunctive views in deletion propagation , 2012, TODS.

[27]  Guy Van den Broeck,et al.  Quantifying Causal Effects on Query Answering in Databases , 2016, TaPP.

[28]  J. Pearl,et al.  Causal inference in statistics , 2016 .

[29]  Martin van Hees,et al.  Degrees of Causation , 2009 .

[30]  Joseph Y. Halpern,et al.  Responsibility and Blame: A Structural-Model Approach , 2003, IJCAI.

[31]  Renée J. Miller,et al.  Reexamining Some Holy Grails of Data Provenance , 2011, TaPP.

[32]  Tobias Gerstenberg,et al.  Finding fault: Causality and counterfactuals in group attributions , 2012, Cognition.

[33]  Georg Gottlob,et al.  Tractable database design and datalog abduction through bounded treewidth , 2010, Inf. Syst..

[34]  Sanjeev Khanna,et al.  Edinburgh Research Explorer On the Propagation of Deletions and Annotations through Views , 2013 .

[35]  Georg Gottlob,et al.  Hypothesis Classification, Abductive Diagnosis and Therapy , 1990, Expert Systems in Engineering.

[36]  Leopoldo E. Bertossi,et al.  Database Repairing and Consistent Query Answering , 2011, Database Repairing and Consistent Query Answering.

[37]  Pietro Torasso,et al.  A spectrum of logical definitions of model‐based diagnosis 1 , 1991, Comput. Intell..

[38]  Leopoldo E. Bertossi,et al.  Achieving Data Privacy through Secrecy Views and Null-Based Virtual Updates , 2011, IEEE Transactions on Knowledge and Data Engineering.

[39]  David Poole,et al.  Representing diagnosis knowledge , 1994, Annals of Mathematics and Artificial Intelligence.

[40]  Daniele Theseider Dupré,et al.  The role of abduction in database view updating , 1995, Journal of Intelligent Information Systems.

[41]  Leopoldo E. Bertossi,et al.  Complexity of Consistent Query Answering in Databases Under Cardinality-Based and Incremental Repair Semantics , 2006, ICDT.

[42]  Leopoldo E. Bertossi,et al.  Causes for Query Answers from Databases, Datalog Abduction and View-Updates: The Presence of Integrity Constraints , 2016, FLAIRS Conference.

[43]  Miroslaw Truszczynski,et al.  Answer set programming at a glance , 2011, Commun. ACM.

[44]  Jennifer Widom,et al.  Run-Time Translation of View Tuple Deletions Using Data Lineage , 2001 .

[45]  Neil Immerman,et al.  The Complexity of Resilience and Responsibility for Self-Join-Free Conjunctive Queries , 2015, Proc. VLDB Endow..

[46]  Paolo Mancarella,et al.  Abductive Logic Programming , 1992, LPNMR.

[47]  Phokion G. Kolaitis,et al.  Repair checking in inconsistent databases: algorithms and complexity , 2009, ICDT '09.

[48]  Joseph Y. Halpern Cause, Responsibility, and Blame: oA Structural-Model Approach , 2014, ArXiv.

[49]  Paolo Mancarella,et al.  Database Updates through Abduction , 1990, VLDB.

[50]  Dan Suciu,et al.  Causality in Databases , 2010, IEEE Data Eng. Bull..

[51]  Dan Suciu,et al.  Knowledge Compilation Meets Database Theory: Compiling Queries to Decision Diagrams , 2011, Theory of Computing Systems.

[52]  Suman Nath,et al.  Tracing data errors with view-conditioned causality , 2011, SIGMOD '11.

[53]  Raymond Reiter,et al.  What should a database know? , 1988, PODS '88.

[54]  John Wylie Lloyd,et al.  Foundations of Logic Programming , 1987, Symbolic Computation.

[55]  Oded Goldreich Computational Complexity , 2008 .

[56]  Joseph Y. Halpern A Modification of the Halpern-Pearl Definition of Causality , 2015, IJCAI.

[57]  Raymond Reiter,et al.  A Theory of Diagnosis from First Principles , 1986, Artif. Intell..

[58]  Peter Struss,et al.  Model-based Problem Solving , 2008, Handbook of Knowledge Representation.

[59]  Leopoldo E. Bertossi,et al.  Unifying Causality, Diagnosis, Repairs and View-Updates in Databases , 2014, ArXiv.

[60]  Georg Gottlob,et al.  The Complexity of Logic-Based Abduction , 1993, STACS.

[61]  Babak Salimi,et al.  From Causes for Database Queries to Repairs and Model-Based Diagnosis and Back , 2014, Theory of Computing Systems.

[62]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[63]  Joseph Y. Halpern,et al.  Actual causation and the art of modeling , 2011, ArXiv.

[64]  Antonis C. Kakas,et al.  Abduction in logic programming , 2002 .