Sampling Query Feedback Restricted Repairs of Functional Dependency Violations: Complexity and Algorithm

An inconsistent database is a database instance violating integrity constraints. A repair of an inconsistent database is a maximal consistent subset. Sampling from the repair space is an alternative approach meeting the needs of many applications. In this paper, we introduce a new class of repair, query feedback restricted repair, based on the feedback on user’s witness query. We first map out a complete picture of both data and combined complexities of repair existence problems under different cases to identify the intractable cases. Especially, we show that if the query is a projection or a union query, then the decision problem is NP-complete; Even worse, if the query is a conjunctive query, the decision problem becomes \(\Sigma_{2}^{\mathrm{P}}\)-complete. At last, we provide a random repair sampling algorithm when the witness query is a selection-join query, and it is still polynomial even under the combined complexity.

[1]  Leopoldo E. Bertossi,et al.  Complexity of Consistent Query Answering in Databases Under Cardinality-Based and Incremental Repair Semantics , 2006, ICDT.

[2]  Wenfei Fan,et al.  Conditional Functional Dependencies for Data Cleaning , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[3]  Jianzhong Li,et al.  On the Complexity of View Update Analysis and Its Application to Annotation Propagation , 2012, IEEE Transactions on Knowledge and Data Engineering.

[4]  Lukasz Golab,et al.  Sampling from repairs of conditional functional dependency violations , 2014, The VLDB Journal.

[5]  David P. Woodruff,et al.  Multi-Tuple Deletion Propagation: Approximations and Complexity , 2013, Proc. VLDB Endow..

[6]  Nicolas Spyratos,et al.  Update semantics of relational views , 1981, TODS.

[7]  Gottfried Vossen,et al.  On the computation of relational view complements , 2003, TODS.

[8]  Benjamin C. Pierce,et al.  Relational lenses: a language for updatable views , 2006, PODS '06.

[9]  Jef Wijsen,et al.  Condensed Representation of Database Repairs for Consistent Query Answering , 2003, ICDT.

[10]  Jennifer Widom,et al.  Run-Time Translation of View Tuple Deletions Using Data Lineage , 2001 .

[11]  Gabriel M. Kuper,et al.  Structural Properties of XPath Fragments , 2003, ICDT.

[12]  Rajeev Rastogi,et al.  A cost-based model and effective heuristic for repairing constraints by value modification , 2005, SIGMOD '05.

[13]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[14]  Jie Liu,et al.  Propagating functional dependencies with conditions , 2008, VLDB 2008.

[15]  Umeshwar Dayal,et al.  On the correct translation of update operations on relational views , 1982, TODS.

[16]  Anthony C. Klug,et al.  Determining View dependencies using tableaux , 1982, TODS.

[17]  Anthony C. Klug Calculating constraints on relational expression , 1980, TODS.

[18]  Phokion G. Kolaitis,et al.  Repair checking in inconsistent databases: algorithms and complexity , 2009, ICDT '09.

[19]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[20]  Jan Chomicki,et al.  Prioritized repairing and consistent query answering in relational databases , 2012, Annals of Mathematics and Artificial Intelligence.

[21]  Arthur M. Keller,et al.  Algorithms for translating view updates to database updates for views involving selections, projections, and joins , 1985, PODS.

[22]  Renée J. Miller,et al.  A unified model for data and constraint repair , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[23]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[24]  Jan Chomicki,et al.  Minimal-change integrity maintenance using tuple deletions , 2002, Inf. Comput..

[25]  Helmut Seidl,et al.  Exact XML Type Checking in Polynomial Time , 2007, ICDT.

[26]  Loreto Bravo,et al.  Efficient Approximation Algorithms for Repairing Inconsistent Databases , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[27]  Shuai Ma,et al.  Improving Data Quality: Consistency and Accuracy , 2007, VLDB.