Recognizing Determinism in Prioritized Repairing of Inconsistent Databases

A repair of an inconsistent database is traditionally defined as a consistent database that differs from the inconsistent one in a “minimal way.” As there are often reasons to prefer one repair over another, researchers have introduced and investigated the framework of preferred repairs, where a priority relation between facts is lifted towards a priority relation between consistent databases, and repairs are restricted to ones that are optimal in the lifted sense. In this paper we describe our recent results on the complexity of deciding whether the priority relation suffices to clean the database unambiguously, or in other words, whether there is exactly one optimal repair. In particular, we show that different conventional semantics of priority lifting entail highly different complexities.

[1]  Frederick Reiss,et al.  SystemT: An Algebraic Approach to Declarative Information Extraction , 2010, ACL.

[2]  Jan Chomicki,et al.  Preference-Driven Querying of Inconsistent Relational Databases , 2006, EDBT Workshops.

[3]  Wenfei Fan,et al.  Conditional Functional Dependencies for Data Cleaning , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[4]  Jef Wijsen,et al.  Determining the Currency of Data , 2011, TODS.

[5]  RONALD FAGIN,et al.  Document Spanners , 2015, J. ACM.

[6]  Jianzhong Li,et al.  Towards certain fixes with editing rules and master data , 2010, The VLDB Journal.

[7]  Wenfei Fan,et al.  Determining the relative accuracy of attributes , 2013, SIGMOD '13.

[8]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[9]  Phokion G. Kolaitis,et al.  Repair checking in inconsistent databases: algorithms and complexity , 2009, ICDT '09.

[10]  Jan Chomicki,et al.  Prioritized repairing and consistent query answering in relational databases , 2012, Annals of Mathematics and Artificial Intelligence.

[11]  Frederick Reiss,et al.  Cleaning inconsistencies in information extraction via prioritized repairs , 2014, PODS.

[12]  Jan Chomicki,et al.  Minimal-change integrity maintenance using tuple deletions , 2002, Inf. Comput..

[13]  Jef Wijsen,et al.  The Data Complexity of Consistent Query Answering for Self-Join-Free Conjunctive Queries Under Primary Key Constraints , 2015, ACM Trans. Database Syst..

[14]  Parke Godfrey,et al.  An overview of cooperative answering , 1992, Journal of Intelligent Information Systems.

[15]  Shuai Ma,et al.  Interaction between Record Matching and Data Repairing , 2014, JDIQ.

[16]  Ronald Fagin,et al.  Dichotomies in the Complexity of Preferred Repairs , 2015, PODS.

[17]  Douglas E. Appelt,et al.  The Common Pattern Specification Language , 1998, TIPSTER.