Querying Incomplete Data with Logic Programs: ER Strikes Back

Since Chen's Entity-Relationship (ER) model, conceptual modelling has been playing a fundamental role in relational data design. In this paper we consider an extended ER model enriched with cardinality constraints, disjunction assertions, and is-a relations among both entities and relationships; we present a framework in which the data underlying an ER schema can be directly queried through the schema by using suitable predicates. In this setting, we consider the case of incomplete data, which is likely to happen, for instance, when data from different sources are integrated. We address the problem of providing correct answers to conjunctive queries by reasoning on the schema. Based on previous results about decidability of the problem, we provide a query answering algorithm based on rewriting the initial query into a recursive Datalog query, in which the information about the schema is encoded.We finally give some complexity results, and we show extensions to more general settings.

[1]  Phokion G. Kolaitis,et al.  Conjunctive-query containment and constraint satisfaction , 1998, PODS.

[2]  Georg Gottlob,et al.  Complexity and expressive power of logic programming , 2001, CSUR.

[3]  David S. Johnson,et al.  Testing Containment of Conjunctive Queries under Functional and Inclusion Dependencies , 1984, J. Comput. Syst. Sci..

[4]  David Maier,et al.  Testing implications of data dependencies , 1979, SIGMOD '79.

[5]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[6]  Andrea Calì,et al.  Data integration under integrity constraints , 2004, Inf. Syst..

[7]  Diego Calvanese,et al.  Characterizing Data Complexity for Conjunctive Query Answering in Expressive Description Logics , 2006, AAAI.

[8]  Andrea Calì,et al.  Accessing Data Integration Systems through Conceptual Schemas , 2001, ER.

[9]  Jan Chomicki,et al.  Consistent query answers in inconsistent databases , 1999, PODS '99.

[10]  Andrea Calì,et al.  Query rewriting and answering under constraints in data integration systems , 2003, IJCAI.

[11]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[12]  Moshe Y. Vardi Inferring multivalued dependencies from functional and join dependencies , 2004, Acta Informatica.

[13]  Andrea Calì Containment of Conjunctive Queries over Conceptual Schemata , 2006, DASFAA.

[14]  Raymond Reiter On Closed World Data Bases , 1977, Logic and Data Bases.

[15]  Michael R. Genesereth,et al.  Answering recursive queries using views , 1997, PODS '97.

[16]  Diego Calvanese,et al.  Dwq : Esprit Long Term Research Project, No 22469 on the Decidability of Query Containment under Constraints on the Decidability of Query Containment under Constraints , 2022 .

[17]  Diego Calvanese,et al.  DL-Lite: Tractable Description Logics for Ontologies , 2005, AAAI.

[18]  Salvatore J. Stolfo,et al.  Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem , 1998, Data Mining and Knowledge Discovery.

[19]  Andrea Calì,et al.  On the decidability and complexity of query answering over inconsistent and incomplete databases , 2003, PODS.

[20]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[21]  Mihalis Yannakakis,et al.  On the Complexity of Testing Implications of Functional and Join Dependencies , 1981, JACM.

[22]  Edward P. F. Chan Containment and minimization of positive conjunctive queries in OODB's , 1992, PODS.

[23]  Johann A. Makowsky,et al.  Identifying Extended Entity-Relationship Object Structures in Relational Schemas , 1990, IEEE Trans. Software Eng..

[24]  Sushil Jajodia,et al.  Conceptual Modeling — ER 2001 , 2001, Lecture Notes in Computer Science.

[25]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[26]  Diego Calvanese,et al.  Data Complexity of Query Answering in Description Logics , 2006, Description Logics.