Incomplete and uncertain information in relational databases

In real life it is very often the case that the available knowledge is imperfect in the sense that it represents multiple possible states of the external world, yet it is unknown which state corresponds to the actual situation of the world. Imperfect knowledge can be of two different categories. Knowledge is incomplete if it represents different states, one of which is true in the external world. On the contrary, knowledge is uncertain if it represents different states which may be satisfied or are likely to be true in the external world.Imperfect knowledge can be considered under two different perspectives: using either an algebraic or a logical approach. We present both approaches in relation with the standard relational model, providing the necessary background for the subsequent development.The study of imperfect knowledge has been an active area of research, in particular in the context of relational databases. However, due to the complexity of manipulating imperfect knowledge, little practical results have been obtained so far. In this thesis we provide a survey of the field of incompleteness and uncertainty in relational databases;it can be used also as an introductory tutorial for understanding the intuitive semantics and the problems encountered when representing and manipulating such imperfect knowledge. The survey concentrates in giving an unifying presentation of the different approaches and results found in the literature, thus providing a state of the art in the field.The rest of the thesis studies in detail the manipulation of one type of incomplete knowledge, namely disjunctive information, and one type of uncertain knowledge, namely probabilistic information. We study both types of imperfect knowledge using similar approaches, that is through an algebraic and a logical framework. The relational algebra operators are generalized for disjunctive and probabilistic relations, and we prove the correctness of these generalizations. In addition, disjunctive and probabilistic databases are formalized using appropriate logical theories and we give sound and complete query evaluation algorithms.A major implication of these studies is the conviction that viewing incompleteness and uncertainty as different facets of the same problem would allow to achieve a deeper understanding of imperfect knowledge, which is absolutely necessary for building information systems capable of modeling complex real-life situations.

[1]  Joachim Biskup,et al.  A foundation of CODD's relational maybe-operations , 1983, TODS.

[2]  Hector J. Levesque,et al.  Foundations of a Functional Approach to Knowledge Representation , 1984, Artif. Intell..

[3]  Arthur M. Keller,et al.  On the Use of an Extended Relational Model to Handle Changing Incomplete Information , 1985, IEEE Transactions on Software Engineering.

[4]  Johann Eder,et al.  Logic and Databases , 1992, Advanced Topics in Artificial Intelligence.

[5]  Jorge Lobo,et al.  WF³: A Semantics for Negation in Normal Disjunctive Logic Programs , 1991, ISMIS.

[6]  Yannis Vassiliou,et al.  Null values in data base management a denotational semantics approach , 1979, SIGMOD '79.

[7]  Jack Minker,et al.  Procedural Interpretation of Non-Horn Logic Programs , 1988, CADE.

[8]  Ronald Fagin,et al.  A logic for reasoning about probabilities , 1988, [1988] Proceedings. Third Annual Information Symposium on Logic in Computer Science.

[9]  Raymond Reiter,et al.  Foundations of Assumption-based Truth Maintenance Systems: Preliminary Report , 1987, AAAI.

[10]  Arcot Rajasekar Disjunctive Logic Programs and Constrain Processing , 1991, ISMIS.

[11]  Georg Gottlob,et al.  Closed World Databases Opened Through Null Values , 1988, VLDB.

[12]  Jack Minker,et al.  A Stratification Semantics for General Disjunctive Programs , 1989, NACLP.

[13]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[14]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[15]  Didier Dubois,et al.  Fuzzy sets and systems ' . Theory and applications , 2007 .

[16]  Tomasz Imielinski Query processing in deductive databases with incomplete information , 1986, SIGMOD '86.

[17]  J. Minker,et al.  Semantics for disjunctive logic programs , 1989 .

[18]  H. Levesque A formal treatment of incomplete knowledge bases , 1981 .

[19]  Jeffrey D. Uuman Principles of database and knowledge- base systems , 1989 .

[20]  Witold Lipski On Relational Algebra with Marked Nulls. , 1984, PODS 1984.

[21]  Tomasz Imielinski,et al.  Complexity of query processing in databases with OR-objects , 1989, PODS '89.

[22]  Didier Dubois,et al.  Theorem Proving Under Uncertainty - A Possibility Theory-based Approach , 1987, IJCAI.

[23]  John Grant,et al.  Answering Queries in Indefinite Databases and the Null Value Problem , 1986, Adv. Comput. Res..

[24]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[25]  Joseph R. Shoenfield,et al.  Mathematical logic , 1967 .

[26]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[27]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[28]  Joseph Y. Halpern An Analysis of First-Order Logics of Probability , 1989, IJCAI.

[29]  Alain Pirotte,et al.  Generalized joins , 1976, SGMD.

[30]  Lotfi A. Zadeh,et al.  A Theory of Approximate Reasoning , 1979 .

[31]  Jorge Lobo,et al.  Extending the Semantics of Logic Programs to Disjunctive Logic Programs , 1989, ICLP.

[32]  Stefan Brass,et al.  Generalized Bottom-Up Query Evaluation , 1992, EDBT.

[33]  Johan de Kleer,et al.  Extending the ATMS , 1986, Artif. Intell..

[34]  D. Dubois,et al.  Fuzzy sets in approximate reasoning, part 2: logical approaches , 1991 .

[35]  Teodor C. Przymusinski,et al.  The extended closed world assumption and its relationship to parallel circumscription , 1985, PODS '86.

[36]  John Grant,et al.  Null Values in a Relational Data Base , 1977, Inf. Process. Lett..

[37]  Rajshekhar Sunderraman,et al.  On representing indefinite and maybe information in relational databases , 1988, Proceedings. Fourth International Conference on Data Engineering.

[38]  John L. Bell,et al.  A course in mathematical logic , 1977 .

[39]  A framework for comparison of update semantics , 1988, PODS '88.

[40]  Werner Kießling,et al.  Database Support for Problematic Knowledge , 1992, EDBT.

[41]  Tomasz Imielinski,et al.  Incomplete information and dependencies in relational databases , 1983, SIGMOD '83.

[42]  Witold Lipski,et al.  On Databases with Incomplete Information , 1981, JACM.

[43]  Nadine Lerat Evaluation de requêtes sur une base de données logique satisfaisant un ensemble de dépendances , 1986, BDA.

[44]  Marianne Winslett,et al.  A model-based approach to updating databases with incomplete information , 1988, TODS.

[45]  Carlo Zaniolo,et al.  Database relations with null values , 1982, J. Comput. Syst. Sci..

[46]  Alexander Borgida,et al.  Language features for flexible handling of exceptions in information systems , 1985, TODS.

[47]  Joachim Biskup,et al.  A Formal Approach to Null Values in Database Relations , 1979, Advances in Data Base Theory.

[48]  Joobin Choobineh,et al.  Extension of relational views to derived relations with exceptions , 1990, Inf. Syst..

[49]  Michael Gelfond,et al.  Negation as Failure: Careful Closure Procedure , 1986, Artif. Intell..

[50]  Rajshekhar Sunderraman,et al.  Indefinite and maybe information in relational databases , 1990, TODS.

[51]  Gösta Grahne,et al.  Dependency Satisfaction in Databases with Incomplete Information , 1984, VLDB.

[52]  Jack Minker,et al.  On Indefinite Databases and the Closed World Assumption , 1987, CADE.

[53]  R. Sunderraman,et al.  A Generalized Relational Model for Indefinite and Maybe Information , 1991, IEEE Trans. Knowl. Data Eng..

[54]  Nils J. Nilsson,et al.  Probabilistic Logic * , 2022 .

[55]  Raymond Reiter,et al.  Towards a Logical Reconstruction of Relational Database Theory , 1982, On Conceptual Modelling.

[56]  Ronald Fagin,et al.  Uncertainty, belief, and probability 1 , 1991, IJCAI.

[57]  Ichiro Semba An Efficient Algorithm for Generating All k-Subsets (1 <= k <= m <= n) of the Set { 1, 2, ..., n } in Lexicographical Order , 1984, J. Algorithms.

[58]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[59]  Jack Minker,et al.  Logic and Databases: A Deductive Approach , 1984, CSUR.

[60]  Raymond Reiter,et al.  A sound and sometimes complete query evaluation algorithm for relational databases with null values , 1986, JACM.

[61]  Marianne Winslett Updating Logical Databases Containing Null Values , 1986, ICDT.

[62]  Serge Abiteboul,et al.  Update Semantics for Incomplete Databases , 1985, VLDB.

[63]  Nadine Lerat Query Processing in Incomplete Logical Databases , 1986, ICDT.

[64]  Raymond Reiter,et al.  A Logic for Default Reasoning , 1987, Artif. Intell..

[65]  Eliezer L. Lozinskii Plausible World Assumption , 1989, KR.

[66]  Teodor C. Przymusinski On the Declarative Semantics of Deductive Databases and Logic Programs , 1988, Foundations of Deductive Databases and Logic Programming..

[67]  Witold Lipski,et al.  On semantic issues connected with incomplete information databases , 1979, ACM Trans. Database Syst..

[68]  E. F. Codd,et al.  Missing information (applicable and inapplicable) in relational databases , 1986, SGMD.

[69]  Serge Abiteboul,et al.  On the representation and querying of sets of possible worlds , 1987, SIGMOD '87.

[70]  Yannis Vassiliou Functional Dependencies and Incomplete Information , 1980, VLDB.

[71]  Jorge Lobo,et al.  Skeptical Reasoning and Disjunctive Programs , 1989, KR.

[72]  Arun K. Majumdar,et al.  Fuzzy Functional Dependencies and Lossless Join Decomposition of Fuzzy Relational Database Systems , 1988, ACM Trans. Database Syst..

[73]  Richard C. T. Lee,et al.  Symbolic logic and mechanical theorem proving , 1973, Computer science classics.

[74]  A. Tarski A Decision Method for Elementary Algebra and Geometry , 2023 .

[75]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[76]  Rajshekhar Sunderraman,et al.  On representing indefinite and maybe information in relational databases: a generalization , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[77]  Gösta Grahne,et al.  The Problem of Incomplete Information in Relational Databases , 1991, Lecture Notes in Computer Science.

[78]  Shi-Kuo Chang,et al.  Imprecise schema: a rationale for relations with embedded subrelations , 1989, TODS.

[79]  E. F. Codd,et al.  Understanding Relations (Installment #7) , 1974, FDT Bull. ACM SIGFIDET SIGMOD.

[80]  Li-Yan Yuan,et al.  A sound and complete query evaluation algorithm for relational databases with null values , 1988, SIGMOD '88.

[81]  Jack Minker,et al.  Toward A Foundation of Disjunctive Logic Programming , 1989, NACLP.

[82]  Johan de Kleer,et al.  An Assumption-Based TMS , 1987, Artif. Intell..

[83]  Raymond Reiter,et al.  What should a database know? , 1988, PODS '88.

[84]  Alexander Borgida,et al.  Accommodating Exceptions in Databases, and Refining the Schema by Learning from them , 1985, VLDB.

[85]  J. D. Uiiman,et al.  Principles of Database Systems , 2004, PODS 2004.

[86]  Jérôme Lang Logique possibiliste : aspects formels, deduction automatique, et applications , 1991 .

[87]  Tore Risch,et al.  A functional approach to integrating database and expert systems , 1988, CACM.

[88]  R. Reiter On Closed World Data Bases , 1987, Logic and Data Bases.

[89]  Hector Garcia-Molina,et al.  A Probalilistic Relational Data Model , 1990, EDBT.