Inferring null join dependencies in relational databases

The inference problem for data dependencies in relational databases is the problem of deciding whether a set of data dependencies logically implies another data dependency. For join dependencies (JDs), the inference problem has been extensively studied by utilising the well-known chase procedure. We generalise JDs to null join dependencies (NJDs) that hold in relations which may contain null values. In our model for incomplete information we allow only a single unmarked null value denoted bynull. This allows us to solve the inference problem for NJDs by extending the chase procedure to the or-chase procedure. In order to define the or-chase procedure we generalise relations with nulls to or-relations which contain a limited form of disjunctive information. The main result of the paper shows that the inference problem for NJDs, including embedded NJDs (which are a special case of NJDs), is decidable; this is realised via the or-chase procedure.

[1]  Rajshekhar Sunderraman,et al.  Indefinite and maybe information in relational databases , 1990, TODS.

[2]  David Maier,et al.  Testing implications of data dependencies , 1979, SIGMOD '79.

[3]  Catriel Beeri,et al.  On the Properties of Join Dependencies , 1979, Advances in Data Base Theory.

[4]  Moshe Y. Vardi,et al.  Notions of dependency satisfaction , 1986 .

[5]  David Maier,et al.  Relaxing the universal relation scheme assumption , 1985, ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems.

[6]  David A. Schmidt,et al.  Denotationaisemantics: a methodology for language development , 1986 .

[7]  Witold Lipski,et al.  Nonapplicable Nulls , 1986, Theor. Comput. Sci..

[8]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[9]  Mark Levene,et al.  The Nested Universal Relation Database Model , 1992, Lecture Notes in Computer Science.

[10]  Y. Edmund Lien Multivalued Dependencies With Null Values In Relational Data Bases , 1979, Fifth International Conference on Very Large Data Bases, 1979..

[11]  John Grant,et al.  Incomplete Information in a Relational Database , 1980, Fundamenta Informaticae.

[12]  Carlo Zaniolo,et al.  Database relations with null values , 1982, J. Comput. Syst. Sci..

[13]  Yehoshua Sagiv,et al.  Subset Dependencies and a Completeness Result for a Subclass of Embedded Multivalued Dependencies , 1982, JACM.

[14]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[15]  Saharon Shelah,et al.  Fixed-point extensions of first-order logic , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[16]  Ronald Fagin,et al.  Degrees of acyclicity for hypergraphs and relational database schemes , 1983, JACM.

[17]  Paolo Atzeni,et al.  A New Interpretation for Null Values in the Weak Instance Model , 1990, J. Comput. Syst. Sci..

[18]  Sushil Jajodia,et al.  Lossless outer joins with incomplete information , 1990, BIT Comput. Sci. Sect..

[19]  Catriel Beeri,et al.  A Proof Procedure for Data Dependencies , 1984, JACM.

[20]  Edward Sciore,et al.  A Complete Axiomatization of Full Join Dependencies , 1982, JACM.