A Formal Approach to Null Values in Database Relations

We study the problem of null values. By this we mean that an attribute is applicable but its value at present is unknown and also that an attribute is applicable but its value is arbitrary. We adopt the view that tuples denote statements of predicate logic about database relations. Then, a null value of the first kind, respectively second kind, corresponds to an existentially quantified variable, respectively universally quantified variable. For instance if r is a database relation without null values and X is a range declaration for r then the tuple (a, ∀,b, ∃) ∈ R is intended to mean “there exists an x ∈ X such that for all y ∈ X: (a,y,b,x) ∈ r”. We extend basic operations of the well-known relational algebra to relations with null values. Using formal notions of correctness and completeness (adapted from predicate logic) we show that our extensions are meaningful and natural. Furthermore we reexamine the generalized join within our framework. Finally we investigate the algebraic structure of the class of relations with null values under a partial ordering which can be interpreted as a kind of logical implication.

[1]  Michael Adamowicz,et al.  Data base systems , 1976 .

[2]  Jack Minker,et al.  Logic and Data Bases , 1978, Springer US.

[3]  Raymond Reiter On Closed World Data Bases , 1977, Logic and Data Bases.

[4]  Jean E. Sammet,et al.  An information algebra: phase 1 report—language structure group of the CODASYL development committee , 1962, CACM.

[5]  Hans Hermes Einführung in die Verbandstheorie , 1955 .

[6]  Witold Lipski,et al.  On semantic issues connected with incomplete information databases , 1979, ACM Trans. Database Syst..

[7]  Ronald Fagin,et al.  A normal form for relational databases that is based on domains and keys , 1981, TODS.

[8]  Jean-Marie Nicolas,et al.  Data Base: Theory vs. Interpretation , 1977, Logic and Data Bases.

[9]  Jack Minker,et al.  An Overview and Introduction to Logic and Data Bases , 1977, Logic and Data Bases.

[10]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[11]  Alfred V. Aho,et al.  The theory of joins in relational data bases , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[12]  Yannis Vassiliou,et al.  Null values in data base management a denotational semantics approach , 1979, SIGMOD '79.

[13]  Ann Yasuhara,et al.  Recursive function theory and logic , 1971, Computer science and applied mathematics.

[14]  SHAMKANT B. NAVATHE,et al.  On the implementation of a conceptual schema model within a three-level DBMS architecture* , 1979, 1979 International Workshop on Managing Requirements Knowledge (MARK).

[15]  John Grant,et al.  Null Values in a Relational Data Base , 1977, Inf. Process. Lett..

[16]  Catriel Beeri,et al.  A Sophisticate's Introduction to Database Normalization Theory , 1978, VLDB.

[17]  Stephen Todd,et al.  An algebra of relations for machine computation , 1975, POPL '75.

[18]  J. A. Robinson,et al.  A Machine-Oriented Logic Based on the Resolution Principle , 1965, JACM.

[19]  Richard C. T. Lee,et al.  Symbolic logic and mechanical theorem proving , 1973, Computer science classics.

[20]  Alfred V. Aho,et al.  The Theory of Joins in Relational Data Bases (Extended Abstract) , 1977, FOCS.

[21]  Alain Pirotte,et al.  Generalized joins , 1976, SGMD.

[22]  Carlo Zaniolo,et al.  Analysis and design of relational schemata for database systems. , 1976 .

[23]  R. Bosak An information Algebra , 1961 .

[24]  E. F. Codd,et al.  Understanding relations , 1973, SGMD.