Armstrong axioms and Boyce-Codd-Heath Normal Form under bag semantics

The theory of functional dependencies is based on relations, i.e. sets of tuples. Over relations, the class of functional dependencies subsumes the class of keys. Commercial database systems permit the storage of bags of tuples where duplicate tuples can occur. Over bags, keys and functional dependencies interact differently from how they interact over relations. We establish finite ground axiomatizations of keys and functional dependencies over bags, and show a strong correspondence to goal and definite clauses in classical propositional logic. We define a syntactic Boyce-Codd-Heath Normal Form condition, and show that the condition characterizes schemata that will never have any redundant data value occurrences in their instances. The results close the gap between the existing set-based theory of data dependencies and database practice where bags are permitted.

[1]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[2]  Ronald Fagin,et al.  Functional Dependencies in a Relational Data Base and Propositional Logic , 1977, IBM J. Res. Dev..

[3]  Sebastian Link,et al.  Empirical evidence for the usefulness of Armstrong relations in the acquisition of meaningful functional dependencies , 2010, Inf. Syst..

[4]  E. F. Codd,et al.  Recent Investigations in Relational Data Base Systems , 1974, ACM Pacific.

[5]  Lucja Kot,et al.  Characterization of the Interaction of XML Functional Dependencies with DTDs , 2007, ICDT.

[6]  Sven Hartmann,et al.  Efficient reasoning about a robust XML key fragment , 2009, TODS.

[7]  Jean H. Gallier,et al.  Linear-Time Algorithms for Testing the Satisfiability of Propositional Horn Formulae , 1984, J. Log. Program..

[8]  I. J. Heath Unacceptable file operations in a relational data base , 1971, SIGFIDET '71.

[9]  Gianfranco Lamperti,et al.  On Multisets in Database Systems , 2000, WMP.

[10]  Joel Berman,et al.  Positive Boolean Dependencies , 1988, Inf. Process. Lett..

[11]  Millist W. Vincent,et al.  Semantic foundations of 4NF in relational database design , 1999, Acta Informatica.

[12]  Ernst-Rüdiger Olderog,et al.  Transformational design of real-time systems Part I: From requirements to program specifications , 1999, Acta Informatica.

[13]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[14]  Ronald Fagin,et al.  An Equivalence Between Relational Database Dependencies and a Fragment of Propositional Logic , 1981, JACM.

[15]  Marcelo Arenas,et al.  A normal form for XML documents , 2004, TODS.

[16]  Sven Hartmann,et al.  Numerical constraints on XML data , 2010, Inf. Comput..