On the relationship of minimum and optimum covers for a set of functional dependencies

SummaryMost algorithms in relational database theory use a set of functional dependencies as their input. The efficiency of the algorithms depends on the size of the set. The notions of a minimum set (with as few dependencies as possible) and an optimum set (which is as short as possible) were introduced by Maier. He showed that while a minimum cover for a given set of dependencies can be found in polynomial time, obtaining an optimum cover is an NP-complete problem. Here the relationship of these covers is explored further. It is shown that the length of a minimum set (i) cannot be bounded by a linear function on the length of an optimum cover, and (ii) is bounded by the square of the length of an optimum cover. It is also shown that the NP-completeness of the optimization problem is somewhat surprisingly caused solely by the difficulty of optimizing a single class of dependencies having equivalent left sides, not by the globality of the optimality condition. This result has some practical significance, since the equivalence classes appearing in practice are short. The problem of optimizing an equivalence class is studied and left and right sides of a dependency are shown to behave differently. A new representation for equivalence classes based on this observation is suggested. The optimization of single dependencies is shown to be NP-complete, and a method to produce good approximations is given.

[1]  Tok Wang Ling,et al.  An improved third normal form for relational databases , 1981, TODS.

[2]  Philip Alan Bernstein,et al.  Normalization and functional dependencies in the relational data base model. , 1975 .

[3]  Patrick C. Fischer,et al.  Decomposition of a relation scheme into Boyce-Codd Normal Form , 1980, ACM '80.

[4]  Claudio L. Lucchesi,et al.  Candidate Keys for Relations , 1978, J. Comput. Syst. Sci..

[5]  Katsumi Tanaka,et al.  Problems of Relational Database Design , 1978, Data Base Design Techniques I.

[6]  Joachim Biskup,et al.  Synthesizing independent database schemas , 1979, SIGMOD '79.

[7]  Alfred V. Aho,et al.  The theory of joins in relational data bases , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[8]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[9]  Jeffrey D. Ullman,et al.  Principles of Database Systems , 1980 .

[10]  Antonio Claudio Monteiro Da Silva The decomposition of relations based on relational dependencies , 1980 .

[11]  Philip A. Bernstein,et al.  Synthesizing third normal form relations from functional dependencies , 1976, TODS.

[12]  J. H. Jou Theory of functional relation schemes in relational databases , 1980 .

[13]  Jeffrey Ullman,et al.  Principles of Data Base Systems , 1980 .

[14]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[15]  David Maier Minimum covers in the relational database model (Extended Abstract) , 1979, STOC '79.

[16]  Philip A. Bernstein,et al.  Computational problems related to the design of normal form relational schemas , 1979, TODS.