Automated Database Schema Design Using Mined Data Dependencies

Data dependencies are used in database schema design to enforce the correctness of a database as well as to reduce redundant data. These dependencies are usually determined from the semantics of the attributes and are then enforced upon the relations. This paper describes a bottom-up procedure for discovering multivalued dependencies (MVDs) in observed data without knowing a priori the relationships amongst the attributes. The proposed algorithm is an application of the technique we designed for learning conditional independen-cies in probabilistic reasoning. A prototype system for automated database schema design has been implemented. Experiments were carried out to demonstrate both the eeectiveness and eeciency of our method.

[1]  Catriel Beeri,et al.  A complete axiomatization for functional and multivalued dependencies in database relations , 1977, SIGMOD '77.

[2]  Judea Pearl,et al.  The recovery of causal poly-trees from statistical data , 1987, Int. J. Approx. Reason..

[3]  R. Möhring Algorithmic graph theory and perfect graphs , 1986 .

[4]  Ronald Fagin,et al.  Multivalued dependencies and a new normal form for relational databases , 1977, TODS.

[5]  Remco R. Bouckaert,et al.  Properties of Bayesian Belief Network Learning Algorithms , 1994, UAI.

[6]  Catriel Beeri,et al.  On the Desirability of Acyclic Database Schemes , 1983, JACM.

[7]  Rina Dechter Decomposing a Relation into a Tree of Binary Relations , 1990, J. Comput. Syst. Sci..

[8]  Petr Hájek,et al.  Uncertain information processing in expert systems , 1992 .

[9]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems , 1988 .

[10]  Yang Xiang,et al.  CONSTRUCTION OF A MARKOV NETWORK FROM DATA FOR PROBABILISTIC INFERENCE , 1994 .

[11]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[12]  T. T. Lee An algebraic theory of relational databases , 1983, The Bell System Technical Journal.

[13]  Gregory F. Cooper,et al.  The ALARM Monitoring System: A Case Study with two Probabilistic Inference Techniques for Belief Networks , 1989, AIME.

[14]  Yang Xiang,et al.  Critical Remarks on Single Link Search in Learning Belief Networks , 1996, UAI.

[15]  Claude Delobel,et al.  Normalization and hierarchical dependencies in the relational data model , 1978, TODS.

[16]  Yang Xiang,et al.  Representation of Bayesian Networks as Relational Databases , 1994, IPMU.

[17]  Judea Pearl,et al.  The Logic of Representing Dependencies by Directed Graphs , 1987, AAAI.

[18]  P. Spirtes,et al.  An Algorithm for Fast Recovery of Sparse Causal Graphs , 1991 .

[19]  S. K. Michael Wong,et al.  Testing Implication of Probabilistic Dependencies , 1996, UAI.

[20]  Peter A. Flach,et al.  Bottom-up induction of functional dependencies from relations , 1993 .

[21]  Yang Xiang,et al.  A Method for Implementing a Probabilistic Model as a Relational Database , 1995, UAI.

[22]  Udi Manber,et al.  Introduction to algorithms - a creative approach , 1989 .

[23]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[24]  Richard E. Neapolitan,et al.  Probabilistic reasoning in expert systems - theory and algorithms , 2012 .