The Problem of Equivalence for Entity-Relationship Diagrams

We investigate the question of when two entity-relationship diagrams (ERD's) should be considered equivalent, in the sense of representing the same information. This question is very important for a database design process which uses the ERD model, and can be interpreted in various ways. We give three natural and increasingly stricter criteria for developing concepts of equivalence for ERD's. We first give a notion of "domain data compatibility" which ensures that the ERD's in question represent the same universe of data in an aggregate sense. Then we define the set of functional dependencies which are naturally embedded in each ERD, and use it to develop a concept of "data dependency equivalence" which ensures that the ERD's satisfy the same constraints (functional dependencies) among the represented data. Finally, we give our strongest criterion, instance data equivalence, which requires the ERD's to have the same power to represent instances of data. We develop several alternate forms of this third notion, including some giving efficient tableaux tests for its occurrence. Indeed, for each type of equivalence, we give a polynomial-time algorithm to test for it.

[1]  Catriel Beeri,et al.  Equivalence of relational database schemes , 1979, SIAM J. Comput..

[2]  Alfred V. Aho,et al.  Efficient optimization of a class of relational expressions , 1979, TODS.

[3]  Laurian M. Chirica,et al.  The entity-relationship model: toward a unified view of data , 1975, SIGF.

[4]  David Maier,et al.  Testing implications of data dependencies , 1979, SIGMOD '79.

[5]  Ronald Fagin,et al.  A normal form for relational databases that is based on domains and keys , 1981, TODS.

[6]  Alfred V. Aho,et al.  Efficient optimization of a class of relational expressions , 1978, SIGMOD Conference.

[7]  Philip A. Bernstein,et al.  Synthesizing third normal form relations from functional dependencies , 1976, TODS.

[8]  Alfred V. Aho,et al.  Equivalences Among Relational Expressions , 1979, SIAM J. Comput..

[9]  Peter A. Ng,et al.  Further Analysis of the Entity-Relationship Approach to Database Design , 1981, IEEE Transactions on Software Engineering.

[10]  Ashok K. Chandra,et al.  Optimal implementation of conjunctive queries in relational data bases , 1977, STOC '77.

[11]  Jeffrey D. Ullman,et al.  The U. R. strikes back , 1982, PODS.

[12]  J. D. Uiiman Principles of database systems , 1982 .

[13]  E. F. Codd,et al.  A Relational Model for Large Shared Data Banks , 1970 .

[14]  Alfred V. Aho,et al.  The theory of joins in relational databases , 1979, ACM Trans. Database Syst..

[15]  E. F. Codd,et al.  Further Normalization of the Data Base Relational Model , 1971, Research Report / RJ / IBM / San Jose, California.

[16]  Peter P. Chen,et al.  A Decomposition of Relations Using the Entity-Relationship Approach , 1981, ER.