Normalisation in the Presence of Lists

In the relational data model (RDM), normal forms are conditions for relation schemata that a database design should satisfy to ensure an absence of processing difficulties with the database. One prime example of such a normal form is the Boyce-Codd normal form guaranteeing the absence of redundancies and update anomalies caused by functional dependencies (FDs).Many different data models have been introduced over the years trying to capture data beyond relational structures. The success of those advanced data models will depend, in particular, on the study of normal forms. In fact, finding a unifying framework and extending achievements of relational databases to deal with advanced database features such as complex object types are currently two major challenges in database design (Biskup 1995, Biskup 1998).Such a unifying approach, capturing various different data models at a time, can be based on the type systems underlying the various data models. In the present paper, we study normalisation in the presence of base, record and finite list types. Nested lists are used as a data structure whenever order matters. List types are therefore supported by many advanced data models such as genomic sequence, deductive and object-oriented data models including XML.On the basis of a finite axiomatisation of FDs in the presence of lists the Nested List Normal Form (NLNF) is proposed as a weaker normal form than BCNF. This proposal is semantically justified by formally proving that NLNF is equivalent to the absence of redundancy. Moreover, NLNF is equivalent to the absence of strong insertion and most forms of replacement anomalies, and sufficient for the absence of all types of update anomalies.

[1]  Bernhard Thalheim,et al.  Fundamental Concepts of Object Oriented Databases , 1993, Acta Cybern..

[2]  Bernhard Thalheim,et al.  Entity-relationship modeling - foundations of database technology , 2010 .

[3]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[4]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[5]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[6]  A. Tarski,et al.  On Closed Elements in Closure Algebras , 1946 .

[7]  Sven Hartmann,et al.  More Functional Dependencies for XML , 2003, ADBIS.

[8]  Millist Walter Vincent,et al.  THE SEMANTIC JUSTIFICATION FOR NORMAL FORMS IN RELATIONAL DATABASE DESIGN , 1994 .

[9]  Nicolás Marín,et al.  Review of Data on the Web: from relational to semistructured data and XML by Serge Abiteboul, Peter Buneman, and Dan Suciu. Morgan Kaufmann 1999. , 2003, SGMD.

[10]  A Min Tjoa,et al.  Transformation of Requirement Specifications Expressed in Natural Language into an EER Model , 1993, ER.

[11]  Miron Livny,et al.  The Design and Implementation of a Sequence Database System , 1996, VLDB.

[12]  Zahir Tari,et al.  Object normal forms and dependency constraints for object-oriented schemata , 1997, TODS.

[13]  Joachim Biskup,et al.  Database Schema Design Theory : Achievements and Challenges , 1995, CISMOD.

[14]  Klaus-Dieter Schewe,et al.  Reasoning about Functional and Multi-valued Dependencies in the Presence of Lists , 2004, FoIKS.

[15]  Philip A. Bernstein,et al.  What does Boyce-Codd Normal Form Do? , 1980, VLDB.

[16]  Joachim Biskup,et al.  Achievements of Relational Database Schema Design Theory Revisited , 1995, Semantics in Databases.

[17]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[18]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[19]  Peter P. Chen English Sentence Structure and Entity-Relationship Diagrams , 1983, Inf. Sci..

[20]  Catriel Beeri,et al.  A Sophisticate's Introduction to Database Normalization Theory , 1978, VLDB.

[21]  Shamim A. Naqvi,et al.  A Logical Language for Data and Knowledge Bases , 1989 .

[22]  Dominique Pastre,et al.  Managing Complex Objects in an Extensible Relational DBMS , 1989, VLDB.

[23]  Peer Kröger,et al.  A Computational Biology Database Digest: Data, Data Analysis, and Data Management , 2004, Distributed and Parallel Databases.

[24]  Chengfei Liu,et al.  A Redundancy Free 4NF for XML , 2003, Xsym.

[25]  Sven Hartmann,et al.  On Functional Dependencies in Advanced Data Models , 2003, Electron. Notes Theor. Comput. Sci..

[26]  Marc Gyssens,et al.  The Structure of the Relational Database Model , 1989, EATCS Monographs on Theoretical Computer Science.

[27]  Marcelo Arenas,et al.  A normal form for XML documents , 2002, PODS '02.

[28]  Sven Hartmann Decomposing relationship types by pivoting and schema equivalence , 2001, Data Knowl. Eng..

[29]  E. F. Codd,et al.  Recent Investigations in Relational Data Base Systems , 1974, ACM Pacific.

[30]  Bernhard Thalheim,et al.  Foundations of entity-relationship modeling , 1993, Annals of Mathematics and Artificial Intelligence.

[31]  Philip A. Bernstein,et al.  Computational problems related to the design of normal form relational schemas , 1979, TODS.

[32]  Roger King,et al.  Semantic database modeling: survey, applications, and research issues , 1987, CSUR.

[33]  Joel E. Richardson,et al.  Supporting Lists in a Data Model (A Timely Approach) , 1992, VLDB.

[34]  David W. Embley,et al.  A normal form for precisely characterizing redundancy in nested relations , 1996, TODS.

[35]  Z. Meral Özsoyoglu,et al.  A new normal form for nested relations , 1987, TODS.

[36]  Ronald Fagin,et al.  A normal form for relational databases that is based on domains and keys , 1981, TODS.

[37]  Sven Hartmann,et al.  A Membership Algorithm for Functional and Multi-valued Dependencies in the Presence of Lists , 2004, Electron. Notes Theor. Comput. Sci..