Design of statistical databases: a methodology for the conceptual step

Abstract The aim of the conceptual step in database design is to describe data involving in the application in a formal and abstract way, without any concern to the specific model and language chosen for the implementation. In statistical applications, data are described at different levels of aggregation, from elementary facts of the reality to complex aggregations such as classifications, time series, indexes. The paper describes a methodology for conceptual design of statistical databases that provides the designer suitable strategies for defining such different levels of aggregation starting from user requirements, and checking the completeness, coherence and minimality of the conceptual schema at the different levels. The methodology makes use of two data models for the representation of data: for elementary data the Entity-Relationship model, widely used in database applications, and for summary data a new model is proposed, designed to be an effective trade-off between expressive power and simplicity of use.

[1]  Shamkant B. Navathe,et al.  Abstracting Relational and Hierarchical Data with a Semantic Data Model , 1987, International Conference on Conceptual Modeling.

[2]  Joobin Choobineh,et al.  Acquisition and Use of Contextual Knowledge in a Form-Driven Database Design Methodology , 1986, ER.

[3]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[4]  Kathi Hogshead Davis,et al.  A Methodology for Translating a Conventional File System into an Entity-Relationship Model , 1985, ER.

[5]  Ramez Elmasri,et al.  GORDAS: A Formal High-Level Query Language for the Entity-Relationship Model , 1981, ER.

[6]  Roberto Tamassia,et al.  An Interactive Graphic System for Designing and Accessing Statistical Data Bases , 1986 .

[7]  Arie Shoshani,et al.  SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases , 1981, VLDB.

[8]  Carlo Batini,et al.  Automatic graph drawing and readability of diagrams , 1988, IEEE Trans. Syst. Man Cybern..

[9]  Peter P. Chen English Sentence Structure and Entity-Relationship Diagrams , 1983, Inf. Sci..

[10]  Arnon Rosenthal,et al.  Theoretically Sound Transformations for Practical Database Design , 1987, ER.

[11]  David W. Embley,et al.  A Relationally Complete Query Language for an Entity-Relationship Model , 1985, ER.

[12]  Carlo Batini,et al.  A methodology for conceptual design of office data bases , 1984, Inf. Syst..

[13]  Arie Shoshani,et al.  Statistical and Scientific Database Issues , 1985, IEEE Transactions on Software Engineering.

[14]  Stanley Y. W. Su,et al.  SAM*: A Semantic Association Model for Corporate and Scientific/Statistical Databases , 1983, Inf. Sci..

[15]  Paolo Atzeni,et al.  Completeness of Query Languages for the Entity-Relationship Model , 1981, ER.

[16]  Daniel Pascot,et al.  Conception d'un système d'information: construction de la base de données , 1980 .

[17]  Erik Malmborg On the Semantics of Aggregated Data , 1986, SSDBM.

[18]  Maurizio Lenzerini,et al.  A Methodology for Data Schema Integration in the Entity Relationship Model , 1984, IEEE Transactions on Software Engineering.

[19]  Kathi Hogshead Davis,et al.  Converting A Relational Database Model into an Entity-Relationship Model , 1987, ER.