Aggregate statistical data: models for their representation

The paper gives a review of a number of data models for aggregate statistical data which have appeared in the computer science literature in the last ten years.After a brief introduction to the data model in general, the fundamental concepts of statistical data are introduced. These are called statistical objects because they are complex data structures (vectors, matrices, relations, time series, etc) which may have different possible representations (e.g. tables, relations, vectors, pie-charts, bar-charts, graphs, and so on). For this reason a statistical object is defined by two different types of attribute (a summary attribute, with its own summary type and with its own instances, called summary data, and the set of category attributes, which describe the summary attribute). Some conceptual models of statistical data (CSM, SDM4S), some semantic models of statistical data (SCM, SAM*, OSAM*), and some graphical models of statistical data (SUBJECT, GRASS, STORM) are also discussed.

[1]  David Maier,et al.  Making smalltalk a database system , 1984, SIGMOD '84.

[2]  Zbigniew Michalewicz Statistical and Scientific Databases , 1991 .

[3]  Anthony C. Klug Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions , 1982, JACM.

[4]  Maurizio Rafanelli,et al.  Mefisto: A Functional Model for Statistical Entities , 1993, IEEE Trans. Knowl. Data Eng..

[5]  Rowland R. Johnson,et al.  Modelling summary data , 1981, SIGMOD '81.

[6]  Gultekin Özsoyoglu,et al.  Extending relational algebra and relational calculus with set-valued attributes and aggregate functions , 1987, TODS.

[7]  Arie Shoshani,et al.  Statistical Databases: Characteristics, Problems, and some Solutions , 1982, VLDB.

[8]  Stephen E. Fienberg,et al.  Discrete Multivariate Analysis: Theory and Practice , 1976 .

[9]  Stanley Y. W. Su,et al.  SAM*: A Semantic Association Model for Corporate and Scientific/Statistical Databases , 1983, Inf. Sci..

[10]  Francesco M. Malvestuto A universal table model for categorical databases , 1989, Inf. Sci..

[11]  Hideto Sato,et al.  A Data Model, Knowledge Base, and Natural Language Processing for Sharing a Large Statistical Database , 1988, SSDBM.

[12]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[13]  Sakti P. Ghosh Statistical relational tables for statistical database management , 1986, IEEE Transactions on Software Engineering.

[14]  Abdullah Uz Tansel,et al.  HQUEL, a Query Language for Historical Relational Databases , 1986, SSDBM.

[15]  Dennis McLeod,et al.  Database description with SDM: a semantic database model , 1981, TODS.

[16]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[17]  Roger E. Cubitt Meta Data: An Experience of Its Uses and Management , 1983, SSDBM.

[18]  Francesco M. Malvestuto Answering queries in categorical databases , 1987, PODS '87.

[19]  David Maier,et al.  Development of an object-oriented DBMS , 1986, OOPSLA 1986.

[20]  Stanley B. Zdonik,et al.  A shared, segmented memory system for an object-oriented database , 1987, TOIS.

[21]  David W. Shipman The functional data model and the data language DAPLEX , 1979, SIGMOD '79.

[22]  Francesco M. Malvestuto,et al.  A universal-scheme approach to statistical databases containing homogeneous summary tables , 1993, TODS.

[23]  Maurizio Rafanelli,et al.  A model for the graphical representation of aggregate data. , 1992 .

[24]  Maurizio Rafanelli,et al.  Suppressing marginal cells to protect sensitive information in a two-dimensional statistical table (extended abstract) , 1991, PODS.

[25]  Z. Meral Özsoyoglu,et al.  A new normal form for nested relations , 1987, TODS.

[26]  Diane C. P. Smith,et al.  Database abstractions: aggregation and generalization , 1977, TODS.

[27]  David Maier,et al.  Development of an object-oriented DBMS , 1986, OOPLSA '86.

[28]  Arie Shoshani,et al.  Statistical and Scientific Database Issues , 1985, IEEE Transactions on Software Engineering.

[29]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[30]  Arie Shoshani,et al.  STORM: A Statistical Object Representation Model , 1990, IEEE Data Eng. Bull..

[31]  Arie Shoshani,et al.  SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases , 1981, VLDB.

[32]  Maurizio Rafanelli,et al.  Proposal of a Logical Model for Statistical Data Base , 1983, SSDBM.

[33]  Carlo Batini,et al.  Design of statistical databases: a methodology for the conceptual step , 1988, Inf. Syst..

[34]  Shamkant B. Navathe,et al.  Complex Data Types and a Data Manipulation Language for Scientific and Statistical Databases , 1983, SSDBM.

[35]  Ryosuke Hotaka,et al.  Conceptual Schema for a Wide-Scope Statistical Database and its Applications , 1986, SSDBM.