Mefisto: A Functional Model for Statistical Entities

There have been numerous proposals aimed at correcting the deficiency in existing database models in order to manipulate statistical data. The manipulation of these data, such as statistical tables that are widely used in many statistical database application areas, is examined. A functional model Mefisto, which is based on a data structure called statistical entity and on a set of operations capable of manipulating this data structure, is proposed. The characteristics that an aggregate data model has are discussed and a brief survey of the main proposals in literature is made. The operators that allow statistical entities to be manipulated both from the descriptive and from the summary data point of view are discussed, and some examples are given. Each operator can be seen as a family of operators, and each is able to automatically compute the summary values of the statistical entity obtained by its application. A brief discussion regarding the limitations of the relational model for this type of data and a comparison with other proposals are presented. The advantages of the Mefisto model over those proposals are illustrated. It is shown that it is possible to define user friendly query languages based on the Mefisto model. >

[1]  Arie Shoshani,et al.  Statistical and Scientific Database Issues , 1985, IEEE Transactions on Software Engineering.

[2]  Maurizio Rafanelli,et al.  Research Topics in Statistical and Scientific Database Management: the IV SSDBM , 1988, SSDBM.

[3]  Arie Shoshani,et al.  SUBJECT: A Directory Driven System for Organizing and Accessing Large Statistical Databases , 1981, VLDB.

[4]  John L. McCarthy,et al.  Metadata Management for Large Statistical Databases , 1982, VLDB.

[5]  Gultekin Özsoyoglu,et al.  Extending relational algebra and relational calculus with set-valued attributes and aggregate functions , 1987, TODS.

[6]  Arie Shoshani,et al.  Statistical Databases: Characteristics, Problems, and some Solutions , 1982, VLDB.

[7]  Stanley Y. W. Su,et al.  SAM*: A Semantic Association Model for Corporate and Scientific/Statistical Databases , 1983, Inf. Sci..

[8]  Richard J. Orli Modeling data for the summary database , 1990, DATB.

[9]  Z. Meral Ozsoyoglu,et al.  An extension of relational algebra for summary tables , 1983 .

[10]  Shamkant B. Navathe,et al.  Complex Data Types and a Data Manipulation Language for Scientific and Statistical Databases , 1983, SSDBM.

[11]  S. B. Yao,et al.  View Modeling and Integration Using the Functional Data Model , 1982, IEEE Transactions on Software Engineering.

[12]  Albert Croker,et al.  The historical relational data model (HRDM) and algebra based on lifespans , 1986, 1987 IEEE Third International Conference on Data Engineering.

[13]  Sakti P. Ghosh Statistical relational tables for statistical database management , 1986, IEEE Transactions on Software Engineering.

[14]  Arie Shoshani,et al.  STORM: A Statistical Object Representation Model , 1990, IEEE Data Eng. Bull..

[15]  David Maier,et al.  The Theory of Relational Databases , 1983 .

[16]  Anthony C. Klug Equivalence of Relational Algebra and Relational Calculus Query Languages Having Aggregate Functions , 1982, JACM.

[17]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[18]  David W. Shipman The functional data model and the data language DAPLEX , 1979, SIGMOD '79.

[19]  Maurizio Rafanelli,et al.  Proposal of a Logical Model for Statistical Data Base , 1983, SSDBM.

[20]  Abdullah Uz Tansel A statistical interface for historical relational databases , 1987, 1987 IEEE Third International Conference on Data Engineering.

[21]  M. Rafanelli,et al.  Statistical Database: An Interactive Language for Logical Schema Definition by Means of a Model Based on Graphs , 1984 .

[22]  John C. Klensin,et al.  Statistical Data Management Requirements and the SQL Standards - An Evolving Comparison , 1988, SSDBM.

[23]  Hideto Ikeda,et al.  Additional Facilities of a Concentional DBMS to Support Interactive Statistical Analysis , 1981, SSDBM.

[24]  Francesco M. Malvestuto Answering queries in categorical databases , 1987, PODS '87.

[25]  E. F. Codd,et al.  A relational model of data for large shared data banks , 1970, CACM.

[26]  Rowland R. Johnson,et al.  Modelling summary data , 1981, SIGMOD '81.

[27]  G. Ozsoyoglu,et al.  Statistical Database Query Languages , 1985, IEEE Transactions on Software Engineering.

[28]  Fabrizio L. Ricci,et al.  ADAMS: an Aggregate Data Management System with Multip Interaction Techniques , 1991, DEXA.