A DBMS For Large Statistical Databases

This paper describes the approach taken at Statistics Canada to create a generalized DBMS appropriate for the management of large statistical data-bases. The authors describe the requirements for statistical database processing which differ from more traditional database applications and the methods employed in storage organization and system architecture to satisfy them. Special emphasis is given to the usefulness of the transposed physical structure for the creation of statistical databases. Finally, the applicability of the system as a basis for a relational DBMS for large scale processing is discussed.

[1]  Dennis G. Severance,et al.  A Practical Approach to Selecting Record Access Paths , 1977, CSUR.

[2]  Donald D. Chamberlin,et al.  SEQUEL: A structured English query language , 1974, SIGFIDET '74.

[3]  Don S. Batory,et al.  On searching transposed files , 1978, ACM Trans. Database Syst..

[4]  P. A. Dearnley A Model of a Self-Organising Data Management System , 1974, Comput. J..

[5]  Jeffrey Alan Hoffer A clustering approach to the generation of subfiles for the design of a computer data base. , 1975 .

[6]  Dennis G. Severance,et al.  The use of cluster analysis in physical data base design , 1975, VLDB '75.

[7]  Jair M. Babad A record and file partitioning model , 1977, CACM.

[8]  Dennis G. Severance,et al.  Mathematical Techniques for Efficient Record Segmentation in Large Shared Databases , 1976, JACM.

[9]  Randall L. Frank,et al.  CODASYL Data-Base Management Systems , 1976, CSUR.

[10]  Richard H. Day,et al.  Letter to the Editor-On Optimal Extracting from a Multiple File Data Storage System: An Application of Integer Programming , 1965 .

[11]  Alfonso F. Cardenas Analysis and performance of inverted data base structures , 1975, CACM.

[12]  Arvola Chan,et al.  Index selection in a self-adaptive data base management system , 1976, SIGMOD '76.

[13]  William D. Slysz,et al.  Remark on algorithm 434: exact probabilities for R×C contingency tables , 1974, Commun. ACM.

[14]  Dennis G. Severance,et al.  The determination of efficient record segmentations and blocking factors for shared data files , 1977, TODS.

[15]  Peter M. Stocker,et al.  Self-Organising Data Management Systems , 1973, Comput. J..