From enterprise models to dimensional models: a methodology for data warehouse and data mart design

This paper describes a method for developing dimensional models from traditional Entity Relationship models. This can be used to design data warehouses and data marts based on enterprise data models. The first step of the method involves classifying entities in the data model into a number of categories. The second step involves identifying hierarchies that exist in the model. The final step involves collapsing these hierarchies and aggregating transaction data to form dimensional models. A number of design alternatives are presented, including a flat schema, a terraced schema, a star schema and a snowflake schema. We also define a new type of schema called a star cluster schema. This is a restricted form of snowflake schema, which minimises the number of tables while avoiding overlap between different dimensional hierarchies. Individual schemas can be collected together to form constellations or galaxies. We illustrate the method using a simple example.

[1]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .

[2]  S. Hitchman Practitioner perceptions on the use of some semantic concepts in the entity–relationship model , 1995 .

[3]  Barry Devlin,et al.  Data Warehouse: From Architecture to Implementation , 1996 .

[4]  Katherine Glassey Seducing the end user , 1998, CACM.

[5]  Laurie J. Kirsch,et al.  The Impact of Data Integration on the Costs and Benefits of Information Systems , 1992, MIS Q..

[6]  Stephen R. Gardner Building the data warehouse , 1998, CACM.

[7]  Varghese S. Jacob,et al.  Industrial-strength data warehousing , 1998, CACM.

[8]  E. F. CODD,et al.  A relational model of data for large shared data banks , 1970, CACM.

[9]  W. H. Inmon,et al.  Building the data warehouse (2nd ed.) , 1996 .

[10]  Peter P. Chen A Preliminary Framework for Entity-Relationship Models , 1981, ER.

[11]  Graeme Simsion,et al.  Justifying Investment in Information Resource Management , 1995, Australas. J. Inf. Syst..

[12]  Graeme Simsion,et al.  Data Modeling Essentials , 1994 .

[13]  G. M. Nijssen,et al.  Conceptual schema and relational database design - a fact oriented approach , 1989 .

[14]  M. T. Sager Data Centred Enterprise Modelling Methodologies - A Study of Practice and Potential , 1988, Aust. Comput. J..

[15]  Shamkant B. Navathe,et al.  Conceptual Database Design: An Entity-Relationship Approach , 1991 .

[16]  F. E. A Relational Model of Data Large Shared Data Banks , 2000 .

[17]  Bruce Love Enterprise Information Technologies: Designing the Competitive Company , 1993 .

[18]  James Martin,et al.  Strategic data-planning methodologies , 1982 .

[19]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .

[20]  Robert J. Kauffman,et al.  Reuse and Productivity in Integrated Computer-Aided Software Engineering: An Empirical Study , 1991, MIS Q..