A Model Driven Modernization Approach for Automatically Deriving Multidimensional Models in Data Warehouses

Data warehouses integrate several operational sources to provide a multidimensional (MD) analysis of data. Therefore, the development of a data warehouse claims for an in-depth analysis of these data sources. Several approaches have been presented to obtain multidimensional structures from data sources in order to guide this development. However, these approaches assume that a wide documentation of the data sources is available and only provide informal guidelines to support the discovery of MD elements. Therefore, this task may become highly difficult for complex and large data sources (e.g. legacy systems). To overcome these problems, we consider the development of the data warehouse as a modernization scenario that addresses the analysis of the available data sources, thus discovering MD structures to either derive a data-driven conceptual MD model or reconcile a requirement-driven conceptual MD model with data sources. Specifically, we use concepts from Architecture Driven Modernization (ADM) in order to automatically perform the following tasks: (i) obtain a logical representation of data sources (ii) mark this logical representation with MD concepts, and (iii) derive a conceptual MD model from the marked model. Finally, we have provided a case study based on a real world project in order to exemplify the application of our approach.

[1]  Matteo Golfarelli,et al.  The Dimensional Fact Model: A Conceptual Model for Data Warehouses , 1998, Int. J. Cooperative Inf. Syst..

[2]  Grace A. Lewis,et al.  Modernizing Legacy Systems - Software Technologies, Engineering Processes, and Business Practices , 2003, SEI series in software engineering.

[3]  Jose-Norberto Mazón,et al.  Applying Transformations to Model Driven Data Warehouses , 2006, DaWaK.

[4]  Jean-Luc Hainaut,et al.  Strategy for Database Application Evolution: The DB-MAIN Approach , 2003, ER.

[5]  Tok Wang Ling,et al.  Conceptual Modeling - ER 2003 , 2003, Lecture Notes in Computer Science.

[6]  Michael Böhnlein,et al.  Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems , 1999, DOLAP '99.

[7]  Christian Soutou,et al.  Relational Database Reverse Engineering: Algorithms to Extract Cardinality Constraints , 1998, Data Knowl. Eng..

[8]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[9]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[10]  Jose-Norberto Mazón,et al.  Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms , 2007, Data Knowl. Eng..

[11]  Gottfried Vossen,et al.  Conceptual data warehouse modeling , 2000, DMDW.

[12]  Reda Alhajj Extracting the extended entity-relationship model from a legacy relational database , 2003, Inf. Syst..

[13]  Il-Yeol Song,et al.  A UML profile for multidimensional modeling in data warehouses , 2006, Data Knowl. Eng..

[14]  Karen C. Davis,et al.  Automating data warehouse conceptual schema design and evaluation , 2002, DMDW.

[15]  Jose-Norberto Mazón,et al.  An MDA approach for the development of data warehouses , 2008, Decis. Support Syst..

[16]  Alberto Abelló,et al.  Research in data warehouse modeling and design: dead or alive? , 2006, DOLAP '06.

[17]  David W. Embley,et al.  Conceptual Modeling - ER 2006, 25th International Conference on Conceptual Modeling, Tucson, AZ, USA, November 6-9, 2006, Proceedings , 2006, ER.

[18]  Torben Bach Pedersen,et al.  Discovering Multidimensional Structure in Relational Data , 2004, DaWaK.

[19]  Antoni Olivé,et al.  Conceptual Schema-Centric Development: A Grand Challenge for Information Systems Research , 2005, CAiSE.

[20]  Jose-Norberto Mazón,et al.  A Set of QVT Relations to Assure the Correctness of Data Warehouses by Using Multidimensional Normal Forms , 2006, ER.

[21]  Mario Piattini,et al.  Applying MDA to the development of data warehouses , 2005, DOLAP '05.

[22]  Daniel L. Moody,et al.  From enterprise models to dimensional models: a methodology for data warehouse and data mart design , 2000, DMDW.

[23]  Paolo Giorgini,et al.  Goal-oriented requirement analysis for data warehouse design , 2005, DOLAP '05.

[24]  W. H. Inmon,et al.  Building the Data Warehouse,3rd Edition , 2002 .