Using OCL for Automatically Producing Multidimensional Models and ETL Processes

During the last few years, several frameworks have dealt with Data Warehousing (DW) design issues. Most of these frameworks provide partial answers that focus either on multidimensional (MD) modelling or on Extraction-Transformation-Loading (ETL) modelling. Yet, neither the study of unifying both modelling issues nor their automation have been considered thoroughly. To overcome these limits, we suggest a generic unified method that automatically integrates DW and ETL design. The framework is handled within the Model Driven Architecture (MDA). In this paper we present a unified conceptual model that describes both the DW and its ETL process using the constellation model and the Object Constraint Language (OCL). Morevoer, we give a logical model for the ETL workflow and a set of Query/View/Transformation(QVT) mapping rules from the conceptual level to the logical level and then to the physical one. At the end, we describe the implemented prototype architecture.

[1]  Atish P. Sinha,et al.  A comparison of data warehousing methodologies , 2005, CACM.

[2]  Alkis Simitsis,et al.  Mapping conceptual to logical models for ETL processes , 2005, DOLAP '05.

[3]  Matteo Golfarelli,et al.  The Dimensional Fact Model: A Conceptual Model for Data Warehouses , 1998, Int. J. Cooperative Inf. Syst..

[4]  Isabelle Comyn-Wattiau,et al.  A UML-based data warehouse design method , 2006, Decis. Support Syst..

[5]  Dimitrios Skoutas,et al.  Representation of conceptual ETL designs in natural language using Semantic Web technology , 2010, Data Knowl. Eng..

[6]  Matteo Golfarelli,et al.  A methodological framework for data warehouse design , 1998, DOLAP '98.

[7]  Tok Wang Ling,et al.  Conceptual Modeling - ER 2003 , 2003, Lecture Notes in Computer Science.

[8]  Jose-Norberto Mazón,et al.  Automatic generation of ETL processes from conceptual models , 2009, DOLAP.

[9]  Ralph Kimball,et al.  The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses , 1996 .

[10]  Alberto Abelló,et al.  Research in data warehouse modeling and design: dead or alive? , 2006, DOLAP '06.

[11]  Ronan Tournier,et al.  A Unified Model Driven Methodology for Data Warehouses and ETL Design , 2011, ICEIS.

[12]  José Barateiro,et al.  A Survey of Data Quality Tools , 2005, Datenbank-Spektrum.

[13]  Anneke Kleppe,et al.  The Object Constraint Language: Getting Your Models Ready for MDA , 2003 .

[14]  Panos Vassiliadis,et al.  Data Mapping Diagrams for Data Warehouse Design with UML , 2004, ER.

[15]  Jose-Norberto Mazón,et al.  Extending OCL for OLAP querying on conceptual multidimensional models of data warehouses , 2010, Inf. Sci..

[16]  Anneke Kleppe,et al.  MDA explained - the Model Driven Architecture: practice and promise , 2003, Addison Wesley object technology series.

[17]  Jose-Norberto Mazón,et al.  A hybrid model driven development framework for the multidimensional modeling of data warehouses! , 2009, SGMD.

[18]  Juan Trujillo,et al.  A UML Based Approach for Modeling ETL Processes in Data Warehouses , 2003, ER.

[19]  Hongjun Lu,et al.  Conceptual Modeling – ER 2004 , 2004, Lecture Notes in Computer Science.

[20]  Panos Vassiliadis A Survey of Extract-Transform-Load Technology , 2009, Int. J. Data Warehous. Min..

[21]  Alberto Abelló,et al.  A Survey of Multidimensional Modeling Methodologies , 2009, Int. J. Data Warehous. Min..

[22]  Olivier Teste,et al.  Graphical Querying of Multidimensional Databases , 2007, ADBIS.

[23]  Jordi Cabot,et al.  Specifying Aggregation Functions in Multidimensional Models with OCL , 2010, ER.

[24]  Peretz Shoval,et al.  Conceptual Modeling - ER 2010, 29th International Conference on Conceptual Modeling, Vancouver, BC, Canada, November 1-4, 2010. Proceedings , 2010, ER.