A method for the mapping of conceptual designs to logical blueprints for ETL processes

Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. In previous work, we presented a modeling framework for ETL processes comprised of a conceptual model that concretely deals with the early stages of a data warehouse project, and a logical model that deals with the definition of data-centric workflows. In this paper, we describe the mapping of the conceptual model to the logical model. First, we identify how conceptual entities are mapped to logical entities. Next, we determine the execution order in the logical workflow using information adapted from the conceptual model. Finally, we provide a method for the transition from the conceptual model to the logical model.

[1]  Panos Vassiliadis,et al.  Modeling ETL activities as graphs , 2002, DMDW.

[2]  Timos K. Sellis,et al.  Optimizing ETL processes in data warehouses , 2005, 21st International Conference on Data Engineering (ICDE'05).

[3]  Verónika Peralta,et al.  Data Warehouse Logical Design from Multidimensional Conceptual Schemas , 2003 .

[4]  Matteo Golfarelli,et al.  A methodological framework for data warehouse design , 1998, DOLAP '98.

[5]  Panos Vassiliadis,et al.  A generic and customizable framework for the design of ETL scenarios , 2005, Inf. Syst..

[6]  Chuck Ballard,et al.  Data Modeling Techniques for Data Warehousing , 1999 .

[7]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[8]  Michael Böhnlein,et al.  Deriving initial data warehouse structures from the conceptual data models of the underlying operational information systems , 1999, DOLAP '99.

[9]  Panos Vassiliadis,et al.  Data Mapping Diagrams for Data Warehouse Design with UML , 2004, ER.

[10]  Cassandra J Phipps Migrating an Operational Database Schema to Data Warehouse Schemas , 2002 .

[11]  Shamim A. Naqvi,et al.  A Logical Language for Data and Knowledge Bases , 1989 .

[12]  Panos Vassiliadis,et al.  Conceptual modeling for ETL processes , 2002, DOLAP '02.

[13]  Panos Vassiliadis,et al.  Optimizing ETL processes in data warehouse environments , 2005, ICDE 2005.

[14]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom , 1998 .

[15]  Maurice Bruynooghe,et al.  On the Transformation of Object-Oriented Conceptual Models to Logical Theories , 2002, ER.

[16]  Karen C. Davis,et al.  Automating data warehouse conceptual schema design and evaluation , 2002, DMDW.

[17]  Carsten Sapia,et al.  Automatically generating OLAP schemata from conceptual graphical models , 2000, DOLAP '00.

[18]  Panos Vassiliadis,et al.  A Methodology for the Conceptual Modeling of ETL Processes , 2003, CAiSE Workshops.

[19]  Ibm Redbooks Data Modeling Techniques for Data Warehousing , 1998 .

[20]  Panos Vassiliadis,et al.  A Framework for the Design of ETL Scenarios , 2003, CAiSE.

[21]  Daniel L. Moody,et al.  From enterprise models to dimensional models: a methodology for data warehouse and data mart design , 2000, DMDW.

[22]  Juan Trujillo,et al.  A UML Based Approach for Modeling ETL Processes in Data Warehouses , 2003, ER.