Assisting Data Warehousing Populating Processes Design through Modelling using Coloured Petri Nets

Data warehousing systems populating processes are responsible for loading their data repositories – the data warehouses – with information they extract from operational sources. The tasks that integrate these processes are the most complex ones that we can find in a data warehousing system. For a flawless implementation, modelling these processes previously is important so that a correct set of requirements is considered. This paper approaches conceptual modelling and simulation of the populating processes of a DWS, by applying Coloured Petri Nets in the design of independent populating tasks. We adopt a change data capture task as the case study in order to demonstrate the effective application of coloured petri nets for modelling and simulating data warehousing populating processes.

[1]  Kurt Jensen An Introduction to the Practical Use of Coloured Petri Nets , 1996, Petri Nets.

[2]  C. Petri Kommunikation mit Automaten , 1962 .

[3]  Larry P. English Improving Data Warehouse and Business Information Quality: Methods for Reducing Costs and Increasing Profits , 1999 .

[4]  Ralph Kimball,et al.  The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data , 2004 .

[5]  Stephen R. Gardner Building the data warehouse , 1998, CACM.

[6]  Laura Maruster,et al.  Encyclopedia of data warehousing and mining , 2008 .

[7]  Matteo Golfarelli DFM as a Conceptual Model for Data Warehouse , 2009, Encyclopedia of Data Warehousing and Mining.

[8]  Panos Vassiliadis,et al.  Conceptual modeling for ETL processes , 2002, DOLAP '02.

[9]  Jorge C. A. de Figueiredo,et al.  Using Coloured Petri Nets to Investigate Behavioural and Performance Issues of TCP Protocols , 1999 .

[10]  José Samos,et al.  YAM2: a multidimensional conceptual model extending UML , 2006, Inf. Syst..

[11]  Lars Michael Kristensen,et al.  Coloured Petri Nets - Modelling and Validation of Concurrent Systems , 2009 .

[12]  Matteo Golfarelli,et al.  Data Warehouse Design: Modern Principles and Methodologies , 2009 .

[13]  Jose-Norberto Mazón,et al.  Automatic generation of ETL processes from conceptual models , 2009, DOLAP.

[14]  Ludmila Cherkasova,et al.  On scalable net modeling of OLTP , 1993, Proceedings of 5th International Workshop on Petri Nets and Performance Models.