Abstract In real-time Business Analytics scenarios, like Business Activity Monitoring (BAM) or Operational Intelligence, speeding up ETL is fundamental to provide business users with up-to-date data in order to support decision-making process and to optimize business operations. Conventional ETL processes extract data from heterogeneous operational data sources, transform them according to some predefined semantic and syntactic rules, and finally load results into a new (relational or data warehouse) model, in order to be processed both for business monitoring or analysis. Usually the whole loading process is presented as a technical aspect, far from business facts. In real-time Business Analytics context the traditional ETL approach can still be valid, provided that ETL process has to be designed in a way that circumscribes individual business events and facts and makes them independent one from another. In this context we propose a control-flow- based approach to ETL process modeling, which starts from business facts identification, and represents ETL processes using BPMN notation, which is the foundation for machine-readable code. Our main contribution consists in a proposal for structuring ETL processes and related objects and in its application to a business case.
[1]
Kevin Wilkinson,et al.
Leveraging Business Process Models for ETL Design
,
2010,
ER.
[2]
Panos Vassiliadis,et al.
Data Mapping Diagrams for Data Warehouse Design with UML
,
2004,
ER.
[3]
Esteban Zimányi,et al.
BPMN-Based Conceptual Modeling of ETL Processes
,
2012,
DaWaK.
[4]
P. Mell,et al.
The NIST Definition of Cloud Computing
,
2011
.
[5]
Mario Piattini,et al.
Applying MDA to the development of data warehouses
,
2005,
DOLAP '05.
[6]
Werner Schmidt.
Business Activity Monitoring (BAM)
,
2013
.
[7]
Kevin Wilkinson,et al.
Business Processes Meet Operational Business Intelligence
,
2009,
IEEE Data Eng. Bull..
[8]
Ralph Kimball,et al.
The Data Warehouse ETL Toolkit: Practical Techniques for Extracting, Cleaning, Conforming, and Delivering Data
,
2004
.
[9]
Wil M. P. van der Aalst,et al.
Workflow Patterns
,
2004,
Distributed and Parallel Databases.
[10]
Richard Hull,et al.
Business Artifacts: A Data-centric Approach to Modeling Business Operations and Processes
,
2009,
IEEE Data Eng. Bull..