The ATDM methodology to support the design and implementation of an enterprise data warehouse

The structure and abundance of Online Resource Usage (ORU) data restricts the effective and efficient analysis of this data by decision makers. Existing enterprise data warehouse (DW) methodologies do not provide sufficient support for the full DW development lifecycle in that support for metadata documentation, logical design, physical design and implementation is often neglected. This paper proposes an extension to the Triple-Driven Data Modelling (TDM) methodology to include the addition of a design and implementation phase. The Adapted Triple-Driven Methodology (ATDM) supports the generation and documentation of semantic and technical metadata that describes an enterprise data warehouse structure at the logical, physical and implementation levels. The ATDM was successfully applied to the Information and Communication Technology Services (ICTS) department of the Nelson Mandela Metropolitan University (NMMU). The application of the ATDM to ICTS resulted in the generation and documentation of semantic and technical metadata for both the logical and physical design of an enterprise data warehouse structure. The implementation phase was applied using the Microsoft SQL Server integrated tool to obtain an implemented DW for ICTS that is described by technical metadata at an implementation level. Several experiments were conducted to benchmark the accuracy, effectiveness and efficiency of the ICTS DW against the existing operational model used for running ad-hoc queries. The results of the investigation have shown that the ATDM can be successfully applied to obtain an effective and efficient enterprise data warehouse for analysing ORU data.

[1]  Panos Vassiliadis,et al.  A Methodology for the Conceptual Modeling of ETL Processes , 2003, CAiSE Workshops.

[2]  Lakshmi S. Iyer,et al.  Knowledge warehouse: an architectural integration of knowledge management, decision support, artificial intelligence and data warehousing , 2002, Decis. Support Syst..

[3]  Martin Staudt,et al.  Metadata standards for data warehousing: open information model vs. common warehouse metadata , 2000, SGMD.

[4]  Jose-Norberto Mazón,et al.  Modelling ETL Processes of Data Warehouses with UML Activity Diagrams , 2008, OTM Workshops.

[5]  Ruth Anderson Control flow : loops , 2006 .

[6]  Ralph Kimball,et al.  The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling , 1996 .

[7]  Nenad Jukic Modeling strategies and alternatives for data warehousing projects , 2006, CACM.

[8]  Jose-Norberto Mazón,et al.  A hybrid model driven development framework for the multidimensional modeling of data warehouses! , 2009, SGMD.

[9]  Shiwei Tang,et al.  Triple-driven data modeling methodology in data warehousing: a case study , 2006, DOLAP '06.

[10]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[11]  Thilini Ariyachandra,et al.  Key organizational factors in data warehouse architecture selection , 2010, Decis. Support Syst..

[12]  Keeley Crockett,et al.  Database systems: design, implementation & management - international edition , 2008 .

[13]  Shweta Taneja,et al.  COMPARATIVE STUDY OF DATA WAREHOUSE DESIGN APPROACHES : A SURVEY , 2012 .

[14]  Panos Vassiliadis,et al.  Conceptual modeling for ETL processes , 2002, DOLAP '02.

[15]  Robert Winter,et al.  Information requirements engineering for data warehouse systems , 2004, SAC '04.

[16]  Felix Naumann,et al.  Managing ETL Processes , 2008, NTII.

[17]  Matteo Golfarelli From User Requirements to Conceptual Design in Warehouse Design: A Survey , 2010 .

[18]  Irma Becerra-Fernandez,et al.  Business Intelligence: Practices, Technologies, and Management , 2013 .

[19]  M. Golfarelli From User Requirements to Conceptual Design in Data Warehouse Design , 2009 .

[20]  Dimitri Theodoratos,et al.  Data Warehouse Back-End Tools , 2009, Encyclopedia of Data Warehousing and Mining.

[21]  Paolo Giorgini,et al.  GRAnD: A goal-oriented approach to requirement analysis in data warehouses , 2008, Decis. Support Syst..

[22]  Juan Trujillo,et al.  A UML Based Approach for Modeling ETL Processes in Data Warehouses , 2003, ER.

[23]  Edmund F. Vail Causal Architecture: Bringing the Zachman Framework to Life , 2002, Inf. Syst. Manag..

[24]  Erhard Rahm,et al.  An Integrative and Uniform Model for Metadata Management in Data Warehousing Environments , 1999, DMDW.

[25]  Robert Winter,et al.  A method for demand-driven information requirements analysis in data warehousing projects , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[26]  Surajit Chaudhuri,et al.  An overview of data warehousing and OLAP technology , 1997, SGMD.

[27]  Atish P. Sinha,et al.  A comparison of data warehousing methodologies , 2005, CACM.

[28]  W. H. Inmon,et al.  Building the data warehouse , 1992 .