Towards Quality-oriented Data Warehouse Usage and Evolution

As a decision support information system, a data warehouse must provide high level quality of data and quality of service. In the DWQ project we have proposed an architectural framework and a repository of metadata which describes all the data warehouse components in a set of metamodels to which is added a quality metamodel, defining for each data warehouse metaobject the corresponding relevant quality dimensions and quality factors. Apart from this static definition of quality, we also provide an operational complement, that is a methodology on how to use quality factors and to achieve user quality goals. This methodology is an extension of the Goal-Question-Metric (GQM) approach, which allows to capture (a) the inter-relationships between different quality factors and (b) to organize them in order to fulfil specific quality goals. After summarizing the DWQ quality model, this paper describes the methodology we propose to use this quality model, as well as its impact on the data warehouse evolution.

[1]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses: An Extended Repository Approach , 1999, Information Systems.

[2]  Stuart E. Madnick,et al.  Data quality requirements analysis and modeling , 2011, Proceedings of IEEE 9th International Conference on Data Engineering.

[3]  Matthias Jarke,et al.  Design and Analysis of Quality Information for Data Warehouses , 1998, ER.

[4]  Matthias Jarke,et al.  A Model for Data Warehouse Operational Processes , 2000, CAiSE.

[5]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[6]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[7]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[8]  Timos K. Sellis,et al.  Designing Data Warehouses , 1999, Data Knowl. Eng..

[9]  Ken Orr,et al.  Data quality and systems theory , 1998, CACM.

[10]  Matthias Jarke,et al.  Dwq : Esprit Long Term Research Project, No 22469 Data Warehouse Quality: a Review of the Dwq Project , 2022 .

[11]  Diego Calvanese,et al.  Source integration in data warehousing , 1998, Proceedings Ninth International Workshop on Database and Expert Systems Applications (Cat. No.98EX130).

[12]  Matthias Jarke,et al.  A software process data model for knowledge engineering in information systems , 1990, Inf. Syst..

[13]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[14]  Mokrane Bouzeghoub,et al.  Modeling the Data Warehouse Refreshment Process as a Workflow Application , 1999, DMDW.

[15]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[16]  Philip A. Bernstein,et al.  Microsoft Repository Version 2 and the Open Information Model , 1999, Inf. Syst..

[17]  Victor R. Basili,et al.  Representing Software Engineering Models: The TAME Goal Oriented Approach , 1992, IEEE Trans. Software Eng..

[18]  Manfred A. Jeusfeld,et al.  View maintenance and change notification for application program views , 1998, SAC '98.

[19]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses , 1998, CAiSE.

[20]  Dimitri Theodoratos,et al.  Data Currency Quality Factors in Data Warehouse Design , 1999, DMDW.

[21]  H. D. Rombach,et al.  The Goal Question Metric Approach , 1994 .

[22]  Timos K. Sellis,et al.  Designing the Global Data Warehouse with SPJ Views , 1999, CAiSE.

[23]  Timos K. Sellis,et al.  Data Warehouse Configuration , 1997, VLDB.

[24]  Christoph Quix,et al.  Repository Support for Data Warehouse Evolution , 1999, DMDW.