Proactive data quality management for data warehouse systems

Data warehousing has captured the attention of practitioners and researchers for a long time, whereas aspects of data quality is one of the crucial issues in data warehousing. Still, ensuring high level data quality is one of the most expensive and time-consuming tasks to perform in data warehousing projects. Many data warehouse projects are discontinued due to insufficient data quality. The following article describes an approach for managing data quality in data warehouse systems through a metadata based data quality system. The results are integrated in a comprehensive management approach and are based on practical experiences within a Swiss bank.

[1]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[2]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[3]  C. J. Date An Introduction to Database Systems , 1975 .

[4]  J. S. Hunter,et al.  Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building. , 1979 .

[5]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[6]  Matthias Jarke,et al.  Fundamentals of Data Warehouses , 2000, Springer Berlin Heidelberg.

[7]  Ramez Elmasri,et al.  Fundamentals of Database Systems , 1989 .

[8]  Richard Y. Wang,et al.  Quality information and knowledge , 1998 .

[9]  P. Mouncey Improving Data Warehouse and Business Information Quality , 2001 .

[10]  A. R. Zinsmeister,et al.  Statistics for Experimenters: An Introduction to Design, Data Analysis, and Model Building, by G. E. P. Box, W. G. Hunter, and J. S. Hunter , 1981 .

[11]  Thomas A. Gutzwiller,et al.  Das CC RIM-Referenzmodell für den Entwurf von betrieblichen, transaktionsorientierten Informationssystemen , 1994 .

[12]  Martin J. Eppler,et al.  Conceptualizing Information Quality: A Review of Information Quality Frameworks from the Last Ten Years , 2000, IQ.

[13]  Markus Helfert,et al.  Managing and Measuring Data Quality in Data Warehousing , 2001 .

[14]  Matthias Jarke,et al.  Architecture and Quality in Data Warehouses: An Extended Repository Approach , 1999, Information Systems.

[15]  Udo Grimmer,et al.  A Methodological Approach to Data Quality Management Supported by Data Mining , 2001, IQ.

[16]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[17]  Mark Helfert,et al.  Massnahmen und Konzepte zur Sicherung der Datenqualität , 2000 .

[18]  Giri Kumar Tayi,et al.  Examining data quality , 1998, CACM.

[19]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..