A novel data quality metric for timeliness considering supplemental data

It is intensively discussed in both science and practice how data quality (DQ) can be assured and improved. The growing relevance of DQ has revealed the need for adequate metrics because quantifying DQ is essential for planning quality measures in an economic manner. This paper analyses how DQ can be quantified with respect to the DQ dimension timeliness. Based on an existing approach, we design a new metric to quantify timeliness in a well-founded manner that considers so-called supplemental data (supplemental data are additional data attributes that allow drawing conclusions about the timeliness of the data attribute considered). In addition, it is possible to take the values of the metric into account when calculating expected values, an advantage that in turn leads to improved and comprehensible decision support. We evaluate the presented metric briefly with regard to requirements for designing DQ metrics from literature. Then, we illustrate the metric’s applicability as well as its practical benefit. In cooperation with a financial services provider, the metric was applied in the field of customer valuation in order to support the measurement of customer lifetime values.

[1]  Markus Helfert,et al.  Modelling information manufacturing systems , 2007, Int. J. Inf. Qual..

[2]  Latif Al-Hakim,et al.  Information quality factors affecting innovation process , 2007, Int. J. Inf. Qual..

[3]  Eric R. Ziegel Managing Quality Dynamics , 1991 .

[4]  Giri Kumar Tayi,et al.  Enhancing data quality in data warehouse environments , 1999, CACM.

[5]  Graeme G. Shanks,et al.  A semiotic information quality framework: development and comparative analysis , 2005, J. Inf. Technol..

[6]  Marcus Kaiser,et al.  A Procedure to Develop Metrics for Currency and its Application in CRM , 2009, JDIQ.

[7]  Marcus Kaiser,et al.  How to Measure Data Quality? - A Metric-Based Approach , 2007, ICIS.

[8]  P. Pfeifer,et al.  Modeling customer relationships as Markov chains , 2000 .

[9]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[10]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[11]  Adir Even,et al.  Utility-driven assessment of data quality , 2007, DATB.

[12]  Marcus Kaiser,et al.  Does the EU Insurance Mediation Directive Help to Improve Data Quality? A Metric- Based Analysis , 2008, ECIS.

[13]  Barbara D. Klein,et al.  A comparison of information technology professionals' and data consumers' perceptions of the importance of the dimensions of information quality , 2007, Int. J. Inf. Qual..

[14]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[15]  Sumit Sarkar,et al.  A Framework for Reconciling Attribute Values from Multiple Data Sources , 2007, Manag. Sci..

[16]  Markus Helfert,et al.  Analyzing Data Quality Investments in CRM: a Model-Based Approach , 2003, ICIQ.

[17]  Edward G. Schilling,et al.  Juran's Quality Handbook , 1998 .

[18]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[19]  Marcus Kaiser,et al.  DQ Metrics: A Novel Approach To Quantify Timeliness And Its Application In CRM , 2007, ICIQ.

[20]  InduShobha N. Chengalur-Smith,et al.  The Impact of Experience and Time on the Use of Data Quality Information in Decision Making , 2003, Inf. Syst. Res..

[21]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[22]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[23]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[24]  Carlo Batini,et al.  Data Quality: Concepts, Methodologies and Techniques , 2006, Data-Centric Systems and Applications.

[25]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[26]  Martin Bichler,et al.  Design science in information systems research , 2006, Wirtschaftsinf..

[27]  Varghese S. Jacob,et al.  Assessing data quality for information products , 1999, ICIS.

[28]  Richard Y. Wang,et al.  Data Quality , 2000, Advances in Database Systems.

[29]  Ying Wang,et al.  An empirical research on non-technical factors related to statistical data quality in China's enterprises , 2007, Int. J. Inf. Qual..

[30]  Holger Hinrichs Datenqualitätsmanagement in Data-warehouse-Systemen , 2002 .

[31]  D. W.,et al.  CUSTOMER LIFETIME VALUE: MARKETING MODELS AND APPLICATIONS , 1998 .

[32]  Varghese S. Jacob,et al.  Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product , 2004, Manag. Sci..

[33]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[34]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..