Utility-driven assessment of data quality

Data consumers assess quality within specific business contexts or decision tasks. The same data resource may have an acceptable level of quality for some contexts but this quality may be unacceptable for other contexts. However, existing data quality metrics are mostly derived impartially, disconnected from the specific contextual characteristics. This study argues for the need to revise data quality metrics and measurement techniques to incorporate and better reflect contextual assessment. It contributes to that end by developing new metrics for assessing data quality along commonly used dimensions - completeness, validity, accuracy, and currency. The metrics are driven by data utility, a conceptual measure of the business value that is associated with the data within a specific usage context. The suggested data quality measurement framework uses utility as a scaling factor for calculating quality measurements at different levels of data hierarchy. Examples are used to demonstrate the use of utility-driven assessment in real-world data management scenarios and the broader implications for data management are discussed

[1]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Dependent Variable , 1992, Inf. Syst. Res..

[2]  Niv Ahituv,et al.  A Systematic Approach Toward Assessing the Value of an Information System , 1980, MIS Q..

[3]  Donald P. Ballou,et al.  Modeling Completeness versus Consistency Tradeoffs in Information Decision Contexts , 2003, IEEE Trans. Knowl. Data Eng..

[4]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[5]  Robert J. Kauffman,et al.  Discovering Potential and Realizing Value from Information Technology Investments , 2000, J. Manag. Inf. Syst..

[6]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[7]  Niv Ahituv,et al.  Assessing the value of information , 1989, ICIS '89.

[8]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[9]  InduShobha N. Chengalur-Smith,et al.  The Impact of Experience and Time on the Use of Data Quality Information in Decision Making , 2003, Inf. Syst. Res..

[10]  John D. C. Little,et al.  Models and Managers: The Concept of a Decision Calculus , 2016, Manag. Sci..

[11]  R. Blattberg,et al.  Database marketing , 1997 .

[12]  Dale A. Stirling,et al.  Information rules , 2003, SGMD.

[13]  W. J. Stevenson,et al.  PRODUCTION OPERATIONS MANAGEMENT , 1992 .

[14]  Diane M. Strong,et al.  Knowing-Why About Data Processes and Data Quality , 2004 .

[15]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.

[16]  G. Shankaranarayan,et al.  Managing Data Quality in Dynamic Decision Environments: An Information Product Approach , 2003, J. Database Manag..

[17]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[18]  Donald P. Ballou,et al.  Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff , 1995, Inf. Syst. Res..

[19]  R. C. Hanna,et al.  Optimizing time limits in retail promotions: an email application , 2005, J. Oper. Res. Soc..

[20]  Adir Even,et al.  Managing Metadata in Data Warehouses: Pitfalls and Possibilities , 2004, Commun. Assoc. Inf. Syst..

[21]  Salvatore T. March,et al.  Information Systems Research , 2004, IFIP International Federation for Information Processing.

[22]  Richard Y. Wang,et al.  Data quality assessment , 2002, CACM.

[23]  Barbara Wixom,et al.  An Empirical Investigation of the Factors Affecting Data Warehousing Success , 2001, MIS Q..

[24]  InduShobha N. Chengalur-Smith,et al.  The Impact of Data Quality Information on Decision Making: An Exploratory Analysis , 1999, IEEE Trans. Knowl. Data Eng..

[25]  Richard C. Morey,et al.  Estimating and improving the quality of information in a MIS , 1982, CACM.

[26]  Stuart E. Madnick,et al.  A Polygen Model for Developing Heterogeneous Database Systems With Source Tagging Capabilities , 1990 .

[27]  Yu Cai,et al.  Supporting data quality management in decision-making , 2006, Decis. Support Syst..

[28]  Roger W. Schmenner Production/Operations Management: From the Inside Out , 1993 .

[29]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[30]  Ronald W. Hilton The Determinants of Information Value: Synthesizing Some General Results , 1981 .

[31]  Richard Y. Wang A Product Perspective On , 1998 .

[32]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..

[33]  Adir Even,et al.  THE ROLE OF PROCESS METADATA AND DATA QUALITY PERCEPTIONS IN DECISION MAKING: AN EMPIRICAL FRAMEWORK AND INVESTIGATION , 2006 .

[34]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[35]  Stephanie Watts,et al.  A Relevant, Believable Approach for Data Quality Assessment , 2003, ICIQ.

[36]  Diane M. Strong,et al.  Process-Embedded Data Integrity , 2004, J. Database Manag..

[37]  D. W.,et al.  CUSTOMER LIFETIME VALUE: MARKETING MODELS AND APPLICATIONS , 1998 .