Dual Assessment of Data Quality in Customer Databases

Quantitative assessment of data quality is critical for identifying the presence of data defects and the extent of the damage due to these defects. Quantitative assessment can help define realistic quality improvement targets, track progress, evaluate the impacts of different solutions, and prioritize improvement efforts accordingly. This study describes a methodology for quantitatively assessing both impartial and contextual data quality in large datasets. Impartial assessment measures the extent to which a dataset is defective, independent of the context in which that dataset is used. Contextual assessment, as defined in this study, measures the extent to which the presence of defects reduces a dataset’s utility, the benefits gained by using that dataset in a specific context. The dual assessment methodology is demonstrated in the context of Customer Relationship Management (CRM), using large data samples from real-world datasets. The results from comparing the two assessments offer important insights for directing quality maintenance efforts and prioritizing quality improvement solutions for this dataset. The study describes the steps and the computation involved in the dual-assessment methodology and discusses the implications for applying the methodology in other business contexts and data environments.

[1]  Yu Cai,et al.  Supporting data quality management in decision-making , 2006, Decis. Support Syst..

[2]  Richard C. Morey,et al.  Estimating and improving the quality of information in a MIS , 1982, CACM.

[3]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Dependent Variable , 1992, Inf. Syst. Res..

[4]  T. Davenport Competing on analytics. , 2006, Harvard business review.

[5]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[6]  Niv Ahituv,et al.  A Systematic Approach Toward Assessing the Value of an Information System , 1980, MIS Q..

[7]  Richard Y. Wang,et al.  Data Quality Assessment , 2002 .

[8]  Donald P. Ballou,et al.  Modeling Completeness versus Consistency Tradeoffs in Information Decision Contexts , 2003, IEEE Trans. Knowl. Data Eng..

[9]  Donald R. Lehmann,et al.  From Customer Lifetime Value to Shareholder Value , 2006 .

[10]  Adir Even,et al.  Utility-driven assessment of data quality , 2007, DATB.

[11]  Adir Even,et al.  Economics-Driven Data Management: An Application to the Design of Tabular Data Sets , 2007, IEEE Transactions on Knowledge and Data Engineering.

[12]  Thomas Redman,et al.  Data quality for the information age , 1996 .

[13]  Marcus Kaiser,et al.  How to Measure Data Quality? - A Metric-Based Approach , 2007, ICIS.

[14]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[15]  Stuart E. Madnick,et al.  The Design and Implementation of a Corporate Householding Knowledge Processor to Improve Data Quality , 2003, J. Manag. Inf. Syst..

[16]  Donald P. Ballou,et al.  Dynamically determined optimal inspection strategies for serial production processes , 1992 .

[17]  Richard J Courtheoux,et al.  Marketing data analysis and data quality management , 2003 .

[18]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[19]  Adir Even,et al.  Managing Metadata in Data Warehouses: Pitfalls and Possibilities , 2004, Commun. Assoc. Inf. Syst..

[20]  Barbara Wixom,et al.  An Empirical Investigation of the Factors Affecting Data Warehousing Success , 2001, MIS Q..

[21]  InduShobha N. Chengalur-Smith,et al.  The Impact of Data Quality Information on Decision Making: An Exploratory Analysis , 1999, IEEE Trans. Knowl. Data Eng..

[22]  G. Shankaranarayan,et al.  Managing Data Quality in Dynamic Decision Environments: An Information Product Approach , 2003, J. Database Manag..

[23]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[24]  Donald P. Ballou,et al.  Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff , 1995, Inf. Syst. Res..

[25]  M. Jarke,et al.  Fundamentals of Data Warehouses , 2003, Springer Berlin Heidelberg.

[26]  Adir Even,et al.  THE ROLE OF PROCESS METADATA AND DATA QUALITY PERCEPTIONS IN DECISION MAKING: AN EMPIRICAL FRAMEWORK AND INVESTIGATION , 2006 .

[27]  Richard Y. Wang,et al.  A product perspective on total data quality management , 1998, CACM.

[28]  Diane M. Strong,et al.  Process-Embedded Data Integrity , 2004, J. Database Manag..

[29]  Ganesan Shankaranarayanan,et al.  Supporting data quality management in decision-making , 2006 .

[30]  Gordon B. Davis,et al.  Can Humans Detect Errors in Data? Impact of Base Rates, Incentives, and Goals , 1997, MIS Q..

[31]  Omar E. M. Khalil,et al.  Relationship Marketing and Data Quality Management , 1999 .

[32]  Thomas F. Gattiker,et al.  Understanding the local-level costs and benefits of ERP through organizational information processing theory , 2004, Inf. Manag..

[33]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit , 2009 .

[34]  InduShobha N. Chengalur-Smith,et al.  The Impact of Experience and Time on the Use of Data Quality Information in Decision Making , 2003, Inf. Syst. Res..

[35]  Dale A. Stirling,et al.  Information rules , 2003, SGMD.

[36]  A. Herrmann,et al.  Market-driven product and service design: Bridging the gap between customer needs, quality management, and customer satisfaction , 2000 .

[37]  Robert C. Blattberg,et al.  Manage marketing by the customer equity test. , 1996, Harvard business review.

[38]  Paul D. Berger,et al.  Direct Marketing Management , 1989 .

[39]  P BallouDonald,et al.  The Impact of Data Quality Information on Decision Making , 1999 .

[40]  D. W.,et al.  CUSTOMER LIFETIME VALUE: MARKETING MODELS AND APPLICATIONS , 1998 .

[41]  R. Blattberg,et al.  Database marketing , 1997 .

[42]  Giri Kumar Tayi,et al.  An integrated production-inventory model with reprocessing and inspection , 1988 .

[43]  Lawrence A. West,et al.  Private Markets for Public Goods: Pricing Strategies of Online Database Vendors , 2000, J. Manag. Inf. Syst..