Beyond Accuracy: What Data Quality Means to Data Consumers

Poor data quality (DQ) can have substantial social and economic impacts. Although firms are improving data quality with practical approaches and tools, their improvement efforts tend to focus narrowly on accuracy. We believe that data consumers have a much broader data quality conceptualization than IS professionals realize. The purpose of this paper is to develop a framework that captures the aspects of data quality that are important to data consumers.A two-stage survey and a two-phase sorting study were conducted to develop a hierarchical framework for organizing data quality dimensions. This framework captures dimensions of data quality that are important to data consumers. Intrinsic DQ denotes that data have quality in their own right. Contextual DQ highlights the requirement that data quality must be considered within the context of the task at hand. Representational DQ and accessibility DQ emphasize the importance of the role of systems. These findings are consistent with our understanding that high-quality data should be intrinsically good, contextually appropriate for the task, clearly represented, and accessible to the data consumer.Our framework has been used effectively in industry and government. Using this framework, IS managers were able to better understand and meet their data consumers' data quality needs. The salient feature of this research study is that quality attributes of data are collected from data consumers instead of being defined theoretically or based on researchers' experience. Although exploratory, this research provides a basis for future studies that measure data quality along the dimensions of this framework.

[1]  Thomas C. Redman,et al.  Data Quality Management and Technology , 1992 .

[2]  Joseph Moses Juran Juran on leadership for quality : an executive handbook , 1989 .

[3]  F. R. Keller,et al.  Data Quality , 1990, Taxation and Labour Supply.

[4]  Blake Ives,et al.  The measurement of user information satisfaction , 1983, CACM.

[5]  N. Melone A theoretical assessment of the user-satisfaction construct in information systems research , 1990 .

[6]  John Neter,et al.  A Stochastic Model of the Internal Control System , 1973 .

[7]  Robert J. Schneider Book Review: Consumer Protection , 1939 .

[8]  Gerald L. Thompson,et al.  A Mathematical Approach to the Analysis and Design of Internal Control Systems , 1973 .

[9]  Mary J. Culnan,et al.  The dimensions of accessibility to online information: implications for implementing office information systems , 1984, TOIS.

[10]  Stuart E. Madnick,et al.  A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective , 1990, VLDB.

[11]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[12]  Kenneth C. Laudon,et al.  Data quality and due process in large interorganizational record systems , 1986, CACM.

[13]  Sammy W. Pearson,et al.  Development of a Tool for Measuring and Analyzing Computer User Satisfaction , 1983 .

[14]  Giri Kumar Tayi,et al.  Methodology for allocating resources for data quality enhancement , 1989, Commun. ACM.

[15]  Douglas N. Ross,et al.  Quality or Else: The Revolution in World Business , 1992 .

[16]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[17]  Erfan A. Shafey Organizational Planning and Control Systems: Theory and Technology , 1969 .

[18]  Diane M. Strong,et al.  Exceptions and exception handling in computerized information processes , 1995, TOIS.

[19]  Diane M. Strong,et al.  Decision support for exception handling and quality control in office operations , 1992, Decis. Support Syst..

[20]  Stuart E. Madnick,et al.  Integrating information from global systems: Dealing with the "on- and off-ramps"; of the information superhighway , 1995, J. Organ. Comput..

[21]  J CulnanMary The dimensions of accessibility to online information: implications for implementing office information systems , 1984 .

[22]  R. Deshpandé,et al.  The Organizational Context of Market Research Use , 1982 .

[23]  Izak Benbasat,et al.  Development of an Instrument to Measure the Perceptions of Adopting an Information Technology Innovation , 1991, Inf. Syst. Res..

[24]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.

[25]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[26]  Donald P. Ballou,et al.  Designing Information Systems to Optimize the Accuracy-Timeliness Tradeoff , 1995, Inf. Syst. Res..

[27]  Richard Y. Wang,et al.  Anchoring data quality dimensions in ontological foundations , 1996, CACM.

[28]  Stuart E. Madnick,et al.  Data quality requirements analysis and modeling , 2011, Proceedings of IEEE 9th International Conference on Data Engineering.

[29]  Israel Spiegler,et al.  Information as inventory: A new conceptual view , 1991, Inf. Manag..

[30]  J. Hauser,et al.  The House of Quality , 1988 .

[31]  Richard C. Morey,et al.  Estimating and improving the quality of information in a MIS , 1982, CACM.

[32]  Arnon Rosenthal,et al.  Using semantic values to facilitate interoperability among heterogeneous information systems , 1994, TODS.

[33]  Gilbert A. Churchill,et al.  Marketing Research: Methodological Foundations , 1976 .

[34]  J. E. Jackson,et al.  Factor analysis, an applied approach , 1983 .

[35]  Charles H. Kriebel,et al.  Evaluating the Quality of Information Systems , 1979 .

[36]  J. Stevens Applied Multivariate Statistics for the Social Sciences , 1986 .

[37]  W. Edwards Deming,et al.  Out of the Crisis , 1982 .

[38]  Stephen E. Arnold,et al.  Information manufacturing: the road to database quality , 1992 .

[39]  Ephraim R. McLean,et al.  Information Systems Success: The Quest for the Dependent Variable , 1992, Inf. Syst. Res..

[40]  Abbie Griffin,et al.  The Voice of the Customer , 1993 .