A general ranking strategy for data accuracy management

A series of recent studies proposed a construct named damage and a set of models for estimating the damage in a chosen class of information systems. The perception that underlies the proposed construct is that, all other things being equal, it would be beneficial to assign priority to the elimination of data errors that have a stronger negative effect on output accuracy (i.e., output accuracy is lower) over data errors that have a weaker effect. In this paper we extend the work on damage by considering its use with information systems in general, rather than a specific class of information systems. Mainly, we propose a general strategy for ranking the inputs according to the damage that errors in each input inflict on the output of the system. A major advantage of this strategy is that it focuses the ranking effort on a subcomponent of the information system that can be substantially smaller and simpler than the information system as a whole.

[1]  Carlo Batini,et al.  Brokering infrastructure for minimum cost data procurement based on quality-quantity models , 2008, Decis. Support Syst..

[2]  S. Chatterjee,et al.  Design Science Research in Information Systems , 2010 .

[3]  Amir Parssian,et al.  Managerial decision support with knowledge of accuracy and completeness of the relational aggregate functions , 2006, Decis. Support Syst..

[4]  Felix Naumann,et al.  Quality-driven Integration of Heterogenous Information Systems , 1999, VLDB.

[5]  Diane M. Strong,et al.  AIMQ: a methodology for information quality assessment , 2002, Inf. Manag..

[6]  Joseph Moses Juran Juran on planning for quality , 1988 .

[7]  Salvatore T. March,et al.  Design and natural science research on information technology , 1995, Decis. Support Syst..

[8]  Amihai Motro,et al.  Not all answers are equally good: estimating the quality of database answers , 1997 .

[9]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[10]  Irit Askira Gelman,et al.  The Asymmetric Nature of Decision Errors in Multi-Criteria, Satisficing Decisions , 2009, AMCIS.

[11]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[12]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[13]  Irit Askira Gelman,et al.  Simulations of the Relationship Between an Information System's Input Accuracy and Its Output Accuracy , 2004, ICIQ.

[14]  Ma Janson,et al.  Data quality: The Achilles heel of end-user computing , 1988 .

[15]  S. Zeldin,et al.  Higher Order Software—A Methodology for Defining Software , 1976, IEEE Transactions on Software Engineering.

[16]  Donald P. Ballou,et al.  A Framework for the Analysis of Error in Conjunctive, Multi‐Criteria, Satisficing Decision Processes* , 1990 .

[17]  Giri Kumar Tayi,et al.  Methodology for allocating resources for data quality enhancement , 1989, Commun. ACM.

[18]  Ron Weber,et al.  An Ontological Model of an Information System , 1990, IEEE Trans. Software Eng..

[19]  Irit Askira Gelman GIGO or not GIGO: Error Propagation in Basic Information Processing Operations , 2007, AMCIS.

[20]  G. Shankaranarayan,et al.  Managing Data Quality in Dynamic Decision Environments: An Information Product Approach , 2003, J. Database Manag..

[21]  Irit Askira Gelman,et al.  A Model of Error Propagation in Conjunctive Decisions and its Application to Database Quality Management , 2012, J. Database Manag..

[22]  Craig W. Fisher,et al.  An Accuracy Metric: Percentages, Randomness, and Probabilities , 2009, JDIQ.

[23]  Irit Askira Gelman,et al.  Setting priorities for data accuracy improvements in satisficing decision-making scenarios: A guiding theory , 2010, Decis. Support Syst..

[24]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[25]  Varghese S. Jacob,et al.  Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product , 2004, Manag. Sci..

[26]  Irit Askira Gelman,et al.  Simulations of Error Propagation for Prioritizing Data Accuracy Improvements in Multi-Criteria Satisficing Decision Making Scenarios , 2009, ICIS.

[27]  Adir Even,et al.  Utility-driven assessment of data quality , 2007, DATB.