A Model of Error Propagation in Conjunctive Decisions and its Application to Database Quality Management

This study centers on the accuracy dimension of information quality and models the relationship between input accuracy and output accuracy in a popular class of applications. These applications consist of dichotomous decisions that are implemented through conjunction of selected criteria. In particular, the model provides the tools for quantifying the effect of errors in each decision input on the accuracy of a decision. Application of the model is illustrated through the example of a residential real estate database, where users' preferences are captured by conjunctive decision rules. This example demonstrates how the new model can be utilized for quantifying the aggregate damage that errors in different database attributes inflict on property selection decisions. Finally, this paper reports on an initial empirical validation of the proposed model through a series of Monte Carlo simulations. Numerical estimates of the model that have been developed through this inquiry can be useful for data and information quality assessments or policy-making purposes. Mainly, they can be integrated into cost-benefit analyses that assess alternative data accuracy enhancements or process and system designs.

[1]  R. Olshavsky,et al.  Task Complexity and Contingent Processing in Brand Choice , 1979 .

[2]  Carlo Batini,et al.  Brokering infrastructure for minimum cost data procurement based on quality-quantity models , 2008, Decis. Support Syst..

[3]  R. Marks The Satisficer's Curse , 2008 .

[4]  John W. Payne,et al.  Task complexity and contingent processing in decision making: An information search and protocol analysis☆ , 1976 .

[5]  Donald P. Ballou,et al.  Modeling Data and Process Quality in Multi-Input, Multi-Output Information Systems , 1985 .

[6]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[7]  Eric J. Johnson,et al.  The adaptive decision maker , 1993 .

[8]  InduShobha N. Chengalur-Smith,et al.  Sample-based quality estimation of query results in relational database environments , 2006, IEEE Transactions on Knowledge and Data Engineering.

[9]  G. Shankaranarayan,et al.  Managing Data Quality in Dynamic Decision Environments: An Information Product Approach , 2003, J. Database Manag..

[10]  Varghese S. Jacob,et al.  Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product , 2004, Manag. Sci..

[11]  William O. Stratton,et al.  ACCOUNTING SYSTEMS: THE RELIABILITY APPROACH TO INTERNAL CONTROL EVALUATION , 1981 .

[12]  Irit Askira Gelman,et al.  Setting priorities for data accuracy improvements in satisficing decision-making scenarios: A guiding theory , 2010, Decis. Support Syst..

[13]  Graeme Shanks,et al.  Data Quality and Decision Making , 2008 .

[14]  Ronald L. Wasserstein,et al.  Monte Carlo: Concepts, Algorithms, and Applications , 1997 .

[15]  Hillel J. Einhorn,et al.  Expert measurement and mechanical combination , 1972 .

[16]  Irit Askira Gelman,et al.  Simulations of Error Propagation for Prioritizing Data Accuracy Improvements in Multi-Criteria Satisficing Decision Making Scenarios , 2009, ICIS.

[17]  Alex Mintz,et al.  How Do Leaders Make Decisions? , 2004, Contributions to Conflict Management, Peace Economics and Development.

[18]  G. Owen,et al.  Thirteen theorems in search of the truth , 1983 .

[19]  Donald P. Ballou,et al.  A Framework for the Analysis of Error in Conjunctive, Multi‐Criteria, Satisficing Decision Processes* , 1990 .

[20]  Nicolas de Condorcet Essai Sur L'Application de L'Analyse a la Probabilite Des Decisions Rendues a la Pluralite Des Voix , 2009 .

[21]  InduShobha N. Chengalur-Smith,et al.  The Impact of Data Quality Information on Decision Making: An Exploratory Analysis , 1999, IEEE Trans. Knowl. Data Eng..

[22]  Robert L. Winkler,et al.  Limits for the Precision and Value of Information from Dependent Sources , 1985, Oper. Res..

[23]  H. J. Einhorn The use of nonlinear, noncompensatory models in decision making. , 1970, Psychological bulletin.

[24]  John W. Payne,et al.  The adaptive decision maker: Name index , 1993 .

[25]  Alan G. Phipps,et al.  Utility Function Switching During Residential Search , 1983 .

[26]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[27]  Richard Y. Wang,et al.  Modeling Information Manufacturing Systems to Determine Information Product Quality Management Scien , 1998 .

[28]  Felix Naumann,et al.  Assessment Methods for Information Quality Criteria , 2000, IQ.

[29]  Amihai Motro,et al.  Not all answers are equally good: estimating the quality of database answers , 1997 .

[30]  John Neter,et al.  A Stochastic Model of the Internal Control System , 1973 .

[31]  Thomas M. Cover,et al.  The Best Two Independent Measurements Are Not the Two Best , 1974, IEEE Trans. Syst. Man Cybern..

[32]  G. A. Miller THE PSYCHOLOGICAL REVIEW THE MAGICAL NUMBER SEVEN, PLUS OR MINUS TWO: SOME LIMITS ON OUR CAPACITY FOR PROCESSING INFORMATION 1 , 1956 .

[33]  H. J. Einhorn Use of nonlinear, noncompensatory models as a function of task and amount of information , 1971 .

[34]  Varghese S. Jacob,et al.  Assessing data quality for information products , 1999, ICIS.

[35]  Richard Y. Wang,et al.  Data Quality , 2000, Advances in Database Systems.

[36]  Amir Parssian,et al.  Managerial decision support with knowledge of accuracy and completeness of the relational aggregate functions , 2006, Decis. Support Syst..

[37]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[38]  H. Simon,et al.  Models Of Man : Social And Rational , 1957 .