Not all answers are equally good: estimating the quality of database answers

With more and more electronic information sources becoming widely available, the issue of the quality of these often-competing sources has become germane. We propose a standard for rating information products with respect to their quality, and we show how to estimate the quality of answers issued by databases from the quality specifications that have been assigned to these databases. The annotation of answers with their quality provides valuable information to users and is an important new kind of cooperative behavior in databases. We report on preliminary simulations that were carried out to test the validity of our methods.

[1]  Meng Chang Chen,et al.  Selectivity estimation using homogeneity measurement , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[2]  Wen-Chi Hou,et al.  Statistical estimators for aggregate relational algebra queries , 1991, TODS.

[3]  Amihai Motro,et al.  Uncertainty Management in Information Systems: From Needs to Solution , 1996 .

[4]  Mark Chignell,et al.  Intelligent database tools & applications , 1993 .

[5]  Anany Levitin,et al.  The Notion of Data and Its Quality Dimensions , 1994, Inf. Process. Manag..

[6]  Amihai Motro,et al.  A Formal Framework for Integrating Inconsistent Answers from Multiple Information Sources , 1993 .

[7]  Richard Y. Wang,et al.  Estimating Data Accuracy in a Federated Database Environment , 1995, CISMOD.

[8]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[9]  Roger King,et al.  Exploiting data-distribution patterns in modeling tuple selectivities in a database , 1993, Inf. Sci..

[10]  Doron Rotem,et al.  Random sampling from databases: a survey , 1995 .

[11]  Richard Y. Wang,et al.  Toward quality data: An attribute-based approach , 2014, Decis. Support Syst..

[12]  Frank Olken,et al.  Random Sampling from Databases , 1993 .

[13]  Veda C. Storey,et al.  A Framework for Analysis of Data Quality Research , 1995, IEEE Trans. Knowl. Data Eng..

[14]  FoxChristopher,et al.  The notion of data and its quality dimensions , 1994 .

[15]  Amihai Motro,et al.  Integrity = validity + completeness , 1989, TODS.