A Certainty-Based Model for Uncertain Databases

This paper considers relational databases containing uncertain attribute values when some knowledge is available about the more or less certain value (or disjunction of values) that a given attribute in a tuple may take. We propose a possibility-theory-based model suited to this context and extend the operators of relational algebra to handle such relations in a “compact,” thus efficient, way. It is shown that the model is a representation system for the whole relational algebra. An important result is that the data complexity associated with the extended operators in this context is the same as in the classical database case, which makes the approach highly scalable.

[1]  Didier Dubois,et al.  Automated Reasoning Using Possibilistic Logic: Semantics, Belief Revision, and Variable Certainty Weights , 1994, IEEE Trans. Knowl. Data Eng..

[2]  Dan Olteanu,et al.  10106 Worlds and Beyond: Efficient Representation and Processing of Incomplete Information , 2007, ICDE.

[3]  Willard Van Orman Quine,et al.  The Problem of Simplifying Truth Functions , 1952 .

[4]  Dan Suciu,et al.  Management of probabilistic data: foundations and challenges , 2007, PODS '07.

[5]  Henri Prade,et al.  Dealing with Aggregate Queries in an Uncertain Database Model Based on Possibilistic Certainty , 2014, IPMU.

[6]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.

[7]  Verena Kantere,et al.  Efficient Query Computing for Uncertain Possibilistic Databases with Provenance , 2011, TaPP.

[8]  Patrick Bosc,et al.  About projection-selection-join queries addressed to possibilistic relational databases , 2005, IEEE Transactions on Fuzzy Systems.

[9]  Dan Olteanu,et al.  Using OBDDs for Efficient Query Evaluation on Probabilistic Databases , 2008, SUM.

[10]  Jennifer Widom,et al.  ULDBs: databases with uncertainty and lineage , 2006, VLDB.

[11]  Dan Suciu,et al.  Efficient query evaluation on probabilistic databases , 2004, The VLDB Journal.

[12]  J. Lang Possibilistic Logic: Complexity and Algorithms , 2000 .

[13]  E. McCluskey Minimization of Boolean functions , 1956 .

[14]  Dan Olteanu,et al.  $${10^{(10^{6})}}$$ worlds and beyond: efficient representation and processing of incomplete information , 2006, 2007 IEEE 23rd International Conference on Data Engineering.

[15]  L. Zadeh Fuzzy sets as a basis for a theory of possibility , 1999 .

[16]  Christopher Ré,et al.  Probabilistic databases , 2011, SIGA.

[17]  Eugene Wong,et al.  A statistical approach to incomplete information in database systems , 1982, TODS.

[18]  Didier Dubois,et al.  Necessity Measures and the Resolution Principle , 1987, IEEE Transactions on Systems, Man, and Cybernetics.

[19]  Patrick Bosc,et al.  Modeling and Querying Uncertain Relational Databases: a Survey of Approaches Based on the Possible Worlds Semantics , 2010, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[20]  Jennifer Widom,et al.  Databases with uncertainty and lineage , 2008, The VLDB Journal.

[21]  Dan Olteanu,et al.  Fast and Simple Relational Processing of Uncertain Data , 2007, 2008 IEEE 24th International Conference on Data Engineering.

[22]  Jennifer Widom,et al.  Exploiting Lineage for Confidence Computation in Uncertain and Probabilistic Databases , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[23]  Cordell Green Theorem-Proving by Resolution as a Basis for Question-Answering Systems , 2010 .

[24]  Witold Lipski,et al.  On semantic issues connected with incomplete information databases , 1979, ACM Trans. Database Syst..

[25]  Henri Prade,et al.  Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain Information and Vague Queries , 1984, Inf. Sci..

[26]  Philip S. Yu,et al.  A Survey of Uncertain Data Algorithms and Applications , 2009, IEEE Transactions on Knowledge and Data Engineering.

[27]  H. Prade,et al.  Possibilistic logic , 1994 .

[28]  Peter J. Haas,et al.  Special issue on uncertain and probabilistic databases , 2009, The VLDB Journal.

[29]  Henri Prade,et al.  Skyline Queries in an Uncertain Database Model Based on Possibilistic Certainty , 2014, SUM.