Probabilistic object bases

Although there are many applications where an object-oriented data model is a good way of representing and querying data, current object database systems are unable to handle objects whose attributes are uncertain. In this article, we extend previous work by Kornatzky and Shimony to develop an algebra to handle object bases with uncertainty. We propose concepts of consistency for such object bases, together with an NP-completeness result, and classes of probabilistic object bases for which consistency is polynomially checkable. In addition, as certain operations involve conjunctions and disjunctions of events, and as the probability of conjunctive and disjunctive events depends both on the probabilities of the primitive events involved as well as on what is known (if anything) about the relationship between the events, we show how all our algebraic operations may be performed under arbitrary probabilistic conjunction and disjunction strategies. We also develop a host of equivalence results in our algebra, which may be used as rewrite rules for query optimization. Last but not least, we have developed a prototype probabilistic object base server on top of ObjectStore. We describe experiments to assess the efficiency of different possible rewrite rules.

[1]  Joseph Y. Halpern An Analysis of First-Order Logics of Probability , 1989, IJCAI.

[2]  Laks V. S. Lakshmanan,et al.  ProbView: a flexible probabilistic database system , 1997, TODS.

[3]  Stanley B. Zdonik,et al.  A query algebra for object-oriented databases , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[4]  Norbert Fuhr,et al.  A probabilistic relational algebra for the integration of information retrieval and database systems , 1997, TOIS.

[5]  Hector Garcia-Molina,et al.  The Management of Probabilistic Data , 1992, IEEE Trans. Knowl. Data Eng..

[6]  V. S. Subrahmanian,et al.  Hybrid Probabilistic Programs , 2000, J. Log. Program..

[7]  Solomon Eyal Shimony,et al.  A Probabilistic Object-Oriented Data Model , 1994, Data Knowl. Eng..

[8]  Fereidoon Sadri Modeling Uncertainty in Object-Oriented Databases , 1993, SOFTEKS Workshop on Incompleteness and Uncertainty in Information Systems.

[9]  Michael Pittarelli,et al.  An Algebra for Probabilistic Databases , 1994, IEEE Trans. Knowl. Data Eng..

[10]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[11]  Laks V. S. Lakshmanan,et al.  A Parametric Approach to Deductive Databases with Uncertainty , 2001, IEEE Trans. Knowl. Data Eng..

[12]  George Boole,et al.  The Laws of Thought , 2003 .

[13]  Laks V. S. Lakshmanan,et al.  Modeling Uncertainty in Deductive Databases , 1994, DEXA.

[14]  Kathryn B. Laskey,et al.  Network Fragments: Representing Knowledge for Constructing Probabilistic Models , 1997, UAI.

[15]  François Bancilhon,et al.  Building an Object-Oriented Database System, The Story of O2 , 1992 .

[16]  Avi Pfeffer,et al.  Object-Oriented Bayesian Networks , 1997, UAI.

[17]  Ronald Fagin,et al.  A logic for reasoning about probabilities , 1988, [1988] Proceedings. Third Annual Information Symposium on Logic in Computer Science.

[18]  Sumit Sarkar,et al.  PSQL: A Query Language for Probabilistic Relational Data , 1998, Data Knowl. Eng..

[19]  Norbert Fuhr,et al.  A probabilistic NF2 relational algebra for integrated information retrieval and database systems , 1996 .

[20]  Sumit Sarkar,et al.  A probabilistic relational model and algebra , 1996, TODS.

[21]  Curtis E. Dyreson,et al.  Supporting valid-time indeterminacy , 1998, TODS.

[22]  Laks V. S. Lakshmanan,et al.  Probabilistic Deductive Databases , 1994, ILPS.

[23]  Elisa Bertino,et al.  Object-Oriented Database Systems: Concepts and Architectures , 1993 .

[24]  Peter Haddawy,et al.  Generating Bayesian Networks from Probablity Logic Knowledge Bases , 1994, UAI.

[25]  Richard S. Varga,et al.  Proof of Theorem 5 , 1983 .

[26]  Goetz Graefe The Cascades Framework for Query Optimization , 1995, IEEE Data Eng. Bull..

[27]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[28]  Henry E. Kyburg,et al.  Set-based Bayesianism , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[29]  P. Venkat Rangan,et al.  Handbook of Multimedia Information Management , 1997 .

[30]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[31]  Werner Kießling,et al.  New direction for uncertainty reasoning in deductive databases , 1991, SIGMOD '91.

[32]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[33]  Thomas Lukasiewicz,et al.  Extension of the Relational Algebra to Probabilistic Complex Values , 2000, FoIKS.

[34]  David J. DeWitt,et al.  The Object-Oriented Database System Manifesto , 1994, Building an Object-Oriented Database System, The Story of O2.

[35]  Werner Kießling,et al.  Database Support for Problematic Knowledge , 1992, EDBT.

[36]  Michael Kifer,et al.  On the Semantics of Rule-Based Expert Systems with Uncertainty , 1988, ICDT.