Probabilistic Object Bases Probabilistic Object Bases Revised Version, April 2001

Though there are many applications where an object oriented data model is a good way of representing and querying data, current object database sy t ms are unable to handle objects whose attributes are uncertain. In this paper, we extend pre vious pioneering work by Kornatzky and Shimony to develop an algebra to handle object bases with unc ertainty. We propose concepts of consistency for such object bases, together with an NP-complet eness result, and classes of probabilistic object bases for which consistency is polynomially checkab le. In addition, as certain operations involve conjunctions and disjunctions of events, and as the probability of conjunctive and disjunctive events depends both on the probabilities of the primiti ve events involved as well as on what is known (if anything) about the relationship between the ev nts, we show how all our algebraic operations may be performed under arbitrary probabilistic conjunction and disjunction strategies. We also develop a host of equivalence results in our algebra, which may be used as rewrite rules for query optimization. Last but not least, we have develope d a prototype probabilistic object base server on top of ObjectStore. We describe experiments to ass ess the efficiency of different possible rewrite rules. 1Institut und Ludwig Wittgenstein Labor für Informationss ysteme, Technische Universität Wien, Favoritenstraße 9–11, A-1040 Wien, Austria. E-mail: feiter, lukasiewicz g@kr.tuwien.ac.at 2Department of Computer Science, Bucknell University, Lewi sburg, PA 17837, USA. E-mail: jameslu@bucknell.edu 3Institute for Advanced Computer Studies, Institute for Sys tems Research and Department of Computer Science, University of Maryland, College Park, Maryland 20742, USA. E-mail: vs@cs.umd.edu Acknowledgements: This work were supported by a TASC/DARPA grant J09301S9806 1, by the Army Research Laboratory under contract number DAAL01-97-K013 5, by an NSF Young Investigator award IRI93-57756 and an NSF grant CCR9731893, and by the Austrian Sci ence Fund under project N Z29-INF. Copyright c 2001 by the authors INFSYS RR 1843-99-11 I

[1]  George Boole,et al.  The Laws of Thought , 2003 .

[2]  Editors , 2003 .

[3]  L. Lakshmanan,et al.  A Parametric Approach to Deductive Databases with Uncertainty , 1996, IEEE Trans. Knowl. Data Eng..

[4]  Extension of the Relational Algebra to Probabilistic Complex Values , 2000, FoIKS.

[5]  Sumit Sarkar,et al.  PSQL: A Query Language for Probabilistic Relational Data , 1998, Data Knowl. Eng..

[6]  Curtis E. Dyreson,et al.  Supporting valid-time indeterminacy , 1998, TODS.

[7]  V. S. Subrahmanian,et al.  Hybrid Probabilistic Programs , 2000, J. Log. Program..

[8]  Laks V. S. Lakshmanan,et al.  ProbView: a flexible probabilistic database system , 1997, TODS.

[9]  Kathryn B. Laskey,et al.  Network Fragments: Representing Knowledge for Constructing Probabilistic Models , 1997, UAI.

[10]  Avi Pfeffer,et al.  Object-Oriented Bayesian Networks , 1997, UAI.

[11]  P. Venkat Rangan,et al.  Handbook of Multimedia Information Management , 1997 .

[12]  Norbert Fuhr,et al.  A probabilistic relational algebra for the integration of information retrieval and database systems , 1997, TOIS.

[13]  Norbert Fuhr,et al.  A probabilistic NF2 relational algebra for integrated information retrieval and database systems , 1996 .

[14]  Sumit Sarkar,et al.  A probabilistic relational model and algebra , 1996, TODS.

[15]  Henry E. Kyburg,et al.  Set-based Bayesianism , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[16]  G. Graefe The Cascades Framework for Query Optimization , 1995, IEEE Data Eng. Bull..

[17]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[18]  Laks V. S. Lakshmanan,et al.  Probabilistic Deductive Databases , 1994, ILPS.

[19]  Laks V. S. Lakshmanan,et al.  Modeling Uncertainty in Deductive Databases , 1994, DEXA.

[20]  Peter Haddawy,et al.  Generating Bayesian Networks from Probablity Logic Knowledge Bases , 1994, UAI.

[21]  Michael Pittarelli,et al.  An Algebra for Probabilistic Databases , 1994, IEEE Trans. Knowl. Data Eng..

[22]  Solomon Eyal Shimony,et al.  A Probabilistic Object-Oriented Data Model , 1994, Data Knowl. Eng..

[23]  Fereidoon Sadri Modeling Uncertainty in Object-Oriented Databases , 1993, SOFTEKS Workshop on Incompleteness and Uncertainty in Information Systems.

[24]  Elisa Bertino,et al.  Object-Oriented Database Systems: Concepts and Architectures , 1993 .

[25]  Hector Garcia-Molina,et al.  The Management of Probabilistic Data , 1992, IEEE Trans. Knowl. Data Eng..

[26]  Werner Kießling,et al.  Database Support for Problematic Knowledge , 1992, EDBT.

[27]  François Bancilhon,et al.  Building an Object-Oriented Database System, The Story of O2 , 1992 .

[28]  Werner Kießling,et al.  New Directions For Uncertainty Reasoning In Deductive Databases , 1991, SIGMOD Conference.

[29]  Won Kim,et al.  Introduction to Object-Oriented Databases , 1991, Computer systems.

[30]  Stanley B. Zdonik,et al.  A query algebra for object-oriented databases , 1990, [1990] Proceedings. Sixth International Conference on Data Engineering.

[31]  David J. DeWitt,et al.  The Object-Oriented Database System Manifesto , 1994, Building an Object-Oriented Database System, The Story of O2.

[32]  Joseph Y. Halpern An Analysis of First-Order Logics of Probability , 1989, IJCAI.

[33]  Michael Kifer,et al.  On the Semantics of Rule-Based Expert Systems with Uncertainty , 1988, ICDT.

[34]  Ronald Fagin,et al.  A logic for reasoning about probabilities , 1988, [1988] Proceedings. Third Annual Information Symposium on Logic in Computer Science.

[35]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[36]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[37]  R. Stephenson A and V , 1962, The British journal of ophthalmology.