A Probability Data Model and its Semantics

As database systems are increasingly being used in advanced applications, it is becoming common that data in these applications contain some elements of uncertainty. These arise from many factors, such as measurement errors and cognitive errors. As such, many researchers have focused on defining comprehensive uncertainty data models of uncertainty database systems. However, existing uncertainty data models do not adequately support some applications. Moreover, very few works address uncertainty tuple calculus. In this paper we advocate a probabilistic data model for representing uncertain information. In particular, we establish a probabilistic tuple calculus language and its semantics to meet the corresponding probabilistic relational algebra.

[1]  Zili Zhang,et al.  Temporal constraint satisfaction in matrix method , 2003, Appl. Artif. Intell..

[2]  Chengqi Zhang,et al.  Discovering Associations in Very Large Databases by Approximating , 2003, Acta Cybern..

[3]  Michael Pittarelli,et al.  An Algebra for Probabilistic Databases , 1994, IEEE Trans. Knowl. Data Eng..

[4]  C No Anytime Mining for Multi-User Applications , 2002 .

[5]  Shichao Zhang,et al.  Product hierarchy-based customer profiles for electronic commerce recommendation , 2002, Proceedings. International Conference on Machine Learning and Cybernetics.

[6]  Eugene Wong,et al.  A statistical approach to incomplete information in database systems , 1982, TODS.

[7]  Chengqi Zhang,et al.  Discovering causality in large databases , 2002, Appl. Artif. Intell..

[8]  Madan M. Gupta,et al.  Analysis and management of uncertainty : theory and applications , 1992 .

[9]  Xindong Wu,et al.  Multi-Database Mining , 2003, IEEE Intell. Informatics Bull..

[10]  Haim Mendelson,et al.  Incomplete information costs and database design , 1986, TODS.

[11]  Chengqi Zhang,et al.  A Hybrid Model for Sharing Information Between Fuzzy, Uncertain and Default Reasoning Models in Multi-agent Systems , 2002, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[12]  Bill P. Buckles,et al.  Information-theoretical characterization of fuzzy relational databases , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Chengqi Zhang,et al.  Data preparation for data mining , 2003, Appl. Artif. Intell..

[14]  Arun K. Majumdar,et al.  Fuzzy Functional Dependencies and Lossless Join Decomposition of Fuzzy Relational Database Systems , 1988, ACM Trans. Database Syst..

[15]  Frederick E. Petry,et al.  Extending the fuzzy database with fuzzy numbers , 1984, Inf. Sci..

[16]  Michael Pittarelli,et al.  The Theory of Probabilistic Databases , 1987, VLDB.

[17]  Moshe Y. Vardi Querying Logical Databases , 1986, J. Comput. Syst. Sci..

[18]  Li Liu,et al.  Mining Dynamic databases by Weighting , 2003, Acta Cybern..

[19]  Laks V. S. Lakshmanan,et al.  ProbView: a flexible probabilistic database system , 1997, TODS.

[20]  Jeffrey D. Ullman,et al.  Principles Of Database And Knowledge-Base Systems , 1979 .

[21]  Chengqi Zhang,et al.  Post-mining: maintenance of association rules by weighting , 2003, Inf. Syst..

[22]  Henri Prade,et al.  Generalizing Database Relational Algebra for the Treatment of Incomplete/Uncertain Information and Vague Queries , 1984, Inf. Sci..

[23]  Hector Garcia-Molina,et al.  The Management of Probabilistic Data , 1992, IEEE Trans. Knowl. Data Eng..

[24]  Arbee L. P. Chen,et al.  Answering heterogeneous database queries with degrees of uncertainty , 2005, Distributed and Parallel Databases.

[25]  Geoffrey I. Webb,et al.  Identifying Approximate Itemsets of Interest in Large Databases , 2004, Applied Intelligence.

[26]  Xindong Wu,et al.  Mining Both Positive and Negative Association Rules , 2002, ICML.

[27]  Sumit Sarkar,et al.  A probabilistic relational model and algebra , 1996, TODS.

[28]  Laks V. S. Lakshmanan,et al.  Probabilistic Deductive Databases , 1994, ILPS.

[29]  V. S. Subrahmanian,et al.  Stable Semantics for Probabilistic Deductive Databases , 1994, Inf. Comput..

[30]  Zili Zhang,et al.  An agent-based hybrid framework for database mining , 2003, Appl. Artif. Intell..

[31]  Norbert Fuhr,et al.  A probabilistic relational algebra for the integration of information retrieval and database systems , 1997, TOIS.

[32]  Chengqi Zhang,et al.  Propagating temporal relations of intervals by matrix , 2002, Appl. Artif. Intell..

[33]  Abraham Kandel,et al.  Implementing Imprecision in Information Systems , 1985, Inf. Sci..

[34]  Xindong Wu,et al.  Synthesizing High-Frequency Rules from Different Data Sources , 2003, IEEE Trans. Knowl. Data Eng..

[35]  Chengqi Zhang,et al.  Anytime mining for multiuser applications , 2002, IEEE Trans. Syst. Man Cybern. Part A.

[36]  Tomasz Imielinski,et al.  Incomplete Information in Relational Databases , 1984, JACM.