Probabilistic Models for Relational Data

We introduce a graphical language for relational data called the probabilistic entityrelationship (PER) model. The model is an extension of the entity-relationship model, a common model for the abstract representation of database structure. We concentrate on the directed version of this model—the directed acyclic probabilistic entity-relationship (DAPER) model. The DAPER model is closely related to the plate model and the probabilistic relational model (PRM), existing models for relational data. The DAPER model is more expressive than either existing model, and also helps to demonstrate their similarity. In addition to describing the new language, we discuss important facets of modeling relational data, including the use of restricted relationships, self relationships, and probabilistic relationships. Many examples are provided.

[1]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[2]  Anne Lohrli Chapman and Hall , 1985 .

[3]  Ronald A. Howard,et al.  Readings on the Principles and Applications of Decision Analysis , 1989 .

[4]  David Heckerman,et al.  Probabilistic similarity networks , 1991, Networks.

[5]  Chuan Yi Tang,et al.  A 2.|E|-Bit Distributed Algorithm for the Directed Euler Trail Problem , 1993, Inf. Process. Lett..

[6]  Wray L. Buntine Operations for Learning with Graphical Models , 1994, J. Artif. Intell. Res..

[7]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[8]  Craig Boutilier,et al.  Context-Specific Independence in Bayesian Networks , 1996, UAI.

[9]  Avi Pfeffer,et al.  Object-Oriented Bayesian Networks , 1997, UAI.

[10]  Jennifer Widom,et al.  A First Course in Database Systems , 1997 .

[11]  Andrew McCallum,et al.  A comparison of event models for naive bayes text classification , 1998, AAAI 1998.

[12]  Michael I. Jordan Graphical Models , 2003 .

[13]  Lise Getoor,et al.  Learning Probabilistic Relational Models , 1999, IJCAI.

[14]  David Maxwell Chickering,et al.  Dependency Networks for Inference, Collaborative Filtering, and Data Visualization , 2000, J. Mach. Learn. Res..

[15]  Stuart Kent,et al.  Projections in Venn-Euler diagrams , 2000, Proceeding 2000 IEEE International Symposium on Visual Languages.

[16]  Brendan J. Frey,et al.  Factor graphs and the sum-product algorithm , 2001, IEEE Trans. Inf. Theory.

[17]  Stuart J. Russell,et al.  Approximate inference for first-order probabilistic languages , 2001, IJCAI.

[18]  David J. Spiegelhalter,et al.  Bayesian graphical modelling: a case‐study in monitoring health outcomes , 2002 .

[19]  Brendan J. Frey,et al.  Extending Factor Graphs so as to Unify Directed and Undirected Graphical Models , 2002, UAI.

[20]  Ronald A. Howard,et al.  Influence Diagrams , 2005, Decis. Anal..