An Edge Deletion Semantics for Belief Propagation

Iterative belief propagation is an influential method for approximate inference in probabilistic graphical models, perhaps the most influential method of the last decade. Given its wide-spread applicability in various domains, there has been a great interest in developing semantics for this method to both characterize and control the quality of its approximations. We present in this paper a new semantics for belief propagation, formalizing it as a method of exact inference on a simplified model that has been obtained by deleting edges from the original. When we delete an edge, however, we lose a model dependency, which we can compensate for by adding auxiliary parameters to the model. We show that the iterations of belief propagation are searching for such auxiliary parameters in a model which results from deleting every network edge. This semantics leads to a number of questions: Can we delete fewer than every model edge? Further, which edges should we delete and which should we let remain? The answers to these questions lead to a generalization of belief propagation based on edge deletion, which we present here, allowing one to trade approximation quality with computational resources. This edge deletion perspective sheds new light on belief propagation approximations, and further enables an effective procedure for finding improved approximations through a simple process of edge recovery.

[1]  Adnan Darwiche,et al.  Approximating the Partition Function by Deleting and then Correcting for Model Edges , 2008, UAI.

[2]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[3]  Christopher Meek,et al.  A Variational Inference Procedure Allowing Internal Structure for Overlapping Clusters and Deterministic Constraints , 2006, J. Artif. Intell. Res..

[4]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[5]  Adnan Darwiche,et al.  A differential approach to inference in Bayesian networks , 2000, JACM.

[6]  Adnan Darwiche,et al.  On Bayesian Network Approximation by Edge Deletion , 2005, UAI.

[7]  Adnan Darwiche,et al.  On the Revision of Probabilistic Beliefs using Uncertain Evidence , 2003, IJCAI.

[8]  Ian McGraw,et al.  Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing , 2006, UAI.

[9]  Jian Sun,et al.  Symmetric stereo matching for occlusion handling , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[10]  Tom Minka,et al.  A family of algorithms for approximate Bayesian inference , 2001 .

[11]  Martin J. Wainwright,et al.  Tree-based reparameterization framework for analysis of sum-product and related algorithms , 2003, IEEE Trans. Inf. Theory.

[12]  Michael I. Jordan,et al.  Exploiting Tractable Substructures in Intractable Networks , 1995, NIPS.

[13]  David J. Spiegelhalter,et al.  Local computations with probabilities on graphical structures and their application to expert systems , 1990 .

[14]  M. Opper,et al.  Comparing the Mean Field Method and Belief Propagation for Approximate Inference in MRFs , 2001 .

[15]  Adnan Darwiche,et al.  Focusing Generalizations of Belief Propagation on Targeted Queries , 2008, AAAI.

[16]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[17]  Hilbert J. Kappen,et al.  Approximate Inference and Constrained Optimization , 2002, UAI.

[18]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[19]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[20]  Rina Dechter,et al.  Bucket elimination: A unifying framework for probabilistic inference , 1996, UAI.

[21]  Adnan Darwiche,et al.  A Variational Approach for Approximating Bayesian Networks by Edge Deletion , 2006, UAI.

[22]  Michael I. Jordan,et al.  A generalized mean field algorithm for variational inference in exponential families , 2002, UAI.

[23]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[24]  Wim Wiegerinck,et al.  Variational Approximations between Mean Field Theory and the Junction Tree Algorithm , 2000, UAI.

[25]  Tommi S. Jaakkola,et al.  Tutorial on variational approximation methods , 2000 .

[26]  Uffe Kjærulff,et al.  Reduction of Computational Complexity in Bayesian Networks Through Removal of Weak Dependences , 1994, UAI.

[27]  Brendan J. Frey,et al.  Sequentially Fitting "Inclusive" Trees for Inference in Noisy-OR Networks , 2000, NIPS.

[28]  Riccardo Zecchina,et al.  Survey propagation: An algorithm for satisfiability , 2002, Random Struct. Algorithms.

[29]  Max Welling,et al.  On the Choice of Regions for Generalized Belief Propagation , 2004, UAI.

[30]  Adnan Darwiche,et al.  Many-Pairs Mutual Information for Adding Structure to Belief Propagation Approximations , 2008, AAAI.

[31]  Adnan Darwiche,et al.  Encoding CNFs to Empower Component Analysis , 2006, SAT.

[32]  Yee Whye Teh,et al.  Structured Region Graphs: Morphing EP into GBP , 2005, UAI.

[33]  Volker Tresp,et al.  Model-independent mean-field theory as a local method for approximate propagation of information. , 1999, Network.

[34]  Henri Jacques Suermondt,et al.  Explanation in Bayesian belief networks , 1992 .

[35]  Nevin Lianwen Zhang,et al.  Exploiting Causal Independence in Bayesian Network Inference , 1996, J. Artif. Intell. Res..

[36]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[37]  R. van Engelen,et al.  Approximating Bayesian belief networks by arc removal , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Yuan Qi,et al.  Tree-structured Approximations by Expectation Propagation , 2003, NIPS.

[39]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.

[40]  Adnan Darwiche,et al.  Recursive conditioning , 2001, Artif. Intell..

[41]  Jung-Fu Cheng,et al.  Turbo Decoding as an Instance of Pearl's "Belief Propagation" Algorithm , 1998, IEEE J. Sel. Areas Commun..

[42]  S. Aji,et al.  The Generalized Distributive Law and Free Energy Minimization , 2001 .

[43]  Adnan Darwiche,et al.  An Edge Deletion Semantics for Belief Propagation and its Practical Impact on Approximation Quality , 2006, AAAI.

[44]  Rina Dechter,et al.  Iterative Join-Graph Propagation , 2002, UAI.

[45]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields , 2006, ECCV.

[46]  Adnan Darwiche,et al.  A differential semantics for jointree algorithms , 2002, Artif. Intell..

[47]  Brendan J. Frey,et al.  A Revolution: Belief Propagation in Graphs with Cycles , 1997, NIPS.