Prioritized Asynchronous Belief Propagation

Message scheduling is shown to be very eective in belief propagation (BP) algorithms. However, most existing scheduling algorithms use xed heuristics regardless of the structure of the graphs or properties of the distribution. On the other hand, designing dierent scheduling heuristics for all graph structures are not feasible. In this paper, we propose a reinforcement learning based message scheduling framework (RLBP) to learn the heuristics automatically which generalizes to any graph structures and distributions. In the experiments, we show that the learned problem-specic heuristics largely outperform other baselines in speed.

[1]  Richard D. Wesel,et al.  LDPC Decoders with Informed Dynamic Scheduling , 2010, IEEE Transactions on Communications.

[2]  John W. Fisher,et al.  Loopy Belief Propagation: Convergence and Effects of Message Errors , 2005, J. Mach. Learn. Res..

[3]  I JordanMichael,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008 .

[4]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[5]  Martin J. Wainwright,et al.  Tree-based reparameterization for approximate inference on loopy graphs , 2001, NIPS.

[6]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[7]  A. Hasman,et al.  Probabilistic reasoning in intelligent systems: Networks of plausible inference , 1991 .

[8]  Yishay Mansour,et al.  Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.

[9]  Ian McGraw,et al.  Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing , 2006, UAI.

[10]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[11]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[12]  Marc P. C. Fossorier,et al.  Reduced latency iterative decoding of LDPC codes , 2005, GLOBECOM '05. IEEE Global Telecommunications Conference, 2005..

[13]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[14]  Martin L. Puterman,et al.  Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .

[15]  R. Bellman A Markovian Decision Process , 1957 .