Active Inference, Belief Propagation, and the Bethe Approximation

When modeling goal-directed behavior in the presence of various sources of uncertainty, planning can be described as an inference process. A solution to the problem of planning as inference was previously proposed in the active inference framework in the form of an approximate inference scheme based on variational free energy. However, this approximate scheme was based on the mean-field approximation, which assumes statistical independence of hidden variables and is known to show overconfidence and may converge to local minima of the free energy. To better capture the spatiotemporal properties of an environment, we reformulated the approximate inference process using the so-called Bethe approximation. Importantly, the Bethe approximation allows for representation of pairwise statistical dependencies. Under these assumptions, the minimizer of the variational free energy corresponds to the belief propagation algorithm, commonly used in machine learning. To illustrate the differences between the mean-field approximation and the Bethe approximation, we have simulated agent behavior in a simple goal-reaching task with different types of uncertainties. Overall, the Bethe agent achieves higher success rates in reaching goal states. We relate the better performance of the Bethe agent to more accurate predictions about the consequences of its own actions. Consequently, active inference based on the Bethe approximation extends the application range of active inference to more complex behavioral tasks.

[1]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[2]  Karl J. Friston,et al.  Predictive coding under the free-energy principle , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  H. Bethe,et al.  Zur Theorie der Metalle , 1931 .

[4]  W. Arthur Inductive Reasoning and Bounded Rationality , 1994 .

[5]  Robert C. Wilson,et al.  An Approximately Bayesian Delta-Rule Model Explains the Dynamics of Belief Updating in a Changing Environment , 2010, The Journal of Neuroscience.

[6]  Yair Weiss,et al.  Globally optimal solutions for energy minimization in stereo vision using reweighted belief propagation , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[8]  Sophie Denève,et al.  Bayesian Inference with Spiking Neurons , 2004, Encyclopedia of Computational Neuroscience.

[9]  Jan L. Fan,et al.  Constrained coding and soft iterative decoding , 2001, Proceedings 2001 IEEE Information Theory Workshop (Cat. No.01EX494).

[10]  James M. Coughlan,et al.  Finding Deformable Shapes Using Loopy Belief Propagation , 2002, ECCV.

[11]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[12]  K. Doya Modulators of decision making , 2008, Nature Neuroscience.

[13]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[14]  William T. Freeman,et al.  Constructing free-energy approximations and generalized belief propagation algorithms , 2005, IEEE Transactions on Information Theory.

[15]  Karl J. Friston,et al.  The Dopaminergic Midbrain Encodes the Expected Certainty about Desired Outcomes , 2014, Cerebral cortex.

[16]  Raymond J. Dolan,et al.  Model averaging, optimal inference, and habit formation , 2014, Front. Hum. Neurosci..

[17]  Karl J. Friston,et al.  The graphical brain: Belief propagation and active inference , 2017, Network Neuroscience.

[18]  Raymond J. Dolan,et al.  The anatomy of choice: active inference and agency , 2013, Front. Hum. Neurosci..

[19]  William L. Brogan Applied Optimal Estimation (Arthur Gels, ed.) , 1977 .

[20]  Joshua B. Tenenbaum,et al.  Bayesian models of human action understanding , 2005, NIPS.

[21]  H. Bethe On the theory of metals. 1. Eigenvalues and eigenfunctions for the linear atomic chain , 1931 .

[22]  Michael I. Mandel,et al.  Visual Hand Tracking Using Nonparametric Belief Propagation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[23]  Angela J. Yu,et al.  Uncertainty, Neuromodulation, and Attention , 2005, Neuron.

[24]  Karl J. Friston,et al.  Observing the Observer (I): Meta-Bayesian Models of Learning and Decision-Making , 2010, PloS one.

[25]  Peter Bossaerts,et al.  The Neural Representation of Unexpected Uncertainty during Value-Based Decision Making , 2013, Neuron.

[26]  Kevin P. Murphy,et al.  Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[27]  M. Opper,et al.  Comparing the Mean Field Method and Belief Propagation for Approximate Inference in MRFs , 2001 .

[28]  P. Dayan,et al.  Reinforcement learning: The Good, The Bad and The Ugly , 2008, Current Opinion in Neurobiology.

[29]  Karl J. Friston,et al.  Neuroscience and Biobehavioral Reviews , 2022 .

[30]  Raymond J. Dolan,et al.  Sequential inference as a mode of cognition and its correlates in fronto-parietal and hippocampal brain regions , 2017, PLoS Comput. Biol..

[31]  Mark W Woolrich,et al.  Associative learning of social value , 2008, Nature.

[32]  Wolfgang Maass,et al.  Belief Propagation in Networks of Spiking Neurons , 2009, Neural Computation.

[33]  Karl J. Friston,et al.  Spatial Attention, Precision, and Bayesian Inference: A Study of Saccadic Response Speed , 2013, Cerebral cortex.

[34]  G. Monahan State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[35]  H. Bethe Statistical Theory of Superlattices , 1935 .

[36]  Karl J. Friston,et al.  Evidence for surprise minimization over value maximization in choice behavior , 2015, Scientific Reports.

[37]  Karl J. Friston,et al.  Action and behavior: a free-energy formulation , 2010, Biological Cybernetics.

[38]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[39]  M. Botvinick,et al.  Planning as inference , 2012, Trends in Cognitive Sciences.

[40]  P. Dayan,et al.  Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control , 2005, Nature Neuroscience.

[41]  Matthew J. Beal Variational algorithms for approximate Bayesian inference , 2003 .

[42]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[43]  R. Jardri,et al.  Circular inferences in schizophrenia. , 2013, Brain : a journal of neurology.

[44]  Karl J. Friston,et al.  A Bayesian Foundation for Individual Learning Under Uncertainty , 2011, Front. Hum. Neurosci..

[45]  Geoffrey E. Hinton,et al.  The Helmholtz Machine , 1995, Neural Computation.

[46]  Karl J. Friston,et al.  Active inference and epistemic value (vol 6, pg 187, 2015) , 2016 .

[47]  Rajesh P. N. Rao,et al.  Implementing belief propagation in neural circuits , 2005, Neurocomputing.

[48]  Karl J. Friston,et al.  Active Inference: A Process Theory , 2017, Neural Computation.

[49]  Hagai Attias,et al.  Planning by Probabilistic Inference , 2003, AISTATS.

[50]  Raymond J. Dolan,et al.  The anatomy of choice: dopamine and decision-making , 2014, Philosophical Transactions of the Royal Society B: Biological Sciences.

[51]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[52]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[53]  Raymond J. Dolan,et al.  Neural signals encoding shifts in beliefs , 2016, NeuroImage.

[54]  Timothy E. J. Behrens,et al.  Choice, uncertainty and value in prefrontal and cingulate cortex , 2008, Nature Neuroscience.

[55]  Ruedi Stoop,et al.  The Neurodynamics of Belief Propagation on Binary Markov Random Fields , 2006, NIPS.

[56]  Timothy E. J. Behrens,et al.  Learning the value of information in an uncertain world , 2007, Nature Neuroscience.

[57]  Alvin W Drake,et al.  Observation of a Markov process through a noisy channel , 1962 .

[58]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[59]  Alec Solway,et al.  Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates. , 2012, Psychological review.

[60]  Pedro F. Felzenszwalb,et al.  Efficient belief propagation for early vision , 2004, CVPR 2004.

[61]  John L. Fan Forward-Backward Algorithm , 2001 .

[62]  Antonio Rangel,et al.  Neural computations associated with goal-directed choice , 2010, Current Opinion in Neurobiology.

[63]  Gang Hua,et al.  Learning to estimate human pose with data driven belief propagation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[64]  J. J. Martin Bayesian Decision Problems and Markov Chains , 1967 .

[65]  Karl Johan Åström,et al.  Optimal control of Markov processes with incomplete state information , 1965 .

[66]  Karl J. Friston,et al.  Deep temporal models and active inference , 2017, Neuroscience & Biobehavioral Reviews.

[67]  D. Knill,et al.  The Bayesian brain: the role of uncertainty in neural coding and computation , 2004, Trends in Neurosciences.

[68]  Karl J. Friston The free-energy principle: a unified brain theory? , 2010, Nature Reviews Neuroscience.

[69]  Karl J. Friston,et al.  Dissociable Roles of Ventral and Dorsal Striatum in Instrumental Conditioning , 2004, Science.

[70]  Jonathan D. Cohen,et al.  Computational roles for dopamine in behavioural control , 2004, Nature.

[71]  Rajesh P. N. Rao,et al.  Bayesian brain : probabilistic approaches to neural coding , 2006 .

[72]  Florent Meyniel,et al.  The Sense of Confidence during Probabilistic Learning: A Normative Account , 2015, PLoS Comput. Biol..

[73]  Karl J. Friston,et al.  Planning and navigation as active inference , 2017, Biological Cybernetics.

[74]  N. Daw,et al.  The ubiquity of model-based reinforcement learning , 2012, Current Opinion in Neurobiology.

[75]  Dileep George,et al.  Towards a Mathematical Theory of Cortical Micro-circuits , 2009, PLoS Comput. Biol..

[76]  H. Simon,et al.  Invariants of human behavior. , 1990, Annual review of psychology.

[77]  H. Kobayashi,et al.  An efficient forward-backward algorithm for an explicit-duration hidden Markov model , 2003, IEEE Signal Processing Letters.

[78]  Michael I. Jordan,et al.  Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..