Top-Down Feedback in an HMAX-Like Cortical Model of Object Perception Based on Hierarchical Bayesian Networks and Belief Propagation

Hierarchical generative models, such as Bayesian networks, and belief propagation have been shown to provide a theoretical framework that can account for perceptual processes, including feedforward recognition and feedback modulation. The framework explains both psychophysical and physiological experimental data and maps well onto the hierarchical distributed cortical anatomy. However, the complexity required to model cortical processes makes inference, even using approximate methods, very computationally expensive. Thus, existing object perception models based on this approach are typically limited to tree-structured networks with no loops, use small toy examples or fail to account for certain perceptual aspects such as invariance to transformations or feedback reconstruction. In this study we develop a Bayesian network with an architecture similar to that of HMAX, a biologically-inspired hierarchical model of object recognition, and use loopy belief propagation to approximate the model operations (selectivity and invariance). Crucially, the resulting Bayesian network extends the functionality of HMAX by including top-down recursive feedback. Thus, the proposed model not only achieves successful feedforward recognition invariant to noise, occlusions, and changes in position and size, but is also able to reproduce modulatory effects such as illusory contour completion and attention. Our novel and rigorous methodology covers key aspects such as learning using a layerwise greedy algorithm, combining feedback information from multiple parents and reducing the number of operations required. Overall, this work extends an established model of object recognition to include high-level feedback modulation, based on state-of-the-art probabilistic approaches. The methodology employed, consistent with evidence from the visual cortex, can be potentially generalized to build models of hierarchical perceptual organization that include top-down and bottom-up interactions, for example, in other sensory modalities.

[1]  Adam Binch,et al.  Perception as Bayesian Inference , 2014 .

[2]  Wolfgang Maass,et al.  Probabilistic Inference in General Graphical Models through Sampling in Stochastic Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[3]  Honglak Lee,et al.  Unsupervised learning of hierarchical representations with convolutional deep belief networks , 2011, Commun. ACM.

[4]  Davide Maltoni,et al.  Pattern Recognition by Hierarchical Temporal Memory , 2011 .

[5]  Sue L. Denham,et al.  Modelling object perception in cortex: Hierarchical Bayesian networks and belief propagation , 2011, 2011 45th Annual Conference on Information Sciences and Systems.

[6]  Tony J. Prescott,et al.  Learning in a Unitary Coherent Hippocampus , 2010, ICANN.

[7]  Yann LeCun,et al.  Convolutional networks and applications in vision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[8]  Timothée Masquelier,et al.  Learning to recognize objects using waves of spikes and Spike Timing-Dependent Plasticity , 2010, The 2010 International Joint Conference on Neural Networks (IJCNN).

[9]  Thomas Serre,et al.  Reading the mind's eye: Decoding category information during mental imagery , 2010, NeuroImage.

[10]  Luca Maria Gambardella,et al.  Deep, Big, Simple Neural Nets for Handwritten Digit Recognition , 2010, Neural Computation.

[11]  Shimon Ullman,et al.  Cortical Circuitry Implementing Graphical Models , 2009, Neural Computation.

[12]  David D. Cox,et al.  A High-Throughput Screening Approach to Discovering Good Forms of Biologically Inspired Visual Representation , 2009, PLoS Comput. Biol..

[13]  Dileep George,et al.  Towards a Mathematical Theory of Cortical Micro-circuits , 2009, PLoS Comput. Biol..

[14]  Karl J. Friston,et al.  Cortical circuits for perceptual inference , 2009, Neural Networks.

[15]  Wolfgang Maass,et al.  Belief Propagation in Networks of Spiking Neurons , 2009, Neural Computation.

[16]  C. Summerfield,et al.  Expectation (and attention) in visual cognition , 2009, Trends in Cognitive Sciences.

[17]  Liang-Gee Chen,et al.  Hardware-Efficient Belief Propagation , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Thomas Serre,et al.  An integrated model of visual attention using shape-based features , 2009 .

[19]  N. Kanwisher,et al.  Feedback of pVisual Object Information to Foveal Retinotopic Cortex , 2008, Nature Neuroscience.

[20]  P Kaposvári,et al.  The representation of Kanizsa illusory contours in the monkey inferior temporal cortex , 2008, The European journal of neuroscience.

[21]  David G. Lowe,et al.  University of British Columbia. , 1945, Canadian Medical Association journal.

[22]  Shimon Ullman,et al.  Image interpretation by a single bottom-up top-down cycle , 2008, Proceedings of the National Academy of Sciences.

[23]  Ennio Mingolla,et al.  Multiple mechanisms of illusory contour perception. , 2008, Journal of vision.

[24]  M. Ernst,et al.  The statistical determinants of adaptation rate in human reaching. , 2008, Journal of vision.

[25]  S. Pollmann,et al.  Retinotopic Activation in Response to Subjective Contours in Primary Visual Cortex , 2008, Frontiers in human neuroscience.

[26]  Nicolas Pinto,et al.  Why is Real-World Visual Object Recognition Hard? , 2008, PLoS Comput. Biol..

[27]  Joseph F. Murray,et al.  Visual Recognition and Inference Using Dynamic Overcomplete Sparse Learning , 2007, Neural Computation.

[28]  T. Poggio,et al.  A model of V4 shape selectivity and invariance. , 2007, Journal of neurophysiology.

[29]  Andrew McCallum,et al.  Improved Dynamic Schedules for Belief Propagation , 2007, UAI.

[30]  C. Gilbert,et al.  Brain States: Top-Down Influences in Sensory Processing , 2007, Neuron.

[31]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[32]  Bogdan Dreher,et al.  The effects of reversible inactivation of postero-temporal visual cortex on neuronal activities in cat's area 17 , 2007, Brain Research.

[33]  Thomas Serre,et al.  Robust Object Recognition with Cortex-Like Mechanisms , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Rajesh P. N. Rao,et al.  Bayesian brain : probabilistic approaches to neural coding , 2006 .

[35]  Jude W. Shavlik,et al.  Belief Propagation in Large, Highly Connected Graphs for 3D Part-Based Object Recognition , 2006, Sixth International Conference on Data Mining (ICDM'06).

[36]  Peter Dayan,et al.  Uncertainty, phase and oscillatory hippocampal recall , 2006, NIPS.

[37]  Ian McGraw,et al.  Residual Belief Propagation: Informed Scheduling for Asynchronous Message Passing , 2006, UAI.

[38]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[39]  A. Yuille,et al.  Opinion TRENDS in Cognitive Sciences Vol.10 No.7 July 2006 Special Issue: Probabilistic models of cognition Vision as Bayesian inference: analysis by synthesis? , 2022 .

[40]  Soichiro Nomura,et al.  Activation time course of responses to illusory contours and salient region: A high-density electrical mapping comparison , 2006, Brain Research.

[41]  D. George,et al.  A hierarchical Bayesian model of invariant pattern recognition in the visual cortex , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[42]  S. Kosslyn,et al.  Visual mental imagery induces retinotopically organized activation of early visual areas. , 2005, Cerebral cortex.

[43]  David J. Field,et al.  How Close Are We to Understanding V1? , 2005, Neural Computation.

[44]  Balaram Das,et al.  Generating Conditional Probabilities for Bayesian Networks: Easing the Knowledge Acquisition Problem , 2004, ArXiv.

[45]  Thomas Serre,et al.  Realistic Modeling of Simple and Complex Cell Tuning in the HMAX Model, and Implications for Invariant Object Recognition in Cortex , 2004 .

[46]  Daniel P. Huttenlocher,et al.  Efficient Belief Propagation for Early Vision , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[47]  Y. LeCun,et al.  Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[48]  Pietro Perona,et al.  Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[49]  Scott O. Murray,et al.  Perceptual grouping and the interactions between visual cortical areas , 2004, Neural Networks.

[50]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[51]  Konrad Paul Kording,et al.  Bayesian integration in sensorimotor learning , 2004, Nature.

[52]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[53]  Eric Halgren,et al.  Cortical activation to illusory shapes as measured with magnetoencephalography , 2003, NeuroImage.

[54]  T. Poggio,et al.  Cognitive neuroscience: Neural mechanisms for the recognition of biological movements , 2003, Nature Reviews Neuroscience.

[55]  Tai Sing Lee,et al.  Computations in the early visual cortex , 2003, Journal of Physiology-Paris.

[56]  N. Rubin,et al.  fMRI Activation in Response to Illusory Contours and Salient Regions in the Human Lateral Occipital Complex , 2003, Neuron.

[57]  S. Hochstein,et al.  View from the Top Hierarchies and Reverse Hierarchies in the Visual System , 2002, Neuron.

[58]  John J. Foxe,et al.  The Spatiotemporal Dynamics of Illusory Contour Processing: Combined High-Density Electrical Mapping, Source Analysis, and Functional Magnetic Resonance Imaging , 2002, The Journal of Neuroscience.

[59]  J. Bullier Integrated model of visual processing , 2001, Brain Research Reviews.

[60]  T. S. Lee,et al.  Dynamics of subjective contour formation in the early visual cortex. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[61]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[62]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[63]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[64]  Paul S. Bradley,et al.  Refining Initial Points for K-Means Clustering , 1998, ICML.

[65]  Rajesh P. N. Rao,et al.  Dynamic Model of Visual Recognition Predicts Neural Response Properties in the Visual Cortex , 1997, Neural Computation.

[66]  Terrence J. Sejnowski,et al.  Bayesian Unsupervised Learning of Higher Order Structure , 1996, NIPS.

[67]  David Heckerman,et al.  Causal independence for probability assessment and inference using Bayesian networks , 1996, IEEE Trans. Syst. Man Cybern. Part A.

[68]  Nevin Lianwen Zhang,et al.  Exploiting Causal Independence in Bayesian Network Inference , 1996, J. Artif. Intell. Res..

[69]  Francisco Javier Díez,et al.  Parameter adjustment in Bayes networks. The generalized noisy OR-gate , 1993, UAI.

[70]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[71]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[72]  Zhongjie Wang,et al.  Improved Generalized Belief Propagation for Vision Processing , 2011 .

[73]  Pedro M. Domingos,et al.  Approximate Lifted Belief Propagation , 2010, StarAI@AAAI.

[74]  Sophie Denève,et al.  Bayesian Spiking Neurons I: Inference , 2008, Neural Computation.

[75]  J. F. Kalaska,et al.  Attention in hierarchical models of object recognition , 2007 .

[76]  S. Grossberg Towards a unified theory of neocortex: laminar cortical circuits for vision and cognition. , 2007, Progress in brain research.

[77]  Frank DiMaio,et al.  Improving the Efficiency of Belief Propagation in Large, Highly Connected Graphs , 2006 .

[78]  Marek J. Druzdzel,et al.  Probabilistic Independence of Causal Influences , 2006, Probabilistic Graphical Models.

[79]  Rajesh P. N. Rao Neural Models of Bayesian Belief Propagation , 2006 .

[80]  Kunihiko Fukushima,et al.  Restoring partly occluded patterns: a neural network model , 2005, Neural Networks.

[81]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[82]  Gerhard Lakemeyer,et al.  Exploring artificial intelligence in the new millennium , 2003 .

[83]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[84]  Yair Weiss,et al.  Correctness of Local Probability Propagation in Graphical Models with Loops , 2000, Neural Computation.

[85]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[86]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[87]  Kunihiko Fukushima,et al.  Neocognitron: A hierarchical neural network capable of visual pattern recognition , 1988, Neural Networks.