Probabilistic Models and Generative Neural Networks: Towards an Unified Framework for Modeling Normal and Impaired Neurocognitive Functions

Connectionist models can be characterized within the more general framework of probabilistic graphical models, which allow to efficiently describe complex statistical distributions involving a large number of interacting variables. This integration allows building more realistic computational models of cognitive functions, which more faithfully reflect the underlying neural mechanisms at the same time providing a useful bridge to higher-level descriptions in terms of Bayesian computations. Here we discuss a powerful class of graphical models that can be implemented as stochastic, generative neural networks. These models overcome many limitations associated with classic connectionist models, for example by exploiting unsupervised learning in hierarchical architectures (deep networks) and by taking into account top-down, predictive processing supported by feedback loops. We review some recent cognitive models based on generative networks, and we point out promising research directions to investigate neuropsychological disorders within this approach. Though further efforts are required in order to fill the gap between structured Bayesian models and more realistic, biophysical models of neuronal dynamics, we argue that generative neural networks have the potential to bridge these levels of analysis, thereby improving our understanding of the neural bases of cognition and of pathologies caused by brain damage.

[1]  Danny Keogan,et al.  Distributed hierarchical processing , 2002, Photomask Japan.

[2]  M. Corbetta,et al.  Control of goal-directed and stimulus-driven attention in the brain , 2002, Nature Reviews Neuroscience.

[3]  Karl J. Friston,et al.  Structural and Functional Brain Networks: From Connections to Cognition , 2013, Science.

[4]  Marco Zorzi,et al.  Paying Attention through Eye Movements: A Computational Investigation of the Premotor Theory of Spatial Attention , 2012, Journal of Cognitive Neuroscience.

[5]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[6]  Danielle S. Bassett,et al.  Cognitive Network Neuroscience , 2015, Journal of Cognitive Neuroscience.

[7]  P. Dayan,et al.  States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning , 2010, Neuron.

[8]  James L. McClelland Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review , 2013, Front. Psychol..

[9]  Geoffrey E. Hinton Learning multiple layers of representation , 2007, Trends in Cognitive Sciences.

[10]  U Długoń,et al.  [Charles Bonnet syndrome]. , 2000, Psychiatria polska.

[11]  V. Menon Large-Scale Brain Networks in Cognition: Emerging Principles , 2010 .

[12]  J. A. Scott Kelso,et al.  Multistability and metastability: understanding dynamic coordination in the brain , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[13]  Marco Zorzi,et al.  The Role of Semantic and Symbolic Representations in Arithmetic Processing: Insights from Simulated Dyscalculia in a Connectionist Model , 2004, Cortex.

[14]  Sophie Denève,et al.  Bayesian Spiking Neurons I: Inference , 2008, Neural Computation.

[15]  Ivilin Peev Stoianov,et al.  Number skills are maintained in healthy ageing , 2014, Cognitive Psychology.

[16]  Alan D. Lopez,et al.  The Global Burden of Disease Study , 2003 .

[17]  Marco Zorzi,et al.  Emergence of a 'visual number sense' in hierarchical generative models , 2012, Nature Neuroscience.

[18]  Pierre Baldi,et al.  Bayesian surprise attracts human attention , 2005, Vision Research.

[19]  Geoffrey E. Hinton,et al.  The Recurrent Temporal Restricted Boltzmann Machine , 2008, NIPS.

[20]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[21]  S. Dehaene,et al.  Cultural Recycling of Cortical Maps , 2007, Neuron.

[22]  O. Sporns,et al.  Towards the virtual brain: network modeling of the intact and the damaged brain. , 2010, Archives italiennes de biologie.

[23]  Marco Zorzi,et al.  Deep generative learning of location-invariant visual word recognition , 2013, Front. Psychol..

[24]  J. Elman,et al.  Rethinking Innateness: A Connectionist Perspective on Development , 1996 .

[25]  Nir Friedman,et al.  Probabilistic Graphical Models - Principles and Techniques , 2009 .

[26]  Alessandro Sperduti,et al.  Learning Orthographic Structure With Sequential Generative Neural Networks , 2016, Cogn. Sci..

[27]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[28]  David C. Plaut,et al.  Deep Dyslexia: A Case Study of , 1993 .

[29]  James L. McClelland,et al.  Why there are complementary learning systems in the hippocampus and neocortex: insights from the successes and failures of connectionist models of learning and memory. , 1995, Psychological review.

[30]  G. Humphreys,et al.  Hierarchies, similarity, and interactivity in object recognition: “Category-specific” neuropsychological deficits , 2001, Behavioral and Brain Sciences.

[31]  G. Goranović,et al.  Theory and simulation. , 1996, Current opinion in structural biology.

[32]  Wolfgang Maass,et al.  Bayesian Computation Emerges in Generic Cortical Microcircuits through Spike-Timing-Dependent Plasticity , 2013, PLoS Comput. Biol..

[33]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..

[34]  Wolfgang Maass,et al.  Neural Dynamics as Sampling: A Model for Stochastic Computation in Recurrent Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[35]  Rajesh P. N. Rao Bayesian Computation in Recurrent Neural Circuits , 2004, Neural Computation.

[36]  D. Plaut,et al.  Complementary neural representations for faces and words: A computational exploration , 2011, Cognitive neuropsychology.

[37]  Juhan Nam,et al.  Multimodal Deep Learning , 2011, ICML.

[38]  Marcel A. J. van Gerven,et al.  Deep Neural Networks Reveal a Gradient in the Complexity of Neural Representations across the Ventral Stream , 2014, The Journal of Neuroscience.

[39]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[40]  G. Humphreys,et al.  Bridging the gap between physiology and behavior: evidence from the sSoTS model of human visual attention. , 2011, Psychological review.

[41]  Michele De Filippo De Grazia,et al.  Deep Unsupervised Learning on a Desktop PC: A Primer for Cognitive Scientists , 2013, Front. Psychol..

[42]  Karl J. Friston,et al.  The Dynamic Brain: From Spiking Neurons to Neural Masses and Cortical Fields , 2008, PLoS Comput. Biol..

[43]  Vinod Menon,et al.  Functional connectivity in the resting brain: A network analysis of the default mode hypothesis , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Thomas L. Dean,et al.  Neural Networks and Neuroscience-Inspired Computer Vision , 2014, Current Biology.

[45]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[46]  Geoffrey E. Hinton,et al.  A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[47]  Tai Sing Lee,et al.  Hierarchical Bayesian inference in the visual cortex. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[48]  E. Brannon,et al.  Monotonic Coding of Numerosity in Macaque Lateral Intraparietal Area , 2007, PLoS biology.

[49]  Geoffrey E. Hinton,et al.  The "wake-sleep" algorithm for unsupervised neural networks. , 1995, Science.

[50]  P. Berkes,et al.  Statistically Optimal Perception and Learning: from Behavior to Neural Representations , 2022 .

[51]  W. Singer,et al.  Dynamic predictions: Oscillations and synchrony in top–down processing , 2001, Nature Reviews Neuroscience.

[52]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[53]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[54]  J. Barendregt,et al.  Global burden of disease , 1997, The Lancet.

[55]  Jeffrey S Bowers,et al.  On the biological plausibility of grandmother cells: implications for neural network theories in psychology and neuroscience. , 2009, Psychological review.

[56]  M. Raichle The restless brain: how intrinsic activity organizes brain function , 2015, Philosophical Transactions of the Royal Society B: Biological Sciences.

[57]  G. Hankey,et al.  The global and regional burden of stroke. , 2013, The Lancet. Global health.

[58]  Rajat Raina,et al.  Large-scale deep unsupervised learning using graphics processors , 2009, ICML '09.

[59]  A. Pouget,et al.  Relating unilateral neglect to the neural coding of space , 2000, Current Opinion in Neurobiology.

[60]  Charles Kemp,et al.  Bayesian models of cognition , 2008 .

[61]  Geoffrey E. Hinton,et al.  Lesioning an attractor network: investigations of acquired dyslexia. , 1991, Psychological review.

[62]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[63]  S. Bressler,et al.  Large-scale brain networks in cognition: emerging methods and principles , 2010, Trends in Cognitive Sciences.

[64]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.

[65]  F. Andermann,et al.  Complex visual hallucinations. Clinical and neurobiological insights. , 1998, Brain : a journal of neurology.

[66]  Sidney J. Segalowitz,et al.  Neural Networks and Neuroscience , 1997 .

[67]  Karl J. Friston The free-energy principle: a unified brain theory? , 2010, Nature Reviews Neuroscience.

[68]  Alberto Testolin,et al.  Modeling language and cognition with deep unsupervised learning: a tutorial overview , 2013, Front. Psychol..

[69]  V. Feigin,et al.  Global and regional burden of stroke during 1990–2010: findings from the Global Burden of Disease Study 2010 , 2014, The Lancet.

[70]  Wulfram Gerstner,et al.  Theory and Simulation in Neuroscience , 2012, Science.

[71]  Wolfgang Maass,et al.  Probabilistic Inference in General Graphical Models through Sampling in Stochastic Networks of Spiking Neurons , 2011, PLoS Comput. Biol..

[72]  C. Gilbert,et al.  Brain States: Top-Down Influences in Sensory Processing , 2007, Neuron.

[73]  Charles Kemp,et al.  How to Grow a Mind: Statistics, Structure, and Abstraction , 2011, Science.

[74]  Klaus Podoll,et al.  Complex visual hallucinations , 2007 .

[75]  A. Clark Whatever next? Predictive brains, situated agents, and the future of cognitive science. , 2013, The Behavioral and brain sciences.

[76]  Marco Zorzi,et al.  Space coding for sensorimotor transformations can emerge through unsupervised learning , 2012, Cognitive Processing.

[77]  M. Botvinick,et al.  Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective , 2009, Cognition.

[78]  A. Sillito,et al.  Always returning: feedback and sensory processing in visual cortex and thalamus , 2006, Trends in Neurosciences.

[79]  A. Barabasi,et al.  Universal resilience patterns in complex networks , 2016, Nature.

[80]  Marc'Aurelio Ranzato,et al.  Building high-level features using large scale unsupervised learning , 2011, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[81]  Zoubin Ghahramani,et al.  Probabilistic machine learning and artificial intelligence , 2015, Nature.

[82]  J. Martinerie,et al.  The brainweb: Phase synchronization and large-scale integration , 2001, Nature Reviews Neuroscience.

[83]  Gustavo Deco,et al.  Resting brains never rest: computational insights into potential cognitive architectures , 2013, Trends in Neurosciences.

[84]  J. G. Snodgrass,et al.  A standardized set of 260 pictures: norms for name agreement, image agreement, familiarity, and visual complexity. , 1980, Journal of experimental psychology. Human learning and memory.

[85]  C. Mathers Global Burden of Disease , 2008 .

[86]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[87]  G. Kane Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol 1: Foundations, vol 2: Psychological and Biological Models , 1994 .

[88]  Florent Meyniel,et al.  The Neural Representation of Sequences: From Transition Probabilities to Algebraic Patterns and Linguistic Trees , 2015, Neuron.

[89]  T. Shallice,et al.  Deep Dyslexia: A Case Study of , 1993 .

[90]  D. Schacter,et al.  The Brain's Default Network , 2008, Annals of the New York Academy of Sciences.

[91]  Michael von Aster,et al.  Optimized voxel-based morphometry in children with developmental dyscalculia , 2008, NeuroImage.

[92]  Mark S. Seidenberg,et al.  Impairments in verb morphology after brain injury: a connectionist model. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[93]  Peggy Seriès,et al.  Charles Bonnet Syndrome: Evidence for a Generative Model in the Cortex? , 2013, PLoS Comput. Biol..

[94]  anonymous,et al.  Visual agnosia , 2012, BMJ : British Medical Journal.

[95]  Hillol Kargupta,et al.  Graphical Models: Foundations of Neural Computation , 2016, Pattern Analysis and Applications.

[96]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[97]  Scott Kirkpatrick,et al.  Optimization by Simmulated Annealing , 1983, Sci..

[98]  Marco Zorzi,et al.  Computational Modeling of Numerical Cognition , 2004 .

[99]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[100]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[101]  Willem E. Frankenhuis,et al.  Bayesian Models of , 2016 .