A Spiking Neural Network Model of Multi-modal Language Processing of Robot Instructions

Presented is a spiking neural network architecture of human language instruction recognition and robot control. The network is based on a model of a leaky Integrate-And-Fire (lIAF) spiking neurone with Active Dendrites and Dynamic Synapses (ADDS) [1,2,3]. The architecture contains several main modules associating information across different modalities: an auditory system recognising single spoken words, a visual system recognising objects of different colour and shape, motor control system for navigation and motor control and a working memory. The main focus of this presentation is the working memory module whose function is sequential processing of word from a language instruction, task and goal representation and cross-modal association of objects and actions. We test the model with a robot whose goal is to recognise and execute language instructions. The work demonstrates the potential of spiking neurons for processing spatio-temporal patterns and the experiments present spiking neural networks as a paradigm which can be applied for modelling sequence detectors at word level for robot instructions.

[1]  S. Shimojo,et al.  Sensory modalities are not separate modalities: plasticity and interactions , 2001, Current Opinion in Neurobiology.

[2]  R. Desimone,et al.  The Role of Neural Mechanisms of Attention in Solving the Binding Problem , 1999, Neuron.

[3]  D. V. Essen,et al.  Neural mechanisms of form and motion processing in the primate visual system , 1994, Neuron.

[4]  D. C. Essen,et al.  Neural responses to polar, hyperbolic, and Cartesian gratings in area V4 of the macaque monkey. , 1996, Journal of neurophysiology.

[5]  Clayton T. Morrison,et al.  Information Theory and Representation in Associative Word Learning , 2003 .

[6]  Keiji Tanaka,et al.  Neuronal selectivities to complex object features in the ventral visual pathway of the macaque cerebral cortex. , 1994, Journal of neurophysiology.

[7]  Friedemann Pulvermüller,et al.  The Neuroscience of Language: On Brain Circuits of Words and Serial Order , 2003 .

[8]  A. Roskies The Binding Problem , 1999, Neuron.

[9]  J. H. Jacksons On affections of speech from disease of the brain , 1878 .

[10]  Paavo Alku,et al.  Memory Traces for Words as Revealed by the Mismatch Negativity , 2001, NeuroImage.

[11]  Ted Briscoe,et al.  Linguistic Evolution through Language Acquisition: Formal and Computational Models. , 2002 .

[12]  Geoffrey M. Ghose,et al.  Specialized Representations in Visual Cortex A Role for Binding? , 1999, Neuron.

[13]  Michael A. Arbib,et al.  Computing the brain : a guide to neuroinformatics , 2001 .

[14]  F. Fazio,et al.  Brain processing of native and foreign languages , 1996, NeuroImage.

[15]  F. Pulvermüller,et al.  The Concept of Transcortical Cell Assemblies: a Key to the Understanding of Cortical Lateralization and Interhemispheric Interaction , 1996, Neuroscience & Biobehavioral Reviews.

[16]  A. Syrota,et al.  The Cortical Representation of Speech , 1993, Journal of Cognitive Neuroscience.

[17]  N. Chater,et al.  Computational models and Rethinking innateness , 1999, Journal of Child Language.

[18]  David C. Sterratt,et al.  Does Morphology Influence Temporal Plasticity? , 2002, ICANN.

[19]  W. Singer,et al.  Visuomotor integration is associated with zero time-lag synchronization among cortical areas , 1997, Nature.

[20]  T. A. Cartwright,et al.  Distributional regularity and phonotactic constraints are useful for segmentation , 1996, Cognition.

[21]  M. Meredith,et al.  On the neuronal basis for multisensory convergence: a brief overview. , 2002, Brain research. Cognitive brain research.

[22]  H. Barbas Connections underlying the synthesis of cognition, memory, and emotion in primate prefrontal cortices , 2000, Brain Research Bulletin.

[23]  Christoph von der Malsburg,et al.  The Correlation Theory of Brain Function , 1994 .

[24]  Christina M. Krause,et al.  Early and Late Mismatch Negativity Elicited by Words and Speech-Like Stimuli in Children , 2001, Brain and Language.

[25]  Jeffrey Mark Siskind,et al.  Learning Word-to-Meaning Mappings , 1997 .

[26]  Michael D Rugg,et al.  Further study of the electrophysiological correlates of lexical decision , 1983, Brain and Language.

[27]  G. Rizzolatti,et al.  Hearing Sounds, Understanding Actions: Action Representation in Mirror Neurons , 2002, Science.

[28]  J E Lisman,et al.  Storage of 7 +/- 2 short-term memories in oscillatory subcycles , 1995, Science.

[29]  E. Lenneberg Biological Foundations of Language , 1967 .

[30]  G. Rizzolatti,et al.  The organization of the cortical motor system: new concepts. , 1998, Electroencephalography and clinical neurophysiology.

[31]  A. Treisman Solutions to the Binding Problem Progress through Controversy and Convergence , 1999, Neuron.

[32]  G. Rizzolatti,et al.  Neurophysiological mechanisms underlying the understanding and imitation of action , 2001, Nature Reviews Neuroscience.

[33]  D. Pandya,et al.  Architecture and Connections of Cortical Association Areas , 1985 .

[34]  R. Frackowiak,et al.  Demonstrating the implicit processing of visually presented words and pseudowords. , 1996, Cerebral cortex.

[35]  Stefan Wermter,et al.  Spike-Timing Dependent Competitive Learning of Integrate-and-Fire Neurons with Active Dendrites , 2002, ICANN.

[36]  G. Rizzolatti,et al.  Parietal cortex: from sight to action , 1997, Current Opinion in Neurobiology.

[37]  A. Norman Redlich,et al.  Redundancy Reduction as a Strategy for Unsupervised Learning , 1993, Neural Computation.

[38]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[39]  C. Rovee-Collier,et al.  Advances in infancy research , 1981 .

[40]  Sigmund Freud,et al.  Zur Auffassung der Aphasien , 1891 .

[41]  L. Fogassi,et al.  Audiovisual mirror neurons and action recognition , 2003, Experimental Brain Research.

[42]  E. Clark,et al.  The Child's Path to Spoken Language. , 1994 .

[43]  G. Rizzolatti,et al.  Localization of grasp representations in humans by PET: 1. Observation versus execution , 1996, Experimental Brain Research.

[44]  Marco Idiart,et al.  Short-Term Memory as a Single Cell Phenomenon , 1995 .

[45]  J. Hughlingsjackson,et al.  ON AFFECTIONS OF SPEECH FROM DISEASE OF THE BRAIN. , 1879 .

[46]  M. Arbib Rana computatrix to human language: towards a computational neuroethology of language evolution , 2003, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[47]  D. Norman Learning and Memory , 1982 .

[48]  Joy Hirsch,et al.  Distinct cortical areas associated with native and second languages , 1997, Nature.

[49]  K. Dautenhahn,et al.  Imitation in Animals and Artifacts , 2002 .

[50]  T. Deacon Cortical connections of the inferior arcuate sulcus cortex in the macaque brain , 1992, Brain Research.

[51]  O Jensen,et al.  Novel lists of 7 +/- 2 known items can be reliably stored in an oscillatory short-term memory network: interaction with long-term memory. , 1996, Learning & memory.

[52]  O. Bertrand,et al.  Oscillatory gamma activity in humans and its role in object representation , 1999, Trends in Cognitive Sciences.

[53]  J. Locke,et al.  Babbling and early speech: continuity and individual differences , 1989 .

[54]  D. V. van Essen,et al.  Corticocortical connections of visual, sensorimotor, and multimodal processing areas in the parietal lobe of the macaque monkey , 2000, The Journal of comparative neurology.

[55]  T. Poggio,et al.  Are Cortical Models Really Bound by the “Binding Problem”? , 1999, Neuron.

[56]  G. Ojemann Cortical organization of language , 1991, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[57]  P. Goldman-Rakic Architecture of the Prefrontal Cortex and the Central Executive , 1995, Annals of the New York Academy of Sciences.

[58]  S. Dehaene,et al.  Anatomical variability in the cortical representation of first and second language , 1997, Neuroreport.

[59]  M. Sur,et al.  Visual behaviour mediated by retinal projections directed to the auditory pathway , 2000, Nature.

[60]  S. Bressler,et al.  Episodic multiregional cortical coherence at multiple frequencies during visual task performance , 1993, Nature.

[61]  M. Goldsmith,et al.  Statistical Learning by 8-Month-Old Infants , 1996 .

[62]  J. Lisman,et al.  Hippocampal sequence-encoding driven by a cortical multi-item working memory buffer , 2005, Trends in Neurosciences.

[63]  F. Pulvermüller,et al.  Words in the brain's language , 1999, Behavioral and Brain Sciences.

[64]  F. Pulvermüller A brain perspective on language mechanisms: from discrete neuronal ensembles to serial order , 2002, Progress in Neurobiology.

[65]  C. Price,et al.  A functional imaging study of translation and language switching. , 1999, Brain : a journal of neurology.

[66]  M. Arbib,et al.  Language within our grasp , 1998, Trends in Neurosciences.

[67]  J. Fletcher,et al.  The changing nervous system : neurobehavioral consequences of early brain disorders , 1999 .

[68]  F. Aboitiz,et al.  The evolutionary origin of the language areas in the human brain. A neuroanatomical perspective , 1997, Brain Research Reviews.

[69]  Zellig S. Harris,et al.  From Phoneme to Morpheme , 1955 .

[70]  Nick Chater,et al.  Models of Language Acquisition: Inductive and Deductive Approaches , 2000 .

[71]  Friedemann Pulvermüller,et al.  Neurophysiological evidence of memory traces for words in the human brain , 2002, Neuroreport.

[72]  J. Fuster,et al.  Visuo-tactile cross-modal associations in cortical somatosensory cells. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[73]  F Pulvermüller,et al.  The Right Hemisphere’s Role in Action Word Processing: a Double Case Study. , 2001, Neurocase.

[74]  E. Bates,et al.  Linguistic , Cognitive and Affective Development in Children with Pre-and Perinatal Focal Brain Injury : A Ten-Year Overview from the San Diego Longitudinal Project , 2004 .

[75]  Elizabeth Bates,et al.  Neural mediation of language development: Perspectives from lesion studies of infants and children , 1999 .

[76]  F. Pulvermüller,et al.  Distributed neuronal networks for encoding category‐specific semantic information: the mismatch negativity to action words , 2004, The European journal of neuroscience.

[77]  Luc Steels,et al.  Bootstrapping grounded word semantics , 1999 .

[78]  James K. Kroger,et al.  Cross-modal and cross-temporal association in neurons of frontal cortex , 2000, Nature.

[79]  A. Goldman,et al.  Mirror neurons and the simulation theory of mind-reading , 1998, Trends in Cognitive Sciences.

[80]  A. Treisman,et al.  Illusory conjunctions in the perception of objects , 1982, Cognitive Psychology.

[81]  P. Broca Remarques sur le siège de la faculté du langage articulé, suivies d'une observation d'aphémie (perte de la parole) , 1861 .

[82]  W. Marslen-Wilson,et al.  The temporal structure of spoken language understanding , 1980, Cognition.

[83]  Luc Steels,et al.  Aibo''s first words. the social learning of language and meaning. Evolution of Communication , 2002 .

[84]  Vittorio Gallese,et al.  Mirror neurons: A sensorimotor representation system , 2001 .

[85]  J. Wolfe,et al.  The Psychophysical Evidence Review for a Binding Problem in Human , 1999 .

[86]  J. A. Shafer,et al.  Understanding aphasia. , 1954, Archives of Physical Medicine and Rehabilitation.

[87]  R. Desimone,et al.  Selective attention gates visual processing in the extrastriate cortex. , 1985, Science.

[88]  Dora E Angelaki,et al.  Control of eye orientation: where does the brain's role end and the muscle's begin? , 2004, The European journal of neuroscience.

[89]  F. Pulvermüller Brain reflections of words and their meaning , 2001, Trends in Cognitive Sciences.

[90]  Elizabeth Bates,et al.  From first words to grammar in children with focal brain injury , 1997 .

[91]  M. Lappe,et al.  Chapter 23 Information transfer between sensory and motor networks , 2001 .

[92]  G. Rizzolatti,et al.  Action recognition in the premotor cortex. , 1996, Brain : a journal of neurology.

[93]  Paul R. Cohen,et al.  Grounding knowledge in sensors: unsupervised learning for language and planning , 2001 .

[94]  M. Arbib From monkey-like action recognition to human language: An evolutionary framework for neurolinguistics , 2005, Behavioral and Brain Sciences.

[95]  A. A. Mullin,et al.  Principles of neurodynamics , 1962 .

[96]  M. Arbib The mirror system, imitation, and the evolution of language , 2002 .

[97]  Stefan Wermter,et al.  Temporal sequence detection with spiking neurons: towards recognizing robot language instructions , 2006, Connect. Sci..

[98]  Friedemann Pulverm Uuml,et al.  Words in the brain's language , 1999 .

[99]  J P Rauschecker,et al.  Hemispheric specialization for English and ASL: left invariance‐right variability , 1998, Neuroreport.

[100]  P. Goldman-Rakic,et al.  Dissociation of object and spatial processing domains in primate prefrontal cortex. , 1993, Science.

[101]  Michael I. Posner,et al.  Flexible neural circuitry in word processing , 1999, Behavioral and Brain Sciences.

[102]  O Jensen,et al.  Theta/gamma networks with slow NMDA channels learn sequences and encode episodic memory: role of NMDA channels in recall. , 1996, Learning & memory.

[103]  Stefan Wermter,et al.  Spike-timing-dependent synaptic plasticity: from single spikes to spike trains , 2004, Neurocomputing.

[104]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[105]  J. Fuster Memory in the cerebral cortex , 1994 .

[106]  Michael A. Arbib,et al.  Language evolution: neural homologies and neuroinformatics , 2003, Neural Networks.