Vision, action and language unified through embodiment

Increasing evidence shows that vision, action and language should not be regarded as a set of disembodied processes. Instead, they form a closely integrated and highly dynamic system that is attuned to the constraints of its bodily implementation as well as to the constraints coming from the world with which this body interacts. One consequence of such embodiment of cognition is that seeing an object, even when there is no intention to handle it, activates plans for actions directed toward it (e.g., Tucker & Ellis, 1998, 2001; Fischer & Dahl, 2007). Using object names induces similar action planning effects as seeing the objects themselves (Tucker & Ellis, 2004; Borghi, Glenberg & Kaschak, 2004). Depending on linguistic context, different object features can be activated for action planning, as indicated by facilitated manual responses or ‘‘affordance effects’’ (e.g., Borghi, 2004; Glenberg & Robertson, 2000; Zwaan, 2004). Similarly, different action intentions direct attention differently to object features for processing (e.g., Bekkering & Neggers, 2002; Fischer & Hoellen, 2004; Symes, Tucker, Ellis, Vainio, & Ottoboni, 2008). Eye movements during visually guided actions shed further light on the close relationship between vision, action and language (Land & Furneaux, 1997; Johansson, Westling, Backstrom, & Flanagan, 2001). For example when humans interact with objects, their eyes move ahead of their hands to support the on-line control of grasping (e.g., Bekkering & Neggers, 2002). These behavioral results are supported by brain imaging studies of object affordances in humans (e.g., Grezes, Tucker, Armony, Ellis, & Passingham, 2003) and single cell recordings in monkeys (e.g., Sakata, Taira, Mine, & Murata, 1992; Fadiga, Fogassi, Gallese, & Rizzolatti, 2000). Together, these behavioral and neuroscientific studies have recently begun to inform computational models of embodied cognition. For example, Tsiotas, Borghi and Parisi (2005) devised an artificial life simulation to give an evolutionary account of some affordance effects, and Caligiore, Borghi, Parisi, and Baldassarre (2010) proposed a computational model to account for several affordance-related effects in grasping, reaching, and language. The neuroscientific constraints implemented in the design of the model allow its authors to investigate the neural mechanisms underlying affordance selection and control. The present special issue brings together recent developments at the intersection between behavioral, neuroscientific, and computational approaches to embodied cognition. Strong support for the close link between vision, action and language comes from studies which highlight how language processing and comprehension make use of neural systems ordinarily used for perception and action (Lakoff, 1987; Zwaan, 2004; Barsalou, 1999; Glenberg & Robertson, 1999; Gallese, 2008; Glenberg, 2010). For example, when humans process the word ‘‘cup’’ they seem to reenact (and therefore internally simulate) many of the perceptual, motor and affective representations related to a cup (Barsalou, 1999). In a similar way sentences and abstract words are understood by creating a simulation of the actions underlying them (Glenberg and Kaschak, 2002; D. Caligiore (&) Laboratory of Computational Embodied Neuroscience, Istituto di Scienze e Tecnologie della Cognizione, Consiglio Nazionale delle Ricerche (LOCEN-ISTC-CNR), Roma, Italy e-mail: daniele.caligiore@istc.cnr.it

[1]  Zenon W. Pylyshyn,et al.  What the Mind’s Eye Tells the Mind’s Brain: A Critique of Mental Imagery , 1973 .

[2]  J. Akinsanya The knowledge base. , 1981, Nursing times.

[3]  J. Fodor,et al.  Connectionism and cognitive architecture: A critical analysis , 1988, Cognition.

[4]  G. Lakoff,et al.  Women, Fire, and Dangerous Things: What Categories Reveal about the Mind , 1988 .

[5]  Jason W. Brown,et al.  Enhancement of naming in nonfluent aphasia through gesture , 1990, Brain and Language.

[6]  H. Sakata,et al.  Hand-Movement-Related Neurons of the Posterior Parietal Cortex of the Monkey: Their Role in the Visual Guidance of Hand Movements , 1992 .

[7]  Roberto Caminiti,et al.  Control of arm movement in space : neurophysiological and computational approaches , 1992 .

[8]  Julie C. Sedivy,et al.  Subject Terms: Linguistics Language Eyes & eyesight Cognition & reasoning , 1995 .

[9]  M F Land,et al.  The knowledge base of the oculomotor system. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[10]  Luc Steels,et al.  Grounding adaptive language games in robotic agents , 1997 .

[11]  M A Arbib,et al.  From visual affordances in monkey parietal cortex to hippocampo-parietal interactions underlying rat navigation. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[12]  B. MacWhinney Models of the emergence of language. , 1998, Annual review of psychology.

[13]  Michael A. Arbib,et al.  Modeling parietal-premotor interactions in primate control of grasping , 1998, Neural Networks.

[14]  R. O’Reilly Six principles for biologically based computational models of cortical cognition , 1998, Trends in Cognitive Sciences.

[15]  R. Ellis,et al.  On the relations between seen objects and components of potential actions. , 1998, Journal of experimental psychology. Human perception and performance.

[16]  M. Arbib,et al.  Language within our grasp , 1998, Trends in Neurosciences.

[17]  Arthur M. Glenberg,et al.  Indexical understanding of instructions , 1999 .

[18]  L. Barsalou,et al.  Whither structured representation? , 1999, Behavioral and Brain Sciences.

[19]  A. Glenberg,et al.  Symbol Grounding and Meaning: A Comparison of High-Dimensional and Embodied Theories of Meaning , 2000 .

[20]  G. Rizzolatti,et al.  Visuomotor neurons: ambiguity of the discharge or 'motor' perception? , 2000, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[21]  M. Gentilucci,et al.  Grasp with hand and mouth: a kinematic study on healthy subjects. , 2001, Journal of neurophysiology.

[22]  R. Ellis,et al.  The potentiation of grasp types during visual object categorization , 2001 .

[23]  R. Johansson,et al.  Eye–Hand Coordination in Object Manipulation , 2001, The Journal of Neuroscience.

[24]  G. Rizzolatti,et al.  Action observation activates premotor and parietal areas in a somatotopic manner: an fMRI study , 2001, The European journal of neuroscience.

[25]  Luciano Fadiga,et al.  Hand action preparation influences the responses to hand pictures , 2002, Neuropsychologia.

[26]  Michael P. Kaschak,et al.  Grounding language in action , 2002, Psychonomic bulletin & review.

[27]  Angelo Cangelosi,et al.  Simulating the Evolution of Language , 2002, Springer London.

[28]  G. Humphreys,et al.  Modelling direct perceptual constraints on action selection: The Naming and Action Model (NAM) , 2002 .

[29]  H. Bekkering,et al.  Visual Search Is Modulated by Action Intentions , 2002, Psychological science.

[30]  Michael A. Arbib,et al.  Schema design and implementation of the grasp-related mirror neuron system , 2002, Biological Cybernetics.

[31]  Rolf A. Zwaan The Immersed Experiencer: Toward An Embodied Theory Of Language Comprehension , 2003 .

[32]  Maurizio Gentilucci,et al.  ß Federation of European Neuroscience Societies SHORT COMMUNICATION Grasp observation influences speech production , 2022 .

[33]  L. Steels Evolving grounded communication for robots , 2003, Trends in Cognitive Sciences.

[34]  T. Paus,et al.  Seeing and hearing speech excites the motor system involved in speech production , 2003, Neuropsychologia.

[35]  Dirk van Rijn,et al.  Proceedings of the 31st annual conference of the Cognitive Science Society , 2003 .

[36]  Scott T. Grafton,et al.  Actions or Hand-Object Interactions? Human Inferior Frontal Cortex and Action Observation , 2003, Neuron.

[37]  R. Passingham,et al.  Objects automatically potentiate action: an fMRI study of implicit processing , 2003, The European journal of neuroscience.

[38]  A. Borghi Object concepts and action: extracting affordances from objects parts. , 2004, Acta psychologica.

[39]  F. Binkofski,et al.  The mirror neuron system and action recognition , 2004, Brain and Language.

[40]  R. Ellis,et al.  Action priming by briefly presented objects. , 2004, Acta psychologica.

[41]  I. Johnsrude,et al.  Somatotopic Representation of Action Words in Human Motor and Premotor Cortex , 2004, Neuron.

[42]  M. H. Fischer,et al.  Space- and object-based attention depend on motor intention. , 2004, The Journal of general psychology.

[43]  Michael P. Kaschak,et al.  Putting words in perspective , 2004, Memory & cognition.

[44]  Ping Li,et al.  Early lexical development in a self-organizing neural network , 2004, Neural Networks.

[45]  G. Rizzolatti,et al.  The mirror-neuron system. , 2004, Annual review of neuroscience.

[46]  Anna M. Borghi,et al.  Objects and Affordances: An Artificial Life Simulation , 2005 .

[47]  Vittorio Gallese,et al.  Listening to Action-related Sentences Activates Fronto-parietal Motor Circuits , 2005, Journal of Cognitive Neuroscience.

[48]  G. Rizzolatti,et al.  Parietal Lobe: From Action Organization to Intention Understanding , 2005, Science.

[49]  Friedemann Pulvermüller,et al.  Brain mechanisms linking language and action , 2005, Nature Reviews Neuroscience.

[50]  Umberto Castiello,et al.  Language in shadow , 2006, Social neuroscience.

[51]  Angelo Cangelosi,et al.  Language Acquisition and Symbol Grounding Transfer with Neural Networks and Cognitive Robots , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[52]  Mark D. Humphries,et al.  A robot model of the basal ganglia: Behavior and intrinsic processing , 2006, Neural Networks.

[53]  Angelo Cangelosi,et al.  An Embodied Model for Sensorimotor Grounding and Grounding Transfer: Experiments With Epigenetic Robots , 2006, Cogn. Sci..

[54]  Martin H. Fischer,et al.  The time course of visuo-motor affordances , 2006, Experimental Brain Research.

[55]  Michael A. Arbib,et al.  Extending the mirror neuron system model, I , 2007, Biological Cybernetics.

[56]  Michael A. Arbib,et al.  Vision and Action in the Language-Ready Brain: From Mirror Neurons to SemRep , 2007, BVAI.

[57]  K. Jellinger Motor Cognition What Actions Tell the Self , 2007 .

[58]  Roel M. Willems,et al.  Neural evidence for the interplay between language, gesture, and action: A review , 2007, Brain and Language.

[59]  V. Gallese Mirror neurons and the social nature of language: The neural exploitation hypothesis , 2008, Social neuroscience.

[60]  Friedemann Pulvermüller,et al.  A neuroanatomically grounded Hebbian-learning model of attention–language interactions in the human brain , 2008, The European journal of neuroscience.

[61]  L. Vainio,et al.  Grasp preparation improves change detection for congruent objects. , 2008, Journal of experimental psychology. Human perception and performance.

[62]  Anna M. Borghi,et al.  Words as tools and the problem of abstract words meanings , 2009 .

[63]  Gerald M. Edelman,et al.  An Embodied Approach to Linking Nervous System Structure and Function to Behavior , 2009 .

[64]  P. Strick,et al.  Cerebellum and nonmotor function. , 2009, Annual review of neuroscience.

[65]  Karl J. Friston Causal Modelling and Brain Connectivity in Functional Magnetic Resonance Imaging , 2009, PLoS biology.

[66]  Jason Fleischer,et al.  Brain-based devices , 2009, IEEE Robotics & Automation Magazine.

[67]  Thomas W. Schubert,et al.  Embodiment as a unifying perspective for psychology , 2009 .

[68]  Dana H. Ballard,et al.  Credit Assignment in Multiple Goal Embodied Visuomotor Behavior , 2010, Front. Psychology.

[69]  K. Plunkett,et al.  A neurocomputational account of taxonomic responding and fast mapping in early word learning. , 2010, Psychological review.

[70]  Tom Ziemke,et al.  Sentence Processing: Linking Language to Motor Chains , 2010, Front. Neurorobot..

[71]  K. McRae,et al.  The Mechanics of Embodiment , 2010 .

[72]  G. Baldassarre,et al.  Modelling Perception with Artificial Neural Networks: The interplay of Pavlovian and instrumental processes in devaluation experiments: a computational embodied neuroscience model tested with a simulated rat , 2010 .

[73]  Bernhard Hommel,et al.  Games with(Out) Frontiers: Toward an Integrated Science of Human Cognition , 2010, Front. Psychology.

[74]  D. Parisi,et al.  TRoPICALS: a computational embodied neuroscience model of compatibility effects. , 2010, Psychological review.

[75]  Kalanit Grill-Spector,et al.  Sparsely-distributed organization of face and limb activations in human ventral temporal cortex , 2010, NeuroImage.

[76]  A. Glenberg Embodiment as a unifying perspective for psychology. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[77]  L. Fadiga,et al.  Active perception: sensorimotor circuits as a cortical basis for language , 2010, Nature Reviews Neuroscience.

[78]  Michael A. Arbib,et al.  Extending the mirror neuron system model, II: what did I just do? A new role for mirror neurons , 2010, Biological Cybernetics.

[79]  Kalanit Grill-Spector,et al.  Not one extrastriate body area: Using anatomical landmarks, hMT+, and visual field maps to parcellate limb-selective activations in human lateral occipitotemporal cortex , 2011, NeuroImage.

[80]  Angelo Cangelosi,et al.  The Mechanics of Embodiment: A Dialog on Embodiment and Computational Modeling , 2011, Front. Psychology.

[81]  Punit Shah Toward a Neurobiology of Unrealistic Optimism , 2012, Front. Psychology.

[82]  John Bridgeman,et al.  Bodies and other visual objects: the dialectics of reaching toward objects , 2013, Psychological research.

[83]  Hideyuki Ando,et al.  Experimental study on co-evolution of categorical perception and communication systems in humans , 2012, Psychological Research.

[84]  M. de Vega,et al.  The comprehension of action-related sentences may cause interference rather than facilitation on matching actions , 2013, Psychological research.

[85]  Vittorio Gallese,et al.  How the motor system handles nouns: a behavioral study , 2011, Psychological Research.

[86]  A. Cangelosi,et al.  How affordances associated with a distractor object affect compatibility effects: A study with the computational model TRoPICALS , 2013, Psychological research.

[87]  K. Grill-Spector,et al.  Neural representations of faces and limbs neighbor in human high-level visual cortex: evidence for a new organization principle , 2011, Psychological Research.

[88]  Claudia Gianelli,et al.  Acting in perspective: the role of body and language as social tools , 2013, Psychological research.