Reading visually embodied meaning from the brain: Visually grounded computational models decode visual-object mental imagery induced by written text

Embodiment theory predicts that mental imagery of object words recruits neural circuits involved in object perception. The degree of visual imagery present in routine thought and how it is encoded in the brain is largely unknown. We test whether fMRI activity patterns elicited by participants reading objects' names include embodied visual-object representations, and whether we can decode the representations using novel computational image-based semantic models. We first apply the image models in conjunction with text-based semantic models to test predictions of visual-specificity of semantic representations in different brain regions. Representational similarity analysis confirms that fMRI structure within ventral-temporal and lateral-occipital regions correlates most strongly with the image models and conversely text models correlate better with posterior-parietal/lateral-temporal/inferior-frontal regions. We use an unsupervised decoding algorithm that exploits commonalities in representational similarity structure found within both image model and brain data sets to classify embodied visual representations with high accuracy (8/10) and then extend it to exploit model combinations to robustly decode different brain regions in parallel. By capturing latent visual-semantic structure our models provide a route into analyzing neural representations derived from past perceptual experience rather than stimulus-driven brain activity. Our results also verify the benefit of combining multimodal data to model human-like semantic representations.

[1]  A. Ishai,et al.  Recollection- and Familiarity-Based Decisions Reflect Memory Strength , 2008, Frontiers in systems neuroscience.

[2]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[3]  Rolf A. Zwaan,et al.  Language Comprehenders Mentally Represent the Shapes of Objects , 2002, Psychological science.

[4]  Nikolaus Kriegeskorte,et al.  The Emergence of Semantic Meaning in the Ventral Temporal Pathway , 2014, Journal of Cognitive Neuroscience.

[5]  Lawrence W. Barsalou,et al.  Language and simulation in conceptual processing , 2008 .

[6]  Keiji Tanaka,et al.  Matching Categorical Object Representations in Inferior Temporal Cortex of Man and Monkey , 2008, Neuron.

[7]  Nancy Kanwisher,et al.  fMRI evidence for objects as the units of attentional selection , 1999, Nature.

[8]  Andrew Zisserman,et al.  Video Google: a text retrieval approach to object matching in videos , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[10]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[12]  Cordelia Schmid,et al.  Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[13]  Andrew Zisserman,et al.  The devil is in the details: an evaluation of recent feature encoding methods , 2011, BMVC.

[14]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[15]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[16]  D. Hassabis,et al.  Using Imagination to Understand the Neural Basis of Episodic Memory , 2007, The Journal of Neuroscience.

[17]  Nicu Sebe,et al.  In the eye of the beholder: employing statistical analysis and eye tracking for analyzing abstract paintings , 2012, ACM Multimedia.

[18]  Elia Bruni,et al.  VSEM: An open library for visual semantics representation , 2013, ACL.

[19]  Masaaki Kawahashi,et al.  Renovation of Journal of Visualization , 2010, J. Vis..

[20]  G. Aguirre,et al.  Different spatial scales of shape similarity representation in lateral and ventral LOC. , 2009, Cerebral cortex.

[21]  Elia Bruni,et al.  Multimodal Distributional Semantics , 2014, J. Artif. Intell. Res..

[22]  Tom Michael Mitchell,et al.  A Neurosemantic Theory of Concrete Noun Representation Based on the Underlying Brain Codes , 2010, PloS one.

[23]  J. Taylor,et al.  Episodic retrieval activates the precuneus irrespective of the imagery content of word pair associates. A PET study. , 1999, Brain : a journal of neurology.

[24]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[25]  Stephen Clark,et al.  Vector Space Models of Lexical Meaning , 2015 .

[26]  Alfonso Caramazza,et al.  Perception, action, and word meanings in the human brain: the case from action verbs , 2011, Annals of the New York Academy of Sciences.

[27]  A. Ishai,et al.  Distributed neural systems for the generation of visual images , 2000, NeuroImage.

[28]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[29]  Nikolaus Kriegeskorte,et al.  Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[30]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[31]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[32]  John A. Pyles,et al.  Comparing visual representations across human fMRI and computational vision. , 2013, Journal of vision.

[33]  Dermot Lynott,et al.  Principles of Representation: Why You Can't Represent the Same Concept Twice , 2014, Top. Cogn. Sci..

[34]  P. Dupont,et al.  Similarity of fMRI Activity Patterns in Left Perirhinal Cortex Reflects Semantic Similarity between Words , 2013, The Journal of Neuroscience.

[35]  A. A. Wijers,et al.  Visual semantic features are activated during the processing of concrete words: event-related potential evidence for perceptual semantic priming. , 2000, Brain research. Cognitive brain research.

[36]  B. Mesquita,et al.  Adjustment to Chronic Diseases and Terminal Illness Health Psychology : Psychological Adjustment to Chronic Disease , 2006 .

[37]  Li Fei-Fei,et al.  Neural mechanisms of rapid natural scene categorization in human visual cortex , 2009, Nature.

[38]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[39]  Gabriella Vigliocco,et al.  Integrating experiential and distributional data to learn semantic representations. , 2009, Psychological review.

[40]  Rajeev D. S. Raizada,et al.  What Makes Different People's Representations Alike: Neural Similarity Space Solves the Problem of Across-subject fMRI Decoding , 2012, Journal of Cognitive Neuroscience.

[41]  Trevor Darrell,et al.  The pyramid match kernel: discriminative classification with sets of image features , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[42]  Chris I. Baker,et al.  Disentangling visual imagery and perception of real-world objects , 2012, NeuroImage.

[43]  Johan Wagemans,et al.  Perceived Shape Similarity among Unfamiliar Objects and the Organization of the Human Object Vision Pathway , 2008, The Journal of Neuroscience.

[44]  A. Caramazza,et al.  Brain Regions That Represent Amodal Conceptual Knowledge , 2013, The Journal of Neuroscience.

[45]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[46]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[47]  Lawrence W Barsalou,et al.  Simulation, situated conceptualization, and prediction , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[48]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[49]  Thomas Serre,et al.  Reading the mind's eye: Decoding category information during mental imagery , 2010, NeuroImage.

[50]  M. Louwerse,et al.  Neurological Evidence Linguistic Processes Precede Perceptual Simulation in Conceptual Processing , 2012, Front. Psychology.

[51]  Naokazu Goda,et al.  Transformation from image-based to perceptual representation of materials along the human ventral visual pathway , 2011, NeuroImage.

[52]  Marina Schmid,et al.  Imagery And Verbal Processes , 2016 .

[53]  Rutvik H. Desai,et al.  The neurobiology of semantic memory , 2011, Trends in Cognitive Sciences.

[54]  F. Pulvermüller How neurons make meaning: brain mechanisms for embodied and abstract-symbolic semantics , 2013, Trends in Cognitive Sciences.

[55]  Michael N. Jones,et al.  Perceptual Inference Through Global Lexical Similarity , 2012, Top. Cogn. Sci..

[56]  I. Johnsrude,et al.  Somatotopic Representation of Action Words in Human Motor and Premotor Cortex , 2004, Neuron.

[57]  J. Duncan,et al.  Top-Down Activation of Shape-Specific Population Codes in Visual Cortex during Mental Imagery , 2009, The Journal of Neuroscience.

[58]  J. S. Guntupalli,et al.  The Representation of Biological Classes in the Human Brain , 2012, The Journal of Neuroscience.

[59]  Friedemann Pulvermüller,et al.  The Word Processing Deficit in Semantic Dementia: All Categories Are Equal, but Some Categories Are More Equal than Others , 2010, Journal of Cognitive Neuroscience.

[60]  John Duncan,et al.  Shape-specific preparatory activity mediates attention to targets in human visual cortex , 2009, Proceedings of the National Academy of Sciences.

[61]  Alessandro Lenci,et al.  Distributional semantics in linguistic and cognitive research , 2008 .

[62]  Anna Korhonen,et al.  Using fMRI activation to conceptual stimuli to evaluate methods for extracting conceptual representations from corpora , 2010, HLT-NAACL 2010.

[63]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[64]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[65]  Natalie M. Trumpp,et al.  Losing the sound of concepts: Damage to auditory association cortex impairs the processing of sound-related concepts , 2013, Cortex.

[66]  Leslie G. Ungerleider,et al.  Distributed Neural Systems for the Generation of Visual Images , 2000, Neuron.

[67]  Tom M. Mitchell,et al.  Selecting Corpus-Semantic Models for Neurolinguistic Decoding , 2012, *SEMEVAL.

[68]  M. Kiefer,et al.  The Sound of Concepts: Four Markers for a Link between Auditory and Conceptual Brain Systems , 2008, The Journal of Neuroscience.

[69]  P. Duygulu,et al.  Visual categorization with bags of keypoints , 2002, eccv 2002.

[70]  Massimo Poesio,et al.  Of Words, Eyes and Brains: Correlating Image-Based Distributional Semantic Models with Neural Representations of Concepts , 2013, EMNLP.

[71]  William W. Graves,et al.  Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. , 2009, Cerebral cortex.

[72]  Bradford Z. Mahon,et al.  Concepts and categories: a cognitive neuropsychological perspective. , 2009, Annual review of psychology.

[73]  L. Tyler,et al.  Representational Similarity Analysis Reveals Commonalities and Differences in the Semantic Processing of Words and Objects , 2013, The Journal of Neuroscience.

[74]  Anna C. Nobre,et al.  Imagery for shapes activates position-invariant representations in human visual cortex , 2011, NeuroImage.

[75]  M. Louwerse Embodied relations are encoded in language , 2008, Psychonomic bulletin & review.

[76]  S. Kosslyn,et al.  When is early visual cortex activated during visual mental imagery? , 2003, Psychological bulletin.

[77]  W. Glaser Picture naming , 1992, Cognition.

[78]  Nikolaus Kriegeskorte,et al.  Deep Supervised, but Not Unsupervised, Models May Explain IT Cortical Representation , 2014, PLoS Comput. Biol..