USING ENRICHED SEMANTIC REPRESENTATIONS IN PREDICTIONS OF HUMAN BRAIN ACTIVITY

There have been many different theoretical proposals for ways of representing word meaning in a distributed fashion. We ourselves have put forward a framework for expressing aspects of lexical semantics in terms of patterns of word co-occurrences measured in large linguistic corpora. Recent advances in the modelling of fMRI measures of brain activity have started to examine patterns of activation across the cortex rather than averaging activity across a sub-volume. Mitchell et al. [11] have shown that simple linear models can successfully predict fMRI data from patterns of word co-occurrence for a task where participants mentally generate properties for presented word-picture pairs. Using their MRI data, we replicate their models and extend them to use our independently optimised co-occurrence patterns to demonstrate that enriched representations of word/concept meaning produce significantly better predictions of brain activity. We also explore several aspects of the parameter space underlying the supervised learning techniques used in these models.

[1]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[2]  J. Bullinaria Semantic Categorization Using Simple Word Co-occurrence Statistics , 2022 .

[3]  L. K. Tyler,et al.  Conceptual Structure and the Structure of Concepts: A Distributed Account of Category-Specific Deficits , 2000, Brain and Language.

[4]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[5]  J. Bullinaria,et al.  Extracting semantic representations from word co-occurrence statistics: A computational study , 2007, Behavior research methods.

[6]  G. Karypis,et al.  Criterion Functions for Document Clustering ∗ Experiments and Analysis , 2001 .

[7]  Shahram Khadivi,et al.  WordNet Based Features for Predicting Brain Activity associated with meanings of nouns , 2010, HLT-NAACL 2010.

[8]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[9]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[10]  Francisco Pereira,et al.  Learning semantic features for fMRI data from definitional text , 2010, HLT-NAACL 2010.

[11]  Robert M. French,et al.  Four Problems with Extracting Human Semantics from Large Text Corpora , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[12]  Anna Korhonen,et al.  Using fMRI activation to conceptual stimuli to evaluate methods for extracting conceptual representations from corpora , 2010, HLT-NAACL 2010.