Bridging the theoretical gap between semantic representation models without the pressure of a ranking: some lessons learnt from LSA

In recent years, latent semantic analysis (LSA) has reached a level of maturity at which its presence is ubiquitous in technology as well as in simulation of cognitive processes. In spite of this, in recent years there has been a trend of subjecting LSA to some criticisms, usually because it is compared to other models in very specific tasks and conditions and sometimes without having good knowledge of what the semantic representation of LSA means, and without exploiting all the possibilities of which LSA is capable other than the cosine. This paper provides a critical review to clarify some of the misunderstandings regarding LSA and other space models. The historical stability of the predecessors of LSA, the representational structure of word meaning and the multiple topologies that could arise from a semantic space, the computation of similarity, the myth that LSA dimensions have no meaning, the computational and algorithm plausibility to account for meaning acquisition in LSA (in contrast to others models based on online mechanisms), the possibilities of spatial models to substantiate recent proposals, and, in general, the characteristics of classic vector models and their ease and flexibility to simulate some cognitive phenomena will be reviewed. The review highlights the similarity between LSA and other techniques and proposes using long LSA experiences in other models, especially in predicting models such as word2vec. In sum, it emphasizes the lessons that can be learned from comparing LSA-based models to other models, rather than making statements about “the best.”

[1]  Marco Marelli,et al.  Vector-Space Models of Semantic Representation From a Cognitive Perspective: A Discussion of Common Misconceptions , 2019, Perspectives on psychological science : a journal of the Association for Psychological Science.

[2]  H. Kaiser The varimax criterion for analytic rotation in factor analysis , 1958 .

[3]  Albert Costa,et al.  The effect of foreign language in fear acquisition , 2018, Scientific Reports.

[4]  Adam Tauman Kalai,et al.  Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings , 2016, NIPS.

[5]  Arthur C. Graesser,et al.  Strengths, Limitations, and Extensions of LSA , 2007 .

[6]  Erkki Sutinen,et al.  Applying Latent Dirichlet Allocation to Automatic Essay Grading , 2006, FinTAL.

[7]  Ronan Collobert,et al.  Rehabilitation of Count-Based Models for Word Vector Representations , 2015, CICLing.

[8]  Ricardo Olmos,et al.  Transforming LSA space dimensions into a rubric for an automatic assessment and feedback system , 2016, Inf. Process. Manag..

[9]  Guillermo Jorge-Botana,et al.  Visualizing polysemy using LSA and the predication algorithm , 2010, J. Assoc. Inf. Sci. Technol..

[10]  Geraint A. Wiggins,et al.  Re-Representing Metaphor: Modeling Metaphor Perception Using Dynamically Contextual Distributional Semantics , 2019, Front. Psychol..

[11]  Michael N. Jones,et al.  Redundancy in Perceptual and Linguistic Experience: Comparing Feature-Based and Distributional Models of Semantic Representation , 2010, Top. Cogn. Sci..

[12]  Refractor Vision , 2000, The Lancet.

[13]  W. Kintsch,et al.  Metaphor Comprehension: What Makes a Metaphor Difficult to Understand? , 2002 .

[14]  Riitta Salmelin,et al.  Reconstructing meaning from bits of information , 2018, Nature Communications.

[15]  Jon A. Willits,et al.  Models of Semantic Memory , 2015 .

[16]  M. T. Argilaga Construcción de modelos en psicologí­a , 1977 .

[17]  Joshua B. Tenenbaum,et al.  The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth , 2001, Cogn. Sci..

[18]  Thomas K. Landauer,et al.  Word Maturity: Computational Modeling of Word Knowledge , 2011, ACL.

[19]  W. Kintsch,et al.  A Computational Theory of Complex Problem Solving Using Latent Semantic Analysis , 2019, Proceedings of the Twenty-Fourth Annual Conference of the Cognitive Science Society.

[20]  Vipul Jain,et al.  A journey from normative to behavioral operations in supply chain management: A review using Latent Semantic Analysis , 2015, Expert Syst. Appl..

[21]  W. Kintsch,et al.  The construction-integration model: A framework for studying memory for text. , 1991 .

[22]  Thomas Mandl Tolerant and Adaptive Information Retrieval with Neural Networks , 2001 .

[23]  Walter Kintsch,et al.  Predication , 2001, Cogn. Sci..

[24]  Paul Douglas,et al.  International Conference on Information Technology : Coding and Computing , 2003 .

[25]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[26]  V. Kshirsagar,et al.  Face recognition using Eigenfaces , 2011, 2011 3rd International Conference on Computer Research and Development.

[27]  Riitta Salmelin,et al.  Reconstructing meaning from bits of information , 2018 .

[28]  Fritz Günther,et al.  Latent semantic analysis cosines as a cognitive similarity measure: Evidence from priming studies , 2016, Quarterly journal of experimental psychology.

[29]  Paul Broek,et al.  A computational modeling of semantic knowledge in reading comprehension: Integrating the landscape model with latent semantic analysis , 2016, Behavior Research Methods.

[30]  Michael L. Littman,et al.  Measuring praise and criticism: Inference of semantic orientation from association , 2003, TOIS.

[31]  Ricardo Olmos,et al.  The representation of polysemy through vectors: some building blocks for constructing models and applications with LSA , 2011 .

[32]  Michael N. Jones,et al.  In Defense of Spatial Models of Lexical Semantics , 2011, CogSci.

[33]  Bruce D. McCandliss,et al.  Development of neural systems for reading. , 2007, Annual review of neuroscience.

[34]  Margareta Friman,et al.  What Drives Them to Drive?—Parents' Reasons for Choosing the Car to Take Their Children to School , 2017, Front. Psychol..

[35]  Yasar Tonta,et al.  Diffusion of latent semantic analysis as a research tool: A social network analysis approach , 2010, J. Informetrics.

[36]  L. Wittgenstein Philosophical investigations = Philosophische Untersuchungen , 1958 .

[37]  T. Shultz Computational Developmental Psychology , 2003 .

[38]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[39]  Walter Kintsch,et al.  Comprehension: A Paradigm for Cognition , 1998 .

[40]  Richard P. Cooper,et al.  On the Relation Between Marr's Levels: A Response to Blokpoel (2017) , 2018, Top. Cogn. Sci..

[41]  Danielle S. McNamara,et al.  Age of Exposure: A Model of Word Learning , 2016, AAAI.

[42]  Christian Biemann,et al.  Simple Co-Occurrence Statistics Reproducibly Predict Association Ratings , 2018, Cogn. Sci..

[43]  Xiaoping P. Hu,et al.  fMRI evidence for word association and situated simulation in conceptual processing , 2008, Journal of Physiology-Paris.

[44]  John Burnheim,et al.  Philosophical Investigations: Philosophische Untersuchungen , 1954 .

[45]  Markus J. Hofmann,et al.  If You Don't Have Valence, Ask Your Neighbor: Evaluation of Neutral Words as a Function of Affective Semantic Associates , 2017, Front. Psychol..

[46]  G. Denhière,et al.  A Computational Model of Children's Semantic Memory , 2004 .

[47]  Brian D. Haig Analogical modeling: a strategy for developing theories in psychology , 2013, Front. Psychol..

[48]  Simona Balbi,et al.  COMPARING ADVERTISING CAMPAIGNS BY MEANS OF TEXTUAL DATA ANALYSIS WITH EXTERNAL INFORMATION , 2000 .

[49]  José A. Martínez-Huertas,et al.  Analyzing Two Automatic Latent Semantic Analysis (LSA) Assessment Methods (Inbuilt Rubric vs. Golden Summary) in Summaries Extracted from Expository Texts , 2018 .

[50]  L. DeLisi,et al.  An automated method to analyze language use in patients with schizophrenia and their first-degree relatives , 2010, Journal of Neurolinguistics.

[51]  Danielle S. McNamara,et al.  Computational Methods to Extract Meaning From Text and Advance Theories of Human Cognition , 2011, Top. Cogn. Sci..

[52]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[53]  Walter Kintsch,et al.  Symbol systems and perceptual representations , 2008 .

[54]  Max M. Louwerse,et al.  Symbol Interdependency in Symbolic and Embodied Cognition , 2011, Top. Cogn. Sci..

[55]  Georgiana Dinu,et al.  Don’t count, predict! A systematic comparison of context-counting vs. context-predicting semantic vectors , 2014, ACL.

[56]  Michael N Jones,et al.  Representing word meaning and order information in a composite holographic lexicon. , 2007, Psychological review.

[57]  Christoph Lofi,et al.  Measuring Semantic Similarity and Relatedness with Distributional and Knowledge- based Approaches , 2015 .

[58]  Gabriella Vigliocco,et al.  Integrating experiential and distributional data to learn semantic representations. , 2009, Psychological review.

[59]  Walter Kintsch Similarity as a function of semantic distance and amount of knowledge. , 2014, Psychological review.

[60]  Thomas Mandl Efficient Preprocessing for Information Retrieval with Neural Networks , 1999 .

[61]  Lucian L. Visinescu,et al.  Text-mining the voice of the people , 2012, Commun. ACM.

[62]  Jácint Szabó,et al.  A Comparative Analysis of Latent Variable Models for Web Page Classification , 2008, 2008 Latin American Web Conference.

[63]  Paul K. Feyerabend Contra el método , 2002 .

[64]  Larry P. Heck,et al.  Learning deep structured semantic models for web search using clickthrough data , 2013, CIKM.

[65]  David A. Medler,et al.  Distinct Brain Systems for Processing Concrete and Abstract Concepts , 2005, Journal of Cognitive Neuroscience.

[66]  Gabriel Recchia,et al.  Reproducing affective norms with lexical co-occurrence statistics: Predicting valence, arousal, and dominance , 2015, Quarterly journal of experimental psychology.

[67]  Barbara Kaup,et al.  Symbol Grounding Without Direct Experience: Do Words Inherit Sensorimotor Activation From Purely Linguistic Context? , 2018, Cognitive science.

[68]  A. Mendrek,et al.  Event-related brain potentials to emotional images and gonadal steroid hormone levels in patients with schizophrenia and paired controls , 2014, Front. Psychol..

[69]  Allan Paivio,et al.  IMAGERY AND LANGUAGE , 1971 .

[70]  P. Kwantes Using context to build semantics , 2005, Psychonomic bulletin & review.

[71]  Ricardo Olmos,et al.  Word maturity indices with latent semantic analysis: why, when, and where is Procrustes rotation applied? , 2018, Wiley interdisciplinary reviews. Cognitive science.

[72]  Christian Balkenius,et al.  Spaces in the Brain: From Neurons to Meanings , 2016, Front. Psychol..

[73]  Alessandro Lenci,et al.  One Distributional Memory, Many Semantic Spaces , 2009, Proceedings of the Workshop on Geometrical Models of Natural Language Semantics - GEMS '09.

[74]  Sonia Bergamaschi,et al.  Comparing LDA and LSA Topic Models for Content-Based Movie Recommendation Systems , 2014, WEBIST.

[75]  Fan Jiang,et al.  Learning a Language-Independent Representation for Terms from a Partially Aligned Corpus , 1998, ICML.

[76]  John R. Skoyles,et al.  Autism, Context/Noncontext Information Processing, and Atypical Development , 2011, Autism research and treatment.

[77]  Dominic Widdows,et al.  Using LSA and Noun Coordination Information to Improve the Recall and Precision of Automatic Hyponymy Extraction , 2003, CoNLL.

[78]  Mark Steyvers,et al.  Topics in semantic representation. , 2007, Psychological review.

[79]  Sudeep Bhatia Associative Judgment and Vector Space Semantics , 2017, Psychological review.

[80]  L. T. Robertson Memory and the brain. , 2002, Journal of dental education.

[81]  Arthur M. Jacobs,et al.  Features of word similarity , 2018, ArXiv.

[82]  Simona Balbi,et al.  Rotated Canonical Correlation Analysis for Multilingual Corpora , 2006 .

[83]  James L. McClelland,et al.  Semantic Cognition: A Parallel Distributed Processing Approach , 2004 .

[84]  Susan T. Dumais,et al.  Statistical semantics: How can a computer use what people name things to guess what things people mean when they name things? , 1982, CHI '82.

[85]  P. van den Broek,et al.  The role of textual semantic constraints in knowledge-based inference generation during reading comprehension: A computational approach , 2015, Memory.

[86]  Shimon Edelman,et al.  Representation, similarity, and the chorus of prototypes , 1993, Minds and Machines.

[87]  A. C. Rosander,et al.  The Measurement of Social Attitudes , 1937 .

[88]  M. Louwerse,et al.  Neurological Evidence Linguistic Processes Precede Perceptual Simulation in Conceptual Processing , 2012, Front. Psychology.

[89]  Yoed N. Kenett,et al.  Semantic organization in children with cochlear implants: computational analysis of verbal fluency , 2013, Front. Psychol..

[90]  M. Engelmann The Philosophical Investigations , 2013 .

[91]  Omer Levy,et al.  Improving Distributional Similarity with Lessons Learned from Word Embeddings , 2015, TACL.

[92]  Costin-Gabriel Chiru,et al.  Comparison between LSA-LDA-Lexical Chains , 2014, WEBIST.

[93]  Pablo Gamallo,et al.  Is singular value decomposition useful for word similarity extraction? , 2011, Lang. Resour. Evaluation.

[94]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[95]  I. Johnsrude,et al.  Somatotopic Representation of Action Words in Human Motor and Premotor Cortex , 2004, Neuron.

[96]  Rifat Ozcan,et al.  Concept-based information access , 2005, International Conference on Information Technology: Coding and Computing (ITCC'05) - Volume II.

[97]  Peter Wiemer-Hastings,et al.  Latent semantic analysis , 2004, Annu. Rev. Inf. Sci. Technol..

[98]  Ricardo Olmos,et al.  Latent Semantic Analysis Parameters for Essay Evaluation using Small-Scale Corpora* , 2010, J. Quant. Linguistics.

[99]  Omer Levy,et al.  Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[100]  Arthur M. Glenberg,et al.  Constraint on covariation: It's not meaning , 2008 .

[101]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[102]  A. Martin,et al.  Structural Basis of Semantic Memory , 2008 .

[103]  Keith K. Millis,et al.  Applying the Construction-Integration Framework to Aesthetic Responses to Representational Artworks , 2008 .

[104]  Walter Kintsch,et al.  The Construction of Meaning , 2011, Top. Cogn. Sci..

[105]  C. Schunn The Presence and Absence of Category Knowledge in LSA , 2020, Proceedings of the Twenty First Annual Conference of the Cognitive Science Society.

[106]  A. Tversky Features of Similarity , 1977 .

[107]  Lorenz Sichelschmidt,et al.  Reconceptualizing Latent Semantic Analysis in Terms of Complex Network Theory , 2006 .

[108]  Emmanuel M. Pothos,et al.  Towards a Quantum Probability Theory of Similarity Judgments , 2014, QI.

[109]  A. Wierzbicka Semantics: Primes and Universals , 1996 .

[110]  Ren-Hao Pan,et al.  ADDING NEW CONCEPTS ON THE DOMAIN ONTOLOGY BASED ON SEMANTIC SIMILARITY , 2006 .

[111]  Max M. Louwerse,et al.  Knowing the Meaning of a Word by the Linguistic and Perceptual Company It Keeps , 2018, Top. Cogn. Sci..

[112]  Matthew A. Lambon Ralph,et al.  Semantic Diversity Accounts for the “Missing” Word Frequency Effect in Stroke Aphasia: Insights Using a Novel Method to Quantify Contextual Variability in Meaning , 2011, Journal of Cognitive Neuroscience.

[113]  J. Townsend,et al.  The Oxford Handbook of Computational and Mathematical Psychology , 2015 .

[114]  Omer Levy,et al.  word2vec Explained: deriving Mikolov et al.'s negative-sampling word-embedding method , 2014, ArXiv.

[115]  Penny M. Pexman,et al.  The role of embodiment in conceptual development , 2019 .

[116]  Kuan-Yu Chen,et al.  Leveraging Relevance Cues for Improved Spoken Document Retrieval , 2011, INTERSPEECH.

[117]  Emmanuel M. Pothos,et al.  Similarity Judgments: From Classical to Complex Vector Psychological Spaces , 2016 .

[118]  Yves Bestgen,et al.  Checking and bootstrapping lexical norms by means of word similarity indexes , 2012, Behavior Research Methods.

[119]  Verónica Dahl,et al.  Meaning in Context , 2005, CONTEXT.

[120]  Benoît Lemaire,et al.  A computational model for simulating text comprehension , 2006, Behavior research methods.

[121]  Jerome R Busemeyer,et al.  Can quantum probability provide a new direction for cognitive modeling? , 2013, The Behavioral and brain sciences.

[122]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[123]  Javier Sainz Conceptos naturales y conceptos artificiales , 1991 .

[124]  Victor R. Prybutok,et al.  Latent Semantic Analysis: five methodological recommendations , 2012, Eur. J. Inf. Syst..

[125]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[126]  Ricardo Olmos,et al.  Transforming Selected Concepts Into Dimensions in Latent Semantic Analysis , 2014 .

[127]  Lawrence W. Barsalou,et al.  Language and simulation in conceptual processing , 2008 .

[128]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[129]  Xue Li,et al.  A Correlation Analysis on LSA and HAL Semantic Space Models , 2004, CIS.

[130]  Rolf A. Zwaan,et al.  Spatial iconicity affects semantic relatedness judgments , 2003, Psychonomic bulletin & review.

[131]  David M. Blei,et al.  Probabilistic topic models , 2012, Commun. ACM.

[132]  Peter W. Foltz,et al.  Category fluency, latent semantic analysis and schizophrenia: a candidate gene approach , 2014, Cortex.

[133]  Ricardo Olmos,et al.  Could LSA become a "Bifactor" model? Towards a model with general and group factors , 2019, Expert Syst. Appl..

[134]  Ricardo Olmos,et al.  The role of domain knowledge in cognitive modeling of information search , 2017, Information Retrieval Journal.

[135]  Massimiliano Palmiero,et al.  The format of mental imagery: from a critical review to an integrated embodied representation approach , 2019, Cognitive Processing.

[136]  Tat-Jun Chin,et al.  Incremental kernel SVD for face recognition with image sets , 2006, 7th International Conference on Automatic Face and Gesture Recognition (FGR06).

[137]  Geoff Hollis,et al.  The principals of meaning: Extracting semantic dimensions from co-occurrence models of semantics , 2016, Psychonomic Bulletin & Review.

[138]  Andy P Field,et al.  The verbal information pathway to fear and heart rate changes in children. , 2007, Journal of child psychology and psychiatry, and allied disciplines.

[139]  C. Krumhansl Concerning the Applicability of Geometric Models to Similarity Data : The Interrelationship Between Similarity and Spatial Density , 2005 .

[140]  M. Brysbaert,et al.  Explaining human performance in psycholinguistic tasks with models of semantic similarity based on prediction and counting : A review and empirical validation , 2017 .

[141]  Geoff Hollis,et al.  Extrapolating human judgments from skip-gram vector representations of word meaning , 2017, Quarterly journal of experimental psychology.

[142]  N. Foo Conceptual Spaces—The Geometry of Thought , 2022 .

[143]  Arthur C. Graesser,et al.  Pooling Word Vector Representations Across Models , 2017, CICLing.

[144]  P. Gärdenfors Conceptual Spaces as a Basis for Cognitive Semantics , 1996 .

[145]  Ricardo Olmos,et al.  Predicting Word Maturity from Frequency and Semantic Diversity: A Computational Study , 2017 .

[146]  T. Rogers,et al.  Semantic diversity: A measure of semantic ambiguity based on variability in the contextual usage of words , 2012, Behavior Research Methods.

[147]  Paul K. Feyerabend,et al.  Contra el método: esquema de una teoría anarquista del conocimiento , 1981 .

[148]  Thomas K. Landauer Latent Semantic Analysis (LSA), a disembodied learning machine, acquires human word meaning vicariously from language alone , 1999 .

[149]  Curt Burgess,et al.  Producing high-dimensional semantic spaces from lexical co-occurrence , 1996 .

[150]  Douglas L. Hintzman,et al.  MINERVA 2: A simulation model of human memory , 1984 .

[151]  Peter W. Foltz,et al.  Models of Vocabulary Acquisition: Direct Tests and Text-Derived Simulations of Vocabulary Growth , 2014 .

[152]  R. Olmos,et al.  How lexical ambiguity distributes activation to semantic neighbors: Some possible consequences within a computational framework , 2014 .

[153]  R. Shepard,et al.  Second-order isomorphism of internal representations: Shapes of states ☆ , 1970 .

[154]  Rolf A. Zwaan,et al.  Language Encodes Geographical Information , 2009, Cogn. Sci..

[155]  S. McDonald,et al.  Models of high-dimensional semantic space predict language-mediated eye movements in the visual world. , 2006, Acta psychologica.

[156]  J. Elman Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[157]  Thomas K. Landauer,et al.  Word Maturity: A New Metric for Word Knowledge , 2011 .

[158]  H. Kolk,et al.  Three-step priming in lexical decision , 2002, Memory & cognition.

[159]  Alessandro Lenci,et al.  Distributional semantics in linguistic and cognitive research , 2008 .

[160]  Peter M. Wiemer-Hastings,et al.  How Latent is Latent Semantic Analysis? , 1999, IJCAI.

[161]  Duane T. Wegener,et al.  Evaluating the use of exploratory factor analysis in psychological research. , 1999 .

[162]  W. Kintsch Metaphor comprehension: A computational theory , 2000, Psychonomic bulletin & review.

[163]  Thomas T. Hills,et al.  Small Worlds and Semantic Network Growth in Typical and Late Talkers , 2011, PloS one.

[164]  Mariano Sigman,et al.  Comparative study of LSA vs Word2vec embeddings in small corpora: a case study in dreams database , 2016, ArXiv.

[165]  James L. McClelland,et al.  Concepts, Control, and Context: A Connectionist Account of Normal and Disordered Semantic Cognition , 2018, Psychological review.

[166]  Robert F. Hadley On The Proper Treatment of Semantic Systematicity , 2004, Minds and Machines.

[167]  Gabriel Recchia,et al.  In defense of spatial models of semantic representation , 2017, New Ideas in Psychology.