Distributional Formal Semantics

Abstract Natural language semantics has recently sought to combine the complementary strengths of formal and distributional approaches to meaning. More specifically, proposals have been put forward to augment formal semantic machinery with distributional meaning representations, thereby introducing the notion of semantic similarity into formal semantics, or to define distributional systems that aim to incorporate formal notions such as entailment and compositionality. However, given the fundamentally different ‘representational currency’ underlying formal and distributional approaches—models of the world versus linguistic co-occurrence—their unification has proven extremely difficult. Here, we define a Distributional Formal Semantics that integrates distributionality into a formal semantic system on the level of formal models. This approach offers probabilistic, distributed meaning representations that are also inherently compositional, and that naturally capture fundamental semantic notions such as quantification and entailment. Furthermore, we show how the probabilistic nature of these representations allows for probabilistic inference, and how the information-theoretic notion of “information” (measured in terms of Entropy and Surprisal) naturally follows from it. Finally, we illustrate how meaning representations can be derived incrementally from linguistic input using a recurrent neural network model, and how the resultant incremental semantic construction procedure intuitively captures key semantic phenomena, including negation, presupposition, and anaphoricity.

[1]  Matthew W. Crocker,et al.  Semantic Entropy in Language Comprehension , 2019, Entropy.

[2]  Katrin Erk,et al.  Vector Space Models of Word Meaning and Phrase Meaning: A Survey , 2012, Lang. Linguistics Compass.

[3]  C. R.,et al.  On referring , 1950 .

[4]  Alessandro Lenci,et al.  Distributional Models of Word Meaning , 2018 .

[5]  Shalom Lappin,et al.  当代语义理论指南 = The Handbook of Contemporary Semantic Theory , 2015 .

[6]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[7]  Roger C. Schank,et al.  Scripts, plans, goals and understanding: an inquiry into human knowledge structures , 1978 .

[8]  Matthew W. Crocker,et al.  Expectation-based Comprehension: Modeling the Interaction of World Knowledge and Linguistic Experience , 2019 .

[9]  ASA KASHER,et al.  ON THE SEMANTICS AND PRAGMATICS OF SPECIFIC AND NON-SPECIFIC INDEFINITE EXPRESSIONS , 1976 .

[10]  Emily M. Bender,et al.  Linguistic Fundamentals for Natural Language Processing II: 100 Essentials from Semantics and Pragmatics , 2019, Linguistic Fundamentals for Natural Language Processing II.

[11]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[12]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[13]  Mirella Lapata,et al.  A Comparison of Vector-based Representations for Semantic Composition , 2012, EMNLP.

[14]  Christopher Potts,et al.  A large annotated corpus for learning natural language inference , 2015, EMNLP.

[15]  D. Davidson Truth and meaning , 2004, Synthese.

[16]  Mehrnoosh Sadrzadeh,et al.  Concrete Models and Empirical Evaluations for the Categorical Compositional Distributional Model of Meaning , 2015, CL.

[17]  Mirella Lapata,et al.  Composition in Distributional Models of Semantics , 2010, Cogn. Sci..

[18]  J. Mandler Language comprehension. , 1979, Science.

[19]  Stephen Clark,et al.  RELPRON: A Relative Clause Evaluation Data Set for Compositional Distributional Semantics , 2016, CL.

[20]  Nicholas Asher,et al.  Integrating Type Theory and Distributional Semantics: A Case Study on Adjective–Noun Compositions , 2016, CL.

[21]  Katrin Erk,et al.  Representing Meaning with a Combination of Logical and Distributional Models , 2015, CL.

[22]  Matthew W. Crocker,et al.  A Framework for Distributional Formal Semantics , 2019, WoLLIC.

[23]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[24]  Ioannis Korkontzelos,et al.  Estimating Linear Models for Compositional Distributional Semantics , 2010, COLING.

[25]  Katrin Erk,et al.  A Formal Approach to Linking Logical Form and Vector-Space Lexical Semantics , 2014 .

[26]  Gottlob Frege,et al.  Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens , 1879 .

[27]  Johan Bos,et al.  The Groningen Meaning Bank , 2013, JSSP.

[28]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[29]  G. Fitzgerald,et al.  'I. , 2019, Australian journal of primary health.

[30]  Mathieu Koppen,et al.  Modeling knowledge-based inferences in story comprehension , 2003, Cogn. Sci..

[31]  Matthew W. Crocker,et al.  Connectionist Semantic Systematicity in Language Production , 2016, CogSci.

[32]  Scripts , 2019, The SAGE Encyclopedia of Human Communication Sciences and Disorders.

[33]  Stephen Clark,et al.  Vector Space Models of Lexical Meaning , 2015 .

[34]  I. Rooij,et al.  Connectionist semantic systematicity , 2009, Cognition.

[35]  Richard M. Golden,et al.  A parallel distributed processing model of story comprehension and recall , 1993 .

[36]  Matthew Richardson,et al.  Markov logic networks , 2006, Machine Learning.

[37]  Gemma Boleda,et al.  Formal Distributional Semantics: Introduction to the Special Issue , 2016, CL.

[38]  Marco Marelli,et al.  Spicy Adjectives and Nominal Donkeys: Capturing Semantic Deviance Using Compositionality in Distributional Spaces , 2017, Cogn. Sci..

[39]  Katrin Erk,et al.  What do you know about an alligator when you know the company it keeps , 2016 .

[40]  Marco Baroni,et al.  Frege in Space: A Program of Compositional Distributional Semantics , 2014 .

[41]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[42]  Uwe Reyle,et al.  From Discourse to Logic - Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory , 1993, Studies in linguistics and philosophy.

[43]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[44]  P. Strawson III.—ON REFERRING , 1950 .

[45]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[46]  Emmanuele Chersoni,et al.  Towards a Distributional Model of Semantic Complexity , 2016, CL4LC@COLING 2016.

[47]  David C. Plaut,et al.  A connectionist model of sentence comprehension and production , 2002 .

[48]  Noortje J. Venhuizen,et al.  Neurobehavioral Correlates of Surprisal in Language Comprehension: A Neurocomputational Model , 2021, Frontiers in Psychology.

[49]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[50]  Johan Bos,et al.  Discourse Semantics with Information Structure , 2018, J. Semant..

[51]  Marco Baroni,et al.  Nouns are Vectors, Adjectives are Matrices: Representing Adjective-Noun Constructions in Semantic Space , 2010, EMNLP.

[52]  Matthew W. Crocker,et al.  A Neurocomputational Model of the N400 and the P600 in Language Processing , 2016, Cognitive science.

[53]  Rudolf Carnap,et al.  Meaning and Necessity , 1947 .

[54]  John Hale,et al.  Uncertainty About the Rest of the Sentence , 2006, Cogn. Sci..

[55]  Stephen Clark,et al.  Mathematical Foundations for a Compositional Distributional Model of Meaning , 2010, ArXiv.