Modeling the acquisition of quantifier semantics: a case study in function word learnability

This paper studies the acquisition of quantifier meanings as a case study of function word learnability. We suggest that learners construct semantic representations of quantifiers and other function words using a compositional statistical learning mechanism that operates over a small set of domaingeneral cognitive primitives. We present a simple cross-situational learning model that provably solves key learning problems in this domain, using a developmentally-plausible amount of data. We additionally evaluate the utility of proposed constraints on quantifier meaning, and show that learning in an unrestricted space of meanings is not substantially more difficult than learning in highly-constrained frameworks.

[1]  P. Schlenker Maximize Presupposition and Gricean reasoning , 2012 .

[2]  J. Tenenbaum,et al.  Word learning as Bayesian inference. , 2007, Psychological review.

[3]  Kenny Smith,et al.  Cross-Situational Learning: A Mathematical Approach , 2006, EELC.

[4]  Robin Clark,et al.  Learning First Order Quantifier Denotations An Essay in Semantic Learnability , 1996 .

[5]  P. Smolensky The Initial State and 'Richness of the Base' in Optimality Theory , 1996 .

[6]  Jacob Feldman,et al.  Minimization of Boolean complexity in human concept learning , 2000, Nature.

[7]  David Warden,et al.  The Influence of Context on Children's Use of Identifying Expressions and References. , 1976 .

[8]  J. Tenenbaum,et al.  The learnability of abstract syntactic principles , 2011, Cognition.

[9]  Noah D. Goodman Learning and the language of thought , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[10]  Hans Joerg Tiede,et al.  Identifiability in the Limit of Context-Free Generalized Quantifiers , 1999 .

[11]  J. Tenenbaum,et al.  Probabilistic models of cognition: exploring representations and inductive biases , 2010, Trends in Cognitive Sciences.

[12]  Joshua B. Tenenbaum,et al.  Theory Acquisition as Stochastic Search , 2010 .

[13]  Mark Steedman,et al.  Computational Grammar Acquisition from CHILDES Data Using a Probabilistic Parsing Model , 2009 .

[14]  K. Fintel Would You Believe It? The King of France is Back! (Presuppositions and Truth-Value Intuitions) , 2001 .

[15]  L. M. M.-T. Theory of Probability , 1929, Nature.

[16]  Francis Jeffry Pelletier,et al.  Representation and Inference for Natural Language: A First Course in Computational Semantics , 2005, Computational Linguistics.

[17]  Stephen Crain,et al.  Navigating negative quantificational space , 2000 .

[18]  B. MacWhinney The CHILDES project: tools for analyzing talk , 1992 .

[19]  Thomas Roeper,et al.  The Acquisition Path of Quantifiers: Two Kinds of Spreading , 2004 .

[20]  Dana Angluin,et al.  Learning Regular Sets from Queries and Counterexamples , 1987, Inf. Comput..

[21]  Edward L. Keenan,et al.  A semantic characterization of natural language determiners , 1986 .

[22]  James Jay Horning,et al.  A study of grammatical inference , 1969 .

[23]  E. Mark Gold,et al.  Language Identification in the Limit , 1967, Inf. Control..

[24]  Irene Heim,et al.  Semantics in generative grammar , 1998 .

[25]  R. Weale Vision. A Computational Investigation Into the Human Representation and Processing of Visual Information. David Marr , 1983 .

[26]  Joshua B. Tenenbaum,et al.  Beyond Boolean logic: exploring representation languages for learning complex concepts , 2010 .

[27]  Ming Li,et al.  An Introduction to Kolmogorov Complexity and Its Applications , 1997, Texts in Computer Science.

[28]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[29]  Richard Montague,et al.  The Proper Treatment of Quantification in Ordinary English , 1973 .

[30]  Chen Yu,et al.  A unified model of early word learning: Integrating statistical and social cues , 2007, Neurocomputing.

[31]  Charles Reiss,et al.  The Subset Principle in Phonology: Why the tabula can't be rasa , 2003 .

[32]  F. Kessel The Emergence of Set-Relational Quantifiers in Early Childhood , 2013 .

[33]  Yarden Katz,et al.  Modeling Semantic Cognition as Logical Dimensionality Reduction , 2008 .

[34]  Hagit Borer,et al.  The Maturation of Syntax , 1987 .

[35]  M. Rita Manzini,et al.  Parameters and Learnability in Binding Theory , 1987 .

[36]  C. Hanlon Acquisition of set-relational quantifiers in early childhood , 1987 .

[37]  Noah D. Goodman,et al.  Knowledge and implicature: Modeling language understanding as social cognition , 2012, CogSci.

[38]  Jeffrey Lidz,et al.  The Meaning of ‘Most’: Semantics, Numerosity and Psychology , 2009 .

[39]  J. Barwise,et al.  Generalized quantifiers and natural language , 1981 .

[40]  Nina Gierasimczuk,et al.  The Problem of Learning the Semantics of Quantifiers , 2007, TbiLLC.

[41]  Arnim von Stechow,et al.  Artikel und Definitheit Articles and Definiteness , 1991 .

[42]  Joshua B. Tenenbaum,et al.  Learning and using relational theories , 2007, NIPS.

[43]  Thomas L. Griffiths,et al.  A Rational Analysis of Rule-Based Concept Learning , 2008, Cogn. Sci..

[44]  William Churchill Houston Philip Event quantification in the acquisition of universal quantification , 1995 .

[45]  Dag Westerståhl,et al.  Generalized Quantifiers in Linguistics and Logic , 1997, Handbook of Logic and Language.

[46]  Noah D. Goodman,et al.  Learning a theory of causality. , 2011, Psychological review.

[47]  Marcin Mostowski,et al.  Computational semantics for monadic quantifiers , 1998, J. Appl. Non Class. Logics.

[48]  J. Benthem Essays in Logical Semantics , 1986 .

[49]  Heejeong Ko,et al.  Semantic universals and variation in L2 article choice.Proceedings of the 9th Generative approaches to Second Language Acquisition Conference (GASLA 2007). , 2008 .

[50]  Annette Karmiloff-Smith,et al.  A Functional Approach to Child Language: A Study of Determiners and Reference , 1979 .

[51]  Roger Levy,et al.  That's what she (could have) said: How alternative utterances affect language use , 2012, CogSci.

[52]  A. Mostowski On a generalization of quantifiers , 1957 .

[53]  J. Siskind A computational study of cross-situational techniques for learning word-to-meaning mappings , 1996, Cognition.

[54]  Meaning and Dyslexia: A study on pronouns, aspect, and quantification , 2010 .

[55]  David Barner,et al.  Finding one’s meaning: A test of the relation between quantifiers and integers in language development , 2009, Cognitive Psychology.

[56]  N. Chater,et al.  Simplicity: a unifying principle in cognitive science? , 2003, Trends in Cognitive Sciences.

[57]  Mark Steedman,et al.  The syntactic process , 2004, Language, speech, and communication.

[58]  Rosalind Thornton,et al.  Investigations in universal grammar: A guide to experiments on the acquisition of syntax and semantics , 1998 .

[59]  Siobhan Chapman Logic and Conversation , 2005 .

[60]  Nick Chater,et al.  The probabilistic analysis of language acquisition: Theoretical, computational, and experimental analysis , 2010, Cognition.

[61]  Christophe Costa Florêncio Learning generalized quantifiers , 2002 .

[62]  Noah D. Goodman,et al.  A Bayesian Model of the Acquisition of Compositional Semantics , 2008 .

[63]  Paul M. B. Vitányi,et al.  ‘Ideal learning’ of natural language: Positive results about learning from positive evidence , 2007 .

[64]  J. Hintikka On denoting what? , 2005, Synthese.

[65]  Noah D. Goodman,et al.  Bootstrapping in a language of thought: A formal model of numerical concept learning , 2012, Cognition.

[66]  Jeroen Groenendijk,et al.  Studies in discourse, representation theory and the theory of generalized quantifiers , 1987 .

[67]  Martin Hackl,et al.  On the grammar and processing of proportional quantifiers: most versus more than half , 2009 .

[68]  Tim Hunter,et al.  Children ’ s Restrictions on the Meanings of Novel Determiners : An Investigation of Conservativity , 2008 .

[69]  Julien Musolino,et al.  On the Semantics of the Subset Principle , 2006 .

[70]  Linda B. Smith,et al.  Rapid Word Learning Under Uncertainty via Cross-Situational Statistics , 2007, Psychological science.

[71]  Anne Bezuidenhout,et al.  Descriptions and Beyond , 2004 .

[72]  Nick Chater,et al.  The Logical Problem of Language Acquisition: A Probabilistic Perspective , 2010, Cogn. Sci..

[73]  Adult L2-learners Lack the Maximality Presupposition, Too!* , 2005 .

[74]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[75]  D. Madigan,et al.  Model Selection and Accounting for Model Uncertainty in Graphical Models Using Occam's Window , 1994 .

[76]  Hye-kyung Kang Quantifier Spreading by English and Korean Children , 2022 .

[77]  Luke S. Zettlemoyer,et al.  Learning to Map Sentences to Logical Form: Structured Classification with Probabilistic Categorial Grammars , 2005, UAI.

[78]  J. Tenenbaum A Bayesian framework for concept learning , 1999 .

[79]  Andrea Gualmini,et al.  Solving Learnability Problems in the Acquisition of Semantics , 2009, J. Semant..

[80]  V. M. Holmes,et al.  Syntactic presupposition in sentence comprehension , 1979, Cognition.

[81]  Raj Singh,et al.  Maximize Presupposition! and local contexts , 2011 .

[82]  Christopher C. Cummins,et al.  Scalar quantifiers: Logic, acquisition, and processing , 2010 .

[83]  P. Strawson III.—ON REFERRING , 1950 .

[84]  Paul Vogt,et al.  learning colour words is slow: a cross-situational learning account , 2005, Behavioral and Brain Sciences.

[85]  Robert C. Berwick,et al.  The acquisition of syntactic knowledge , 1985 .

[86]  M. Maratsos The Use of Definite and Indefinite Reference in Young Children: An Experimental Study of Semantic Acquisition , 1976 .

[87]  K. Wexler,et al.  Semantic and Pragmatic LanguageDevelopment: Children Know 'That' Better , 2007 .

[88]  Kenneth Wexler,et al.  The Maturation of Grammatical Principles: Evidence from Russian Unaccusatives , 2001, Linguistic Inquiry.

[89]  Donald Geman,et al.  Gibbs distributions and the bayesian restoration of images , 1984 .

[90]  M. Maratsos,et al.  Preschool children's use of definite and indefinite articles , 1974 .

[91]  Michael C. Frank,et al.  A Bayesian Framework for Cross-Situational Word-Learning , 2007, NIPS.