Underspecified quantification

Many noun phrases in text are ambiguously quantified: syntax doesn't explicitly tell us whether they refer to a single entity or to several and, in main clauses, what portion of the set denoted by the subject Nbar actually takes part in the event expressed by the verb. For instance, when we utter the sentence Cats are mammals, it is only world knowledge that allows our hearer to infer that we mean All cats are mammals, and not Some cats are mammals. This ambiguity effect is interesting at several levels. Theoretically, it raises cognitive and linguistic questions. To what extent does syntax help humans resolve the ambiguity? What problem-solving skills come into play when syntax is insufficient for full resolution? How does ambiguous quantification relate to the phenomenon of genericity, as described by the linguistic literature? From an engineering point of view, the resolution of quantificational ambiguity is essential to the accuracy of some Natural Language Processing tasks. We argue that the quantification ambiguity phenomenon can be described in terms of underspecification and propose a formalisation for what we call underquantified subject noun phrases. Our formalisation is motivated by inference requirements and covers all cases of genericity. Our approach is then empirically validated by human annotation experiments. We propose an annotation scheme that follows our theoretical claims with regard to underquan-tification. Our annotation results strengthen our claim that all noun phrases can be analysed in terms of quantification. The produced corpus allows us to derive a gold standard for quantification resolution experiments and is, as far as we are aware, the first attempt to analyse the distribution of null quantifiers in English. We then create a baseline system for automatic quantification resolution, using syntax to provide discriminating features for our classification. We show that results are rather poor for certain classes and argue that some level of pragmatics is needed, in combination with syntax, to perform accurate resolution. We explore the use of memory-based learning as a way to approximate the problem-solving skills available to humans at the level of pragmatic understanding. 3 4 Acknowledgments I would like to thank.... ... my supervisor Dr Ann Copestake, who supported me in more ways than will fit on this page: in particular for mentioning the phenomenon of genericity in the first place, for reading and re-reading countless papers and drafts and always providing the most enlightening comments on my work, for pushing me …

[1]  Sarah-Jane Leslie,et al.  Do ducks lay eggs? How people interpret generic assertions , 2007 .

[2]  Stephen Pulman,et al.  Using the Framework , 1996 .

[3]  J. Fleiss Measuring nominal scale agreement among many raters. , 1971 .

[4]  Mitchell P. Marcus,et al.  Adding Semantic Annotation to the Penn TreeBank , 1998 .

[5]  Gregory Norman Carlson,et al.  Reference to kinds in English , 1977 .

[6]  B. Carpenter,et al.  Think Generic!: The Meaning and Use of Generic Sentences , 1999 .

[7]  Noel Burton-Roberts,et al.  Generic Sentences and Analyticity , 1977 .

[8]  "Rices" and "Waters": The Mass-Count Distinction in Modern Persian , 2003 .

[9]  Manfred Krifka,et al.  Bare NPs: Kind-referring, Indefinites, Both, or Neither? , 2003 .

[10]  Massimo Poesio,et al.  Discourse Annotation and Semantic Annotation in the GNOME corpus , 2004, Proceedings of the 2004 ACL Workshop on Discourse Annotation - DiscAnnotation '04.

[11]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[12]  Drew McDermott,et al.  Non-Monotonic Logic I , 1987, Artif. Intell..

[13]  Godehard Link Algebraic semantics in language and philosophy , 1997 .

[14]  Gerhard Heyer,et al.  Semantics and Knowledge Representation in the Analysis of Generic Descriptions , 1990, J. Semant..

[15]  Benjamin Kuipers,et al.  ON REPRESENTING COMMONSENSE KNOWLEDGE , 1979 .

[16]  Ann Copestake,et al.  Annotating genericity: How do humans decide? (A case study in ontology extraction) , 2009 .

[17]  Lenhart K. Schubert,et al.  Problems in the representation of the logical form of generics, plurals, and mass nouns , 1987 .

[18]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[19]  Wolfgang Sternefeld,et al.  Syntax: An International Handbook of Contemporary Research , 1993 .

[20]  Nicholas Asher,et al.  Commonsense Entailment: A Modal Theory of Non-monotonic Reasoning , 1991, IJCAI.

[21]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[22]  Aurélie Herbelot,et al.  Finding Word Substitutions Using a Distributional Similarity Baseline and Immediate Context Overlap , 2009, EACL.

[23]  Ted Briscoe,et al.  The Second Release of the RASP System , 2006, ACL.

[24]  Walter Daelemans,et al.  Forgetting Exceptions is Harmful in Language Learning , 1998, Machine Learning.

[25]  Siobhan Chapman Logic and Conversation , 2005 .

[26]  Graeme Hirst,et al.  Lexical chains as representations of context for the detection and correction of malapropisms , 1995 .

[27]  Carl Vogel,et al.  Metaphor is generic , 2008 .

[28]  Johanna Völker,et al.  Acquisition of OWL DL Axioms from Lexical Resources , 2007, ESWC.

[29]  A. Feinstein,et al.  High agreement but low kappa: I. The problems of two paradoxes. , 1990, Journal of clinical epidemiology.

[30]  Ido Dagan,et al.  The Distributional Inclusion Hypotheses and Lexical Entailment , 2005, ACL.

[31]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[32]  Jon Star,et al.  Children's interpretation of generic noun phrases. , 2002, Developmental psychology.

[33]  Push Singh,et al.  The Public Acquisition of Commonsense Knowledge , 2002 .

[34]  ダンコヴ スヴェトスラヴ Commonsense and context: a novel approach for automatic extraction of generic statements , 2008 .

[35]  Patrick Pantel,et al.  VerbOcean: Mining the Web for Fine-Grained Semantic Verb Relations , 2004, EMNLP.

[36]  G. Carlson,et al.  1 Truth-Conditions of Generic Sentences : Two Contrasting Views , 1988 .

[37]  Susan A. Gelman,et al.  Learning Words for Kinds: Generic Noun Phrases in Acquisition. , 2004 .

[38]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[39]  Christopher D. Manning,et al.  Modeling Semantic Containment and Exclusion in Natural Language Inference , 2008, COLING.

[40]  Timothy Baldwin,et al.  Multiword Expressions: A Pain in the Neck for NLP , 2002, CICLing.

[41]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[42]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[43]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[44]  Klaus Krippendorff,et al.  Content Analysis: An Introduction to Its Methodology , 1980 .

[45]  Nicholas Asher,et al.  Generics and Defaults , 1997, Handbook of Logic and Language.

[46]  F. Landman Groups, I , 1989 .

[47]  Ramanathan V. Guha,et al.  Cyc: toward programs with common sense , 1990, CACM.

[48]  Sangweon Suh,et al.  Extracting Generic Statements for the Semantic Web , 2006 .

[49]  John McCarthy,et al.  Programs with common sense , 1960 .

[50]  H. Cartwright Some remarks about mass nouns and plurality , 1975, Synthese.

[51]  Berit Brogaard,et al.  The But Not All: A Partitive Account of Plural Definite Descriptions , 2007 .

[52]  Jean Carletta,et al.  Squibs: Reliability Measurement without Limits , 2008, CL.

[53]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[54]  Ted Briscoe,et al.  Semi-productive Polysemy and Sense Extension , 1995, J. Semant..

[55]  Dekang Lin,et al.  DIRT – Discovery of Inference Rules from Text , 2001 .

[56]  Simone Teufel,et al.  An annotation scheme for citation function , 2009, SIGDIAL Workshop.

[57]  Robust Minimal Recursion Semantics , 2006 .

[58]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[59]  S. Glucksberg,et al.  Conceptual and linguistic distinctions between singular and plural generics , 2009 .

[60]  Raymond Reiter,et al.  A Logic for Default Reasoning , 1987, Artif. Intell..

[61]  Barbara B. Levin,et al.  English verb classes and alternations , 1993 .

[62]  Sandeep Prasada,et al.  Principled and statistical connections in common sense conception , 2006, Cognition.

[63]  David S. Touretzky,et al.  A Skeptical Theory of Inheritance in Nonmonotonic Semantic Networks , 1987, Artif. Intell..

[64]  Ernest Lepore,et al.  What model theoretic semantics cannot do? , 1983, Synthese.

[65]  Leila Behrens,et al.  Genericity from a Cross-Linguistic Perspective , 2005 .

[66]  Stanley Peters,et al.  Quantifiers in language and logic , 2006 .

[67]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[68]  J. Peregrin LINGUISTICS AND PHILOSOPHY , 1998 .

[69]  Ann A. Copestake Some Notes on Mass Terms and Plurals , 1989 .

[70]  Carlo Strapparava,et al.  Direct Word Sense Matching for Lexical Substitution , 2006, ACL.

[71]  G. Miller,et al.  Contextual correlates of semantic similarity , 1991 .

[72]  Shalom Lappin,et al.  An Intensional Parametric Semantics For Vague Quantifiers , 2000 .

[73]  Claudio Giuliano,et al.  Instance Based Lexical Entailment for Ontology Population , 2007, EMNLP-CoNLL.

[74]  Andrei Cimpian,et al.  Preschool children’s use of cues to generic meaning , 2008, Cognition.

[75]  Godehard Link The Logical Analysis of Plurals and Mass Terms: A Lattice‐theoretical Approach , 2008 .

[76]  Anna Ritchie,et al.  Compatible RMRS representations from RASP and the ERG , 2006 .

[77]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[78]  S. Glucksberg,et al.  Syllogistic reasoning with generic premises: The generic overgeneralization effect , 2008 .

[79]  Michelle L. McGillion,et al.  GENERICITY IS CONCEPTUAL, NOT SEMANTIC , 2002 .

[80]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[81]  Sheila Glasbey,et al.  Bare plurals in object position: which verbs fail to give existential readings, and why? , 2006 .

[82]  Ewan Klein,et al.  Extracting Common Sense Knowledge from Wikipedia , 2006 .

[83]  Ido Dagan,et al.  Instance-based Evaluation of Entailment Rule Acquisition , 2007, ACL.

[84]  Sarah-Jane Leslie,et al.  GENERICS AND THE STRUCTURE OF THE MIND , 2007 .

[85]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[86]  Sarah-Jane Leslie,et al.  Generics: Cognition and Acquisition , 2008 .

[87]  Danushka Bollegala,et al.  Measuring semantic similarity between words using web search engines , 2007, WWW '07.

[88]  Nomi Erteschik-Shir,et al.  Topic, Focus, and the Interpretation of Bare Plurals , 2002 .

[89]  Alice ter Meulen,et al.  Genericity: An Introduction , 1995 .

[90]  Aurélie Herbelot,et al.  Acquiring Ontological Relationships from Wikipedia Using RMRS , 2006 .

[91]  Roger C. Schank,et al.  Scripts, plans, goals and understanding: an inquiry into human knowledge structures , 1978 .

[92]  A. Cohen Relative Readings of Many, Often, and Generics , 2001 .

[93]  Willard Van Orman Quine,et al.  Word and Object , 1960 .

[94]  Carl Vogel,et al.  Inheritance reasoning : psychological plausibility, proof theory and semantics , 1995 .

[95]  H. Kamp A Theory of Truth and Semantic Representation , 2008 .

[96]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[97]  Ido Dagan,et al.  Scaling Web-based Acquisition of Entailment Relations , 2004, EMNLP.

[98]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[99]  Barbara Di Eugenio,et al.  Squibs and Discussions: The Kappa Statistic: A Second Look , 2004, CL.

[100]  Stan Szpakowicz,et al.  Roget's thesaurus and semantic similarity , 2012, RANLP.

[101]  Fahiem Bacchus A Modest, but Semantically Well Founded, Inheritance Reasoner , 1989, IJCAI.

[102]  David L. Waltz,et al.  Toward memory-based reasoning , 1986, CACM.

[103]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .