论文信息 - Underspecified quantification

Underspecified quantification

Many noun phrases in text are ambiguously quantified: syntax doesn't explicitly tell us whether they refer to a single entity or to several and, in main clauses, what portion of the set denoted by the subject Nbar actually takes part in the event expressed by the verb. For instance, when we utter the sentence Cats are mammals, it is only world knowledge that allows our hearer to infer that we mean All cats are mammals, and not Some cats are mammals. This ambiguity effect is interesting at several levels. Theoretically, it raises cognitive and linguistic questions. To what extent does syntax help humans resolve the ambiguity? What problem-solving skills come into play when syntax is insufficient for full resolution? How does ambiguous quantification relate to the phenomenon of genericity, as described by the linguistic literature? From an engineering point of view, the resolution of quantificational ambiguity is essential to the accuracy of some Natural Language Processing tasks. We argue that the quantification ambiguity phenomenon can be described in terms of underspecification and propose a formalisation for what we call underquantified subject noun phrases. Our formalisation is motivated by inference requirements and covers all cases of genericity. Our approach is then empirically validated by human annotation experiments. We propose an annotation scheme that follows our theoretical claims with regard to underquan-tification. Our annotation results strengthen our claim that all noun phrases can be analysed in terms of quantification. The produced corpus allows us to derive a gold standard for quantification resolution experiments and is, as far as we are aware, the first attempt to analyse the distribution of null quantifiers in English. We then create a baseline system for automatic quantification resolution, using syntax to provide discriminating features for our classification. We show that results are rather poor for certain classes and argue that some level of pragmatics is needed, in combination with syntax, to perform accurate resolution. We explore the use of memory-based learning as a way to approximate the problem-solving skills available to humans at the level of pragmatic understanding. 3 4 Acknowledgments I would like to thank.... ... my supervisor Dr Ann Copestake, who supported me in more ways than will fit on this page: in particular for mentioning the phenomenon of genericity in the first place, for reading and re-reading countless papers and drafts and always providing the most enlightening comments on my work, for pushing me …

Aurélie Herbelot | Aurélie Herbelot

[1] Sarah-Jane Leslie,et al. Do ducks lay eggs? How people interpret generic assertions , 2007 .

[2] Stephen Pulman,et al. Using the Framework , 1996 .

[3] J. Fleiss. Measuring nominal scale agreement among many raters. , 1971 .

[4] Mitchell P. Marcus,et al. Adding Semantic Annotation to the Penn TreeBank , 1998 .

[5] Gregory Norman Carlson,et al. Reference to kinds in English , 1977 .

[6] B. Carpenter,et al. Think Generic!: The Meaning and Use of Generic Sentences , 1999 .

[7] Noel Burton-Roberts,et al. Generic Sentences and Analyticity , 1977 .

[8] "Rices" and "Waters": The Mass-Count Distinction in Modern Persian , 2003 .

[9] Manfred Krifka,et al. Bare NPs: Kind-referring, Indefinites, Both, or Neither? , 2003 .

[10] Massimo Poesio,et al. Discourse Annotation and Semantic Annotation in the GNOME corpus , 2004, Proceedings of the 2004 ACL Workshop on Discourse Annotation - DiscAnnotation '04.

[11] Jacob Cohen. A Coefficient of Agreement for Nominal Scales , 1960 .