Lexical Clustering and Definite Description Interpretation

We present preliminary results concerning the use of lexical clustering algorithms to acquire the kind of lexical knowledge needed to resolve definite descriptions, and in particular what we call ’inferential’ descriptions. We tested the hypothesis that the antecedent of an inferential description is primarily identified on the basis of its semantic distance from the description; we also tested several variants of the clustering algorithm. We found that the choice of parameters has a clear effect, and that the best results are obtained by measuring the distance between lexical vectors using the cosine measure. We also found, however, that factors other than semantic distance play the main role in the majority of cases; but in those cases in which the sort of lexical knowledge we acquired is the main factor, the algorithms we used performed reasonably well; several standing problems are discussed.

[1]  Simone Teufel,et al.  Resolving bridging references in unrestricted text , 1997 .

[2]  Barbara J. Grosz,et al.  The representation and use of focus in dialogue understanding. , 1977 .

[3]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[4]  Anthony McEnery,et al.  Corpus-based and computational approaches to discourse anaphora , 2000 .

[5]  Sabine Schulte im Walde Resolving Bridging Descriptions in High-Dimensional Space , 1998 .

[6]  Candace L. Sidner,et al.  Towards a computational theory of definite anaphora comprehension in English discourse , 1979 .

[7]  J. H. Neely Semantic priming effects in visual word recognition: A selective review of current findings and theories. , 1991 .

[8]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[9]  Philip N. Johnson-Laird,et al.  Thinking; Readings in Cognitive Science , 1977 .

[10]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[11]  Renata Vieira,et al.  Processing definite descriptions in corpora , 2000 .

[12]  L SidnerCandace,et al.  Attention, intentions, and the structure of discourse , 1986 .

[13]  Ellen F. Prince,et al.  Toward a taxonomy of given-new information , 1981 .

[14]  B. Webber,et al.  Elements of Discourse Understanding , 1983 .

[15]  Christopher C. Huckle Unsupervised categorization of word meanings using statistical and neural network methods , 1996 .

[16]  PoesioMassimo,et al.  A corpus-based investigation of definite description use , 1998 .

[17]  R. Schvaneveldt,et al.  Facilitation in recognizing pairs of words: evidence of a dependence between retrieval operations. , 1971, Journal of experimental psychology.

[18]  Simone Teufel,et al.  Towards Resolution of Bridging Descriptions , 1997, ACL.

[19]  Herbert H. Clark,et al.  Bridging , 1975, TINLAP.

[20]  Eugene Charniak,et al.  Statistical language learning , 1997 .

[21]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[22]  G. Humphreys,et al.  Basic processes in reading : visual word recognition , 1993 .