Textual context analysis for information retrieval

We &.9dx? b WPbUkSU3ofQUESCOT, a program whkh analyacs and quantifies textual amtexta in doaunenta with reference to the WordNet databae. and hence awemina the dominance oftopics inadocument. 0uranaly3ia ia baaedon previous work in lexical cobeaim, a feature of texts which contributes to theif functioning aa a cuhaent unit. The applications arc diverse, but all pertain to infurmatian retrieval. Whilatour resulta auggeat that QuEXO’ria notwellauitedto word aenae diaambiguatkm and text segmentation, our ~~~_u@3Q~~ =~* Lxqonent produces premising results. We also used QUESM representationsto automaticallygenerate a resourceto aupplememt WerdNet, baaed on collocatkal relations between conceptsin a document collection. We conclude that QLrEsxrr is auitcd to applications based on document-level descriptions, where the degreeof granularityallowsinaccuraciesto be smoothedout.

[1]  Robert L. Mercer,et al.  Class-Based n-gram Models of Natural Language , 1992, CL.

[2]  Akira Ito,et al.  Context-Sensitive Measurement of Word Distance by Adaptive Scaling of a Semantic Space , 1996, ArXiv.

[3]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[4]  Okumura Manabu,et al.  Word Sense Disambiguation and Text Segmentation Based on Lexical Cohesion , 1994, COLING.

[5]  James Allan,et al.  Selective text utilization and text traversal , 1995, Int. J. Hum. Comput. Stud..

[6]  HirstGraeme,et al.  Lexical cohesion computed by thesaural relations as an indicator of the structure of text , 1991 .

[7]  Marti A. Hearst Multi-Paragraph Segmentation Expository Text , 1994, ACL.

[8]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[9]  Kenneth Ward Church,et al.  Word Association Norms, Mutual Information, and Lexicography , 1989, ACL.

[10]  Jeffrey C. Reynar An Automatic Method of Finding Topic Boundaries , 1994, ACL.

[11]  Akira Ito,et al.  Context-sensitive word distance by adaptive scaling of a semantic space , 1997 .

[12]  Ross Wilkinson,et al.  Effective retrieval of structured documents , 1994, SIGIR '94.

[13]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[14]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[15]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[16]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[17]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[18]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[19]  L SidnerCandace,et al.  Attention, intentions, and the structure of discourse , 1986 .

[20]  Michael Halliday,et al.  Cohesion in English , 1976 .

[21]  Chris Buckley,et al.  Implementation of the SMART Information Retrieval System , 1985 .

[22]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[23]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.