Gravity wells of meaning: detecting information-rich passages in scientific texts

Four term‐weighting schemes are used to detect information‐rich passages in texts and the results are compared. It is demonstrated that word categories and frequency‐derived weights have a close correlation but that weighting according to the first mention theory or the cue‐method shows no correlation with frequency‐based weights.

[1]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[2]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[3]  Graeme Hirst,et al.  Lexical Cohesion Computed by Thesaural relations as an indicator of the structure of text , 1991, CL.

[4]  David C. Blair STAIRS redux: thoughts on the STAIRS evaluation, ten years after , 1996 .

[5]  Carolyn J. Crouch,et al.  An analysis of approximate versus exact discrimination values , 1988, Inf. Process. Manag..

[6]  Kui-Lam Kwok,et al.  A Document-Document Similarity Measure Based on Cited Titles and Probability Theory, and Its Application to Relevance Feedback Retrieval , 1984, SIGIR.

[7]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[8]  Cyril W. Cleverdon,et al.  The significance of the Cranfield tests on index languages , 1991, SIGIR '91.

[9]  Chris D. Paice,et al.  Constructing literature abstracts by computer: Techniques and prospects , 1990, Inf. Process. Manag..

[10]  Christian Plaunt,et al.  Subtopic structuring for full-length document access , 1993, SIGIR.

[11]  James Allan,et al.  Approaches to passage retrieval in full text information systems , 1993, SIGIR.

[12]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[13]  Peter Willett,et al.  An improved algorithm for the calculation of exact term discrimination values , 1988, Inf. Process. Manag..

[14]  Clement T. Yu,et al.  A theory of term importance in automatic text analysis , 1974, J. Am. Soc. Inf. Sci..

[15]  Penelope Sibun,et al.  A Practical Part-of-Speech Tagger , 1992, ANLP.

[16]  David E. Kieras,et al.  Thematic Processes in the Comprehension of Technical Prose. , 1982 .

[17]  Lois L. Earl,et al.  Experiments in automatic extracting and indexing , 1970, Inf. Storage Retr..

[18]  Y. Zhang,et al.  Enhancement of text representations using related document titles , 1986, Inf. Process. Manag..

[19]  Marti A. Hearst TileBars: visualization of term distribution information in full text information access , 1995, CHI '95.

[20]  Sally Jo Cunningham,et al.  Applications of machine learning in information retrieval , 1999 .

[21]  James P. Callan,et al.  Passage-level evidence in document retrieval , 1994, SIGIR '94.

[22]  Mary Hart,et al.  Automatic indexing using selective NLP and first-order thesauri , 1991, RIAO.

[23]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[24]  Peter Willett,et al.  An algorithm for the calculation of exact term discrimination values , 1985, Inf. Process. Manag..

[25]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[26]  Gerard Salton,et al.  Automatic text structuring and retrieval-experiments in automatic encyclopedia searching , 1991, SIGIR '91.

[27]  E. Michael Keen,et al.  Term position ranking: some new test results , 1992, SIGIR '92.