论文信息 - Document Retrieval, Automatic

Document Retrieval, Automatic

Document retrieval is the computerized process of producing a relevance-ranked list of documents in response to an inquirer's request by comparing their request to an automatically produced index of the documents in the system. Everyone uses such systems today in the form of Web-based search engines. While evolving from a fairly small discipline in the 1940s, to a large, profitable industry today, the field has maintained a healthy research focus, supported by test collections and large-scale annual comparative tests of systems. A document retrieval system is composed of three core modules: document processor, query analyzer, and matching function. There are several theoretical models on which document retrieval systems are based: Boolean, vector-space, probabilistic, and language models.

Elizabeth D. Liddy

[1] Gerard Salton,et al. The SMART Retrieval System , 1971 .

[2] Donna K. Harman,et al. Overview of the first TREC conference , 1993, SIGIR.

[3] Elizabeth D. Liddy,et al. Enhanced Text Retrieval Using Natural Language Processing , 2005 .

[4] Stephen E. Robertson,et al. Relevance weighting of search terms , 1976, J. Am. Soc. Inf. Sci..

[5] Vannevar Bush,et al. As we may think , 1945, INTR.

[6] Edward A. Fox,et al. Research Contributions , 2014 .

[7] Christiane Fellbaum,et al. Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[8] Djoerd Hiemstra,et al. Using language models for information retrieval , 2001 .

[9] W. Bruce Croft,et al. Statistical language modeling for information retrieval , 2006, Annu. Rev. Inf. Sci. Technol..

[10] Stephen E. Robertson,et al. Okapi at TREC-7: Automatic Ad Hoc, Filtering, VLC and Interactive , 1998, TREC.

[11] Peter Willett,et al. Readings in information retrieval , 1997 .

[12] Cyril Cleverdon,et al. The Cranfield tests on index language devices , 1997 .

[13] Sophia Ananiadou,et al. Information retrieval and natural language processing , 1997 .