Exemplary documents: a foundation for information retrieval design

Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieval systems in nonlibrary contexts. Here, the cost of developing an indexing scheme independent of the documents to be retrieved is often considered too high to implement. As a result, simple full-text retrieval or, to a lesser extent, automatic extractive or associative indexing methods are the predominant methods used in nonlibrary contexts. This paper suggests an alternative document representation method based on what we call exemplary documents. Exemplary documents are those documents that describe or exhibit the intellectual structure of a particular field of interest. In so doing, they provide both an indexing vocabulary for that area and, more importantly, a narrative context in which the indexing terms have a clearer meaning. Further, it is much easier to develop an indexing scheme by using exemplary documents than it is to do so from scratch.

[1]  D. Over,et al.  Studies in the Way of Words. , 1989 .

[2]  John R. Searle,et al.  Speech Acts: An Essay in the Philosophy of Language , 1970 .

[3]  D. Terence Langendoen,et al.  The vastness of natural languages , 1984 .

[4]  L. Wittgenstein Philosophical investigations = Philosophische Untersuchungen , 1958 .

[5]  David C. Blair Searching biases in large interactive document retrieval systems , 1980, J. Am. Soc. Inf. Sci..

[6]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[7]  正好 長谷川 Information Processing and Management:[8]Patent Information , 1984 .

[8]  T. Kuhn,et al.  The Structure of Scientific Revolutions. , 1964 .

[9]  M. E. Maron,et al.  Full-text information retrieval: Further analysis and clarification , 1990, Inf. Process. Manag..

[10]  J. Austin How to do things with words , 1962 .

[11]  Gerard Salton,et al.  Another look at automatic text-retrieval systems , 1986, CACM.

[12]  J. Bruner Acts of meaning , 1990 .

[13]  Hartmut J. Will,et al.  Model management systems , 1975 .

[14]  Brian C. O'Connor,et al.  Language and representation in information retrieval , 1993 .

[15]  David C. Blair,et al.  Information retrieval and the philosophy of language , 1992, Annu. Rev. Inf. Sci. Technol..

[16]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[17]  V. Bush AS WE MAY THINK by VANNEVAR BUSH THE ATLANTIC MONTHLY , JULY 1945 , 2005 .

[18]  Don R. Swanson,et al.  Information Retrieval as a Trial-And-Error Process , 1977, The Library Quarterly.

[19]  David C. Blair,et al.  Indeterminacy in the subject access to documents , 1986, Inf. Process. Manag..

[20]  Hemant K. Bhargava,et al.  The Coast Guard's KSS Project , 1990 .

[21]  M. Turner The literary mind. , 1997 .

[22]  A. Avramides Studies in the Way of Words , 1992 .