论文信息 - A file organization for cluster-based retrieval

A file organization for cluster-based retrieval

A file organization for cluster-based retrieval is presented and tested. This file organization is based on the bottom-up search which, in contrast to the more usual top-down search, starts at the lowest level of a cluster hierarchy (the documents) and looks at progressively larger clusters. This approach enables most of the efficiency problems previously associated with clustered file organizations to be avoided. There are two parts to this file organization - a compact cluster hierarchy representation which does not store cluster representatives and a compact inverted file which is used to provide a starting point for the bottom-up search. Retrieval experiments show that the bottom-up search using this file organization can be more effective than a serial search, especially if high precision results are required.

W. Bruce Croft

[1] Gerard Salton,et al. Dynamic information and library processing , 1975 .

[2] Van Rijsbergen,et al. Automatic information structuring and retrieval. , 1972 .

[3] Robin Sibson,et al. SLINK: An Optimally Efficient Algorithm for the Single-Link Cluster Method , 1973, Comput. J..

[4] Jon Louis Bentley,et al. Multidimensional binary search trees used for associative searching , 1975, CACM.

[5] Walter A. Burkhard. Partial match retrieval , 1976 .

[6] Elizabeth D. Barraclough. ON‐LINE SEARCHING IN INFORMATION RETRIEVAL , 1977 .

[7] Sakti P. Ghosh,et al. File Organization Schemes Based on Finite Geometries , 1968, Inf. Control..

[8] Daniel Mcclure Murray,et al. Document Retrieval Based on Clustered Files , 1972 .