Reducing block accesses in inverted files by partial clustering

Abstract The number of block accesses can be reduced by clustering the data records on the blocks of the secondary storage device so that the records that are likely to be retrieved in the same query are situated on as few blocks as possible. This paper presents heuristic methods that can be used to reorganize the data file in linear time. Simulation results are also reported.