A Backend Text Retrieval Machine for Signature-Based Document Ranking

We discuss key issues of implementing a ranking strategy based on signature files. The main contribution of our method is the ability to represent term frequencies and obtain inverse document frequencies without explicitly storing them. This reduces the storage overhead significantly but without increasing the processing time. We then describe the design of a hardware signature processor for implementing the ranking strategy.

[1]  Dik Lun Lee A word-parallel, bit-serial signature processor for superimposed coding , 1986, 1986 IEEE Second International Conference on Data Engineering.

[2]  Sudhir Ahuja,et al.  An associative/parallel processor for partial match retrieval using superimposed codes , 1980, ISCA '80.

[3]  Dik Lun Lee,et al.  Text Retrieval Machines , 1985 .

[4]  Gerard Salton,et al.  Parallel text search methods , 1988, CACM.

[5]  M. E. Maron,et al.  An evaluation of retrieval effectiveness for a full-text document-retrieval system , 1985, CACM.

[6]  Craig Stanfill,et al.  Parallel free-text search on the connection machine system , 1986, CACM.

[7]  Andrew Doswell,et al.  Office Automation , 1983 .

[8]  Christos Faloutsos,et al.  Access methods for text , 1985, CSUR.

[9]  Christos Faloutsos,et al.  Description and performance analysis of signature file methods for office filing , 1987, TOIS.

[10]  Dik Lun Lee,et al.  Signature file methods for implementing a ranking strategy , 1990, Inf. Process. Manag..

[11]  Dik Lun Lee,et al.  Design and Performance Evaluation of an Associative Memory with Distributed Control , 1990, J. Parallel Distributed Comput..

[12]  Nassrin Tavakoli,et al.  An architecture for parallel search of large, full-text databases , 1990, Proceedings. PARBASE-90: International Conference on Databases, Parallel Architectures, and Their Applications.

[13]  Harold S. Stone,et al.  Parallel Querying of Large Databases: A Case Study , 1987, Computer.

[14]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[15]  W. Bruce Croft,et al.  Implementing ranking strategies using text signatures , 1988, TOIS.

[16]  Dik Lun Lee,et al.  Optimal weight assignment for signature generation , 1992, TODS.