Inverted files for text search engines

The technology underlying text search engines has advanced dramatically in the past decade. The development of a family of new index representations has led to a wide range of innovations in index storage, index construction, and query evaluation. While some of these developments have been consolidated in textbooks, many specific techniques are not widely known or the textbook descriptions are out of date. In this tutorial, we introduce the key techniques in the area, describing both a core implementation and how the core can be enhanced through a range of extensions. We conclude with a comprehensive bibliography of text indexing literature.

[1]  Hans Peter Luhn,et al.  A Statistical Approach to Mechanized Encoding and Searching of Literary Information , 1957, IBM J. Res. Dev..

[2]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[3]  H. P. Edmundson,et al.  Automatic abstracting and indexing—survey and recommendations , 1961, CACM.

[4]  Solomon W. Golomb,et al.  Run-length encodings (Corresp.) , 1966, IEEE Trans. Inf. Theory.

[5]  S. Golomb Run-length encodings. , 1966 .

[6]  Evan Leon Ivie Search procedures based on measures of relatedness between documents. , 1966 .

[7]  F. W. Matthews,et al.  Weighted Term Search: A Computer Program for an Inverted Coordinate Index on Magnetic Tape , 1967 .

[8]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[9]  Gerard Salton,et al.  Dynamic document processing , 1972, CACM.

[10]  Peter Elias,et al.  Universal codeword sets and representations of the integers , 1975, IEEE Trans. Inf. Theory.

[11]  David C. van Voorhis,et al.  Optimal source codes for geometrically distributed integer alphabets (Corresp.) , 1975, IEEE Trans. Inf. Theory.

[12]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[13]  Alfonso F. Cardenas Analysis and performance of inverted data base structures , 1975, CACM.

[14]  Ernst J. Schuegraf Compression of large inverted files with hyperbolic term distribution , 1976, Inf. Process. Manag..

[15]  R. M. Bird,et al.  Associative/parallel processors for searching very large textual data bases , 1977, CAW '77.

[16]  Terry Noreault,et al.  Automatic ranked output from boolean searches in SIRE , 1977, J. Am. Soc. Inf. Sci..

[17]  Dennis G. Severance,et al.  A Practical Approach to Selecting Record Access Paths , 1977, CSUR.

[18]  William H. Stellhorn,et al.  An Inverted File Processor for Information Retrieval , 1977, IEEE Transactions on Computers.

[19]  Ken J. McDonell An Inverted Index Implementation , 1977, Comput. J..

[20]  Jukka Teuhola,et al.  A Compression Method for Clustered Bit-Vectors , 1978, Inf. Process. Lett..

[21]  Matti Jakobsson Huffman Coding in Bit-Vector Compression , 1978, Inf. Process. Lett..

[22]  共立出版株式会社 コンピュータ・サイエンス : ACM computing surveys , 1978 .

[23]  R. M. Bird,et al.  Text file inversion: An evaluation , 1978, CAW '78.

[24]  H. S. Heaps,et al.  Information retrieval, computational and theoretical aspects , 1978 .

[25]  Robert F. Rice,et al.  Some practical universal noiseless coding techniques , 1979 .

[26]  Roger L. Haskin Hardware for searching very large text databases , 1980, CAW '80.

[27]  Alan F. Smeaton,et al.  The nearest neighbour problem in information retrieval: an algorithm using upperbounds , 1981, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[28]  C. J. van Rijsbergen,et al.  The nearest neighbour problem in information retrieval: an algorithm using upperbounds , 1981, SIGIR '81.

[29]  Gerard Salton,et al.  Research and Development in Information Retrieval , 1982, Lecture Notes in Computer Science.

[30]  Caroline M. Eastman Current Practice in The Evaluation of Multikey Search Algorithms , 1983, SIGIR.

[31]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[32]  Peter Willett,et al.  A review of the use of inverted files for best match searching in information retrieval systems , 1983 .

[33]  Aviezri S. Fraenkel,et al.  Novel Compression of Sparse Bit-Strings — Preliminary Report , 1985 .

[34]  Chris Buckley,et al.  Optimization of inverted vector searches , 1985, SIGIR '85.

[35]  Christos Faloutsos,et al.  Signature files: design and performance comparison of some signature extraction methods , 1985, SIGMOD Conference.

[36]  Christos Faloutsos,et al.  Access methods for text , 1985, CSUR.

[37]  Ellen M. Voorhees,et al.  The efficiency of inverted index and cluster searches , 1986, SIGIR '86.

[38]  Fausto Rabitti,et al.  Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval , 1986 .

[39]  Shmuel Tomi Klein,et al.  Improved hierarchical bit-vector compression in document retrieval systems , 1986, SIGIR '86.

[40]  Shmuel Tomi Klein,et al.  Improved techniques for processing queries in full-text systems , 1987, SIGIR '87.

[41]  Kotagiri Ramamohanarao,et al.  Multikey access methods based on superimposed coding techniques , 1987, TODS.

[42]  Patrick Martin,et al.  Strategies for building distributed information retrieval systems , 1987, Inf. Process. Manag..

[43]  Shmuel Tomi Klein,et al.  Compression of concordances in full-text retrieval systems , 1988, SIGIR '88.

[44]  W. Bruce Croft,et al.  Implementing ranking strategies using text signatures , 1988, TOIS.

[45]  Gerard Salton,et al.  Parallel text search methods , 1988, CACM.

[46]  Dario Lucarella,et al.  A document retrieval system based on nearest neighbour searching , 1988, J. Inf. Sci..

[47]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[48]  Craig Stanfill Partitioned posting files: a parallel inverted file structure for information retrieval , 1989, SIGIR '90.

[49]  Dik Lun Lee,et al.  Partitioned signature files: design issues and performance evaluation , 1989, TOIS.

[50]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[51]  Shmuel Tomi Klein,et al.  Storing text retrieval systems on CD-ROM: compression and encryption considerations , 1989, SIGIR '89.

[52]  David L. Waltz,et al.  A parallel indexed algorithm for information retrieval , 1989, SIGIR '89.

[53]  Jan O. Pedersen,et al.  Optimization for dynamic inverted index maintenance , 1989, SIGIR '90.

[54]  Forbes J. Burkowski Surrogate subsets: a free space management strategy for the index of a text retrieval system , 1989, SIGIR '90.

[55]  Jean-Luc Vidick Proceedings of the 13th annual international ACM SIGIR conference on Research and development in information retrieval , 1989, SIGIR 1989.

[56]  Peter Willett,et al.  Parallel text searching in serial files using a processor farm , 1989, SIGIR '90.

[57]  Donna Harman,et al.  Retrieving Records from a Gigabyte of Text on a Minicomputer Using Statistical Ranking. , 1990 .

[58]  Patrick Martin,et al.  A case study of caching strategies for a distributed full text retrieval system , 1990, Inf. Process. Manag..

[59]  Shmuel Tomi Klein,et al.  Using bitmaps for medium sized information retrieval systems , 1990, Inf. Process. Manag..

[60]  Kotagiri Ramamohanarao,et al.  A Signature File Scheme Based on Multiple Organizations for Indexing Very Large Text Databases. , 1990 .

[61]  Ian H. Witten,et al.  Text Compression , 1990, 125 Problems in Text Algorithms.

[62]  Patrick Martin,et al.  Data caching strategies for distributed full text retrieval systems , 1991, Inf. Syst..

[63]  Shmuel Tomi Klein,et al.  Compression of correlated bit-vectors , 1991, Inf. Syst..

[64]  Pavel Zezula,et al.  Dynamic partitioning of signature files , 1991, TOIS.

[65]  A. Bookstein,et al.  Flexible compression for bitmap sets , 1991, [1991] Proceedings. Data Compression Conference.

[66]  Ian H. Witten,et al.  Models for compression in full-text retrieval systems , 1991, [1991] Proceedings. Data Compression Conference.

[67]  Edward A. Fox,et al.  FAST-INV: A Fast Algorithm for building large inverted files , 1991 .

[68]  Shmuel Tomi Klein,et al.  Generative models for bitmap sets with compression applications: (extended abstract) , 1991, SIGIR '91.

[69]  S. F. Reddaway High speed text retrieval from large databases on a massively parallel processor , 1991, Inf. Process. Manag..

[70]  Donna K. Harman,et al.  Prototyping a distributed information retrieval system that uses statistical ranking , 1991, Inf. Process. Manag..

[71]  Alistair Moffat,et al.  Economical Inversion of Large Text Files , 1992, Comput. Syst..

[72]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[73]  Alistair Moffat,et al.  Coding for compression in full-text retrieval systems , 1992, Data Compression Conference, 1992..

[74]  Alistair Moffat,et al.  Parameterised compression for sparse bitmaps , 1992, SIGIR '92.

[75]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[76]  Ron Sacks-Davis,et al.  An e cient indexing technique for full-text database systems , 1992, VLDB 1992.

[77]  Alistair Moffat,et al.  An Efficient Indexing Technique for Full Text Databases , 1992, Very Large Data Bases Conference.

[78]  Shmuel Tomi Klein,et al.  Models of Bitmap Generation: A Systematic Approach to Bitmap Compression , 1992, Inf. Process. Manag..

[79]  Edward A. Fox,et al.  Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval , 1992, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[80]  Donna K. Harman,et al.  Ranking Algorithms , 1992, Information Retrieval: Data Structures & Algorithms.

[81]  Christos Faloutsos,et al.  Hybrid Index Organizations for Text Databases , 1992, EDBT.

[82]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[83]  Pavel Zezula,et al.  Estimating accesses in partitioned signature file organizations , 1993, TOIS.

[84]  Ian H. Witten,et al.  Data Compression in Full-Text Retrieval Systems , 1993, J. Am. Soc. Inf. Sci..

[85]  Craig Stanfill,et al.  Compression of indexes with full positional information in very large text databases , 1993, SIGIR.

[86]  Alistair Moffat,et al.  Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files , 1993, VLDB.

[87]  Ian H. Witten,et al.  Data compression in full-text retrieval systems , 1993 .

[88]  Alistair Moffat,et al.  Storage Management for Files of Dynamic Records , 1993, Australian Database Conference.

[89]  Dik Lun Lee,et al.  Implementations of Partial Document Ranking Using Inverted Files , 1993, Information Processing & Management.

[90]  Hector Garcia-Molina,et al.  Performance of inverted indices in shared-nothing distributed text document information retrieval systems , 1993, [1993] Proceedings of the Second International Conference on Parallel and Distributed Information Systems.

[91]  Hector Garcia-Molina,et al.  Incremental updates of inverted lists for text document retrieval , 1994, SIGMOD '94.

[92]  Hector Garcia-Molina,et al.  Synthetic workload performance analysis of incremental updates , 1994, SIGIR '94.

[93]  Charles L. A. Clarke,et al.  Fast Inverted Indexes with On-Line Update , 1994 .

[94]  Alistair Moffat,et al.  Memory Efficient Ranking , 1994, Inf. Process. Manag..

[95]  Dik Lun Lee,et al.  An analysis of performance and cost factors in searching large text databases using parallel search systems , 1994 .

[96]  W. Bruce Croft,et al.  Fast Incremental Indexing for Full-Text Information Retrieval , 1994, VLDB.

[97]  Mukesh Singhal,et al.  An Analysis of Performance and Cost Factors in Searching Large Text Databases Using Parallel Search Systems , 1994, Journal of the American Society for Information Science.

[98]  Udi Manber,et al.  GLIMPSE: A Tool to Search Through Entire File Systems , 1994, USENIX Winter.

[99]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[100]  Fazli Can On the Efficiency of Best-Match Cluster Searches , 1994, Inf. Process. Manag..

[101]  Dalia Motzkin,et al.  On High Performance of Updates Within an Efficient Document Retrieval System , 1994, Inf. Process. Manag..

[102]  W. Bruce Croft,et al.  Supporting Full-Text Information Retrieval with a Persistent Object Store , 1994, EDBT.

[103]  Eric W. Brown,et al.  Fast evaluation of structured queries for information retrieval , 1995, SIGIR '95.

[104]  Alistair Moffat,et al.  In Situ Generation of Compressed Inverted Files , 1995, J. Am. Soc. Inf. Sci..

[105]  Alistair Moffat,et al.  In Situ Generation of Compressed Inverted Files , 1995, J. Am. Soc. Inf. Sci..

[106]  Alistair Moffat,et al.  Efficient Retrieval of Partial Documents , 1995, Inf. Process. Manag..

[107]  Douglas W. Oard,et al.  A survey of information retrieval and filtering methods , 1995 .

[108]  Dik Lun Lee,et al.  Efficient Signature File Methods for Text Retrieval , 1995, IEEE Trans. Knowl. Data Eng..

[109]  Howard R. Turtle,et al.  Query Evaluation: Strategies and Optimizations , 1995, Inf. Process. Manag..

[110]  Byeong-Soo Jeong,et al.  Inverted File Partitioning Schemes in Multiple Disk Systems , 1995, IEEE Trans. Parallel Distributed Syst..

[111]  Hans-Peter Frei,et al.  SIGIR '96 : proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 18-22, 1996, Zurich, Switzerland , 1996 .

[112]  Chris Buckley,et al.  Pivoted Document Length Normalization , 1996, SIGIR Forum.

[113]  Kathryn S. McKinley,et al.  Performance evaluation of a distributed architecture for information retrieval , 1996, SIGIR '96.

[114]  W. Bruce Croft,et al.  Integrating INQUERY with an RDBMS to Support Text Retrieval. , 1996 .

[115]  Sharad Mehrotra,et al.  The Gold Text Indexing Engine , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[116]  Hector Garcia-Molina,et al.  Performance Issues in Distributed Shared-Nothing Information-Retrieval Systems , 1996, Inf. Process. Manag..

[117]  Khalid Sayood,et al.  Introduction to Data Compression , 1996 .

[118]  Kotagiri Ramamohanarao,et al.  Guidelines for presentation and comparison of indexing techniques , 1996, SGMD.

[119]  Justin Zobel,et al.  Filtered Document Retrieval with Frequency-Sorted Indexes , 1996, J. Am. Soc. Inf. Sci..

[120]  Pavel Zezula,et al.  Declustering of key-based partitioned signature files , 1996, TODS.

[121]  Kyoungro Yoon,et al.  Index structures for structured documents , 1996, DL '96.

[122]  Alistair Moffat,et al.  Self-indexing inverted files for fast text retrieval , 1996, TOIS.

[123]  Ricardo A. Baeza-Yates,et al.  Block addressing indices for approximate text retrieval , 1997, CIKM '97.

[124]  David Hawking Scalable Text Retrieval for Large Digital Libraries , 1997, ECDL.

[125]  S. Robertson The probability ranking principle in IR , 1997 .

[126]  Elisa Bertino,et al.  Indexing Techniques for Advanced Database Systems , 1997, The Springer International Series on Advances in Database Systems.

[127]  Fazli Can,et al.  Vertical Framing of Superimposed Signature Files Using Partial Evaluation of Queries , 1997, Inf. Process. Manag..

[128]  David Salomon,et al.  Data Compression: The Complete Reference , 2006 .

[129]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[130]  W. Bruce Croft,et al.  Proceedings of the 21th annual international acm sigir conference on research and development in inf , 1998 .

[131]  Donald H. Kraft,et al.  Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval , 1998, SIGIR 2002.

[132]  David Hawking,et al.  Efficiency/effectiveness trade-offs in query processing (from theory into practice workshop, 1998 SIGIR conf.) , 1998, SIGF.

[133]  Alistair Moffat,et al.  Exploring the similarity space , 1998, SIGF.

[134]  J. Kleinberg,et al.  Authoritative Soueces in a Hyper-linked Environment , 1998, SODA 1998.

[135]  Alistair Moffat,et al.  Compressed inverted files with reduced decoding overheads , 1998, SIGIR '98.

[136]  Berthier A. Ribeiro-Neto,et al.  Query performance for tightly coupled distributed digital libraries , 1998, DL '98.

[137]  Divesh Srivastava,et al.  Interaction of query evaluation and buffer management for information retrieval , 1998, SIGMOD '98.

[138]  Alistair Moffat,et al.  Methodologies for distributed information retrieval , 1998, Proceedings. 18th International Conference on Distributed Computing Systems (Cat. No.98CB36183).

[139]  Kotagiri Ramamohanarao,et al.  Inverted files versus signature files for text indexing , 1998, TODS.

[140]  J Allan,et al.  Readings in information retrieval. , 1998 .

[141]  Fredric C. Gey,et al.  Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval , 1999, SIGIR 1999.

[142]  Hugh E. Williams,et al.  Compressing Integers for Fast File Access , 1999, Comput. J..

[143]  Berthier A. Ribeiro-Neto,et al.  Efficient distributed algorithms to build inverted files , 1999, SIGIR '99.

[144]  Ron Sacks-Davis,et al.  Efficient passage ranking for document databases , 1999, TOIS.

[145]  Ian H. Witten,et al.  Managing gigabytes (2nd ed.): compressing and indexing documents and images , 1999 .

[146]  Hugh E. Williams,et al.  What's Next? Index Structures for Efficient Phrase Querying , 1999, Australasian Database Conference.

[147]  Alistair Moffat,et al.  Effective document presentation with a locality-based similarity heuristic , 1999, SIGIR '99.

[148]  Jon Kleinberg,et al.  Authoritative sources in a hyperlinked environment , 1999, SODA '98.

[149]  Amanda Spink,et al.  Selected results from a large study of Web searching: the Excite study , 2000, Inf. Res..

[150]  Kathryn S. McKinley,et al.  Evaluating the performance of distributed architectures for information retrieval using a variety of workloads , 2000, TOIS.

[151]  Khalid Sayood,et al.  Introduction to data compression (2nd ed.) , 2000 .

[152]  Charles L. A. Clarke,et al.  Shortest-substring retrieval and ranking , 2000, TOIS.

[153]  Charles L. A. Clarke,et al.  Relevance ranking for one to three term queries , 1997, Inf. Process. Manag..

[154]  Stephen E. Robertson,et al.  A probabilistic model of information retrieval: development and comparative experiments - Part 2 , 2000, Inf. Process. Manag..

[155]  Stephen E. Robertson,et al.  Parallel search using partitioned inverted files , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[156]  Koichi Takeda,et al.  Information retrieval on the web , 2000, CSUR.

[157]  Ricardo Baeza-Yates,et al.  Block addressing indices for approximate text retrieval , 2000 .

[158]  Wagner Meira,et al.  Rank-preserving two-level caching for scalable search engines , 2001, SIGIR '01.

[159]  Alistair Moffat,et al.  Vector-space ranking with effective early termination , 2001, SIGIR '01.

[160]  H. Garcia-Molina,et al.  Building a distributed full-text index for the web , 2001, TOIS.

[161]  Ricardo A. Baeza-Yates,et al.  Distributed Query Processing Using Partitioned Inverted Files , 2001, SPIRE.

[162]  Sriram Raghavan,et al.  Searching the Web , 2001, ACM Trans. Internet Techn..

[163]  N. Ziviani,et al.  Distributed query processing using partitioned inverted files , 2001, Proceedings Eighth Symposium on String Processing and Information Retrieval.

[164]  Sriram Raghavan,et al.  Building a distributed full-text index for the Web , 2001, WWW '01.

[165]  Amanda Spink,et al.  Searching the Web: the public and their queries , 2001 .

[166]  Ronald Fagin,et al.  Static index pruning for information retrieval systems , 2001, SIGIR '01.

[167]  Hans-Jörg Schek,et al.  PowerDB-IR: information retrieval on top of a database cluster , 2001, CIKM '01.

[168]  Hugh E. Williams,et al.  Compression of inverted indexes For fast query evaluation , 2002, SIGIR '02.

[169]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[170]  Jim Segesta,et al.  Harley Tillitt and Computerized Library Searching , 2002, IEEE Ann. Hist. Comput..

[171]  Alistair Moffat,et al.  Searching large text collections , 2002 .

[172]  Edward A. Fox,et al.  Hybrid Partition Inverted Files: Experimental Validation , 2002, ECDL.

[173]  Guy E. Blelloch,et al.  Index compression through document reordering , 2002, Proceedings DCC 2002. Data Compression Conference.

[174]  Alistair Moffat,et al.  Impact transformation: effective and efficient web retrieval , 2002, SIGIR '02.

[175]  Susan T. Dumais,et al.  Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval , 2004, SIGIR 2004.

[176]  Jeffrey Scott Vitter,et al.  Dynamic maintenance of web indexes using landmarks , 2003, WWW '03.

[177]  Gonzalo Navarro,et al.  (S, C)-Dense Coding: An Optimized Compression Code for Natural Language Text Databases , 2003, SPIRE.

[178]  Tien-Fu Chen,et al.  Inverted file compression through document identifier reassignment , 2003, Inf. Process. Manag..

[179]  Justin Zobel,et al.  Efficient single-pass index construction for text databases , 2003, J. Assoc. Inf. Sci. Technol..

[180]  W. Bruce Croft,et al.  Language Modeling for Information Retrieval , 2010, The Springer International Series on Information Retrieval.

[181]  Tien-Fu Chen,et al.  A Tree-Based inverted File for Fast Ranked-Document Retrieval , 2003, IKE.

[182]  Luiz André Barroso,et al.  Web Search for a Planet: The Google Cluster Architecture , 2003, IEEE Micro.

[183]  Wann-Yun Shieh,et al.  An Inverted File Cache for Fast Information Retrieval , 2003, J. Inf. Sci. Eng..

[184]  Shlomo Moran,et al.  Predictive caching and prefetching of query results in search engines , 2003, WWW '03.

[185]  Andrew Trotman,et al.  Compressing Inverted Files , 2004, Information Retrieval.

[186]  Shlomo Moran,et al.  Optimizing result prefetching in web search engines with segmented indices , 2002, TOIT.

[187]  Hugo Zaragoza,et al.  Information Retrieval: Algorithms and Heuristics , 2002, Information Retrieval.

[188]  Ophir Frieder,et al.  Information Retrieval: Algorithms and Heuristics (The Kluwer International Series on Information Retrieval) , 2004 .

[189]  Alistair Moffat,et al.  Inverted Index Compression Using Word-Aligned Binary Codes , 2004, Information Retrieval.

[190]  Kathryn S. McKinley,et al.  Partial Collection Replication for Information Retrieval , 2003, Information Retrieval.

[191]  Fabrizio Silvestri,et al.  Assigning identifiers to documents to enhance the clustering property of fulltext indexes , 2004, SIGIR '04.

[192]  Ricardo A. Baeza-Yates,et al.  Adding Compression to Block Addressing Inverted Indexes , 2000, Information Retrieval.

[193]  Alistair Moffat,et al.  Binary Interpolative Coding for Effective Index Compression , 2000, Information Retrieval.

[194]  Steven Garcia,et al.  Access-Ordered Indexes , 2004, ACSC.

[195]  Shlomo Moran,et al.  Competitive caching of query results in search engines , 2004, Theor. Comput. Sci..

[196]  Alistair Moffat,et al.  SEFT: a search engine for text , 2004, Softw. Pract. Exp..

[197]  Rudolf Bayer,et al.  Organization and maintenance of large ordered indexes , 1972, Acta Informatica.

[198]  Alistair Moffat,et al.  What Does It Mean to "Measure Performance"? , 2004, WISE.

[199]  Shmuel Tomi Klein,et al.  Simple Bayesian Model for Bitmap Compression , 2004, Information Retrieval.

[200]  Hugh E. Williams,et al.  Fast phrase querying with combined indexes , 2004, TOIS.

[201]  Iadh Ounis,et al.  Performance Analysis of Distributed Architectures to Index One Terabyte of Text , 2004, ECIR.

[202]  Alistair Moffat,et al.  Compression and Coding Algorithms , 2005, IEEE Trans. Inf. Theory.

[203]  Ellen M. Voorhees,et al.  TREC: Experiment and Evaluation in Information Retrieval (Digital Libraries and Electronic Publishing) , 2005 .

[204]  W. Bruce Croft,et al.  Optimization strategies for complex queries , 2005, SIGIR '05.

[205]  Charles L. A. Clarke,et al.  Indexing time vs. query time: trade-offs in dynamic information retrieval systems , 2005, CIKM '05.

[206]  Alistair Moffat,et al.  Space-Limited Ranked Query Evaluation Using Adaptive Pruning , 2005, WISE.

[207]  J. Shane Culpepper,et al.  Enhanced Byte Codes with Restricted Prefix Properties , 2005, SPIRE.

[208]  Wann-Yun Shieh,et al.  A statistics-based approach to incrementally update inverted files , 2005, Inf. Process. Manag..

[209]  Alistair Moffat,et al.  Fast on-line index construction by geometric partitioning , 2005, CIKM '05.

[210]  Hugh E. Williams,et al.  Efficient online index maintenance for contiguous inverted lists , 2006, Inf. Process. Manag..

[211]  Alistair Moffat,et al.  A pipelined architecture for distributed text query evaluation , 2007, Information Retrieval.

[212]  José Luis Vicedo González,et al.  TREC: Experiment and evaluation in information retrieval , 2007, J. Assoc. Inf. Sci. Technol..