Latent Semantic Indexing (LSI) and TREC-2

Latent Semantic Indexing (LSI) is an extension of the vector retrieval method (e.g., Salton & McGill, 1983) in which the dependencies between terms are explicitly taken into account in the representation and exploited in retrieval. This is done by simultaneously modeling all the interrelationships among terms and documents. We assume that there is some underlying or "latent" structure in the pattern of word usage across documents, and use statistical techniques to estimate this latent structure. A description of terms, documents and user queries based on the underlying, "latent semantic", structure (rather than surface level word choice) is used for representing and retrieving information. One advantage of the LSI representation is that a query can be very similar to a document even when they share no words.

[1]  Stephen I. Gallant,et al.  TIPSTER Panel - HNC's MatchPlus System , 1992, TREC.

[2]  Susan T. Dumais,et al.  The Relevance Density Method for Multi-Topic Queries in Information Retrieval, , 1992 .

[3]  James Allan,et al.  Automatic Retrieval With Locality Information Using SMART , 1992, TREC.

[4]  Gregory Grefenstette,et al.  CLARIT TREC Design, Experiments, and Results , 1992, TREC.

[5]  Susan T. Dumais,et al.  Iterative Searching in an Online Database , 1991 .

[6]  Susan T. Dumais,et al.  Improving the retrieval of information from external sources , 1991 .

[7]  Ellen M. Voorhees,et al.  On Expanding Query Vectors with Lexically Related Words , 1993, TREC.

[8]  James Allan,et al.  Automatic Routing and Ad-hoc Retrieval Using SMART: TREC 2 , 1993, TREC.

[9]  Lisa F. Rau,et al.  A Boolean Approximation Method for Query Construction and Topic Assignment in TREC , 1992, TREC.

[10]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[11]  J. Cullum,et al.  Lanczos algorithms for large symmetric eigenvalue computations , 1985 .

[12]  Paul E. Nelson Site Report for the Text REtrieval Conference , 1992, TREC.

[13]  Susan T. Dumais,et al.  Personalized information delivery: an analysis of information filtering methods , 1992, CACM.

[14]  Donna Harman The First Text REtrieval Conference (TREC-1) | NIST , 1993 .

[15]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[16]  Susan T. Dumais,et al.  LSI meets TREC: A Status Report , 1992, TREC.