TREC 7 Ad Hoc, Speech, and Interactive tracks at MDS/CSIRO

In TREC-5 we used document retrieval based on arbitrary passages [8, 9], or xed-length passages that could start at any word position. Although far from the best runs in TREC5, these results were promising, in particular for long documents. In TREC-6 we continued with arbitrary passages, but our main emphasis was on comprehensive factor analysis of successful automatic query expansion and re nements methods in the context of the vector space model [5]. This year we have re ned the MG retrieval system to include Rocchio-based relevance feedback. Also, phrase matching has been added. We have continued to use arbitrary passages and combination of evidence for document retrieval.

[1]  Justin Zobel,et al.  Passage retrieval revisited , 1997, SIGIR '97.

[2]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[3]  Karen Spärck Jones,et al.  Retrieving spoken documents by combining multiple index sources , 1996, SIGIR '96.

[4]  Ross Wilkinson,et al.  MDS TREC6 Report , 1997, TREC.

[5]  Victor Zue,et al.  Subword unit representations for spoken document retrieval , 1997, EUROSPEECH.

[6]  M. Boughanem,et al.  Okapi at TREC { 6 Automatic ad hoc , VLC , routing , ltering and , 1997 .

[7]  I. G. BONNER CLAPPISON Editor , 1960, The Electric Power Engineering Handbook - Five Volume Set.

[8]  L. R. Rasmussen,et al.  In information retrieval: data structures and algorithms , 1992 .

[9]  Steve Young,et al.  The HTK book , 1995 .

[10]  Marti A. Hearst,et al.  Reexamining the cluster hypothesis: scatter/gather on retrieval results , 1996, SIGIR '96.

[11]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[12]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[13]  Ross Wilkinson,et al.  The MDS Experiments for TREC5 , 1996, TREC.

[14]  Ian H. Witten,et al.  Managing Gigabytes: Compressing and Indexing Documents and Images , 1999 .

[15]  James Allan,et al.  The effect of adding relevance information in a relevance feedback environment , 1994, SIGIR '94.