Amberfish at the TREC 2004 Terabyte Track

The TREC 2004 Terabyte Track evaluated information retrieval in largescale text collections, using a set of 25 million documents (426 GB). This paper gives an overview of our experiences with this collection and describes Ambersh, the text retrieval software used for the experiments.