We present the participation of the University of Ottawa in the Cross-Language Spoken Document Retrieval task at CLEF 2005. In order to translate the queries, we combined the results of several online Machine Translation tools. For the Information Retrieval component we used the SMART system, with several weighting schemes for indexing the documents and the queries. One scheme in particular lead to better results than other combinations. We present the results of the submitted runs and of many unofficial runs. We compare the effect of several translations from each language. We present results on phonetic transcripts of the collection and queries and on the combination of text and phonetic transcripts. We also include the results when the manual summaries and keywords are indexed.
[1]
Bhuvana Ramabhadran,et al.
Building an information retrieval test collection for spontaneous conversational speech
,
2004,
SIGIR '04.
[2]
James Allan,et al.
Automatic Retrieval With Locality Information Using SMART
,
1992,
TREC.
[3]
Gerard Salton,et al.
Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer
,
1989
.
[4]
Gerard Salton,et al.
Term-Weighting Approaches in Automatic Text Retrieval
,
1988,
Inf. Process. Manag..
[5]
Diana Inkpen,et al.
Semantic Similarity for Detecting Recognition Errors in Automatic Speech Transcripts
,
2005,
HLT.