PHAST: Spoken document retrieval based on sequence alignment

This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. Classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques, based on terms or n-grams. However, state-of-the-art large vocabulary continuous ASRs produce transcripts of spontaneous speech with a word error rate of 25% or higher, which is a drawback for retrieval techniques based on terms or n-grams. In order to overcome such a limitation, our method is based on a sequence alignment algorithm drawn from the field of bioinformatics to search

[1]  Gareth J. F. Jones,et al.  Investigating Cross-Language Speech Retrieval for a Spontaneous Conversational Speech Collection , 2006, HLT-NAACL.

[2]  Stephen E. Robertson,et al.  GatfordCentre for Interactive Systems ResearchDepartment of Information , 1996 .

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  Diana Inkpen,et al.  University of Ottawa's Participation in the CL-SR Task at CLEF 2006 , 2006, CLEF.

[5]  Grzegorz Kondrak,et al.  A New Algorithm for the Alignment of Phonetic Sequences , 2000, ANLP.

[6]  Graeme Hirst,et al.  Algorithms for language reconstruction , 2002 .

[7]  Gerald Salton,et al.  Automatic text processing , 1988 .

[8]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[9]  Dragutin Petkovic,et al.  Phonetic confusion matrix based spoken document retrieval , 2000, SIGIR '00.

[10]  Richard Sproat,et al.  Lattice-Based Search for Spoken Utterance Retrieval , 2004, NAACL.

[11]  Jianqiang Wang,et al.  CLEF-2005 CL-SR at Maryland: Document and Query Expansion using Side Collections and Thesauri , 2005, CLEF.

[12]  Brett Kessler,et al.  Phonetic comparison algorithms , 2005 .

[13]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[14]  Ryen W. White,et al.  Overview of the CLEF-2005 Cross-Language Speech Retrieval Track , 2005, CLEF.

[15]  Diana Inkpen,et al.  Using Various Indexing Schemes and Multiple Translations in the CL-SR Task at CLEF 2005 , 2005, CLEF.

[16]  Ryen W. White,et al.  Overview of the CLEF-2006 Cross-Language Speech Retrieval Track , 2006, CLEF.

[17]  Ke Zhang,et al.  Dublin City University at CLEF 2006: Cross-Language Speech Retrieval (CL-SR) Experiments , 2006, CLEF.

[18]  Ellen M. Voorhees,et al.  The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[19]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[20]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[21]  Sanda Harabagiu,et al.  High-performance, open-domain question answering from large text collections , 2001 .