论文信息 - Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections

Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections

A query word for retrieval systems is liable to be a special term not included in a speech recognizer dictionary. Spoken document retrieval (SDR) systems must therefore be vocabulary-free to deal with arbitrary query words. This paper proposes a new method for vocabulary-free spoken document retrieval. The method exploits two-stage tactics. First, when a query word is submitted, the query word is transformed to a subword sequence according to conversion rules. The subword sequence is searched for spoken documents previously transcribed to a subword sequence by subword recognition. The identified sections are extracted according to the distance between the subword sequences of the query and the identified sections. Second, each identified section is re-recognized using a grammar that includes the query subword sequence. Retrieval experiments were conducted with an actual TV program and the results demonstrated that the proposed method improved SDR performance without long delays in retrieval.

[1] Shi-wook Lee,et al. Robust Spoken Document Retrieval Based on Multilingual Subphonetic Segment Recognition , 2004, ICEIS.

[2] Fabio Crestani,et al. Combination of similarity measures for effective spoken document retrieval , 2003, J. Inf. Sci..

[3] Katunobu Itou,et al. Evaluating Speech-Driven IR in the NTCIR-3 Web Retrieval Task , 2002, NTCIR.

[4] Kiyohiro Shikano,et al. Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.

[5] Kazuyo Tanaka,et al. A speech recognition method with a language-independent intermediate phonetic code , 2000, INTERSPEECH.

[6] Dragutin Petkovic,et al. Phonetic confusion matrix based spoken document retrieval , 2000, SIGIR '00.

[7] Jonathan G. Fiscus,et al. Automatic Language Model Adaptation for Spoken Document Retrieval , 2000, RIAO.

[8] Ellen M. Voorhees,et al. The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[9] Shuichi Itahashi,et al. JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research , 1999 .

[10] Richard P. Lippmann,et al. Techniques for information retrieval from voice messages , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.