论文信息 - An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval

An integration method of retrieval results using plural subword models for vocabulary-free spoken document retrieval

Spoken document retrieval (SDR) systems must be vocabulary-free in order to deal with arbitrary query words because a user often searches the section where a query word is spoken, and query words are liable to be special terms that are not included in a speech recognizer’s dictionary. We have previously proposed new subword models, such as the 1/2 phone model, the 1/3 phone model, and the sub-phonetic segment (SPS) model, and have confirmed the effectiveness of these models for SDR [1]. These models are more sophisticated on the time axis than phoneme models such as the triphone model. The present paper proposes an integration method of plural retrieval results that are obtained from each subword model and demonstrates the performance improvement through experiments using an actual presentation speech corpus.

[1] Ellen M. Voorhees,et al. The TREC Spoken Document Retrieval Track: A Success Story , 2000, TREC.

[2] Katunobu Itou,et al. Evaluating Speech-Driven IR in the NTCIR-3 Web Retrieval Task , 2002, NTCIR.

[3] Richard P. Lippmann,et al. Techniques for information retrieval from voice messages , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4] Kazuyo Tanaka,et al. A speech recognition method with a language-independent intermediate phonetic code , 2000, INTERSPEECH.

[5] Jonathan G. Fiscus,et al. Automatic Language Model Adaptation for Spoken Document Retrieval , 2000, RIAO.

[6] Shi-wook Lee,et al. Open-vocabulary spoken document retrieval based on new subword models and subword phonetic similarity , 2006, INTERSPEECH.

[7] Shi-wook Lee,et al. Two-stage vocabulary-free spoken document retrieval - subword identification and re-recognition of the identified sections , 2006, Interspeech.

[8] Shuichi Itahashi,et al. JNAS: Japanese speech corpus for large vocabulary continuous speech recognition research , 1999 .