论文信息 - Towards Speech-Driven Question Answering: Experiments Using the NTCIR-3 Question Answering Collection

Towards Speech-Driven Question Answering: Experiments Using the NTCIR-3 Question Answering Collection

We developed a method for producing statistical language models for speech-driven question answering, which recognizes spoken questions with high accuracy. Our method uses a target collection (i.e., a document set from which answers are derived) to extract N-grams, and adapts them to the questionanswering task by way of frozen patterns typically used in interrogative questions. In addition, our method magnifies N-gram statistics corresponding to frozen patterns in the original N-gram. For the purpose of experiments, we used dictated questions in the NTCIR-3 QAC test collection, and showed that our method outperformed a conventional language model adaptation method in terms of the speech recognition accuracy.

Katunobu Itou | Tetsuya Ishikawa | Atsushi Fujii | Tomoyosi Akiba

[1] Ellen M. Voorhees,et al. The TREC-8 Question Answering Track Evaluation , 2000, TREC.

[2] Katunobu Itou,et al. Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition , 2001, SIGIR Workshop: Information Retrieval Techniques for Speech Applications.

[3] Marcello Federico,et al. Bayesian estimation methods for n-gram language model adaptation , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4] Pascale Fung,et al. The estimation of powerful language models from small and large corpora , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Eric K. Ringger,et al. Rapid language model development for new task domains , 1998 .

[6] Kiyohiro Shikano,et al. Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.