Towards Speech-Driven Question Answering: Experiments Using the NTCIR-3 Question Answering Collection

We developed a method for producing statistical language models for speech-driven question answering, which recognizes spoken questions with high accuracy. Our method uses a target collection (i.e., a document set from which answers are derived) to extract N-grams, and adapts them to the questionanswering task by way of frozen patterns typically used in interrogative questions. In addition, our method magnifies N-gram statistics corresponding to frozen patterns in the original N-gram. For the purpose of experiments, we used dictated questions in the NTCIR-3 QAC test collection, and showed that our method outperformed a conventional language model adaptation method in terms of the speech recognition accuracy.

[1]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Evaluation , 2000, TREC.

[2]  Katunobu Itou,et al.  Speech-Driven Text Retrieval: Using Target IR Collections for Statistical Language Model Adaptation in Speech Recognition , 2001, SIGIR Workshop: Information Retrieval Techniques for Speech Applications.

[3]  Marcello Federico,et al.  Bayesian estimation methods for n-gram language model adaptation , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[4]  Pascale Fung,et al.  The estimation of powerful language models from small and large corpora , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Eric K. Ringger,et al.  Rapid language model development for new task domains , 1998 .

[6]  Kiyohiro Shikano,et al.  Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.