Language Modeling for Spoken Dialogue System based on Filtering using Predicate-Argument Structures

We present a novel scheme of language modeling for a spoken dialogue system by effectively filtering query sentences collected via a Web site of wisdom of crowds. Our goal is a speechbased information navigation system by retrieving from backend documents such as Web news. Then, we expect that users make queries that are relevant to the backend documents. The relevance measure can be defined with cross-entropy or perplexity by the language model generated from the documents in a conventional manner. In this article, we propose a novel criteria that considers semantic-level information. It is based on predicate-argument (P-A) pairs and their relevance to the documents (or topic) is defined by a naive Bayes score. Experimental evaluations demonstrate that the proposed relevance measure effectively selects relevant sentences used for a language model, resulting in significant reduction of the word error rate of speech recognition as well as the semantic-level error rate.

[1]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[2]  Andreas Stolcke,et al.  Web resources for language modeling in conversational speech recognition , 2007, TSLP.

[3]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[4]  Tatsuya Kawahara,et al.  Spoken Dialogue System based on Information Extraction using Similarity of Predicate Argument Structures , 2011, SIGDIAL Conference.

[5]  Kiyohiro Shikano,et al.  Julius - an open source real-time large vocabulary recognition engine , 2001, INTERSPEECH.

[6]  Ralph Grishman Discovery methods for information extraction , 2003 .

[7]  Tatsuya Kawahara,et al.  A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts , 2006, INTERSPEECH.

[8]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[9]  Ruhi Sarikaya,et al.  Rapid language model development using external resources for new spoken dialog domains , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[10]  Francoise Beaufays,et al.  Google Search by Voice: A Case Study , 2010 .

[11]  Panayiotis G. Georgiou,et al.  Building topic specific language models from webdata using competitive models , 2005, INTERSPEECH.

[12]  Tatsuya Kawahara,et al.  Bayes Risk-based Dialogue Management for Document Retrieval System with Speech Interface , 2008, COLING.

[13]  Tatsuya Kawahara New perspectives on spoken language understanding: Does machine need to fully understand speech? , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.