Language modeling for spoken dialogue system based on sentence transformation and filtering using predicate-argument structures

We present a novel scheme of language modeling for a spoken dialogue system by effectively exploiting the back-end documents the system uses for information navigation. The proposed method first converts sentences in the document, which are written and plain style, into spoken question-style queries, which are expected in spoken dialogue. In this process, we conduct dependency analysis to extract verbs and relevant phrases to generate natural sentences by applying transformation rules. Then, we select sentences which have useful information relevant to the target domain and thus are more likely to be queried. For this purpose, we define predicate-argument (P-A) templates based on a statistical measure in the target document. An experimental evaluation shows that the proposed method outperforms the conventional method in ASR performance, and the sentence selection based on the P-A templates is effective.

[1]  Tatsuya Kawahara New perspectives on spoken language understanding: Does machine need to fully understand speech? , 2009, 2009 IEEE Workshop on Automatic Speech Recognition & Understanding.

[2]  Maria T. Pazienza,et al.  Information Extraction , 2002, Lecture Notes in Computer Science.

[3]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[4]  Kentaro Torisawa,et al.  Similarity Based Language Model Construction for Voice Activated Open-Domain Question Answering , 2011, IJCNLP.

[5]  Ralph Grishman Discovery methods for information extraction , 2003 .

[6]  Panayiotis G. Georgiou,et al.  Building topic specific language models from webdata using competitive models , 2005, INTERSPEECH.

[7]  Tatsuya Kawahara,et al.  Spoken Dialogue System based on Information Extraction using Similarity of Predicate Argument Structures , 2011, SIGDIAL Conference.

[8]  R. Iida,et al.  Learning-Based Argument Structure Analysis of Event-Nouns in Japanese , 2007 .

[9]  益子 真由美 Argument Structure , 1993, The Lexicon.

[10]  Andreas Stolcke,et al.  Web resources for language modeling in conversational speech recognition , 2007, TSLP.

[11]  Ruhi Sarikaya,et al.  Rapid language model development using external resources for new spoken dialog domains , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[12]  Tatsuya Kawahara,et al.  A bootstrapping approach for developing language model of new spoken dialogue systems by selecting web texts , 2006, INTERSPEECH.

[13]  Tatsuya Kawahara,et al.  Bayes Risk-based Dialogue Management for Document Retrieval System with Speech Interface , 2008, COLING.