SAIL-GRS: Grammar Induction for Spoken Dialogue Systems using CF-IRF Rule Similarity

The SAIL-GRS system is based on a widely used approach originating from information retrieval and document indexing, the TF -IDF measure. In this implementation for spoken dialogue system grammar induction, rule constituent frequency and inverse rule frequency measures are used for estimating lexical and semantic similarity of candidate grammar rules to a seed set of rule pattern instances. The performance of the system is evaluated for the English language in three different domains, travel, tourism and finance and in the travel domain, for Greek. The simplicity of our approach makes it quite easy and fast to implement irrespective of language and domain. The results show that the SAIL-GRS system performs quite well in all three domains and in both languages.

[1]  Tatsuya Kawahara,et al.  Spoken Dialogue System based on Information Extraction using Similarity of Predicate Argument Structures , 2011, SIGDIAL Conference.

[2]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[3]  David Traum,et al.  First Steps Towards Dialogue Modelling from an Un-annotated Human-Human Corpus , 2007 .

[4]  Martin Dillon,et al.  FASIT: A fully automatic syntactically based indexing system , 1983, J. Am. Soc. Inf. Sci..

[5]  Marie-Francine Moens,et al.  Representations for multi-document event clustering , 2012, Data Mining and Knowledge Discovery.

[6]  Alexander I. Rudnicky,et al.  Unsupervised induction and filling of semantic slots for spoken dialogue systems using frame-semantic parsing , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[7]  Chin-Hui Lee,et al.  Metrics for measuring domain independence of semantic classes , 2001, INTERSPEECH.

[8]  Alexandros Potamianos,et al.  A soft-clustering algorithm for automatic induction of semantic classes , 2007, INTERSPEECH.

[9]  Wayne H. Ward,et al.  The CMU Air Travel Information Service: Understanding Spontaneous Speech , 1990, HLT.

[10]  Helen M. Meng,et al.  Semiautomatic Acquisition of Semantic Structures for Understanding Domain-Specific Natural Language Queries , 2002, IEEE Trans. Knowl. Data Eng..

[11]  Arne Jönsson,et al.  IJCAI 2003 workshop on Knowledge and Reasoning in Practical Dialogue Systems , 2003 .

[12]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.