Specialized language models using dialogue predictions

Analyses language modeling in spoken dialogue systems for accessing a database. The use of several language models obtained by exploiting dialogue predictions gives better results than the use of a single model for the whole dialogue interaction. For this reason, several models have been created, each one for a specific system question, such as the request for or the confirmation of a parameter. The use of dialogue-dependent language models increases the performance both at the recognition level and at the understanding level, especially on answers to system requests. Moreover, using other methods to increase the performance, like the automatic clustering of vocabulary words or the use of better acoustic models during recognition, does not affect the improvements given by dialogue-dependent language models. The system used in our experiments is Dialogos, the Italian spoken dialogue system used for accessing railway timetable information over the telephone. The experiments were carried out on a large corpus of dialogues collected using Dialogos.

[1]  Morena Danieli,et al.  Managing dialogue in a continuous speech understanding system , 1993, EUROSPEECH.

[2]  David M. Carter,et al.  Improving Language Models by Clustering Training Sentences , 1994, ANLP.

[3]  Sheryl R. Young,et al.  The MINDS System: Using Context and Dialog to Enhance Speech Recognition , 1989, HLT.

[4]  Günther Görz,et al.  Towards understanding spontaneous speech: word accuracy vs. concept accuracy , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  François Andry Static and dynamic predictions : a method to improve speech understanding in cooperative dialogues , 1992, ICSLP.

[6]  Norbert Reithinger,et al.  Utilizing Statistical Dialogue Act Processing in Verbrnobil , 1995, ACL.

[7]  Heinrich Niemann,et al.  Combining stochastic and linguistic language models for recognition of spontaneous speech , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[8]  Egidio P. Giachin,et al.  Automatic clustering of words for probabilistic language models , 1995, EUROSPEECH.

[9]  Morena Danieli,et al.  Dialogos: a robust system for human-machine spoken dialogue on the telephone , 1996, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Paolo Baggia,et al.  Partial parsing as a robust parsing strategy , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Roberto Gemello,et al.  Recurrent network automata for speech recognition: a summary of recent work , 1994, Proceedings of IEEE Workshop on Neural Networks for Signal Processing.