论文信息 - Scalable and portable web-based multimodal dialogue interaction with geographical databases

Scalable and portable web-based multimodal dialogue interaction with geographical databases

We describe work towards developing a scalable and portable framework for enabling map-based multimodal dialogue interaction over the web. Working in the context of a restaurant-guide system, we show how large information databases harvested from the web can be accommodated in our speech recognizer, parser, and web-based GUI. We compare two dynamic language modeling techniques, which calculate context-dependent weights for the large sets of proper nouns associated with geographical entities such as restaurants and streets. We show that the more fine-grained approach results in a 7.8% reduction in concept error rate. Index Terms: multimodal dialogue system, language modeling, restaurants, maps, world wide web

Stephanie Seneff | Chao Wang | Alexander Gruenstein

[1] Stephanie Seneff,et al. Automatic induction of n-gram language models from a natural language grammar , 2003, INTERSPEECH.

[2] Sy Bor Wang,et al. A multimodal galaxy-based geographic system , 2003 .

[3] Stephanie Seneff,et al. Automatic induction of language model data for a spoken dialogue system , 2006, SIGDIAL.

[4] I. Lee Hetherington,et al. A multi-pass, dynamic-vocabulary approach to real-time, large-vocabulary speech recognition , 2005, INTERSPEECH.

[5] Stephanie Seneff,et al. Language model data filtering via user simulation and dialogue resynthesis , 2005, INTERSPEECH.

[6] Stephanie Seneff,et al. A dynamic vocabulary spoken dialogue interface , 2004, INTERSPEECH.

[7] Marilyn A. Walker,et al. MATCH: An Architecture for Multimodal Dialogue Systems , 2002, ACL.

[8] Joseph Polifroni,et al. Towards the automatic generation of mixed-initiative dialogue systems from web content , 2003, INTERSPEECH.

[9] Johan Schalkwyk,et al. Speech recognition with dynamic grammars using finite-state transducers , 2003, INTERSPEECH.

[10] Stephanie Seneff,et al. A context resolution server for the galaxy conversational systems , 2003, INTERSPEECH.

[11] Victor Zue,et al. GALAXY-II: a reference architecture for conversational system development , 1998, ICSLP.

[12] Jens Edlund,et al. Adapt - a multimodal conversational dialogue system in an apartment domain , 2000, INTERSPEECH.

[13] Stephanie Seneff,et al. GENESIS-II: a versatile system for language generation in conversational system applications , 2000, INTERSPEECH.