论文信息 - Décodage conceptuel et apprentissage automatique : application au corpus de dialogue Homme-Machine MEDIA

Décodage conceptuel et apprentissage automatique : application au corpus de dialogue Homme-Machine MEDIA

Within the framework of the French evaluation program MEDIA on spoken dialogue systems, this paper presents the methods proposed at the LIA for the robust extraction of basic conceptual constituents (or concepts) from an audio message. The conceptual decoding model proposed follows a stochastic paradigm and is directly integrated into the Automatic Speech Recognition (ASR) process. This approach allows us to keep the probabilistic search space on words produced by the ASR module and to project it to a probabilistic search space of concepts. The experiments carried on on the MEDIA corpus show that the performance reached by our approach is state of the art on manual transcriptions of dialogues. By partitioning the training corpus according to different sizes, one can measure the impact of the training corpus on the decoding performance and therefore estimate the minimal as well as the optimal number of dialogue examples needed. Finally we detail how a priori knowledge can be integrated in our models in order to increase their coverage and therefore lowering, for the same level of performance, the amount of training corpus needed.

Frédéric Béchet | Christophe Servan

[1] Frédéric Béchet,et al. On the use of finite state transducers for semantic interpretation , 2006, Speech Commun..

[2] Stephanie Seneff,et al. TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[3] Martin Rajman,et al. Lattice Parsing for Speech Recognition , 1999 .

[4] Sophie Rosset,et al. Semantic annotation of the French media dialog corpus , 2005, INTERSPEECH.

[5] Brendan J. Frey,et al. Combination of statistical and rule-based approaches for spoken language understanding , 2002, INTERSPEECH.

[6] Eugene Charniak,et al. Equations for Part-of-Speech Tagging , 1993, AAAI.

[7] H. Bonneau-Maynard,et al. A 2+1-level stochastic understanding model , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[8] Jérôme Goulian,et al. Quand le TAL robuste s’attaque au langage parlé : analyse incrémentale pour la compréhension de la parole spontanée , 2003, JEPTALNRECITAL.

[9] Brian Roark,et al. Markov Parsing: Lattice Rescoring with a Statistical Parser , 2002, ACL.

[10] Fernando Pereira,et al. Weighted finite-state transducers in speech recognition , 2002, Comput. Speech Lang..

[11] Roberto Pieraccini,et al. Concept-based spontaneous speech understanding system , 1995, EUROSPEECH.