Maximum entropy models for speech confidence estimation

In this work we implement a confidence estimation system based on a Naive Bayes classifier, by using the maximum entropy paradigm. The model takes information from various sources including a set of scores which have proved to be useful in confidence estimation tasks. Two different approaches are modeled. First a basic model which takes advantages of smoothing techniques used in a previous work, and second an optimized model, which is designed to hold a set of very few but essential characteristics of the model, without decrease in the performance. A considerably reduction in the number of parameters is obtained compared to the basic model. Both models are evaluated with two different corpora and compared to a model previously developed.

[1]  Alfons Juan-Císcar,et al.  New features based on multiple word graphs for utterance verification , 2004, INTERSPEECH.

[2]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[3]  Ronald Rosenfeld,et al.  A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..

[4]  Alfons Juan-Císcar,et al.  Estimating Confidence Measures for Speech Recognition Verification Using a Smoothed Naive Bayes Model , 2003, IbPRIA.

[5]  Hermann Ney,et al.  Some approaches to statistical and finite-state speech-to-speech translation , 2004, Comput. Speech Lang..

[6]  Hermann Ney,et al.  Smoothing methods in maximum entropy language modeling , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[7]  Sven C. Martin,et al.  Statistical Language Modeling Using Leaving-One-Out , 1997 .

[8]  Alfons Juan-Císcar,et al.  Improving utterance verification using a smoothed naive Bayes model , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[9]  Francisco Casacuberta,et al.  The EuTrans Spoken Language Translation System , 2004, Machine Translation.

[10]  Hermann Ney,et al.  Maximum entropy language modeling and the smoothing problem , 2000, IEEE Trans. Speech Audio Process..

[11]  Hermann Ney,et al.  Confidence measures for large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..