Improving Automatic Call Classification using Machine Translation

Utterance classification is an important task in spoken-dialog systems. The response of the system is dependent on category assigned to the speaker's utterance by the classifier. However, often the input speech is spontaneous and noisy which results in high word error rates. This results in unsatisfactory system performance. In this paper we describe a method to improve the natural language call classification task using statistical machine translation (SMT). We utilize the translation model in SMT to capture the relation between truth and the ASR transcribed text. The model is trained using the human transcribed text and the ASR transcribed text. During deployment SMT is used to sanitize the ASR transcribed text. Our experiments with IBM model 2 shows significant improvement in call classification accuracy.

[1]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[2]  Chin-Hui Lee,et al.  Discriminative training in natural language call routing , 2000, INTERSPEECH.

[3]  Chin-Hui Lee,et al.  Natural language call routing: towards combination and boosting of classifiers , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[4]  Robert E. Schapire,et al.  Boosting with prior knowledge for call classification , 2005, IEEE Transactions on Speech and Audio Processing.

[5]  Xiang Li,et al.  Improving end-to-end performance of call classification through data confusion reduction and model tolerance enhancement , 2005, INTERSPEECH.

[6]  Tanja Schultz,et al.  Document driven machine translation enhanced ASR , 2005, INTERSPEECH.

[7]  N. Tyson,et al.  Improved lsi-based natural language call routing using speech recognition confidence scores , 2004, Second IEEE International Conference on Computational Cybernetics, 2004. ICCC 2004..

[8]  Olivier Siohan,et al.  Multiple classifiers by constrained minimization , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[9]  Bob Carpenter,et al.  Vector-based Natural Language Call Routing , 1999, Comput. Linguistics.