Statistical Error Correction Methods for Domain-Specific ASR Systems

Whenever an ASR company promises to deliver error-proof transcripts to the end user, manual verification and correction of the raw ASR transcripts cannot be avoided. This manual post-editing process systematically generates new and correct domain-specific data which can be used to incrementally improve the original ASR system. This paper proposes a statistic, SMT-based ASR error correction method, which takes advantage of the past corrected ASR errors to automatically post-process its future transcripts. We show that the proposed method can bring more than 10% WER improvements using only 2000 user-corrected sentences.

[1]  Eric K. Ringger,et al.  A fertility channel model for post-correction of continuous speech recognition , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  Hitoshi Iida,et al.  A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurence , 1998, COLING-ACL.

[3]  Horia Cucu,et al.  Investigating the role of machine translated text in ASR domain adaptation: Unsupervised and semi-supervised methods , 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding.

[4]  David D. Palmer,et al.  Context-based Speech Recognition Error Detection and Correction , 2004, NAACL.

[5]  Horia Cucu,et al.  Enhancing Automatic Speech Recognition for Romanian by Using Machine Translated and Web-based Text Corpora , 2011 .

[6]  Roland Kuhn,et al.  Rule-Based Translation with Statistical Phrase-Based Post-Editing , 2007, WMT@ACL.

[7]  Horia Cucu,et al.  ASR domain adaptation methods for low-resourced languages: Application to Romanian language , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[8]  Gary Geunbae Lee,et al.  Speech recognition error correction using maximum entropy language model , 2004, INTERSPEECH.

[9]  Mukund Padmanabhan,et al.  Error corrective mechanisms for speech recognition , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[10]  Michel Simard,et al.  Statistical Phrase-Based Post-Editing , 2007, NAACL.

[11]  Mei-Yuh Hwang,et al.  Unsupervised learning from users' error correction in speech dictation , 2004, INTERSPEECH.

[12]  Francisco Casacuberta,et al.  Statistical Post-Editing of a Rule-Based Machine Translation System , 2009, NAACL.