Improving on-line handwritten recognition in interactive machine translation

On-line handwriting text recognition (HTR) could be used as a more natural way of interaction in many interactive applications. However, current HTR technology is far from developing error-free systems and, consequently, its use in many applications is limited. Despite this, there are many scenarios, as in the correction of the errors of fully-automatic systems using HTR in a post-editing step, in which the information from the specific task allows to constrain the search and therefore to improve the HTR accuracy. For example, in machine translation (MT), the on-line HTR system can also be used to correct translation errors. The HTR can take advantage of information from the translation problem such as the source sentence that is translated, the portion of the translated sentence that has been supervised by the human, or the translation error to be amended. Empirical experimentation suggests that this is a valuable information to improve the robustness of the on-line HTR system achieving remarkable results. Graphical abstractThis work presents an e-pen enabled system where handwriting is used to amend the errors of a machine translation system. Handwriting recognition is performed in such a way that the contextual information (source, prefix, translation, and error) is integrated to improve the final recognition accuracy.Display Omitted HighlightsWe present a specific on-line HTR system for editing machine translation (MT) output.We leverage information from different sources in MT to constrain the HTR search.All the proposed systems outperform the baseline.The use of information from the translation models achieves remarkable results.Finally, we propose a system to amend HTR errors with a 75% typing effort reduction.

[1]  Hermann Ney,et al.  On the Estimation of 'Small' Probabilities by Leaving-One-Out , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Hermann Ney,et al.  Statistical Approaches to Computer-Assisted Translation , 2009, CL.

[3]  F. Casacuberta,et al.  Thot: a Toolkit To Train Phrase-based Statistical Translation Models , 2005, MTSUMMIT.

[4]  Wayne H. Ward,et al.  Speech recognition , 1997 .

[5]  Antonio L. Lagarda,et al.  An iterative multimodal framework for the transcription of handwritten historical documents , 2014, Pattern Recognit. Lett..

[6]  Isabelle Guyon,et al.  UNIPEN project of on-line data exchange and recognizer benchmarks , 1994, Proceedings of the 12th IAPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5).

[7]  Hermann Ney,et al.  Discriminative Training and Maximum Entropy Models for Statistical Machine Translation , 2002, ACL.

[8]  Robert L. Mercer,et al.  The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[9]  Francisco Casacuberta,et al.  Online Learning for Interactive Statistical Machine Translation , 2010, NAACL.

[10]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[11]  Alejandro Héctor Toselli,et al.  Multimodal interactive transcription of text images , 2010, Pattern Recognit..

[12]  Mohamed Cheriet,et al.  Error handling approach using characterization and correction steps for handwritten document analysis , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[13]  Moisés Pastor,et al.  iATROS: A SPEECH AND HANDWRITING RECOGNITION SYSTEM , 2008 .

[14]  Francisco Casacuberta,et al.  Computer-assisted translation using speech recognition , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Francisco Casacuberta,et al.  Multimodal Interactive Pattern Recognition and Applications , 2011 .

[16]  Alejandro Héctor Toselli,et al.  Writing speed normalization for on-line handwritten text recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[17]  Venu Govindaraju,et al.  Phrase-based correction model for improving handwriting recognition accuracies , 2009, Pattern Recognit..

[18]  Francisco Casacuberta,et al.  Improving On-line Handwritten Recognition using Translation Models in Multimodal Interactive Machine Translation , 2011, ACL.

[19]  Lambert Schomaker,et al.  Online Handwriting Recognition , 1999 .

[20]  Germán Sanchis-Trilles,et al.  Improving Interactive Machine Translation via Mouse Actions , 2008, EMNLP.

[21]  Hermann Ney,et al.  Confidence measures for large vocabulary continuous speech recognition , 2001, IEEE Trans. Speech Audio Process..

[22]  Pascual Martínez-Gómez,et al.  On multimodal interactive machine translation using speech recognition , 2011, ICMI '11.

[23]  William M. Campbell,et al.  Discriminative Keyword Selection Using Support Vector Machines , 2007, NIPS.

[24]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[25]  Alexander H. Waibel,et al.  Online handwriting recognition: the NPen++ recognizer , 2001, International Journal on Document Analysis and Recognition.

[26]  Jungpil Shin,et al.  A statistical handwriting model for style-preserving and variable character synthesis , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[27]  Hermann Ney,et al.  Phrase-Based Statistical Machine Translation , 2002, KI.

[28]  Barry Haddow,et al.  Interactive Assistance to Human Translators using Statistical Machine Translation Methods , 2009, MTSUMMIT.

[29]  Hermann Ney,et al.  Generation of Word Graphs in Statistical Machine Translation , 2002, EMNLP.

[30]  Rohit Prasad,et al.  Statistical Machine Translation as a Language Model for Handwriting Recognition , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[31]  Hermann Ney,et al.  Integration of Speech Recognition and Machine Translation in Computer-Assisted Translation , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[32]  Philip Koehn,et al.  Statistical Machine Translation , 2010, EAMT.

[33]  Francisco Casacuberta,et al.  Multimodal interactive machine translation , 2010, ICMI-MLMI '10.

[34]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[35]  Antonio L. Lagarda,et al.  A Multimodal Approach to Dictation of Handwritten Historical Documents , 2011, INTERSPEECH.

[36]  Pierre Isabelle,et al.  Target-Text Mediated Interactive Machine Translation , 2004, Machine Translation.

[37]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[38]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .