An Interactive Approach with Off-Line and On-Line Handwritten Text Recognition Combination for Transcribing Historical Documents

Automatic transcription of historical documents is becoming an important research topic, specially because of the increasing number of digitised historical documents that libraries and archives are publishing. However, state-of-the-art handwritten text recognition systems are far from being perfect. Therefore, to have perfect transcriptions, human expert revision is required to really produce a transcription of standard quality. In this context, an interactive assistive scenario, where the automatic system and the human transcriber cooperate to generate the perfect transcription, would allow for a more effective approach. In this paper we present a multimodal interactive transcription system where user feedback is provided by means of touchscreen pen strokes, traditional keyboard and mouse operations. The combination of both the main and the feedback data stream is based on the use of Confusion Networks derived from the output of the on-line and off-line handwritten text recognition systems. The use of the proposed combination help to optimise overall performance and usability.

[1]  Alfons Juan-Císcar,et al.  Balancing error and supervision effort in interactive-predictive handwriting recognition , 2010, IUI '10.

[2]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[3]  Carlos D. Martínez-Hinarejos,et al.  Multimodal Output Combination for Transcribing Historical Handwritten Documents , 2015, CAIP.

[4]  Camino Vera Combining Handwriting and Speech Recognition for Transcribing Historical Handwritten Documents , 2015 .

[5]  Hiromitsu Nishizaki,et al.  Effect of Confusion Network Combination on Speech Recognition System for Editing , 2011 .

[6]  Andreas Stolcke,et al.  SRILM - an extensible language modeling toolkit , 2002, INTERSPEECH.

[7]  Jian Xue,et al.  Improved confusion network algorithm and shortest path search from word lattice , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  Moisés Pastor,et al.  iATROS: A SPEECH AND HANDWRITING RECOGNITION SYSTEM , 2008 .

[9]  Alfons Juan-Císcar,et al.  The RODRIGO Database , 2010, LREC.

[10]  Alejandro Héctor Toselli,et al.  On-Line Handwriting Recognition System for Tamil Handwritten Characters , 2007, IbPRIA.

[11]  Alejandro Héctor Toselli Rossi,et al.  Multimodal Interactive Handwritten Text Transcription , 2012, Series in Machine Perception and Artificial Intelligence.

[12]  Sudeept Mohan,et al.  STATE , 2016, Intell. Serv. Robotics.

[13]  Verónica Romero,et al.  Interactive Off-Line Handwritten Text Transcription Using On-Line Handwritten Text as Feedback , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[14]  Volkmar Frinken,et al.  Automatic Transcription of Handwritten Medieval Documents , 2009, 2009 15th International Conference on Virtual Systems and Multimedia.

[15]  Hermann Ney,et al.  Integrated Handwriting Recognition And Interpretation Using Finite-State Models , 2004, Int. J. Pattern Recognit. Artif. Intell..

[16]  Juan Miguel Vilar,et al.  State,: an assisted document transcription system , 2009, ICMI-MLMI '09.

[17]  Hermann Ney,et al.  Improved backing-off for M-gram language modeling , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.