Bi-modal Handwritten Text Recognition (BiHTR) ICPR 2010 Contest Report

Handwritten text is generally captured through two main modalities: off-line and on-line. Each modality has advantages and disadvantages, but it seems clear that smart approaches to handwritten text recognition (HTR) should make use of both modalities in order to take advantage of the positive aspects of each one. A particularly interesting case where the need of this bi-modal processing arises is when an off-line text, written by some writer, is considered along with the online modality of the same text written by another writer. This happens, for example, in computer-assisted transcription of old documents, where on-line text can be used to interactively correct errors made by a main off-line HTR system. In order to develop adequate techniques to deal with this challenging bi-modal HTR recognition task, a suitable corpus is needed. We have collected such a corpus using data (word segments) from the publicly available off-line and on-line IAM data sets. In order to provide the Community with an useful corpus to make easy tests, and to establish baseline performance figures, we have proposed this handwritten bi-modal contest. Here is reported the results of the contest with two participants, one of them achieved a 0% classification error rate, whilst the other participant achieved an interesting 1.5%.

[1]  Horst Bunke,et al.  A full English sentence database for off-line handwriting recognition , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[2]  Alejandro Héctor Toselli,et al.  Computer Assisted Transcription of Handwritten Text Images , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[3]  Salvador España Boquera,et al.  Improving Offline Handwritten Text Recognition with Hybrid HMM/ANN Models , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Alejandro Héctor Toselli,et al.  Computer Assisted Transcription of Text Images and Multimodal Interaction , 2008, MLMI.

[5]  Salvador España Boquera,et al.  Handwritten Text Normalization by using Local Extrema Classification , 2008, PRIS.

[6]  Marcus Liwicki,et al.  IAM-OnDB - an on-line English sentence database acquired from handwritten text on a whiteboard , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[7]  Michael Perrone,et al.  Combining online and offline handwriting recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[8]  Alejandro Héctor Toselli,et al.  Writing speed normalization for on-line handwritten text recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[9]  Alfons Juan-Císcar,et al.  Spontaneous handwriting recognition and classification , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[10]  Johansson. Stig,et al.  Manual of information to accompany the Lancaster-Oslo : Bergen Corpus of British English, for use with digital computers , 1978 .

[11]  Andrei Popescu-Belis,et al.  Machine Learning for Multimodal Interaction , 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers , 2008, MLMI.

[12]  Stefan Knerr,et al.  The IRESTE On/Off (IRONOFF) dual handwriting database , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[13]  Horst Bunke,et al.  The IAM-database: an English sentence database for offline handwriting recognition , 2002, International Journal on Document Analysis and Recognition.

[14]  Marcus Liwicki,et al.  Combining on-line and off-line bidirectional long short-term memory networks for handwritten text line recognition , 2008 .

[15]  Francisco Casacuberta,et al.  A Bi-modal Handwritten Text Corpus: Baseline Results , 2010, 2010 20th International Conference on Pattern Recognition.