Learning to Sort Handwritten Text Lines in Reading Order through Estimated Binary Order Relations

Recent advances in Handwritten Text Recognition and Document Layout Analysis make it possible to extract information from digitized documents and make them accessible beyond the archive shelves. But the reading order of the elements in those documents still is an open problem that has to be solved in order to provide that information with the correct structure. Most of the studies on the reading order task are rule-base approaches that focus on printed documents, while less attention has been paid to handwritten text documents. In this work we propose a new approach to automatically determine the reading order of text lines in handwritten text documents. The task is approached as a sorting problem where the order-relation operator is learned directly from examples. We demonstrate the effectiveness of our method on three different datasets.

[1]  Lorenzo Quirós,et al.  Multi-Task Handwritten Document Layout Analysis , 2018, ArXiv.

[2]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[3]  Volkmar Frinken,et al.  HMM word graph based keyword spotting in handwritten document images , 2016, Inf. Sci..

[4]  Frédéric Kaplan,et al.  dhSegment: A Generic Deep-Learning Approach for Document Segmentation , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[5]  Michelangelo Ceci,et al.  Machine Learning for Reading Order Detection in Document Image Understanding , 2008, Machine Learning in Document Analysis and Recognition.

[6]  Johannes Michael,et al.  A two-stage method for text line detection in historical documents , 2018, International Journal on Document Analysis and Recognition (IJDAR).

[7]  Théodore Bluche,et al.  Deep Neural Networks for Large Vocabulary Handwritten Text Recognition , 2015 .

[8]  Joan Puigcerver I Pérez,et al.  A Probabilistic Formulation of Keyword Spotting , 2018 .

[9]  Alejandro Héctor Toselli,et al.  Probabilistic Indexing and Search for Information Extraction on Handwritten German Parish Records , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[10]  Alejandro Héctor Toselli,et al.  A set of benchmarks for Handwritten Text Recognition on historical documents , 2019, Pattern Recognit..

[11]  Enrique Vidal,et al.  Handwritten Text Recognition for Historical Documents , 2011 .

[12]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[13]  Alejandro Héctor Toselli,et al.  Making Two Vast Historical Manuscript Collections Searchable and Extracting Meaningful Textual Features Through Large-Scale Probabilistic Indexing , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[14]  Moisés Pastor Text baseline detection, a single page trained system , 2019, Pattern Recognit..

[15]  Hyeran Byun,et al.  Automatic generation of structured hyperdocuments from document images , 2002, Pattern Recognit..

[16]  Brian A. Davey,et al.  An Introduction to Lattices and Order , 1989 .

[17]  Sergei Vassilvitskii,et al.  Generalized distances between rankings , 2010, WWW '10.

[18]  M. Kendall A NEW MEASURE OF RANK CORRELATION , 1938 .

[19]  Thomas M. Breuel,et al.  High Performance Document Layout Analysis , 2003 .

[20]  Joel Nothman,et al.  Article Segmentation in Digitised Newspapers with a 2D Markov Model , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[21]  Jean-Luc Meunier,et al.  Versatile Layout Understanding via Conjugate Graph , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).