Page Segmentation for Historical Handwritten Document Images Using Conditional Random Fields

In this paper, we present a Conditional Random Field (CRF) model to deal with the problem of segmenting handwritten historical document images into different regions. We consider page segmentation as a pixel-labeling problem, i.e., each pixel is assigned to one of a set of labels. Features are learned from pixel intensity values with stacked convolutional autoencoders in an unsupervised manner. The features are used for the purpose of initial classification with a multilayer perceptron. Then a CRF model is introduced for modeling the local and contextual information jointly in order to improve the segmentation. For the purpose of decreasing the time complexity, we perform labeling at superpixel level. In the CRF model, graph nodes are represented by superpixels. The label of each pixel is determined by the label of the superpixel to which it belongs. Experiments on three public datasets demonstrate that, compared to previous methods, the proposed method achieves more accurate segmentation results and is much faster.

[1]  Yuan Qi,et al.  Contextual recognition of hand-drawn diagrams with conditional random fields , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[2]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Kozaburo Hachimura,et al.  Character segmentation and retrieval for learning support system of Japanese historical books , 2013, HIP '13.

[4]  Marcus Liwicki,et al.  Robust Text Line Segmentation for Historical Manuscript Images Using Color and Texture , 2014, 2014 22nd International Conference on Pattern Recognition.

[5]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields , 2006, ECCV.

[6]  Michele Alberti,et al.  N-light-N Read The Friendly Manual , 2016 .

[7]  Masaki Nakagawa,et al.  Development of Nom character segmentation for collecting patterns from historical document pages , 2011, HIP '11.

[8]  Jihad El-Sana,et al.  Robust text and drawing segmentation algorithm for historical documents , 2013, HIP '13.

[9]  Andrew McCallum,et al.  Exploring the use of conditional random field models and HMMs for historical handwritten document recognition , 2006, Second International Conference on Document Image Analysis for Libraries (DIAL'06).

[10]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[11]  Jihad El-Sana,et al.  Layout Analysis for Arabic Historical Document Images Using Machine Learning , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[12]  Cheng-Lin Liu,et al.  Contextual text/non-text stroke classification in online handwritten notes with conditional random fields , 2014, Pattern Recognit..

[13]  Marcus Liwicki,et al.  Ground truth model, tool, and dataset for layout analysis of historical documents , 2015, Electronic Imaging.

[14]  Marcus Liwicki,et al.  Page Segmentation for Historical Handwritten Document Images Using Color and Texture Features , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[15]  Thierry Paquet,et al.  Document Image Segmentation Using a 2D Conditional Random Field Model , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[16]  Marcus Liwicki,et al.  Page segmentation of historical document images with convolutional autoencoders , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[17]  Venu Govindaraju,et al.  Text - image separation in Devanagari documents , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  Marcus Liwicki,et al.  Page Segmentation for Historical Document Images Based on Superpixel Classification with Unsupervised Feature Learning , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[19]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.