Creating an Atlas over Handwritten Script Signs

A framework for interactive visualization of script characteristics, as present in the form of handwritten letters, is proposed in this work. The basic idea behind this investigation is to lay the foundations for creating a comprehensive atlas over letter forms extracted from a large collection of handwritten documents, with minimal human guidance. The visualization of the results is based on the atlas metaphor and uses the t-SNE visualization method for creating island-like clusters that can be investigated using the proposed visualization framework. By changing a scale parameter one can investigate the dataset on different levels, i.e different sizes of the clusters.

[1]  Ernest Valveny,et al.  A Coarse-to-Fine Approach for Handwritten Word Spotting in Large Scale Historical Documents Collection , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[2]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3]  Anders Hast,et al.  Automatic Document Image Binarization using Bayesian Optimization , 2017, HIP@ICDAR.

[4]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[5]  Anders Hast,et al.  On-the-fly Historical Handwritten Text Annotation , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).