Pattern localization in historical document images via template matching

Template matching is a classical and essential step in many pattern recognition, object detection or video tracking systems. This paper aims at integrating and evaluating different template matching methods in the context of pattern spotting in historical document images — i.e. the search for occurrences of a given visual pattern in document images. Given a query image, our pattern spotting system first computes the similarity score between the query signature and the signatures of a few regions provided by a region proposal algorithm. The top ranked regions are then selected for further processing. Template Matching is then applied in the neighborhood of the selected regions to precisely locate and rank the candidate windows that maximize the matching score. This paper builds upon popular template matching approaches and provides a unified testing framework for historical document image pattern spotting. The experimentation offers an insight on how to choose the most promising techniques for historical document images. This paper also proposes an improvement over these standard template matching approaches to significantly increase the overall performance.

[1]  J. P. Lewis,et al.  Fast Template Matching , 2009 .

[2]  Caroline Petitjean,et al.  Segmentation-free pattern spotting in historical document images , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[3]  Alexander Sibiryakov,et al.  Fast and high-performance template matching method , 2011, CVPR 2011.

[4]  Yacov Hel-Or,et al.  Fast template matching in non-linear tone-mapped images , 2011, 2011 International Conference on Computer Vision.

[5]  William J. Christmas,et al.  Fast robust correlation , 2005, IEEE Transactions on Image Processing.

[6]  Haim Schweitzer,et al.  A near optimal acceptance-rejection algorithm for exact cross-correlation search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[7]  Cordelia Schmid,et al.  Aggregating Local Image Descriptors into Compact Codes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  S.-D. Wei,et al.  Robust and Efficient Image Alignment Based on Relative Gradient Matching , 2006, IEEE Transactions on Image Processing.

[9]  Caroline Petitjean,et al.  A scalable pattern spotting system for historical documents , 2016, Pattern Recognit..

[10]  Eamonn J. Keogh,et al.  Mother Fugger: Mining Historical Manuscripts with Local Color Patches , 2010, 2010 IEEE International Conference on Data Mining.

[11]  Philip H. S. Torr,et al.  BING: Binarized normed gradients for objectness estimation at 300fps , 2019, Computational Visual Media.

[12]  Laurent Heutte,et al.  Spot It! Finding Words and Patterns in Historical Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[13]  Michael Werman,et al.  Asymmetric Correlation: A Noise Robust Similarity Measure for Template Matching , 2013, IEEE Transactions on Image Processing.

[14]  William T. Freeman,et al.  Best-Buddies Similarity for robust template matching , 2015, CVPR.

[15]  Michael Werman,et al.  Robust Real-Time Pattern Matching Using Bayesian Sequential Hypothesis Testing , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Ju Jang Lee,et al.  Fast and robust template matching algorithm in noisy image , 2007, 2007 International Conference on Control, Automation and Systems.

[17]  Eamonn J. Keogh,et al.  Establishing the provenance of historical manuscripts with a novel distance measure , 2013, Pattern Analysis and Applications.

[18]  Josep Lladós,et al.  Efficient segmentation-free keyword spotting in historical document collections , 2015, Pattern Recognit..

[19]  Federico Tombari,et al.  Performance Evaluation of Full Search Equivalent Pattern Matching Algorithms , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shai Avidan,et al.  FasT-Match: Fast Affine Template Matching , 2013, CVPR.

[21]  Frédéric Jurie,et al.  Region Proposal for Pattern Spotting in Historical Document Images , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[22]  Yong-Sheng Chen,et al.  Fast algorithm for robust template matching with M-estimators , 2003, IEEE Trans. Signal Process..

[23]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[24]  Arif Mahmood,et al.  Exploiting Transitivity of Correlation for Fast Template Matching , 2010, IEEE Transactions on Image Processing.