Document Image Recognition Based on Template Matching of Component Block Projections

Document Image Recognition (DIR), a very useful technique in office automation and digital library applications, is to find the most similar template for any input document image in a prestored template document image data set. Existing methods use both local features and global layout information. In this paper, we propose a novel algorithm based on the global matching of Component Block Projections (CBP), which are the concatenated directional projection vectors of the component blocks of a document image. Compared to those existing methods, CBP-based template-matching methods possess two major advantages: (1) The spatial relationship among the component blocks of a document image is better represented, hence a very high matching accuracy can be obtained even for a large template set and seriously distorted input images; and (2) the effective matching distance of each template and the triangle inequality are proposed to significantly reduce the computational cost. Our experimental results confirm these advantages and show that the CBP-based template-matching methods are very suitable for DIR applications.

[1]  James Lee Hafner,et al.  Efficient Color Histogram Indexing for Quadratic Form Distance Functions , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Majid Ahmadi,et al.  Document registration using projective geometry , 1997, IEEE Trans. Image Process..

[3]  Donald A. Adjeroh,et al.  On ratio-based color indexing , 2001, IEEE Trans. Image Process..

[4]  Andrew D. Bagdanov,et al.  Projection profile based skew estimation algorithm for JBIG compressed images , 1998, International Journal on Document Analysis and Recognition.

[5]  John C. Dalton,et al.  Hierarchical browsing and search of large image databases , 2000, IEEE Trans. Image Process..

[6]  Hans-Peter Kriegel,et al.  Multiple Similarity Queries: A Basic DBMS Operation for Mining in Metric Databases , 2001, IEEE Trans. Knowl. Data Eng..

[7]  Kuo-Chin Fan,et al.  Form document identification using line structure based features , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[8]  Jianying Hu,et al.  Document image layout comparison and classification , 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318).

[9]  Francesca Cesarini,et al.  INFORMys: A Flexible Invoice-Like Form-Reader System , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Q. Gan,et al.  SOCR 1.03: A handwritten data form producing and reading system , 2000 .

[11]  Jonathan J. Hull Document image similarity and equivalence detection , 1998, International Journal on Document Analysis and Recognition.

[12]  Daniel P. Lopresti,et al.  String techniques for detecting duplicates in document databases , 2000, International Journal on Document Analysis and Recognition.

[13]  Wan-Chi Siu,et al.  Document image matching based on component blocks , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[14]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Computing k-Nearest Neighbors , 1975, IEEE Transactions on Computers.

[15]  Wan-Chi Siu,et al.  Document image template matching based on component block list , 2001, Pattern Recognit. Lett..

[16]  David S. Doermann,et al.  The detection of duplicates in document image databases , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[17]  Rung Ching Chen,et al.  The recognition of form documents based on three types of line segments , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[18]  Shigeyoshi Shimotsuji,et al.  Form identification based on cell structure , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[19]  Joachim M. Buhmann,et al.  Empirical evaluation of dissimilarity measures for color and texture , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Toyohide Watanabe,et al.  Layout Recognition of Multi-Kinds of Table-Form Documents , 1995, IEEE Trans. Pattern Anal. Mach. Intell..