Faxed form identification using histogram of the Hough-space

Ordering and charging goods have been increasingly treated with faxed forms. Although the FAXOCR system for specified forms is used practically, the performance to unspecified forms is not enough, because of the effect of noise on the faxed forms during the facsimile transmission. The final target of this study is to construct a practical FAXOCR system for unspecified forms. As the first stage, an identification method for unspecified faxed forms is proposed in this paper. In our approach, character separation and position adjustment are performed in the Hough-Space as pre-processing. Then the form identification is carried out by using vote histogram in the Hough-space. The performance of the proposed technique is verified experimentally by using actual faxed forms.

[1]  S.W. Lam,et al.  Anatomy of a form reader , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[2]  Lawrence O'Gorman,et al.  The Document Spectrum for Page Layout Analysis , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Yasuto Ishitani,et al.  Flexible and Robust Model Matching based on Association Graph for Form Image Understanding , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[4]  U. Bohnacker,et al.  Matching form lines based on a heuristic search , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[5]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.