Address block localization based on graph theory

An efficient mail sorting system is mainly based on an accurate optical recognition of the addresses on the envelopes. However, the localizing of the address block (ABL) should be done before the OCR recognition process. The location step is very crucial as it has a great impact on the global performance of the system. Currently, a good localizing step leads to a better recognition rate. The limit of current methods is mainly caused by modular linear architectures used for ABL: their performances greatly depend on each independent module performance. We are presenting in this paper a new approach for ABL based on a pyramidal data organization and on a hierarchical graph coloring for classification process. This new approach presents the advantage to guarantee a good coherence between different modules and reduces both the computation time and the rejection rate. The proposed method gives a very satisfying rate of 98% of good locations on a set of 750 envelope images.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Sargur N. Srihari,et al.  Object recognition in visually complex environments: an architecture for locating address blocks on mail pieces , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[3]  C. Viard-Gaudin,et al.  A multi-resolution approach to extract the address block on flat mail pieces , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[5]  Olivier Déforges,et al.  A fast multiresolution text line and non text-line structures extraction and discrimination scheme for document image analysis , 1994, Proceedings of 1st International Conference on Image Processing.

[6]  Adnan Amin,et al.  Page segmentation and classification utilising a bottom-up approach , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[7]  Shin-Ywan Wang,et al.  Block selection: a method for segmenting a page image of various editing styles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[8]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[9]  Anil K. Jain,et al.  Address block location on complex mail pieces , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[10]  Stefan Agne,et al.  Benchmarking of document page segmentation , 1999, Electronic Imaging.

[11]  Shahram Latifi,et al.  An Algorithm with Reduced Operations for Connected Components Detection in ITU-T Group 3/4 Coded Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Hamamache Kheddouci,et al.  The b-chromatic number of power graphs , 2003, Discret. Math. Theor. Comput. Sci..

[13]  David Menotti,et al.  Postal envelope address block location by fractal-based approach , 2004, Proceedings. 17th Brazilian Symposium on Computer Graphics and Image Processing.

[14]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[15]  Yun-Seok Nam,et al.  Locating destination address block in Korean mail images , 2004, ICPR 2004.

[16]  Seung Ick Jang,et al.  Locating destination address block in Korean mail images , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[17]  Mario Valencia-Pabon,et al.  On Approximating the B-Chromatic Number , 2003, Discret. Appl. Math..

[18]  Véronique Eglin,et al.  Contribution to the Automatic Recognition of Business Documents , 2006 .

[19]  Hamamache Kheddouci,et al.  A Distributed Algorithm for a b-Coloring of a Graph , 2006, ISPA.

[20]  Mohand-Said Hacid,et al.  A New Clustering Approach for Symbolic Data: Algorithms and Application to Healthcare Data , 2006, BDA.