Robust approach of address block localization in business mail by graph coloring

An efficient mail sorting system is mainly based on an accurate optical recognition of the addresses on the envelopes. However, the localizing of the address block should be done before the OCR recognition process. The location step is very crucial as it has a great impact on the global performance of the system. Actually, a good localizing step leads to a better recognition rate. The limit of current methods depends on modular linear architectures used for address block localization. Their performances depend on each independent module performance. We are presenting in this paper a new approach for ABL based on the hierarchical graph coloring and on the pyramidal data organization. This new approach presents the advantage to guarantee a good coherence between different modules and that reduces both the computation time and the rejection rate. The proposed method gives a very satisfying rate of 98% of good locations on a set of 750 envelope images.

[1]  Yun-Seok Nam,et al.  Locating destination address block in Korean mail images , 2004, ICPR 2004.

[2]  Stefan Agne,et al.  Benchmarking of document page segmentation , 1999, Electronic Imaging.

[3]  Mohand-Said Hacid,et al.  A New Clustering Approach for Symbolic Data: Algorithms and Application to Healthcare Data , 2006, BDA.

[4]  Hamamache Kheddouci,et al.  A Distributed Algorithm for a b-Coloring of a Graph , 2006, ISPA.

[5]  Mario Valencia-Pabon,et al.  On Approximating the B-Chromatic Number , 2003, Discret. Appl. Math..

[6]  Venu Govindaraju,et al.  Line separation for complex document images using fuzzy runlength , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[7]  David Menotti,et al.  Postal envelope address block location by fractal-based approach , 2004, Proceedings. 17th Brazilian Symposium on Computer Graphics and Image Processing.

[8]  Hamamache Kheddouci,et al.  The b-chromatic number of power graphs , 2003, Discret. Math. Theor. Comput. Sci..

[9]  Elizabeth Berry,et al.  Segmentation and Classification , 2007 .

[10]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[11]  Sargur N. Srihari,et al.  Object recognition in visually complex environments: an architecture for locating address blocks on mail pieces , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[12]  Shin-Ywan Wang,et al.  Block selection: a method for segmenting a page image of various editing styles , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[13]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..

[14]  Osama Al-Haj Hassan,et al.  A New Exam Scheduling Algorithm Using Graph Coloring , 2008, Int. Arab J. Inf. Technol..

[15]  Véronique Eglin,et al.  Contribution to the Automatic Recognition of Business Documents , 2006 .

[16]  C. Viard-Gaudin,et al.  A multi-resolution approach to extract the address block on flat mail pieces , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[17]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[18]  Anil K. Jain,et al.  Address block location on complex mail pieces , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[19]  Shahram Latifi,et al.  An Algorithm with Reduced Operations for Connected Components Detection in ITU-T Group 3/4 Coded Images , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Adnan Amin,et al.  Page Segmentation and Classification Utilizing Bottom-Up Approach , 2001, Int. J. Image Graph..

[21]  Seung Ick Jang,et al.  Locating destination address block in Korean mail images , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[22]  Olivier Déforges,et al.  A fast multiresolution text line and non text-line structures extraction and discrimination scheme for document image analysis , 1994, Proceedings of 1st International Conference on Image Processing.