Automatic Chinese Postal Address Block Location Using Proximity Descriptors and Cooperative Profit Random Forests

Locating the destination address block is key to automated sorting of mails. Due to the characteristics of Chinese envelopes used in mainland China, we here exploit proximity cues in order to describe the investigated regions on envelopes. We propose two proximity descriptors encoding spatial distributions of the connected components obtained from the binary envelope images. To locate the destination address block, these descriptors are used together with cooperative profit random forests (CPRFs). Experimental results show that the proposed proximity descriptors are superior to two component descriptors, which only exploit the shape characteristics of the individual components, and the CPRF classifier produces higher recall values than seven state-of-the-art classifiers. These promising results are due to the fact that the proposed descriptors encode the proximity characteristics of the binary envelope images, and the CPRF classifier uses an effective tree node split approach.

[1]  Shigeki Sugano,et al.  Human-robot-contact-state identification based on tactile recognition , 2005, IEEE Transactions on Industrial Electronics.

[2]  Junyu Dong,et al.  Segmentation of Chinese Postal Envelope Images for Address Block Location , 2009, ISVC.

[3]  R. Radha,et al.  A novel approach for Postal address block and Pincode segmentation , 2014 .

[4]  Chudong Tong,et al.  Decentralized Monitoring of Dynamic Processes Based on Dynamic Feature Selection and Informative Fault Pattern Dissimilarity , 2016, IEEE Transactions on Industrial Electronics.

[5]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[6]  Antonio Criminisi,et al.  Decision Forests: A Unified Framework for Classification, Regression, Density Estimation, Manifold Learning and Semi-Supervised Learning , 2012, Found. Trends Comput. Graph. Vis..

[7]  S. D. Yanowitz,et al.  A new method for image segmentation , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[8]  Norbert Wiener,et al.  Extrapolation, Interpolation, and Smoothing of Stationary Time Series , 1964 .

[9]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[10]  Jin Li,et al.  Using cooperative game theory to optimize the feature selection problem , 2012, Neurocomputing.

[11]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Díbio Leandro Borges,et al.  Segmentation of envelopes and address block location by salient features and hypothesis testing , 2007 .

[13]  Che-Seung Cho,et al.  Development of real-time vision-based fabric inspection system , 2005, IEEE Transactions on Industrial Electronics.

[14]  Ralf Steinmetz,et al.  Extraction of Address Data from Unstructured Text using Free Knowledge Resources , 2013, i-Know '13.

[15]  Anil K. Jain,et al.  Address block location on complex mail pieces , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[16]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.

[17]  Anil K. Jain,et al.  Address block location on envelopes using Gabor filters , 1992, Pattern Recognit..

[18]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[19]  Jinhua Xu,et al.  Address block localization for Chinese postal envelopes with clutter background , 2014, 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD).

[20]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[21]  Hanan Samet,et al.  A general approach to connected-component labeling for arbitrary image representations , 1992, JACM.

[22]  Xiang Bai,et al.  Scene text detection and recognition: recent advances and future trends , 2015, Frontiers of Computer Science.

[23]  Qian Huang,et al.  Improving Automatic Detection of Defects in Castings by Applying Wavelet Technique , 2006, IEEE Transactions on Industrial Electronics.

[24]  Xiang Bai,et al.  Symmetry-based text line detection in natural scenes , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Jacques Facon,et al.  Postal envelope segmentation by 2-D histogram clustering through watershed transform , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[26]  Mike J. Chantler,et al.  The Importance of Long-Range Interactions to Texture Similarity , 2013, CAIP.

[27]  Subhadip Basu,et al.  A novel framework for automatic sorting of postal documents with multi-script address blocks , 2010, Pattern Recognit..

[28]  L. S. Shapley,et al.  17. A Value for n-Person Games , 1953 .

[29]  Venu Govindaraju,et al.  Postal address block location by contour clustering , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[30]  Véronique Eglin,et al.  A New Pyramidal Approach for the Address Block Location Based on Hierarchical Graph Coloring , 2007, ICIAR.

[31]  Bryan Nousain,et al.  Wavelet Fingerprinting of Radio-Frequency Identification (RFID) Tags , 2012, IEEE Transactions on Industrial Electronics.

[32]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[33]  Rui Zhang,et al.  Location and interpretation of destination addresses on handwritten Chinese envelopes , 2001, Pattern Recognit. Lett..

[34]  Hiroshi Sako,et al.  Address-block extraction by Bayesian rule , 2004, ICPR 2004.

[35]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[36]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[37]  Jonathan R. M. Hosking,et al.  Partitioning Nominal Attributes in Decision Trees , 1999, Data Mining and Knowledge Discovery.

[38]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Michael Wooldridge,et al.  Computational Aspects of Cooperative Game Theory , 2011, KES-AMSTA.

[40]  Deyong You,et al.  WPD-PCA-Based Laser Welding Process Monitoring and Defects Diagnosis by Using FNN and SVM , 2015, IEEE Transactions on Industrial Electronics.

[41]  David S. Doermann,et al.  Text Detection and Recognition in Imagery: A Survey , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.