CSM neural network for degraded printed character optical recognition

Abstract In pattern recognition applications, the classification power of a system can be improved by combining several classifiers. Obviously performance of the system cannot be improved if the individual classifiers make all the same mistakes, thus it is important to use different features and different structures in the individual classifiers. In this context, we propose a two subnets neural network called CSM net. The first subnet, or similarity layer, is operating as a similarity measure neural network; it is based on the complementary similarity measure method (CSM). The second subnet is a competitive neural network (CNN) based on the winner takes all algorithm (WTA) that is used for the classification. In the proposed neural architecture, the statistical CSM method is analyzed, and implemented in the form of a feed forward neural network, it is named “similarity measure neural network” (SMNN). We show that the resulting SMNN synaptic weights are modified versions of the model patterns used in the training set, and that they can be considered as a memory network. We introduce a relative distance data calculated from the SMNN output, and we use it as a quality measurement tool of the degraded characters, what makes the SMNN classifier very powerful, and very well-suited for features rejections. This relative distance is used by the SMNN and compared to a first rejection threshold to accept, or reject, the incoming characters. In order to guarantee a higher recognition and reliability rates for the cascaded method, the SMNN is combined with a second subnet based on the WTA for classification using a second specific rejection threshold. These two submits combination (CSM net) boost the performance of the SMNN classifier. This is resulting in a robust multiple classifiers that can be used for setting the entire rejection threshold. The experimental results that we introduce are related to the proposed method, but the tests are introduced with various impulse noise levels, as well as the tests with broken and manually corrupted characters, and characters with various levels of additive Gaussian noise. The experiments show the effective ability of the model to yield relevant and robust recognition on poor quality printed checks, and show that the CSM net outperforms the previous works, both in efficiency and accuracy.

[1]  Tin Kam Ho,et al.  Enhancing degraded document images via bitmap clustering and averaging , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[2]  Abderrahmane Namane,et al.  Degraded dot matrix character recognition using CSM-based feature extraction , 2010, DocEng '10.

[3]  Abderrezak Guessoum,et al.  CSM-autossociators combination for degraded machine printed character recognition , 2007, 2007 9th International Symposium on Signal Processing and Its Applications.

[4]  Yingbai Yan,et al.  Postprocessing algorithm for the optical recognition of degraded characters , 1999, Electronic Imaging.

[5]  Patrick van der Smagt A Comparative Study of Neural Network Algorithms Applied to Optical Character Recognition , 1990, IEA/AIE.

[6]  Berrin A. Yanikoglu,et al.  Pitch-based segmentation and recognition of dot-matrix text , 2000, International Journal on Document Analysis and Recognition.

[7]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[8]  Chew Lim Tan,et al.  Imaged Document Text Retrieval Without OCR , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Laurence Likforman-Sulem,et al.  Recognition of degraded characters using dynamic Bayesian networks , 2008, Pattern Recognit..

[10]  Anna Tonazzini,et al.  Analysis and recognition of highly degraded printed characters , 2003, Document Analysis and Recognition.

[11]  Ching Y. Suen,et al.  Error-Correcting Output Coding for the Convolutional Neural Network for Optical Character Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Amit Kumar Gupta,et al.  Analysis of Hamming Network and MAXNET of Neural Network Method in the String Recognition , 2011, 2011 International Conference on Communication Systems and Network Technologies.

[13]  Norihiro Hagita,et al.  Text-Line Extraction and Character Recognition of Document Headlines With Graphical Designs Using Complementary Similarity Measure , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Emmanuel Augustin,et al.  Industrial bank check processing: the A2iA CheckReaderTM , 2001, International Journal on Document Analysis and Recognition.

[15]  Seongjun Lee,et al.  Discrimination of Li-ion batteries based on Hamming network using discharging–charging voltage pattern recognition for improved state-of-charge estimation , 2011 .

[16]  Manish Kumar,et al.  Degraded character recognition based on gradient pattern , 2010, International Conference on Digital Image Processing.

[17]  Andreas Vrabl,et al.  Fast real-time recognition and quality inspection of printed characters via point correlation , 2001, IS&T/SPIE Electronic Imaging.

[18]  Sebastiano Impedovo,et al.  Automatic Bankcheck Processing: A New Engineered System , 1997, Int. J. Pattern Recognit. Artif. Intell..

[19]  Hadar I. Avi-Itzhak,et al.  High Accuracy Optical Character Recognition Using Neural Networks with Centroid Dithering , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[21]  Abderrezak Guessoum,et al.  Hopfield-multilayer-perceptron serial combination for accurate degraded printed character recognition , 2006 .

[22]  Jérôme Darbon,et al.  Enhancement of historical printed document images by combining Total Variation regularization and Non-local Means filtering , 2011, Image Vis. Comput..

[23]  Giovanni Soda,et al.  A serial combination of connectionist-based classifiers for OCR , 2001, International Journal on Document Analysis and Recognition.

[24]  Manish Kumar,et al.  Manish Kumar , 2015 .

[25]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[26]  Mountassar Maamoun,et al.  CSM-based feature extraction for degraded machine printed character recognition , 2010, 2010 IEEE 9th International Conference on Cyberntic Intelligent Systems.