Recognition of Handwritten Characters in Chinese Legal Amounts by Stacked Autoencoders

Handwritten Characters Recognition has long been a tough problem in pattern recognition and machine learning. Some special tasks, such as automatic check preprocessing, require Handwritten Chinese Legal Amounts recognition as a prerequisite. Since we expect to utilize machine instead of human to process bank checks, the recognition rate in such task must reach a relatively high rate. This paper proposes to use deep learning, auto-encoder as an effective approach for obtaining hierarchical representations of Isolated Handwritten Chinese Legal Amounts. Experiments show such representations are highly abstractive and can be used in character recognition. Meanwhile, a novel way by combining multiple Neural Networks in doing the work is proposed which proves to be able to improve the recognition rate significantly. This method reports a 0.64% error rate on a large test set over 10,000 samples and outperforms traditional methods using hand-crafted features and convolutional neural network committees (another deep learning model), narrowing the gap to human performance.

[1]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[2]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[3]  Dong Liu,et al.  A Prototype System of Courtesy Amount Recognition for Chinese Bank Checks , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[4]  Ching Y. Suen,et al.  Recognition of unconstrained legal amounts handwritten on Chinese bank checks , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[5]  Marc'Aurelio Ranzato,et al.  Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[6]  Hermann Ney,et al.  Deformation Models for Image Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Paul C. K. Kwok,et al.  Segmentation and recognition of Chinese bank check amounts , 2001, International Journal on Document Analysis and Recognition.

[8]  Fei Yin,et al.  ICDAR 2013 Chinese Handwriting Recognition Competition , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[9]  Xia Shaowei,et al.  A Chinese bank check recognition system based on the fault tolerant technique , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Thomas Hofmann,et al.  Greedy Layer-Wise Training of Deep Networks , 2007 .

[13]  Jianfeng Gao,et al.  Scalable training of L1-regularized log-linear models , 2007, ICML '07.

[14]  Cheng-Lin Liu,et al.  Normalization-Cooperated Gradient Feature Extraction for Handwritten Character Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Balázs Kégl,et al.  Boosting products of base classifiers , 2009, ICML '09.

[16]  Shaoping Ma,et al.  Feature extraction by hierarchical overlapped elastic meshing for handwritten Chinese character recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[17]  Luca Maria Gambardella,et al.  Convolutional Neural Network Committees for Handwritten Character Classification , 2011, 2011 International Conference on Document Analysis and Recognition.

[18]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[19]  Meng Shi,et al.  Handwritten numeral recognition using gradient and curvature of gray scale image , 2002, Pattern Recognit..

[20]  Shutao Li,et al.  Extraction of Filled-In Items from Chinese Bank Check Using Support Vector Machines , 2007, ISNN.

[21]  Yoshua. Bengio,et al.  Learning Deep Architectures for AI , 2007, Found. Trends Mach. Learn..