Multi-font printed Chinese character recognition using multi-pooling convolutional neural network

Although previous studies have achieved effective printed Chinese character recognition (PCCR) in the case a single font or a few different fonts, large scale multi-font PCCR remains a major challenge owing to the wide variety in the shape, layout, and grey-level distribution of single Chinese characters across different font styles. This paper applies multi-pooling and data augmentation with non-linear transformation to a convolutional neural network (CNN) for multi-font PCCR. We propose a multi-pooling layer on top of the final convolutional layer; this approach is found to be robust to spatial layout variations and deformations in multi-font printed Chinese characters. Experimental results show that multi-pooling significantly improves CNN performance. In addition, we adopt a distorted sample generation technique by applying non-linear warping functions along an original font image, which distorts the local density of image-based Chinese character strokes. We find that CNN performance is further boosted by the distorted samples technique. An input character image is transformed into four distorted images and the CNN learns the original image as well as the distorted samples to classify 3755 classes (level-1 set of GB2312-80) of printed Chinese characters in 280 widely varying fonts and 120 manually selected fonts. Outstanding recognition rates of 94.38% and 99.74% are achieved in the former and latter cases, respectively, which indicates the effectiveness of the proposed methods.

[1]  Jiřı́ Matas,et al.  Real-time scene text localization and recognition , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Ning Xu,et al.  Printed Chinese character recognition via the cooperative block neural networks , 1992, [1992] Proceedings of the IEEE International Symposium on Industrial Electronics.

[3]  Ching Y. Suen,et al.  Error-Correcting Output Coding for the Convolutional Neural Network for Optical Character Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[4]  Jian Sun,et al.  Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jun Guo,et al.  Learning Pattern Generation for Handwritten Chinese Character using Pattern Transform Method with Cosine Function , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[6]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[7]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[8]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[9]  Christof Koch,et al.  AdaBoost for Text Detection in Natural Scene , 2011, 2011 International Conference on Document Analysis and Recognition.

[10]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Fei Yin,et al.  Online and offline handwritten Chinese character recognition: Benchmarking on new databases , 2013, Pattern Recognit..

[12]  Luca Maria Gambardella,et al.  Convolutional Neural Network Committees for Handwritten Character Classification , 2011, 2011 International Conference on Document Analysis and Recognition.

[13]  Ka-Chung Leung,et al.  Recognition of Handwritten Chinese Characters by Combining Regularization, Fisher's Discriminant and Distorted Sample Generation , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[14]  Gang Lv Recognition of Multi-Fontstyle Characters Based on Convolutional Neural Network , 2011, 2011 Fourth International Symposium on Computational Intelligence and Design.

[15]  Ning Wang Printed Chinese Character Recognition Based on Pixel Distribution Probability of Character Image , 2008, 2008 International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[16]  Fei Yin,et al.  ICDAR 2011 Chinese Handwriting Recognition Competition , 2011, 2011 International Conference on Document Analysis and Recognition.

[17]  Patrice Y. Simard,et al.  Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  N.M. Memon,et al.  Recognition of printed Chinese characters by using Neural Network , 2006, 2006 IEEE International Multitopic Conference.