Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark

Recent deep learning based methods have achieved the state-of-the-art performance for handwritten Chinese character recognition (HCCR) by learning discriminative representations directly from raw data. Nevertheless, we believe that the long-and-well investigated domain-specific knowledge should still help to boost the performance of HCCR. By integrating the traditional normalization-cooperated direction-decomposed feature map (directMap) with the deep convolutional neural network (convNet), we are able to obtain new highest accuracies for both online and offline HCCR on the ICDAR-2013 competition database. With this new framework, we can eliminate the needs for data augmentation and model ensemble, which are widely used in other systems to achieve their best results. This makes our framework to be efficient and effective for both training and testing. Furthermore, although directMap+convNet can achieve the best results and surpass human-level performance, we show that writer adaptation in this case is still effective. A new adaptation layer is proposed to reduce the mismatch between training and test data on a particular source layer. The adaptation process can be efficiently and effectively implemented in an unsupervised manner. By adding the adaptation layer into the pre-trained convNet, it can adapt to the new handwriting styles of particular writers, and the recognition accuracy can be further improved consistently and significantly. This paper gives an overview and comparison of recent deep learning based approaches for HCCR, and also sets new benchmarks for both online and offline HCCR.

[1]  Tianqi Chen,et al.  Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[2]  Jun Sun,et al.  Handwritten Character Recognition by Alternately Trained Relaxation Convolutional Neural Network , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[3]  Trevor Darrell,et al.  One-Shot Adaptation of Supervised Deep Convolutional Models , 2013, ICLR.

[4]  Xiang Bai,et al.  An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[6]  Cheng-Lin Liu,et al.  High Accuracy Handwritten Chinese Character Recognition Using Quadratic Classifiers with Discriminative Feature Extraction , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[7]  Cheng-Lin Liu,et al.  Evaluation of weighted Fisher criteria for large category dimensionality reduction in application to Chinese handwriting recognition , 2013, Pattern Recognit..

[8]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Cheng-Lin Liu,et al.  Handwritten digit recognition: benchmarking of state-of-the-art techniques , 2003, Pattern Recognit..

[10]  Cheng-Lin Liu,et al.  Pseudo two-dimensional shape normalization methods for handwritten Chinese character recognition , 2005, Pattern Recognit..

[11]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Alex Graves,et al.  Supervised Sequence Labelling , 2012 .

[13]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[14]  Ching Y. Suen,et al.  The State of the Art in Online Handwriting Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Kazuhiko Yamamoto,et al.  On-line handwriting character recognition method with directional features and direction-change features , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[16]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[17]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[18]  Feng Tian,et al.  Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Zhuowen Tu,et al.  Deeply-Supervised Nets , 2014, AISTATS.

[20]  Lianwen Jin,et al.  An Investigation of Imaginary Stroke Techinique for Cursive Online Handwriting Chinese Character Recognition , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  Lianwen Jin,et al.  Recognition confidence analysis of handwritten Chinese character with CNN , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[22]  Cheng-Lin Liu,et al.  Online Japanese Character Recognition Using Trajectory-Based Normalization and Direction Feature Extraction , 2006 .

[23]  Cheng-Lin Liu,et al.  Normalization-Cooperated Gradient Feature Extraction for Handwritten Character Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Lianwen Jin,et al.  High performance offline handwritten Chinese character recognition using GoogLeNet and directional feature maps , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[25]  Wentai Liu,et al.  Optical recognition of handwritten Chinese characters: Advances since 1980 , 1993, Pattern Recognit..

[26]  LiuCheng-Lin,et al.  Online Recognition of Chinese Characters , 2004 .

[27]  Lianwen Jin,et al.  DropSample: A New Training Method to Enhance Deep Convolutional Neural Networks for Large-Scale Unconstrained Handwritten Chinese Character Recognition , 2015, Pattern Recognit..

[28]  Alex Graves,et al.  Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[29]  Liangrui Peng,et al.  Gaussian process style transfer mapping for historical Chinese character recognition , 2015, Electronic Imaging.

[30]  Hiroshi Sako,et al.  Effects of classifier structures and training regimes on integrated segmentation and recognition of handwritten numeral strings , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Cheng-Lin Liu,et al.  Writer Adaptation with Style Transfer Mapping , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Anil K. Jain,et al.  Writer Adaptation for Online Handwriting Recognition , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Cheng-Lin Liu,et al.  Lexicon-Driven Segmentation and Recognition of Handwritten Character Strings for Japanese Address Reading , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  Zhen-Long Bai,et al.  A study on the use of 8-directional features for online handwritten Chinese character recognition , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[35]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[36]  Y. J. Liu,et al.  CHINESE CHARACTER RECOGNITION , 1990 .

[37]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[38]  Jürgen Schmidhuber,et al.  Multi-column deep neural networks for image classification , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Wei-Yun Yau,et al.  Multiview Face Detection and Registration Requiring Minimal Manual Intervention , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Lianwen Jin,et al.  Character-level Chinese Writer Identification using Path Signature Feature, DropStroke and Deep CNN , 2015, ArXiv.

[41]  Sargur N. Srihari,et al.  On-Line and Off-Line Handwriting Recognition: A Comprehensive Survey , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[42]  Hiroshi Sako,et al.  Handwritten Chinese character recognition: alternatives to nonlinear normalization , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[43]  LiuCheng-Lin,et al.  Online and offline handwritten Chinese character recognition , 2013 .

[44]  Xiaohui Xie,et al.  Handwritten Hangul recognition using deep convolutional neural networks , 2014, International Journal on Document Analysis and Recognition (IJDAR).

[45]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Xin Li,et al.  An MQDF-CNN Hybrid Model for Offline Handwritten Chinese Character Recognition , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[47]  Liangrui Peng,et al.  Historical Chinese Character Recognition Method Based on Style Transfer Mapping , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[48]  Benjamin Graham,et al.  Spatially-sparse convolutional neural networks , 2014, ArXiv.

[49]  Fei Yin,et al.  Handwritten Chinese Text Recognition by Integrating Multiple Contexts , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[51]  Lianwen Jin,et al.  Improved deep convolutional neural network for online handwritten Chinese character recognition using domain-specific knowledge , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[52]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[53]  Cheng-Lin Liu,et al.  Locally Smoothed Modified Quadratic Discriminant Function , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[54]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[55]  Luca Maria Gambardella,et al.  Flexible, High Performance Convolutional Neural Networks for Image Classification , 2011, IJCAI.

[56]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[57]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[58]  Jérôme Louradour,et al.  Segmentation-free handwritten Chinese text recognition with LSTM-RNN , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[59]  Fei Yin,et al.  Online and offline handwritten Chinese character recognition: Benchmarking on new databases , 2013, Pattern Recognit..

[60]  Andrew Zisserman,et al.  Deep Features for Text Spotting , 2014, ECCV.

[61]  Baihua Xiao,et al.  Chinese character recognition: history, status and prospects , 2007, Frontiers of Computer Science in China.

[62]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[63]  Fumitaka Kimura,et al.  Modified Quadratic Discriminant Functions and the Application to Chinese Character Recognition , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[65]  Hiroshi Sako,et al.  Discriminative learning quadratic discriminant function for handwriting recognition , 2004, IEEE Transactions on Neural Networks.

[66]  Zong Chen,et al.  Handwritten Digits Recognition , 2018, IPCV.

[67]  Meng Wang,et al.  Recognition of Handwritten Characters in Chinese Legal Amounts by Stacked Autoencoders , 2014, 2014 22nd International Conference on Pattern Recognition.

[68]  Masaki Nakagawa,et al.  Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition , 2001, Pattern Recognit..

[69]  Yoshua Bengio,et al.  LeRec: A NN/HMM Hybrid for On-Line Handwriting Recognition , 1995, Neural Computation.

[70]  Lianwen Jin,et al.  Chinese character-level writer identification using path signature feature, DropStroke and deep CNN , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[71]  Benjamin Graham,et al.  Sparse arrays of signatures for online character recognition , 2013, ArXiv.

[72]  Razvan Pascanu,et al.  Theano: new features and speed improvements , 2012, ArXiv.

[73]  J. Tsukumo,et al.  Classification of handprinted Chinese characters using nonlinear normalization and correlation methods , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[74]  Hiromichi Fujisawa,et al.  Forty years of research in character and document recognition - an industrial perspective , 2008, Pattern Recognit..

[75]  Pan He,et al.  Reading Scene Text in Deep Convolutional Sequences , 2015, AAAI.

[76]  Dai Ruwei,et al.  Chinese character recognition: history, status and prospects , 2007 .

[77]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[78]  Satoshi Naoi,et al.  Beyond human recognition: A CNN-based framework for handwritten character recognition , 2015, 2015 3rd IAPR Asian Conference on Pattern Recognition (ACPR).

[79]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[80]  T. Munich,et al.  Offline Handwriting Recognition with Multidimensional Recurrent Neural Networks , 2008, NIPS.

[81]  David A. Forsyth,et al.  Representation Learning , 2015, Computer.

[82]  Fei Yin,et al.  Chinese Handwriting Recognition Contest 2010 , 2010, 2010 Chinese Conference on Pattern Recognition (CCPR).

[83]  Dan Ciresan,et al.  Multi-Column Deep Neural Networks for offline handwritten Chinese character classification , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[84]  Frans C. A. Groen,et al.  The box-cox metric for nearest neighbour classification improvement , 1997, Pattern Recognit..

[85]  Masaki Nakagawa,et al.  'Online recognition of Chinese characters: the state-of-the-art , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[86]  M. Berthod,et al.  Automatic recognition of handprinted characters—The state of the art , 1980, Proceedings of the IEEE.

[87]  Fei Yin,et al.  ICDAR 2011 Chinese Handwriting Recognition Competition , 2011, 2011 International Conference on Document Analysis and Recognition.

[88]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[89]  Jun Du,et al.  A discriminative linear regression approach to adaptation of multi-prototype based classifiers and its applications for Chinese OCR , 2013, Pattern Recognit..

[90]  Fei Yin,et al.  CASIA Online and Offline Chinese Handwriting Databases , 2011, 2011 International Conference on Document Analysis and Recognition.

[91]  George Nagy,et al.  Style consistent classification of isogenous patterns , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.