CNN Based Transfer Learning for Historical Chinese Character Recognition

Historical Chinese character recognition has been suffering from the problem of lacking sufficient labeled training samples. A transfer learning method based on Convolutional Neural Network (CNN) for historical Chinese character recognition is proposed in this paper. A CNN model L is trained by printed Chinese character samples in the source domain. The network structure and weights of model L are used to initialize another CNN model T, which is regarded as the feature extractor and classifier in the target domain. The model T is then fine-tuned by a few labeled historical or handwritten Chinese character samples, and used for final evaluation in the target domain. Several experiments regarding essential factors of the CNNbased transfer learning method are conducted, showing that the proposed method is effective.

[1]  Cheng-Lin Liu,et al.  Writer Adaptation with Style Transfer Mapping , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Honggang Zhang,et al.  2009 10th International Conference on Document Analysis and Recognition HCL2000—A Large-scale Handwritten Chinese Character Database for Handwritten Character Recognition , 2022 .

[4]  Xin Li,et al.  An MQDF-CNN Hybrid Model for Offline Handwritten Chinese Character Recognition , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[5]  Liangrui Peng,et al.  Historical Chinese Character Recognition Method Based on Style Transfer Mapping , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[6]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[7]  Jürgen Schmidhuber,et al.  Transfer learning for Latin and Chinese characters with Deep Neural Networks , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[8]  Trevor Darrell,et al.  Deep Domain Confusion: Maximizing for Domain Invariance , 2014, CVPR 2014.

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[11]  Michael I. Jordan,et al.  Learning Transferable Features with Deep Adaptation Networks , 2015, ICML.

[12]  Liangrui Peng,et al.  Gaussian process style transfer mapping for historical Chinese character recognition , 2015, Electronic Imaging.