论文信息 - Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters

Self-supervised Learning of Orc-Bert Augmentor for Recognizing Few-Shot Oracle Characters

This paper studies the recognition of oracle character, the earliest known hieroglyphs in China. Essentially, oracle character recognition suffers from the problem of data limitation and imbalance. Recognizing the oracle characters of extremely limited samples, naturally, should be taken as the few-shot learning task. Different from the standard few-shot learning setting, our model has only access to large-scale unlabeled source Chinese characters and few labeled oracle characters. In such a setting, meta-based or metric-based few-shot methods are failed to be efficiently trained on source unlabeled data; and thus the only possible methodologies are self-supervised learning and data augmentation. Unfortunately, the conventional geometric augmentation always performs the same global transformations to all samples in pixel format, without considering the diversity of each part within a sample. Moreover, to the best of our knowledge, there is no effective self-supervised learning method for few-shot learning. To this end, this paper integrates the idea of self-supervised learning in data augmentation. And we propose a novel data augmentation approach, named Orc-Bert Augmentor pretrained by self-supervised learning, for few-shot oracle character recognition. Specifically, Orc-Bert Augmentor leverages a self-supervised BERT model pre-trained on large unlabeled Chinese characters datasets to generate sample-wise augmented samples. Given a masked input in vector format, Orc-Bert Augmentor can recover it and then output a pixel format image as augmented data. Different mask proportion brings diverse reconstructed output. Concatenated with Gaussian noise, the model further performs point-wise displacement to improve diversity. Experimentally, we collect two large-scale datasets of oracle characters and other Chinese ancient characters for few-shot oracle character recognition and Orc-Bert Augmentor pre-training. Extensive experiments on few-shot learning demonstrate the effectiveness of our Orc-Bert Augmentor on improving the performance of various networks in the few-shot oracle character recognition. ⋆ corresponding author.

[1] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[2] C. V. Aravinda,et al. Oracle Bone Inscription Detector Based on SSD , 2019, ICIAP Workshops.

[3] Andrew Zisserman,et al. Reading Text in the Wild with Convolutional Neural Networks , 2014, International Journal of Computer Vision.

[4] Yoshua Bengio,et al. Online and offline handwritten Chinese character recognition: A comprehensive study and new benchmark , 2016, Pattern Recognit..

[5] Graham W. Taylor,et al. Dataset Augmentation in Feature Space , 2017, ICLR.

[6] Canjie Luo,et al. Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Fei Yin,et al. Handwritten Chinese character recognition with spatial transformer and deep residual networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[8] Heng Tao Shen,et al. Sequence-To-Sequence Domain Adaptation Network for Robust Text Image Recognition , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Hang Li,et al. Meta-SGD: Learning to Learn Quickly for Few Shot Learning , 2017, ArXiv.

[10] Hongyang Chao,et al. Building Hierarchical Representations for Oracle Character and Sketch Recognition , 2016, IEEE Transactions on Image Processing.

[11] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[12] Yanwei Fu,et al. Instance Credibility Inference for Few-Shot Learning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[14] Lianwen Jin,et al. OBC306: A Large-Scale Oracle Bone Character Recognition Dataset , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[15] Wenju Liu,et al. Robust offline handwritten character recognition through exploring writer-independent features under the guidance of printed data , 2018, Pattern Recognit. Lett..

[16] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[17] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Martial Hebert,et al. Image Deformation Meta-Networks for One-Shot Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Constantine Bekas,et al. BAGAN: Data Augmentation with Balancing GAN , 2018, ArXiv.

[20] Lin Meng,et al. Recognition of Oracle Bone Inscriptions by Extracting Line Features on Image Processing , 2017, ICPRAM.

[21] Cheng-Lin Liu,et al. Oracle Character Recognition by Nearest Neighbor Classification with Deep Metric Learning , 2019, 2019 International Conference on Document Analysis and Recognition (ICDAR).

[22] Joshua B. Tenenbaum,et al. Meta-Learning for Semi-Supervised Few-Shot Classification , 2018, ICLR.

[23] Partha Pratim Roy,et al. Handwriting Recognition in Low-Resource Scripts Using Adversarial Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Ankush Gupta,et al. Synthetic Data for Text Localisation in Natural Images , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Jing Xiong,et al. Oracle bone inscription detection: a survey of Oracle bone inscription detection based on deep learning algorithm , 2019, AIIPCC '19.

[26] Yoshua Bengio,et al. How transferable are features in deep neural networks? , 2014, NIPS.

[27] Peter Corcoran,et al. Smart Augmentation Learning an Optimal Data Augmentation Strategy , 2017, IEEE Access.

[28] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[29] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.

[31] Amos J. Storkey,et al. Assume, Augment and Learn: Unsupervised Few-Shot Meta-Learning via Random Labels and Data Augmentation , 2019, ArXiv.

[32] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[33] Yu-Gang Jiang,et al. Sketch-BERT: Learning Sketch Bidirectional Encoder Representation From Transformers by Self-Supervised Learning of Sketch Gestalt , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Tomas Pfister,et al. Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Vincent Christlein,et al. Spatio-Temporal Handwriting Imitation , 2020, ECCV Workshops.

[36] Quoc V. Le,et al. AutoAugment: Learning Augmentation Strategies From Data , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Liu Yi-feng,et al. Graphs,Words,and Meanings:Three Reference Works for Shang Oracle-Bone Studies , 2007 .

[38] Feng Gao,et al. Oracle-Bone Inscription Recognition Based on Deep Convolutional Neural Network , 2018, J. Comput..

[39] Nuno Vasconcelos,et al. Feature Space Transfer for Data Augmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.