论文信息 - Few-Shot Transfer Learning for Text Classification With Lightweight Word Embedding Based Models - 字舞流文

Few-Shot Transfer Learning for Text Classification With Lightweight Word Embedding Based Models

Many deep learning architectures have been employed to model the semantic compositionality for text sequences, requiring a huge amount of supervised data for parameters training, making it unfeasible in situations where numerous annotated samples are not available or even do not exist. Different from data-hungry deep models, lightweight word embedding-based models could represent text sequences in a plug-and-play way due to their parameter-free property. In this paper, a modified hierarchical pooling strategy over pre-trained word embeddings is proposed for text classification in a few-shot transfer learning way. The model leverages and transfers knowledge obtained from some source domains to recognize and classify the unseen text sequences with just a handful of support examples in the target problem domain. The extensive experiments on five datasets including both English and Chinese text demonstrate that the simple word embedding-based models (SWEMs) with parameter-free pooling operations are able to abstract and represent the semantic text. The proposed modified hierarchical pooling method exhibits significant classification performance in the few-shot transfer learning tasks compared with other alternative methods.

Jian Huang | Xingsheng Yuan | Chongyu Pan | Jianxing Gong

[1] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[2] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[3] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[4] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[5] Guoyin Wang,et al. Baseline Needs More Love: On Simple Word-Embedding-Based Models and Associated Pooling Mechanisms , 2018, ACL.

[6] Gregory R. Koch,et al. Siamese Neural Networks for One-Shot Image Recognition , 2015 .

[7] Jie Cao,et al. Few-shot learning for short text classification , 2018, Multimedia Tools and Applications.

[8] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[9] Sanjeev Arora,et al. A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[10] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.

[11] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[12] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[13] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[14] Phil Blunsom,et al. A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[15] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[16] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[17] Hal Daumé,et al. Deep Unordered Composition Rivals Syntactic Methods for Text Classification , 2015, ACL.

[18] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[19] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[20] Jakob Uszkoreit,et al. A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.

[21] Tao Xiang,et al. Learning to Compare: Relation Network for Few-Shot Learning , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.