论文信息 - BertGCN: Transductive Text Classification by Combining GNN and BERT

BertGCN: Transductive Text Classification by Combining GNN and BERT

In this work, we propose BertGCN, a model that combines large scale pretraining and transductive learning for text classification. BertGCN constructs a heterogeneous graph over the dataset and represents documents as nodes using BERT representations. By jointly training the BERT and GCN modules within BertGCN, the proposed model is able to leverage the advantages of both worlds: large-scale pretraining which takes the advantage of the massive amount of raw data and transductive learning which jointly learns representations for both training data and unlabeled test data by propagating label influence through graph convolution. Experiments show that BertGCN achieves SOTA performances on a wide range of text classification datasets.1

[1] Houfeng Wang,et al. Text Level Graph Neural Network for Text Classification , 2019, EMNLP.

[2] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Hao Ma,et al. GaAN: Gated Attention Networks for Learning on Large and Spatiotemporal Graphs , 2018, UAI.

[4] Haopeng Zhang,et al. Text Graph Transformer for Document Classification , 2020, EMNLP.

[5] Zhiyuan Liu,et al. A C-LSTM Neural Network for Text Classification , 2015, ArXiv.

[6] Diego Marcheggiani,et al. Deep Graph Convolutional Encoders for Structured Data to Text Generation , 2018, INLG.

[7] Yuan Luo,et al. Graph Convolutional Networks for Text Classification , 2018, AAAI.

[8] Navneet Kaur,et al. Opinion mining and sentiment analysis , 2016, 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom).

[9] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[10] Omer Levy,et al. RoBERTa: A Robustly Optimized BERT Pretraining Approach , 2019, ArXiv.

[11] Jure Leskovec,et al. How Powerful are Graph Neural Networks? , 2018, ICLR.

[12] Iryna Gurevych,et al. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks , 2019, EMNLP.

[13] Jiwei Li,et al. Description Based Text Classification with Reinforcement Learning , 2020, ICML.

[14] Kilian Q. Weinberger,et al. Simplifying Graph Convolutional Networks , 2019, ICML.

[15] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[16] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[17] James Henderson,et al. GILE: A Generalized Input-Label Embedding for Text Classification , 2018, TACL.

[18] Xiao-Ming Wu,et al. Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning , 2018, AAAI.

[19] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[20] Jian-Yun Nie,et al. VGCN-BERT: Augmenting BERT with Graph Embedding for Text Classification , 2020, ECIR.