论文信息 - Text Graph Transformer for Document Classification - 字舞流文

Text Graph Transformer for Document Classification

Text classification is a fundamental problem in natural language processing. Recent studies applied graph neural network (GNN) techniques to capture global word co-occurrence in a corpus. However, previous works are not scalable to large-sized corpus and ignore the heterogeneity of the text graph. To address these problems, we introduce a novel Transformer based heterogeneous graph neural network, namely Text Graph Transformer (TG-Transformer). Our model learns effective node representations by capturing structure and heterogeneity from the text graph. We propose a mini-batch text graph sampling method that significantly reduces computing and memory costs to handle large-sized corpus. Extensive experiments have been conducted on several benchmark datasets, and the results demonstrate that TG-Transformer outperforms state-of-the-art approaches on text classification task.

Haopeng Zhang | Jiawei Zhang | Jiawei Zhang | Haopeng Zhang

[1] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[2] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[3] Michalis Vazirgiannis,et al. Message Passing Attention Networks for Document Understanding , 2019, AAAI.

[4] Houfeng Wang,et al. Text Level Graph Neural Network for Text Classification , 2019, EMNLP.

[5] Xuanjing Huang,et al. Recurrent Neural Network for Text Classification with Multi-Task Learning , 2016, IJCAI.

[6] Mathias Niepert,et al. Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[7] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[8] Christopher D. Manning,et al. Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[9] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[10] Rong Jin,et al. Understanding bag-of-words model: a statistical framework , 2010, Int. J. Mach. Learn. Cybern..

[11] Yan Liu,et al. Deep Computational Phenotyping , 2015, KDD.

[12] Li Zhao,et al. Attention-based LSTM for Aspect-level Sentiment Classification , 2016, EMNLP.

[13] Donald E. Brown,et al. Text Classification Algorithms: A Survey , 2019, Inf..

[14] Ji Wu,et al. Tensor Graph Convolutional Networks for Text Classification , 2020, AAAI.

[15] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[16] Diyi Yang,et al. Hierarchical Attention Networks for Document Classification , 2016, NAACL.

[17] Jiawei Zhang,et al. GResNet: Graph Residual Network for Reviving Deep GNNs from Suspended Animation , 2019, ArXiv.

[18] James H. Harrison,et al. Patient2Vec: A Personalized Interpretable Deep Representation of the Longitudinal Electronic Health Record , 2018, IEEE Access.

[19] Christopher D. Manning,et al. Baselines and Bigrams: Simple, Good Sentiment and Topic Classification , 2012, ACL.

[20] Jianxin Li,et al. Large-Scale Hierarchical Text Classification with Recursively Regularized Deep Graph-CNN , 2018, WWW.

[21] Xiang Zhang,et al. Character-level Convolutional Networks for Text Classification , 2015, NIPS.

[22] Xavier Bresson,et al. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[23] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[24] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[25] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[26] Jiawei Zhang,et al. Graph-Bert: Only Attention is Needed for Learning Graph Representations , 2020, ArXiv.

[27] Yufeng Zhang,et al. Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks , 2020, ACL.

[28] Yizhou Sun,et al. Heterogeneous Graph Transformer , 2020, WWW.

[29] Yuan Luo,et al. Graph Convolutional Networks for Text Classification , 2018, AAAI.

[30] Tomas Mikolov,et al. Bag of Tricks for Efficient Text Classification , 2016, EACL.