论文信息 - Graph-Bert: Only Attention is Needed for Learning Graph Representations - 字舞流文

Graph-Bert: Only Attention is Needed for Learning Graph Representations

The dominant graph neural networks (GNNs) over-rely on the graph links, several serious performance problems with which have been witnessed already, e.g., suspended animation problem and over-smoothing problem. What's more, the inherently inter-connected nature precludes parallelization within the graph, which becomes critical for large-sized graph, as memory constraints limit batching across the nodes. In this paper, we will introduce a new graph neural network, namely GRAPH-BERT (Graph based BERT), solely based on the attention mechanism without any graph convolution or aggregation operators. Instead of feeding GRAPH-BERT with the complete large input graph, we propose to train GRAPH-BERT with sampled linkless subgraphs within their local contexts. GRAPH-BERT can be learned effectively in a standalone mode. Meanwhile, a pre-trained GRAPH-BERT can also be transferred to other application tasks directly or with necessary fine-tuning if any supervised label information or certain application oriented objective is available. We have tested the effectiveness of GRAPH-BERT on several graph benchmark datasets. Based the pre-trained GRAPH-BERT with the node attribute reconstruction and structure recovery tasks, we further fine-tune GRAPH-BERT on node classification and graph clustering tasks specifically. The experimental results have demonstrated that GRAPH-BERT can out-perform the existing GNNs in both the learning effectiveness and efficiency.

Jiawei Zhang | Congying Xia | Li Sun | Haopeng Zhang | Jiawei Zhang | Haopeng Zhang | Congying Xia | Li Sun | Li Sun

[1] Ce Zhang,et al. An Anatomy of Graph Neural Networks Going Deep via the Lens of Mutual Information: Exponential Decay vs. Full Preservation , 2019, ArXiv.

[2] Walter Daelemans,et al. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) , 2014, EMNLP 2014.

[3] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[4] Kathleen M. Carley,et al. Inductive Graph Representation Learning with Recurrent Graph Neural Networks , 2019, ArXiv.

[5] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[6] Limeng Cui,et al. Deep Loopy Neural Network Model for Graph Structured Data Representation Learning , 2018, ArXiv.

[7] Mathias Niepert,et al. Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[8] P. Jaccard,et al. Etude comparative de la distribution florale dans une portion des Alpes et des Jura , 1901 .

[9] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[10] Limeng Cui,et al. SEGEN: Sample-Ensemble Genetic Evolutional Network Model , 2018, ArXiv.

[11] Kevin Gimpel,et al. ALBERT: A Lite BERT for Self-supervised Learning of Language Representations , 2019, ICLR.

[12] Lars Backstrom,et al. The Anatomy of the Facebook Social Graph , 2011, ArXiv.

[13] Jiawei Zhang,et al. IsoNN: Isomorphic Neural Network for Graph Representation Learning and Classification , 2019, ArXiv.

[14] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[15] Stephan Günnemann,et al. Personalized Embedding Propagation: Combining Neural Networks on Graphs with Personalized PageRank , 2018, ArXiv.

[16] L. Getoor,et al. Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[17] Steven Skiena,et al. DeepWalk: online learning of social representations , 2014, KDD.

[18] Regina Barzilay,et al. Junction Tree Variational Autoencoder for Molecular Graph Generation , 2018, ICML.

[19] Mikhail Belkin,et al. Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[20] Kaiming He,et al. Rethinking ImageNet Pre-Training , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[21] Подъяблонский Павел Александрович. 9 8 , 2022 .

[22] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[23] Dimitri Van De Ville,et al. Machine Learning with Brain Graphs: Predictive Modeling Approaches for Functional Imaging in Systems Neuroscience , 2013, IEEE Signal Processing Magazine.

[24] Yiming Yang,et al. Transformer-XL: Attentive Language Models beyond a Fixed-Length Context , 2019, ACL.

[25] Zoubin Ghahramani,et al. Proceedings of the 24th international conference on Machine learning , 2007, ICML 2007.

[26] Jonathan Masci,et al. Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Jure Leskovec,et al. node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[28] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[29] Jimeng Sun,et al. Pre-training of Graph Augmented Transformers for Medication Recommendation , 2019, IJCAI.

[30] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[31] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[32] Francis Bach,et al. Global alignment of protein–protein interaction networks by graph matching methods , 2009, Bioinform..

[33] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[34] Lada A. Adamic,et al. Friends and neighbors on the Web , 2003, Soc. Networks.

[35] Pierre Vandergheynst,et al. Wavelets on Graphs via Spectral Graph Theory , 2009, ArXiv.

[36] Jiawei Zhang,et al. GResNet: Graph Residual Network for Reviving Deep GNNs from Suspended Animation , 2019, ArXiv.

[37] Leo Katz,et al. A new status index derived from sociometric analysis , 1953 .

[38] Ming-Wei Chang,et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.