论文信息 - Inductive Representation Learning on Large Graphs

Inductive Representation Learning on Large Graphs

Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions. However, most existing approaches require that all nodes in the graph are present during training of the embeddings; these previous approaches are inherently transductive and do not naturally generalize to unseen nodes. Here we present GraphSAGE, a general, inductive framework that leverages node feature information (e.g., text attributes) to efficiently generate node embeddings for previously unseen data. Instead of training individual embeddings for each node, we learn a function that generates embeddings by sampling and aggregating features from a node's local neighborhood. Our algorithm outperforms strong baselines on three inductive node-classification benchmarks: we classify the category of unseen nodes in evolving information graphs based on citation and Reddit post data, and we show that our algorithm generalizes to completely unseen graphs using a multi-graph dataset of protein-protein interactions.

Jure Leskovec | William L. Hamilton | Zhitao Ying | J. Leskovec | Z. Ying

[1] Duncan J. Watts,et al. Collective dynamics of ‘small-world’ networks , 1998, Nature.

[2] Jure Leskovec,et al. Diachronic Word Embeddings Reveal Statistical Laws of Semantic Change , 2016, ACL.

[3] Joan Bruna,et al. Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[4] Steven Skiena,et al. DeepWalk: online learning of social representations , 2014, KDD.

[5] Mingzhe Wang,et al. LINE: Large-scale Information Network Embedding , 2015, WWW.

[6] Jun Zhu,et al. Stochastic Training of Graph Convolutional Networks , 2017, ICML 2018.

[7] Ruslan Salakhutdinov,et al. Revisiting Semi-Supervised Learning with Graph Embeddings , 2016, ICML.

[8] Philip S. Yu,et al. Embedding Identity and Interest for Social Networks , 2017, WWW.

[9] Le Song,et al. Stochastic Training of Graph Convolutional Networks with Variance Reduction , 2017, ICML.

[10] Omer Levy,et al. Neural Word Embedding as Implicit Matrix Factorization , 2014, NIPS.

[11] Pablo Tamayo,et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12] Ah Chung Tsoi,et al. The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[13] Michael I. Jordan,et al. On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15] Max Welling,et al. Variational Graph Auto-Encoders , 2016, ArXiv.

[16] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[17] Alán Aspuru-Guzik,et al. Convolutional Networks on Graphs for Learning Molecular Fingerprints , 2015, NIPS.

[18] Jure Leskovec,et al. node2vec: Scalable Feature Learning for Networks , 2016, KDD.

[19] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[20] Mathias Niepert,et al. Learning Convolutional Neural Networks for Graphs , 2016, ICML.

[21] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Martín Abadi,et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[23] Le Song,et al. Discriminative Embeddings of Latent Variable Models for Structured Data , 2016, ICML.

[24] Jure Leskovec,et al. Predicting multicellular function through multi-layer tissue networks , 2017, Bioinform..

[25] Richard S. Zemel,et al. Gated Graph Sequence Neural Networks , 2015, ICLR.

[26] S. Siegel,et al. Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[27] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[28] Qiongkai Xu,et al. GraRep: Learning Graph Representations with Global Structural Information , 2015, CIKM.

[29] Wenwu Zhu,et al. Structural Deep Network Embedding , 2016, KDD.

[30] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[31] Jian Pei,et al. Community Preserving Network Embedding , 2017, AAAI.

[32] F. Scarselli,et al. A new model for learning in graph domains , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[33] Sanjeev Arora,et al. A Simple but Tough-to-Beat Baseline for Sentence Embeddings , 2017, ICLR.

[34] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[35] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[36] Xavier Bresson,et al. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[37] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[38] Kurt Hornik,et al. Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[39] J. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis , 1964 .