论文信息 - GraphMix: Regularized Training of Graph Neural Networks for Semi-Supervised Learning

GraphMix: Regularized Training of Graph Neural Networks for Semi-Supervised Learning

We present GraphMix, a regularization technique for Graph Neural Network based semi-supervised object classification, leveraging the recent advances in the regularization of classical deep neural networks. Specifically, we propose a unified approach in which we train a fully-connected network jointly with the graph neural network via parameter sharing, interpolation-based regularization, and self-predicted-targets. Our proposed method is architecture agnostic in the sense that it can be applied to any variant of graph neural networks which applies a parametric transformation to the features of the graph nodes. Despite its simplicity, with GraphMix we can consistently improve results and achieve or closely match state-of-the-art performance using even simpler architectures such as Graph Convolutional Networks, across three established graph benchmarks: the Cora, Citeseer and Pubmed citation network datasets, as well as three newly proposed datasets : Cora-Full, Co-author-CS and Co-author-Physics.

[1] Xiao-Ming Wu,et al. Deeper Insights into Graph Convolutional Networks for Semi-Supervised Learning , 2018, AAAI.

[2] Joan Bruna,et al. Spectral Networks and Locally Connected Networks on Graphs , 2013, ICLR.

[3] Jonathan Masci,et al. Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Zhiyuan Liu,et al. Graph Neural Networks: A Review of Methods and Applications , 2018, AI Open.

[5] Ah Chung Tsoi,et al. The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[6] Stephan Günnemann,et al. Deep Gaussian Embedding of Graphs: Unsupervised Inductive Learning via Ranking , 2017, ICLR.

[7] Ruslan Salakhutdinov,et al. Revisiting Semi-Supervised Learning with Graph Embeddings , 2016, ICML.

[8] Tatsuya Harada,et al. Between-Class Learning for Image Classification , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9] Jian Tang,et al. InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization , 2019, ICLR.

[10] R Devon Hjelm,et al. On Adversarial Mixup Resynthesis , 2019, NeurIPS.

[11] Pietro Liò,et al. Deep Graph Infomax , 2018, ICLR.

[12] Mikhail Belkin,et al. Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[13] Wenwu Zhu,et al. Disentangled Graph Convolutional Networks , 2019, ICML.

[14] Dong-Hyun Lee,et al. Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[15] Pietro Liò,et al. Graph Attention Networks , 2017, ICLR.

[16] Sanjeev Khudanpur,et al. Audio augmentation for speech recognition , 2015, INTERSPEECH.

[17] Stephan Günnemann,et al. Pitfalls of Graph Neural Network Evaluation , 2018, ArXiv.

[18] Quoc V. Le,et al. SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition , 2019, INTERSPEECH.

[19] Ken-ichi Kawarabayashi,et al. Representation Learning on Graphs with Jumping Knowledge Networks , 2018, ICML.

[20] Zoubin Ghahramani,et al. Learning from labeled and unlabeled data with label propagation , 2002 .

[21] Alexander Zien,et al. Semi-Supervised Learning , 2006 .

[22] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[23] Xavier Bresson,et al. Convolutional Neural Networks on Graphs with Fast Localized Spectral Filtering , 2016, NIPS.

[24] Quoc V. Le,et al. DropBlock: A regularization method for convolutional networks , 2018, NeurIPS.

[25] Ioannis Mitliagkas,et al. Manifold Mixup: Better Representations by Interpolating Hidden States , 2018, ICML.

[26] Christos Faloutsos,et al. Edge Weight Prediction in Weighted Signed Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[27] Christos Faloutsos,et al. REV2: Fraudulent User Prediction in Rating Platforms , 2018, WSDM.

[28] Yoshua Bengio,et al. Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[29] Steven Skiena,et al. DeepWalk: online learning of social representations , 2014, KDD.

[30] Jun Zhu,et al. Batch Virtual Adversarial Training for Graph Convolutional Networks , 2019, AI Open.

[31] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.

[32] Yoshua Bengio,et al. GMNN: Graph Markov Neural Networks , 2019, ICML.

[33] David Berthelot,et al. MixMatch: A Holistic Approach to Semi-Supervised Learning , 2019, NeurIPS.

[34] Jure Leskovec,et al. Inductive Representation Learning on Large Graphs , 2017, NIPS.

[35] Hossein Mobahi,et al. Deep Learning via Semi-supervised Embedding , 2012, Neural Networks: Tricks of the Trade.

[36] Jason Weston,et al. Deep learning via semi-supervised embedding , 2008, ICML '08.

[37] Shin Ishii,et al. Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38] Graham W. Taylor,et al. Improved Regularization of Convolutional Neural Networks with Cutout , 2017, ArXiv.

[39] Max Welling,et al. Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[40] Timo Aila,et al. Temporal Ensembling for Semi-Supervised Learning , 2016, ICLR.

[41] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[42] Hongyi Zhang,et al. mixup: Beyond Empirical Risk Minimization , 2017, ICLR.

[43] Lise Getoor,et al. Link-Based Classification , 2003, Encyclopedia of Machine Learning and Data Mining.

[44] Daniel Jurafsky,et al. Data Noising as Smoothing in Neural Network Language Models , 2017, ICLR.

[45] Jie Zhang,et al. Semi-supervised Learning on Graphs with Generative Adversarial Nets , 2018, CIKM.

[46] Max Welling,et al. Variational Graph Auto-Encoders , 2016, ArXiv.

[47] F. Scarselli,et al. A new model for learning in graph domains , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[48] Yoshua Bengio,et al. Interpolation Consistency Training for Semi-Supervised Learning , 2019, IJCAI.

[49] Tat-Seng Chua,et al. Graph Adversarial Training: Dynamically Regularizing Based on Graph Structure , 2019, IEEE Transactions on Knowledge and Data Engineering.

[50] Joan Bruna,et al. Deep Convolutional Networks on Graph-Structured Data , 2015, ArXiv.

[51] Shuiwang Ji,et al. Graph U-Nets , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52] Samuel S. Schoenholz,et al. Neural Message Passing for Quantum Chemistry , 2017, ICML.

[53] David Berthelot,et al. Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer , 2018, ICLR.

[54] Sina Honari,et al. Adversarial Mixup Resynthesizers , 2019, DGS@ICLR.

[55] Harri Valpola,et al. Weight-averaged consistency targets improve semi-supervised deep learning results , 2017, ArXiv.

[56] Ben Taskar,et al. Link Prediction in Relational Data , 2003, NIPS.