Distributed Optimization of Graph Convolutional Network using Subgraph Variance

In recent years, Graph Convolutional Networks (GCNs) have achieved great success in learning from graph-structured data. With the growing tendency of graph nodes and edges, GCN training by single processor cannot meet the demand for time and memory, which led to a boom into distributed GCN training frameworks research. However, existing distributed GCN training frameworks require enormous communication costs between processors since multitudes of dependent nodes and edges information need to be collected and transmitted for GCN training from other processors. To address this issue, we propose a Graph Augmentation based Distributed GCN framework (GAD). In particular, GAD has two main components, GAD-Partition and GAD-Optimizer . We first propose a graph augmentation-based partition (GAD-Partition) that can divide original graph into augmented subgraphs to reduce communication by selecting and storing as few significant nodes of other processors as possible while guaranteeing the accuracy of the training. In addition, we further design a subgraph variance-based importance calculation formula and propose a novel weighted global consensus method, collectively referred to as GAD-Optimizer . This optimizer adaptively reduces the importance of subgraphs with large variances for the purpose of reducing the effect of extra variance introduced by GAD-Partition on distributed GCN training. Extensive experiments on four large-scale real-world datasets demonstrate that our framework significantly reduces the communication overhead (≈ 50%), improves the convergence speed (≈ 2X) of distributed GCN training, and slight gain in accuracy (≈ 0.45%) based on minimal redundancy compared to the state-of-the-art methods.

[1]  George Karypis,et al.  Multilevel k-way Partitioning Scheme for Irregular Graphs , 1998, J. Parallel Distributed Comput..

[2]  Vipin Kumar,et al.  A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs , 1998, SIAM J. Sci. Comput..

[3]  Morris R. Driels,et al.  Determining the Number of Iterations for Monte Carlo Simulations of Weapon Effectiveness , 2004 .

[4]  Jie Tang,et al.  ArnetMiner: extraction and mining of academic social networks , 2008, KDD.

[5]  Matus Telgarsky,et al.  Benefits of Depth in Neural Networks , 2016, COLT.

[6]  Thomas Pellissier Tanon,et al.  From Freebase to Wikidata: The Great Migration , 2016, WWW.

[7]  Georgios B. Giannakis,et al.  Kernel-Based Reconstruction of Graph Signals , 2016, IEEE Transactions on Signal Processing.

[8]  Max Welling,et al.  Semi-Supervised Classification with Graph Convolutional Networks , 2016, ICLR.

[9]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[10]  Tara N. Sainath,et al.  Parallel Deep Neural Network Training for Big Data on Blue Gene/Q , 2017, IEEE Transactions on Parallel and Distributed Systems.

[11]  Joan Bruna,et al.  Few-Shot Learning with Graph Neural Networks , 2017, ICLR.

[12]  Jure Leskovec,et al.  Graph Convolutional Neural Networks for Web-Scale Recommender Systems , 2018, KDD.

[13]  Le Song,et al.  Stochastic Training of Graph Convolutional Networks with Variance Reduction , 2017, ICML.

[14]  Junzhou Huang,et al.  Adaptive Sampling Towards Fast Graph Representation Learning , 2018, NeurIPS.

[15]  Cao Xiao,et al.  FastGCN: Fast Learning with Graph Convolutional Networks via Importance Sampling , 2018, ICLR.

[16]  Samy Bengio,et al.  Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks , 2019, KDD.

[17]  Jan Eric Lenssen,et al.  Fast Graph Representation Learning with PyTorch Geometric , 2019, ArXiv.

[18]  Chang Zhou,et al.  AliGraph: A Comprehensive Graph Neural Network Platform , 2019, Proc. VLDB Endow..

[19]  Alex Smola,et al.  Deep Graph Library: Towards Efficient and Scalable Deep Learning on Graphs , 2019, ArXiv.

[20]  G. Karypis,et al.  DistDGL: Distributed Graph Neural Network Training for Billion-Scale Graphs , 2020, 2020 IEEE/ACM 10th Workshop on Irregular Applications: Architectures and Algorithms (IA3).

[21]  Murali Annavaram,et al.  Distributed Training of Graph Convolutional Networks using Subgraph Approximation , 2020, ArXiv.

[22]  Man Li,et al.  Efficient Strategy Mining for Football Social Network , 2020, Complex..

[23]  Ziqi Liu,et al.  AGL , 2020, Proc. VLDB Endow..

[24]  Rajgopal Kannan,et al.  GraphSAINT: Graph Sampling Based Inductive Learning Method , 2019, ICLR.

[25]  Simone Scardapane,et al.  Distributed Training of Graph Convolutional Networks , 2021, IEEE Transactions on Signal and Information Processing over Networks.

[26]  Tinghuai Ma,et al.  Graph classification based on structural features of significant nodes and spatial convolutional neural networks , 2021, Neurocomputing.

[27]  Xin Wang,et al.  Deep attributed network representation learning of complex coupling and interaction , 2021, Knowl. Based Syst..

[28]  Santiago Segarra,et al.  Distributed Scheduling Using Graph Neural Networks , 2020, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[29]  Dianhui Wang,et al.  Effective Deep Attributed Network Representation Learning With Topology Adapted Smoothing , 2021, IEEE Transactions on Cybernetics.

[30]  Biswajeet Pradhan,et al.  An Optimized Deep Neural Network Approach for Vehicular Traffic Noise Trend Modeling , 2021, IEEE Access.

[31]  Peng Jiang,et al.  Communication-Efficient Sampling for Distributed Training of Graph Convolutional Networks , 2021, ArXiv.

[32]  Yunxin Liu,et al.  Efficient Data Loader for Fast Sampling-Based GNN Training on Large Graphs , 2021, IEEE Transactions on Parallel and Distributed Systems.

[33]  Jianxin Li,et al.  Dynamic Network Embedding Survey , 2021, Neurocomputing.

[34]  Ping Luo,et al.  vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training , 2022, IEEE Transactions on Parallel and Distributed Systems.