Amortized Probabilistic Detection of Communities in Graphs

Learning community structures in graphs has broad applications across scientific domains. While graph neural networks (GNNs) have been successful in encoding graph structures, existing GNN-based methods for community detection are limited by requiring knowledge of the number of communities in advance, in addition to lacking a proper probabilistic formulation to handle uncertainty. We propose a simple framework for amortized community detection, which addresses both of these issues by combining the expressive power of GNNs with recent methods for amortized clustering. Our models consist of a graph representation backbone that extracts structural information and an amortized clustering network that naturally handles variable numbers of clusters. Both components combine into well-defined models of the posterior distribution of graph communities and are jointly optimized given labeled graphs. At inference time, the models yield parallel samples from the posterior of community labels, quantifying uncertainty in a principled way. We evaluate several models from our framework on synthetic and real datasets and demonstrate superior performance to previous methods. As a separate contribution, we extend recent amortized probabilistic clustering architectures by adding attention modules, which yield further improvements on community detection tasks.

[1]  Joan Bruna,et al.  Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges , 2021, ArXiv.

[2]  Chithrupa Ramesh,et al.  Attention-Based Clustering: Learning a Kernel from Context , 2020, ArXiv.

[3]  William L. Hamilton Graph Representation Learning , 2020, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[4]  Dino Ienco,et al.  Deep Multivariate Time Series Embedding Clustering via Attentive-Gated Autoencoder , 2020, PAKDD.

[5]  Graham W. Taylor,et al.  Learning Permutation Invariant Representations using Memory Networks , 2019, ECCV.

[6]  Ari Pakman,et al.  Neural Clustering Processes , 2019, ICML.

[7]  Yee Whye Teh,et al.  Deep Amortized Clustering , 2019, ArXiv.

[8]  Dongxiao He,et al.  Community Detection via Joint Graph Convolutional Network Embedding in Attribute Network , 2019, ICANN.

[9]  Jian Tang,et al.  vGraph: A Generative Model for Joint Community Detection and Node Representation Learning , 2019, NeurIPS.

[10]  Yee Whye Teh,et al.  Set Transformer , 2018, ICML.

[11]  Qiang Liu,et al.  A Survey of Clustering With Deep Learning: From the Perspective of Network Architecture , 2018, IEEE Access.

[12]  Daniel Cremers,et al.  Clustering with Deep Learning: Taxonomy and New Methods , 2018, ArXiv.

[13]  Xavier Bresson,et al.  Residual Gated Graph ConvNets , 2017, ArXiv.

[14]  Kevin Chen-Chuan Chang,et al.  Learning Community Embedding with Community Detection and Node Embedding on Graphs , 2017, CIKM.

[15]  Kilian Q. Weinberger,et al.  On Calibration of Modern Neural Networks , 2017, ICML.

[16]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[17]  Jure Leskovec,et al.  Inductive Representation Learning on Large Graphs , 2017, NIPS.

[18]  Joan Bruna,et al.  Supervised Community Detection with Line Graph Neural Networks , 2017, ICLR.

[19]  Emmanuel Abbe,et al.  Community Detection and Stochastic Block Models , 2017, Found. Trends Commun. Inf. Theory.

[20]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[21]  Jian Pei,et al.  Community Preserving Network Embedding , 2017, AAAI.

[22]  Pierre Vandergheynst,et al.  Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[23]  Frank D. Wood,et al.  Inference Compilation and Universal Probabilistic Programming , 2016, AISTATS.

[24]  David M. Blei,et al.  Variational Inference: A Review for Statisticians , 2016, ArXiv.

[25]  Honglak Lee,et al.  Learning Structured Output Representation using Deep Conditional Generative Models , 2015, NIPS.

[26]  Emmanuel Abbe,et al.  Community Detection in General Stochastic Block models: Fundamental Limits and Efficient Algorithms for Recovery , 2015, 2015 IEEE 56th Annual Symposium on Foundations of Computer Science.

[27]  Emmanuel Abbe,et al.  Recovering Communities in the General Stochastic Block Model Without Knowing the Parameters , 2015, NIPS.

[28]  M. Cugmas,et al.  On comparing partitions , 2015 .

[29]  Elchanan Mossel,et al.  Consistency Thresholds for Binary Symmetric Block Models , 2014, ArXiv.

[30]  Jure Leskovec,et al.  {SNAP Datasets}: {Stanford} Large Network Dataset Collection , 2014 .

[31]  Emmanuel Abbe,et al.  Exact Recovery in the Stochastic Block Model , 2014, IEEE Transactions on Information Theory.

[32]  Jure Leskovec,et al.  Community Detection in Networks with Node Attributes , 2013, 2013 IEEE 13th International Conference on Data Mining.

[33]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[34]  James Bailey,et al.  Information Theoretic Measures for Clusterings Comparison: Variants, Properties, Normalization and Correction for Chance , 2010, J. Mach. Learn. Res..

[35]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[36]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[37]  Victor J. Rayward-Smith,et al.  Adapting k-means for supervised clustering , 2006, Applied Intelligence.

[38]  Thorsten Joachims,et al.  Supervised clustering with support vector machines , 2005, ICML.

[39]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[40]  Geoffrey J. McLachlan,et al.  Mixture models : inference and applications to clustering , 1989 .

[41]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[42]  Philip S. Yu,et al.  Algorithms for Estimating the Partition Function of Restricted Boltzmann Machines (Extended Abstract) , 2020 .

[43]  Noah D. Goodman,et al.  Amortized Inference in Probabilistic Reasoning , 2014, CogSci.

[44]  Ke-Lin Du,et al.  Clustering: A neural network approach , 2010, Neural Networks.

[45]  Hans-Peter Kriegel,et al.  Learning Infinite Hidden Relational Models , 2006 .