Bayesian Testing for Exogenous Partition Structures in Stochastic Block Models

Network data often exhibit block structures characterized by clusters of nodes with similar patterns of edge formation. When such relational data are complemented by additional information on exogenous node partitions, these sources of knowledge are typically included in the model to supervise the cluster assignment mechanism or to improve inference on edge probabilities. Although these solutions are routinely implemented, there is a lack of formal approaches to test if a given external node partition is in line with the endogenous clustering structure encoding stochastic equivalence patterns among the nodes in the network. To fill this gap, we develop a formal Bayesian testing procedure which relies on the calculation of the Bayes factor between a stochastic block model with known grouping structure defined by the exogenous node partition and an infinite relational model that allows the endogenous clustering configurations to be unknown, random and fully revealed by the block-connectivity patterns in the network. A simple Markov chain Monte Carlo method for computing the Bayes factor and quantifying uncertainty in the endogenous groups is proposed. This routine is evaluated in simulations and in an application to study exogenous equivalence structures in brain networks of Alzheimer's patients.

[1]  Tracy M. Sweet,et al.  Incorporating Covariates Into Stochastic Blockmodels , 2015 .

[2]  Francis Comets,et al.  École d'été de probabilités de Saint-Flour XLVI , 2017 .

[3]  Christian Tronstad,et al.  Evaluation of Hypoglycaemia with Non-Invasive Sensors in People with Type 1 Diabetes and Impaired Awareness of Hypoglycaemia , 2018, Scientific Reports.

[4]  A. Simmons,et al.  Stability of graph theoretical measures in structural brain networks in Alzheimer’s disease , 2018, Scientific Reports.

[5]  C. Stam Modern network science of neurological disorders , 2014, Nature Reviews Neuroscience.

[6]  Emmanuel Abbe,et al.  Community detection and stochastic block models: recent developments , 2017, Found. Trends Commun. Inf. Theory.

[7]  Ji Zhu,et al.  Consistency of community detection in networks under degree-corrected stochastic block models , 2011, 1110.3854.

[8]  Leto Peel,et al.  The ground truth about metadata and community detection in networks , 2016, Science Advances.

[9]  Anna Pajor,et al.  Estimating the Marginal Likelihood Using the Arithmetic Mean Identity , 2017 .

[10]  Morten Mørup,et al.  Bayesian Community Detection , 2012, Neural Computation.

[11]  Norbert Schuff,et al.  Imaging the Alzheimer brain. , 2011, Journal of Alzheimer's disease : JAD.

[12]  Morten Mørup,et al.  Nonparametric Bayesian modeling of complex networks: an introduction , 2013, IEEE Signal Processing Magazine.

[13]  Majnu John,et al.  Graph analysis of structural brain networks in Alzheimer’s disease: beyond small world properties , 2016, Brain Structure and Function.

[14]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[15]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[16]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockstructures , 2001 .

[17]  M Cieplak 蛋白質の折りたたみにおける協調性と接触秩序 | 文献情報 | J-GLOBAL 科学技術総合リンクセンター , 2004 .

[18]  M. Newton Approximate Bayesian-inference With the Weighted Likelihood Bootstrap , 1994 .

[19]  P. Pin,et al.  Assessing the relevance of node features for network structure , 2008, Proceedings of the National Academy of Sciences.

[20]  St'ephane Robin,et al.  Uncovering latent structure in valued graphs: A variational approach , 2010, 1011.1813.

[21]  Samuel J. Gershman,et al.  A Tutorial on Bayesian Nonparametric Models , 2011, 1106.2697.

[22]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Jonathan M. Mudge,et al.  Evidence for Transcript Networks Composed of Chimeric RNAs in Human Cells , 2012, PloS one.

[24]  Marc Niethammer,et al.  Stochastic block models with multiple continuous attributes , 2018, Applied Network Science.

[25]  Michael Weiner,et al.  Breakdown of Brain Connectivity Between Normal Aging and Alzheimer's Disease: A Structural k-Core Network Analysis , 2013, Brain Connect..

[26]  Edoardo M. Airoldi,et al.  Mixed Membership Stochastic Blockmodels , 2007, NIPS.

[27]  Zoubin Ghahramani,et al.  Bayesian Cluster Analysis: Point Estimation and Credible Balls (with Discussion) , 2015, Bayesian Analysis.

[28]  Peter J. Lenk,et al.  Simulation Pseudo-Bias Correction to the Harmonic Mean Estimator of Integrated Likelihoods , 2009 .

[29]  Mark E. J. Newman,et al.  Structure and inference in annotated networks , 2015, Nature Communications.

[30]  Thomas L. Griffiths,et al.  Learning Systems of Concepts with an Infinite Relational Model , 2006, AAAI.

[31]  Mikkel N. Schmidt,et al.  Nonparametric Bayesian modeling of complex networks: an introduction , 2013, IEEE Signal Processing Magazine.

[32]  Tobias Hecking,et al.  Positional analysis in cross-media information diffusion networks , 2019, Applied Network Science.

[33]  Ash A. Alizadeh,et al.  Corrigendum: Circulating tumour DNA profiling reveals heterogeneity of EGFR inhibitor resistance mechanisms in lung cancer patients , 2016, Nature Communications.

[34]  Thomas Brendan Murphy,et al.  Mixed-membership of experts stochastic blockmodel , 2014, Network Science.

[35]  D. Aldous Exchangeability and related topics , 1985 .

[36]  Anders M. Dale,et al.  An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest , 2006, NeuroImage.

[37]  M. Newton,et al.  Estimating the Integrated Likelihood via Posterior Simulation Using the Harmonic Mean Identity , 2006 .

[38]  Daniele Durante,et al.  Extended Stochastic Block Models , 2020 .

[39]  Anirban Bhattacharya,et al.  Posterior Contraction Rates for Stochastic Block Models , 2019, Sankhya A.

[40]  Radford M. Neal Markov Chain Sampling Methods for Dirichlet Process Mixture Models , 2000 .

[41]  Darren J. Wilkinson,et al.  A review of stochastic block models and extensions for graph clustering , 2019, Applied Network Science.

[42]  D. F. Saldana,et al.  How Many Communities Are There? , 2014, 1412.1684.

[43]  Chao Gao,et al.  Community Detection in Degree-Corrected Block Models , 2016, The Annals of Statistics.

[44]  George Michailidis,et al.  Likelihood Inference for Large Scale Stochastic Blockmodels With Covariates Based on a Divide-and-Conquer Parallelizable Algorithm With Communication , 2016, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[45]  Santo Fortunato,et al.  Community detection in networks: A user guide , 2016, ArXiv.

[46]  Mohammad Khansari,et al.  Predicting brain network changes in Alzheimer's disease with link prediction algorithms. , 2017, Molecular bioSystems.

[47]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  Olaf Sporns,et al.  Weighted Stochastic Block Models of the Human Connectome across the Life Span , 2018, Scientific Reports.

[49]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[50]  Anirban Bhattacharya,et al.  Probabilistic Community Detection With Unknown Number of Communities , 2016, Journal of the American Statistical Association.

[51]  Christian Tallberg A BAYESIAN APPROACH TO MODELING STOCHASTIC BLOCKSTRUCTURES WITH COVARIATES , 2004 .

[52]  A. V. D. Vaart,et al.  Bayesian Community Detection , 2016, Bayesian Analysis.

[53]  E. William Yund,et al.  Hemispherically-Unified Surface Maps of Human Cerebral Cortex: Reliability and Hemispheric Asymmetries , 2012, PloS one.

[54]  Edoardo M. Airoldi,et al.  Stochastic blockmodels with growing number of classes , 2010, Biometrika.

[55]  O. Sporns Structure and function of complex brain networks , 2013, Dialogues in clinical neuroscience.