A central limit theorem for an omnibus embedding of multiple random graphs and implications for multiscale network inference

Performing statistical analyses on collections of graphs is of import to many disciplines, but principled, scalable methods for multi-sample graph inference are few. Here we describe an "omnibus" embedding in which multiple graphs on the same vertex set are jointly embedded into a single space with a distinct representation for each graph. We prove a central limit theorem for this embedding and demonstrate how it streamlines graph comparison, obviating the need for pairwise subspace alignments. The omnibus embedding achieves near-optimal inference accuracy when graphs arise from a common distribution and yet retains discriminatory power as a test procedure for the comparison of different graphs. Moreover, this joint embedding and the accompanying central limit theorem are important for answering multiscale graph inference questions, such as the identification of specific subgraphs or vertices responsible for similarity or difference across networks. We illustrate this with a pair of analyses of connectome data derived from dMRI and fMRI scans of human subjects. In particular, we show that this embedding allows the identification of specific brain regions associated with population-level differences. Finally, we sketch how the omnibus embedding can be used to address pressing open problems, both theoretical and practical, in multisample graph inference.

[1]  H. Hotelling The Generalization of Student’s Ratio , 1931 .

[2]  Timothy O. Laumann,et al.  Functional Network Organization of the Human Brain , 2011, Neuron.

[3]  Carey E. Priebe,et al.  Community Detection and Classification in Hierarchical Stochastic Blockmodels , 2015, IEEE Transactions on Network Science and Engineering.

[4]  J. Gower Generalized procrustes analysis , 1975 .

[5]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[6]  János Komlós,et al.  The eigenvalues of random symmetric matrices , 1981, Comb..

[7]  Carey E. Priebe,et al.  Limit theorems for eigenvectors of the normalized Laplacian for random graphs , 2016, The Annals of Statistics.

[8]  S. Chatterjee,et al.  Matrix estimation by Universal Singular Value Thresholding , 2012, 1212.1247.

[9]  K. Mardia,et al.  Statistical Shape Analysis , 1998 .

[10]  J. Gabrieli,et al.  Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia , 2009, Proceedings of the National Academy of Sciences.

[11]  C. Priebe,et al.  Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding , 2013, 1310.0532.

[12]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[13]  Rex E. Jung,et al.  Multimodal Neuroimaging in Schizophrenia: Description and Dissemination , 2017, Neuroinformatics.

[14]  Joel A. Tropp,et al.  An Introduction to Matrix Concentration Inequalities , 2015, Found. Trends Mach. Learn..

[15]  Carey E. Priebe,et al.  A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs , 2011, 1108.2228.

[16]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[17]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[18]  Patrick J. Wolfe,et al.  Network histograms and universality of blockmodel approximation , 2013, Proceedings of the National Academy of Sciences.

[19]  Dena Marie Asta,et al.  The Geometry of Continuous Latent Space Models for Network Data , 2017, Statistical science : a review journal of the Institute of Mathematical Statistics.

[20]  C. Priebe,et al.  A nonparametric two-sample hypothesis testing problem for random dot product graphs , 2014, 1409.2344.

[21]  S. Janson,et al.  Graph limits and exchangeable random graphs , 2007, 0712.2749.

[22]  Terence Tao,et al.  Random matrices: Universal properties of eigenvectors , 2011, 1103.2801.

[23]  Linyuan Lu,et al.  Spectra of Edge-Independent Random Graphs , 2012, Electron. J. Comb..

[24]  R. Bhatia Matrix Analysis , 1996 .

[25]  C. Priebe,et al.  A Limit Theorem for Scaled Eigenvectors of Random Dot Product Graphs , 2013, Sankhya A.

[26]  Edward R. Scheinerman,et al.  Random Dot Product Graph Models for Social Networks , 2007, WAW.

[27]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[28]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[29]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[30]  R. Oliveira Concentration of the adjacency matrix and of the Laplacian in random graphs with independent edges , 2009, 0911.0600.

[31]  Mu Zhu,et al.  Automatic dimensionality selection from the scree plot via the use of profile likelihood , 2006, Comput. Stat. Data Anal..