A central limit theorem for an omnibus embedding of random dot product graphs

Performing statistical analyses on collections of graphs is of import to many disciplines, but principled, scalable methods for multisample graph inference are few. In this paper, we describe an omnibus embedding in which multiple graphs on the same vertex set are jointly embedded into a single space with a distinct representation for each graph. We prove a central limit theorem for this omnibus embedding, and we show that this simultaneous embedding into a single common space allows for the comparison of graphs without the requirement that the embedded points associated to each graph undergo cumbersome pairwise alignments. Moreover, the existence of multiple embedded points for each vertex renders possible the resolution of important multiscale graph inference goals, such as the identification of specific subgraphs or vertices as drivers of similarity or difference across large networks. The omnibus embedding achieves near-optimal inference accuracy when graphs arise from a common distribution and yet retains discriminatory power as a test procedure for the comparison of different graphs. We demonstrate the applicability of the omnibus embedding in two analyses of connectomic graphs generated from MRI scans of the brain in human subjects. We show how the omnibus embedding can be used to detect statistically significant differences, at multiple scales, across these networks, with an identification of specific brain regions that are associated with these population-level differences. Finally, we sketch how the omnibus embedding can be used to address pressing open problems, both theoretical and practical, in multisample graph inference.

[1]  H. Hotelling The Generalization of Student’s Ratio , 1931 .

[2]  T. W. Anderson An Introduction to Multivariate Statistical Analysis , 1959 .

[3]  Chandler Davis The rotation of eigenvectors by a perturbation , 1963 .

[4]  W. Kahan,et al.  The Rotation of Eigenvectors by a Perturbation. III , 1970 .

[5]  J. Gower Generalized procrustes analysis , 1975 .

[6]  János Komlós,et al.  The eigenvalues of random symmetric matrices , 1981, Comb..

[7]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[8]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[9]  R. Bhatia Matrix Analysis , 1996 .

[10]  K. Mardia,et al.  Statistical Shape Analysis , 1998 .

[11]  Peter D. Hoff,et al.  Latent Space Approaches to Social Network Analysis , 2002 .

[12]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[13]  Mu Zhu,et al.  Automatic dimensionality selection from the scree plot via the use of profile likelihood , 2006, Comput. Stat. Data Anal..

[14]  Edward R. Scheinerman,et al.  Random Dot Product Graph Models for Social Networks , 2007, WAW.

[15]  S. Janson,et al.  Graph limits and exchangeable random graphs , 2007, 0712.2749.

[16]  J. Gabrieli,et al.  Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia , 2009, Proceedings of the National Academy of Sciences.

[17]  R. Oliveira Concentration of the adjacency matrix and of the Laplacian in random graphs with independent edges , 2009, 0911.0600.

[18]  Terence Tao,et al.  Random matrices: Universal properties of eigenvectors , 2011, 1103.2801.

[19]  Mark E. J. Newman,et al.  Stochastic blockmodels and community structure in networks , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Timothy O. Laumann,et al.  Functional Network Organization of the Human Brain , 2011, Neuron.

[21]  Carey E. Priebe,et al.  A Consistent Adjacency Spectral Embedding for Stochastic Blockmodel Graphs , 2011, 1108.2228.

[22]  C. Priebe,et al.  Universally consistent vertex classification for latent positions graphs , 2012, 1212.1182.

[23]  C. Priebe,et al.  A Limit Theorem for Scaled Eigenvectors of Random Dot Product Graphs , 2013, Sankhya A.

[24]  C. Priebe,et al.  Perfect Clustering for Stochastic Blockmodel Graphs via Adjacency Spectral Embedding , 2013, 1310.0532.

[25]  Linyuan Lu,et al.  Spectra of Edge-Independent Random Graphs , 2012, Electron. J. Comb..

[26]  Patrick J. Wolfe,et al.  Network histograms and universality of blockmodel approximation , 2013, Proceedings of the National Academy of Sciences.

[27]  Tengyao Wang,et al.  A useful variant of the Davis--Kahan theorem for statisticians , 2014, 1405.0680.

[28]  Joel A. Tropp,et al.  An Introduction to Matrix Concentration Inequalities , 2015, Found. Trends Mach. Learn..

[29]  P. Bickel,et al.  Role of normalization in spectral clustering for stochastic blockmodels , 2013, 1310.1495.

[30]  S. Chatterjee,et al.  Matrix estimation by Universal Singular Value Thresholding , 2012, 1212.1247.

[31]  Vince Lyzinski,et al.  A joint graph inference case study: the C. elegans chemical and electrical connectomes , 2015, Worm.

[32]  C. Priebe,et al.  A Semiparametric Two-Sample Hypothesis Testing Problem for Random Graphs , 2017 .

[33]  Carey E. Priebe,et al.  Community Detection and Classification in Hierarchical Stochastic Blockmodels , 2015, IEEE Transactions on Network Science and Engineering.

[34]  Rex E. Jung,et al.  Multimodal Neuroimaging in Schizophrenia: Description and Dissemination , 2017, Neuroinformatics.

[35]  Carey E. Priebe,et al.  Limit theorems for eigenvectors of the normalized Laplacian for random graphs , 2016, The Annals of Statistics.