A semi-parametric statistical test to compare complex networks

The modelling of real-world data as complex networks is ubiquitous in several scientific fields, for example, in molecular biology, we study gene regulatory networks and protein–protein interaction (PPI)_networks; in neuroscience, we study functional brain networks; and in social science, we analyse social networks. In contrast to theoretical graphs, real-world networks are better modelled as realizations of a random process. Therefore, analyses using methods based on deterministic graphs may be inappropriate. For example, verifying the isomorphism between two graphs is of limited use to decide whether two (or more) real-world networks are generated from the same random process. To overcome this problem, in this article, we introduce a semi-parametric approach similar to the analysis of variance to test the equality of generative models of two or more complex networks. We measure the performance of the proposed statistic using Monte Carlo simulations and illustrate its usefulness by comparing PPI networks of six enteric pathogens.

[1]  E. Nadaraya On Estimating Regression , 1964 .

[2]  J. Dall,et al.  Random geometric graphs. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  João Ricardo Sato,et al.  Statistical Methods in Graphs: Parameter Estimation, Model Selection, and Hypothesis Test , 2016 .

[4]  Daniel Y. Takahashi,et al.  A Statistical Method to Distinguish Functional Brain Networks , 2017, Front. Neurosci..

[5]  João Ricardo Sato,et al.  Discriminating Different Classes of Biological Networks by Analyzing the Graphs Spectra Distribution , 2012, PloS one.

[6]  Markus Meringer,et al.  Fast generation of regular graphs and construction of cages , 1999, J. Graph Theory.

[7]  Charles L. Hofacre,et al.  Enumeration of Salmonella and Campylobacter spp. in Environmental Farm Samples and Processing Plant Carcass Rinses from Commercial Broiler Chicken Flocks , 2013, Applied and Environmental Microbiology.

[8]  Daniel J. Brass,et al.  Network Analysis in the Social Sciences , 2009, Science.

[9]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[10]  B. Silverman,et al.  Choosing the window width when estimating a density , 1978 .

[11]  D. W. Scott,et al.  On Locally Adaptive Density Estimation , 1996 .

[12]  Alan M. Frieze,et al.  Random graphs , 2006, SODA '06.

[13]  Ulrike von Luxburg,et al.  Two-Sample Tests for Large Random Graphs Using Network Statistics , 2017, COLT.

[14]  Carey E. Priebe,et al.  A semiparametric two-sample hypothesis testing problem for random graphs , 2017 .

[15]  Olaf Sporns,et al.  Complex network measures of brain connectivity: Uses and interpretations , 2010, NeuroImage.

[16]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.

[17]  B. Ahmer,et al.  Yersinia enterocolitica Inhibits Salmonella enterica Serovar Typhimurium and Listeria monocytogenes Cellular Uptake , 2013, Infection and Immunity.

[18]  Ricardo Fraiman,et al.  An ANOVA approach for statistical comparisons of brain networks , 2018, Scientific Reports.

[19]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[20]  Zhao Xu,et al.  Shigella Strains Are Not Clones of Escherichia coli but Sister Species in the Genus Escherichia , 2012, Genom. Proteom. Bioinform..

[21]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[22]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[23]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[24]  Andressa Cerqueira,et al.  A Test of Hypotheses for Random Graph Distributions Built from EEG Data , 2015, IEEE Transactions on Network Science and Engineering.

[25]  Marián Boguñá,et al.  Popularity versus similarity in growing networks , 2011, Nature.