Statistical Analysis of Global Connectivity and Activity Distributions in Cellular Networks

Various molecular interaction networks have been claimed to follow power-law decay for their global connectivity distribution. It has been proposed that there may be underlying generative models that explain this heavy-tailed behavior by self-reinforcement processes such as classical or hierarchical scale-free network models. Here, we analyze a comprehensive data set of protein-protein and transcriptional regulatory interaction networks in yeast, an Escherichia coli metabolic network, and gene activity profiles for different metabolic states in both organisms. We show that in all cases the networks have a heavy-tailed distribution, but most of them present significant differences from a power-law model according to a stringent statistical test. Those few data sets that have a statistically significant fit with a power-law model follow other distributions equally well. Thus, while our analysis supports that both global connectivity interaction networks and activity distributions are heavy-tailed, they are not generally described by any specific distribution model, leaving space for further inferences on generative models. Supplementary Material is available online at www.liebertonline.com.

[1]  K. Goh,et al.  Universal behavior of load distribution in scale-free networks. , 2001, Physical review letters.

[2]  Q. Vuong Likelihood Ratio Tests for Model Selection and Non-Nested Hypotheses , 1989 .

[3]  H E Stanley,et al.  Classes of small-world networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[5]  Joel S. Bader,et al.  Where Have All the Interactions Gone? Estimating the Coverage of Two-Hybrid Protein Interaction Maps , 2007, PLoS Comput. Biol..

[6]  R. Tanaka,et al.  Scale-rich metabolic networks. , 2005, Physical review letters.

[7]  T. Nakagawa,et al.  The Discrete Weibull Distribution , 1975, IEEE Transactions on Reliability.

[8]  Michel L. Goldstein,et al.  Problems with fitting to the power-law distribution , 2004, cond-mat/0402322.

[9]  M. A. de Menezes,et al.  Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity , 2007, Proceedings of the National Academy of Sciences.

[10]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[11]  R. Hart Does Size Matter ? Exploring the Small Sample Properties of Maximum Likelihood Estimation , 1999 .

[12]  R. Albert Scale-free networks in cell biology , 2005, Journal of Cell Science.

[13]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[14]  Raya Khanin,et al.  How Scale-Free Are Biological Networks , 2006, J. Comput. Biol..

[15]  R. Milo,et al.  Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Adam M. Feist,et al.  A genome-scale metabolic reconstruction for Escherichia coli K-12 MG1655 that accounts for 1260 ORFs and thermodynamic information , 2007, Molecular systems biology.

[17]  M. Stumpf,et al.  Evolution at the system level: the natural history of protein interaction networks. , 2007, Trends in ecology & evolution.

[18]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[19]  David J. Galas,et al.  A duplication growth model of gene expression networks , 2002, Bioinform..

[20]  Ting Wang,et al.  An improved map of conserved regulatory sites for Saccharomyces cerevisiae , 2006, BMC Bioinformatics.

[21]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Herman L Offerhaus,et al.  Accurate and unbiased estimation of power-law exponents from single-emitter blinking data. , 2006, The Journal of chemical physics.

[23]  R. Solé,et al.  Evolving protein interaction networks through gene duplication. , 2003, Journal of theoretical biology.

[24]  Ziv Bar-Joseph,et al.  Impact of the solvent capacity constraint on E. coli metabolism , 2008, BMC Systems Biology.

[25]  M. Gerstein,et al.  Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model. , 2001, Journal of molecular biology.

[26]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[27]  Andrey Rzhetsky,et al.  Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome , 2001, Bioinform..

[28]  A. M. Edwards,et al.  Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees and deer , 2007, Nature.

[29]  Tetsuya Yomo,et al.  Universality and flexibility in gene expression from bacteria to human. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  E. O’Shea,et al.  Global analysis of protein expression in yeast , 2003, Nature.

[31]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[32]  F. Pammolli,et al.  On Size and Growth of Business Firms , 2003, cond-mat/0307074.

[33]  A. Barabasi,et al.  Global organization of metabolic fluxes in the bacterium Escherichia coli , 2004, Nature.

[34]  Piers J. Ingram,et al.  Probability models for degree distributions of protein interaction networks , 2005 .

[35]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[36]  J. Doyle,et al.  Some protein interaction data do not exhibit power law statistics , 2005, FEBS letters.

[37]  T. Ideker,et al.  Comprehensive curation and analysis of global interaction networks in Saccharomyces cerevisiae , 2006, Journal of biology.

[38]  Robert Gentleman,et al.  Estimating node degree in bait-prey graphs , 2008, Bioinform..

[39]  J. Rojo Optimality : the second Erich L. Lehmann Symposium , 2006 .