Deciphering the connectivity structure of biological networks using MixNet

BackgroundAs biological networks often show complex topological features, mathematical methods are required to extract meaningful information. Clustering methods are useful in this setting, as they allow the summary of the network's topology into a small number of relevant classes. Different strategies are possible for clustering, and in this article we focus on a model-based strategy that aims at clustering nodes based on their connectivity profiles.ResultsWe present MixNet, the first publicly available computer software that analyzes biological networks using mixture models. We apply this method to various networks such as the E. coli transcriptional regulatory network, the macaque cortex network, a foodweb network and the Buchnera aphidicola metabolic network. This method is also compared with other approaches such as module identification or hierarchical clustering.ConclusionWe show how MixNet can be used to extract meaningful biological information, and to give a summary of the networks topology that highlights important biological features. This approach is powerful as MixNet is adaptive to the network under study, and finds structural information without any a priori on the structure that is investigated. This makes MixNet a very powerful tool to summarize and decipher the connectivity structure of biological networks.

[1]  M. Hattori,et al.  Genome sequence of the endocellular bacterial symbiont of aphids Buchnera sp. APS , 2000, Nature.

[2]  Anestis Antoniadis,et al.  A Multiscale Approach for Statistical Characterization of Functional Images , 2009 .

[3]  D. Thieffry,et al.  Functional organisation of Escherichia coli transcriptional regulatory network , 2008, Journal of molecular biology.

[4]  Bradford A. Hawkins,et al.  EFFECTS OF SAMPLING EFFORT ON CHARACTERIZATION OF FOOD-WEB STRUCTURE , 1999 .

[5]  S. Shen-Orr,et al.  Network motifs in the transcriptional regulation network of Escherichia coli , 2002, Nature Genetics.

[6]  Gilles Celeux,et al.  A statistical approach for CGH microarray data analysis , 2004 .

[7]  Neo D. Martinez,et al.  Food-web structure and network theory: The role of connectance and size , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  St'ephane Robin,et al.  Uncovering latent structure in valued graphs: A variational approach , 2010, 1011.1813.

[9]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Marc Lavielle,et al.  Using penalized contrasts for the change-point problem , 2005, Signal Process..

[11]  Oliver Ebenhöh,et al.  An environmental perspective on metabolism. , 2008, Journal of theoretical biology.

[12]  S. Strogatz Exploring complex networks , 2001, Nature.

[13]  H. Malke,et al.  Paul Buchner, Endosymbiosis of Animals with Plant Microorganisms. 909 S., 371 Abb., 5 Tab., 6 Taf. New York 1965: John Wiley & Sons, Inc.: Interscience Publ. $ 35.00 , 1967 .

[14]  J. Collado-Vides,et al.  Identifying global regulators in transcriptional regulatory networks in bacteria. , 2003, Current opinion in microbiology.

[15]  T. Snijders,et al.  Estimation and Prediction for Stochastic Blockstructures , 2001 .

[16]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[17]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[18]  H. Dawah,et al.  Structure of the parasitoid communities of grass-feeding chalcid wasps , 1995 .

[19]  P. Buchner Endosymbiosis of Animals with Plant Microorganisms , 1965 .

[20]  Peter D. Karp,et al.  The Pathway Tools software , 2002, ISMB.

[21]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[22]  Franck Picard,et al.  Assessing the Exceptionality of Network Motifs , 2007, J. Comput. Biol..

[23]  Z. N. Oltvai,et al.  Topological units of environmental signal processing in the transcriptional regulatory network of Escherichia coli , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[24]  C. Médigue,et al.  MaGe: a microbial genome annotation system supported by synteny results , 2006, Nucleic acids research.

[25]  Franck Picard,et al.  A mixture model for random graphs , 2008, Stat. Comput..

[26]  Franck Picard,et al.  A statistical approach for array CGH data analysis , 2005, BMC Bioinformatics.

[27]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[28]  O. Sporns,et al.  Identification and Classification of Hubs in Brain Networks , 2007, PloS one.

[29]  Michael I. Jordan,et al.  An Introduction to Variational Methods for Graphical Models , 1999, Machine Learning.

[30]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[31]  E A Leicht,et al.  Mixture models and exploratory analysis in networks , 2006, Proceedings of the National Academy of Sciences.

[32]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[33]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.