Substance graphs are optimal simple-graph representations of metabolism

One approach to study the system-wide organization of biochemistry is to use statistical graph theory. In such heavily simplified methods, which disregard most of the dynamic aspects of biochemistry, one is faced with fundamental questions. One such question is how the chemical reaction systems should be reduced to a graph retaining as much functional information as possible from the original reaction system. In these graph representations, should the edges go between substrates and products, or substrates and substrates, or both? Should vertices represent substances or reactions? Different representations encode different information about the reaction system and affect network measures in different ways. This paper investigates which representation reflects the functional organization of the metabolic system in the best way, according to the defined criteria. Four different graph representations of metabolism (three where the vertices are metabolites, one where the vertices are reactions) are evaluated using data from different organisms and databases. The graph representations are evaluated by comparing the overlap between clusters (network modules) and annotated functions, and also by comparing the set of identified currency metabolites with those that other authors have identified using qualitative biological arguments. It is found that a “substance network”, where all metabolites participating in a reaction are connected, is better than others, evaluated with respect to both the functional overlap between modules and functions and to the number and identity of the identified currency metabolites.

[1]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[2]  D. Segrè,et al.  Supporting Online Material Materials and Methods Tables S1 and S2 References the Effect of Oxygen on Biochemical Networks and the Evolution of Complex Life , 2022 .

[3]  U. Alon Biological Networks: The Tinkerer as an Engineer , 2003, Science.

[4]  Eduardo Sontag,et al.  Modular cell biology: retroactivity and insulation , 2008, Molecular systems biology.

[5]  Kunihiko Kaneko,et al.  Life: An Introduction to Complex Systems Biology , 2006 .

[6]  Monica L. Mo,et al.  Global reconstruction of the human metabolic network based on genomic and bibliomic data , 2007, Proceedings of the National Academy of Sciences.

[7]  J. Coleman Relational Analysis: The Study of Social Organizations with Survey Methods , 1958 .

[8]  M. Newman,et al.  Hierarchical structure and the prediction of missing links in networks , 2008, Nature.

[9]  Jing Zhao,et al.  Complex networks theory for analyzing metabolic networks , 2006, q-bio/0603015.

[10]  D. Fell,et al.  A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks , 2000, Nature Biotechnology.

[11]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[12]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[13]  Vincent Fromion,et al.  Reconstruction and analysis of the genetic and metabolic regulatory networks of the central metabolism of Bacillus subtilis , 2008, BMC Systems Biology.

[14]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  An-Ping Zeng,et al.  Reconstruction of metabolic networks from genome data and analysis of their global structure for various organisms , 2003, Bioinform..

[16]  An-Ping Zeng,et al.  Decomposition of metabolic network into functional modules based on the global connectivity structure of reaction graph , 2004, Bioinform..

[17]  K. Kaski,et al.  Limited resolution in complex network community detection with Potts model approach , 2006 .

[18]  Petter Holme,et al.  Subnetwork hierarchies of biochemical pathways , 2002, Bioinform..

[19]  Jing Zhao,et al.  Modular co-evolution of metabolic networks , 2007, BMC Bioinformatics.

[20]  E. Almaas Biological impacts and context of network theory , 2007, Journal of Experimental Biology.

[21]  Anat Kreimer,et al.  The evolution of modularity in bacterial metabolic networks , 2008, Proceedings of the National Academy of Sciences.

[22]  D. Fell,et al.  The small world inside large metabolic networks , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[23]  Jing Zhao,et al.  Hierarchical modularity of nested bow-ties in metabolic networks , 2006, BMC Bioinformatics.

[24]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[25]  Lenwood S. Heath,et al.  Multimodal Networks: Structure and Operations , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[26]  V. Lacroix,et al.  An Introduction to Metabolic Networks and Their Structural Analysis , 2008, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[27]  Masanori Arita The metabolic world of Escherichia coli is not small. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Ulrik Brandes,et al.  On Finding Graph Clusterings with Maximum Modularity , 2007, WG.

[29]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Thomas Pfeiffer,et al.  Exploring the pathway structure of metabolism: decomposition into subnetworks and application to Mycoplasma pneumoniae , 2002, Bioinform..

[31]  Alexander E. Vinogradov,et al.  Modularity of cellular networks shows general center-periphery polarization , 2008, Bioinform..

[32]  Petter Holme,et al.  Currency and commodity metabolites: their identification and relation to the modularity of metabolic networks. , 2006, IET systems biology.

[33]  Petter Holme,et al.  Model validation of simple-graph representations of metabolism , 2008, Journal of The Royal Society Interface.

[34]  Mark E. J. Newman,et al.  Power-Law Distributions in Empirical Data , 2007, SIAM Rev..

[35]  Krin A. Kay,et al.  The implications of human metabolic network topology for disease comorbidity , 2008, Proceedings of the National Academy of Sciences.

[36]  K. Sneppen,et al.  Specificity and Stability in Topology of Protein Networks , 2002, Science.

[37]  D. Lauffenburger Cell signaling pathways as control modules: complexity for simplicity? , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[38]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.