Comparative classification of species and the study of pathway evolution based on the alignment of metabolic pathways

BackgroundPathways provide topical descriptions of cellular circuitry. Comparing analogous pathways reveals intricate insights into individual functional differences among species. While previous works in the field performed genomic comparisons and evolutionary studies that were based on specific genes or proteins, whole genomic sequence, or even single pathways, none of them described a genomic system level comparative analysis of metabolic pathways. In order to properly implement such an analysis one should overcome two specific challenges: how to combine the effect of many pathways under a unified framework and how to appropriately analyze co-evolution of pathways.Here we present a computational approach for solving these two challenges. First, we describe a comprehensive, scalable, information theory based computational pipeline that calculates pathway alignment information and then compiles it in a novel manner that allows further analysis. This approach can be used for building phylogenies and for pointing out specific differences that can then be analyzed in depth. Second, we describe a new approach for comparing the evolution of metabolic pathways. This approach can be used for detecting co-evolutionary relationships between metabolic pathways.ResultsWe demonstrate the advantages of our approach by applying our pipeline to data from the MetaCyc repository (which includes a total of 205 organisms and 660 metabolic pathways). Our analysis revealed several surprising biological observations. For example, we show that the different habitats in which Archaea organisms reside are reflected by a pathway based phylogeny. In addition, we discover two striking clusters of metabolic pathways, each cluster includes pathways that have very similar evolution.ConclusionWe demonstrate that distance measures that are based on the topology and the content of metabolic networks are useful for studying evolution and co-evolution.

[1]  T. Y. Kim,et al.  Phylogenetic analysis based on genome-scale metabolic pathway reaction content , 2004, Applied Microbiology and Biotechnology.

[2]  Nikolay V Dokholyan,et al.  The Coordinated Evolution of Yeast Proteins Is Constrained by Functional Modularity , 2022 .

[3]  Kenji Satou,et al.  Reconstruction of phylogenetic relationships from metabolic pathways based on the enzyme hierarchy and the gene ontology. , 2005, Genome informatics. International Conference on Genome Informatics.

[4]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[5]  P. Bork,et al.  iPath: interactive exploration of biochemical pathways and networks. , 2008, Trends in biochemical sciences.

[6]  Peter D. Karp,et al.  MetaCyc: a multiorganism database of metabolic pathways and enzymes , 2005, Nucleic Acids Res..

[7]  Tamir Tuller,et al.  Biological Networks: Comparison, Conservation, and Evolution via Relative Description Length , 2007, J. Comput. Biol..

[8]  K. Schulten,et al.  Phylogenetic Analysis of Metabolic Pathways , 2001, Journal of Molecular Evolution.

[9]  Hideo Matsuda,et al.  A Multiple Alignment Algorithm for Metabolic Pathway Analysis Using Enzyme Hierarchy , 2000, ISMB.

[10]  Veronika Vonstein,et al.  Archaeal Shikimate Kinase, a New Member of the GHMP-Kinase Family , 2001, Journal of bacteriology.

[11]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[12]  Ambuj K. Singh,et al.  Deriving phylogenetic trees from the similarity analysis of metabolic pathways , 2003, ISMB.

[13]  Sabine Cornelsen,et al.  Evolutionary analysis of Arabidopsis, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Haruyuki Atomi,et al.  The Ribulose Monophosphate Pathway Substitutes for the Missing Pentose Phosphate Pathway in the Archaeon Thermococcus kodakaraensis , 2006, Journal of bacteriology.

[15]  Kenji Satou,et al.  Finding conserved and non-conserved reactions using a metabolic pathway alignment algorithm. , 2006, Genome informatics. International Conference on Genome Informatics.

[16]  Roded Sharan,et al.  Center CLICK: A Clustering Algorithm with Applications to Gene Expression Analysis , 2000, ISMB.

[17]  Kenji Satou,et al.  Phylogenetic reconstruction from non-genomic data , 2007, Bioinform..

[18]  Andrey A Mironov,et al.  A metabolic network in the evolutionary context: multiscale structure and modularity. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[19]  T. Ideker,et al.  Modeling cellular machinery through biological network comparison , 2006, Nature Biotechnology.

[20]  A. Valencia,et al.  High-confidence prediction of global interactomes based on genome-wide coevolutionary networks , 2008, Proceedings of the National Academy of Sciences.

[21]  Thomas D. Brock,et al.  Biology of microorganisms , 1970 .

[22]  M. Madigan,et al.  Brock Biology of Microorganisms , 1996 .

[23]  Klaus Schulten,et al.  Evolution of Metabolisms: A New Method for the Comparison of Metabolic Pathways Using Genomics Information , 1999, J. Comput. Biol..

[24]  Ron Y. Pinter,et al.  Alignment of metabolic pathways , 2005, Bioinform..

[25]  Volker Müller,et al.  The molecular basis of salt adaptation in Methanosarcina mazei Gö1 , 2008, Archives of Microbiology.

[26]  Eytan Ruppin,et al.  Co-evolutionary networks of genes and cellular processes across fungal species , 2009, Genome Biology.

[27]  B. Snel,et al.  Pathway alignment: application to the comparative analysis of glycolytic enzymes. , 1999, The Biochemical journal.

[28]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[29]  Christian von Mering,et al.  STRING 8—a global view on proteins and their functional interactions in 630 organisms , 2008, Nucleic Acids Res..

[30]  R. Sharan,et al.  CLICK: a clustering algorithm with applications to gene expression analysis. , 2000, Proceedings. International Conference on Intelligent Systems for Molecular Biology.

[31]  D. Grahame,et al.  Acetyl-CoA decarbonylase/synthase complex from Archaeoglobus fulgidus , 1998, Archives of Microbiology.