SubMAP: Aligning Metabolic Pathways with Subnetwork Mappings

We consider the problem of aligning two metabolic pathways Unlike traditional approaches, we do not restrict the alignment to one-to-one mappings between the molecules of the input pathways We follow the observation that in nature different organisms can perform the same or similar functions through different sets of reactions and molecules The number and the topology of the molecules in these alternative sets often vary from one organism to another In other words, given two metabolic pathways of arbitrary topology, we would like to find a mapping that maximizes the similarity between the molecule subsets of query pathways of size at most a given integer k We transform this problem into an eigenvalue problem The solution to this eigenvalue problem produces alternative mappings in the form of a weighted bipartite graph We then convert this graph to a vertex weighted graph The maximum weight independent subset of this new graph is the alignment that maximizes the alignment score while ensuring consistency We call our algorithm SubMAP (Subnetwork Mappings in Alignment of Pathways) We evaluate its accuracy and performance on real datasets Our experiments demonstrate that SubMAP can identify biologically relevant mappings that are missed by traditional alignment methods and it is scalable for real size metabolic pathways. Availability: Our software and source code in C++ is available at http://bioinformatics.cise.ufl.edu/SubMAP.html

[1]  P. Berman,et al.  On Some Tighter Inapproximability Results , 1998, Electron. Colloquium Comput. Complex..

[2]  Thomas Pfeiffer,et al.  Exploring the pathway structure of metabolism: decomposition into subnetworks and application to Mycoplasma pneumoniae , 2002, Bioinform..

[3]  Roded Sharan,et al.  QNet: A Tool for Querying Protein Interaction Networks , 2007, RECOMB.

[4]  Martine Labbé,et al.  Identification of all steady states in large networks by logical analysis , 2003, Bulletin of mathematical biology.

[5]  Xiaoning Qian,et al.  Effective Identification of Conserved Pathways in Biological Networks Using Hidden Markov Models , 2009, PloS one.

[6]  Tamer Kahveci,et al.  Consistent alignment of metabolic pathways without abstraction. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[7]  J. Edwards,et al.  Robustness Analysis of the Escherichiacoli Metabolic Network , 2000, Biotechnology progress.

[8]  Yukako Tohsato,et al.  Metabolic Pathway Alignment Based on Similarity between Chemical Structures , 2007, Inf. Media Technol..

[9]  Tamer Kahveci,et al.  A Fast and Accurate Algorithm for Comparative Analysis of metabolic Pathways , 2009, J. Bioinform. Comput. Biol..

[10]  Roded Sharan,et al.  NetworkBLAST: comparative analysis of protein networks , 2008 .

[11]  G. Michal On representation of metabolic pathways. , 1998, Bio Systems.

[12]  Kenji Satou,et al.  Reconstruction of phylogenetic relationships from metabolic pathways based on the enzyme hierarchy and the gene ontology. , 2005, Genome informatics. International Conference on Genome Informatics.

[13]  Ron Y. Pinter,et al.  Alignment of metabolic pathways , 2005, Bioinform..

[14]  Johannes Berg,et al.  Cross-species analysis of biological networks by Bayesian alignment. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Wojciech Szpankowski,et al.  Pairwise Local Alignment of Protein Interaction Networks Guided by Models of Evolution , 2005, RECOMB.

[16]  Koichi Yamazaki,et al.  A note on greedy algorithms for the maximum weighted independent set problem , 2003, Discret. Appl. Math..

[17]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[18]  Michael Lässig,et al.  Local graph alignment and motif search in biological networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[20]  Isaac Meilijson,et al.  Can single knockouts accurately single out gene functions? , 2008, BMC Systems Biology.

[21]  Subhash Khot,et al.  Inapproximability of Vertex Cover and Independent Set in Bounded Degree Graphs , 2009, Computational Complexity Conference.

[22]  Bonnie Berger,et al.  Pairwise Global Alignment of Protein Interaction Networks by Matching Neighborhood Topology , 2007, RECOMB.

[23]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[24]  M. Kanehisa,et al.  Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. , 2003, Journal of the American Chemical Society.

[25]  Peter Damaschke,et al.  Induced Subgraph Isomorphism for Cographs in NP-Complete , 1990, WG.

[26]  J. Vederas,et al.  Crystal structure of LL-diaminopimelate aminotransferase from Arabidopsis thaliana: a recently discovered enzyme in the biosynthesis of L-lysine by plants and Chlamydia. , 2007, Journal of molecular biology.

[27]  C. Francke,et al.  Reconstructing the metabolic network of a bacterium from its genome. , 2005, Trends in microbiology.

[28]  Robert W. Harrison,et al.  MetNetAligner: a web service tool for metabolic network alignments , 2009, Bioinform..

[29]  P. Saunders,et al.  Saccharopine, an intermediate of the aminoadipic acid pathway of lysine biosynthesis. IV. Saccharopine dehydrogenase. , 1966, The Journal of biological chemistry.

[30]  Ambuj K. Singh,et al.  Deriving phylogenetic trees from the similarity analysis of metabolic pathways , 2003, ISMB.

[31]  László Lovász,et al.  Stable sets and polynomials , 1994, Discret. Math..

[32]  Roded Sharan,et al.  Fast and Accurate Alignment of Multiple Protein Networks , 2009, J. Comput. Biol..

[33]  André O. Hudson,et al.  l,l-diaminopimelate aminotransferase, a trans-kingdom enzyme shared by Chlamydia and plants for synthesis of diaminopimelate/lysine , 2006, Proceedings of the National Academy of Sciences.

[34]  Sanjay Ranka,et al.  An Iterative Algorithm for Metabolic Network-Based Drug Target Identification , 2006, Pacific Symposium on Biocomputing.

[35]  Hideo Matsuda,et al.  A Multiple Alignment Algorithm for Metabolic Pathway Analysis Using Enzyme Hierarchy , 2000, ISMB.

[36]  M. Kanehisa,et al.  A heuristic graph comparison algorithm and its application to detect functionally related enzyme clusters. , 2000, Nucleic acids research.

[37]  Sebastian Wernicke,et al.  FANMOD: a tool for fast network motif detection , 2006, Bioinform..

[38]  Changning Liu,et al.  Integrated analysis of multiple data sources reveals modular structure of biological networks. , 2006, Biochemical and biophysical research communications.

[39]  Bonnie Berger,et al.  Global alignment of multiple protein interaction networks with application to functional orthology detection , 2008, Proceedings of the National Academy of Sciences.

[40]  Joshua A. Grochow,et al.  Network Motif Discovery Using Subgraph Enumeration and Symmetry-Breaking , 2007, RECOMB.

[41]  Chris Sander,et al.  ChiBE: interactive visualization and manipulation of BioPAX pathway models , 2010, Bioinform..

[42]  B. Snel,et al.  Pathway alignment: application to the comparative analysis of glycolytic enzymes. , 1999, The Biochemical journal.

[43]  R. Karp,et al.  From the Cover : Conserved patterns of protein interaction in multiple species , 2005 .

[44]  Tamer Kahveci,et al.  Finding Dynamic Modules of Biological Regulatory Networks , 2010, 2010 IEEE International Conference on BioInformatics and BioEngineering.

[45]  Tamer Kahveci,et al.  Scalable Steady State Analysis of Boolean Biological Regulatory Networks , 2009, PloS one.

[46]  Marek Karpinski,et al.  On Some Tighter Inapproximability Results (Extended Abstract) , 1999, ICALP.

[47]  Peter D. Karp,et al.  A Bayesian method for identifying missing enzymes in predicted metabolic pathway databases , 2004, BMC Bioinformatics.

[48]  Bonnie Berger,et al.  IsoRankN: spectral methods for global alignment of multiple protein networks , 2009, Bioinform..

[49]  Wojciech Szpankowski,et al.  An efficient algorithm for detecting frequent subgraphs in biological networks , 2004, ISMB/ECCB.