Compound clustering and consensus scopes of metabolic networks

We investigate the structure of metabolic networks by identifying sets of metabolites having a similar synthesizing capacity. We measure the synthesizing capacity of a compound by determining all metabolites that can be produced from it, and call this set the scope of the compound. We then define a distance measure based on the Jaccard coefficient and apply a hierarchical clustering method. Compounds within the same cluster are chemically similar and often appear in the same metabolic pathway. For each cluster we define a consensus scope by determining a set of metabolites that is most similar to all scopes within the cluster. We find that only a few of the resulting consensus scopes are mutually disjoint while others overlap, and some consensus scopes are fully contained in others. Thus, our approach reveals a number of functional subunits of the metabolic network which are arranged into a hierarchical setting.

[1]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[2]  A. Barabasi,et al.  Network biology: understanding the cell's functional organization , 2004, Nature Reviews Genetics.

[3]  Antje Chang,et al.  BRENDA, enzyme data and metabolic information , 2002, Nucleic Acids Res..

[4]  Oliver Ebenhöh,et al.  Expanding Metabolic Networks: Scopes of Compounds, Robustness, and Evolution , 2005, Journal of Molecular Evolution.

[5]  Reinhart Heinrich,et al.  Structural analysis of expanding metabolic networks. , 2004, Genome informatics. International Conference on Genome Informatics.

[6]  A. Barabasi,et al.  Hierarchical Organization of Modularity in Metabolic Networks , 2002, Science.

[7]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[8]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[9]  D. Fell,et al.  The small world inside large metabolic networks , 2000, Proceedings of the Royal Society of London. Series B: Biological Sciences.