KNApSAcK family databases: integrated metabolite-plant species databases for multifaceted plant research.

A database (DB) describing the relationships between species and their metabolites would be useful for metabolomics research, because it targets systematic analysis of enormous numbers of organic compounds with known or unknown structures in metabolomics. We constructed an extensive species-metabolite DB for plants, the KNApSAcK Core DB, which contains 101,500 species-metabolite relationships encompassing 20,741 species and 50,048 metabolites. We also developed a search engine within the KNApSAcK Core DB for use in metabolomics research, making it possible to search for metabolites based on an accurate mass, molecular formula, metabolite name or mass spectra in several ionization modes. We also have developed databases for retrieving metabolites related to plants used for a range of purposes. In our multifaceted plant usage DB, medicinal/edible plants are related to the geographic zones (GZs) where the plants are used, their biological activities, and formulae of Japanese and Indonesian traditional medicines (Kampo and Jamu, respectively). These data are connected to the species-metabolites relationship DB within the KNApSAcK Core DB, keyed via the species names. All databases can be accessed via the website http://kanaya.naist.jp/KNApSAcK_Family/. KNApSAcK WorldMap DB comprises 41,548 GZ-plant pair entries, including 222 GZs and 15,240 medicinal/edible plants. The KAMPO DB consists of 336 formulae encompassing 278 medicinal plants; the JAMU DB consists of 5,310 formulae encompassing 550 medicinal plants. The Biological Activity DB consists of 2,418 biological activities and 33,706 pairwise relationships between medicinal plants and their biological activities. Current statistics of the binary relationships between individual databases were characterized by the degree distribution analysis, leading to a prediction of at least 1,060,000 metabolites within all plants. In the future, the study of metabolomics will need to take this huge number of metabolites into consideration.

[1]  Takayuki Tohge,et al.  Combining genetic diversity, informatics and metabolomics to facilitate annotation of plant gene function , 2010, Nature Protocols.

[2]  S. Wuchty Scale-free behavior in protein domain networks. , 2001, Molecular biology and evolution.

[3]  R. Verpoorte,et al.  Ethnopharmacology and systems biology: a perfect holistic match. , 2005, Journal of ethnopharmacology.

[4]  Albert-László Barabási,et al.  Internet: Diameter of the World-Wide Web , 1999, Nature.

[5]  Jean-Loup Guillaume,et al.  Bipartite graphs as models of complex networks , 2006 .

[6]  Susumu Goto,et al.  LIGAND: database of chemical compounds and reactions in biological pathways , 2002, Nucleic Acids Res..

[7]  J. Keurentjes,et al.  Metabolomics: the chemistry between ecology and genetics , 2010, Molecular ecology resources.

[8]  R. Bino,et al.  Metabolomics technologies and metabolite identification , 2007 .

[9]  Amos Bairoch,et al.  The ENZYME database in 2000 , 2000, Nucleic Acids Res..

[10]  Kazuyuki Tanaka,et al.  Generation of complex bipartite graphs by using a preferential rewiring process. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  C. Witt,et al.  Traditional Japanese Kampo Medicine: Clinical Research between Modernity and Traditional Medicine—The State of Research and Methodological Suggestions for the Future , 2011, Evidence-based complementary and alternative medicine : eCAM.

[12]  Lie-Fen Shyur,et al.  Metabolomics for phytomedicine research and drug development. , 2008, Current opinion in chemical biology.

[13]  Jane Qiu,et al.  Traditional medicine: A culture in the balance , 2007, Nature.

[14]  M. Hirai,et al.  Widely Targeted Metabolomics Based on Large-Scale MS/MS Data for Elucidating Metabolite Accumulation Patterns in Plants , 2008, Plant & cell physiology.

[15]  Hong-yu Zhang,et al.  Natural products and drug discovery , 2009, EMBO reports.

[16]  Andrey Rzhetsky,et al.  Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome , 2001, Bioinform..

[17]  Kazuki Saito,et al.  Metabolomics for functional genomics, systems biology, and biotechnology. , 2010, Annual review of plant biology.

[18]  R. Albert,et al.  The large-scale organization of metabolic networks , 2000, Nature.

[19]  Shigehiko Kanaya,et al.  Metabolomics of medicinal plants: the importance of multivariate analysis of analytical chemistry data. , 2010, Current computer-aided drug design.

[20]  D. Boufford,et al.  Flora of Japan , 2006 .

[21]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[22]  Kelvin Chan,et al.  Chinese medicinal materials and their interface with Western medical concepts. , 2005, Journal of ethnopharmacology.

[23]  U. Schippmann,et al.  Impact of cultivation and gathering of medicinal plants on biodiversity: global trends and issues. , 2003 .

[24]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[25]  Robert W. Scotland,et al.  How many species of seed plants are there , 2003 .

[26]  E. Koonin,et al.  Scale-free networks in biology: new insights into the fundamentals of evolution? , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[27]  V. Luca,et al.  The cell and developmental biology of alkaloid biosynthesis , 2000 .

[28]  Kazuki Saito,et al.  Potential of metabolomics as a functional genomics tool. , 2004, Trends in plant science.

[29]  C. Peterson,et al.  Topological properties of citation and metabolic networks. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.