Compliance with minimum information guidelines in public metabolomics repositories

The Metabolomics Standards Initiative (MSI) guidelines were first published in 2007. These guidelines provided reporting standards for all stages of metabolomics analysis: experimental design, biological context, chemical analysis and data processing. Since 2012, a series of public metabolomics databases and repositories, which accept the deposition of metabolomic datasets, have arisen. In this study, the compliance of 399 public data sets, from four major metabolomics data repositories, to the biological context MSI reporting standards was evaluated. None of the reporting standards were complied with in every publicly available study, although adherence rates varied greatly, from 0 to 97%. The plant minimum reporting standards were the most complied with and the microbial and in vitro were the least. Our results indicate the need for reassessment and revision of the existing MSI reporting standards.

[1]  Nigel W. Hardy,et al.  The metabolomics standards initiative (MSI) , 2007, Metabolomics.

[2]  Yves Gibon,et al.  GMD@CSB.DB: the Golm Metabolome Database , 2005, Bioinform..

[3]  Matej Oresic,et al.  COordination of Standards in MetabOlomicS (COSMOS): facilitating integrated metabolomics data access , 2015, Metabolomics.

[4]  Matej Oresic,et al.  COordination of Standards in MetabOlomicS (COSMOS): facilitating integrated metabolomics data access , 2015, Metabolomics.

[5]  Peng Zhang,et al.  PhenoMeter: A Metabolome Database Search Tool Using Statistical Similarity Matching of Metabolic Phenotypes for High-Confidence Detection of Functional Links , 2015, Front. Bioeng. Biotechnol..

[6]  Nigel W. Hardy,et al.  The Metabolomics Standards Initiative , 2007, Nature Biotechnology.

[7]  S. Neumann,et al.  PredRet: prediction of retention time by direct mapping between multiple chromatographic systems. , 2015, Analytical chemistry.

[8]  Thomas F. Malone,et al.  The environmental context , 1971 .

[9]  Brian A. Nosek,et al.  How open science helps researchers succeed , 2016, eLife.

[10]  Kristian Fog Nielsen,et al.  Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking , 2016, Nature Biotechnology.

[11]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[12]  Hilla Peretz,et al.  The , 1966 .

[13]  Royston Goodacre Water, water, every where, but rarely any drop to drink , 2013, Metabolomics.

[14]  Feng Zhu,et al.  Performance Evaluation and Online Realization of Data-driven Normalization Methods Used in LC/MS based Untargeted Metabolomics Analysis , 2016, Scientific Reports.

[15]  Christoph Steinbeck,et al.  MetaboLights—an open-access general-purpose repository for metabolomics studies and associated meta-data , 2012, Nucleic Acids Res..

[16]  Reality check on reproducibility , 2016, Nature.

[17]  The Standard Metabolic Reporting Structures working group Summary recommendations for standardization and reporting of metabolic analyses , 2005 .

[18]  Heather A. Piwowar,et al.  Data reuse and the open data citation advantage , 2013, PeerJ.

[19]  A. Brazma,et al.  Standards for systems biology , 2006, Nature Reviews Genetics.

[20]  Ute Roessner,et al.  Minimum reporting standards for plant biology context information in metabolomic studies , 2007, Metabolomics.

[21]  Ralf Takors,et al.  Standard reporting requirements for biological samples in metabolomics experiments: microbial and in vitro biology experiments , 2007, Metabolomics.

[22]  Joachim Selbig,et al.  The Golm Metabolome Database: a database for GC-MS based metabolite profiling , 2007 .

[23]  A. Harvey Millar,et al.  The MetabolomeExpress Project: enabling web-based processing, analysis and transparent dissemination of GC/MS metabolomics datasets , 2010, BMC Bioinformatics.

[24]  Susanna-Assunta Sansone,et al.  Standard reporting requirements for biological samples in metabolomics experiments: environmental context , 2007, Metabolomics.

[25]  Christoph Steinbeck,et al.  mzML2ISA & nmrML2ISA: generating enriched ISA-Tab metadata files from metabolomics XML data , 2017, Bioinform..

[26]  Eoin Fahy,et al.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools , 2015, Nucleic Acids Res..

[27]  Leo L. Cheng,et al.  Standard reporting requirements for biological samples in metabolomics experiments: mammalian/in vivo experiments , 2007, Metabolomics.

[28]  Macha Nikolski,et al.  MeRy-B: a web knowledgebase for the storage, visualization, analysis and annotation of plant NMR metabolomic profiles , 2011, BMC Plant Biology.

[29]  Michael L. Turner,et al.  The influence of scaling metabolomics data on model classification accuracy , 2015, Metabolomics.

[30]  Matej Oresic,et al.  Data standards can boost metabolomics research, and if there is a will, there is a way , 2015, Metabolomics.

[31]  Nigel W. Hardy,et al.  A proposed framework for the description of plant metabolomics experiments and their results , 2004, Nature Biotechnology.

[32]  Jason E. Stewart,et al.  Minimum information about a microarray experiment (MIAME)—toward standards for microarray data , 2001, Nature Genetics.

[33]  Lennart Martens,et al.  The minimum information about a proteomics experiment (MIAPE) , 2007, Nature Biotechnology.

[34]  Catherine A Ball,et al.  Are we stuck in the standards? , 2006, Nature Biotechnology.

[35]  Andrew R. Jones,et al.  ProteomeXchange provides globally co-ordinated proteomics data submission and dissemination , 2014, Nature Biotechnology.