Analytic Correlation Filtration: A New Tool to Reduce Analytical Complexity of Metabolomic Datasets

Metabolomics generates massive and complex data. Redundant different analytical species and the high degree of correlation in datasets is a constraint for the use of data mining/statistical methods and interpretation. In this context, we developed a new tool to detect analytical correlation into datasets without confounding them with biological correlations. Based on several parameters, such as a similarity measure, retention time, and mass information from known isotopes, adducts, or fragments, the algorithm principle is used to group features coming from the same analyte, and to propose one single representative per group. To illustrate the functionalities and added-value of this tool, it was applied to published datasets and compared to one of the most commonly used free packages proposing a grouping method for metabolomics data: ‘CAMERA’. This tool was developed to be included in Galaxy and is available in Workflow4Metabolomics.

[1]  Nicola Zamboni,et al.  High-throughput discovery metabolomics. , 2015, Current opinion in biotechnology.

[2]  Karan Uppal,et al.  xMSannotator: An R Package for Network-Based Annotation of High-Resolution Metabolomics Data. , 2017, Analytical chemistry.

[3]  Torsten Reimer,et al.  Virtual Research Environment Collaborative Landscape Study , 2010 .

[4]  Erik Schultes,et al.  The FAIR Guiding Principles for scientific data management and stewardship , 2016, Scientific Data.

[5]  P. Schmitt‐Kopplin,et al.  Liquid chromatography-mass spectrometry in metabolomics research: mass analyzers in ultra high pressure liquid chromatography coupling. , 2013, Journal of chromatography. A.

[6]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[7]  Marta Díaz,et al.  AStream: an R package for annotating LC/MS metabolomic data , 2011, Bioinform..

[8]  Rawi Ramautar,et al.  Human metabolomics: strategies to understand biology. , 2013, Current opinion in chemical biology.

[9]  J. Lindon,et al.  'Metabonomics': understanding the metabolic responses of living systems to pathophysiological stimuli via multivariate statistical analysis of biological NMR spectroscopic data. , 1999, Xenobiotica; the fate of foreign compounds in biological systems.

[10]  Blandine Comte,et al.  Systems Metabolomics for Prediction of Metabolic Syndrome. , 2017, Journal of proteome research.

[11]  Daniel Jacob,et al.  Workflow4Metabolomics: a collaborative research infrastructure for computational metabolomics , 2014, Bioinform..

[12]  Roger Guimerà,et al.  CliqueMS: a computational tool for annotating in-source metabolite ions from LC-MS untargeted metabolomics data based on a coelution similarity network , 2019, Bioinform..

[13]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[14]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[15]  Yann Guitton,et al.  Create, run, share, publish, and reference your LC-MS, FIA-MS, GC-MS, and NMR data analysis workflows with the Workflow4Metabolomics 3.0 Galaxy online infrastructure for metabolomics. , 2017, The international journal of biochemistry & cell biology.

[16]  Philip Britz-McKibbin,et al.  New advances in separation science for metabolomics: resolving chemical diversity in a post-genomic era. , 2013, Chemical reviews.

[17]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[18]  Arnald Alonso,et al.  Analytical Methods in Untargeted Metabolomics: State of the Art in 2015 , 2015, Front. Bioeng. Biotechnol..