Visualization, Quantification, and Alignment of Spectral Drift in Population Scale Untargeted Metabolomics Data.

Untargeted liquid-chromatography-mass spectrometry (LC-MS)-based metabolomics analysis of human biospecimens has become among the most promising strategies for probing the underpinnings of human health and disease. Analysis of spectral data across population scale cohorts, however, is precluded by day-to-day nonlinear signal drifts in LC retention time or batch effects that complicate comparison of thousands of untargeted peaks. To date, there exists no efficient means of visualization and quantitative assessment of signal drift, correction of drift when present, and automated filtering of unstable spectral features, particularly across thousands of data files in population scale experiments. Herein, we report the development of a set of R-based scripts that allow for pre- and postprocessing of raw LC-MS data. These methods can be integrated with existing data analysis workflows by providing initial preprocessing bulk nonlinear retention time correction at the raw data level. Further, this approach provides postprocessing visualization and quantification of peak alignment accuracy, as well as peak-reliability-based parsing of processed data through hierarchical clustering of signal profiles. In a metabolomics data set derived from ∼3000 human plasma samples, we find that application of our alignment tools resulted in substantial improvement in peak alignment accuracy, automated data filtering, and ultimately statistical power for detection of metabolite correlates of clinical measures. These tools will enable metabolomics studies of population scale cohorts.

[1]  Tomasz Burzykowski,et al.  Evaluation of normalization methods to pave the way towards large-scale LC-MS-based metabolomics profiling experiments. , 2013, Omics : a journal of integrative biology.

[2]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[3]  C. Gieger,et al.  Human metabolic individuality in biomedical and pharmaceutical research , 2011, Nature.

[4]  Caroline H. Johnson,et al.  Metabolomics: beyond biomarkers and towards mechanisms , 2016, Nature Reviews Molecular Cell Biology.

[5]  Steffen Neumann,et al.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements , 2008, BMC Bioinformatics.

[6]  H. Ressom,et al.  LC-MS-based metabolomics. , 2012, Molecular bioSystems.

[7]  Ron Wehrens,et al.  Improved batch correction in untargeted MS-based metabolomics , 2016, Metabolomics.

[8]  F. Gonzalez,et al.  LC–MS-based metabolomics: an update , 2014, Archives of Toxicology.

[9]  Gary Siuzdak,et al.  Thermal Degradation of Small Molecules: A Global Metabolomic Investigation , 2015, Analytical chemistry.

[10]  G A Nagana Gowda,et al.  Biomarker Discovery and Translation in Metabolomics. , 2013, Current Metabolomics.

[11]  I. Wilson,et al.  Does the mass spectrometer define the marker? A comparison of global metabolite profiling data generated simultaneously via UPLC-MS on two different mass spectrometers. , 2010, Analytical chemistry.

[12]  V. Mootha,et al.  Metabolite profiles and the risk of developing diabetes , 2011, Nature Medicine.

[13]  Gary Siuzdak,et al.  Bioinformatics: The Next Frontier of Metabolomics , 2014, Analytical chemistry.

[14]  Arnald Alonso,et al.  Analytical Methods in Untargeted Metabolomics: State of the Art in 2015 , 2015, Front. Bioeng. Biotechnol..

[15]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[16]  G. Siuzdak,et al.  Innovation: Metabolomics: the apogee of the omics trilogy , 2012, Nature Reviews Molecular Cell Biology.

[17]  Nicola Zamboni,et al.  Defining the metabolome: size, flux, and regulation. , 2015, Molecular cell.

[18]  R. Weinshilboum,et al.  Metabolomics: a global biochemical approach to drug response and disease. , 2008, Annual review of pharmacology and toxicology.

[19]  Yuliya V. Karpievitch,et al.  Metabolomics Data Normalization with EigenMS , 2014, PloS one.

[20]  Andrey Ziyatdinov,et al.  Intensity drift removal in LC/MS metabolomics by common variance compensation , 2014, Bioinform..

[21]  G. Siuzdak,et al.  Nonlinear data alignment for UPLC-MS and HPLC-MS based metabolomics: quantitative analysis of endogenous and exogenous metabolites in human serum. , 2006, Analytical chemistry.

[22]  Matej Oresic,et al.  Metabolomics enables precision medicine: “A White Paper, Community Perspective” , 2016, Metabolomics.

[23]  J. Kuligowski,et al.  Intra-batch effect correction in liquid chromatography-mass spectrometry using quality control samples and support vector regression (QC-SVRC). , 2015, The Analyst.

[24]  I. Wilson,et al.  Within-day reproducibility of an HPLC-MS-based method for metabonomic analysis: application to human urine. , 2007, Journal of proteome research.

[25]  C. Kuo,et al.  Batch Normalizer: a fast total abundance regression calibration method to simultaneously adjust batch and injection order effects in liquid chromatography/time-of-flight mass spectrometry-based metabolomics data and comparison with current calibration methods. , 2013, Analytical chemistry.

[26]  Ara W. Darzi,et al.  Metabolic phenotyping in clinical and surgical environments , 2012, Nature.

[27]  W. Pan,et al.  SMART: Statistical Metabolomics Analysis-An R Tool. , 2016, Analytical chemistry.

[28]  D. Pérez-Guaita,et al.  Detection of batch effects in liquid chromatography-mass spectrometry metabolomic data using guided principal component analysis. , 2014, Talanta.

[29]  Kai Stühler,et al.  Retention time alignment algorithms for LC/MS data must consider non-linear shifts , 2009, Bioinform..

[30]  T. Ebbels,et al.  Optimized preprocessing of ultra-performance liquid chromatography/mass spectrometry urinary metabolic profiles for improved information recovery. , 2011, Analytical chemistry.

[31]  R. Weinshilboum,et al.  Pharmacometabolomics: Implications for Clinical Pharmacology and Systems Pharmacology , 2014, Clinical pharmacology and therapeutics.

[32]  Matej Oresic,et al.  Normalization method for metabolomics data using optimal selection of multiple internal standards , 2007, BMC Bioinformatics.

[33]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[34]  J. Murabito,et al.  Distinct Metabolomic Signatures Are Associated with Longevity in Humans , 2015, Nature Communications.

[35]  Hanno Steen,et al.  amsrpm: Robust Point Matching for Retention Time Aligment of LC/MS Data with R , 2007 .