IPO: a tool for automated optimization of XCMS parameters

BackgroundUntargeted metabolomics generates a huge amount of data. Software packages for automated data processing are crucial to successfully process these data. A variety of such software packages exist, but the outcome of data processing strongly depends on algorithm parameter settings. If they are not carefully chosen, suboptimal parameter settings can easily lead to biased results. Therefore, parameter settings also require optimization. Several parameter optimization approaches have already been proposed, but a software package for parameter optimization which is free of intricate experimental labeling steps, fast and widely applicable is still missing.ResultsWe implemented the software package IPO (‘Isotopologue Parameter Optimization’) which is fast and free of labeling steps, and applicable to data from different kinds of samples and data from different methods of liquid chromatography - high resolution mass spectrometry and data from different instruments.IPO optimizes XCMS peak picking parameters by using natural, stable 13C isotopic peaks to calculate a peak picking score. Retention time correction is optimized by minimizing relative retention time differences within peak groups. Grouping parameters are optimized by maximizing the number of peak groups that show one peak from each injection of a pooled sample. The different parameter settings are achieved by design of experiments, and the resulting scores are evaluated using response surface models. IPO was tested on three different data sets, each consisting of a training set and test set. IPO resulted in an increase of reliable groups (146% - 361%), a decrease of non-reliable groups (3% - 8%) and a decrease of the retention time deviation to one third.ConclusionsIPO was successfully applied to data derived from liquid chromatography coupled to high resolution mass spectrometry from three studies with different sample types and different chromatographic methods and devices. We were also able to show the potential of IPO to increase the reliability of metabolomics data.The source code is implemented in R, tested on Linux and Windows and it is freely available for download at https://github.com/glibiseller/IPO. The training sets and test sets can be downloaded from https://health.joanneum.at/IPO.

[1]  Eddy Karnieli,et al.  Preventing type 2 diabetes mellitus: a call for personalized intervention. , 2013, The Permanente journal.

[2]  Joshua D Rabinowitz,et al.  Metabolomic analysis and visualization engine for LC-MS data. , 2010, Analytical chemistry.

[3]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[4]  Christian Gieger,et al.  Biomarkers for Type 2 Diabetes and Impaired Fasting Glucose Using a Nontargeted Metabolomics Approach , 2013, Diabetes.

[5]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[6]  H. C. Bertram,et al.  Time-saving design of experiment protocol for optimization of LC-MS data processing in metabolomic approaches. , 2013, Analytical chemistry.

[7]  H. P. Benton,et al.  XCMS 2 : Processing Tandem Mass Spectrometry Data for Metabolite Identification and Structural Characterization , 2008 .

[8]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[9]  S. Ferreira,et al.  Box-Behnken design: an alternative for the optimization of analytical methods. , 2007, Analytica chimica acta.

[10]  G. Siuzdak,et al.  XCMS2: processing tandem mass spectrometry data for metabolite identification and structural characterization. , 2008, Analytical chemistry.

[11]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[12]  W. Pories,et al.  Long-Term Follow-Up After Bariatric Surgery , 2015 .

[13]  Martin Trötzmüller,et al.  A comprehensive method for lipid profiling by liquid chromatography-ion cyclotron resonance mass spectrometry[S] , 2011, Journal of Lipid Research.

[14]  Bernhard Kluger,et al.  MetExtract: a new software tool for the automated comprehensive extraction of metabolite-derived LC/MS signals in metabolomics research , 2012, Bioinform..

[15]  Lukas N. Mueller,et al.  SuperHirn – a novel tool for high resolution LC‐MS‐based peptide/protein profiling , 2007, Proteomics.

[16]  Liang Li,et al.  IUPAC Standard Definitions of Terms Relating to Mass Spectrometry , 2003 .

[17]  D. Arterburn,et al.  The current state of the evidence for bariatric surgery. , 2014, JAMA.

[18]  V. Mootha,et al.  Metabolite profiles and the risk of developing diabetes , 2011, Nature Medicine.

[19]  Nele Friedrich,et al.  Metabolomics in diabetes research. , 2012, The Journal of endocrinology.

[20]  Wenyun Lu,et al.  Separation and quantitation of water soluble cellular metabolites by hydrophilic interaction chromatography-tandem mass spectrometry. , 2006, Journal of chromatography. A.

[21]  T. Pieber,et al.  Multiple risk factor intervention reduces carotid atherosclerosis in patients with type 2 diabetes , 2014, Cardiovascular Diabetology.

[22]  G. Siuzdak,et al.  XCMS Online: a web-based platform to process untargeted metabolomic data. , 2012, Analytical chemistry.

[23]  Tianwei Yu,et al.  xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data , 2013, BMC Bioinformatics.

[24]  Matej Oresic,et al.  Processing methods for differential analysis of LC/MS profile data , 2005, BMC Bioinformatics.

[25]  Arjen Lommen,et al.  MetAlign: interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. , 2009, Analytical chemistry.

[26]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[27]  J. Asara,et al.  A positive/negative ion–switching, targeted mass spectrometry–based metabolomics platform for bodily fluids, cells, and fresh and fixed tissue , 2012, Nature Protocols.

[28]  G. Box,et al.  Some New Three Level Designs for the Study of Quantitative Variables , 1960 .

[29]  E. Marcotte,et al.  Chromatographic alignment of ESI-LC-MS proteomics data sets by ordered bijective interpolated warping. , 2006, Analytical chemistry.

[30]  Margaret J. Robertson,et al.  Design and Analysis of Experiments , 2006, Handbook of statistics.

[31]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[32]  Erik Johansson,et al.  Strategy for optimizing LC-MS data processing in metabolomics: a design of experiments approach. , 2012, Analytical chemistry.

[33]  Tianwei Yu,et al.  apLCMS - adaptive processing of high-resolution LC/MS data , 2009, Bioinform..

[34]  Bernhard Kluger,et al.  A novel stable isotope labelling assisted workflow for improved untargeted LC–HRMS based metabolomics research , 2013, Metabolomics.

[35]  R. Breitling,et al.  PeakML/mzMatch: a file format, Java library, R library, and tool-chain for mass spectrometry data analysis. , 2011, Analytical chemistry.

[36]  Joerg M. Buescher,et al.  Ultrahigh performance liquid chromatography-tandem mass spectrometry method for fast and robust quantification of anionic and aromatic metabolites. , 2010, Analytical chemistry.