Mass Spectral Feature List Optimizer (MS-FLO): A Tool To Minimize False Positive Peak Reports in Untargeted Liquid Chromatography-Mass Spectroscopy (LC-MS) Data Processing.

Untargeted metabolomics by liquid chromatography-mass spectrometry generates data-rich chromatograms in the form of m/z-retention time features. Managing such datasets is a bottleneck. Many popular data processing tools, including XCMS-online and MZmine2, yield numerous false-positive peak detections. Flagging and removing such false peaks manually is a time-consuming task and prone to human error. We present a web application, Mass Spectral Feature List Optimizer (MS-FLO), to improve the quality of feature lists after initial processing to expedite the process of data curation. The tool utilizes retention time alignments, accurate mass tolerances, Pearson's correlation analysis, and peak height similarity to identify ion adducts, duplicate peak reports, and isotopic features of the main monoisotopic metabolites. Removing such erroneous peaks reduces the overall number of metabolites in data reports and improves the quality of subsequent statistical investigations. To demonstrate the effectiveness of MS-FLO, we processed 28 biological studies and uploaded raw and results data to the Metabolomics Workbench website ( www.metabolomicsworkbench.org ), encompassing 1481 chromatograms produced by two different data processing programs used in-house (MZmine2 and later MS-DIAL). Post-processing of datasets with MS-FLO yielded a 7.8% automated reduction of total peak features and flagged an additional 7.9% of features, per dataset, for review by the user. When manually curated, 87% of these additional flagged features were verified false positives. MS-FLO is an open source web application that is freely available for use at http://msflo.fiehnlab.ucdavis.edu .

[1]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[2]  Eoin Fahy,et al.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools , 2015, Nucleic Acids Res..

[3]  Erik Johansson,et al.  Strategy for optimizing LC-MS data processing in metabolomics: a design of experiments approach. , 2012, Analytical chemistry.

[4]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[5]  Xiaohui Lin,et al.  Ion fusion of high-resolution LC-MS-based metabolomics data to discover more reliable biomarkers. , 2014, Analytical chemistry.

[6]  Laxman Yetukuri,et al.  Algorithms and tools for the preprocessing of LC–MS metabolomics data , 2011 .

[7]  H. C. Bertram,et al.  Time-saving design of experiment protocol for optimization of LC-MS data processing in metabolomic approaches. , 2013, Analytical chemistry.

[8]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[9]  Alexander Goesmann,et al.  ALLocator: An Interactive Web Platform for the Analysis of Metabolomic LC-ESI-MS Datasets, Enabling Semi-Automated, User-Revised Compound Annotation and Mass Isotopomer Ratio Analysis , 2014, PloS one.

[10]  Oliver Fiehn,et al.  Toward Merging Untargeted and Targeted Methods in Mass Spectrometry-Based Metabolomics and Lipidomics. , 2016, Analytical chemistry.

[11]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[12]  Michel José Anzanello,et al.  Chemometrics and Intelligent Laboratory Systems , 2009 .