msPurity: Automated Evaluation of Precursor Ion Purity for Mass Spectrometry-Based Fragmentation in Metabolomics.

Tandem mass spectrometry (MS/MS or MS2) is a widely used approach for structural annotation and identification of metabolites in complex biological samples. The importance of assessing the contribution of the precursor ion within an isolation window for MS2 experiments has been previously detailed in proteomics, where precursor ion purity influences the quality and accuracy of matching to mass spectral libraries, but to date, there has been little attention to this data-processing technique in metabolomics. Here, we present msPurity, a vendor-independent R package for liquid chromatography (LC) and direct infusion (DI) MS2 that calculates a simple metric to describe the contribution of the selected precursor. The precursor purity metric is calculated as "intensity of a selected precursor divided by the summed intensity of the isolation window". The metric is interpolated at the recorded point of MS2 acquisition using bordering full-scan spectra. Isotopic peaks of the selected precursor can be removed, and low abundance peaks that are believed to have limited contribution to the resulting MS2 spectra are removed. Additionally, the isolation efficiency of the mass spectrometer can be taken into account. The package was applied to Data Dependent Acquisition (DDA)-based MS2 metabolomics data sets derived from three metabolomics data repositories. For the 10 LC-MS2 DDA data sets with > ±1 Da isolation windows, the median precursor purity score ranged from 0.67 to 0.96 (scale = 0 to +1). The R package was also used to assess precursor purity of theoretical isolation windows from LC-MS data sets of differing sample types. The theoretical isolation windows being the same width used for an anticipated DDA experiment (±0.5 Da). The most complex sample had a median precursor purity score of 0.46 for the 64,498 XCMS determined features, in comparison to the less spectrally complex sample that had a purity score of 0.66 for 5071 XCMS features. It has been previously reported in proteomics that a purity score of <0.5 can produce unreliable spectra matching results. With this assumption, we show that for complex samples there will be a large number of metabolites where traditional DDA approaches will struggle to provide reliable annotations or accurate matches to mass spectral libraries.

[1]  Eoin Fahy,et al.  Metabolomics Workbench: An international repository for metabolomics data and metadata, metabolite standards, protocols, tutorials and training, and analysis tools , 2015, Nucleic Acids Res..

[2]  John R Yates,et al.  Extracting Accurate Precursor Information for Tandem Mass Spectra by RawConverter. , 2015, Analytical chemistry.

[3]  E. Kujawinski,et al.  Release of ecologically relevant metabolites by the cyanobacterium Synechococcus elongates CCMP 1631. , 2015, Environmental microbiology.

[4]  Lei Zhou,et al.  Global Metabonomic and Proteomic Analysis of Human Conjunctival Epithelial Cells (IOBA-NHC) in Response to Hyperosmotic Stress. , 2015, Journal of proteome research.

[5]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[6]  Matthias Mann,et al.  Visualization of LC‐MS/MS proteomics data in MaxQuant , 2015, Proteomics.

[7]  E. Kujawinski,et al.  Dissolved organic matter produced by Thalassiosira pseudonana , 2015 .

[8]  R. Salek,et al.  Metabolic differences in ripening of Solanum lycopersicum ‘Ailsa Craig’ and three monogenic mutants , 2014, Scientific Data.

[9]  Jennifer A Kirwan,et al.  Direct infusion mass spectrometry metabolomics dataset: a benchmark for data processing and quality control , 2014, Scientific Data.

[10]  Ben C. Collins,et al.  A tool for the automated, targeted analysis of data-independent acquisition MS-data: OpenSWATH , 2014 .

[11]  G. Patti,et al.  An untargeted metabolomic workflow to improve structural characterization of metabolites. , 2013, Analytical chemistry.

[12]  Christoph Steinbeck,et al.  MetaboLights—an open-access general-purpose repository for metabolomics studies and associated meta-data , 2012, Nucleic Acids Res..

[13]  G. Thallinger,et al.  Lipidomic analysis of lipid droplets from murine hepatocytes reveals distinct signatures for nutritional stress[S] , 2012, Journal of Lipid Research.

[14]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[15]  Steffen Neumann,et al.  Nearline acquisition and processing of liquid chromatography-tandem mass spectrometry data , 2012, Metabolomics.

[16]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[17]  Joshua D. Knowles,et al.  Procedures for large-scale metabolic profiling of serum and plasma using gas chromatography and liquid chromatography coupled to mass spectrometry , 2011, Nature Protocols.

[18]  M. Mann,et al.  More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. , 2011, Journal of proteome research.

[19]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[20]  N. Ahn,et al.  Quantifying the impact of chimera MS/MS spectra on peptide identification in large-scale proteomics studies. , 2010, Journal of proteome research.

[21]  Dan Golick,et al.  Database searching and accounting of multiplexed precursor and product ion spectra from the data independent analysis of simple and complex peptide mixtures , 2009, Proteomics.

[22]  Robert A. Grothe,et al.  Precursor-ion mass re-estimation improves peptide identification on hybrid instruments. , 2008, Journal of proteome research.

[23]  M. MacCoss,et al.  High-speed data reduction, feature detection, and MS/MS spectrum quality assessment of shotgun proteomics data sets using high-resolution mass spectrometry. , 2007, Analytical chemistry.

[24]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.