Ion trace detection algorithm to extract pure ion chromatograms to improve untargeted peak detection quality for liquid chromatography/time-of-flight mass spectrometry-based metabolomics data.

Able to detect known and unknown metabolites, untargeted metabolomics has shown great potential in identifying novel biomarkers. However, elucidating all possible liquid chromatography/time-of-flight mass spectrometry (LC/TOF-MS) ion signals in a complex biological sample remains challenging since many ions are not the products of metabolites. Methods of reducing ions not related to metabolites or simply directly detecting metabolite related (pure) ions are important. In this work, we describe PITracer, a novel algorithm that accurately detects the pure ions of a LC/TOF-MS profile to extract pure ion chromatograms and detect chromatographic peaks. PITracer estimates the relative mass difference tolerance of ions and calibrates the mass over charge (m/z) values for peak detection algorithms with an additional option to further mass correction with respect to a user-specified metabolite. PITracer was evaluated using two data sets containing 373 human metabolite standards, including 5 saturated standards considered to be split peaks resultant from huge m/z fluctuation, and 12 urine samples spiked with 50 forensic drugs of varying concentrations. Analysis of these data sets show that PITracer correctly outperformed existing state-of-art algorithm and extracted the pure ion chromatograms of the 5 saturated standards without generating split peaks and detected the forensic drugs with high recall, precision, and F-score and small mass error.

[1]  P. Benfey,et al.  Integrative systems biology: an attempt to describe a simple weed. , 2012, Current opinion in plant biology.

[2]  T. Dillon,et al.  Improving mass accuracy of high performance liquid chromatography/electrospray ionization time-of-flight mass spectrometry of intact antibodies , 2006, Journal of the American Society for Mass Spectrometry.

[3]  W. Windig,et al.  Chemometric analysis of complex hyphenated data. Improvements of the component detection algorithm. , 2007, Journal of chromatography. A.

[4]  Tianwei Yu,et al.  apLCMS - adaptive processing of high-resolution LC/MS data , 2009, Bioinform..

[5]  T. Rejtar,et al.  A universal denoising and peak picking algorithm for LC-MS based on matched filtration in the chromatographic time domain. , 2003, Analytical chemistry.

[6]  Vladimir Shulaev,et al.  Metabolomics technology and bioinformatics , 2006, Briefings Bioinform..

[7]  Roeland C. H. J. van Ham,et al.  Accurate mass error correction in liquid chromatography time-of-flight mass spectrometry based metabolomics , 2008, Metabolomics.

[8]  Rima Kaddurah-Daouk,et al.  Metabolomics tools for identifying biomarkers for neuropsychiatric diseases , 2009, Neurobiology of Disease.

[9]  C. Kuo,et al.  Screening and confirmation of 62 drugs of abuse and metabolites in urine by ultra-high-performance liquid chromatography-quadrupole time-of-flight mass spectrometry. , 2013, Journal of analytical toxicology.

[10]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[11]  Chris F. Taylor,et al.  A common open representation of mass spectrometry data and its application to proteomics research , 2004, Nature Biotechnology.

[12]  David S. Wishart,et al.  MetaboAnalyst: a web server for metabolomic data analysis and interpretation , 2009, Nucleic Acids Res..

[13]  P. Bork,et al.  Literature mining for the biologist: from information retrieval to biological discovery , 2006, Nature Reviews Genetics.

[14]  M. Orešič,et al.  Data processing for mass spectrometry-based metabolomics. , 2007, Journal of chromatography. A.

[15]  Ralf Tautenhahn,et al.  Meta-analysis of untargeted metabolomic data from multiple profiling experiments , 2012, Nature Protocols.

[16]  Yufei Huang,et al.  Review of Peak Detection Algorithms in Liquid-Chromatography-Mass Spectrometry , 2009, Current genomics.

[17]  Anthony K. L. Leung,et al.  Nucleolar proteome dynamics , 2005, Nature.

[18]  A. Saghatelian,et al.  Assignment of endogenous substrates to enzymes by global metabolite profiling. , 2004, Biochemistry.

[19]  H. Ressom,et al.  LC-MS-based metabolomics. , 2012, Molecular bioSystems.

[20]  Andreas Zell,et al.  Automated Label-free Quantification of Metabolites from Liquid Chromatography–Mass Spectrometry Data* , 2013, Molecular & Cellular Proteomics.

[21]  E. Deutsch mzML: A single, unifying data format for mass spectrometer output , 2008, Proteomics.

[22]  F. Moriya Urine levels of drugs for which Triage DOA screening was positive. , 2009, Legal medicine.

[23]  Wolfram Weckwerth,et al.  Metabolomics: an integral technique in systems biology. , 2010, Bioanalysis.

[24]  F Peter Guengerich,et al.  Elucidation of functions of human cytochrome P450 enzymes: identification of endogenous substrates in tissue extracts using metabolomic and isotopic labeling approaches. , 2009, Analytical chemistry.

[25]  Dong Wook Kang,et al.  Metabolomics reveals a novel vitamin E metabolite and attenuated vitamin E metabolism upon PXR activation Published, JLR Papers in Press, January 20, 2009. , 2009, Journal of Lipid Research.

[26]  Tao Zhang,et al.  Identification of potential biomarkers for ovarian cancer by urinary metabolomic profiling. , 2013, Journal of proteome research.

[27]  R. Whittal,et al.  Interferences and contaminants encountered in modern mass spectrometry. , 2008, Analytica chimica acta.

[28]  L. Samson,et al.  New immunoaffinity-LC-MS/MS methodology reveals that Aag null mice are deficient in their ability to clear 1,N6-etheno-deoxyadenosine DNA lesions from lung and liver in vivo. , 2004, DNA repair.

[29]  Laxman Yetukuri,et al.  Algorithms and tools for the preprocessing of LC–MS metabolomics data , 2011 .

[30]  W. Windig,et al.  A Noise and Background Reduction Method for Component Detection in Liquid Chromatography/Mass Spectrometry , 1996 .

[31]  J. Lindon,et al.  Metabonomics: a platform for studying drug toxicity and gene function , 2002, Nature Reviews Drug Discovery.

[32]  V. Nesatyy,et al.  Proteomics for the analysis of environmental stress responses in organisms. , 2007, Environmental science & technology.

[33]  T. Hankemeier,et al.  Microbial metabolomics: replacing trial-and-error by the unbiased selection and ranking of targets , 2005, Journal of Industrial Microbiology and Biotechnology.

[34]  Choon Nam Ong,et al.  MetaboNexus: an interactive platform for integrated metabolomics analysis , 2014, Metabolomics.

[35]  Steffen Neumann,et al.  Highly sensitive feature detection for high resolution LC/MS , 2008, BMC Bioinformatics.

[36]  Dirk Inzé,et al.  Plant cell factories in the post-genomic era: new ways to produce designer secondary metabolites. , 2004, Trends in plant science.

[37]  Willem Windig,et al.  The use of the Durbin-Watson criterion for noise and background reduction of complex liquid chromatography/mass spectrometry data and a new algorithm to determine sample differences , 2005 .

[38]  Pei Wang,et al.  Bioinformatics Original Paper a Suite of Algorithms for the Comprehensive Analysis of Complex Protein Mixtures Using High-resolution Lc-ms , 2022 .

[39]  Pan Du,et al.  Bioinformatics Original Paper Improved Peak Detection in Mass Spectrum by Incorporating Continuous Wavelet Transform-based Pattern Matching , 2022 .

[40]  Bernhard Kluger,et al.  MetExtract: a new software tool for the automated comprehensive extraction of metabolite-derived LC/MS signals in metabolomics research , 2012, Bioinform..

[41]  I. Wilson,et al.  High resolution "ultra performance" liquid chromatography coupled to oa-TOF mass spectrometry as a tool for differential metabolic pathway profiling in functional genomic studies. , 2005, Journal of proteome research.

[42]  Ching-Hua Kuo,et al.  True ion pick (TIPick): a denoising and peak picking algorithm to extract ion signals from liquid chromatography/mass spectrometry data. , 2013, Journal of mass spectrometry : JMS.

[43]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[44]  Maciek Sasinowski,et al.  What is mzXML good for? , 2005, Expert review of proteomics.

[45]  Uwe Schmitt,et al.  eMZed: an open source framework in Python for rapid and interactive development of LC/MS data analysis workflows , 2013, Bioinform..

[46]  M. Okada,et al.  Metabolomic analysis of dynamic response and drug resistance of gastric cancer cells to 5-fluorouracil , 2012, Oncology reports.

[47]  Bruce R. Kowalski,et al.  Windowed mass selection method: a new data processing algorithm for liquid chromatography–mass spectrometry data , 1999 .

[48]  Yiyu Cheng,et al.  An entropy-based method for noise reduction of liquid chromatography-mass spectrometry data. , 2008, Analytica chimica acta.

[49]  Mahlet G Tadesse,et al.  LC-MS based serum metabolomics for identification of hepatocellular carcinoma biomarkers in Egyptian cohort. , 2012, Journal of proteome research.

[50]  S. Gygi,et al.  Quantitative analysis of complex protein mixtures using isotope-coded affinity tags , 1999, Nature Biotechnology.

[51]  J Bruce German,et al.  Metabolomics and biochemical profiling in drug discovery and development. , 2002, Current opinion in molecular therapeutics.

[52]  Dean P. Jones,et al.  Hybrid feature detection and information accumulation using high-resolution LC-MS metabolomics data. , 2013, Journal of proteome research.

[53]  C. Barbas,et al.  Metabolomics in cancer biomarker discovery: current trends and future perspectives. , 2014, Journal of pharmaceutical and biomedical analysis.

[54]  Brian C. Thomas,et al.  Metabolites Associated with Adaptation of Microorganisms to an Acidophilic, Metal-Rich Environment Identified by Stable-Isotope-Enabled Metabolomics , 2013, mBio.

[55]  Julian L Griffin,et al.  Understanding mouse models of disease through metabolomics. , 2006, Current opinion in chemical biology.

[56]  Diego Orzaez,et al.  Evaluation of unintended effects in the composition of tomatoes expressing a human immunoglobulin A against rotavirus. , 2014, Journal of agricultural and food chemistry.