Mining for Peaks in LC-HRMS Datasets Using Finnee – A Case Study with Exhaled Breath Condensates from Healthy, Asthmatic, and COPD Patients

Separation techniques hyphenated to high-resolution mass spectrometry are essential in untargeted metabolomic analyses. Due to the complexity and size of the resulting data, analysts rely on computer-assisted tools to mine for features that may represent a chromatographic signal. However, this step remains problematic, and a high number of false positives are often obtained. This work reports a novel approach where each step is carefully controlled to decrease the likelihood of errors. Datasets are first corrected for baseline drift and background noise before the MS scans are converted from profile to centroid. A new alignment strategy that includes purity control is introduced, and features are quantified using the original data with scans recorded as profile, not the extracted features. All the algorithms used in this work are part of the Finnee Matlab toolbox that is freely available. The approach was validated using metabolites in exhaled breath condensates to differentiate individuals diagnosed with asthma from patients with chronic obstructive pulmonary disease. With this new pipeline, twice as many markers were found with Finnee in comparison to XCMS-online, and nearly 50% more than with MS-Dial, two of the most popular freeware for untargeted metabolomics analysis.

[1]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[2]  Lennart Martens,et al.  mzML—a Community Standard for Mass Spectrometry Data* , 2010, Molecular & Cellular Proteomics.

[3]  R. Baran Untargeted metabolomics suffers from incomplete raw data processing , 2017, Metabolomics.

[4]  Liang Li,et al.  Definitions of terms relating to mass spectrometry (IUPAC Recommendations 2013) , 2013 .

[5]  I. Annesi-Maesano,et al.  Airways changes related to air pollution exposure in wheezing children , 2011, European Respiratory Journal.

[6]  Károly Héberger,et al.  Metabolomics applied to exhaled breath condensate in childhood asthma. , 2007, American journal of respiratory and critical care medicine.

[7]  Serge Rudaz,et al.  Knowledge discovery in metabolomics: an overview of MS data handling. , 2010, Journal of separation science.

[8]  Richard D. Beger,et al.  Towards quality assurance and quality control in untargeted metabolomics studies , 2019, Metabolomics.

[9]  Wenqing Shui,et al.  Comprehensive evaluation of untargeted metabolomics data processing software in feature detection, quantification and discriminating marker selection. , 2018, Analytica chimica acta.

[10]  Shuzhao Li,et al.  Detailed Investigation and Comparison of the XCMS and MZmine 2 Chromatogram Construction and Chromatographic Peak Detection Methods for Preprocessing Mass Spectrometry Metabolomics Data. , 2017, Analytical chemistry.

[11]  Shichen Shen,et al.  MS1 ion current‐based quantitative proteomics: A promising solution for reliable analysis of large biological cohorts , 2019, Mass spectrometry reviews.

[12]  P. J. Barnes,et al.  Exhaled breath condensate: methodological recommendations and unresolved questions , 2005, European Respiratory Journal.

[13]  Paolo Montuschi,et al.  Analysis of exhaled breath condensate for monitoring airway inflammation. , 2002, Trends in pharmacological sciences.

[14]  Aaron Park,et al.  Baseline correction using asymmetrically reweighted penalized least squares smoothing. , 2015, The Analyst.

[15]  M. Orešič,et al.  Data processing for mass spectrometry-based metabolomics. , 2007, Journal of chromatography. A.

[16]  Ildik Horv th Exhaled Breath Condensate in Disease Monitoring , 2003 .

[17]  Romà Tauler,et al.  Data analysis strategies for targeted and untargeted LC-MS metabolomic studies: Overview and workflow , 2016 .

[18]  Serge Rudaz,et al.  Harnessing the complexity of metabolomic data with chemometrics , 2014 .

[19]  Christophe Ley,et al.  Detecting outliers: Do not use standard deviation around the mean, use absolute deviation around the median , 2013 .

[20]  Adam P. Arkin,et al.  Interactive XCMS Online: Simplifying Advanced Metabolomic Data Processing and Subsequent Statistical Analyses , 2014, Analytical chemistry.

[21]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[22]  Guillermo Quintás,et al.  Data Quality Assessment in Untargeted LC-MS Metabolomics , 2018 .

[23]  J. Hunt,et al.  Exhaled breath condensate: an evolving tool for noninvasive evaluation of lung disease. , 2002, The Journal of allergy and clinical immunology.

[24]  Coral Barbas,et al.  Quality assurance procedures for mass spectrometry untargeted metabolomics. a review , 2018, Journal of pharmaceutical and biomedical analysis.

[25]  Oliver Fiehn,et al.  Toward Merging Untargeted and Targeted Methods in Mass Spectrometry-Based Metabolomics and Lipidomics. , 2016, Analytical chemistry.

[26]  M. Gare,et al.  Dilution of respiratory solutes in exhaled condensates. , 2002, American journal of respiratory and critical care medicine.

[27]  Fabian J Theis,et al.  The dynamic range of the human metabolome revealed by challenges , 2012, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[28]  Alejandro Cifuentes,et al.  Background correction in separation techniques hyphenated to high-resolution mass spectrometry - Thorough correction with mass spectrometry scans recorded as profile spectra. , 2017, Journal of chromatography. A.

[29]  J. Coble,et al.  Comparative evaluation of preprocessing freeware on chromatography/mass spectrometry data for signature discovery. , 2014, Journal of chromatography. A.

[30]  M. Griese Exhaled breath condensate , 2004, Pediatric pulmonology. Supplement.

[31]  I. Wilson,et al.  A pragmatic and readily implemented quality control strategy for HPLC-MS and GC-MS-based metabonomic analysis. , 2006, The Analyst.

[32]  J. C. Jongste,et al.  The analysis of volatile organic compounds in exhaled breath and biomarkers in exhaled breath condensate in children – clinical tools or scientific toys? , 2015, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[33]  Robert S Plumb,et al.  Untargeted LC/MS-based metabolic phenotyping (metabonomics/metabolomics): The state of the art. , 2019, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[34]  Alejandro Cifuentes,et al.  Finnee — A Matlab toolbox for separation techniques hyphenated high resolution mass spectrometry dataset , 2016 .

[35]  F. Fernández,et al.  Mass Spectrometry-Based Non-targeted Metabolic Profiling for Disease Detection: Recent Developments. , 2019, Trends in analytical chemistry : TRAC.

[36]  Wei-Hao Wang,et al.  Studies , 1926 .

[37]  Z. Borrill,et al.  Exhaled breath condensate biomarkers in COPD , 2008, European Respiratory Journal.

[38]  Shuzhao Li,et al.  One Step Forward for Reducing False Positive and False Negative Compound Identifications from Mass Spectrometry Metabolomics Data: New Algorithms for Constructing Extracted Ion Chromatograms and Detecting Chromatographic Peaks. , 2017, Analytical chemistry.

[39]  G. Siuzdak,et al.  XCMS Online: a web-based platform to process untargeted metabolomic data. , 2012, Analytical chemistry.

[40]  D. Armstrong,et al.  The utility of statistical moments in chromatography using trapezoidal and Simpson's rules of peak integration. , 2019, Journal of separation science.

[41]  Ravali Adusumilli,et al.  Data Conversion with ProteoWizard msConvert. , 2017, Methods in molecular biology.

[42]  G. Patti,et al.  Perspectives on Data Analysis in Metabolomics: Points of Agreement and Disagreement from the 2018 ASMS Fall Workshop , 2019, Journal of The American Society for Mass Spectrometry.

[43]  Li Zhang,et al.  Data preprocessing method for liquid chromatography-mass spectrometry based metabolomics. , 2012, Analytical chemistry.

[44]  F. Foret,et al.  Exhaled breath condensate: determination of non-volatile compounds and their potential for clinical diagnosis and monitoring. A review. , 2013, Analytica chimica acta.

[45]  M. MacCoss,et al.  Label-free comparative analysis of proteomics mixtures using chromatographic alignment of high-resolution muLC-MS data. , 2008, Analytical chemistry.

[46]  J. Hunt Exhaled breath condensate: an overview. , 2007, Immunology and allergy clinics of North America.

[47]  Gary Siuzdak,et al.  Bioinformatics: The Next Frontier of Metabolomics , 2014, Analytical chemistry.