BatMass: a Java Software Platform for LC-MS Data Visualization in Proteomics and Metabolomics.

Mass spectrometry (MS) coupled to liquid chromatography (LC) is a commonly used technique in metabolomic and proteomic research. As the size and complexity of LC-MS-based experiments grow, it becomes increasingly more difficult to perform quality control of both raw data and processing results. In a practical setting, quality control steps for raw LC-MS data are often overlooked, and assessment of an experiment's success is based on some derived metrics such as "the number of identified compounds". The human brain interprets visual data much better than plain text, hence the saying "a picture is worth a thousand words". Here, we present the BatMass software package, which allows for performing quick quality control of raw LC-MS data through its fast visualization capabilities. It also serves as a testbed for developers of LC-MS data processing algorithms by providing a data access library for open mass spectrometry file formats and a means of visually mapping processing results back to the original data. We illustrate the utility of BatMass with several use cases of quality control and data exploration.

[1]  Brendan MacLean,et al.  Bioinformatics Applications Note Gene Expression Skyline: an Open Source Document Editor for Creating and Analyzing Targeted Proteomics Experiments , 2022 .

[2]  Steffen Neumann,et al.  IPO: a tool for automated optimization of XCMS parameters , 2015, BMC Bioinformatics.

[3]  David L. Tabb,et al.  Performance Metrics for Liquid Chromatography-Tandem Mass Spectrometry Systems in Proteomics Analyses* , 2009, Molecular & Cellular Proteomics.

[4]  John T. Prince,et al.  Metriculator: quality assessment for mass spectrometry-based proteomics , 2013, Bioinform..

[5]  Robert Burke,et al.  ProteoWizard: open source software for rapid proteomics tools development , 2008, Bioinform..

[6]  Antoine H P America,et al.  Comparative LC‐MS: A landscape of peaks and valleys , 2008, Proteomics.

[7]  C. Sander,et al.  Applications of targeted proteomics in systems biology and translational medicine , 2015, Proteomics.

[8]  Fredrik Levander,et al.  Data processing methods and quality control strategies for label-free LC-MS protein quantification. , 2014, Biochimica et biophysica acta.

[9]  M. Mann,et al.  A large synthetic peptide and phosphopeptide reference library for mass spectrometry–based proteomics , 2013, Nature Biotechnology.

[10]  Barbara Frewen,et al.  Hybrid data acquisition and processing strategies with increased throughput and selectivity: pSMART analysis for global qualitative and quantitative analysis. , 2014, Journal of proteome research.

[11]  Lars Malmström,et al.  Numerical Compression Schemes for Proteomics Mass Spectrometry Data* , 2014, Molecular & Cellular Proteomics.

[12]  M. Gorenstein,et al.  Quantitative proteomic analysis by accurate mass retention time pairs. , 2005, Analytical chemistry.

[13]  Alexandra Simader,et al.  QCScreen: a software tool for data quality control in LC-HRMS based metabolomics , 2015, BMC Bioinformatics.

[14]  Johannes Griss,et al.  jmzReader: A Java parser library to process and visualize multiple text and XML-based mass spectrometry data formats , 2012, Proteomics.

[15]  Yuichiro Fujita,et al.  Mass++: A Visualization and Analysis Tool for Mass Spectrometry. , 2014, Journal of proteome research.

[16]  David L Tabb,et al.  Employing ProteoWizard to Convert Raw Mass Spectrometry Data , 2014, Current protocols in bioinformatics.

[17]  Lorenzo J. Vega-Montoto,et al.  QuaMeter: multivendor performance metrics for LC-MS/MS proteomics instrumentation. , 2012, Analytical chemistry.

[18]  D. Goodlett,et al.  Multiplexed and data-independent tandem mass spectrometry for global proteome profiling. , 2014, Mass spectrometry reviews.

[19]  Rainer Breitling,et al.  msCompare: A Framework for Quantitative Analysis of Label-free LC-MS Data for Comparative Candidate Biomarker Studies* , 2012, Molecular & Cellular Proteomics.

[20]  Chih-Chiang Tsou,et al.  DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics , 2015, Nature Methods.

[21]  Stephen L. Johnson,et al.  After the feature presentation: technologies bridging untargeted metabolomics and biology. , 2014, Current opinion in biotechnology.

[22]  David L. Tabb,et al.  QC Metrics from CPTAC Raw LC-MS/MS Data Interpreted through Multivariate Statistics , 2014, Analytical chemistry.

[23]  Ruedi Aebersold,et al.  Using data‐independent, high‐resolution mass spectrometry in protein biomarker research: Perspectives and clinical applications , 2015, Proteomics. Clinical applications.

[24]  Martin Eisenacher,et al.  PRIDE Inspector Toolsuite: Moving Toward a Universal Visualization Tool for Proteomics Data Standard Formats and Quality Assessment of ProteomeXchange Datasets , 2015, Molecular & Cellular Proteomics.

[25]  Lennart Martens,et al.  qcML: An Exchange Format for Quality Control Metrics from Mass Spectrometry Experiments , 2014, Molecular & Cellular Proteomics.

[26]  Oliver Kohlbacher,et al.  TOPPView: an open-source viewer for mass spectrometry data. , 2009, Journal of proteome research.

[27]  Lennart Martens,et al.  compomics-utilities: an open-source Java library for computational proteomics , 2011, BMC Bioinformatics.

[28]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[29]  Ludovic C. Gillet,et al.  Targeted Data Extraction of the MS/MS Spectra Generated by Data-independent Acquisition: A New Concept for Consistent and Accurate Proteome Analysis* , 2012, Molecular & Cellular Proteomics.

[30]  Gary Siuzdak,et al.  Bioinformatics: The Next Frontier of Metabolomics , 2014, Analytical chemistry.

[31]  Giovanni Scardoni,et al.  Metscape 2 bioinformatics tool for the analysis and visualization of metabolomics and gene expression data , 2012, Bioinform..

[32]  Tianwei Yu,et al.  xMSanalyzer: automated pipeline for improved feature detection and downstream analysis of large-scale, non-targeted metabolomics data , 2013, BMC Bioinformatics.

[33]  David L Tabb,et al.  Quality assessment for clinical proteomics. , 2013, Clinical biochemistry.

[34]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[35]  Masanori Arita,et al.  MS-DIAL: Data Independent MS/MS Deconvolution for Comprehensive Metabolome Analysis , 2015, Nature Methods.

[36]  Michael S Bereman,et al.  Tools for monitoring system suitability in LC MS/MS centric proteomic experiments , 2015, Proteomics.