LC-MS alignment in theory and practice: a comprehensive algorithmic review

Liquid chromatography-mass spectrometry is widely used for comparative replicate sample analysis in proteomics, lipidomics and metabolomics. Before statistical comparison, registration must be established to match corresponding analytes from run to run. Alignment, the most popular correspondence approach, consists of constructing a function that warps the content of runs to most closely match a given reference sample. To date, dozens of correspondence algorithms have been proposed, creating a daunting challenge for practitioners in algorithm selection. Yet, existing reviews have highlighted only a few approaches. In this review, we describe 50 correspondence algorithms to facilitate practical algorithm selection. We elucidate the motivation for correspondence and analyze the limitations of current approaches, which include prohibitive runtimes, numerous user parameters, model limitations and the need for reference samples. We suggest and describe a paradigm shift for overcoming current correspondence limitations by building on known liquid chromatography-mass spectrometry behavior.

[1]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[2]  Zhongqi Zhang,et al.  Retention Time Alignment of LC/MS Data by a Divide-and-Conquer Algorithm , 2012, Journal of The American Society for Mass Spectrometry.

[3]  Navdeep Jaitly,et al.  MultiAlign: a multiple LC-MS analysis tool for targeted omics analysis , 2013, BMC Bioinformatics.

[4]  Radford M. Neal,et al.  Multiple Alignment of Continuous Time Series , 2004, NIPS.

[5]  Joachim M. Buhmann,et al.  Semi-supervised LC/MS alignment for differential proteomics , 2006, ISMB.

[6]  H. Ressom,et al.  LC-MS-based metabolomics. , 2012, Molecular bioSystems.

[7]  Age K. Smilde,et al.  Optimized time alignment algorithm for LC-MS data: correlation optimized warping using component detection algorithm-selected mass chromatograms. , 2008, Analytical chemistry.

[8]  Knut Reinert,et al.  A geometric approach for the alignment of liquid chromatography - mass spectrometry data , 2007, ISMB/ECCB.

[9]  Fernando M. Maroto,et al.  ChromAlign: A two-step algorithmic procedure for time alignment of three-dimensional LC-MS chromatographic surfaces. , 2006, Analytical chemistry.

[10]  L Pattini,et al.  MassUntangler: a novel alignment tool for label-free liquid chromatography-mass spectrometry proteomic data. , 2011, Journal of chromatography. A.

[11]  Benno Schwikowski,et al.  Signal Maps for Mass Spectrometry-based Comparative Proteomics* , 2006, Molecular & Cellular Proteomics.

[12]  Magnus Palmblad,et al.  Chromatographic alignment of LC-MS and LC-MS/MS datasets by genetic algorithm feature extraction , 2007, Journal of the American Society for Mass Spectrometry.

[13]  Chih-Chiang Tsou,et al.  IDEAL-Q, an Automated Tool for Label-free Quantitation Analysis Using an Efficient Peptide Alignment Approach and Spectral Data Validation* , 2009, Molecular & Cellular Proteomics.

[14]  Hua Lin,et al.  Quantifying reproducibility for differential proteomics: noise analysis for protein liquid chromatography-mass spectrometry of human serum , 2004, Bioinform..

[15]  Rainer Bischoff,et al.  Current Technological Challenges in Biomarker Discovery and Validation , 2010, European journal of mass spectrometry.

[16]  D. Chan,et al.  Evaluation of serum protein profiling by surface-enhanced laser desorption/ionization time-of-flight mass spectrometry for the detection of prostate cancer: I. Assessment of platform reproducibility. , 2005, Clinical chemistry.

[17]  David L. Woodruff,et al.  Beam search for peak alignment of NMR signals , 2004 .

[18]  Yue Joseph Wang,et al.  Multi-profile Bayesian alignment model for LC-MS data analysis with integration of internal standards , 2013, Bioinform..

[19]  Xiang Zhang,et al.  Data pre-processing in liquid chromatography-mass spectrometry-based proteomics , 2005, Bioinform..

[20]  Sule Gündüz Ögüdücü,et al.  SAGA: A novel signal alignment method based on genetic algorithm , 2013, Inf. Sci..

[21]  Ralf J. O. Torgrip,et al.  Peak alignment using reduced set mapping , 2003 .

[22]  Tomasz Burzykowski,et al.  A strategy for the prior processing of high-resolution mass spectral data obtained from high-dimensional combined fractional diagonal chromatography. , 2009, Journal of mass spectrometry : JMS.

[23]  K. Laukens,et al.  Getting Your Peaks in Line: A Review of Alignment Methods for NMR Spectral Data , 2013, Metabolites.

[24]  Frank Suits,et al.  Two-dimensional method for time aligning liquid chromatography-mass spectrometry data. , 2008, Analytical chemistry.

[25]  Jian Yang,et al.  Metabolomics spectral formatting, alignment and conversion tools (MSFACTs) , 2003, Bioinform..

[26]  Paweł Wiczling,et al.  New supervised alignment method as a preprocessing tool for chromatographic data in metabolomic studies. , 2012, Journal of chromatography. A.

[27]  Malcolm J. McConville,et al.  Progressive peak clustering in GC-MS Metabolomic experiments applied to Leishmania parasites , 2006, Bioinform..

[28]  Steffen Neumann,et al.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements , 2008, BMC Bioinformatics.

[29]  P. Eilers Parametric time warping. , 2004, Analytical chemistry.

[30]  Yongyi Mao,et al.  Informatics Platform for Global Proteomic Profiling and Biomarker Discovery Using Liquid Chromatography-Tandem Mass Spectrometry*S , 2004, Molecular & Cellular Proteomics.

[31]  Dan Ventura,et al.  A general model for continuous noninvasive pulmonary artery pressure estimation , 2013, Comput. Biol. Medicine.

[32]  P. A. Taylor,et al.  Synchronization of batch trajectories using dynamic time warping , 1998 .

[33]  B. W. Wright,et al.  High-speed peak matching algorithm for retention time alignment of gas chromatographic data for chemometric analysis. , 2003, Journal of chromatography. A.

[34]  Ruedi Aebersold,et al.  A Software Suite for the Generation and Comparison of Peptide Arrays from Sets of Data Collected by Liquid Chromatography-Mass Spectrometry*S , 2005, Molecular & Cellular Proteomics.

[35]  Erik Alm,et al.  The correspondence problem for metabonomics datasets , 2009, Analytical and bioanalytical chemistry.

[36]  Jacob D. Jaffe,et al.  PEPPeR, a Platform for Experimental Proteomic Pattern Recognition*S , 2006, Molecular & Cellular Proteomics.

[37]  K. Markides,et al.  Chromatographic alignment by warping and dynamic programming as a pre-processing tool for PARAFAC modelling of liquid chromatography-mass spectrometry data. , 2002, Journal of chromatography. A.

[38]  Pei Wang,et al.  Bioinformatics Original Paper a Suite of Algorithms for the Comprehensive Analysis of Complex Protein Mixtures Using High-resolution Lc-ms , 2022 .

[39]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[40]  Dan Ventura,et al.  Novel algorithms and the benefits of comparative validation , 2013, Bioinform..

[41]  E. Marcotte,et al.  Chromatographic alignment of ESI-LC-MS proteomics data sets by ordered bijective interpolated warping. , 2006, Analytical chemistry.

[42]  Rob Smith,et al.  Mspire-Simulator: LC-MS shotgun proteomic simulator for creating realistic gold standard data. , 2013, Journal of proteome research.

[43]  Gerard Cagney,et al.  Computational methods for the comparative quantification of proteins in label-free LCn-MS experiments , 2007, Briefings Bioinform..

[44]  Arjen Lommen,et al.  MetAlign: interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. , 2009, Analytical chemistry.

[45]  Joachim M. Buhmann,et al.  Time-series alignment by non-negative multiple generalized canonical correlation analysis , 2007, BMC Bioinformatics.

[46]  Yuzhu Hu,et al.  A new algorithm of piecewise automated beam search for peak alignment of chromatographic fingerprints. , 2007, Journal of chromatography. A.

[47]  Lihua Zhang,et al.  A new method for alignment of LC-MALDI-TOF data , 2010, BIBM.

[48]  Radford M. Neal,et al.  Difference detection in LC-MS data for protein biomarker discovery , 2007, Bioinform..

[49]  Hua Tang,et al.  A statistical method for chromatographic alignment of LC-MS data. , 2007, Biostatistics.

[50]  T. Shaler,et al.  Quantification of proteins and metabolites by mass spectrometry without isotopic labeling or spiked standards. , 2003, Analytical chemistry.

[51]  Jimmy Eng,et al.  A platform for accurate mass and time analyses of mass spectrometry data. , 2007, Journal of proteome research.

[52]  Ullrich Köthe,et al.  SIMA: Simultaneous Multiple Alignment of LC/MS Peak Lists , 2011, Bioinform..

[53]  J. Carstensen,et al.  Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping , 1998 .

[54]  Michael J. MacCoss,et al.  De Novo Correction of Mass Measurement Error in Low Resolution Tandem MS Spectra for Shotgun Proteomics , 2012, Journal of The American Society for Mass Spectrometry.

[55]  Richard E Higgs,et al.  Comprehensive label-free method for the relative quantification of proteins from biological samples. , 2005, Journal of proteome research.

[56]  Lukas N. Mueller,et al.  SuperHirn – a novel tool for high resolution LC‐MS‐based peptide/protein profiling , 2007, Proteomics.

[57]  Benno Schwikowski,et al.  Alignment of LC‐MS images, with applications to biomarker discovery and protein identification , 2008, Proteomics.

[58]  Getachew K. Befekadu,et al.  Probabilistic Mixture Regression Models for Alignment of LC-MS Data , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[59]  Jeffrey S. Morris,et al.  Reproducibility of SELDI-TOF protein patterns in serum: comparing datasets from different experiments , 2004, Bioinform..

[60]  Shelley Herbrich Retention time alignment algorithms for LC / MS data , 2009 .

[61]  Rasmus Bro,et al.  Automated alignment of chromatographic data , 2006 .

[62]  L. S. Shapley,et al.  College Admissions and the Stability of Marriage , 2013, Am. Math. Mon..

[63]  Matej Oresic,et al.  MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data , 2006, Bioinform..

[64]  M Daszykowski,et al.  A comparison of three algorithms for chromatograms alignment. , 2006, Journal of chromatography. A.

[65]  Hanno Steen,et al.  amsrpm: Robust Point Matching for Retention Time Aligment of LC/MS Data with R , 2007 .

[66]  Richard D. Smith,et al.  Robust algorithm for alignment of liquid chromatography-mass spectrometry analyses in an accurate mass and time tag data analysis pipeline. , 2006, Analytical chemistry.

[67]  Benno Schwikowski,et al.  Assessing Bias in Experiment Design for Large Scale Mass Spectrometry-based Quantitative Proteomics*S , 2007, Molecular & Cellular Proteomics.

[68]  B. Walczak,et al.  Fuzzy warping of chromatograms , 2005 .

[69]  Paul H. C. Eilers,et al.  Improved parametric time warping for proteomics , 2010 .