Smith-Waterman peak alignment for comprehensive two-dimensional gas chromatography-mass spectrometry

BackgroundComprehensive two-dimensional gas chromatography coupled with mass spectrometry (GC × GC-MS) is a powerful technique which has gained increasing attention over the last two decades. The GC × GC-MS provides much increased separation capacity, chemical selectivity and sensitivity for complex sample analysis and brings more accurate information about compound retention times and mass spectra. Despite these advantages, the retention times of the resolved peaks on the two-dimensional gas chromatographic columns are always shifted due to experimental variations, introducing difficulty in the data processing for metabolomics analysis. Therefore, the retention time variation must be adjusted in order to compare multiple metabolic profiles obtained from different conditions.ResultsWe developed novel peak alignment algorithms for both homogeneous (acquired under the identical experimental conditions) and heterogeneous (acquired under the different experimental conditions) GC × GC-MS data using modified Smith-Waterman local alignment algorithms along with mass spectral similarity. Compared with literature reported algorithms, the proposed algorithms eliminated the detection of landmark peaks and the usage of retention time transformation. Furthermore, an automated peak alignment software package was established by implementing a likelihood function for optimal peak alignment.ConclusionsThe proposed Smith-Waterman local alignment-based algorithms are capable of aligning both the homogeneous and heterogeneous data of multiple GC × GC-MS experiments without the transformation of retention times and the selection of landmark peaks. An optimal version of the SW-based algorithms was also established based on the associated likelihood function for the automatic peak alignment. The proposed alignment algorithms outperform the literature reported alignment method by analyzing the experiment data of a mixture of compound standards and a metabolite extract of mouse plasma with spiked-in compound standards.

[1]  M. Vidyasagar Statistical Methods in Bioinformatics (Second Edition) (W. J. Ewens and G. R. Grant; 2005) [book review] , 2006 .

[2]  Hans-Willi Kling,et al.  Analysis of special surfactants by comprehensive two-dimensional gas chromatography coupled to time-of-flight mass spectrometry. , 2010, Journal of chromatography. A.

[3]  Jiri Adamec,et al.  Development of GCxGC/TOF-MS metabolomics for use in ecotoxicological studies with invertebrates. , 2008, Aquatic toxicology.

[4]  M. Mann,et al.  Stable Isotope Labeling by Amino Acids in Cell Culture, SILAC, as a Simple and Accurate Approach to Expression Proteomics* , 2002, Molecular & Cellular Proteomics.

[5]  Kai Stühler,et al.  Retention time alignment algorithms for LC/MS data must consider non-linear shifts , 2009, Bioinform..

[6]  Brian Carrillo,et al.  Methods for peptide identification by spectral comparison , 2007, Proteome Science.

[7]  A. Casilli,et al.  Analytical challenges in doping control: Comprehensive two-dimensional gas chromatography with time of flight mass spectrometry, a promising option. , 2009, Journal of chromatography. A.

[8]  A. Wilbers,et al.  Retention time locking procedure for comprehensive two-dimensional gas chromatography. , 2011, Journal of chromatography. A.

[9]  F W McLafferty,et al.  Comparative Evaluations of Mass Spectral Data Bases , 1991, Journal of the American Society for Mass Spectrometry.

[10]  B. W. Wright,et al.  A comprehensive two-dimensional retention time alignment algorithm to enhance chemometric analysis of comprehensive two-dimensional separation data. , 2005, Analytical chemistry.

[11]  Charles Buck,et al.  Comprehensive two-dimensional gas chromatography/time-of-flight mass spectrometry peak sorting algorithm. , 2008, Journal of chromatography. A.

[12]  Age K Smilde,et al.  Quantitative analysis of target components by comprehensive two-dimensional gas chromatography. , 2003, Journal of chromatography. A.

[13]  R. Synovec,et al.  Objective data alignment and chemometric analysis of comprehensive two-dimensional separations with run-to-run peak shifting on both dimensions. , 2001, Analytical chemistry.

[14]  Min Zhang,et al.  Two-dimensional correlation optimized warping algorithm for aligning GC x GC-MS data. , 2008, Analytical chemistry.

[15]  N. Sakauchi [Gas chromatography]. , 2020, Horumon to rinsho. Clinical endocrinology.

[16]  Xin Lu,et al.  Comprehensive two-dimensional gas chromatography/time-of-flight mass spectrometry for metabonomics: Biomarker discovery for diabetes mellitus. , 2009, Analytica chimica acta.

[17]  BMC Bioinformatics , 2005 .

[18]  Keith L. March,et al.  Inside the Personalized Medicine Toolbox : GC×GC-Mass Spectrometry for High-Throughput Profiling of the Human Plasma Metabolome , 2008 .

[19]  Aiqin Fang,et al.  DISCO: distance and spectrum correlation optimization alignment for two-dimensional gas chromatography time-of-flight mass spectrometry-based metabolomics. , 2010, Analytical chemistry.

[20]  U. Brinkman,et al.  Recent developments in the application of comprehensive two-dimensional gas chromatography. , 2008, Journal of chromatography. A.

[21]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[22]  Gregory R. Grant,et al.  Statistical Methods in Bioinformatics , 2001 .

[23]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[24]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[25]  L Torres Vaz-Freire,et al.  Comprehensive two-dimensional gas chromatography for fingerprint pattern recognition in olive oils produced by two different techniques in Portuguese olive varieties Galega Vulgar, Cobrançosa e Carrasquenha. , 2009, Analytica chimica acta.

[26]  Werner Welthagen,et al.  Comprehensive two-dimensional gas chromatography–time-of-flight mass spectrometry (GC × GC-TOF) for high resolution metabolomics: biomarker discovery on spleen tissue extracts of obese NZO compared to lean C57BL/6 mice , 2005, Metabolomics.

[27]  Luigi Mondello,et al.  Comprehensive two-dimensional gas chromatography-mass spectrometry: a review. , 2008, Mass spectrometry reviews.

[28]  Wolfram Weckwerth,et al.  An automated GCxGC‐TOF‐MS protocol for batch‐wise extraction and alignment of mass isotopomer matrixes from differential 13C‐labelling experiments: a case study for photoautotrophic‐mixotrophic grown Chlamydomonas reinhardtii cells , 2009, Journal of basic microbiology.

[29]  P. May,et al.  Metabolomics- and Proteomics-Assisted Genome Annotation and Analysis of the Draft Metabolic Network of Chlamydomonas reinhardtii , 2008, Genetics.

[30]  Weiwen Zhang,et al.  Integrating multiple 'omics' analysis for microbial biology: application and methodologies. , 2010, Microbiology.

[31]  F W McLafferty,et al.  Comparison of algorithms and databases for matching unknown mass spectra , 1998, Journal of the American Society for Mass Spectrometry.