ChromA: signal-based retention time alignment for chromatography–mass spectrometry data

Summary: We describe ChromA, a web-based alignment tool for chromatography–mass spectrometry data from the metabolomics and proteomics domains. Users can supply their data in open and standardized file formats for retention time alignment using dynamic time warping with different configurable local distance and similarity functions. Additionally, user-defined anchors can be used to constrain and speedup the alignment. A neighborhood around each anchor can be added to increase the flexibility of the constrained alignment. ChromA offers different visualizations of the alignment for easier qualitative interpretation and comparison of the data. For the multiple alignment of more than two data files, the center-star approximation is applied to select a reference among input files to align to. Availability: ChromA is available at http://bibiserv.techfak.uni-bielefeld.de/chroma. Executables and source code under the L-GPL v3 license are provided for download at the same location. Contact: stoye@techfak.uni-bielefeld.de Supplementary information: Supplementary data are available at Bioinformatics online.

[1]  Russ Rew,et al.  NetCDF: an interface for scientific data access , 1990, IEEE Computer Graphics and Applications.

[2]  Ute Baumann,et al.  A framework for gene expression analysis , 2007, Bioinform..

[3]  Joseph B. Kruskal,et al.  Time Warps, String Edits, and Macromolecules , 1999 .

[4]  Joseph B. Kruskall,et al.  The Symmetric Time-Warping Problem : From Continuous to Discrete , 1983 .

[5]  J. Keurentjes,et al.  Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry , 2007, Nature Protocols.

[6]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[7]  Knut Reinert,et al.  OpenMS – An open-source software framework for mass spectrometry , 2008, BMC Bioinformatics.

[8]  Mark D. Robinson,et al.  A dynamic programming approach for the alignment of signal peaks in multiple gas chromatography-mass spectrometry experiments , 2007, BMC Bioinformatics.

[9]  Knut Reinert,et al.  TOPP - the OpenMS proteomics pipeline , 2007, Bioinform..

[10]  Johan Trygg,et al.  High-throughput data analysis for detecting and identifying differences between samples in GC/MS-based metabolomic analyses. , 2005, Analytical chemistry.

[11]  Chris F. Taylor,et al.  A common open representation of mass spectrometry data and its application to proteomics research , 2004, Nature Biotechnology.

[12]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[13]  Todd Miller,et al.  ASTM Protocols for Analytical Data Interchange , 2000 .

[14]  Jan Hummel,et al.  Retention index thresholds for compound matching in GC-MS metabolite profiling. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[15]  Johan Lindberg,et al.  Predictive metabolite profiling applying hierarchical multivariate curve resolution to GC-MS data--a potential tool for multi-parametric diagnosis. , 2006, Journal of proteome research.

[16]  E. Marcotte,et al.  Chromatographic alignment of ESI-LC-MS proteomics data sets by ordered bijective interpolated warping. , 2006, Analytical chemistry.

[17]  Jens Stoye,et al.  MeltDB: a software platform for the analysis and integration of metabolomics experiment data , 2008, Bioinform..

[18]  R. Abagyan,et al.  XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. , 2006, Analytical chemistry.

[19]  T. F. Moran,et al.  Characterization of normal human cells by pyrolysis gas chromatography mass spectrometry. , 1979, Biomedical mass spectrometry.

[20]  Mark P. Styczynski,et al.  Systematic identification of conserved metabolites in GC/MS data for metabolomics and biomarker discovery. , 2007, Analytical chemistry.