From raw ion mobility measurements to disease classification: a comparison of analysis processes

Ion mobility spectrometry (IMS) is a technology for the detection of volatile compounds in the air of exhaled breath that is increasingly used in medical applications. One major goal is to classify patients into disease groups, for example diseased versus healthy, from simple breath samples. Raw IMS measurements are data matrices in which peak regions representing the compounds have to be identified and quantified. A typical analysis process consists of pre-processing and peak detection in single experiments, peak clustering to obtain consensus peaks across several experiments, and classification of samples based on the resulting multivariate peak intensities. Recently several automated algorithms for peak detection and peak clustering have been introduced, in order to overcome the current need for human-based analysis that is slow, subjective and sometimes not reproducible. We present an unbiased comparison of a multitude of combinations of peak processing and multivariate classification algorithms on a disease dataset. The specific combination of the algorithms for the different analysis steps determines the classification accuracy, with the encouraging result that certain fully-automated combinations perform even better than current manual approaches.

[1]  Jörg Ingo Baumbach,et al.  Preprocessing of ion mobility spectra by lognormal detailing and wavelet transform , 2008 .

[2]  Sven Rahmann,et al.  An online peak extraction algorithm for ion mobility spectrometry data , 2015, Algorithms for Molecular Biology.

[3]  Jörg Ingo Baumbach,et al.  Peak finding and referencing in MCC/IMS-data , 2008 .

[4]  Jörg Ingo Baumbach,et al.  Visualisation of MCC/IMS-data , 2008 .

[5]  B. Ripley,et al.  Recursive Partitioning and Regression Trees , 2015 .

[6]  Sven Rahmann,et al.  A modular computational framework for automated peak extraction from ion mobility spectra , 2014, BMC Bioinformatics.

[7]  Sven Rahmann,et al.  Peak modeling for Ion mobility spectrometry measurements , 2012, 2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO).

[8]  Betti Maria,et al.  Analytical and Bioanalytical Chemistry - Plasma Spectrochemistry , 2007 .

[9]  Alexey Egorov,et al.  Ressourcenbeschränkte Analyse von Ionenmobilitätsspektren mit dem Raspberry Pi , 2014 .

[10]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[11]  A. Savitzky,et al.  Smoothing and Differentiation of Data by Simplified Least Squares Procedures. , 1964 .

[12]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[13]  S. Bader,et al.  PROCESSING ION MOBILITY SPECTROMETRY DATA TO CHARACTERIZE GROUP DIFFERENCES IN A MULTIPLE CLASS COMPARISON , 2005 .

[14]  Sven Rahmann,et al.  Exact and heuristic algorithms for weighted cluster editing. , 2007, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[15]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[16]  Jan Baumbach,et al.  Peak Detection Method Evaluation for Ion Mobility Spectrometry by Using Machine Learning Approaches , 2013, Metabolites.

[17]  S. Kreuer,et al.  Ion mobility spectrometry in breath research , 2014, Journal of breath research.

[18]  J. Baumbach,et al.  Peak assignment in multi-capillary column–ion mobility spectrometry using comparative studies with gas chromatography–mass spectrometry for VOC analysis , 2010, Analytical and bioanalytical chemistry.

[19]  I. Gràcia,et al.  Review on ion mobility spectrometry. Part 1: current instrumentation. , 2015, The Analyst.

[20]  Jörg Ingo Baumbach,et al.  Detection of infectious agents in the airways by ion mobility spectrometry of exhaled breath , 2011 .

[21]  J I Baumbach,et al.  Review on ion mobility spectrometry. Part 2: hyphenated methods and effects of experimental parameters. , 2015, The Analyst.

[22]  Sebastian Böcker,et al.  Exact Algorithms for Cluster Editing: Evaluation and Experiments , 2008, Algorithmica.