Multiclass methods in the analysis of metabolomic datasets: The example of raspberry cultivar volatile compounds detected by GC–MS and PTR-MS

Multiclass sample classification and marker selection are cutting-edge problems in metabolomics. In the present study we address the classification of 14 raspberry cultivars having different levels of gray mold (Botrytis cinerea) susceptibility. We characterized raspberry cultivars by two headspace analysis methods, namely solid-phase microextraction/gas chromatography–mass spectrometry (SPME/GC–MS) and proton transfer reaction-mass spectrometry (PTR-MS). Given the high number of classes, advanced data mining methods are necessary. Random Forest (RF), Penalized Discriminant Analysis (PDA), Discriminant Partial Least Squares (dPLS) and Support Vector Machine (SVM) have been employed for cultivar classification and Random Forest-Recursive Feature Elimination (RF-RFE) has been used to perform feature selection. In particular the most important GC–MS and PTR-MS variables related to gray mold susceptibility of the selected raspberry cultivars have been investigated. Moving from GC–MS profiling to the more rapid and less invasive PTR-MS fingerprinting leads to a cultivar characterization which is still related to the corresponding Botrytis susceptibility level and therefore marker identification is still possible.

[1]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[2]  M. Polissiou,et al.  The effectiveness of plant essential oils on the growth of Botrytis cinerea, Fusarium sp. and Clavibacter michiganensis subsp. michiganensis , 2003 .

[3]  P. Shepson,et al.  Development of a proton-transfer reaction-linear ion trap mass spectrometer for quantitative determination of volatile organic compounds. , 2008, Analytical Chemistry.

[4]  Joseph Arul,et al.  Characterization and use of essential oil from Thymus vulgaris against Botrytis cinerea and Rhizopus stolonifer in strawberry fruits , 1998 .

[5]  Werner Lindinger,et al.  Proton-transfer-reaction mass spectrometry (PTR–MS): on-line monitoring of volatile organic compounds at pptv levels , 1998 .

[6]  J. Nielsen,et al.  The yeast metabolome addressed by electrospray ionization mass spectrometry: Initiation of a mass spectral library and its applications for metabolic footprinting by direct infusion mass spectrometry , 2008, Metabolomics.

[7]  M. Heinonen,et al.  Berry phenolics and their antioxidant activity. , 2001, Journal of agricultural and food chemistry.

[8]  Jörg-Peter Schnitzler,et al.  Practical approaches to plant volatile analysis. , 2006, The Plant journal : for cell and molecular biology.

[9]  C. Furlanello,et al.  Rapid and non-destructive identification of strawberry cultivars by direct PTR-MS headspace analysis and data mining techniques , 2007 .

[10]  F. Harren,et al.  Collision induced dissociation study of 10 monoterpenes for identification in trace gas measurements using the newly developed proton-transfer reaction ion trap mass spectrometer , 2007 .

[11]  Maria Liakata,et al.  Enhancement of Plant Metabolite Fingerprinting by Machine Learning1[W] , 2010, Plant Physiology.

[12]  Wanchang Lin,et al.  Original Article , 1995 .

[13]  Sterner,et al.  Signal suppression in electrospray ionization Fourier transform mass spectrometry of multi-component samples , 2000, Journal of mass spectrometry : JMS.

[14]  Y. Fujii,et al.  Antifungal Effects of Volatile Compounds from Black Zira (Bunium persicum) and Other Spices and Herbs , 2007, Journal of Chemical Ecology.

[15]  C. N. Hewitt,et al.  Effect of water vapour pressure on monoterpene measurements using proton transfer reaction-mass spectrometry (PTR-MS) , 2004 .

[16]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[17]  P. T. Palmer,et al.  Proton transfer reaction ion trap mass spectrometer. , 2003, Rapid communications in mass spectrometry : RCM.

[18]  M. Achouri,et al.  Chemical composition and antifungal activity of essential oils of seven Moroccan Labiatae against Botrytis cinerea Pers: Fr. , 2003, Journal of ethnopharmacology.

[19]  Thomas A. Gerds,et al.  The Validation and Assessment of Machine Learning: A Game of Prediction from High-Dimensional Data , 2009, PloS one.

[20]  Pablo M. Granitto,et al.  On data analysis in PTR-TOF-MS: From raw spectra to data mining , 2011 .

[21]  I. Jolliffe Principal Component Analysis , 2002 .

[22]  Chengyin Shen,et al.  Proton Transfer Reaction Mass Spectrometry (PTR‐MS) , 2012 .

[23]  E. Aprea,et al.  Characterization of 14 raspberry cultivars by solid-phase microextraction and relationship with gray mold susceptibility. , 2010, Journal of agricultural and food chemistry.

[24]  Robert Martin,et al.  Volatile composition in raspberry cultivars grown in the Pacific Northwest determined by stir bar sorptive extraction-gas chromatography-mass spectrometry. , 2008, Journal of agricultural and food chemistry.

[25]  S. Wold,et al.  PLS-regression: a basic tool of chemometrics , 2001 .

[26]  C. Furlanello,et al.  Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products , 2006 .

[27]  Melanie Hilario,et al.  Standard machine learning algorithms applied to UPLC-TOF/MS metabolic fingerprinting for the discovery of wound biomarkers in Arabidopsis thaliana , 2010 .

[28]  Mar Larrosa,et al.  Urolithins, ellagic acid-derived metabolites produced by human colonic microflora, exhibit estrogenic and antiestrogenic activities. , 2006, Journal of agricultural and food chemistry.

[29]  F. Biasioli,et al.  On quantitative determination of volatile organic compound concentrations using proton transfer reaction time-of-flight mass spectrometry. , 2012, Environmental science & technology.

[30]  D. Stewart,et al.  Towards fruitful metabolomics: high throughput analyses of polyphenol composition in berries using direct infusion mass spectrometry. , 2008, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[31]  Bogusław Buszewski,et al.  Human exhaled air analytics: biomarkers of diseases. , 2007, Biomedical chromatography : BMC.

[32]  Franco Biasioli,et al.  PTR-ToF-MS and data mining methods: a new tool for fruit metabolomics , 2012, Metabolomics.

[33]  F. Biasioli,et al.  Improved mass accuracy in PTR-TOF-MS: Another step towards better compound identification in PTR-MS , 2010 .

[34]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[35]  F. Biasioli,et al.  Proton transfer reaction rate coefficients between H3O+ and some sulphur compounds , 2010 .

[36]  R. Atkinson Atmospheric chemistry of VOCs and NOx , 2000 .

[37]  H. Klee Improving the flavor of fresh fruits: genomics, biochemistry, and biotechnology. , 2010, The New phytologist.

[38]  Philipp Sulzer,et al.  A high resolution and high sensitivity proton-transfer-reaction time-of-flight mass spectrometer (PTR-TOF-MS) , 2009 .

[39]  F. Biasioli,et al.  Investigation of volatile compounds in two raspberry cultivars by two headspace techniques: solid-phase microextraction/gas chromatography-mass spectrometry (SPME/GC-MS) and proton-transfer reaction-mass spectrometry (PTR-MS). , 2009, Journal of agricultural and food chemistry.

[40]  E. Ragazzi,et al.  A metabolite fingerprinting for the characterization of commercial botanical dietary supplements , 2011, Metabolomics.

[41]  Y. Elad,et al.  Botrytis: biology, pathology and control. , 2007 .

[42]  Jun Han,et al.  Mass spectrometry-based technologies for high-throughput metabolomics. , 2009, Bioanalysis.

[43]  W. Jarvis The infection of strawberry and raspberry fruits by Botrytis cinerea Fr. , 1962 .

[44]  T. Annesley Ion suppression in mass spectrometry. , 2003, Clinical chemistry.