Common mistakes in cross-validating classification models

The common mistakes of cross-validation (CV) for the development of chemometric models for Raman based biological applications were investigated. We focused on two common mistakes: the first mistake occurs when splitting the dataset into training and validation datasets improperly; and the second mistake is regarding the wrong position of a dimension reduction procedure with respect to the CV loop. For the first mistake, we split the dataset either randomly or each technical replicate was used as one fold of the CV and we compared the results. To check the second mistake, we employed two dimension reduction methods including principal component analysis (PCA) and partial least squares regression (PLS). These dimension reduction models were constructed either once for the whole training data outside the CV loop or rebuilt inside the CV loop for each iteration. We based our study on a benchmark dataset of Raman spectra of three cell types, which included nine technical replicates respectively. Two binary classification models were constructed with a two-layer CV. For the external CV, each replicate was used once as the independent testing dataset. The other replicates were used for the internal CV, where different methods of data splitting and different positions of the dimension reduction were studied. The conclusions include two points. The first point is related to the reliability of the model evaluation by the internal CV, illustrated by the differences between the testing accuracies from the external CV and the validation accuracies from the internal CV. It was demonstrated that the dataset should be split at the highest hierarchical level, which means the biological/technical replicate in this manuscript. Meanwhile, the dimension reduction should be redone for each iteration of the internal CV loop. The second point is the optimization of the performance of the internal CV, benchmarked by the prediction accuracy of the optimized model on the testing dataset. Comparable results were observed for different methods of data splitting and positions of dimension reduction in the internal CV. This means if the internal CV is used for optimizing the model parameters, the two mistakes are less influential in contrast to the model evaluation.

[1]  Jürgen Popp,et al.  The investigation of single bacteria by means of fluorescence staining and Raman spectroscopy , 2007 .

[2]  Richard L. McCreery,et al.  Raman Spectroscopy for Chemical Analysis , 2000 .

[3]  Bruce Dunn,et al.  Ensemble multivariate analysis to improve identification of articular cartilage disease in noisy Raman spectra , 2015, Journal of biophotonics.

[4]  M. Schmitt,et al.  Classification of inflammatory bowel diseases by means of Raman spectroscopic imaging of epithelium cells. , 2012, Journal of biomedical optics.

[5]  J. Popp,et al.  Raman Based Molecular Imaging and Analytics: A Magic Bullet for Biomedical Applications!? , 2016, Analytical chemistry.

[6]  F R Burden,et al.  Cross-validatory selection of test and validation sets in multivariate calibration and neural networks as applied to spectroscopy. , 1997, The Analyst.

[7]  Andreas Uhl,et al.  Evaluation of cross-validation protocols for the classification of endoscopic images of colonic polyps , 2012, 2012 25th IEEE International Symposium on Computer-Based Medical Systems (CBMS).

[8]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[9]  R. McCreery,et al.  Raman Spectroscopy for Chemical Analysis: McCreery/Raman Spectroscopy , 2005 .

[10]  Sylvain Arlot,et al.  A survey of cross-validation procedures for model selection , 2009, 0907.4728.

[11]  K. Baumann,et al.  Gaussian mixture discriminant analysis for the single-cell differentiation of bacteria using micro-Raman spectroscopy , 2009 .

[12]  Browne,et al.  Cross-Validation Methods. , 2000, Journal of mathematical psychology.

[13]  C. Kendall,et al.  Raman spectroscopy for medical diagnostics--From in-vitro biofluid assays to in-vivo cancer detection. , 2015, Advanced drug delivery reviews.

[14]  Benjamin Bird,et al.  Applications of Infrared and Raman Microspectroscopy of Cells and Tissue in Medical Diagnostics: Present Status and Future Promises , 2012 .

[15]  H. Zeng,et al.  Real‐time in vivo cancer diagnosis using raman spectroscopy , 2015, Journal of biophotonics.

[16]  E. V. Thomas Incorporating auxiliary predictor variation in principal component regression models , 1995 .

[17]  H. Abramczyk,et al.  Raman imaging in biochemical and biomedical applications. Diagnosis and treatment of breast cancer. , 2013, Chemical reviews.

[18]  E. K. Kemsley,et al.  THE USE AND MISUSE OF CHEMOMETRICS FOR TREATING CLASSIFICATION PROBLEMS , 1997 .

[19]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics, ProbabilityTheory Group (Formerly: E1071), TU Wien , 2015 .

[20]  P. Filzmoser,et al.  Repeated double cross validation , 2009 .

[21]  Jürgen Popp,et al.  A comprehensive study of classification methods for medical diagnosis , 2009 .

[22]  Anita Mahadevan-Jansen,et al.  Detecting Biochemical Changes in the Rodent Cervix During Pregnancy Using Raman Spectroscopy , 2012, Annals of Biomedical Engineering.

[23]  Axel Mosig,et al.  Label-free imaging of drug distribution and metabolism in colon cancer cells by Raman microscopy. , 2014, The Analyst.

[24]  Antonella I. Mazur,et al.  Molecular pathology via IR and Raman spectral imaging , 2013, Journal of biophotonics.

[25]  Age K. Smilde,et al.  UvA-DARE ( Digital Academic Repository ) Assessment of PLSDA cross validation , 2008 .

[26]  J. Popp,et al.  Sample size planning for classification models. , 2012, Analytica chimica acta.

[27]  Jürgen Popp,et al.  Localizing and identifying living bacteria in an abiotic environment by a combination of Raman and fluorescence microscopy. , 2008, Analytical chemistry.

[28]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[29]  A. Talari,et al.  Raman Spectroscopy of Biological Tissues , 2007 .

[30]  L. Bernstein,et al.  Intraoperative brain cancer detection with Raman spectroscopy in humans , 2015, Science Translational Medicine.

[31]  B. Dietzek,et al.  Raman and coherent anti-Stokes Raman scattering microspectroscopy for biomedical applications. , 2012, Journal of biomedical optics.