Feature and model selection with discriminatory visualization for diagnostic classification of brain tumors

Machine Learning (ML) and related methods have of late made significant contributions to solving multidisciplinary problems in the field of oncology diagnosis. Human brain tumor diagnosis, in particular, often relies on the use of non-invasive techniques such as Magnetic Resonance Imaging (MRI) and Spectroscopy (MRS). In this paper, MRS data of human brain tumors are analyzed in detail. The high dimensionality of the MR spectra makes difficult both their classification and the interpretation of the obtained results, thus limiting their usability in practical medical settings. The use of dimensionality reduction techniques is therefore advisable. In this work, we apply feature selection methods and several off-the-shelf classifiers on various ^1H-MRS modalities: long and short echo times and an ad hoc combination of both. The introduction of bootstrap resampling techniques permits the obtention of mean performance estimates and their variability. Our experimental findings indicate that the feature selection process enhances the classification performance compared to using the full set of features. We also show that the use of combined information from the different echo times is a better strategy for small numbers of spectral frequencies; however, the use of ever greater numbers of short echo time frequencies permits the obtention of many models with similar performance. The final induced models offer very attractive solutions both in terms of prediction accuracy and number of involved spectral frequencies, which are also amenable to metabolic interpretation. A linear dimensionality-reduction technique that preserves class discrimination capabilities is used for visualizing the data corresponding to the selected frequencies.

[1]  Paulo J. G. Lisboa,et al.  Machine learning in cancer research: implications for personalised medicine , 2008, ESANN.

[2]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[3]  Y. Kinoshita,et al.  Proton magnetic resonance spectroscopy of brain tumors: an in vitro study. , 1994, Neurosurgery.

[4]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[5]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[6]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[7]  John W. Sammon,et al.  A Nonlinear Mapping for Data Structure Analysis , 1969, IEEE Transactions on Computers.

[8]  Lluís A. Belanche Muñoz,et al.  Gene subset selection in microarray data using entropic filtering for cancer classification , 2009, Expert Syst. J. Knowl. Eng..

[9]  Margarida Julià-Sapé,et al.  Brain tumor classification by proton MR spectroscopy: comparison of diagnostic accuracy at short and long TE. , 2004, AJNR. American journal of neuroradiology.

[10]  J. Frahm,et al.  Noninvasive differentiation of tumors with use of localized H-1 MR spectroscopy in vivo: initial experience in patients with cerebral tumors. , 1989, Radiology.

[11]  Lukasz A. Kurgan,et al.  CAIM discretization algorithm , 2004, IEEE Transactions on Knowledge and Data Engineering.

[12]  A. W. Simonetti,et al.  Development of a decision support system for diagnosis and grading of brain tumours using in vivo magnetic resonance single voxel spectra , 2006, NMR in biomedicine.

[13]  F A Howe,et al.  The clinical value of proton magnetic resonance spectroscopy in adult brain tumours. , 2007, Clinical radiology.

[14]  David G. Stork,et al.  Pattern Classification , 1973 .

[15]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[16]  Sabine Van Huffel,et al.  The effect of combining two echo times in automatic brain tumor classification by MRS , 2008, NMR in biomedicine.

[17]  K. Hara,et al.  Noninvasive evaluation of malignancy of brain tumors with proton MR spectroscopy. , 1996, AJNR. American journal of neuroradiology.

[18]  E. Ziegel,et al.  Bootstrapping: A Nonparametric Approach to Statistical Inference , 1993 .

[19]  K. Gabriel,et al.  The biplot graphic display of matrices with application to principal component analysis , 1971 .

[20]  Christophe Ludovic,et al.  Pattern Recognition Techniques for the Study of Magnetic Resonance Spectra of Brain Tumours , 2003 .

[21]  Melanie Hilario,et al.  Knowledge and Information Systems , 2007 .

[22]  M. Julià-Sapé,et al.  A Multi-Centre, Web-Accessible and Quality Control-Checked Database of in vivo MR Spectra of Brain Tumour Patients , 2006, Magnetic Resonance Materials in Physics, Biology and Medicine.

[23]  Colas Schretter,et al.  Information-Theoretic Feature Selection in Microarray Data Using Variable Complementarity , 2008, IEEE Journal of Selected Topics in Signal Processing.

[24]  Lluís A. Belanche Muñoz,et al.  Feature Selection in in vivo1H-MRS Single Voxel Spectra , 2008, KES.

[25]  Hui Wang Towards a unified framework of relevance , 1996 .

[26]  Robert Tibshirani,et al.  Bootstrap Methods for Standard Errors, Confidence Intervals, and Other Measures of Statistical Accuracy , 1986 .

[27]  Paulo J. G. Lisboa,et al.  Handling outliers in brain tumour MRS data analysis through robust topographic mapping , 2006, Comput. Biol. Medicine.

[28]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[29]  C. H. Coombs,et al.  Mathematical psychology : an elementary introduction , 1970 .

[30]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[31]  M. Ng,et al.  Informative Gene Discovery for Cancer Classification from Microarray Expression Data , 2005, 2005 IEEE Workshop on Machine Learning for Signal Processing.

[32]  Paulo J. G. Lisboa,et al.  Cluster-based visualisation with scatter matrices , 2008, Pattern Recognit. Lett..

[33]  G Friedmann,et al.  Human brain tumors: spectral patterns detected with localized H-1 MR spectroscopy. , 1992, Radiology.

[34]  Duy-Dinh Le,et al.  Robust Object Detection using Fast Feature Selection from Huge Feature Sets , 2006, 2006 International Conference on Image Processing.

[35]  H. P. Friedman,et al.  On Some Invariant Criteria for Grouping Data , 1967 .

[36]  Hidehiko Kajiwara,et al.  Proton Magnetic Resonance Spectroscopy of Brain TumorsAn In Vitro Study , 1994 .

[37]  J Hennig,et al.  Human brain tumors: assessment with in vivo proton MR spectroscopy. , 1993, Radiology.