Performance comparison of machine learning algorithms and number of independent components used in fMRI decoding of belief vs. disbelief

Machine learning (ML) has become a popular tool for mining functional neuroimaging data, and there are now hopes of performing such analyses efficiently in real-time. Towards this goal, we compared accuracy of six different ML algorithms applied to neuroimaging data of persons engaged in a bivariate task, asserting their belief or disbelief of a variety of propositional statements. We performed unsupervised dimension reduction and automated feature extraction using independent component (IC) analysis and extracted IC time courses. Optimization of classification hyperparameters across each classifier occurred prior to assessment. Maximum accuracy was achieved at 92% for Random Forest, followed by 91% for AdaBoost, 89% for Naïve Bayes, 87% for a J48 decision tree, 86% for K*, and 84% for support vector machine. For real-time decoding applications, finding a parsimonious subset of diagnostic ICs might be useful. We used a forward search technique to sequentially add ranked ICs to the feature subspace. For the current data set, we determined that approximately six ICs represented a meaningful basis set for classification. We then projected these six IC spatial maps forward onto a later scanning session within subject. We then applied the optimized ML algorithms to these new data instances, and found that classification accuracy results were reproducible. Additionally, we compared our classification method to our previously published general linear model results on this same data set. The highest ranked IC spatial maps show similarity to brain regions associated with contrasts for belief > disbelief, and disbelief < belief.

[1]  Xiaoping P. Hu,et al.  Real‐time fMRI using brain‐state classification , 2007, Human brain mapping.

[2]  Don M. Tucker,et al.  A single-trial analytic framework for EEG analysis and its application to target detection and classification , 2008, NeuroImage.

[3]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[4]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[5]  T. H. Bø,et al.  New feature subset selection procedures for classification of expression profiles , 2002, Genome Biology.

[6]  Stephen Budiansky,et al.  The cold war experiments. , 1994, U.S. news & world report.

[7]  David D. Cox,et al.  Functional magnetic resonance imaging (fMRI) “brain reading”: detecting and classifying distributed patterns of fMRI activity in human visual cortex , 2003, NeuroImage.

[8]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Stephen M. Smith,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[11]  Michael Brady,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[12]  H. Akaike A new look at the statistical model identification , 1974 .

[13]  R. DeCharms Applications of real-time fMRI , 2008, Nature Reviews Neuroscience.

[14]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[15]  PK Douglas,et al.  Naïve Bayes Classification of Belief verses Disbelief using Event Related Neuroimaging Data , 2009, NeuroImage.

[16]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[17]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[18]  Stephen José Hanson,et al.  Combinatorial codes in ventral temporal lobe for object recognition: Haxby (2001) revisited: is there a “face” area? , 2004, NeuroImage.

[19]  Stephen M. Smith,et al.  Accurate, Robust, and Automated Longitudinal and Cross-Sectional Brain Change Analysis , 2002, NeuroImage.

[20]  R. Poldrack Can cognitive processes be inferred from neuroimaging data? , 2006, Trends in Cognitive Sciences.

[21]  B. Ripley,et al.  Pattern Recognition , 1968, Nature.

[22]  Arthur W. Toga,et al.  Automatic independent component labeling for artifact removal in fMRI , 2008, NeuroImage.

[23]  Stephen José Hanson,et al.  Decoding the Large-Scale Structure of Brain Function by Classifying Mental States Across Individuals , 2009, Psychological science.

[24]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[25]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[26]  John G. Cleary,et al.  K*: An Instance-based Learner Using and Entropic Distance Measure , 1995, ICML.

[27]  Mark S. Cohen,et al.  Functional neuroimaging of belief, disbelief, and uncertainty , 2008, Annals of neurology.

[28]  Stephen C. Strother,et al.  Support vector machines for temporal classification of block design fMRI data , 2005, NeuroImage.

[29]  Lars Kai Hansen,et al.  Optimizing the fMRI data-processing pipeline using prediction and reproducibility performance metrics: I. A preliminary group analysis , 2004, NeuroImage.

[30]  Stefan Pollmann,et al.  Statistical Learning Analysis in Neuroscience: Aiming for Transparency , 2009, Frontiers in neuroscience.

[31]  Vladimir Naumovich Vapni The Nature of Statistical Learning Theory , 1995 .

[32]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[33]  Huiqing Liu,et al.  A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns. , 2002, Genome informatics. International Conference on Genome Informatics.

[34]  Stephen M Smith,et al.  Applying FSL to the FIAC data: Model‐based and model‐free analysis of voice and sentence repetition priming , 2006, Human brain mapping.

[35]  D. Wolpert,et al.  No Free Lunch Theorems for Search , 1995 .

[36]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[37]  Tom M. Mitchell,et al.  Machine learning classifiers and fMRI: A tutorial overview , 2009, NeuroImage.

[38]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Janaina Mourão Miranda,et al.  The impact of temporal compression and space selection on SVM analysis of single-subject and multi-subject fMRI data , 2006, NeuroImage.

[40]  Neill W Campbell,et al.  IEEE International Conference on Computer Vision and Pattern Recognition , 2008 .

[41]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[42]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[43]  R. Turner,et al.  Characterizing Dynamic Brain Responses with fMRI: A Multivariate Approach , 1995, NeuroImage.

[44]  David W. Aha,et al.  Tolerating Noisy, Irrelevant and Novel Attributes in Instance-Based Learning Algorithms , 1992, Int. J. Man Mach. Stud..

[45]  C.W. Anderson,et al.  Comparison of linear, nonlinear, and feature selection methods for EEG signal classification , 2003, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[46]  Masa-aki Sato,et al.  Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns , 2008, NeuroImage.

[47]  Sean M. Polyn,et al.  Beyond mind-reading: multi-voxel pattern analysis of fMRI data , 2006, Trends in Cognitive Sciences.

[48]  Alan L. Yuille,et al.  Classification of spatially unaligned fMRI scans , 2010, NeuroImage.

[49]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[50]  Gabriel Curio,et al.  MACHINE LEARNING TECHNIQUES FOR BRAIN-COMPUTER INTERFACES , 2004 .

[51]  Josef Kittler,et al.  Feature selection for a DTW-based speaker verification system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).