Beyond brain blobs: machine learning classifiers as instruments for analyzing functional magnetic resonance imaging data

The thesis put forth in this dissertation is that machine learning classifiers can be used as instruments for decoding variables of interest from functional magnetic resonance imaging (fMRI) data. There are two main goals in decoding: (1) Showing that the variable of interest can be predicted from the data in a statistically reliable manner (i.e. there's enough information present). (2) Shedding light on how the data encode the information needed to predict, taking into account what the classifier used can learn and any criteria by which the data are filtered (e.g. how voxels and time points used are chosen). Chapter 2 considers the issues that arise when using traditional linear classifiers and several different voxel selection techniques to strive towards these goals. It examines questions such as whether there is redundant, as well as different kinds of, information in fMRI data and how those facts complicate the task of determining whether voxel subsets encode the desired information or whether certain choices of selection method or classifier are universally preferable. All the results presented were obtained through a comparative study of five fMRI datasets. Chapter 3 and Chapter 4 introduce the Support Vector Decomposition Machine, a new algorithm that attempts to sidestep the use of voxel selection by learning a low dimensional representation of the data that is also suitable for decoding variables of interest and has the potential to allow incorporation of domain information directly, rather than through the proxy of voxel selection criteria.

[1]  N. Intrator On the combination of supervised and unsupervised learning , 1993 .

[2]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[3]  D. Hirschberg,et al.  Small sample statistics for classi cation error rates II: Con-dence intervals and signi cance tests , 1995 .

[4]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[5]  P. Good Permutation, Parametric, and Bootstrap Tests of Hypotheses , 2005 .

[6]  Thomas G. Dietterich,et al.  Solving Multiclass Learning Problems via Error-Correcting Output Codes , 1994, J. Artif. Intell. Res..

[7]  Daniel S. Hirschberg,et al.  Small Sample Statistics for Classification Error Rates I: Error Rate Measurements , 1996 .

[8]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[9]  J. V. Haxby,et al.  Spatial Pattern Analysis of Functional Brain Images Using Partial Least Squares , 1996, NeuroImage.

[10]  Yiming Yang,et al.  A Comparative Study on Feature Selection in Text Categorization , 1997, ICML.

[11]  Thomas G. Dietterich Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms , 1998, Neural Computation.

[12]  福山 秀直,et al.  SPM(statistical parametric mapping)の概要とその応用 (特集:核医学の新潮流) , 1998 .

[13]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[14]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[15]  L. K. Hansen,et al.  Generalizable Patterns in Neuroimaging: How Many Principal Components? , 1999, NeuroImage.

[16]  Nello Cristianini,et al.  Advances in Kernel Methods - Support Vector Learning , 1999 .

[17]  Ravi S. Menon,et al.  Spatial and temporal limits in cognitive neuroimaging with fMRI , 1999, Trends in Cognitive Sciences.

[18]  L. K. Hansen,et al.  Plurality and Resemblance in fMRI Data Analysis , 1999, NeuroImage.

[19]  L. K. Hansen,et al.  The Quantitative Evaluation of Functional Neuroimaging Experiments: The NPAIRS Data Analysis Framework , 2000, NeuroImage.

[20]  Michael I. Jordan,et al.  Feature selection for high-dimensional genomic microarray data , 2001, ICML.

[21]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[22]  Yuhong Yang,et al.  Nonparametric Regression with Correlated Errors , 2001 .

[23]  S. Strother,et al.  Penalized discriminant analysis of [/sup 15/O]-water PET brain images with prediction error selection of smoothness and regularization hyperparameters , 2001, IEEE Transactions on Medical Imaging.

[24]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[25]  Michael I. Jordan,et al.  Distance Metric Learning with Application to Clustering with Side-Information , 2002, NIPS.

[26]  Lars Kai Hansen,et al.  A spatially robust ICA algorithm for multiple fMRI data sets , 2002, Proceedings IEEE International Symposium on Biomedical Imaging.

[27]  David D. Cox,et al.  Functional magnetic resonance imaging (fMRI) “brain reading”: detecting and classifying distributed patterns of fMRI activity in human visual cortex , 2003, NeuroImage.

[28]  Eric P. Xing Feature Selection in Microarray Analysis , 2003 .

[29]  T. Carlson,et al.  Patterns of Activity in the Categorical Representations of Objects , 2003, Journal of Cognitive Neuroscience.

[30]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[31]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[32]  Vince D. Calhoun,et al.  ICA of functional MRI data: an overview. , 2003 .

[33]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[34]  Anthony Randal McIntosh,et al.  Partial least squares analysis of neuroimaging data: applications and advances , 2004, NeuroImage.

[35]  Aapo Hyvärinen,et al.  Validating the independent components of neuroimaging time series via clustering and visualization , 2004, NeuroImage.

[36]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[37]  Stephen José Hanson,et al.  Combinatorial codes in ventral temporal lobe for object recognition: Haxby (2001) revisited: is there a “face” area? , 2004, NeuroImage.

[38]  Tom M. Mitchell,et al.  Learning to Decode Cognitive States from Brain Images , 2004, Machine Learning.

[39]  Larry Wasserman,et al.  All of Statistics , 2004 .

[40]  Nathan Srebro,et al.  Learning with matrix factorizations , 2004 .

[41]  Stephen C. Strother,et al.  Support vector machines for temporal classification of block design fMRI data , 2005, NeuroImage.

[42]  F. Tong,et al.  Decoding the visual and subjective contents of the human brain , 2005, Nature Neuroscience.

[43]  Gabriele Steidl,et al.  Combined SVM-Based Feature Selection and Classification , 2005, Machine Learning.

[44]  Kilian Q. Weinberger,et al.  Distance Metric Learning for Large Margin Nearest Neighbor Classification , 2005, NIPS.

[45]  Sayan Mukherjee,et al.  Permutation Tests for Classification , 2005, COLT.

[46]  Vaidehi S. Natu,et al.  Category-Specific Cortical Activity Precedes Retrieval During Memory Search , 2005, Science.

[47]  J. Langford Tutorial on Practical Prediction Theory for Classification , 2005, J. Mach. Learn. Res..

[48]  Amir Globerson,et al.  Metric Learning by Collapsing Classes , 2005, NIPS.

[49]  G. Rees,et al.  Neuroimaging: Decoding mental states from brain activity in humans , 2006, Nature Reviews Neuroscience.

[50]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[51]  Rich Caruana,et al.  An empirical comparison of supervised learning algorithms , 2006, ICML.

[52]  Rainer Goebel,et al.  Information-based functional brain mapping. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Sean M. Polyn,et al.  Beyond mind-reading: multi-voxel pattern analysis of fMRI data , 2006, Trends in Cognitive Sciences.

[54]  Tom Mitchell,et al.  Exploring predictive and reproducible modeling with the single‐subject FIAC dataset , 2006, Human brain mapping.

[55]  Geoffrey J. Gordon,et al.  The support vector decomposition machine , 2006, ICML.

[56]  S.C. Strother,et al.  Evaluating fMRI preprocessing pipelines , 2006, IEEE Engineering in Medicine and Biology Magazine.

[57]  Stergios B. Fotopoulos,et al.  All of Nonparametric Statistics , 2007, Technometrics.

[58]  Yongyi Yang,et al.  Effect of Spatial Alignment Transformations in PCA and ICA of Functional Neuroimages , 2007, IEEE Transactions on Medical Imaging.