Generalized Sparse Regularization with Application to fMRI Brain Decoding

Many current medical image analysis problems involve learning thousands or even millions of model parameters from extremely few samples. Employing sparse models provides an effective means for handling the curse of dimensionality, but other propitious properties beyond sparsity are typically not modeled. In this paper, we propose a simple approach, generalized sparse regularization (GSR), for incorporating domain-specific knowledge into a wide range of sparse linear models, such as the LASSO and group LASSO regression models. We demonstrate the power of GSR by building anatomically-informed sparse classifiers that additionally model the intrinsic spatiotemporal characteristics of brain activity for fMRI classification. We validate on real data and show how prior-informed sparse classifiers outperform standard classifiers, such as SVM and a number of sparse linear classifiers, both in terms of prediction accuracy and result interpretability. Our results illustrate the added-value in facilitating flexible integration of prior knowledge beyond sparsity in large-scale model learning problems.

[1]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[2]  Tom Heskes,et al.  Efficient Bayesian multivariate fMRI analysis using a sparsifying spatio-temporal prior , 2010, NeuroImage.

[3]  Mark W. Schmidt,et al.  Fast Optimization Methods for L1 Regularization: A Comparative Study and Two New Approaches , 2007, ECML.

[4]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[5]  Ghassan Hamarneh,et al.  Generalized Sparse Classifiers for Decoding Cognitive States in fMRI , 2010, MLMI.

[6]  Masa-aki Sato,et al.  Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns , 2008, NeuroImage.

[7]  Jürgen Hennig,et al.  Fully automated classification of HARDI in vivo data using a support vector machine , 2009, NeuroImage.

[8]  J. -B. Poline,et al.  Estimating the Delay of the fMRI Response , 2002, NeuroImage.

[9]  R. Tibshirani,et al.  A note on the group lasso and a sparse group lasso , 2010, 1001.0736.

[10]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[11]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[12]  Yongyi Yang,et al.  Machine Learning in Medical Imaging , 2010, IEEE Signal Processing Magazine.

[13]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[14]  A. Ravishankar Rao,et al.  Prediction and interpretation of distributed neural activity with sparse models , 2009, NeuroImage.

[15]  Tom Michael Mitchell,et al.  Predicting Human Brain Activity Associated with the Meanings of Nouns , 2008, Science.

[16]  Carlo Miniussi,et al.  The role of the prefrontal cortex in sentence comprehension: An rTMS study , 2008, Cortex.

[17]  Shuicheng Yan,et al.  Graph embedding: a general framework for dimensionality reduction , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[18]  Bertrand Thirion,et al.  Multi-Class Sparse Bayesian Regression for Neuroimaging Data Analysis , 2010, MLMI.

[19]  Jean-Baptiste Poline,et al.  Dealing with the shortcomings of spatial normalization: Multi‐subject parcellation of fMRI datasets , 2006, Human brain mapping.

[20]  Richard S. J. Frackowiak,et al.  Functional anatomy of a common semantic system for words and pictures , 1996, Nature.

[21]  Kaustubh Supekar,et al.  Sparse logistic regression for whole-brain classification of fMRI data , 2010, NeuroImage.

[22]  Tom M. Mitchell,et al.  Learning to Decode Cognitive States from Brain Images , 2004, Machine Learning.

[23]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[24]  J. Fodor The Modularity of mind. An essay on faculty psychology , 1986 .

[25]  R. Tibshirani,et al.  Sparsity and smoothness via the fused lasso , 2005 .

[26]  Michael P. Friedlander,et al.  Probing the Pareto Frontier for Basis Pursuit Solutions , 2008, SIAM J. Sci. Comput..

[27]  Yonina C. Eldar,et al.  Collaborative hierarchical sparse modeling , 2010, 2010 44th Annual Conference on Information Sciences and Systems (CISS).

[28]  Jiawei Han,et al.  Spectral Regression: A Unified Approach for Sparse Subspace Learning , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[29]  R. Tibshirani,et al.  The solution path of the generalized lasso , 2010, 1005.1971.

[30]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.