A Novel Feature Selection Approach for Analyzing High dimensional Functional MRI Data

Feature selection based on traditional multivariate methods is likely to obtain unstable and unreliable results in case of an extremely high dimensional space and very limited training samples. In order to overcome this difficulty, we introduced a novel feature selection method which combines the idea of stability selection approach and the elastic net approach to detect discriminative features in a stable and robust way. This new method is applied to functional magnetic resonance imaging (fMRI) data, whose discriminative features are often correlated or redundant. Compared with the original stability selection approach with the pure l_1 -norm regularized model serving as the baseline model, the proposed method achieves a better sensitivity empirically, because elastic net encourages a grouping effect besides sparsity. Compared with the feature selection method based on the plain Elastic Net, our method achieves the finite sample control for certain error rates of false discoveries, transparent principle for choosing a proper amount of regularization and the robustness of the feature selection results, due to the incorporation of the stability selection idea. A simulation study showed that our approach are less influenced than other methods by label noise. In addition, the advantage in terms of better control of false discoveries and missed discoveries of our approach was verified in a real fMRI experiment. Finally, a multi-center resting-state fMRI data about Attention-deficit/ hyperactivity disorder (ADHD) suggested that the resulted classifier based on our feature selection method achieves the best and most robust prediction accuracy.

[1]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[2]  Kevin J. Parker,et al.  A novel volumetric feature extraction technique with applications to MR images , 1997, IEEE Transactions on Medical Imaging.

[3]  A. Ishai,et al.  Distributed and Overlapping Representations of Faces and Objects in Ventral Temporal Cortex , 2001, Science.

[4]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[5]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[6]  Stephen C. Strother,et al.  Support vector machines for temporal classification of block design fMRI data , 2005, NeuroImage.

[7]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[8]  Lena Costaridou,et al.  Medical Image Analysis Methods , 2005 .

[9]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[10]  Sean M. Polyn,et al.  Beyond mind-reading: multi-voxel pattern analysis of fMRI data , 2006, Trends in Cognitive Sciences.

[11]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[12]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[13]  Masa-aki Sato,et al.  Sparse estimation automatically selects voxels relevant for the decoding of fMRI activity patterns , 2008, NeuroImage.

[14]  I. Buciu,et al.  Gabor wavelet based features for medical image analysis and classification , 2009, 2009 2nd International Symposium on Applied Sciences in Biomedical and Communication Technologies.

[15]  Juan José Rodríguez Diez,et al.  Random Subspace Ensembles for fMRI Classification , 2010, IEEE Transactions on Medical Imaging.

[16]  Don Hong,et al.  Weighted Elastic Net Model for Mass Spectrometry Imaging Processing , 2010 .

[17]  Marcos Lévano,et al.  New aspects of the elastic net algorithm for cluster analysis , 2010, Neural Computing and Applications.

[18]  Jianqing Fan,et al.  A Selective Overview of Variable Selection in High Dimensional Feature Space. , 2009, Statistica Sinica.

[19]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[20]  Chaogan Yan,et al.  DPARSF: A MATLAB Toolbox for “Pipeline” Data Analysis of Resting-State fMRI , 2010, Front. Syst. Neurosci..

[21]  Yong He,et al.  Hemisphere- and gender-related differences in small-world brain networks: A resting-state functional MRI study , 2011, NeuroImage.

[22]  Kimberly S. Chiew,et al.  Neural correlates of recognition memory for emotional faces and scenes. , 2011, Social cognitive and affective neuroscience.

[23]  Jean-Philippe Vert,et al.  The Influence of Feature Selection Methods on Accuracy, Stability and Interpretability of Molecular Signatures , 2011, PloS one.

[24]  Rajen Dinesh Shah,et al.  Variable selection with error control: another look at stability selection , 2011, 1105.5578.

[25]  J. Haynes Brain Reading: Decoding Mental States From Brain Activity In Humans , 2011 .

[26]  Julien Mairal,et al.  Optimization with Sparsity-Inducing Penalties , 2011, Found. Trends Mach. Learn..

[27]  Lars Kai Hansen,et al.  Model sparsity and brain pattern interpretation of classification models in neuroimaging , 2012, Pattern Recognit..

[28]  W. Cheng,et al.  Individual classification of ADHD patients by integrating multiscale neuroimaging markers and advanced pattern recognition techniques , 2012, Front. Syst. Neurosci..

[29]  Luca Baldassarre,et al.  Structured Sparsity Models for Brain Decoding from fMRI Data , 2012, 2012 Second International Workshop on Pattern Recognition in NeuroImaging.

[30]  Johan Wessberg,et al.  Clustered sampling improves random subspace brain mapping , 2012, Pattern Recognit..

[31]  Gaël Varoquaux,et al.  Small-sample brain mapping: sparse recovery on spatially correlated designs with randomization and clustering , 2012, ICML.

[32]  Kaustubh Supekar,et al.  Estimation of functional connectivity in fMRI data using stability selection-based sparse partial correlation with elastic net penalty , 2012, NeuroImage.

[33]  L. Rohde,et al.  Evaluation of Pattern Recognition and Feature Extraction Methods in ADHD Prediction , 2012, Front. Syst. Neurosci..

[34]  Patricia Figueiredo,et al.  Decoding visual brain states from fMRI using an ensemble of classifiers , 2012, Pattern Recognit..

[35]  Peter A. Bandettini,et al.  Does feature selection improve classification accuracy? Impact of sample size and feature selection on classification using anatomical magnetic resonance images , 2012, NeuroImage.

[36]  Noah D. Brenowitz,et al.  Whole-brain, time-locked activation with simple tasks revealed using massive averaging and model-free analysis , 2012, Proceedings of the National Academy of Sciences.

[37]  Ivor W. Tsang,et al.  Discovering Support and Affiliated Features from Very High Dimensions , 2012, ICML.

[38]  Mary E. Meyerand,et al.  Support vector machine classification and characterization of age-related reorganization of functional brain networks , 2012, NeuroImage.

[39]  Huafu Chen,et al.  Two-stage processing in automatic detection of emotional intensity: a scalp event-related potential study , 2013, Neuroreport.

[40]  Swathi P. Iyer,et al.  Distinct neural signatures detected for ADHD subtypes after controlling for micro-movements in resting state functional connectivity MRI data , 2012, Front. Syst. Neurosci..

[41]  Sun-Yuan Kung,et al.  A classification scheme for ‘high-dimensional-small-sample-size’ data using soda and ridge-SVM with microwave measurement applications , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[42]  Shuiwang Ji,et al.  Automated identification of cell-type-specific genes in the mouse brain by image computing of expression patterns , 2014, BMC Bioinformatics.

[43]  Huafu Chen,et al.  Multivariate classification of social anxiety disorder using whole brain functional connectivity , 2013, Brain Structure and Function.

[44]  Gaël Varoquaux,et al.  Identifying Predictive Regions from fMRI with TV-L1 Prior , 2013, 2013 International Workshop on Pattern Recognition in Neuroimaging.

[45]  David Dagan Feng,et al.  Feature-Based Image Patch Approximation for Lung Tissue Classification , 2013, IEEE Transactions on Medical Imaging.

[46]  Mohamed Saoud,et al.  Disrupting pre-SMA activity impairs facial happiness recognition: an event-related TMS study. , 2013, Cerebral cortex.

[47]  Chandra Sripada,et al.  Disrupted network architecture of the resting brain in attention‐deficit/hyperactivity disorder , 2014, Human brain mapping.

[48]  Rabab Kreidieh Ward,et al.  Improved MRI reconstruction via non-convex elastic net , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[49]  Lijun Wang,et al.  Sparse models for visual image reconstruction from fMRI activity. , 2014, Bio-medical materials and engineering.

[50]  W. Art Chaovalitwongse,et al.  Voxel Selection Framework in Multi-Voxel Pattern Analysis of fMRI Data for Prediction of Neural Response to Visual Stimuli , 2014, IEEE Transactions on Medical Imaging.

[51]  Yong Zhou,et al.  Similarity of markers identified from cancer gene expression studies: observations from GEO , 2014, Briefings Bioinform..

[52]  Julien Mairal,et al.  Efficient RNA isoform identification and quantification from RNA-Seq data with network flows , 2014, Bioinform..

[53]  Zi Wang,et al.  Network-guided regression for detecting associations between DNA methylation and gene expression , 2014, Bioinform..

[54]  Feng Liu,et al.  Steady-State BOLD Response Modulates Low Frequency Neural Oscillations , 2014, Scientific Reports.

[55]  Yan Xu,et al.  Deep learning of feature representation with multiple instance learning for medical image analysis , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[56]  Kathryn R. Cullen,et al.  Developmental Resting State Functional Connectivity for Clinicians , 2014, Current Behavioral Neuroscience Reports.

[57]  M. Verleysen,et al.  Classification in the Presence of Label Noise: A Survey , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[58]  C. Sripada,et al.  Modality-Spanning Deficits in Attention-Deficit/Hyperactivity Disorder in Functional Networks, Gray Matter, and White Matter , 2014, The Journal of Neuroscience.

[59]  Richard J. Harris,et al.  Neural Responses to Expression and Gaze in the Posterior Superior Temporal Sulcus Interact with Facial Identity , 2012, Cerebral cortex.

[60]  Yilun Wang,et al.  Randomized structural sparsity via constrained block subsampling for improved sensitivity of discriminative voxel identification , 2014, NeuroImage.

[61]  Feng Liu,et al.  Steady-state BOLD Response to Higher-order Cognition Modulates Low-Frequency Neural Oscillations , 2015, Journal of Cognitive Neuroscience.

[62]  Yifeng Wang,et al.  Phase-Dependent Alteration of Functional Connectivity Density During Face Recognition in the Infra-slow Frequency Range , 2016 .

[63]  Gilles Blanchard,et al.  Extensions of stability selection using subsamples of observations and covariates , 2014, Stat. Comput..