Decoding the encoding of functional brain networks: An fMRI classification comparison of non-negative matrix factorization (NMF), independent component analysis (ICA), and sparse coding algorithms

BACKGROUND Brain networks in fMRI are typically identified using spatial independent component analysis (ICA), yet other mathematical constraints provide alternate biologically-plausible frameworks for generating brain networks. Non-negative matrix factorization (NMF) would suppress negative BOLD signal by enforcing positivity. Spatial sparse coding algorithms (L1 Regularized Learning and K-SVD) would impose local specialization and a discouragement of multitasking, where the total observed activity in a single voxel originates from a restricted number of possible brain networks. NEW METHOD The assumptions of independence, positivity, and sparsity to encode task-related brain networks are compared; the resulting brain networks within scan for different constraints are used as basis functions to encode observed functional activity. These encodings are then decoded using machine learning, by using the time series weights to predict within scan whether a subject is viewing a video, listening to an audio cue, or at rest, in 304 fMRI scans from 51 subjects. RESULTS AND COMPARISON WITH EXISTING METHOD The sparse coding algorithm of L1 Regularized Learning outperformed 4 variations of ICA (p<0.001) for predicting the task being performed within each scan using artifact-cleaned components. The NMF algorithms, which suppressed negative BOLD signal, had the poorest accuracy compared to the ICA and sparse coding algorithms. Holding constant the effect of the extraction algorithm, encodings using sparser spatial networks (containing more zero-valued voxels) had higher classification accuracy (p<0.001). Lower classification accuracy occurred when the extracted spatial maps contained more CSF regions (p<0.001). CONCLUSION The success of sparse coding algorithms suggests that algorithms which enforce sparsity, discourage multitasking, and promote local specialization may capture better the underlying source processes than those which allow inexhaustible local processes such as ICA. Negative BOLD signal may capture task-related activations.

[1]  Alan L. Yuille,et al.  Classification of spatially unaligned fMRI scans , 2010, NeuroImage.

[2]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[3]  Jack L. Gallant,et al.  Encoding and decoding in fMRI , 2011, NeuroImage.

[4]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[5]  Jeff H. Duyn,et al.  Linear Discriminant Analysis Achieves High Classification Accuracy for the BOLD fMRI Response to Naturalistic Movie Stimuli , 2016, Front. Hum. Neurosci..

[6]  Kasarapu Ramani Performance Comparison of Machine Learning Algorithms , 2018 .

[7]  Aiping Liu,et al.  A Sticky Weighted Regression Model for Time-Varying Resting-State Brain Connectivity Estimation , 2015, IEEE Transactions on Biomedical Engineering.

[8]  Jean-Franois Cardoso High-Order Contrasts for Independent Component Analysis , 1999, Neural Computation.

[9]  Andres Hoyos Idrobo,et al.  Assessing and tuning brain decoders: Cross-validation, caveats, and guidelines , 2016, NeuroImage.

[10]  Ludovica Griffanti,et al.  Automatic denoising of functional MRI data: Combining independent component analysis and hierarchical fusion of classifiers , 2014, NeuroImage.

[11]  Andrzej Cichocki,et al.  A New Learning Algorithm for Blind Signal Separation , 1995, NIPS.

[12]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[13]  H. Sebastian Seung,et al.  Algorithms for Non-negative Matrix Factorization , 2000, NIPS.

[14]  J. Cardoso,et al.  Blind beamforming for non-gaussian signals , 1993 .

[15]  S. Palmer Hierarchical structure in perceptual representation , 1977, Cognitive Psychology.

[16]  David L. Sheinberg,et al.  Visual object recognition. , 1996, Annual review of neuroscience.

[17]  Erkki Oja,et al.  Efficient Variant of Algorithm FastICA for Independent Component Analysis Attaining the CramÉr-Rao Lower Bound , 2006, IEEE Transactions on Neural Networks.

[18]  Stefan Haufe,et al.  On the interpretation of weight vectors of linear models in multivariate neuroimaging , 2014, NeuroImage.

[19]  Saeid Sanei,et al.  Fast and incoherent dictionary learning algorithms with application to fMRI , 2015, Signal Image Video Process..

[20]  Vince D. Calhoun,et al.  Group learning using contrast NMF : Application to functional and structural MRI of schizophrenia , 2008, 2008 IEEE International Symposium on Circuits and Systems.

[21]  David Ruppert,et al.  An evaluation of independent component analyses with an application to resting‐state fMRI , 2014, Biometrics.

[22]  Jennifer Bramen,et al.  Effect of bupropion treatment on brain activation induced by cigarette-related cues in smokers. , 2011, Archives of general psychiatry.

[23]  Sungho Tak,et al.  A Data-Driven Sparse GLM for fMRI Analysis Using Sparse Dictionary Learning With MDL Criterion , 2011, IEEE Transactions on Medical Imaging.

[24]  Terrence J. Sejnowski,et al.  An Information-Maximization Approach to Blind Separation and Blind Deconvolution , 1995, Neural Computation.

[25]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[26]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[27]  K. D. Singh,et al.  Negative BOLD in the visual cortex: Evidence against blood stealing , 2004, Human brain mapping.

[28]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[29]  Chih-Jen Lin,et al.  Projected Gradient Methods for Nonnegative Matrix Factorization , 2007, Neural Computation.

[30]  Alan L. Yuille,et al.  Performance comparison of machine learning algorithms and number of independent components used in fMRI decoding of belief vs. disbelief , 2011, NeuroImage.

[31]  Michael Elad,et al.  Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[32]  Erkki Oja,et al.  Independent component analysis: algorithms and applications , 2000, Neural Networks.

[33]  Stephen C. Strother,et al.  Correction: An Automated, Adaptive Framework for Optimizing Preprocessing Pipelines in Task-Based Functional MRI , 2015, PloS one.

[34]  Michael Brady,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[35]  Mauro DiNuzzo,et al.  On the origin of sustained negative BOLD response. , 2012, Journal of neurophysiology.

[36]  Jie Tian,et al.  Detecting brain activations by constrained non-negative matrix factorization from task-related BOLD fMRI , 2004, SPIE Medical Imaging.

[37]  Michael W. Spratling Classification using sparse representations: a biologically plausible approach , 2013, Biological Cybernetics.

[38]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[39]  Lars Kai Hansen,et al.  Model sparsity and brain pattern interpretation of classification models in neuroimaging , 2012, Pattern Recognit..

[40]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[41]  Dae-Shik Kim,et al.  Origin of Negative Blood Oxygenation Level—Dependent fMRI Signals , 2002, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[42]  I Daubechies,et al.  Independent component analysis for brain fMRI does not select for independence , 2009 .

[43]  Seong-Whan Lee,et al.  Performance evaluation of nonnegative matrix factorization algorithms to estimate task-related neuronal activities from fMRI data. , 2013, Magnetic resonance imaging.

[44]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[45]  PK Douglas,et al.  Naïve Bayes Classification of Belief verses Disbelief using Event Related Neuroimaging Data , 2009, NeuroImage.

[46]  E. Oja,et al.  Independent Component Analysis , 2013 .

[47]  L. K. Hansen,et al.  Independent component analysis of functional MRI: what is signal and what is noise? , 2003, Current Opinion in Neurobiology.

[48]  Tülay Adali,et al.  Independent Component Analysis by Entropy Bound Minimization , 2010, IEEE Transactions on Signal Processing.

[49]  P. Paatero,et al.  Source identification of bulk wet deposition in Finland by positive matrix factorization , 1995 .

[50]  Kangjoo Lee,et al.  Statistical parametric mapping of FMRI data using sparse dictionary learning , 2010, 2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[51]  Karl J. Friston Modes or models: a critique on independent component analysis for fMRI , 1998, Trends in Cognitive Sciences.

[52]  Roman Filipovych,et al.  Sparse Dictionary Learning of Resting State fMRI Networks , 2012, 2012 Second International Workshop on Pattern Recognition in NeuroImaging.

[53]  Kurt Hornik,et al.  Misc Functions of the Department of Statistics (e1071), TU Wien , 2014 .

[54]  Alan L. Yuille,et al.  Non-negative matrix factorization of multimodal MRI, fMRI and phenotypic data reveals differential changes in default mode subnetworks in ADHD , 2014, NeuroImage.

[55]  S Makeig,et al.  Analysis of fMRI data by blind separation into independent spatial components , 1998, Human brain mapping.

[56]  Stephen M. Smith,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[57]  D. Chakrabarti,et al.  A fast fixed - point algorithm for independent component analysis , 1997 .

[58]  Michael W. Berry,et al.  Algorithms and applications for approximate nonnegative matrix factorization , 2007, Comput. Stat. Data Anal..

[59]  Saeid Sanei,et al.  A new spatially constrained NMF with application to fMRI , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[60]  Grigori Yourganov,et al.  Comparing within‐subject classification and regularization methods in fMRI for large and small sample sizes , 2014, Human brain mapping.

[61]  P. Paatero,et al.  Positive matrix factorization: A non-negative factor model with optimal utilization of error estimates of data values† , 1994 .

[62]  Steen Moeller,et al.  ICA-based artefact removal and accelerated fMRI acquisition for improved resting state network imaging , 2014, NeuroImage.

[63]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[64]  Mark S. Cohen,et al.  Real-Time Functional MRI Classification of Brain States Using Markov-SVM Hybrid Models: Peering Inside the rt-fMRI Black Box , 2011, MLINI.

[65]  Simon B. Laughlin,et al.  Action Potential Energy Efficiency Varies Among Neuron Types in Vertebrates and Invertebrates , 2010, PLoS Comput. Biol..

[66]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[67]  M. Fukunaga,et al.  Negative BOLD-fMRI Signals in Large Cerebral Veins , 2011, Journal of cerebral blood flow and metabolism : official journal of the International Society of Cerebral Blood Flow and Metabolism.

[68]  Saeid Sanei,et al.  A constrained NMF algorithm for bold detection in fMRI , 2010, 2010 IEEE International Workshop on Machine Learning for Signal Processing.

[69]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[70]  D. Perrett,et al.  Recognition of objects and their component parts: responses of single units in the temporal cortex of the macaque. , 1994, Cerebral cortex.

[71]  Hyunsoo Kim,et al.  Nonnegative Matrix Factorization Based on Alternating Nonnegativity Constrained Least Squares and Active Set Method , 2008, SIAM J. Matrix Anal. Appl..

[72]  Stephen M Smith,et al.  Correspondence of the brain's functional architecture during activation and rest , 2009, Proceedings of the National Academy of Sciences.

[73]  Dimitri Van De Ville,et al.  Disentangling dynamic networks: Separated and joint expressions of functional connectivity patterns in time , 2014, Human brain mapping.

[74]  Mark S. Cohen,et al.  Single trial decoding of belief decision making from EEG and fMRI data using independent components features , 2013, Front. Hum. Neurosci..

[75]  Michael Elad,et al.  Dictionaries for Sparse Representation Modeling , 2010, Proceedings of the IEEE.

[76]  Vince D. Calhoun,et al.  Independent Component Analysis Applied to fMRI Data: A Generative Model for Validating Results , 2004, J. VLSI Signal Process..

[77]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.