Exploring the latent space between brain and behaviour using eigen-decomposition methods

Machine learning methods have been successfully used to analyse neuroimaging data for a variety of applications, including the classification of subjects with different brain disorders. However, most studies still rely on the labelling of the subjects, constraining the study of several brain diseases within a paradigm of pre-defined clinical labels, which have shown to be unreliable in some cases. The lack of understanding regarding the association between brain and behaviour presents itself as an interesting challenge for more exploratory machine learning approaches, which could potentially help in the study of diseases whose clinical labels have shown limitations. The aim of this project is to explore the possibility of using eigen-decomposition approaches to find multivariate associative effects between brain structure and behaviour in an exploratory way. This thesis addresses a number of issues associated with eigen-decomposition methods, in order to enable their application to investigate brain/behaviour relationships in a reliable way. The first contribution was showing the advantages of an alternative matrix deflation approach to be used with Sparse Partial Least Squares (SPLS). The modified SPLS method was later used to model the associations between clinical/demographic data and brain structure, without relying on a priori assumptions on the sparsity of each data source. A novel multiple hold-out SPLS framework was then proposed, which allowed for the detection of robust multivariate associative effects between brain structure and individual questionnaire items. The linearity assumption of most machine learning methods used in neuroimaging might be a limitation, since these methods will not have enough flexibility to detect non-linear associations. In order to address this issue, a novel Sparse Canonical Correlation Analysis (SCCA) method was proposed, which allows one to use sparsity constraints in one data source (e.g. neuroimaging data), with non-linear transformations of the data in the other source (e.g. clinical data).

[1]  Thomas E. Nichols,et al.  The ENIGMA Consortium: large-scale collaborative analyses of neuroimaging and genetic data , 2014, Brain Imaging and Behavior.

[2]  D. Hardoon,et al.  Correlation-based multivariate analysis of genetic influence on brain volume , 2009, Neuroscience Letters.

[3]  Michael Moutoussis,et al.  Brain-behaviour modes of covariation in healthy and clinically depressed young people , 2018, bioRxiv.

[4]  Nick C Fox,et al.  A data-driven model of biomarker changes in sporadic Alzheimer's disease , 2014, Alzheimer's & Dementia.

[5]  Janaina Mourão Miranda,et al.  Multivariate Effect Ranking via Adaptive Sparse PLS , 2015, 2015 International Workshop on Pattern Recognition in NeuroImaging.

[6]  F. Bookstein,et al.  A new statistical method for testing hypotheses of neuropsychological/MRI relationships in schizophrenia: partial least squares analysis , 2002, Schizophrenia Research.

[7]  Colin Fyfe,et al.  Kernel and Nonlinear Canonical Correlation Analysis , 2000, IJCNN.

[8]  D. Tritchler,et al.  Sparse Canonical Correlation Analysis with Application to Genomic Data Integration , 2009, Statistical applications in genetics and molecular biology.

[9]  Karl J. Friston,et al.  Voxel-Based Morphometry—The Methods , 2000, NeuroImage.

[10]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[11]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[12]  P. Matthews,et al.  Multimodal population brain imaging in the UK Biobank prospective epidemiological study , 2016, Nature Neuroscience.

[13]  Vince D. Calhoun,et al.  Correspondence between fMRI and SNP data by group sparse canonical correlation analysis , 2014, Medical Image Anal..

[14]  Roman Rosipal,et al.  Overview and Recent Advances in Partial Least Squares , 2005, SLSFS.

[15]  John Ashburner,et al.  A fast diffeomorphic image registration algorithm , 2007, NeuroImage.

[16]  Tianzi Jiang,et al.  Sparse canonical correlation analysis reveals correlated patterns of gray matter loss and white matter impairment in alzheimer's disease , 2015, 2015 IEEE 12th International Symposium on Biomedical Imaging (ISBI).

[17]  Shannon L. Risacher,et al.  GN-SCCA: GraphNet Based Sparse Canonical Correlation Analysis for Brain Imaging Genetics , 2015, BIH.

[18]  J. Shawe-Taylor,et al.  Data-driven Selection of Relevant fMRI Voxels using Sparse CCA and Stability Selection , 2013 .

[19]  S. Folstein,et al.  "Mini-mental state". A practical method for grading the cognitive state of patients for the clinician. , 1975, Journal of psychiatric research.

[20]  Emil Polajnar,et al.  Using restricted CCA for cross-language information retrieval , 2017, Commun. Stat. Simul. Comput..

[21]  J. Ashburner,et al.  Prognostic and Diagnostic Potential of the Structural Neuroanatomy of Depression , 2009, PloS one.

[22]  Vince D. Calhoun,et al.  Group sparse canonical correlation analysis for genomic data integration , 2013, BMC Bioinformatics.

[23]  Steven C. R. Williams,et al.  Describing the Brain in Autism in Five Dimensions—Magnetic Resonance Imaging-Assisted Diagnosis of Autism Spectrum Disorder Using a Multiparameter Classification Approach , 2010, The Journal of Neuroscience.

[24]  Andre Altmann,et al.  Sex modifies the APOE‐related risk of developing Alzheimer disease , 2014, Annals of neurology.

[25]  Atsushi Kawaguchi,et al.  Radial Basis Function-Sparse Partial Least Squares for Application to Brain Imaging Data , 2013, Comput. Math. Methods Medicine.

[26]  R. Murray,et al.  Individualized prediction of illness course at the first psychotic episode: a support vector machine MRI study , 2011, Psychological Medicine.

[27]  Christian Gaser,et al.  Partial least squares correlation of multivariate cognitive abilities and local brain structure in children and adolescents , 2013, NeuroImage.

[28]  D. Bennett,et al.  Vitamin E and donepezil for the treatment of mild cognitive impairment. , 2005, The New England journal of medicine.

[29]  R. Tibshirani,et al.  A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. , 2009, Biostatistics.

[30]  Anil Rao,et al.  Classification of Alzheimer's Disease from structural MRI using sparse logistic regression with optional spatial regularization , 2011, 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[31]  Lisa A. Weissfeld,et al.  A variant of sparse partial least squares for variable selection and data exploration , 2014, Front. Neuroinform..

[32]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[33]  Karl J. Friston,et al.  Unified segmentation , 2005, NeuroImage.

[34]  S. Keleş,et al.  Sparse partial least squares regression for simultaneous dimension reduction and variable selection , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[35]  Philippe Besse,et al.  Sparse PLS discriminant analysis: biologically relevant feature selection and graphical displays for multiclass problems , 2011, BMC Bioinformatics.

[36]  A. Zwinderman,et al.  Statistical Applications in Genetics and Molecular Biology Quantifying the Association between Gene Expressions and DNA-Markers by Penalized Canonical Correlation Analysis , 2011 .

[37]  H. Hotelling Relations Between Two Sets of Variates , 1936 .

[38]  S. Resnick,et al.  Detection of prodromal Alzheimer's disease via pattern classification of magnetic resonance imaging , 2008, Neurobiology of Aging.

[39]  Gene H. Golub,et al.  Numerical methods for computing angles between linear subspaces , 1971, Milestones in Matrix Computation.

[40]  Lin Shi,et al.  Nonlinear Canonical Correlation Analysis of fMRI Signals Using HDR Models , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[41]  Vince D. Calhoun,et al.  Multi-set canonical correlation analysis for the fusion of concurrent single trial ERP and functional MRI , 2010, NeuroImage.

[42]  Judy H. Cho,et al.  Identification of association between disease and multiple markers via sparse partial least‐squares regression , 2011, Genetic epidemiology.

[43]  Martin Rossor,et al.  Differing patterns of temporal atrophy in Alzheimer's disease and semantic dementia. , 2002, Neurology.

[44]  Juho Rousu,et al.  Canonical Correlation Methods for Exploring Microbe-Environment Interactions in Deep Subsurface , 2015, Discovery Science.

[45]  Nikos D. Sidiropoulos,et al.  Scalable and Flexible Multiview MAX-VAR Canonical Correlation Analysis , 2016, IEEE Transactions on Signal Processing.

[46]  Andres Hoyos Idrobo,et al.  Assessing and tuning brain decoders: Cross-validation, caveats, and guidelines , 2016, NeuroImage.

[47]  C.-Y.C. Chu,et al.  Pattern recognition and machine learning for magnetic resonance images with kernel methods , 2009 .

[48]  Daniel Rueckert,et al.  Hierarchical Statistical Shape Analysis and Prediction of Sub-cortical Brain Structures , 2007 .

[49]  K. Davis,et al.  A new rating scale for Alzheimer's disease. , 1984, The American journal of psychiatry.

[50]  Alexander Gammerman,et al.  Machine learning classification with confidence: Application of transductive conformal predictors to MRI-based diagnostic and prognostic markers in depression , 2011, NeuroImage.

[51]  Lars T. Westlye,et al.  Comparison of variants of canonical correlation analysis and partial least squares for combined analysis of MRI and genetic data , 2015, NeuroImage.

[52]  Janaina Mourão Miranda,et al.  Predictive modelling using neuroimaging data in the presence of confounds , 2017, NeuroImage.

[53]  Andrew P. Holmes,et al.  Statistical issues in functional brain mapping. , 1994 .

[54]  Gereon R. Fink,et al.  fMRI Data Predict Individual Differences of Behavioral Effects of Nicotine: A Partial Least Square Analysis , 2007, Journal of Cognitive Neuroscience.

[55]  Andre F. Marquand,et al.  Estimating multivariate similarity between neuroimaging datasets with sparse canonical correlation analysis: an application to perfusion imaging , 2015, Front. Neurosci..

[56]  Janaina Mourão Miranda,et al.  Embedding Anatomical or Functional Knowledge in Whole-Brain Multiple Kernel Learning Models , 2018, Neuroinformatics.

[57]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[58]  John Shawe-Taylor,et al.  SCoRS—A Method Based on Stability for Feature Selection and Mapping in Neuroimaging , 2014, IEEE Transactions on Medical Imaging.

[59]  Daniela M Witten,et al.  Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data , 2009, Statistical applications in genetics and molecular biology.

[60]  Janaina Mourão Miranda,et al.  A Comparison of Strategies for Incorporating Nuisance Variables into Predictive Neuroimaging Models , 2015, 2015 International Workshop on Pattern Recognition in NeuroImaging.

[61]  Ignacio Santamaría,et al.  Adaptive Kernel Canonical Correlation Analysis Algorithms for Nonparametric Identification of Wiener and Hammerstein Systems , 2008, EURASIP J. Adv. Signal Process..

[62]  T. Hänninen,et al.  Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease , 2016, NeuroImage: Clinical.

[63]  Thomas E. Nichols,et al.  A positive-negative mode of population covariation links brain connectivity, demographics and behavior , 2015, Nature Neuroscience.

[64]  Simon J Graham,et al.  An fMRI study investigating cognitive modulation of brain regions associated with emotional processing of visual stimuli , 2003, Neuropsychologia.

[65]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[66]  H. Knutsson,et al.  Detection of neural activity in functional MRI using canonical correlation analysis , 2001, Magnetic resonance in medicine.

[67]  Robert Sekuler,et al.  Corticolimbic Interactions Associated with Performance on a Short-Term Memory Task Are Modified by Age , 2000, The Journal of Neuroscience.

[68]  John Shawe-Taylor,et al.  Sparse canonical correlation analysis , 2009, Machine Learning.

[69]  Le Song,et al.  Hilbert Space Embeddings of Hidden Markov Models , 2010, ICML.

[70]  Paul M. Thompson,et al.  Imaging genetics via sparse canonical correlation analysis , 2013, 2013 IEEE 10th International Symposium on Biomedical Imaging.

[71]  Anthony Randal McIntosh,et al.  Partial Least Squares (PLS) methods for neuroimaging: A tutorial and review , 2011, NeuroImage.

[72]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[73]  J. V. Haxby,et al.  Spatial Pattern Analysis of Functional Brain Images Using Partial Least Squares , 1996, NeuroImage.

[74]  W. Zheng,et al.  Facial expression recognition using kernel canonical correlation analysis (KCCA) , 2006, IEEE Transactions on Neural Networks.

[75]  Arthur W. Toga,et al.  A Probabilistic Atlas of the Human Brain: Theory and Rationale for Its Development The International Consortium for Brain Mapping (ICBM) , 1995, NeuroImage.

[76]  Brian B. Avants,et al.  Dementia induces correlated reductions in white matter integrity and cortical thickness: A multivariate neuroimaging study with sparse canonical correlation analysis , 2010, NeuroImage.

[77]  M. Healy A rotation method for computing canonical correlations , 1957 .

[78]  Philippe Besse,et al.  Sparse canonical methods for biological data integration: application to a cross-platform study , 2009, BMC Bioinformatics.

[79]  Janaina Mourão Miranda,et al.  Prediction of clinical scores from neuroimaging data with censored likelihood gaussian processes , 2016, 2016 International Workshop on Pattern Recognition in Neuroimaging (PRNI).

[80]  C. Jack,et al.  Rates of hippocampal atrophy correlate with change in clinical status in aging and AD , 2000, Neurology.

[81]  Jacob A. Wegelin,et al.  A Survey of Partial Least Squares (PLS) Methods, with Emphasis on the Two-Block Case , 2000 .

[82]  H. Wold Causal flows with latent variables: Partings of the ways in the light of NIPALS modelling , 1974 .

[83]  V. Calhoun,et al.  In Search of Multimodal Neuroimaging Biomarkers of Cognitive Deficits in Schizophrenia , 2015, Biological Psychiatry.

[84]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[85]  Brian B. Avants,et al.  Sparse canonical correlation analysis relates network-level atrophy to multivariate cognitive measures in a neurodegenerative population , 2014, NeuroImage.

[86]  Janaina Mourão Miranda,et al.  Unsupervised analysis of fMRI data using kernel canonical correlation , 2007, NeuroImage.

[87]  Li Dong,et al.  Characterizing nonlinear relationships in functional imaging data using eigenspace maximal information canonical correlation analysis (emiCCA) , 2015, NeuroImage.

[88]  David A. Seminowicz,et al.  Personality influences limbic-cortical interactions during sad mood induction , 2003, NeuroImage.

[89]  B. Sahakian,et al.  Differing patterns of temporal atrophy in Alzheimer’s disease and semantic dementia , 2001, Neurology.

[90]  T. Kurosaki,et al.  Measurement of functional activities in older adults in the community. , 1982, Journal of gerontology.

[91]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[92]  M N Rossor,et al.  Patterns of temporal lobe atrophy in semantic dementia and Alzheimer's disease , 2001, Annals of neurology.

[93]  Audra E. Kosh,et al.  Linear Algebra and its Applications , 1992 .

[94]  Hao He,et al.  Three-way FMRI-DTI-methylation data fusion based on mCCA+jICA and its application to schizophrenia , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[95]  Andreas Bartels,et al.  Semi-supervised kernel canonical correlation analysis with application to human fMRI , 2011, Pattern Recognit. Lett..

[96]  Nick C Fox,et al.  Automatic classification of MR scans in Alzheimer's disease. , 2008, Brain : a journal of neurology.

[97]  B.J. Lopresti,et al.  Quantitative and statistical analyses of PET imaging studies of amyloid deposition in humans , 2004, IEEE Symposium Conference Record Nuclear Science 2004..

[98]  T. Insel,et al.  Wesleyan University From the SelectedWorks of Charles A . Sanislow , Ph . D . 2010 Research Domain Criteria ( RDoC ) : Toward a New Classification Framework for Research on Mental Disorders , 2018 .

[99]  Michael Weiner,et al.  Hippocampal Surface Mapping of Genetic Risk Factors in AD via Sparse Learning Models , 2011, MICCAI.

[100]  Tom M. Mitchell,et al.  Machine learning classifiers and fMRI: A tutorial overview , 2009, NeuroImage.

[101]  Michael I. Jordan,et al.  Kernel independent component analysis , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[102]  John Shawe-Taylor,et al.  A multimodal multiple kernel learning approach to Alzheimer's disease detection , 2016, 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP).

[103]  Hans Knutsson,et al.  A canonical correlation approach to exploratory data analysis in fMRI , 2002 .

[104]  John Shawe-Taylor,et al.  Leveraging Clinical Data to Enhance Localization of Brain Atrophy , 2014, MLINI@NIPS.

[105]  Peter A. Bandettini,et al.  Characteristic cortical thickness patterns in adolescents with autism spectrum disorders: Interactions with age and intellectual ability revealed by canonical correlation analysis , 2012, NeuroImage.

[106]  Philippe Besse,et al.  Statistical Applications in Genetics and Molecular Biology A Sparse PLS for Variable Selection when Integrating Omics Data , 2011 .

[107]  R. Mohs,et al.  The Alzheimer's Disease Assessment Scale , 1996, International Psychogeriatrics.

[108]  C. Croux,et al.  Sparse canonical correlation analysis from a predictive point of view , 2015, Biometrical journal. Biometrische Zeitschrift.

[109]  A R McIntosh,et al.  General and specific brain regions involved in encoding and retrieval of events: what, where, and when. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[110]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[111]  A. Mechelli,et al.  Using Support Vector Machine to identify imaging biomarkers of neurological and psychiatric disease: A critical review , 2012, Neuroscience & Biobehavioral Reviews.

[112]  Lester W. Mackey,et al.  Deflation Methods for Sparse PCA , 2008, NIPS.

[113]  G. Golub,et al.  The canonical correlations of matrix pairs and their numerical computation , 1992 .

[114]  John G. Csernansky,et al.  Open Access Series of Imaging Studies: Longitudinal MRI Data in Nondemented and Demented Older Adults , 2010, Journal of Cognitive Neuroscience.

[115]  Janaina Mourão Miranda,et al.  Classifying brain states and determining the discriminating activation patterns: Support Vector Machine on functional MRI data , 2005, NeuroImage.

[116]  Tutut Herawan,et al.  Computational and mathematical methods in medicine. , 2006, Computational and mathematical methods in medicine.

[117]  Anastasia Lykou,et al.  Sparse CCA using a Lasso with positivity constraints , 2010, Comput. Stat. Data Anal..

[118]  Juho Rousu,et al.  Biomarker Discovery by Sparse Canonical Correlation Analysis of Complex Clinical Phenotypes of Tuberculosis and Malaria , 2013, PLoS Comput. Biol..

[119]  Antonio Moreno,et al.  Significant correlation between a set of genetic polymorphisms and a functional brain network revealed by feature selection and sparse Partial Least Squares , 2012, NeuroImage.