Coupled support tensor machine classification for multimodal neuroimaging data

Multimodal data arise in various applications where information about the same phenomenon is acquired from multiple sensors and across different imaging modalities. Learning from multimodal data is of great interest in machine learning and statistics research as this offers the possibility of capturing complementary information among modalities. Multimodal modeling helps to explain the interdependence between heterogeneous data sources, discovers new insights that may not be available from a single modality, and improves decision-making. Recently, coupled matrix-tensor factorization has been introduced for multimodal data fusion to jointly estimate latent factors and identify complex interdependence among the latent factors. However, most of the prior work on coupled matrix-tensor factors focuses on unsupervised learning and there is little work on supervised learning using the jointly estimated latent factors. This paper considers the multimodal tensor data classification problem. A Coupled Support Tensor Machine (C-STM) built upon the latent factors jointly estimated from the Advanced Coupled Matrix Tensor Factorization (ACMTF) is proposed. C-STM combines individual and shared latent factors with multiple kernels and estimates a maximal-margin classifier for coupled matrix tensor data. The classification risk of C-STM is shown to converge to the optimal Bayes risk, making it a statistically consistent rule. C-STM is validated through simulation studies as well as a simultaneous EEG-fMRI analysis. The empirical evidence shows that C-STM can utilize information from multiple sources and provide a better classification performance than traditional single-mode classifiers.

[1]  Terran Lane,et al.  A Framework for Multiple Kernel Support Vector Regression and Its Applications to siRNA Efficacy Prediction , 2009, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  A. Babajani-Feremi,et al.  Application of advanced machine learning methods on resting-state fMRI network for identification of mild cognitive impairment and Alzheimer’s disease , 2015, Brain Imaging and Behavior.

[3]  Quefeng Li,et al.  Integrative Factor Regression and Its Inference for Multimodal Data Analysis , 2019, Journal of the American Statistical Association.

[4]  R. Turner,et al.  Dynamic magnetic resonance imaging of human brain activity during primary sensory stimulation. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Taps Maiti,et al.  Universal Consistency of Support Tensor Machine , 2019, 2019 IEEE International Conference on Data Science and Advanced Analytics (DSAA).

[6]  Klaus-Robert Müller,et al.  Efficient and Accurate Lp-Norm Multiple Kernel Learning , 2009, NIPS.

[7]  Vince D. Calhoun,et al.  Unraveling Diagnostic Biomarkers of Schizophrenia Through Structure-Revealing Fusion of Multi-Modal Neuroimaging Data , 2019, bioRxiv.

[8]  Francis R. Bach,et al.  Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[9]  Ethem Alpaydin,et al.  Multiple Kernel Learning Algorithms , 2011, J. Mach. Learn. Res..

[10]  Philip S. Yu,et al.  Kernelized Support Tensor Machines , 2017, ICML.

[11]  Chunxiang Jiang,et al.  Prediction and classification of Alzheimer disease based on quantification of MRI deformation , 2017, PloS one.

[12]  Guillaume Flandin,et al.  Multimodal Integration of M/EEG and f/MRI Data in SPM12 , 2019, Front. Neurosci..

[13]  Shai Ben-David,et al.  Understanding Machine Learning: From Theory to Algorithms , 2014 .

[14]  Sergios Theodoridis,et al.  Fusion of EEG and fMRI via Soft Coupled Tensor Decompositions , 2018, 2018 26th European Signal Processing Conference (EUSIPCO).

[15]  Hao Yan,et al.  Multiple Tensor-on-Tensor Regression: An Approach for Modeling Processes With Heterogeneous Sources of Data , 2018, Technometrics.

[16]  Jared A. Nielsen,et al.  Functional connectivity magnetic resonance imaging classification of autism. , 2011, Brain : a journal of neurology.

[17]  Sidong Liu,et al.  Early diagnosis of Alzheimer's disease with deep learning , 2014, 2014 IEEE 11th International Symposium on Biomedical Imaging (ISBI).

[18]  Vince D. Calhoun,et al.  Discriminating schizophrenia and bipolar disorder by fusing fMRI and DTI in a multimodal CCA+ joint ICA model , 2011, NeuroImage.

[19]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[20]  M. Lindquist The Statistical Analysis of fMRI Data. , 2008, 0906.3662.

[21]  Mark W. Woolrich,et al.  Linked independent component analysis for multimodal data fusion , 2011, NeuroImage.

[22]  J. Pekar,et al.  Method for multimodal analysis of independent source differences in schizophrenia: Combining gray matter structural and auditory oddball functional data , 2006, Human brain mapping.

[23]  M. Teplan FUNDAMENTALS OF EEG MEASUREMENT , 2002 .

[24]  V. Calhoun,et al.  Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA , 2009, Human brain mapping.

[25]  Dan Schonfeld,et al.  Multilinear Discriminant Analysis for Higher-Order Tensor Data Classification , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Murat Dundar,et al.  A fast iterative algorithm for fisher discriminant using heterogeneous kernels , 2004, ICML.

[27]  Robert Oostenveld,et al.  MEG-BIDS, the brain imaging data structure extended to magnetoencephalography , 2018, Scientific Data.

[28]  R. Durrett Probability: Theory and Examples , 1993 .

[29]  Hongtu Zhu,et al.  Tensor Regression with Applications in Neuroimaging Data Analysis , 2012, Journal of the American Statistical Association.

[30]  László Györfi,et al.  A Probabilistic Theory of Pattern Recognition , 1996, Stochastic Modelling and Applied Probability.

[31]  Pedro A. Valdes-Sosa,et al.  Tensor Analysis and Fusion of Multimodal Brain Images , 2015, Proceedings of the IEEE.

[32]  Philip S. Yu,et al.  DuSK: A Dual Structure-preserving Kernel for Supervised Tensor Learning with Applications to Neuroimages , 2014, SDM.

[33]  Vince D. Calhoun,et al.  ACMTF for fusion of multi-modal neuroimaging data and identification of biomarkers , 2017, 2017 25th European Signal Processing Conference (EUSIPCO).

[34]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[35]  Gholam-Ali Hossein-Zadeh,et al.  Correlated coupled matrix tensor factorization method for simultaneous EEG-fMRI data fusion , 2020, Biomed. Signal Process. Control..

[36]  Annie Qu,et al.  Tensors in Statistics , 2021 .

[37]  David C. Zhu,et al.  Early prediction of Alzheimer’s disease using longitudinal volumetric MRI data from ADNI , 2019, Health Services and Outcomes Research Methodology.

[38]  D. Eidelberg,et al.  Network imaging biomarkers: insights and clinical applications in Parkinson's disease , 2018, The Lancet Neurology.

[39]  William Stafford Noble,et al.  Kernel methods for predicting protein-protein interactions , 2005, ISMB.

[40]  Tamara G. Kolda,et al.  All-at-once Optimization for Coupled Matrix and Tensor Factorizations , 2011, ArXiv.

[41]  Mingjun Zhong,et al.  Data Integration for Classification Problems Employing Gaussian Process Priors , 2006, NIPS.

[42]  Sabine Van Huffel,et al.  Early soft and flexible fusion of EEG and fMRI via tensor decompositions , 2020, ArXiv.

[43]  Xin Yang,et al.  Functional connectivity magnetic resonance imaging classification of autism spectrum disorder using the multisite ABIDE dataset , 2019, 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI).

[44]  Vince D. Calhoun,et al.  Tensor-based fusion of EEG and FMRI to understand neurological changes in schizophrenia , 2016, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[45]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[46]  Trevor Darrell,et al.  Bayesian Localized Multiple Kernel Learning , 2009 .

[47]  Xiaoshan Li,et al.  Tucker Tensor Regression and Neuroimaging Analysis , 2018, Statistics in Biosciences.

[48]  W. Hager,et al.  A SURVEY OF NONLINEAR CONJUGATE GRADIENT METHODS , 2005 .

[49]  J. Lancaster,et al.  Using the talairach atlas with the MNI template , 2001, NeuroImage.

[50]  Tu Bao Ho,et al.  Simple but effective methods for combining kernels in computational biology , 2008, 2008 IEEE International Conference on Research, Innovation and Vision for the Future in Computing and Communication Technologies.

[51]  P. Wolfe Convergence Conditions for Ascent Methods. II: Some Corrections , 1971 .

[52]  Manik Varma,et al.  Learning The Discriminative Power-Invariance Trade-Off , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[53]  Siam Rfview,et al.  CONVERGENCE CONDITIONS FOR ASCENT METHODS , 2016 .

[54]  R. Bro,et al.  Core consistency diagnostic in PARAFAC2 , 2013 .

[55]  M. Powell Nonconvex minimization calculations and the conjugate gradient method , 1984 .

[56]  Samir Kanaan-Izquierdo,et al.  Early Prediction of Alzheimer’s Disease Using Null Longitudinal Model-Based Classifiers , 2017, PloS one.

[57]  Alan C. Evans,et al.  A General Statistical Analysis for fMRI Data , 2000, NeuroImage.

[58]  Charles A. Micchelli,et al.  When is there a representer theorem? Vector versus matrix regularizers , 2008, J. Mach. Learn. Res..

[59]  Michael J. Martinez,et al.  Bias between MNI and Talairach coordinates analyzed using the ICBM‐152 brain template , 2007, Human brain mapping.

[60]  Kristin P. Bennett,et al.  MARK: a boosting algorithm for heterogeneous kernel models , 2002, KDD.

[61]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[62]  Jason Weston,et al.  Gene functional classification from heterogeneous data , 2001, RECOMB.

[63]  Yiming Ding,et al.  A Deep Learning Model to Predict a Diagnosis of Alzheimer Disease by Using 18F-FDG PET of the Brain. , 2019, Radiology.

[64]  J. Kruskal Three-way arrays: rank and uniqueness of trilinear decompositions, with application to arithmetic complexity and statistics , 1977 .

[65]  B. Rosen,et al.  Functional mapping of the human visual cortex by magnetic resonance imaging. , 1991, Science.

[66]  D. Tank,et al.  Brain magnetic resonance imaging with contrast dependent on blood oxygenation. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[67]  Dezhong Yao,et al.  EEG/fMRI fusion based on independent component analysis: integration of data-driven and model-driven methods. , 2012, Journal of integrative neuroscience.

[68]  A. Fagan,et al.  Pittsburgh compound B imaging and prediction of progression from cognitive normality to symptomatic Alzheimer disease. , 2009, Archives of neurology.

[69]  Xuelong Li,et al.  Supervised tensor learning , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[70]  M Filippi,et al.  MR imaging in white matter diseases of the brain and spinal cord , 2005 .

[71]  P. Sajda,et al.  Simultaneous EEG-fMRI Reveals Temporal Evolution of Coupling between Supramodal Cortical Attention Networks and the Brainstem , 2013, The Journal of Neuroscience.

[72]  Eric F. Lock,et al.  Tensor-on-Tensor Regression , 2017, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[73]  Hadi Fanaee-T,et al.  SimTensor: A synthetic tensor data generator , 2016, ArXiv.

[74]  Rasmus Bro,et al.  Structure-revealing data fusion , 2014, BMC Bioinformatics.

[75]  Xin Zhang,et al.  Covariate-Adjusted Tensor Classification in High Dimensions , 2018, Journal of the American Statistical Association.

[76]  Tong Zhang Statistical behavior and consistency of classification methods based on convex risk minimization , 2003 .

[77]  Rodolfo Abreu,et al.  EEG-Informed fMRI: A Review of Data Analysis Methods , 2018, Front. Hum. Neurosci..