NETWORK CLASSIFICATION WITH APPLICATIONS TO BRAIN CONNECTOMICS.

While statistical analysis of a single network has received a lot of attention in recent years, with a focus on social networks, analysis of a sample of networks presents its own challenges which require a different set of analytic tools. Here we study the problem of classification of networks with labeled nodes, motivated by applications in neuroimaging. Brain networks are constructed from imaging data to represent functional connectivity between regions of the brain, and previous work has shown the potential of such networks to distinguish between various brain disorders, giving rise to a network classification problem. Existing approaches tend to either treat all edge weights as a long vector, ignoring the network structure, or focus on graph topology as represented by summary measures while ignoring the edge weights. Our goal is to design a classification method that uses both the individual edge information and the network structure of the data in a computationally efficient way, and that can produce a parsimonious and interpretable representation of differences in brain connectivity patterns between classes. We propose a graph classification method that uses edge weights as predictors but incorporates the network nature of the data via penalties that promote sparsity in the number of nodes, in addition to the usual sparsity penalties that encourage selection of edges. We implement the method via efficient convex optimization and provide a detailed analysis of data from two fMRI studies of schizophrenia.

[1]  Can M. Le,et al.  Sparse random graphs: regularization and concentration of the Laplacian , 2015, ArXiv.

[2]  Dennis L. Sun,et al.  Exact post-selection inference, with application to the lasso , 2013, 1311.6238.

[3]  D. Battle,et al.  Diagnostic and Statistical Manual of Mental Disorders (DSM). , 2013, CoDAS.

[4]  Genevera I. Allen,et al.  Two Sample Inference for Populations of Graphical Models with Applications to Functional Connectivity , 2015, 1502.03853.

[5]  V. D. Calhoun,et al.  Using joint ICA to link function and structure using MEG and DTI in schizophrenia , 2013, NeuroImage.

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  Edward T. Bullmore,et al.  Schizophrenia, neuroimaging and connectomics , 2012, NeuroImage.

[8]  William Stafford Noble,et al.  Support vector machine , 2013 .

[9]  Jean-Philippe Vert,et al.  Group lasso with overlap and graph lasso , 2009, ICML '09.

[10]  M. Chun,et al.  Functional connectome fingerprinting: Identifying individuals based on patterns of brain connectivity , 2015, Nature Neuroscience.

[11]  Anderson Y. Zhang,et al.  Minimax Rates of Community Detection in Stochastic Block Models , 2015, ArXiv.

[12]  N. Meinshausen,et al.  High-dimensional graphs and variable selection with the Lasso , 2006, math/0608017.

[13]  M. First,et al.  Structured Clinical Interview for DSM-IV Axis I Disorders , 1997 .

[14]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[15]  Hang Joon Jo,et al.  The perils of global signal regression for group comparisons: a case study of Autism Spectrum Disorders , 2013, Front. Hum. Neurosci..

[16]  P. Bickel,et al.  A nonparametric view of network models and Newman–Girvan and other modularities , 2009, Proceedings of the National Academy of Sciences.

[17]  C. O’Brien Statistical Learning with Sparsity: The Lasso and Generalizations , 2016 .

[18]  Yoshua Bengio,et al.  Non-Local Manifold Tangent Learning , 2004, NIPS.

[19]  N. Meinshausen,et al.  Stability selection , 2008, 0809.2932.

[20]  Robert Tibshirani,et al.  1-norm Support Vector Machines , 2003, NIPS.

[21]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[22]  Chandra Sripada,et al.  Disrupted network architecture of the resting brain in attention‐deficit/hyperactivity disorder , 2014, Human brain mapping.

[23]  Abraham Z. Snyder,et al.  Spurious but systematic correlations in functional connectivity MRI networks arise from subject motion , 2012, NeuroImage.

[24]  Edward T. Bullmore,et al.  Network-based statistic: Identifying differences in brain networks , 2010, NeuroImage.

[25]  Wen Gao,et al.  Efficient Generalized Fused Lasso and its Application to the Diagnosis of Alzheimer's Disease , 2014, AAAI.

[26]  Rajen Dinesh Shah,et al.  Variable selection with error control: another look at stability selection , 2011, 1105.5578.

[27]  Ann K. Shinn,et al.  Default mode network abnormalities in bipolar disorder and schizophrenia , 2010, Psychiatry Research: Neuroimaging.

[28]  Axel Benner,et al.  penalizedSVM: a R-package for feature selection SVM classification , 2009, Bioinform..

[29]  Katya Scheinberg,et al.  Fast First-Order Methods for Composite Convex Optimization with Backtracking , 2014, Found. Comput. Math..

[30]  Dost Öngür,et al.  Anticorrelations in resting state networks without global signal regression , 2012, NeuroImage.

[31]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[32]  R. Tibshirani,et al.  A SIGNIFICANCE TEST FOR THE LASSO. , 2013, Annals of statistics.

[33]  Gregory A. Miller,et al.  Bilateral hippocampal dysfunction in schizophrenia , 2011, NeuroImage.

[34]  Cheng Luo,et al.  Dysfunction of Large-Scale Brain Networks in Schizophrenia: A Meta-analysis of Resting-State Functional Connectivity , 2018, Schizophrenia bulletin.

[35]  Timothy O. Laumann,et al.  Functional Network Organization of the Human Brain , 2011, Neuron.

[36]  V. Calhoun,et al.  Exploring the Psychosis Functional Connectome: Aberrant Intrinsic Networks in Schizophrenia and Bipolar Disorder , 2012, Front. Psychiatry.

[37]  S. Geer,et al.  On asymptotically optimal confidence regions and tests for high-dimensional models , 2013, 1303.0518.

[38]  Ashwin Srinivasan,et al.  The Predictive Toxicology Challenge 2000-2001 , 2001, Bioinform..

[39]  Yoram Singer,et al.  Efficient Online and Batch Learning Using Forward Backward Splitting , 2009, J. Mach. Learn. Res..

[40]  Jieping Ye,et al.  Efficient Methods for Overlapping Group Lasso , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Daniele Durante,et al.  Bayesian Inference and Testing of Group Differences in Brain Networks , 2014, 1411.6506.

[42]  D. Battle,et al.  Diagnostic and Statistical Manual of Mental Disorders (DSM). , 2013, CoDAS.

[43]  Thomas T. Liu,et al.  A component based noise correction method (CompCor) for BOLD and perfusion based fMRI , 2007, NeuroImage.

[44]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[45]  O. Sporns,et al.  Complex brain networks: graph theoretical analysis of structural and functional systems , 2009, Nature Reviews Neuroscience.

[46]  George Karypis,et al.  Frequent substructure-based approaches for classifying chemical compounds , 2003, IEEE Transactions on Knowledge and Data Engineering.

[47]  Yufeng Zang,et al.  Standardizing the intrinsic brain: Towards robust measurement of inter-individual variation in 1000 functional connectomes , 2013, NeuroImage.

[48]  M. Lindquist The Statistical Analysis of fMRI Data. , 2008, 0906.3662.

[49]  V. Menon Large-scale brain networks and psychopathology: a unifying triple network model , 2011, Trends in Cognitive Sciences.

[50]  J. Gabrieli,et al.  Hyperactivity and hyperconnectivity of the default network in schizophrenia and in first-degree relatives of persons with schizophrenia , 2009, Proceedings of the National Academy of Sciences.

[51]  Yuji Matsumoto,et al.  An Application of Boosting to Graph Classification , 2004, NIPS.

[52]  Nicolai Meinshausen,et al.  Relaxed Lasso , 2007, Comput. Stat. Data Anal..

[53]  Stephen P. Boyd,et al.  Proximal Algorithms , 2013, Found. Trends Optim..

[54]  Yong He,et al.  Disrupted small-world networks in schizophrenia. , 2008, Brain : a journal of neurology.

[55]  Daniel L. Rubin,et al.  Network Analysis of Intrinsic Functional Brain Connectivity in Alzheimer's Disease , 2008, PLoS Comput. Biol..

[56]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[57]  Brian A. Nosek,et al.  Power failure: why small sample size undermines the reliability of neuroscience , 2013, Nature Reviews Neuroscience.

[58]  Jun Li,et al.  Hypothesis Testing For Network Data in Functional Neuroimaging , 2014, 1407.5525.

[59]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[60]  Genevera I. Allen,et al.  Local‐aggregate modeling for big data via distributed optimization: Applications to neuroimaging , 2014, Biometrics.

[61]  Jonathan E. Taylor,et al.  Interpretable whole-brain prediction analysis with GraphNet , 2013, NeuroImage.

[62]  R. Tibshirani,et al.  A note on the group lasso and a sparse group lasso , 2010, 1001.0736.

[63]  Kathryn B. Laskey,et al.  Stochastic blockmodels: First steps , 1983 .

[64]  Shantanu H. Joshi,et al.  Brain connectivity and novel network measures for Alzheimer's disease classification , 2015, Neurobiology of Aging.

[65]  Francis R. Bach,et al.  Consistency of the group Lasso and multiple kernel learning , 2007, J. Mach. Learn. Res..

[66]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[67]  Michael Angstadt,et al.  Disease prediction based on functional connectomes using a scalable and spatially-informed support vector machine , 2013, NeuroImage.

[68]  Vince D. Calhoun,et al.  A High-Throughput Pipeline Identifies Robust Connectomes But Troublesome Variability , 2017, bioRxiv.

[69]  Rex E. Jung,et al.  Multimodal Neuroimaging in Schizophrenia: Description and Dissemination , 2017, Neuroinformatics.

[70]  Thomas E. Nichols,et al.  Functional connectomics from resting-state fMRI , 2013, Trends in Cognitive Sciences.

[71]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[72]  James G. Scott,et al.  False Discovery Rate Regression: An Application to Neural Synchrony Detection in Primary Visual Cortex , 2013, Journal of the American Statistical Association.

[73]  Danielle S Bassett,et al.  Brain graphs: graphical models of the human brain connectome. , 2011, Annual review of clinical psychology.

[74]  Michael Angstadt,et al.  Volitional regulation of emotions produces distributed alterations in connectivity between visual, attention control, and default networks , 2014, NeuroImage.

[75]  Lawrence B. Holder,et al.  Graph-Based Concept Learning , 2001, FLAIRS Conference.

[76]  Gaël Varoquaux,et al.  Learning and comparing functional connectomes across subjects , 2013, NeuroImage.

[77]  Hongliang Fei,et al.  Boosting with structure information in the functional space: an application to graph classification , 2010, KDD.

[78]  Carey E. Priebe,et al.  Graph Classification Using Signal-Subgraphs: Applications in Statistical Connectomics , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[79]  George Karypis,et al.  Frequent Substructure-Based Approaches for Classifying Chemical Compounds , 2005, IEEE Trans. Knowl. Data Eng..

[80]  Can M. Le,et al.  Concentration and regularization of random graphs , 2015, Random Struct. Algorithms.

[81]  Rainer Goebel,et al.  Default Mode Network Connectivity as a Function of Familial and Environmental Risk for Psychotic Disorder , 2015, PloS one.

[82]  S. Debener,et al.  Default-mode brain dysfunction in mental disorders: A systematic review , 2009, Neuroscience & Biobehavioral Reviews.

[83]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[84]  Ashwin Srinivasan,et al.  Theories for Mutagenicity: A Study in First-Order and Feature-Based Induction , 1996, Artif. Intell..

[85]  Sharon C. Lyter,et al.  Diagnostic and Statistical Manual of Mental Disorders: Making it Work for Social Work , 2012 .

[86]  Wei Cheng,et al.  Pattern Classification of Large-Scale Functional Brain Networks: Identification of Informative Neuroimaging Markers for Epilepsy , 2012, PloS one.

[87]  R Cameron Craddock,et al.  Disease state prediction from resting state functional connectivity , 2009, Magnetic resonance in medicine.

[88]  S. V. N. Vishwanathan,et al.  Graph kernels , 2007 .

[89]  Hongtu Zhu,et al.  Tensor Regression with Applications in Neuroimaging Data Analysis , 2012, Journal of the American Statistical Association.

[90]  Dimitri Van De Ville,et al.  Decoding brain states from fMRI connectivity graphs , 2011, NeuroImage.

[91]  C. Priebe,et al.  A Semiparametric Two-Sample Hypothesis Testing Problem for Random Graphs , 2017 .

[92]  Takashi Washio,et al.  An Apriori-Based Algorithm for Mining Frequent Substructures from Graph Data , 2000, PKDD.

[93]  Xi Chen,et al.  Smoothing proximal gradient method for general structured sparse regression , 2010, The Annals of Applied Statistics.

[94]  Juan Bustillo,et al.  Functional imaging of the hemodynamic sensory gating response in schizophrenia , 2013, Human brain mapping.

[95]  Lawrence B. Holder,et al.  Empirical comparison of graph classification algorithms , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[96]  Marina Vannucci,et al.  A spatiotemporal nonparametric Bayesian model of multi-subject fMRI data , 2016 .

[97]  Chao Gao,et al.  Achieving Optimal Misclassification Proportion in Stochastic Block Models , 2015, J. Mach. Learn. Res..

[98]  Julien Mairal,et al.  Structured sparsity through convex optimization , 2011, ArXiv.

[99]  Michael Brady,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[100]  W. Bunney,et al.  Evidence for a compromised dorsolateral prefrontal cortical parallel circuit in schizophrenia , 2000, Brain Research Reviews.