High dimensional classification for spatially dependent data with application to neuroimaging

Discriminating patients with Alzheimer's disease (AD) from healthy subjects is a crucial task in the research of Alzheimer's disease. The task can be potentially achieved by linear discriminant analysis (LDA), which is one of the most classical and popular classification techniques. However, the classification problem becomes challenging for LDA because of the high-dimensionally and the spatial dependency of the brain imaging data. To address the challenges, researchers have proposed various ways to generalize LDA into high-dimensional context in recent years. However, these existing methods did not reach any consensus on how to incorporate spatially dependent structure. In light of the current needs and limitations, we propose a new classification method, named as Penalized Maximum Likelihood Estimation LDA (PMLE-LDA). The proposed method uses $Mat\acute{e}rn$ covariance function to describe the spatial correlation of brain regions. Additionally, PMLE is designed to model the sparsity of high-dimensional features. The spatial location information is used to address the singularity of the covariance. Tapering technique is introduced to reduce computational burden. We show in theory that the proposed method can not only provide consistent results of parameter estimation and feature selection, but also generate an asymptotically optimal classifier driven by high dimensional data with specific spatially dependent structure. Finally, the method is validated through simulations and an application into ADNI data for classifying Alzheimer's patients.

[1]  R. Tibshirani,et al.  Penalized classification using Fisher's linear discriminant , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[2]  M. Steen-Adams,et al.  On Estimation and Selection of Autologistic Regression Models via Penalized Pseudolikelihood , 2013 .

[3]  K. Worsley Developments in Random Field Theory , 2003 .

[4]  Anru Zhang,et al.  Minimax rate-optimal estimation of high-dimensional covariance matrices with incomplete data , 2016, J. Multivar. Anal..

[5]  Tapabrata Maiti,et al.  Variable selection for binary spatial regression: Penalized quasi-likelihood approach. , 2016, Biometrics.

[6]  L. Fahrmeir,et al.  Spatial Bayesian Variable Selection With Application to Functional Magnetic Resonance Imaging , 2007 .

[7]  Harrison H. Zhou,et al.  Estimating structured high-dimensional covariance and precision matrices: Optimal rates and adaptive estimation , 2016 .

[8]  J. Shao,et al.  Sparse linear discriminant analysis by thresholding for high dimensional data , 2011, 1105.3561.

[9]  M. Lindquist The Statistical Analysis of fMRI Data. , 2008, 0906.3662.

[10]  K. Mardia,et al.  Maximum likelihood estimation of models for residual covariance in spatial regression , 1984 .

[11]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[12]  P. Bickel,et al.  Some theory for Fisher''s linear discriminant function , 2004 .

[13]  H. Zou,et al.  A direct approach to sparse discriminant analysis in ultra-high dimensions , 2012 .

[14]  Jun Shao,et al.  SPARSE QUADRATIC DISCRIMINANT ANALYSIS FOR HIGH DIMENSIONAL DATA , 2015 .

[15]  Perla E. Reyes,et al.  Selection of Spatial-Temporal Lattice Models: Assessing the Impact of Climate Conditions on a Mountain Pine Beetle Outbreak , 2012 .

[16]  Linjun Zhang,et al.  High dimensional linear discriminant analysis: optimality, adaptive algorithm and missing data , 2018, Journal of the Royal Statistical Society: Series B (Statistical Methodology).

[17]  Adam J. Rothman,et al.  Sparse permutation invariant covariance estimation , 2008, 0801.4837.

[18]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[19]  Bernard W. Silverman Penalized Maximum Likelihood Estimation , 2004 .

[20]  John Hughes,et al.  Fast, fully Bayesian spatiotemporal inference for fMRI data. , 2016, Biostatistics.

[21]  Hsin-Cheng Huang,et al.  Optimal Geostatistical Model Selection , 2007 .

[22]  Xin Zhang,et al.  Parsimonious Tensor Response Regression , 2015, 1501.07815.

[23]  F. Bowman Brain Imaging Analysis. , 2014, Annual review of statistics and its application.

[24]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[25]  P. Bickel,et al.  Covariance regularization by thresholding , 2009, 0901.3079.

[26]  Lixing Zhu,et al.  Covariance-enhanced discriminant analysis. , 2015, Biometrika.

[27]  Jennifer A Hoeting,et al.  Model selection for geostatistical models. , 2006, Ecological applications : a publication of the Ecological Society of America.

[28]  T. Cai,et al.  A Direct Estimation Approach to Sparse Linear Discriminant Analysis , 2011, 1107.3442.

[29]  H. Zou,et al.  One-step Sparse Estimates in Nonconcave Penalized Likelihood Models. , 2008, Annals of statistics.

[30]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[31]  Xin Zhang,et al.  Covariate-Adjusted Tensor Classification in High Dimensions , 2018, Journal of the American Statistical Association.

[32]  R W Cox,et al.  AFNI: software for analysis and visualization of functional magnetic resonance neuroimages. , 1996, Computers and biomedical research, an international journal.

[33]  Owen Carmichael,et al.  Standardization of analysis sets for reporting results from ADNI MRI data , 2013, Alzheimer's & Dementia.

[34]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[35]  Yang Feng,et al.  A road to classification in high dimensional space: the regularized optimal affine discriminant , 2010, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[36]  P. Bickel,et al.  Regularized estimation of large covariance matrices , 2008, 0803.1909.

[37]  T. Maiti,et al.  Additive model building for spatial regression , 2017 .

[38]  Jun Zhu,et al.  Web-based Supplementary Materials for "On selection of spatial linear models for lattice data" , 2009 .

[39]  Jun Zhu,et al.  Penalized maximum likelihood estimation and variable selection in geostatistics , 2011, 1109.0320.

[40]  Holger Wendland,et al.  Piecewise polynomial, positive definite and compactly supported radial functions of minimal degree , 1995, Adv. Comput. Math..

[41]  Franccois Bachoc,et al.  On the smallest eigenvalues of covariance matrices of multivariate spatial processes , 2016, 1602.02882.

[42]  David C. Zhu,et al.  Early prediction of Alzheimer’s disease using longitudinal volumetric MRI data from ADNI , 2019, Health Services and Outcomes Research Methodology.

[43]  Jianqing Fan,et al.  Variable Selection via Nonconcave Penalized Likelihood and its Oracle Properties , 2001 .