Identifying Candidate Genetic Associations with MRI-Derived AD-Related ROI via Tree-Guided Sparse Learning

Imaging genetics has attracted significant interests in recent studies. Traditional work has focused on mass-univariate statistical approaches that identify important single nucleotide polymorphisms (SNPs) associated with quantitative traits (QTs) of brain structure or function. More recently, to address the problem of multiple comparison and weak detection, multivariate analysis methods such as the least absolute shrinkage and selection operator (Lasso) are often used to select the most relevant SNPs associated with QTs. However, one problem of Lasso, as well as many other feature selection methods for imaging genetics, is that some useful prior information, e.g., the hierarchical structure among SNPs, are rarely used for designing a more powerful model. In this paper, we propose to identify the associations between candidate genetic features (i.e., SNPs) and magnetic resonance imaging (MRI)-derived measures using a tree-guided sparse learning (TGSL) method. The advantage of our method is that it explicitly models the complex hierarchical structure among the SNPs in the objective function for feature selection. Specifically, motivated by the biological knowledge, the hierarchical structures involving gene groups and linkage disequilibrium (LD) blocks as well as individual SNPs are imposed as a tree-guided regularization term in our TGSL model. Experimental studies on simulation data and the Alzheimer's Disease Neuroimaging Initiative (ADNI) data show that our method not only achieves better predictions than competing methods on the MRI-derived measures of AD-related region of interests (ROIs) (i.e., hippocampus, parahippocampal gyrus, and precuneus), but also identifies sparse SNP patterns at the block level to better guide the biological interpretation.

[1]  Michael Weiner,et al.  Predicting temporal lobe volume on MRI from genotypes using L1-L2 regularized regression , 2012, 2012 9th IEEE International Symposium on Biomedical Imaging (ISBI).

[2]  Paul M. Thompson,et al.  Identification of gene pathways implicated in Alzheimer's disease using longitudinal imaging phenotypes with sparse regression☆ , 2012, NeuroImage.

[3]  Jieping Ye,et al.  Moreau-Yosida Regularization for Grouped Tree Structure Learning , 2010, NIPS.

[4]  Michael Weiner,et al.  Whole genome association study of brain-wide imaging phenotypes for identifying quantitative trait loci in MCI and AD: A study of the ADNI cohort , 2010, NeuroImage.

[5]  Karl J. Friston,et al.  Voxel-Based Morphometry , 2015 .

[6]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[7]  Hongtu Zhu,et al.  Bayesian Generalized Low Rank Regression Models for Neuroimaging Phenotypes and Genetic Markers , 2014, Journal of the American Statistical Association.

[8]  Andrew J. Saykin,et al.  Voxelwise genome-wide association study (vGWAS) , 2010, NeuroImage.

[9]  Wei Zhang,et al.  INPP5D rs35349669 polymorphism with late-onset Alzheimer's disease: A replication study and meta-analysis , 2016, Oncotarget.

[10]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[11]  Christoph Lange,et al.  Quantitative trait prediction based on genetic marker-array data, a simulation study , 2011, Bioinform..

[12]  Michael W. Weiner,et al.  Discovery and Replication of Gene Influences on Brain Structure Using LASSO Regression , 2012, Front. Neurosci..

[13]  Daoqiang Zhang,et al.  Identifying Genetic Associations with MRI-derived Measures via Tree-Guided Sparse Learning , 2014, MICCAI.

[14]  Jianfeng Feng,et al.  Imaging genetics — towards discovery neuroscience , 2013, Quantitative Biology.

[15]  Daoqiang Zhang,et al.  Tree-Guided Sparse Coding for Brain Disease Classification , 2012, MICCAI.

[16]  Nicholas G Martin,et al.  Imaging genomics. , 2010, Current opinion in neurology.

[17]  Dinggang Shen,et al.  Structured Sparse Low-Rank Regression Model for Brain-Wide and Genome-Wide Associations , 2016, MICCAI.

[18]  Martin Klein,et al.  Precuneus atrophy in early-onset Alzheimer’s disease: a morphometric structural MRI study , 2007, Neuroradiology.

[19]  P. Thompson,et al.  Multilocus Genetic Analysis of Brain Images , 2011, Front. Gene..

[20]  I. Gottesman,et al.  The endophenotype concept in psychiatry: etymology and strategic intentions. , 2003, The American journal of psychiatry.

[21]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[22]  Yurii Nesterov,et al.  Gradient methods for minimizing composite functions , 2012, Mathematical Programming.

[23]  Michael Weiner,et al.  Boosting Power to Detect Genetic Associations in Imaging Using Multi-locus, Genome-wide Scans and Ridge Regression , 2022 .

[24]  Mert R. Sabuncu,et al.  Joint Modeling of Imaging and Genetics , 2013, IPMI.

[25]  H. Uylings,et al.  Atrophy in the parahippocampal gyrus as an early biomarker of Alzheimer’s disease , 2010, Brain Structure and Function.

[26]  Jason H. Moore,et al.  Alzheimer's Disease Neuroimaging Initiative biomarkers as quantitative phenotypes: Genetics core aims, progress, and plans , 2010, Alzheimer's & Dementia.

[27]  D. Basak,et al.  Support Vector Regression , 2008 .

[28]  Bertrand Thirion,et al.  Multiscale Mining of fMRI Data with Hierarchical Structured Sparsity , 2012, SIAM J. Imaging Sci..

[29]  Shannon L. Risacher,et al.  From phenotype to genotype: an association study of longitudinal phenotypic markers to Alzheimer's disease relevant SNPs , 2012, Bioinform..

[30]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[31]  N. Tzourio-Mazoyer,et al.  Automated Anatomical Labeling of Activations in SPM Using a Macroscopic Anatomical Parcellation of the MNI MRI Single-Subject Brain , 2002, NeuroImage.

[32]  David A Bennett,et al.  CD33: increased inclusion of exon 2 implicates the Ig V-set domain in Alzheimer's disease susceptibility. , 2014, Human molecular genetics.

[33]  Shannon L. Risacher,et al.  Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort , 2012, Bioinform..

[34]  D. Blacker,et al.  Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene database , 2007, Nature Genetics.

[35]  Polina Golland,et al.  Probabilistic Modeling of Imaging, Genetics and Diagnosis , 2016, IEEE Transactions on Medical Imaging.

[36]  Nan Hu,et al.  CD33 in Alzheimer's Disease , 2013, Molecular Neurobiology.

[37]  Judy H. Cho,et al.  Comparisons of multi‐marker association methods to detect association between a candidate region and disease , 2010, Genetic epidemiology.

[38]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[39]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[40]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[41]  C. Jack,et al.  Prediction of AD with MRI-based hippocampal volume in mild cognitive impairment , 1999, Neurology.

[42]  Jieping Ye,et al.  Efficient Methods for Overlapping Group Lasso , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[43]  P. Thompson,et al.  Neuroimaging endophenotypes: Strategies for finding genes influencing brain structure and function , 2007, Human brain mapping.