Discovery and Replication of Gene Influences on Brain Structure Using LASSO Regression

We implemented least absolute shrinkage and selection operator (LASSO) regression to evaluate gene effects in genome-wide association studies (GWAS) of brain images, using an MRI-derived temporal lobe volume measure from 729 subjects scanned as part of the Alzheimer’s Disease Neuroimaging Initiative (ADNI). Sparse groups of SNPs in individual genes were selected by LASSO, which identifies efficient sets of variants influencing the data. These SNPs were considered jointly when assessing their association with neuroimaging measures. We discovered 22 genes that passed genome-wide significance for influencing temporal lobe volume. This was a substantially greater number of significant genes compared to those found with standard, univariate GWAS. These top genes are all expressed in the brain and include genes previously related to brain function or neuropsychiatric disorders such as MACROD2, SORCS2, GRIN2B, MAGI2, NPAS3, CLSTN2, GABRG3, NRXN3, PRKAG2, GAS7, RBFOX1, ADARB2, CHD4, and CDH13. The top genes we identified with this method also displayed significant and widespread post hoc effects on voxelwise, tensor-based morphometry (TBM) maps of the temporal lobes. The most significantly associated gene was an autism susceptibility gene known as MACROD2. We were able to successfully replicate the effect of the MACROD2 gene in an independent cohort of 564 young, Australian healthy adult twins and siblings scanned with MRI (mean age: 23.8 ± 2.2 SD years). Our approach powerfully complements univariate techniques in detecting influences of genes on the living brain.

[1]  Stephen M Smith,et al.  Fast robust automated brain extraction , 2002, Human brain mapping.

[2]  Michael Brady,et al.  Improved Optimization for the Robust and Accurate Linear Registration and Motion Correction of Brain Images , 2002, NeuroImage.

[3]  N. Jahanshad,et al.  Brain structure in healthy adults is related to serum transferrin and the H63D polymorphism in the HFE gene , 2012, Proceedings of the National Academy of Sciences.

[4]  Michael Weiner,et al.  Voxelwise gene-wide association study (vGeneWAS): Multivariate gene-based association testing in 731 elderly subjects , 2011, NeuroImage.

[5]  Alan C. Evans,et al.  A nonparametric method for automatic correction of intensity nonuniformity in MRI data , 1998, IEEE Transactions on Medical Imaging.

[6]  M. Jarvelin,et al.  A Common Variant in the FTO Gene Is Associated with Body Mass Index and Predisposes to Childhood and Adult Obesity , 2007, Science.

[7]  J. Morris The Clinical Dementia Rating (CDR) , 1993, Neurology.

[8]  A. J. Slater,et al.  Candidate single-nucleotide polymorphisms from a genomewide association study of Alzheimer disease. , 2008, Archives of neurology.

[9]  L. Berg Clinical Dementia Rating (CDR). , 1988, Psychopharmacology bulletin.

[10]  Marisa O. Hollinshead,et al.  Identification of common variants associated with human hippocampal and intracranial volumes , 2012, Nature Genetics.

[11]  I. Gottesman,et al.  The endophenotype concept in psychiatry: etymology and strategic intentions. , 2003, The American journal of psychiatry.

[12]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[13]  Nick C Fox,et al.  The clinical use of structural MRI in Alzheimer disease , 2010, Nature Reviews Neurology.

[14]  Paul M. Thompson,et al.  Submitted to: , 2008 .

[15]  N. Schork,et al.  Accommodating linkage disequilibrium in genetic-association analyses via ridge regression. , 2008, American journal of human genetics.

[16]  G. D'Angelo,et al.  Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies , 2009, BMC proceedings.

[17]  S. Rich,et al.  MEETING THE CHALLENGES , 1995 .

[18]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[19]  Michael Weiner,et al.  Boosting Power to Detect Genetic Associations in Imaging Using Multi-locus, Genome-wide Scans and Ridge Regression , 2022 .

[20]  K. Sleegers,et al.  A genomewide screen for late-onset Alzheimer disease in a genetically isolated Dutch population. , 2007, American journal of human genetics.

[21]  Andrew J. Saykin,et al.  Hippocampal Atrophy as a Quantitative Trait in a Genome-Wide Association Study Identifying Novel Susceptibility Genes for Alzheimer's Disease , 2009, PloS one.

[22]  H. Cordell,et al.  SNP Selection in Genome-Wide and Candidate Gene Studies via Penalized Logistic Regression , 2010, Genetic epidemiology.

[23]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[24]  E. Boerwinkle,et al.  Mining gold dust under the genome wide significance level: a two‐stage approach to analysis of GWAS , 2011, Genetic epidemiology.

[25]  Thomas E. Nichols,et al.  Pathway-based approaches to imaging genetics association studies: Wnt signaling, GSK3beta substrates and major depression , 2010, NeuroImage.

[26]  N. Jahanshad,et al.  Common Alzheimer's Disease Risk Variant Within the CLU Gene Affects White Matter Microstructure in Young Adults , 2011, The Journal of Neuroscience.

[27]  Thomas E. Nichols,et al.  Discovering genetic associations with high-dimensional neuroimaging phenotypes: A sparse reduced-rank regression approach , 2010, NeuroImage.

[28]  J. Hirschhorn,et al.  Genetic model testing and statistical power in population‐based association studies of quantitative traits , 2007, Genetic epidemiology.

[29]  Jiang Qian,et al.  TiGER: A database for tissue-specific gene expression and regulation , 2008, BMC Bioinformatics.

[30]  K. Lunetta,et al.  The neuronal sortilin-related receptor SORL1 is genetically associated with Alzheimer disease , 2007, Nature Genetics.

[31]  V. Pungpapong,et al.  Simultaneous genome-wide association studies of anti-cyclic citrullinated peptide in rheumatoid arthritis using penalized orthogonal-components regression , 2009, BMC proceedings.

[32]  M. Xiong,et al.  Genome-wide gene and pathway analysis , 2010, European Journal of Human Genetics.

[33]  Margaret J. Wright,et al.  Brisbane Adolescent Twin Study: Outline of study methods and research projects , 2004 .

[34]  Margaret A. Pericak-Vance,et al.  A genome-wide scan for common alleles affecting risk for autism , 2010, Human molecular genetics.

[35]  L. Kurlowicz,et al.  The Mini Mental State Examination (MMSE). , 1999, Director.

[36]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[37]  A. Meyer-Lindenberg,et al.  Intermediate phenotypes and genetic mechanisms of psychiatric disorders , 2006, Nature Reviews Neuroscience.

[38]  Arthur W Toga,et al.  Alzheimer's Disease Risk Gene, GAB2, is Associated with Regional Brain Volume Differences in 755 Young Healthy Twins , 2012, Twin Research and Human Genetics.

[39]  John A. Bohn,et al.  Meeting the challenges , 2020, Law in the First Person Plural.

[40]  V. Calhoun,et al.  Combining fMRI and SNP data to investigate connections between brain function and genetics using parallel ICA , 2009, Human brain mapping.

[41]  Doron Lancet,et al.  GeneNote: whole genome expression profiles in normal human tissues. , 2003, Comptes rendus biologies.

[42]  J Mazziotta,et al.  A probabilistic atlas and reference system for the human brain: International Consortium for Brain Mapping (ICBM). , 2001, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[43]  P. Bosco,et al.  Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease , 2009, Nature Genetics.

[44]  Michael Weiner,et al.  Tensor-based morphometry as a neuroimaging biomarker for Alzheimer's disease: An MRI study of 676 AD, MCI, and normal subjects , 2008, NeuroImage.

[45]  M. Hall A New Role for Endophenotypes in the GWAS Era: Functional Characterization of Risk Variants , 2010, Harvard review of psychiatry.

[46]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[47]  Istvan Mody,et al.  The splicing regulator Rbfox1 (A2BP1) controls neuronal excitation in the mammalian brain , 2011, Nature Genetics.

[48]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[49]  Sven Cichon,et al.  A genome-wide association study for late-onset Alzheimer's disease using DNA pooling , 2008, BMC Medical Genomics.

[50]  K L Evans,et al.  Interacting haplotypes at the NPAS3 locus alter risk of schizophrenia and bipolar disorder , 2009, Molecular Psychiatry.

[51]  Paul M. Thompson,et al.  BDNF gene effects on brain circuitry replicated in 455 twins , 2011, NeuroImage.

[52]  Andrew J. Saykin,et al.  Voxelwise genome-wide association study (vGWAS) , 2010, NeuroImage.

[53]  Gonçalo R Abecasis,et al.  Elucidating the genetic architecture of familial schizophrenia using rare copy number variant and linkage scans , 2009, Proceedings of the National Academy of Sciences.

[54]  Valerie Obenchain,et al.  Risk prediction using genome‐wide association studies , 2010, Genetic epidemiology.

[55]  Thomas W. Mühleisen,et al.  Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's disease , 2013, Nature Genetics.

[56]  M. Gill,et al.  Evidence for novel susceptibility genes for late-onset Alzheimer's disease from a genome-wide association study of putative functional variants. , 2007, Human molecular genetics.

[57]  Paul M. Thompson,et al.  Sparse reduced-rank regression detects genetic associations with voxel-wise longitudinal phenotypes in Alzheimer's disease , 2012, NeuroImage.

[58]  Michael Weiner,et al.  Genome-wide analysis reveals novel genes influencing temporal lobe structure with relevance to neurodegeneration in Alzheimer's disease , 2010, NeuroImage.

[59]  Rebecca F. Halperin,et al.  A high-density whole-genome association study reveals that APOE is the major susceptibility gene for sporadic late-onset Alzheimer's disease. , 2007, The Journal of clinical psychiatry.

[60]  Guifang Fu,et al.  The Bayesian lasso for genome-wide association studies , 2011, Bioinform..

[61]  J. Haines,et al.  Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. , 1993, Science.

[62]  Jason J. Corneveaux,et al.  Common Kibra Alleles Are Associated with Human Memory Performance , 2006, Science.

[63]  Taesung Park,et al.  Joint Identification of Multiple Genetic Variants via Elastic‐Net Variable Selection in a Genome‐Wide Association Analysis , 2010, Annals of human genetics.

[64]  Michael W. Weiner,et al.  Twelve-month metabolic declines in probable Alzheimer's disease and amnestic mild cognitive impairment assessed using an empirically pre-defined statistical region-of-interest: Findings from the Alzheimer's Disease Neuroimaging Initiative , 2010, NeuroImage.

[65]  Eden R Martin,et al.  Genome-wide association study implicates a chromosome 12 risk locus for late-onset Alzheimer disease. , 2009, American journal of human genetics.

[66]  Yansong Cheng,et al.  Comparison of statistical approaches to rare variant analysis for quantitative traits , 2011, BMC proceedings.

[67]  S E Poduslo,et al.  Genome screen of late‐onset Alzheimer's extended pedigrees identifies TRPC4AP by haplotype analysis , 2009, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[68]  Christa Lese Martin,et al.  Cytogenetic and molecular characterization of A2BP1/FOX1 as a candidate gene for autism , 2007, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[69]  Paul M. Thompson,et al.  Common variants at 12q14 and 12q24 are associated with hippocampal volume , 2012, Nature Genetics.

[70]  P. Sham,et al.  The future of association studies: gene-based analysis and replication. , 2004, American journal of human genetics.

[71]  M. Annett A classification of hand preference by association analysis. , 1970, British journal of psychology.

[72]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[73]  Danielle J. Harvey,et al.  The Alzheimer's Disease Neuroimaging Initiative: Annual change in biomarkers and clinical outcomes , 2010, Alzheimer's & Dementia.

[74]  H. Zou,et al.  Regularization and variable selection via the elastic net , 2005 .

[75]  Nick C Fox,et al.  The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods , 2008, Journal of magnetic resonance imaging : JMRI.

[76]  C. Jack,et al.  Discovery and replication of dopamine-related gene effects on caudate volume in young and elderly populations (N=1198) using genome-wide search , 2011, Molecular Psychiatry.

[77]  Dave R. M. Langers,et al.  Enhanced signal detection in neuroimaging by means of regional control of the global false discovery rate , 2007, NeuroImage.

[78]  Kyunga Kim,et al.  Elastic-net regularization approaches for genome-wide association studies of rheumatoid arthritis , 2009, BMC proceedings.

[79]  Arthur E. Hoerl,et al.  Application of ridge analysis to regression problems , 1962 .

[80]  Christoph Lange,et al.  Genome-wide association analysis reveals putative Alzheimer's disease susceptibility loci in addition to APOE. , 2008, American journal of human genetics.

[81]  M. Sugishita,et al.  [Clinical Dementia Rating (CDR)]. , 2011, Nihon rinsho. Japanese journal of clinical medicine.

[82]  M. Fornage,et al.  Genome-Wide Association Studies of MRI-Defined Brain Infarcts: Meta-Analysis From the CHARGE Consortium , 2010, Stroke.

[83]  P. Thompson,et al.  Multilocus Genetic Analysis of Brain Images , 2011, Front. Gene..

[84]  Jason H. Moore,et al.  Alzheimer's Disease Neuroimaging Initiative biomarkers as quantitative phenotypes: Genetics core aims, progress, and plans , 2010, Alzheimer's & Dementia.

[85]  Shannon L. Risacher,et al.  Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort , 2012, Bioinform..

[86]  Paul M. Thompson,et al.  Hierarchical clustering of the genetic connectivity matrix reveals the network topology of gene action on brain microstructure: An N=531 twin study , 2011, 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[87]  D. Collins,et al.  Automatic 3D Intersubject Registration of MR Volumetric Data in Standardized Talairach Space , 1994, Journal of computer assisted tomography.

[88]  M. Folstein,et al.  Clinical diagnosis of Alzheimer's disease , 1984, Neurology.

[89]  Jessica A. Turner,et al.  Identifying gene regulatory networks in schizophrenia , 2010, NeuroImage.

[90]  Norbert Schuff,et al.  3D characterization of brain atrophy in Alzheimer's disease and mild cognitive impairment using tensor-based morphometry , 2008, NeuroImage.