A kernel machine method for detecting effects of interaction between multidimensional variable sets: An imaging genetics application

Measurements derived from neuroimaging data can serve as markers of disease and/or healthy development, are largely heritable, and have been increasingly utilized as (intermediate) phenotypes in genetic association studies. To date, imaging genetic studies have mostly focused on discovering isolated genetic effects, typically ignoring potential interactions with non-genetic variables such as disease risk factors, environmental exposures, and epigenetic markers. However, identifying significant interaction effects is critical for revealing the true relationship between genetic and phenotypic variables, and shedding light on disease mechanisms. In this paper, we present a general kernel machine based method for detecting effects of the interaction between multidimensional variable sets. This method can model the joint and epistatic effect of a collection of single nucleotide polymorphisms (SNPs), accommodate multiple factors that potentially moderate genetic influences, and test for nonlinear interactions between sets of variables in a flexible framework. As a demonstration of application, we applied the method to the data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to detect the effects of the interactions between candidate Alzheimer's disease (AD) risk genes and a collection of cardiovascular disease (CVD) risk factors, on hippocampal volume measurements derived from structural brain magnetic resonance imaging (MRI) scans. Our method identified that two genes, CR1 and EPHA1, demonstrate significant interactions with CVD risk factors on hippocampal volume, suggesting that CR1 and EPHA1 may play a role in influencing AD-related neurodegeneration in the presence of CVD risks.

[1]  Mert R. Sabuncu,et al.  Statistical analysis of longitudinal neuroimage data with Linear Mixed Effects models , 2013, NeuroImage.

[2]  Xihong Lin,et al.  Semiparametric Regression of Multidimensional Genetic Pathway Data: Least‐Squares Kernel Machines and Linear Mixed Models , 2007, Biometrics.

[3]  Nick C Fox,et al.  Common variants in ABCA7, MS4A6A/MS4A4E, EPHA1, CD33 and CD2AP are associated with Alzheimer’s disease , 2011, Nature Genetics.

[4]  J Tuomilehto,et al.  Midlife vascular risk factors and Alzheimer's disease in later life: longitudinal, population based study , 2001, BMJ.

[5]  D. G. Clark,et al.  Common variants in MS4A4/MS4A6E, CD2uAP, CD33, and EPHA1 are associated with late-onset Alzheimer’s disease , 2011, Nature Genetics.

[6]  G. Abecasis,et al.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes , 2010, Genetic epidemiology.

[7]  Patrick F. Sullivan,et al.  Genetic architectures of psychiatric disorders: the emerging picture and its implications , 2012, Nature Reviews Genetics.

[8]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[9]  A. Caspi,et al.  Influence of Life Stress on Depression: Moderation by a Polymorphism in the 5-HTT Gene , 2003, Science.

[10]  P. Visscher,et al.  Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs , 2012, Nature Genetics.

[11]  John Hardy,et al.  The genetic architecture of Alzheimer's disease: beyond APP, PSENs and APOE , 2012, Neurobiology of Aging.

[12]  A. Caspi,et al.  Role of Genotype in the Cycle of Violence in Maltreated Children , 2002, Science.

[13]  Xihong Lin Variance component testing in generalised linear models with random effects , 1997 .

[14]  Vijaya L. Melnick,et al.  Alzheimer’s Dementia , 1985, Contemporary Issues in Biomedicine, Ethics, and Society.

[15]  N. Laird,et al.  Maximum likelihood computations with repeated measures: application of the EM algorithm , 1987 .

[16]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[17]  A. Meyer-Lindenberg,et al.  Intermediate phenotypes and genetic mechanisms of psychiatric disorders , 2006, Nature Reviews Neuroscience.

[18]  N. Aronszajn Theory of Reproducing Kernels. , 1950 .

[19]  P. Bosco,et al.  Genome-wide association study identifies variants at CLU and CR1 associated with Alzheimer's disease , 2009, Nature Genetics.

[20]  Daniel J Schaid,et al.  Genomic Similarity and Kernel Methods II: Methods for Genomic Information , 2010, Human Heredity.

[21]  Paul M. Thompson,et al.  Increasing power for voxel-wise genome-wide association studies: The random field theory, least square kernel machines and fast permutation procedures , 2012, NeuroImage.

[22]  Juan Pablo Lewinger,et al.  Sample size requirements to detect gene‐environment interactions in genome‐wide association studies , 2011, Genetic epidemiology.

[23]  Debashis Ghosh,et al.  Equivalence of Kernel Machine Regression and Kernel Distance Covariance for Multidimensional Trait Association Studies , 2014, 1402.2679.

[24]  G. Wahba,et al.  Smoothing spline ANOVA for exponential families, with application to the Wisconsin Epidemiological Study of Diabetic Retinopathy : the 1994 Neyman Memorial Lecture , 1995 .

[25]  Andrew J. Saykin,et al.  Voxelwise genome-wide association study (vGWAS) , 2010, NeuroImage.

[26]  Guillén Fernández,et al.  CR1 genotype is associated with entorhinal cortex volume in young healthy adults , 2011, Neurobiology of Aging.

[27]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[28]  Alzheimer's Disease Neuroimaging Initiative,et al.  Genome-wide association with MRI atrophy measures as a quantitative trait locus for Alzheimer's disease , 2011, Molecular Psychiatry.

[29]  L S Honig,et al.  Aggregation of vascular risk factors and risk of incident Alzheimer disease , 2005, Neurology.

[30]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[31]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[32]  Paul M. Ridker,et al.  On the Use of Variance per Genotype as a Tool to Identify Quantitative Trait Interaction Effects: A Report from the Women's Genome Health Study , 2010, PLoS genetics.

[33]  W. Jagust,et al.  Vascular burden and Alzheimer disease pathologic progression , 2012, Neurology.

[34]  Xihong Lin,et al.  Hypothesis testing in semiparametric additive mixed models. , 2003, Biostatistics.

[35]  Brigitte Landeau,et al.  Morphological brain plasticity induced by musical expertise is accompanied by modulation of functional connectivity at rest , 2014, NeuroImage.

[36]  Mark E. Schmidt,et al.  The Alzheimer's Disease Neuroimaging Initiative: A review of papers published since its inception , 2012, Alzheimer's & Dementia.

[37]  Peter Kraft,et al.  Exploiting Gene-Environment Interaction to Detect Genetic Associations , 2007, Human Heredity.

[38]  Daniel J Schaid,et al.  Genomic Similarity and Kernel Methods I: Advancements by Building on Mathematical and Statistical Foundations , 2010, Human Heredity.

[39]  Carolyn Hutter,et al.  Powerful Cocktail Methods for Detecting Genome‐Wide Gene‐Environment Interaction , 2012, Genetic epidemiology.

[40]  L. Kiemeney,et al.  Corrigendum: Genetic variation in the prostate stem cell antigen gene PSCA confers susceptibility to urinary bladder cancer , 2009, Nature Genetics.

[41]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[42]  A. Dale,et al.  Whole Brain Segmentation Automated Labeling of Neuroanatomical Structures in the Human Brain , 2002, Neuron.

[43]  M. Nalls,et al.  Effect of Complement CR1 on Brain Amyloid Burden During Aging and Its Modification by APOE Genotype , 2013, Biological Psychiatry.

[44]  James Shields,et al.  Etiology of Psychosis. (Book Reviews: Schizophrenia and Genetics. A Twin Study Vantage Point) , 1972 .

[45]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.

[46]  Xihong Lin,et al.  Test for interactions between a genetic marker set and environment in generalized linear models. , 2013, Biostatistics.

[47]  B. Winblad,et al.  Combined effects of APOE genotype, blood pressure, and antihypertensive drug use on incident AD , 2003, Neurology.

[48]  G. Schellenberg,et al.  Developmental and vascular risk factors for Alzheimer's disease , 2005, Neurobiology of Aging.

[49]  M. Pencina,et al.  General Cardiovascular Risk Profile for Use in Primary Care: The Framingham Heart Study , 2008, Circulation.

[50]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[51]  W. G. Hill,et al.  Genome partitioning of genetic variation for complex traits using common SNPs , 2011, Nature Genetics.

[52]  Yuehua Cui,et al.  Gene-centric gene–gene interaction: A model-based kernel machine method , 2012, 1209.6502.

[53]  Yaakov Stern,et al.  Contribution of vascular risk factors to the progression in Alzheimer disease. , 2009, Archives of neurology.

[54]  D. Bates,et al.  Newton-Raphson and EM Algorithms for Linear Mixed-Effects Models for Repeated-Measures Data , 1988 .

[55]  H. Hendrie,et al.  Cardiovascular Risk Factors and Incident Alzheimer Disease: A Systematic Review of the Literature , 2009, Alzheimer disease and associated disorders.

[56]  Jianfeng Feng,et al.  Imaging genetics — towards discovery neuroscience , 2013, Quantitative Biology.

[57]  Epidemiology Branch Genome-Wide Meta-Analysis of Joint Tests for Genetic and Gene-Environment Interaction Effects , 2015 .

[58]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[59]  Mert R. Sabuncu,et al.  Spatiotemporal Linear Mixed Effects Modeling for the Mass-univariate Analysis of Longitudinal Neuroimage Data ⁎ for the Alzheimer's Disease Neuroimaging Initiative 1 , 2022 .

[60]  G. Wahba,et al.  Some results on Tchebycheffian spline functions , 1971 .

[61]  M. Kenward,et al.  Small sample inference for fixed effects from restricted maximum likelihood. , 1997, Biometrics.

[62]  Denise C. Park,et al.  Toward defining the preclinical stages of Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer's Association workgroups on diagnostic guidelines for Alzheimer's disease , 2011, Alzheimer's & Dementia.

[63]  Bin Zhao,et al.  Cardiovascular disease contributes to Alzheimer's disease: evidence from large-scale genome-wide association studies , 2014, Neurobiology of Aging.

[64]  Mert Sabuncu,et al.  Genetic variation and neuroimaging measures in Alzheimer disease. , 2010, Archives of neurology.

[65]  I. Gottesman,et al.  The endophenotype concept in psychiatry: etymology and strategic intentions. , 2003, The American journal of psychiatry.

[66]  Jason J. Corneveaux,et al.  CR1 is associated with amyloid plaque burden and age‐related cognitive decline , 2011, Annals of neurology.

[67]  Thomas W. Mühleisen,et al.  Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's disease , 2013, Nature Genetics.

[68]  P. Visscher,et al.  Common polygenic variation contributes to risk of schizophrenia and bipolar disorder , 2009, Nature.

[69]  L. Kuller,et al.  Enhanced risk for Alzheimer disease in persons with type 2 diabetes and APOE epsilon4: the Cardiovascular Health Study Cognition Study. , 2008, Archives of neurology.

[70]  Chong Gu Smoothing Spline Anova Models , 2002 .

[71]  G. Wahba,et al.  Smoothing Spline ANOVA with Component-Wise Bayesian “Confidence Intervals” , 1993 .

[72]  Paul M. Thompson,et al.  Genetics of the connectome , 2013, NeuroImage.

[73]  MD,et al.  Genetic variation at CR 1 increases risk of cerebral amyloid angiopathy , 2012 .

[74]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[75]  David A. Bennett,et al.  REST and Stress Resistance in Aging and Alzheimer’s Disease , 2014, Nature.

[76]  Berislav V. Zlokovic,et al.  Neurovascular mechanisms of Alzheimer's neurodegeneration , 2005, Trends in Neurosciences.

[77]  Debashis Ghosh,et al.  Equivalence of kernel machine regression and kernel distance covariance for multidimensional phenotype association studies , 2015, Biometrics.

[78]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[79]  Nick C Fox,et al.  Letter abstract - Genome-wide association study identifies variants at CLU and PICALM associated with Alzheimer's Disease , 2009 .

[80]  Nikos Makris,et al.  Automatically parcellating the human cerebral cortex. , 2004, Cerebral cortex.

[81]  R. Buckner,et al.  The association between a polygenic Alzheimer score and cortical thickness in clinically normal subjects. , 2012, Cerebral cortex.

[82]  Nick C Fox,et al.  Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease , 2013, Nature Genetics.

[83]  Stephan Ripke,et al.  Estimating the proportion of variation in susceptibility to schizophrenia captured by common SNPs , 2012, Nature Genetics.

[84]  Bruce Fischl,et al.  FreeSurfer , 2012, NeuroImage.

[85]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[86]  Karl J. Friston,et al.  Psychophysiological and Modulatory Interactions in Neuroimaging , 1997, NeuroImage.

[87]  D. Harville Maximum Likelihood Approaches to Variance Component Estimation and to Related Problems , 1977 .

[88]  Bhramar Mukherjee,et al.  Exploiting Gene‐Environment Independence for Analysis of Case–Control Studies: An Empirical Bayes‐Type Shrinkage Estimator to Trade‐Off between Bias and Efficiency , 2008, Biometrics.

[89]  Mert R. Sabuncu,et al.  Event time analysis of longitudinal neuroimage data , 2014, NeuroImage.

[90]  Yuedong Wang Smoothing Spline ANOVA , 2011 .

[91]  L. Cortellini,et al.  Genetic variation at CR1 increases risk of cerebral amyloid angiopathy , 2012, Neurology.

[92]  Xihong Lin,et al.  A powerful and flexible multilocus association test for quantitative traits. , 2008, American journal of human genetics.