Gene-Based Association Testing of Dichotomous Traits With Generalized Functional Linear Mixed Models Using Extended Pedigrees: Applications to Age-Related Macular Degeneration

Abstract Genetics plays a role in age-related macular degeneration (AMD), a common cause of blindness in the elderly. There is a need for powerful methods for carrying out region-based association tests between a dichotomous trait like AMD and genetic variants on family data. Here, we apply our new generalized functional linear mixed models (GFLMM) developed to test for gene-based association in a set of AMD families. Using common and rare variants, we observe significant association with two known AMD genes: CFH and ARMS2. Using rare variants, we find suggestive signals in four genes: ASAH1, CLEC6A, TMEM63C, and SGSM1. Intriguingly, ASAH1 is down-regulated in AMD aqueous humor, and ASAH1 deficiency leads to retinal inflammation and increased vulnerability to oxidative stress. These findings were made possible by our GFLMM which model the effect of a major gene as a fixed mean, the polygenic contributions as a random variation, and the correlation of pedigree members by kinship coefficients. Simulations indicate that the GFLMM likelihood ratio tests (LRTs) accurately control the Type I error rates. The LRTs have similar or higher power than existing retrospective kernel and burden statistics. Our GFLMM-based statistics provide a new tool for conducting family-based genetic studies of complex diseases. Supplementary materials for this article, including a standardized description of the materials available for reproducing the work, are available as an online supplement.

[1]  John D. Storey,et al.  Testing for genetic associations in arbitrarily structured populations , 2014, Nature Genetics.

[2]  D. Harville,et al.  Computational aspects of likelihood-based inference for variance components. , 1990 .

[3]  Aaron Y. Lee,et al.  Genome-wide association study of advanced age-related macular degeneration identifies a role of the hepatic lipase gene (LIPC) , 2010, Proceedings of the National Academy of Sciences.

[4]  Wei Chen,et al.  Gene Level Meta-Analysis of Quantitative Traits by Functional Linear Models , 2015, Genetics.

[5]  Keyan Zhao,et al.  An Arabidopsis Example of Association Mapping in Structured Samples , 2006, PLoS genetics.

[6]  William J. Astle,et al.  Population Structure and Cryptic Relatedness in Genetic Association Studies , 2009, 1010.4681.

[7]  T. Axenovich,et al.  FFBSKAT: Fast Family-Based Sequence Kernel Association Test , 2014, PloS one.

[8]  D. Schaid Mathematical and Statistical Methods for Genetic Analysis , 1999 .

[9]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[10]  M. Stephens,et al.  Efficient multivariate linear mixed model algorithms for genome-wide association studies. , 2014, Nature methods.

[11]  B. Rosner,et al.  Association of CFH Y402H and LOC387715 A69S with progression of age-related macular degeneration. , 2007, JAMA.

[12]  T. Axenovich,et al.  Region-Based Association Test for Familial Data under Functional Linear Models , 2015, PloS one.

[13]  Momiao Xiong,et al.  Smoothed functional principal component analysis for testing association of the entire allelic spectrum of genetic variation , 2012, European Journal of Human Genetics.

[14]  Ding Xu,et al.  iTRAQ-based proteomics analysis of aqueous humor in patients with dry age-related macular degeneration. , 2019, International journal of ophthalmology.

[15]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[16]  H. F. Robinson,et al.  PRECISION OF ESTIMATES OF VARIANCE COMPONENTS , 1958 .

[17]  D. A. Barondess,et al.  Uncovering Local Trends in Genetic Effects of Multiple Phenotypes via Functional Linear Models , 2016, Genetic epidemiology.

[18]  M. Rieder,et al.  Optimal unified approach for rare-variant association testing with application to small-sample case-control whole-exome sequencing studies. , 2012, American journal of human genetics.

[19]  Momiao Xiong,et al.  Genome-wide gene–gene interaction analysis for next-generation sequencing , 2015, European Journal of Human Genetics.

[20]  Momiao Xiong,et al.  Quantitative trait locus analysis for next-generation sequencing with the functional linear models , 2012, Journal of Medical Genetics.

[21]  Matti Pirinen,et al.  Efficient computation with a linear mixed model on large-scale data sets with applications to genetic studies , 2012, 1207.4886.

[22]  David Heckerman,et al.  FaST-LMM-Select for addressing confounding from spatial structure and rare variants , 2013, Nature Genetics.

[23]  Benita J. O’Colmain,et al.  Prevalence of age-related macular degeneration in the United States. , 2004, Archives of ophthalmology.

[24]  Alan M. Kwong,et al.  Family-based exome sequencing identifies rare coding variants in age-related macular degeneration , 2020, Human molecular genetics.

[25]  S. Redline,et al.  Control for Population Structure and Relatedness for Binary Traits in Genetic Association Studies via Logistic Mixed Models. , 2016, American journal of human genetics.

[26]  Adam Kiezun,et al.  Exome sequencing and the genetic basis of complex traits , 2012, Nature Genetics.

[27]  Bjarni J. Vilhjálmsson,et al.  A mixed-model approach for genome-wide association studies of correlated traits in structured populations , 2012, Nature Genetics.

[28]  Iuliana Ionita-Laza,et al.  Sequence kernel association tests for the combined effect of rare and common variants. , 2013, American journal of human genetics.

[29]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[30]  Margaret A. Pericak-Vance,et al.  Genetic variants near TIMP3 and high-density lipoprotein–associated loci influence susceptibility to age-related macular degeneration , 2010, Proceedings of the National Academy of Sciences.

[31]  T. Thornton,et al.  Case-control association testing with related individuals: a more powerful quasi-likelihood score test. , 2007, American journal of human genetics.

[32]  H. Kang,et al.  Variance component model to account for sample structure in genome-wide association studies , 2010, Nature Genetics.

[33]  A. L. Rae,et al.  The analysis of binomial data by a generalized linear mixed model , 1985 .

[34]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[35]  Peter Kraft,et al.  Quality control and quality assurance in genotypic data for genome‐wide association studies , 2010, Genetic epidemiology.

[36]  Ying Liu,et al.  FaST linear mixed models for genome-wide association studies , 2011, Nature Methods.

[37]  A. Goate,et al.  Evaluation of Gene-Based Family-Based Methods to Detect Novel Genes Associated With Familial Late Onset Alzheimer Disease , 2018, bioRxiv.

[38]  Momiao Xiong,et al.  Association studies for next-generation sequencing. , 2011, Genome research.

[39]  Iuliana Ionita-Laza,et al.  Rare Variant Analysis for Family-Based Design , 2013, PloS one.

[40]  Gabriëlle H S Buitendijk,et al.  Seven New Loci Associated with Age-Related Macular Degeneration , 2013, Nature Genetics.

[41]  Tatiana I Axenovich,et al.  Rapid variance components–based method for whole-genome association analysis , 2012, Nature Genetics.

[42]  E. Sugano,et al.  Overexpression of acid ceramidase (ASAH1) protects retinal cells (ARPE19) from oxidative stress[S] , 2018, Journal of Lipid Research.

[43]  Alexander F. Wilson,et al.  Generalized Functional Linear Models for Gene‐Based Case‐Control Association Studies , 2014, Genetic epidemiology.

[44]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[45]  D. Weeks,et al.  A full genome scan for age-related maculopathy. , 2000, Human molecular genetics.

[46]  R. Schall Estimation in generalized linear models with random effects , 1991 .

[47]  Hsing,et al.  Functional Data Analysis , 2015 .

[48]  C. Haley,et al.  Genomewide Rapid Association Using Mixed Model and Regression: A Fast and Simple Method For Genomewide Pedigree-Based Quantitative Trait Loci Association Analysis , 2007, Genetics.

[49]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[50]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[51]  S. Gabriel,et al.  Calibrating a coalescent simulation of human genome sequence variation. , 2005, Genome research.

[52]  C. R. Henderson Applications of linear models in animal breeding , 1984 .

[53]  J. Haines,et al.  Age-related maculopathy: a genomewide scan with continued evidence of susceptibility loci within the 1q31, 10q26, and 17q25 regions. , 2004, American journal of human genetics.

[54]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[55]  G. Abecasis,et al.  Age-related macular degeneration: genetics and biology coming together. , 2014, Annual review of genomics and human genetics.

[56]  P. Visscher,et al.  Common SNPs explain a large proportion of heritability for human height , 2011 .

[57]  Momiao Xiong,et al.  Epistasis analysis for quantitative traits by functional regression model , 2014, Genome research.

[58]  V. Bansal,et al.  Statistical analysis strategies for association studies involving rare variants , 2010, Nature Reviews Genetics.

[59]  M. Marazita,et al.  Genome-wide Association Studies , 2012, Journal of dental research.

[60]  I. Pikuleva,et al.  The Interplay between Retinal Pathways of Cholesterol Output and Its Effects on Mouse Retina , 2019, Biomolecules.

[61]  D. Weeks,et al.  Gene‐Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions , 2016, Genetic epidemiology.

[62]  Bjarni J. Vilhjálmsson,et al.  An efficient multi-locus mixed model approach for genome-wide association studies in structured populations , 2012, Nature Genetics.

[63]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[64]  Alexander F. Wilson,et al.  A Comparison Study of Fixed and Mixed Effect Models for Gene Level Association Studies of Complex Traits , 2016, Genetic epidemiology.

[65]  Nengjun Yi,et al.  A Sequence Kernel Association Test for Dichotomous Traits in Family Samples under a Generalized Linear Mixed Model , 2015, Human Heredity.

[66]  G. Dahlberg,et al.  Genetics of human populations. , 1948, Advances in genetics.

[67]  M. Boehnke,et al.  Meta-analysis of Complex Diseases at Gene Level with Generalized Functional Linear Models , 2015, Genetics.

[68]  P. Visscher,et al.  Mixed model with correction for case-control ascertainment increases association power. , 2015, American journal of human genetics.

[69]  G. Abecasis,et al.  Meta-analysis of genome scans of age-related macular degeneration. , 2005, Human molecular genetics.

[70]  Xihong Lin,et al.  GEE‐Based SNP Set Association Test for Continuous and Discrete Traits in Family‐Based Association Studies , 2013, Genetic epidemiology.

[71]  Daniel J Schaid,et al.  Multiple Genetic Variant Association Testing by Collapsing and Kernel Methods With Pedigree or Population Structured Data , 2013, Genetic epidemiology.

[72]  Eden R Martin,et al.  A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms , 2008, Genetic epidemiology.

[73]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[74]  D. Heckerman,et al.  Efficient Control of Population Structure in Model Organism Association Mapping , 2008, Genetics.

[75]  Qing Lu,et al.  Functional Analysis of Variance for Association Studies , 2014, PloS one.

[76]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[77]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[78]  D Gianola,et al.  Technical note: an R package for fitting generalized linear mixed models in animal breeding. , 2010, Journal of animal science.

[79]  Momiao Xiong,et al.  Pleiotropy Analysis of Quantitative Traits at Gene Level by Multivariate Functional Linear Models , 2015, Genetic epidemiology.

[80]  George A. Williams,et al.  The Age-Related Eye Disease Study (AREDS): design implications. AREDS report no. 1. , 1999, Controlled clinical trials.

[81]  Eleazar Eskin,et al.  Improved linear mixed models for genome-wide association studies , 2012, Nature Methods.

[82]  Yara T. E. Lechanteur,et al.  Nature Genetics Advance Online Publication , 2022 .

[83]  Mark I McCarthy,et al.  Genomic inflation factors under polygenic inheritance , 2011, European Journal of Human Genetics.

[84]  Spencer Graves,et al.  Functional Data Analysis with R and MATLAB , 2009 .

[85]  D. Gianola,et al.  Genome-Wide Association Studies with a Genomic Relationship Matrix: A Case Study with Wheat and Arabidopsis , 2016, G3: Genes, Genomes, Genetics.

[86]  F. Ferraty,et al.  The Oxford Handbook of Functional Data Analysis , 2011, Oxford Handbooks Online.

[87]  D. Clayton,et al.  A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. , 2002, American journal of human genetics.

[88]  P. Visscher,et al.  Advantages and pitfalls in the application of mixed-model association methods , 2014, Nature Genetics.

[89]  Iuliana Ionita-Laza,et al.  Family-based association tests for sequence data, and comparisons with population-based association tests , 2013, European Journal of Human Genetics.

[90]  Jinliang Wang,et al.  Pedigrees or markers: Which are better in estimating relatedness and inbreeding coefficient? , 2016, Theoretical population biology.

[91]  Momiao Xiong,et al.  Functional Linear Models for Association Analysis of Quantitative Traits , 2013, Genetic epidemiology.

[92]  Piotr Kokoszka,et al.  Inference for Functional Data with Applications , 2012 .