A Genomic Background Based Method for Association Analysis in Related Individuals

Background Feasibility of genotyping of hundreds and thousands of single nucleotide polymorphisms (SNPs) in thousands of study subjects have triggered the need for fast, powerful, and reliable methods for genome-wide association analysis. Here we consider a situation when study participants are genetically related (e.g. due to systematic sampling of families or because a study was performed in a genetically isolated population). Of the available methods that account for relatedness, the Measured Genotype (MG) approach is considered the ‘gold standard’. However, MG is not efficient with respect to time taken for the analysis of genome-wide data. In this context we proposed a fast two-step method called Genome-wide Association using Mixed Model and Regression (GRAMMAR) for the analysis of pedigree-based quantitative traits. This method certainly overcomes the drawback of time limitation of the measured genotype (MG) approach, but pays in power. One of the major drawbacks of both MG and GRAMMAR, is that they crucially depend on the availability of complete and correct pedigree data, which is rarely available. Methodology In this study we first explore type 1 error and relative power of MG, GRAMMAR, and Genomic Control (GC) approaches for genetic association analysis. Secondly, we propose an extension to GRAMMAR i.e. GRAMMAR-GC. Finally, we propose application of GRAMMAR-GC using the kinship matrix estimated through genomic marker data, instead of (possibly missing and/or incorrect) genealogy. Conclusion Through simulations we show that MG approach maintains high power across a range of heritabilities and possible pedigree structures, and always outperforms other contemporary methods. We also show that the power of our proposed GRAMMAR-GC approaches to that of the ‘gold standard’ MG for all models and pedigrees studied. We show that this method is both feasible and powerful and has correct type 1 error in the context of genome-wide association analysis in related individuals.

[1]  E. Boerwinkle,et al.  The use of measured genotype information in the analysis of quantitative phenotypes in man , 1986, Annals of human genetics.

[2]  C. V. van Duijn,et al.  The Effect of Genetic Drift in a Young Genetically Isolated Population , 2005 .

[3]  Saurabh Ghosh,et al.  Mapping quantitative trait loci in humans: achievements and limitations. , 2005, The Journal of clinical investigation.

[4]  Cornelia M van Duijn,et al.  The effect of genetic drift in a young genetically isolated population. , 2005, Annals of human genetics.

[5]  C. Haley,et al.  Genomewide Rapid Association Using Mixed Model and Regression: A Fast and Simple Method For Genomewide Pedigree-Based Quantitative Trait Loci Association Analysis , 2007, Genetics.

[6]  Christoph Lange,et al.  Power and design considerations for a general class of family-based association tests: quantitative traits. , 2002, American journal of human genetics.

[7]  J. Blangero,et al.  BioMed Central , 2001 .

[8]  Sebastian Zöllner,et al.  Coalescent-Based Association Mapping and Fine Mapping of Complex Trait Loci , 2005, Genetics.

[9]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[10]  Kenneth Lange,et al.  Association testing with Mendel , 2005, Genetic epidemiology.

[11]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[12]  Kathryn Roeder,et al.  Association studies for quantitative traits in structured populations , 2002, Genetic epidemiology.

[13]  C. Sing,et al.  Role of the apolipoprotein E polymorphism in determining normal plasma lipid and lipoprotein variation. , 1985, American journal of human genetics.

[14]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[15]  Yurii S. Aulchenko,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm108 Genetics and population analysis GenABEL: an R library for genome-wide association analysis , 2022 .

[16]  Robin Thompson,et al.  ASREML user guide release 1.0 , 2002 .

[17]  D. Clayton,et al.  Population structure, differential bias and genomic control in a large-scale, case-control association study , 2005, Nature Genetics.

[18]  Bernard Prum,et al.  Estimation of the inbreeding coefficient through use of genomic data. , 2003, American journal of human genetics.

[19]  J. Witteman,et al.  Heritabilities, apolipoprotein E, and effects of inbreeding on plasma lipids in a genetically isolated population: The Erasmus Rucphen Family Study , 2007, European Journal of Epidemiology.

[20]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[21]  Xin Xu,et al.  Family‐based tests for associating haplotypes with general phenotype data: Application to asthma genetics , 2004, Genetic epidemiology.

[22]  J. Gulcher,et al.  A variant in CDKAL1 influences insulin response and risk of type 2 diabetes , 2007, Nature Genetics.

[23]  Andrew P Morris,et al.  Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes. , 2004, American journal of human genetics.