Detecting rare variants for complex traits using family and unrelated data

Large genome‐wide association studies (GWAS) have been performed to detect common genetic variants involved in common diseases, but most of the variants found this way account for only a small portion of the trait variance. Furthermore, candidate gene‐based resequencing suggests that many rare genetic variants contribute to the trait variance of common diseases. Here we propose two designs, sibpair and unrelated‐case designs, to detect rare genetic variants in either a candidate gene‐based or genome‐wide association analysis. First we show that we can detect and classify together rare risk haplotypes using a relatively small sample with either of these designs, and then have increased power to test association in a larger case‐control sample. This method can also be applied to resequencing data. Next we apply the method to the Wellcome Trust Case Control Consortium (WTCCC) coronary artery disease (CAD) and hypertension (HT) data, the latter being the only trait for which no genome‐wide association evidence was reported in the original WTCCC study, and identify one interesting gene associated with HT and four associated with CAD at a genome‐wide significance level of 5%. These results suggest that searching for rare genetic variants is feasible and can be fruitful in current GWAS, candidate gene studies or resequencing studies. Genet. Epidemiol. 34: 171–187, 2010. © 2009 Wiley‐Liss, Inc.

[1]  J. Haines,et al.  Gene dose of apolipoprotein E type 4 allele and the risk of Alzheimer's disease in late onset families. , 1993, Science.

[2]  E. Lander The New Genomics: Global Views of Biology , 1996, Science.

[3]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[4]  A. Chakravarti Population genetics—making sense out of sequence , 1999, Nature Genetics.

[5]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[6]  K. Weiss,et al.  How many diseases does it take to map a gene with SNPs? , 2000, Nature Genetics.

[7]  J. Weber,et al.  Quantitative trait loci on chromosomes 3 and 17 influence phenotypes of the metabolic syndrome. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[8]  E. Boerwinkle,et al.  Apolipoprotein E variation at the sequence haplotype level: implications for the origin and maintenance of a major human polymorphism. , 2000, American journal of human genetics.

[9]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[10]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[11]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[12]  R C Elston,et al.  Linkage and association analysis of angiotensin I-converting enzyme (ACE)-gene polymorphisms with ACE concentration and blood pressure. , 2001, American journal of human genetics.

[13]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[14]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[15]  A. Chakravarti,et al.  A genome-wide scan for obesity in African-Americans. , 2002, Diabetes.

[16]  A. Luke,et al.  Linkage for BMI at 3q27 region confirmed in an African-American population. , 2003, Diabetes.

[17]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[18]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[19]  B. Nordestgaard,et al.  Genetic variation in ABC transporter A1 contributes to HDL cholesterol in the general population. , 2004, The Journal of clinical investigation.

[20]  D. Nyholt A simple correction for multiple testing for single-nucleotide polymorphisms in linkage disequilibrium with each other. , 2004, American journal of human genetics.

[21]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[22]  Alexander Pertsemlidis,et al.  Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9 , 2005, Nature Genetics.

[23]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[24]  Xiaofeng Zhu,et al.  Haplotypes produced from rare variants in the promoter and coding regions of angiotensinogen contribute to variation in angiotensinogen levels. , 2005, Human molecular genetics.

[25]  Jonathan C. Cohen,et al.  A spectrum of PCSK9 alleles contributes to plasma levels of low-density lipoprotein cholesterol. , 2006, American journal of human genetics.

[26]  Céline Bellenguez,et al.  New susceptibility locus for hypertension on chromosome 8q by efficient pedigree-breaking in an Italian isolate. , 2006, Human molecular genetics.

[27]  Jonathan C. Cohen,et al.  Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. , 2006, The New England journal of medicine.

[28]  Paul Scheet,et al.  A fast and flexible statistical model for large-scale population genotype data: applications to inferring missing genotypes and haplotypic phase. , 2006, American journal of human genetics.

[29]  Jonathan C. Cohen,et al.  Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Jennifer R. Harris,et al.  Combined Genome Scans for Body Stature in 6,602 European Twins: Evidence for Common Caucasian Loci , 2007, PLoS genetics.

[31]  Kai Wang,et al.  Pathway-based approaches for analysis of genomewide association studies. , 2007, American journal of human genetics.

[32]  M. McCarthy,et al.  Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes , 2007, Science.

[33]  C. Gieger,et al.  Genomewide association analysis of coronary artery disease. , 2007, The New England journal of medicine.

[34]  Eric Boerwinkle,et al.  Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL , 2007, Nature Genetics.

[35]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[36]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[37]  Robert C Elston,et al.  The genetic basis of complex traits: rare variants or "common gene, common disease"? , 2007, Methods in molecular biology.

[38]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[39]  B. Browning,et al.  Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. , 2007, American journal of human genetics.

[40]  S. Gabriel,et al.  Two independent alleles at 6q23 associated with risk of rheumatoid arthritis , 2007, Nature Genetics.

[41]  Tom Walsh,et al.  Ten genes for inherited breast cancer. , 2007, Cancer cell.

[42]  D. Strachan,et al.  Rheumatoid arthritis association at 6q23 , 2007, Nature Genetics.

[43]  Nazneen Rahman,et al.  The emerging landscape of breast cancer susceptibility , 2007, Nature Genetics.

[44]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[45]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[46]  C. Gieger,et al.  Identification of ten loci associated with height highlights new biological pathways in human growth , 2008, Nature Genetics.

[47]  David M. Evans,et al.  Genome-wide association analysis identifies 20 loci that influence adult height , 2008, Nature Genetics.

[48]  Bjarni V. Halldórsson,et al.  Many sequence variants affecting diversity of adult human height , 2008, Nature Genetics.

[49]  Hongyu Zhao,et al.  Rare independent mutations in renal salt handling genes contribute to blood pressure variation , 2008, Nature Genetics.

[50]  M. Rieder,et al.  Estimating coverage and power for genetic association studies using near-complete variation data , 2008, Nature Genetics.

[51]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[52]  Peter M Visscher,et al.  Sizing up human height variation , 2008, Nature Genetics.

[53]  Wei Guo,et al.  Generalized linear modeling with regularization for detecting common disease rare haplotype association , 2009, Genetic epidemiology.