Genetic association analysis: a primer on how it works, its strengths and its weaknesses.

Currently, the most used approach to mapping disease genes is the genome wide association study, using large samples of cases and controls and hundreds of thousands of markers spread throughout the genome. This review focuses in explaining how an association study works, its strengths and its weaknesses, and the methods available to analyse the data. Issues related to sample size, genetic effect sizes, epistasis, replication and population stratification are specifically addressed, issues that an investigator must take into account when planning an association study of any complex disease. Finally, we include some special features concerning association studies in the Y chromosome, and we contrast the analysis characteristics of linkage and association.

[1]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[2]  Xavier Estivill,et al.  Copy Number Variants and Common Disorders: Filling the Gaps and Exploring Complexity in Genome-Wide Association Studies , 2007, PLoS genetics.

[3]  W. Ewens,et al.  The transmission/disequilibrium test: history, subdivision, and admixture. , 1995, American journal of human genetics.

[4]  C. Krausz,et al.  Genetic Risk Factors in Male Infertility , 2007, Archives of andrology.

[5]  T. Becker,et al.  Association of BRD2 polymorphisms with photoparoxysmal response , 2006, Neuroscience Letters.

[6]  Judy H Cho,et al.  Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis , 2007, Nature Genetics.

[7]  C. Falk,et al.  Haplotype relative risks: an easy reliable way to construct a proper control sample for risk calculations , 1987, Annals of human genetics.

[8]  N Risch,et al.  The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases I. DNA pooling. , 1998, Genome research.

[9]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[10]  W. Ewens,et al.  Transmission test for linkage disequilibrium: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). , 1993, American journal of human genetics.

[11]  Michael W. Mahoney,et al.  PCA-Correlated SNPs for Structure Identification in Worldwide Human Populations , 2007, PLoS genetics.

[12]  Robert C Elston,et al.  The genetic basis of complex traits: rare variants or "common gene, common disease"? , 2007, Methods in molecular biology.

[13]  P. Fearnhead,et al.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24 , 2007, Nature Genetics.

[14]  Junying Zhang,et al.  Effect of Population Stratification on Case-Control Association Studies , 2004, Human Heredity.

[15]  I. Scheffer,et al.  A Multicenter Study of BRD2 as a Risk Factor for Juvenile Myoclonic Epilepsy , 2007, Epilepsia.

[16]  C. Foresta,et al.  Association of partial AZFc region deletions with spermatogenic impairment and male infertility , 2005, Journal of Medical Genetics.

[17]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[18]  K. Christensen,et al.  What genome-wide association studies can do for medicine. , 2007, The New England journal of medicine.

[19]  Lyle J Palmer,et al.  Genetic Epidemiology 4 Shaking the tree : mapping complex disease genes with linkage disequilibrium , 2022 .

[20]  O. Evgrafov,et al.  BRD2 (RING3) is a probable major susceptibility gene for common juvenile myoclonic epilepsy. , 2003, American journal of human genetics.

[21]  C. O'Morain,et al.  CARD15/NOD2 mutational analysis and genotype-phenotype correlation in 612 patients with inflammatory bowel disease. , 2002, American journal of human genetics.

[22]  N Risch,et al.  The relative power of family-based and case-control designs for linkage disequilibrium studies of complex human diseases. II. Individual genotyping. , 1999, Genome research.

[23]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[24]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[25]  Case-Control Association Studies in Mixed Populations: Correcting Using Genomic Control , 2005, Human Heredity.

[26]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[27]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[28]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[29]  Prakash Gorroochurn,et al.  Centralizing the non‐central chi‐square: a new method to correct for population stratification in genetic case‐control association studies , 2006, Genetic epidemiology.

[30]  D. Clayton,et al.  A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region , 2006, Nature Genetics.

[31]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[32]  M. T. Medina,et al.  Juvenile myoclonic epilepsy locus in chromosome 6p21.2-p11: linkage to convulsions and electroencephalography trait. , 1995, American journal of human genetics.

[33]  Nicholas J Schork,et al.  Methods for handling multiple testing. , 2008, Advances in genetics.

[34]  L. Jorde,et al.  Linkage disequilibrium and the search for complex disease genes. , 2000, Genome research.

[35]  Eric Boerwinkle,et al.  Determinants of the success of whole-genome association testing. , 2005, Genome research.

[36]  S Shinnar,et al.  Reproducibility and complications in gene searches: linkage on chromosome 6, heterogeneity, association, and maternal inheritance in juvenile myoclonic epilepsy. , 2000, American journal of human genetics.

[37]  J. Ioannidis,et al.  Replication validity of genetic association studies , 2001, Nature Genetics.

[38]  P. Deloukas,et al.  A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21 , 2007, Nature Genetics.

[39]  N. Carter Methods and strategies for analyzing copy number variation using DNA microarrays , 2007, Nature Genetics.

[40]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[41]  A. Verma,et al.  Risk Alleles for Multiple Sclerosis Identified by a Genomewide Study , 2008 .

[42]  G. Abecasis,et al.  A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants , 2007, Science.

[43]  T. Wienker,et al.  Refined mapping of the epilepsy susceptibility locus EJM1 on chromosome 6 , 1997, Neurology.

[44]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[45]  Nathaniel Rothman,et al.  Counterpoint: bias from population stratification is not a major threat to the validity of conclusions from epidemiological studies of common polymorphisms and cancer. , 2002, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[46]  John P.A. Ioannidis,et al.  Non-Replication and Inconsistency in the Genome-Wide Association Setting , 2007, Human Heredity.

[47]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[48]  N. Laird,et al.  Family-based designs in the age of large-scale gene-association studies , 2006, Nature Reviews Genetics.

[49]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[50]  C. Foresta,et al.  Y chromosome microdeletions and alterations of spermatogenesis. , 2001, Endocrine reviews.

[51]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.

[52]  K. Strauch,et al.  Genetic dissection of photosensitivity and its relation to idiopathic generalized epilepsy , 2005, Annals of neurology.

[53]  Christian Gieger,et al.  Genome-wide association study of restless legs syndrome identifies common variants in three genomic regions , 2007, Nature Genetics.

[54]  M. T. Medina,et al.  Mutations in EFHC1 cause juvenile myoclonic epilepsy , 2004, Nature Genetics.

[55]  Robert C. Elston,et al.  Are Linkage Analysis and the Collection of Family Data Dead? Prospects for Family Studies in the Age of Genome-Wide Association , 2007, Human Heredity.

[56]  John P. A. Ioannidis,et al.  Methods for meta-analysis in genetic association studies: a review of their potential and pitfalls , 2008, Human Genetics.

[57]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[58]  L. Cardon,et al.  Population stratification and spurious allelic association , 2003, The Lancet.

[59]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[60]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[61]  Michael P Epstein,et al.  A simple and improved correction for population stratification in case-control studies. , 2007, American journal of human genetics.

[62]  V. Gatta,et al.  Male infertility: role of genetic background. , 2007, Reproductive biomedicine online.

[63]  Pak Sham,et al.  Properties of Structured Association Approaches to Detecting Population Stratification , 2005, Human Heredity.

[64]  Susan E. Hodge,et al.  Effect of Population Stratification on Case-Control Association Studies , 2004, Human Heredity.

[65]  D. Greenberg,et al.  Linkage analysis of "necessary" disease loci versus "susceptibility" loci. , 1993, American journal of human genetics.

[66]  D. Pal,et al.  Evaluating Genetic Heterogeneity in Complex Disorders , 2002, Human Heredity.

[67]  S. Hodge,et al.  A Unified Approach for Quantifying, Testing and Correcting Population Stratification in Case-Control Association Studies , 2007, Human Heredity.

[68]  Veronica J. Vieland,et al.  HLODs, Trait Models, and Ascertainment: Implications of Admixture for Parameter Estimation and Linkage Detection , 2002, Human Heredity.

[69]  K. Weiss,et al.  How many diseases does it take to map a gene with SNPs? , 2000, Nature Genetics.

[70]  D. Reich,et al.  Detecting association in a case‐control study while correcting for population stratification , 2001, Genetic epidemiology.

[71]  E. Rajpert-De Meyts,et al.  Gene polymorphisms and male infertility--a meta-analysis and literature review. , 2007, Reproductive biomedicine online.

[72]  K. Roeder,et al.  Unbiased methods for population‐based association studies , 2001, Genetic epidemiology.