The success of the genome-wide association approach: a brief story of a long struggle

The genome-wide association approach has been the most powerful and efficient study design thus far in identifying genetic variants that are associated with complex human diseases. This approach became feasible as the result of several key advancements in genetic knowledge, genotyping technologies, statistical analysis algorithms and the availability of large collections of cases and controls. With all these necessary tools in hand, many genome-wide association studies were recently completed, and many more studies which will explore the genetic basis of various complex diseases and quantitative traits are soon to come. This approach has started to reap the fruits of its labor over the past several months. Publications of genome-wide association studies in several complex diseases such as inflammatory bowel disease, type-2 diabetes, breast cancer and prostate cancer have been abundant in the first half of this year. The aims of this review are firstly, to provide a timely summary for most of the genome-wide association studies that have been published until June/July 2007 and secondly, to evaluate to what extent these results have been validated in subsequent replication studies.

[1]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[2]  Eric S. Lander,et al.  The common PPARγ Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes , 2000, Nature Genetics.

[3]  Response to Comments on "A Common Genetic Variant Is Associated with Adult and Childhood Obesity" , 2007, Science.

[4]  A. Whittemore,et al.  Multiple regions within 8q24 independently affect risk for prostate cancer , 2007, Nature Genetics.

[5]  H. Drummond,et al.  IL23R Arg381Gln is associated with childhood onset inflammatory bowel disease in Scotland , 2007, Gut.

[6]  R. A. Bailey,et al.  Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes , 2007, Nature Genetics.

[7]  Tim Hubbard Finishing the euchromatic sequence of the human genome , 2004 .

[8]  F. Hu,et al.  A Common Genetic Variant Is Associated with Adult and Childhood Obesity , 2006, Science.

[9]  Lisa M. Schwartz,et al.  A genetic risk factor for periodic limb movements in sleep. , 2008, The New England journal of medicine.

[10]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[11]  Mark Atkinson,et al.  Large-scale genetic fine mapping and genotype-phenotype associations implicate polymorphism in the IL2RA region in type 1 diabetes , 2007, Nature Genetics.

[12]  C. Gieger,et al.  Genomewide association analysis of coronary artery disease. , 2007, The New England journal of medicine.

[13]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[14]  A. Gylfason,et al.  A Common Variant on Chromosome 9p21 Affects the Risk of Myocardial Infarction , 2007, Science.

[15]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[16]  Xiaofeng Zhu,et al.  The Association of a SNP Upstream of INSIG2 with Body Mass Index is Reproduced in Several but Not All Cohorts , 2007, PLoS genetics.

[17]  R. Myers,et al.  Lack of replication of thirteen single-nucleotide polymorphisms implicated in Parkinson's disease: a large-scale international study , 2006, The Lancet Neurology.

[18]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[19]  S. O’Rahilly,et al.  Comment on "A Common Genetic Variant Is Associated with Adult and Childhood Obesity" , 2007, Science.

[20]  H. Lachman,et al.  Increase in GSK3β gene copy number variation in bipolar disorder , 2007, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[21]  Jacques Fellay,et al.  A Whole-Genome Association Study of Major Determinants for Host Control of HIV-1 , 2007, Science.

[22]  Adam S. Kibel,et al.  A common variant associated with prostate cancer in European and African populations , 2007 .

[23]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[24]  Hemant K Tiwari,et al.  Problems with Genome-Wide Association Studies , 2007, Science.

[25]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[26]  Dmitri V Zaykin,et al.  Ranks of Genuine Associations in Whole-Genome Scans , 2005, Genetics.

[27]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[28]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[29]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[30]  Chi Pui Pang,et al.  HTRA1 promoter polymorphism in wet age-related macular degeneration. , 2007, Science.

[31]  Michael Krawczak,et al.  A genome-wide association scan identifies the hepatic cholesterol transporter ABCG8 as a susceptibility factor for human gallstone disease , 2007, Nature Genetics.

[32]  Kuixing Zhang,et al.  Whole-genome analysis of sporadic amyotrophic lateral sclerosis. , 2007, The New England journal of medicine.

[33]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[34]  S. Gabriel,et al.  Risk alleles for multiple sclerosis identified by a genomewide study. , 2007, The New England journal of medicine.

[35]  D. Gudbjartsson,et al.  Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor–positive breast cancer , 2007, Nature Genetics.

[36]  Eric Boerwinkle,et al.  Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL , 2007, Nature Genetics.

[37]  P. Donnelly,et al.  New models of collaboration in genome-wide association studies: the Genetic Association Information Network , 2007, Nature Genetics.

[38]  M. Jarvelin,et al.  A Common Variant in the FTO Gene Is Associated with Body Mass Index and Predisposes to Childhood and Adult Obesity , 2007, Science.

[39]  Sonja W. Scholz,et al.  Conflicting results regarding the semaphorin gene (SEMA5A) and the risk for Parkinson disease. , 2006, American journal of human genetics.

[40]  J. Gilbert,et al.  Complement Factor H Variant Increases the Risk of Age-Related Macular Degeneration , 2005, Science.

[41]  D. Bentley,et al.  Whole-genome re-sequencing. , 2006, Current opinion in genetics & development.

[42]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[43]  Ammarin Thakkinstian,et al.  Systematic review and meta-analysis of the association between complementary factor H Y402H polymorphisms and age-related macular degeneration , 2006 .

[44]  J. Catanese,et al.  A case-control association study of the 12 single-nucleotide polymorphisms implicated in Parkinson disease by a recent genome scan. , 2006, American journal of human genetics.

[45]  C. Sabatti,et al.  Tag SNPs chosen from HapMap perform well in several population isolates , 2007, Genetic epidemiology.

[46]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[47]  H. Völzke,et al.  Comment on "A Common Genetic Variant Is Associated with Adult and Childhood Obesity" , 2007, Science.

[48]  Kari Stefansson,et al.  Common Sequence Variants in the LOXL1 Gene Confer Susceptibility to Exfoliation Glaucoma , 2007, Science.

[49]  R. Ophoff,et al.  ITPR2 as a susceptibility gene in sporadic amyotrophic lateral sclerosis: a genome-wide association study , 2007, The Lancet Neurology.

[50]  R. Klein,et al.  Power analysis for genome-wide association studies , 2007, BMC Genetics.

[51]  D. Gudbjartsson,et al.  Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24 , 2007, Nature Genetics.

[52]  Christian Gieger,et al.  Genome-wide association study of restless legs syndrome identifies common variants in three genomic regions , 2007, Nature Genetics.

[53]  Sonja W. Scholz,et al.  A genome-wide genotyping study in patients with ischaemic stroke: initial analysis and data release , 2007, The Lancet Neurology.

[54]  Mourad Sahbatou,et al.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease , 2001, Nature.

[55]  P. Donnelly,et al.  Replicating genotype–phenotype associations , 2007, Nature.

[56]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[57]  Philippe Froguel,et al.  TCF7L2 is reproducibly associated with type 2 diabetes in various ethnic groups: a global meta-analysis , 2007, Journal of Molecular Medicine.

[58]  Sinead B. O'Leary,et al.  Genetic variation in the 5q31 cytokine gene cluster confers susceptibility to Crohn disease , 2001, Nature Genetics.

[59]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[60]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[61]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.

[62]  Pauline C Ng,et al.  Power to Detect Risk Alleles Using Genome-Wide Tag SNP Panels , 2007, PLoS genetics.

[63]  K. Gunderson,et al.  Whole genome genotyping technologies on the BeadArray™ platform , 2007 .

[64]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[65]  Sonja W. Scholz,et al.  Genome-wide genotyping in Parkinson's disease and neurologically normal controls: first stage analysis and public release of data , 2006, The Lancet Neurology.

[66]  M. McCarthy,et al.  Large-scale association studies of variants in genes encoding the pancreatic beta-cell KATP channel subunits Kir6.2 (KCNJ11) and SUR1 (ABCC8) confirm that the KCNJ11 E23K variant is associated with type 2 diabetes. , 2003, Diabetes.

[67]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[68]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[69]  Simon Heath,et al.  Novel Crohn Disease Locus Identified by Genome-Wide Association Maps to a Gene Desert on 5p13.1 and Modulates Expression of PTGER4 , 2007, PLoS genetics.

[70]  J. Hoh,et al.  HTRA1 promoter polymorphism predisposes Japanese to age-related macular degeneration , 2007, Molecular vision.

[71]  Judy H. Cho,et al.  A Genome-Wide Association Study Identifies IL23R as an Inflammatory Bowel Disease Gene , 2006, Science.

[72]  Philippe Froguel,et al.  FCGR3B copy number variation is associated with susceptibility to systemic, but not organ-specific, autoimmunity , 2007, Nature Genetics.

[73]  Eric E. Smith,et al.  Variants conferring risk of atrial fibrillation on chromosome 4q25 , 2007, Nature.

[74]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[75]  Oliver Sieber,et al.  A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21 , 2007, Nature Genetics.

[76]  S. Gabriel,et al.  Two independent alleles at 6q23 associated with risk of rheumatoid arthritis , 2007, Nature Genetics.

[77]  Tariq Ahmad,et al.  Confirmation of the role of ATG16l1 as a Crohn's disease susceptibility gene , 2007, Inflammatory bowel diseases.

[78]  K. Gunderson,et al.  A genome-wide scalable SNP genotyping assay using microarray technology , 2005, Nature Genetics.

[79]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[80]  P. Deloukas,et al.  IL23R Variation Determines Susceptibility But Not Disease Phenotype in Inflammatory Bowel Disease , 2007, Gastroenterology.

[81]  A. Edwards,et al.  Complement Factor H Polymorphism and Age-Related Macular Degeneration , 2005, Science.

[82]  N. Camp,et al.  A Variant of the HTRA1 Gene Increases Susceptibility to Age-Related Macular Degeneration , 2006, Science.

[83]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[84]  Judy H Cho,et al.  Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis , 2007, Nature Genetics.

[85]  Thomas Lengauer,et al.  A genome-wide association scan of nonsynonymous SNPs identifies a susceptibility variant for Crohn disease in ATG16L1 , 2007, Nature Genetics.

[86]  J. Gulcher,et al.  A variant in CDKAL1 influences insulin response and risk of type 2 diabetes , 2007, Nature Genetics.

[87]  Itsik Pe'er,et al.  Evaluating potential for whole-genome studies in Kosrae, an isolated population in Micronesia , 2006, Nature Genetics.

[88]  L. Cardon,et al.  Contribution of the novel inflammatory bowel disease gene IL23R to disease susceptibility and phenotype , 2007, Inflammatory bowel diseases.

[89]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[90]  Bi Zhou,et al.  Gene copy-number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans. , 2007, American journal of human genetics.

[91]  Jaana M. Hartikainen,et al.  A common coding variant in CASP8 is associated with breast cancer risk , 2007, Nature Genetics.

[92]  A. Goris,et al.  No evidence for association with Parkinson disease for 13 single-nucleotide polymorphisms identified by whole-genome association screening. , 2006, American journal of human genetics.

[93]  C. Dina,et al.  Comment on "A Common Genetic Variant Is Associated with Adult and Childhood Obesity" , 2007, Science.

[94]  M. McCarthy,et al.  Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes , 2007, Science.

[95]  Mariza de Andrade,et al.  High-resolution whole-genome association study of Parkinson disease. , 2005, American journal of human genetics.

[96]  Joseph T. Glessner,et al.  A genome-wide association study identifies KIAA0350 as a type 1 diabetes gene , 2007, Nature.

[97]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[98]  Kari Stefansson,et al.  A genetic risk factor for periodic limb movements in sleep. , 2007, The New England journal of medicine.

[99]  P. Fearnhead,et al.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24 , 2007, Nature Genetics.

[100]  Lon R Cardon,et al.  Evaluating coverage of genome-wide association studies , 2006, Nature Genetics.

[101]  S. Fisher,et al.  A nonsynonymous SNP in ATG16L1 predisposes to ileal Crohn's disease and is independent of CARD15 and IBD5. , 2007, Gastroenterology.

[102]  Paul T. Groth,et al.  The ENCODE (ENCyclopedia Of DNA Elements) Project , 2004, Science.

[103]  A. Verma,et al.  Risk Alleles for Multiple Sclerosis Identified by a Genomewide Study , 2008 .

[104]  G. Abecasis,et al.  A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants , 2007, Science.

[105]  Alastair Forbes,et al.  Sequence variants in the autophagy gene IRGM and multiple other replicating loci contribute to Crohn's disease susceptibility , 2007, Nature Genetics.

[106]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[107]  G. Galbraith,et al.  TRAF1-C5 as a Risk Locus for Rheumatoid Arthritis—A Genomewide Study , 2008 .

[108]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[109]  S. P. Fodor,et al.  Large-scale genotyping of complex DNA , 2003, Nature Biotechnology.

[110]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[111]  E S Lander,et al.  The common PPARgamma Pro12Ala polymorphism is associated with decreased risk of type 2 diabetes. , 2000, Nature genetics.

[112]  F. Crick,et al.  Molecular Structure of Nucleic Acids: A Structure for Deoxyribose Nucleic Acid , 1953, Nature.

[113]  Steven Gallinger,et al.  Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24 , 2007, Nature Genetics.

[114]  H. Stefánsson,et al.  Variant of transcription factor 7-like 2 (TCF7L2) gene confers risk of type 2 diabetes , 2006, Nature Genetics.

[115]  J. Lupski Structural variation in the human genome. , 2007, The New England journal of medicine.

[116]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[117]  Iwona Wrobel,et al.  IL‐23 receptor (IL‐23R) gene protects against pediatric Crohn's disease , 2007, Inflammatory bowel diseases.

[118]  J. Ragoussis,et al.  Matrix-Assisted Laser Desorption/Ionisation, Time-of-Flight Mass Spectrometry in Genomics Research , 2006, PLoS genetics.

[119]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[120]  J. Hoh,et al.  Systematic review and meta-analysis of the association between complement factor H Y402H polymorphisms and age-related macular degeneration. , 2006, Human molecular genetics.

[121]  Sonja W. Scholz,et al.  Genome-wide genotyping in amyotrophic lateral sclerosis and neurologically normal controls: first stage analysis and public release of data , 2007, The Lancet Neurology.

[122]  Thomas J. Liesegang,et al.  The sequence of the human genome. Venter JC,∗ Adams MD, Myers EW, et al. Science 2001;291:1304–1351. , 2001 .