Disorders: Filling the Gaps and Exploring Complexity in Genome-Wide Association Studies

Genome-wide association scans (GWASs) using single nucleotide polymorphisms (SNPs) have been completed successfully for several common disorders and have detected over 30 new associations. Considering the large sample sizes and genome-wide SNP coverage of the scans, one might have expected many of the common variants underpinning the genetic component of various disorders to have been identified by now. However, these studies have not evaluated the contribution of other forms of genetic variation, such as structural variation, mainly in the form of copy number variants (CNVs). Known CNVs account for over 15% of the assembled human genome sequence. Since CNVs are not easily tagged by SNPs, might have a wide range of copy number variability, and often fall in genomic regions not well covered by whole-genome arrays or not genotyped by the HapMap project, current GWASs have largely missed the contribution of CNVs to complex disorders. In fact, some CNVs have already been reported to show association with several complex disorders using candidate gene/region approaches, underpinning the importance of regions not investigated in current GWASs. This reveals the need for new generation arrays (some already in the market) and the use of tailored approaches to explore the full dimension of genome variability beyond the single nucleotide scale.

[1]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[2]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[3]  M. Litt,et al.  A hypervariable microsatellite revealed by in vitro amplification of a dinucleotide repeat within the cardiac muscle actin gene. , 1989, American journal of human genetics.

[4]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.

[5]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[6]  E. Eichler,et al.  Fine-scale structural variation of the human genome , 2005, Nature Genetics.

[7]  E. Eichler,et al.  Linkage disequilibrium and heritability of copy-number polymorphisms within duplicated regions of the human genome. , 2006, American journal of human genetics.

[8]  P. Deloukas,et al.  A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21 , 2007, Nature Genetics.

[9]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[10]  C. Y. Yu,et al.  Determining the one, two, three, or four long and short loci of human complement C4 in a major histocompatibility complex haplotype encoding C4A or C4B proteins. , 2002, American journal of human genetics.

[11]  D. Campion,et al.  APP locus duplication causes autosomal dominant early-onset Alzheimer disease with cerebral amyloid angiopathy , 2006, Nature Genetics.

[12]  B. Rovin,et al.  The Influence of CCL 3 L 1 Gene – Containing Segmental Duplications on HIV-1 / AIDS Susceptibility , 2009 .

[13]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[14]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[15]  P. Cahan,et al.  A High-Resolution Map of Segmental DNA Copy Number Variation in the Mouse Genome , 2006, PLoS genetics.

[16]  B. Rovin,et al.  The intricate role of complement component C4 in human systemic lupus erythematosus. , 2004, Current directions in autoimmunity.

[17]  Y. Kan,et al.  Polymorphism of DNA sequence adjacent to human beta-globin structural gene: relationship to sickle mutation. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[18]  A Dürr,et al.  Causal relation between α-synuclein locus duplication as a cause of familial Parkinson's disease , 2004, The Lancet.

[19]  P. Fearnhead,et al.  Genome-wide association study of prostate cancer identifies a second risk locus at 8q24 , 2007, Nature Genetics.

[20]  Hiroshi Sato,et al.  Functional SNPs in the lymphotoxin-α gene that are associated with susceptibility to myocardial infarction , 2002, Nature Genetics.

[21]  E. Eichler,et al.  Segmental duplications and copy-number variation in the human genome. , 2005, American journal of human genetics.

[22]  Pardis C Sabeti,et al.  Common deletion polymorphisms in the human genome , 2006, Nature Genetics.

[23]  Judy H Cho,et al.  Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesis , 2007, Nature Genetics.

[24]  J. Weber,et al.  Abundant class of human DNA polymorphisms which can be typed using the polymerase chain reaction. , 1989, American journal of human genetics.

[25]  E. Birney,et al.  Challenges and standards in integrating surveys of structural variation , 2007, Nature Genetics.

[26]  Frédéric Morel,et al.  Hereditary pancreatitis caused by triplication of the trypsinogen locus , 2006, Nature Genetics.

[27]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[28]  N. Carter Methods and strategies for analyzing copy number variation using DNA microarrays , 2007, Nature Genetics.

[29]  R. Redon,et al.  Genome assembly comparison identifies structural variants in the human genome , 2006, Nature Genetics.

[30]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[31]  H. Lachman,et al.  Increase in GSK3β gene copy number variation in bipolar disorder , 2007, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[32]  D. Clayton,et al.  A genome-wide association study of nonsynonymous SNPs identifies a type 1 diabetes locus in the interferon-induced helicase (IFIH1) region , 2006, Nature Genetics.

[33]  Philippe Amouyel,et al.  Alpha-synuclein locus duplication as a cause of familial Parkinson's disease. , 2004, Lancet.

[34]  X. Estivill,et al.  Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability , 2007, Nature Reviews Genetics.

[35]  N. Williams,et al.  Gene copy number variation in schizophrenia , 2008, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[36]  S. Steer,et al.  Evidence for an influence of chemokine ligand 3-like 1 (CCL3L1) gene copy number on susceptibility to rheumatoid arthritis , 2007, Annals of the rheumatic diseases.

[37]  M. Daly,et al.  Transferability of tag SNPs in genetic association studies in multiple populations , 2006, Nature Genetics.

[38]  Sonja W. Scholz,et al.  Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals. , 2007, Human molecular genetics.

[39]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[40]  Enrico Petretto,et al.  Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans , 2006, Nature.

[41]  V. Bafna,et al.  Evidence for large inversion polymorphisms in the human genome from HapMap data. , 2007, Genome research.

[42]  R. Gibbs,et al.  Genomic segmental polymorphisms in inbred mouse strains , 2004, Nature Genetics.

[43]  J. Gulcher,et al.  A variant in CDKAL1 influences insulin response and risk of type 2 diabetes , 2007, Nature Genetics.

[44]  A. Singleton,et al.  alpha-Synuclein locus triplication causes Parkinson's disease. , 2003, Science.

[45]  Xavier Estivill,et al.  Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome. , 2002, Human molecular genetics.

[46]  Bernhard Radlwimmer,et al.  A chromosome 8 gene-cluster polymorphism with low human beta-defensin 2 gene copy number predisposes to Crohn disease of the colon. , 2006, American journal of human genetics.

[47]  G. Abecasis,et al.  A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants , 2007, Science.

[48]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[49]  Y. Nakamura,et al.  Variable number of tandem repeat (VNTR) markers for human gene mapping. , 1987, Science.

[50]  Philippe Amouyel,et al.  α-synuclein locus duplication as a cause of familial Parkinson's disease , 2004, The Lancet.

[51]  Ryan E. Mills,et al.  An initial map of insertion and deletion (INDEL) variation in the human genome. , 2006, Genome research.

[52]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[53]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[54]  A. Brookes,et al.  DNA diagnostics by surface-bound melt-curve reactions. , 2007, The Journal of molecular diagnostics : JMD.

[55]  Florence Pasquier,et al.  Phenotype associated with APP duplication in five families. , 2006, Brain : a journal of neurology.

[56]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[57]  X. Estivill,et al.  Molecular studies in 20 submicroscopic neurofibromatosis type 1 gene deletions , 1999, Human mutation.

[58]  Bi Zhou,et al.  Gene copy-number variation and associated polymorphisms of complement component C4 in human systemic lupus erythematosus (SLE): low copy number is a risk factor for and high copy number is a protective factor against SLE susceptibility in European Americans. , 2007, American journal of human genetics.

[59]  D. Altshuler,et al.  Completing the map of human genetic variation , 2007, Nature.

[60]  Kari Stefansson,et al.  A common variant on chromosome 9p21 affects the risk of myocardial infarction. , 2007, Science.

[61]  Lester L. Peters,et al.  Genome-wide association study identifies novel breast cancer susceptibility loci , 2007, Nature.

[62]  M. Shaw,et al.  Polymorphism of human C-band heterochromatin. I. Frequency of variants. , 1973, American journal of human genetics.

[63]  Thomas Bourgeron,et al.  Mapping autism risk loci using genetic linkage and chromosomal rearrangements , 2007, Nature Genetics.

[64]  D. Conrad,et al.  Global variation in copy number in the human genome , 2006, Nature.

[65]  S. Gabriel,et al.  Risk alleles for multiple sclerosis identified by a genomewide study. , 2007, The New England journal of medicine.

[66]  S. Gabriel,et al.  Efficiency and power in genetic association studies , 2005, Nature Genetics.

[67]  W. Willett,et al.  A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer , 2007, Nature Genetics.

[68]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[69]  Simon Heath,et al.  Novel Crohn Disease Locus Identified by Genome-Wide Association Maps to a Gene Desert on 5p13.1 and Modulates Expression of PTGER4 , 2007, PLoS genetics.

[70]  C. Y. Yu,et al.  Genetic sophistication of human complement components C4A and C4B and RP-C4-CYP21-TNX (RCCX) modules in the major histocompatibility complex. , 2002, American journal of human genetics.

[71]  Alfons Meindl,et al.  BRIP1 (BACH1) variants and familial breast cancer risk: a case-control study , 2007, BMC Cancer.

[72]  D. Gudbjartsson,et al.  Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24 , 2007, Nature Genetics.

[73]  Christian Gieger,et al.  Genome-wide association study of restless legs syndrome identifies common variants in three genomic regions , 2007, Nature Genetics.

[74]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[75]  D. Gudbjartsson,et al.  Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor–positive breast cancer , 2007, Nature Genetics.

[76]  X. Estivill,et al.  A candidate for the cystic fibrosis locus isolated by selection for methylation-free islands , 1987, Nature.

[77]  Judy H. Cho,et al.  A Genome-Wide Association Study Identifies IL23R as an Inflammatory Bowel Disease Gene , 2006, Science.

[78]  Philippe Froguel,et al.  FCGR3B copy number variation is associated with susceptibility to systemic, but not organ-specific, autoimmunity , 2007, Nature Genetics.

[79]  G. Stamatoyannopoulos,et al.  Triplicated alpha-globin loci in humans. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[80]  K. Sleegers,et al.  APP duplication is sufficient to cause early onset Alzheimer's dementia with cerebral amyloid angiopathy. , 2006, Brain : a journal of neurology.

[81]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[82]  Carolyn J. Brown,et al.  A comprehensive analysis of common copy-number variations in the human genome. , 2007, American journal of human genetics.

[83]  A Dürr,et al.  Causal relation between alpha-synuclein gene duplication and familial Parkinson's disease. , 2004, Lancet.

[84]  Gonçalo R. Abecasis,et al.  Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthma , 2007, Nature.

[85]  Janel O. Johnson,et al.  α-Synuclein Locus Triplication Causes Parkinson's Disease , 2003, Science.

[86]  C. Eun A Genome-Wide Association Study Identifies IL23R as an Inflammatory Bowel Disease Gene. , 2007 .

[87]  M. den Heijer,et al.  Accurate, high-throughput typing of copy number variation using paralogue ratios from dispersed repeats , 2006, Nucleic acids research.

[88]  K. Frazer,et al.  Common deletions and SNPs are in linkage disequilibrium in the human genome , 2006, Nature Genetics.

[89]  J. R. MacDonald,et al.  Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence , 2003, Genome Biology.