Uncovering the roles of rare variants in common disease through whole-genome sequencing

Although genome-wide association (GWA) studies for common variants have thus far succeeded in explaining only a modest fraction of the genetic components of human common diseases, recent advances in next-generation sequencing technologies could rapidly facilitate substantial progress. This outcome is expected if much of the missing genetic control is due to gene variants that are too rare to be picked up by GWA studies and have relatively large effects on risk. Here, we evaluate the evidence for an important role of rare gene variants of major effect in common diseases and outline discovery strategies for their identification.

[1]  A. Motulsky Drug reactions enzymes, and biochemical genetics. , 1957, Journal of the American Medical Association.

[2]  Shenduo Li,et al.  Epidemiology and Etiology , 1990 .

[3]  J J Goedert,et al.  Genetic Restriction of HIV-1 Infection and Progression to AIDS by a Deletion Allele of the CKR5 Structural Gene , 1996, Science.

[4]  Steven M. Wolinsky,et al.  The role of a mutant CCR5 allele in HIV–1 transmission and disease progression , 1996, Nature Medicine.

[5]  Richard A Koup,et al.  Homozygous Defect in HIV-1 Coreceptor Accounts for Resistance of Some Multiply-Exposed Individuals to HIV-1 Infection , 1996, Cell.

[6]  Marc Parmentier,et al.  Resistance to HIV-1 infection in Caucasian individuals bearing mutant alleles of the CCR-5 chemokine receptor gene , 1996, Nature.

[7]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[8]  Michael Krawczak,et al.  The human gene mutation database , 1998, Nucleic Acids Res..

[9]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[10]  E. Lander,et al.  On the allelic spectrum of human disease. , 2001, Trends in genetics : TIG.

[11]  C. Moore,et al.  Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir , 2002, The Lancet.

[12]  J. Pritchard,et al.  The allelic architecture of human disease genes: common disease-common variant...or not? , 2002, Human molecular genetics.

[13]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[14]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[15]  S. Humphries,et al.  The molecular genetics of cardiovascular disease: clinical implications , 2003, Journal of internal medicine.

[16]  I. James,et al.  Predisposition to abacavir hypersensitivity conferred by HLA-B*5701 and a haplotypic Hsp70-Hom variant , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[17]  M Ingelman-Sundberg,et al.  Genetic polymorphisms of cytochrome P450 2D6 (CYP2D6): clinical consequences, evolutionary aspects and functional diversity , 2005, The Pharmacogenomics Journal.

[18]  N. Orr,et al.  A common CFH haplotype, with deletion of CFHR1 and CFHR3, is associated with lower risk of age-related macular degeneration , 2006, Nature Genetics.

[19]  M. Steinberg,et al.  Modifier genes and sickle cell anemia , 2006, Current opinion in hematology.

[20]  Don H. Anderson,et al.  Extended haplotypes in the complement factor H (CFH) and CFH-related (CFHR) family of genes protect against age-related macular degeneration: characterization, ethnic distribution and evolutionary implications. , 2006, Annals of medicine.

[21]  Sarah Barber,et al.  Oligonucleotide microarray analysis of genomic imbalance in children with mental retardation. , 2006, American journal of human genetics.

[22]  R. A. Bailey,et al.  Robust associations of four new chromosome regions from genome-wide analyses of type 1 diabetes , 2007, Nature Genetics.

[23]  Steven Gallinger,et al.  Genome-wide association scan identifies a colorectal cancer susceptibility locus on chromosome 8q24 , 2007, Nature Genetics.

[24]  Thomas Bourgeron,et al.  Mapping autism risk loci using genetic linkage and chromosomal rearrangements , 2007, Nature Genetics.

[25]  Oliver Sieber,et al.  A genome-wide association scan of tag SNPs identifies a susceptibility variant for colorectal cancer at 8q24.21 , 2007, Nature Genetics.

[26]  S. Gruber,et al.  Genetic variation in 8q24 associated with risk of colorectal cancer , 2007, Cancer biology & therapy.

[27]  M. Jarvelin,et al.  A Common Variant in the FTO Gene Is Associated with Body Mass Index and Predisposes to Childhood and Adult Obesity , 2007, Science.

[28]  T. Crow,et al.  How and why genetic linkage has not solved the problem of psychosis: review and hypothesis. , 2007, The American journal of psychiatry.

[29]  T. Frayling Genome–wide association studies provide new insights into type 2 diabetes aetiology , 2007, Nature Reviews Genetics.

[30]  J. Willson,et al.  Colon carcinoma cells harboring PIK3CA mutations display resistance to growth factor deprivation induced apoptosis , 2007, Molecular Cancer Therapeutics.

[31]  Marcia M. Nizzari,et al.  Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride Levels , 2007, Science.

[32]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[33]  G. Abecasis,et al.  A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility Variants , 2007, Science.

[34]  C Thomas Caskey,et al.  The drug development crisis: efficiency and safety. , 2007, Annual review of medicine.

[35]  Fikret Erdogan,et al.  Comparative genome hybridization suggests a role for NRXN1 and APBA2 in schizophrenia. , 2007, Human molecular genetics.

[36]  K. Squires,et al.  Increased HIV infection rate among violent deaths: a mortuary study in the Republic of Congo , 2008, AIDS (London).

[37]  Francis S Collins,et al.  A HapMap harvest of insights into the genetics of common disease. , 2008, The Journal of clinical investigation.

[38]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[39]  R. Collins,et al.  SLCO1B1 variants and statin-induced myopathy--a genomewide study. , 2008, The New England journal of medicine.

[40]  Deletion of CFHR3 and CFHR1 genes in age-related macular degeneration. , 2008, Human molecular genetics.

[41]  Thomas W. Mühleisen,et al.  Large recurrent microdeletions associated with schizophrenia , 2008, Nature.

[42]  David M. Evans,et al.  Genome-wide association analysis identifies 20 loci that influence adult height , 2008, Nature Genetics.

[43]  K. Shianna,et al.  Tissue-Specific Genetic Control of Splicing: Implications for the Study of Complex Traits , 2008, PLoS biology.

[44]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[45]  Henry A. Nasrallah,et al.  Schizophrenia, “Just the Facts” What we know in 2008. 2. Epidemiology and etiology , 2008, Schizophrenia Research.

[46]  M. McCarthy,et al.  Genome-wide association studies: potential next steps on a genetic journey. , 2008, Human molecular genetics.

[47]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[48]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[49]  D. Conrad,et al.  Recurrent 16p11.2 microdeletions in autism. , 2007, Human molecular genetics.

[50]  Henry A. Nasrallah,et al.  Schizophrenia, “Just the Facts”: What we know in 2008 Part 1: Overview , 2008, Schizophrenia Research.

[51]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[52]  Timothy B. Stockwell,et al.  Genetic Variation in an Individual Human Exome , 2008, PLoS genetics.

[53]  A. Serretti,et al.  The genetics of bipolar disorder: genome ‘hot regions,’ genes, new potential candidates and future directions , 2008, Molecular Psychiatry.

[54]  Shah Ebrahim,et al.  Common variants in the GDF5-UQCC region are associated with variation in human height , 2008, Nature Genetics.

[55]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[56]  Dawei Li,et al.  The diploid genome sequence of an Asian individual , 2008, Nature.

[57]  B. Ponder,et al.  Allele-Specific Up-Regulation of FGFR2 Increases Susceptibility to Breast Cancer , 2008, PLoS biology.

[58]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[59]  K. Dewar,et al.  Targeted screening of cis-regulatory variation in human haplotypes. , 2008, Genome research.

[60]  Kenny Q. Ye,et al.  Sensitive and accurate detection of copy number variants using read depth of coverage. , 2009, Genome research.

[61]  H. Hakonarson,et al.  Genomic Landscape of a Three-Generation Pedigree Segregating Affective Disorder , 2009, PloS one.

[62]  K. Shianna,et al.  A Genome-Wide Association Study in Chronic Obstructive Pulmonary Disease (COPD): Identification of Two Major Susceptibility Loci , 2009, PLoS genetics.

[63]  I. Tikhonova,et al.  Genetic diagnosis by whole exome capture and massively parallel DNA sequencing , 2009, Proceedings of the National Academy of Sciences.

[64]  S. Thein,et al.  Discovering the genetics underlying foetal haemoglobin production in adults , 2009, British journal of haematology.

[65]  David B. Goldstein,et al.  A Genome-Wide Investigation of SNPs and CNVs in Schizophrenia , 2009, PLoS genetics.

[66]  L. Prokunina-Olsson,et al.  No effect of cancer-associated SNP rs6983267 in the 8q24 region on co-expression of MYC and TCF7L2 in normal colon tissue , 2009, Molecular Cancer.

[67]  Hui Guo,et al.  MapView: visualization of short reads alignment on a desktop computer , 2009, Bioinform..

[68]  Jianxin Shi,et al.  Common variants on chromosome 6p22.1 are associated with schizophrenia , 2009, Nature.

[69]  Heinrich Magnus Manske,et al.  LookSeq: a browser-based viewer for deep sequencing data. , 2009, Genome research.

[70]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[71]  Süleyman Cenk Sahinalp,et al.  Combinatorial Algorithms for Structural Variation Detection in High Throughput Sequenced Genomes , 2009, RECOMB.

[72]  Yuan Lin,et al.  The HuRef Browser: a web resource for individual human genomics , 2008, Nucleic Acids Res..

[73]  Anna L. Gloyn,et al.  Coexpression of the Type 2 Diabetes Susceptibility Gene Variants KCNJ11 E23K and ABCC8 S1369A Alter the ATP and Sulfonylurea Sensitivities of the ATP-Sensitive K+ Channel , 2009, Diabetes.

[74]  R. Plomin,et al.  Common disorders are quantitative traits , 2009, Nature Reviews Genetics.

[75]  D. Goldstein Common genetic variation and human traits. , 2009, The New England journal of medicine.

[76]  M. Daly,et al.  HLA-B*5701 genotype is a major determinant of drug-induced liver injury due to flucloxacillin , 2009, Nature Genetics.

[77]  P. Stenson,et al.  The Human Gene Mutation Database: 2008 update , 2009, Genome Medicine.

[78]  M. Loder,et al.  Insulin Storage and Glucose Homeostasis in Mice Null for the Granule Zinc Transporter ZnT8 and Studies of the Type 2 Diabetes–Associated Variants , 2009, Diabetes.

[79]  J. Todd,et al.  Rare Variants of IFIH1, a Gene Implicated in Antiviral Responses, Protect Against Type 1 Diabetes , 2009, Science.

[80]  A. Koike,et al.  Genome-wide association of IL28B with response to pegylated interferon-α and ribavirin therapy for chronic hepatitis C , 2009, Nature Genetics.

[81]  D. Clayton Prediction and Interaction in Complex Disease Genetics: Experience in Type 1 Diabetes , 2009, PLoS genetics.

[82]  Jacques Fellay,et al.  Genetic variation in IL28B predicts hepatitis C treatment-induced viral clearance , 2009, Nature.

[83]  P. Elliott,et al.  A variant near MTNR1B is associated with increased fasting plasma glucose levels and type 2 diabetes risk , 2009, Nature Genetics.

[84]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[85]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[86]  Daniel F. Gudbjartsson,et al.  Parental origin of sequence variants associated with complex diseases , 2009, Nature.

[87]  O. Ohara,et al.  Mina, an Il4 repressor, controls T helper type 2 bias , 2009, Nature Immunology.

[88]  Gail Clement,et al.  A genome-wide study of common SNPs and CNVs in cognitive performance in the CANTAB. , 2009, Human molecular genetics.

[89]  J. Carpten,et al.  Fine mapping association study and functional analysis implicate a SNP in MSMB at 10q11 as a causal variant for prostate cancer risk. , 2009, Human molecular genetics.

[90]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[91]  Y. Hayashizaki,et al.  NGSView: an extensible open source editor for next-generation sequencing data , 2009, Bioinform..

[92]  Tom H. Pringle,et al.  Complete Khoisan and Bantu genomes from southern Africa , 2010, Nature.

[93]  Paul D. Shaw,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2022 .

[94]  Greg Gibson,et al.  Common genetic variation and performance on standardized cognitive tests , 2010, European Journal of Human Genetics.

[95]  Robert B. Hartlage,et al.  This PDF file includes: Materials and Methods , 2009 .

[96]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[97]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[98]  Jacques Fellay,et al.  ITPA gene variants protect against anaemia in patients treated for chronic hepatitis C , 2010, Nature.

[99]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[100]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[101]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[102]  Karen L. Mohlke,et al.  A map of open chromatin in human pancreatic islets , 2010, Nature Genetics.

[103]  Elizabeth T. Cirulli,et al.  Whole-Genome Sequencing of a Single Proband Together with Linkage Analysis Identifies a Mendelian Disease Gene , 2010, PLoS genetics.

[104]  Hongling Liao,et al.  Long-range enhancers on 8q24 regulate c-Myc , 2010, Proceedings of the National Academy of Sciences.

[105]  Jared T. Simpson,et al.  Copy number variant detection in inbred strains from short read sequence data , 2009, Bioinform..

[106]  P. Shannon,et al.  Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing , 2010, Science.

[107]  Peter Kraft,et al.  Using principal components of genetic variation for robust and powerful detection of gene-gene interactions in case-control and case-only studies. , 2010, American journal of human genetics.

[108]  M. Gerstein,et al.  Variation in Transcription Factor Binding Among Humans , 2010, Science.