Hunting human disease genes: lessons from the past, challenges for the future

The concept that a specific alteration in an individual’s DNA can result in disease is central to our notion of molecular medicine. The molecular basis of more than 3,500 Mendelian disorders has now been identified. In contrast, the identification of genes for common disease has been much more challenging. We discuss historical and contemporary approaches to disease gene identification, focusing on novel opportunities such as the use of population extremes and the identification of rare variants. While our ability to sequence DNA has advanced dramatically, assigning function to a given sequence change remains a major challenge, highlighting the need for both bioinformatics and functional approaches to appropriately interpret these data. We review progress in mapping and identifying human disease genes and discuss future challenges and opportunities for the field.

[1]  Ronald W. Davis,et al.  Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. , 1999, Science.

[2]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[3]  J. Veltman,et al.  De novo mutations in human genetic disease , 2012, Nature Reviews Genetics.

[4]  J. Lupski,et al.  A Duplication CNV That Conveys Traits Reciprocal to Metabolic Syndrome and Protects against Diet-Induced Obesity in Mice and Men , 2012, PLoS genetics.

[5]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[6]  D. Botstein,et al.  Construction of a genetic linkage map in man using restriction fragment length polymorphisms. , 1980, American journal of human genetics.

[7]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[8]  J. Gustafson,et al.  Cystic Fibrosis , 2009, Journal of the Iowa Medical Society.

[9]  M. Daly,et al.  Estimation of the multiple testing burden for genomewide association studies of nearly all common variants , 2008, Genetic epidemiology.

[10]  Anushya Muruganujan,et al.  PANTHER version 7: improved phylogenetic trees, orthologs and collaboration with the Gene Ontology Consortium , 2009, Nucleic Acids Res..

[11]  Eric Boerwinkle,et al.  Population-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDL , 2007, Nature Genetics.

[12]  M. Gerstein,et al.  The Centers for Mendelian Genomics: A new large‐scale initiative to identify the genes underlying rare Mendelian conditions , 2012, American journal of medical genetics. Part A.

[13]  J. Lupski,et al.  Genomic rearrangements and sporadic disease , 2007, Nature Genetics.

[14]  F. Ratjen,et al.  Cystic fibrosis , 2003, The Lancet.

[15]  S. Schneider,et al.  Mutations in ANO3 cause dominant craniocervical dystonia: ion channel implicated in pathogenesis. , 2012, American journal of human genetics.

[16]  C. Bocchini,et al.  The status of online Mendelian inheritance in man (OMIM) medio 1994. , 1994, Nucleic acids research.

[17]  A. Sturtevant,et al.  THE LINEAR ARRANGEMENT OF SIX SEX-LINKED FACTORS IN DROSOPHILA, AS SHOWN BY THEIR MODE OF ASSOCIATION , 1913 .

[18]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[19]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[20]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[21]  E S Lander,et al.  Homozygosity mapping: a way to map human recessive traits with the DNA of inbred children. , 1987, Science.

[22]  Jane S. Paulsen,et al.  A new model for prediction of the age of onset and penetrance for Huntington's disease based on CAG length , 2004, Clinical genetics.

[23]  T. Wieland,et al.  Exome sequencing identifies a REEP1 mutation involved in distal hereditary motor neuropathy type V. , 2012, American journal of human genetics.

[24]  J. Shendure,et al.  Exome sequencing as a tool for Mendelian disease gene discovery , 2011, Nature Reviews Genetics.

[25]  Michael F. Walker,et al.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism , 2012, Nature.

[26]  Elizabeth T. Cirulli,et al.  Whole-Genome Sequencing of a Single Proband Together with Linkage Analysis Identifies a Mendelian Disease Gene , 2010, PLoS genetics.

[27]  P. Stenson,et al.  The Human Gene Mutation Database: 2008 update , 2009, Genome Medicine.

[28]  S. Raychaudhuri Mapping Rare and Common Causal Alleles for Complex Human Diseases , 2011, Cell.

[29]  P. Tsai,et al.  Mutations in KCND3 cause spinocerebellar ataxia type 22 , 2012, Annals of neurology.

[30]  M. Daly,et al.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms , 2001, Nature.

[31]  S. Henikoff,et al.  Predicting the effects of amino acid substitutions on protein function. , 2006, Annual review of genomics and human genetics.

[32]  V. Salomaa,et al.  Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia , 2010, Nature Genetics.

[33]  Yudi Pawitan,et al.  Revisiting Mendelian disorders through exome sequencing , 2011, Human Genetics.

[34]  D. Mccormick Sequence the Human Genome , 1986, Bio/Technology.

[35]  P. Stankiewicz,et al.  Structural variation in the human genome and its role in disease. , 2010, Annual review of medicine.

[36]  Steve D. M. Brown,et al.  Mouse large-scale phenotyping initiatives: overview of the European Mouse Disease Clinic (EUMODIC) and of the Wellcome Trust Sanger Institute Mouse Genetics Project , 2012, Mammalian Genome.

[37]  J. Lupski,et al.  Human genome sequencing in health and disease. , 2012, Annual review of medicine.

[38]  Seppo Ylä-Herttuala,et al.  Endgame: glybera finally recommended for approval as the first gene therapy drug in the European union. , 2012, Molecular therapy : the journal of the American Society of Gene Therapy.

[39]  Robert P. St.Onge,et al.  The Chemical Genomic Portrait of Yeast: Uncovering a Phenotype for All Genes , 2008, Science.

[40]  M. Hayden,et al.  Novel mutations in scavenger receptor BI associated with high HDL cholesterol in humans , 2011, Clinical genetics.

[41]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[42]  Louise V Wain,et al.  Copy number variation. , 2011, Methods in molecular biology.

[43]  Marie-Pierre Dubé,et al.  Segregation of LIPG, CETP, and GALNT2 Mutations in Caucasian Families with Extremely High HDL Cholesterol , 2012, PloS one.

[44]  Adam Kiezun,et al.  Computational and statistical approaches to analyzing variants identified by exome sequencing , 2011, Genome Biology.

[45]  P. Donnelly,et al.  The Fine-Scale Structure of Recombination Rate Variation in the Human Genome , 2004, Science.

[46]  R. Myers,et al.  Candidate-gene approaches for studying complex genetic traits: practical considerations , 2002, Nature Reviews Genetics.

[47]  Yan Guo,et al.  Copy Number Variation , 2013 .

[48]  Paul D Thomas,et al.  Accurate Prediction of the Functional Significance of Single Nucleotide Polymorphisms and Mutations in the ABCA1 Gene , 2005, PLoS genetics.

[49]  M. King,et al.  Genetic Heterogeneity in Human Disease , 2010, Cell.

[50]  E. Bertini,et al.  Spinal muscular atrophy associated with progressive myoclonic epilepsy is caused by mutations in ASAH1. , 2012, American journal of human genetics.

[51]  M. Daly,et al.  Genetic Mapping in Human Disease , 2008, Science.

[52]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[53]  M. Hayden,et al.  Predictive testing for Huntington disease: I. Description of a pilot project in British Columbia. , 1989, American journal of medical genetics.

[54]  Marc Gewillig,et al.  Contribution of global rare copy-number variants to the risk of sporadic congenital heart disease. , 2012, American journal of human genetics.

[55]  Dekan der Mathematisch-Naturwissenschaftlichen,et al.  Functional characterization of , 2014 .

[56]  A. Young,et al.  A polymorphic DNA marker genetically linked to Huntington's disease , 1983, Nature.

[57]  C. Bear,et al.  Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) Potentiator VX-770 (Ivacaftor) Opens the Defective Channel Gate of Mutant CFTR in a Phosphorylation-dependent but ATP-independent Manner* ♦ , 2012, The Journal of Biological Chemistry.

[58]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[59]  Eric Purdy PREDICTIVE TESTING , 1986, The Lancet.

[60]  Eleftheria Zeggini,et al.  Whole-genome scan, in a complex disease, using 11,245 single-nucleotide polymorphisms: comparison with microsatellites. , 2004, American journal of human genetics.

[61]  Andrew D. Johnson,et al.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap , 2008, Bioinform..

[62]  Roded Sharan,et al.  Medical sequencing at the extremes of human body mass. , 2006, American journal of human genetics.

[63]  J. Rashbass Online Mendelian Inheritance in Man. , 1995, Trends in genetics : TIG.

[64]  M. Hayden,et al.  Predictive testing for Huntington disease. , 1992, Journal of medical ethics.

[65]  M. Koenig,et al.  Complete cloning of the duchenne muscular dystrophy (DMD) cDNA and preliminary genomic organization of the DMD gene in normal and affected individuals , 1987, Cell.

[66]  A. Hofman,et al.  Variant of TREM2 associated with the risk of Alzheimer's disease. , 2013, The New England journal of medicine.

[67]  M. Daly,et al.  Genome-wide association studies for common diseases and complex traits , 2005, Nature Reviews Genetics.

[68]  Jan Albert Kuivenhoven,et al.  Genetic variant of the scavenger receptor BI in humans. , 2011, The New England journal of medicine.

[69]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.

[70]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[71]  Michael Dean,et al.  A closely linked genetic marker for cystic fibrosis , 1985, Nature.

[72]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[73]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[74]  Dan M Roden,et al.  A rare variant in MYH6 is associated with high risk of sick sinus syndrome , 2011, Nature Genetics.

[75]  L. Palmer,et al.  Genomewide scans of complex human diseases: true linkage is hard to find. , 2001, American journal of human genetics.

[76]  O. Sarig,et al.  Familial pityriasis rubra pilaris is caused by mutations in CARD14. , 2012, American journal of human genetics.

[77]  Manish S. Shah,et al.  A novel gene containing a trinucleotide repeat that is expanded and unstable on Huntington's disease chromosomes , 1993, Cell.

[78]  Michael Krawczak,et al.  The human gene mutation database , 1998, Nucleic Acids Res..

[79]  M. Hurles,et al.  Copy number variation in human health, disease, and evolution. , 2009, Annual review of genomics and human genetics.

[80]  D. Landau,et al.  A Deletion Mutation in TMEM38B Associated with Autosomal Recessive Osteogenesis Imperfecta , 2013, Human mutation.

[81]  M. Kosorok,et al.  Longitudinal development of mucoid Pseudomonas aeruginosa infection and lung disease progression in children with cystic fibrosis. , 2005, JAMA.

[82]  M. Rieder,et al.  Autosomal dominant familial dyskinesia and facial myokymia: single exome sequencing identifies a mutation in adenylyl cyclase 5. , 2012, Archives of Neurology.

[83]  M. Daly,et al.  High-resolution haplotype structure in the human genome , 2001, Nature Genetics.

[84]  D. Srivastava,et al.  Genetics of Human Cardiovascular Disease , 2012, Cell.

[85]  J. Riordan,et al.  Identification of the Cystic Fibrosis Gene : Chromosome Walking and Jumping Author ( s ) : , 2008 .

[86]  Bradley P. Coe,et al.  Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations , 2012, Nature.

[87]  Jonathan C. Cohen,et al.  Multiple rare variants in NPC1L1 associated with reduced sterol absorption and plasma low-density lipoprotein levels. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[88]  Jonathan C. Cohen,et al.  Homozygosity mapping identifies a bile acid biosynthetic defect in an adult with cirrhosis of unknown etiology , 2012, Hepatology.

[89]  H. Hobbs,et al.  Evidence for a dominant gene that suppresses hypercholesterolemia in a family with defective low density lipoprotein receptors. , 1989, The Journal of clinical investigation.

[90]  S. Züchner,et al.  A new locus for X-linked dominant Charcot-Marie-Tooth disease (CMTX6) is caused by mutations in the pyruvate dehydrogenase kinase isoenzyme 3 (PDK3) gene. , 2013, Human molecular genetics.

[91]  A. McCue A Mystery , 2021, An Invitation to Celtic Wisdom.

[92]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[93]  Stylianos E. Antonarakis,et al.  Mendelian disorders deserve more attention , 2006, Nature Reviews Genetics.

[94]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[95]  M. Hauser,et al.  Exome Analysis of Two Limb-Girdle Muscular Dystrophy Families: Mutations Identified and Challenges Encountered , 2012, PloS one.

[96]  M. Rieder,et al.  Exome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosis , 2012, Nature Genetics.

[97]  M. Hayden,et al.  The likelihood of being affected with Huntington disease by a particular age, for a specific CAG size. , 1997, American journal of human genetics.

[98]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[99]  Olle Melander,et al.  From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus , 2010, Nature.

[100]  M. Leppert,et al.  A closely linked genetic marker for cystic fibrosis , 1986, Nature.

[101]  A. Singleton,et al.  TREM2 variants in Alzheimer's disease. , 2013, The New England journal of medicine.

[102]  Christian E Elger,et al.  15q13.3 microdeletions increase risk of idiopathic generalized epilepsy , 2009, Nature Genetics.

[103]  In Ho Choi,et al.  A single recurrent mutation in the 5'-UTR of IFITM5 causes osteogenesis imperfecta type V. , 2012, American journal of human genetics.

[104]  M. Hayden,et al.  Complete Rescue of Lipoprotein Lipase–Deficient Mice by Somatic Gene Transfer of the Naturally Occurring LPLS447X Beneficial Mutation , 2005, Arteriosclerosis, thrombosis, and vascular biology.

[105]  Joseph A. Gogos,et al.  Strong association of de novo copy number mutations with sporadic schizophrenia , 2008, Nature Genetics.

[106]  E. Lander The New Genomics: Global Views of Biology , 1996, Science.

[107]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[108]  中村 祐輔,et al.  The human gene , 1997 .

[109]  S. Scherer,et al.  Rare Copy Number Variants Contribute to Congenital Left-Sided Heart Disease , 2012, PLoS genetics.

[110]  James C. Mullikin,et al.  Exome sequencing: the sweet spot before whole genomes , 2010, Human molecular genetics.

[111]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[112]  M. Muers Human genetics: Fruits of exome sequencing for autism , 2012, Nature Reviews Genetics.

[113]  Frank Dudbridge,et al.  Haplotype tagging for the identification of common disease genes , 2001, Nature Genetics.

[114]  Kenny Q. Ye,et al.  De Novo Gene Disruptions in Children on the Autistic Spectrum , 2012, Neuron.

[115]  Pall I. Olason,et al.  Detection of sharing by descent, long-range phasing and haplotype imputation , 2008, Nature Genetics.

[116]  Emily H Turner,et al.  Whole-genome analysis reveals that mutations in inositol polyphosphate phosphatase-like 1 cause opsismodysplasia. , 2013, American journal of human genetics.

[117]  Momiao Xiong,et al.  The Power of Linkage Detection by the Transmission/Disequilibrium Tests , 1998, Human Heredity.

[118]  L. Tsui,et al.  A polymorphic DNA marker linked to cystic fibrosis is located on chromosome 7 , 1985, Nature.

[119]  Nathaniel D. Heintzman,et al.  9p21 DNA variants associated with Coronary Artery Disease impair IFNγ signaling response , 2011, Nature.

[120]  Alexander Pertsemlidis,et al.  Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9 , 2005, Nature Genetics.

[121]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[122]  Evan T. Geller,et al.  Patterns and rates of exonic de novo mutations in autism spectrum disorders , 2012, Nature.

[123]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[124]  J Oyston,et al.  Online Mendelian Inheritance in Man. , 1998, Anesthesiology.

[125]  Santhosh Girirajan,et al.  Human copy number variation and complex genetic disease. , 2011, Annual review of genetics.

[126]  Matthias Griese,et al.  A CFTR potentiator in patients with cystic fibrosis and the G551D mutation. , 2011, The New England journal of medicine.

[127]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[128]  A. Singleton,et al.  Rare Structural Variants Disrupt Multiple Genes in Neurodevelopmental Pathways in Schizophrenia , 2008, Science.

[129]  A. Chakravarti Population genetics—making sense out of sequence , 1999, Nature Genetics.

[130]  Jonathan C. Cohen,et al.  Functional characterization of genetic variants in NPC1L1 supports the sequencing extremes strategy to identify complex trait genes , 2008, Human molecular genetics.

[131]  Gloria R. Gogola,et al.  Mutations in ECEL1 cause distal arthrogryposis type 5D. , 2013, American journal of human genetics.