Inferring causality and functional significance of human coding DNA variants.

Sequencing technology enables the complete characterization of human genetic variation. Statistical genetics studies identify numerous loci linked to or associated with phenotypes of direct medical interest. The major remaining challenge is to characterize functionally significant alleles that are causally implicated in the genetic basis of human traits. Here, I review three sources of evidence for the functional significance of human DNA variants in protein-coding genes. These include (i) statistical genetics considerations such as co-segregation with the phenotype, allele frequency in unaffected controls and recurrence; (ii) in vitro functional assays and model organism experiments; and (iii) computational methods for predicting the functional effect of amino acid substitutions. In spite of many successes of recent studies, functional characterization of human allelic variants remains problematic.

[1]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[2]  Joseph K. Pickrell,et al.  A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes , 2012, Science.

[3]  Warren C. Lathe,et al.  Prediction of deleterious human alleles. , 2001, Human molecular genetics.

[4]  Sivakumar Gowrisankar,et al.  Pattern of sequence variation across 213 environmental response genes. , 2004, Genome research.

[5]  N. Katsanis,et al.  Zebrafish assays of ciliopathies. , 2011, Methods in cell biology.

[6]  Isabelle Cleynen,et al.  Resequencing of positional candidates identifies low frequency IL23R coding variants protecting against inflammatory bowel disease , 2011, Nature Genetics.

[7]  Hongyu Zhao,et al.  Rare independent mutations in renal salt handling genes contribute to blood pressure variation , 2008, Nature Genetics.

[8]  Jana Marie Schwarz,et al.  MutationTaster evaluates disease-causing potential of sequence alterations , 2010, Nature Methods.

[9]  J. Rine,et al.  Surrogate Genetics and Metabolic Profiling for Characterization of Human Disease Alleles , 2012, Genetics.

[10]  Evan T. Geller,et al.  Patterns and rates of exonic de novo mutations in autism spectrum disorders , 2012, Nature.

[11]  A. Spurdle,et al.  Sequence variant classification and reporting: recommendations for improving the interpretation of cancer susceptibility genetic test results , 2008, Human mutation.

[12]  Joshua M. Korn,et al.  Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease , 2011, Nature Genetics.

[13]  Swapan Mallick,et al.  A direct characterization of human mutation based on microsatellites , 2012, Nature Genetics.

[14]  Michael F. Walker,et al.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism , 2012, Nature.

[15]  E. Birney,et al.  Heritable Individual-Specific and Allele-Specific Chromatin Signatures in Humans , 2010, Science.

[16]  Adam Kiezun,et al.  Exome sequencing and the genetic basis of complex traits , 2012, Nature Genetics.

[17]  R. Petersen,et al.  Mutations in the colony stimulating factor 1 receptor (CSF1R) cause hereditary diffuse leukoencephalopathy with spheroids , 2011, Nature Genetics.

[18]  A. Sidow,et al.  Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. , 2005, Genome research.

[19]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[20]  Andrej Sali,et al.  Functional Impact of Missense Variants in BRCA1 Predicted by Supervised Learning , 2006, PLoS Comput. Biol..

[21]  Justin C. Fay,et al.  Identification of deleterious mutations within three human genomes. , 2009, Genome research.

[22]  S. Sunyaev,et al.  Human allelic variation: perspective from protein function, structure, and evolution. , 2010, Current opinion in structural biology.

[23]  J. Shendure,et al.  De novo mutations in the actin genes ACTB and ACTG1 cause Baraitser-Winter syndrome , 2012, Nature Genetics.

[24]  A. Chakravarti,et al.  On the probability that a novel variant is a disease-causing mutation. , 2005, Genome research.

[25]  Jack T Stapleton,et al.  The Major Genetic Determinants of HIV-1 Control Affect HLA Class I Peptide Presentation , 2010, Science.

[26]  Shamil R Sunyaev,et al.  Pooled association tests for rare variants in exon-resequencing studies. , 2010, American journal of human genetics.

[27]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[28]  Ryan D. Hernandez,et al.  Simultaneous inference of selection and population growth from patterns of variation in the human genome , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[29]  S. Henikoff,et al.  Predicting deleterious amino acid substitutions. , 2001, Genome research.

[30]  J. Schuurs-Hoeijmakers,et al.  Mutations in the phospholipid remodeling gene SERAC1 impair mitochondrial function and intracellular cholesterol trafficking and cause dystonia and deafness , 2012, Nature Genetics.

[31]  Jay Shendure,et al.  Single-nucleotide evolutionary constraint scores highlight disease-causing mutations , 2010, Nature Methods.

[32]  Shamil R Sunyaev,et al.  Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. , 2007, American journal of human genetics.

[33]  D. Chasman,et al.  Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. , 2001, Journal of molecular biology.

[34]  J. Stamatoyannopoulos,et al.  Power of deep, all-exon resequencing for discovery of human trait genes , 2009, Proceedings of the National Academy of Sciences.

[35]  Roded Sharan,et al.  Medical sequencing at the extremes of human body mass. , 2006, American journal of human genetics.

[36]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[37]  David B. Goldstein,et al.  De novo mutations in ATP1A3 cause alternating hemiplegia of childhood , 2012, Nature Genetics.

[38]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[39]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[40]  S. Sunyaev,et al.  Dobzhansky–Muller incompatibilities in protein evolution , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Olle Melander,et al.  From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus , 2010, Nature.

[42]  Sudhir Kumar,et al.  Positional conservation and amino acids shape the correct diagnosis and population frequencies of benign and damaging personal amino acid mutations. , 2009, Genome research.

[43]  Matthew S. Lebo,et al.  Development and validation of a computational method for assessment of missense variants in hypertrophic cardiomyopathy. , 2011, American journal of human genetics.

[44]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[45]  Marek Kimmel,et al.  Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed , 2011, Human mutation.

[46]  A. Gonzalez-Perez,et al.  Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. , 2011, American journal of human genetics.

[47]  G. Abecasis,et al.  Exome sequencing and complex disease: practical aspects of rare variant association studies , 2012, Human molecular genetics.

[48]  X. Estivill,et al.  KLHL3 mutations cause familial hyperkalemic hypertension by impairing ion transport in the distal nephron , 2012, Nature Genetics.

[49]  Gabor T. Marth,et al.  The Allele Frequency Spectrum in Genome-Wide Human Variation Data Reveals Signals of Differential Demographic History in Three Large World Populations , 2004, Genetics.

[50]  P. Shannon,et al.  Analysis of Genetic Inheritance in a Family Quartet by Whole-Genome Sequencing , 2010, Science.

[51]  F. Galibert,et al.  PNPLA1 mutations cause autosomal recessive congenital ichthyosis in golden retriever dogs and humans , 2012, Nature Genetics.

[52]  Huanming Yang,et al.  Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants , 2010, Nature Genetics.

[53]  S. Levy,et al.  Exome sequencing supports a de novo mutational paradigm for schizophrenia , 2011, Nature Genetics.

[54]  J. Moult,et al.  Loss of protein structure stability as a major causative factor in monogenic disease. , 2005, Journal of molecular biology.

[55]  G. Schreiber,et al.  Assessing computational methods for predicting protein stability upon mutation: good on average but not in the details. , 2009, Protein engineering, design & selection : PEDS.

[56]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[57]  Inês Barroso,et al.  Rare MTNR1B variants impairing melatonin receptor 1B function contribute to type 2 diabetes , 2012, Nature Genetics.

[58]  Eric Boerwinkle,et al.  Rare loss-of-function mutations in ANGPTL family members contribute to plasma triglyceride levels in humans. , 2008, The Journal of clinical investigation.

[59]  V. Salomaa,et al.  Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia , 2010, Nature Genetics.

[60]  Naomi R. Wray,et al.  Synthetic Associations Created by Rare Variants Do Not Explain Most GWAS Results , 2011, PLoS biology.

[61]  J. Shendure,et al.  De novo germline and postzygotic mutations in AKT3, PIK3R2 and PIK3CA cause a spectrum of related megalencephaly syndromes , 2012, Nature Genetics.

[62]  P Bork,et al.  SNP frequencies in human genes an excess of rare alleles and differing modes of selection. , 2000, Trends in genetics : TIG.

[63]  A. Munnich,et al.  Mutations at a single codon in Mad homology 2 domain of SMAD4 cause Myhre syndrome , 2011, Nature Genetics.

[64]  Robert M. Plenge,et al.  Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis , 2011, Nature Genetics.

[65]  Monique M. Ryan,et al.  Mutations in the RNA exosome component gene EXOSC3 cause pontocerebellar hypoplasia and spinal motor neuron degeneration , 2012, Nature Genetics.

[66]  Alexey S Kondrashov,et al.  Direct estimates of human per nucleotide mutation rates at 20 loci causing mendelian diseases , 2003, Human mutation.

[67]  W. Grody,et al.  ACMG recommendations for standards for interpretation and reporting of sequence variations: Revisions 2007 , 2008, Genetics in Medicine.

[68]  James T. Elder,et al.  Rare and common variants in CARD14, encoding an epidermal regulator of NF-kappaB, in psoriasis. , 2012, American journal of human genetics.

[69]  Yves Moreau,et al.  Heterozygous missense mutations in SMARCA2 cause Nicolaides-Baraitser syndrome , 2012, Nature Genetics.

[70]  Bradley P. Coe,et al.  Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations , 2012, Nature.

[71]  Jianzhi Zhang,et al.  Gene Losses during Human Origins , 2006, PLoS biology.

[72]  M. Hauser,et al.  Mutations affecting the cytoplasmic functions of the co-chaperone DNAJB6 cause limb-girdle muscular dystrophy , 2012, Nature Genetics.

[73]  P. Shannon,et al.  Exome sequencing identifies the cause of a Mendelian disorder , 2009, Nature Genetics.

[74]  Ryan D. Hernandez,et al.  Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome , 2008, PLoS genetics.

[75]  M. Nachman,et al.  Estimate of the mutation rate per nucleotide in humans. , 2000, Genetics.