Predicting the effects of amino acid substitutions on protein function.

Nonsynonymous single nucleotide polymorphisms (nsSNPs) are coding variants that introduce amino acid changes in their corresponding proteins. Because nsSNPs can affect protein function, they are believed to have the largest impact on human health compared with SNPs in other regions of the genome. Therefore, it is important to distinguish those nsSNPs that affect protein function from those that are functionally neutral. Here we provide an overview of amino acid substitution (AAS) prediction methods, which use sequence and/or structure to predict the effect of an AAS on protein function. Most methods predict approximately 25-30% of human nsSNPs to negatively affect protein function, and such nsSNPs tend to be rare in the population. We discuss the utility of AAS prediction methods for Mendelian and complex diseases as well as their broader applications for understanding protein function.

[1]  M. Carrington,et al.  The killer immunoglobulin-like receptor gene cluster: tuning the genome for defense. , 2006, Annual review of genomics and human genetics.

[2]  C. Stewart,et al.  The laminopathies: the functional architecture of the nucleus and its contribution to disease. , 2006, Annual review of genomics and human genetics.

[3]  Nicholas Katsanis,et al.  The ciliopathies: an emerging class of human genetic disorders. , 2006, Annual review of genomics and human genetics.

[4]  T. Mackay,et al.  Of flies and man: Drosophila as a model for human complex traits. , 2006, Annual review of genomics and human genetics.

[5]  J. Moult,et al.  Identification and analysis of deleterious human SNPs. , 2006, Journal of molecular biology.

[6]  J. Moult,et al.  Loss of protein structure stability as a major causative factor in monogenic disease. , 2005, Journal of molecular biology.

[7]  D. Bell,et al.  Single nucleotide polymorphism in transcriptional regulatory regions and expression of environmentally responsive genes. , 2005, Toxicology and applied pharmacology.

[8]  Arend Sidow,et al.  Trade-offs in detecting evolutionarily constrained sequence by comparative genomics. , 2005, Annual review of genomics and human genetics.

[9]  Modesto Orozco,et al.  PMUT: a web-based tool for the annotation of pathological mutations on proteins , 2005, Bioinform..

[10]  A. Sidow,et al.  Physicochemical constraint violation by missense substitutions mediates impairment of protein function and disease severity. , 2005, Genome research.

[11]  Joaquín Dopazo,et al.  PupasView: a visual tool for selecting suitable SNPs, with putative pathological effect in genes, for genotyping purposes , 2005, Nucleic Acids Res..

[12]  John Moult,et al.  A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. , 2005, Current opinion in structural biology.

[13]  Chu Chen,et al.  Screening for Deleterious Nonsynonymous Single-Nucleotide Polymorphisms in Genes Involved in Steroid Hormone Metabolism and Response , 2005, Cancer Epidemiology Biomarkers & Prevention.

[14]  C. Ponting,et al.  Statistical Genetics: Usual suspects in complex disease , 2005, European Journal of Human Genetics.

[15]  Sean D. Mooney,et al.  Bioinformatics approaches and resources for single nucleotide polymorphism functional analysis , 2005, Briefings Bioinform..

[16]  G. Stormo,et al.  PolyMAPr: Programs for polymorphism database mining, annotation, and functional analysis , 2005, Human Mutation.

[17]  Leszek Rychlewski,et al.  LiveBench‐8: The large‐scale, continuous assessment of automated protein structure prediction , 2005, Protein science : a publication of the Protein Society.

[18]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[19]  M. Orozco,et al.  Sequence‐based prediction of pathological mutations , 2004, Proteins.

[20]  J. Parks,et al.  High frequency of mitochondrial complex I mutations in Parkinson’s disease and aging , 2004, Neurobiology of Aging.

[21]  H. Wajcman,et al.  In silico prediction of the deleterious effect of a mutation: proceed with caution in clinical genetics. , 2004, Clinical chemistry.

[22]  P. Thomas,et al.  Coding single-nucleotide polymorphisms associated with complex vs. Mendelian disease: evolutionary evidence for differences in molecular effects. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Sivakumar Gowrisankar,et al.  Pattern of sequence variation across 213 environmental response genes. , 2004, Genome research.

[24]  Dirk Holste,et al.  Single Nucleotide Polymorphism–Based Validation of Exonic Splicing Enhancers , 2004, PLoS biology.

[25]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[26]  Alberto Riva,et al.  Bayesian approach to discovering pathogenic SNPs in conserved protein domains , 2004, Human mutation.

[27]  N. Mukhopadhyay,et al.  Genetic Polymorphisms in Human Proton-Dependent Dipeptide Transporter PEPT1: Implications for the Functional Role of Pro586 , 2004, Journal of Pharmacology and Experimental Therapeutics.

[28]  W. Foulkes,et al.  Germline E-cadherin mutations in hereditary diffuse gastric cancer: assessment of 42 new families and review of genetic screening criteria , 2004, Journal of Medical Genetics.

[29]  Joaquín Dopazo,et al.  PupaSNP Finder: a web tool for finding SNPs with putative effect at transcriptional level , 2004, Nucleic Acids Res..

[30]  I. M. Jones,et al.  Many amino acid substitution variants identified in DNA repair genes during human population screenings are predicted to impact protein function. , 2004, Genomics.

[31]  D. Guerry,et al.  Assessment of polymorphic variants in the melanocortin-1 receptor gene with cutaneous pigmentation using an evolutionary approach. , 2004, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[32]  A. Chakravarti,et al.  The Human MitoChip: a high-throughput sequencing microarray for mitochondrial mutation detection. , 2004, Genome research.

[33]  Albert Y Lau,et al.  Functional classification of proteins and protein variants. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[34]  International Chicken Polymorphism Map Consortium Explorer A genetic variation map for chicken with 2.8 million single-nucleotide polymorphisms , 2012 .

[35]  Simon Kasif,et al.  topoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association , 2004, Nucleic Acids Res..

[36]  Webb Miller,et al.  Improvements in the HbVar database of human hemoglobin variants and thalassemia mutations for population and sequence variation studies , 2004, Nucleic Acids Res..

[37]  Stacy T. Knutson,et al.  Prediction of deleterious functional effects of amino acid mutations using a library of structure‐based function descriptors , 2003, Proteins.

[38]  S. Henikoff,et al.  Single-nucleotide mutations for plant functional genomics. , 2003, Annual review of plant biology.

[39]  David R. Westhead,et al.  A comparative study of machine-learning methods to predict the effects of single nucleotide polymorphisms on protein function , 2003, Bioinform..

[40]  C. Sander,et al.  The amino-acid mutational spectrum of human genetic disease , 2003, Genome Biology.

[41]  Russ B. Altman,et al.  MutDB: annotating human variation with functionally relevant data , 2003, Bioinform..

[42]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[43]  Lars Bolund,et al.  A population threshold for functional polymorphisms. , 2003, Genome research.

[44]  John M. Hancock,et al.  A phylogenetic approach to assessing the significance of missense mutations in disease genes , 2003, Human mutation.

[45]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[46]  P. Stenson,et al.  Human Gene Mutation Database (HGMD®): 2003 update , 2003, Human mutation.

[47]  Conrad C. Huang,et al.  Natural variation in human membrane transporter genes reveals evolutionary and functional constraints , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Russ B Altman,et al.  A functional analysis of disease-associated mutations in the androgen receptor gene. , 2003, Nucleic acids research.

[49]  S. Kasif,et al.  Structural location of disease-associated single-nucleotide polymorphisms. , 2003, Journal of molecular biology.

[50]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[51]  A. Valencia,et al.  Automatic methods for predicting functionally important residues. , 2003, Journal of molecular biology.

[52]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[53]  J. Potter,et al.  Understanding missense mutations in the BRCA1 gene: An evolutionary approach , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[54]  P. Stenson,et al.  Human Gene Mutation Database (HGMD , 2003 .

[55]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[56]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[57]  Christopher T. Saunders,et al.  Evaluation of structural and evolutionary contributions to deleterious mutation prediction. , 2002, Journal of molecular biology.

[58]  P. Bork,et al.  Human non-synonymous SNPs: server and survey. , 2002, Nucleic acids research.

[59]  D. Cooper,et al.  Assessing the relative importance of the biophysical properties of amino acid substitutions associated with human genetic disease , 2002, Human mutation.

[60]  T. J. Brickman,et al.  Bordetella Interspecies Allelic Variation in AlcR Inducer Requirements: Identification of a Critical Determinant of AlcR Inducer Responsiveness and Construction of an alcR(Con) Mutant Allele , 2002, Journal of bacteriology.

[61]  S. Henikoff,et al.  Accounting for human polymorphisms predicted to affect protein function. , 2002, Genome research.

[62]  Andrew C R Martin,et al.  G6PDdb, an integrated database of glucose‐6‐phosphate dehydrogenase (G6PD) mutations , 2002, Human mutation.

[63]  J. Hirschhorn,et al.  A comprehensive review of genetic association studies , 2002, Genetics in Medicine.

[64]  George P Patrinos,et al.  HbVar: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server , 2002, Human mutation.

[65]  M. Orozco,et al.  Characterization of disease-associated single amino acid polymorphisms in terms of sequence and structure properties. , 2002, Journal of molecular biology.

[66]  Mouse Genome Sequencing Consortium Initial sequencing and comparative analysis of the mouse genome , 2002, Nature.

[67]  M. Miller,et al.  Understanding human disease mutations through the use of interspecific genetic variation. , 2001, Human molecular genetics.

[68]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[69]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[70]  S. Henikoff,et al.  Predicting deleterious amino acid substitutions. , 2001, Genome research.

[71]  J. Moult,et al.  SNPs, protein structure, and disease , 2001, Human mutation.

[72]  D. Chasman,et al.  Predicting the functional consequences of non-synonymous single nucleotide polymorphisms: structure-based assessment of amino acid variation. , 2001, Journal of molecular biology.

[73]  Warren C. Lathe,et al.  Prediction of deleterious human alleles. , 2001, Human molecular genetics.

[74]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[75]  Csilla Szabo,et al.  The Breast Cancer Information Core: Database design, structure, and scope , 2000, Human mutation.

[76]  P. Bork,et al.  Towards a structural basis of human non-synonymous single nucleotide polymorphisms. , 2000, Trends in genetics : TIG.

[77]  A Chakravarti,et al.  Patterns of genetic variation in Mendelian and complex traits. , 2000, Annual review of genomics and human genetics.

[78]  N. Shen,et al.  Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis , 1999, Nature Genetics.

[79]  E. Lander,et al.  Characterization of single-nucleotide polymorphisms in coding regions of human genes , 1999 .

[80]  M V Olson,et al.  When less is more: gene loss as an engine of evolutionary change. , 1999, American journal of human genetics.

[81]  Bryan Chan,et al.  Human immunodeficiency virus reverse transcriptase and protease sequence database , 2003, Nucleic Acids Res..

[82]  G J Pielak,et al.  A genetic approach for identifying critical residues in the fingers and palm subdomains of HIV-1 reverse transcriptase. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[83]  Patricia Rodriguez-Tomé,et al.  IARC Database of p53 gene mutations in human tumors and cell lines: updated compilation, revised formats and new visualisation tools , 1998, Nucleic Acids Res..

[84]  Heikki Lehväslaiho,et al.  The Androgen Receptor Gene Mutations Database , 1998, Nucleic Acids Res..

[85]  Francis S. Collins,et al.  Variations on a Theme: Cataloging Human DNA Sequence Variation , 1997, Science.

[86]  J H Miller,et al.  Lac repressor genetic map in real space. , 1997, Trends in biochemical sciences.

[87]  L Luzzatto,et al.  Hematologically important mutations: glucose-6-phosphate dehydrogenase. , 1996, Blood cells, molecules & diseases.

[88]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[89]  S. Bouvier,et al.  Systematic mutation of bacteriophage T4 lysozyme. , 1991, Journal of molecular biology.

[90]  Marianne Manchester,et al.  Complete mutagenesis of the HIV-1 protease , 1989, Nature.