UniProt genomic mapping for deciphering functional effects of missense variants

The integration of protein function knowledge in the UniProt Knowledgebase (UniProtKB) with genomic data provides unique opportunities for biomedical research by helping scientists to rapidly comprehend complex processes in biology. UniProtKB provides expert curation of functionally characterized variants reported in the literature, many of them associated with disease. Understanding the association of genetic variation with its functional consequences in proteins is essential for the interpretation of large-scale genomics data. UniProtKB protein sequences and sequence feature annotations such as active sites, modified sites, binding domains and amino acid variation have been mapped to the current build of the human genome (GRCh38). We have also created public track hubs for the Ensembl and UCSC browsers. We examine some specific biological examples in disease-related genes and proteins, illustrating the utility of combining protein and genome annotations for the functional interpretation of variants. We also present a larger comparison of UniProtKB variant and feature curation data that collocates with clinically observed SNP data from ClinVar, illustrating how protein and gene disease annotations currently compare and discuss some additional uses that might follow with combined annotation. The combination of gene and protein annotation can assist medical geneticists with the functional interpretation of variation. The genomic track hubs are downloadable from the UniProt FTP site (http://www.uniprot.org/downloads) and directly discoverable as public track hubs at the USCS and Ensembl genome browsers.

[1]  Alan Bridge,et al.  An enhanced workflow for variant interpretation in UniProtKB/Swiss-Prot improves consistency and reuse in ClinVar , 2019, Database J. Biol. Databases Curation.

[2]  Yuan Tian,et al.  A Bayesian framework for efficient and accurate variant prediction , 2018, PloS one.

[3]  S. Boca,et al.  Future of Evidence Synthesis in Precision Oncology: Between Systematic Reviews and Biocuration. , 2018, JCO precision oncology.

[4]  Chunling Hu,et al.  Assessment of the Clinical Relevance of BRCA2 Missense Variants by Functional and Computational Approaches. , 2018, American journal of human genetics.

[5]  Bruce D. Gelb,et al.  ClinGen’s RASopathy Expert Panel Consensus Methods for Variant Interpretation , 2018, Genetics in Medicine.

[6]  R. Nussbaum,et al.  Modeling the ACMG/AMP Variant Classification Guidelines as a Bayesian Classification Framework , 2018, Genetics in Medicine.

[7]  Chunlei Liu,et al.  ClinVar: improving access to variant interpretations and supporting evidence , 2017, Nucleic Acids Res..

[8]  Subha Madhavan,et al.  ClinGen Cancer Somatic Working Group - Standardizing and democratizing access to cancer molecular diagnostic data to drive translational research , 2018, PSB.

[9]  Lily Hoffman-Andrews The known unknown: the challenges of genetic variants of uncertain significance in clinical practice , 2017, Journal of law and the biosciences.

[10]  Daniel J. Park,et al.  Variant effect prediction tools assessed using independent, functional assay-based datasets: implications for discovery and diagnostics , 2017, Human Genomics.

[11]  Keith Nykamp,et al.  Sherloc: a comprehensive refinement of the ACMG–AMP variant classification criteria , 2017, Genetics in Medicine.

[12]  Matthew S. Lebo,et al.  Using large sequencing data sets to refine intragenic disease regions and prioritize clinical variant interpretation , 2016, Genetics in Medicine.

[13]  Maria Jesus Martin,et al.  The Proteins API: accessing key integrated protein and genome information , 2017, Nucleic Acids Res..

[14]  Alessandro Vullo,et al.  Ensembl 2017 , 2016, Nucleic Acids Res..

[15]  D. Karolchik,et al.  The UCSC Genome Browser database: 2017 update , 2016, Nucleic Acids Res..

[16]  Joshua L. Deignan,et al.  A survey of current practices for genomic sequencing test interpretation and reporting processes in US laboratories , 2016, Genetics in Medicine.

[17]  D. MacArthur,et al.  Reassessment of Mendelian gene pathogenicity using 7,855 cardiomyopathy cases and 60,706 reference samples , 2016, Genetics in Medicine.

[18]  Marilyn M. Li,et al.  Standards and Guidelines for the Interpretation and Reporting of Sequence Variants in Cancer: A Joint Consensus Recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists. , 2017, The Journal of molecular diagnostics : JMD.

[19]  Subha Madhavan,et al.  Somatic cancer variant curation and harmonization through consensus minimum variant level data , 2016, Genome Medicine.

[20]  Trevor Hastie,et al.  REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants. , 2016, American journal of human genetics.

[21]  Peter Szolovits,et al.  Genetic Misdiagnoses and the Potential for Health Disparities. , 2016, The New England journal of medicine.

[22]  Levi C. T. Pierce,et al.  Deep sequencing of 10,000 human genomes , 2016, Proceedings of the National Academy of Sciences.

[23]  Matthew S. Lebo,et al.  Performance of ACMG-AMP Variant-Interpretation Guidelines among Nine Laboratories in the Clinical Sequencing Exploratory Research Consortium. , 2016, American journal of human genetics.

[24]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[25]  Daniel R. Zerbino,et al.  Ensembl 2016 , 2015, Nucleic Acids Res..

[26]  Glenn A. Maston,et al.  A Standardized DNA Variant Scoring System for Pathogenicity Assessments in Mendelian Disorders , 2015, Human mutation.

[27]  H. Rehm,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[28]  Colin Campbell,et al.  An integrative approach to predicting the functional effects of non-coding and coding sequence variation , 2015, Bioinform..

[29]  Elisabeth Coudert,et al.  HAMAP in 2015: updates to the protein family classification and annotation system , 2014, Nucleic Acids Res..

[30]  Jerven T. Bolleman,et al.  Genetic Variations and Diseases in UniProtKB/Swiss-Prot: The Ins and Outs of Expert Manual Curation , 2014, Human mutation.

[31]  D. G. MacArthur,et al.  Guidelines for investigating causality of sequence variants in human disease , 2014, Nature.

[32]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[33]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[34]  Ting Wang,et al.  Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser , 2013, Bioinform..

[35]  Lora J. H. Bean,et al.  Free the Data: One Laboratory's Approach to Knowledge‐Based Genomic Variant Classification and Preparation for EMR Integration of Genomic Data , 2013, Human mutation.

[36]  P. Padakannaya,et al.  Copy number variation-based polymorphism in a new pseudoautosomal region 3 (PAR3) of a human X-chromosome-transposed region (XTR) in the Y chromosome , 2013, Functional & Integrative Genomics.

[37]  Xiao-xia Liu,et al.  [Genetic and clinical study of three Chinese pedigrees with Fabry disease]. , 2013, Zhonghua yi xue yi chuan xue za zhi = Zhonghua yixue yichuanxue zazhi = Chinese journal of medical genetics.

[38]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[39]  Tom R. Gaunt,et al.  Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models , 2012, Human mutation.

[40]  R. Altman,et al.  Pharmacogenomics Knowledge for Personalized Medicine , 2012, Clinical pharmacology and therapeutics.

[41]  Rachael P. Huntley,et al.  The UniProt-GO Annotation database in 2011 , 2011, Nucleic Acids Res..

[42]  A. Mehta,et al.  Functional analysis of variant lysosomal acid glycosidases of Anderson-Fabry and Pompe disease in a human embryonic kidney epithelial cell line (HEK 293 T) , 2012, Journal of Inherited Metabolic Disease.

[43]  Yusuke Nakamura,et al.  IL28B but not ITPA polymorphism is predictive of response to pegylated interferon, ribavirin, and telaprevir triple therapy in patients with genotype 1 hepatitis C. , 2011, The Journal of infectious diseases.

[44]  Yusuke Nakamura,et al.  Common genetic polymorphism of ITPA gene affects ribavirin‐induced anemia and effect of peg‐interferon plus ribavirin therapy , 2011, Journal of medical virology.

[45]  D. Lockhart,et al.  A Pharmacogenetic Approach to Identify Mutant Forms of α-Galactosidase A that Respond to a Pharmacological Chaperone for Fabry Disease , 2011, Human mutation.

[46]  David McHugh,et al.  Genetics in Medicine : Official Journal of the American College of Medical Genetics , 2011 .

[47]  Cathy H. Wu,et al.  Structure-guided rule-based annotation of protein functional sites in UniProt knowledgebase. , 2011, Methods in molecular biology.

[48]  D. Selkoe Alzheimer's disease. , 2011, Cold Spring Harbor perspectives in biology.

[49]  Galt P. Barber,et al.  BigWig and BigBed: enabling browsing of large distributed datasets , 2010, Bioinform..

[50]  Xinning Jiang,et al.  Glycoproteomics analysis of human liver tissue by combination of multiple enzyme digestion and hydrazide chemistry. , 2009, Journal of proteome research.

[51]  R. Schiffmann Fabry disease. , 2009, Pharmacology & therapeutics.

[52]  A. Spurdle,et al.  Sequence variant classification and reporting: recommendations for improving the interpretation of cancer susceptibility genetic test results , 2008, Human mutation.

[53]  S. Ishii,et al.  Mutant alpha-galactosidase A enzymes identified in Fabry disease patients with residual enzyme activity: biochemical characterization and restoration of normal intracellular processing by 1-deoxygalactonojirimycin. , 2007, The Biochemical journal.

[54]  B. Morris,et al.  The Human Pseudoautosomal Region (PAR): Origin, Function and Future. , 2007, Current Genomics.

[55]  Andreas Prlic,et al.  Ensembl 2007 , 2006, Nucleic Acids Res..

[56]  R. Desnick,et al.  Detection of α‐galactosidase a mutations causing fabry disease by denaturing high performance liquid chromatography , 2005 .

[57]  K. Mills,et al.  Measurement of urinary CDH and CTH by tandem mass spectrometry in patients hemizygous and heterozygous for Fabry disease , 2005, Journal of Inherited Metabolic Disease.

[58]  R. Desnick,et al.  Detection of alpha-galactosidase a mutations causing Fabry disease by denaturing high performance liquid chromatography. , 2005, Human mutation.

[59]  F. Collins,et al.  The MLH1 D132H variant is associated with susceptibility to sporadic colorectal cancer , 2004, Nature Genetics.

[60]  D. Garboczi,et al.  The molecular defect leading to Fabry disease: structure of human alpha-galactosidase. , 2004, Journal of molecular biology.

[61]  S. Amladi,et al.  Online Mendelian Inheritance in Man 'OMIM'. , 2003, Indian journal of dermatology, venereology and leprology.

[62]  Huawei Qiu,et al.  A biochemical and pharmacological comparison of enzyme replacement therapies for the glycolipid storage disorder Fabry disease. , 2003, Glycobiology.

[63]  S. Kornfeld,et al.  Mannose 6-phosphate receptors: new twists in the tale , 2003, Nature Reviews Molecular Cell Biology.

[64]  Jim Kent,et al.  autoSQL and autoXML: code generators from the genome project , 2002 .

[65]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[66]  B. Winchester,et al.  Fabry disease: 20 novel GLA mutations in 35 families , 2001, Human mutation.

[67]  S. Younkin,et al.  The 'Arctic' APP mutation (E693G) causes Alzheimer's disease by enhanced Aβ protofibril formation , 2001, Nature Neuroscience.

[68]  H. Vanderstichele,et al.  Nonfibrillar diffuse amyloid deposition due to a gamma(42)-secretase site mutation points to an essential role for N-truncated A beta(42) in Alzheimer's disease. , 2000, Human molecular genetics.

[69]  C. Masters,et al.  Novel Leu723Pro amyloid precursor protein mutation increases amyloid β42(43) peptide levels and induces apoptosis , 2000, Annals of neurology.

[70]  D. Valle,et al.  Online Mendelian Inheritance In Man (OMIM) , 2000, Human mutation.

[71]  C. Eng,et al.  Twenty Novel Mutations in the α-Galactosidase A Gene Causing Fabry Disease , 1999, Molecular medicine.

[72]  L. Poenaru,et al.  Fabry disease: identification of novel alpha-galactosidase A mutations and molecular carrier detection by use of fluorescent chemical cleavage of mismatches. , 1999, Biochemical and biophysical research communications.

[73]  D. Campion,et al.  Unusual phenotypic alteration of β amyloid precursor protein (βAPP) maturation by a new Val-715 → Met βAPP-770 mutation responsible for probable early-onset Alzheimer’s disease , 1999 .

[74]  D. Campion,et al.  Unusual phenotypic alteration of beta amyloid precursor protein (betaAPP) maturation by a new Val-715 --> Met betaAPP-770 mutation responsible for probable early-onset Alzheimer's disease. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[75]  A. Hofman,et al.  Presenile Alzheimer dementia characterized by amyloid angiopathy and large amyloid core type senile plaques in the APP 692Ala→Gly mutation , 1998, Acta Neuropathologica.

[76]  R. Desnick,et al.  Human alpha-galactosidase A: glycosylation site 3 is essential for enzyme solubility. , 1998, The Biochemical journal.

[77]  D. Selkoe The cell biology of beta-amyloid precursor protein and presenilin in Alzheimer's disease. , 1998, Trends in cell biology.

[78]  R. Froissart,et al.  Mutation analysis in 11 French patients with Fabry disease , 1998, Human mutation.

[79]  J. Hardy,et al.  A new pathogenic mutation in the APP gene (I716V) increases the relative proportion of A beta 42(43). , 1997, Human molecular genetics.

[80]  R. Wevers,et al.  Uneven X inactivation in a female monozygotic twin pair with Fabry disease and discordant expression of a novel mutation in the alpha-galactosidase A gene. , 1996, Journal of medical genetics.

[81]  J. Rashbass Online Mendelian Inheritance in Man. , 1995, Trends in genetics : TIG.

[82]  C. Eng,et al.  Molecular basis of fabry disease: Mutations and polymorphisms in the human α‐galactosidase A gene , 1994 .

[83]  C. Eng,et al.  Molecular basis of Fabry disease: mutations and polymorphisms in the human alpha-galactosidase A gene. , 1994, Human mutation.

[84]  A. Roses,et al.  Characterization of Amyloid Familial β-Peptide in Familial Alzheimer′s Disease with APP717 Mutations , 1993 .

[85]  C. Eng,et al.  Nature and frequency of mutations in the alpha-galactosidase A gene that cause Fabry disease. , 1993, American journal of human genetics.

[86]  S. Malcolm,et al.  Mutation analysis in patients with the typical form of Anderson-Fabry disease. , 1993, Human molecular genetics.

[87]  R. Denman,et al.  A system for studying the effect(s) of familial Alzheimer disease mutations on the processing of the beta-amyloid peptide precursor. , 1993, Biochemical and biophysical research communications.

[88]  R. Loewenstein,et al.  A Survey of Current Practices , 1993 .

[89]  B. Ghetti,et al.  Characterization of amyloid fibril beta-peptide in familial Alzheimer's disease with APP717 mutations. , 1993, Biochemical and biophysical research communications.

[90]  B. Winblad,et al.  A pathogenic mutation for probable Alzheimer's disease in the APP gene at the N–terminus of β–amyloid , 1992, Nature Genetics.

[91]  A. Hofman,et al.  Presenile dementia and cerebral haemorrhage linked to a mutation at codon 692 of the β–amyloid precursor protein gene , 1992, Nature Genetics.

[92]  J. Hardy,et al.  Early-onset Alzheimer's disease caused by mutations at codon 717 of the β-amyloid precursor protein gene , 1991, Nature.

[93]  B. Ghetti,et al.  A mutation in the amyloid precursor protein associated with hereditary Alzheimer's disease. , 1991, Science.

[94]  M. Pericak-Vance,et al.  Segregation of a missense mutation in the amyloid precursor protein gene with familial Alzheimer's disease , 1991, Nature.

[95]  V. McKusick Mendelian inheritance in man , 1971 .

[96]  B. Migeon,et al.  Genetic Inactivation of the α-Galactosidase Locus in Carriers of Fabry's Disease , 1970, Science.

[97]  B. Migeon,et al.  Genetic inactivation of the alpha-galactosidase locus in carriers of Fabry's disease. , 1970, Science.