eRAM: encyclopedia of rare disease annotations for precision medicine

Abstract Rare diseases affect over a hundred million people worldwide, most of these patients are not accurately diagnosed and effectively treated. The limited knowledge of rare diseases forms the biggest obstacle for improving their treatment. Detailed clinical phenotyping is considered as a keystone of deciphering genes and realizing the precision medicine for rare diseases. Here, we preset a standardized system for various types of rare diseases, called encyclopedia of Rare disease Annotations for Precision Medicine (eRAM). eRAM was built by text-mining nearly 10 million scientific publications and electronic medical records, and integrating various data in existing recognized databases (such as Unified Medical Language System (UMLS), Human Phenotype Ontology, Orphanet, OMIM, GWAS). eRAM systematically incorporates currently available data on clinical manifestations and molecular mechanisms of rare diseases and uncovers many novel associations among diseases. eRAM provides enriched annotations for 15 942 rare diseases, yielding 6147 human disease related phenotype terms, 31 661 mammalians phenotype terms, 10,202 symptoms from UMLS, 18 815 genes and 92 580 genotypes. eRAM can not only provide information about rare disease mechanism but also facilitate clinicians to make accurate diagnostic and therapeutic decisions towards rare diseases. eRAM can be freely accessed at http://www.unimd.org/eram/.

[1]  Rong Xu,et al.  Towards building a disease-phenotype knowledge base: extracting disease-manifestation relationship from literature , 2013, Bioinform..

[2]  Chang Su,et al.  Genome-wide analysis of differential DNA methylation in Silver-Russell syndrome , 2017, Science China Life Sciences.

[3]  Qian Fu,et al.  Whole-exome sequencing identified compound heterozygous variants in MMKS in a Chinese pedigree with Bardet-Biedl syndrome , 2017, Science China Life Sciences.

[4]  Peter N. Robinson,et al.  Deep phenotyping for precision medicine , 2012, Human mutation.

[5]  Russ B Altman,et al.  PharmGKB: the Pharmacogenomics Knowledge Base. , 2013, Methods in molecular biology.

[6]  Janan T. Eppig,et al.  The Mammalian Phenotype Ontology as a unifying standard for experimental and high-throughput phenotyping data , 2012, Mammalian Genome.

[7]  Doron Lancet,et al.  MalaCards: an amalgamated human disease compendium with diverse clinical and genetic annotation and structured search , 2016, Nucleic Acids Res..

[8]  Andrey Rzhetsky,et al.  DiseaseConnect: a comprehensive web server for mechanism-based disease–disease connections , 2014, Nucleic Acids Res..

[9]  Núria Queralt-Rosinach,et al.  DisGeNET: a discovery platform for the dynamical exploration of human diseases and their genes , 2015, Database J. Biol. Databases Curation.

[10]  Xiaoxia Peng,et al.  DICER1 mutations in twelve Chinese patients with pleuropulmonary blastoma , 2017, Science China Life Sciences.

[11]  Li Li,et al.  Clinical feature and waveform in infantile nystagmus syndrome in children with FRMD7 gene mutations , 2017, Science China Life Sciences.

[12]  Valérie Lanneau,et al.  Clinical Practice Guidelines for Rare Diseases: The Orphanet Database , 2017, PloS one.

[13]  Janos X. Binder,et al.  DISEASES: Text mining and data integration of disease–gene associations , 2014, bioRxiv.

[14]  Marcel H. Schulz,et al.  Clinical diagnostics in human genetics with semantic similarity searches in ontologies. , 2009, American journal of human genetics.

[15]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[16]  W. Tong,et al.  Potential Reuse of Oncology Drugs in the Treatment of Rare Diseases. , 2016, Trends in pharmacological sciences.

[17]  Wei Huang,et al.  Bmc Medical Genetics , 2022 .

[18]  Yi Wang,et al.  AR mutations in 28 patients with androgen insensitivity syndrome (Prader grade 0–3) , 2017, Science China Life Sciences.

[19]  Tieliu Shi,et al.  The challenge and promise of rare disease diagnosis in China , 2017, Science China Life Sciences.

[20]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[21]  Steven J. M. Jones,et al.  FORGE Canada Consortium: outcomes of a 2-year national rare-disease gene-discovery project. , 2014, American journal of human genetics.

[22]  Li Li,et al.  Analysis of genotypes and phenotypes in Chinese children with tuberous sclerosis complex , 2017, Science China Life Sciences.

[23]  Allen C. Browne,et al.  Lexical methods for managing variation in biomedical terminologies. , 1994, Proceedings. Symposium on Computer Applications in Medical Care.

[24]  Eva Ardanaz,et al.  Data Quality in Rare Cancers Registration: The Report of the RARECARE Data Quality Study , 2017, Tumori.

[25]  Caiyan Jia,et al.  A disease similarity matrix based on the uniqueness of shared genes , 2017, BMC Medical Genomics.

[26]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[27]  Gang Fu,et al.  Disease Ontology 2015 update: an expanded and updated database of human diseases for linking biomedical knowledge through disease data , 2014, Nucleic Acids Res..

[28]  Xuan Yuan,et al.  Effectiveness of exome and genome sequencing guided by acuity of illness for diagnosis of neurodevelopmental disorders , 2014, Science Translational Medicine.

[29]  Bart De Moor,et al.  eXtasy: variant prioritization by genomic data fusion , 2013, Nature Methods.

[30]  Tudor Groza,et al.  The Human Phenotype Ontology in 2017 , 2016, Nucleic Acids Res..

[31]  Peter N. Robinson,et al.  International Cooperation to Enable the Diagnosis of All Rare Genetic Diseases , 2017, American journal of human genetics.

[32]  Sean Ekins,et al.  Industrializing rare disease therapy discovery and development , 2017, Nature Biotechnology.

[33]  Simon Woods,et al.  Rare diseases and now rare data? , 2013, Nature Reviews Genetics.

[34]  Indra Neil Sarkar,et al.  Structural network analysis of biological networks for assessment of potential disease model organisms , 2014, J. Biomed. Informatics.

[35]  Pak Chung Sham,et al.  GWASdb v2: an update database for human genetic variants identified by genome-wide association studies , 2015, Nucleic Acids Res..

[36]  Li Li,et al.  Analysis of genotypes and phenotypes in Chinese children with tuberous sclerosis complex , 2018 .

[37]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[38]  Michael Q. Zhang,et al.  Network-based global inference of human disease genes , 2008, Molecular systems biology.

[39]  Tao Zhang,et al.  Novel LOVD databases for hereditary breast cancer and colorectal cancer genes in the Chinese population , 2011, Human mutation.

[40]  Peter N. Robinson,et al.  The Human Phenotype Ontology: Semantic Unification of Common and Rare Disease , 2015, American journal of human genetics.

[41]  I. Xenarios,et al.  UniProtKB/Swiss-Prot, the Manually Annotated Section of the UniProt KnowledgeBase: How to Use the Entry View. , 2016, Methods in molecular biology.

[42]  Núria Queralt-Rosinach,et al.  DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants , 2016, Nucleic Acids Res..

[43]  K. Boycott,et al.  Rare-disease genetics in the era of next-generation sequencing: discovery to translation , 2013, Nature Reviews Genetics.

[44]  A. Barabasi,et al.  Human symptoms–disease network , 2014, Nature Communications.

[45]  A. Gonzalez-Perez,et al.  Uncovering disease mechanisms through network biology in the era of Next Generation Sequencing , 2016, Scientific Reports.

[46]  Ying Liu,et al.  Detection of mycobacterial and viral DNA in Kikuchi-Fujimoto disease: an analysis of 153 Chinese pediatric cases , 2017, Science China Life Sciences.

[47]  C. Sabatti,et al.  The Human Phenome Project , 2003, Nature Genetics.

[48]  John M. Hancock,et al.  Integration of global resources for human genetic variation and disease , 2012, Human mutation.

[49]  Caifeng Li,et al.  Gene mutations and clinical phenotypes in Chinese children with Blau syndrome , 2017, Science China Life Sciences.

[50]  Clement J. McDonald,et al.  The UMLS-CORE project: a study of the problem list terminologies used in large healthcare institutions , 2010, J. Am. Medical Informatics Assoc..

[51]  John A. McGrath,et al.  Rare inherited skin diseases and the Genomics England 100 000 Genome Project , 2016, The British journal of dermatology.

[52]  Peter A Merkel,et al.  Clinical research for rare disease: opportunities, challenges, and solutions. , 2009, Molecular genetics and metabolism.

[53]  P. Robinson,et al.  The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. , 2008, American journal of human genetics.

[54]  Tieliu Shi,et al.  Towards efficiency in rare disease research: what is distinctive and important? , 2017, Science China Life Sciences.

[55]  Jie Zhang,et al.  Detection of FOXO1 break-apart status by fluorescence in situ hybridization in atypical alveolar rhabdomyosarcoma , 2017, Science China Life Sciences.

[56]  Zhimei Liu,et al.  The clinical and genetic characteristics in children with mitochondrial disease in China , 2017, Science China Life Sciences.