Human Mutation

Funding information National Institute of General Medical Sciences, Grant/Award Numbers: R01GM104436, R01GM120364; National Human Genome Research Institute, Grant/ Award Numbers: R13HG006650, U41HG007346 Abstract Precise identification of causative variants from whole‐genome sequencing data, including both coding and noncoding variants, is challenging. The Critical Assessment of Genome Interpretation 5 SickKids clinical genome challenge provided an opportunity to assess our ability to extract such information. Participants in the challenge were required to match each of the 24 whole‐genome sequences to the correct phenotypic profile and to identify the disease class of each genome. These are all rare disease cases that have resisted genetic diagnosis in a state‐of‐the‐art pipeline. The patients have a range of eye, neurological, and connective‐tissue disorders. We used a gene‐centric approach to address this problem, assigning each gene a multiphenotype‐matching score. Mutations in the top‐scoring genes for each phenotype profile were ranked on a 6‐point scale of pathogenicity probability, resulting in an approximately equal number of top‐ranked coding and noncoding candidate variants overall. We were able to assign the correct disease class for 12 cases and the correct genome to a clinical profile for five cases. The challenge assessor found genes in three of these five cases as likely appropriate. In the postsubmission phase, after careful screening of the genes in the correct genome, we identified additional potential diagnostic variants, a high proportion of which are noncoding.

[1]  Nir Yosef,et al.  Integration of multiple epigenomic marks improves prediction of variant impact in saturation mutagenesis reporter assay , 2019, Human mutation.

[2]  Aashish N. Adhikari,et al.  CAGI SickKids challenges: Assessment of phenotype and variant predictions derived from clinical and genomic data of children with undiagnosed diseases , 2019, Human mutation.

[3]  A. Ramasamy,et al.  Transcriptomic and genetic analyses reveal potential causal drivers for intractable partial epilepsy. , 2019, Brain : a journal of neurology.

[4]  Luis Serrano,et al.  FoldX 5.0: working with RNA, small molecules and a new graphical interface , 2019, Bioinform..

[5]  Kiely N. James,et al.  Biallelic mutations in valyl-tRNA synthetase gene VARS are associated with a progressive neurodevelopmental epileptic encephalopathy , 2019, Nature Communications.

[6]  Y. Gotoh,et al.  Regulation of Chromatin Structure During Neural Development , 2018, Front. Neurosci..

[7]  Michelle Giglio,et al.  Human Disease Ontology 2018 update: classification, content and workflow expansion , 2018, Nucleic Acids Res..

[8]  Gregory M. Cooper,et al.  CADD: predicting the deleteriousness of variants throughout the human genome , 2018, Nucleic Acids Res..

[9]  M. Martí-Renom,et al.  Challenges and guidelines toward 4D nucleome data and model standards , 2018, Nature Genetics.

[10]  Zornitza Stark,et al.  Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected genetic diseases , 2018, npj Genomic Medicine.

[11]  Marni J. Falk,et al.  MSeqDR mvTool: A mitochondrial DNA Web and API resource for comprehensive variant annotation, universal nomenclature collation, and reference genome conversion , 2018, Human mutation.

[12]  Katherine S. Pollard,et al.  Chromatin features constrain structural variation across evolutionary timescales , 2018, Proceedings of the National Academy of Sciences.

[13]  Yifeng Qi,et al.  Predicting three-dimensional genome organization with chromatin states , 2018, bioRxiv.

[14]  Timo Lassmann,et al.  A phenotype centric benchmark of variant prioritisation tools , 2018, npj Genomic Medicine.

[15]  Lipika R. Pal,et al.  CAGI4 SickKids clinical genomes challenge: A pipeline for identifying pathogenic variants , 2017, Human mutation.

[16]  Lipika R. Pal,et al.  Ensemble variant interpretation methods to predict enzyme activity and assign pathogenicity in the CAGI4 NAGLU (Human N‐acetyl‐glucosaminidase) and UBE2I (Human SUMO‐ligase) challenges , 2017, Human mutation.

[17]  Steven E Brenner,et al.  Reports from CAGI: The Critical Assessment of Genome Interpretation , 2017, Human mutation.

[18]  Olivier Lichtarge,et al.  Objective assessment of the evolutionary action equation for the fitness effect of missense mutations across CAGI‐blinded contests , 2017, Human mutation.

[19]  Predrag Radivojac,et al.  Missense variant pathogenicity predictors generalize well across a range of function‐specific prediction challenges , 2017, Human mutation.

[20]  Daniel R. Richards,et al.  Leveraging network analytics to infer patient syndrome and identify causal genes in rare disease cases , 2017, BMC Genomics.

[21]  W. Chung,et al.  Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics , 2016, Genetics in Medicine.

[22]  David E. Kim,et al.  Simultaneous Optimization of Biomolecular Energy Functions on Features from Small Molecules and Macromolecules. , 2016, Journal of chemical theory and computation.

[23]  Trevor Hastie,et al.  REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants. , 2016, American journal of human genetics.

[24]  Giorgio Valentini,et al.  A Whole-Genome Analysis Framework for Effective Identification of Pathogenic Regulatory Variants in Mendelian Disease. , 2016, American journal of human genetics.

[25]  D. Lancet,et al.  VarElect: the phenotype-based variation prioritizer of the GeneCards Suite , 2016, BMC Genomics.

[26]  Sheena M. Scroggins,et al.  CADD score has limited clinical validity for the identification of pathogenic variants in non-coding regions in a hereditary cancer panel , 2016, Genetics in Medicine.

[27]  E. Boerwinkle,et al.  dbNSFP v3.0: A One‐Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice‐Site SNVs , 2016, Human mutation.

[28]  J. Buxbaum,et al.  A SPECTRAL APPROACH INTEGRATING FUNCTIONAL GENOMIC ANNOTATIONS FOR CODING AND NONCODING VARIANTS , 2015, Nature Genetics.

[29]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[30]  Damian Smedley,et al.  Next-generation diagnostics and disease-gene discovery with the Exomiser , 2015, Nature Protocols.

[31]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[32]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[33]  Alexa B. R. McIntyre,et al.  Extensive sequencing of seven human genomes to characterize benchmark reference materials , 2015, Scientific Data.

[34]  Peter N. Robinson,et al.  Phenotype-driven strategies for exome prioritization of human Mendelian disease genes , 2015, Genome Medicine.

[35]  Hui Yang,et al.  Phenolyzer: phenotype-based prioritization of candidate genes for human diseases , 2015, Nature Methods.

[36]  B. Rost,et al.  Better prediction of functional effects for sequence variants , 2015, BMC Genomics.

[37]  Kateryna D. Makova,et al.  The effects of chromatin organization on variation in mutation rates in the genome , 2015, Nature Reviews Genetics.

[38]  Juancarlos Chan,et al.  Gene Ontology Consortium: going forward , 2014, Nucleic Acids Res..

[39]  Eric Boerwinkle,et al.  In silico prediction of splice-altering single nucleotide variants in the human genome , 2014, Nucleic acids research.

[40]  Magalie S Leduc,et al.  Molecular findings among patients referred for clinical whole-exome sequencing. , 2014, JAMA.

[41]  Judith A. Blake,et al.  The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease , 2014, Nucleic Acids Res..

[42]  Brett J. Kennedy,et al.  Phevor combines multiple biomedical ontologies for accurate identification of disease-causing alleles in single individuals and small nuclear families. , 2014, American journal of human genetics.

[43]  Yvonne M. Bradford,et al.  The zebrafish anatomy and stage ontologies: representing the anatomy and development of Danio rerio , 2014, J. Biomed. Semant..

[44]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[45]  Melissa J. Landrum,et al.  RefSeq: an update on mammalian reference sequences , 2013, Nucleic Acids Res..

[46]  P. Stenson,et al.  The Human Gene Mutation Database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine , 2013, Human Genetics.

[47]  Michael Brudno,et al.  PhenoTips: Patient Phenotyping Software for Clinical and Research Use , 2013, Human mutation.

[48]  Mark Yandell,et al.  VAAST 2.0: Improved Variant Classification and Disease-Gene Identification Using a Conservation-Controlled Amino Acid Substitution Matrix , 2013, Genetic epidemiology.

[49]  M. Bucan,et al.  From Mouse to Human: Evolutionary Genomics Analysis of Human Orthologs of Essential Genes , 2013, PLoS genetics.

[50]  H. Carter,et al.  Identifying Mendelian disease genes with the Variant Effect Scoring Tool , 2013, BMC Genomics.

[51]  Eurie L. Hong,et al.  Annotation of functional variation in personal genomes using RegulomeDB , 2012, Genome research.

[52]  Mark Gerstein,et al.  VAT: a computational framework to functionally annotate variants in personal genomes within a cloud-computing environment , 2012, Bioinform..

[53]  Pablo Cingolani,et al.  Using Drosophila melanogaster as a Model for Genotoxic Chemical Mutational Studies with a New Program, SnpSift , 2012, Front. Gene..

[54]  J. Shendure,et al.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data , 2011, Nature Reviews Genetics.

[55]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[56]  Tsviya Olender,et al.  GeneCards Version 3: the human gene integrator , 2010, Database J. Biol. Databases Curation.

[57]  H. Hakonarson,et al.  ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data , 2010, Nucleic acids research.

[58]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[59]  Janan T Eppig,et al.  The mammalian phenotype ontology: enabling robust annotation and comparative analysis , 2009, Wiley interdisciplinary reviews. Systems biology and medicine.

[60]  J. Moult,et al.  Identification and analysis of deleterious human SNPs. , 2006, Journal of molecular biology.

[61]  J. Moult,et al.  Loss of protein structure stability as a major causative factor in monogenic disease. , 2005, Journal of molecular biology.

[62]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[63]  L. Serrano,et al.  Prediction of water and metal binding sites and their affinities by using the Fold-X force field. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2004, Nucleic Acids Res..

[65]  MA Katharina Schwarze BSc,et al.  Are whole-exome and whole-genome sequencing approaches cost-effective? A systematic review of the literature , 2018, Genetics in Medicine.

[66]  Damian Smedley,et al.  The Human Phenotype Ontology project: linking molecular biology and disease through phenotype data , 2014, Nucleic Acids Res..

[67]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[68]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[69]  S. Daiger,et al.  Identifying retinal disease genes: how far have we come, how far do we have to go? , 2004, Novartis Foundation symposium.

[70]  W. McLaren,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[71]  Pablo Cingolani,et al.  © 2012 Landes Bioscience. Do not distribute. , 2022 .