Poison exon annotations improve the yield of clinically relevant variants in genomic diagnostic testing

Purpose Neurodevelopmental disorders (NDDs) often result from rare genetic variation, but genomic testing yield for NDDs remains around 50%, suggesting some clinically relevant rare variants may be missed by standard analyses. Here we analyze “poison exons” (PEs) which, while often absent from standard gene annotations, are alternative exons whose inclusion results in a premature termination codon. Variants that alter PE inclusion can lead to loss-of-function and may be highly penetrant contributors to disease. Methods We curated published RNA-seq data from developing mouse cortex to define 1,937 PE regions conserved between humans and mice and potentially relevant to NDDs. We then analyzed variants found by genome sequencing in multiple NDD cohorts. Results Across 2,999 probands, we found six clinically relevant variants in PE regions that were previously overlooked. Five of these variants are in genes that are part of the sodium voltage-gated channel alpha subunit family (SCN1A, SCN2A, and SCN8A), associated with epilepsies. One variant is in SNRPB, associated with Cerebrocostomandibular Syndrome. These variants have moderate to high computational impact assessments, are absent from population variant databases, and were observed in probands with features consistent with those reported for the associated gene. Conclusion With only a minimal increase in variant analysis burden (most probands had zero or one candidate PE variants in a known NDD gene, with an average of 0.77 per proband), annotation of PEs can improve diagnostic yield for NDDs and likely other congenital conditions.

[1]  V. Jobanputra,et al.  Detection of mosaic variants using genome sequencing in a large pediatric cohort , 2022, American journal of medical genetics. Part A.

[2]  D. Bick,et al.  Applying the Clinician-reported Genetic testing Utility InDEx (C-GUIDE) to genome sequencing: further evidence of validity , 2022, European Journal of Human Genetics.

[3]  Kathleen F. Mittendorf,et al.  Lessons learned and recommendations for data coordination in collaborative research: The CSER consortium experience , 2022, HGG advances.

[4]  K. Downes,et al.  Recommendations for clinical interpretation of variants found in non-coding regions of the genome , 2021, Genome Medicine.

[5]  G. Barsh,et al.  Genome sequencing as a first-line diagnostic test for hospitalized infants. , 2021, Genetics in medicine : official journal of the American College of Medical Genetics.

[6]  J. Szaflarski,et al.  Pharmacogenetic Predictors of Cannabidiol Response and Tolerability in Treatment‐Resistant Epilepsy , 2021, Clinical pharmacology and therapeutics.

[7]  Brian D. O’Connor,et al.  Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL) , 2021, bioRxiv.

[8]  R. Myers,et al.  The Therapeutic Odyssey: Positioning Genomic Sequencing in the Search for a Child’s Best Possible Life , 2021, AJOB empirical bioethics.

[9]  J. Shendure,et al.  CADD-Splice—improving genome-wide variant effect prediction using deep learning-derived splice scores , 2021, Genome Medicine.

[10]  R. Myers,et al.  Aberrant regulation of a poison exon caused by a non-coding variant in a mouse model of Scn1a-associated epileptic encephalopathy , 2021, PLoS genetics.

[11]  Thomas M. Keane,et al.  Twelve years of SAMtools and BCFtools , 2020, GigaScience.

[12]  Siobhan M. Dolan,et al.  The NYCKidSeq project: study protocol for a randomized controlled trial incorporating genomics into the clinical care of diverse New York City children , 2020, Trials.

[13]  D. Absher,et al.  Identifying rare, medically relevant variation via population-based genomic screening in Alabama: opportunities and pitfalls , 2020, Genetics in Medicine.

[14]  I. Aznarez,et al.  Antisense oligonucleotides increase Scn1a expression and reduce seizures and SUDEP incidence in a mouse model of Dravet syndrome , 2020, Science Translational Medicine.

[15]  Ryan L. Collins,et al.  The mutational constraint spectrum quantified from variation in 141,456 humans , 2020, Nature.

[16]  Jonathan M. Mudge,et al.  Re-annotation of 191 developmental and epileptic encephalopathy-associated genes unmasks de novo variants in SCN1A , 2019, npj Genomic Medicine.

[17]  Brent S. Pedersen,et al.  Somalier: rapid relatedness estimation for cancer and germline studies using efficient genome sketches , 2019, Genome Medicine.

[18]  Neil H. Parker,et al.  Diagnostic utility of transcriptome sequencing for rare Mendelian diseases , 2019, Genetics in Medicine.

[19]  Mikel Hernaez,et al.  Sentieon DNASeq Variant Calling Workflow Demonstrates Strong Computational Performance and Accuracy , 2019, Front. Genet..

[20]  S. Scherer,et al.  Meta-analysis and multidisciplinary consensus statement: exome sequencing is a first-tier clinical diagnostic test for individuals with neurodevelopmental disorders , 2019, Genetics in Medicine.

[21]  Brian E. Cade,et al.  Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program , 2019, Nature.

[22]  David G. Knowles,et al.  Predicting Splicing from Primary Sequence with Deep Learning , 2019, Cell.

[23]  G. Carvill,et al.  Aberrant Inclusion of a Poison Exon Causes Dravet Syndrome and Related SCN1A-Associated Genetic Epilepsies. , 2018, American journal of human genetics.

[24]  Marina T. DiStefano,et al.  Recommendations for interpreting the loss of function PVS1 ACMG/AMP variant criterion , 2018, Human mutation.

[25]  N. Risch,et al.  The Clinical Sequencing Evidence-Generating Research Consortium: Integrating Genomic Sequencing in Diverse and Medically Underserved Populations. , 2018, American journal of human genetics.

[26]  Christopher T. Saunders,et al.  Strelka2: fast and accurate calling of germline and somatic variants , 2018, Nature Methods.

[27]  Raymond Dalgleish,et al.  VariantValidator: Accurate validation, mapping, and formatting of sequence variation descriptions , 2017, Human mutation.

[28]  Nikhil Wagle,et al.  Clinical Sequencing Exploratory Research Consortium: Accelerating Evidence-Based Practice of Genomic Medicine. , 2016, American journal of human genetics.

[29]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, bioRxiv.

[30]  Karynne E. Patterson,et al.  The Genetic Basis of Mendelian Phenotypes: Discoveries, Challenges, and Opportunities. , 2015, American journal of human genetics.

[31]  Bale,et al.  Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology , 2015, Genetics in Medicine.

[32]  Sebastien M. Weyn-Vanhentenryck,et al.  Systematic discovery of regulated and conserved alternative exons in the mammalian brain reveals NMD modulating chromatin regulators , 2015, Proceedings of the National Academy of Sciences.

[33]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[34]  E. Zackai,et al.  Mutations within the spliceosomal gene SNRPB affect its auto‐regulation and are causative for classic cerebro‐costo‐mandibular syndrome , 2015, Clinical genetics.

[35]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[36]  Ting Wang,et al.  Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser , 2013, Bioinform..

[37]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[38]  Dan M. Roden,et al.  Implementing genomic medicine in the clinic: the future is here , 2013, Genetics in Medicine.

[39]  B. Barres,et al.  Rbfox proteins regulate alternative splicing of neuronal sodium channel SCN8A , 2012, Molecular and Cellular Neuroscience.

[40]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[41]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[42]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[43]  H. Ropers,et al.  Genetics of intellectual disability. , 2008, Current opinion in genetics & development.

[44]  Terrence S. Furey,et al.  The UCSC Genome Browser Database: update 2006 , 2005, Nucleic Acids Res..

[45]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[46]  S. Waxman,et al.  Novel splice variants of the voltage‐sensitive sodium channel alpha subunit , 1998, Neuroreport.

[47]  M. Meisler,et al.  Alternative Splicing of the Sodium Channel SCN8A Predicts a Truncated Two-domain Protein in Fetal Brain and Non-neuronal Cells* , 1997, Journal of Biological Chemistry.