Next-gen sequencing identifies non-coding variation disrupting miRNA-binding sites in neurological disorders

Understanding the genetic factors underlying neurodevelopmental and neuropsychiatric disorders is a major challenge given their prevalence and potential severity for quality of life. While large-scale genomic screens have made major advances in this area, for many disorders the genetic underpinnings are complex and poorly understood. To date the field has focused predominantly on protein coding variation, but given the importance of tightly controlled gene expression for normal brain development and disorder, variation that affects non-coding regulatory regions of the genome is likely to play an important role in these phenotypes. Herein we show the importance of 3 prime untranslated region (3'UTR) non-coding regulatory variants across neurodevelopmental and neuropsychiatric disorders. We devised a pipeline for identifying and functionally validating putatively pathogenic variants from next generation sequencing (NGS) data. We applied this pipeline to a cohort of children with severe specific language impairment (SLI) and identified a functional, SLI-associated variant affecting gene regulation in cells and post-mortem human brain. This variant and the affected gene (ARHGEF39) represent new putative risk factors for SLI. Furthermore, we identified 3′UTR regulatory variants across autism, schizophrenia and bipolar disorder NGS cohorts demonstrating their impact on neurodevelopmental and neuropsychiatric disorders. Our findings show the importance of investigating non-coding regulatory variants when determining risk factors contributing to neurodevelopmental and neuropsychiatric disorders. In the future, integration of such regulatory variation with protein coding changes will be essential for uncovering the genetic causes of complex neurological disorders and the fundamental mechanisms underlying health and disease.

[1]  J. Tomblin,et al.  Prevalence of specific language impairment in kindergarten children. , 1997, Journal of speech, language, and hearing research : JSLHR.

[2]  Andrew J. Hill,et al.  Analysis of protein-coding genetic variation in 60,706 humans , 2015, bioRxiv.

[3]  G. Baird,et al.  A genomewide scan identifies two novel loci involved in specific language impairment. , 2002, American journal of human genetics.

[4]  Murat Gunel,et al.  Sequence Variants in SLITRK1 Are Associated with Tourette's Syndrome , 2005, Science.

[5]  Adam W. McCrimmon,et al.  Test Review: Wiig, E. H., Semel, E., & Secord, W. A. (2013). Clinical Evaluation of Language Fundamentals–Fifth Edition (CELF-5) , 2015 .

[6]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[7]  N. Šestan,et al.  The developmental transcriptome of the human brain: implications for neurodevelopmental disorders. , 2014, Current opinion in neurology.

[8]  B. Langguth,et al.  Resequencing of the auxiliary GABAB receptor subunit gene KCTD12 in chronic tinnitus , 2012, Front. Syst. Neurosci..

[9]  A. Hoischen,et al.  Next-generation sequencing identifies novel gene variants and pathways involved in specific language impairment , 2016, bioRxiv.

[10]  Mutational Screening of PARKIN Identified a 3′ UTR Variant (rs62637702) Associated with Parkinson’s Disease , 2013, Journal of Molecular Neuroscience.

[11]  A. van den Berg,et al.  Generation of miRNA sponge constructs. , 2012, Methods.

[12]  G. Mirza,et al.  Genome-wide analysis of genetic susceptibility to language impairment in an isolated Chilean population , 2011, European Journal of Human Genetics.

[13]  S. Vernes,et al.  A direct molecular link between the autism candidate gene RORa and the schizophrenia candidate MIR137 , 2014, Scientific Reports.

[14]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[15]  M. Snyder,et al.  Impacts of variation in the human genome on gene regulation. , 2013, Journal of molecular biology.

[16]  C. Dieterich,et al.  A coding-independent function of an alternative Ube3a transcript during neuronal development , 2015, Nature Neuroscience.

[17]  Timothy C. Bates,et al.  Genome-wide screening for DNA variants associated with reading and language traits , 2014, Genes, brain, and behavior.

[18]  Michael F. Walker,et al.  De novo mutations revealed by whole-exome sequencing are strongly associated with autism , 2012, Nature.

[19]  Varadharajan Vaishnavi,et al.  Mining the 3′UTR of Autism-implicated Genes for SNPs Perturbing MicroRNA Regulation , 2014, Genom. Proteom. Bioinform..

[20]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[21]  Varun Kulkarni,et al.  MiRNA-Target Interaction Reveals Cell-Specific Post-Transcriptional Regulation in Mammalian Cell Lines , 2016, International journal of molecular sciences.

[22]  D. Bartel,et al.  Global analyses of the effect of different cellular contexts on microRNA targeting. , 2014, Molecular cell.

[23]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[24]  Dmitrij Frishman,et al.  TargetSpy: a supervised machine learning approach for microRNA target prediction , 2010, BMC Bioinformatics.

[25]  Wendy Cohen,et al.  Highly significant linkage to the SLI1 locus in an expanded sample of individuals affected by specific language impairment. , 2004, American journal of human genetics.

[26]  J. Gregg,et al.  Gene expression changes in children with autism. , 2008, Genomics.

[27]  Stephen J. Blumberg,et al.  Trends in the Prevalence of Developmental Disabilities in US Children, 1997–2008 , 2011, Pediatrics.

[28]  Min Zhao,et al.  AutismKB: an evidence-based knowledgebase of autism genetics , 2011, Nucleic Acids Res..

[29]  Dorothy V.M. Bishop,et al.  CMIP and ATP2C2 Modulate Phonological Short-Term Memory in Language Impairment , 2009, American journal of human genetics.

[30]  B. V. van Bon,et al.  Diagnostic exome sequencing in persons with severe intellectual disability. , 2012, The New England journal of medicine.

[31]  A. Hoischen,et al.  Exome Sequencing in an Admixed Isolated Population Indicates NFXL1 Variants Confer a Risk for Specific Language Impairment , 2015, PLoS genetics.

[32]  L. Van Aelst,et al.  Rho GTPases, dendritic structure, and mental retardation. , 2005, Journal of neurobiology.

[33]  S. Ring,et al.  A new human genetic resource: a DNA bank established as part of the Avon Longitudinal Study of Pregnancy and Childhood (ALSPAC) , 2000, European Journal of Human Genetics.

[34]  E. Banks,et al.  De novo mutations in schizophrenia implicate synaptic networks , 2014, Nature.

[35]  Béatrice Conne,et al.  The 3′ untranslated region of messenger RNA: A molecular ‘hotspot’ for pathology? , 2000, Nature Medicine.

[36]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[37]  J. Law,et al.  Screening for speech and language delay: a systematic review of the literature. , 1998, Health technology assessment.

[38]  Ernesto Picardi,et al.  UTRdb and UTRsite (RELEASE 2010): a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs , 2009, Nucleic Acids Res..

[39]  Kali T. Witherspoon,et al.  Excess of rare, inherited truncating mutations in autism , 2015, Nature Genetics.

[40]  Yuan Tian,et al.  Genome-wide transcriptome profiling reveals the functional impact of rare de novo and recurrent CNVs in autism spectrum disorders. , 2012, American journal of human genetics.

[41]  C. Sander,et al.  Analysis of microRNA-target interactions across diverse cancer types , 2013, Nature Structural &Molecular Biology.

[42]  A D Baddeley,et al.  The Children's Test of Nonword Repetition: a test of phonological working memory. , 1994, Memory.

[43]  Vivian G. Cheung,et al.  Genetics of human gene expression: mapping DNA variants that influence gene expression , 2009, Nature Reviews Genetics.

[44]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[45]  Christof Fellmann,et al.  An optimized microRNA backbone for effective single-copy RNAi. , 2013, Cell reports.

[46]  Lilia M. Iakoucheva,et al.  Whole-Genome Sequencing in Autism Identifies Hot Spots for De Novo Germline Mutation , 2012, Cell.

[47]  J. Stockman,et al.  Trends in the Prevalence of Developmental Disabilities in US Children, 1997–2008 , 2013 .

[48]  G Baird,et al.  Genome-wide association analyses of child genotype effects and parent-of-origin effects in specific language impairment , 2014, Genes, brain, and behavior.

[49]  David Wechsler,et al.  Wechsler Intelligence Scale for Children; manual. , 1949 .

[50]  K. Davies,et al.  Functional genetic analysis of mutations implicated in a human speech and language disorder. , 2006, Human molecular genetics.

[51]  L. Van Aelst,et al.  Rho GTPases and signaling networks. , 1997, Genes & development.

[52]  S. Scherer,et al.  Identification of candidate intergenic risk loci in autism spectrum disorder , 2013, BMC Genomics.

[53]  J. Roach,et al.  Parent-of-origin-specific signatures of de novo mutations , 2016, Nature Genetics.

[54]  Boris Yamrom,et al.  The contribution of de novo coding mutations to autism spectrum disorder , 2014, Nature.

[55]  R. García-Mata,et al.  I’m coming to GEF you: Regulation of RhoGEFs during cell migration , 2014, Cell adhesion & migration.

[56]  H. Ropers,et al.  X-linked mental retardation , 2005, Nature Reviews Genetics.

[57]  Latarsha J. Carithers,et al.  The Genotype-Tissue Expression (GTEx) Project. , 2015, Biopreservation and biobanking.

[58]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[59]  H. Ropers,et al.  Mutations in ARHGEF6, encoding a guanine nucleotide exchange factor for Rho GTPases, in patients with X-linked mental retardation , 2000, Nature Genetics.

[60]  J. B. Talcott,et al.  Investigation of Dyslexia and SLI Risk Variants in Reading- and Language-Impaired Subjects , 2010, Behavior genetics.

[61]  Samuel S. Gross,et al.  Genome-wide characteristics of de novo mutations in autism , 2016, npj Genomic Medicine.

[62]  M. Webster,et al.  Gene expression profiling by mRNA sequencing reveals increased expression of immune/inflammation-related genes in the hippocampus of individuals with schizophrenia , 2013, Translational Psychiatry.

[63]  Evan T. Geller,et al.  Patterns and rates of exonic de novo mutations in autism spectrum disorders , 2012, Nature.

[64]  D. Locke,et al.  Analysis of ANK3 and CACNA1C variants identified in bipolar disorder whole genome sequence data , 2014, Bipolar disorders.

[65]  C. Baker,et al.  Genome Sequencing of Autism-Affected Families Reveals Disruption of Putative Noncoding Regulatory DNA. , 2016, American journal of human genetics.

[66]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[67]  A. Reymond,et al.  Copy number variants, diseases and gene expression. , 2009, Human molecular genetics.

[68]  S. Grimmond,et al.  Imperfect centered miRNA binding sites are common and can mediate repression of target mRNAs , 2014, Genome Biology.

[69]  L. Vissers,et al.  Genome sequencing identifies major causes of severe intellectual disability , 2014, Nature.