Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder

Gestational alcohol exposure causes fetal alcohol spectrum disorder (FASD) and is a prominent cause of neurodevelopmental disability. Whole transcriptome sequencing (RNA-Seq) offer insights into mechanisms underlying FASD, but gene-level analysis provides limited information regarding complex transcriptional processes such as alternative splicing and non-coding RNAs. Moreover, traditional analytical approaches that use multiple hypothesis testing with a false discovery rate adjustment prioritize genes based on an adjusted p-value, which is not always biologically relevant. We address these limitations with a novel approach and implemented an unsupervised machine learning model, which we applied to an exon-level analysis to reduce data complexity to the most likely functionally relevant exons, without loss of novel information. This was performed on an RNA-Seq paired-end dataset derived from alcohol-exposed neural fold-stage chick crania, wherein alcohol causes facial deficits recapitulating those of FASD. A principal component analysis along with k-means clustering was utilized to extract exons that deviated from baseline expression. This identified 6857 differentially expressed exons representing 1251 geneIDs; 391 of these genes were identified in a prior gene-level analysis of this dataset. It also identified exons encoding 23 microRNAs (miRNAs) having significantly differential expression profiles in response to alcohol. We developed an RDAVID pipeline to identify KEGG pathways represented by these exons, and separately identified predicted KEGG pathways targeted by these miRNAs. Several of these (ribosome biogenesis, oxidative phosphorylation) were identified in our prior gene-level analysis. Other pathways are crucial to facial morphogenesis and represent both novel (focal adhesion, FoxO signaling, insulin signaling) and known (Wnt signaling) alcohol targets. Importantly, there was substantial overlap between the exomes themselves and the predicted miRNA targets, suggesting these miRNAs contribute to the gene-level expression changes. Our novel application of unsupervised machine learning in conjunction with statistical analyses facilitated the discovery of signaling pathways and miRNAs that inform mechanisms underlying FASD.

[1]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[2]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[3]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[4]  Wei Shi,et al.  featureCounts: an efficient general purpose program for assigning sequence reads to genomic features , 2013, Bioinform..

[5]  R. Natoli,et al.  Alcohol‐related deficient fracture healing is associated with activation of FoxO transcription factors in mice , 2016, Journal of orthopaedic research : official publication of the Orthopaedic Research Society.

[6]  Vinodh Narayanan,et al.  De Novo Missense Mutations in DHX30 Impair Global Translation and Cause a Neurodevelopmental Disorder. , 2018, American journal of human genetics.

[7]  M. Berres,et al.  High-throughput transcriptome sequencing identifies candidate genetic modifiers of vulnerability to fetal alcohol spectrum disorders. , 2014, Alcoholism, clinical and experimental research.

[8]  R. Miranda MicroRNAs and ethanol toxicity. , 2014, International review of neurobiology.

[9]  Xiaopan Chen,et al.  Up-regulation of Siah1 by ethanol triggers apoptosis in neural crest cells through p38 MAPK-mediated activation of p53 signaling pathway , 2016, Archives of Toxicology.

[10]  B. Laufer,et al.  Long-lasting alterations to DNA methylation and ncRNAs could underlie the effects of fetal alcohol exposure in mice , 2013, Disease Models & Mechanisms.

[11]  N. L. Battiato,et al.  Ethanol induces morphological and dynamic changes on in vivo and in vitro neural crest cells. , 2002, Alcoholism, clinical and experimental research.

[12]  K. Zarbalis,et al.  The emerging roles of ribosome biogenesis in craniofacial development , 2014, Front. Physiol..

[13]  D. Alfandari,et al.  The ectodomain of cadherin-11 binds to erbB2 and stimulates Akt phosphorylation to promote cranial neural crest cell migration , 2017, PloS one.

[14]  A. Chudley,et al.  Fetal alcohol spectrum disorder: a guideline for diagnosis across the lifespan , 2016, Canadian Medical Association Journal.

[15]  Hyungwon Choi,et al.  CDK10 Mutations in Humans and Mice Cause Severe Growth Retardation, Spine Malformations, and Developmental Delays. , 2017, American journal of human genetics.

[16]  David Tollervey,et al.  A pre-ribosome-associated HEAT-repeat protein is required for export of both ribosomal subunits. , 2004, Genes & development.

[17]  Xiaopan Chen,et al.  MiR-125b protects against ethanol-induced apoptosis in neural crest cells and mouse embryos by targeting Bak 1 and PUMA , 2015, Experimental Neurology.

[18]  R. Krumlauf,et al.  Defects in pathfinding by cranial neural crest cells in mice lacking the neuregulin receptor ErbB4 , 2000, Nature Cell Biology.

[19]  Jie Bai,et al.  MicroRNA-532 and microRNA-3064 inhibit cell proliferation and invasion by acting as direct regulators of human telomerase reverse transcriptase in ovarian cancer , 2017, PloS one.

[20]  Jianbo Wang,et al.  RNA helicase DDX5 is a p53-independent target of ARF that participates in ribosome biogenesis. , 2011, Cancer research.

[21]  Julie Josse,et al.  Principal component methods - hierarchical clustering - partitional clustering: why would we need to choose for visualizing data? , 2010 .

[22]  B. Chang,et al.  The role of FoxO4 in the relationship between alcohol-induced intestinal barrier dysfunction and liver injury. , 2013, International journal of molecular medicine.

[23]  Danilo Bzdok,et al.  Points of Significance: Statistics versus machine learning , 2018, Nature Methods.

[24]  C. Kiecker,et al.  The chick embryo as a model for the effects of prenatal exposure to alcohol on craniofacial development. , 2016, Developmental biology.

[25]  Susan M. Smith,et al.  CaMKII represses transcriptionally active β‐catenin to mediate acute ethanol neurodegeneration and can phosphorylate β‐catenin , 2014, Journal of neurochemistry.

[26]  S. Balaraman,et al.  MiR-153 targets the nuclear factor-1 family and protects against teratogenic effects of ethanol exposure in fetal neural stem cells , 2014, Biology Open.

[27]  D. Armant,et al.  Ethanol-induced cephalic apoptosis requires phospholipase C-dependent intracellular calcium signaling. , 2003, Alcoholism, clinical and experimental research.

[28]  T. Knudsen,et al.  Reprogramming of genetic networks during initiation of the Fetal Alcohol Syndrome , 2007, Developmental dynamics : an official publication of the American Association of Anatomists.

[29]  S. Mason,et al.  Epigenetic regulation of the neural transcriptome and alcohol interference during development , 2014, Front. Genet..

[30]  Manuel J. Aybar,et al.  Neurocristopathies: New insights 150 years after the neural crest discovery. , 2018, Developmental biology.

[31]  Mukesh Jain,et al.  NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data , 2012, PloS one.

[32]  M. Berres,et al.  Alcohol‐mediated calcium signals dysregulate pro‐survival Snai2/PUMA/Bcl2 networks to promote p53‐mediated apoptosis in avian neural crest progenitors , 2019, Birth defects research.

[33]  Dvir Dahary,et al.  Biallelic SZT2 mutations cause infantile encephalopathy with epilepsy and dysmorphic corpus callosum. , 2013, American journal of human genetics.

[34]  SuzukiKoichi,et al.  The Novel Helicase helG (DHX30) Is Expressed During Gastrulation in Mice and Has a Structure Similar to a Human DExH Box Helicase , 2015 .

[35]  Hendrik C Korswagen,et al.  Functional Interaction Between ß-Catenin and FOXO in Oxidative Stress Signaling , 2005, Science.

[36]  C. Downing,et al.  Gene expression changes in C57BL/6J and DBA/2J mice following prenatal alcohol exposure. , 2012, Alcoholism, clinical and experimental research.

[37]  Dmitry Korkin,et al.  Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers? , 2018, RNA.

[38]  R. Harris,et al.  Sites of alcohol and volatile anaesthetic action on GABAA and glycine receptors , 1997, Nature.

[39]  P. Gunaratne,et al.  Identification of differentially expressed miRNAs in chicken lung and trachea with avian influenza virus infection by a deep sequencing approach , 2009, BMC Genomics.

[40]  Hui Zhou,et al.  Drastic expression change of transposon-derived piRNA-like RNAs and microRNAs in early stages of chicken embryos implies a role in gastrulation , 2012, RNA biology.

[41]  A. Hamosh,et al.  Haploinsufficiency of ZNF462 is associated with craniofacial anomalies, corpus callosum dysgenesis, ptosis, and developmental delay , 2017, European Journal of Human Genetics.

[42]  Susan M. Smith,et al.  CaMKII activation is a novel effector of alcohol’s neurotoxicity in neural crest stem/progenitor cells , 2011, Journal of neurochemistry.

[43]  W. Huber,et al.  Detecting differential usage of exons from RNA-seq data , 2012, Genome research.

[44]  C. Ampe,et al.  Beta-Actin Is Required for Proper Mouse Neural Crest Ontogeny , 2014, PloS one.

[45]  R. Pfundt,et al.  Loss-of-Function Mutations in YY1AP1 Lead to Grange Syndrome and a Fibromuscular Dysplasia-Like Vascular Disease. , 2017, American journal of human genetics.

[46]  M. Bronner,et al.  The Neural Crest Migrating into the Twenty-First Century. , 2016, Current topics in developmental biology.

[47]  J. Bezdek,et al.  VAT: a tool for visual assessment of (cluster) tendency , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[48]  Sibum Sung,et al.  RNA-seq assistant: machine learning based methods to identify more transcriptional regulated genes , 2018, BMC Genomics.

[49]  V. Pratt Biological Classification* , 1972, The British Journal for the Philosophy of Science.

[50]  M. Berres,et al.  Transcriptome Profiling Identifies Ribosome Biogenesis as a Target of Alcohol Teratogenicity and Vulnerability during Early Embryogenesis , 2017, PloS one.

[51]  M. Berres,et al.  Neural crest development in fetal alcohol syndrome. , 2014, Birth defects research. Part C, Embryo today : reviews.

[52]  Z. Ji,et al.  The novel helicase helG (DHX30) is expressed during gastrulation in mice and has a structure similar to a human DExH box helicase. , 2015, Stem cells and development.

[53]  Robert J. Moore,et al.  A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. , 2008, Genome research.

[54]  Lyubov Yevtushok,et al.  Plasma miRNA Profiles in Pregnant Women Predict Infant Outcomes following Prenatal Alcohol Exposure , 2016, PloS one.

[55]  J. Rosenfeld,et al.  De Novo Missense Mutations in DHX30 Impair Global Translation and Cause a Neurodevelopmental Disorder. , 2017, American journal of human genetics.

[56]  A. Noegel,et al.  CDK6 associates with the centrosome during mitosis and is mutated in a large Pakistani family with primary microcephaly. , 2013, Human molecular genetics.

[57]  F. Fuller-Pace The DEAD box proteins DDX5 (p68) and DDX17 (p72): multi-tasking transcriptional regulators. , 2013, Biochimica et biophysica acta.

[58]  Xianglin Shi,et al.  ErbB2 and p38γ MAPK mediate alcohol-induced increase in breast cancer stem cells and metastasis , 2016, Molecular Cancer.

[59]  W. Filipowicz,et al.  Regulation of mRNA translation and stability by microRNAs. , 2010, Annual review of biochemistry.