Integrated gene landscapes uncover multi-layered roles of repressive histone marks during mouse CNS development

A prominent aspect of most, if not all, central nervous systems (CNSs) is that anterior regions (brain) are larger than posterior ones (spinal cord). Studies in Drosophila and mouse have revealed that the Polycomb Repressor Complex 2 (PRC2), a protein complex responsible for applying key repressive histone modifications, acts by several mechanisms to promote anterior CNS expansion. However, it is unclear what the full spectrum of PRC2 action is during embryonic CNS development and how PRC2 integrates with the epigenetic landscape. We removed PRC2 function from the developing mouse CNS, by mutating the key gene Eed, and generated spatio-temporal transcriptomic data. To decode the role of PRC2, we developed a method that incorporates standard statistical analyses with probabilistic deep learning to integrate the transcriptomic response to PRC2 inactivation with epigenetic information from ENCODE. This multi-variate analysis corroborates the central involvement of PRC2 in anterior CNS expansion, and reveals layered regulation via PRC2. These findings uncover a differential logic for the role of PRC2 upon functionally distinct gene categories that drive CNS anterior expansion. To support the analysis of emerging multi-modal datasets, we provide a novel bioinformatics package that integrates transcriptomic and epigenetic datasets to identify regulatory underpinnings of heterogeneous biological processes.

[1]  Zhongming Zhao,et al.  Decoding regulatory structures and features from epigenomics profiles: a Roadmap-ENCODE Variational Auto-Encoder (RE-VAE) model. , 2021, Methods.

[2]  Kyle J. Gaulton,et al.  An atlas of dynamic chromatin landscapes in mouse fetal development , 2020, Nature.

[3]  Jaime Fern'andez del R'io,et al.  Array programming with NumPy , 2020, Nature.

[4]  Sebastian Tschiatschek,et al.  VAEM: a Deep Generative Model for Heterogeneous Mixed Type Data , 2020, NeurIPS.

[5]  Francisco S. Melo,et al.  MHVAE: a Human-Inspired Deep Hierarchical Generative Model for Multimodal Representation Learning , 2020, ArXiv.

[6]  Sonika Tyagi,et al.  Integrative computational epigenomics to build data-driven gene regulation hypotheses , 2020, GigaScience.

[7]  Ronald R. Coifman,et al.  Visualizing structure and transitions in high-dimensional biological data , 2019, Nature Biotechnology.

[8]  Bing Yao,et al.  The Dynamic Partnership of Polycomb and Trithorax in Brain Development and Diseases , 2019, Epigenomes.

[9]  Yanni Chen,et al.  An Allosteric PRC2 Inhibitor Targeting EED Suppresses Tumor Progression by Modulating the Immune Response , 2019, Cancer Research.

[10]  Zohreh Shams,et al.  Variational Autoencoders for Cancer Data Integration: Design Principles and Computational Practice , 2019, bioRxiv.

[11]  S. Thor,et al.  Anterior CNS expansion driven by brain transcription factors , 2019, eLife.

[12]  L. Nguyen,et al.  Temporal patterning of apical progenitors and their daughter neurons in the developing neocortex , 2019, Science.

[13]  S. Thor,et al.  Brain expansion promoted by polycomb-mediated anterior enhancement of a neural stem cell proliferation program , 2019, PLoS biology.

[14]  Stephan J Sanders,et al.  Integrative functional genomic analysis of human brain development and neuropsychiatric risks , 2018, Science.

[15]  Casper Kaae Sønderby,et al.  scVAE: Variational auto-encoders for single-cell gene expression data , 2018, bioRxiv.

[16]  J. Marioni,et al.  Multi‐Omics Factor Analysis—a framework for unsupervised integration of multi‐omics data sets , 2018, Molecular systems biology.

[17]  S. Thor,et al.  Evolutionarily conserved anterior expansion of the central nervous system promoted by a common PcG-Hox program , 2018, Development.

[18]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[19]  Daniel W. Hagey,et al.  SOX2 regulates common and specific stem cell features in the CNS and endoderm derived organs , 2018, PLoS genetics.

[20]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[21]  H. Bourbon,et al.  Genome Regulation by Polycomb and Trithorax: 70 Years and Counting , 2017, Cell.

[22]  Casey S. Greene,et al.  Extracting a Biologically Relevant Latent Space from Cancer Transcriptomes with Variational Autoencoders , 2017, bioRxiv.

[23]  I. Cobeta,et al.  Anterior-Posterior Gradient in Neural Stem and Daughter Cell Proliferation Governed by Spatial and Temporal Hox Control , 2017, Current Biology.

[24]  Masahiro Suzuki,et al.  Joint Multimodal Learning with Deep Generative Models , 2016, ICLR.

[25]  Alexey Sergushichev,et al.  An algorithm for fast preranked gene set enrichment analysis using cumulative statistic calculation , 2016 .

[26]  Måns Magnusson,et al.  MultiQC: summarize analysis results for multiple tools and samples in a single report , 2016, Bioinform..

[27]  A. Shilatifard,et al.  Epigenetic balance of gene expression by Polycomb and COMPASS families , 2016, Science.

[28]  V. Sartorelli,et al.  Polycomb Ezh2 controls the fate of GABAergic neurons in the embryonic cerebellum , 2016, Development.

[29]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[30]  X. de la Cruz,et al.  EZH2 regulates neuroepithelium structure and neuroblast proliferation by repressing p21 , 2016, Open Biology.

[31]  D. Arendt,et al.  From nerve net to nerve ring, nerve cord and brain — evolution of the nervous system , 2015, Nature Reviews Neuroscience.

[32]  D. Schübeler,et al.  Loss of Ezh2 promotes a midbrain-to-forebrain identity switch by direct gene derepression and Wnt-dependent regulation , 2015, BMC Biology.

[33]  R. Paro,et al.  Trithorax and Polycomb group-dependent regulation: a tale of opposing activities , 2015, Development.

[34]  Steven L Salzberg,et al.  HISAT: a fast spliced aligner with low memory requirements , 2015, Nature Methods.

[35]  C. Nielsen Larval nervous systems: true larval and precocious adult , 2015, Journal of Experimental Biology.

[36]  F. Fazal,et al.  Preconditioning with Endoplasmic Reticulum Stress Ameliorates Endothelial Cell Inflammation , 2014, PloS one.

[37]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[38]  L. Ringrose,et al.  What are memories made of? How Polycomb and Trithorax proteins mediate epigenetic memory , 2014, Nature Reviews Molecular Cell Biology.

[39]  S. Orkin,et al.  Polycomb repressive complex 2 regulates normal hematopoietic stem cell function in a developmental-stage-specific manner. , 2014, Cell stem cell.

[40]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[41]  D. Arendt,et al.  The bilaterian forebrain: an evolutionary chimaera , 2013, Current Opinion in Neurobiology.

[42]  João E. Carvalho,et al.  Evolution of bilaterian central nervous systems: a single origin? , 2013, EvoDevo.

[43]  A. Schnittger,et al.  Cell cycle control across the eukaryotic kingdom. , 2013, Trends in cell biology.

[44]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[45]  Johannes E. Schindelin,et al.  Fiji: an open-source platform for biological-image analysis , 2012, Nature Methods.

[46]  C. Nielsen How to make a protostome , 2012, Invertebrate Systematics.

[47]  Guangchuang Yu,et al.  clusterProfiler: an R package for comparing biological themes among gene clusters. , 2012, Omics : a journal of integrative biology.

[48]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[49]  Marcel Martin Cutadapt removes adapter sequences from high-throughput sequencing reads , 2011 .

[50]  Xiaoqin Xu,et al.  Genome-wide Analysis of Transcription Factor E2F1 Mutant Proteins Reveals That N- and C-terminal Protein Interaction Domains Do Not Participate in Targeting E2F1 to the Human Genome , 2011, The Journal of Biological Chemistry.

[51]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[52]  P. Gruss,et al.  Haploinsufficiency of the murine polycomb gene Suz12 results in diverse malformations of the brain and neural tube , 2009, Disease Models & Mechanisms.

[53]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[54]  S. Nishikawa,et al.  Neuroepithelial Cells Supply an Initial Transient Wave of MSC Differentiation , 2007, Cell.

[55]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[56]  N. Dyson,et al.  The E2F transcriptional network: old acquaintances with new faces , 2005, Oncogene.

[57]  S. Thor,et al.  Genetic mechanisms controlling anterior expansion of the central nervous system. , 2020, Current topics in developmental biology.

[58]  Brock C. Christensen,et al.  A New Dimension of Breast Cancer Epigenetics - Applications of Variational Autoencoders with DNA Methylation , 2018, BIOINFORMATICS.

[59]  M. Wegner,et al.  From CNS stem cells to neurons and glia: Sox for everyone , 2014, Cell and Tissue Research.

[60]  Adam B. Olshen,et al.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis , 2009, Bioinform..

[61]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .