Single-cell multi-scale footprinting reveals the modular organization of DNA regulatory elements

Cis-regulatory elements control gene expression and are dynamic in their structure, reflecting changes to the composition of diverse effector proteins over time1–3. Here we sought to connect the structural changes at cis-regulatory elements to alterations in cellular fate and function. To do this we developed PRINT, a computational method that uses deep learning to correct sequence bias in chromatin accessibility data and identifies multi-scale footprints of DNA-protein interactions. We find that multi-scale footprints enable more accurate inference of TF and nucleosome binding. Using PRINT with single-cell multi-omics, we discover wide-spread changes to the structure and function of candidate cis-regulatory elements (cCREs) across hematopoiesis, wherein nucleosomes slide, expose DNA for TF binding, and promote gene expression. Activity segmentation using the co-variance across cell states identifies “sub-cCREs” as modular cCRE subunits of regulatory DNA. We apply this single-cell and PRINT approach to characterize the age-associated alterations to cCREs within hematopoietic stem cells (HSCs). Remarkably, we find a spectrum of aging alterations among HSCs corresponding to a global gain of sub-cCRE activity while preserving cCRE accessibility. Collectively, we reveal the functional importance of cCRE structure across cell states, highlighting changes to gene regulation at single-cell and single-base-pair resolution.

[1]  M. A. Evans,et al.  Clonal Hematopoiesis, Somatic Mosaicism, and Age-Associated Disease. , 2022, Physiological reviews.

[2]  Y. Furukawa,et al.  Epigenetic traits inscribed in chromatin accessibility in aged hematopoietic stem cells , 2022, Nature Communications.

[3]  I. Pe’er,et al.  SEACells: Inference of transcriptional and epigenomic cellular states from single-cell genomics data , 2022, bioRxiv.

[4]  Mingbo Cheng,et al.  Chromatin-accessibility estimation from single-cell ATAC-seq data with scOpen , 2021, Nature Communications.

[5]  Howard Y. Chang,et al.  ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis , 2021, Nature Genetics.

[6]  Rafael Riudavets Puig,et al.  UniBind: maps of high-confidence direct TF-DNA interactions across nine species , 2020, BMC Genomics.

[7]  T. Enver,et al.  Mitochondrial Potentiation Ameliorates Age-Related Heterogeneity in Hematopoietic Stem Cell Function. , 2020, Cell stem cell.

[8]  Fidencio J. Neri,et al.  Global reference mapping of human transcription factor footprints , 2020, Nature.

[9]  Michael J. Purcaro,et al.  Expanded encyclopaedias of DNA elements in the human and mouse genomes , 2020, Nature.

[10]  Aviv Regev,et al.  Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin , 2020, Cell.

[11]  J. Qu,et al.  The ageing epigenome and its rejuvenation , 2020, Nature Reviews Molecular Cell Biology.

[12]  J. Shendure,et al.  Towards a comprehensive catalogue of validated and target-linked human enhancers , 2020, Nature Reviews Genetics.

[13]  M. Looso,et al.  Beyond accessibility: ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation , 2019, bioRxiv.

[14]  B. Ebert,et al.  Clonal hematopoiesis in human aging and disease , 2019, Science.

[15]  Stefano Monti,et al.  hypeR: An R Package for Geneset Enrichment Workflows , 2019, bioRxiv.

[16]  S. Aerts,et al.  cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data , 2019, Nature Methods.

[17]  D. Pe’er,et al.  Characterization of cell fate probabilities in single-cell data with Palantir , 2019, Nature Biotechnology.

[18]  E. Cohen,et al.  The insulin/IGF signaling cascade modulates SUMOylation to regulate aging and proteostasis in Caenorhabditis elegans , 2018, eLife.

[19]  Christoph Hafemeister,et al.  Comprehensive integration of single cell data , 2018, bioRxiv.

[20]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[21]  Marcel H. Schulz,et al.  Identification of transcription factor binding sites using ATAC-seq , 2018, bioRxiv.

[22]  Renee M. Brielmann,et al.  Mitochondrial Stress Restores the Heat Shock Response and Prevents Proteostasis Collapse during Aging. , 2017, Cell reports.

[23]  William J. Greenleaf,et al.  chromVAR: Inferring transcription factor-associated accessibility from single-cell epigenomic data , 2017, Nature Methods.

[24]  B. Franklin Pugh,et al.  Understanding nucleosome dynamics and their links to gene expression and DNA replication , 2017, Nature Reviews Molecular Cell Biology.

[25]  T. Yoshimori,et al.  Faculty Opinions recommendation of Autophagy maintains the metabolism and function of young and old stem cells. , 2017 .

[26]  Bin Xiong,et al.  Insights into Nucleosome Organization in Mouse Embryonic Stem Cells through Chemical Mapping , 2016, Cell.

[27]  J. Tyler,et al.  Epigenetics and aging , 2016, Science Advances.

[28]  Jean-Christophe Aude,et al.  Genome-wide nucleosome specificity and function of chromatin remodellers in ES cells , 2015, Nature.

[29]  Florian Hahne,et al.  Visualizing Genomic Data Using Gviz and Bioconductor , 2016, Statistical Genomics.

[30]  J. Dekker,et al.  Condensin-Driven Remodeling of X-Chromosome Topology during Dosage Compensation , 2015, Nature.

[31]  A. Regev,et al.  Spatial reconstruction of single-cell gene expression data , 2015 .

[32]  Gavin Sherlock,et al.  Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions , 2015, bioRxiv.

[33]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[34]  Elhanan Borenstein,et al.  Conservation of trans-acting circuitry during mammalian regulatory evolution , 2014, Nature.

[35]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[36]  Steven Henikoff,et al.  High-resolution digital profiling of the epigenome , 2014, Nature Reviews Genetics.

[37]  B. Göttgens,et al.  Epigenomic profiling of young and aged HSCs reveals concerted changes during aging that reinforce self-renewal. , 2014, Cell stem cell.

[38]  A. Stark,et al.  Transcriptional enhancers: from properties to genome-wide predictions , 2014, Nature Reviews Genetics.

[39]  Shane J. Neph,et al.  Developmental Fate and Cellular Maturity Encoded in Human Regulatory DNA Landscapes , 2013, Cell.

[40]  J. Wysocka,et al.  Modification of enhancer chromatin: what, how, and why? , 2013, Molecular cell.

[41]  Shane J. Neph,et al.  Circuitry and Dynamics of Human Transcription Factor Regulatory Networks , 2012, Cell.

[42]  Linda Partridge,et al.  Ageing as a Risk Factor for Disease , 2012, Current Biology.

[43]  Shane J. Neph,et al.  An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.

[44]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[45]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[46]  R. Sandberg,et al.  Sequentially acting Sox transcription factors in neural lineage development. , 2011, Genes & development.

[47]  Andrew C. Adey,et al.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition , 2010, Genome Biology.

[48]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[49]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[50]  Eran Segal,et al.  From DNA sequence to transcriptional behaviour: a quantitative approach , 2009, Nature Reviews Genetics.

[51]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[52]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[53]  G. Blobel,et al.  Exchange of GATA factors mediates transitions in looped chromatin organization at a developmentally regulated gene locus. , 2008, Molecular cell.

[54]  T. Tullius,et al.  Footprinting protein–DNA complexes using the hydroxyl radical , 2008, Nature Protocols.

[55]  Chad A Shaw,et al.  Aging Hematopoietic Stem Cells Decline in Function and Exhibit Epigenetic Dysregulation , 2007, PLoS biology.

[56]  I. Weissman,et al.  Cell intrinsic alterations underlie hematopoietic stem cell aging. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[57]  Cynthia Kenyon,et al.  Regulation of Aging and Age-Related Disease by DAF-16 and Heat-Shock Factor , 2003, Science.

[58]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[59]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[60]  D. Galas,et al.  DNAse footprinting: a simple method for the detection of protein-DNA binding specificity. , 1978, Nucleic acids research.