Computational Methods for Assessing Chromatin Hierarchy

The hierarchical organization of chromatin is known to associate with diverse cellular functions; however, the precise mechanisms and the 3D structure remain to be determined. With recent advances in high-throughput next generation sequencing (NGS) techniques, genome-wide profiling of chromatin structures is made possible. Here, we provide a comprehensive overview of NGS-based methods for profiling “higher-order” and “primary-order” chromatin structures from both experimental and computational aspects. Experimental requirements and considerations specific for each method were highlighted. For computational analysis, we summarized a common analysis strategy for both levels of chromatin assessment, focusing on the characteristic computing steps and the tools. The recently developed single-cell level techniques based on Hi-C and ATAC-seq present great potential to reveal cell-to-cell variability in chromosome architecture. A brief discussion on these methods in terms of experimental and data analysis features is included. We also touch upon the biological relevance of chromatin organization and how the combination with other techniques uncovers the underlying mechanisms. We conclude with a summary and our prospects on necessary improvements of currently available methods in order to advance understanding of chromatin hierarchy. Our review brings together the analyses of both higher- and primary-order chromatin structures, and serves as a roadmap when choosing appropriate experimental and computational methods for assessing chromatin hierarchy.

[1]  Jacob F. Degner,et al.  Sequence and Chromatin Accessibility Data Accurate Inference of Transcription Factor Binding from Dna Material Supplemental Open Access , 2022 .

[2]  Tom Misteli,et al.  Higher-order genome organization in human disease. , 2010, Cold Spring Harbor perspectives in biology.

[3]  Zhiping Weng,et al.  High-resolution mapping of chromatin packaging in mouse embryonic stem cells and sperm. , 2014, Developmental cell.

[4]  Elzo de Wit,et al.  4C technology: protocols and data analysis. , 2012, Methods in enzymology.

[5]  Ilya M. Flyamer,et al.  Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition , 2017, Nature.

[6]  Andrew C. Adey,et al.  Rapid, low-input, low-bias construction of shotgun fragment libraries by high-density in vitro transposition , 2010, Genome Biology.

[7]  Gabor T. Marth,et al.  EagleView: a genome assembly viewer for next-generation sequencing technologies. , 2008, Genome research.

[8]  William Stafford Noble,et al.  Statistical confidence estimation for Hi-C data reveals regulatory chromatin contacts , 2014, Genome research.

[9]  Nathan C. Sheffield,et al.  Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity. , 2011, Genome research.

[10]  Howard Y. Chang,et al.  ATAC-see reveals the accessible genome by transposase-mediated imaging and sequencing , 2016, Nature Methods.

[11]  Zhaoyu Li,et al.  DANPOS: Dynamic analysis of nucleosome position and occupancy by sequencing , 2013, Genome research.

[12]  Erik Splinter,et al.  Looping and interaction between hypersensitive sites in the active beta-globin locus. , 2002, Molecular cell.

[13]  Marc A. Martí-Renom,et al.  Automatic analysis and 3D-modelling of Hi-C data using TADbit reveals structural features of the fly chromatin colors , 2017, PLoS Comput. Biol..

[14]  A. Tanay,et al.  Single-cell epigenomics: techniques and emerging applications , 2015, Nature Reviews Genetics.

[15]  V. Iyer,et al.  FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) isolates active regulatory elements from human chromatin. , 2007, Genome research.

[16]  S. K. Zaidi,et al.  C‐ing the Genome: A Compendium of Chromosome Conformation Capture Methods to Study Higher‐Order Chromatin Organization , 2016, Journal of cellular physiology.

[17]  Borbala Mifsud,et al.  GOTHiC, a probabilistic model to resolve complex biases and to identify real interactions in Hi-C data , 2017, PloS one.

[18]  Kim Rutherford,et al.  Artemis: sequence visualization and annotation , 2000, Bioinform..

[19]  Dustin E. Schones,et al.  Dynamic Regulation of Nucleosome Positioning in the Human Genome , 2008, Cell.

[20]  Terrence S. Furey,et al.  F-Seq: a feature density estimator for high-throughput sequence tags , 2008, Bioinform..

[21]  Guillaume J. Filion,et al.  Systematic Protein Location Mapping Reveals Five Principal Chromatin Types in Drosophila Cells , 2010, Cell.

[22]  J. Stamatoyannopoulos,et al.  Chromatin accessibility pre-determines glucocorticoid receptor binding patterns , 2011, Nature Genetics.

[23]  D. Frishman,et al.  Inter-Chromosomal Contact Networks Provide Insights into Mammalian Chromatin Organization , 2015, PloS one.

[24]  Clifford A. Meyer,et al.  Identifying and mitigating bias in next-generation sequencing methods for chromatin biology , 2014, Nature Reviews Genetics.

[25]  Benjamin J. Raphael,et al.  Identification of hierarchical chromatin domains , 2016, Bioinform..

[26]  Felix A. Klein,et al.  FourCSeq: analysis of 4C sequencing data , 2015, bioRxiv.

[27]  S. Grewal,et al.  Heterochromatin revisited , 2007, Nature Reviews Genetics.

[28]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[29]  Robert Patro,et al.  Identification of alternative topological domains in chromatin , 2014, Algorithms for Molecular Biology.

[30]  Tao Zhang,et al.  Genome-Wide Nucleosome Occupancy and Positioning and Their Impact on Gene Expression and Evolution in Plants1[OPEN] , 2015, Plant Physiology.

[31]  Morgan C. Giddings,et al.  Defining functional DNA elements in the human genome , 2014, Proceedings of the National Academy of Sciences.

[32]  M. Babu,et al.  A complex network framework for unbiased statistical analyses of DNA–DNA contact maps , 2012, Nucleic acids research.

[33]  I. Albert,et al.  Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome , 2007, Nature.

[34]  Jun Song,et al.  CEAS: cis-regulatory element annotation system , 2006, Nucleic Acids Res..

[35]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[36]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[37]  James T. Robinson,et al.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. , 2016, Cell systems.

[38]  R. Kornberg Chromatin structure: a repeating unit of histones and DNA. , 1974, Science.

[39]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.

[40]  T. Misteli,et al.  The emerging role of nuclear architecture in DNA repair and genome maintenance , 2009, Nature Reviews Molecular Cell Biology.

[41]  Yaniv Lubling,et al.  Single-cell Hi-C for genome-wide detection of chromatin interactions that occur simultaneously in a single cell , 2015, Nature Protocols.

[42]  Istvan Albert,et al.  GeneTrack - a genomic data processing and visualization framework , 2008, Bioinform..

[43]  S. Q. Xie,et al.  Complex multi-enhancer contacts captured by Genome Architecture Mapping (GAM) , 2017, Nature.

[44]  Improved DNase-seq protocol facilitates high resolution mapping of DNase I hypersensitive sites in roots in Arabidopsis thaliana , 2015, Plant Methods.

[45]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[46]  E. Furlong,et al.  FourCSeq : analysis of 4 C sequencing data , 2015 .

[47]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[48]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[49]  Yun Zhu,et al.  The pluripotent genome in three dimensions is shaped around pluripotency factors , 2013, Nature.

[50]  Chee Seng Chan,et al.  CTCF-Mediated Functional Chromatin Interactome in Pluripotent Cells , 2011, Nature Genetics.

[51]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[52]  William Stafford Noble,et al.  Massively multiplex single-cell Hi-C , 2016, Nature Methods.

[53]  Neva C. Durand,et al.  Juicer Provides a One-Click System for Analyzing Loop-Resolution Hi-C Experiments. , 2016, Cell systems.

[54]  Terrence S. Furey,et al.  Identifying and Characterizing Regulatory Sequences in the Human Genome with Chromatin Accessibility Assays , 2012, Genes.

[55]  C. Glass,et al.  Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. , 2010, Molecular cell.

[56]  S. Henikoff,et al.  Epigenome characterization at single base-pair resolution , 2011, Proceedings of the National Academy of Sciences.

[57]  M. Sung,et al.  Quantitative analysis of genome-wide chromatin remodeling. , 2012, Methods in molecular biology.

[58]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[59]  P. Zhou,et al.  Correlation Between DNase I Hypersensitive Site Distribution and Gene Expression in HeLa S3 Cells , 2012, PloS one.

[60]  Pietro Liò,et al.  FisHiCal: an R package for iterative FISH-based calibration of Hi-C data , 2014, Bioinform..

[61]  Shuang-yong Xu,et al.  NicE-seq: high resolution open chromatin profiling , 2017, Genome Biology.

[62]  Michael Brudno,et al.  SHRiMP: Accurate Mapping of Short Color-space Reads , 2009, PLoS Comput. Biol..

[63]  Holger Karas,et al.  TRANSFAC: a database on transcription factors and their DNA binding sites , 1996, Nucleic Acids Res..

[64]  Yuval Kluger,et al.  Close proximity to Igh is a contributing factor to AID-mediated translocations. , 2012, Molecular cell.

[65]  S. Elgin,et al.  Heterochromatin and gene regulation in Drosophila. , 1996, Current opinion in genetics & development.

[66]  A. Stark,et al.  Transcriptional enhancers: from properties to genome-wide predictions , 2014, Nature Reviews Genetics.

[67]  J. Ibrahim,et al.  ZINBA integrates local covariates with DNA-seq data to identify broad and narrow regions of enrichment, even within amplified genomic regions , 2011, Genome Biology.

[68]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[69]  C. Ponting,et al.  Parallel single-cell sequencing links transcriptional and epigenetic heterogeneity , 2015, Nature Methods.

[70]  S. Elgin,et al.  The contradictory definitions of heterochromatin: transcription and silencing , 2006, Chromosoma.

[71]  Thomas A. Down,et al.  A Comparison of Peak Callers Used for DNase-Seq Data , 2014, bioRxiv.

[72]  P. Jeggo,et al.  The repair and signaling responses to DNA double-strand breaks. , 2013, Advances in genetics.

[73]  Katherine S. Pollard,et al.  Features that define the best ChIP-seq peak calling algorithms , 2016, Briefings Bioinform..

[74]  William Stafford Noble,et al.  Epigenetic priors for identifying active transcription factor binding sites , 2012, Bioinform..

[75]  William B. Langdon,et al.  Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks , 2015, BioData Mining.

[76]  Howard Y. Chang,et al.  Single-cell chromatin accessibility reveals principles of regulatory variation , 2015, Nature.

[77]  J. Dekker,et al.  Condensin-Driven Remodeling of X-Chromosome Topology during Dosage Compensation , 2015, Nature.

[78]  J. Dekker,et al.  Hi-C: a comprehensive technique to capture the conformation of genomes. , 2012, Methods.

[79]  James Taylor,et al.  HiFive: a tool suite for easy and efficient HiC and 5C data analysis , 2014, Genome Biology.

[80]  Christopher D. Green,et al.  Improved nucleosome-positioning algorithm iNPS for accurate nucleosome positioning from sequencing data , 2014, Nature Communications.

[81]  Boris Lenhard,et al.  Cohesin-based chromatin interactions enable regulated gene expression within preexisting architectural compartments , 2013, Genome research.

[82]  Ivan Smirnov,et al.  Epigenomic annotation of genetic variants using the Roadmap Epigenome Browser , 2015, Nature Biotechnology.

[83]  Tom Misteli,et al.  Spatial proximity of translocation-prone gene loci in human lymphomas , 2003, Nature Genetics.

[84]  Ming Hu,et al.  HiCNorm: removing biases in Hi-C data via Poisson regression , 2012, Bioinform..

[85]  James O J Davies,et al.  How best to identify chromosomal interactions: a comparison of approaches , 2017, Nature Methods.

[86]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[87]  Z. Weng,et al.  High-Resolution Mapping and Characterization of Open Chromatin across the Genome , 2008, Cell.

[88]  D. S. Gross,et al.  Nuclease hypersensitive sites in chromatin. , 1988, Annual review of biochemistry.

[89]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[90]  T. Richmond,et al.  The structure of DNA in the nucleosome core , 2003, Nature.

[91]  Nathan C. Sheffield,et al.  Single-cell epigenomic variability reveals functional cancer heterogeneity , 2017, Genome Biology.

[92]  Frank Grosveld,et al.  Multiplexed chromosome conformation capture sequencing for rapid genome-scale high-resolution detection of long-range chromatin interactions , 2013, Nature Protocols.

[93]  M. Buck,et al.  Chromatin accessibility: a window into the genome , 2014, Epigenetics & Chromatin.

[94]  S. Aerts,et al.  Discovery of Transcription Factors and Regulatory Regions Driving In Vivo Tumor Development by ATAC-seq and FAIRE-seq Open Chromatin Profiling , 2015, PLoS genetics.

[95]  J. Schug,et al.  Integration of ATAC-seq and RNA-seq identifies human alpha cell and beta cell signature genes , 2016, Molecular metabolism.

[96]  Michael Q. Zhang,et al.  Updates to the RMAP short-read mapping software , 2009, Bioinform..

[97]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[98]  Richard Bonneau,et al.  Interpreting 4C-Seq data: how far can we go? , 2014, Epigenomics.

[99]  B. Steensel,et al.  Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture–on-chip (4C) , 2006, Nature Genetics.

[100]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[101]  William K. M. Lai,et al.  BIOINFORMATICS APPLICATIONS , 2022 .

[102]  Amos Tanay,et al.  Robust 4C-seq data analysis to screen for regulatory DNA interactions , 2012, Nature Methods.

[103]  Wolfgang Huber,et al.  Enhancer loops appear stable during development and are associated with paused polymerase , 2014, Nature.

[104]  Kairong Cui,et al.  Genome-wide approaches to determining nucleosome occupancy in metazoans using MNase-Seq. , 2012, Methods in molecular biology.

[105]  Myong-Hee Sung,et al.  DNase footprint signatures are dictated by factor dynamics and DNA sequence. , 2014, Molecular cell.

[106]  Peter J. Park,et al.  HiGlass: Web-based visual comparison and exploration of genome interaction maps , 2017 .

[107]  Ruiqiang Li,et al.  SOAP: short oligonucleotide alignment program , 2008, Bioinform..

[108]  Andrew C. Adey,et al.  Multiplex single-cell profiling of chromatin accessibility by combinatorial cellular indexing , 2015, Science.

[109]  Qi Zheng,et al.  HIPPIE: a high-throughput identification pipeline for promoter interacting enhancer elements , 2015, Bioinform..

[110]  W. Reik,et al.  Single-cell epigenomics: powerful new methods for understanding gene regulation and cell identity , 2016, Genome Biology.

[111]  Jeff Vierstra,et al.  Coupling transcription factor occupancy to nucleosome architecture with DNase-FLASH , 2013, Nature Methods.

[112]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[113]  F. Grosveld,et al.  High-resolution identification of balanced and complex chromosomal rearrangements by 4C technology , 2009, Nature Methods.

[114]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[115]  Michael C. Schatz,et al.  CloudBurst: highly sensitive read mapping with MapReduce , 2009, Bioinform..

[116]  Robert J. Schmitz,et al.  Combining ATAC-seq with nuclei sorting for discovery of cis-regulatory regions in plant genomes , 2016, Nucleic acids research.