Free energy-based model of CTCF-mediated chromatin looping in the human genome.

In recent years, high-throughput techniques have revealed considerable structural organization of the human genome with diverse regions of the chromatin interacting with each other in the form of loops. Some of these loops are quite complex and may encompass regions comprised of many interacting chain segments around a central locus. Popular techniques for extracting this information are chromatin interaction analysis by paired-end tag sequencing (ChIA-PET) and high-throughput chromosome conformation capture (Hi-C). Here, we introduce a physics-based method to predict the three-dimensional structure of chromatin from population-averaged ChIA-PET data. The approach uses experimentally-validated data from human B-lymphoblastoid cells to generate 2D meta-structures of chromatin using a dynamic programming algorithm that explores the chromatin free energy landscape. By generating both optimal and suboptimal meta-structures we can calculate both the free energy and additionally the relative thermodynamic probability. A 3D structure prediction program with applied restraints then can be used to generate the tertiary structures. The main advantage of this approach for population-averaged experimental data is that it provides a way to distinguish between the principal and the spurious contacts. The program source-code is available at https://github.com/plewczynski/looper.

[1]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[2]  Dariusz Plewczynski,et al.  Spatial chromatin architecture alteration by structural variations in human genomes at the population scale , 2019, Genome Biology.

[3]  Miao Yu,et al.  Mapping of long-range chromatin interactions by proximity ligation-assisted ChIP-seq , 2016, Cell Research.

[4]  E. Lieb,et al.  A Fundamental Property of Quantum-Mechanical Entropy , 1973 .

[5]  Peter G Wolynes,et al.  Genomic Energy Landscapes. , 2017, Biophysical journal.

[6]  R. Bellman Dynamic programming. , 1957, Science.

[7]  Matthias Reisser,et al.  Direct Observation of Cell-Cycle-Dependent Interactions between CTCF and Chromatin , 2017, Biophysical journal.

[8]  Daniel Capurso,et al.  Multiplex chromatin interactions with single-molecule precision , 2019, Nature.

[9]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[10]  W. Dawson,et al.  A physical origin for functional domain structure in nucleic acids as evidenced by cross-linking entropy: I. , 2001, Journal of theoretical biology.

[11]  J. Onuchic,et al.  Theory of Protein Folding This Review Comes from a Themed Issue on Folding and Binding Edited Basic Concepts Perfect Funnel Landscapes and Common Features of Folding Mechanisms , 2022 .

[12]  Y. Ruan,et al.  ChIP‐based methods for the identification of long‐range chromatin interactions , 2009, Journal of cellular biochemistry.

[13]  I. Tinoco,et al.  Estimation of Secondary Structure in Ribonucleic Acids , 1971, Nature.

[14]  L. Mirny,et al.  The 3D Genome as Moderator of Chromosomal Communication , 2016, Cell.

[15]  E. Lieb,et al.  Proof of the strong subadditivity of quantum‐mechanical entropy , 1973 .

[16]  Sigal Shachar,et al.  3D Chromosome Regulatory Landscape of Human Pluripotent Cells. , 2016, Cell stem cell.

[17]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[18]  Wilma K Olson,et al.  Contributions of Sequence to the Higher-Order Structures of DNA. , 2017, Biophysical journal.

[19]  Yann Ponty,et al.  VARNA: Interactive drawing and editing of the RNA secondary structure , 2009, Bioinform..

[20]  A. Visel,et al.  Disruptions of Topological Chromatin Domains Cause Pathogenic Rewiring of Gene-Enhancer Interactions , 2015, Cell.

[21]  Janusz M Bujnicki,et al.  Computational modeling of protein-RNA complex structures. , 2014, Methods.

[22]  A. H. Sparrow,et al.  Correlations between nuclear volume, cell volume and DNA content in meristematic cells of herbaceous angiosperms , 1973, Experientia.

[23]  Dariusz M Plewczynski,et al.  CTCF-Mediated Human 3D Genome Architecture Reveals Chromatin Topology for Transcription , 2015, Cell.

[24]  Simona Bianco,et al.  Polymer Physics of the Large-Scale Structure of Chromatin. , 2016, Methods in molecular biology.

[25]  J. SantaLucia,et al.  The thermodynamics of DNA structural motifs. , 2004, Annual review of biophysics and biomolecular structure.

[26]  P. Flory Principles of polymer chemistry , 1953 .

[27]  Y. Kafri,et al.  Dynamics of DNA melting , 2008, Journal of physics. Condensed matter : an Institute of Physics journal.

[28]  E. Liu,et al.  Next-generation DNA sequencing of paired-end tags (PET) for transcriptome and genome analyses. , 2009, Genome research.

[29]  John F. Marko,et al.  Self-organization of domain structures by DNA-loop-extruding enzymes , 2012, Nucleic acids research.

[30]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[31]  J. Bujnicki,et al.  Coarse-grained modeling of RNA 3D structure. , 2016, Methods.

[32]  Niels Galjart,et al.  Choice of binding sites for CTCFL compared to CTCF is driven by chromatin and by sequence preference , 2018, Nucleic acids research.

[33]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[34]  J. McCaskill The equilibrium partition function and base pair binding probabilities for RNA secondary structure , 1990, Biopolymers.

[35]  Howard Y. Chang,et al.  Single-cell chromatin accessibility reveals principles of regulatory variation , 2015, Nature.

[36]  Kazuhiro Maeshima,et al.  Chromatin structure: does the 30-nm fibre exist in vivo? , 2010, Current opinion in cell biology.

[37]  Shihua He,et al.  Chromatin organization and nuclear microenvironments in cancer cells , 2008, Journal of cellular biochemistry.

[38]  H. Stanley,et al.  Statistical physics of macromolecules , 1995 .

[39]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[40]  Zhonghui Tang,et al.  Methods for comparative ChIA-PET and Hi-C data analysis. , 2019, Methods.

[41]  Dariusz Plewczynski,et al.  An integrated 3-Dimensional Genome Modeling Engine for data-driven simulation of spatial genome organization , 2016, Genome research.

[42]  A. West,et al.  Insulators: many functions, many mechanisms. , 2002, Genes & development.

[43]  J. Davie,et al.  Nuclear matrix, dynamic histone acetylation and transcriptionally active chromatin , 1997, Molecular Biology Reports.

[44]  L. Mirny,et al.  Formation of Chromosomal Domains in Interphase by Loop Extrusion , 2015, bioRxiv.

[45]  S. Zhong,et al.  3D Chromatin Architecture of Large Plant Genomes Determined by Local A/B Compartments. , 2017, Molecular plant.

[46]  Timothy C Elston,et al.  Multiscale approaches for studying energy transduction in dynein. , 2009, Physical chemistry chemical physics : PCCP.

[47]  Ilya M. Flyamer,et al.  Active chromatin and transcription play a key role in chromosome partitioning into topologically associating domains , 2016, Genome research.

[48]  Thomas G. Gilgenast,et al.  Local Genome Topology Can Exhibit an Incompletely Rewired 3D-Folding State during Somatic Cell Reprogramming. , 2016, Cell stem cell.

[49]  Shihua He,et al.  Nuclear organization and chromatin dynamics--Sp1, Sp3 and histone deacetylases. , 2008, Advances in enzyme regulation.

[50]  William Stafford Noble,et al.  Integrative annotation of chromatin elements from ENCODE data , 2012, Nucleic acids research.