Exploring the 2D and 3D structural properties of topologically associating domains

Topologically associating domains (TADs) are genomic regions with varying lengths. The interactions within TADs are more frequent than those between different TADs. TADs or sub-TADs are considered the structural and functional units of the mammalian genomes. Although TADs are important for understanding how genomes function, we have limited knowledge about their 3D structural properties. In this study, we designed and benchmarked three metrics for capturing the three-dimensional and two-dimensional structural signatures of TADs, which can help better understand TADs’ structural properties and the relationships between structural properties and genetic and epigenetic features. The first metric for capturing 3D structural properties is radius of gyration, which in this study is used to measure the spatial compactness of TADs. The mass value of each DNA bead in a 3D structure is novelly defined as one or more genetic or epigenetic feature(s). The second metric is folding degree. The last metric is exponent parameter, which is used to capture the 2D structural properties based on TADs’ Hi-C contact matrices. In general, we observed significant correlations between the three metrics and the genetic and epigenetic features. We made the same observations when using H3K4me3, transcription start sites, and RNA polymerase II to represent the mass value in the modified radius-of-gyration metric. Moreover, we have found that the TADs in the clusters of depleted chromatin states apparently correspond to smaller exponent parameters and larger radius of gyrations. In addition, a new objective function of multidimensional scaling for modelling chromatin or TADs 3D structures was designed and benchmarked, which can handle the DNA bead-pairs with zero Hi-C contact values. The web server for reconstructing chromatin 3D structures using multiple different objective functions and the related source code are publicly available at http://dna.cs.miami.edu/3DChrom/.

[1]  Kim-Chuan Toh,et al.  3D Chromosome Modeling with Semi-Definite Programming and Hi-C Data , 2013, J. Comput. Biol..

[2]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[3]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[4]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[5]  Ming Hu,et al.  Bayesian Inference of Spatial Organizations of Chromosomes , 2013, PLoS Comput. Biol..

[6]  M. Randic,et al.  On Characterization of 3D Molecular Structure , 2002 .

[7]  Hao Zhu,et al.  SCL: a lattice-based approach to infer 3D chromosome structures from single-cell Hi-C data , 2019, Bioinform..

[8]  M. Martí-Renom,et al.  Chromatin and RNA Maps Reveal Regulatory Long Noncoding RNAs in Mouse , 2015, Molecular and Cellular Biology.

[9]  B. Steensel,et al.  Nuclear organization of active and inactive chromatin domains uncovered by chromosome conformation capture–on-chip (4C) , 2006, Nature Genetics.

[10]  William Stafford Noble,et al.  A statistical approach for inferring the 3D structure of the genome , 2014, Bioinform..

[11]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[12]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[13]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[14]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[15]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  E. Lander,et al.  The Xist lncRNA Exploits Three-Dimensional Genome Architecture to Spread Across the X Chromosome , 2013, Science.

[17]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[18]  A. Tanay,et al.  Single cell Hi-C reveals cell-to-cell variability in chromosome structure , 2013, Nature.

[19]  V. Corces,et al.  CTCF: Master Weaver of the Genome , 2009, Cell.

[20]  K. Sandhu,et al.  Circular chromosome conformation capture (4C) uncovers extensive networks of epigenetically regulated intra- and interchromosomal interactions , 2006, Nature Genetics.

[21]  William Stafford Noble,et al.  Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression , 2014, Genome research.

[22]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[23]  Ming Hu,et al.  Understanding spatial organizations of chromosomes via statistical analysis of Hi-C data , 2013, Quantitative Biology.

[24]  Hao Zhu SCL: A Lattice-Based Approach to Infer Three-Dimensional Chromosome Structures from Single-Cell Hi-C Data , 2019 .

[25]  O. V. Galzitskaya,et al.  Radius of gyration as an indicator of protein structure compactness , 2008, Molecular Biology.

[26]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[27]  Ernesto Estrada,et al.  Characterization of the folding degree of proteins , 2002, Bioinform..

[28]  Renzhi Cao,et al.  The Properties of Genome Conformation and Spatial Gene Interaction and Regulation Networks of Normal and Malignant Human Cell Types , 2013, PloS one.

[29]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[30]  William Stafford Noble,et al.  A Three-Dimensional Model of the Yeast Genome , 2010, Nature.