Comparing chromatin contact maps at scale: methods and insights

Comparing chromatin contact maps is an essential step in quantifying how three-dimensional (3D) genome organization shapes development, evolution, and disease. However, no gold standard exists for comparing contact maps, and even simple methods often disagree. In this study, we propose novel comparison methods and evaluate them alongside existing approaches using genome-wide Hi-C data and 22,500 in silico predicted contact maps. We also quantify the robustness of methods to common sources of biological and technical variation, such as boundary size and noise. We find that simple difference-based methods such as mean squared error are suitable for initial screening, but biologically informed methods are necessary to identify why maps diverge and propose specific functional hypotheses. We provide a reference guide, codebase, and benchmark for rapidly comparing chromatin contact maps at scale to enable biological insights into the 3D organization of the genome.

[1]  T. Sakellaropoulos,et al.  Cell type-specific prediction of 3D chromatin organization enables high-throughput in silico genetic screening , 2022, bioRxiv.

[2]  Ilya M. Flyamer,et al.  Cooltools: Enabling high-resolution Hi-C analysis in Python , 2022, bioRxiv.

[3]  Dongsup Kim,et al.  DeepLUCIA: predicting tissue-specific chromatin loops using Deep Learning-based Universal Chromatin Interaction Annotator , 2022, Bioinform..

[4]  William Stafford Noble,et al.  Epiphany: predicting Hi-C contact maps from 1D epigenomic signals , 2021, bioRxiv.

[5]  Rafael Riudavets Puig,et al.  JASPAR 2022: the 9th release of the open-access database of transcription factor binding profiles , 2021, Nucleic Acids Res..

[6]  X. Zhou,et al.  Integrative genome modeling platform reveals essentiality of rare contact events in 3D genome organizations , 2021, Nature Methods.

[7]  Roy G van Heesbeen,et al.  3D genomics across the tree of life reveals condensin II as a determinant of architecture type , 2021, Science.

[8]  J. Capra,et al.  Topologically associating domain boundaries that are stable across diverse cell types are evolutionarily constrained and enriched for heritability. , 2021, American journal of human genetics.

[9]  X. Xie,et al.  Changes in genome architecture and transcriptional dynamics progress independently of sensory experience during post-natal brain development , 2021, Cell.

[10]  Jian Ma,et al.  Multiscale and integrative single-cell Hi-C analysis with Higashi , 2020, Nature Biotechnology.

[11]  V. Corces,et al.  Principles of 3D compartmentalization of the human genome , 2020, bioRxiv.

[12]  Juan M. Vaquerizas,et al.  CHESS enables quantitative comparison of chromatin contact data and automatic feature extraction , 2020, Nature Genetics.

[13]  Yee Whye Teh,et al.  DeepC: predicting 3D genome folding using megabase-scale transfer learning , 2020, Nature Methods.

[14]  David R. Kelley,et al.  Predicting 3D genome folding from DNA sequence with Akita , 2020, Nature Methods.

[15]  Hongqiang Lyu,et al.  Comparison of normalization methods for Hi-C data. , 2020, BioTechniques.

[16]  A. Kundaje,et al.  The ENCODE Blacklist: Identification of Problematic Regions of the Genome , 2019, Scientific Reports.

[17]  Jianzhu Ma,et al.  Robust single-cell Hi-C clustering by convolution- and random-walk–based imputation , 2019, Proceedings of the National Academy of Sciences.

[18]  Anders S. Hansen,et al.  Resolving the 3D landscape of transcription-linked mammalian chromatin folding , 2019, bioRxiv.

[19]  Leonid A. Mirny,et al.  Ultrastructural details of mammalian chromosome architecture , 2019, bioRxiv.

[20]  Amina Noor,et al.  Common DNA sequence variation influences 3-dimensional conformation of the human genome , 2019, Genome Biology.

[21]  S. Mundlos,et al.  Serial genomic inversions induce tissue-specific architectural stripes, gene misexpression and congenital malformations , 2019, Nature Cell Biology.

[22]  G. Ciriello,et al.  Comparison of computational methods for the identification of topologically associating domains , 2018, Genome Biology.

[23]  Yoav Gilad,et al.  Reorganization of 3D genome structure may contribute to gene regulatory evolution in primates , 2018, bioRxiv.

[24]  A. Visel,et al.  Dynamic 3D chromatin architecture contributes to enhancer specificity and limb morphogenesis , 2018, Nature Genetics.

[25]  X. Xie,et al.  Three-dimensional genome structures of single diploid human cells , 2018, Science.

[26]  Mikhail G. Dozmorov,et al.  HiCcompare: an R-package for joint normalization and comparison of HI-C datasets , 2018, BMC Bioinformatics.

[27]  S. Mundlos,et al.  Structural variation in the 3D genome , 2018, Nature Reviews Genetics.

[28]  Neva C. Durand,et al.  The Energetics and Physiological Impact of Cohesin Extrusion , 2018, Cell.

[29]  P. Kambadur,et al.  Stratification of TAD boundaries reveals preferential insulation of super-enhancers by strong boundaries , 2018, Nature Communications.

[30]  Mark Gerstein,et al.  Measuring the reproducibility and quality of Hi-C data , 2017, Genome Biology.

[31]  William Stafford Noble,et al.  HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient , 2017, bioRxiv.

[32]  S. Bicciato,et al.  Comparison of computational methods for Hi-C data analysis , 2017, Nature Methods.

[33]  A. Tanay,et al.  Cell-cycle dynamics of chromosomal organisation at single-cell resolution , 2016, Nature.

[34]  Mark Gerstein,et al.  HiC-spector: a matrix library for spectral and reproducibility analysis of Hi-C contact maps , 2016, bioRxiv.

[35]  Jesse R. Dixon,et al.  Chromatin Domains: The Unit of Chromosome Organization. , 2016, Molecular cell.

[36]  L. Mirny,et al.  Formation of Chromosomal Domains in Interphase by Loop Extrusion , 2015, bioRxiv.

[37]  Terrence S. Furey,et al.  A hidden Markov random field-based Bayesian method for the detection of long-range chromosomal interactions in Hi-C data , 2016, Bioinform..

[38]  J. Dekker,et al.  Condensin-Driven Remodeling of X-Chromosome Topology during Dosage Compensation , 2015, Nature.

[39]  Jing Liang,et al.  Chromatin architecture reorganization during stem cell differentiation , 2015, Nature.

[40]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[41]  A. Tanay,et al.  Single cell Hi-C reveals cell-to-cell variability in chromosome structure , 2013, Nature.

[42]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[43]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[44]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[45]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.