Evaluation and comparison of methods for recapitulation of 3D spatial chromatin structures

How chromosomes fold and how distal genomic elements interact with one another at a genomic scale have been actively pursued in the past decade following the seminal work describing the Chromosome Conformation Capture (3C) assay. Essentially, 3C-based technologies produce two-dimensional (2D) contact maps that capture interactions between genomic fragments. Accordingly, a plethora of analytical methods have been proposed to take a 2D contact map as input to recapitulate the underlying whole genome three-dimensional (3D) structure of the chromatin. However, their performance in terms of several factors, including data resolution and ability to handle contact map features, have not been sufficiently evaluated. This task is taken up in this article, in which we consider several recent and/or well-regarded methods, both optimization-based and model-based, for their aptness of producing 3D structures using contact maps generated based on a population of cells. These methods are evaluated and compared using both simulated and real data. Several criteria have been used. For simulated data sets, the focus is on accurate recapitulation of the entire structure given the existence of the gold standard. For real data sets, comparison with distances measured by Florescence in situ Hybridization and consistency with several genomic features of known biological functions are examined.

[1]  Shili Lin,et al.  Impact of data resolution on three-dimensional structure inference methods , 2016, BMC Bioinformatics.

[2]  Shili Lin,et al.  Statistical Inference on Three-Dimensional Structure of Genome by Truncated Poisson Architecture Model , 2015 .

[3]  Andre J. Faure,et al.  3D structure of individual mammalian genomes studied by single cell Hi-C , 2017, Nature.

[4]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[5]  A. Lesne,et al.  3D genome reconstruction from chromosomal contacts , 2014, Nature Methods.

[6]  M. Gobbi,et al.  Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment , 2014, Nature Genetics.

[7]  Liang Niu,et al.  Statistical Models for Detecting Differential Chromatin Interactions Mediated by a Protein , 2014, PloS one.

[8]  Ming Hu,et al.  HiCNorm: removing biases in Hi-C data via Poisson regression , 2012, Bioinform..

[9]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[10]  A. Tanay,et al.  Single cell Hi-C reveals cell-to-cell variability in chromosome structure , 2013, Nature.

[11]  William Stafford Noble,et al.  A statistical approach for inferring the 3D structure of the genome , 2014, Bioinform..

[12]  Reza Kalhor,et al.  Genome architectures revealed by tethered chromosome conformation capture and population-based modeling , 2011, Nature Biotechnology.

[13]  Shili Lin,et al.  A random effect model for reconstruction of spatial chromatin structure , 2017, Biometrics.

[14]  Enrique Blanco,et al.  3 D structure of individual mammalian genomes studied by single cell HiC , 2017 .

[15]  Kim-Chuan Toh,et al.  Inference of Spatial Organizations of Chromosomes Using Semi-definite Embedding Approach and Hi-C Data , 2013, RECOMB.

[16]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[17]  Ming Hu,et al.  Bayesian Inference of Spatial Organizations of Chromosomes , 2013, PLoS Comput. Biol..

[18]  Mathieu Blanchette,et al.  Three-dimensional modeling of chromatin structure from interaction frequency data using Markov chain Monte Carlo sampling , 2011, BMC Bioinformatics.

[19]  Elzo de Wit,et al.  Capturing heterogeneity: single-cell structures of the 3D genome , 2017, Nature Structural &Molecular Biology.

[20]  J. Telenius,et al.  Multiplexed analysis of chromosome conformation at vastly improved sensitivity , 2015, Nature Methods.

[21]  A. Tanay,et al.  Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture , 2011, Nature Genetics.

[22]  William Stafford Noble,et al.  Massively multiplex single-cell Hi-C , 2016, Nature Methods.

[23]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[24]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .