Chromosome3D: reconstructing three-dimensional chromosomal structures from Hi-C interaction frequency data using distance geometry simulated annealing

BackgroundReconstructing three-dimensional structures of chromosomes is useful for visualizing their shapes in a cell and interpreting their function. In this work, we reconstruct chromosomal structures from Hi-C data by translating contact counts in Hi-C data into Euclidean distances between chromosomal regions and then satisfying these distances using a structure reconstruction method rigorously tested in the field of protein structure determination.ResultsWe first evaluate the robustness of the overall reconstruction algorithm on noisy simulated data at various levels of noise by comparing with some of the state-of-the-art reconstruction methods. Then, using simulated data, we validate that Spearman’s rank correlation coefficient between pairwise distances in the reconstructed chromosomal structures and the experimental chromosomal contact counts can be used to find optimum conversion rules for transforming interaction frequencies to wish distances. This strategy is then applied to real Hi-C data at chromosome level for optimal transformation of interaction frequencies to wish distances and for ranking and selecting structures. The chromosomal structures reconstructed from a real-world human Hi-C dataset by our method were validated by the known two-compartment feature of the human chromosome organization. We also show that our method is robust with respect to the change of the granularity of Hi-C data, and consistently produces similar structures at different chromosomal resolutions.ConclusionChromosome3D is a robust method of reconstructing chromosome three-dimensional models using distance restraints obtained from Hi-C interaction frequency data. It is available as a web application and as an open source tool at http://sysbio.rnet.missouri.edu/chromosome3d/.

[1]  Reza Kalhor,et al.  Genome architectures revealed by tethered chromosome conformation capture and population-based modeling , 2011, Nature Biotechnology.

[2]  A. Gronenborn,et al.  Determination of three‐dimensional structures of proteins from interproton distance data by hybrid distance geometry‐dynamical simulated annealing calculations , 1988, FEBS letters.

[3]  S. Gasser,et al.  Visualizing Chromatin Dynamics in Interphase Nuclei , 2002, Science.

[4]  Mathieu Blanchette,et al.  Three-dimensional modeling of chromatin structure from interaction frequency data using Markov chain Monte Carlo sampling , 2011, BMC Bioinformatics.

[5]  Jorge Nocedal,et al.  On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[6]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[7]  J. Lawrence,et al.  The three-dimensional folding of the α-globin gene domain reveals formation of chromatin globules , 2011, Nature Structural &Molecular Biology.

[8]  Jianlin Cheng,et al.  MOGEN: a tool for reconstructing 3D models of genomes from chromosomal conformation capturing data , 2016, Bioinform..

[9]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[10]  A. Brunger Version 1.2 of the Crystallography and NMR system , 2007, Nature Protocols.

[11]  D. Heermann,et al.  Spatially confined folding of chromatin in the interphase nucleus , 2009, Proceedings of the National Academy of Sciences.

[12]  Chenchen Zou,et al.  HSA: integrating multi-track Hi-C data for genome-scale reconstruction of 3D chromatin structure , 2016, Genome Biology.

[13]  William Stafford Noble,et al.  A statistical approach for inferring the 3D structure of the genome , 2014, Bioinform..

[14]  Jianlin Cheng,et al.  Large-scale reconstruction of 3D structures of human chromosomes from chromosomal contact data , 2014, Nucleic acids research.

[15]  A. Lesne,et al.  3D genome reconstruction from chromosomal contacts , 2014, Nature Methods.

[16]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[17]  T. Cremer,et al.  Dynamic genome architecture in the nuclear space: regulation of gene expression in three dimensions , 2007, Nature Reviews Genetics.

[18]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1983, Bulletin of Mathematical Biology.

[19]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[20]  Daniel Ruiz,et al.  A Fast Algorithm for Matrix Balancing , 2013, Web Information Retrieval and Linear Algebra Algorithms.

[21]  William Stafford Noble,et al.  A Three-Dimensional Model of the Yeast Genome , 2010, Nature.

[22]  Kim-Chuan Toh,et al.  3D Chromosome Modeling with Semi-Definite Programming and Hi-C Data , 2013, J. Comput. Biol..

[23]  Ming Hu,et al.  Bayesian Inference of Spatial Organizations of Chromosomes , 2013, PLoS Comput. Biol..