Producing genome structure populations with the dynamic and automated PGS software

Chromosome conformation capture technologies such as Hi-C are widely used to investigate the spatial organization of genomes. Because genome structures can vary considerably between individual cells of a population, interpreting ensemble-averaged Hi-C data can be challenging, in particular for long-range and interchromosomal interactions. We pioneered a probabilistic approach for the generation of a population of distinct diploid 3D genome structures consistent with all the chromatin-chromatin interaction probabilities from Hi-C experiments. Each structure in the population is a physical model of the genome in 3D. Analysis of these models yields new insights into the causes and the functional properties of the genome's organization in space and time. We provide a user-friendly software package, called PGS, which runs on local machines (for practice runs) and high-performance computing platforms. PGS takes a genome-wide Hi-C contact frequency matrix, along with information about genome segmentation, and produces an ensemble of 3D genome structures entirely consistent with the input. The software automatically generates an analysis report, and provides tools to extract and analyze the 3D coordinates of specific domains. Basic Linux command-line knowledge is sufficient for using this software. A typical running time of the pipeline is ∼3 d with 300 cores on a computer cluster to generate a population of 1,000 diploid genome structures at topological-associated domain (TAD)-level resolution.

[1]  A. Tanay,et al.  Three-Dimensional Folding and Functional Organization Principles of the Drosophila Genome , 2012, Cell.

[2]  L. Mirny,et al.  Iterative Correction of Hi-C Data Reveals Hallmarks of Chromosome Organization , 2012, Nature Methods.

[3]  P. Wolynes,et al.  Topology, structures, and energy landscapes of human chromosomes , 2015, Proceedings of the National Academy of Sciences.

[4]  Ming Hu,et al.  Bayesian Inference of Spatial Organizations of Chromosomes , 2013, PLoS Comput. Biol..

[5]  Ryan K. Dale,et al.  CTCF-mediated transcriptional regulation through cell type-specific chromosome organization in the β-globin locus , 2012, Nucleic acids research.

[6]  Howard Y. Chang,et al.  Single-cell chromatin accessibility reveals principles of regulatory variation , 2015, Nature.

[7]  Jinbo Xu,et al.  Inferential modeling of 3D chromatin structure , 2015, Nucleic acids research.

[8]  Jonas Paulsen,et al.  Chrom3D: three-dimensional genome modeling from Hi-C and nuclear lamin-genome contacts , 2017, Genome Biology.

[9]  Reza Kalhor,et al.  Genome architectures revealed by tethered chromosome conformation capture and population-based modeling , 2011, Nature Biotechnology.

[10]  William Stafford Noble,et al.  Three-dimensional modeling of the P. falciparum genome during the erythrocytic cycle reveals a strong connection between genome architecture and gene expression , 2014, Genome research.

[11]  Liang-Yu Fu,et al.  The sequencing bias relaxed characteristics of Hi-C derived data and implications for chromatin 3D modeling , 2013, Nucleic acids research.

[12]  Kim-Chuan Toh,et al.  3D Chromosome Modeling with Semi-Definite Programming and Hi-C Data , 2013, J. Comput. Biol..

[13]  Mario Nicodemi,et al.  Complexity of chromatin folding is captured by the strings and binders switch model , 2012, Proceedings of the National Academy of Sciences.

[14]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[15]  Jianlin Cheng,et al.  Large-scale reconstruction of 3D structures of human chromosomes from chromosomal contact data , 2014, Nucleic acids research.

[16]  Mathieu Blanchette,et al.  Three-dimensional modeling of chromatin structure from interaction frequency data using Markov chain Monte Carlo sampling , 2011, BMC Bioinformatics.

[17]  B. Chait,et al.  Determining the architectures of macromolecular assemblies , 2007, Nature.

[18]  Peng Yin,et al.  Single-molecule super-resolution imaging of chromosomes and in situ haplotype visualization using Oligopaint FISH probes , 2015, Nature Communications.

[19]  A. Lesne,et al.  3D genome reconstruction from chromosomal contacts , 2014, Nature Methods.

[20]  William Stafford Noble,et al.  A statistical approach for inferring the 3D structure of the genome , 2014, Bioinform..

[21]  Yannick G. Spill,et al.  Restraint‐based three‐dimensional modeling of genomes and genomic domains , 2015, FEBS letters.

[22]  J. Dekker,et al.  Capturing Chromosome Conformation , 2002, Science.

[23]  M. L. Le Gros,et al.  Population-based 3D genome structure analysis reveals driving forces in spatial genome organization , 2016, Proceedings of the National Academy of Sciences.

[24]  Marc A Marti-Renom,et al.  Genome structure determination via 3C-based data integration by the Integrative Modeling Platform. , 2012, Methods.

[25]  Jing Liang,et al.  Chromatin architecture reorganization during stem cell differentiation , 2015, Nature.

[26]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[27]  Daniel Ruiz,et al.  A Fast Algorithm for Matrix Balancing , 2013, Web Information Retrieval and Linear Algebra Algorithms.

[28]  Gerd Gruenert,et al.  Chromosome positioning and the clustering of functionally related loci in yeast is driven by chromosomal interactions , 2012, Nucleus.

[29]  Frank Alber,et al.  Mining 3D genome structure populations identifies major factors governing the stability of regulatory communities , 2016, Nature Communications.

[30]  Chenchen Zou,et al.  HSA: integrating multi-track Hi-C data for genome-scale reconstruction of 3D chromatin structure , 2016, Genome Biology.

[31]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[32]  James T. Robinson,et al.  Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom. , 2016, Cell systems.

[33]  Peter G Wolynes,et al.  Transferable model for chromosome architecture , 2016, Proceedings of the National Academy of Sciences.

[34]  Dario Meluzzi,et al.  Recovering ensembles of chromatin conformations from contact probabilities , 2012, Nucleic acids research.

[35]  A. Tanay,et al.  Single cell Hi-C reveals cell-to-cell variability in chromosome structure , 2013, Nature.

[36]  Jianlin Cheng,et al.  MOGEN: a tool for reconstructing 3D models of genomes from chromosomal conformation capturing data , 2016, Bioinform..

[37]  J. Dekker,et al.  Predictive Polymer Modeling Reveals Coupled Fluctuations in Chromosome Conformation and Transcription , 2014, Cell.

[38]  William Stafford Noble,et al.  A Three-Dimensional Model of the Yeast Genome , 2010, Nature.

[39]  W. Bickmore,et al.  Single-Cell Dynamics of Genome-Nuclear Lamina Interactions , 2013, Cell.

[40]  Frank Alber,et al.  The three-dimensional genome organization of Drosophila melanogaster through data integration , 2017, Genome Biology.

[41]  Ben M. Webb,et al.  Putting the Pieces Together: Integrative Modeling Platform Software for Structure Determination of Macromolecular Assemblies , 2012, PLoS biology.

[42]  Andre J. Faure,et al.  3D structure of individual mammalian genomes studied by single cell Hi-C , 2017, Nature.

[43]  C. Nusbaum,et al.  Chromosome Conformation Capture Carbon Copy (5C): a massively parallel solution for mapping interactions between genomic elements. , 2006, Genome research.

[44]  I. Amit,et al.  Comprehensive mapping of long range interactions reveals folding principles of the human genome , 2011 .

[45]  Dariusz Plewczynski,et al.  3D-GNOME: an integrated web service for structural modeling of the 3D genome , 2016, Nucleic Acids Res..